Hack-Proof Artificial Intelligence Supply Chains Using Open Source Security

Practical ways to protect against AI software attacks

Tim Miller

Michael Lieberman

August 5, 2024

As security professionals, we are always working to guard against backdoors and vulnerabilities throughout the entire software development lifecycle. Developers try to pull in open source code that is secure, but the dependencies are several layers deep, projects are inconsistently maintained, and now AI enters the picture. How can the industry keep up with the increasing pace of attacks, especially in the relatively novel artificial intelligence space?

‍

Why is security a challenge for the AI software supply chain?

The perils of Artificial Intelligence (AI) software supply chains mirror those of the broader software landscape, with some added intricacies. This is particularly true when integrating large language models (LLMs) or machine learning (ML) models. Traditional software supply chains are concerned with software. AI supply chains add the complication of the dataset used to train the model. The same model with two different sets of training data can produce dramatically different output.

‍

Individual organizations as well as the broader AI technology ecosystem are seeing some alarming trends. Recent studies suggest an inverse correlation between the security stance of open-source AI software tools and their popularity. In other words: as these tools gain wider adoption, they may also increase the risk to users. It’s not just that the code itself could contain an exploitable vulnerability, the model data presents risks as well.

‍

For instance, consider a scenario where a company uses AI models for screening job applicants. State and federal laws forbid discrimination on the basis of race, sex, veteran status, and other attributes. Companies must meticulously consider and assess their AI model's software and training data supply chain to avoid biases that could lead to legal issues down the road. This isn’t just speculation. As far back as 2018, Amazon stopped using its AI recruiting tool when it discovered that it perpetuated discrimination against women.

‍

The proliferation of AI models poses substantial legal and regulatory hazards when the models are trained on potentially illegal or unethical data. This underscores the urgency for bolstered measures within the AI supply chain in an effort to ensure users are safe and secure. Getting over these hurdles is key to responsible adoption and meeting and sustaining the potential of AI.

‍

One of the main concerns with generative AI models at the moment is understanding the provenance of the data: does the model have the input necessary to give reasonable output, or will the model tell us to put glue on pizza? A fully-open model is the only way to be sure. It’s good to see government alignment in favor of open innovation between the EU’s AI Act legislation and the US NTIA’s policy recommendations.

‍

Actions for security professionals to take

Protecting the AI software supply chain takes diligence and a holistic look at the problem across multiple avenues. Only then can you make decisions that truly protect your organization.

‍

The first step is to use truly open source AI models — that is to say a model open to inspection, modification, and redistribution, as well as an openly accessible training dataset with transparent origins, offering the same freedoms for scrutiny and utilization. After all, you can’t trust — or fix — what you can’t inspect.

‍

Second, implement security best practices internally and advocate for greater transparency and accountability within the open source community. Make it the minimum requirement in your organization to have essential security metadata, such as Software Bills of Materials (SBOMs), SLSA (Supply Chain Levels for Software Artifacts), and SARIF (Static Analysis Results Interchange Format). Many of the projects you rely on are maintained by volunteers, so bring help to improve the practices in your upstreams — don’t just make demands of them.

‍

Third, adopt open source security tools into your workflow. Projects such as Allstar, GUAC, and in-toto attestations provide tools you can incorporate to observe and verify your software stack’s security posture. Google, our partner in the development of GUAC, recently released a report that shares how they secure their AI supply chain using provenance information and provides guidance for other organizations looking to do the same.

‍

Lastly, invest in open source contributions and funding. Support organizations like the Open Source Security Foundation (OpenSSF), which develops specifications, tools, and initiatives to secure critical open source projects. Donate time and money to the projects in your supply chain that you depend on — it’s far more affordable than writing all that software yourself.

‍

There is no silver bullet to address security, and even the most careful organizations can find themselves on the wrong end of a compromise. The addition of AI models into the software supply chain only adds more complexity. But there’s no need to panic — you can improve your AI supply chain’s observability with tools and practices available today. Once you understand your supply chain, you can secure it.

Like what you read? Share it with others.

Open Source Summit 2022

Takeaways & Learnings

June 23, 2022

Tim Miller

SPIFFE/SPIRE CSI Driver

Overview of the SPIFFE/SPIRE CSI Driver

June 27, 2022

Parth Patel

Not Just Third Party Risk

There’s a misconception in Cybersecurity among some that Software Supply Chain Security is just Third Party Risk Mana...

Hack-Proof Artificial Intelligence Supply Chains Using Open Source Security

Why is security a challenge for the AI software supply chain?

Actions for security professionals to take

Other blog posts

Open Source Summit 2022

SPIFFE/SPIRE CSI Driver

Not Just Third Party Risk

Government Memo for Enhancing the Security of the Software Supply Chain

A High Fidelity View of Software Supply Chain

Kusari presenting at KubeCon and Cloud Native SecurityCon NA 2022

The Next Heartbleed?

Kusari's Software Supply Chain Security Overview

Applying Zero Trust to the Software Supply Chain

Figure Out Who's Lurking in Your Supply Chain With Signatures and Attestations

Kusari Open-Sources Spector

GUAC v0.1 Beta Release

Quest to determine the 'G' in GUAC

daBOM Podcast with Tim & DJ

Announcing Helm Chart for GUAC

Case Study: A discussion with Guidewire on GUAC

Announcing the Kusari YouTube Channel and GUACademy

Terror of cURL - Preparation is Half the Battle

Spooky Enhancements: Unveiling GUAC's OpenVEX Feature

What the NSA Missed in its SBOM Management Recommendations

Contributor to Leader: Securing Open Source Software at OpenSSF

Our $8M Funding Round Fuels our Mission to Make the Software Supply Chain Transparent and Secure

Kusari Soaks up Community at FOSDEM and Beyond

From Open Source Community to Joining a Start-up – while in High School

Unveiling GUAC as an OpenSSF Incubating Project for Software Dependency Management

Graph for Understanding Artifact Composition (GUAC) Joins OpenSSF as Incubating Project

XZ Backdoor: Software Security Lessons

Proactive Security in the Post-Log4j Era

Graph for Understanding Artifact Composition (GUAC) adds persistent storage in v0.6.0 release

Another Turn of the Page: GUAC v0.7.0 Released

Counting CVEs Was Never Enough

Kusari Signs the Secure by Design Pledge

To Fork or Not to Fork

Meeting Federal Software Supply Chain Mandates

Achieving Wisdom with GUAC Visualizer

Announcing GUAC v0.8.0 Enhancements

Why Software Cannot Be Secured by SBOMs Alone

Hack-Proof Artificial Intelligence Supply Chains Using Open Source Security

GUAC Boosts License Transparency

Understanding Prevalence is the First Step

You Can’t Fix Issues if You Can’t Find Them

Introducing the Kusari Platform—know your software

Is Your Supply Chain Haunted by CVEs?

The Path to Zero CVEs: Vanquishing Cyber Threats

Is the Internet on Fire? The State of Open Source Security

The Best Way to Secure Your Open Source Supply Chain is to Participate

Rust Won’t Fix Everything: Moving Toward a Memory-Safe Future

Threat Modeling in the Software Development Life Cycle

Solving the “Bottom Turtle” Problem in Supply Chain Security

Software Supply Chain Security Predictions: Hits & Misses from 2024

AI Alone Won’t Fix Your Supply Chain

Software Supply Chain Security Predictions for 2025

Stick a Pin in It: Managing Dependencies for Supply Chain Security

Alarms Raised by Critical Reverse Backdoor Vulnerability in Medical Devices

Unpacking the Kusari Score

Unpacking the Kusari “Effort to Fix” Capability

Building a Foundation of Trust for a Stronger Software Supply Chain

Analyzing Third-Party Risk in Open Source Software

Addressing Third-Party Risk in Open Source Software

Raising the Bar for Open Source Security: Introducing the OSPS Baseline

Unpacking Kusari Platform Views

Starting the Security Journey: Producing an SBOM

The Next Step in the Security Journey: Comparing SBOMs

Another Step on the Security Journey: A Constellation of SBOMs

The Last Step on the Security Journey: Kusari Platform

Securing Your AI Models

GUAC Now Supports Runtime Kubernetes SBOMs using Kubescape

Securing the Software Supply Chain book now available!

Providing Secure Updates with TUF

The Hidden Risk in Your Software: Understanding Transitive Dependencies

Previous

Next

Want to learn more about Kusari?

Sign up for our newsletter