Securing AI Model Weights: Preventing Theft and Misuse of Frontier Models

Nevo, Sella; Lahav, Dan; Karpur, Ajay; Bar-on, Yogev; Bradley, Henry A.; Alstott, Jeff

Securing AI Model Weights: Preventing Theft and Misuse of Frontier Models

Active / Technical Report | Accesssion Number: AD1229050 |

Abstract:

The goal of this report is to improve the security of frontier artificial intelligence (AI) or machine learning (ML) models. (Frontier models are those that match or exceed the capabilities of the most advanced AI models at the time of their development.) Our analysis focuses on foundation models, and specifically large language models and similar multimodal models. We focus on the critical leverage point that is the core of a models intelligence and capabilities: its weights, a term used here to refer to all learnable parameters derived by training the model on massive datasets. These parameters stem from large investments in data, algorithms, compute (i.e., the processing power and resources used to process data and run calculations), and other resources; compromising the weights would give an attacker direct access to the crown jewels of an AI developers work and the ability to exploit them for their own use.

Author(s):

Nevo, Sella ; Lahav, Dan ; Karpur, Ajay ; Bar-on, Yogev ; Bradley, Henry A. ; Alstott, Jeff

Author Organization(s):

RAND CORP SANTA MONICA CA

Funding Organization(s):

DEPARTMENT OF DEFENSE WASHINGTON DC, WASHINGTON, DC

Document Type:

Technical Report/Research Report

Publication Date:

2024 May 01

Pagination:

128

Security Markings

DOCUMENT & CONTEXTUAL SUMMARY

Distribution Code:

A - Approved For Public Release

Distribution Statement: Public Release.

Copyright: Not Copyrighted

RECORD

Collection: TRECMS

Subject Terms

Subject Categories:

Mathematical and Computer Sciences

Creation Date:

2024 May 31

Update Date:

2024 Jun 10