Publication:

Structuring Incentives in the Development and Use of Artificial Intelligence

Loading...
Thumbnail Image

Date

2024-01-30

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Shavit, Yonadav Goldwasser. 2023. Structuring Incentives in the Development and Use of Artificial Intelligence. Doctoral dissertation, Harvard University Graduate School of Arts and Sciences.

Abstract

The success of machine learning (ML) has, in addition to its direct impacts, significantly reshaped the incentives of the humans developing, using, and being targeted by ML systems. This dissertation explores technical methods for identifying and reshaping these incentives to ensure they are serving the interests of users and society. The first two chapters examine the incentives created by narrow ML systems that make consequential decisions about users. Chapter 1 provides a method for identifying the incentives produced by black-box models like neural networks. Chapter 2 provides methods for selecting linear regression rules in the presence of decision-recipients who will actively game the chosen rule, and whose true outcomes may change in turn. The later two chapters concern the incentives for companies and governments to develop and misuse powerful general-purpose AI systems. They cover technical approaches for identifying misconduct by ML developers, in order to increase their incentives for honesty. Chapter 3 proposes a framework for international inspectors to verify a company or government’s large-scale ML development via data center hardware inspections, inspired by the model of the IAEA. Chapter 4 examines how an auditor could efficiently verify what training data had been used to produce a large ML model, and provides an efficient method that works on current open-source large language models.

Description

Other Available Sources

Research Data

Keywords

AI Policy, AI Safety, Economics & Computer Science, Machine Learning, Computer science

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories