Publication:

DEADEYE: Differential Expressivity As Dataset fairnEss/usabilitY Estimator

Loading...
Thumbnail Image

Date

2022-05-23

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Fishman, Leor Barak. 2022. DEADEYE: Differential Expressivity As Dataset fairnEss/usabilitY Estimator. Bachelor's thesis, Harvard College.

Abstract

Over the past several years, significant research has gone into analyzing algorithmic fairness -- the problem of ensuring ML algorithms do not exhibit biases against protected groups. That research demonstrated that, given a fair ground truth dataset, one could produce algorithms that maintained that fairness (for various definitions of fairness). Additionally, that research gave several holistic ways in which datasets themselves might be unfair. We provide a new metric for dataset fairness, \textit{Differential Expressivity}, which puts dataset fairness on the same formal grounding as algorithmic fairness. Additionally, we show several hardness results for this new metric, as well as algorithms for calculating it in certain subcases. Finally, we test the metric on COMPAS recidivism data and show that empirically it points out underlying fairness issues on real data.

Description

Other Available Sources

Research Data

Keywords

Fairness, Machine Learning, Theory of Computer Science, Computer science

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories