Publication:
Analysis of the Harvard Computer Society Email Archives: An Exploration of Differential Privacy in Practice

No Thumbnail Available

Date

2024-11-26

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Cooper, William Chen. 2024. Analysis of the Harvard Computer Society Email Archives: An Exploration of Differential Privacy in Practice. Bachelor's thesis, Harvard University Engineering and Applied Sciences.

Research Data

Abstract

This thesis provides a rudimentary introduction to differential privacy as a framework for modern data privacy, using the Harvard Computer Society email list archives as an investigative medium. The differentially private analysis of this dataset includes but is not limited to: time series of list usage, email topic modeling, and sentiment analysis. OpenDP’s Python package for differential privacy is used extensively to execute computations, and the API is evaluated as a standalone programming framework within itself. Novel graph differential private algorithms are both implemented and empirically assessed. Lastly, this thesis discusses a significant inherent challenge in balancing contrasting aspects of differential privacy and exploratory data analysis.

Description

Other Available Sources

Keywords

Computer science

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories