Publication:

The Ad-Hoc Uncertainty Principle of Patient Privacy

Loading...
Thumbnail Image

Date

2017

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

American Medical Informatics Association
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Klann, Jeffrey G., Matthew Joss, Rohan Shirali, Marc Natter, Sebastian Schneeweiss, Kenneth D. Mandl, and Shawn N. Murphy. 2017. “The Ad-Hoc Uncertainty Principle of Patient Privacy.” AMIA Summits on Translational Science Proceedings 2017 (1): 132-138.

Abstract

The Health Information Portability and Accountability Act (HIPAA) allows for the exchange of de-identified patient data, but its definition of de-identification is essentially open-ended, thus leaving the onus on dataset providers to ensure patient privacy. The Patient Centered Outcomes Research Network (PCORnet) builds a de-identification approach into queries, but we have noticed various subtle problems with this approach. We censor aggregate counts below a threshold (i.e. <11) to protect patient privacy. However, we have found that thresholded numbers can at times be inferred, and some key numbers are not thresholded at all. Furthermore, PCORnet’s approach of thresholding low counts introduces a selection bias which slants the data towards larger health care sites and their corresponding demographics. We propose a solution: instead of censoring low counts, introduce Gaussian noise to all aggregate counts. We describe this approach and the freely available tools we created for this purpose.

Description

Research Data

Keywords

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories