Publication:
A database of human exposomes and phenomes from the US National Health and Nutrition Examination Survey

Thumbnail Image

Open/View Files

Date

2016

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

Nature Publishing Group
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Patel, Chirag J., Nam Pho, Michael McDuffie, Jeremy Easton-Marks, Cartik Kothari, Isaac S. Kohane, and Paul Avillach. 2016. “A database of human exposomes and phenomes from the US National Health and Nutrition Examination Survey.” Scientific Data 3 (1): 160096. doi:10.1038/sdata.2016.96. http://dx.doi.org/10.1038/sdata.2016.96.

Research Data

Abstract

The National Health and Nutrition Examination Survey (NHANES) is a population survey implemented by the Centers for Disease Control and Prevention (CDC) to monitor the health of the United States whose data is publicly available in hundreds of files. This Data Descriptor describes a single unified and universally accessible data file, merging across 255 separate files and stitching data across 4 surveys, encompassing 41,474 individuals and 1,191 variables. The variables consist of phenotype and environmental exposure information on each individual, specifically (1) demographic information, physical exam results (e.g., height, body mass index), laboratory results (e.g., cholesterol, glucose, and environmental exposures), and (4) questionnaire items. Second, the data descriptor describes a dictionary to enable analysts find variables by category and human-readable description. The datasets are available on DataDryad and a hands-on analytics tutorial is available on GitHub. Through a new big data platform, BD2K Patient Centered Information Commons (http://pic-sure.org), we provide a new way to browse the dataset via a web browser (https://nhanes.hms.harvard.edu) and provide application programming interface for programmatic access.

Description

Keywords

Diagnostic markers, Databases, Epidemiology

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories