Publication:
Functional characterization of genetic variation with in silico predictions of cell-type-specific regulatory elements

No Thumbnail Available

Date

2020-09-10

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Amariuta, Tiffany. 2020. Functional characterization of genetic variation with in silico predictions of cell-type-specific regulatory elements. Doctoral dissertation, Harvard University Graduate School of Arts and Sciences.

Research Data

Abstract

Genome-wide association studies (GWAS) have implicated thousands of complex trait-variant associations, an estimated 90% of which reside in the noncoding genome. While noncoding variants generally have poorly understood regulatory function, previous work has shown that disease-driving genetic variation often affects cell-type-specific gene regulation, such as transcription factor (TF) binding. However, maps of TF-mediated cell-type-specific regulation are currently incomplete due to limited amounts of experimental data. In this thesis, I introduce a novel strategy to annotate the noncoding genome with cell-type-specific regulatory element probabilities via integration and modeling of thousands of publicly available epigenetic datasets. I show that these functional annotations in the disease-driving cell type are more highly enriched for disease heritability than experimentally derived functional annotations. Next, I use these functional annotations to prioritize disease-relevant variants in the context of polygenic risk score (PRS) models. I show that this approach improves the trans-ethnic portability of PRS by reducing the confounding effects of population-specific linkage disequilibrium. Lastly, I introduce a novel strategy to leverage the unprecedented resolution of single cell data to elucidate cell-state-specific activity of trait-driving variants identified by polygenic fine-mapping data from GWAS. This strategy consists of calculating cell-specific enrichments of genome-wide genetic variation in functional regions and then associating these enrichments with polygenic regulatory programs. I show that this approach identifies heterogeneity of risk variant accessibility, nominating putatively causal cell states and regulatory mechanisms. Altogether, this work demonstrates the importance of comprehensive functional annotations to better understand disease and trait etiology.

Description

Other Available Sources

Keywords

Autoimmune disease, Biomedical Informatics, Complex traits and diseases, Genomic functional annotation, Population genetics, Statistical genetics, Bioinformatics, Genetics, Statistics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories