Publication: Towards a Systematic Approach for Characterizing Regulatory Variation
No Thumbnail Available
Date
2015-11-03
Authors
Published Version
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Barrera, Luis A. 2016. Towards a Systematic Approach for Characterizing Regulatory Variation. Doctoral dissertation, Harvard University, Graduate School of Arts & Sciences.
Research Data
Abstract
A growing body of evidence suggests that genetic variants that alter gene expression are responsible for many phenotypic differences across individuals, particularly for the risk of developing common diseases. However, the molecular mechanisms that underlie the vast majority of associations between genetic variants and their phenotypes remain unknown. An important limiting factor is that genetic variants remain difficult to interpret, particularly in noncoding sequences. Developing truly systematic approaches for characterizing regulatory variants will require: (a) improved annotations for the genomic sequences that control gene expression, (b) a more complete understanding of the molecular mechanisms through which genetic variants, both coding and noncoding, can affect gene expression, and (c) better experimental tools for testing hypotheses about regulatory variants.
In this dissertation, I present conceptual and methodological advances that directly contribute to each of these goals. A recurring theme in all of these developments is the statistical modeling of protein-DNA interactions and its integration with other data types. First, I describe enhancer-FACS-Seq, a high-throughput experimental approach for screening candidate enhancer sequences to test for in vivo, tissue-specific activity. Second, I present an integrative computational analysis of the in vivo binding of NF-kappaB, a key regulator of the immune system, yielding new insights into how genetic variants can affect NF-kappaB binding. Next, I describe the first comprehensive survey of coding variation in human transcription factors and what it reveals about additional sources of genetic variation that can affect gene expression. Finally, I present SIFTED, a statistical framework and web tool for the optimal design of TAL effectors, which have been used successfully in genome editing and can thus be used to test hypotheses about regulatory variants. Together, these developments help fulfill key needs in the quest to understand the molecular basis of human phenotypic variation.
Description
Other Available Sources
Keywords
Biology, Bioinformatics, Biology, Genetics
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service