Publication: The Landscape and Consequences of Structural Variation in the Human Genome
No Thumbnail Available
Open/View Files
Date
2022-09-01
Authors
Published Version
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Collins, Ryan Lewis. 2022. The Landscape and Consequences of Structural Variation in the Human Genome. Doctoral dissertation, Harvard University Graduate School of Arts and Sciences.
Research Data
Abstract
Structural variants (SVs), defined as any rearrangement of ≥50 DNA nucleotides, are a major source of human genetic diversity that determines the content, order, and orientation of the six billion nucleotides in every human genome. Despite their outsized impact on genome biology and human phenotypes, surprisingly little is known about SVs compared to better-studied classes of genetic variation, such as single-nucleotide variants. In this dissertation, I examine the patterns and consequences of SVs in the global human population by aggregating and analyzing large genomic datasets. In Chapter 1, I review the foundational properties of SVs that were established at the outset of my dissertation research. In Chapter 2, I present GATK-SV, a scalable computational pipeline for jointly discovering and genotyping SVs in whole-genome sequencing (WGS) of large cohorts, which we subsequently applied 14,891 individuals from five major continental populations to construct a new reference catalog of SVs as part of the Genome Aggregation Database. In Chapter 3, I recap four parallel efforts to probe the roles of SVs in human disease, including a meta-analysis of rare deletions and duplications in nearly one-million individuals across 54 disorders. In Chapter 4, I investigate the predicted consequences of SVs on coding and noncoding functional loci and document how natural selection shapes the distributions of these SVs in human populations. I conclude by proposing how extensions of this dissertation may eventually enable genome-wide maps of evolutionary constraint against changes in DNA copy number—also known as dosage sensitivity—at kilobase resolution, present a prototype computational framework for dosage sensitivity mapping, and describe ongoing genomic data aggregation efforts towards this goal. Taken collectively, this work sheds new light on a variety of topics in contemporary human genomics, which I hope will provide value to the basic, translational, and clinical research communities.
Description
Other Available Sources
Keywords
Bioinformatics, Copy-Number Variation, Genetic Variation, Genomics, Population Genetics, Structural Variation, Genetics, Bioinformatics
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service