Publication:
The Landscape and Consequences of Structural Variation in the Human Genome

No Thumbnail Available

Date

2022-09-01

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Collins, Ryan Lewis. 2022. The Landscape and Consequences of Structural Variation in the Human Genome. Doctoral dissertation, Harvard University Graduate School of Arts and Sciences.

Research Data

Abstract

Structural variants (SVs), defined as any rearrangement of ≥50 DNA nucleotides, are a major source of human genetic diversity that determines the content, order, and orientation of the six billion nucleotides in every human genome. Despite their outsized impact on genome biology and human phenotypes, surprisingly little is known about SVs compared to better-studied classes of genetic variation, such as single-nucleotide variants. In this dissertation, I examine the patterns and consequences of SVs in the global human population by aggregating and analyzing large genomic datasets. In Chapter 1, I review the foundational properties of SVs that were established at the outset of my dissertation research. In Chapter 2, I present GATK-SV, a scalable computational pipeline for jointly discovering and genotyping SVs in whole-genome sequencing (WGS) of large cohorts, which we subsequently applied 14,891 individuals from five major continental populations to construct a new reference catalog of SVs as part of the Genome Aggregation Database. In Chapter 3, I recap four parallel efforts to probe the roles of SVs in human disease, including a meta-analysis of rare deletions and duplications in nearly one-million individuals across 54 disorders. In Chapter 4, I investigate the predicted consequences of SVs on coding and noncoding functional loci and document how natural selection shapes the distributions of these SVs in human populations. I conclude by proposing how extensions of this dissertation may eventually enable genome-wide maps of evolutionary constraint against changes in DNA copy number—also known as dosage sensitivity—at kilobase resolution, present a prototype computational framework for dosage sensitivity mapping, and describe ongoing genomic data aggregation efforts towards this goal. Taken collectively, this work sheds new light on a variety of topics in contemporary human genomics, which I hope will provide value to the basic, translational, and clinical research communities.

Description

Other Available Sources

Keywords

Bioinformatics, Copy-Number Variation, Genetic Variation, Genomics, Population Genetics, Structural Variation, Genetics, Bioinformatics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories