Publication:
A Map of Human Genome Variation from Population Scale Sequencing

Thumbnail Image

Date

2010

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

Nature Publishing Group
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Durbin, Richard M., David Altshuler, Gonçalo R. Abecasis, David R. Bentley, Aravinda Chakravarti, Andrew G. Clark, Francis S. Collins et al. 2010. A map of human genome variation from population-scale sequencing. Nature 467(7319): 1061-1073.

Research Data

Abstract

The 1000 Genomes Project aims to provide a deep characterization of human genome sequence variation as a foundation for investigating the relationship between genotype and phenotype. Here we present results of the pilot phase of the project, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms. We undertook three projects: low-coverage whole-genome sequencing of 179 individuals from four populations; high-coverage sequencing of two mother–father–child trios; and exon-targeted sequencing of 697 individuals from seven populations. We describe the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants, most of which were previously undescribed. We show that, because we have catalogued the vast majority of common variation, over 95% of the currently accessible variants found in any individual are present in this data set. On average, each person is found to carry approximately 250 to 300 loss-of-function variants in annotated genes and 50 to 100 variants previously implicated in inherited disorders. We demonstrate how these results can be used to inform association and functional studies. From the two trios, we directly estimate the rate of de novo germline base substitution mutations to be approximately \(10^{−8}\) per base pair per generation. We explore the data with regard to signatures of natural selection, and identify a marked reduction of genetic variation in the neighbourhood of genes, due to selection at linked sites. These methods and public data will support the next phase of human genetic research.

Description

Keywords

wide association, gene-expression, gene-expression, recombination hotspots, meiotic recombination, genotype imputation, rare variants, haplotype map, nucleotide, diseases

Terms of Use

This article is made available under the terms and conditions applicable to Open Access Policy Articles (OAP), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories

Story
A Map of Human Genome Variation… : DASH Story 2012-12-03
Thank you!! Thank you very much for the open access... Please continue to share open access materials to us researchers who might not otherwise have access to these articles.. :)
Story
A Map of Human Genome Variation… : DASH Story 2013-06-03
Open access to scientific papers is vital if the public are to be properly informed, and armed with the information allowing them to defend themselves against being misled, either by politicians or by pressure groups for various ideologies. I rely upon open access in the continued battle to inform the public about genuine scientific findings, in the face of duplicitous anti-science propaganda by, for example, various creationist organisations. Without open access, combating the lies would be considerably more difficult.