Analysis of 6,515 exomes reveals a recent origin of most human protein-coding variants
O'Connor, Timothy D.
Kang, Hyun Min
Leal, Suzanne M.
Nickerson, Deborah A.
Bamshad, Michael J.
Akey, Joshua M.Note: Order does not necessarily reflect citation order of authors.
MetadataShow full item record
CitationFu, W., T. D. O'Connor, G. Jun, H. M. Kang, G. Abecasis, S. M. Leal, S. Gabriel, et al. 2012. “Analysis of 6,515 exomes reveals a recent origin of most human protein-coding variants.” Nature 493 (7431): 216-220. doi:10.1038/nature11690. http://dx.doi.org/10.1038/nature11690.
AbstractEstablishing the age of each mutation segregating in contemporary human populations is important to fully understand our evolutionary history1,2 and will help facilitate the development of new approaches for disease gene discovery3. Large-scale surveys of human genetic variation have reported signatures of recent explosive population growth4-6, notable for an excess of rare genetic variants, qualitatively suggesting that many mutations arose recently. To more quantitatively assess the distribution of mutation ages, we resequenced 15,336 genes in 6,515 individuals of European (n=4,298) and African (n=2,217) American ancestry and inferred the age of 1,146,401 autosomal single nucleotide variants (SNVs). We estimate that ~73% of all protein-coding SNVs and ~86% of SNVs predicted to be deleterious arose in the past 5,000-10,000 years. The average age of deleterious SNVs varied significantly across molecular pathways, and disease genes contained a significantly higher proportion of recently arisen deleterious SNVs compared to other genes. Furthermore, European Americans had an excess of deleterious variants in essential and Mendelian disease genes compared to African Americans, consistent with weaker purifying selection due to the out-of-Africa dispersal. Our results better delimit the historical details of human protein-coding variation, illustrate the profound effect recent human history has had on the burden of deleterious SNVs segregating in contemporary populations, and provides important practical information that can be used to prioritize variants in disease gene discovery.
Citable link to this pagehttp://nrs.harvard.edu/urn-3:HUL.InstRepos:11717639
- HMS Scholarly Articles