| Title: | Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster |
| Author: |
Kulathinal, Rob J.; Bergman, Casey M.; Quinlan, Aaron R.; Dopman, Erik B.; Marth, Gabor T.; Carneiro, Mauricio Oliveira; Hartl, Daniel L.; Clark, Andrew G.; Sackton, Timothy
Note: Order does not necessarily reflect citation order of authors. |
| Citation: | Sackton, Timothy B., Rob J. Kulathinal, Casey M. Bergman, Aaron R. Quinlan, Erik B. Dopman, Mauricio Carneiro, Gabor T. Marth, Daniel L. Hartl, and Andrew G. Clark. 2009. Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster. Genome Biology and Evolution 1:449-465. |
| Full Text & Related Files: |
2839279.pdf (780.4Kb; PDF)
|
| Abstract: | Short-read sequencing techniques provide the opportunity to capture genome-wide sequence data in a single experiment. A current challenge is to identify questions that shallow-depth genomic data can address successfully and to develop corresponding analytical methods that are statistically sound. Here, we apply the Roche/454 platform to survey natural variation in strains of Drosophila melanogaster from an African (n = 3) and a North American (n = 6) population. Reads were aligned to the reference D. melanogaster genomic assembly, single nucleotide polymorphisms were identified, and nucleotide variation was quantified genome wide. Simulations and empirical results suggest that nucleotide diversity can be accurately estimated from sparse data with as little as 0.2× coverage per line. The unbiased genomic sampling provided by random short-read sequencing also allows insight into distributions of transposable elements and copy number polymorphisms found within populations and demonstrates that short-read sequencing methods provide an efficient means to quantify variation in genome organization and content. Continued development of methods for statistical inference of shallow-depth genome-wide sequencing data will allow such sparse, partial data sets to become the norm in the emerging field of population genomics. |
| Published Version: | doi:10.1093/gbe/evp048 |
| Other Sources: | http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2839279/pdf/ |
| Terms of Use: | This article is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#OAP |
| Citable link to this page: | http://nrs.harvard.edu/urn-3:HUL.InstRepos:4454185 |
Contact administrator regarding this item (to report mistakes or request changes)