Publication:

Best Practices and Joint Calling of the HumanExome BeadChip: The CHARGE Consortium

Loading...
Thumbnail Image

Date

2013

Journal Title

Journal ISSN

Volume Title

Publisher

Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Grove, M. L., B. Yu, B. J. Cochran, T. Haritunians, J. C. Bis, K. D. Taylor, M. Hansen, et al. 2013. “Best Practices and Joint Calling of the HumanExome BeadChip: The CHARGE Consortium.” PLoS ONE 8 (7): e68095. doi:10.1371/journal.pone.0068095. http://dx.doi.org/10.1371/journal.pone.0068095.

Abstract

Genotyping arrays are a cost effective approach when typing previously-identified genetic polymorphisms in large numbers of samples. One limitation of genotyping arrays with rare variants (e.g., minor allele frequency [MAF] <0.01) is the difficulty that automated clustering algorithms have to accurately detect and assign genotype calls. Combining intensity data from large numbers of samples may increase the ability to accurately call the genotypes of rare variants. Approximately 62,000 ethnically diverse samples from eleven Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium cohorts were genotyped with the Illumina HumanExome BeadChip across seven genotyping centers. The raw data files for the samples were assembled into a single project for joint calling. To assess the quality of the joint calling, concordance of genotypes in a subset of individuals having both exome chip and exome sequence data was analyzed. After exclusion of low performing SNPs on the exome chip and non-overlap of SNPs derived from sequence data, genotypes of 185,119 variants (11,356 were monomorphic) were compared in 530 individuals that had whole exome sequence data. A total of 98,113,070 pairs of genotypes were tested and 99.77% were concordant, 0.14% had missing data, and 0.09% were discordant. We report that joint calling allows the ability to accurately genotype rare variation using array technology when large sample sizes are available and best practices are followed. The cluster file from this experiment is available at www.chargeconsortium.com/main/exomechip.

Description

Research Data

Keywords

Biology, Computational Biology, Population Genetics, Genetic Polymorphism, Genetics, Heredity, Genotypes, Human Genetics, Genomics, Genome Databases, Mutation Databases, Genome Analysis Tools, Genome Sequencing, Population Biology

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories