Publication:

Identification of pathogen genomic variants through an integrated pipeline

Loading...
Thumbnail Image

Open/View Files

Date

2014

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

BioMed Central
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Manary, Micah J, Suriya S Singhakul, Erika L Flannery, Selina ER Bopp, Victoria C Corey, Andrew Taylor Bright, Case W McNamara, John R Walker, and Elizabeth A Winzeler. 2014. “Identification of pathogen genomic variants through an integrated pipeline.” BMC Bioinformatics 15 (1): 63. doi:10.1186/1471-2105-15-63. http://dx.doi.org/10.1186/1471-2105-15-63.

Abstract

Background: Whole-genome sequencing represents a powerful experimental tool for pathogen research. We present methods for the analysis of small eukaryotic genomes, including a streamlined system (called Platypus) for finding single nucleotide and copy number variants as well as recombination events. Results: We have validated our pipeline using four sets of Plasmodium falciparum drug resistant data containing 26 clones from 3D7 and Dd2 background strains, identifying an average of 11 single nucleotide variants per clone. We also identify 8 copy number variants with contributions to resistance, and report for the first time that all analyzed amplification events are in tandem. Conclusions: The Platypus pipeline provides malaria researchers with a powerful tool to analyze short read sequencing data. It provides an accurate way to detect SNVs using known software packages, and a novel methodology for detection of CNVs, though it does not currently support detection of small indels. We have validated that the pipeline detects known SNVs in a variety of samples while filtering out spurious data. We bundle the methods into a freely available package.

Description

Research Data

Keywords

Malaria, Sequencing, Genome, Polymorphism, Variant

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories