Publication:

An Analytical Method for Inferring Library Identity From Illumina NGS Read Groups

Loading...
Thumbnail Image

Date

2017-04-04

Published Version

Published Version

Journal Title

Journal ISSN

Volume Title

Publisher

The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Taylor, Bradley Ryan. 2017. An Analytical Method for Inferring Library Identity From Illumina NGS Read Groups. Master's thesis, Harvard Extension School.

Abstract

When working with genetic sequence data, it is important to know the individual, sample, and sequencing library from which the data derives. While multiple software packages exist that can detect when data has been swapped between samples, there are presently no methods to detect library mislabeling. Such mislabellings can negatively impact downstream analyses. Here, we present a tool for reconstructing library relationships from aligned sequence read groups. The basic approach relies on quantifying the similarity of read groups based on their distributions of duplicated insert molecules. Several similarity measures were considered. Library-identity decisions based on similarity-modeling resulted in >91% sensitivity for identifying pairs of read groups from the same library and >98% specificity for identifying pairs from different libraries. Further improvements were seen through unbiased clustering of read groups. Without analytical methods to detect library misidentification, there is no way to know how pervasive this problem is in sequencing data sets. The present tool addresses this unmet need, and provides re- searchers with unique insight into their data’s chain of evidence.

Description

Other Available Sources

Research Data

Keywords

Biology, Bioinformatics, Biology, Genetics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories