reGenotyper: Detecting mislabeled samples in genetic data
Snoek, Basten L.
Van der Velde, K. Joeri
Swertz, Morris A.
Kammenga, Jan E.
Jansen, Ritsert C.
Li, YangNote: Order does not necessarily reflect citation order of authors.
MetadataShow full item record
CitationZych, K., B. L. Snoek, M. Elvin, M. Rodriguez, K. J. Van der Velde, D. Arends, H. Westra, et al. 2017. “reGenotyper: Detecting mislabeled samples in genetic data.” PLoS ONE 12 (2): e0171324. doi:10.1371/journal.pone.0171324. http://dx.doi.org/10.1371/journal.pone.0171324.
AbstractIn high-throughput molecular profiling studies, genotype labels can be wrongly assigned at various experimental steps; the resulting mislabeled samples seriously reduce the power to detect the genetic basis of phenotypic variation. We have developed an approach to detect potential mislabeling, recover the “ideal” genotype and identify “best-matched” labels for mislabeled samples. On average, we identified 4% of samples as mislabeled in eight published datasets, highlighting the necessity of applying a “data cleaning” step before standard data analysis.
Citable link to this pagehttp://nrs.harvard.edu/urn-3:HUL.InstRepos:31731722
- HMS Scholarly Articles