Assessing Biases in the Evaluation of Classification Assays for HIV Infection Recency
Wu, Julia W.
MetadataShow full item record
CitationPatterson-Lomba, Oscar, Julia W. Wu, and Marcello Pagano. 2015. “Assessing Biases in the Evaluation of Classification Assays for HIV Infection Recency.” PLoS ONE 10 (10): e0139735. doi:10.1371/journal.pone.0139735. http://dx.doi.org/10.1371/journal.pone.0139735.
AbstractIdentifying recent HIV infection cases has important public health and clinical implications. It is essential for estimating incidence rates to monitor epidemic trends and evaluate the effectiveness of interventions. Detecting recent cases is also important for HIV prevention given the crucial role that recently infected individuals play in disease transmission, and because early treatment onset can improve the clinical outlook of patients while reducing transmission risk. Critical to this enterprise is the development and proper assessment of accurate classification assays that, based on cross-sectional samples of viral sequences, help determine infection recency status. In this work we assess some of the biases present in the evaluation of HIV recency classification algorithms that rely on measures of within-host viral diversity. Particularly, we examine how the time since infection (TSI) distribution of the infected subjects from which viral samples are drawn affect performance metrics (e.g., area under the ROC curve, sensitivity, specificity, accuracy and precision), potentially leading to misguided conclusions about the efficacy of classification assays. By comparing the performance of a given HIV recency assay using six different TSI distributions (four simulated TSI distributions representing different epidemic scenarios, and two empirical TSI distributions), we show that conclusions about the overall efficacy of the assay depend critically on properties of the TSI distribution. Moreover, we demonstrate that an assay with high overall classification accuracy, mainly due to properly sorting members of the well-represented groups in the validation dataset, can still perform notoriously poorly when sorting members of the less represented groups. This is an inherent issue of classification and diagnostics procedures that is often underappreciated. Thus, this work underscores the importance of acknowledging and properly addressing evaluation biases when proposing new HIV recency assays.
Citable link to this pagehttp://nrs.harvard.edu/urn-3:HUL.InstRepos:23474095
- SPH Scholarly Articles