Publication:
A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease

Thumbnail Image

Date

2017

Journal Title

Journal ISSN

Volume Title

Publisher

Oxford University Press
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Han, Lichy, Mateusz Maciejewski, Christoph Brockel, William Gordon, Scott B Snapper, Joshua R Korzenik, Lovisa Afzelius, and Russ B Altman. 2017. “A probabilistic pathway score (PROPS) for classification with applications to inflammatory bowel disease.” Bioinformatics 34 (6): 985-993. doi:10.1093/bioinformatics/btx651. http://dx.doi.org/10.1093/bioinformatics/btx651.

Research Data

Abstract

Abstract Summary Gene-based supervised machine learning classification models have been widely used to differentiate disease states, predict disease progression and determine effective treatment options. However, many of these classifiers are sensitive to noise and frequently do not replicate in external validation sets. For complex, heterogeneous diseases, these classifiers are further limited by being unable to capture varying combinations of genes that lead to the same phenotype. Pathway-based classification can overcome these challenges by using robust, aggregate features to represent biological mechanisms. In this work, we developed a novel pathway-based approach, PRObabilistic Pathway Score, which uses genes to calculate individualized pathway scores for classification. Unlike previous individualized pathway-based classification methods that use gene sets, we incorporate gene interactions using probabilistic graphical models to more accurately represent the underlying biology and achieve better performance. We apply our method to differentiate two similar complex diseases, ulcerative colitis (UC) and Crohn’s disease (CD), which are the two main types of inflammatory bowel disease (IBD). Using five IBD datasets, we compare our method against four gene-based and four alternative pathway-based classifiers in distinguishing CD from UC. We demonstrate superior classification performance and provide biological insight into the top pathways separating CD from UC. Availability and Implementation PROPS is available as a R package, which can be downloaded at http://simtk.org/home/props or on Bioconductor. Contact rbaltman@stanford.edu Supplementary information Supplementary data are available at Bioinformatics online.

Description

Keywords

Systems Biology

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories