Publication: iBBiG: iterative binary bi-clustering of gene sets
Open/View Files
Date
2012
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
Oxford University Press
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Gusenleitner, Daniel, Eleanor A. Howe, Stefan Bentink, John Quackenbush, and Aedín C. Culhane. 2012. iBBiG: iterative binary bi-clustering of gene sets. Bioinformatics 28(19): 2484-2492.
Research Data
Abstract
Motivation: Meta-analysis of genomics data seeks to identify genes associated with a biological phenotype across multiple datasets; however, merging data from different platforms by their features (genes) is challenging. Meta-analysis using functionally or biologically characterized gene sets simplifies data integration is biologically intuitive and is seen as having great potential, but is an emerging field with few established statistical methods. Results: We transform gene expression profiles into binary gene set profiles by discretizing results of gene set enrichment analyses and apply a new iterative bi-clustering algorithm (iBBiG) to identify groups of gene sets that are coordinately associated with groups of phenotypes across multiple studies. iBBiG is optimized for meta-analysis of large numbers of diverse genomics data that may have unmatched samples. It does not require prior knowledge of the number or size of clusters. When applied to simulated data, it outperforms commonly used clustering methods, discovers overlapping clusters of diverse sizes and is robust in the presence of noise. We apply it to meta-analysis of breast cancer studies, where iBBiG extracted novel gene set—phenotype association that predicted tumor metastases within tumor subtypes.
Description
Other Available Sources
Keywords
Systems Biology
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service