Defining an Informativeness Metric for Clustering Gene Expression Data

Mar, Jessica; Wells, Christine A.; Quackenbush, John

View/Open

3072547.pdf (365.5Kb)

Author

Mar, Jessica

Wells, Christine A.

Quackenbush, John HARVARD

0000-0002-2702-5879

Published Version

https://doi.org/10.1093/bioinformatics/btr074

Metadata

Show full item record

Citation

Mar, Jessica C., Christine A. Wells, and John Quackenbush. 2011. Defining an informativeness metric for clustering gene expression data. Bioinformatics 27(8): 1094-1100.

Abstract

Motivation: Unsupervised ‘cluster’ analysis is an invaluable tool for exploratory microarray data analysis, as it organizes the data into groups of genes or samples in which the elements share common patterns. Once the data are clustered, finding the optimal number of informative subgroups within a dataset is a problem that, while important for understanding the underlying phenotypes, is one for which there is no robust, widely accepted solution. Results: To address this problem we developed an ‘informativeness metric’ based on a simple analysis of variance statistic that identifies the number of clusters which best separate phenotypic groups. The performance of the informativeness metric has been tested on both experimental and simulated datasets, and we contrast these results with those obtained using alternative methods such as the gap statistic. Availability: The method has been implemented in the Bioconductor R package attract; it is also freely available from http://compbio.dfci.harvard.edu/pubs/attract_1.0.1.zip. Contact: jess@jimmy.harvard.edu; johnq@jimmy.harvard.edu. Supplementary information: Supplementary data are available at Bioinformatics online.

Other Sources

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3072547/pdf/

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at http://nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of-use#LAA

Citable link to this page

http://nrs.harvard.edu/urn-3:HUL.InstRepos:8579873

Collections

SPH Scholarly Articles [6362]

Contact administrator regarding this item (to report mistakes or request changes)