Show simple item record

dc.contributor.authorHsu, Jessie J.en_US
dc.contributor.authorFinkelstein, Dianne M.en_US
dc.contributor.authorSchoenfeld, David A.en_US
dc.date.accessioned2015-12-04T18:15:45Z
dc.date.issued2015en_US
dc.identifier.citationHsu, Jessie J., Dianne M. Finkelstein, and David A. Schoenfeld. 2015. “Outcome-Driven Cluster Analysis with Application to Microarray Data.” PLoS ONE 10 (11): e0141874. doi:10.1371/journal.pone.0141874. http://dx.doi.org/10.1371/journal.pone.0141874.en
dc.identifier.issn1932-6203en
dc.identifier.urihttp://nrs.harvard.edu/urn-3:HUL.InstRepos:23845370
dc.description.abstractOne goal of cluster analysis is to sort characteristics into groups (clusters) so that those in the same group are more highly correlated to each other than they are to those in other groups. An example is the search for groups of genes whose expression of RNA is correlated in a population of patients. These genes would be of greater interest if their common level of RNA expression were additionally predictive of the clinical outcome. This issue arose in the context of a study of trauma patients on whom RNA samples were available. The question of interest was whether there were groups of genes that were behaving similarly, and whether each gene in the cluster would have a similar effect on who would recover. For this, we develop an algorithm to simultaneously assign characteristics (genes) into groups of highly correlated genes that have the same effect on the outcome (recovery). We propose a random effects model where the genes within each group (cluster) equal the sum of a random effect, specific to the observation and cluster, and an independent error term. The outcome variable is a linear combination of the random effects of each cluster. To fit the model, we implement a Markov chain Monte Carlo algorithm based on the likelihood of the observed data. We evaluate the effect of including outcome in the model through simulation studies and describe a strategy for prediction. These methods are applied to trauma data from the Inflammation and Host Response to Injury research program, revealing a clustering of the genes that are informed by the recovery outcome.en
dc.language.isoen_USen
dc.publisherPublic Library of Scienceen
dc.relation.isversionofdoi:10.1371/journal.pone.0141874en
dc.relation.hasversionhttp://www.ncbi.nlm.nih.gov/pmc/articles/PMC4643008/pdf/en
dash.licenseLAAen_US
dc.titleOutcome-Driven Cluster Analysis with Application to Microarray Dataen
dc.typeJournal Articleen_US
dc.description.versionVersion of Recorden
dc.relation.journalPLoS ONEen
dash.depositing.authorHsu, Jessie J.en_US
dc.date.available2015-12-04T18:15:45Z
dc.identifier.doi10.1371/journal.pone.0141874*
dash.contributor.affiliatedHsu, Hao-Ru
dash.contributor.affiliatedSchoenfeld, David
dash.contributor.affiliatedFinkelstein, Dianne


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record