Vector Algebra in the Analysis of Genome-Wide Expression Data

DSpace/Manakin Repository

Vector Algebra in the Analysis of Genome-Wide Expression Data

Show simple item record Kuruvilla, Finny G Park, Peter J. Schreiber, Stuart L. 2010-10-04T18:50:22Z 2002
dc.identifier.citation Kuruvilla, Finny G., Peter J. Park, and Stuart L. Schreiber. 2002. Vector algebra in the analysis of genome-wide expression data. Genome Biology 3(3): research0011.1-research0011.11. en_US
dc.identifier.issn 1465-6906 en_US
dc.description.abstract Background: Data from thousands of transcription-profiling experiments in organisms ranging from yeast to humans are now publicly available. How best to analyze these data remains an important challenge. A variety of tools have been used for this purpose, including hierarchical clustering, self-organizing maps and principal components analysis. In particular, concepts from vector algebra have proven useful in the study of genome-wide expression data. Results: Here we present a framework based on vector algebra for the analysis of transcription profiles that is geometrically intuitive and computationally efficient. Concepts in vector algebra such as angles, magnitudes, subspaces, singular value decomposition, bases and projections have natural and powerful interpretations in the analysis of microarray data. Angles in particular offer a rigorous method of defining 'similarity' and are useful in evaluating the claims of a microarray-based study. We present a sample analysis of cells treated with rapamycin, an immunosuppressant whose effects have been extensively studied with microarrays. In addition, the algebraic concept of a basis for a space affords the opportunity to simplify data analysis and uncover a limited number of expression vectors to span the transcriptional range of cell behavior. Conclusions: This framework represents a compact, powerful and scalable construction for analysis and computation. As the amount of microarray data in the public domain grows, these vector-based methods are relevant in determining statistical significance. These approaches are also well suited to extract biologically meaningful information in the analysis of signaling networks. en_US
dc.description.sponsorship Chemistry and Chemical Biology en_US
dc.language.iso en_US en_US
dc.publisher BioMed Central en_US
dc.relation.isversionof en_US
dc.relation.hasversion en_US
dash.license LAA
dc.title Vector Algebra in the Analysis of Genome-Wide Expression Data en_US
dc.type Journal Article en_US
dc.description.version Accepted Manuscript en_US
dc.relation.journal Genome Biology en_US Schreiber, Stuart L. 2010-10-04T18:50:22Z

Files in this item

Files Size Format View
88809.pdf 356.9Kb PDF View/Open

This item appears in the following Collection(s)

  • FAS Scholarly Articles [6463]
    Peer reviewed scholarly articles from the Faculty of Arts and Sciences of Harvard University

Show simple item record


Search DASH

Advanced Search