Person: Hide, Winston
Email Address
AA Acceptance Date
Birth Date
Research Projects
Organizational Units
Job Title
Last Name
First Name
Name
Search Results
Publication ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level
(Oxford University Press, 2010) Rocca-Serra, Philippe; Brandizi, Marco; Maguire, Eamonn; Sklyar, Nataliya; Taylor, Chris; Begley, Kimberly; Field, Dawn; Harris, Stephen; Hide, Winston; Hofmann, Oliver; Neumann, Steffen; Sterk, Peter; Tong, Weida; Sansone, Susanna-AssuntaSummary: The first open source software suite for experimentalists and curators that (i) assists in the annotation and local management of experimental metadata from high-throughput studies employing one or a combination of omics and other technologies; (ii) empowers users to uptake community-defined checklists and ontologies; and (iii) facilitates submission to international public repositories. Availability and Implementation: Software, documentation, case studies and implementations at http://www.isa-tools.org
Publication Prenatal Lead Levels, Plasma Amyloid β Levels, and Gene Expression in Young Adulthood
(National Institute of Environmental Health Sciences, 2012) Mazumdar, Maitreyi; Xia, Weiming; Hofmann, Oliver; Gregas, Matthew; Sui, Shannan Ho; Hide, Winston; Yang, Ting; Needleman, Herbert L.; Bellinger, DavidBackground: Animal studies suggest that early-life lead exposure influences gene expression and production of proteins associated with Alzheimer’s disease (AD). Objectives: We attempted to assess the relationship between early-life lead exposure and potential biomarkers for AD among young men and women. We also attempted to assess whether early-life lead exposure was associated with changes in expression of AD-related genes. Methods: We used sandwich enzyme-linked immunosorbent assays (ELISA) to measure plasma concentrations of amyloid β proteins Aβ40 and Aβ42 among 55 adults who had participated as newborns and young children in a prospective cohort study of the effects of lead exposure on development. We used RNA microarray techniques to analyze gene expression. Results: Mean plasma Aβ:42 concentrations were lower among 13 participants with high umbilical cord blood lead concentrations (≥ 10 μg/dL) than in 42 participants with lower cord blood lead concentrations (p = 0.08). Among 10 participants with high prenatal lead exposure, we found evidence of an inverse relationship between umbilical cord lead concentration and expression of ADAM metallopeptidase domain 9 (ADAM9), reticulon 4 (RTN4), and low-density lipoprotein receptor-related protein associated protein 1 (LRPAP1) genes, whose products are believed to affect Aβ production and deposition. Gene network analysis suggested enrichment in gene sets involved in nerve growth and general cell development. Conclusions: Data from our exploratory study suggest that prenatal lead exposure may influence Aβ-related biological pathways that have been implicated in AD onset. Gene network analysis identified further candidates to study the mechanisms of developmental lead neurotoxicity.
Publication Population Differences in Transcript-Regulator Expression Quantitative Trait Loci
(Public Library of Science, 2012) Bushel, Pierre R.; McGovern, Ray; Liu, Liwen; Hofmann, Oliver; Huda, Ahsan; Lu, Jun; Hide, Winston; Lin, XihongGene expression quantitative trait loci (eQTL) are useful for identifying single nucleotide polymorphisms (SNPs) associated with diseases. At times, a genetic variant may be associated with a master regulator involved in the manifestation of a disease. The downstream target genes of the master regulator are typically co-expressed and share biological function. Therefore, it is practical to screen for eQTLs by identifying SNPs associated with the targets of a transcript-regulator (TR). We used a multivariate regression with the gene expression of known targets of TRs and SNPs to identify TReQTLs in European (CEU) and African (YRI) HapMap populations. A nominal p-value of <1 x 10(^{-6})revealed 234 SNPs in CEU and 154 in YRI as TReQTLs. These represent 36 independent (tag) SNPs in CEU and 39 in YRI affecting the downstream targets of 25 and 36 TRs respectively. At a false discovery rate (FDR) = 45%, one cis-acting tag SNP (within 1 kb of a gene) in each population was identified as a TReQTL. In CEU, the SNP (rs16858621) in Pcnxl2 was found to be associated with the genes regulated by CREM whereas in YRI, the SNP (rs16909324) was linked to the targets of miRNA hsa-miR-125a. To infer the pathways thatregulate expression, we ranked TReQTLs by connectivity within the structure of biological process subtrees. One TReQTL SNP (rs3790904) in CEU maps to Lphn2 and is associated (nominal p-value = 8.1 x 10(^{-7})) with the targets of the X-linked breast cancer suppressor Foxp3. The structure of the biological process subtree and a gene interaction network of the TReQTL revealed that tumor necrosis factor, NF-kappaB and variants in G-protein coupled receptors signaling may play a central role as communicators in Foxp3 functional regulation. The potential pleiotropic effect of the Foxp3 TReQTLs was gleaned from integrating mRNA-Seq data and SNP-set enrichment into the analysis.
Publication Integrating Murine Gene Expression Studies to Understand Obstructive Lung Disease due to Chronic Inhaled Endotoxin
(Public Library of Science, 2013) Lai, Peggy; Hofmann, Oliver; Baron, Rebecca; Cernadas, Manuela; Meng, Quanxin Ryan; Bresler, Herbert S.; Brass, David M.; Yang, Ivana V.; Schwartz, David A.; Christiani, David; Hide, WinstonRationale: Endotoxin is a near ubiquitous environmental exposure that that has been associated with both asthma and chronic obstructive pulmonary disease (COPD). These obstructive lung diseases have a complex pathophysiology, making them difficult to study comprehensively in the context of endotoxin. Genome-wide gene expression studies have been used to identify a molecular snapshot of the response to environmental exposures. Identification of differentially expressed genes shared across all published murine models of chronic inhaled endotoxin will provide insight into the biology underlying endotoxin-associated lung disease. Methods: We identified three published murine models with gene expression profiling after repeated low-dose inhaled endotoxin. All array data from these experiments were re-analyzed, annotated consistently, and tested for shared genes found to be differentially expressed. Additional functional comparison was conducted by testing for significant enrichment of differentially expressed genes in known pathways. The importance of this gene signature in smoking-related lung disease was assessed using hierarchical clustering in an independent experiment where mice were exposed to endotoxin, smoke, and endotoxin plus smoke. Results: A 101-gene signature was detected in three murine models, more than expected by chance. The three model systems exhibit additional similarity beyond shared genes when compared at the pathway level, with increasing enrichment of inflammatory pathways associated with longer duration of endotoxin exposure. Genes and pathways important in both asthma and COPD were shared across all endotoxin models. Mice exposed to endotoxin, smoke, and smoke plus endotoxin were accurately classified with the endotoxin gene signature. Conclusions: Despite the differences in laboratory, duration of exposure, and strain of mouse used in three experimental models of chronic inhaled endotoxin, surprising similarities in gene expression were observed. The endotoxin component of tobacco smoke may play an important role in disease development.
Publication The Stem Cell Discovery Engine: An Integrated Repository and Analysis System for Cancer Stem Cell Comparisons
(Oxford University Press, 2011) Begley, Kimberly; Reilly, Dorothy; McGovern, Ray; Rocca-Sera, Philippe; Maguire, Eamonn; Altschuler, Gabriel M.; Culhane, Aedín C.; Correll, Mick; Sansone, Susanna-Assunta; Ho Sui, Shannan; Chapman, Brad; Hansen, Terah; Sompallae, Ramakrishna Rao; Krivtsov, Andrei; Shivdasani, Ramesh; Armstrong, Scott; Hofmann, Oliver; Hide, WinstonMounting evidence suggests that malignant tumors are initiated and maintained by a subpopulation of cancerous cells with biological properties similar to those of normal stem cells. However, descriptions of stem-like gene and pathway signatures in cancers are inconsistent across experimental systems. Driven by a need to improve our understanding of molecular processes that are common and unique across cancer stem cells (CSCs), we have developed the Stem Cell Discovery Engine (SCDE)—an online database of curated CSC experiments coupled to the Galaxy analytical framework. The SCDE allows users to consistently describe, share and compare CSC data at the gene and pathway level. Our initial focus has been on carefully curating tissue and cancer stem cell-related experiments from blood, intestine and brain to create a high quality resource containing 53 public studies and 1098 assays. The experimental information is captured and stored in the multi-omics Investigation/Study/Assay (ISA-Tab) format and can be queried in the data repository. A linked Galaxy framework provides a comprehensive, flexible environment populated with novel tools for gene list comparisons against molecular signatures in GeneSigDB and MSigDB, curated experiments in the SCDE and pathways in WikiPathways. The SCDE is available at http://discovery.hsci.harvard.edu.
Publication Integrated Genomic Analysis of Diverse Induced Pluripotent Stem Cells from the Progenitor Cell Biology Consortium
(Elsevier, 2016) Salomonis, Nathan; Dexheimer, Phillip J.; Omberg, Larsson; Schroll, Robin; Bush, Stacy; Huo, Jeffrey; Schriml, Lynn; Ho Sui, Shannan; Keddache, Mehdi; Mayhew, Christopher; Shanmukhappa, Shiva Kumar; Wells, James; Daily, Kenneth; Hubler, Shane; Wang, Yuliang; Zambidis, Elias; Margolin, Adam; Hide, Winston; Hatzopoulos, Antonis K.; Malik, Punam; Cancelas, Jose A.; Aronow, Bruce J.; Lutzko, CarolynSummary The rigorous characterization of distinct induced pluripotent stem cells (iPSC) derived from multiple reprogramming technologies, somatic sources, and donors is required to understand potential sources of variability and downstream potential. To achieve this goal, the Progenitor Cell Biology Consortium performed comprehensive experimental and genomic analyses of 58 iPSC from ten laboratories generated using a variety of reprogramming genes, vectors, and cells. Associated global molecular characterization studies identified functionally informative correlations in gene expression, DNA methylation, and/or copy-number variation among key developmental and oncogenic regulators as a result of donor, sex, line stability, reprogramming technology, and cell of origin. Furthermore, X-chromosome inactivation in PSC produced highly correlated differences in teratoma-lineage staining and regulator expression upon differentiation. All experimental results, and raw, processed, and metadata from these analyses, including powerful tools, are interactively accessible from a new online portal at https://www.synapse.org to serve as a reusable resource for the stem cell community.
Publication A comprehensive promoter landscape identifies a novel promoter for CD133 in restricted tissues, cancers, and stem cells
(Frontiers Media S.A., 2013) Sompallae, Ramakrishna; Hofmann, Oliver; Maher, Christopher A.; Gedye, Craig; Behren, Andreas; Vitezic, Morana; Daub, Carsten O.; Devalle, Sylvie; Caballero, Otavia L.; Carninci, Piero; Hayashizaki, Yoshihide; Lawlor, Elizabeth R.; Cebon, Jonathan; Hide, WinstonPROM1 is the gene encoding prominin-1 or CD133, an important cell surface marker for the isolation of both normal and cancer stem cells. PROM1 transcripts initiate at a range of transcription start sites (TSS) associated with distinct tissue and cancer expression profiles. Using high resolution Cap Analysis of Gene Expression (CAGE) sequencing we characterize TSS utilization across a broad range of normal and developmental tissues. We identify a novel proximal promoter (P6) within CD133+ melanoma cell lines and stem cells. Additional exon array sampling finds P6 to be active in populations enriched for mesenchyme, neural stem cells and within CD133+ enriched Ewing sarcomas. The P6 promoter is enriched with respect to previously characterized PROM1 promoters for a HMGI/Y (HMGA1) family transcription factor binding site motif and exhibits different epigenetic modifications relative to the canonical promoter region of PROM1.
Publication Gateways to the FANTOM5 promoter level mammalian expression atlas
(BioMed Central, 2015) Lizio, Marina; Harshbarger, Jayson; Shimoji, Hisashi; Severin, Jessica; Kasukawa, Takeya; Sahin, Serkan; Abugessaisa, Imad; Fukuda, Shiro; Hori, Fumi; Ishikawa-Kato, Sachi; Mungall, Christopher J; Arner, Erik; Baillie, J Kenneth; Bertin, Nicolas; Bono, Hidemasa; de Hoon, Michiel; Diehl, Alexander D; Dimont, Emmanuel; Freeman, Tom C; Fujieda, Kaori; Hide, Winston; Kaliyaperumal, Rajaram; Katayama, Toshiaki; Lassmann, Timo; Meehan, Terrence F; Nishikata, Koro; Ono, Hiromasa; Rehli, Michael; Sandelin, Albin; Schultes, Erik A; ‘t Hoen, Peter AC; Tatum, Zuotian; Thompson, Mark; Toyoda, Tetsuro; Wright, Derek W; Daub, Carsten O; Itoh, Masayoshi; Carninci, Piero; Hayashizaki, Yoshihide; Forrest, Alistair RR; Kawaji, HideyaThe FANTOM5 project investigates transcription initiation activities in more than 1,000 human and mouse primary cells, cell lines and tissues using CAGE. Based on manual curation of sample information and development of an ontology for sample classification, we assemble the resulting data into a centralized data resource (http://fantom.gsc.riken.jp/5/). This resource contains web-based tools and data-access points for the research community to search and extract data related to samples, genes, promoter activities, transcription factors and enhancers across the FANTOM5 atlas. Electronic supplementary material The online version of this article (doi:10.1186/s13059-014-0560-6) contains supplementary material, which is available to authorized users.
Publication edgeRun: an R package for sensitive, functionally relevant differential expression discovery using an unconditional exact test
(Oxford University Press, 2015) Dimont, Emmanuel; Shi, Jiantao; Kirchner, Rory; Hide, WinstonSummary: Next-generation sequencing platforms for measuring digital expression such as RNA-Seq are displacing traditional microarray-based methods in biological experiments. The detection of differentially expressed genes between groups of biological conditions has led to the development of numerous bioinformatics tools, but so far, few exploit the expanded dynamic range afforded by the new technologies. We present edgeRun, an R package that implements an unconditional exact test that is a more powerful version of the exact test in edgeR. This increase in power is especially pronounced for experiments with as few as two replicates per condition, for genes with low total expression and with large biological coefficient of variation. In comparison with a panel of other tools, edgeRun consistently captures functionally similar differentially expressed genes. Availability and implementation: The package is freely available under the MIT license from CRAN (http://cran.r-project.org/web/packages/edgeRun). Contact: edimont@mail.harvard.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Publication The Stem Cell Commons: an exemplar for data integration in the biomedical domain driven by the ISA framework
(American Medical Informatics Association, 2013) Sui, Shannan Ho; Merrill, Emily; Gehlenborg, Nils; Haseley, Psalm; Sytchev, Ilya; Park, Richard; Rocca-Serra, Philippe; Corlosquet, Stephane; Gonzalez-Beltran, Alejandra; Maguire, Eamonn; Hofmann, Oliver; Park, Peter; Das, Sudeshna; Sansone, Susanna-Assunta; Hide, WinstonComparisons of stem cell experiments at both molecular and semantic levels remain challenging due to inconsistencies in results, data formats, and descriptions among biomedical research discoveries. The Harvard Stem Cell Institute (HSCI) has created the Stem Cell Commons (stemcellcommons.org), an open, community-based approach to data sharing. Experimental information is integrated using the Investigation-Study-Assay tabular format (ISA-Tab) used by over 30 organizations (ISA Commons, isacommons.org). The early adoption of this format permitted the novel integration of three independent systems to facilitate stem cell data storage, exchange and analysis: the Blood Genomics Repository, the Stem Cell Discovery Engine, and the new Refinery platform that links the Galaxy analytical engine to data repositories.
- «
- 1 (current)
- 2
- 3
- »