Person: Chen, Cho-Yi
Loading...
Email Address
AA Acceptance Date
Birth Date
Research Projects
Organizational Units
Job Title
Last Name
Chen
First Name
Cho-Yi
Name
Chen, Cho-Yi
Search Results
Now showing 1 - 2 of 2
Publication Regulatory network changes between cell lines and their tissues of origin(BioMed Central, 2017) Lopes-Ramos, Camila; Paulson, Joseph; Chen, Cho-Yi; Kuijjer, Marieke; Fagny, Maud; Platig, John; Sonawane, Abhijeet; Demeo, Dawn; Quackenbush, John; Glass, KimberlyBackground: Cell lines are an indispensable tool in biomedical research and often used as surrogates for tissues. Although there are recognized important cellular and transcriptomic differences between cell lines and tissues, a systematic overview of the differences between the regulatory processes of a cell line and those of its tissue of origin has not been conducted. The RNA-Seq data generated by the GTEx project is the first available data resource in which it is possible to perform a large-scale transcriptional and regulatory network analysis comparing cell lines with their tissues of origin. Results: We compared 127 paired Epstein-Barr virus transformed lymphoblastoid cell lines (LCLs) and whole blood samples, and 244 paired primary fibroblast cell lines and skin samples. While gene expression analysis confirms that these cell lines carry the expression signatures of their primary tissues, albeit at reduced levels, network analysis indicates that expression changes are the cumulative result of many previously unreported alterations in transcription factor (TF) regulation. More specifically, cell cycle genes are over-expressed in cell lines compared to primary tissues, and this alteration in expression is a result of less repressive TF targeting. We confirmed these regulatory changes for four TFs, including SMAD5, using independent ChIP-seq data from ENCODE. Conclusions: Our results provide novel insights into the regulatory mechanisms controlling the expression differences between cell lines and tissues. The strong changes in TF regulation that we observe suggest that network changes, in addition to transcriptional levels, should be considered when using cell lines as models for tissues. Electronic supplementary material The online version of this article (10.1186/s12864-017-4111-x) contains supplementary material, which is available to authorized users.Publication Tissue-aware RNA-Seq processing and normalization for heterogeneous and sparse data(BioMed Central, 2017) Paulson, Joseph; Chen, Cho-Yi; Lopes-Ramos, Camila; Kuijjer, Marieke; Platig, John; Sonawane, Abhijeet; Fagny, Maud; Glass, Kimberly; Quackenbush, JohnBackground: Although ultrahigh-throughput RNA-Sequencing has become the dominant technology for genome-wide transcriptional profiling, the vast majority of RNA-Seq studies typically profile only tens of samples, and most analytical pipelines are optimized for these smaller studies. However, projects are generating ever-larger data sets comprising RNA-Seq data from hundreds or thousands of samples, often collected at multiple centers and from diverse tissues. These complex data sets present significant analytical challenges due to batch and tissue effects, but provide the opportunity to revisit the assumptions and methods that we use to preprocess, normalize, and filter RNA-Seq data – critical first steps for any subsequent analysis. Results: We find that analysis of large RNA-Seq data sets requires both careful quality control and the need to account for sparsity due to the heterogeneity intrinsic in multi-group studies. We developed Yet Another RNA Normalization software pipeline (YARN), that includes quality control and preprocessing, gene filtering, and normalization steps designed to facilitate downstream analysis of large, heterogeneous RNA-Seq data sets and we demonstrate its use with data from the Genotype-Tissue Expression (GTEx) project. Conclusions: An R package instantiating YARN is available at http://bioconductor.org/packages/yarn. Electronic supplementary material The online version of this article (10.1186/s12859-017-1847-x) contains supplementary material, which is available to authorized users.