Publication: Screening for interaction effects in gene expression data
Open/View Files
Date
2017
Published Version
Journal Title
Journal ISSN
Volume Title
Publisher
Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.
Citation
Castaldi, Peter J., Michael H. Cho, Liming Liang, Edwin K. Silverman, Craig P. Hersh, Kenneth Rice, and Hugues Aschard. 2017. “Screening for interaction effects in gene expression data.” PLoS ONE 12 (3): e0173847. doi:10.1371/journal.pone.0173847. http://dx.doi.org/10.1371/journal.pone.0173847.
Research Data
Abstract
Expression quantitative trait (eQTL) studies are a powerful tool for identifying genetic variants that affect levels of messenger RNA. Since gene expression is controlled by a complex network of gene-regulating factors, one way to identify these factors is to search for interaction effects between genetic variants and mRNA levels of transcription factors (TFs) and their respective target genes. However, identification of interaction effects in gene expression data pose a variety of methodological challenges, and it has become clear that such analyses should be conducted and interpreted with caution. Investigating the validity and interpretability of several interaction tests when screening for eQTL SNPs whose effect on the target gene expression is modified by the expression level of a transcription factor, we characterized two important methodological issues. First, we stress the scale-dependency of interaction effects and highlight that commonly applied transformation of gene expression data can induce or remove interactions, making interpretation of results more challenging. We then demonstrate that, in the setting of moderate to strong interaction effects on the order of what may be reasonably expected for eQTL studies, standard interaction screening can be biased due to heteroscedasticity induced by true interactions. Using simulation and real data analysis, we outline a set of reasonable minimum conditions and sample size requirements for reliable detection of variant-by-environment and variant-by-TF interactions using the heteroscedasticity consistent covariance-based approach.
Description
Other Available Sources
Keywords
Biology and Life Sciences, Genetics, Gene Expression, Physical Sciences, Mathematics, Probability Theory, Probability Distribution, Normal Distribution, Physics, Particle Physics, Elementary Particle Interactions, Fundamental Interactions, Strong Interaction, Quantum Mechanics, Quantum Chromodynamics, Mathematical and Statistical Techniques, Statistical Methods, Regression Analysis, Linear Regression Analysis, Statistics (Mathematics), Computational Biology, Genome Analysis, Genome-Wide Association Studies, Genomics, Human Genetics, Biology and life sciences, Biochemistry, Proteins, DNA-binding proteins, Transcription Factors, Gene Regulation, Regulatory Proteins, Gene Identification and Analysis, Genetic Screens, Random Variables, Covariance
Terms of Use
This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service