Publication:
Screening for interaction effects in gene expression data

Thumbnail Image

Open/View Files

Date

2017

Journal Title

Journal ISSN

Volume Title

Publisher

Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Castaldi, Peter J., Michael H. Cho, Liming Liang, Edwin K. Silverman, Craig P. Hersh, Kenneth Rice, and Hugues Aschard. 2017. “Screening for interaction effects in gene expression data.” PLoS ONE 12 (3): e0173847. doi:10.1371/journal.pone.0173847. http://dx.doi.org/10.1371/journal.pone.0173847.

Research Data

Abstract

Expression quantitative trait (eQTL) studies are a powerful tool for identifying genetic variants that affect levels of messenger RNA. Since gene expression is controlled by a complex network of gene-regulating factors, one way to identify these factors is to search for interaction effects between genetic variants and mRNA levels of transcription factors (TFs) and their respective target genes. However, identification of interaction effects in gene expression data pose a variety of methodological challenges, and it has become clear that such analyses should be conducted and interpreted with caution. Investigating the validity and interpretability of several interaction tests when screening for eQTL SNPs whose effect on the target gene expression is modified by the expression level of a transcription factor, we characterized two important methodological issues. First, we stress the scale-dependency of interaction effects and highlight that commonly applied transformation of gene expression data can induce or remove interactions, making interpretation of results more challenging. We then demonstrate that, in the setting of moderate to strong interaction effects on the order of what may be reasonably expected for eQTL studies, standard interaction screening can be biased due to heteroscedasticity induced by true interactions. Using simulation and real data analysis, we outline a set of reasonable minimum conditions and sample size requirements for reliable detection of variant-by-environment and variant-by-TF interactions using the heteroscedasticity consistent covariance-based approach.

Description

Keywords

Biology and Life Sciences, Genetics, Gene Expression, Physical Sciences, Mathematics, Probability Theory, Probability Distribution, Normal Distribution, Physics, Particle Physics, Elementary Particle Interactions, Fundamental Interactions, Strong Interaction, Quantum Mechanics, Quantum Chromodynamics, Mathematical and Statistical Techniques, Statistical Methods, Regression Analysis, Linear Regression Analysis, Statistics (Mathematics), Computational Biology, Genome Analysis, Genome-Wide Association Studies, Genomics, Human Genetics, Biology and life sciences, Biochemistry, Proteins, DNA-binding proteins, Transcription Factors, Gene Regulation, Regulatory Proteins, Gene Identification and Analysis, Genetic Screens, Random Variables, Covariance

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories