Publication:
HD CAGnome: A Search Tool for Huntingtin CAG Repeat Length-Correlated Genes

Thumbnail Image

Open/View Files

Date

2014

Journal Title

Journal ISSN

Volume Title

Publisher

Public Library of Science
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Galkina, Ekaterina I., Aram Shin, Kathryn R. Coser, Toshi Shioda, Isaac S. Kohane, Ihn Sik Seong, Vanessa C. Wheeler, James F. Gusella, Marcy E. MacDonald, and Jong-Min Lee. 2014. “HD CAGnome: A Search Tool for Huntingtin CAG Repeat Length-Correlated Genes.” PLoS ONE 9 (4): e95556. doi:10.1371/journal.pone.0095556. http://dx.doi.org/10.1371/journal.pone.0095556.

Research Data

Abstract

Background: The length of the huntingtin (HTT) CAG repeat is strongly correlated with both age at onset of Huntington’s disease (HD) symptoms and age at death of HD patients. Dichotomous analysis comparing HD to controls is widely used to study the effects of HTT CAG repeat expansion. However, a potentially more powerful approach is a continuous analysis strategy that takes advantage of all of the different CAG lengths, to capture effects that are expected to be critical to HD pathogenesis. Methodology/Principal Findings We used continuous and dichotomous approaches to analyze microarray gene expression data from 107 human control and HD lymphoblastoid cell lines. Of all probes found to be significant in a continuous analysis by CAG length, only 21.4% were so identified by a dichotomous comparison of HD versus controls. Moreover, of probes significant by dichotomous analysis, only 33.2% were also significant in the continuous analysis. Simulations revealed that the dichotomous approach would require substantially more than 107 samples to either detect 80% of the CAG-length correlated changes revealed by continuous analysis or to reduce the rate of significant differences that are not CAG length-correlated to 20% (n = 133 or n = 206, respectively). Given the superior power of the continuous approach, we calculated the correlation structure between HTT CAG repeat lengths and gene expression levels and created a freely available searchable website, “HD CAGnome,” that allows users to examine continuous relationships between HTT CAG and expression levels of ∼20,000 human genes. Conclusions/Significance: Our results reveal limitations of dichotomous approaches compared to the power of continuous analysis to study a disease where human genotype-phenotype relationships strongly support a role for a continuum of CAG length-dependent changes. The compendium of HTT CAG length-gene expression level relationships found at the HD CAGnome now provides convenient routes for discovery of candidates influenced by the HD mutation.

Description

Keywords

Biology and Life Sciences, Computational Biology, Genome Analysis, Transcriptome Analysis, Genome Expression Analysis, Genetics, Genetic Dominance, Autosomal Dominant Diseases, Huntington Disease, Autosomal Dominant Traits, Genomics, Human Genetics

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Referenced By

Related Stories