Block-based Bayesian epistasis association mapping with application to WTCCC type 1 diabetes data

DSpace/Manakin Repository

Block-based Bayesian epistasis association mapping with application to WTCCC type 1 diabetes data

Citable link to this page


Title: Block-based Bayesian epistasis association mapping with application to WTCCC type 1 diabetes data
Author: Zhang, Yu; Zhang, Jing; Liu, Jun

Note: Order does not necessarily reflect citation order of authors.

Citation: Zhang, Yu, Jing Zhang, and Jun S. Liu. 2011. “Block-Based Bayesian Epistasis Association Mapping with Application to WTCCC Type 1 Diabetes Data.” The Annals of Applied Statistics 5, no. 3: 2052–2077.
Full Text & Related Files:
Abstract: Interactions among multiple genes across the genome may contribute to the risks of many complex human diseases. Whole-genome single nucleotide polymorphisms (SNPs) data collected for many thousands of SNP markers from thousands of individuals under the case–control design promise to shed light on our understanding of such interactions. However, nearby SNPs are highly correlated due to linkage disequilibrium (LD) and the number of possible interactions is too large for exhaustive evaluation. We propose a novel Bayesian method for simultaneously partitioning SNPs into LD-blocks and selecting SNPs within blocks that are associated with the disease, either individually or interactively with other SNPs. When applied to homogeneous population data, the method gives posterior probabilities for LD-block boundaries, which not only result in accurate block partitions of SNPs, but also provide measures of partition uncertainty. When applied to case–control data for association mapping, the method implicitly filters out SNP associations created merely by LD with disease loci within the same blocks. Simulation study showed that this approach is more powerful in detecting multi-locus associations than other methods we tested, including one of ours. When applied to the WTCCC type 1 diabetes data, the method identified many previously known T1D associated genes, including PTPN22, CTLA4, MHC, and IL2RA. The method also revealed some interesting two-way associations that are undetected by single SNP methods. Most of the significant associations are located within the MHC region. Our analysis showed that the MHC SNPs form long-distance joint associations over several known recombination hotspots. By controlling the haplotypes of the MHC class II region, we identified additional associations in both MHC class I (HLA-A, HLA-B) and class III regions (BAT1). We also observed significant interactions between genes PRSS16, ZNF184 in the extended MHC region and the MHC class II genes. The proposed method can be broadly applied to the classification problem with correlated discrete covariates.
Published Version: doi:10.1214/11-AOAS469
Terms of Use: This article is made available under the terms and conditions applicable to Other Posted Material, as set forth at
Citable link to this page:
Downloads of this work:

Show full Dublin Core record

This item appears in the following Collection(s)


Search DASH

Advanced Search