Publication:

WISARD: workbench for integrated superfast association studies for related datasets

Loading...
Thumbnail Image

Open/View Files

Date

2018

Journal Title

Journal ISSN

Volume Title

Publisher

BioMed Central
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Lee, Sungyoung, Sungkyoung Choi, Dandi Qiao, Michael Cho, Edwin K. Silverman, Taesung Park, and Sungho Won. 2018. “WISARD: workbench for integrated superfast association studies for related datasets.” BMC Medical Genomics 11 (Suppl 2): 39. doi:10.1186/s12920-018-0345-y. http://dx.doi.org/10.1186/s12920-018-0345-y.

Abstract

Background: A Mendelian transmission produces phenotypic and genetic relatedness between family members, giving family-based analytical methods an important role in genetic epidemiological studies—from heritability estimations to genetic association analyses. With the advance in genotyping technologies, whole-genome sequence data can be utilized for genetic epidemiological studies, and family-based samples may become more useful for detecting de novo mutations. However, genetic analyses employing family-based samples usually suffer from the complexity of the computational/statistical algorithms, and certain types of family designs, such as incorporating data from extended families, have rarely been used. Results: We present a Workbench for Integrated Superfast Association studies for Related Data (WISARD) programmed in C/C++. WISARD enables the fast and a comprehensive analysis of SNP-chip and next-generation sequencing data on extended families, with applications from designing genetic studies to summarizing analysis results. In addition, WISARD can automatically be run in a fully multithreaded manner, and the integration of R software for visualization makes it more accessible to non-experts. Conclusions: Comparison with existing toolsets showed that WISARD is computationally suitable for integrated analysis of related subjects, and demonstrated that WISARD outperforms existing toolsets. WISARD has also been successfully utilized to analyze the large-scale massive sequencing dataset of chronic obstructive pulmonary disease data (COPD), and we identified multiple genes associated with COPD, which demonstrates its practical value. Electronic supplementary material The online version of this article (10.1186/s12920-018-0345-y) contains supplementary material, which is available to authorized users.

Description

Research Data

Keywords

Family-based design, Genome-wide association analyses, Next generation sequencing, Multi-threaded analyses, Related samples

Terms of Use

This article is made available under the terms and conditions applicable to Other Posted Material (LAA), as set forth at Terms of Service

Endorsement

Review

Supplemented By

Related Stories