Publication:

Polygenic enrichment distinguishes disease associations of individual cells in single-cell RNA-seq data

Loading...
Thumbnail Image

Date

2021-10-05

Journal Title

Journal ISSN

Volume Title

Publisher

Research Square Platform LLC
The Harvard community has made this article openly available. Please share how this access benefits you.

Research Projects

Organizational Units

Journal Issue

Citation

Zhang, Jinye, Kangcheng Hou, Kushal Dey, Saori Sakaue, Karthik Jagadeesh, Kathryn Weinand, Aris Taychameekiatchai et al. "Polygenic enrichment distinguishes disease associations of individual cells in single-cell RNA-seq data." No Journal No Volume. DOI: 10.21203/rs.3.rs-933790/v1

Abstract

Gene expression at the individual cell-level resolution, as quantified by single-cell RNA-sequencing (scRNA-seq), can provide unique insights into the pathology and cellular origin of diseases and complex traits. Here, we introduce single-cell Disease Relevance Score (scDRS), an approach that links scRNA-seq with polygenic risk of disease at individual cell resolution; scDRS identifies individual cells that show excess expression levels for genes in a disease-specific gene set constructed from GWAS data. We determined via simulations that scDRS is well-calibrated and powerful in identifying individual cells associated to disease. We applied scDRS to GWAS data from 74 diseases and complex traits (average N=341K) in conjunction with 16 scRNA-seq data sets spanning 1.3 million cells from 31 tissues and organs. At the cell type level, scDRS broadly recapitulated known links between classical cell types and disease, and also produced novel biologically plausible findings. At the individual cell level, scDRS identified subpopulations of disease-associated cells that are not captured by existing cell type labels, including subpopulations of CD4+ T cells associated with inflammatory bowel disease, partially characterized by their effector-like states; subpopulations of hippocampal CA1 pyramidal neurons associated with schizophrenia, partially characterized by their spatial location at the proximal part of the hippocampal CA1 region; and subpopulations of hepatocytes associated with triglyceride levels, partially characterized by their higher ploidy levels. At the gene level, we determined that genes whose expression across individual cells was correlated with the scDRS score (thus reflecting co-expression with GWAS disease genes) were strongly enriched for gold-standard drug target and Mendelian disease genes.

Description

Other Available Sources

Research Data

Keywords

Terms of Use

Metadata Only

Endorsement

Review

Supplemented By

Related Stories