Polygenic enrichment distinguishes disease associations of individual cells in single-cell RNA-seq data

Nat Genet. 2022 Oct;54(10):1572-1580. doi: 10.1038/s41588-022-01167-z. Epub 2022 Sep 1.

Abstract

Single-cell RNA sequencing (scRNA-seq) provides unique insights into the pathology and cellular origin of disease. We introduce single-cell disease relevance score (scDRS), an approach that links scRNA-seq with polygenic disease risk at single-cell resolution, independent of annotated cell types. scDRS identifies cells exhibiting excess expression across disease-associated genes implicated by genome-wide association studies (GWASs). We applied scDRS to 74 diseases/traits and 1.3 million single-cell gene-expression profiles across 31 tissues/organs. Cell-type-level results broadly recapitulated known cell-type-disease associations. Individual-cell-level results identified subpopulations of disease-associated cells not captured by existing cell-type labels, including T cell subpopulations associated with inflammatory bowel disease, partially characterized by their effector-like states; neuron subpopulations associated with schizophrenia, partially characterized by their spatial locations; and hepatocyte subpopulations associated with triglyceride levels, partially characterized by their higher ploidy levels. Genes whose expression was correlated with the scDRS score across cells (reflecting coexpression with GWAS disease-associated genes) were strongly enriched for gold-standard drug target and Mendelian disease genes.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Gene Expression Profiling / methods
  • Genome-Wide Association Study*
  • Multifactorial Inheritance / genetics
  • RNA-Seq
  • Single-Cell Analysis* / methods
  • Triglycerides

Substances

  • Triglycerides