scNET: learning context-specific gene and cell embeddings by integrating single-cell gene expression data with protein-protein interactions

Nat Methods. 2025 Apr;22(4):708-716. doi: 10.1038/s41592-025-02627-0. Epub 2025 Mar 17.

Abstract

Recent advances in single-cell RNA sequencing (scRNA-seq) techniques have provided unprecedented insights into the heterogeneity of various tissues. However, gene expression data alone often fails to capture and identify changes in cellular pathways and complexes, as they are more discernible at the protein level. Moreover, analyzing scRNA-seq data presents further challenges due to inherent characteristics such as high noise levels and zero inflation. In this study, we propose an approach to address these limitations by integrating scRNA-seq datasets with a protein-protein interaction network. Our method utilizes a unique dual-view architecture based on graph neural networks, enabling joint representation of gene expression and protein-protein interaction network data. This approach models gene-to-gene relationships under specific biological contexts and refines cell-cell relations using an attention mechanism. Next, through comprehensive evaluations, we demonstrate that scNET better captures gene annotation, pathway characterization and gene-gene relationship identification, while improving cell clustering and pathway analysis across diverse cell types and biological conditions.

MeSH terms

  • Algorithms
  • Animals
  • Computational Biology* / methods
  • Gene Expression Profiling* / methods
  • Humans
  • Neural Networks, Computer
  • Protein Interaction Maps* / genetics
  • Sequence Analysis, RNA / methods
  • Single-Cell Analysis* / methods