SCPortalen: human and mouse single-cell centric database

Nucleic Acids Res. 2018 Jan 4;46(D1):D781-D787. doi: 10.1093/nar/gkx949.


Published single-cell datasets are rich resources for investigators who want to address questions not originally asked by the creators of the datasets. The single-cell datasets might be obtained by different protocols and diverse analysis strategies. The main challenge in utilizing such single-cell data is how we can make the various large-scale datasets to be comparable and reusable in a different context. To challenge this issue, we developed the single-cell centric database 'SCPortalen' ( The current version of the database covers human and mouse single-cell transcriptomics datasets that are publicly available from the INSDC sites. The original metadata was manually curated and single-cell samples were annotated with standard ontology terms. Following that, common quality assessment procedures were conducted to check the quality of the raw sequence. Furthermore, primary data processing of the raw data followed by advanced analyses and interpretation have been performed from scratch using our pipeline. In addition to the transcriptomics data, SCPortalen provides access to single-cell image files whenever available. The target users of SCPortalen are all researchers interested in specific cell types or population heterogeneity. Through the web interface of SCPortalen users are easily able to search, explore and download the single-cell datasets of their interests.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Data Accuracy
  • Data Curation
  • Databases, Genetic*
  • Datasets as Topic*
  • Gene Expression
  • Gene Ontology
  • Humans
  • Mice / genetics*
  • Molecular Sequence Annotation
  • Single-Cell Analysis*
  • Transcriptome*
  • User-Computer Interface
  • Workflow