A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor
- PMID: 27909575
- PMCID: PMC5112579
- DOI: 10.12688/f1000research.9501.2
A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor
Abstract
Single-cell RNA sequencing (scRNA-seq) is widely used to profile the transcriptome of individual cells. This provides biological resolution that cannot be matched by bulk RNA sequencing, at the cost of increased technical noise and data complexity. The differences between scRNA-seq and bulk RNA-seq data mean that the analysis of the former cannot be performed by recycling bioinformatics pipelines for the latter. Rather, dedicated single-cell methods are required at various steps to exploit the cellular resolution while accounting for technical noise. This article describes a computational workflow for low-level analyses of scRNA-seq data, based primarily on software packages from the open-source Bioconductor project. It covers basic steps including quality control, data exploration and normalization, as well as more complex procedures such as cell cycle phase assignment, identification of highly variable and correlated genes, clustering into subpopulations and marker gene detection. Analyses were demonstrated on gene-level count data from several publicly available datasets involving haematopoietic stem cells, brain-derived cells, T-helper cells and mouse embryonic stem cells. This will provide a range of usage scenarios from which readers can construct their own analysis pipelines.
Keywords: Bioconductor; RNA-seq; Single cell; bioinformatics; workflow.
Conflict of interest statement
Figures
Similar articles
-
Single-Cell RNA Sequencing Analysis: A Step-by-Step Overview.Methods Mol Biol. 2021;2284:343-365. doi: 10.1007/978-1-0716-1307-8_19. Methods Mol Biol. 2021. PMID: 33835452
-
Analysis of ChIP-seq Data in R/Bioconductor.Methods Mol Biol. 2018;1689:195-226. doi: 10.1007/978-1-4939-7380-4_17. Methods Mol Biol. 2018. PMID: 29027176
-
Analysis of Technical and Biological Variability in Single-Cell RNA Sequencing.Methods Mol Biol. 2019;1935:25-43. doi: 10.1007/978-1-4939-9057-3_3. Methods Mol Biol. 2019. PMID: 30758818
-
Single-Cell RNA-Seq Technologies and Related Computational Data Analysis.Front Genet. 2019 Apr 5;10:317. doi: 10.3389/fgene.2019.00317. eCollection 2019. Front Genet. 2019. PMID: 31024627 Free PMC article. Review.
-
Machine learning and statistical methods for clustering single-cell RNA-sequencing data.Brief Bioinform. 2020 Jul 15;21(4):1209-1223. doi: 10.1093/bib/bbz063. Brief Bioinform. 2020. PMID: 31243426 Review.
Cited by
-
Arid3c identifies an uncharacterized subpopulation of V2 interneurons during embryonic spinal cord development.Front Cell Neurosci. 2024 Oct 16;18:1466056. doi: 10.3389/fncel.2024.1466056. eCollection 2024. Front Cell Neurosci. 2024. PMID: 39479525 Free PMC article.
-
Define and visualize pathological architectures of human tissues from spatially resolved transcriptomics using deep learning.Comput Struct Biotechnol J. 2022 Aug 24;20:4600-4617. doi: 10.1016/j.csbj.2022.08.029. eCollection 2022. Comput Struct Biotechnol J. 2022. PMID: 36090815 Free PMC article.
-
Modeling plasticity and dysplasia of pancreatic ductal organoids derived from human pluripotent stem cells.Cell Stem Cell. 2021 Jun 3;28(6):1105-1124.e19. doi: 10.1016/j.stem.2021.03.005. Epub 2021 Apr 28. Cell Stem Cell. 2021. PMID: 33915078 Free PMC article.
-
The astroglial and stem cell functions of adult rat folliculostellate cells.Glia. 2023 Feb;71(2):205-228. doi: 10.1002/glia.24267. Epub 2022 Sep 12. Glia. 2023. PMID: 36093576 Free PMC article.
-
Single cell transcriptomes and multiscale networks from persons with and without Alzheimer's disease.Nat Commun. 2024 Jul 10;15(1):5815. doi: 10.1038/s41467-024-49790-0. Nat Commun. 2024. PMID: 38987616 Free PMC article.
References
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
