Models of global gene expression define major domains of cell type and tissue identity

Nucleic Acids Res. 2017 Mar 17;45(5):2354-2367. doi: 10.1093/nar/gkx054.

Abstract

The current classification of cells in an organism is largely based on their anatomic and developmental origin. Cells types and tissues are traditionally classified into those that arise from the three embryonic germ layers, the ectoderm, mesoderm and endoderm, but this model does not take into account the organization of cell type-specific patterns of gene expression. Here, we present computational models for cell type and tissue specification derived from a collection of 921 RNA-sequencing samples from 272 distinct mouse cell types or tissues. In an unbiased fashion, this analysis accurately predicts the three known germ layers. Unexpectedly, this analysis also suggests that in total there are eight major domains of cell type-specification, corresponding to the neurectoderm, neural crest, surface ectoderm, endoderm, mesoderm, blood mesoderm, germ cells and the embryonic domain. Further, we identify putative genes responsible for specifying the domain and the cell type. This model has implications for understanding trans-lineage differentiation for stem cells, developmental cell biology and regenerative medicine.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cell Differentiation
  • Cell Lineage / genetics*
  • Ectoderm / cytology
  • Ectoderm / growth & development
  • Ectoderm / metabolism*
  • Endoderm / cytology
  • Endoderm / growth & development
  • Endoderm / metabolism*
  • Gene Expression Regulation, Developmental*
  • Gene Ontology
  • Gene Regulatory Networks*
  • Mesoderm / cytology
  • Mesoderm / growth & development
  • Mesoderm / metabolism*
  • Mice
  • Molecular Sequence Annotation
  • Organ Specificity
  • Principal Component Analysis
  • Transcription Factors / genetics
  • Transcription Factors / metabolism

Substances

  • Transcription Factors