Machine Learning Analysis of Longevity-Associated Gene Expression Landscapes in Mammals

Int J Mol Sci. 2021 Jan 22;22(3):1073. doi: 10.3390/ijms22031073.


One of the important questions in aging research is how differences in transcriptomics are associated with the longevity of various species. Unfortunately, at the level of individual genes, the links between expression in different organs and maximum lifespan (MLS) are yet to be fully understood. Analyses are complicated further by the fact that MLS is highly associated with other confounding factors (metabolic rate, gestation period, body mass, etc.) and that linear models may be limiting. Using gene expression from 41 mammalian species, across five organs, we constructed gene-centric regression models associating gene expression with MLS and other species traits. Additionally, we used SHapley Additive exPlanations and Bayesian networks to investigate the non-linear nature of the interrelations between the genes predicted to be determinants of species MLS. Our results revealed that expression patterns correlate with MLS, some across organs, and others in an organ-specific manner. The combination of methods employed revealed gene signatures formed by only a few genes that are highly predictive towards MLS, which could be used to identify novel longevity regulator candidates in mammals.

Keywords: cross-species analysis; longevity; mammals; maximum lifespan; transcriptomics.

MeSH terms

  • Aging
  • Algorithms
  • Animals
  • Bayes Theorem
  • Brain / metabolism
  • Computational Biology
  • Gene Expression
  • Gene Expression Profiling*
  • Humans
  • Linear Models
  • Liver / metabolism
  • Longevity / genetics*
  • Machine Learning*
  • Mammals / genetics*
  • Models, Genetic
  • RNA-Seq
  • Regression Analysis
  • Tissue Distribution
  • Transcriptome