Predicting environmentally responsive transgenerational differential DNA methylated regions (epimutations) in the genome using a hybrid deep-machine learning approach

BMC Bioinformatics. 2021 Nov 30;22(1):575. doi: 10.1186/s12859-021-04491-z.

Abstract

Background: Deep learning is an active bioinformatics artificial intelligence field that is useful in solving many biological problems, including predicting altered epigenetics such as DNA methylation regions. Deep learning (DL) can learn an informative representation that addresses the need for defining relevant features. However, deep learning models are computationally expensive, and they require large training datasets to achieve good classification performance.

Results: One approach to addressing these challenges is to use a less complex deep learning network for feature selection and Machine Learning (ML) for classification. In the current study, we introduce a hybrid DL-ML approach that uses a deep neural network for extracting molecular features and a non-DL classifier to predict environmentally responsive transgenerational differential DNA methylated regions (DMRs), termed epimutations, based on the extracted DL-based features. Various environmental toxicant induced epigenetic transgenerational inheritance sperm epimutations were used to train the model on the rat genome DNA sequence and use the model to predict transgenerational DMRs (epimutations) across the entire genome.

Conclusion: The approach was also used to predict potential DMRs in the human genome. Experimental results show that the hybrid DL-ML approach outperforms deep learning and traditional machine learning methods.

Keywords: Artificial intelligence; DNA methylation; Deep learning; Epigenetics; Epimutation; Machine learning; Transgenerational.

MeSH terms

  • Animals
  • Artificial Intelligence*
  • DNA
  • DNA Methylation*
  • Epigenesis, Genetic
  • Genome, Human
  • Humans
  • Machine Learning
  • Rats

Substances

  • DNA