Machine learning and clinical epigenetics: a review of challenges for diagnosis and classification

Clin Epigenetics. 2020 Apr 3;12(1):51. doi: 10.1186/s13148-020-00842-4.

Abstract

Background: Machine learning is a sub-field of artificial intelligence, which utilises large data sets to make predictions for future events. Although most algorithms used in machine learning were developed as far back as the 1950s, the advent of big data in combination with dramatically increased computing power has spurred renewed interest in this technology over the last two decades.

Main body: Within the medical field, machine learning is promising in the development of assistive clinical tools for detection of e.g. cancers and prediction of disease. Recent advances in deep learning technologies, a sub-discipline of machine learning that requires less user input but more data and processing power, has provided even greater promise in assisting physicians to achieve accurate diagnoses. Within the fields of genetics and its sub-field epigenetics, both prime examples of complex data, machine learning methods are on the rise, as the field of personalised medicine is aiming for treatment of the individual based on their genetic and epigenetic profiles.

Conclusion: We now have an ever-growing number of reported epigenetic alterations in disease, and this offers a chance to increase sensitivity and specificity of future diagnostics and therapies. Currently, there are limited studies using machine learning applied to epigenetics. They pertain to a wide variety of disease states and have used mostly supervised machine learning methods.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • DNA Methylation
  • Diagnosis*
  • Disease / classification*
  • Epigenesis, Genetic
  • Epigenomics*
  • Humans
  • Machine Learning*
  • Precision Medicine
  • Supervised Machine Learning
  • Unsupervised Machine Learning