Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Aug 26;17(8):e1009291.
doi: 10.1371/journal.pcbi.1009291. eCollection 2021 Aug.

Review of machine learning methods for RNA secondary structure prediction

Affiliations
Review

Review of machine learning methods for RNA secondary structure prediction

Qi Zhao et al. PLoS Comput Biol. .

Abstract

Secondary structure plays an important role in determining the function of noncoding RNAs. Hence, identifying RNA secondary structures is of great value to research. Computational prediction is a mainstream approach for predicting RNA secondary structure. Unfortunately, even though new methods have been proposed over the past 40 years, the performance of computational prediction methods has stagnated in the last decade. Recently, with the increasing availability of RNA structure data, new methods based on machine learning (ML) technologies, especially deep learning, have alleviated the issue. In this review, we provide a comprehensive overview of RNA secondary structure prediction methods based on ML technologies and a tabularized summary of the most important methods in this field. The current pending challenges in the field of RNA secondary structure prediction and future trends are also discussed.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1
RNA primary (left), secondary (middle), and tertiary structures (right). The RNA folding process is hierarchical, i.e., the RNA secondary structure forms rapidly from linear RNA (primary structure) with a large energy loss, while the formation of a complex tertiary structure is usually much slower.
Fig 2
Fig 2. Framework for RNA secondary structure prediction methods with ML-based score schemes. Wet lab data, RNA sequence data, or RNA structure data can be employed to train an ML model to obtain a score scheme.
Using this score scheme, an RNA secondary structure can be predicted using a traditional score-based approach from a single RNA sequence.
Fig 3
Fig 3. Framework for RNA secondary structure prediction methods with ML-based preprocessing or postprocessing.
In RNA secondary structure prediction, ML models (trained by sequence data, in green) can be also used in pretreatment for selecting an appropriate prediction method or a group of appropriate parameters; ML models (trained by structure data, in brown) also can provide a means of determining the most likely structures among the outcomes.
Fig 4
Fig 4. Framework for the RNA secondary structure prediction methods with ML-based prediction process.
ML models (trained by wet lab, RNA sequence, or RNA structure data) are directly used to predict RNA secondary structures in an end-to-end way or followed by a filter or optimizer to obtain the optimal RNA secondary structure.

Similar articles

Cited by

References

    1. Fu Y, Xu ZZ, Lu ZJ, Zhao S, Mathews DH. Discovery of Novel ncRNA Sequences in Multiple Genome Alignments on the Basis of Conserved and Stable Secondary Structures. PLoS ONE. 2015;10(6):e0130200. doi: 10.1371/journal.pone.0130200. - DOI - PMC - PubMed
    1. Consortium TEP. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012;489(7414):57–74. doi: 10.1038/nature11247 . - DOI - PMC - PubMed
    1. Consortium TF. The transcriptional landscape of the mammalian genome. Science. 2006;311(5768):1713. doi: 10.1126/science.1121522. - DOI - PubMed
    1. Doudna JA, Cech TR. The chemical repertoire of natural ribozymes. Nature. 2002;418(6894):222–8. doi: 10.1038/418222a . - DOI - PubMed
    1. Higgs PG, Lehman N. The RNA World: molecular cooperation at the origins of life. Nat Rev Genet. 2015;16(1):7–17. doi: 10.1038/nrg3841 . - DOI - PubMed

Publication types

Grants and funding

This work was supported in part by the Fundamental Research Funds of Northeastern University (N181903008 - Q.Z.); the Research Start-up Fund for Talent of Dalian Maritime University (02500348 - Z.Z.); the Doctoral Scientific Research Foundation of Liaoning Province of China (2019-BS-108 - Q.M.); the Youth Seedling Project of Educational Department of Liaoning Province of China (LQN202002- Q.M.); the Fundamental Research Funds for the Central Universities (82232019 - X.Y.F.); and the National Natural Science Foundation of China (62002056 - Q.Z., 31801623 - Q.M., 81871219 - Z.W.Y). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.