Survey of Natural Language Processing Techniques in Bioinformatics

Comput Math Methods Med. 2015;2015:674296. doi: 10.1155/2015/674296. Epub 2015 Oct 7.


Informatics methods, such as text mining and natural language processing, are always involved in bioinformatics research. In this study, we discuss text mining and natural language processing methods in bioinformatics from two perspectives. First, we aim to search for knowledge on biology, retrieve references using text mining methods, and reconstruct databases. For example, protein-protein interactions and gene-disease relationship can be mined from PubMed. Then, we analyze the applications of text mining and natural language processing techniques in bioinformatics, including predicting protein structure and function, detecting noncoding RNA. Finally, numerous methods and applications, as well as their contributions to bioinformatics, are discussed for future use by text mining and natural language processing researchers.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Data Mining / methods
  • Databases, Factual
  • Databases, Genetic
  • Databases, Protein
  • Humans
  • Natural Language Processing*
  • Protein Interaction Maps
  • PubMed
  • RNA, Untranslated / chemistry
  • RNA, Untranslated / genetics
  • Sequence Alignment
  • Surveys and Questionnaires


  • RNA, Untranslated