An introduction to information retrieval: applications in genomics

Pharmacogenomics J. 2002;2(2):96-102. doi: 10.1038/sj.tpj.6500084.


Information retrieval (IR) is the field of computer science that deals with the processing of documents containing free text, so that they can be rapidly retrieved based on keywords specified in a user's query. IR technology is the basis of Web-based search engines, and plays a vital role in biomedical research, because it is the foundation of software that supports literature search. Documents can be indexed by both the words they contain, as well as the concepts that can be matched to domain-specific thesauri; concept matching, however, poses several practical difficulties that make it unsuitable for use by itself. This article provides an introduction to IR and summarizes various applications of IR and related technologies to genomics.

Publication types

  • Research Support, U.S. Gov't, P.H.S.
  • Review

MeSH terms

  • Animals
  • Databases, Genetic / trends
  • Genomics / methods*
  • Genomics / trends
  • Humans
  • Information Storage and Retrieval / methods*
  • Information Storage and Retrieval / trends