A survey of multiple sequence comparison methods

Bull Math Biol. 1992 Jul;54(4):563-98. doi: 10.1007/BF02459635.

Abstract

Multiple sequence comparison refers to the search for similarity in three or more sequences. This article presents a survey of the exhaustive (optimal) and heuristic (possibly sub-optimal) methods developed for the comparison of multiple macromolecular sequences. Emphasis is given to the different approaches of the heuristic methods. Four distance measures derived from information engineering and genetic studies are introduced for the comparison between two alignments of sequences. The use of entropy, which plays a central role in information theory as measures of information, choice and uncertainty, is proposed as a simple measure for the evaluation of the optimality of an alignment in the absence of any a priori knowledge about the structures of the sequences being compared. This article also gives two examples of comparison between alternative alignments of the same set of 5SRNAs as obtained by several different heuristic methods.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms
  • Base Sequence
  • Mathematics
  • Molecular Sequence Data
  • RNA, Ribosomal, 5S / genetics
  • Sequence Alignment / methods*
  • Sequence Alignment / statistics & numerical data
  • Software

Substances

  • RNA, Ribosomal, 5S