Comprehensive study on iterative algorithms of multiple sequence alignment

M Hirosawa; Y Totoki; M Hoshida; M Ishikawa

doi:10.1093/bioinformatics/11.1.13

Comprehensive study on iterative algorithms of multiple sequence alignment

Comput Appl Biosci. 1995 Feb;11(1):13-8. doi: 10.1093/bioinformatics/11.1.13.

Authors

M Hirosawa¹, Y Totoki, M Hoshida, M Ishikawa

Affiliation

¹ Institute for New Generation Computer Technology, (ICOT), Tokyo, Japan.

PMID: 7796270
DOI: 10.1093/bioinformatics/11.1.13

Abstract

Multiple sequence alignment is an important problem in the biosciences. To date, most multiple alignment systems have employed a tree-based algorithm, which combines the results of two-way dynamic programming in a tree-like order of sequence similarity. The alignment quality is not, however, high enough when the sequence similarity is low. Once an error occurs in the alignment process, that error can never be corrected. Recently, an effective new class of algorithms has been developed. These algorithms iteratively apply dynamic programming to partially aligned sequences to improve their alignment quality. The iteration corrects any errors that may have occurred in the alignment process. Such an iterative strategy requires heuristic search methods to solve practical alignment problems. Incorporating such methods yields various iterative algorithms. This paper reports our comprehensive comparison of iterative algorithms. We proved that performance improves remarkably when using a tree-based iterative method, which iteratively refines an alignment whenever two subalignments are merged in a tree-based way. We propose a tree-dependent, restricted partitioning technique to efficiently reduce the execution time of iterative algorithms.

Publication types

Comparative Study

MeSH terms

Algorithms*
Amino Acid Sequence
Evaluation Studies as Topic
Molecular Sequence Data
Proteins / genetics
Sequence Alignment / methods*
Sequence Alignment / statistics & numerical data
Sequence Homology, Amino Acid
Software*

Substances

Proteins