A benchmark of multiple sequence alignment programs upon structural RNAs

Paul P Gardner; Andreas Wilm; Stefan Washietl

doi:10.1093/nar/gki541

A benchmark of multiple sequence alignment programs upon structural RNAs

Nucleic Acids Res. 2005 Apr 28;33(8):2433-9. doi: 10.1093/nar/gki541. Print 2005.

Authors

Paul P Gardner¹, Andreas Wilm, Stefan Washietl

Affiliation

¹ Department of Evolutionary Biology, University of Copenhagen Universitetsparken 15, 2100 Copenhagen Ø, Denmark. PPGardner@bi.ku.dk

Abstract

To date, few attempts have been made to benchmark the alignment algorithms upon nucleic acid sequences. Frequently, sophisticated PAM or BLOSUM like models are used to align proteins, yet equivalents are not considered for nucleic acids; instead, rather ad hoc models are generally favoured. Here, we systematically test the performance of existing alignment algorithms on structural RNAs. This work was aimed at achieving the following goals: (i) to determine conditions where it is appropriate to apply common sequence alignment methods to the structural RNA alignment problem. This indicates where and when researchers should consider augmenting the alignment process with auxiliary information, such as secondary structure and (ii) to determine which sequence alignment algorithms perform well under the broadest range of conditions. We find that sequence alignment alone, using the current algorithms, is generally inappropriate <50-60% sequence identity. Second, we note that the probabilistic method ProAlign and the aging Clustal algorithms generally outperform other sequence-based algorithms, under the broadest range of applications.

Publication types

Evaluation Study
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Nucleic Acid Conformation
RNA / chemistry*
RNA, Untranslated / chemistry
Reproducibility of Results
Sequence Alignment / methods*
Sequence Analysis, RNA / methods*

Substances

RNA, Untranslated
RNA