R-PASS: A Fast Structure-based RNA Sequence Alignment Algorithm

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2011 Dec 31;2011:618-622. doi: 10.1109/BIBM.2011.74.


We present a fast pairwise RNA sequence alignment method using structural information, named R-PASS (RNA Pairwise Alignment of Structure and Sequence), which shows good accuracy on sequences with low sequence identity and significantly faster than alternative methods. The method begins by representing RNA secondary structure as a set of structure motifs. The motifs from two RNAs are then used as input into a bipartite graph-matching algorithm, which determines the structure matches. The matches are then used as constraints in a constrained dynamic programming sequence alignment procedure. The R-PASS method has an O(nm) complexity. We compare our method with two other structure-based alignment methods, LARA and ExpaLoc, and with a sequence-based alignment method, MAFFT, across three benchmarks and obtain favorable results in accuracy and orders of magnitude faster in speed.

Keywords: RNA pairwise structural alignment; bipartite graph matching; constraint sequence alignment; structure motif.