Direct RNA motif definition and identification from multiple sequence alignments using secondary structure profiles
- PMID: 11700055
- DOI: 10.1006/jmbi.2001.5102
Direct RNA motif definition and identification from multiple sequence alignments using secondary structure profiles
Abstract
We present here a new approach to the problem of defining RNA signatures and finding their occurrences in sequence databases. The proposed method is based on "secondary structure profiles". An RNA sequence alignment with secondary structure information is used as an input. Two types of weight matrices/profiles are constructed from this alignment: single strands are represented by a classical lod-scores profile while helical regions are represented by an extended "helical profile" comprising 16 lod-scores per position, one for each of the 16 possible base-pairs. Database searches are then conducted using a simultaneous search for helical profiles and dynamic programming alignment of single strand profiles. The algorithm has been implemented into a new software, ERPIN, that performs both profile construction and database search. Applications are presented for several RNA motifs. The automated use of sequence information in both single-stranded and helical regions yields better sensitivity/specificity ratios than descriptor-based programs. Furthermore, since the translation of alignments into profiles is straightforward with ERPIN, iterative searches can easily be conducted to enrich collections of homologous RNAs.
Copyright 2001 Academic Press.
Similar articles
-
A method for aligning RNA secondary structures and its application to RNA motif detection.BMC Bioinformatics. 2005 Apr 7;6:89. doi: 10.1186/1471-2105-6-89. BMC Bioinformatics. 2005. PMID: 15817128 Free PMC article.
-
Computing expectation values for RNA motifs using discrete convolutions.BMC Bioinformatics. 2005 May 13;6:118. doi: 10.1186/1471-2105-6-118. BMC Bioinformatics. 2005. PMID: 15892887 Free PMC article.
-
Alignment of RNA base pairing probability matrices.Bioinformatics. 2004 Sep 22;20(14):2222-7. doi: 10.1093/bioinformatics/bth229. Epub 2004 Apr 8. Bioinformatics. 2004. PMID: 15073017
-
Energy-based RNA consensus secondary structure prediction in multiple sequence alignments.Methods Mol Biol. 2014;1097:125-41. doi: 10.1007/978-1-62703-709-9_7. Methods Mol Biol. 2014. PMID: 24639158 Review.
-
The art of editing RNA structural alignments.Methods Mol Biol. 2014;1097:379-94. doi: 10.1007/978-1-62703-709-9_17. Methods Mol Biol. 2014. PMID: 24639168 Review.
Cited by
-
Computational small RNA prediction in bacteria.Bioinform Biol Insights. 2013;7:83-95. doi: 10.4137/BBI.S11213. Epub 2013 Mar 7. Bioinform Biol Insights. 2013. PMID: 23516022 Free PMC article.
-
A novel sigma factor reveals a unique regulon controlling cell-specific recombination in Mycoplasma genitalium.Nucleic Acids Res. 2015 May 26;43(10):4923-36. doi: 10.1093/nar/gkv422. Epub 2015 Apr 29. Nucleic Acids Res. 2015. PMID: 25925568 Free PMC article.
-
Computational discovery of folded RNA domains in genomes and in vitro selected libraries.Methods. 2010 Oct;52(2):133-40. doi: 10.1016/j.ymeth.2010.06.005. Epub 2010 Jun 8. Methods. 2010. PMID: 20554049 Free PMC article. Review.
-
RSEARCH: finding homologs of single structured RNA sequences.BMC Bioinformatics. 2003 Sep 22;4:44. doi: 10.1186/1471-2105-4-44. BMC Bioinformatics. 2003. PMID: 14499004 Free PMC article.
-
Small regulatory RNAs are mediators of the Streptococcus mutans SloR regulon.J Bacteriol. 2023 Sep 26;205(9):e0017223. doi: 10.1128/jb.00172-23. Epub 2023 Sep 11. J Bacteriol. 2023. PMID: 37695854 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
