NAPP: the Nucleic Acid Phylogenetic Profile Database

Nucleic Acids Res. 2012 Jan;40(Database issue):D205-9. doi: 10.1093/nar/gkr807. Epub 2011 Oct 8.


Nucleic acid phylogenetic profiling (NAPP) classifies coding and non-coding sequences in a genome according to their pattern of conservation across other genomes. This procedure efficiently distinguishes clusters of functional non-coding elements in bacteria, particularly small RNAs and cis-regulatory RNAs, from other conserved sequences. In contrast to other non-coding RNA detection pipelines, NAPP does not require the presence of conserved RNA secondary structure and therefore is likely to identify previously undetected RNA genes or elements. Furthermore, as NAPP clusters contain both coding and non-coding sequences with similar occurrence profiles, they can be analyzed under a functional perspective. We recently improved the NAPP pipeline and applied it to a collection of 949 bacterial and 68 archaeal species. The database and web interface available at enable detailed analysis of NAPP clusters enriched in non-coding RNAs, graphical display of phylogenetic profiles, visualization of predicted RNAs in their genome context and extraction of predicted RNAs for use with genome browsers or other software.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bacillus subtilis / genetics
  • Base Sequence
  • Conserved Sequence
  • Databases, Nucleic Acid*
  • Genome, Archaeal
  • Genome, Bacterial
  • Phylogeny
  • RNA, Archaeal / classification
  • RNA, Archaeal / genetics*
  • RNA, Bacterial / classification
  • RNA, Bacterial / genetics*
  • RNA, Small Untranslated / genetics*
  • Regulatory Sequences, Ribonucleic Acid*


  • RNA, Archaeal
  • RNA, Bacterial
  • RNA, Small Untranslated
  • Regulatory Sequences, Ribonucleic Acid