The Papillomavirus Episteme: A Central Resource for Papillomavirus Sequence Data and Analysis

Nucleic Acids Res. 2013 Jan;41(Database issue):D571-8. doi: 10.1093/nar/gks984. Epub 2012 Oct 23.


The goal of the Papillomavirus Episteme (PaVE) is to provide an integrated resource for the analysis of papillomavirus (PV) genome sequences and related information. The PaVE is a freely accessible, web-based tool ( created around a relational database, which enables storage, analysis and exchange of sequence information. From a design perspective, the PaVE adopts an Open Source software approach and stresses the integration and reuse of existing tools. Reference PV genome sequences have been extracted from publicly available databases and reannotated using a custom-created tool. To date, the PaVE contains 241 annotated PV genomes, 2245 genes and regions, 2004 protein sequences and 47 protein structures, which users can explore, analyze or download. The PaVE provides scientists with the data and tools needed to accelerate scientific progress for the study and treatment of diseases caused by PVs.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural

MeSH terms

  • Databases, Genetic*
  • Genome, Viral
  • Genomics
  • Internet
  • Molecular Sequence Annotation
  • Papillomaviridae / genetics*
  • Sequence Analysis
  • User-Computer Interface
  • Viral Proteins / chemistry
  • Viral Proteins / genetics


  • Viral Proteins