Protegen: a web-based protective antigen database and analysis system

Nucleic Acids Res. 2011 Jan;39(Database issue):D1073-8. doi: 10.1093/nar/gkq944. Epub 2010 Oct 19.


Protective antigens are specifically targeted by the acquired immune response of the host and are able to induce protection in the host against infectious and non-infectious diseases. Protective antigens play important roles in vaccine development, as biological markers for disease diagnosis, and for analysis of fundamental host immunity against diseases. Protegen is a web-based central database and analysis system that curates, stores and analyzes protective antigens. Basic antigen information and experimental evidence are curated from peer-reviewed articles. More detailed gene/protein information (e.g. DNA and protein sequences, and COG classification) are automatically extracted from existing databases using internally developed scripts. Bioinformatics programs are also applied to compute different antigen features, such as protein weight and pI, and subcellular localizations of bacterial proteins. Presently, 590 protective antigens have been curated against over 100 infectious diseases caused by pathogens and non-infectious diseases (including cancers and allergies). A user-friendly web query and visualization interface is developed for interactive protective antigen search. A customized BLAST sequence similarity search is also developed for analysis of new sequences provided by the users. To support data exchange, the information of protective antigens is stored in the Vaccine Ontology (VO) in OWL format and can also be exported to FASTA and Excel files. Protegen is publically available at

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Antigens / chemistry*
  • Antigens / genetics*
  • Antigens / immunology
  • Antigens, Bacterial
  • Antigens, Protozoan
  • Antigens, Viral
  • Databases, Protein*
  • Internet
  • Molecular Sequence Annotation
  • Proteins / chemistry
  • Proteins / genetics
  • Proteins / immunology*
  • Sequence Alignment
  • User-Computer Interface
  • Vaccines / immunology


  • Antigens
  • Antigens, Bacterial
  • Antigens, Protozoan
  • Antigens, Viral
  • Proteins
  • Vaccines