SUBA: the Arabidopsis Subcellular Database

Nucleic Acids Res. 2007 Jan;35(Database issue):D213-8. doi: 10.1093/nar/gkl863. Epub 2006 Oct 28.


Knowledge of protein localisation contributes towards our understanding of protein function and of biological inter-relationships. A variety of experimental methods are currently being used to produce localisation data that need to be made accessible in an integrated manner. Chimeric fluorescent fusion proteins have been used to define subcellular localisations with at least 1100 related experiments completed in Arabidopsis. More recently, many studies have employed mass spectrometry to undertake proteomic surveys of subcellular components in Arabidopsis yielding localisation information for approximately 2600 proteins. Further protein localisation information may be obtained from other literature references to analysis of locations (AmiGO: approximately 900 proteins), location information from Swiss-Prot annotations (approximately 2000 proteins); and location inferred from gene descriptions (approximately 2700 proteins). Additionally, an increasing volume of available software provides location prediction information for proteins based on amino acid sequence. We have undertaken to bring these various data sources together to build SUBA, a SUBcellular location database for Arabidopsis proteins. The localisation data in SUBA encompasses 10 distinct subcellular locations, >6743 non-redundant proteins and represents the proteins encoded in the transcripts responsible for 51% of Arabidopsis expressed sequence tags. The SUBA database provides a powerful means by which to assess protein subcellular localisation in Arabidopsis (

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis Proteins / analysis*
  • Arabidopsis Proteins / chemistry
  • Databases, Protein*
  • Internet
  • Proteome / analysis
  • Proteome / chemistry
  • Sequence Analysis, Protein
  • User-Computer Interface


  • Arabidopsis Proteins
  • Proteome