Gene Ontology annotation of sequence-specific DNA binding transcription factors: setting the stage for a large-scale curation effort

Database (Oxford). 2013 Aug 27:2013:bat062. doi: 10.1093/database/bat062. Print 2013.

Abstract

Transcription factors control which information in a genome becomes transcribed to produce RNAs that function in the biological systems of cells and organisms. Reliable and comprehensive information about transcription factors is invaluable for large-scale network-based studies. However, existing transcription factor knowledge bases are still lacking in well-documented functional information. Here, we provide guidelines for a curation strategy, which constitutes a robust framework for using the controlled vocabularies defined by the Gene Ontology Consortium to annotate specific DNA binding transcription factors (DbTFs) based on experimental evidence reported in literature. Our standardized protocol and workflow for annotating specific DNA binding RNA polymerase II transcription factors is designed to document high-quality and decisive evidence from valid experimental methods. Within a collaborative biocuration effort involving the user community, we are now in the process of exhaustively annotating the full repertoire of human, mouse and rat proteins that qualify as DbTFs in as much as they are experimentally documented in the biomedical literature today. The completion of this task will significantly enrich Gene Ontology-based information resources for the research community. Database URL: www.tfcheckpoint.org.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • DNA-Binding Proteins / genetics*
  • Data Mining / methods*
  • Gene Expression Regulation
  • Gene Ontology*
  • Genes, Reporter
  • Humans
  • Mice
  • Molecular Sequence Annotation*
  • Protein Binding
  • RNA Polymerase II / metabolism
  • Rats
  • Transcription Factors / genetics*

Substances

  • DNA-Binding Proteins
  • Transcription Factors
  • RNA Polymerase II