Nucleic Acids Res. 2002 Jan 1;30(1):17-20. doi: 10.1093/nar/30.1.17.


The GenBank sequence database incorporates publicly available DNA sequences of more than 105 000 different organisms, primarily through direct submission of sequence data from individual laboratories and large-scale sequencing projects. Most submissions are made using the BankIt (web) or Sequin programs and accession numbers are assigned by GenBank staff upon receipt. Data exchange with the EMBL Data Library and the DNA Data Bank of Japan helps ensure comprehensive worldwide coverage. GenBank data is accessible through NCBI's integrated retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical literature via PubMed. Sequence similarity searching is provided by the BLAST family of programs. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. NCBI also offers a wide range of World Wide Web retrieval and analysis services based on GenBank data. The GenBank database and related resources are freely accessible via the NCBI home page at

MeSH terms

  • Animals
  • Base Sequence
  • Data Collection
  • Databases, Nucleic Acid*
  • Expressed Sequence Tags
  • Genome
  • Humans
  • Information Storage and Retrieval
  • Internet
  • National Library of Medicine (U.S.)
  • Sequence Analysis, DNA*
  • Sequence Homology, Nucleic Acid
  • Sequence Tagged Sites
  • United States