LigBase: a database of families of aligned ligand binding sites in known protein sequences and structures

Bioinformatics. 2002 Jan;18(1):200-1. doi: 10.1093/bioinformatics/18.1.200.


A database comprising all ligand-binding sites of known structure aligned with all related protein sequences and structures is described. Currently, the database contains approximately 50000 ligand-binding sites for small molecules found in the Protein Data Bank (PDB). The structure-structure alignments are obtained by the Combinatorial Extension (CE) program (Shindyalov and Bourne, Protein Eng., 11, 739-747, 1998) and sequence-structure alignments are extracted from the ModBase database of comparative protein structure models for all known protein sequences (Sanchez et al., Nucleic Acids Res., 28, 250-253, 2000). It is possible to search for binding sites in LigBase by a variety of criteria. LigBase reports summarize ligand data including relevant structural information from the PDB file, such as ligand type and size, and contain links to all related protein sequences in the TrEMBL database. Residues in the binding sites are graphically depicted for comparison with other structurally defined family members. LigBase provides a resource for the analysis of families of related binding sites.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Binding Sites
  • Computational Biology
  • Databases, Protein*
  • Internet
  • Ligands
  • Proteins / chemistry*
  • Proteins / genetics*
  • Sequence Alignment / statistics & numerical data


  • Ligands
  • Proteins