SBSPKSv2: structure-based sequence analysis of polyketide synthases and non-ribosomal peptide synthetases

Nucleic Acids Res. 2017 Jul 3;45(W1):W72-W79. doi: 10.1093/nar/gkx344.


Genome guided discovery of novel natural products has been a promising approach for identification of new bioactive compounds. SBSPKS web-server has been a valuable resource for analysis of polyketide synthase (PKS) and non-ribosomal peptide synthetase (NRPS) gene clusters. We have developed an updated version - SBSPKSv2 which is based on comprehensive analysis of sequence, structure and secondary metabolite chemical structure data from 311 experimentally characterized PKS/NRPS gene clusters with known biosynthetic products. A completely new feature of SBSPKSv2 is the inclusion of features for search in chemical space. It allows the user to compare the chemical structure of a given secondary metabolite to the chemical structures of biosynthetic intermediates and final products. For identification of catalytic domains, SBSPKS now uses profile based searches, which are computationally faster and have high sensitivity. HMM profiles have also been added for a number of new domains and motif information has been used for distinguishing condensation (C), epimerization (E) and cyclization (Cy) domains of NRPS. In summary, the new and updated SBSPKSv2 is a versatile tool for genome mining and analysis of polyketide and non-ribosomal peptide biosynthetic pathways in chemical space. The server is available at:

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Biosynthetic Pathways / genetics
  • Catalytic Domain
  • Genomics
  • Internet
  • Peptide Synthases / chemistry*
  • Peptide Synthases / genetics
  • Polyketide Synthases / chemistry*
  • Polyketide Synthases / genetics
  • Secondary Metabolism / genetics
  • Sequence Analysis
  • Software*


  • Polyketide Synthases
  • Peptide Synthases
  • non-ribosomal peptide synthase