Naïve Bayes classification using 2D pharmacophore feature triplet vectors

J Chem Inf Model. 2008 Jan;48(1):166-78. doi: 10.1021/ci7003253. Epub 2008 Jan 10.

Abstract

A naïve Bayes classifier, employed in conjunction with 2D pharmacophore feature triplet vectors describing the molecules, is presented and validated. Molecules are described using a vector where each element in the vector contains the number of times a particular triplet of atom-based features separated by a set of topological distances occurs. Using the feature triplet vectors it is possible to generate naïve Bayes classifiers that predict whether molecules are likely to be active against a given target (or family of targets). Two retrospective validation experiments were performed using a range of actives from WOMBAT, the Prous Integrity database, and the Arena screening library. The performance of the classifiers was evaluated using enrichment curves, enrichment factors, and the BEDROC metric. The classifiers were found to give significant enrichments for the various test sets.

MeSH terms

  • Artificial Intelligence*
  • Bayes Theorem
  • Computational Biology / methods*
  • Databases, Protein
  • Drug Evaluation, Preclinical / methods*
  • Endopeptidases / classification
  • Endopeptidases / metabolism
  • Phosphotransferases / classification
  • Phosphotransferases / metabolism
  • Receptors, G-Protein-Coupled / classification
  • Receptors, G-Protein-Coupled / metabolism
  • Reproducibility of Results

Substances

  • Receptors, G-Protein-Coupled
  • Phosphotransferases
  • Endopeptidases