Quantitative and qualitative models for carcinogenicity prediction for non-congeneric chemicals using CP ANN method for regulatory uses

Mol Divers. 2010 Aug;14(3):581-94. doi: 10.1007/s11030-009-9190-4. Epub 2009 Aug 15.


The new European chemicals regulation Registration, Evaluation, Authorization and Restriction of Chemicals entered into force in June 2007 and accelerated the development of quantitative structure-activity relationship (QSAR) models for a variety of endpoints, including carcinogenicity. Here, we would like to present quantitative (continuous) and qualitative (categorical) models for non-congeneric chemicals for prediction of carcinogenic potency. A dataset of 805 substances was obtained after a preliminary screening of findings of rodent carcinogenicity for 1,481 chemicals accessible via Distributed Structure-Searchable Toxicity (DSSTox) Public Database Network originated from the Lois Gold Carcinogenic Potency Database (CPDB). Twenty seven two-dimensional MDL descriptors were selected using Kohonen mapping and principal component analysis. The counter propagation artificial neural network (CP ANN) technique was applied. Quantitative models were developed exploring the relationship between the experimental and predicted carcinogenic potency expressed as a tumorgenic dose TD(50) for rats. The obtained models showed low prediction power with correlation coefficient less than 0.5 for the test set. In the next step, qualitative models were developed. We found that the qualitative models exhibit good accuracy for the training set (92%). The model demonstrated good predicted performance for the test set. It was obtained accuracy (68%), sensitivity (73%), and specificity (63%). We believe that CP ANN method is a good in silico approach for modeling and predicting rodent carcinogenicity for non-congeneric chemicals and may find application for other toxicological endpoints.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Carcinogenicity Tests
  • Carcinogens / toxicity*
  • Databases, Factual
  • Drug and Narcotic Control / methods*
  • Humans
  • Neural Networks, Computer*
  • Principal Component Analysis
  • Quantitative Structure-Activity Relationship*
  • ROC Curve
  • Rats


  • Carcinogens