C-QSAR: a database of 18,000 QSARs and associated biological and physical data

J Comput Aided Mol Des. 2003 Feb-Apr;17(2-4):187-96. doi: 10.1023/a:1025322008290.

Abstract

The C-QSAR program is used to develop and search a database of over 18,000 equations that relate biological or physico-chemical properties of molecules to various molecular descriptors. The data used to derive the quantitative structure activity relationships (QSAR) are taken from various high quality journals. C-QSAR comprises two databases, one for structure-activity information biological systems (n = 9200) and the other for physical organic systems. Users can search the data in 20 different fields; for example by structure or substructure of the compounds involved, by the type of property correlated, by molecular properties, or by properties of the QSAR equation. Various ways in which information can be obtained is briefly discussed. Initially the database is often used for data mining, to search lead molecules, for substituent selection and "model mining" for lateral validation. The regression analysis is useful when the user wants to derive a new QSAR using his structures and activity data.

MeSH terms

  • Chemical Phenomena
  • Chemistry, Physical
  • Databases, Bibliographic*
  • Databases, Factual*
  • Drug Design*
  • Molecular Structure
  • Quantitative Structure-Activity Relationship*
  • Software