The prediction of human oral absorption for diffusion rate-limited drugs based on heuristic method and support vector machine

J Comput Aided Mol Des. 2005 Jan;19(1):33-46. doi: 10.1007/s10822-005-0095-8.


Support vector machine (SVM), as a novel machine learning technique, was used for the prediction of the human oral absorption for a large and diverse data set using the five descriptors calculated from the molecular structure alone. The molecular descriptors were selected by heuristic method (HM) implemented in CODESSA. At the same time, in order to show the influence of different molecular descriptors on absorption and to well understand the absorption mechanism, HM was used to build several multivariable linear models using different numbers of molecular descriptors. Both the linear and non-linear model can give satisfactory prediction results: the square of correlation coefficient R(2) was 0.78 and 0.86 for the training set, and 0.70 and 0.73 for the test set respectively. In addition, this paper provides a new and effective method for predicting the absorption of the drugs from their structures and gives some insight into structural features related to the absorption of the drugs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Administration, Oral
  • Diffusion
  • Humans
  • Intestinal Absorption
  • Pharmacokinetics*
  • Quantitative Structure-Activity Relationship