Benefit of Retraining pKa Models Studied Using Internally Measured Data

Peter Gedeck; Yipin Lu; Suzanne Skolnik; Stephane Rodde; Gavin Dollinger; Weiping Jia; Giuliano Berellini; Riccardo Vianello; Bernard Faller; Franco Lombardo

doi:10.1021/acs.jcim.5b00172

Benefit of Retraining pKa Models Studied Using Internally Measured Data

J Chem Inf Model. 2015 Jul 27;55(7):1449-59. doi: 10.1021/acs.jcim.5b00172. Epub 2015 Jun 29.

Authors

Affiliations

¹ †Novartis Institute for Tropical Diseases Pte. Ltd., 10 Biopolis Road, #05-01 Chromos, Singapore 138670, Singapore.
² ‡Novartis Institute for Biomedical Research, 5300 Chiron Way, Emeryville, California 94608, United States.
³ §Novartis Institute for Biomedical Research, 250 Massachusetts Ave, Cambridge, Massachusetts 02139, United States.
⁴ ∥Novartis Institute for Biomedical Research, Postfach, CH-4002 Basel, Switzerland.

PMID: 26052622
DOI: 10.1021/acs.jcim.5b00172

Abstract

The ionization state of drugs influences many pharmaceutical properties such as their solubility, permeability, and biological activity. It is therefore important to understand the structure property relationship for the acid-base dissociation constant pKa during the lead optimization process to make better-informed design decisions. Computational approaches, such as implemented in MoKa, can help with this; however, they often predict with too large error especially for proprietary compounds. In this contribution, we look at how retraining helps to greatly improve prediction error. Using a longitudinal study with data measured over 15 years in a drug discovery environment, we assess the impact of model training on prediction accuracy and look at model degradation over time. Using the MoKa software, we will demonstrate that regular retraining is required to address changes in chemical space leading to model degradation over six to nine months.

MeSH terms

Chemical Phenomena*
Machine Learning*
Models, Theoretical*
Reproducibility of Results