The SAMPL6 challenge on predicting aqueous pKa values from EC-RISM theory
- PMID: 30073500
- DOI: 10.1007/s10822-018-0140-z
The SAMPL6 challenge on predicting aqueous pKa values from EC-RISM theory
Abstract
The "embedded cluster reference interaction site model" (EC-RISM) integral equation theory is applied to the problem of predicting aqueous pKa values for drug-like molecules based on an ensemble of tautomers. EC-RISM is based on self-consistent calculations of a solute's electronic structure and the distribution function of surrounding water. Following-up on the workflow developed after the SAMPL5 challenge on cyclohexane-water distribution coefficients we extended and improved the methodology by taking into account exact electrostatic solute-solvent interactions taken from the wave function in solution. As before, the model is calibrated against Gibbs energies of hydration from the "Minnesota Solvation Database" and a public dataset of acidity constants of organic acids and bases by adjusting in total 4 parameters, among which only 3 are relevant for predicting pKa values. While the best-performing training model yields a root-mean-square error (RMSE) of 1 pK unit, the corresponding test set prediction on the full SAMPL6 dataset of macroscopic pKa values using the same level of theory exhibits slightly larger error (1.7 pK units) than the best test set model submitted (1.7 pK units for corresponding training set vs. test set performance of 1.6). Post-submission analysis revealed a number of physical optimization options regarding the numerical treatment of electrostatic interactions and conformational sampling. While the experimental test set data revealed after submission was not used for reparametrizing the methodology, the best physically optimized models consequentially result in RMSEs of 1.5 if only improved electrostatic interactions are considered and of 1.1 if, in addition, conformational sampling accounts for quantum-chemically derived rankings. We conclude that these numbers are probably near the ultimate accuracy achievable with the simple 3-parameter model using a single or the two best-ranking conformations per tautomer or microstate. Finally, relations of the present macrostate approach to microstate pKa results are discussed and some illustrative results for microstate populations are presented.
Keywords: EC-RISM; Integral equation theory; Quantum chemistry; SAMPL6; Solvation model; pK a.
Similar articles
-
SAMPL7 physical property prediction from EC-RISM theory.J Comput Aided Mol Des. 2021 Aug;35(8):933-941. doi: 10.1007/s10822-021-00410-9. Epub 2021 Jul 19. J Comput Aided Mol Des. 2021. PMID: 34278539 Free PMC article.
-
The SAMPL5 challenge for embedded-cluster integral equation theory: solvation free energies, aqueous pK a, and cyclohexane-water log D.J Comput Aided Mol Des. 2016 Nov;30(11):1035-1044. doi: 10.1007/s10822-016-9939-7. Epub 2016 Aug 23. J Comput Aided Mol Des. 2016. PMID: 27554666
-
The SAMPL6 challenge on predicting octanol-water partition coefficients from EC-RISM theory.J Comput Aided Mol Des. 2020 Apr;34(4):453-461. doi: 10.1007/s10822-020-00283-4. Epub 2020 Jan 24. J Comput Aided Mol Des. 2020. PMID: 31981015 Free PMC article.
-
Biomolecular Simulations with the Three-Dimensional Reference Interaction Site Model with the Kovalenko-Hirata Closure Molecular Solvation Theory.Int J Mol Sci. 2021 May 11;22(10):5061. doi: 10.3390/ijms22105061. Int J Mol Sci. 2021. PMID: 34064655 Free PMC article. Review.
-
The pKa Cooperative: a collaborative effort to advance structure-based calculations of pKa values and electrostatic effects in proteins.Proteins. 2011 Dec;79(12):3249-59. doi: 10.1002/prot.23194. Epub 2011 Oct 15. Proteins. 2011. PMID: 22002877 Free PMC article. Review.
Cited by
-
Improving Small Molecule pK a Prediction Using Transfer Learning With Graph Neural Networks.Front Chem. 2022 May 26;10:866585. doi: 10.3389/fchem.2022.866585. eCollection 2022. Front Chem. 2022. PMID: 35721000 Free PMC article.
-
Implementation and Optimization of the Embedded Cluster Reference Interaction Site Model with Atomic Charges.J Phys Chem A. 2022 Apr 21;126(15):2417-2429. doi: 10.1021/acs.jpca.1c07904. Epub 2022 Apr 8. J Phys Chem A. 2022. PMID: 35394778 Free PMC article.
-
Asymmetric Interplay Between K+ and Blocker and Atomistic Parameters From Physiological Experiments Quantify K+ Channel Blocker Release.Front Physiol. 2021 Oct 29;12:737834. doi: 10.3389/fphys.2021.737834. eCollection 2021. Front Physiol. 2021. PMID: 34777005 Free PMC article.
-
A Joint Venture of Ab Initio Molecular Dynamics, Coupled Cluster Electronic Structure Methods, and Liquid-State Theory to Compute Accurate Isotropic Hyperfine Constants of Nitroxide Probes in Water.J Chem Theory Comput. 2021 Oct 12;17(10):6366-6386. doi: 10.1021/acs.jctc.1c00582. Epub 2021 Sep 13. J Chem Theory Comput. 2021. PMID: 34516119 Free PMC article.
-
SAMPL7 physical property prediction from EC-RISM theory.J Comput Aided Mol Des. 2021 Aug;35(8):933-941. doi: 10.1007/s10822-021-00410-9. Epub 2021 Jul 19. J Comput Aided Mol Des. 2021. PMID: 34278539 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
