Predictive Modeling of Estrogen Receptor Binding Agents Using Advanced Cheminformatics Tools and Massive Public Data
- PMID: 27642585
- PMCID: PMC5023020
- DOI: 10.3389/fenvs.2016.00012
Predictive Modeling of Estrogen Receptor Binding Agents Using Advanced Cheminformatics Tools and Massive Public Data
Abstract
Estrogen receptors (ERα) are a critical target for drug design as well as a potential source of toxicity when activated unintentionally. Thus, evaluating potential ERα binding agents is critical in both drug discovery and chemical toxicity areas. Using computational tools, e.g., Quantitative Structure-Activity Relationship (QSAR) models, can predict potential ERα binding agents before chemical synthesis. The purpose of this project was to develop enhanced predictive models of ERα binding agents by utilizing advanced cheminformatics tools that can integrate publicly available bioassay data. The initial ERα binding agent data set, consisting of 446 binders and 8307 non-binders, was obtained from the Tox21 Challenge project organized by the NIH Chemical Genomics Center (NCGC). After removing the duplicates and inorganic compounds, this data set was used to create a training set (259 binders and 259 non-binders). This training set was used to develop QSAR models using chemical descriptors. The resulting models were then used to predict the binding activity of 264 external compounds, which were available to us after the models were developed. The cross-validation results of training set [Correct Classification Rate (CCR) = 0.72] were much higher than the external predictivity of the unknown compounds (CCR = 0.59). To improve the conventional QSAR models, all compounds in the training set were used to search PubChem and generate a profile of their biological responses across thousands of bioassays. The most important bioassays were prioritized to generate a similarity index that was used to calculate the biosimilarity score between each two compounds. The nearest neighbors for each compound within the set were then identified and its ERα binding potential was predicted by its nearest neighbors in the training set. The hybrid model performance (CCR = 0.94 for cross validation; CCR = 0.68 for external prediction) showed significant improvement over the original QSAR models, particularly for the activity cliffs that induce prediction errors. The results of this study indicate that the response profile of chemicals from public data provides useful information for modeling and evaluation purposes. The public big data resources should be considered along with chemical structure information when predicting new compounds, such as unknown ERα binding agents.
Keywords: QSAR modeling; bioassay profiling; biosimilarity; endocrine disrupting chemicals; estrogen receptor α.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures
Similar articles
-
Differentiation of AmpC beta-lactamase binders vs. decoys using classification kNN QSAR modeling and application of the QSAR classifier to virtual screening.J Comput Aided Mol Des. 2008 Sep;22(9):593-609. doi: 10.1007/s10822-008-9199-2. Epub 2008 Mar 13. J Comput Aided Mol Des. 2008. PMID: 18338225
-
The importance of molecular structures, endpoints' values, and predictivity parameters in QSAR research: QSAR analysis of a series of estrogen receptor binders.Mol Divers. 2010 Nov;14(4):687-96. doi: 10.1007/s11030-009-9212-2. Epub 2009 Nov 17. Mol Divers. 2010. PMID: 19921452
-
Developing Enhanced Blood-Brain Barrier Permeability Models: Integrating External Bio-Assay Data in QSAR Modeling.Pharm Res. 2015 Sep;32(9):3055-65. doi: 10.1007/s11095-015-1687-1. Epub 2015 Apr 11. Pharm Res. 2015. PMID: 25862462 Free PMC article.
-
From QSAR to QSIIR: searching for enhanced computational toxicology models.Methods Mol Biol. 2013;930:53-65. doi: 10.1007/978-1-62703-059-5_3. Methods Mol Biol. 2013. PMID: 23086837 Free PMC article. Review.
-
Prediction reliability of QSAR models: an overview of various validation tools.Arch Toxicol. 2022 May;96(5):1279-1295. doi: 10.1007/s00204-022-03252-y. Epub 2022 Mar 10. Arch Toxicol. 2022. PMID: 35267067 Review.
Cited by
-
MATH: A Deep Learning Approach in QSAR for Estrogen Receptor Alpha Inhibitors.Molecules. 2023 Aug 3;28(15):5843. doi: 10.3390/molecules28155843. Molecules. 2023. PMID: 37570812 Free PMC article.
-
Data-Driven Quantitative Structure-Activity Relationship Modeling for Human Carcinogenicity by Chronic Oral Exposure.Environ Sci Technol. 2023 Apr 25;57(16):6573-6588. doi: 10.1021/acs.est.3c00648. Epub 2023 Apr 11. Environ Sci Technol. 2023. PMID: 37040559 Free PMC article.
-
Predicting Prenatal Developmental Toxicity Based On the Combination of Chemical Structures and Biological Data.Environ Sci Technol. 2022 May 3;56(9):5984-5998. doi: 10.1021/acs.est.2c01040. Epub 2022 Apr 22. Environ Sci Technol. 2022. PMID: 35451820 Free PMC article.
-
Identification of tolerance levels on the cold-water coral Desmophyllum pertusum (Lophelia pertusa) from realistic exposure conditions to suspended bentonite, barite and drill cutting particles.PLoS One. 2022 Feb 22;17(2):e0263061. doi: 10.1371/journal.pone.0263061. eCollection 2022. PLoS One. 2022. PMID: 35192627 Free PMC article.
-
Elucidation of Agonist and Antagonist Dynamic Binding Patterns in ER-α by Integration of Molecular Docking, Molecular Dynamics Simulations and Quantum Mechanical Calculations.Int J Mol Sci. 2021 Aug 29;22(17):9371. doi: 10.3390/ijms22179371. Int J Mol Sci. 2021. PMID: 34502280 Free PMC article.
References
-
- Breiman L. Random forests. Mach Learn. 2001;45:5–32. doi: 10.1023/A:1010933404324. - DOI
-
- Committee on Toxicity Testing and Assessment of Environmental Agents N.R.C. Toxicity Testing in the 21st Century: A Vision and a Strategy. Washington, DC: The National Academies Press; 2007.
-
- Dalgaard P. Introductory Statistics with R. New York, NY: Springer Science & Business Media; 2008.
Grants and funding
LinkOut - more resources
Full Text Sources