Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Mar 17;30(11):3710-3716.e4.
doi: 10.1016/j.celrep.2020.02.094.

Machine Learning Uncovers Food- and Excipient-Drug Interactions

Affiliations

Machine Learning Uncovers Food- and Excipient-Drug Interactions

Daniel Reker et al. Cell Rep. .

Abstract

Inactive ingredients and generally recognized as safe compounds are regarded by the US Food and Drug Administration (FDA) as benign for human consumption within specified dose ranges, but a growing body of research has revealed that many inactive ingredients might have unknown biological effects at these concentrations and might alter treatment outcomes. To speed up such discoveries, we apply state-of-the-art machine learning to delineate currently unknown biological effects of inactive ingredients-focusing on P-glycoprotein (P-gp) and uridine diphosphate-glucuronosyltransferase-2B7 (UGT2B7), two proteins that impact the pharmacokinetics of approximately 20% of FDA-approved drugs. Our platform identifies vitamin A palmitate and abietic acid as inhibitors of P-gp and UGT2B7, respectively; in silico, in vitro, ex vivo, and in vivo validations support these interactions. Our predictive framework can elucidate biological effects of commonly consumed chemical matter with implications on food- and excipient-drug interactions and functional drug formulation development.

Keywords: data science; drug delivery; excipient-drug interactions; food-drug interactions; inactive ingredients; machine learning; pharmacokinetics; pharmacology; virtual screening.

PubMed Disclaimer

Conflict of interest statement

Declaration of Interests Complete details of all relationships for profit and not for profit for G.T. can be found at the following link: https://www.dropbox.com/sh/szi7vnr4a2ajb56/AABs5N5i0q9AfT1IqIJAE-T5a?dl=0. Complete details for R.L. can be found at the following link: https://www.dropbox.com/s/yc3xqb5s8s94v7x/Rev%20Langer%20COI.pdf?dl=0. D.R., R.L., and G.T. are co-inventors on a provisional patent application encompassing systems and algorithms capable of quantifying and providing IIG burden in medications and their potential biological implications.

Figures

Figure 1.
Figure 1.. Inactive Ingredients and GRAS Compounds Resemble FDA-Approved Drugs and Exert Known or Potentially Novel Bioactivities
(A) Schematic visualizing the general workflow of the study and the utilized datasets. Briefly, CAS numbers for generally recognized as safe (GRAS) and inactive ingredient (IIG) compounds were extracted and curated from the FDA website (https://www.fda.gov) and translated into SMILES structural representations using the CACTUS NIH webserver (https://cactus.nci.nih.gov). These chemical representations were then harnessed to calculate physicochemical properties (http://rdkit.org) and compare the property distributions with approved drugs (https://www.drugbank.ca). Biological activity data were extracted from ChEMBL22 (http://ebi.ac.uk/chembl) to identify previously reported activities for GRAS/IIG compounds and build machine learning models (https://scikit-learn.org) to predict additional biological activities of GRAS/IIG compounds. (B) Distribution of molecular weight (MW), calculated logP, and the fraction of rotational bonds (rot bonds) among GRAS (light blue) and IIG (dark blue) compared to FDA-approved drugs in the DrugBank database (DRUGS, orange). Summary statistics represented through boxplots show considerable overlap in the three distinct distributions. (C) Visualization of chemical space spanned by GRAS (light blue) and IIG (dark blue) compared to approved drugs stored within the DrugBank 5.0 database (orange). Projection based on t-Distributed Stochastic Neighbor Embedding (t-SNE) using Morgan fingerprints (r = 4, 2,048 bits; RDKit) is shown. (D) Pharmacology network of GRAS and IIG. Compounds are shown as light blue (GRAS) or dark blue (IIG) nodes; protein targets (ChEMBL22) are shown in red.A compound and a target are connected either when the compound has been previously measured to interact with the protein (black edge) or when machine learning models predicted that the compound is likely to interact with the protein (Z score > 4; gray edge). (E and F) Distribution of number of previously reported (left, E) and computationally predicted (right, F) activities on the level of different protein families (inner pie charts). Top seven families are labeled. Outer pie charts visualize the number of reported or predicted activities per protein. Proteins for which more than 10 GRAS or IIG compounds have been reported or predicted to modulate their activity have been annotated.
Figure 2.
Figure 2.. Gum Rosin and Abietic Acid Inhibit UGT2B7 Activity
(A) Chemical structures of abietic acid (1) and training data compound isolongifolic acid (2). (B) In vitro validation shows that gum rosin (black circles) and abietic acid (orange squares) inhibit UGT2B7 activity in microsomes. (C) The effect of abietic acid (orange) on UGT activity was confirmed in complex tissue liver lysates, where it slowed the conversion of a proprietary UGT substrate (Biovision K692; gray). (D) Computational docking indicates that abietic acid has the potential to interact with UGT2B7 at the interface of the substrate- (gold) and the co-factor-binding (cyan) domains.
Figure 3.
Figure 3.. Vitamin A Palmitate Modulates P-gp Activity
(A) P-gp is involved in the transport of 8% of all approved drugs, spanning a broad range of different indications (DrugBank 5.0). Complete bar corresponds to 170 approved drugs. Only sub-classifications with at least three drugs are visualized on the right. (B) Structure of vitamin A palmitate (3). (C) Vitamin A palmitate inhibits P-gp activity in HepG cells with an IC50 of 2.9 ± 3.6 µM. (Biovision K507) Data plotted as mean and standard deviation, curve fitted in Prism using the standard three parameter equation for “inhibitor vs. response”. (D) Schematic of ex vivo tissue permeability experiment in (E). (E) Vitamin A palmitate increases the permeability of the four known P-gp substrates irinotecan, ranitidine, colchicine, and loperamide across porcine intestinal tissue. p ≤ 0.001; two-tailed t test with Holm-Sidak correction. (F) Schematic of in vivo experiment in (G). (G) Vitamin A palmitate shows a mild increase of systemic warfarin, a known P-gp substrate, after oral delivery in mice. p = 0.04; one-tailed t test. (H) Computational docking suggests that vitamin A palmitate can bind the ATPase site of P-gp (blue mesh) with a stabilizing hydrogen bond formed with ARG1047 (dashed yellow line; see black arrow).

Similar articles

Cited by

References

    1. Apponi LH, Leung SW, Williams KR, Valentini SR, Corbett AH, and Pavlath GK. (2010). Loss of nuclear poly(A)-binding protein 1 causes defects in myogenesis and mRNA biogenesis. Hum. Mol. Genet 19, 1058–1065. - PMC - PubMed
    1. Arnold K, Bordoli L, Kopp J, and Schwede T. (2006). The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics 22, 195–201. - PubMed
    1. Bélanger AS, Caron P, Harvey M, Zimmerman PA, Mehlotra RK, and Guillemette C. (2009). Glucuronidation of the antiretroviral drug efavirenz by UGT2B7 and an in vitro investigation of drug-drug interaction with zidovudine. Drug Metab. Dispos 37, 1793–1796. - PMC - PubMed
    1. Bento AP, Gaulton A, Hersey A, Bellis LJ, Chambers J, Davies M, Krüger FA, Light Y, Mak L, McGlinchey S, et al. (2014). The ChEMBL bioactivity database: an update. Nucleic Acids Res 42, D1083–D1090. - PMC - PubMed
    1. Binas B, Danneberg H, McWhir J, Mullins L, and Clark AJ. (1999). Requirement for the heart-type fatty acid binding protein in cardiac fatty acid utilization. FASEB J 13, 805–812. - PubMed

Publication types

MeSH terms