In silico prediction of siRNA ionizable-lipid nanoparticles In vivo efficacy: Machine learning modeling based on formulation and molecular descriptors

Abdelkader A Metwally; Amira A Nayel; Rania M Hathout

doi:10.3389/fmolb.2022.1042720

In silico prediction of siRNA ionizable-lipid nanoparticles In vivo efficacy: Machine learning modeling based on formulation and molecular descriptors

Front Mol Biosci. 2022 Dec 21:9:1042720. doi: 10.3389/fmolb.2022.1042720. eCollection 2022.

Authors

Abdelkader A Metwally^{1

2}, Amira A Nayel^{3

4}, Rania M Hathout²

Affiliations

¹ Department of Pharmaceutics, Faculty of Pharmacy, Health Sciences Center, Kuwait University, Kuwait City, Kuwait.
² Department of Pharmaceutics and Industrial Pharmacy, Faculty of Pharmacy, Ain Shams University, Cairo, Egypt.
³ Clinical Pharmacy Department, Alexandria Ophthalmology Hospital, Alexandria, Egypt.
⁴ Department of Clinical Pharmacy and Pharmacy Practice, Faculty of Pharmacy, Alexandria University, Alexandria, Egypt.

Abstract

In silico prediction of the in vivo efficacy of siRNA ionizable-lipid nanoparticles is desirable as it can save time and resources dedicated to wet-lab experimentation. This study aims to computationally predict siRNA nanoparticles in vivo efficacy. A data set containing 120 entries was prepared by combining molecular descriptors of the ionizable lipids together with two nanoparticles formulation characteristics. Input descriptor combinations were selected by an evolutionary algorithm. Artificial neural networks, support vector machines and partial least squares regression were used for QSAR modeling. Depending on how the data set is split, two training sets and two external validation sets were prepared. Training and validation sets contained 90 and 30 entries respectively. The results showed the successful predictions of validation set log (siRNA dose) with R_val ²= 0.86-0.89 and 0.75-80 for validation sets one and two, respectively. Artificial neural networks resulted in the best R_val ² for both validation sets. For predictions that have high bias, improvement of R_val ² from 0.47 to 0.96 was achieved by selecting the training set lipids lying within the applicability domain. In conclusion, in vivo performance of siRNA nanoparticles was successfully predicted by combining cheminformatics with machine learning techniques.

Keywords: QSAR; in vivo; ionizable lipids; machine learning; nanoparticles; siRNA.