A systematic evaluation of deep learning methods for the prediction of drug synergy in cancer

Delora Baptista; Pedro G Ferreira; Miguel Rocha

doi:10.1371/journal.pcbi.1010200

A systematic evaluation of deep learning methods for the prediction of drug synergy in cancer

PLoS Comput Biol. 2023 Mar 23;19(3):e1010200. doi: 10.1371/journal.pcbi.1010200. eCollection 2023 Mar.

Authors

Delora Baptista^{1

2}, Pedro G Ferreira^{3

4

5

6}, Miguel Rocha^{1

2}

Affiliations

¹ CEB - Centre of Biological Engineering, University of Minho, Braga, Portugal.
² LABBELS - Associate Laboratory, Braga, Guimarães, Portugal.
³ Department of Computer Science, Faculty of Sciences, University of Porto, Porto, Portugal.
⁴ INESC TEC, Porto, Portugal.
⁵ Ipatimup - Institute of Molecular Pathology and Immunology of the University of Porto, Porto, Portugal.
⁶ i3s - Instituto de Investigação e Inovação em Saúde da Universidade do Porto, Porto, Portugal.

Abstract

One of the main obstacles to the successful treatment of cancer is the phenomenon of drug resistance. A common strategy to overcome resistance is the use of combination therapies. However, the space of possibilities is huge and efficient search strategies are required. Machine Learning (ML) can be a useful tool for the discovery of novel, clinically relevant anti-cancer drug combinations. In particular, deep learning (DL) has become a popular choice for modeling drug combination effects. Here, we set out to examine the impact of different methodological choices on the performance of multimodal DL-based drug synergy prediction methods, including the use of different input data types, preprocessing steps and model architectures. Focusing on the NCI ALMANAC dataset, we found that feature selection based on prior biological knowledge has a positive impact-limiting gene expression data to cancer or drug response-specific genes improved performance. Drug features appeared to be more predictive of drug response, with a 41% increase in coefficient of determination (R2) and 26% increase in Spearman correlation relative to a baseline model that used only cell line and drug identifiers. Molecular fingerprint-based drug representations performed slightly better than learned representations-ECFP4 fingerprints increased R2 by 5.3% and Spearman correlation by 2.8% w.r.t the best learned representations. In general, fully connected feature-encoding subnetworks outperformed other architectures. DL outperformed other ML methods by more than 35% (R2) and 14% (Spearman). Additionally, an ensemble combining the top DL and ML models improved performance by about 6.5% (R2) and 4% (Spearman). Using a state-of-the-art interpretability method, we showed that DL models can learn to associate drug and cell line features with drug response in a biologically meaningful way. The strategies explored in this study will help to improve the development of computational methods for the rational design of effective drug combinations for cancer therapy.

Copyright: © 2023 Baptista et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Deep Learning*
Humans
Machine Learning
Neoplasms* / drug therapy

Grants and funding

This study was supported by the Portuguese Foundation for Science and Technology (FCT), through a PhD scholarship (SFRH/BD/130913/2017 awarded to DB) and under the scope of the strategic funding of UIDB/04469/2020 unit. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.