Using Machine Learning to Individualize Treatment Effect Estimation: Challenges and Opportunities

Clin Pharmacol Ther. 2024 Apr;115(4):710-719. doi: 10.1002/cpt.3159. Epub 2024 Jan 12.

Abstract

The use of data from randomized clinical trials to justify treatment decisions for real-world patients is the current state of the art. It relies on the assumption that average treatment effects from the trial can be extrapolated to patients with personal and/or disease characteristics different from those treated in the trial. Yet, because of heterogeneity of treatment effects between patients and between the trial population and real-world patients, this assumption may not be correct for many patients. Using machine learning to estimate the expected conditional average treatment effect (CATE) in individual patients from observational data offers the potential for more accurate estimation of the expected treatment effects in each patient based on their observed characteristics. In this review, we discuss some of the challenges and opportunities for machine learning to estimate CATE, including ensuring identification assumptions are met, managing covariate shift, and learning without access to the true label of interest. We also discuss the potential applications as well as future work and collaborations needed to further improve identification and utilization of CATE estimates to increase patient benefit.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Causality
  • Humans
  • Machine Learning*