Comment

Jingxiang Chen; Yufeng Liu; Donglin Zeng; Rui Song; Yingqi Zhao; Michael R Kosorok

doi:10.1080/01621459.2016.1200914

Comment

J Am Stat Assoc. 2016;111(515):942-947. doi: 10.1080/01621459.2016.1200914. Epub 2016 Oct 18.

Authors

Jingxiang Chen¹, Yufeng Liu², Donglin Zeng¹, Rui Song³, Yingqi Zhao⁴, Michael R Kosorok⁵

Affiliations

¹ Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
² Department of Statistics and Operations Research, Department of Biostatistics, Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
³ Department of Statistics, North Carolina State University, Raleigh, NC, USA.
⁴ Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA.
⁵ Department of Biostatistics, Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.

Abstract

Xu, Müller, Wahed, and Thall proposed a Bayesian model to analyze an acute leukemia study involving multi-stage chemotherapy regimes. We discuss two alternative methods, Q-learning and O-learning, to solve the same problem from the machine learning point of view. The numerical studies show that these methods can be flexible and have advantages in some situations to handle treatment heterogeneity while being robust to model misspecification.

Keywords: Dynamic treatment regimes; Multi-stage chemotherapy regimes; O-learning; Q-learning.

Abstract

Grants and funding