Data-driven choice of a model selection method in joinpoint regression

J Appl Stat. 2022 Apr 18;50(9):1992-2013. doi: 10.1080/02664763.2022.2063265. eCollection 2023.

Abstract

Selecting the number of change points in segmented line regression is an important problem in trend analysis, and there have been various approaches proposed in the literature. We first study the empirical properties of several model selection procedures and propose a new method based on two Schwarz type criteria, a classical Bayes Information Criterion (BIC) and the one with a harsher penalty than BIC (BIC3). The proposed rule is designed to use the former when effect sizes are small and the latter when the effect sizes are large and employs the partial R2 to determine the weight between BIC and BIC3. The proposed method is computationally much more efficient than the permutation test procedure that has been the default method of Joinpoint software developed for cancer trend analysis, and its satisfactory performance is observed in our simulation study. Simulations indicate that the proposed method performs well in keeping the probability of correct selection at least as large as that of BIC3, whose performance is comparable to that of the permutation test procedure, and improves BIC3 when it performs worse than BIC. The proposed method is applied to the U.S. prostate cancer incidence and mortality rates.

Keywords: Bayesian information criterion; change-point; probability of correct selection; segmented line regression; weighted.

Grants and funding

H.-J. Kim's research was partially supported by U. S. National Institute of Health Contract HHC 26120150003B and also a part of the research was conducted while H.-J. Kim was visiting the U.S. National Cancer Institute under the support of the Intergovernmental Personnel Act program.