ESKD Risk Prediction Model in a Multicenter Chronic Kidney Disease Cohort in China: A Derivation, Validation, and Comparison Study

J Clin Med. 2023 Feb 14;12(4):1504. doi: 10.3390/jcm12041504.

Abstract

Background and objectives: In light of the growing burden of chronic kidney disease (CKD), it is of particular importance to create disease prediction models that can assist healthcare providers in identifying cases of CKD individual risk and integrate risk-based care for disease progress management. The objective of this study was to develop and validate a new pragmatic end-stage kidney disease (ESKD) risk prediction utilizing the Cox proportional hazards model (Cox) and machine learning (ML).

Design, setting, participants, and measurements: The Chinese Cohort Study of Chronic Kidney Disease (C-STRIDE), a multicenter CKD cohort in China, was employed as the model's training and testing datasets, with a split ratio of 7:3. A cohort from Peking University First Hospital (PKUFH cohort) served as the external validation dataset. The participants' laboratory tests in those cohorts were conducted at PKUFH. We included individuals with CKD stages 1~4 at baseline. The incidence of kidney replacement therapy (KRT) was defined as the outcome. We constructed the Peking University-CKD (PKU-CKD) risk prediction model employing the Cox and ML methods, which include extreme gradient boosting (XGBoost) and survival support vector machine (SSVM). These models discriminate metrics by applying Harrell's concordance index (Harrell's C-index) and Uno's concordance (Uno's C). The calibration performance was measured by the Brier score and plots.

Results: Of the 3216 C-STRIDE and 342 PKUFH participants, 411 (12.8%) and 25 (7.3%) experienced KRT with mean follow-up periods of 4.45 and 3.37 years, respectively. The features included in the PKU-CKD model were age, gender, estimated glomerular filtration rate (eGFR), urinary albumin-creatinine ratio (UACR), albumin, hemoglobin, medical history of type 2 diabetes mellitus (T2DM), and hypertension. In the test dataset, the values of the Cox model for Harrell's C-index, Uno's C-index, and Brier score were 0.834, 0.833, and 0.065, respectively. The XGBoost algorithm values for these metrics were 0.826, 0.825, and 0.066, respectively. The SSVM model yielded values of 0.748, 0.747, and 0.070, respectively, for the above parameters. The comparative analysis revealed no significant difference between XGBoost and Cox, in terms of Harrell's C, Uno's C, and the Brier score (p = 0.186, 0.213, and 0.41, respectively) in the test dataset. The SSVM model was significantly inferior to the previous two models (p < 0.001), in terms of discrimination and calibration. The validation dataset showed that XGBoost was superior to Cox, regarding Harrell's C, Uno's C, and the Brier score (p = 0.003, 0.027, and 0.032, respectively), while Cox and SSVM were almost identical concerning these three parameters (p = 0.102, 0.092, and 0.048, respectively).

Conclusions: We developed and validated a new ESKD risk prediction model for patients with CKD, employing commonly measured indicators in clinical practice, and its overall performance was satisfactory. The conventional Cox regression and certain ML models exhibited equal accuracy in predicting the course of CKD.

Keywords: chronic kidney disease; machine learning; prediction model; progression.

Grants and funding

This study was supported by grants from the National Key R&D Program of the Ministry of Science and Technology of China (2020YFC2005003), the National Natural Science Foundation of China (72125009, 91846101, 81771938, 81900665, 82003529, and 82090021), the Chinese Scientific and Technical Innovation Project 2030 (2018AAA0102100), the University of Michigan Health System–Peking University Health Science Center Joint Institute for Translational and Clinical Research (BMU2018JI012 and BMU2019JI005), the CAMS Innovation Fund for Medical Sciences (2019-I2M-5-046), and the PKU-Baidu Fund (2019BD017, 2020BD004, 2020BD005, 2020BD032, 2020BD026, and 2020BD044). Project 2020-JKCS-009 was supported by the Non-profit Central Research Institute, project 2018YFC1314004 was supported by the National Key Research and Development Program of China.