Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
, 2019, 8321942

Using Machine Learning to Predict Progression in the Gastric Precancerous Process in a Population From a Developing Country Who Underwent a Gastroscopy for Dyspeptic Symptoms


Using Machine Learning to Predict Progression in the Gastric Precancerous Process in a Population From a Developing Country Who Underwent a Gastroscopy for Dyspeptic Symptoms

Susan Thapa et al. Gastroenterol Res Pract.


Background: Gastric cancer is the fourth most common cancer and the third most common cause of cancer deaths worldwide. Morbidity and mortality from gastric cancer may be decreased by identification of those that are at high risk for progression in the gastric precancerous process so that they can be monitored over time for early detection and implementation of preventive strategies.

Method: Using machine learning, we developed prediction models for gastric precancerous progression in a population from a developing country with a high rate of gastric cancer who underwent gastroscopies for dyspeptic symptoms. In the data imputed for completeness, we divided the data into a training and a validation test set. Using the training set, we used the random forest method to rank potential predictors based on their predictive importance. Using predictors identified by the random forest method, we conducted best subset linear regressions with the leave-one-out cross-validation approach to select predictors for overall progression and progression to dysplasia or cancer. We validated the models in the test set using leave-one-out cross-validation.

Results: We observed for all models that complete intestinal metaplasia and incomplete intestinal metaplasia were the strongest predictors for further progression in the precancerous process. We also observed that a diagnosis of no gastritis, superficial gastritis, or antral diffuse gastritis at baseline was a predictor of no progression in the gastric precancerous process. The sensitivities and specificities were 86% and 79% for the general model and 100% and 82% for the location-specific model, respectively.

Conclusion: We developed prediction models to identify gastroscopy patients that are more likely to progress in the gastric precancerous process, among whom routine follow-up gastroscopies can be targeted to prevent gastric cancer. Future external validation is needed.


Figure 1
Figure 1
Steps of data analyses for prediction of gastric precancerous progression.

Similar articles

See all similar articles


    1. Herrero R., Park J. Y., Forman D. The fight against gastric cancer – the IARC working group report. Best Practice & Research Clinical Gastroenterology. 2014;28(6):1107–1114. doi: 10.1016/j.bpg.2014.10.003. - DOI - PubMed
    1. Ferlay J., Soerjomataram I., Dikshit R., et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. International Journal of Cancer. 2015;136(5):E359–E386. doi: 10.1002/ijc.29210. - DOI - PubMed
    1. Howlader N. N., Noone A. M., Krapcho M., et al., editors. SEER Cancer Statistics Review, 1975–2014. Bethesda, MD, USA: National Cancer Institute; 2016.
    1. Leung W. K., Ho H. J., Lin J. T., Wu M. S., Wu C. Y. Prior gastroscopy and mortality in patients with gastric cancer: a matched retrospective cohort study. Gastrointestinal Endoscopy. 2018;87(1):119–127.e3. doi: 10.1016/j.gie.2017.06.013. - DOI - PubMed
    1. Matsumoto S., Ishikawa S., Yoshida Y. Reduction of gastric cancer mortality by endoscopic and radiographic screening in an isolated island: a retrospective cohort study. Australian Journal of Rural Health. 2013;21(6):319–324. doi: 10.1111/ajr.12064. - DOI - PubMed

LinkOut - more resources