Development and validation of a risk prediction model to diagnose Barrett's oesophagus (MARK-BE): a case-control machine learning approach

Avi Rosenfeld; David G Graham; Sarah Jevons; Jose Ariza; Daryl Hagan; Ash Wilson; Samuel J Lovat; BEST2 study group; Sarmed S Sami; Omer F Ahmad; Marco Novelli; Manuel Rodriguez Justo; Alison Winstanley; Eliyahu M Heifetz; Mordehy Ben-Zecharia; Uria Noiman; Rebecca C Fitzgerald; Peter Sasieni; Laurence B Lovat

doi:10.1016/S2589-7500(19)30216-X

Development and validation of a risk prediction model to diagnose Barrett's oesophagus (MARK-BE): a case-control machine learning approach

Lancet Digit Health. 2020 Jan 1;2(1):E37-E48. doi: 10.1016/S2589-7500(19)30216-X. Epub 2019 Dec 5.

Authors

Avi Rosenfeld^{1

2}, David G Graham^{2

3}, Sarah Jevons², Jose Ariza^{2

3}, Daryl Hagan², Ash Wilson², Samuel J Lovat²; BEST2 study group; Sarmed S Sami^{2

3}, Omer F Ahmad^{2

3}, Marco Novelli⁴, Manuel Rodriguez Justo⁴, Alison Winstanley⁴, Eliyahu M Heifetz⁵, Mordehy Ben-Zecharia⁵, Uria Noiman⁵, Rebecca C Fitzgerald⁶, Peter Sasieni^{7

8}, Laurence B Lovat^{2

3}

Affiliations

¹ Department of Industrial Engineering Jerusalem College of Technology (JCT), Jerusalem, Israel.
² GENIE GastroENterological IntervEntion Group, Department for Targeted Intervention, University College London (UCL), London, United Kingdom.
³ Gastrointestinal Services, University College London Hospital (UCLH), London, United Kingdom.
⁴ Dept of Pathology, University College London Hospital (UCLH), London, United Kingdom.
⁵ Department of Health Informatics, Jerusalem College of Technology (JCT), Jerusalem, Israel.
⁶ MRC Cancer Unit, University of Cambridge, Cambridge, United Kingdom.
⁷ Cancer Prevention Trials Unit, Queen Mary University of London, London, United Kingdom.
⁸ School of Cancer & Pharmaceutical Sciences, King's College London, London, United Kingdom.

Abstract

Background: Screening for Barrett's Oesophagus (BE) relies on endoscopy which is invasive and has a low yield. This study aimed to develop and externally validate a simple symptom and risk-factor questionnaire to screen for patients with BE.

Methods: Questionnaires from 1299 patients in the BEST2 case-controlled study were analysed: 880 had BE including 40 with invasive oesophageal adenocarcinoma (OAC) and 419 were controls. This was randomly split into a training cohort of 776 patients and an internal validation cohort of 523 patients. External validation included 398 patients from the BOOST case-controlled study: 198 with BE (23 with OAC) and 200 controls. Identification of independently important diagnostic features was undertaken using machine learning techniques information gain (IG) and correlation based feature selection (CFS). Multiple classification tools were assessed to create a multi-variable risk prediction model. Internal validation was followed by external validation in the independent dataset.

Findings: The BEST2 study included 40 features. Of these, 24 added IG but following CFS, only 8 demonstrated independent diagnostic value including age, gender, smoking, waist circumference, frequency of stomach pain, duration of heartburn and acid taste and taking of acid suppression medicines. Logistic regression offered the highest prediction quality with AUC (area under the receiver operator curve) of 0.87. In the internal validation set, AUC was 0.86. In the BOOST external validation set, AUC was 0.81.

Interpretation: The diagnostic model offers valid predictions of diagnosis of BE in patients with symptomatic gastroesophageal reflux, assisting in identifying who should go forward to invasive testing. Overweight men who have been taking stomach medicines for a long time may merit particular consideration for further testing. The risk prediction tool is quick and simple to administer but will need further calibration and validation in a prospective study in primary care.

Funding: Charles Wolfson Trust and Guts UK.

Publication types

Research Support, Non-U.S. Gov't
Validation Study

MeSH terms

Aged
Barrett Esophagus / diagnosis*
Case-Control Studies
Female
Forecasting
Humans
Machine Learning*
Male
Middle Aged
Prospective Studies
Risk Assessment / standards*
United Kingdom

Abstract

Publication types

MeSH terms

Grants and funding