Elucidating Celecoxib's Preventive Effect in Capecitabine-Induced Hand-Foot Syndrome Using Medical Natural Language Processing

JCO Clin Cancer Inform. 2025 Aug:9:e2500096. doi: 10.1200/CCI-25-00096. Epub 2025 Aug 12.

Abstract

Purpose: Capecitabine, an oral anticancer agent, frequently causes hand-foot syndrome (HFS), affecting patients' quality of life and treatment adherence. However, such symptomatic toxicities are often difficult to detect in structured electronic health record (EHR) data. This study primarily aimed to validate a natural language processing (NLP) approach to identifying capecitabine-induced HFS from unstructured clinical text and demonstrate its application in evaluating medication-associated adverse event trends in real-world settings.

Methods: We conducted a retrospective cohort study using EHRs from the University of Tokyo Hospital (2004-2021). HFS cases were identified using the MedNERN-CR-JA NLP model. After propensity score matching, we compared capecitabine users with and without celecoxib and assessed time to HFS onset using Cox proportional hazards models. NLP-based HFS detection was validated through manual annotation of aggregated clinical notes. Negative control and sensitivity analyses ensured robustness.

Results: Among 44,502 patients with cancer, 669 capecitabine users were analyzed. HFS incidence was significantly higher among capecitabine users (hazard ratio [HR], 1.93 [95% CI, 1.48 to 2.52]; P < .001) compared with nonusers. Celecoxib use showed a suggestive association with a reduced HFS risk (HR, 0.51 [95% CI, 0.24 to 1.07]; P = .073). The NLP model demonstrated high accuracy in identifying HFS, achieving a precision of 0.875, recall of 1.000, and F1 score of 0.933, based on manual annotation of patient-level clinical notes. Outcome trends remained consistent when using manually annotated HFS case labels instead of NLP-detected events, supporting the method's robustness.

Conclusion: These findings demonstrate the effectiveness of NLP in detecting HFS from real-world clinical records. The application to celecoxib-HFS detection illustrates the potential utility of this approach for retrospective safety analysis. Further work is needed to evaluate generalizability across diverse clinical settings.

MeSH terms

  • Aged
  • Antimetabolites, Antineoplastic* / adverse effects
  • Capecitabine* / adverse effects
  • Celecoxib* / therapeutic use
  • Electronic Health Records
  • Female
  • Hand-Foot Syndrome* / epidemiology
  • Hand-Foot Syndrome* / etiology
  • Hand-Foot Syndrome* / prevention & control
  • Humans
  • Male
  • Middle Aged
  • Natural Language Processing*
  • Neoplasms* / drug therapy
  • Retrospective Studies

Substances

  • Capecitabine
  • Celecoxib
  • Antimetabolites, Antineoplastic