Evaluating the clinical utility of multimodal large language models in rare maculopathy

Sci Rep. 2025 Dec 3;16(1):54. doi: 10.1038/s41598-025-29299-2.

Abstract

This study aimed to assess how multimodal large language models (MLLM) diagnose and differentiate Pentosan Polysulfate (PPS) Maculopathy from other phenotypic mimics. A retrospective review of clinical records and multimodal retinal imaging was conducted with patients from the Shiley Eye Institute and Casey Eye Institute. Four MLLMs (ChatGPT-4o, Claude 3.5 Sonnet, Google Gemini 1.5 Pro, Perplexity Llama 3.1 Sonar/Default) along with human retinal specialists answered prompts based on retinal imaging and demographic data. Performance was evaluated using accuracy, sensitivity and specificity estimates. The study included 126 eyes from 63 patients, with 36 eyes with PPS maculopathy, 50 eyes with Stargardt disease, and 40 eyes with PRPH2-associated multifocal pattern dystrophy. MLLMs showed improved accuracy and sensitivity when answer choices were restricted, with ChatGPT consistently performing best when all imaging modalities were prompted together. The inclusion of demographic data further enhanced performance in prompts with limited answer choices. Human retinal specialist evaluations aligned with MLLM performance trends and also improved with demographic data. While MLLMs show diagnostic potential, further refinement is needed before clinical implementation. These findings highlight the importance of prompt design and demographic data to optimize MLLM performance with retinal imaging modalities.

Keywords: Pattern dystrophy; Pentosan polysulfate sodium; Stargardt.

MeSH terms

  • Adult
  • Aged
  • Diagnosis, Differential
  • Female
  • Humans
  • Large Language Models
  • Macular Degeneration* / diagnosis
  • Macular Degeneration* / diagnostic imaging
  • Male
  • Middle Aged
  • Multimodal Imaging / methods
  • Retina / diagnostic imaging
  • Retina / pathology
  • Retrospective Studies
  • Sensitivity and Specificity
  • Stargardt Disease / diagnosis