Self-regulating the use of large language models in clinical practice: a risk-stratified approach

BMJ Health Care Inform. 2026 May 6;33(1):e101921. doi: 10.1136/bmjhci-2025-101921.

Abstract

The rapid integration of large language models (LLMs) into clinical practice offers promising benefits, including assistance with documentation, decision support and patient communication. However, these advantages are tempered by concerns around model accuracy, data privacy, clinician trust and regulatory responsibility. In this comment, we propose a clinician-led, risk-stratified framework to guide responsible adoption of LLMs in healthcare. The framework categorises applications into four risk tiers: low risk, moderate risk, high risk and critical risk. Each tier demands tailored oversight, validation and governance, with increasing levels of statutory regulatory scrutiny (eg., European Union Artificial Intelligence Act and U.S. Food and Drug Administration) and clinical supervision. We argue that proactive self-regulation combined with ongoing quality management and clinician education is essential to safely integrate LLMs into ongoing care.

Keywords: Artificial intelligence; Decision Support Systems, Clinical; Health Information Management; Implementation Science; Information Technology.

MeSH terms

  • Artificial Intelligence
  • Electronic Health Records*
  • Humans
  • Language*
  • Large Language Models
  • Risk Assessment