Dosimetric assessment of deep learning based organ-at-risk segmentation: insights from the HaN-Seg challenge

Radiother Oncol. 2026 Apr:217:111387. doi: 10.1016/j.radonc.2026.111387. Epub 2026 Jan 23.

Abstract

Background and purpose: To extend the previously reported geometric analysis of HaN-Seg: The Head and Neck Organ-at-Risk CT and MR Segmentation Challenge by integrating a dosimetric evaluation, thereby offering a comprehensive assessment of challenge results with practical insights into their clinical applicability.

Materials and methods: Participating teams of the HaN-Seg challenge were tasked to auto-segment 30 organs-at-risk (OARs) in the head and neck region using paired contrast-enhanced computed tomography and T1-weighted magnetic resonance images. The teams were ranked according to their geometric performance, measured by the Dice similarity coefficient (DSC) and 95th-percentile Hausdorff distance (HD95). Here, we extend this evaluation with a forward dosimetric analysis, also known as dosimetric impact approximation, including the verification of OAR dosimetric restriction compliance, assessment of OAR priority ratings, evaluation of segmentation performance relative to tumor proximity, and correlation analysis between geometric and dosimetric metrics.

Results: All six teams from the previous geometric analysis were assessed for dosimetric performance on the original 14 test cases. Dosimetric analysis revealed minor performance differences among teams, with the best- and worst-performing teams achieving dosimetric compliance in 70.7% and 67.7% of OAR auto-segmentations, respectively. Most teams successfully met priority 1 dosimetric restrictions including the spinal cord, brainstem, optic chiasm, and optic nerves in 11 out of 14 test cases. The lowest compliance rates were observed for the oral cavity and submandibular glands. Correlation analysis revealed no clear relationship between geometric and dosimetric metrics.

Conclusion: The high dosimetric compliance highlights the practical utility of deep learning OAR auto-segmentation methods. Lower compliance for the oral cavity and submandibular glands most probably stems from their proximity to tumors and the corresponding steep dose gradients, where certain dosimetric constraints are inherently challenging to meet in clinical practice, or from the limitations of the forward dosimetric analysis. These findings underpin the critical need for both geometric and dosimetric evaluations of OAR auto-segmentation tools to ensure robust validation. Such a comprehensive assessment will be essential as commercial deep learning tools become increasingly integrated into the radiotherapy planning workflow.

Keywords: Computational challenge; Computed tomography; Deep learning; Dosimetric assessment; Dosimetric evaluation; Dosimetric restrictions; Head and neck cancer; Magnetic resonance; Organs-at-risk; Radiotherapy; Segmentation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Deep Learning*
  • Head and Neck Neoplasms* / diagnostic imaging
  • Head and Neck Neoplasms* / radiotherapy
  • Humans
  • Magnetic Resonance Imaging / methods
  • Organs at Risk* / diagnostic imaging
  • Organs at Risk* / radiation effects
  • Radiometry
  • Radiotherapy Dosage
  • Radiotherapy Planning, Computer-Assisted* / methods
  • Tomography, X-Ray Computed / methods