Multicenter Multivendor Evaluation of Dose Volume Histogram Creation Consistencies for 8 Commercial Radiation Therapy Dosimetric Systems

Pract Radiat Oncol. 2024 May-Jun;14(3):e236-e248. doi: 10.1016/j.prro.2023.09.009. Epub 2023 Oct 31.


Purpose: To evaluate dose volume histogram (DVH) construction differences across 8 major commercial treatment planning systems (TPS) and dose reporting systems for clinically treated plans of various anatomic sites and target sizes.

Methods and materials: Dose files from 10 selected clinically treated plans with a hypofractionation, stereotactic radiation therapy prescription or sharp dose gradients such as head and neck plans ranging from prescription doses of 18 Gy in 1 fraction to 70 Gy in 35 fractions, each calculated at 0.25 and 0.125 cm grid size, were created and anonymized in Eclipse TPS, and exported to 7 other major TPS (Pinnacle, RayStation, and Elements) and dose reporting systems (MIM, Mobius, ProKnow, and Velocity) systems for comparison. Dose-volume constraint points of clinical importance for each plan were collected from each evaluated system (D0.03 cc [Gy], volume, and the mean dose were used for structures without specified constraints). Each reported constraint type and structure volume was normalized to the value from Eclipse for a pairwise comparison. A Wilcoxon rank-sum test was used for statistical significance and a multivariable regression model was evaluated adjusting for plan, grid size, and distance to target center.

Results: For all DVH points relative to Eclipse, all systems reported median values within 1.0% difference of each other; however, they were all different from Eclipse. Considering mean values, Pinnacle, RayStation, and Elements averaged at 1.038, 1.046, and 1.024, respectively, while MIM, Mobius, ProKnow, and Velocity reported 1.026, 1.050, 1.033, and 1.022, respectively relative to Eclipse. Smaller dose grid size improved agreement between the systems marginally without statistical significance. For structure volumes relative to Eclipse, larger differences are seen across all systems with a range in median values up to 3.0% difference and mean up to 10.1% difference.

Conclusions: Large variations were observed between all systems. Eclipse generally reported, at statistically significant levels, lower values than all other evaluated systems. The nonsignificant change resulting from lowering the dose grid resolution indicates that this resolution may be less important than other aspects of calculating DVH curves, such as the 3-dimensional modeling of the structure.

Publication types

  • Multicenter Study

MeSH terms

  • Head and Neck Neoplasms / radiotherapy
  • Humans
  • Radiometry / methods
  • Radiotherapy Dosage*
  • Radiotherapy Planning, Computer-Assisted* / methods
  • Radiotherapy, Intensity-Modulated / methods