Evaluating performance of a user-trained MR lung tumor autocontouring algorithm in the context of intra- and interobserver variations

Eugene Yip; Jihyun Yun; Zsolt Gabos; Sarah Baker; Don Yee; Keith Wachowicz; Satyapal Rathee; B Gino Fallone

doi:10.1002/mp.12687

Evaluating performance of a user-trained MR lung tumor autocontouring algorithm in the context of intra- and interobserver variations

Med Phys. 2018 Jan;45(1):307-313. doi: 10.1002/mp.12687. Epub 2017 Dec 15.

Authors

Eugene Yip^{1

2}, Jihyun Yun^{1

2}, Zsolt Gabos^{3

4}, Sarah Baker^{3

4}, Don Yee^{3

4}, Keith Wachowicz^{1

2}, Satyapal Rathee^{1

2}, B Gino Fallone^{1

2}

Affiliations

¹ Department of Medical Physics, Cross Cancer Institute, 11560 University Avenue, Edmonton, AB, T6G 1Z2, Canada.
² Medical Physics Division, Department of Oncology, University of Alberta, 11560 University Avenue, Edmonton, AB, T6G 1Z2, Canada.
³ Department of Radiation Oncology, Cross Cancer Institute, 11560 University Avenue, Edmonton, AB, T6G 1Z2, Canada.
⁴ Radiation Oncology Division, Department of Oncology, University of Alberta, 11560 University Avenue, Edmonton, AB, T6G 1Z2, Canada.

PMID: 29159957
DOI: 10.1002/mp.12687

Abstract

Purpose: Real-time tracking of lung tumors using magnetic resonance imaging (MRI) has been proposed as a potential strategy to mitigate the ill-effects of breathing motion in radiation therapy. Several autocontouring methods have been evaluated against a "gold standard" of a single human expert user. However, contours drawn by experts have inherent intra- and interobserver variations. In this study, we aim to evaluate our user-trained autocontouring algorithm with manually drawn contours from multiple expert users, and to contextualize the accuracy of these autocontours within intra- and interobserver variations.

Methods: Six nonsmall cell lung cancer patients were recruited, with institutional ethics approval. Patients were imaged with a clinical 3 T Philips MR scanner using a dynamic 2D balanced SSFP sequence under free breathing. Three radiation oncology experts, each in two separate sessions, contoured 130 dynamic images for each patient. For autocontouring, the first 30 images were used for algorithm training, and the remaining 100 images were autocontoured and evaluated. Autocontours were compared against manual contours in terms of Dice's coefficient (DC) and Hausdorff distances (d_H ). Intra- and interobserver variations of the manual contours were also evaluated.

Results: When compared with the manual contours of the expert user who trained it, the algorithm generates autocontours whose evaluation metrics (same session: DC = 0.90(0.03), d_H = 3.8(1.6) mm; different session DC = 0.88(0.04), d_H = 4.3(1.5) mm) are similar to or better than intraobserver variations (DC = 0.88(0.04), and d_H = 4.3(1.7) mm) between two sessions. The algorithm's autocontours are also compared to the manual contours from different expert users with evaluation metrics (DC = 0.87(0.04), d_H = 4.8(1.7) mm) similar to interobserver variations (DC = 0.87(0.04), d_H = 4.7(1.6) mm).

Conclusions: Our autocontouring algorithm delineates tumor contours (<20 ms per contour), in dynamic MRI of lung, that are comparable to multiple human experts (several seconds per contour), but at a much faster speed. At the same time, the agreement between autocontours and manual contours is comparable to the intra- and interobserver variations. This algorithm may be a key component of the real time tumor tracking workflow for our hybrid Linac-MR device in the future.

Keywords: MRI guidance; autocontouring; linac-MRI hybrid; observer variability; respiratory motion management; tumor tracking.

Publication types

Evaluation Study

MeSH terms

Algorithms*
Carcinoma, Non-Small-Cell Lung / diagnostic imaging
Humans
Image Interpretation, Computer-Assisted / methods*
Lung / diagnostic imaging
Lung Neoplasms / diagnostic imaging*
Magnetic Resonance Imaging* / methods
Observer Variation