Variability in observer performance studies experimental observations

Acad Radiol. 2005 Dec;12(12):1527-33. doi: 10.1016/j.acra.2005.08.010.

Abstract

Rationale and objectives: The aim of the study is to assess variance components in observer performance studies and the possible impact on study results and conclusions.

Materials and methods: Two previously performed retrospective receiver operating characteristic-type observer performance studies to evaluate the performance of seven radiologists in detecting interstitial disease on conventional posteroanterior chest films and nine radiologists in detecting interstitial disease on a high-resolution workstation were reanalyzed by using the Beiden, Wagner, and Campbell nine-component model to estimate the different variance components. We estimated case-, reader-, and mode-related components of the variance for the group as a whole and after excluding (round robin) each reader. Overall variance was evaluated, and the effect of individual readers on overall study conclusions was assessed.

Results: Overall results and conclusions of the reanalysis agreed with the original one in that, as a group, radiologists performed significantly better when using conventional films (P < .05) in both studies. Reader variability was large compared with all other components, and in one study, it was substantially larger for the workstation reading mode. Reader variability was affected substantially by one observer in each study, and in one study, reader-by-mode variability was affected by another reader who performed better on the workstation.

Conclusion: Estimates of variance components can shed light on the appropriateness of study design, as well as the sensitivity of results to the inclusion (or exclusion) of individual observers.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Humans
  • Image Interpretation, Computer-Assisted / methods*
  • Lung Diseases / diagnostic imaging*
  • Observer Variation*
  • Quality Assurance, Health Care / methods*
  • ROC Curve*
  • Radiography
  • Reproducibility of Results
  • Retrospective Studies
  • Sensitivity and Specificity
  • Task Performance and Analysis*
  • United States