Visual-statistical interpretation of (18)F-FDG-PET images for characteristic Alzheimer patterns in a multicenter study: inter-rater concordance and relationship to automated quantitative evaluation

AJNR Am J Neuroradiol. 2014 Feb;35(2):244-9. doi: 10.3174/ajnr.A3665. Epub 2013 Aug 1.

Abstract

Background and purpose: The role of (18)F-FDG-PET in the diagnosis of Alzheimer disease is increasing and should be validated. The aim of this study was to assess the inter-rater variability in the interpretation of (18)F-FDG-PET images obtained in the Japanese Alzheimer's Disease Neuroimaging Initiative, a multicenter clinical research project.

Materials and methods: This study analyzed 274 (18)F-FDG-PET scans (67 mild Alzheimer disease, 100 mild cognitive impairment, and 107 normal cognitive) as baseline scans for the Japanese Alzheimer's Disease Neuroimaging Initiative, which were acquired with various types of PET or PET/CT scanners in 23 facilities. Three independent raters interpreted all PET images by using a combined visual-statistical method. The images were classified into 7 (FDG-7) patterns by the criteria of Silverman et al and further into 2 (FDG-2) patterns.

Results: Agreement among the 7 visual-statistical categories by at least 2 of the 3 readers occurred in >94% of cases for all groups: Alzheimer disease, mild cognitive impairment, and normal cognitive. Perfect matches by all 3 raters were observed for 62% of the cases by FDG-7 and 76 by FDG-2. Inter-rater concordance was moderate by FDG-7 (κ = 0.57) and substantial in FDG-2 (κ = 0.67) on average. The FDG-PET score, an automated quantitative index developed by Herholz et al, increased as the number of raters who voted for the AD pattern increased (ρ = 0.59, P < .0001), and the FDG-PET score decreased as those for normal pattern increased (ρ = -0.64, P < .0001).

Conclusions: Inter-rater agreement was moderate to substantial for the combined visual-statistical interpretation of (18)F-FDG-PET and was also significantly associated with automated quantitative assessment.

Publication types

  • Multicenter Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Alzheimer Disease / complications
  • Alzheimer Disease / diagnostic imaging*
  • Artificial Intelligence
  • Cognitive Dysfunction / complications
  • Cognitive Dysfunction / diagnostic imaging*
  • Data Interpretation, Statistical
  • Female
  • Fluorodeoxyglucose F18*
  • Humans
  • Image Enhancement / methods
  • Image Interpretation, Computer-Assisted / methods*
  • Japan
  • Male
  • Middle Aged
  • Observer Variation
  • Pattern Recognition, Automated / methods*
  • Positron-Emission Tomography / methods*
  • Radiopharmaceuticals
  • Reproducibility of Results
  • Sensitivity and Specificity

Substances

  • Radiopharmaceuticals
  • Fluorodeoxyglucose F18