Comparison of methods for classifying Hispanic ethnicity in a population-based cancer registry

Am J Epidemiol. 1999 Jun 1;149(11):1063-71. doi: 10.1093/oxfordjournals.aje.a009752.


The accuracy of ethnic classification can substantially affect ethnic-specific cancer statistics. In the Greater Bay Area Cancer Registry, which is part of the Surveillance, Epidemiology, and End Results (SEER) Program and of the statewide California Cancer Registry, Hispanic ethnicity is determined by medical record review and by matching to surname lists. This study compared these classification methods with self-report. Ethnic self-identification was obtained by surveying 1,154 area residents aged 20-89 years who were diagnosed with cancer in 1990 and were reported to the registry as being Hispanic or White non-Hispanic. Predictive value positive, sensitivity, and relative bias were used to assess the accuracy of Hispanic classification by medical record and surname. Among those persons classified as Hispanic by either or both of these sources, only two-thirds agreed (predictive value positive = 66%), and many self-identified Hispanics were classified incorrectly (sensitivity = 68%). Classification based on either medical record or surname alone had a lower sensitivity (59% and 61%, respectively) but a higher predictive value positive (77% and 70%, respectively). Ethnic classification by medical record alone resulted in an underestimate of Hispanic cancer cases and incidence rates. Bias was reduced when medical records and surnames were used together to classify cancer cases as Hispanic.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • California / epidemiology
  • Female
  • Hispanic Americans / statistics & numerical data*
  • Humans
  • Incidence
  • Male
  • Neoplasms / ethnology*
  • Population Surveillance / methods*
  • Registries / statistics & numerical data*
  • San Francisco / epidemiology
  • Sensitivity and Specificity