Limitations of claims and registry data in surgical oncology research

Ann Surg Oncol. 2008 Feb;15(2):415-23. doi: 10.1245/s10434-007-9658-3. Epub 2007 Nov 7.


Studies based on large population-based data sets, such as administrative claims data and tumor registry data, have become increasingly common in surgical oncology research. These data sets can be acquired relatively easily, and they offer larger sample sizes and improved generalizability compared with institutional data. There are, however, significant limitations that must be considered in the analysis and interpretation of such data. Invalid conclusions can result when insufficient attention is paid to issues such as data quality and depth, potential sources of bias, missing data, type I error, and the assessment of statistical significance. This article reviews some important limitations of population-based data sets and the methods used to analyze them. The candid reporting of these issues in the literature and an increased awareness among surgical oncologists of these limitations will ensure that population-based studies in the surgical oncology literature achieve high standards of methodological quality and clinical utility.

MeSH terms

  • Bias
  • Biomedical Research
  • Causality
  • Confounding Factors, Epidemiologic
  • Humans
  • Insurance Claim Reporting
  • Medical Oncology*
  • Medicare
  • Models, Statistical
  • Multivariate Analysis
  • Outcome Assessment, Health Care* / standards
  • Registries*
  • Risk Adjustment
  • SEER Program / standards
  • Specialties, Surgical* / standards
  • United States