Evaluating the quality of anonymous record linkage using deterministic procedures with the New York State AIDS registry and a hospital discharge file

Stat Med. 1995;14(5-7):499-509. doi: 10.1002/sim.4780140511.

Abstract

Linkage of same-person records across multiple databases relies on high-quality, uniformly available identifying information. These data quality issues become increasingly important when personal names are not available for record linkage. Using deterministic decision criteria, we linked records from two population-based files in the absence of personal names. The sensitivity of anonymous record linkage procedures ranged from 32 to 85 per cent for the two years studied, and the positive predictive value (PPV) ranged from 14 to 99 per cent. Decreasing sensitivity and PPV were primarily attributed to (1) errors in computerized identifying information and (2) the deterministic decision criteria specified for record linkage. An evaluation of the contribution of personal names to the quality of record linkage found no measurable impact.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Acquired Immunodeficiency Syndrome / epidemiology*
  • Algorithms
  • Databases, Factual*
  • Disease Notification
  • Female
  • Hospital Information Systems
  • Humans
  • Male
  • Medical Record Linkage / methods
  • Medical Record Linkage / standards*
  • New York / epidemiology
  • Patient Discharge / statistics & numerical data*
  • Population Surveillance
  • Predictive Value of Tests
  • Quality Control
  • Random Allocation
  • Registries*