Preserving Privacy when Querying OMOP CDM Databases

Stud Health Technol Inform. 2022 Aug 31:298:163-164. doi: 10.3233/SHTI220930.

Abstract

Anonymisation is currently one of the biggest challenges when sharing sensitive personal information. Its importance depends largely on the application domain, but when dealing with health information, this becomes a more serious issue. A simpler approach to avoid inadequate disclosure is to ensure that all data that can be associated directly with an individual is removed from the original dataset. However, some studies have shown that simple anonymisation procedures can sometimes be reverted using specific patients' characteristics. In this work, we propose a secure architecture to share information from distributed databases without compromising the subjects' privacy. The anonymiser system was validated using the OMOP CDM data schema, which is widely adopted in observational research studies.

Keywords: Data anonymisation; OHDSI; OMOP CDM; Privacy preserving; k-Anonymity; l-Diversity.

MeSH terms

  • Databases, Factual
  • Humans
  • Personally Identifiable Information*
  • Privacy*