A globally optimal k-anonymity method for the de-identification of health data
- PMID: 19567795
- PMCID: PMC2744718
- DOI: 10.1197/jamia.M3144
A globally optimal k-anonymity method for the de-identification of health data
Abstract
Background: Explicit patient consent requirements in privacy laws can have a negative impact on health research, leading to selection bias and reduced recruitment. Often legislative requirements to obtain consent are waived if the information collected or disclosed is de-identified.
Objective: The authors developed and empirically evaluated a new globally optimal de-identification algorithm that satisfies the k-anonymity criterion and that is suitable for health datasets.
Design: Authors compared OLA (Optimal Lattice Anonymization) empirically to three existing k-anonymity algorithms, Datafly, Samarati, and Incognito, on six public, hospital, and registry datasets for different values of k and suppression limits. Measurement Three information loss metrics were used for the comparison: precision, discernability metric, and non-uniform entropy. Each algorithm's performance speed was also evaluated.
Results: The Datafly and Samarati algorithms had higher information loss than OLA and Incognito; OLA was consistently faster than Incognito in finding the globally optimal de-identification solution.
Conclusions: For the de-identification of health datasets, OLA is an improvement on existing k-anonymity algorithms in terms of information loss and performance.
Figures
Similar articles
-
Protecting privacy using k-anonymity.J Am Med Inform Assoc. 2008 Sep-Oct;15(5):627-37. doi: 10.1197/jamia.M2716. Epub 2008 Jun 25. J Am Med Inform Assoc. 2008. PMID: 18579830 Free PMC article.
-
Attribute Utility Motivated k-anonymization of datasets to support the heterogeneous needs of biomedical researchers.AMIA Annu Symp Proc. 2011;2011:1573-82. Epub 2011 Oct 22. AMIA Annu Symp Proc. 2011. PMID: 22195223 Free PMC article.
-
A computational model to protect patient data from location-based re-identification.Artif Intell Med. 2007 Jul;40(3):223-39. doi: 10.1016/j.artmed.2007.04.002. Epub 2007 Jun 1. Artif Intell Med. 2007. PMID: 17544262
-
Patient Privacy in the Era of Big Data.Balkan Med J. 2018 Jan 20;35(1):8-17. doi: 10.4274/balkanmedj.2017.0966. Epub 2017 Sep 13. Balkan Med J. 2018. PMID: 28903886 Free PMC article. Review.
-
Securing electronic health records without impeding the flow of information.Int J Med Inform. 2007 May-Jun;76(5-6):471-9. doi: 10.1016/j.ijmedinf.2006.09.015. Epub 2007 Jan 3. Int J Med Inform. 2007. PMID: 17204451 Review.
Cited by
-
The Costs of Anonymization: Case Study Using Clinical Data.J Med Internet Res. 2024 Apr 24;26:e49445. doi: 10.2196/49445. J Med Internet Res. 2024. PMID: 38657232 Free PMC article.
-
Enabling Health Data Sharing with Fine-Grained Privacy.Proc ACM Int Conf Inf Knowl Manag. 2023 Oct;2023:131-141. doi: 10.1145/3583780.3614864. Epub 2023 Oct 21. Proc ACM Int Conf Inf Knowl Manag. 2023. PMID: 37906633 Free PMC article.
-
Multiple modes of data sharing can facilitate secondary use of sensitive health data for research.BMJ Glob Health. 2023 Oct;8(10):e013092. doi: 10.1136/bmjgh-2023-013092. BMJ Glob Health. 2023. PMID: 37802544 Free PMC article. Review.
-
RespectM revealed metabolic heterogeneity powers deep learning for reshaping the DBTL cycle.iScience. 2023 Jun 8;26(7):107069. doi: 10.1016/j.isci.2023.107069. eCollection 2023 Jul 21. iScience. 2023. PMID: 37426353 Free PMC article.
-
Algorithms to anonymize structured medical and healthcare data: A systematic review.Front Bioinform. 2022 Dec 22;2:984807. doi: 10.3389/fbinf.2022.984807. eCollection 2022. Front Bioinform. 2022. PMID: 36619476 Free PMC article.
References
-
- Ness R. Influence of the HIPAA privacy rule on health research J Am Med Assoc 2007;298(18):2164-2170. - PubMed
-
- Institute of Medicine Health research and the privacy of health information—The HIPAA privacy rule, 2008http://www.iom.edu/CMS/3740/43729.aspx 2007. Accessed August 4, 2009.
-
- Institute of Medicine 2006. Effect of the HIPAA privacy rule on health research: Proceedings of a workshop presented to the National Cancer Policy Forum.
-
- Association of Academic Health Centers HIPAA creating barriers to research and discovery 2008.
-
- Wilson J. Health insurance portability and accountability Act privacy rule causes ongoing concerns among clinicians and researchers Ann Intern Med 2006;145(4):313-316. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
