Practical Implementation and Challenges of Artificial Intelligence-Driven Electronic Health Record Evaluation: Protected Health Information

Adam P Tashman

doi:10.1053/j.ackd.2022.05.003

Practical Implementation and Challenges of Artificial Intelligence-Driven Electronic Health Record Evaluation: Protected Health Information

Adv Chronic Kidney Dis. 2022 Sep;29(5):427-430. doi: 10.1053/j.ackd.2022.05.003.

Author

Adam P Tashman¹

Affiliation

¹ University of Virginia, School of Data Science, Director, Online MSDS Program, Associate Professor of Data Science, Charlottesville, Virginia. Electronic address: apt4c@virginia.edu.

PMID: 36253025
DOI: 10.1053/j.ackd.2022.05.003

Abstract

Detecting protected health information in electronic health record systems is often an early step in health care analytics, and it is a nontrivial problem. Specific challenges include finding clinician names and diseases, which lack a fixed format and are often context-dependent. The general problem of finding entities, termed named-entity recognition, has received a substantial amount of attention in the natural language processing and deep learning communities. This paper begins by outlining recent methods for finding protected health information, and it then introduces a hybrid system which combines regular expressions with a natural language processing framework called FLAIR. FLAIR is open-source, it includes state-of-the-art deep learning models, and it supports straightforward development of new models for language tasks including named-entity recognition. Finally, there is a discussion of how to apply the system to structured text in a database table as well as unstructured text in clinical notes.

Keywords: Deep learning; Named-entity recognition; Protected health information.

Publication types

Review

MeSH terms

Artificial Intelligence*
Electronic Health Records*
Humans
Language
Natural Language Processing