Identifying and categorizing spurious weight data in electronic medical records

Sunny Chen; William A Banks; Meera Sheffrin; William Bryson; Marissa Black; Stephen M Thielke

doi:10.1093/ajcn/nqx056

Identifying and categorizing spurious weight data in electronic medical records

Am J Clin Nutr. 2018 Mar 1;107(3):420-426. doi: 10.1093/ajcn/nqx056.

Authors

Sunny Chen¹, William A Banks^{2

3}, Meera Sheffrin⁴, William Bryson⁵, Marissa Black^{1

3}, Stephen M Thielke^{1

5}

Affiliations

¹ Geriatric Research, Education, and Clinical Center and Research and Development, Puget Sound VA Medical Center, Seattle, WA.
² Research and Development, Puget Sound VA Medical Center, Seattle, WA.
³ Division of Gerontology and Geriatric Medicine, Department of Medicine and Department of Psychiatry and Behavioral Sciences, University of Washington School of Medicine, WA.
⁴ Geriatric Medicine, Division of General Medical Disciplines, Stanford University School of Medicine, Stanford, CA.
⁵ Department of Psychiatry and Behavioral Sciences, University of Washington School of Medicine, WA.

PMID: 29566188
DOI: 10.1093/ajcn/nqx056

Abstract

Background: Spurious weights compromise the validity of summary measures, such as averages and trends. Even rare errors in weight records can undermine the utility of electronic medical record (EMR) data.

Objective: We sought to estimate the prevalence of spurious weight values in a large EMR, to ascertain the likely causes, and to develop and test straightforward algorithms for identifying spurious weight data.

Design: Using EMR data from 10,000 randomly selected patients aged ≥65 y in the VA system, we examined the percentage of weight change across various time intervals, from 1 to 3000 d. We examined descriptive results and developed 3 algorithms to categorize degree of weight change over time. On the basis of distributions, we identified cases that were most likely spurious. We manually reviewed these and categorized the type of error.

Results: The data followed the expected distributions. The algorithms reliably identified spurious weight. Approximately 0.8% of all weights in the record appeared to be spurious and ∼1 in 5 patient charts included ≥1 spurious weight value. The most common type of error involved the misentry of a single digit (e.g., 148 for 178).

Conclusions: Spurious weights are common in EMRs. Straightforward algorithms can identify and remove them, and thus enhance the reliability of EMR data.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Aged
Algorithms
Body Weight*
Body Weights and Measures / standards*
Databases, Factual
Electronic Health Records / standards*
Humans
Models, Theoretical
Reproducibility of Results
Veterans