Detecting earlier indicators of homelessness in the free text of medical records

Stud Health Technol Inform. 2014:202:153-6.

Abstract

Early warning indicators to identify US Veterans at risk of homelessness are currently only inferred from administrative data. References to indicators of risk or instances of homelessness in the free text of medical notes written by Department of Veterans Affairs (VA) providers may precede formal identification of Veterans as being homeless. This represents a potentially untapped resource for early identification. Using natural language processing (NLP), we investigated the idea that concepts related to homelessness written in the free text of the medical record precede the identification of homelessness by administrative data. We found that homeless Veterans were much higher utilizers of VA resources producing approximately 12 times as many documents as non-homeless Veterans. NLP detected mentions of either direct or indirect evidence of homelessness in a significant portion of Veterans earlier than structured data.

MeSH terms

  • Data Mining / methods*
  • Electronic Health Records / classification*
  • Electronic Health Records / organization & administration
  • Humans
  • Ill-Housed Persons / classification*
  • Ill-Housed Persons / statistics & numerical data
  • Machine Learning
  • Natural Language Processing*
  • Pattern Recognition, Automated / methods*
  • United States
  • Vocabulary, Controlled*