Minimalistic Approach to Coreference Resolution in Lithuanian Medical Records

Comput Math Methods Med. 2019 Mar 20;2019:9079840. doi: 10.1155/2019/9079840. eCollection 2019.


Coreference resolution is a challenging part of natural language processing (NLP) with applications in machine translation, semantic search and other information retrieval, and decision support systems. Coreference resolution requires linguistic preprocessing and rich language resources for automatically identifying and resolving such expressions. Many rarer and under-resourced languages (such as Lithuanian) lack the required language resources and tools. We present a method for coreference resolution in Lithuanian language and its application for processing e-health records from a hospital reception. Our novelty is the ability to process coreferences with minimal linguistic resources, which is important in linguistic applications for rare and endangered languages. The experimental results show that coreference resolution is applicable to the development of NLP-powered online healthcare services in Lithuania.

MeSH terms

  • Algorithms
  • Computational Biology
  • Data Mining / methods
  • Electronic Health Records / statistics & numerical data*
  • Humans
  • Language
  • Linguistics / statistics & numerical data
  • Lithuania
  • Machine Learning / statistics & numerical data
  • Mathematical Computing
  • Natural Language Processing*
  • Pattern Recognition, Automated / statistics & numerical data
  • Semantics