An evaluation of natural language processing methodologies

C Friedman; G Hripcsak; I Shablinsky

An evaluation of natural language processing methodologies

Proc AMIA Symp. 1998:855-9.

Authors

C Friedman¹, G Hripcsak, I Shablinsky

Affiliation

¹ Computer Science Department, Queens College CUNY, USA.

PMID: 9929340
PMCID: PMC2232366

Abstract

Medical language processing (MLP) systems that codify information in textual patient reports have been developed to help solve the data entry problem. Some systems have been evaluated in order to assess performance, but there has been little evaluation of the underlying technology. Various methodologies are used by the different MLP systems but a comparison of the methods has not been performed although evaluations of MLP methodologies would be extremely beneficial to the field. This paper describes a study that evaluates different techniques. To accomplish this task an existing MLP system MedLEE was modified and results from a previous study were used. Based on confidence intervals and differences in sensitivity and specificity between each technique and all the others combined, the results showed that the two methods based on obtaining the largest well-formed segment within a sentence had significantly higher sensitivity than the others by 5% and 6%. The method based on recognizing a complete sentence had a significantly worse sensitivity than the others by 7% and a better specificity by .2%. None of the methods had significantly worse specificity.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, P.H.S.

MeSH terms

Evaluation Studies as Topic
Medical Records / classification*
Methods
Natural Language Processing*
Sensitivity and Specificity

Abstract

Publication types

MeSH terms

Grants and funding