Determining prominent subdomains in medicine

AMIA Annu Symp Proc. 2005;2005:46-50.


We discuss an automated method for identifying prominent subdomains in medicine. The motivation is to enhance the results of natural language processing by focusing on sublanguages associated with medical specialties concerned with prevalent disorders. At the core of our approach is a statistical system for topical categorization of medical text. A method based on epidemiological evidence is compared to another that considers frequency of occurrence of Medline citations. We suggest the isolation of UMLS terminology peculiar to individual medical specialties as a way of enhancing natural language processing systems in the biomedical domain.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Abstracting and Indexing / methods*
  • Disease / classification
  • Electronic Data Processing*
  • Information Storage and Retrieval
  • Medical Subject Headings
  • Medicine / classification*
  • Natural Language Processing*
  • Specialization*
  • Terminology as Topic*
  • Unified Medical Language System