Heterogeneous database integration in biomedicine

J Biomed Inform. 2001 Aug;34(4):285-98. doi: 10.1006/jbin.2001.1024.

Abstract

The rapid expansion of biomedical knowledge, reduction in computing costs, and spread of internet access have created an ocean of electronic data. The decentralized nature of our scientific community and healthcare system, however, has resulted in a patchwork of diverse, or heterogeneous, database implementations, making access to and aggregation of data across databases very difficult. The database heterogeneity problem applies equally to clinical data describing individual patients and biological data characterizing our genome. Specifically, databases are highly heterogeneous with respect to the data models they employ, the data schemas they specify, the query languages they support, and the terminologies they recognize. Heterogeneous database systems attempt to unify disparate databases by providing uniform conceptual schemas that resolve representational heterogeneities, and by providing querying capabilities that aggregate and integrate distributed data. Research in this area has applied a variety of database and knowledge-based techniques, including semantic data modeling, ontology definition, query translation, query optimization, and terminology mapping. Existing systems have addressed heterogeneous database integration in the realms of molecular biology, hospital information systems, and application portability.

MeSH terms

  • Computational Biology*
  • Computer Simulation
  • Database Management Systems
  • Databases, Factual*
  • Decision Support Techniques
  • Hospital Information Systems
  • Language
  • Molecular Biology
  • Terminology as Topic