Retrieval, alignment, and clustering of computational models based on semantic annotations

Mol Syst Biol. 2011 Jul 19;7:512. doi: 10.1038/msb.2011.41.

Abstract

The exploding number of computational models produced by Systems Biologists over the last years is an invitation to structure and exploit this new wealth of information. Researchers would like to trace models relevant to specific scientific questions, to explore their biological content, to align and combine them, and to match them with experimental data. To automate these processes, it is essential to consider semantic annotations, which describe their biological meaning. As a prerequisite for a wide range of computational methods, we propose general and flexible similarity measures for Systems Biology models computed from semantic annotations. By using these measures and a large extensible ontology, we implement a platform that can retrieve, cluster, and align Systems Biology models and experimental data sets. At present, its major application is the search for relevant models in the BioModels Database, starting from initial models, data sets, or lists of biological concepts. Beyond similarity searches, the representation of models by semantic feature vectors may pave the way for visualisation, exploration, and statistical analysis of large collections of models and corresponding data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Computer Simulation*
  • Database Management Systems
  • Databases, Factual
  • Information Storage and Retrieval / methods*
  • Models, Biological
  • Natural Language Processing
  • Semantics*
  • Sequence Alignment / methods*
  • Systems Biology / methods*