Towards FAIRer Biological Knowledge Networks Using a Hybrid Linked Data and Graph Database Approach

J Integr Bioinform. 2018 Aug 7;15(3):20180023. doi: 10.1515/jib-2018-0023.

Abstract

The speed and accuracy of new scientific discoveries - be it by humans or artificial intelligence - depends on the quality of the underlying data and on the technology to connect, search and share the data efficiently. In recent years, we have seen the rise of graph databases and semi-formal data models such as knowledge graphs to facilitate software approaches to scientific discovery. These approaches extend work based on formalised models, such as the Semantic Web. In this paper, we present our developments to connect, search and share data about genome-scale knowledge networks (GSKN). We have developed a simple application ontology based on OWL/RDF with mappings to standard schemas. We are employing the ontology to power data access services like resolvable URIs, SPARQL endpoints, JSON-LD web APIs and Neo4j-based knowledge graphs. We demonstrate how the proposed ontology and graph databases considerably improve search and access to interoperable and reusable biological knowledge (i.e. the FAIRness data principles).

Keywords: FAIR data principles; bio-ontologies; biological knowledge networks; data integration; graph databases; linked data; semantic web.

MeSH terms

  • Computational Biology / methods*
  • Computer Graphics*
  • Databases, Factual
  • Gene Regulatory Networks*
  • Genome, Human*
  • Genome-Wide Association Study
  • Humans
  • Knowledge
  • Software*