Knowledge-Based Biomedical Data Science

Annu Rev Biomed Data Sci. 2020 Jul:3:23-41. doi: 10.1146/annurev-biodatasci-010820-091627. Epub 2020 Apr 7.

Abstract

Knowledge-based biomedical data science involves the design and implementation of computer systems that act as if they knew about biomedicine. Such systems depend on formally represented knowledge in computer systems, often in the form of knowledge graphs. Here we survey recent progress in systems that use formally represented knowledge to address data science problems in both clinical and biological domains, as well as progress on approaches for creating knowledge graphs. Major themes include the relationships between knowledge graphs and machine learning, the use of natural language processing to construct knowledge graphs, and the expansion of novel knowledge-based approaches to clinical and biological domains.

Keywords: Semantic Web; knowledge discovery; knowledge graph; knowledge graph embeddings; natural language processing; ontology.