Representing a Heterogeneous Pharmaceutical Knowledge-Graph with Textual Information

Front Res Metr Anal. 2021 Jul 1:6:670206. doi: 10.3389/frma.2021.670206. eCollection 2021.

Abstract

We deal with a heterogeneous pharmaceutical knowledge-graph containing textual information built from several databases. The knowledge graph is a heterogeneous graph that includes a wide variety of concepts and attributes, some of which are provided in the form of textual pieces of information which have not been targeted in the conventional graph completion tasks. To investigate the utility of textual information for knowledge graph completion, we generate embeddings from textual descriptions given to heterogeneous items, such as drugs and proteins, while learning knowledge graph embeddings. We evaluate the obtained graph embeddings on the link prediction task for knowledge graph completion, which can be used for drug discovery and repurposing. We also compare the results with existing methods and discuss the utility of the textual information.

Keywords: drug database; heterogeneous networks; knowledge graph completion; knowledge graph embedding; textual information.