An embedding approach for analyzing the evolution of research topics with a case study on computer science subdomains

Scientometrics. 2023;128(3):1567-1582. doi: 10.1007/s11192-023-04642-4. Epub 2023 Jan 31.

Abstract

The study of topic evolution aims to analyze the behavior of different research fields by utilizing various features such as the relationships between articles. In recent years, many published papers consider more than one field of study which has led to a significant increase in the number of inter-field and interdisciplinary articles. Therefore, we can analyze the similarity/dissimilarity and convergence/divergence of research fields based on topic analysis of the published papers. Our research intends to create a methodology for studying the evolution of the research fields. In this paper, we propose an embedding approach for modeling each research topics as a multidimensional vector. Using this model, we measure the topic's distances over the years and investigate how topics evolve over time. The proposed similarity metric showed many advantages over other alternatives (such as Jaccard similarity) and it resulted in better stability and accuracy. As a case study, we applied the proposed method to subsets of computer science for experimental purposes, and the results were quite comprehensible and coherent.

Keywords: Data mining; Informetrics; Scientometrics; Similarity metrics; Topic embedding; Topic evolution.