Improving SDG Classification Precision Using Combinatorial Fusion

Sensors (Basel). 2022 Jan 29;22(3):1067. doi: 10.3390/s22031067.

Abstract

Combinatorial fusion algorithm (CFA) is a machine learning and artificial intelligence (ML/AI) framework for combining multiple scoring systems using the rank-score characteristic (RSC) function and cognitive diversity (CD). When measuring the relevance of a publication or document with respect to the 17 Sustainable Development Goals (SDGs) of the United Nations, a classification scheme is used. However, this classification process is a challenging task due to the overlapping goals and contextual differences of those diverse SDGs. In this paper, we use CFA to combine a topic model classifier (Model A) and a semantic link classifier (Model B) to improve the precision of the classification process. We characterize and analyze each of the individual models using the RSC function and CD between Models A and B. We evaluate the classification results from combining the models using a score combination and a rank combination, when compared to the results obtained from human experts. In summary, we demonstrate that the combination of Models A and B can improve classification precision only if these individual models perform well and are diverse.

Keywords: LDA; cognitive diversity; combinatorial fusion algorithm (CFA); rank combination; rank-score characteristic (RSC) function; score combination; semantic web; sustainable development goals (SDGs); topic model.

MeSH terms

  • Artificial Intelligence*
  • Global Health
  • Humans
  • Machine Learning
  • Sustainable Development*
  • United Nations