Image classification with symbolic hints using limited resources

PLoS One. 2024 May 21;19(5):e0301360. doi: 10.1371/journal.pone.0301360. eCollection 2024.

Abstract

Typical machine learning classification benchmark problems often ignore the full input data structures present in real-world classification problems. Here we aim to represent additional information as "hints" for classification. We show that under a specific realistic conditional independence assumption, the hint information can be included by late fusion. In two experiments involving image classification with hints taking the form of text metadata, we demonstrate the feasibility and performance of the fusion scheme. We fuse the output of pre-trained image classifiers with the output of pre-trained text models. We show that calibration of the pre-trained models is crucial for the performance of the fused model. We compare the performance of the fusion scheme with a mid-level fusion scheme based on support vector machines and find that these two methods tend to perform quite similarly, albeit the late fusion scheme has only negligible computational costs.

MeSH terms

  • Algorithms
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Machine Learning
  • Support Vector Machine*

Grants and funding

LT and LKH: DIREC (direc.dk): Bridge project Deep Learning and Automation of Imaging-Based Quality of Seeds and Grains -- Innovation Fund Denmark (innovationsfonden.dk) grant number 9142-00001B LKH:Danish Pioneer Centre for AI (aicentre.dk), DNRF grant number P1 The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.