Exploring UMAP in hybrid models of entropy-based and representativeness sampling for active learning in biomedical segmentation

Comput Biol Med. 2024 Jun:176:108605. doi: 10.1016/j.compbiomed.2024.108605. Epub 2024 May 16.

Abstract

In this work, we study various hybrid models of entropy-based and representativeness sampling techniques in the context of active learning in medical segmentation, in particular examining the role of UMAP (Uniform Manifold Approximation and Projection) as a technique for capturing representativeness. Although UMAP has been shown viable as a general purpose dimension reduction method in diverse areas, its role in deep learning-based medical segmentation has yet been extensively explored. Using the cardiac and prostate datasets in the Medical Segmentation Decathlon for validation, we found that a novel hybrid combination of Entropy-UMAP sampling technique achieved a statistically significant Dice score advantage over the random baseline (3.2% for cardiac, 4.5% for prostate), and attained the highest Dice coefficient among the spectrum of 10 distinct active learning methodologies we examined. This provides preliminary evidence that there is an interesting synergy between entropy-based and UMAP methods when the former precedes the latter in a hybrid model of active learning.

Keywords: Active learning; Biomedical segmentation; UMAP; Uncertainty and representativeness sampling.

MeSH terms

  • Deep Learning
  • Entropy*
  • Heart
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Male
  • Prostate / diagnostic imaging
  • Supervised Machine Learning