A lighter hybrid feature fusion framework for polyp segmentation

Sci Rep. 2024 Oct 5;14(1):23179. doi: 10.1038/s41598-024-72763-8.

Abstract

Colonoscopy is widely recognized as the most effective method for the detection of colon polyps, which is crucial for early screening of colorectal cancer. Polyp identification and segmentation in colonoscopy images require specialized medical knowledge and are often labor-intensive and expensive. Deep learning provides an intelligent and efficient approach for polyp segmentation. However, the variability in polyp size and the heterogeneity of polyp boundaries and interiors pose challenges for accurate segmentation. Currently, Transformer-based methods have become a mainstream trend for polyp segmentation. However, these methods tend to overlook local details due to the inherent characteristics of Transformer, leading to inferior results. Moreover, the computational burden brought by self-attention mechanisms hinders the practical application of these models. To address these issues, we propose a novel CNN-Transformer hybrid model for polyp segmentation (CTHP). CTHP combines the strengths of CNN, which excels at modeling local information, and Transformer, which excels at modeling global semantics, to enhance segmentation accuracy. We transform the self-attention computation over the entire feature map into the width and height directions, significantly improving computational efficiency. Additionally, we design a new information propagation module and introduce additional positional bias coefficients during the attention computation process, which reduces the dispersal of information introduced by deep and mixed feature fusion in the Transformer. Extensive experimental results demonstrate that our proposed model achieves state-of-the-art performance on multiple benchmark datasets for polyp segmentation. Furthermore, cross-domain generalization experiments show that our model exhibits excellent generalization performance.

Keywords: Deep learning; Generalization; Polyp segmentation; Transformer.

MeSH terms

  • Algorithms
  • Colonic Polyps* / diagnostic imaging
  • Colonic Polyps* / pathology
  • Colonoscopy* / methods
  • Colorectal Neoplasms / diagnostic imaging
  • Colorectal Neoplasms / pathology
  • Deep Learning*
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Neural Networks, Computer