Dual encoder-decoder-based deep polyp segmentation network for colonoscopy images

Sci Rep. 2023 Jan 21;13(1):1183. doi: 10.1038/s41598-023-28530-2.

Abstract

Detection of colorectal polyps through colonoscopy is an essential practice in prevention of colorectal cancers. However, the method itself is labor intensive and is subject to human error. With the advent of deep learning-based methodologies, and specifically convolutional neural networks, an opportunity to improve upon the prognosis of potential patients suffering with colorectal cancer has appeared with automated detection and segmentation of polyps. Polyp segmentation is subject to a number of problems such as model overfitting and generalization, poor definition of boundary pixels, as well as the model's ability to capture the practical range in textures, sizes, and colors. In an effort to address these challenges, we propose a dual encoder-decoder solution named Polyp Segmentation Network (PSNet). Both the dual encoder and decoder were developed by the comprehensive combination of a variety of deep learning modules, including the PS encoder, transformer encoder, PS decoder, enhanced dilated transformer decoder, partial decoder, and merge module. PSNet outperforms state-of-the-art results through an extensive comparative study against 5 existing polyp datasets with respect to both mDice and mIoU at 0.863 and 0.797, respectively. With our new modified polyp dataset we obtain an mDice and mIoU of 0.941 and 0.897 respectively.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Colonoscopy
  • Electric Power Supplies
  • Female
  • Generalization, Psychological
  • Humans
  • Image Processing, Computer-Assisted
  • Labor, Obstetric*
  • Neural Networks, Computer
  • Polyps*
  • Pregnancy