Classification of cervical lesions based on multimodal features fusion

Comput Biol Med. 2024 Jul:177:108589. doi: 10.1016/j.compbiomed.2024.108589. Epub 2024 May 10.

Abstract

Cervical cancer is a severe threat to women's health worldwide with a long cancerous cycle and a clear etiology, making early screening vital for the prevention and treatment. Based on the dataset provided by the Obstetrics and Gynecology Hospital of Fudan University, a four-category classification model for cervical lesions including Normal, low-grade squamous intraepithelial lesion (LSIL), high-grade squamous intraepithelial lesion (HSIL) and cancer (Ca) is developed. Considering the dataset characteristics, to fully utilize the research data and ensure the dataset size, the model inputs include original and acetic colposcopy images, lesion segmentation masks, human papillomavirus (HPV), thinprep cytologic test (TCT) and age, but exclude iodine images that have a significant overlap with lesions under acetic images. Firstly, the change information between original and acetic images is introduced by calculating the acetowhite opacity to mine the correlation between the acetowhite thickness and lesion grades. Secondly, the lesion segmentation masks are utilized to introduce prior knowledge of lesion location and shape into the classification model. Lastly, a cross-modal feature fusion module based on the self-attention mechanism is utilized to fuse image information with clinical text information, revealing the features correlation. Based on the dataset used in this study, the proposed model is comprehensively compared with five excellent models over the past three years, demonstrating that the proposed model has superior classification performance and a better balance between performance and complexity. The modules ablation experiments further prove that each proposed improved module can independently improve the model performance.

Keywords: Acetowhite opacity; Cervical lesion classification; Colposcopy image; Efficientnet-B3; HPV; Lesion segmentation mask; Multimodal fusion; Self-attention mechanism.

MeSH terms

  • Adult
  • Cervix Uteri / diagnostic imaging
  • Cervix Uteri / pathology
  • Colposcopy / methods
  • Female
  • Humans
  • Image Interpretation, Computer-Assisted / methods
  • Uterine Cervical Neoplasms* / diagnostic imaging
  • Uterine Cervical Neoplasms* / pathology