Serum Raman spectroscopy combined with multiple classification models for rapid diagnosis of breast cancer

Photodiagnosis Photodyn Ther. 2022 Dec:40:103115. doi: 10.1016/j.pdpdt.2022.103115. Epub 2022 Sep 10.

Abstract

Breast cancer is a malignant tumor with the highest incidence rate in women. Current diagnostic methods are time-consuming, costly, and dependent on physician experience. In this study, we used serum Raman spectroscopy combined with multiple classification algorithms to implement an auxiliary diagnosis method for breast cancer, which will help in the early diagnosis of breast cancer patients. We analyzed the serum Raman spectra of 171 invasive ductal carcinoma (IDC) and 100 healthy volunteers; The analysis showed differences in nucleic acids, carotenoids, amino acids, and lipid concentrations in their blood. These differences provide a theoretical basis for this experiment. First, we used adaptive iteratively reweighted penalized least squares (airPLS) and Savitzky-Golay (SG) for baseline correction and smoothing denoising to remove the effect of noise on the experiment. Then, the Principal component analysis (PCA) algorithm was used to extract features. Finally, we built four classification models: support vector machine (SVM), decision tree (DT), linear discriminant analysis (LDA), and Neural Network Language Model (NNLM). The LDA, SVM, and NNLM achieve 100% accuracy. As supplementary, we added the classification experiment of the raw data. By comparing the experimental results of the two groups, We concluded that the NNLM was the best model. The results show the reliability of the combination of serum Raman spectroscopy and classification models under large sample conditions.

Keywords: Breast cancer; Classification; Raman spectrum; Serum.

MeSH terms

  • Algorithms
  • Breast Neoplasms* / diagnosis
  • Discriminant Analysis
  • Female
  • Humans
  • Photochemotherapy* / methods
  • Principal Component Analysis
  • Reproducibility of Results
  • Spectrum Analysis, Raman / methods
  • Support Vector Machine