Analysis and comparison of machine learning methods for blood identification using single-cell laser tweezer Raman spectroscopy

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Sep 5:277:121274. doi: 10.1016/j.saa.2022.121274. Epub 2022 Apr 18.

Abstract

Raman spectroscopy, a "fingerprint" spectrum of substances, can be used to characterize various biological and chemical samples. To allow for blood classification using single-cell Raman spectroscopy, several machine learning algorithms were implemented and compared. A single-cell laser optical tweezer Raman spectroscopy system was established to obtain the Raman spectra of red blood cells. The Boruta algorithm extracted the spectral feature frequency shift, reduced the spectral dimension, and determined the essential features that affect classification. Next, seven machine learning classification models are analyzed and compared based on the classification accuracy, precision, and recall indicators. The results show that support vector machines and artificial neural networks are the two most appropriate machine learning algorithms for single-cell Raman spectrum blood classification, and this finding provides essential guidance for future research studies.

Keywords: Artificial neural network; Blood identification; Machine learning methods; Raman spectroscopy; Single-cell; Support vector machine.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Machine Learning
  • Neural Networks, Computer
  • Optical Tweezers*
  • Spectrum Analysis, Raman* / methods
  • Support Vector Machine