Deep learning-based method for automatic resolution of gas chromatography-mass spectrometry data from complex samples
- PMID: 36641940
- DOI: 10.1016/j.chroma.2022.463768
Deep learning-based method for automatic resolution of gas chromatography-mass spectrometry data from complex samples
Abstract
Modern gas chromatography-mass spectrometry (GC-MS) is the workhorse for the high-throughput profiling of volatile compounds in complex samples. It can produce a considerable amount of two-dimensional data, and automatic methods are required to distill chemical information from raw GC-MS data efficiently. In this study, we proposed an Automatic Resolution method (AutoRes) based on pseudo-Siamese convolutional neural networks (pSCNN) to extract the meaningful features swamped by the noises, baseline drifts, retention time shifts, and overlapped peaks. Two pSCNN models were trained with 400,000 augmented spectral pairs, respectively. They can predict the selective region (pSCNN1) and elution region (pSCNN2) of compounds in an untargeted manner. The accuracies of the pSCNN1 model and the pSCNN2 model on their test sets are 99.9% and 92.6%, respectively. Then, the chromatographic profile of each component was automatically resolved by full rank resolution (FRR) based on the predicted regions by these models. The performance of AutoRes was evaluated on the simulated and plant essential oil datasets. Compared to AMDIS and MZmine, AutoRes resolves more reasonable mass spectra, chromatograms, and peak areas to identify and quantify compounds. The average match scores of AutoRes (925 and 936) outperformed AMDIS (909 and 925) and MZmine (888 and 916) when resolving mass spectra from overlapped peaks on the Set Ⅰ and Set Ⅱ of plant essential oil dataset and matching them against the NIST17 library. It extracted peak areas and mass spectra automatically from 10 GC-MS files of plant essential oils, and the entire process was completed in 8 min without any prior information or manual intervention. It is implemented in Python and is available as an open-source package at https://github.com/dyjfan/AutoRes.
Keywords: Automatic resolution; Deep neural network; GC–MS; Multivariate curve resolution.
Copyright © 2022 Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Similar articles
-
Fully automatic resolution of untargeted GC-MS data with deep learning assistance.Talanta. 2022 Jul 1;244:123415. doi: 10.1016/j.talanta.2022.123415. Epub 2022 Mar 26. Talanta. 2022. PMID: 35358897
-
Deep-Learning-Assisted multivariate curve resolution.J Chromatogr A. 2021 Jan 4;1635:461713. doi: 10.1016/j.chroma.2020.461713. Epub 2020 Nov 13. J Chromatogr A. 2021. PMID: 33229011
-
Peak alignment of gas chromatography-mass spectrometry data with deep learning.J Chromatogr A. 2019 Oct 25;1604:460476. doi: 10.1016/j.chroma.2019.460476. Epub 2019 Aug 22. J Chromatogr A. 2019. PMID: 31488294
-
Analysis of the essential oils of Coriandrum sativum Using GC-MS coupled with chemometric resolution methods.Chem Pharm Bull (Tokyo). 2011;59(1):28-34. doi: 10.1248/cpb.59.28. Chem Pharm Bull (Tokyo). 2011. PMID: 21212543
-
compMS2Miner: An Automatable Metabolite Identification, Visualization, and Data-Sharing R Package for High-Resolution LC-MS Data Sets.Anal Chem. 2017 Apr 4;89(7):3919-3928. doi: 10.1021/acs.analchem.6b02394. Epub 2017 Mar 27. Anal Chem. 2017. PMID: 28225587 Free PMC article.
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Miscellaneous
