Combining comparative genomic analysis with machine learning reveals some promising diagnostic markers to identify five common pathogenic non-tuberculous mycobacteria
- PMID: 34019733
- PMCID: PMC8313281
- DOI: 10.1111/1751-7915.13815
Combining comparative genomic analysis with machine learning reveals some promising diagnostic markers to identify five common pathogenic non-tuberculous mycobacteria
Abstract
Non-tuberculous mycobacteria (NTM) can cause various respiratory diseases and even death in severe cases, and its incidence has increased rapidly worldwide. To date, it's difficult to use routine diagnostic methods and strain identification to precisely diagnose various types of NTM infections. We combined systematic comparative genomics with machine learning to select new diagnostic markers for precisely identifying five common pathogenic NTMs (Mycobacterium kansasii, Mycobacterium avium, Mycobacterium intracellular, Mycobacterium chelonae, Mycobacterium abscessus). A panel including six genes and two SNPs (nikA, benM, codA, pfkA2, mpr, yjcH, rrl C2638T, rrl A1173G) was selected to simultaneously identify the five NTMs with high accuracy (> 90%). Notably, the panel only containing the six genes also showed a good classification effect (accuracy > 90%). Additionally, the two panels could precisely differentiate the five NTMs from M. tuberculosis (accuracy > 99%). We also revealed some new marker genes/SNPs/combinations to accurately discriminate any one of the five NTMs separately, which provided the possibility to diagnose one certain NTM infection precisely. Our research not only reveals novel promising diagnostic markers to promote the development of precision diagnosis in NTM infectious, but also provides an insight into precisely identifying various genetically close pathogens through comparative genomics and machine learning.
© 2021 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Conflict of interest statement
The authors declare that they are inventors on various patent applications covering several of the methods and results reported here.
Figures
Similar articles
-
Prevalence of nontuberculous mycobacteria in a tertiary hospital in Beijing, China, January 2013 to December 2018.BMC Microbiol. 2020 Jun 12;20(1):158. doi: 10.1186/s12866-020-01840-5. BMC Microbiol. 2020. PMID: 32532202 Free PMC article.
-
Evaluation of the Speed-Oligo Mycobacteria assay for the identification of nontuberculous mycobacteria.J Med Microbiol. 2015 Mar;64(Pt 3):283-287. doi: 10.1099/jmm.0.000025. Epub 2015 Jan 16. J Med Microbiol. 2015. PMID: 25596120
-
Evaluation of the new GenoType NTM-DR kit for the molecular detection of antimicrobial resistance in non-tuberculous mycobacteria.J Antimicrob Chemother. 2017 Jun 1;72(6):1669-1677. doi: 10.1093/jac/dkx021. J Antimicrob Chemother. 2017. PMID: 28333340
-
Human infections due to nontuberculous mycobacteria: the infectious diseases and clinical microbiology specialists' point of view.Future Microbiol. 2015;10(9):1467-83. doi: 10.2217/fmb.15.64. Epub 2015 Sep 7. Future Microbiol. 2015. PMID: 26344005 Review.
-
Nontuberculous Mycobacteria-Overview.Microbiol Spectr. 2017 Jan;5(1). doi: 10.1128/microbiolspec.TNMI7-0024-2016. Microbiol Spectr. 2017. PMID: 28128073 Review.
Cited by
-
Comparative genomic analysis reveals differential genomic characteristics and featured genes between rapid- and slow-growing non-tuberculous mycobacteria.Front Microbiol. 2023 Sep 21;14:1243371. doi: 10.3389/fmicb.2023.1243371. eCollection 2023. Front Microbiol. 2023. PMID: 37808319 Free PMC article.
-
Harmonization of supervised machine learning practices for efficient source attribution of Listeria monocytogenes based on genomic data.BMC Genomics. 2023 Sep 22;24(1):560. doi: 10.1186/s12864-023-09667-w. BMC Genomics. 2023. PMID: 37736708 Free PMC article.
-
Phylogenomics of nontuberculous mycobacteria respiratory infections in people with cystic fibrosis.Paediatr Respir Rev. 2023 Jun;46:63-70. doi: 10.1016/j.prrv.2023.02.001. Epub 2023 Feb 10. Paediatr Respir Rev. 2023. PMID: 36828670 Review.
-
Genomic analysis of Mycobacterium brumae sustains its nonpathogenic and immunogenic phenotype.Front Microbiol. 2023 Jan 5;13:982679. doi: 10.3389/fmicb.2022.982679. eCollection 2022. Front Microbiol. 2023. PMID: 36687580 Free PMC article.
-
Comparative genome analysis reveals high-level drug resistance markers in a clinical isolate of Mycobacterium fortuitum subsp. fortuitum MF GZ001.Front Cell Infect Microbiol. 2023 Jan 4;12:1056007. doi: 10.3389/fcimb.2022.1056007. eCollection 2022. Front Cell Infect Microbiol. 2023. PMID: 36683685 Free PMC article.
References
-
- Aitken, M.L. , Limaye, A. , Pottinger, P. , Whimbey, E. , Goss, C.H. , Tonelli, M.R. , et al. (2012) Respiratory outbreak of Mycobacterium abscessus subspecies massiliense in a lung transplant and cystic fibrosis center. Am J Respir Crit Care Med 185: 231–232. - PubMed
-
- Blanchet, L. , Vitale, R. , van Vorstenbosch, R. , Stavropoulos, G. , Pender, J. , Jonkers, D. , et al. (2020) Constructing bi‐plots for random forest. Tutorial. Anal Chim Acta 1131: 146–155. - PubMed
-
- Bramer, M. (2013) Ensemble classification. Principles of Data Mining. Undergraduate Topics in Computer Science. London: Springer, pp. 209–220.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
