Clustering and classification of virus sequence through music communication protocol and wavelet transform

Genomics. 2021 Jan;113(1 Pt 2):778-784. doi: 10.1016/j.ygeno.2020.10.009. Epub 2020 Oct 16.

Abstract

The coronavirus pandemic became a major risk in global public health. The outbreak is caused by SARS-CoV-2, a member of the coronavirus family. Though the images of the virus are familiar to us, in the present study, an attempt is made to hear the coronavirus by translating its protein spike into audio sequences. The musical features such as pitch, timbre, volume and duration are mapped based on the coronavirus protein sequence. Three different viruses Influenza, Ebola and Coronavirus were studied and compared through their auditory virus sequences by implementing Haar wavelet transform. The sonification of the coronavirus benefits in understanding the protein structures by enhancing the hidden features. Further, it makes a clear difference in the representation of coronavirus compared with other viruses, which will help in various research works related to virus sequence. This evolves as a simplified and novel way of representing the conventional computational methods.

Keywords: Coronavirus; Haar wavelet; MIDI; Protein music; SVM.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • COVID-19 / virology*
  • Cluster Analysis
  • Coronavirus / classification
  • Coronavirus / genetics
  • Ebolavirus / classification
  • Ebolavirus / genetics
  • Genome, Viral*
  • Humans
  • Middle East Respiratory Syndrome Coronavirus / classification
  • Middle East Respiratory Syndrome Coronavirus / genetics
  • Music*
  • Orthomyxoviridae / classification
  • Orthomyxoviridae / genetics
  • Pandemics
  • RNA, Viral / genetics
  • SARS-CoV-2 / classification*
  • SARS-CoV-2 / genetics*
  • Severe acute respiratory syndrome-related coronavirus / classification
  • Severe acute respiratory syndrome-related coronavirus / genetics
  • Viral Proteins / genetics
  • Wavelet Analysis*

Substances

  • RNA, Viral
  • Viral Proteins