Scoring amino acid mutation to predict pandemic risk of avian influenza virus

BMC Bioinformatics. 2019 Jun 10;20(Suppl 8):288. doi: 10.1186/s12859-019-2770-0.

Abstract

Background: Avian influenza virus can directly cross species barriers and infect humans with high fatality. As antigen novelty for human host, the public health is being challenged seriously. The pandemic risk of avian influenza viruses should be analyzed and a prediction model should be constructed for virology applications.

Results: The 178 signature positions in 11 viral proteins were firstly screened as features by the scores of five amino acid factors and their random forest rankings. The Supporting Vector Machine algorithm achieved well performance. The most important amino acid factor (Factor 5) and the minimal range of signature positions (63 amino acid residues) were also explored. Moreover, human-origin avian influenza viruses with three or four genome segments from human virus had pandemic risk with high probability.

Conclusion: Using machine learning methods, the present paper scores the amino acid mutations and predicts pandemic risk with well performance. Although long evolution distances between avian and human viruses suggest that avian influenza virus in nature still need time to fix among human host, it should be notable that there are high pandemic risks for H7N9 and H9N2 avian viruses.

Keywords: Amino acid mutation; Avian influenza virus; Machine learning; Pandemic risk.

MeSH terms

  • Algorithms
  • Amino Acids / genetics*
  • Animals
  • Birds / virology*
  • Computer Simulation
  • Databases as Topic
  • Genome, Viral
  • Influenza in Birds / epidemiology*
  • Influenza in Birds / virology*
  • Machine Learning
  • Mutation / genetics*
  • Pandemics*
  • Reassortant Viruses / genetics
  • Risk Factors

Substances

  • Amino Acids