Profile Comparer Extended: phylogeny of lytic polysaccharide monooxygenase families using profile hidden Markov model alignments

F1000Res. 2019 Oct 31:8:1834. doi: 10.12688/f1000research.21104.1. eCollection 2019.

Abstract

Insight into the inter- and intra-family relationship of protein families is important, since it can aid understanding of substrate specificity evolution and assign putative functions to proteins with unknown function. To study both these inter- and intra-family relationships, the ability to build phylogenetic trees using the most sensitive sequence similarity search methods (e.g. profile hidden Markov model (pHMM)-pHMM alignments) is required. However, existing solutions require a very long calculation time to obtain the phylogenetic tree. Therefore, a faster protocol is required to make this approach efficient for research. To contribute to this goal, we extended the original Profile Comparer program (PRC) for the construction of large pHMM phylogenetic trees at speeds several orders of magnitude faster compared to pHMM-tree. As an example, PRC Extended (PRCx) was used to study the phylogeny of over 10,000 sequences of lytic polysaccharide monooxygenase (LPMO) from over seven families. Using the newly developed program we were able to reveal previously unknown homologs of LPMOs, namely the PFAM Egh16-like family. Moreover, we show that the substrate specificities have evolved independently several times within the LPMO superfamily. Furthermore, the LPMO phylogenetic tree, does not seem to follow taxonomy-based classification.

Keywords: HMM; Hidden Markov Model; LPMO; Lytic Polysaccharide Mono-oxygenase; phylogeny.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Mixed Function Oxygenases*
  • Phylogeny*
  • Polysaccharides*
  • Proteins

Substances

  • Polysaccharides
  • Proteins
  • Mixed Function Oxygenases

Grants and funding

The Netherlands Organisation for Scientific Research (NWO) supported this research in the framework of an ERA-IB project FilaZyme (053.80.721/EIB.14.021).