Functional metagenomics reveals novel β-galactosidases not predictable from gene sequences

PLoS One. 2017 Mar 8;12(3):e0172545. doi: 10.1371/journal.pone.0172545. eCollection 2017.

Abstract

The techniques of metagenomics have allowed researchers to access the genomic potential of uncultivated microbes, but there remain significant barriers to determination of gene function based on DNA sequence alone. Functional metagenomics, in which DNA is cloned and expressed in surrogate hosts, can overcome these barriers, and make important contributions to the discovery of novel enzymes. In this study, a soil metagenomic library carried in an IncP cosmid was used for functional complementation for β-galactosidase activity in both Sinorhizobium meliloti (α-Proteobacteria) and Escherichia coli (γ-Proteobacteria) backgrounds. One β-galactosidase, encoded by six overlapping clones that were selected in both hosts, was identified as a member of glycoside hydrolase family 2. We could not identify ORFs obviously encoding possible β-galactosidases in 19 other sequenced clones that were only able to complement S. meliloti. Based on low sequence identity to other known glycoside hydrolases, yet not β-galactosidases, three of these ORFs were examined further. Biochemical analysis confirmed that all three encoded β-galactosidase activity. Lac36W_ORF11 and Lac161_ORF7 had conserved domains, but lacked similarities to known glycoside hydrolases. Lac161_ORF10 had neither conserved domains nor similarity to known glycoside hydrolases. Bioinformatic and structural modeling implied that Lac161_ORF10 protein represented a novel enzyme family with a five-bladed propeller glycoside hydrolase domain. By discovering founding members of three novel β-galactosidase families, we have reinforced the value of functional metagenomics for isolating novel genes that could not have been predicted from DNA sequence analysis alone.

MeSH terms

  • Bacteria / classification
  • Bacteria / enzymology
  • Bacteria / genetics
  • Cloning, Molecular
  • Computational Biology / methods
  • Gene Expression
  • Metagenome
  • Metagenomics* / methods
  • Models, Molecular
  • Molecular Sequence Annotation
  • Open Reading Frames
  • Phylogeny
  • Protein Conformation
  • Sequence Analysis, DNA
  • Soil Microbiology
  • beta-Galactosidase / chemistry
  • beta-Galactosidase / genetics*
  • beta-Galactosidase / metabolism

Substances

  • beta-Galactosidase

Grants and funding

This work was supported by Natural Sciences and Engineering Research Council of Canada Strategic Projects Grant STPGP 381646 - 2009. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.