Peroxidase gene discovery from the horseradish transcriptome

BMC Genomics. 2014 Mar 24:15:227. doi: 10.1186/1471-2164-15-227.


Background: Horseradish peroxidases (HRPs) from Armoracia rusticana have long been utilized as reporters in various diagnostic assays and histochemical stainings. Regardless of their increasing importance in the field of life sciences and suggested uses in medical applications, chemical synthesis and other industrial applications, the HRP isoenzymes, their substrate specificities and enzymatic properties are poorly characterized. Due to lacking sequence information of natural isoenzymes and the low levels of HRP expression in heterologous hosts, commercially available HRP is still extracted as a mixture of isoenzymes from the roots of A. rusticana.

Results: In this study, a normalized, size-selected A. rusticana transcriptome library was sequenced using 454 Titanium technology. The resulting reads were assembled into 14871 isotigs with an average length of 1133 bp. Sequence databases, ORF finding and ORF characterization were utilized to identify peroxidase genes from the 14871 isotigs generated by de novo assembly. The sequences were manually reviewed and verified with Sanger sequencing of PCR amplified genomic fragments, resulting in the discovery of 28 secretory peroxidases, 23 of them previously unknown. A total of 22 isoenzymes including allelic variants were successfully expressed in Pichia pastoris and showed peroxidase activity with at least one of the substrates tested, thus enabling their development into commercial pure isoenzymes.

Conclusions: This study demonstrates that transcriptome sequencing combined with sequence motif search is a powerful concept for the discovery and quick supply of new enzymes and isoenzymes from any plant or other eukaryotic organisms. Identification and manual verification of the sequences of 28 HRP isoenzymes do not only contribute a set of peroxidases for industrial, biological and biomedical applications, but also provide valuable information on the reliability of the approach in identifying and characterizing a large group of isoenzymes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Armoracia / genetics*
  • Databases, Genetic
  • Gene Library
  • Genes, Plant*
  • Isoenzymes / classification
  • Isoenzymes / genetics
  • Molecular Sequence Data
  • Peroxidase / classification
  • Peroxidase / genetics*
  • Phylogeny
  • Plant Proteins / genetics
  • Sequence Analysis, DNA
  • Transcriptome*


  • Isoenzymes
  • Plant Proteins
  • Peroxidase

Associated data

  • GENBANK/HE963800
  • GENBANK/HE963801
  • GENBANK/HE963802
  • GENBANK/HE963803
  • GENBANK/HE963804
  • GENBANK/HE963805
  • GENBANK/HE963806
  • GENBANK/HE963807
  • GENBANK/HE963808
  • GENBANK/HE963809
  • GENBANK/HE963810
  • GENBANK/HE963811
  • GENBANK/HE963812
  • GENBANK/HE963813
  • GENBANK/HE963814
  • GENBANK/HE963815
  • GENBANK/HE963816
  • GENBANK/HE963817
  • GENBANK/HE963818
  • GENBANK/HE963819
  • GENBANK/HE963820
  • GENBANK/HE963821
  • GENBANK/HE963822
  • GENBANK/HE963823
  • GENBANK/HE963824
  • GENBANK/HE963825