The Allergenic Protein Tha P 2 of Processionary Moths of the Genus Thaumetopoea (Thaumetopoeinae, Notodontidae, Lepidoptera): Characterization and Evolution

Gene. 2015 Dec 15;574(2):317-24. doi: 10.1016/j.gene.2015.08.019. Epub 2015 Aug 11.


The allergenic Tha p 2 protein has been extracted recently from the urticating setae of the pine processionary moth Thaumetopoea pityocampa. In the present paper, we test for the occurrence of this protein in other Thaumetopoeinae, with a particular focus on members of the genus Thaumetopoea, as well as unrelated moth species, to better understand the physicochemical properties of the protein, the nature of encoding genes and their evolutionary history. Tha p 2 is encoded by the intronless gene Tha p 2 that is restricted to the processionary moths (Thaumetopoeinae, Notodontidae, Lepidoptera). Most of the species present two isoforms of Tha p 2 that can be interpreted as the result of heterozygosity in the single gene. The only exception is represented by Thaumetopoea wilkinsoni, in which 20 different isoforms occur in a single specimen, leading to the conclusion that, at least in this species, multiple copies of Tha p 2 exist. Serine, glycine, cysteine and leucine are abundant in Tha p 2, a protein well conserved among processionary moths. The predicted secondary structures of Tha p 2 indicate the presence of 3 α-helices and six β-barrels. Finally, the evolution of the gene and the protein was characterized by a combination of positive and negative selection, with the latter being more evident.

Keywords: Allergenic protein Tha p 2; Molecular evolution; Processionary moths; Secondary structure prediction; Thaumeotopoeinae; Thaumetopoea.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Allergens / chemistry
  • Allergens / genetics*
  • Allergens / immunology
  • Amino Acid Sequence
  • Animals
  • Cloning, Molecular
  • Evolution, Molecular*
  • Insect Proteins / chemistry
  • Insect Proteins / genetics*
  • Insect Proteins / immunology
  • Molecular Sequence Data
  • Moths / genetics*
  • Phylogeny
  • Protein Structure, Secondary
  • Sequence Analysis, DNA


  • Allergens
  • Insect Proteins