The peroxisomal targeting signal type 1 (PTS1) is a seemingly simple peptide sequence at the C-terminal end of most peroxisomal matrix proteins. PTS1 can be described as a tripeptide with the consensus motif [S/A/C] [K/R/H] L. However, this description is neither necessary nor sufficient. It does not cover all cases of PTS1 proteins, and some proteins in accordance with this consensus do not target to the peroxisome. In order to find new PTS proteins in yeast and to arrive at a more complete description of the PTS1 consensus motif, we developed a machine learning approach that involves orthologue expansion of the set of known peroxisomal proteins. We performed a genome-wide in silico screen, characterised several PTS1-containing peptides and identified two new peroxisomal matrix proteins, which we named Pxp1 (Yel020c) and Pxp2 (Yjr111c). Based on these in silico and in vivo analyses, we revised the yeast PTS1 consensus which now includes all known PTS1 proteins.
Keywords: PTS1; Pxp1; Pxp2; Yel020c; Yjr111c; amino acid analysis; machine learning; organelle targeting; peroxisomal targeting signal 1; peroxisome.
© 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.