Drosophila proteins involved in metabolism of uracil-DNA possess different types of nuclear localization signals

FEBS J. 2010 May;277(9):2142-56. doi: 10.1111/j.1742-4658.2010.07630.x.

Abstract

Adequate transport of large proteins that function in the nucleus is indispensable for cognate molecular events within this organelle. Selective protein import into the nucleus requires nuclear localization signals (NLS) that are recognized by importin receptors in the cytoplasm. Here we investigated the sequence requirements for nuclear targeting of Drosophila proteins involved in the metabolism of uracil-substituted DNA: the recently identified uracil-DNA degrading factor, dUTPase, and the two uracil-DNA glycosylases present in Drosophila. For the uracil-DNA degrading factor, NLS prediction identified two putative NLS sequences [PEKRKQE(320-326) and PKRKKKR(347-353)]. Truncation and site-directed mutagenesis using YFP reporter constructs showed that only one of these basic stretches is critically required for efficient nuclear localization in insect cells. This segment corresponds to the well-known prototypic NLS of SV40 T-antigen. An almost identical NLS segment is also present in the Drosophila thymine-DNA glycosylase, but no NLS elements were predicted in the single-strand-specific monofunctional uracil-DNA glycosylase homolog protein. This latter protein has a molecular mass of 31 kDa, which may allow NLS-independent transport. For Drosophila dUTPase, two isoforms with distinct features regarding molecular mass and subcellular distribution were recently described. In this study, we characterized the basic PAAKKMKID(10-18) segment of dUTPase, which has been predicted to be a putative NLS by in silico analysis. Deletion studies, using YFP reporter constructs expressed in insect cells, revealed the importance of the PAA(10-12) tripeptide and the ID(17-18) dipeptide, as well as the role of the PAAK(10-13) segment in nuclear localization of dUTPase. We constructed a structural model that shows the molecular basis of such recognition in three dimensions.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Cell Line
  • Computational Biology
  • Conserved Sequence
  • DNA / metabolism*
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism*
  • Drosophila Proteins / chemistry
  • Drosophila Proteins / genetics
  • Drosophila Proteins / metabolism*
  • Drosophila melanogaster / chemistry
  • Drosophila melanogaster / metabolism*
  • Humans
  • Isoenzymes / metabolism
  • Models, Molecular
  • Molecular Sequence Data
  • Mutation
  • Nuclear Localization Signals / chemistry
  • Nuclear Localization Signals / genetics
  • Nuclear Localization Signals / metabolism*
  • Protein Structure, Quaternary
  • Pyrophosphatases / chemistry
  • Pyrophosphatases / genetics
  • Pyrophosphatases / metabolism*
  • Sequence Alignment
  • Spodoptera
  • Uracil / metabolism*

Substances

  • DNA-Binding Proteins
  • Drosophila Proteins
  • Isoenzymes
  • Nuclear Localization Signals
  • UDE protein, Drosophila
  • Uracil
  • DNA
  • Pyrophosphatases
  • dUTP pyrophosphatase