Sequencing analysis of a putative human O-sialoglycoprotein endopeptidase gene (OSGEP) and analysis of a bidirectional promoter between the OSGEP and APEX genes

Gene. 2002 Feb 20;285(1-2):101-8. doi: 10.1016/s0378-1119(02)00429-8.


We performed cDNA and genomic cloning, sequencing and promoter analysis of the putative human O-sialoglycoprotein endopeptidase gene OSGEP (a homologue of gcp, a Pasteurella haemolytica A1 glycoprotease). The cloned OSGEP cDNA is 1311 nucleotides long, and encodes a protein consisting of 335 amino acids with predicted molecular mass of 36.4 kDa. The amino acid sequence of OSGEP showed 29.7% identity with that of P. haemolytica glycoprotease. The OSGEP gene is 7.75 kb long, consists of 11 exons and 10 introns, and lies immediately adjacent to the APEX gene (which encodes APEX nuclease, a multifunctional DNA repair enzyme) in 5'-to-5' orientation. The promoter region of the OSGEP gene lacks the typical TATA box, but has putative regulatory elements in the CpG island. Northern blot analysis showed ubiquitous expression of the OSGEP gene in several tissues, and we observed similarities in expression patterns between OSGEP and APEX. In order to study the regulation of OSGEP gene expression, we analyzed the OSGEP promoter region by luciferase assay using HeLa cells. A functional region required for full transcription activity was narrowed down to a 23 bp region containing a CCAAT box. It has been reported that this CCAAT box promotes basal transcription in the APEX direction. We thus conclude that a bidirectional promoter containing a CCAAT box regulates transcription of both the OSGEP and APEX genes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Blotting, Northern
  • Carbon-Oxygen Lyases / genetics*
  • DNA / chemistry
  • DNA / genetics
  • DNA, Complementary / chemistry
  • DNA, Complementary / genetics
  • DNA-(Apurinic or Apyrimidinic Site) Lyase*
  • Exons
  • Female
  • Gene Expression
  • Genes / genetics
  • HeLa Cells
  • Humans
  • Introns
  • Luciferases / genetics
  • Luciferases / metabolism
  • Metalloendopeptidases / genetics*
  • Molecular Sequence Data
  • Promoter Regions, Genetic / genetics*
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Recombinant Fusion Proteins / genetics
  • Recombinant Fusion Proteins / metabolism
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid


  • DNA, Complementary
  • RNA, Messenger
  • Recombinant Fusion Proteins
  • DNA
  • Luciferases
  • Metalloendopeptidases
  • O-sialoglycoprotein endopeptidase
  • Carbon-Oxygen Lyases
  • APEX1 protein, human
  • DNA-(Apurinic or Apyrimidinic Site) Lyase

Associated data

  • GENBANK/AB047823
  • GENBANK/AB050442