Primary structure of human pepsinogen C gene

J Biol Chem. 1988 Jan 25;263(3):1382-5.


The entire human pepsinogen C gene has been isolated from a cosmid genomic library. The nucleotide sequences of all the exons and the 5'- and 3'-flanking regions of the gene are presented. The organization of the gene is fundamentally compatible with those of other aspartic proteinases, allowing us to conclude that the genes of these aspartic proteinases including pepsinogen C are derived from a common ancestral gene. The predicted 388-residue amino acid sequence of human pepsinogen C consists of a signal sequence of 16-amino acid residues, an activation peptide of 43 residues, and the mature pepsin of 329 residues containing the two active-site aspartic acids. In the light of present notions about eukaryotic gene expression, possible regulatory roles of the oligonucleotide DNA sequences in the promoter region of the gene are discussed.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Base Sequence
  • Cosmids
  • Humans
  • Molecular Sequence Data
  • Pepsinogens / genetics*
  • Promoter Regions, Genetic


  • Pepsinogens

Associated data

  • GENBANK/J03508
  • GENBANK/M18659
  • GENBANK/M18660
  • GENBANK/M18661
  • GENBANK/M18662
  • GENBANK/M18663
  • GENBANK/M18664
  • GENBANK/M18665
  • GENBANK/M18666
  • GENBANK/M18667