Structure of the mouse nucleolin gene. The complete sequence reveals that each RNA binding domain is encoded by two independent exons

J Mol Biol. 1988 Apr 20;200(4):627-38. doi: 10.1016/0022-2836(88)90476-7.

Abstract

Nucleolin is a multifunctional nucleolar protein involved in the synthesis, packaging and maturation of pre-rRNA in eukaryotic cells. We describe the molecular organization and complete sequence of the mouse nucleolin gene, the first higher eukaryotic gene encoding a protein that is both an RNA binding protein involved in rRNA processing and a specific nucleolar protein. The nucleolin gene extends over 9000 base-pairs and is split into 14 exons that encode the 706 amino acid residues of the protein. The promoter sequence is G + C-rich (67% G + C) with four G/C boxes, it lacks bona fide TATA and CAAT boxes and shows capping site heterogeneity. The existence of pyrimidine-rich motifs, similar to those found in the promoter of ribosomal protein genes, could be relevant to the co-regulation of genes whose products are involved in ribosome biogenesis. Nucleolin contains four RNA binding domains, each about 80 amino acid residues long, which include the 11-residue core ribonucleoprotein consensus motif. Each domain is encoded by two exons, with an intervening sequence interrupting the conserved core motif at roughly the same amino acid position. This latter result suggests that the RNA binding domains are composed of two independent subdomains, whose functions remain to be determined.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Binding Sites
  • Carrier Proteins / genetics*
  • DNA
  • Exons*
  • Genes*
  • Introns
  • Mice
  • Molecular Sequence Data
  • Nuclear Proteins / genetics*
  • Nuclear Proteins / metabolism
  • Nucleolin
  • Phosphoproteins / genetics*
  • Phosphoproteins / metabolism
  • RNA, Messenger
  • RNA, Ribosomal / genetics*
  • RNA-Binding Proteins
  • Repetitive Sequences, Nucleic Acid
  • Terminator Regions, Genetic

Substances

  • Carrier Proteins
  • Nuclear Proteins
  • Phosphoproteins
  • RNA, Messenger
  • RNA, Ribosomal
  • RNA-Binding Proteins
  • DNA

Associated data

  • GENBANK/X07699