Complete structure of the human alpha-albumin gene, a new member of the serum albumin multigene family

Proc Natl Acad Sci U S A. 1996 Jul 23;93(15):7557-61. doi: 10.1073/pnas.93.15.7557.

Abstract

The nucleotide sequence of the human alpha-albumin gene, including 887 bp of the 5'-flanking region and 1311 bp of the 3-flanking region (24,454 in total), was determined from three overlapping lambda phage clones. The sequence spans 22,256 bp from the cap site to the polyadenylylation site, revealing a gene structure of 15 exons separated by 14 introns. The methionine initiation codon ATG is within exon 1; the termination codon TGA is within exon 14. Exon 15 is entirely untranslated and contains the polyadenylylation signal AATAAA. The deduced polypeptide chain is composed of a 21-amino-acid leader peptide, followed by 578 amino acids of the mature protein. There are seven repetitive DNA elements (Alu and Kpn) in the introns and 3-flanking region. The sizes of the 15 alpha-albumin exons match closely those of the albumin, alpha-fetoprotein, and vitamin D-binding protein genes. The exons are symmetrically placed within the three domains of the individual proteins, and they share a characteristic codon splitting pattern that is conserved among members of the gene family. The results provide strong evidence that alpha-albumin belongs to, and most likely completes with, the serum albumin gene family. Based on structural similarity, alpha-albumin appears to be most closely related to alpha-fetoprotein. The complete structure of this family of four tandemly linked genes provides a well-characterized approximately 200 kb locus in the 4q subcentromeric region of the human genome.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Albumins / biosynthesis
  • Albumins / chemistry
  • Albumins / genetics*
  • Amino Acid Sequence
  • Bacteriophage lambda
  • Base Sequence
  • Centromere
  • Chromosome Mapping
  • Chromosomes, Human, Pair 4*
  • Cloning, Molecular
  • Exons
  • Humans
  • Introns
  • Molecular Sequence Data
  • Multigene Family*
  • Protein Structure, Secondary
  • RNA Splicing
  • Regulatory Sequences, Nucleic Acid
  • Repetitive Sequences, Nucleic Acid
  • Restriction Mapping
  • Sequence Homology, Amino Acid
  • Serum Albumin / chemistry
  • Serum Albumin / genetics*
  • TATA Box
  • Vitamin D-Binding Protein / chemistry
  • alpha-Fetoproteins / chemistry

Substances

  • Albumins
  • Serum Albumin
  • Vitamin D-Binding Protein
  • alpha-Fetoproteins
  • alpha-albumin

Associated data

  • GENBANK/U51243