Complete primary structure and genomic organization of the mouse Col14a1 gene

Matrix Biol. 2004 Jan;22(7):595-601. doi: 10.1016/j.matbio.2003.11.005.

Abstract

The entire mouse cDNA sequence for type XIV collagen was determined using overlapping PCR products. The 6456 nucleotide (nt) cDNA sequence contains a 5391-nt open reading frame encoding 1797 amino acid residues. The amino terminus has a 28-residue signal peptide that is followed by the mature polypeptide of 1769 amino acid residues with a calculated molecular mass of 193.2 kDa. The mouse alpha1(XIV) collagen chain is predicted to contain all the structural domains described for the polypeptide in chicken and human. These include fibronectin type III repeats, von Willebrand factor A domains, thrombospondin-N-terminal-like domains and two triple-helical domains similar to those of other collagen family members. The amino acid residue sequence of human alpha1(XIV) collagen showed an overall identity of 74% to the chicken sequence and 88% to the human sequence. The entire mouse genomic structure has been determined and is made up of 48 exons. Alternatively spliced forms of mouse type XIV, collagen were not identified corresponding to the findings for the human form.

Publication types

  • Corrected and Republished Article
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Alternative Splicing
  • Amino Acid Sequence
  • Animals
  • Collagen / genetics*
  • DNA, Complementary / genetics
  • Drosophila Proteins
  • Genome*
  • Glycoproteins
  • Mice / genetics*
  • Molecular Sequence Data
  • Molecular Weight
  • Open Reading Frames
  • Phosphatidate Phosphatase
  • Protein Structure, Tertiary / genetics
  • Sequence Homology, Amino Acid

Substances

  • COL14A1 protein, human
  • Col14a1 protein, mouse
  • DNA, Complementary
  • Drosophila Proteins
  • Glycoproteins
  • Collagen
  • Phosphatidate Phosphatase
  • Wun2 protein, Drosophila

Associated data

  • GENBANK/AY221110