Phylogenetic analysis and in silico characterization of the GARS-AIRS-GART gene which codes for a tri-functional enzyme protein involved in de novo purine biosynthesis

Mol Biotechnol. 2009 Jul;42(3):306-19. doi: 10.1007/s12033-009-9160-1. Epub 2009 Mar 20.

Abstract

Human GARS-AIRS-GART encodes a fused tri-functional enzyme protein involved in de novo purine biosynthesis, aberrant function being implicated in Down syndrome and Leukemia. We performed phylogenetic analysis to discern evolutionary relationships and in silico characterization to identify elements potentially important for gene regulation. We report that murine, bovine and chimpanzee sequences are the nearest neighbors of human GARS-AIRS-GART and that endo-duplication of the AIRS protein is restricted to insect orthologs. Convergent evolution of mono-functional bacterial orthologs to bi-functional, partly fused, yeast orthologs is observed from the rooted-NJ tree topology that bears bootstrap values exceeding 9000 in majority of the nodes. Sequence alignments reveal that introns 11-15 of human GARS-AIRS-GART are conserved among vertebrates. An inverse correlation is observed between intron size and intron density without bias for intron position. The generation time of organisms is independent of intron density. Human, bovine and murine sequences possess similar GC content with CpG islands in promoter regions. The long isoforms of cow and chicken transcripts and short isoforms of human, bovine and murine mRNA form energetically stable stem-like structures in the 3'-UTR and may regulate translational stability of GARS-AIRS-GART transcripts. Glycine-rich loops important for enzyme structure and ATP-, folate-binding residues are partially conserved.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Composition
  • Carbon-Nitrogen Ligases / genetics*
  • Carbon-Nitrogen Ligases / metabolism
  • Cluster Analysis
  • Computer Simulation
  • CpG Islands
  • Gene Expression Regulation
  • Humans
  • Models, Genetic
  • Molecular Sequence Data
  • Phosphoribosylglycinamide Formyltransferase / genetics*
  • Phosphoribosylglycinamide Formyltransferase / metabolism
  • Phylogeny
  • Sequence Alignment
  • Statistics, Nonparametric
  • Untranslated Regions

Substances

  • Untranslated Regions
  • Phosphoribosylglycinamide Formyltransferase
  • Carbon-Nitrogen Ligases
  • phosphoribosylaminoimidazole synthase
  • phosphoribosylamine-glycine ligase