MHC class II pseudogene and genomic signature of a 32-kb cosmid in the house finch (Carpodacus mexicanus)

Genome Res. 2000 May;10(5):613-23. doi: 10.1101/gr.10.5.613.

Abstract

Large-scale sequencing studies in vertebrates have thus far focused primarily on the genomes of a few model organisms. Birds are of interest to genomics because of their much smaller and highly streamlined genomes compared to mammals. However, large-scale genetic work has been confined almost exclusively to the chicken; we know little about general aspects of genomes in nongame birds. This study examines the organization of a genomic region containing an Mhc class II B gene in a representative of another important lineage of the avian tree, the songbirds (Passeriformes). We used a shotgun sequencing approach to determine the sequence of a 32-kb cosmid insert containing a strongly hybridizing Mhc fragment from house finches (Carpodacus mexicanus). There were a total of three genes found on the cosmid clone, about the gene density expected for the mammalian Mhc: a class II Mhc beta-chain gene (Came-DAB1), a serine-threonine kinase, and a zinc finger motif. Frameshift mutations in both the second and third exons of Came-DAB1 and the unalignability of the gene after the third exon suggest that it is a nonfunctional pseudogene. In addition, the identifiable introns of Came-DAB1 are more than twice as large as those of chickens. Nucleotide diversity in the peptide-binding region of Came-DAB1 (Pi = 0.03) was much lower than polymorphic chicken and other functional Mhc genes but higher than the expected diversity for a neutral locus in birds, perhaps because of hitchhiking on a selected Mhc locus close by. The serine-threonine kinase gene is likely functional, whereas the zinc finger motif is likely nonfunctional. A paucity of long simple-sequence repeats and retroelements is consistent with emerging rules of chicken genomics, and a pictorial analysis of the "genomic signature" of this sequence, the first of its kind for birds, bears strong similarity to mammalian signatures, suggesting common higher-order structures in these homeothermic genomes. The house finch sequence is among a very few of its kind from nonmodel vertebrates and provides insight into the evolution of the avian Mhc and of avian genomes generally.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Avian Proteins*
  • Base Composition
  • Base Sequence
  • Birds
  • Chickens
  • Cosmids / genetics*
  • Evolution, Molecular
  • Genes, MHC Class II / genetics*
  • Genetic Variation
  • Molecular Sequence Data
  • Multigene Family
  • Phylogeny
  • Proteins / genetics
  • Pseudogenes / genetics*
  • Repetitive Sequences, Nucleic Acid / genetics
  • Sequence Alignment
  • Sequence Homology, Nucleic Acid
  • Songbirds / genetics*

Substances

  • Avian Proteins
  • Came-DAB1 protein, Carpodacus mexicanus
  • Proteins

Associated data

  • GENBANK/AF205032
  • GENBANK/AF241546
  • GENBANK/AF241547
  • GENBANK/AF241548
  • GENBANK/AF241549
  • GENBANK/AF241550
  • GENBANK/AF241551
  • GENBANK/AF241552
  • GENBANK/AF241553
  • GENBANK/AF241554
  • GENBANK/AF241555
  • GENBANK/AF241556
  • GENBANK/AF241557
  • GENBANK/AF241558
  • GENBANK/AF241559
  • GENBANK/AF241560
  • GENBANK/AF241561
  • GENBANK/AF241562
  • GENBANK/AF241563
  • GENBANK/AF241564
  • GENBANK/AF241565