Large-scale sequencing of two regions in human chromosome 7q22: analysis of 650 kb of genomic sequence around the EPO and CUTL1 loci reveals 17 genes

Genome Res. 1998 Oct;8(10):1060-73. doi: 10.1101/gr.8.10.1060.


We have sequenced and annotated two genomic regions located in the Giemsa negative band q22 of human chromosome 7. The first region defined by the erythropoietin (EPO) locus is 228 kb in length and contains 13 genes. Whereas 3 genes (GNB2, EPO, PCOLCE) were known previously on the mRNA level, we have been able to identify 10 novel genes using a newly developed automatic annotation tool RUMMAGE-DP, which comprises >26 different programs mainly for exon prediction, homology searches, and compositional and repeat analysis. For precise annotation we have also resequenced ESTs identified to the region and assembled them to build large cDNAs. In addition, we have investigated the differential splicing of genes. Using these tools we annotated 4 of the 10 genes as a zonadhesin, a transferrin homolog, a nucleoporin-like gene, and an actin gene. Two genes showed weak similarity to an insulin-like receptor and a neuronal protein with a leucine-rich amino-terminal domain. Four predicted genes (CDS1-CDS4) CDS that have been confirmed on the mRNA level showed no similarity to known proteins and a potential function could not be assigned. The second region in 7q22 defined by the CUTL1 (CCAAT displacement protein and its splice variant) locus is 416 kb in length and contains three known genes, including PMSL12, APS, CUTL1, and a novel gene (CDS5). The CUTL1 locus, consisting of two splice variants (CDP and CASP), occupies >300 kb. Based on the G, C profile an isochore switch can be defined between the CUTL1 gene and the APS and PMSL12 genes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Base Composition
  • Chromosomes, Human, Pair 7 / genetics*
  • Cloning, Molecular / methods
  • Erythropoietin / genetics*
  • Exons / genetics*
  • Expressed Sequence Tags
  • GTP-Binding Proteins / genetics
  • Homeodomain Proteins / genetics
  • Humans
  • Molecular Sequence Data
  • Nuclear Proteins / genetics*
  • Phylogeny
  • Repressor Proteins / genetics*
  • Sequence Analysis, DNA / methods*
  • Transcription Factors


  • CUX1 protein, human
  • Homeodomain Proteins
  • Nuclear Proteins
  • Repressor Proteins
  • Transcription Factors
  • Erythropoietin
  • GTP-Binding Proteins

Associated data

  • GENBANK/AF006752
  • GENBANK/AF024533
  • GENBANK/AF024534
  • GENBANK/AF030453
  • GENBANK/AF047825
  • GENBANK/AF053356