Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
, 5 (10), R84

A Genome Annotation-Driven Approach to Cloning the Human ORFeome


A Genome Annotation-Driven Approach to Cloning the Human ORFeome

John E Collins et al. Genome Biol.


We have developed a systematic approach to generating cDNA clones containing full-length open reading frames (ORFs), exploiting knowledge of gene structure from genomic sequence. Each ORF was amplified by PCR from a pool of primary cDNAs, cloned and confirmed by sequencing. We obtained clones representing 70% of genes on human chromosome 22, whereas searching available cDNA clone collections found at best 48% from a single collection and 60% for all collections combined.


Figure 1
Figure 1
Summary of the ORF cloning method.
Figure 2
Figure 2
Sequence characteristics of cloned ORFs. (a) Plot of the distribution of the 398 chromosome 22 ORFs by GC content (%) and length (bases). Closed circles are the 331 ORFs that were isolated as acceptable clones (278) or as clones with the correct ORF but currently with a problem in the sequence (53). Dotted circles are the rest of the ORFs which were not amplifiable or clonable (67). (b) Overlap of chromosome 22 ORF clones isolated here with cDNA collections. Analysis of GC content and length for 398 chromosome 22 ORFs, split according to whether the gene has been isolated only by the strategy described here (SANGER, red circles), only in the cDNA collections (OTHER, green triangles), in both (BOTH, black circles), or not at all (NOT, yellow triangles).
Figure 3
Figure 3
Schematic Venn diagram showing the relationships of the set of ORF clones isolated here compared with the full-length cDNA clones in current high-throughput clone collections (227 maintain the correct reading frame at the amino acid level from Table 1) for the 398 annotated full-length chromosome 22 ORFs. The four different classes of genes are labeled as in the text and Figure 2b.

Similar articles

See all similar articles

Cited by 10 articles

See all "Cited by" articles


    1. Vidal M. A biological atlas of functional maps. Cell. 2001;104:333–339. doi: 10.1016/S0092-8674(01)00221-5. - DOI - PubMed
    1. Reboul J, Vaglio P, Rual JF, Lamesch P, Martinez M, Armstrong CM, Li S, Jacotot L, Bertin N, Janky R, et al. C. elegans ORFeome version 1.1: experimental verification of the genome annotation and resource for proteome-scale protein expression. Nat Genet. 2003;34:35–41. doi: 10.1038/ng1140. - DOI - PubMed
    1. Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, Knight JR, Lockshon D, Narayan V, Srinivasan M, Pochart P, et al. A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature. 2000;403:623–627. doi: 10.1038/35001009. - DOI - PubMed
    1. Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, Sakaki Y. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl Acad Sci USA. 2001;98:4569–4574. doi: 10.1073/pnas.061034498. - DOI - PMC - PubMed
    1. Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. Detecting protein function and protein-protein interactions from genome sequences. Science. 1999;285:751–753. doi: 10.1126/science.285.5428.751. - DOI - PubMed

Publication types

LinkOut - more resources