Characterization of cDNA clones selected by the GeneMark analysis from size-fractionated cDNA libraries from human brain

DNA Res. 1999 Oct 29;6(5):329-36. doi: 10.1093/dnares/6.5.329.

Abstract

We have conducted a sequencing project of human cDNAs which encode large proteins in brain. For selection of cDNA clones to be sequenced in this project, cDNA clones have been experimentally examined by in vitro transcription/translation prior to sequencing. In this study, we tested an alternative approach for picking up cDNA clones having a high probability of carrying protein coding region. This approach exploited 5'-end single-pass sequence data and the GeneMark program for assessing protein-coding potential, and allowed us to select 74 clones out of 14,804 redundant cDNA clones. The complete sequence data of these 74 clones revealed that 45% of them encoded proteins consisting of more than 500 amino acid residues while all the clones thus selected carried possible protein coding sequences as expected. The results indicated that the GeneMark analysis of 5'-end sequences of cDNAs offered us a simple and effective means to select cDNA clones with protein-coding potential although the sizes of the encoded proteins could not be predicted.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • 5' Untranslated Regions / genetics
  • Brain / metabolism*
  • Cloning, Molecular*
  • DNA, Complementary / genetics*
  • Gene Expression Profiling
  • Gene Library
  • Humans
  • Molecular Sequence Data
  • Physical Chromosome Mapping
  • Proteins / genetics*
  • Proteins / metabolism
  • Reverse Transcriptase Polymerase Chain Reaction
  • Sequence Analysis, DNA / methods*

Substances

  • 5' Untranslated Regions
  • DNA, Complementary
  • Proteins

Associated data

  • GENBANK/AB032945
  • GENBANK/AB032946
  • GENBANK/AB032947
  • GENBANK/AB032948
  • GENBANK/AB032949
  • GENBANK/AB032950
  • GENBANK/AB032951
  • GENBANK/AB032952
  • GENBANK/AB032953
  • GENBANK/AB032954
  • GENBANK/AB032955
  • GENBANK/AB032956
  • GENBANK/AB032957
  • GENBANK/AB032958
  • GENBANK/AB032959
  • GENBANK/AB032960
  • GENBANK/AB032961
  • GENBANK/AB032962
  • GENBANK/AB032963
  • GENBANK/AB032964
  • GENBANK/AB032965
  • GENBANK/AB032966
  • GENBANK/AB032967
  • GENBANK/AB032968
  • GENBANK/AB032969
  • GENBANK/AB032970
  • GENBANK/AB032971
  • GENBANK/AB032972
  • GENBANK/AB032973
  • GENBANK/AB032974