Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice

Science. 2003 Jul 18;301(5631):376-9. doi: 10.1126/science.1081288.


We collected and completely sequenced 28,469 full-length complementary DNA clones from Oryza sativa L. ssp. japonica cv. Nipponbare. Through homology searches of publicly available sequence data, we assigned tentative protein functions to 21,596 clones (75.86%). Mapping of the cDNA clones to genomic DNA revealed that there are 19,000 to 20,500 transcription units in the rice genome. Protein informatics analysis against the InterPro database revealed the existence of proteins presented in rice but not in Arabidopsis. Sixty-four percent of our cDNAs are homologous to Arabidopsis proteins.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • Amino Acid Sequence
  • Cloning, Molecular
  • Computational Biology
  • DNA, Complementary
  • Databases, Nucleic Acid
  • Databases, Protein
  • Genes, Plant
  • Genome, Plant*
  • Molecular Sequence Data
  • Open Reading Frames
  • Oryza / genetics*
  • Plant Proteins / chemistry
  • Plant Proteins / genetics
  • Plant Proteins / physiology
  • Protein Structure, Tertiary
  • RNA, Antisense / genetics
  • Sequence Analysis, DNA*
  • Sequence Homology, Amino Acid
  • Sequence Homology, Nucleic Acid
  • Transcription Factors / chemistry
  • Transcription Factors / genetics
  • Transcription, Genetic


  • DNA, Complementary
  • Plant Proteins
  • RNA, Antisense
  • Transcription Factors