The genome sequence and structure of rice chromosome 1

Nature. 2002 Nov 21;420(6913):312-6. doi: 10.1038/nature01184.


The rice species Oryza sativa is considered to be a model plant because of its small genome size, extensive genetic map, relative ease of transformation and synteny with other cereal crops. Here we report the essentially complete sequence of chromosome 1, the longest chromosome in the rice genome. We summarize characteristics of the chromosome structure and the biological insight gained from the sequence. The analysis of 43.3 megabases (Mb) of non-overlapping sequence reveals 6,756 protein coding genes, of which 3,161 show homology to proteins of Arabidopsis thaliana, another model plant. About 30% (2,073) of the genes have been functionally categorized. Rice chromosome 1 is (G + C)-rich, especially in its coding regions, and is characterized by several gene families that are dispersed or arranged in tandem repeats. Comparison with a draft sequence indicates the importance of a high-quality finished sequence.

MeSH terms

  • Arabidopsis / genetics
  • Base Composition
  • Chloroplasts / genetics
  • Chromosomes, Plant / genetics*
  • Computational Biology
  • Contig Mapping
  • DNA Transposable Elements / genetics
  • Exons / genetics
  • Expressed Sequence Tags
  • Genes, Plant / genetics
  • Genome, Plant*
  • In Situ Hybridization, Fluorescence
  • Introns / genetics
  • Molecular Sequence Data
  • Multigene Family / genetics
  • Oryza / genetics*
  • Physical Chromosome Mapping*
  • Plant Proteins / classification
  • Plant Proteins / genetics
  • Sensitivity and Specificity
  • Sequence Analysis, DNA
  • Sequence Homology
  • Tandem Repeat Sequences / genetics


  • DNA Transposable Elements
  • Plant Proteins

Associated data

  • GENBANK/BA000010