Sequencing of a rice centromere uncovers active genes

Nat Genet. 2004 Feb;36(2):138-45. doi: 10.1038/ng1289. Epub 2004 Jan 11.


Centromeres are the last frontiers of complex eukaryotic genomes, consisting of highly repetitive sequences that resist mapping, cloning and sequencing. The centromere of rice Chromosome 8 (Cen8) has an unusually low abundance of highly repetitive satellite DNA, which allowed us to determine its sequence. A region of approximately 750 kb in Cen8 binds rice CENH3, the centromere-specific H3 histone. CENH3 binding is contained within a larger region that has abundant dimethylation of histone H3 at Lys9 (H3-Lys9), consistent with Cen8 being embedded in heterochromatin. Fourteen predicted and at least four active genes are interspersed in Cen8, along with CENH3 binding sites. The retrotransposons located in and outside of the CENH3 binding domain have similar ages and structural dynamics. These results suggest that Cen8 may represent an intermediate stage in the evolution of centromeres from genic regions, as in human neocentromeres, to fully mature centromeres that accumulate megabases of homogeneous satellite arrays.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Centromere / genetics*
  • Chromosomes, Artificial, Bacterial
  • DNA, Satellite
  • Genes
  • Molecular Sequence Data
  • Open Reading Frames
  • Oryza / genetics*
  • Retroelements


  • DNA, Satellite
  • Retroelements

Associated data

  • GENBANK/AY360384
  • GENBANK/AY360385
  • GENBANK/AY360386
  • GENBANK/AY360387
  • GENBANK/AY360388
  • GENBANK/AY360389
  • GENBANK/AY360390
  • GENBANK/AY360391
  • GENBANK/AY360392
  • GENBANK/AY360393
  • GENBANK/AY360394
  • GENBANK/AY438639