Genome-wide identification of NBS genes in japonica rice reveals significant expansion of divergent non-TIR NBS-LRR genes

Mol Genet Genomics. 2004 May;271(4):402-15. doi: 10.1007/s00438-004-0990-z. Epub 2004 Mar 10.

Abstract

A complete set of candidate disease resistance ( R) genes encoding nucleotide-binding sites (NBSs) was identified in the genome sequence of japonica rice ( Oryza sativaL. var. Nipponbare). These putative R genes were characterized with respect to structural diversity, phylogenetic relationships and chromosomal distribution, and compared with those in Arabidopsis thaliana. We found 535 NBS-coding sequences, including 480 non-TIR (Toll/IL-1 receptor) NBS-LRR (Leucine Rich Repeat) genes. TIR NBS-LRR genes, which are common in A. thaliana, have not been identified in the rice genome. The number of non-TIR NBS-LRR genes in rice is 8.7 times higher than that in A. thaliana, and they account for about 1% of all of predicted ORFs in the rice genome. Some 76% of the NBS genes were located in 44 gene clusters or in 57 tandem arrays, and 16 apparent gene duplications were detected in these regions. Phylogenetic analyses based both NBS and N-terminal regions classified the genes into about 200 groups, but no deep clades were detected, in contrast to the two distinct clusters found in A. thaliana. The structural and genetic diversity that exists among NBS-LRR proteins in rice is remarkable, and suggests that diversifying selection has played an important role in the evolution of R genes in this agronomically important species. (Supplemental material is available online at http://gattaca.nju.edu.cn.)

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Arabidopsis / genetics
  • Binding Sites / genetics*
  • Biological Evolution
  • Gene Duplication
  • Genetic Variation*
  • Genome, Plant*
  • Leucine-Rich Repeat Proteins
  • Molecular Sequence Data
  • Oryza / genetics*
  • Phylogeny
  • Plant Diseases / genetics
  • Proteins / genetics*
  • Receptors, Interleukin-1 / genetics*
  • Repetitive Sequences, Amino Acid

Substances

  • Leucine-Rich Repeat Proteins
  • Proteins
  • Receptors, Interleukin-1