Manually curated database of rice proteins

Nucleic Acids Res. 2014 Jan;42(Database issue):D1214-21. doi: 10.1093/nar/gkt1072. Epub 2013 Nov 7.


'Manually Curated Database of Rice Proteins' (MCDRP) available at is a unique curated database based on published experimental data. Semantic integration of scientific data is essential to gain a higher level of understanding of biological systems. Since the majority of scientific data is available as published literature, text mining is an essential step before the data can be integrated and made available for computer-based search in various databases. However, text mining is a tedious exercise and thus, there is a large gap in the data available in curated databases and published literature. Moreover, data in an experiment can be perceived from several perspectives, which may not reflect in the text-based curation. In order to address such issues, we have demonstrated the feasibility of digitizing the experimental data itself by creating a database on rice proteins based on in-house developed data curation models. Using these models data of individual experiments have been digitized with the help of universal ontologies. Currently, the database has data for over 1800 rice proteins curated from >4000 different experiments of over 400 research articles. Since every aspect of the experiment such as gene name, plant type, tissue and developmental stage has been digitized, experimental data can be rapidly accessed and integrated.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Protein*
  • Genes, Plant
  • Internet
  • Oryza / genetics*
  • Oryza / growth & development
  • Plant Proteins / chemistry
  • Plant Proteins / genetics
  • Plant Proteins / physiology*


  • Plant Proteins