Computational and experimental identification of C. elegans microRNAs

Mol Cell. 2003 May;11(5):1253-63. doi: 10.1016/s1097-2765(03)00153-9.

Abstract

MicroRNAs (miRNAs) constitute an extensive class of noncoding RNAs that are thought to regulate the expression of target genes via complementary base-pair interactions. To date, cloning has identified over 200 miRNAs from diverse eukaryotic organisms. Despite their success, such biochemical approaches are skewed toward identifying abundant miRNAs, unlike genome-wide, sequence-based computational predictions. We developed informatic methods to predict miRNAs in the C. elegans genome using sequence conservation and structural similarity to known miRNAs and generated 214 candidates. We confirmed the expression of four new miRNAs by Northern blotting and used a more sensitive PCR approach to verify the expression of ten additional candidates. Based on hypotheses underlying our computational methods, we estimate that the C. elegans genome may encode between 140 and 300 miRNAs and potentially many more.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms*
  • Animals
  • Base Sequence / genetics*
  • Caenorhabditis elegans / genetics*
  • Caenorhabditis elegans / metabolism
  • Databases, Genetic
  • Evolution, Molecular
  • Gene Expression Regulation / genetics*
  • Medical Informatics Computing
  • MicroRNAs / analysis*
  • MicroRNAs / genetics*
  • Molecular Sequence Data
  • Molecular Structure
  • Phylogeny
  • Reproducibility of Results
  • Sequence Homology, Nucleic Acid*

Substances

  • MicroRNAs