miRNA target recognition using features of suboptimal alignments

Int J Data Min Bioinform. 2015;13(2):171-80. doi: 10.1504/ijdmb.2015.071523.

Abstract

MicroRNAs (miRNAs) are a class of short RNA molecules that regulate gene expression by binding directly to messenger RNAs. Conventional approaches to miRNA target prediction estimate the accessibility of target sites and the strength of the binding miRNA by finding optimums of some energy models, which involves O(n3) computations. Alternatively, we narrow down potential binding sites of miRNAs to suboptimal hits of a pairwise alignment algorithm called Fitting Alignment in O(n2). We invoke a same algorithm, once for all candidate sites to measure the site accessibilities. These features are applied to a binary classifier being learned to predict true associations between miRNAs and target genes. Training the classifier requires the negative samples indicating non-affected genes. The experiments verifying such negative associations have been rarely performed, so we exploit tissue-specific gene expression data to impute the negative associations. The recall rate of our method is above 70% (at precision 85%).

MeSH terms

  • Algorithms*
  • Base Sequence
  • Binding Sites
  • Data Mining / methods
  • Databases, Genetic
  • Machine Learning
  • MicroRNAs / genetics*
  • Molecular Sequence Data
  • Pattern Recognition, Automated / methods*
  • RNA, Messenger / genetics*
  • Sequence Alignment / methods*
  • Sequence Analysis, RNA / methods*

Substances

  • MicroRNAs
  • RNA, Messenger