Alternate measure of information useful for DNA sequences

Phys Rev E Stat Nonlin Soft Matter Phys. 2011 May;83(5 Pt 1):051918. doi: 10.1103/PhysRevE.83.051918. Epub 2011 May 20.

Abstract

We propose an alternate measure of information, called superinformation, which has been found to be very effective for analyzing the coding and noncoding regions of the DNA. This superinformation is actually a measure of the "randomness of randomness." It has been found to be highly accurate in classifying coding and noncoding regions of human DNA. In the proposed method, no prior training is required. This technique exhibits higher accuracy than previously reported techniques in distinguishing between the coding and the noncoding portions of the DNA. Superinformation can also be used to analyze the untranslated regions in various genes.

MeSH terms

  • Base Sequence
  • Computational Biology / methods*
  • DNA / genetics*
  • Humans
  • Stochastic Processes
  • Untranslated Regions / genetics

Substances

  • Untranslated Regions
  • DNA