Alternate measure of information useful for DNA sequences

Ranjan Bose; Sonali Chouhan

doi:10.1103/PhysRevE.83.051918

Alternate measure of information useful for DNA sequences

Phys Rev E Stat Nonlin Soft Matter Phys. 2011 May;83(5 Pt 1):051918. doi: 10.1103/PhysRevE.83.051918. Epub 2011 May 20.

Authors

Ranjan Bose¹, Sonali Chouhan

Affiliation

¹ Department of Electrical Engineering, IIT Delhi, Hauz Khas, New Delhi, India.

PMID: 21728582
DOI: 10.1103/PhysRevE.83.051918

Abstract

We propose an alternate measure of information, called superinformation, which has been found to be very effective for analyzing the coding and noncoding regions of the DNA. This superinformation is actually a measure of the "randomness of randomness." It has been found to be highly accurate in classifying coding and noncoding regions of human DNA. In the proposed method, no prior training is required. This technique exhibits higher accuracy than previously reported techniques in distinguishing between the coding and the noncoding portions of the DNA. Superinformation can also be used to analyze the untranslated regions in various genes.

MeSH terms

Base Sequence
Computational Biology / methods*
DNA / genetics*
Humans
Stochastic Processes
Untranslated Regions / genetics

Substances

Untranslated Regions
DNA