Genomes are covered with ubiquitous 11 bp periodic patterns, the "class A flexible patterns"

BMC Bioinformatics. 2005 Aug 24:6:206. doi: 10.1186/1471-2105-6-206.

Abstract

Background: The genomes of prokaryotes and lower eukaryotes display a very strong 11 bp periodic bias in the distribution of their nucleotides. This bias is present throughout a given genome, both in coding and non-coding sequences. Until now this bias remained of unknown origin.

Results: Using a technique for analysis of auto-correlations based on linear projection, we identified the sequences responsible for the bias. Prokaryotic and lower eukaryotic genomes are covered with ubiquitous patterns that we termed "class A flexible patterns". Each pattern is composed of up to ten conserved nucleotides or dinucleotides distributed into a discontinuous motif. Each occurrence spans a region up to 50 bp in length. They belong to what we named the "flexible pattern" type, in that there is some limited fluctuation in the distances between the nucleotides composing each occurrence of a given pattern. When taken together, these patterns cover up to half of the genome in the majority of prokaryotes. They generate the previously recognized 11 bp periodic bias.

Conclusion: Judging from the structure of the patterns, we suggest that they may define a dense network of protein interaction sites in chromosomes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computers, Molecular
  • DNA / chemistry
  • DNA / genetics
  • Eukaryotic Cells / chemistry
  • Eukaryotic Cells / metabolism
  • Genome*
  • Models, Genetic
  • Nucleic Acid Conformation
  • Nucleotide Mapping*
  • Pattern Recognition, Automated / methods*
  • Periodicity
  • Probability
  • Prokaryotic Cells / chemistry
  • Prokaryotic Cells / metabolism
  • Repetitive Sequences, Nucleic Acid*
  • Sequence Alignment / methods*

Substances

  • DNA