Mathematical model to predict regions of chromatin attachment to the nuclear matrix

Nucleic Acids Res. 1997 Apr 1;25(7):1419-25. doi: 10.1093/nar/25.7.1419.

Abstract

The potentiation and subsequent initiation of transcription are complex biological phenomena. The region of attachment of the chromatin fiber to the nuclear matrix, known as the matrix attachment region or scaffold attachment region (MAR or SAR), are thought to be requisite for the transcriptional regulation of the eukaryotic genome. As expressed sequences should be contained in these regions, it becomes significant to answer the following question: can these regions be identified from the primary sequence data alone and subsequently used as markers for expressed sequences? This paper represents an effort toward achieving this goal and describes a mathematical model for the detection of MARs. The location of matrix associated regions has been linked to a variety of sequence patterns. Consequently, a list of these patterns is compiled and represented as a set of decision rules using an AND-OR formulation. The DNA sequence was then searched for the presence of these patterns and a statistical significance was associated with the frequency of occurrence of the various patterns. Subsequently, a mathematical potential value,MAR-Potential, was assigned to a sequence region as the inverse proportion to the probability that the observed pattern population occurred at random. Such a MAR detection process was applied to the analysis of a variety of known MAR containing sequences. Regions of matrix association predicted by the software essentially correspond to those determined experimentally. The human T-cell receptor and the DNA sequence from the Drosophila bithorax region were also analyzed. This demonstrates the usefulness of the approach described as a means to direct experimental resources.

MeSH terms

  • Base Sequence
  • Binding Sites
  • Chromatin / metabolism*
  • Chromosome Mapping
  • DNA Topoisomerases, Type II / metabolism
  • Deoxyribonuclease I / metabolism
  • Globins / genetics
  • Humans
  • Models, Molecular
  • Models, Theoretical*
  • Molecular Sequence Data
  • Nuclear Matrix / metabolism*
  • Software

Substances

  • Chromatin
  • Globins
  • Deoxyribonuclease I
  • DNA Topoisomerases, Type II