A structure-based Multiple-Instance Learning approach to predicting in vitro transcription factor-DNA interaction

BMC Genomics. 2015;16 Suppl 4(Suppl 4):S3. doi: 10.1186/1471-2164-16-S4-S3. Epub 2015 Apr 21.

Abstract

Background: Understanding the mechanism of transcriptional regulation remains an inspiring stage of molecular biology. Recently, in vitro protein-binding microarray experiments have greatly improved the understanding of transcription factor-DNA interaction. We present a method - MIL3D - which predicts in vitro transcription factor binding by multiple-instance learning with structural properties of DNA.

Results: Evaluation on in vitro data of twenty mouse transcription factors shows that our method outperforms a method based on simple-instance learning with DNA structural properties, and the widely used k-mer counting method, for nineteen out of twenty of the transcription factors. Our analysis showed that the MIL3D approach can utilize subtle structural similarities when a strong sequence consensus is not available.

Conclusion: Combining multiple-instance learning and structural properties of DNA has promising potential for studying biological regulatory networks.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Animals
  • Artificial Intelligence
  • Computational Biology / methods*
  • DNA / chemistry*
  • DNA / metabolism*
  • In Vitro Techniques
  • Mice
  • Protein Binding
  • Transcription Factors / chemistry
  • Transcription Factors / metabolism*

Substances

  • Transcription Factors
  • DNA