Functional 5' UTR motif discovery with LESMoN: Local Enrichment of Sequence Motifs in biological Networks

Nucleic Acids Res. 2017 Oct 13;45(18):10415-10427. doi: 10.1093/nar/gkx751.

Abstract

Biological networks are rich representations of the relationships between entities such as genes or proteins and have become increasingly complete thanks to various high-throughput network mapping experimental approaches. Here, we propose a method to use such networks to guide the search for functional sequence motifs. Specifically, we introduce Local Enrichment of Sequence Motifs in biological Networks (LESMoN), an enumerative motif discovery algorithm that identifies 5' untranslated region (UTR) sequence motifs whose associated proteins form unexpectedly dense clusters in a given biological network. When applied to the human protein-protein interaction network from BioGRID, LESMoN identifies several highly significant 5' UTR sequence motifs, including both previously known motifs and uncharacterized ones. The vast majority of these motifs are evolutionary conserved and the genes containing them are significantly enriched for various gene ontology terms suggesting new associations between 5' UTR motifs and a number of biological processes. We validate in vivo the role in protein expression regulation of three motifs identified by LESMoN.

Publication types

  • Validation Study

MeSH terms

  • 5' Untranslated Regions / genetics*
  • Algorithms*
  • Binding Sites / genetics
  • Computational Biology / methods*
  • Gene Expression Regulation*
  • Gene Ontology
  • Gene Regulatory Networks*
  • Genetic Association Studies
  • Humans
  • Mutation
  • Protein Interaction Maps / genetics
  • Regulatory Elements, Transcriptional*
  • Transcription Factors / metabolism

Substances

  • 5' Untranslated Regions
  • Transcription Factors