Networks of motifs from sequences of symbols

Phys Rev Lett. 2010 Oct 22;105(17):178702. doi: 10.1103/PhysRevLett.105.178702. Epub 2010 Oct 19.

Abstract

We introduce a method to convert an ensemble of sequences of symbols into a weighted directed network whose nodes are motifs, while the directed links and their weights are defined from statistically significant co-occurences of two motifs in the same sequence. The analysis of communities of networks of motifs is shown to be able to correlate sequences with functions in the human proteome database, to detect hot topics from online social dialogs, to characterize trajectories of dynamical systems, and it might find other useful applications to process large amounts of data in various fields.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs*
  • Amino Acid Sequence*
  • Blogging
  • Humans
  • Nonlinear Dynamics
  • Proteome / chemistry
  • Social Support

Substances

  • Proteome