Bidirectional Long Short-Term Memory Networks for predicting the subcellular localization of eukaryotic proteins

IEEE/ACM Trans Comput Biol Bioinform. 2007 Jul-Sep;4(3):441-446. doi: 10.1109/tcbb.2007.1015.

Abstract

An algorithm called Bidirectional Long Short-Term Memory Networks (BLSTM) for processing sequential data is introduced. This supervised learning method trains a special recurrent neural network to use very long ranged symmetric sequence context using a combination of nonlinear processing elements and linear feedback loops for storing long-range context. The algorithm is applied to the sequence-based prediction of protein localization and predicts 93.3 percent novel non-plant proteins and 88.4 percent novel plant proteins correctly, which is an improvement over feedforward and standard recurrent networks solving the same problem. The BLSTM system is available as a web-service (http://www.stepc.gr/~synaptic/blstm.html).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Molecular Sequence Data
  • Neural Networks, Computer*
  • Proteome / chemistry*
  • Proteome / metabolism*
  • Sequence Alignment / methods*
  • Sequence Analysis, Protein / methods*
  • Structure-Activity Relationship
  • Subcellular Fractions / metabolism*

Substances

  • Proteome