A hidden Markov model for predicting transmembrane helices in protein sequences
- PMID: 9783223
A hidden Markov model for predicting transmembrane helices in protein sequences
Abstract
A novel method to model and predict the location and orientation of alpha helices in membrane-spanning proteins is presented. It is based on a hidden Markov model (HMM) with an architecture that corresponds closely to the biological system. The model is cyclic with 7 types of states for helix core, helix caps on either side, loop on the cytoplasmic side, two loops for the non-cytoplasmic side, and a globular domain state in the middle of each loop. The two loop paths on the non-cytoplasmic side are used to model short and long loops separately, which corresponds biologically to the two known different membrane insertions mechanisms. The close mapping between the biological and computational states allows us to infer which parts of the model architecture are important to capture the information that encodes the membrane topology, and to gain a better understanding of the mechanisms and constraints involved. Models were estimated both by maximum likelihood and a discriminative method, and a method for reassignment of the membrane helix boundaries were developed. In a cross validated test on single sequences, our transmembrane HMM, TMHMM, correctly predicts the entire topology for 77% of the sequences in a standard dataset of 83 proteins with known topology. The same accuracy was achieved on a larger dataset of 160 proteins. These results compare favourably with existing methods.
Similar articles
-
An improved hidden Markov model for transmembrane protein detection and topology prediction and its applications to complete genomes.Bioinformatics. 2005 May 1;21(9):1853-8. doi: 10.1093/bioinformatics/bti303. Epub 2005 Feb 2. Bioinformatics. 2005. PMID: 15691854
-
Predicting the topology of transmembrane helical proteins using mean burial propensity and a hidden-Markov-model-based method.Protein Sci. 2003 Jul;12(7):1547-55. doi: 10.1110/ps.0305103. Protein Sci. 2003. PMID: 12824500 Free PMC article.
-
Combined prediction of transmembrane topology and signal peptide of beta-barrel proteins: using a hidden Markov model and genetic algorithms.Comput Biol Med. 2010 Jul;40(7):621-8. doi: 10.1016/j.compbiomed.2010.04.006. Epub 2010 May 21. Comput Biol Med. 2010. PMID: 20488436
-
State-of-the-art in membrane protein prediction.Appl Bioinformatics. 2002;1(1):21-35. Appl Bioinformatics. 2002. PMID: 15130854 Review.
-
Hidden Markov Models for prediction of protein features.Methods Mol Biol. 2008;413:173-98. doi: 10.1007/978-1-59745-574-9_7. Methods Mol Biol. 2008. PMID: 18075166 Review.
Cited by
-
Comparative Genomics Reveal That Host-Innate Immune Responses Influence the Clinical Prevalence of Legionella pneumophila Serogroups.PLoS One. 2013 Jun 27;8(6):e67298. doi: 10.1371/journal.pone.0067298. Print 2013. PLoS One. 2013. PMID: 23826259 Free PMC article.
-
Sterol Biosynthesis in Four Green Algae: A Bioinformatic Analysis of the Ergosterol Versus Phytosterol Decision Point.J Phycol. 2021 Aug;57(4):1199-1211. doi: 10.1111/jpy.13164. Epub 2021 May 20. J Phycol. 2021. PMID: 33713347 Free PMC article.
-
Rapid identification of novel immunodominant proteins and characterization of a specific linear epitope of Campylobacter jejuni.PLoS One. 2013 May 29;8(5):e65837. doi: 10.1371/journal.pone.0065837. Print 2013. PLoS One. 2013. PMID: 23734261 Free PMC article.
-
The Membrane Protein LasM Promotes the Culturability of Legionella pneumophila in Water.Front Cell Infect Microbiol. 2016 Sep 28;6:113. doi: 10.3389/fcimb.2016.00113. eCollection 2016. Front Cell Infect Microbiol. 2016. PMID: 27734007 Free PMC article.
-
Prediction and comparison of Salmonella-human and Salmonella-Arabidopsis interactomes.Chem Biodivers. 2012 May;9(5):991-1018. doi: 10.1002/cbdv.201100392. Chem Biodivers. 2012. PMID: 22589098 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Other Literature Sources
Miscellaneous