CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations
- PMID: 25064567
- PMCID: PMC4201158
- DOI: 10.1093/bioinformatics/btu500
CCMpred--fast and precise prediction of protein residue-residue contacts from correlated mutations
Abstract
Motivation: Recent breakthroughs in protein residue-residue contact prediction have made reliable de novo prediction of protein structures possible. The key was to apply statistical methods that can distinguish direct couplings between pairs of columns in a multiple sequence alignment from merely correlated pairs, i.e. to separate direct from indirect effects. Two classes of such methods exist, either relying on regularized inversion of the covariance matrix or on pseudo-likelihood maximization (PLM). Although PLM-based methods offer clearly higher precision, available tools are not sufficiently optimized and are written in interpreted languages that introduce additional overheads. This impedes the runtime and large-scale contact prediction for larger protein families, multi-domain proteins and protein-protein interactions.
Results: Here we introduce CCMpred, our performance-optimized PLM implementation in C and CUDA C. Using graphics cards in the price range of current six-core processors, CCMpred can predict contacts for typical alignments 35-113 times faster and with the same precision as the most accurate published methods. For users without a CUDA-capable graphics card, CCMpred can also run in a CPU mode that is still 4-14 times faster. Thanks to our speed-ups (http://dictionary.cambridge.org/dictionary/british/speed-up) contacts for typical protein families can be predicted in 15-60 s on a consumer-grade GPU and 1-6 min on a six-core CPU.
Availability and implementation: CCMpred is free and open-source software under the GNU Affero General Public License v3 (or later) available at https://bitbucket.org/soedinglab/ccmpred.
© The Author 2014. Published by Oxford University Press.
Figures
Similar articles
-
MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins.Bioinformatics. 2015 Apr 1;31(7):999-1006. doi: 10.1093/bioinformatics/btu791. Epub 2014 Nov 26. Bioinformatics. 2015. PMID: 25431331 Free PMC article.
-
bbcontacts: prediction of β-strand pairing from direct coupling patterns.Bioinformatics. 2015 Jun 1;31(11):1729-37. doi: 10.1093/bioinformatics/btv041. Epub 2015 Jan 23. Bioinformatics. 2015. PMID: 25618863
-
The evolution of contact prediction: evidence that contact selection in statistical contact prediction is changing.Bioinformatics. 2020 Mar 1;36(6):1750-1756. doi: 10.1093/bioinformatics/btz816. Bioinformatics. 2020. PMID: 31693112
-
High precision in protein contact prediction using fully convolutional neural networks and minimal sequence features.Bioinformatics. 2018 Oct 1;34(19):3308-3315. doi: 10.1093/bioinformatics/bty341. Bioinformatics. 2018. PMID: 29718112 Free PMC article.
-
Predicting accurate contacts in thousands of Pfam domain families using PconsC3.Bioinformatics. 2017 Sep 15;33(18):2859-2866. doi: 10.1093/bioinformatics/btx332. Bioinformatics. 2017. PMID: 28535189
Cited by
-
Accurate contact predictions using covariation techniques and machine learning.Proteins. 2016 Sep;84 Suppl 1(Suppl Suppl 1):145-51. doi: 10.1002/prot.24863. Epub 2015 Aug 14. Proteins. 2016. PMID: 26205532 Free PMC article.
-
Harnessing generative AI to decode enzyme catalysis and evolution for enhanced engineering.Natl Sci Rev. 2023 Dec 28;10(12):nwad331. doi: 10.1093/nsr/nwad331. eCollection 2023 Dec. Natl Sci Rev. 2023. PMID: 38299119 Free PMC article. Review.
-
Applying PyRosetta molecular energies to separate properly oriented protein models from mirror models, obtained from contact maps.J Mol Model. 2016 May;22(5):111. doi: 10.1007/s00894-016-2975-3. Epub 2016 Apr 23. J Mol Model. 2016. PMID: 27107578 Free PMC article.
-
Detecting distant-homology protein structures by aligning deep neural-network based contact maps.PLoS Comput Biol. 2019 Oct 17;15(10):e1007411. doi: 10.1371/journal.pcbi.1007411. eCollection 2019 Oct. PLoS Comput Biol. 2019. PMID: 31622328 Free PMC article.
-
DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.Bioinformatics. 2020 Apr 1;36(7):2105-2112. doi: 10.1093/bioinformatics/btz863. Bioinformatics. 2020. PMID: 31738385 Free PMC article.
References
-
- Dunn SD, et al. Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction. Bioinformatics. 2008;24:333–340. - PubMed
-
- Ekeberg M, et al. Improved contact prediction in proteins: using pseudolikelihoods to infer potts models. Phys. Rev. E. 2013;87:012707. - PubMed
-
- Jones DT, et al. PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments. Bioinformatics. 2012;28:184–190. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
