Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 27;38(4):947-953.
doi: 10.1093/bioinformatics/btab761.

Deep graph learning of inter-protein contacts

Affiliations

Deep graph learning of inter-protein contacts

Ziwei Xie et al. Bioinformatics. .

Abstract

Motivation: Inter-protein (interfacial) contact prediction is very useful for in silico structural characterization of protein-protein interactions. Although deep learning has been applied to this problem, its accuracy is not as good as intra-protein contact prediction.

Results: We propose a new deep learning method GLINTER (Graph Learning of INTER-protein contacts) for interfacial contact prediction of dimers, leveraging a rotational invariant representation of protein tertiary structures and a pretrained language model of multiple sequence alignments. Tested on the 13th and 14th CASP-CAPRI datasets, the average top L/10 precision achieved by GLINTER is 54% on the homodimers and 52% on all the dimers, much higher than 30% obtained by the latest deep learning method DeepHomo on the homodimers and 15% obtained by BIPSPI on all the dimers. Our experiments show that GLINTER-predicted contacts help improve selection of docking decoys.

Availability and implementation: The software is available at https://github.com/zw2x/glinter. The datasets are available at https://github.com/zw2x/glinter/data.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Overview of the GLINTER architecture. L1and L2 are the lengths of the two protein chains, K is the number of channels in a CaConv layer and 144 is the total number of heads in the row attention weights generated by Facebook’s MSA Transformer (Rao et al., 2021)
Fig. 2.
Fig. 2.
Comparison of top-10 precision of three models: ESM, Residue+Atom+Surface and Residue+Atom+Surface+ESM. (A) compares Residue+Atom+Surface and ESM, (B) compares Residue+Atom+Surface+ESM and ESM, and (C) compares Residue+Atom+Surface and Residue+Atom+Surface+ESM.
Fig. 3.
Fig. 3.
The average quality (measured by TMscore) of the selected decoys by top predicted contacts. The x-axis is the number of top decoys selected. In the legend, ‘top-10’, ‘top-25’ and ‘top-50’ represent that top 10, 25 and 50 predicted contacts are used to select docking decoys, respectively. ‘best decoy’ indicates the quality of the best decoys generated by HDOCK

Similar articles

Cited by

References

    1. Baldassi C. et al. (2014) Fast and accurate multivariate Gaussian modeling of protein families: predicting residue contacts and protein-interaction partners. PLoS One, 9, e92721. - PMC - PubMed
    1. Bitbol A.-F. et al. (2016) Inferring interaction partners from protein sequences. Proc. Natl. Acad. Sci. USA, 113, 12180–12185. - PMC - PubMed
    1. Burger L., van Nimwegen E. (2008) Accurate prediction of protein–protein interactions from sequence alignments using a Bayesian method. Mol. Syst. Biol., 4, 165. - PMC - PubMed
    1. Cong Q. et al. (2019) Protein interaction networks revealed by proteome coevolution. Science, 365, 185–189. - PMC - PubMed
    1. Dai B., Bailey-Kellogg C. (2021) Protein interaction interface region prediction by geometric deep learning. Bioinformatics, 37, 2580–2588. - PMC - PubMed

Publication types