SAXSDom: Modeling multidomain protein structures using small-angle X-ray scattering data

Proteins. 2020 Jun;88(6):775-787. doi: 10.1002/prot.25865. Epub 2019 Dec 27.

Abstract

Many proteins are composed of several domains that pack together into a complex tertiary structure. Multidomain proteins can be challenging for protein structure modeling, particularly those for which templates can be found for individual domains but not for the entire sequence. In such cases, homology modeling can generate high quality models of the domains but not for the orientations between domains. Small-angle X-ray scattering (SAXS) reports the structural properties of entire proteins and has the potential for guiding homology modeling of multidomain proteins. In this article, we describe a novel multidomain protein assembly modeling method, SAXSDom that integrates experimental knowledge from SAXS with probabilistic Input-Output Hidden Markov model to assemble the structures of individual domains together. Four SAXS-based scoring functions were developed and tested, and the method was evaluated on multidomain proteins from two public datasets. Incorporation of SAXS information improved the accuracy of domain assembly for 40 out of 46 critical assessment of protein structure prediction multidomain protein targets and 45 out of 73 multidomain protein targets from the ab initio domain assembly dataset. The results demonstrate that SAXS data can provide useful information to improve the accuracy of domain-domain assembly. The source code and tool packages are available at https://github.com/jianlin-cheng/SAXSDom.

Keywords: CASP; SAXS; domain assembly; machine learning; probabilistic model; protein structure; small-angle X-ray scattering.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bacterial Proteins / chemistry*
  • Bacterial Proteins / genetics
  • Bacterial Proteins / metabolism
  • Binding Sites
  • Caspases / chemistry*
  • Caspases / genetics
  • Caspases / metabolism
  • Crystallography, X-Ray
  • Escherichia coli / chemistry
  • Escherichia coli Proteins / chemistry*
  • Escherichia coli Proteins / genetics
  • Escherichia coli Proteins / metabolism
  • Humans
  • Markov Chains
  • Membrane Proteins / chemistry*
  • Membrane Proteins / genetics
  • Membrane Proteins / metabolism
  • Models, Molecular
  • Monte Carlo Method
  • Protein Binding
  • Protein Conformation, alpha-Helical
  • Protein Conformation, beta-Strand
  • Protein Interaction Domains and Motifs
  • Protein Structure, Tertiary
  • Rhodobacter capsulatus / chemistry
  • Scattering, Small Angle
  • Software*
  • Structural Homology, Protein
  • Thermodynamics
  • X-Ray Diffraction

Substances

  • Bacterial Proteins
  • Escherichia coli Proteins
  • FtsA protein, E coli
  • Membrane Proteins
  • PutA protein, Bacteria
  • Caspases