Identifying RNA-binding residues based on evolutionary conserved structural and energetic features

Nucleic Acids Res. 2014 Feb;42(3):e15. doi: 10.1093/nar/gkt1299. Epub 2013 Dec 16.

Abstract

Increasing numbers of protein structures are solved each year, but many of these structures belong to proteins whose sequences are homologous to sequences in the Protein Data Bank. Nevertheless, the structures of homologous proteins belonging to the same family contain useful information because functionally important residues are expected to preserve physico-chemical, structural and energetic features. This information forms the basis of our method, which detects RNA-binding residues of a given RNA-binding protein as those residues that preserve physico-chemical, structural and energetic features in its homologs. Tests on 81 RNA-bound and 35 RNA-free protein structures showed that our method yields a higher fraction of true RNA-binding residues (higher precision) than two structure-based and two sequence-based machine-learning methods. Because the method requires no training data set and has no parameters, its precision does not degrade when applied to 'novel' protein sequences unlike methods that are parameterized for a given training data set. It was used to predict the 'unknown' RNA-binding residues in the C-terminal RNA-binding domain of human CPEB3. The two predicted residues, F430 and F474, were experimentally verified to bind RNA, in particular F430, whose mutation to alanine or asparagine nearly abolished RNA binding. The method has been implemented in a webserver called DR_bind1, which is freely available with no login requirement at http://drbind.limlab.ibms.sinica.edu.tw.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acids / chemistry*
  • Binding Sites
  • DNA-Binding Proteins / chemistry
  • Evolution, Molecular
  • Humans
  • Protein Binding
  • Protein Conformation
  • RNA / chemistry
  • RNA / metabolism
  • RNA-Binding Proteins / chemistry*
  • RNA-Binding Proteins / metabolism
  • Software
  • Static Electricity

Substances

  • Amino Acids
  • DNA-Binding Proteins
  • RNA-Binding Proteins
  • RNA