Sequence coevolution between RNA and protein characterized by mutual information between residue triplets

PLoS One. 2012;7(1):e30022. doi: 10.1371/journal.pone.0030022. Epub 2012 Jan 18.

Abstract

Coevolving residues in a multiple sequence alignment provide evolutionary clues of biophysical interactions in 3D structure. Despite a rich literature describing amino acid coevolution within or between proteins and nucleic acid coevolution within RNA, to date there has been no direct evidence of coevolution between protein and RNA. The ribosome, a structurally conserved macromolecular machine composed of over 50 interacting protein and RNA chains, provides a natural example of RNA/protein interactions that likely coevolved. We provide the first direct evidence of RNA/protein coevolution by characterizing the mutual information in residue triplets from a multiple sequence alignment of ribosomal protein L22 and neighboring 23S RNA. We define residue triplets as three positions in the multiple sequence alignment, where one position is from the 23S RNA and two positions are from the L22 protein. We show that residue triplets with high mutual information are more likely than residue doublets to be proximal in 3D space. Some high mutual information residue triplets cluster in a connected series across the L22 protein structure, similar to patterns seen in protein coevolution. We also describe RNA nucleotides for which switching from one nucleotide to another (or between purines and pyrimidines) results in a change in amino acid distribution for proximal amino acid positions. Multiple crystal structures for evolutionarily distinct ribosome species can provide structural evidence for these differences. For one residue triplet, a pyrimidine in one species is a purine in another, and RNA/protein hydrogen bonds are present in one species but not the other. The results provide the first direct evidence of RNA/protein coevolution by using higher order mutual information, suggesting that biophysical constraints on interacting RNA and protein chains are indeed a driving force in their evolution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acids / chemistry
  • Amino Acids / genetics
  • Codon / chemistry
  • Codon / genetics*
  • Entropy
  • Evolution, Molecular*
  • Models, Molecular
  • Nucleic Acid Conformation
  • Protein Conformation
  • Proteins / chemistry
  • Proteins / genetics*
  • RNA / chemistry
  • RNA / genetics*
  • RNA, Ribosomal, 23S / chemistry
  • RNA, Ribosomal, 23S / genetics
  • Ribosomal Proteins / chemistry
  • Ribosomal Proteins / genetics

Substances

  • Amino Acids
  • Codon
  • Proteins
  • RNA, Ribosomal, 23S
  • Ribosomal Proteins
  • RNA