Integrating Phylogenetics With Intron Positions Illuminates the Origin of the Complex Spliceosome

Mol Biol Evol. 2023 Jan 4;40(1):msad011. doi: 10.1093/molbev/msad011.

Abstract

Eukaryotic genes are characterized by the presence of introns that are removed from pre-mRNA by a spliceosome. This ribonucleoprotein complex is comprised of multiple RNA molecules and over a hundred proteins, which makes it one of the most complex molecular machines that originated during the prokaryote-to-eukaryote transition. Previous works have established that these introns and the spliceosomal core originated from self-splicing introns in prokaryotes. Yet, how the spliceosomal core expanded by recruiting many additional proteins remains largely elusive. In this study, we use phylogenetic analyses to infer the evolutionary history of 145 proteins that we could trace back to the spliceosome in the last eukaryotic common ancestor. We found that an overabundance of proteins derived from ribosome-related processes was added to the prokaryote-derived core. Extensive duplications of these proteins substantially increased the complexity of the emerging spliceosome. By comparing the intron positions between spliceosomal paralogs, we infer that most spliceosomal complexity postdates the spread of introns through the proto-eukaryotic genome. The reconstruction of early spliceosomal evolution provides insight into the driving forces behind the emergence of complexes with many proteins during eukaryogenesis.

Keywords: eukaryogenesis; introns; spliceosome.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Eukaryota / genetics
  • Evolution, Molecular
  • Introns
  • Phylogeny
  • RNA Splicing*
  • Spliceosomes* / genetics

Associated data

  • figshare/10.6084/m9.figshare.20653575