Striking similarities in diverse telomerase proteins revealed by combining structure prediction and machine learning approaches

Pac Symp Biocomput. 2008:501-12.

Abstract

Telomerase is a ribonucleoprotein enzyme that adds telomeric DNA repeat sequences to the ends of linear chromosomes. The enzyme plays pivotal roles in cellular senescence and aging, and because it provides a telomere maintenance mechanism for approximately 90% of human cancers, it is a promising target for cancer therapy. Despite its importance, a high-resolution structure of the telomerase enzyme has been elusive, although a crystal structure of an N-terminal domain (TEN) of the telomerase reverse transcriptase subunit (TERT) from Tetrahymena has been reported. In this study, we used a comparative strategy, in which sequence-based machine learning approaches were integrated with computational structural modeling, to explore the potential conservation of structural and functional features of TERT in phylogenetically diverse species. We generated structural models of the N-terminal domains from human and yeast TERT using a combination of threading and homology modeling with the Tetrahymena TEN structure as a template. Comparative analysis of predicted and experimentally verified DNA and RNA binding residues, in the context of these structures, revealed significant similarities in nucleic acid binding surfaces of Tetrahymena and human TEN domains. In addition, the combined evidence from machine learning and structural modeling identified several specific amino acids that are likely to play a role in binding DNA or RNA, but for which no experimental evidence is currently available.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Artificial Intelligence
  • Binding Sites / genetics
  • Computational Biology
  • Computer Simulation
  • Conserved Sequence
  • DNA / chemistry
  • DNA / metabolism
  • Databases, Protein
  • Humans
  • Macromolecular Substances
  • Models, Molecular
  • Molecular Sequence Data
  • Protein Conformation
  • Protein Structure, Tertiary
  • RNA / chemistry
  • RNA / metabolism
  • Saccharomyces cerevisiae / enzymology
  • Saccharomyces cerevisiae / genetics
  • Sequence Homology, Amino Acid
  • Telomerase / chemistry*
  • Telomerase / genetics
  • Telomerase / metabolism
  • Tetrahymena / enzymology
  • Tetrahymena / genetics

Substances

  • Macromolecular Substances
  • RNA
  • DNA
  • Telomerase