A novel method for GPCR recognition and family classification from sequence alone using signatures derived from profile hidden Markov models

SAR QSAR Environ Res. 2003 Oct-Dec;14(5-6):413-20. doi: 10.1080/10629360310001623999.

Abstract

G-protein coupled receptors (GPCRs) constitute a broad class of cell-surface receptors, including several functionally distinct families, that play a key role in cellular signalling and regulation of basic physiological processes. GPCRs are the focus of a significant amount of current pharmaceutical research since they interact with more than 50% of prescription drugs, whereas they still comprise the best potential targets for drug design. Taking into account the excess of data derived by genome sequencing projects, the use of computational tools for automated characterization of novel GPCRs is imperative. Typical computational strategies for identifying and classifying GPCRs involve sequence similarity searches (e.g. BLAST) coupled with pattern database analysis (e.g. PROSITE, BLOCKS). The diagnostic method presented here is based on a probabilistic approach that exploits highly discriminative profile Hidden Markov Models, excised from low entropy regions of multiple sequence alignments, to derive potent family signatures. For a given query, a P-value is obtained, combining individual hits derived from the same family. Hence a best-guess family membership is depicted, allowing GPCRs' classification at a family level, solely using primary structure information. A web-based version of the application is freely available at URL: http:/bioinformatics.biol.uoa.gr/PRED-GPCR.

MeSH terms

  • Databases, Factual
  • Drug Design*
  • Drug Prescriptions
  • Forecasting
  • Humans
  • Internet
  • Markov Chains
  • Quantitative Structure-Activity Relationship
  • Receptors, G-Protein-Coupled / physiology*
  • Sequence Analysis, DNA*
  • Signal Transduction

Substances

  • Receptors, G-Protein-Coupled