Systematic analysis of human kinase genes: a large number of genes and alternative splicing events result in functional and structural diversity

BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S20. doi: 10.1186/1471-2105-6-S4-S20.

Abstract

Background: Protein kinases are a well defined family of proteins, characterized by the presence of a common kinase catalytic domain and playing a significant role in many important cellular processes, such as proliferation, maintenance of cell shape, apoptosis. In many members of the family, additional non-kinase domains contribute further specialization, resulting in subcellular localization, protein binding and regulation of activity, among others. About 500 genes encode members of the kinase family in the human genome, and although many of them represent well known genes, a larger number of genes code for proteins of more recent identification, or for unknown proteins identified as kinase only after computational studies.

Results: A systematic in silico study performed on the human genome, led to the identification of 5 genes, on chromosome 1, 11, 13, 15 and 16 respectively, and 1 pseudogene on chromosome X; some of these genes are reported as kinases from NCBI but are absent in other databases, such as KinBase. Comparative analysis of 483 gene regions and subsequent computational analysis, aimed at identifying unannotated exons, indicates that a large number of kinase may code for alternately spliced forms or be incorrectly annotated. An InterProScan automated analysis was performed to study domain distribution and combination in the various families. At the same time, other structural features were also added to the annotation process, including the putative presence of transmembrane alpha helices, and the cystein propensity to participate into a disulfide bridge.

Conclusion: The predicted human kinome was extended by identifying both additional genes and potential splice variants, resulting in a varied panorama where functionality may be searched at the gene and protein level. Structural analysis of kinase proteins domains as defined in multiple sources together with transmembrane alpha helices and signal peptide prediction provides hints to function assignment. The results of the human kinome analysis are collected in the KinWeb database, available for browsing and searching over the internet, where all results from the comparative analysis and the gene structure annotation are made available, alongside the domain information. Kinases may be searched by domain combinations and the relative genes may be viewed in a graphic browser at various level of magnification up to gene organization on the full chromosome set.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Alternative Splicing*
  • Amino Acid Motifs
  • Animals
  • Apoptosis
  • Cell Proliferation
  • Chromosome Mapping
  • Computational Biology / methods*
  • Databases, Genetic
  • Databases, Protein
  • Disulfides
  • Exons
  • Gene Expression Regulation, Enzymologic*
  • Genetic Variation
  • Genome
  • Humans
  • Internet
  • Mice
  • Models, Genetic
  • Molecular Sequence Data
  • Multigene Family
  • Phosphotransferases / genetics*
  • Protein Structure, Tertiary
  • Proteome
  • Sequence Analysis
  • Software
  • Structure-Activity Relationship

Substances

  • Disulfides
  • Proteome
  • Phosphotransferases