Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Jul 1;34(Web Server issue):W350-5.
doi: 10.1093/nar/gkl159.

DILIMOT: Discovery of Linear Motifs in Proteins

Affiliations
Free PMC article

DILIMOT: Discovery of Linear Motifs in Proteins

Victor Neduva et al. Nucleic Acids Res. .
Free PMC article

Abstract

Discovery of protein functional motifs is critical in modern biology. Small segments of 3-10 residues play critical roles in protein interactions, post-translational modifications and trafficking. DILIMOT (DIscovery of LInear MOTifs) is a server for the prediction of these short linear motifs within a set of proteins. Given a set of sequences sharing a common functional feature (e.g. interaction partner or localization) the method finds statistically over-represented motifs likely to be responsible for it. The input sequences are first passed through a set of filters to remove regions unlikely to contain instances of linear motifs. Motifs are then found in the remaining sequence and ranked according to a statistic that measure over-representation and conservation across homologues in related species. The results are displayed via a visual interface for easy perusal. The server is available at http://dilimot.embl.de.

Figures

Figure 1
Figure 1
The server process and output. (A) Schematic showing how submitted sequences are filtered, motifs found and arranged into a ranked list sorted by P (left). When the species is provided, sequences are assigned to the orthologous groups, species–specific probabilities for over-represented motifs are calculated (coloured box) the list resorted by SCONS (right). (B) Example of server output. A list of putative motifs is reported in an interactive table (left), which gives general details for each of them. Clicking on each motif launches an additional page (right) showing sequences containing the motif, where the motif is found in them and the degree to which the motif is conserved in related species. Motif locations (red bars) and other features found in the sequences, such as domains, are shown graphically and detailed below each image.
Figure 2
Figure 2
The EB1 motif SxIP detected by the server. (A) A sequence logo (27) for the EB1 binding motif, generated using all instances of the motif in the input set. (B) Examples of EB1 binding proteins from the input set (represented as boxes) and multiple alignments of putative motif containing regions. Dark blue regions in the boxes denote those removed by the domain and redundancy filters. A known EB1 binding region (in APC) lies at the C-terminus of a Pfam domain. To avoid its removal, we simply cut the sequence down to this region alone (switching the Pfam filter off will have similar effect). Sequences for the motif-containing region are shown aligned to the best homologues in closely related species. Amino acids in the alignments are coloured according to residue type: blue, positive; red, negative; light-blue, small; yellow, hydrophobic; green, aromatic; magenta, polar; Proline, orange. Positions within the predicted motif are denoted by red triangles. Species abbreviations: Hsa, H.sapiens; Mmu, M.musculus; Rno, R.norwegicus; Gga, G.gallus; Fru, F.rubripes; Cgi, Candida glabrata; Kla, Kluyveromyces lactis; Kwa, Kluyveromyces waltii; Ego, Eremothecium gossypii; Sce, Saccharomyces cerevisiae; Dha, Debaryomyces hansenii.
Figure 3
Figure 3
Features of known linear motifs. (A) Distributions of length (red), number of specified (i.e. non-‘x’; green) and invariant (i.e. a single specific residue; blue) positions for 120 known linear motifs extracted from the ELM database (7). Note that four motifs with lengths of 13–18 are not shown in the first (red) plot for clarity. (B) Degree to which residues are over-represented in known motifs. Numbers show the ratio of the abundance of the residue within the 120 motifs from ELM to the abundance in globular domains as computed from the protein databank [PDB; (28)]. ‘ALL’ includes all 120, ‘LIG’ are the 66 ligand binding, ‘TRG’ the 16 targeting and ‘MOD’ the 30 modification site motifs. For 7 of 40 residues in the latter two categories there were too few counts to obtain a confident measurement (i.e. <5); these are denoted by an asterix. Note that we have not included a fourth ELM category CLV, which includes protein cleavage sites, as there were too few examples to compute meaningful numbers. Colour scheme: red, strongly favoured in linear motifs compared to globular proteins; orange, moderately favoured; light-blue moderately disfavoured; blue strongly disfavoured.

Similar articles

See all similar articles

Cited by 53 articles

  • An intrinsically disordered proteins community for ELIXIR.
    Davey NE, Babu MM, Blackledge M, Bridge A, Capella-Gutierrez S, Dosztanyi Z, Drysdale R, Edwards RJ, Elofsson A, Felli IC, Gibson TJ, Gutmanas A, Hancock JM, Harrow J, Higgins D, Jeffries CM, Le Mercier P, Mészáros B, Necci M, Notredame C, Orchard S, Ouzounis CA, Pancsa R, Papaleo E, Pierattelli R, Piovesan D, Promponas VJ, Ruch P, Rustici G, Romero P, Sarntivijai S, Saunders G, Schuler B, Sharan M, Shields DC, Sussman JL, Tedds JA, Tompa P, Turewicz M, Vondrasek J, Vranken WF, Wallace BA, Wichapong K, Tosatto SCE. Davey NE, et al. F1000Res. 2019 Oct 15;8:ELIXIR-1753. doi: 10.12688/f1000research.20136.1. eCollection 2019. F1000Res. 2019. PMID: 31824649 Free PMC article.
  • Resources to Discover and Use Short Linear Motifs in Viral Proteins.
    Hraber P, O'Maille PE, Silberfarb A, Davis-Anderson K, Generous N, McMahon BH, Fair JM. Hraber P, et al. Trends Biotechnol. 2020 Jan;38(1):113-127. doi: 10.1016/j.tibtech.2019.07.004. Epub 2019 Aug 16. Trends Biotechnol. 2020. PMID: 31427097 Free PMC article. Review.
  • The carboxy-terminus, a key regulator of protein function.
    Sharma S, Schiller MR. Sharma S, et al. Crit Rev Biochem Mol Biol. 2019 Apr;54(2):85-102. doi: 10.1080/10409238.2019.1586828. Epub 2019 May 20. Crit Rev Biochem Mol Biol. 2019. PMID: 31106589 Free PMC article. Review.
  • Computational and experimental analysis of bioactive peptide linear motifs in the integrin adhesome.
    O'Brien KT, Golla K, Kranjc T, O'Donovan D, Allen S, Maguire P, Simpson JC, O'Connell D, Moran N, Shields DC. O'Brien KT, et al. PLoS One. 2019 Jan 28;14(1):e0210337. doi: 10.1371/journal.pone.0210337. eCollection 2019. PLoS One. 2019. PMID: 30689642 Free PMC article.
  • LMDIPred: A web-server for prediction of linear peptide sequences binding to SH3, WW and PDZ domains.
    Sarkar D, Jana T, Saha S. Sarkar D, et al. PLoS One. 2018 Jul 12;13(7):e0200430. doi: 10.1371/journal.pone.0200430. eCollection 2018. PLoS One. 2018. PMID: 30001346 Free PMC article.
See all "Cited by" articles

References

    1. Letunic I., Copley R.R., Schmidt S., Ciccarelli F.D., Doerks T., Schultz J., Ponting C.P., Bork P. SMART 4.0: towards genomic data integration. Nucleic Acids Res. 2004;32:D142–D144. - PMC - PubMed
    1. Bateman A., Birney E., Cerruti L., Durbin R., Etwiller L., Eddy S.R., Griffiths-Jones S., Howe K.L., Marshall M., Sonnhammer E.L. The Pfam protein families database. Nucleic Acids Res. 2002;30:276–280. - PMC - PubMed
    1. Eddy S.R. Profile hidden Markov models. Bioinformatics. 1998;14:755–763. - PubMed
    1. Madera M., Gough J. A comparison of profile hidden Markov model procedures for remote homology detection. Nucleic Acids Res. 2002;30:4321–4328. - PMC - PubMed
    1. Bork P., Gibson T.J. Applying motif and profile searches. Meth. Enzymol. 1996;266:162–184. - PubMed

Publication types

Substances

LinkOut - more resources

Feedback