Skip to main page content
Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Jun;83(6):1005-13.
doi: 10.1002/prot.24803. Epub 2015 Apr 22.

Key Challenges for the Creation and Maintenance of Specialist Protein Resources

Free PMC article

Key Challenges for the Creation and Maintenance of Specialist Protein Resources

Gemma L Holliday et al. Proteins. .
Free PMC article


As the volume of data relating to proteins increases, researchers rely more and more on the analysis of published data, thus increasing the importance of good access to these data that vary from the supplemental material of individual articles, all the way to major reference databases with professional staff and long-term funding. Specialist protein resources fill an important middle ground, providing interactive web interfaces to their databases for a focused topic or family of proteins, using specialized approaches that are not feasible in the major reference databases. Many are labors of love, run by a single lab with little or no dedicated funding and there are many challenges to building and maintaining them. This perspective arose from a meeting of several specialist protein resources and major reference databases held at the Wellcome Trust Genome Campus (Cambridge, UK) on August 11 and 12, 2014. During this meeting some common key challenges involved in creating and maintaining such resources were discussed, along with various approaches to address them. In laying out these challenges, we aim to inform users about how these issues impact our resources and illustrate ways in which our working together could enhance their accuracy, currency, and overall value.

Keywords: big data; biocuration; key challenges; longevity; misannotation; specialist protein resource.

Similar articles

See all similar articles

Cited by 7 articles

See all "Cited by" articles


    1. Lombard V, Golaconda Ramulu H, Drula E, Coutinho PM, Henrissat B. The carbohydrate-active enzymes database (CAZy) in 2013. Nucleic Acids Res. 2014;42:D490–D495. . [PMC][10.1093/nar/gkt1178] [24270786] - PMC - PubMed
    1. Kaas Q, Yu R, Jin AH, Dutertre S, Craik DJ. ConoServer: updated content, knowledge, and discovery tools in the conopeptide database. Nucleic Acids Res. 2012;40:D325–D330. - PMC - PubMed
    1. Wang CK, Kaas Q, Chiche L, Craik DJ. CyBase: a database of cyclic protein sequences and structures, with applications in protein discovery and engineering. Nucleic Acids Res. 2008;36:D206–D210. - PMC - PubMed
    1. Lenfant N, Hotelier T, Velluet E, Bourne Y, Marchot P, Chatonnet A. ESTHER, the database of the alpha/beta-hydrolase fold superfamily of proteins: tools to explore diversity of functions. Nucleic Acids Res. 2013;41:D423–D429. - PMC - PubMed
    1. Tsaousis GN, Tsirigos KD, Andrianou XD, Liakopoulos TD, Bagos PG, Hamodrakas SJ. ExTopoDB: a database of experimentally derived topological models of transmembrane proteins. Bioinformatics. 2010;26:2490–2492. - PubMed

Publication types

LinkOut - more resources