Key challenges for the creation and maintenance of specialist protein resources

Gemma L Holliday; Amos Bairoch; Pantelis G Bagos; Arnaud Chatonnet; David J Craik; Robert D Finn; Bernard Henrissat; David Landsman; Gerard Manning; Nozomi Nagano; Claire O'Donovan; Kim D Pruitt; Neil D Rawlings; Milton Saier; Ramanathan Sowdhamini; Michael Spedding; Narayanaswamy Srinivasan; Gert Vriend; Patricia C Babbitt; Alex Bateman

doi:10.1002/prot.24803

Key challenges for the creation and maintenance of specialist protein resources

Proteins. 2015 Jun;83(6):1005-13. doi: 10.1002/prot.24803. Epub 2015 Apr 22.

Authors

Gemma L Holliday¹, Amos Bairoch², Pantelis G Bagos³, Arnaud Chatonnet^{4

5}, David J Craik⁶, Robert D Finn⁷, Bernard Henrissat^{8

9}, David Landsman¹⁰, Gerard Manning¹¹, Nozomi Nagano¹², Claire O'Donovan⁷, Kim D Pruitt¹⁰, Neil D Rawlings^{7

13}, Milton Saier¹⁴, Ramanathan Sowdhamini¹⁵, Michael Spedding¹⁶, Narayanaswamy Srinivasan¹⁷, Gert Vriend¹⁸, Patricia C Babbitt¹, Alex Bateman⁷

Affiliations

¹ Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, 94158.
² SIB-Swiss Institute of Bioinformatics, University of Geneva, Geneva, Switzerland.
³ Department of Computer Science and Biomedical Informatics, University of Thessaly, Lamia, 35100, Greece.
⁴ INRA, Umr866 Dynamique Musculaire Et Métabolisme, Montpellier, F-34000, France.
⁵ Université Montpellier, Montpellier, F-34000, France.
⁶ Institute for Molecular Bioscience. The University of Queensland, Brisbane, Queensland, 4072, Australia.
⁷ European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, Cb10 1SD, United Kingdom.
⁸ Architecture Et Fonction Des Macromolécules Biologiques, CNRS, Aix-Marseille Université, Marseille, 13288, France.
⁹ Department of Biological Sciences, King Abdulaziz University, Jeddah, Saudi Arabia.
¹⁰ National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, 20892.
¹¹ Department of Bioinformatics & Computational Biology, Genentech, 1 DNA Way, South San Francisco, California, 98010.
¹² Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo, 135-0064, Japan.
¹³ Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, Cb10 1SD, United Kingdom.
¹⁴ Department of Molecular Biology, University of California at San Diego, La Jolla, California, 92093.
¹⁵ National Centre for Biological Sciences, TIFR, GKVK Campus, Bellary Road, Bangalore, 560065, India.
¹⁶ Chair NC-IUPHAR, Spedding Research Solutions SARL, 6 Rue Ampere, Le Vesinet, 78110, France.
¹⁷ Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India.
¹⁸ Centre for Molecular and Biomolecular Informatics (CMBI), Radboud University Medical Center, Geert Grooteplein Zuid 26-28, 6525 GA, Nijmegen, The Netherlands.

Abstract

As the volume of data relating to proteins increases, researchers rely more and more on the analysis of published data, thus increasing the importance of good access to these data that vary from the supplemental material of individual articles, all the way to major reference databases with professional staff and long-term funding. Specialist protein resources fill an important middle ground, providing interactive web interfaces to their databases for a focused topic or family of proteins, using specialized approaches that are not feasible in the major reference databases. Many are labors of love, run by a single lab with little or no dedicated funding and there are many challenges to building and maintaining them. This perspective arose from a meeting of several specialist protein resources and major reference databases held at the Wellcome Trust Genome Campus (Cambridge, UK) on August 11 and 12, 2014. During this meeting some common key challenges involved in creating and maintaining such resources were discussed, along with various approaches to address them. In laying out these challenges, we aim to inform users about how these issues impact our resources and illustrate ways in which our working together could enhance their accuracy, currency, and overall value.

Keywords: big data; biocuration; key challenges; longevity; misannotation; specialist protein resource.

Publication types

Editorial
Research Support, N.I.H., Extramural
Research Support, N.I.H., Intramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

Key challenges for the creation and maintenance of specialist protein resources

Authors

Affiliations

Abstract

Publication types

MeSH terms

Substances

Grants and funding