Mining databases for protein aggregation: a review

Amyloid. 2017 Sep;24(3):143-152. doi: 10.1080/13506129.2017.1353966. Epub 2017 Jul 18.


Protein aggregation is an active area of research in recent decades, since it is the most common and troubling indication of protein instability. Understanding the mechanisms governing protein aggregation and amyloidogenesis is a key component to the aetiology and pathogenesis of many devastating disorders, including Alzheimer's disease or type 2 diabetes. Protein aggregation data are currently found "scattered" in an increasing number of repositories, since advances in computational biology greatly influence this field of research. This review exploits the various resources of aggregation data and attempts to distinguish and analyze the biological knowledge they contain, by introducing protein-based, fragment-based and disease-based repositories, related to aggregation. In order to gain a broad overview of the available repositories, a novel comprehensive network maps and visualizes the current association between aggregation databases and other important databases and/or tools and discusses the beneficial role of community annotation. The need for unification of aggregation databases in a common platform is also addressed.

Keywords: Protein aggregation; amyloid; amyloidogenesis; amyloidosis; database.

Publication types

  • Review

MeSH terms

  • Alzheimer Disease / metabolism*
  • Amyloid / metabolism*
  • Animals
  • Data Mining*
  • Databases, Factual*
  • Diabetes Mellitus, Type 2 / metabolism*
  • Humans
  • Protein Aggregates*
  • Protein Aggregation, Pathological / metabolism*


  • Amyloid
  • Protein Aggregates