The BAD project: data mining, database and prediction of protein adsorption on surfaces

Lab Chip. 2009 Apr 7;9(7):891-900. doi: 10.1039/b813475h. Epub 2008 Dec 24.

Abstract

Protein adsorption at solid-liquid interfaces is critical to many applications, including biomaterials, protein microarrays and lab-on-a-chip devices. Despite this general interest, and a large amount of research in the last half a century, protein adsorption cannot be predicted with an engineering level, design-orientated accuracy. Here we describe a Biomolecular Adsorption Database (BAD), freely available online, which archives the published protein adsorption data. Piecewise linear regression with breakpoint applied to the data in the BAD suggests that the input variables to protein adsorption, i.e., protein concentration in solution; protein descriptors derived from primary structure (number of residues, global protein hydrophobicity and range of amino acid hydrophobicity, isoelectric point); surface descriptors (contact angle); and fluid environment descriptors (pH, ionic strength), correlate well with the output variable-the protein concentration on the surface. Furthermore, neural network analysis revealed that the size of the BAD makes it sufficiently representative, with a neural network-based predictive error of 5% or less. Interestingly, a consistently better fit is obtained if the BAD is divided in two separate sub-sets representing protein adsorption on hydrophilic and hydrophobic surfaces, respectively. Based on these findings, selected entries from the BAD have been used to construct neural network-based estimation routines, which predict the amount of adsorbed protein, the thickness of the adsorbed layer and the surface tension of the protein-covered surface. While the BAD is of general interest, the prediction of the thickness and the surface tension of the protein-covered layers are of particular relevance to the design of microfluidics devices.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adsorption
  • Amino Acid Sequence
  • Databases, Factual*
  • Hydrogen-Ion Concentration
  • Hydrophobic and Hydrophilic Interactions
  • Isoelectric Point
  • Linear Models
  • Neural Networks, Computer
  • Osmolar Concentration
  • Proteins / chemistry*
  • Proteins / metabolism
  • Surface Properties

Substances

  • Proteins