Effects of the difference in similarity measures on the comparison of ligand-binding pockets using a reduced vector representation of pockets

Biophys Physicobiol. 2016 Jul 14:13:139-147. doi: 10.2142/biophysico.13.0_139. eCollection 2016.

Abstract

Comprehensive analysis and comparison of protein ligand-binding pockets are important to predict the ligands which bind to parts of putative ligand binding pockets. Because of the recent increase of protein structure information, such analysis demands a fast and efficient method for comparing ligand binding pockets. Previously we proposed a fast alignment-free method based on a simple representation of a ligand binding pocket with one 11-dimensional vector, which is suitable for such analysis. Based on that method, we conducted this study to expand and revise similarity measures of binding pockets and to investigate the effects of those modifications with two datasets for improving the ability to detect similar binding pockets. The new method exhibits higher detection performance of similar binding pockets than the previous methods and another existing accurate alignment-dependent method: APoc. Results also show that the effects of the modifications depend on the difficulty of the dataset, implying some avenues for methods of improvement.

Keywords: alignment-free comparison; large-scale comparison; multidimensional scaling; triangle descriptor.