Finding a set of molecules, which closely resemble a given lead molecule, from a database containing potentially billions of chemical structures is an important but daunting problem. Similar molecular shapes are particularly important, given that in biology small organic molecules frequently act by binding into a defined and complex site on a macromolecule. Here, we present a new method for molecular shape comparison, named ultrafast shape recognition (USR), capable of screening billions of compounds for similar shapes using a single computer and without the need of aligning the molecules before testing for similarity. Despite its extremely fast comparison rate, USR will be shown to be highly accurate at describing, and hence comparing, molecular shapes.
(c) 2007 Wiley Periodicals, Inc.