Automatic identification and representation of protein binding sites for molecular docking

J Ruppert; W Welch; A N Jain

doi:10.1002/pro.5560060302

Automatic identification and representation of protein binding sites for molecular docking

Protein Sci. 1997 Mar;6(3):524-33. doi: 10.1002/pro.5560060302.

Authors

J Ruppert¹, W Welch, A N Jain

Affiliation

¹ Arris Pharmaceutical Corporation, South San Francisco, California 94080, USA.

Abstract

Molecular docking is a popular way to screen for novel drug compounds. The method involves aligning small molecules to a protein structure and estimating their binding affinity. To do this rapidly for tens of thousands of molecules requires an effective representation of the binding region of the target protein. This paper presents an algorithm for representing a protein's binding site in a way that is specifically suited to molecular docking applications. Initially the protein's surface is coated with a collection of molecular fragments that could potentially interact with the protein. Each fragment, or probe, serves as a potential alignment point for atoms in a ligand, and is scored to represent that probe's affinity for the protein. Probes are then clustered by accumulating their affinities, where high affinity clusters are identified as being the "stickiest" portions of the protein surface. The stickiest cluster is used as a computational binding "pocket" for docking. This method of site identification was tested on a number of ligand-protein complexes; in each case the pocket constructed by the algorithm coincided with the known ligand binding site. Successful docking experiments demonstrated the effectiveness of the probe representation.

MeSH terms

Algorithms
Bacterial Proteins / chemistry
Binding Sites
Models, Molecular
Molecular Probes
Protein Binding*
Streptavidin
Tetrahydrofolate Dehydrogenase / chemistry
Trypsin / chemistry

Substances

Bacterial Proteins
Molecular Probes
Streptavidin
Tetrahydrofolate Dehydrogenase
Trypsin