Convex-PL

 

Method Validation Downloads Tests References

About

Convex-PL is a knowledge-based scoring function for protein-ligand interactions.

Convex-PLR augments Convex-PL with conformational flexibility and solvation descriptors for better performance in affinity prediction and virtual screening tasks.

Method

Convex-PL

We suppose the interaction potential to be decomposed into a polynomial basis with expansion coefficients that can be found from a knowledge base with a convex optimization procedure. To do this, we assume that for a set of interaction sites located on the protein and ligand atoms, there is a linear functional dependent from the pairwise distances between the sites that indicates whether a configuration of a receptor-ligand complex is a native or a non-native one. The linear functional itself takes a simple form of a scalar product between a features “structure” vector and a scoring vector.

For a set of training protein-ligand complexes taken from the PDBBindCN 2015 database, we generate decoys with rigid-body deformations of ligand in the binding pocket preserving the ligand RMSD value. The training procedure includes the following stages (Fig. 1):

  1. Extraction of geometrical features, or the site-site distance distributions. For the typization of atomic sites we use our Knodle tool.
  2. These continuous features are projected into a set of discrete features.
  3. A Support Vector Machine quadratic optimization problem is solved to find the interaction potentials in form of a scoring vector.

To score an unknown configuration, a vector of its geometrical features is multiplied by the scoring vector, resulting in a relative binding energy prediction.

flowchart-02

Figure 1. Schematic representation of the training stage (top) and the scoring stage (bottom) of the method.

Convex-PLR

To better account for the conformational entropy and solvation energy changes, we augmented Convex-PL with additional terms and trained their weights with a linear ridge regression model.

VinaCPL

It is also possible to directly use Convex-PL for molecular docking. For this purpose, we created VinaCPL, where Convex-PL is integrated it as a scoring function into AutoDock

Vina. We suggest to use it in combination with the latest version of Convex-PL or with Convex-PLR for subsequent re-scoring. Specifically for re-scoring, we have modified the num_saved_min and num_modes parameters to get more output poses. It is also possible to provide a user-defined threshold for output poses clustering.

Validation

Convex-PL v0.2-v0.3

Method validation was done on the Comparative Assessment of Scoring Functions (CASF) 2013 benchmark. It included four tests that were already applied for 20 popular scoring functions assessment by the benchmark authors.

Compared to those 20 scoring functions, Convex-PL showed the best result in pose prediction, 5th result in the scoring test and 1st in ranking. Nevertheless, its results in the screening test representing the virtual screening tasks, which  involve higher diversity of chemical compounds for each target receptor, were average.

Convex-PLR

Convex-PL was validated on the CASF-2013 and 2016 benchmarks, and on the DUD virtual screening benchmark. In the CASF benchmarks it showed top-ranked results in the pose prediction and scoring tests. Inclusion of additional descriptors improved its virtual screening abilities compared with Convex-PL.

On the DUD Benchmark Convex-PLR shows state-of-the-art results, similar to those we obtained with AutoDock Vina and Vinardo, slightly outperforming them in the early enrichment metrics.

Download

VinaCPL for Linux

Convex-PL v0.4 and Convex-PLR for Linux

Convex-PL v0.3 for Linux

Convex-PL v0.3 for MacOS

Convex-PL v0.2 for Linux

Test cases

Convex-PL v0.2-v0.3

10gs protein with one ligand, 3ozt protein with 100 decoys and a native ligand (from the CASF 2013 benchmark)

Convex-PL v0.4 and Convex-PLR

4dli protein with one ligand and decoys from the CASF benchmark

VinaCPL

4dli protein and ligand with configurational files for Convex-PL and AutoDock Vina’s default scoring function

References

Convex-PL: a novel knowledge-based potential for protein-ligand interactions deduced from structural databases using convex optimization. Maria Kadukova, Sergei Grudinin. Journal of Computer-Aided Molecular Design, Springer Verlag, 2017, 〈10.1007/s10822-017-0068-8〉 https://hal.inria.fr/hal-01591154/file/2017-Kadukova-Convex-PL%3A%20a%20novel%20knowledge-based%20potential%20for%20protein-ligand%20interactions%20deduced%20from%20structural%20databases%20using%20convex%20optimization.pdf BibTex
Docking of small molecules to farnesoid X receptors using AutoDock Vina with the Convex-PL potential: lessons learned from D3R Grand Challenge 2. Maria Kadukova, Sergei Grudinin. Journal of Computer-Aided Molecular Design, Springer Verlag, 2018, 32 (1), pp.151-162, 〈10.1007/s10822-017-0062-1〉 https://hal.inria.fr/hal-01591157/file/2017-Kadukova-Docking%20of%20small%20molecules%20to%20farnesoid%20X%20receptors%20using%20AutoDock%20Vina%20with%20the%20Convex-PL%20potential%3A%20lessons%20learned%20from%20D3R%20Grand%20Challenge%202.pdf BibTex
Predicting Binding Poses and Affinities in the CSAR 2013―2014 Docking Exercises Using the Knowledge-Based Convex-PL Potential. Sergei Grudinin, Petr Popov, Emilie Neveu, Georgy Cheremovskiy. Journal of Chemical Information and Modeling, American Chemical Society, 2016, 56 (6), pp.1053-1062, 〈10.1021/acs.jcim.5b00339〉 BibTex
Predicting binding poses and affinities for protein-ligand complexes in the 2015 D3R Grand Challenge using a physical model with a statistical parameter estimation. Sergei Grudinin, Maria Kadukova, Andreas Eisenbarth, Simon Marillet, Frédéric Cazals. Journal of Computer-Aided Molecular Design, Springer Verlag, 2016, 30 (9), pp.791-804, 〈10.1007/s10822-016-9976-2〉 https://hal.inria.fr/hal-01377738/file/2016-Grudinin-Predicting%20binding%20poses%20and%20affinities%20for%20protein%20-%20ligand%20complexes%20in%20the%202015%20D3R%20Grand%20Challenge%20using%20a%20physical%20model%20with%20a%20statistical%20parameter%20estimation-post-print.pdf BibTex

Comments are closed.