3.1. Virtual screening
The present study aimed to identify compounds that could potentially inhibit Cbl-b protein by targeting its TKB domain using a structured based virtual screening approach. The structure of the target protein with a bound inhibitor was obtained and prepared as described. Prior to molecular docking, the ligand was redocked to the active site using built-in scoring functions of smina software. It was found that the redocked inhibitor, with vinardo scoring function, best reproduced the experimental binding pose with a docking score of -12.1 kcal/mol. The RMSD value between the original and redocked poses was 0.17 Å (<2) indicating the reliability of the docking procedure and the ability to use smina to perform molecular docking experiments. The alignment between the co-crystallized and redocked poses is shown in Fig. S1. A structurally diverse library of recently synthesized compounds was retrieved and prepared for molecular docking experiment. They have the following properties: 191 < molecular weight < 482, -4.4 < clogP < 4.9, 0 < HBA < 9 and 0 < HBD < 5, indicating their compliance with drug likeness rule (Lipinski Rule of 5). 49930 ligands were docked into the target active site and the generated poses were ranked based on their estimated binding energies. The docking scores ranged between -3.24 and -10.19 kcal/mol indicating that none of the docked compounds had better affinity than the original inhibitor (From here will be termed “Z3N”).
Scoring functions are used by molecular docking algorithms to evaluate the binding affinity of poses generated by the searching algorithm (Li, Fu and Zhang 2019). These “approximate” scoring functions don’t usually correlate well with the experimental binding affinities, increasing the likelihood of false-positives and false negatives appearing in the top ranked list (Rastelli and Pinzi 2019). To improve the accuracy of our virtual screening approach, by discriminating between binders and non-binders with high precision i.e., filter out false positives, a machine learning based scoring function (RF-Score-VS 1.0) was employed to re-score the top 1% docked poses (500 compounds, docking score < -8.61 kcal/mol). This scoring function was shown to outperform classical scoring functions such as X-score and Autodock vina (Li, Leung et al. 2015, Li, Lu et al. 2021) and has been employed successfully in other virtual screening studies (Ramesh, Shin and Veerappapillai 2021, Ramesh and Veerappapillai 2022, Krishnamoorthy and Karuppasamy 2023, Phengsakun, Boonyarit et al. 2023). 6 compounds had higher RF-Score than the reference inhibitor (6.34). Molecular docking and rescoring results together with the structures of the lead 6 compounds are depicted in Table 1. This indicated that both scoring functions complemented each other and provided more reliable screening results.
Table 1 Molecular Docking and rescoring results.
3.2. Protein-Ligand interaction analysis
To better understand the factors that would contribute to the binding of the potential inhibitors, their interactions with the targeted protein were analyzed and compared to the reference ligand. Fig. S2 shows the alignment of binding poses of the reference and lead compounds in the protein active site. Protein-ligand interactions are provided in Table S1 and represented in Fig. 2. The co-crystallized ligand Z3N, an N-aryl isoindolin-1-one inhibitor mediated three hydrogen bonds with Y224, F227 and Y312 at distances of 3.58, 3.04, and 3.97 Å respectively. Moreover, it formed hydrophobic interactions with T108, L112, T183, L186, F227 and T229. A parallel p-stacking and salt bridge were formed with Y312 and E232 respectively. Comparative analysis with the reference ligand revealed that the top hits mediated various hydrogen bonds and hydrophobic interactions with the active site residues. The polar aromatic Y312 was the most common residue involved in hydrogen bonding with the compounds (except for Z2077032497). It should be noted that the hydrogen bond donor-acceptor distances with Y312 were found to be shorter for the docked ligands, except for Z5011956847, compared to Z3N. The same was observed with F227 and Y224 suggesting a more favorable orientation inside the binding site. Z3396345178 and Z2077032497 formed the highest number of hydrogen bonds (five) which might be correlated with their high binding affinity (-9.72 and -10.06 kcal/mol respectively). The hydrophobic interactions are mediated most by F227 (except for Z5011956847) and K109 (except for Z1102107341 and Z5011956847). Z5011956847, having the lowest docking score (-8.86 kcal/mol) formed nine hydrophobic interactions and was the only compound to make a pi-stacking interaction with Y363, like Z3N. Results indicated that several residues contributed to ligands’ binding through formation of various interactions, mainly hydrogen bonding and hydrophobic interactions. The stability of these interactions needed to be further analyzed to better assess ligand binding efficiency.
3.3. ADME-T analysis
Evaluation of the absorption, distribution, metabolism, excretion, and toxicity is very important to assess the quality of a potential drug candidate. The pharmacokinetic and toxicity profiles of the top hits were determined using ADMETlab server and compared with the reference compound. Detailed results are listed in Table S2. Z3N had a QED score of 0.52, passed Lipinski rule but violated other drug likeness rules (GSK and Pfizer). In addition, it showed a Caco-2 permeability score of -5.47 which is not optimal and low HIA score indicating poor intestinal absorption. Pgp inhibition and BBB penetration scores were also high. Toxicity analysis revealed that it could potentially cause respiratory toxicity and block hERG, in addition to low probability of causing human hepatotoxicity. The QED score of the lead compounds ranged between 0.34 and 0.71 with Z5011956847 having the highest score indicating its attractiveness. All compounds passed Lipinski and Pfizer filters in addition to Z1102107341 which also satisfied GSK rules. They exhibited excellent Caco2-permeability and low HIA. Only Z3396339411 had higher HIA score than Z3N. Z5011956847 and Z2077032497 had the lowest Pgp inhibition probability, and the BBB penetration score was low with all top hits. Evaluation of potential toxicity showed that the lead compounds had high hERG blocking capabilities (except for Z5011956847). Z3396345178, Z2077032497 and Z5011956847 were predicted to be carcinogenic. Results indicated that the potential ligands had good pharmacokinetic properties, but caution must be taken because of possible side effects that would need to be monitored carefully. Based on molecular docking scores, binding interaction analysis and ADMET evaluation, the three compounds Z5011956847, Z2077032497 and were selected for further analysis.
3.4. MD simulations
MD simulations of the CBL-B protein in complex with the three lead compounds were performed alongside the reference ligand to better assess the system stability and conformational changes of CBL-B induced by potential inhibitor binding. Initially, RMSD plots of the protein backbone over the 100 ns were generated and depicted in Fig. 3A. Apo CBL-B was stable throughout the simulation run with average RMSD of 0.29 nm. CBL-B in complex with Z2077032497 had an average RMSD value of 0.38 nm with several increments above 0.4 nm indicating less stability. Other complexes exhibited similar RMSD pattern with average values ranging between 0.27 and 0.32 nm showing stability like the apo protein. RMSD of the ligands from their initial positions were also analyzed and shown in Fig. 3B. The reference ligand Z3N was stable for 50 ns with RMSD < 0.3 nm. The RMSD started to increase gradually and didn’t become stable until it reached 0.71 nm at the simulation end. Z3396339411 which had the highest docking score, exhibited major drift in RMSD at the beginning of the simulation from 0.2 to 0.8 nm. Till the end of the simulation, the ligand kept fluctuating around 0.8 nm. Z5011956847 was stable for 40 ns at 0.3 nm then fluctuates till it reaches around 0.68 nm at 50 ns. It then remained stable till the end of the run. Z2077032497 was somewhat similar to Z5011956847. It showed stability till 30 seconds at 0.3 nm after which RMSD increased till 0.5 nm at 50 ns. Stability was observed till the simulation end with an average RMSD of 0.55 nm. In general, RMSD analysis suggested that the selected ligands had no significant effect on CBL-B stability except for Z2077032497 which exhibited the highest RMSD value. The observed fluctuations in ligands’ RMSD plot indicated multiple conformations adopted by the ligands inside the binding pocket allowing possibly for different interactions with surrounding residues. This was evident from the snapshots taken for the ligands at 100 ns where different orientations from the initial positions were clearly observed (Fig. S3). Visual inspection also revealed the stability of the potential inhibitors inside the protein active site.
RMSF was computed to assess the fluctuations in CBL-B residues upon ligand binding, with lower values indicating lower flexibility and higher stability (Fig. 4A). Residues lining the binding site (within 5 Å of bound ligand) showed low RMSF values in the apo CBL-B implying high stability. High fluctuations were only observed in the far C-terminal region (residues 350-360) with RMSF reaching 0.3 nm. This is not surprising given its loop conformation. Similar RMSF pattern was exhibited in other complex systems indicating that all inhibitors did not influence the stability of the protein active site residues. Only, the region between residues 300-310 showed an increase in RMSF when bound to the reference ligand Z3N and Z2077032497. Additionally, the impact of inhibitor binding on protein compactness and folding nature was examined by measuring Rg. CBL-B in the apo state showed average Rg value of 2.24 nm indicating tightly packed structure (Fig. 4B). Similar pattern was observed in ligand-bound CBL-B with average Rg values ranging between 2.27 and 2.29 nm. In case of Z3N complex, fluctuations were observed in the last 30 ns with Rg increasing to 2.34 nm suggesting a less compact structure.
Overall, results indicated that bound inhibitors had no significant effects on protein overall stability or compactness. Notably, RMSD analysis indicated changes in ligands orientation enabling the formation of new interactions not observed in the initial docked poses.
Hydrogen bonds contribute significantly to the protein-ligand interaction, and hence the complex stability. Time evolution formation of hydrogen bonds for the studied complexes was monitored and depicted in Fig. 5. The reference compound Z3N exhibited a maximum number of 8 bonds with an average value of 3 bonds. The highest number of H-bonds was observed for Z2077032497 (10) followed by Z5011956847 (9) and Z3396339411 (8). It is worth mentioning that for 40 ns, Z5011956847 formed an average of 2 hydrogen bonds. After this, the number of hydrogen bonds increased till the simulation end. This can be attributed to the change in the compound conformation previously detected in the RMSD plot (Fig. 3B). In general, H-bond analysis indicated tight interactions of the lead compounds with the protein and hence more stability inside the binding pocket.
3.5. Post MD simulations interaction analysis
The last frames were retrieved from the 100 ns MD trajectories and the inhibitors’ interactions with CBL-B were analyzed (Fig. S4). Z3N formed one hydrophobic interaction with T229, one hydrogen bond with R105 and one salt bridge with E232. R105 interacted with fluoride atom of Z3N through halogen bonding which was not observed previously. Z3396339411 mediated one hydrophobic interaction with Y224 and three hydrogen bonds, two with R105 and one with F227. Six hydrogen bonds were established between Z5011956847 and CBL-B through R105, K109, S182, F227 and Y312. In addition, Z5011956847 formed four hydrophobic interactions with I113, A226, F227 and L228. The largest number of interactions was observed with Z2077032497 which interacted with CBL-B through five hydrogen bonds, with T229, Y230 and R250. It also formed five hydrophobic interactions with R105, T108, F227, L228 and Y230. Finally, a pi-stacking and pi-cation interactions with Y230 and R105 respectively were observed. Post MD simulation analysis indicated that all ligands exhibited interactions different to some extent from those detected after molecular docking procedure. This underlies the importance of MD simulations to examine protein-ligand interactions in a more dynamic context given the static nature of the docked poses. These observed changes might affect the affinity of the lead compounds towards CBL-B. Hence, it was crucial to estimate the binding energy of lead compounds using a more accurate way. This was performed in the next step.
3.6. Binding free energy calculations using MM-GBSA
Table 2 lists the various energy terms and the binding free energies calculated through MM-GBSA approach. Z2077032497 exhibited the highest affinity, showcasing a binding energy of -50.78 kcal/mol, followed by Z5011956847 (-42.48 kcal/mol) and Z3396339411 (-23.03 kcal/mol). In all complexes, electrostatic interactions provided the largest favorable contribution to the protein-ligand binding, followed by van der Waals forces. Notably, the binding energy of the positive control Z3N was significantly lower than other compounds, which can be attributed to the highly unfavorable polar solvation energy resulting in the overall decreased binding energy and hence lower affinity. These findings support the idea of utilizing these compounds as potential inhibitors of CBL-B. Moreover, per-residue decompositions analysis was performed to delineate the key residues contributing favorably to the overall binding free energy (Fig. 6). In general, amino acids with energy contributions greater than -1 kcal/mol can be considered as hot spot residues. For Z2077032497, R250 was the major contributor to the ligand binding with an energy of -38.1 kcal/mol followed by T229 (-16.8 kcal/mol) and Y230 (-8.4 kcal/mol). Other residues include C253, R105, L228, T108, F227, P35, K109, T254, L112 and S252 with energies ranging between -3.5 and -1.5 kcal/mol. From previous analysis, Z2077032497 formed hydrogen bonds with R250, T229 and Y230, which explains these favorable interaction energies. Concerning Z5011956847, the major contribution came from K109 with a binding energy of -32.8 kcal/mol. It was shown to mediate hydrogen bond with the ligand during MD simulation. Through VdW and electrostatic interactions, F227 contributed with an energy of -4.6 kcal/mol. Other key residues included M315, Y312, D185, L112, L186, S182, A226, M225 and I113. The interaction between Z3396339411 and CBL-B was primarily contributed by M315 and K109 with binding energies of -5.7 and -5.3 kcal/mol, respectively caused mainly by VdW forces. Notably, electrostatic interactions contributed favorably to K209-Z3396339411 binding with energy of -20 kcal/mol. However, polar solvation energy (19 kcal/mol) opposed this effect leading to a decreased binding energy. F227, A226, M225, L112, Y224 and T108 can be regarded also as hot spot residues (DG: between -2.5 and -1.2 kcal/mol). In conclusion, decomposition analysis revealed the significant role of several amino acids towards maintaining stable interactions with the lead compounds and explains the disparities in binding energies, and hence in binding affinities towards CBL-B observed among them. R250, T229, Y230, K109, F227 and M315 seemed to be the major hot spot residues, and development of CBL-B inhibitors is suggested to focus on mediating interactions with these amino acids.
Table 2 Binding free energies of CBL-B complexes computed via MM-GBSA method. Units are expressed in kcal/mol.
Inhibitor
|
DEele
|
DEVdW
|
DEgb
|
DEsurf
|
DGgas
|
DGsolv
|
DGtotal
|
Z3N
|
-103.30
|
-25.22
|
125.97
|
-4.78
|
-128.53
|
121.19
|
-7.33
|
Z3396339411
|
-129.34
|
-35.92
|
147.08
|
-4.85
|
-165.25
|
142.23
|
-23.03
|
Z5011956847
|
-247.13
|
-32.49
|
243.03
|
-5.89
|
-279.62
|
237.14
|
-42.48
|
Z2077032497
|
-206.06
|
-40.75
|
201.75
|
-5.72
|
-246.81
|
196.03
|
-50.78
|