3.1. Tanimoto Coefficient
The similarity search algorithm based on molecular fingerprint is a significant tool usually employed in retrospective benchmarking studies. According to the similarity-property principle, increased chemical similarity correlates with a greater possibility that two molecules shares similar activity [35]. Therefore the degree of resemblance between the known inhibitor and the small molecules from the approved subset of the DrugBank database were measured using the Tanimoto coefficient (Tc). The overall structural similarity of all the compounds in our study ranged from 0.16 to 0.52and thus the mean value of the tanimoto co-efficient was used as selection criteria for screening compounds. So, compounds with the threshold value of 0.2 and above were considered for further molecular docking. A total of 1432 compounds possessed a structural similarity of 0.2 and above were used for further analyses.
3.2. Virtual Screening
The approved set of compounds with Tc ≥ 0.2 was retrieved and they were preprocessed using the LigPrep module of the Schrödinger suite. Subsequently, molecular docking was performed for all the compounds to identify the potential lead molecules against mIDH1 protein. Initially, all the compounds (1432 compounds) were subjected to the HTVS mode of docking. As the HTVS method indulges in faster prediction of the appropriate binding mode and ranks the ligand-based on its empirical scoring functions [27]. The top-scoring ligands were selected from HTVS level and used for SP docking. Towards the end, XP docking process was initiated with the high scoring compounds respectively. For instance, 299 molecules were subjected for XP docking and the binding score of each compound was compared with the reference compound Vorasidenib that had a binding score of -5.403 kcal/mol. In comparison, about 59 molecules showed increased binding affinity towards mIDH1 protein. The binding score of the screened compound was ranging from − 5.433 kcal/mol to − 9.964 kcal/mol. The Glide XP GScore of the docked complexes was depicted in Table S2.
3.3. Binding Free energy Analysis
The accuracy of docking was further validated by binding free energy analysis using MM-GBSA algorithm. The reference molecule Vorasidenib exhibited binding free energy (ΔGbind) of -56.44kcal/mol. Therefore, ΔGbind of Vorasidenib was used as the threshold to screen molecules with an enhanced binding affinity towards the target protein mIDH1. A total of 16 molecules from the approved set of DrugBank database showed lower binding energies than the reference compound. The results of the compounds were represented in Table 1. It is evident from the table that the binding energies of the compounds vary from − 57.42 kcal/mol to -77.33 kcal/mol. Further the major contributions to the enhanced binding of the hits are due to the exceptionally strong lipophilic and van der Waals interaction. For instance, shreds of literature evidence also highlight the significance of Coulomb, lipophilic energy, electrostatic solvation and van der Waals interactions in the effective binding of protein-ligand complex [36, 37]. Therefore, in addition to ΔGbind, the above mentioned parameters were also included in selection criteria. Notably, only 10 molecules namely, DB01076, DB13751, DB04868, DB00229, DB01698, DB06769, DB12001, DB00183, DB06590 and DB00492 possessed appropriate energy values. Other screened molecules such as DB01326, DB05039, DB03310, DB13874, DB00293 and DB12789 were eliminated because of its disfavored Coulomb, van der Waals and solvation energies with the values less than − 7.97 kcal/mol, -27.84 kcal/mol, and 11.88 kcal/mol respectively. These selected 10 molecules were then examined for its drug-likeliness screening.
Table 1
The energetic of the reference and the hits from MM-GBSA analysis
S.No. | Compound | MMGBSA dG Bind (kcal/mol) | MMGBSA dG Bind Coulomb (kcal/mol) | MMGBSA dG Bind Lipo (kcal/mol) | MMGBSA dG Bind Solv GB (kcal/mol) | MMGBSA dG Bind vdW (kcal/mol) |
1. | Vorasidenib | -56.44 | -7.97 | -27.84 | 11.88 | -29.58 |
2. | DB00492 | -77.33 | -13.60 | -41.43 | 25.26 | -48.79 |
3. | DB06590 | -75.12 | -34.31 | -35.1 | 42.11 | -47.68 |
4. | DB12789 | -71.77 | -20.69 | -47.58 | 18.68 | -29.50 |
5. | DB00183 | -69.44 | -8.64 | -38.60 | 20.3 | -46.73 |
6. | DB12001 | -69.26 | -9.23 | -41.92 | 20.91 | -44.45 |
7. | DB00293 | -68.84 | -28.75 | -21.27 | 20.04 | -31.48 |
8. | DB06769 | -68.83 | -18.14 | -40.93 | 22.08 | -30.83 |
9. | DB01698 | -68.69 | -19.00 | -39.01 | 17.07 | -31.45 |
10. | DB00229 | -67.25 | -8.27 | -31.51 | 14.82 | -36.34 |
11. | DB13874 | -66.98 | -16.66 | -28.68 | 11.46 | -35.72 |
12. | DB03310 | -65.49 | -34.80 | -24.7 | 44.61 | -48.42 |
13. | DB04868 | -64.92 | -8.88 | -34.56 | 18.95 | -43.26 |
14. | DB05039 | -62.92 | -21.96 | -25.49 | 13.55 | -24.02 |
15. | DB01326 | -61.75 | -19.46 | -28.16 | 12.69 | -28.13 |
16. | DB13751 | -61.09 | -18.42 | -28.98 | 24.47 | -34.85 |
17. | DB01076 | -57.42 | -19.21 | -34.80 | 20.95 | -30.75 |
3.4. In-silico drug bioavailability screening
In search of an efficient drug candidate for glioma treatment, the resultant compounds with high binding affinity and binding energies were analyzed for their drug-likeliness and pharmacokinetic properties using the QikProp module of the Schrödinger suite. QikProp module is equipped with a varying range of descriptors that broadly categorizes the absorption, distribution, metabolism and excretion properties of the chemical structure. The inclusion criteria considered for the screening of desired small molecules include the blood-brain barrier (log BB), stars, human oral absorption (HOA) and CNS. The FDA-approved small molecule ivosidenib was restricted from use only due to its poor blood-brain penetrating ability with the logBB value of -1.112 [38]. Thus, the screened molecules with increased logBB value were taken into account. The star descriptor identifies a number of physiochemical features each screened molecule failed to abide within the acceptable limits. The compounds with less than 5 star values were selected as hit compounds. The HOA values of reference and the hits were analyzed and compounds with an HOA value of 3 were scrutinized.The CNS value of the molecules lies between the range of -2 to 2. In glioma the neoplastic cells are predominantly present in the neuronal cells; therefore an effective drug molecule should penetrate into the brain cells to inhibit the activity of tumor cells. Hence, an active CNS score of zero and above was considered for screening. The PK/PD properties of all the screened molecules are shown in Table 2. Amongst the 10 hit molecules, only DB12001 was found to have satisfactory QPlogBB, stars, HOA and CNS properties with the value of 0.14, 1, 3 and 1 respectively. Further, the biological activity of the hit compound was predicted using the PASS server [31]. The server predicts the biological activity by calculating the probability of active and inactive. The potent hit molecule should possess a higher probability of active (Pa) score when compared with the probability of inactivity (Pi). Interestingly, the identified hit was found to have anti-cancerous activity with a Pavalue of 0.5 (Table S3). Finally, the toxicological endpoints of the Vorasidenib and DB12001 were examined using the ProTox-II server [30]. From Table 3 we infer that the reference compound was found to exhibit organ toxicity, especially hepatotoxicity. Interestingly, DB12001 was predicted to have a lethal dose level of 2000 mg/kg and categorized under class IV toxicity. Further, the molecular insights of the reference and the hit were inspected through the interaction pattern and structural scaffolds.
Table 2
The Pharmacokinetics and drug-likeliness properties of the reference and the screened hit molecules
S.No. | Compound | | Descriptors |
Lipinski rule of five (Ro5) | Jorgensen’s rule of three (Ro3) | | Others |
MWi | HBAii | HBDiii | QplogPo/Wiv | NRBv | QPPCacovi | QPlogSvii | #Starviii | CNSix | HOAx | QPlogBBxi |
1. | Vorasidenib | 414.74 | 5.00 | 2.00 | 4.786 | 7 | 2832.41 | -6.728 | 3 | 1 | 1 | 0.328 |
2. | DB00492 | 563.67 | 12.00 | 1.00 | 4.13 | 17 | 86.59 | -5.21 | 0 | -2 | 2 | -1.77 |
3. | DB06590 | 420.49 | 9.70 | 5.00 | -1.427 | 9 | 0.906 | -3.42 | 0 | -2 | 2 | -2.52 |
4. | DB00183 | 767.89 | 13.25 | 4.75 | 3.31 | 28 | 0.55 | -6.00 | 12 | -2 | 1 | -4.66 |
5. | DB12001 | 506.60 | 9.00 | 1.00 | 4.51 | 7 | 103.70 | -5.81 | 1 | 1 | 3 | 0.14 |
6. | DB06769 | 358.26 | 4.50 | 1.00 | 4.49 | 10 | 182.89 | -5.36 | 1 | -1 | 3 | -0.75 |
7. | DB00229 | 525.61 | 13.25 | 3.25 | -1.68 | 12 | 0.36 | -2.99 | 1 | -2 | 1 | -2.91 |
8. | DB04868 | 529.52 | 8.00 | 2.00 | 5.90 | 8 | 617.10 | -9.33 | 3 | -2 | 1 | -1.04 |
9. | DB13751 | 822.94 | 21.30 | 6.00 | 1.92 | 15 | 0.009 | -6.034 | 12 | -2 | 1 | -5.47 |
10. | DB01076 | 558.64 | 6.90 | 3.00 | 6.55 | 16 | 48.853 | -7.521 | 3 | -2 | 1 | -1.99 |
11. | DB01698 | 610.52 | 20.55 | 9.00 | -2.38 | 16 | 1.10 | -2.70 | 9 | -2 | 1 | -4.89 |
MWi -Molecular weight |
HBAii- Hydrogen Bond acceptor |
HBDiii - Hydrogen Bond donor |
QplogPo/wiv- Predicted water/octanol partition co-efficient |
NRBv– Number of rotatable bonds |
QPPCacovi- Predicted apparent Caco-2 cell permeability in nm/sec |
QplogSvii- Predicted aqueous solubility |
#starsviii - Number of property or descriptor values that fall outside the 95% range of similar values for known drugs |
CNSix - Predicted central nervous system activity on a -2 (inactive) to + 2 (active) scale |
HOAx- Predicted human oral absorption |
QPlogBBxi- Predicted brain/blood partition coefficient |
Table 3
Toxicity endpoint analysis of the reference and hit using ProTox-II server
S. No | Compound | Hepatotoxicity | Mutagenicity | Carcinogenicity | Predicted LD50(mg/kg) | Predicted toxicity class |
1. | Vorasidenib (reference) | Active | Inactive | Inactive | 1700 | 4 |
2. | DB12001 | Inactive | Inactive | Inactive | 2000 | 4 |
3.5. Interactions of the Protein-ligand complex
Initially, the binding scheme of all the available PDB codes of mIDH1 protein along with their respective ligand was analyzed using PLIP [39]. Figure 2 highlights the important interacting residues of mIDH1 protein along with their PDB codes. It is evident from the Fig. 2 that most of the mIDH1 proteins were forming hydrophobic interactions with residues such as TRP124, VAL255 which are highlighted in red color. Predominantly, hydrogen bond interactions were formed by the residues GLN277, ASP279, SER280. Similarly, the binding pattern of the Vorasidenib and DB12001 was analyzed and key interactions were identified. The atomic interactions of the hit and the reference molecules were highlighted in different color representations. The interaction of Vorasidenib and DB12001 is highlighted in Fig. 3. The blue color line depicts the existence of hydrogen bond interactions. From the results, we infer that the Vorasidenib was capable of forming two hydrogen bond interactions with the residue GLN277. Furthermore, Vorasidenib was able to form halogen bond interaction between ASP273 residue and chloro-pyridine moiety which is highlighted in cyan color. Comparably, DB12001 was also observed to create two hydrogen bond interactions with the residue GLN277. In addition, DB12001 was forming hydrophobic interactions (dashed line) with some of the conserved residues present in the catalytical site of mIDH1 protein such as TRP124, ASP252, VAL276 and SER280 [16]. Overall, from the above results, we deduce that the additional hydrophobic interaction might contribute to the increased binding affinity of the hit molecule against the mIDH1 protein.
3.6. Scaffold Analysis
Shreds of literature evidence report that amino triazine and chloropyridine moiety of Vorasidenib enhances its binding affinity towards mIDH1 protein [16, 38]. Similarly, the structure of DB12001 was analyzed to identify the important structural moiety that contributes to the increased binding affinity. The benzimidazole group of DB12001 was reported to exhibit anti-cancerous activity especially against glioma [40]. The 2-dimensional structure of both the reference and the hit are depicted in Fig. 4. The generic name of DB12001 is Abemaciclib. It is an anti-cancerous agent that is reported to dually inhibit cyclin-dependent kinase 4 and 6 (CDK 4 and CDK 6). Abemaciclib is an FDA-approved drug used for the treatment of HR-positive and HER-negative metastatic breast cancer in combination with Fulvestrant [41]. Further, the drug has been used in various other trials including for the treatment of melanoma, lymphoma and solid tumors [42, 43]. All these literature evidence highlights that Abemaciclib could also be repurposed for the inhibition of mIDH1 in glioma treatment.Thus, the structural dynamics of Vorasidenib and Abemaciclib were investigated using molecular dynamics simulation studies to enrich the prediction accuracy.
3.7. Molecular Dynamics Simulation
3.7.1. Root Mean Square Deviation (RMSD)
The conformational stability of the protein-ligand complexes were evaluated using MD simulation. The gmx rms utility of gromacs was employed to quantitatively estimate the conformational changes and the stability of the system that occur within the stipulated time boundaries [44]. In our study, the average root mean square deviation (RMSD) of the reference and the hit complexes were calculated for backbone atoms of the protein. Figure 5 highlights the RMSD plots of mIDH1-Vorasidenib (reference) and mIDH1-Abemaciclib (hit) complex. From Fig. 5, it is evident that the reference complex exhibited least deviation from 0 to 27ns. However, the average RMSD of the reference complex increased from ~ 0.41 to ~ 0.65 nm between 30 to 100ns. Although the hit complex showed increased deviation from 0 to 60 ns with the RMSD value of ~ 0.49 nm, the complex attained the state of equilibrium within 70 ns time frame. Interestingly, at the end of the 100ns simulation the complex showed the average RMSD value of ~ 0.48 nm. Thus, from RMSD plot we infer that the mIDH1-Abemaciclib complex proclaimed lesser backbone deviation when compared to mIDH1-Vorasidenib complex.
3.7.2. Root Mean Square Fluctuation (RMSF)
The root mean square fluctuations of individual amino acids were analyzed using gmx rmsf tool of gromacs. In the mIDH1-Vorasidenib complex, PRO147, PRO149, GLY150, ILE154, THR155, ASP160, GLY161, PRO384 and SER415 were few majorly flexible residues. While residues such as GLU17, LYS72, ALA193, SER195, SER196, PHE197, GLN198 and MET199 were relatively stable and expressed subtle fluctuations. In case of mIDH1-Abemaciclib complex LYS3, LYS81, VAL146, PRO147, GLY148, PRO149, THR157, PRO158, SER159, ASP160, GLY161, GLU174, GLY175, GLY176 and GLY177 were found to exhibit high flexibility. On the other hand the residues MET13, GLN14, ASP16, ILE26, LYS27, ILE31, GLU110, ILE113, ILE128, ASP137, SER196, MET199, ILE266, TRP267, THR292, GLU306 and TYR316 were found to exhibit less flexibility as evident from its RMSF value less than ~ 0.03 nm. Of note, active site residues such as ILE113, MET199, TRP267 and THR292 exhibited restricted fluctuation which highlights its involvement in the binding of Abemaciclib (Fig. 6).
3.7.3. Hydrogen bond
The gmx hbond was employed to ascertain the specific inter-molecular interactions between protein-ligand complexes. The stability of the H-bond created between reference and the hit complex was deduced by extracting the time dependent hydrogen bond pattern observed throughout 100ns simulation. The results from the trajectory (Fig. 7) revealed that the reference complex formed an average of ~ 6 hydrogen bonds during the simulation. Whilst the hit complex was capable of forming 3 hydrogen bond interaction with the target protein. From interaction studies, we understand that the hit complex was actively forming hydrophobic interactions with the crucial binding site residues of mIDH1 protein (Fig. 3).
3.7.4. Radius of Gyration
The structural compactness of the reference and hit complex were analyzed using gmx gyrate. The inbuilt gyrate (Rg) tool of gromacs calculates the weighted root mean square distance of collective Cα atoms from the center of mass. Thus, Rg imparts insights on the overall dimensions and the folding state of the target protein [45]. Increased fluctuation in Rg value highlights the unfolding of the target protein. Figure 8 illustrates the radius of gyration plot for the Cα atoms of the mIDH1-Vorasidenib and mIDH1- Abemaciclib complex. Initially, both the complexes were found to have increased RG value of ~ 2.31 nm. However, at the end of 100ns simulation period the Rg value of Vorasidenib and Abemaciclib complex was found to be ~ 2.08 nm and ~ 2.11 nm respectively. It is evident from the results that the reference and hit complexes were stable throughout the simulation time [46].
3.7.5. Solvent Accessible Surface Area (SASA)
SASA estimates the interacting surface area of target protein along with its solvent molecules [47]. The gmx sasa tool was employed to measure the average SASA value of mIDH1-Vorasidenib and mIDH1-Abemaciclib complex throughout the time period of 100ns. The results from SASA plot (Fig. 9) illustrates that the mean SASA value of Vorasidenib and Abemaciclib was 197.39 nm2 and 200.71 nm2 respectively. The increased SASA value of the hit complex signifies that the internal residues of the mIDH1-Abemaciclib complex are disclosed to the solvent molecules for interactions.The free energy of solvation for hit complex is similar to that of the reference. Thus, the results from SASA analysis highlight the stable binding of the hit complex.
3.7.6. Principal Component Analysis (PCA)
Essential dynamics / PCA aids in identifying the most dominant and probable conformational changes that occurs in the target protein at the time ligand binding. This study allows us to quantify the effect of functionally critical movement upon ligand binding [48]. The important conformational subspace with crucial amount of collective motions of the reference and hit complex are confined within first few eigenvectors. Notably, the first two principal components (PCs) were used for detailed study. Each point on the subspace illustrates a specific conformation of the protein ligand complex. The flexibility of mIDH1-Vorasidenib and mIDH1-Abemaciclib complex was evaluated using the trace value. The trace value for the reference and the hit complex was found to be 26.08 nm2 and 14.12 nm2 respectively. The higher trace value of the reference compound suggests that there is an increased flexibility and expansion in the collective motion than the hit complex. The 2d projection of the MD trajectories was depicted in Fig. 10 (a) illustrates the concreted motion of system in the phase space spanned by the first two principal components of the reference and hit complex. Interestingly, the results revealed that the Abemaciclib complex occupied smaller phase space along the PC when compared to the reference complex implicating higher flexibility of the reference compound [49]. Notably, the results of 2d projection were found to be in agreement with trace values of the covariance matrix.
Furthermore, covariance calculation also aids to correlate the collective motion of the protein. For instance, when two atoms moves unidirectional they are termed as positively correlated and if the atoms moves in opposite directions they are termed as anti-correlated motion. The positively correlated motion of atoms is highlighted in red regions and the negatively correlated motion in blue region. From Fig. 10, we conclude that both the complexes showed net anti-correlated motion. Overall, less trace value and smaller conformational phase space of the hit molecule suggest its stable binding than the reference compound.
3.7.7. Free Energy Landscape (FEL)
Finally, the conformational states of the PCs were examined in terms of free energy landscape to gain insights on the protein folding. The FEL is merely post-processing of PCs where the difference between free energy is evaluated from probability of energy state occupancy [50]. The conformational states of each complex are represented in different color codes. For instance, the global energy minima are represented in blue and the meta-stable states are depicted in green and cyan. For mIDH1-Vorasidenib, a single narrow energy minima basin was identified, while mIDH1- Abemaciclib complex showed a broader free energy minima basin. Figure 11 highlights the FEL plot of reference and hit complex. From the FEL analysis, we infer that the Abemaciclib complex is thermodynamically stable than mIDH1-reference complex.