3.1. Docking Validation
Docking studies provide valuable insights into the mechanism of interactions between drug and proteins. To confirm the accuracy, we repeated the docking of the co-crystallized inhibitor within its initial structure. Remarkably, this redocking process resulted in a low RMSD value of 0.2–0.4 Å. The docking outcomes were thoroughly examined, focusing on the binding conformation and interaction analysis detailed in (Supplementary Table 1, 2, 3).
3.2 Docking studies
The docking studies involved diverse conformers of 4-hydroxyisoleucine and its mimetics to assess their binding affinity with key diabetes targets. These investigations identified the top 2 ligands for each target out of a total of 23 ligands based on their docking score and interactions resembling those of the respective target co-crystal ligands. The chosen compounds demonstrated substantial interactions with the key active site residues; analysing these detailed binding interactions and evaluating the therapeutic potential of the top-selected compounds can help guide further exploration and development of these promising drug candidates. This in-depth understanding of the compound-target interactions is crucial for assessing the viability of these compounds for continued drug development efforts.
Upon docking studies, it was revealed that out of 23 ligands used in the study (supplementary material Table 2), ligand 4-OHIL-4 has the best docking score of -8.3 kcal/mol. As depicted from Fig. 1 (2b) ligand, 4-OHIL-4 shows many hydrogen bond interactions, salt bridge, and pi-pi stacking interactions with the target alpha glucosidase protein. Pi-pi stacking is at the distance of 5.13Å with the residue PHE649; hydrogen bonding interactions with the residues ASP404, ASP518, ARG600, HIE674, and LEU678 within the space of 1.94 Å, 1.65 Å, 2.12 Å, 2.22 Å, 2.54Å respectively. Compared with the co-crystal ligand Fig. 1 (1a), acarbose shows additional interaction. The amine group of the ligand shows pi-pi interaction at 5.13 Å with the aromatic ring of PHE649—two salt bridge formations with ASP518 and ASP616 at distances of 3.07 and 4.73 Å, respectively.
The co-crystalized ligand of the target alpha-amylase Fig. 1 (2a) shows respective hydrogen bonds and pi-pi stacking interactions with the protein. Pi-pi stacking is at the distance of 3.85 Å with the residue TRP59; hydrogen bonding interactions with the residues GLN63, ASP197, and GLH233 within the space of 2.1, 1.5, 2.1, and 2.6 Å, respectively. 2R-3S-4R-4OHIL shows additional pi-cation and salt bridge interactions compared with the co-crystal ligand. The amine group of the ligand shows pi-cation interaction at 6.3Å with the aromatic ring of TYR62. Two salt bridge formations with ASP300 at 3.5 and 3.6 Å. The amino and carboxy groups of VAL-L-4-OHIL form salt bridges with negative and positively charged residues ASP197 and ARG195 of 2.68 and 4.76 Å. These results highlight the potential of these compounds as promising candidates for further investigation in drug development efforts.
The co-crystal ligand of aldose reductase (4QX4) forms an essential Hydrogen bonding interaction with the polar and hydrophobic residues Fig. 1 (3a) HIE110, TRP111, TYR48, LEU300, and aromatic hydrogen bond with the VAL47. Compared with the co-crystal ligand, the 4-OHIL amide − 2 w a better binding affinity of -7.6 kcal/mol.
3.3 MM-GBSA
The post – molecular docking analysis, which is essential for determining the stability of receptor-ligand complexes, was computed using the Prime MM-GBSA approach. Recent studies have shown that docking score findings can be consistently verified using the computation of binding free energy. The calculations for the energy required to bind are expressed as ΔG. In mathematical terms, a stable protein-ligand complex is characterized negative values of the Gibbs free energy (ΔG), while incorrect docking outcomes are identified by positive ΔG values. The six derivatives of the 4-OHIL formed a stable complex with Alpha-glucosidase, Alpha-amylase, and Aldose reductase. 4-OHIL-Amide-2 ,4-OHIL -Amide-3, 4-OHIL-4, 4-OHIL-5, 4-OHIL-6, 4-OHIL-7 exhibited binding free energy of -64.3Kcal/mol, -45.7Kcal/mol, -49.6Kcal/mol, -66.6Kcal/ mol, -55.4Kcal/mol and − 60.0 Kcal/mol (Fig. 2).
3.4 Molecular Dynamic Analysis
The stability of interactions and conformations within physiological environments can be assessed by using molecular dynamic simulations. To evaluate the structural and flexibility properties of the top protein-ligand complexes and the reference compound for each antidiabetic target underwent a were 100 ns molecular dynamics simulation. The conformational and interaction stability of receptor-ligand complexes were assessed using root mean square deviation (RMSD) and root mean square fluctuation (RMSF). Notably, complexes involving 4-OHIL-4 and 4-OHIL of target Alpha Glucosidase(5NN8), 2R-3S-4R-4OHIL and 4-OHIL of target alpha-amylase (4GQR), 4-OHIL-Amide-2 and 4-OHIL of target aldose reductase (4QX4) exhibited remarkable stability RMSD when compared to known compounds. RMSD values of all the above-mentioned complexes fell between 0.2 to 0.4nm. RMSF analysis demonstrated minimal variation in residues across all complexes (Fig. 3), highlighting their constant stability.
3.5 Hydrogen Bonding Analysis
The stabilization of the protein-ligand complex is greatly aided by hydrogen bonding. Figure 4 shows the number of hydrogen bonds formed in complexes during the last 100 ns trajectory. The binding affinity between the protein and ligand greatly influenced by their polar interactions. The occupancy percentages for hydrogen bonds indicate how many trajectories were involved in the formation of hydrogen bonds during a simulation. In the case of the Alpha Glucosidase ligand, 4-OHIL-4 shows many hydrogen bond interactions, with the specific residues ASP404, ASP518, ARG600, HIE674, and LEU678 within the space of 1.94 Å, 1.65 Å, 2.12 Å, 2.22 Å, 2.54Å respectively. Hydrogen bonding interaction with the co-crystal ligand characteristic residues ASP404, ASP518, ARG600, HIE674, and LEU678 within a range of 2.5 Å. The co-crystallized form of alpha-glucosidase contained 23 hydrogen bonds, in which the ligand shows the occupancy rate with Asp 616 and Asp 282 at 76.13% and 76.95%. Meanwhile, the 4-OHIL-4 exhibited a higher occupancy rate than the co-crystal ligand.
The co-crystalized ligand of the target alpha amylase shows hydrogen bonding interactions with the residues GLN63, ASP197, and GLH233 within the space of 2.1, 1.5, 2.1, and 2.6 Å, respectively, 2R-3S-4R-4OHIL shows Hydrogen bonding interaction with characteristic residues ASP197, ARG195, GLH233, ASN 298 within a range of 2 Å, VAL-L-4-OHIL forms hydrogen bonding interaction at 1.7, 1.7, 1.8,1.9, 1.7 Å with characteristic residues such as ASP197, ARG195, GLH233, HIE299, ASP300. The co-crystallized form of alpha-amylase contained 32 hydrogen bonds, in which the ligand shows the occupancy rate with the Asp 300, Glu 233, and Asp 356 with 26.94%, 8.91%, and 11.05%. At the same time, the 2R-3S-4R-4OHIL was found to show a higher occupancy rate with the Asp 300, Glu 233, and Asp 356 with 30.77%, 20.26%, and 24.04% Compared with the co-crystal ligand. The co-crystal ligand of aldose reductase forms an essential Hydrogen bonding interaction with the polar and hydrophobic residues HIE110, TRP111, TYR48, LEU300, and aromatic hydrogen bond with the VAL47. The presence of significant number of hydrogen bonds with various target protein-ligand complexes indicates these are suitable inhibitors. It is possible that these residues could have a significant impact on the interaction between the protein-ligand complex. Specific residues involved in molecular interactions can be identified by analyzing hydrogen bond interactions.
3.6 Free energy landscape
Gmx Covar, Gmx Anaeig, and Gmx Sham were utilized to calculate the Gibbs free energy landscape, using projections from their own first (PC1) and second (PC2) eigenvectors. In (Fig. 5), the Gibbs free energy landscape is shown in a color-coded manner. Trajectories are used to analyze the direction of fluctuation for all C atoms in complex structures is investigated by the Gibbs free energy landscape through the use of trajectories. Lower energy is indicated by a deeper blue colour corresponds to the free energy contour map. After the binding of these compounds, the primary free energy in the global free energy region was found to have undergone a complete change. These molecules stable conformational states are strongly indicated by these free energies. Different global minima of various targets are observed during the 100ns of MD simulations due to the binding of ligands to the respective targets. A small energy barrier separates the metastable conformational states from several distinguishable minima on the energy landscape. The most metastable conformational states were seen in the binding in case of (Alpha Glucosidase )5NN8 with 4-OHIL and 5NN8 with 4-OHIL-4, (Alpha amylase) 4GQR with 4-OHIL, 4GQR with 2R-3S-4R, (Aldose Reductase) 4QX4-4-OHIL, 4QX4-4OHIL - Amide 2 in which local minima were distributed to about three to four regions within the energy landscape. The above-mentioned complexes formed just two to three metastable conformations during the whole trajectories.
4. Shape- Based Generative Modeling for de Novo Drug Design
Generative modelling provides an alternative approach to molecular discovery by reformulating molecular design as an inverse design problem. The number of compounds that have ever been synthesized lies around 10^8 while the total number of theoretically feasible compounds lies between 10^23 and 10^60. Conventional discovery methods are only capable of exploring a small fraction of chemical space. By identifying a function that maps a set of structures, generative models can rapidly identify diverse sets of molecules that are highly optimised for specific application. Generative modelling can be designed by sequence based and shape based. The sequence based are more biased towards the design of compounds obtainable by small molecule chemical modifications.
The shape based provide generative novel scaffold inspired from structure-based design (spatial information is considered). Shape variational encoder using convolutional neural network to autoencoder compound representation, Combination of CNN and long short-term memory network to generate smile string. Variational autoencoder is a type of neural network that encodes shapes, create puzzles and decodes.
Training, Featurization and Model Training (Ligdream)
Canonical smiles notations were collected from Zinc 15 database, by using the RDkit random conformer generated, optimization of 3D structures using MMFF94 forcefield, random splitting into training and test sets, protein targets taken from DUDE database, docking. This approach has some challenges such as performance drops as the target sequence gets longer, H-bond donars and Hbond acceptors are difficult to recover as these properties are not specifically marked and are mainly dependent on its surrounding. This approach is applied to the top obtained molecules against each diabetes target, generative structures are subjected to further docking process against alpha glucosidase, alpha amylase, aldose reductase target shown in supplementary data (Table 4, Fig. 1,2,3)[32, 33].