Re-docking results of ligand molecules with receptor targets
Before initiating large-scale virtual screening, the validity of parameter settings should be assessed using the RMSD values, derived from the comparison of docked ligand molecules with their original counterparts. An RMSD value equal to or less than 2 Å signifies the reasonableness of the docking parameter settings, reflecting a high overlap between the docked and original ligand molecules [26]. This correlation implies that a higher overlap corresponds to a smaller RMSD value. The redocked ligand aligns significantly with the original ligand, evidenced by the RMSD value of 0.164 Å for ADP (refer to Supporting Information Fig. S1). Such an observation substantiates the reliability of the chosen docking methodology.
Structure-based virtual screening (SBVS) of natural products
A comprehensive search for novel chemical types was performed through several screening rounds, which resulted in the selection of three natural compounds out of a total of 213,176 compounds. These compounds were derived from three different natural product databases: InterBioScreen, The Natural Product Atlas, and ZINC. Given the functional and structural similarities between NEK7 and NEK9, we used Dabrafenib, a potent NEK9 inhibitor (with an IC50 value of 1–9 nM), as an active reference compound. The docking score of Dabrafenib with NEK7 served as the benchmark for our virtual screening process. Natural products from the databases mentioned underwent two rounds of docking with varying precision levels, namely semi-flexible docking and flexible docking, to ascertain their potential affinity with NEK7. From the semi-flexible docking process, we identified 56,375 compounds that exhibited higher affinity with NEK7, surpassing the − 8.909 kcal/mol docking score of Dabrafenib with NEK7. Additionally, the top 5,000 compounds, which demonstrated binding energies ranging from − 12.0 to -17.0 kcal/mol, were classified as high-ranking compounds. These compounds potentially showcase a more potent inhibitory impact on NEK7.
In the subsequent analysis, several amino acids located on NEK7's active loop, specifically Ala61, Ala165, Asp118, Gly117, Ala116, Arg121, Lys63, Gly43al48, Gly41, Leu111, Leu113, Phe168, Asp115, Ala114, Asp179, Ile40, Asn166, and Ile40, were identified for their flexibility, which aids in the docking of high-ranking compounds with NEK7. Three natural product molecules, namely, scutellarin, digallic acid, and (-)-balanol, were singled out for their high NEK7 affinity and multiple interactions. These selections were made based on docking score rankings and visual screening, with docking scores of -11.547, -13.059, and − 15.054 kcal/mol, respectively. In comparison, the flexible docking score of dabrafenib, the active control compound, with NEK7 was − 9.071 kcal/mol. The molecular structures of the SBVS shortlisted compounds, along with their docking scores and interacting residues, are detailed in Table 1 and Fig. 3. An in-depth interaction analysis will be presented in the following section.
Examining their structure, the natural products that bind tightly to NEK7 are structurally diverse, and the three aforementioned molecules have unique skeletal configurations. Of particular note, recent research has found scutellarin to influence genes associated with tumor cell proliferation, migration, angiogenesis, and metabolism via the regulation of cellular signaling pathways. Moreover, scutellarin alters the tumor microenvironment to promote an enhanced immune cell response and curb excessive inflammation, a mechanism leveraged by tumor cells for their growth [44]. Digallic acid, a polyphenolic compound isolated from Pistacia lentiscus L. fruits, selectively triggers apoptosis in cancer cells [45, 46]. Furthermore, (-)-Balanol, a natural product originally isolated from Verticillium balanoides, mimics ATP and inhibits protein kinase C (PKC) isozymes and cAMP-dependent protein kinase (PKA) with minor selectivity. The frequent implication of PKCε in cancer progression suggests its potential as an anticancer drug target [47]. Given the established activities of these compounds, they show promise for cancer chemotherapy. With the support of NEK7-based virtual screening results, we propose that they may contribute to anticancer activities by acting on NEK7.
Table 1
The complete detail of shortlisted compounds.
PubChem CID | Molecular Formula | Compounds Names | Molecular Weight | Origin Organism | Origin Genus | Origin Species |
185617 | C21H18O12 | Scutellarin | 462.4 | Plant | Erigeron | breviscapus |
341 | C14H10O9 | Digallic acid | 322.22 | Plant | Pistascia | lentiscus |
5287736 | C28H26N2O10 | (-)-Balanol | 550.5 | Fungus | Verticillium | balanolides |
3.3. Molecular docking analysis of candidate molecules
To uncover the binding potential and mode of action of the shortlisted natural products with NEK7, we evaluated an interaction analysis of their molecular docking. This evaluation aimed to improve our understanding of their pharmacological activity and binding mechanism against NEK7. In these studies, each ligand demonstrated several binding interactions with the receptor (Table 2). The docking order, based on scores, was as follows: (-)-balanol > digallic acid > scutellarin.
First, we observed the binding mode of dabrafenib with NEK7's active pocket (docking score=-9.071 kcal/mol). Dabrafenib primarily bound to the catalytic loop, interacting with several key amino acids (Fig. 4A). The 2-aminopyrimidine ring of dabrafenib formed a hydrogen bond with GLU112, which partially represents the strength of the bond. Another hydrogen bond was detected between the benzene sulfonamide fragment and Ile40. Additionally, certain nonpolar amino acids in the catalytic loop, such as Ile40, Phe45, Val48, formed hydrophobic interactions with dabrafenib.
Next, the docking conformation of scutellarin with NEK7's active pocket resulted in a stable protein-ligand complex (docking score − 11.547 kcal/mol). Scutellarin exhibited significant molecular interactions, such as hydrogen bonds and hydrophobic interactions, due to its polyhydroxy and polycyclic structure. These interactions stabilize the protein-ligand complex. The glucose part of scutellarin formed hydrogen bonds with Gln44, Ser46, Asp179, and Leu180. Furthermore, the phenolic hydroxyl of the flavone part also formed hydrogen bond interactions with Glu112 and Ala114, allowing scutellarin better NEK7 binding ability than Dabrafenib (Fig. 4B).
The docking conformation of digallic acid revealed strong hydrophobic and hydrophilic interactions with the active site amino acid residues (Fig. 5A). Digallic acid forms six hydrogen bonds with various amino acids in the active site, all related to the hydroxyl groups on the benzene ring. However, in comparison to scutellarin, digallic acid had fewer hydrophobic interactions with the active loop.
Finally, the molecular interactions of (-)-Balanol involved crucial amino acid residues in the target protein's active pocket. (-)-Balanol exhibited significant molecular interactions that contribute to the binding affinity and binding score of the protein-ligand complex. Unlike dabrafenib, (-)-Balanol has more hydrophilic interactions and fewer hydrophobic interactions. These interactions significantly enhance the binding energy. Figure 5B presents the potential two-dimensional and three-dimensional binding mechanisms of (-)-Balanol.
ADME/T predictions of the hit compounds selected from virtual screening
Table 3. Predicted physiochemical and druglikeness properties of the reference dabrafenib and top-hit natural compounds.
Molecular property
|
Dabrafenib
|
Scutellarin
|
Digallic acid
|
(-)-Balanol
|
nHA a
|
7
|
12
|
9
|
12
|
nHD b
|
3
|
7
|
6
|
7
|
nRot c
|
6
|
4
|
4
|
9
|
TPSA d
|
111.59
|
207.35
|
164.75
|
202.72
|
Molecular Weight
|
519.1
|
462.08
|
322.03
|
550.16
|
logP e
|
4.15
|
0.751
|
1.352
|
3.253
|
Lipinski rulef
|
Accepted
|
Rejected
|
Accepted
|
Rejected
|
a Number of hydrogen bond acceptors. b Number of hydrogen bond donors. c Number of rotatable bonds. d Topological polar surface area. e Log of the octanol/water partition coefficient. f If two properties are out of range, a poor absorption or permeability is possible, one is acceptable.
The physicochemical attributes of prospective compounds are pivotal parameters closely associated with the preliminary phase of drug discovery. A molecule's likelihood of being a successful drug (drug-likeness) frequently hinges on these characteristics. Table 3 presents the physicochemical profiles of the natural product candidates and active compounds. The Lipinski's Rule of Five, a guideline for identifying potential orally bioavailable drugs, sets forth the following criteria: molecular weight below 500 Da, fewer than five hydrogen bond donors, fewer than ten hydrogen bond acceptors, and a calculated logP (octanol-water partition coefficient) below 5 [48]. Dabrafenib and digallic acid comply with Lipinski's Rule, but scutellarin and (-)-balanol exceed its thresholds. In detail, scutellarin infringes upon the rule regarding the count of hydrogen bond donors and acceptors, and (-)-balanol breaches the restrictions on molecular weight and hydrogen bond donors and acceptors, likely attributable to the preponderance of hydroxyl groups within their molecular structures. This suggests potential absorption or permeation issues. Notwithstanding, the logP values of all candidate compounds, including dabrafenib, reside within acceptable limits, suggesting an optimal equilibrium between permeability and first-pass elimination. The total polar surface area and quantity of rotatable bonds can dictate a molecule's passive transport, enabling the prediction of the drug's transport attributes [49]. Except for dabrafenib, the total polar surface areas of most compounds exceed 140 Å2 [50], indicating potential challenges in absorption and transport for these candidate compounds.
Table 4
Predicted ADME properties of the reference dabrafenib and top-hit natural compounds.
Parameter | Dabrafenib | Scutellarin | Digallic acid | (-)-Balanol |
Absorption |
Caco-2 permeability a | -5.816 | -6.389 | -6.063 | -6.462 |
MDCK permeability b | 8.7e− 5 | 1.2e− 5 | 5e− 6 | 5e− 6 |
Pgp-inhibitor c | 0.973 | 0 | 0.002 | 0.003 |
Pgp-substrate d | 0.001 | 0.544 | 0.009 | 0.424 |
Distribution |
PPB e | 100.6% | 91.38% | 90.72% | 95.67% |
VD f | 0.509 | 0.662 | 0.524 | 0.961 |
Metabolismg |
CYP1A2 inhibitor | 0.339 | 0.054 | 0.053 | 0.538 |
CYP1A2 substrate | 0.350 | 0.032 | 0.036 | 0.034 |
CYP2C19 inhibitor | 0.908 | 0.029 | 0.017 | 0.115 |
CYP2C19 substrate | 0.061 | 0.045 | 0.029 | 0.032 |
CYP2C9 inhibitor | 0.917 | 0.013 | 0.47 | 0.583 |
CYP2C9 substrate | 0.097 | 0.207 | 0.036 | 0.056 |
CYP2D6 inhibitor | 0.848 | 0.016 | 0.012 | 0.120 |
CYP2D6 substrate | 0.085 | 0.134 | 0.090 | 0.115 |
CYP3A4 inhibitor | 0.885 | 0.014 | 0.030 | 0.191 |
CYP3A4 substrate | 0.863 | 0.003 | 0.013 | 0.088 |
Excretion |
CL h | 1.55 | 1.267 | 8.182 | 1.988 |
T1/2i | 0.237 | 0.921 | 0.97 | 0.862 |
a Optimal: higher than − 5.15 Log unit. b Low permeability: < 2 × 10− 6 cm/s. Medium permeability: 2–20 × 10− 6 cm/s. High passive permeability: > 20 × 10− 6 cm/s. c Category 1: inhibitor; Category 0: non-inhibitor. The output value is the probability of being Pgp-inhibitor. d Category 1: substrate; Category 0: non-substrate. The output value is the probability of being Pgp-substrate. e Plasma protein binding. Optimal: < 90%. Drugs with high protein-bound may have a low therapeutic index. f Volume distribution. Optimal: 0.04-20 L/kg. g Category 1: inhibitor or substrate; Category 0: non-inhibitor or non-substrate. The output value is the probability of being inhibitor or substrate. h Clearance. High: > 15 mL/min/kg; moderate: 5–15 mL/min/kg; low: < 5 mL/min/kg. i Category 1: long half-life; Category 0: short half-life. long half-life: > 3 h; short half-life: < 3 h. The output value is the probability of having long half-life.
Table 5
Predicted toxicity properties of the reference dabrafenib and top-hit natural compounds.
Molecular property | Dabrafenib | Scutellarin | Digallic acid | (-)-Balanol |
hERG blockers a | 0.345 | 0.149 | 0.015 | 0.370 |
H-HT b | 0.997 | 0.138 | 0.407 | 0.139 |
DILI c | 0.994 | 0.975 | 0.811 | 0.986 |
Skin sensitization d | 0.051 | 0.346 | 0.931 | 0.084 |
Carcinogen city e | 0.17 | 0.092 | 0.018 | 0.066 |
Eye corrosion f | 0.003 | 0.003 | 0.058 | 0.003 |
Eye irritation g | 0.008 | 0.076 | 0.907 | 0.011 |
a Category 1: active; Category 0: inactive; The output value is the probability of being active. b Human hepatotoxicity. Category 1: H-HT positive (+); Category 0: H-HT negative (-); The output value is the probability of being toxic. c Drug induced liver injury. Category 1: drugs with a high risk of DILI; Category 0: drugs with no risk of DILI. The output value is the probability of being toxic. d Category 1: sensitizer; Category 0: non-sensitizer; The output value is the probability of being sensitizer. e Category 1: carcinogens; Category 0: non-carcinogens; The output value is the probability of being toxic. f Category 1: corrosives; Category 0: non-corrosives; The output value is the probability of being corrosives. g Category 1: irritants; Category 0: non-irritants; The output value is the probability of being irritants.
An ideal drug candidate should meet ADME and toxicology standards, reflected by acceptable molecular descriptor values [51, 52]. Employing the SBVS method, we identified certain hit molecules and evaluated their pharmacokinetic properties. We used ADMETlab2.0, an online tool, to compute the pharmacokinetic properties of three candidate compounds and dabrafenib, the results of which are presented in Table 4. The predicted permeability of the Caco-2 cell line, a model of human colon carcinoma, seems to mirror the intestinal absorption capacity of the ligand. Our observations revealed that all hit and positive compounds exhibited Caco-2 permeability values below 5.15, suggesting that these compounds may not be suitable for oral administration without modification to mitigate adverse properties. Regarding Madin-Darby Canine Kidney (MDCK) permeability, all candidate natural products fall within the recommended range, implying some permeability despite sub-optimal absorption. Furthermore, although high Plasma Protein Binding (PPB) can interfere with drug bioavailability, it may also prolong a drug's half-life, and vice versa. The PPB values of these potential compounds exceeded the recommended range, implying possible bioavailability interference but potential half-life enhancement, a hypothesis supported by the clearance (CL) and half-life (T1/2) predictions. The drug reaches other body parts via bloodstream circulation, followed by hepatic metabolism [53]. A set of cytochrome P450 family enzymes can metabolize and expel these compounds from the body as bile and urine. Certain bioactive substances serve as substrates for these enzymes, undergoing metabolism by the corresponding CYP450 enzymes [54]. In contrast, some bioactive substances inhibit these enzymes, disrupting the biodegradation process. The results highlight that the active compounds inhibit CYP2C19, CYP2C9, CYP2D6, and CYP3A4. Scutellarin, a natural product, may show affinity for CYP2C9; Digallic acid could be a CYP2C9 inhibitor; and (-)-Balanol might antagonize both CYP1A2 and CYP2C9.
The potential toxicities of active and candidate compounds, displayed in Table 5, identify a shared risk of potential drug-induced liver injury. In addition, scutellarin displayed moderate skin sensitization, digallic acid exhibited moderate human hepatotoxicity alongside notable skin sensitization and eye irritation, and (-)-Balanol is a mild Herg blocker. These findings suggest oral administration might be the least toxic method, subject to further experimental verification.
Table 6
FMOs energetic parameters for the reference dabrafenib and top-hit natural compounds in water phase.
Compound | EHOMO (eV) | ELUMO (eV) | ΔEgap (eV) | Potential ionization I (eV) | Electron affinity A (eV) |
Dabrafenib | -0.222 | -0.062 | 0.161 | 0.222 | 0.062 |
Scutellarin | -0.212 | -0.063 | 0.149 | 0.212 | 0.063 |
Digallic acid | -0.220 | -0.051 | 0.169 | 0.220 | 0.051 |
(-)-Balanol | -0.221 | -0.074 | 0.147 | 0.221 | 0.074 |
Table 7
Global reactivity descriptors for the reference dabrafenib and top-hit natural compounds in water phase.
Compound | Hardness (η) | Softness (σ) | Electronegativity (χ) | Chemical potential (µ) | Electrophilicity index (ω) |
Dabrafenib | 0.080 | 6.22 | 0.142 | -0.142 | 0.125 |
Scutellarin | 0.074 | 6.72 | 0.138 | -0.138 | 0.127 |
Digallic acid | 0.085 | 5.91 | 0.135 | -0.135 | 0.108 |
(-)-Balanol | 0.074 | 6.80 | 0.148 | -0.148 | 0.148 |
Frontier molecular orbital analysis (FMO)
High-ranking natural product molecules and the reference compound, dabrafenib, were analyzed using frontier molecular orbital methods to delineate key electronic properties [55], inclusive of molecular geometry optimization, HOMO/LUMO analysis, electrostatic potential, frontier molecular orbital energy, energy parameters, and global reactivity descriptors. Frontier Molecular Orbitals (FMOs), specifically HOMO and LUMO, elucidate the molecular characteristics and reactivity in theoretical chemistry. HOMO and LUMO represent a compound's capacity to donate and accept or withdraw electrons, respectively. These factors critically influence the compound's chemical stability. The band gap, computed by juxtaposing the HOMO and LUMO states of a compound, serves to ascertain its chemical reactivity and kinetic stability. A smaller gap corresponds to elevated compound reactivity [56]. All compounds, except for digallic acid (ΔEgap=0.169 eV), demonstrated smaller bandgap values than dabrafenib, suggestive of their molecular reactivity. This reactivity potentially results in potent and stable interactions with the NEK7 protein. The HOMO and LUMO sites were plotted on the molecular surface orbitals, thereby determining the molecule's acceptability. Figure 6 exhibits the HOMO and LUMO maps (blue for the positive phase, red for the negative phase) of dabrafenib and three natural product molecules.
Additional insight into the reactivity and stability of compounds arises from factors like electronegativity, ionization potential, electron affinity, hardness, and softness [57]. Table 6 displays the collective reactivity descriptors of both drugs. Chemical hardness and softness also govern a molecule's stability and chemical reactivity. Natural products, scutellarin, and (-)-balanol, are likely to be more susceptible to chemical reactions or interactions due to their chemical softness and hardness ranking (Table 7). Electronegativity characterizes a compound's electron acceptance capability, while the electrophilic index (omega) quantifies the affinity of electron acceptors for additional charges from the environment. The natural compound (-)-balanol exhibits the highest electronegativity and electrophilic index, signifying its potential as a superior electron acceptor.
Additionally, the MEP of these four compounds was calculated to discern and explore the stereo-electronic complementarity between the ligand and its receptor, an essential feature for molecular recognition in protein-ligand interactions. It highlights the most electrophilic (susceptible to nucleophilic attack) and electronegative (prone to electrophilic attack) sites. These sites incorporate data about atoms capable of non-covalent interactions within the compound. Figure 7 illustrates the molecular electrostatic potentials of the three natural products and dabrafenib, with the compounds' surface MEP values denoted by gradient colors. Red, blue, and gray correspond to areas with the highest electronegative (electrophilic attack-prone), electropositive (nucleophilic attack-prone), and zero electrostatic potential, respectively. The electrostatic potential contour maps of the compounds reveal potential sites for electrophilic and nucleophilic attacks. Predominantly, the blue sites are located in the carbonyl region, while the red sites cluster within hydroxyl or amino structures.
Molecular dynamics simulation
Relative to molecular docking, molecular dynamics simulations, albeit more computationally demanding, offer a reliable and precise method to compute the temporal behavior of proteins, and to predict their future positions and momentum. This approach is employed in the exploration of the structure, dynamics, and thermodynamics of biomolecules and their complexes. It yields granular information concerning the fluctuations and conformational modifications observed in docked complexes, including protein-ligand interactions [57, 58]. Consequently, founded on the outcomes of molecular docking, molecular dynamics simulations spanning 100ns were conducted on four complexes to assess the stability of protein-ligand binding.
Root mean square deviation (RMSD)
The RMSD values of the complexes reveal conformational alterations relative to the initial structural simulation, serving as a tool to assess the stability and convergence of the simulation process [59]. For all four complexes studied, their RMSD variations remained within acceptable limits [60]. This finding suggests that the triad of natural products and dabrafenib established relatively stable structural interactions with the NEK7 protein. Fluctuations observed in the protein backbone during the simulation could be indicative of minor conformational shifts in NEK7 in the solution (refer to Fig. 8A). Specifically, the RMSD of the scutellarin-NEK7 complex (depicted in green) demonstrated a trend towards stabilization post 45 ns, stabilizing around 0.25 nm. Meanwhile, the digallic acid-NEK7 complex (illustrated in blue) exhibited a stable RMSD of 0.22 nm at 75 ns. The (-)-balanol-NEK7 complex (marked in purple) witnessed an overall stabilization following an initial period of acceptable fluctuations, with the RMSD stabilizing at around 0.20 nm post 45 ns. Notably, the dabrafenib-NEK7 complex (denoted in red) displayed minor backbone fluctuations between 55–70 ns, with RMSD peaking at approximately 0.35 nm. This observation suggests potential minor changes in the overall protein conformation, which finally achieved stability post 75 ns and sustained at approximately 0.23 nm until the simulation's conclusion.
Root mean square fluctuation (RMSF)
RMSF serves to expose fluctuations in the average position of protein residues throughout a simulation, thereby facilitating the investigation of the complex system's flexibility and binding effects within the active site [59]. Typically, regions exhibiting minimal fluctuations within the complex suggest a well-structured area with limited deformation. Figure 8B depicts the comparable dynamic fluctuation behaviors across all four systems. Notably, in comparison with the dabrafenib-NEK7 complex, the lower RMSF values associated with the residues related to the active pocket in the dynamic trajectory denote equal stability in the interactions of the three natural products with the NEK7 protein. Consequently, these natural products emerge as potential NEK7 inhibitors.
Radius of gyration (Rg)
The Rg is a crucial parameter in delineating the outcomes of molecular dynamics simulations [61]. It provides an assessment of a protein's compactness, where a lower Rg signifies a more compact structure in the protein-ligand complex throughout the simulation process, thereby denoting enhanced complex stability. As depicted in Fig. 9A, the Rg fluctuations for complexes formed by three natural products and dabrafenib with NEK7 are outlined. The simulated Rg for these selected natural compounds, including dabrafenib, ranged from 1.95 to 2.10 nm, demonstrating a steady state over the 100 ns dynamic trajectory. This suggests that the structural integrity of these molecules remained unaffected upon binding. Consequently, the close packing post-ligand binding leads to the formation of a stable protein complex.
Hydrogen bond (H-bond)
Hydrogen bonding significantly contributes to the binding strength between a ligand and a protein among various interaction methods. To examine the stability of the complex, the complete protein-ligand simulation trajectory is utilized to compute the mean quantity of hydrogen bonds within the complex [62]. The hydrogen bonding patterns of three identified natural compounds and dabrafenib in association with NEK7 are illustrated across the full simulation period, as indicated in Fig. 9B. The average hydrogen bond count calculated in the dabrafenib-NEK7, scutellarin-NEK7, digallic acid-NEK7, and (-)-balanol-NEK7 complexes are 0.719, 2.863, 3.291, and 5.227, respectively. Despite minor increases or decreases in the number of protein-ligand hydrogen bonds during the simulation, the three natural products formed, on average, more hydrogen bonds with NEK7 than dabrafenib did. This indicates that the robust binding ability of the natural products with NEK7 may be attributed to hydrogen bonding, aligning with the outcomes of the molecular docking process.
Free energy landscape (FEL)
FEL analysis is utilized to investigate the conformational distribution and stability of protein-ligand complex structures, inferred from the simulation trajectory. The complex stability is gauged by the minimal relative energy where lower free energy corresponds to heightened conformational stability [63]. Areas of lower energy are depicted in darker shades of blue. In this study, a free energy contour map has been formulated for the systems involving kinase NEK7 and four distinct ligands. The Gibbs Free Energy Landscape is derived from principal components serving as reaction coordinates, illustrated in Fig. 10. Throughout the simulation, the various accessible conformational states of the four complexes (NEK7-dabrafenib, NEK7-scutellarin, NEK7-digallic acid, and NEK7-(-)-balanol) can be delineated on the free energy landscape, accompanied by two principal components, PC1 and PC2. As depicted in Fig. 17, each of the four complexes exhibits one or more distinct basins of global energy minima, each corresponding to their conformational states. The dabrafenib-NEK7 system shows considerable conformational alterations, as evidenced by the broad and dispersed basin on the FEL, thereby underlining the instability of this complex. Compared to other complexes, the conformations of scutellarin, digallic acid, and (-)-balanol with NEK7 are more compact and centralized on the FEL, suggesting these ligands bind closely with NEK7, thereby adopting stable conformations.
Vector movements
We further employ Porcupine analysis, grounded in Principal Component Analysis (PCA), to scrutinize the fundamental protein movements vital for biological function [64]. MD simulations combined with PCA-informed FEL analysis facilitate an expansive exploration of the conformational dynamics of protein-ligand complexes. The Porcupine plot identifies the dominant features of movement, revealing the direction and magnitude of selected eigenvectors for every backbone atom (Fig. 11). Within the plot, cones symbolize the movement direction, and their length denotes the amplitude of the motion. The regions with residue indices 11–21, 88–95, 138–145, 200–209, 226–231, 280–296, 327–335, and 348–355 exhibit the most noticeable basic motions of the backbone residues, mirroring the RMSF of NEK7. Observations reveal a lesser amplitude of motion in the residues of scutellarin-NEK7, digallic acid-NEK7, and (-)-balanol-NEK7 complexes than in other complexes, with (-)-balanol showing the minimal amplitude. Loop movements predominate in all systems, which validates our simulations that the stability of scutellarin-NEK7, digallic acid-NEK7, and (-)-balanol-NEK7 complexes surpasses that of the dabrafenib-NEK7 complex.
Dynamic cross-correlation matrices analysis (DCCM)
A DCCM analysis was implemented to examine the correlated movements of structural domains, thereby stabilizing and confirming the MD trajectories following the ligand interaction of Mur enzymes [65]. The dynamic cross-correlation diagram highlighted the protein's residual fluctuations, indicative of the positive and negative correlations between residue displacements throughout the simulation. The NEK7 enzyme complex, in combination with hit natural products and dabrafenib, was evaluated for positive, negative, and non-correlated movements. The color-coding scheme used to denote correlation strength represented positive correlations in blue and negative correlations in red. Figure 12 provides a visual representation of the correlation diagrams, showcasing the interactions between the compound and structural domains during the simulation process that subsequently impact the stability of protein-ligand complexes. Although four types of complexes exhibited correlated and anti-correlated movements, the three natural molecules forming complexes with NEK7 demonstrated superior stability compared to dabrafenib, as their residues displayed less extreme positive and negative correlated movements.
Secondary structure profile analysis
Structural discrepancies contribute to differences in protein stability [66]. We employed the DSSP module in GROMACS to explore alterations in secondary structural (SS) elements' patterns (α-Helix, β-Sheet, β-Bridge, and Turn) post-ligand binding during simulation, affirming their conformational behavior and folding mechanisms. Figure 13 delineates the secondary structural elements' distribution following complex formation between NEK7 and the target molecules. The DSSP analysis implies that the secondary structural elements in the NEK7-target molecule complexes predominantly preserve as unbroken coils and invariant α-Helix and β-Sheet, with scarce evidence of twisted, turned, or β-Bridge conformations (see Fig. S2). Additionally, we discerned a notable trend in the dynamic trajectory: the structural domain formulated by residues 35–40 presented as a coil in the NEK7-dabrafenib complex, whereas in the other three natural product-NEK7 complexes, these structures were more orderly. This observation potentially signals enhanced stability for the natural molecule-NEK7 complexes during the simulation.
Molecular mechanics Poisson-Boltzmann surface area (MM/PBSA)
To validate and delve into the simulation study findings, we performed calculations of binding free energy, predicting the average binding free energy for protein-ligand complexes [67]. This enabled us to understand better the energy contributions of residues interacting within the binding free energy curve. Generally, a lower binding energy correlates with increased system stability. Table S1 details the binding energy parameters, encompassing Van der Waals, electrostatic, polar solvent, nonpolar solvent, total gas-phase free energy, total solvation free energy, and total binding energy. The computations revealed that the binding free energies for scutellarin, digallic acid, and (-)-balanol are − 35.1397, -40.7140, and − 43.8438 kcal/mol, respectively. The hit compounds' binding free energy is significantly lower than that of dabrafenib (Fig. 14), implying stronger binding of these compounds to NEK7 than dabrafenib's binding to NEK7. This suggests that these compounds might have potent inhibitory impacts on NEK7.
Among all energy components of protein-ligand complexes, the Van der Waals forces emerge as the principal interaction. Notably, electrostatics also significantly contributes to the system's stability. Conversely, the polar solvation energy, being positive, adversely affects the binding energy. This effect is especially pronounced in the three natural products, which possess greater polar solvation energies than dabrafenib. While the nonpolar solvation energy's importance is less pronounced, its negative value implies a favorable contribution to the binding energy.
Based on the MD simulation results and binding free energy, we executed energy decomposition on amino acid residues and analyzed individual residue contributions. Overall, the key residues' energy of the hit natural products is lesser than that of dabrafenib. Specifically, residues Ile 20, Val 28, Ile 75, Leu91, Glu 92, Leu 93, Ala94, Asp95, Lys 143, Phe 148, Asp 159, Leu 160, and Thr 161 significantly influence the binding effect (see in Fig. 15). Among these, residues Ile 20, Val 28, Ile 75, Leu 93, Ala 94, Lys 143, Phe 148, Leu 160, and Thr 161 facilitate compound binding most effectively, whereas residues Glu 92 and Asp 95 are unfavorable for compound binding to NEK7.