Structural Parameter Variations with Method. Single crystals of commercially available compounds, including 2, 9, 10, 11, 14, 15, and 16, were obtained at room temperature for X-ray diffraction. The crystal data, data collection, and structure refinement details are described in published structure reports50–60. Structural parameters, such as bond lengths and angles, for these compounds, were calculated at the B3LYP level of theory using different basis sets (6-31G(d,p), 6-311G(d,p), and 6-311 + + G(d,p)). A comparison was made between the computational results and the experimental data to assess the method's precision54–61. This analysis evaluated any discrepancies or variations between the calculated and experimental data50–61. The structures of compounds 2, 9, 10, 11, 14, 15, and 16, along with their respective bond lengths and angles obtained from the selected methods, are depicted in Fig. 1 (refer to Figure S1 to S18 and Table S1 to S19 in the SI). The summary of the computed and experimental values is presented in Table 1 (See Table S21 in the SI).
Upon comparing the optimized and crystal structures of the compounds, we observed that the position of the carbon six (C6) group attached to the chromone ring differed, resulting in calculated bond lengths in the chromone ring that deviated from the crystal data55–60. To investigate the impact of EDG and EWG at the C6 position of the chromone ring, we conducted ab initio calculations. Our findings demonstrated that the bond lengths and angles of the EWG series (Y = F, Cl, or Br) at the C6 position increased from fluorine to bromine, with the bromine atom stabilizing the chromone ring due to its high polarizability. In the gas phase, compounds 2 and 10 exhibited mean deviations (MDs) in bond distances ranging from 0.677 Å to 0.728 Å, while compounds 9 and 11 displayed MDs ranging from 0.812 Å to 0.863 Å. All tested basis sets at the B3LYP level of theory exhibited similar performance in predicting the bond lengths and angles of compounds 2, 9, 10, and 11. It was observed that the inclusion of diffuse functions in the basis set had minimal impact on the geometries of these compounds at the B3LYP/6-311 + + G(d,p) level of theory. Specifically, the B3LYP/6-31G(d,p) method provided good agreement with experimental results61–67 in estimating the bond lengths and angles of compounds 2, 9, 10, 11, 14, 15, and 16 (Tables 1 and S20 in the SI). Hence, B3LYP/6-31G(d,p) can yield acceptable results while reducing computational time.
Table 1 Selected Bond Distances (Å) and Angles (deg) for 3-Formyl Chromone Derivativesa,b
a The values in parenthesis represent the difference between the experimental and calculated values
b MD is the mean deviation using units Å and deg. c Reference 1–7.
Analysis of Frontier Molecular Orbitals. Investigating the FMO in the 3-formyl chromone derivatives (1–16) offers vital insights into their stability and reactivity61. The energy gap (Egap) between the HOMO and LUMO known as the HOMO-LUMO energy gap, is a critical factor that determines their chemical reactivity, hardness, softness, chemical potential, and electrophilic index. A narrow Egap indicates softness, signifying high reactivity but low stability, while a wide Egap indicates high stability and low reactivity. The energy levels of the HOMO and LUMO orbitals help in understanding the compounds electron-donating and accepting properties, respectively. Our study presents the findings of FMO analysis, which are summarized in Table 2 and illustrated in Fig. 2 (see Figures S1 to S18 in the Supplementary Information), providing a comprehensive understanding of the FMOs of the compounds.
The FMO analysis of 3-formyl chromone derivatives (1–16) reveals essential insights into their stability and reactivity. Among the 6-substituted derivatives (9, 10, and 11), the HOMO-LUMO gap decreases in the order F > Cl > Br, ranging from 4.557 eV to 4.542 eV (Table 2). Soft ligands (Y = Br) exhibit a smaller HOMO-LUMO gap due to the greater polarizability of their valence electrons compared to fluorine (Table 2). This enhanced polarizability contributes to the electron cloud distortion in C6-halide substituted 3-formyl chromone derivatives. The soft base (Br), characterized by high polarizability and low electronegativity, and further decreases the HOMO-LUMO gap in C6-X substituted chromone rings. Conversely, the hard base (F) with higher electron density exhibits a larger HOMO-LUMO gap. A smaller gap indicates softness, associated with high reactivity and low stability, while a wider gap implies high stability and low reactivity. The energy levels of the HOMO and LUMO orbitals provide insights into the compounds' electron acceptor and donor properties, respectively.
Table 2
The HOMO and LUMO energies, IP, EA, electronegativity (χ), chemical potential (µ), global hardness (η), softness (σ), electrophilicity (ω), and dipole moment (Debye) of all compounds (1–16) at 298.15 K calculated using B3LYP/6-31G(d,p).a,b
Ligand | EHOMO (ev) | ELUMO (ev) | EGap (ev) | IP (ev) | EA (ev) | χ (ev) | µ (ev) | η (ev) | σ (ev) | ω (ev) | Dipole (D) |
1 | -5.926 | -2.053 | 3.874 | 5.926 | 2.052 | 3.989 | -3.989 | 1.936 | 0.516 | 4.108 | 3.425 |
2 | -6.424 | -1.822 | 4.601 | 6.424 | 1.822 | 4.122 | -4.122 | 2.300 | 0.434 | 3.694 | 6.457 |
3 | -6.422 | -1.829 | 4.592 | 6.422 | 1.829 | 4.126 | -4.126 | 2.296 | 0.435 | 3.707 | 6.401 |
4 | -6.421 | -1.835 | 4.585 | 6.421 | 1.835 | 4.128 | -4.128 | 2.292 | 0.436 | 3.716 | 6.313 |
5 | -6.430 | -1.856 | 4.573 | 6.430 | 1.856 | 4.143 | -4.143 | 2.286 | 0.437 | 3.752 | 7.784 |
6 | -6.335 | -1.814 | 4.521 | 6.335 | 1.814 | 4.075 | -4.075 | 2.260 | 0.442 | 3.672 | 7.763 |
7 | -6.295 | -1.789 | 4.505 | 6.295 | 1.789 | 4.042 | -4.042 | 2.252 | 0.443 | 3.626 | 7.730 |
8 | -5.925 | -1.692 | 4.233 | 5.925 | 1.692 | 3.809 | -3.809 | 2.116 | 0.472 | 3.428 | 6.944 |
9 | -6.619 | -2.062 | 4.557 | 6.619 | 2.062 | 4.340 | -4.340 | 2.278 | 0.438 | 4.134 | 6.410 |
10 | -6.681 | -2.137 | 4.543 | 6.681 | 2.137 | 4.409 | -4.409 | 2.271 | 0.440 | 4.279 | 6.409 |
11 | -6.672 | -2.129 | 4.542 | 6.672 | 2.129 | 4.401 | -4.401 | 2.271 | 0.440 | 4.263 | 6.172 |
12 | -6.899 | -2.467 | 4.432 | 6.899 | 2.467 | 4.683 | -4.683 | 2.216 | 0.451 | 4.949 | 7.531 |
13 | -6.941 | -2.927 | 4.014 | 6.941 | 2.927 | 4.934 | -4.934 | 2.007 | 0.498 | 6.067 | 7.482 |
14 | -6.655 | -2.117 | 4.538 | 6.655 | 2.117 | 4.386 | -4.386 | 2.269 | 0.441 | 4.239 | 4.986 |
15 | -6.843 | -2.342 | 4.502 | 6.843 | 2.342 | 4.592 | -4.592 | 2.251 | 0.444 | 4.685 | 4.822 |
16 | -6.802 | -2.308 | 4.493 | 6.802 | 2.308 | 4.555 | -4.555 | 2.247 | 0.445 | 4.618 | 5.823 |
vitexin | -5.920 | -1.746 | 4.175 | 5.920 | 1.746 | 3.833 | -3.833 | 2.087 | 0.479 | 3.519 | 4.274 |
myricetin | -5.679 | -1.722 | 3.957 | 5.679 | 1.722 | 3.700 | -3.700 | 1.978 | 0.505 | 3.460 | 6.758 |
D | -5.879 | -0.551 | 5.328 | 5.879 | 0.551 | 3.215 | -3.215 | 2.664 | 0.375 | 1.939 | 1.999 |
aCalculated using equations in the method section. bD (Reference drug: Dapagliflozin) |
For the 6-hydroxy substituted chromone derivatives, the HOMO-LUMO gap of compound 5 (6-hydroxy) is 4.573 eV, lower than that of compound 4 (6-OCH2CH3, 4.505 eV) and compound 3 (6-NH2, 4.233 eV), due to the electronic effects of the electron-rich chromone ring in 3-formyl chromone derivatives. Additionally, the presence of C6 alkyl substituents (-CH3, -CH2CH3,-CH (CH3)2) leads to a decreasing trend in the HOMO-LUMO gap, likely due to steric effects (Table 2). Compounds 2–16 exhibit higher energy gaps than the parent compound 1, indicating changes caused by EDG and EWG at the 6-substituted position. Vitexin and myricetin display lower energy gaps than the other compounds (2–16). The reference drug, dapagliflozin (D), has a higher energy gap than the other compounds. The energy gap values in this series (compounds 1 to 16) decrease in the following order: D (5.328 eV) > 2 (4.601 eV) > 3 (4.592 eV) > 4 (4.585 eV) > 5 (4.573 eV) > 9 (4.557 eV) > 10 (4.543 eV) > 11 (4.542 eV) > 14 (4.538 eV) > 15 (4.502 eV) > 16 (4.493 eV) > 12 (4.432 eV) > 8 (4.233 eV) > vitexin (4.175 eV) > 13 (4.014 eV) > myricetin (3.957 eV) > 1 (3.874 eV).
The results indicate that compounds (2–16) have higher reactivity than the reference drug D, as evidenced by their higher electronegativity (χ), chemical potential (µ), global hardness (η), and electrophilicity index (ω), as well as their lower softness (σ). The electrophilicity index62 is a commonly used parameter for predicting biological activity and identifying reactive sites by measuring the energy lowering resulting from electron transfer between the HOMO and LUMO. Meanwhile, the lower electrophilicity index of compound 8 (3.428 eV) and reference drug D (1.939 eV) provides a basis for further analysis of its potential biological activity through molecular docking with a suitable protein. The higher dipole moment of compounds (2–16) than the reference drug suggested that they may have a better binding affinity. These findings make our compounds promising candidates for further investigation of their biological activity through molecular docking studies.
NBO analysis can provide insights into the bonding and stability of molecules by examining the interactions between atoms and the distribution of atomic charges. The stabilization energies of the studied molecules are affected by intramolecular charge transfer, as shown in Figures S1 to S18 in the SI. MEP maps in Fig. 2 (S1 to S18 in the SI) visually represent the electrostatic potential of the molecules and may be used to define areas of electrophilicity and nucleophilicity. The colors used in the MEP maps (blue for electrophilic regions and red for nucleophilic regions) indicate the strength of the electrical potential.
Table 3
In silico prediction of physicochemical parameters for the derivatives of 3-formyl chromone (1 − 16)a
Ligand | MW | LogP | HBA | HBD | nroth | TPSA (Å2) |
Lipinski* | ≤ 500 | ≤ 5 | ≤ 5 | ≤ 10 | ≤ 5 | ̶ |
Veber** | ̶ | ̶ | ̶ | ̶ | ≤ 10 | ≤ 140 |
1 | 174.15 | 1.46 | 3 | 0 | 1 | 47.28 |
2 | 188.18 | 1.79 | 3 | 0 | 1 | 47.28 |
3 | 202.21 | 2.09 | 3 | 0 | 2 | 47.28 |
4 | 216.23 | 2.4 | 3 | 0 | 2 | 47.28 |
5 | 190.15 | 1.03 | 4 | 1 | 1 | 67.51 |
6 | 204.18 | 1.44 | 4 | 0 | 2 | 56.51 |
7 | 218.21 | 1.78 | 4 | 0 | 3 | 56.51 |
8 | 199.16 | 0.89 | 4 | 0 | 1 | 71.07 |
9 | 192.14 | 1.76 | 4 | 0 | 1 | 47.28 |
10 | 208.60 | 1.98 | 3 | 0 | 1 | 47.28 |
11 | 253.05 | 2.08 | 3 | 0 | 1 | 47.28 |
12 | 189.17 | 1.2 | 3 | 1 | 1 | 73.30 |
13 | 219.15 | 0.65 | 5 | 0 | 2 | 93.10 |
14 | 253.05 | 2.08 | 3 | 0 | 1 | 47.28 |
15 | 243.04 | 2.5 | 3 | 0 | 1 | 47.28 |
16 | 245.06 | 2.23 | 3 | 0 | 1 | 43.37 |
vitexin | 432.38 | -0.02 | 10 | 7 | 3 | 181.05 |
myricetin | 318.24 | 0.79 | 8 | 6 | 1 | 151.59 |
D | 408.87 | 2.17 | 6 | 4 | 6 | 99.38 |
a *Lipinski reference values; **Veber reference values; MW, molecular weight; |
LogP, lipophilicity (O/W); HBD, number of hydrogen bond donors; HBA, number
of hydrogen bond acceptors; nroth, Number of rotatable bonds;TPSA, topological
polar surface area (Å2). D (Reference drug: Dapagliflozin)
Analysis of Physicochemical and Pharmacokinetic Properties. To determine compliance with Lipinski and Veber's criteria, it is essential to evaluate the physicochemical characteristics of 3-formyl chromone derivatives (1–16). Before a chemical can be administered orally, it must meet the five criteria as per Lipinski's recommendations: (a) molecular weight (MW) of 500 g/mol or less; (b) octanol-water partition coefficient (log P ≤ 5); (c) no more than five H-bond donors (HBD); no more than ten H-bond acceptors (HBA); and (e) no more than 140 Å2 of topological polar surface area (TPSA) 63–64. Additionally, Veber has proposed two more requirements for drug bioavailability: (a) TPSA must be less than or equal to 140 Å2 (as per Lipinski's standards), and (b) the number of rotatable bonds (nrotb) in the molecules should be less than 10. The compounds (1–16) with the most promising biological activity were analyzed using SwissADME to assess their compliance with the Lipinski and Veber criteria. All compounds (1–16) were found to meet the prescribed limit ranges specified by the Lipinski and Veber criteria (Table 3). Additionally, there was a strong agreement (less than 6) for bioactive compounds regarding molecular weight (< 500 g/mol), MLOGP (4.15), and Log S (ESOL) within the specified limit ranges. Moreover, all compounds (1–16) that obtained a drug-like (bioavailability) score of 1 demonstrated their adherence to the criteria, providing robust theoretical support for developing innovative novel drugs.
The drug-likeness parameters calculated using molinspiration for all 16 compounds are presented in Table 4, which includes their potential as G protein-coupled receptor (GPCR) ligands, ion channel modulators (ICM), kinase inhibitors (KI), nuclear receptor ligands (NRL), protease inhibitors (PI), and enzyme inhibitors (EI). Among these, compound 4 showed a GPCR value of -0.66, and compared to 0.15 for the reference drug D (dapagliflozin).
Table 4
Drug-likeness assessment of 3-formyl chromone derivatives (1 − 16) by molinspiration
Ligand | GPCR | ICM | KI | NRL | PI | EI |
1 | −1.03 | −0.88 | −1.06 | −0.91 | −1.64 | −0.36 |
2 | -0.97 | -0.94 | -1.01 | -0.81 | -1.56 | -0.39 |
3 | -0.75 | -0.70 | -0.90 | -0.57 | -1.28 | -0.21 |
4 | -0.66 | -0.66 | -0.74 | -0.43 | -1.17 | -0.16 |
5 | -0.81 | -0.70 | -0.79 | -0.48 | -1.46 | -0.17 |
6 | -0.84 | -0.82 | -0.80 | -0.62 | -1.42 | -0.28 |
7 | -0.78 | -0.77 | -0.77 | -0.48 | -1.30 | -0.27 |
8 | -0.74 | -0.72 | -0.64 | -0.47 | -1.26 | -0.15 |
9 | -0.85 | -0.80 | -0.86 | -0.70 | -1.52 | -0.29 |
10 | −0.91 | −0.79 | −0.94 | −0.80 | −1.51 | −0.34 |
11 | −1.13 | −0.98 | −1.00 | −1.04 | −1.71 | −0.45 |
12 | -0.81 | -0.67 | -0.68 | -0.86 | -1.31 | -0.12 |
13 | −0.89 | −0.70 | −0.84 | −0.68 | −1.38 | −0.33 |
14 | −1.21 | −0.92 | −1.12 | −1.10 | −1.82 | −0.46 |
15 | −0.91 | −0.89 | −0.87 | −0.67 | −1.33 | −0.25 |
16 | -0.74 | -0.61 | -0.89 | -0.59 | -1.33 | -0.22 |
vitexin | 0.13 | -0.14 | 0.19 | 0.23 | 0.03 | 0.46 |
myricetin | -0.06 | -0.18 | 0.28 | 0.32 | -0.20 | 0.30 |
D | 0.15 | -0.07 | -0.05 | 0.09 | 0.06 | 0.25 |
D (Reference drug: Dapagliflozin) |
In drug development, it is crucial to assess the pharmacokinetic properties, such as absorption, distribution, metabolism, excretion, and toxicity (ADMET), to ensure innovative drugs' efficient and economical creation. This study utilized SwissADME (http://www.swissadme.ch/index.php) and admetSAR (http://lmmd.ecust.edu.cn/admetsar2/) software to evaluate the ADMET properties of all 16 compounds (1–16). The evaluation involved seven essential ADMET characteristics, as listed in Table 5, including cytochrome P450 enzymes (CYP3A4 and CYP2C19) inhibition, hERG inhibition, plasma protein binding (PPB), blood-brain barrier (BBB) penetration, human intestinal absorption (HIA), and synthetic accessibility (SA) score. It should be noted that drugs that affect the central nervous system (CNS) should have good blood-brain barrier penetration, whereas those that do should not penetrate the BBB. Low absorption is defined as < 0.1, medium absorption as 0.1-2, and high BBB penetration as > 264. The results indicate that all compounds (1–16) meet the ADMET standards for drug-likeness (bioavailability), promising for developing novel drugs.
Table 5
In silico prediction of selected ADMET parameters for the 3-formyl chromone derivatives (1 − 16)a
Ligand | bHIA | bBBB | bPPB | bCYP3A4 inhibition | bCYP2C19 inhibition | bhERG_ pIC50 | cSynthetic Accessibility score |
1 | 0.9939 | -0.6250 | 0.751 | -0.7657 | 0.5874 | -0.7838 | 2.43 |
2 | 0.9952 | -0.5750 | 0.846 | -0.7427 | -0.6462 | -0.7722 | 2.51 |
3 | 0.9950 | -0.5750 | 0.723 | -0.8213 | 0.6574 | -0.5211 | 2.52 |
4 | 0.9948 | -0.5 | 0.921 | 0.8012 | 0.6092 | -0.4744 | 2.62 |
5 | 0.9815 | -0.8500 | 0.769 | -0.7920 | -0.6630 | -0.8597 | 2.34 |
6 | 0.9909 | -0.6500 | 0.791 | -0.6403 | 0.6478 | -0.6015 | 2.43 |
7 | 0.9930 | -0.6750 | 0.716 | -0.8523 | 0.9446 | -0.5680 | 2.5 |
8 | 0.9944 | 0.5500 | 0.754 | 0.5078 | 0.5218 | -0.7564 | 2.3 |
9 | 0.9947 | -0.5750 | 0.787 | -0.7213 | 0.5464 | -0.7399 | 2.34 |
10 | 0.9944 | -0.5500 | 0.874 | -0.6170 | 0.6182 | -0.8247 | 2.34 |
11 | 0.9932 | -0.5750 | 0.879 | -0.5316 | 0.5415 | -0.7425 | 2.46 |
12 | 0.9938 | -0.6000 | 0.839 | 0.5140 | -0.5565 | -0.6906 | 2.66 |
13 | 0.9691 | -0.5250 | 0.72 | -0.6990 | -0.6065 | -0.9129 | 2.51 |
14 | 0.9932 | -0.5750 | 0.815 | -0.5316 | 0.5415 | -0.7636 | 2.55 |
15 | 0.9944 | -0.5500 | 0.839 | -0.6170 | 0.6182 | -0.7854 | 2.46 |
16 | 0.9944 | -0.5500 | 0.806 | -0.6170 | 0.6182 | -0.8011 | 2.60 |
vitexin | 0.6665 | -0.7000 | 0.845 | -0.8310 | -0.9240 | -0.4762 | 5.12 |
myricetin | 0.9071 | -0.7750 | 0.991 | 0.6951 | -0.9025 | -0.7812 | 3.27 |
D | 0.6268 | 0.6250 | 0.736 | -0.8763 | 0.5166 | 0.7719 | 4.52 |
aHIA: Human Intestinal Absorption (%); BBB: Blood-Brain Barrier penetration; |
PPB: plasma protein binding; CYP3A4: Cytochrome P4503A4; CYP2C19: Cytochrome P4502C19; |
hERG: human ether-a-go-go-related gene, hERG inhibition potential (pIC50), the potential risk for inhibitors ranges 5.5 − 6. bThe values are using admetSAR.cThe values are using swissADME. D (Reference drug: Dapagliflozin) |
Based on our study, most compounds showed BBB penetration values between − 0.5 and − 0.7, similar to vitexin (-0.70) and myricetin (-0.77). The range of pIC50 predictions for potential risk from hERG activity inhibitors is between − 0.4 to -0.8265. Our research showed that compound 4 had the lowest hERG pIC50 value of -0.4744, while all other compounds had values below the reference range. To assess the synthetic accessibility (SA) score of the drug-like compounds (1–16), which ranges from 1 (very simple) to 10 (very difficult), we employed a unique method [61]. The SA scores of all the compounds (1–16) ranged from 2.4 to 2.6, which was lower than the reference drug, with a high SA score of 4.52. Vitexin and myricetin have higher SA scores than compounds (1–16). Our findings suggest that all compounds (1–16) meet the ADMET standards, indicating drug-likeness (bioavailability) and potential for developing novel medications.
Analysis of Pharmacological Activities. For a comprehensive investigation of the potential pharmacological effects of the 3-formyl chromone derivatives (1–16), Multilevel Neighborhoods of Atoms (MNA) descriptors were applied using the Prediction of Activity Spectra for Substances (PASS) method. The use of MNA descriptors allowed for a unique and detailed characterization of the chemical structures, which aided in clarifying the compounds' potential biological functions. The PASS method can simultaneously predict various biological activities, such as mutagenicity, carcinogenicity, teratogenicity, embryotoxicity, and primary and side pharmacological effects. A compound's biological activity is influenced by its structural and physicochemical characteristics, the biological entity (such as species, gender, age, etc.), and the treatment approach (such as dose, route of administration, etc.). The MNA descriptors are used by PASS to determine the probable activity (Pa) and probable inactivity (Pi) for the anticipated activity spectrum of a drug. These probabilities range from 0.000 to 1.000, where Pa + Pi ≠ 1. PASS predictions can be interpreted in several ways. For example, if Pa > 0.7, there is a high likelihood of finding the activity experimentally. If Pa is between 0.5 and 0.7, the chemical is likely to demonstrate the activity in an experimental setting, but it is likely different from recognized pharmaceutical drugs. The likelihood of finding the activity experimentally is lower if Pa < 0.5.
3-formyl chromone derivatives (1–16), except some of them exhibited Pa values greater than 0.5, as presented in Table 6. The histidine kinase inhibitor values ranged from Pa = 0.614 to Pa = 0.830, nearly identical to those of the vitexin (Pa = 0.819) and myricetin (Pa = 0.892). But the reference drug, dapagliflozin, has a lower Pa (0.439) value as a histidine kinase inhibitor. Compound 4 had a Pa value of 0.669 and a Pi value of 0.010 as a histidine kinase inhibitor, indicating that it was good in promoting antibacterial activity by targeting bacterial histidine kinase, although more potent than dapagliflozin. Dysregulated kinase activity is frequently associated with multiple disorders, including cancer. Kinases are enzymes that are essential for many biological functions, including cell proliferation, differentiation, and signaling. Moreover, all compounds, except for compounds 9 and 13, showed insulysin inhibitor values between Pa = 0.515 to Pa = 0.729, indicating that these compounds could potentially be therapeutic agents for diabetes treatment. Using a clearance mechanism anchored on degradation, IDE controls the amount of circulating insulin in several organs66. The reference drug shows no activity as an insulysin inhibitor. HIF1α expression is frequently increased in cancer cells and is linked to metastasis and tumor formation. Compound 4 was found to have a Pa value of 0.738 and a Pi value of 0.016 as a HIF1α expression inhibitor, indicating moderate potency in inhibiting the expression of HIF1α. Additionally, some compounds showed stronger activity as alcohol dehydrogenase inhibitors than the reference drugs.
Our findings revealed that the Pa values of the studied compounds, acting as both insulin inhibitors (anti-diabetic) and alcohol dehydrogenase inhibitors, ranged from 0.5 to 0.7, with only a few exceptions. This indicates a strong potential for these compounds to inhibit both insulin and alcohol dehydrogenase enzymes effectively. Additionally, our compounds exhibited apoptosis agonist activity, which is well-known for inducing cancer cell death.[67]
Table 6
Predicted biological activity of the 3-formyl chromone derivatives (1 − 16) using PASS
| Alcohol Dehydrogenase (NADP+) inhibitor (Anti-microbial) | Apoptosis Agonist (Anti-cancer) | HIF1α expression inhibitor (Anti-tumor) | Insulysin inhibitor (Anti-diabetic) | Histidine kinase inhibitor (Anti-bacterial) |
Ligand | Pa | Pi | Pa | Pi | Pa | Pi | Pa | Pi | Pa | Pi |
1 | 0.737 | 0.004 | 0.675 | 0.017 | 0.626 | 0.029 | 0.711 | 0.007 | 0.774 | 0.005 |
2 | 0.500 | 0.009 | 0.631 | 0.023 | 0.549 | 0.044 | 0.729 | 0.005 | 0.715 | 0.007 |
3 | 0.623 | 0.005 | 0.557 | 0.031 | 0.542 | 0.045 | 0.578 | 0.029 | 0.695 | 0.008 |
4 | 0.395 | 0.014 | 0.585 | 0.028 | 0.738 | 0.016 | 0.515 | 0.045 | 0.669 | 0.010 |
5 | 0.788 | 0.003 | 0.698 | 0.015 | 0.768 | 0.014 | 0.727 | 0.005 | 0.830 | 0.003 |
6 | 0.459 | 0.011 | 0.682 | 0.017 | 0.647 | 0.026 | 0.700 | 0.008 | 0.695 | 0.008 |
7 | 0.607 | 0.005 | 0.568 | 0.030 | 0.467 | 0.069 | 0.679 | 0.011 | 0.661 | 0.011 |
8 | 0.489 | 0.009 | 0.568 | 0.030 | 0.695 | 0.021 | 0.541 | 0.038 | 0.626 | 0.013 |
9 | 0.690 | 0.004 | 0.530 | 0.035 | 0.631 | 0.028 | 0.461 | 0.063 | 0.679 | 0.009 |
10 | 0.620 | 0.005 | 0.544 | 0.033 | 0.485 | 0.062 | 0.687 | 0.010 | 0.792 | 0.004 |
11 | 0.763 | 0.004 | 0.622 | 0.024 | 0.390 | 0.107 | 0.620 | 0.021 | 0.659 | 0.011 |
12 | 0.489 | 0.009 | 0.568 | 0.030 | 0.695 | 0.021 | 0.541 | 0.038 | 0.626 | 0.013 |
13 | 0.270 | 0.024 | 0.574 | 0.029 | 0.441 | 0.080 | 0.455 | 0.066 | 0.614 | 0.014 |
14 | 0.770 | 0.004 | 0.553 | 0.032 | 0.378 | 0.114 | 0.604 | 0.024 | 0.638 | 0.012 |
15 | 0.398 | 0.014 | 0.426 | 0.062 | 0.316 | 0.162 | 0.592 | 0.026 | 0.749 | 0.005 |
16 | 0.507 | 0.007 | 0.507 | 0.309 | 0.439 | 0.081 | 0.624 | 0.020 | 0.748 | 0.005 |
vitexin | 0.382 | 0.014 | 0.737 | 0.012 | 0.940 | 0.004 | - | - | 0.819 | 0.004 |
myricetin | 0.912 | 0.002 | 0.915 | 0.004 | 0.969 | 0.002 | 0.603 | 0.024 | 0.892 | 0.002 |
D | - | - | 0.292 | 0.131 | 0.469 | 0.068 | - | - | 0.439 | 0.041 |
D (Reference drug: dapagliflozin) |
In Silico Molecular Docking. Molecular docking investigation and in silico docking was performed on 3-formyl chromone compounds (1–16) and reference standards of vitexin to inhibit targeted proteins, including CAD (3TWO), BHK (3DGE), IDE (6BF8), HIF-α (2WA4), p53 (7EAX), COX (6Y3C), and Mpro of SARS-CoV2 (6LU7), as presented in Table 7. Results revealed that compound 4 had a strong binding affinity for IDE (-8.5 kcal mol− 1). Additionally, all docking values obtained were superior to the reference drug dapagliflozin (IDE: -7.9 kcal mol− 1), indicating their potential for diabetes treatment. Furthermore, compounds 1–13 showed good binding affinity, while compounds 14, 15, and 16, which had additional changes in their chemical structure, exhibited weaker binding affinity. Although Vitexin has a lower binding affinity (-8.3 kcal mol− 1) than compound 4, myricetin has a binding affinity (-8.5 kcal mol− 1) similar to compound 4.
The findings of the molecular docking study indicated that compounds 1–16 exhibited a higher binding affinity for IDE, BHK, and COX, while their binding affinity for human CAD, p53, Mpro, and HIF-α was relatively lower. These results suggest that these compounds have the potential to be used as therapeutic agents for diseases related to the dysregulation of IDE, BHK, and COX. While molecular docking is a valuable tool for drug discovery, it is crucial to consider a compound's pharmacokinetics, toxicity, and metabolism, in addition to its binding affinity. In vitro and in vivo research is necessary to comprehend how compounds behave in living organisms and evaluate their safety and efficacy. By employing a combination of computational and experimental techniques, researchers can optimize the compounds and create dependable and effective therapeutic drugs.
Table 7
Molecular docking simulation results for 3-formyl chromone derivatives (1–16) against seven targets:
Binding affinity (kcal mol− 1) |
Ligand | CAD (3TWO) | IDE (6BF8) | P53 (7EAX) | BHK (3DGE) | HIF-α (2WA4) | Mpro (6LU7) | COX (6Y3C) |
1 | -6.7 | -7.1 | -5.7 | -7.3 | -6.2 | -5.8 | -6.7 |
2 | -6.9 | -7.9 | -6.2 | -8.0 | -6.4 | -5.7 | -7.3 |
3 | -7.2 | -8.1 | -6.2 | -8.1 | -6.8 | -6.0 | -7.6 |
4 | -7.6 | -8.5 | -6.3 | -8.5 | -6.9 | -6.1 | -8.1 |
5 | -6.8 | -7.6 | -6.2 | -7.4 | -6.2 | -6.1 | -7.0 |
6 | -6.8 | -7.9 | -6.2 | -7.6 | -6.2 | -5.7 | -7.2 |
7 | -6.6 | -7.7 | -5.7 | -7.9 | -6.2 | -5.7 | -7.2 |
8 | -6.8 | -7.6 | -5.5 | -7.4 | -6.2 | -5.7 | -6.9 |
9 | -6.8 | -7.7 | -5.7 | -7.7 | -6.0 | -6.0 | -7.3 |
10 | -6.5 | -7.7 | -5.7 | -7.4 | -6.2 | -5.6 | -7.2 |
11 | -6.6 | -7.7 | -5.5 | -7.0 | -5.9 | -5.3 | -7.0 |
12 | -7.2 | -8.1 | -6.0 | -8.5 | -6.5 | -6.6 | -7.6 |
13 | -7.3 | -8.2 | -5.7 | -8.3 | -6.7 | -6.4 | -7.7 |
14 | -6.3 | -7.2 | -5.7 | -7.7 | -5.9 | -5.9 | -6.1 |
15 | -6.4 | -6.7 | -5.6 | -8.0 | -6.4 | -5.8 | -6.1 |
16 | -6.5 | -7.6 | -5.9 | -7.5 | -6.0 | -6.2 | -7.6 |
vitexin | -7.7 | -8.3 | -8.1 | -9.8 | -7.7 | -7.9 | -8.9 |
myricetin | -7.9 | -8.5 | -7.4 | -8.2 | -7.9 | -7.4 | -8.6 |
D | -8.0 | -7.9 | -7.2 | -8.9 | -7.4 | -7.1 | -7.1 |
D (Reference drug: dapagliflozin) |
Furthermore, Figs. 3 and 4 present 2D diagrams for compound 4 in IDE and COX, depicting the ligand's interactions with the protein and how it affects the pathogens' active components. The bond lengths and residue numbers are indicated in Fig. 3d and Fig. 4d, the ligand-protein interaction of compound 4 through the amino acid residues of IDE is demonstrated, revealing around eight different bonds. Four of these bonds are hydrophobic, including Pi-Pi stacking on residue TYR A: 269, Pi-alkyl on residue LEU A: 156, and alkyl on residue ALA A: 434. Three conventional hydrogen bonds are at residues THR A: 271, ARG A: 432, and THR A: 163. The last bond is Pi-anion on residue GLU A: 160. The protein-ligand interaction of compound 4 in COX, shown in Fig. 4, exhibits five different interactions. Four of them are hydrogen bonds, including a conventional hydrogen bond at residue HIS A: 207, a carbon-hydrogen bond with THR A: 206, and a Pi-donor hydrogen bond with HIS A: 388. One bond is hydrophobic, including amide-Pi stacking with ALA A: 202.
Based on the results of molecular docking and interaction analysis, it has been concluded that compound 4 exhibits a stronger binding affinity towards the IDE protein. Additionally, all compounds also show good interactions with the IDE protein. The 2D diagram and bond interactions of the IDE protein with all compounds are shown in SI (Table S2). Thus, the in silico study suggests that compound 4 has promising potential as a insulin-degrading enzyme protein inhibitor.
In Silico Molecular Dynamics. The goal of the MD simulations was to examine the stability and interactions of compound 4 with the IDE and COX proteins for a duration of 20 ns. The molecular docking analysis had previously shown that compound 4 had a strong binding affinity of − 8.5 kcal mol ̶ 1 with the IDE protein and a favorable binding affinity of -8.1 kcal mol− 1 with COX, indicating its potential as a drug candidate. The MD simulation results were evaluated using RMSD (Root Mean Square Deviation), RMSF (Root Mean Square Fluctuation), and Rg values, potential energies, temperature, and hydrogen bonding to gain a comprehensive understanding of the system's behavior over time.
The RMSD values derived from the MD simulations offer insights into ligand-protein complexes' stability and conformational changes, as presented in Fig. 5. For the ligand-IDE protein complex (green curve), the RMSD values were in the range of 0.2 to 0.5 nm, while for the ligand-COX protein complex (blue curve), the values were between 0.1 to 0.2 nm, indicating that both complexes are stable and exhibit minimal deviation from their initial positions during the simulation. However, the RMSD values for water and ions in the ligand-IDE protein complex (yellow curve) were approximately 7.7 nm, while those in the ligand-COX protein complex (brown curve) were about 7.3 nm. These observations suggest potential differences in the stability and dynamics of the solvent molecules in the two complexes, which could be due to the size and shape of the binding sites, as well as the specificity and strength of ligand-protein interactions. It is important to note that the RMSD values for water and ions can be affected by various simulation parameters, including force field, simulation time, and binding site definition. Therefore, further analyses, such as solvent density profiles, hydrogen bonding patterns, and residence times, are recommended to elucidate the underlying factors contributing to the differences in RMSD values.
The RMSF is an important measure of the flexibility and mobility of protein-ligand complexes in molecular dynamics simulations. In this study, separate RMSF values were calculated for the protein and ligand in the IDE (6BF8) and COX (6Y3C) protein-ligand complexes with compound 4, and the results are shown in Fig. 6. A lower RMSF value indicates greater rigidity of a particular residue or atom, whereas a higher RMSF value indicates greater flexibility or mobility. The COX protein-ligand complex had an RMSF value of 0.1 nm (red line), indicating that it is relatively rigidly held in place, while the IDE protein-ligand complex had an RMSF value of 0.3 nm (black line), indicating that it is more flexible. A fluctuation observed in the IDE protein-ligand complex at the 15000 range amino acid atom with a value of around 1.5 nm could be due to a number of factors, such as the inherent flexibility of the amino acid or its interaction with the ligand or solvent molecules. Additional analysis, such as examining the specific interactions of this residue with other parts of the protein or ligand, may provide further insights into the cause of this fluctuation. The differences in RMSF values between the COX and IDE protein-ligand complexes suggest that they have varying degrees of flexibility or mobility, which may have implications for their biological function. The fluctuation observed in the IDE protein-ligand complex at atom 15000 highlights the importance of careful analysis of simulation results and emphasizes the potential complexity of protein-ligand dynamics.
The radius of gyration (Rg) is a measure of molecular compactness and is often used to monitor conformational changes in MD simulations. In this study, the Rg values for compound 4 bound to the IDE and COX proteins, were compared over a 20 ns simulation period, as shown in Fig. 7. It was observed that the Rg values for the IDE (protein-ligand) complex exhibited slight fluctuations within the range of 3.4–3.5 nm, while the Rg value for the COX complex remained consistently at 3.1 nm throughout the simulation period. This suggests that the COX complex is more stable and less prone to conformational changes compared to the IDE complex. Additionally, the Rg value for the ligand alone was found to be 0.3 nm in both cases, indicating that the ligand assumes a more compact conformation when bound to the proteins. Furthermore, the Rg value for the water-ion was larger than that of the protein-ligand complex, indicating a more loosely packed structure for water-ions compared to the ligand-protein complex. Specifically, the Rg values for the water-ion were 5.72 nm (red line) and 5.485 nm (black line) in the COX and IDE protein-ligand complexes, respectively, over the simulation period. Understanding the behavior of protein-ligand complexes can be valuable for the rational design of drugs with improved binding affinity and specificity.
Hydrogen bonds play a crucial role in protein-ligand interactions, providing valuable information about binding strength and specificity. In this study, MD simulations were used to investigate the number of hydrogen bonds formed between compound 4 and the active site of two proteins, IDE and COX, as shown in Fig. 8.
The number of hydrogen bonds between the ligand and the active site of the IDE protein fluctuated between 0 and 2 during the 20 ns simulation period (Fig. 8a). This suggests that hydrogen bond formation between the ligand and the IDE protein needs to be consistently maintained and is influenced by many factors. Consequently, the IDE protein and ligand interaction is dynamic rather than static. On the other hand, the number of hydrogen bonds formed between the ligand and the active site of the COX protein ranged from 0 to 1 throughout the 20 ns simulation (Fig. 8b). This indicates that the hydrogen bond formation between the COX protein and the ligand is relatively stable and consistent, implying a more static and less dynamic interaction compared to the IDE protein-ligand interaction.
These findings demonstrate that multiple factors, including conformational changes, ligand movement, and the specific characteristics of the protein, influence the number of hydrogen bonds formed between a protein and a ligand. The observed dynamic nature of hydrogen bond formation in the IDE protein-ligand interaction holds significant implications for drug design and optimization. It emphasizes the importance of considering the dynamic aspects of protein-ligand interactions in rational drug design processes.
In MD simulations, monitoring the temperature of the system is crucial to ensure its stability. In this study, the IDE and COX proteins' temperature was relatively stable, fluctuating between 298 and 302 Kelvin during the 20,000 ps simulation period (Figs. 9a and 9c). This indicates that the simulation was well-controlled and the system remained within an appropriate temperature range. However, the system's potential energy, which reflects the interactions between atoms, showed fluctuations throughout the simulation period. For the IDE protein, the potential energy fluctuated between − 1.680e+ 06 to -1.677e+ 06 kJ mol− 1 (Fig. 9b), while for the COX protein, it varied between − 1.325e+ 06 to -1.323e+ 06 kJ mol− 1 (Fig. 9d). These fluctuations imply that the interactions between the atoms in the system are dynamic and continuously changing. In summary, the stable temperature observed in the study indicates a well-controlled simulation, but the fluctuation in potential energy highlights the dynamic nature of protein-ligand interactions.
PCA Analysis. Conformational principal component analysis (PCA) was performed on the simulated molecular dynamics (MD) trajectories of the IDE protein-ligand complex with compound 4 at 300K. The aim was to identify the variability, collective motions, and changes in protein conformations observed in subsets of the primary components throughout the MD simulations. The PCA analysis was conducted using the Bio3D program68–70. The resulting eigenvalues versus eigenvector plots are displayed in Fig. 10. The first three eigenvectors, namely PC1, PC2, and PC3, were utilized to compare the dominant motions within the smaller trajectory subgroup. Colored dots represent the captured variance by the eigenvectors. Regarding the internal motions observed in the MD trajectory, the protein-ligand complex simulation at 300K exhibited the most extensive variability in PC1, accounting for 26.13% of the total variance. PC2 showed a lower percentage of variation (16.3%), and PC3 showed 15.48% of the conflict. These three components accounted for 57.9% of the total variance (Fig. 10).
Furthermore, the simulation appeared to have converged, as evidenced by the cosine content value of the eigenvectors, which was computed to be 0.32 based on the MD trajectory. The conformational PCA analysis provided insights into the dominant motions and variability within the IDE protein-ligand complex during the MD simulation at 300K. These findings contribute to a better understanding of the dynamic behavior and conformational changes occurring in the protein-ligand complex.