Stability Analysis of MPro Protein during MD Simulation
We measured the stability of the MPro protein by calculating the RMSD (Root Mean Square Deviation) of the Cα atoms during MD simulations. As shown in Fig. 2A, the RMSD plot demonstrated high stability of the protein with deviations ranging between 0.05 and 0.08 nm. The RMSF (Root Mean Square Fluctuations) analyses of the backbone, backbone + Cβ, and sidechain atoms of MPro protein was subsequently performed to reveal the flexibility of the residues during simulations. As shown in Fig. 2B, a significant difference in RMSF values was observed between the backbone and sidechain atoms, as well as between the backbone + Cβ and sidechain atoms. The sidechain atoms showed fluctuations between 0.04 and 0.20 nm, while the backbone and backbone + Cβ atoms displayed more limited fluctuations in the range of 0.03 to 0.10 nm. Among the subsites forming residues, relatively higher fluctuations were observed in the sidechain atoms of Thr45, Asn119, Asn142, Arg188, and Gln189, which exhibited RMSF values in the range of 0.15 nm to 0.19 nm. The highest fluctuations of the sidechain chain atoms were observed for Thr45 forming the S2 site with RMSF value of 0.19 nm. Asn142, Arg188, and Gln189 have previously been reported28,29,30 to play a crucial role in ligand interactions. The high flexibility of these residues could allow the binding site to adapt to different ligand structures and sizes, facilitating the recognition and binding of various inhibitors.
Pocket Dynamics Analysis
To optimize the selection of protein conformations for virtual screening, we employed pocket-based clustering of the ensembles obtained from simulations. POVME31,32 was utilized to calculate the binding site volumes of 60 distinct frames of MPro were obtained from each simulation. These frames were sampled at 5 ns intervals during the 300 ns simulation. Active site pocket volumes ranged from 190 to 498 Å3, with surface areas varying between 188 to 380 Å2 during the MD simulations. The active site volumes in our study were slightly greater than previously reported33,34 due to the widening of subsite pockets and the formation of new pockets in the active site.
The POVME clustering workflow classified the binding sites within the sampled frames into five clusters, each representing frequently observed pocket shapes. The representative structure from each cluster is shown in Fig. 3. Differences between the binding site shape in each cluster originate from the opening or closing of regions that constitute the active site. For Clusters 1 through 4, a gradual widening of the S1 pocket is observed; however, this widening coincides with a concurrent reduction in the pocket's depth, resulting in a shallower S1 pocket configuration (shown in Fig. 3A-D). In Cluster 5, the shape and size of the S1 subsite is decreased significantly compared to the other clusters (shown in Fig. 3E). The S2 subsite, on the other hand, maintained its small size in all conformations, with a portion of it being deeply embedded and thus inaccessible for ligand binding. New subsites, termed the S2’ subsite, emerged adjacent to the S1’ subsite in structures corresponding to Clusters 1, 3, 4, and 5 (shown in Fig. 3A,C-E). The S1’ pocket showed the least divergence when compared to the average pocket geometry by retaining a similar shape in Clusters 1–3 but was notably diminished in size and volume in the structures of Cluster 4 and 5. Whereas S4 exhibited the most pronounced dissimilarity in shape across all five clusters.
Molecular Docking
The CMAUP database14 was screened to identify ligands capable of binding to each of the five active site conformations of MPro. We identified nearly 500 phytochemicals with docking scores above − 7.5 kcal/mol across five protein conformations. The criteria for the docking score cutoff was based on the docking scores observed for reported small-molecule MPro inhibitors35,36,37. Our analysis revealed a preference for the binding site within Cluster 3, as approximately 200 compounds bound to this conformation with scores above the set threshold. In contrast, only 25 compounds docked to the active site of Cluster 5 with docking scores above the threshold.
Table 1 lists the 20 phytochemicals with high docking scores across all five conformations. Additional details of the phytochemicals, including structure, PubChem ID, and the plant sources are listed in Supplementary Table 1. Of these 20 phytochemicals, four were identified as aglycones, while the remaining compounds were glycosides. When discussing phytochemical, the aglycone is of particular interest because it typically holds the pharmacological or biological activity. Whereas the glycosidic bond could affect the compound's solubility, stability, and bioavailability.
Table 1
Top phytochemicals with high docking scores (in kcal/mol) across all five MPro conformations.
Phytochemical Name | Docking scores |
| Conformation 1 | Conformation 2 | Conformation 3 | Conformation 4 | Conformation 5 |
1,3,6-Tri-O-Galloyl-Beta-D-Glucose | -7.6 | -8.8 | -11.1 | -7.5 | -10.1 |
2'-Acetylacteoside | -8.6 | -9.5 | -12.1 | -13.4 | -7.6 |
2''-O-Acetylrutin | -10.3 | -9.6 | -12.2 | -10.8 | -10.4 |
*AHDPH | -8.1 | -9.0 | -11.6 | -7.5 | -9.0 |
Balanophotannin E | -7.5 | -11.0 | -12.9 | -9.9 | -8.4 |
**DDHHG | -9.7 | -8.3 | -10.9 | -11.3 | -8.6 |
***DHMMP-TRTH-TMMO-Chr-One | -9.7 | -10.5 | -10.7 | -9.1 | -9.9 |
Eriodictyol 7-O-Sophoroside | -12.6 | -9.3 | -10.0 | -11.1 | -10.0 |
Forsythiaside | -10.3 | -12.6 | -14.3 | -14.6 | -9.2 |
Hyperin 6''-[glucosyl-(1->3)-rhamnoside] | -9.7 | -10.9 | -15.9 | -12.1 | -11.9 |
Kaempferol 3-(3R-glucosylrutinoside) | -10.0 | -10.6 | -12.0 | -11.1 | -8.5 |
Luteolin 7-rutinoside | -9.8 | -9.4 | -14.4 | -12.0 | -9.9 |
Narcissin | -9.7 | -10.5 | -10.7 | -9.1 | -9.9 |
Pectolinarin | -8.9 | -7.7 | -13.9 | -8.5 | -7.5 |
Plantagineoside C | -9.4 | -10.3 | -13.3 | -10.4 | -9.3 |
Quercetin 3-glucoside2''-gallate | -7.8 | -9.2 | -12.1 | -10.6 | -7.5 |
Quercetin-3-o-rutinose | -12.2 | -11.0 | -11.1 | -11.5 | -11.4 |
Salvianolic Acid L (SAL) | -9.1 | -8.2 | -13.3 | -11.3 | -7.6 |
Shikonin | -8.1 | -8.4 | -8.6 | -8.9 | -9.5 |
Shimobashiric Acid C (SAC) | -8.2 | -8.7 | -10.5 | -9.6 | -10.2 |
*AHDPH = (3R,5R)-3-Acetoxy-5-Hydroxy-1,7-Bis(3,4-Dihydroxyphenyl)Heptane. |
**DDHHG = (3R,5R)-3,5-Dihydroxy-1-(3,4-Dihydroxyphenyl)-7-(4-Hydroxyphenyl)-Heptane 3-O-Beta-D-Glucopyranoside. |
***DHMMP-TRTH-TMMO-Chr-One = 5,7-Dihydroxy-2-(4-Hydroxy-3-Methoxyphenyl)-3-[(2S,3R,4S,5S,6R)-3,4,5-Trihydroxy-6-[[(3R,4R,5R,6S)-3,4,5-Trihydroxy-6-Methyloxan-2-Yl]Oxymethyl]Oxan-2-Yl]Oxychromen-4-One. |
The four aglycones included shimobashiric acid C (SAC), salvianolic acid L (SAL), AHDPH, and shikonin. The binding modes of these phytochemicals were then ranked for the most favorable binding site conformation by docking score (Fig. 4). SAC, SAL, and AHDPH exhibited a highest binding affinity of when bound to the third binding site conformation with a docking score of -10.5 kcal/mol, -13.3 kcal/mol, and − 11.6 kcal/mol. SAC is a complex molecule featuring a cyclobutane core with multiple hydroxyphenyl groups. As shown in Fig. 4A-C the S2 pocket is predicted to accommodate one of the four hydroxyphenyl groups linked to the cyclobutene core. The three other hydroxy groups from the phenyl moieties interact with the peripheral residues of the S1, S4, and newly formed S2’ pocket. The catalytic residue, His41 is engaged in a pi-pi interaction with one of the hydroxyphenyl groups and residues Cys44, Glu166, and Asp187 established hydrogen bonds with the hydroxy groups. Furthermore, residues Gly143, Ser144, Cys145, and Gln189 formed hydrogen bonds with the oxygen atoms of the two propanoic acid units of the phytochemical. SAL consists of a naphthalene molecule with three 3,4-dihydroxyphenyl substituents and a 3-(3,4-dihydroxyphenyl) propanoic acid moiety connected via a carboxyethyl linker. As depicted in Fig. 4D-F the three 3,4-dihydroxyphenyl groups occupied the S1’, S1, and S4 pockets. The naphthalene ring interacted with S2’ pocket. Six hydrogen bonds were formed between the molecule and protein residues, including Cys44, Gly143, Cys145, His164, Glu166, Arg188, and Gln192. In the case of AHDPH, S1 and S2 pockets remained unoccupied, while S1’ and S4 accommodated the two dihydroxyphenyl groups of the compound. Three residues, Thr26, Arg188, and Gln192 formed hydrogen bonds with the hydroxy groups of the phenyl while Glu 166 formed a hydrogen bond with the hydroxy group of the carbon chain of the phytochemical (Fig. 4G-I). For shikonin, the best binding was achieved with the fifth protein conformation, with docking scores of -9.5 kcal/mol, respectively. The naphthalene ring of shikonin occupied the narrow S4 pocket, with its hydroxy groups interacting with charged pocket residues—Glu166, Asp187, Arg188, and Gln189. Additionally, the phytochemical's pentyl chain interacted with the hydrophobic residues forming the S2 pocket (Fig. 4J-L).
Considering the metabolic cleavage of glycosides to aglycones in the body we investigated docking of such metabolites (Table 2). In each case, aglycones exhibited moderate binding capabilities and displayed a preference for the active site of a specific cluster conformation, except for quercetin that showed good binding to all conformations with a docking score > 7.0 kcal/mol. Dihydrocaffeic acid, brevifolincarboxylic acid, DDHH, pinoresinol, kaempferol, luteolin, isoharmnetin, pectolinarigenin, and secoisolariciresinol preferentially bound to at least two conformations with a docking score > = 7.0 kcal/mol. Hesperetin and eriodictyol bounded favorably with the first conformation, while hydroxytyrosol preferred the third conformation. Gallic acid, and caffeic acid exhibited a low docking score across all conformations compared to its glycoside. Some of these aglycones have been previously reported for their anti-SARS-Cov-2 activity38,39,40,41,42.
Table 2
Docking Scores (in kcal/mol) of metabolites for the top phytochemicals.
Glycoside Name | Metabolite | Docking scores |
| | Conformation 1 | Conformation 2 | Conformation 3 | Conformation 4 | Conformation 5 |
1,3,6-Tri-O-Galloyl-Beta-D-Glucose | Gallic acid | -5.8 | -5.3 | -5.8 | -6.3 | -5.4 |
2'-Acetylacteoside/Forsythiaside | Caffeic acid | -4.9 | -4.3 | -4.0 | -5.5 | -4.4 |
2'-Acetylacteoside/Forsythiaside | Dihydrocaffeic acid | -5.3 | -6.0 | -8.2 | -8.0 | -6.3 |
2'-Acetylacteoside/Forsythiaside | Hydroxytyrosol | -4.8 | -4.3 | -7.5 | -6.7 | -5.0 |
2''-O-Acetylrutin/ Hyperin 6''-[glucosyl-(1->3)-rhamnoside]/ Quercetin 3-glucoside2''-gallate/ Quercetin-3-o-rutinose | Quercetin | -7.8 | -7.2 | -7.3 | -7.6 | -7.0 |
Balanophotannin E | Brevifolincarboxylic acid | -6.4 | -6.4 | -8.0 | -7.1 | -6.7 |
Balanophotannin E | Gallic acid | -5.8 | -5.3 | -5.8 | -6.3 | -5.4 |
**DDHHG | ****DDHH | -7.5 | -6.5 | -10.2 | -6.8 | -6.5 |
***DHMMP-TRTH-TMMO-Chr-One | Hesperetin | -7.0 | -5.7 | -6.5 | -5.2 | -5.3 |
Eriodictyol 7-O-Sophoroside | Eriodictyol | -7.0 | -5.6 | -6.4 | -6.7 | -6.7 |
Forsythiaside | Pinoresinol | -5.0 | -6.2 | -8.7 | -7.8 | -6.0 |
Kaempferol 3-(3R glucosylrutinoside) | Kaempferol | -7.0 | -5.4 | -7.9 | -5.1 | -5.5 |
Luteolin 7-rutinoside | Luteolin | -7.3 | -6.6 | -8.0 | -6.9 | -5.4 |
Narcissin | Isoharmnetin | -7.3 | -5.9 | -7.0 | -5.9 | -6.1 |
Pectolinarin | Pectolinarigenin | -7.0 | -5.8 | -7.6 | -5.4 | -3.7 |
Plantagineoside C | Secoisolariciresinol | -7.8 | -5.5 | -8.8 | -7.3 | -6.7 |
**DDHHG = (3R,5R)-3,5-Dihydroxy-1-(3,4-Dihydroxyphenyl)-7-(4-Hydroxyphenyl)-Heptane 3-O-Beta-D-Glucopyranoside. |
***DHMMP-TRTH-TMMO-Chr-One = 5,7-Dihydroxy-2-(4-Hydroxy-3-Methoxyphenyl)-3-[(2S,3R,4S,5S,6R)-3,4,5-Trihydroxy-6-[[(3R,4R,5R,6S)-3,4,5-Trihydroxy-6-Methyloxan-2-Yl]Oxymethyl]Oxan-2-Yl]Oxychromen-4-One. |
****DDHH = 3,5-dihydroxy-1-(3,4-dihydroxyphenyl)-7-(4-hydroxyphenyl)-heptane. |
In addition to examining the top 20 phytochemicals and their respective metabolites, we extended our analysis to include four phytochemicals that exhibited high docking scores for at least one protein conformation. These included cynarin, demethoxycurcumin, hexahydrocurcumin, and withaferin A (Table 3).
Table 3
Phytochemicals with high docking scores (in kcal/mol) for at least one MPro conformation.
Phytochemical Name | Docking scores |
| Conformation 1 | Conformation 2 | Conformation 3 | Conformation 4 | Conformation 5 |
Cynarin | -10.7 | -5.7 | -9.4 | -3.9 | -6.9 |
Demethoxycurcumin | -5.9 | -8.0 | -6.9 | -5.8 | -4.9 |
Hexahydrocurcumin | -3.4 | -4.7 | -9.9 | -7.1 | -5.6 |
Withaferin A | -3.9 | -4.8 | -7.4 | -4.6 | -8.4 |
Cynarin had moderate to low docking scores across all binding site conformations, except for the conformations representing Cluster 1 and 3, where its docking scores exceeded − 9.0 kcal/mol. As shown in Fig. 5A-C, the core cyclic structure of dihydrocyclohexane occupied the S1 pocket, while the carboxylic acid group attached to the cyclohexane extended towards the S1’ site. One of the two hydroxyphenyl groups occupied the S4 pocket, while the other hydroxyphenyl group, although not positioned within the S1’ pocket, interacted with the residues forming the pocket. The phytochemical establishes hydrogen bonds with five key residues—Thr26, Asn142, Gly143, Glu166, and Thr190.
Demethoxycurcumin exhibits a docking score of approximately − 8.0 kcal/mol when binding to the representative protein conformation of Cluster 2, the highest among other protein conformations. The hydroxyphenyl group of demethoxycurcumin occupies the region between the S2 and S1’ pockets, with the S2 pocket accommodating the hepta-1,6-diene-3,5-dione bridge. Simultaneously, the hydroxy-3-methoxy group of the phytochemical effectively occupies the S1 pocket. This binding arrangement leads to the formation of hydrogen bonds with four significant residues—Thr26, Leu141, Ser144, and Gly143 (refer to Fig. 5. D-F).
Hexahydrocurcumin had its highest binding score (~ 10.0 kcal/mol) when interacting with the representative conformation of Cluster 3. One of the two hydroxy methoxyphenyl groups of the phytochemical occupy the S2-S4 pocket, interacting with Glu166, Asp187, and Thr190. The other hydroxy methoxy phenyl group interacts with the residues Gly143 and Cys 145, fitting into the S1 pocket (refer to Fig. 5G-I). Interestingly, despite the structural similarity between the two curcumin derivatives, their predicted affinities toward specific protein conformations and their adopted binding orientations are distinct.
Withaferin A preferred the representative structure of Cluster 5 conformation, binding with a docking score of -8.4 kcal/mol. In this binding orientation the dihydropyran ring and the hydroxymethyl substituent occupied the S4 pocket. The main cyclic structure of withaferin A (oxapentacyclooctadec-4-en-3-one) interacted with the S2 and S1’ sites of the binding pocket. Crucial interactions also included MPro residuesThr26, Asn119, and Glu166 interacting with the hydroxy groups of withaferin A (Fig. 5J-L).
MM-GBSA Prediction
To improve the accuracy of our inhibitor binding predictions, the initially identified 39 phytochemicals were rescored based on binding energies calculated with Prime/MM-GBSA. For these binding energy calculations, the protein-ligand conformation with the highest docking score was selected. Table 4 summarizes the MM-GBSA energies for the selected 20 phytochemicals (Table 1), their corresponding aglycones (Table 2), and phytochemicals exhibiting strong docking scores for at least one protein conformation (Table 3), a total of 39 phytochemicals. The MM-GBSA energy calculations revealed that the binding mode of all the phytochemicals was primarily driven by negative ΔG values for ΔGcoulomb, ΔGhbond, ΔGlipo, ΔGpacking, and ΔGvdw. These scores indicate the presence of attractive Coulombic interactions, hydrogen bonding, and strong hydrophobic interactions, including lipophilic and van der Waals forces. However, the positive ΔGcovalent and ΔGsolv_GB scores suggest that covalent bond formation and binding of the phytochemicals in an aqueous environment are energetically unfavorable.
Table 4
Prime MM_GBSA energies (in kcal/mol) of phytochemicals.
Phytochemical Name | ΔGbind | ΔGcoulomb | ΔGcovalent | ΔGhbond | ΔGlipo | ΔGpacking | ΔGsolv_GB | ΔGvdW |
1,3,6-Tri-O-Galloyl-Beta-D-Glucose | -63.8 | -45.1 | 7.4 | -5.3 | -15.3 | -2.2 | 48.3 | -51.5 |
2'-Acetylacteoside | -59.0 | -30.9 | 8.2 | -4.4 | -20.8 | -2.2 | 41.9 | -50.9 |
2''-O-Acetylrutin | -53.3 | -43.3 | 14.5 | -5.7 | -11.7 | -3.1 | 47.1 | -51.3 |
*AHDPH | -58.3 | -27.4 | 3.9 | -5.0 | -15.1 | -2.2 | 29.9 | -42.5 |
Balanophotannin E | -70.9 | -57.1 | 17.4 | -5.5 | -15.9 | -1.5 | 43.0 | -51.3 |
Brevifolincarboxylic acid | -40.0 | -14.3 | 0.0 | -2.3 | -9.5 | -2.3 | 18.2 | -29.7 |
Caffeic acid | -26.8 | 11.3 | 3.8 | -2.2 | -8.1 | -0.2 | -13.5 | -17.9 |
Cynarin | -34.1 | -2.6 | 7.6 | -5.2 | -13.1 | -1.8 | 18.2 | -37.1 |
****DDHH | -49.9 | -27.4 | 1.9 | -3.4 | -11.1 | -2.2 | 22.2 | -29.9 |
**DDHHG | -70.2 | -51.9 | 6.1 | -5.8 | -12.4 | -1.5 | 32.8 | -37.4 |
Demethoxycurcumin | -57.2 | -39.5 | 6.0 | -2.8 | -12.3 | -0.3 | 27.6 | -35.9 |
***DHMMP-TRTH-TMMO-Chr-One | -54.9 | -32.1 | 5.9 | -4.0 | -11.0 | -3.0 | 24.5 | -35.2 |
Dihydrocaffeic acid | -41.1 | -26.1 | 1.0 | -2.7 | -9.6 | -0.8 | 17.3 | -20.2 |
Eriodictyol | -35.5 | -32.2 | 3.6 | -3.2 | -4.6 | -1.3 | 29.1 | -26.9 |
Eriodictyol 7-O-Sophoroside | -60.6 | -50.5 | 5.8 | -6.7 | -13.1 | -1.4 | 44.9 | -39.7 |
Forsythiaside | -82.2 | -55.0 | 6.9 | -6.0 | -18.4 | -0.8 | 39.4 | -48.2 |
Gallic acid | -24.5 | 11.8 | 4.3 | -2.8 | -6.1 | -0.3 | -15.7 | -15.7 |
Hesperetin | -43.1 | -33.1 | 3.5 | -3.2 | -5.9 | -1.3 | 26.2 | -29.3 |
Hexahydrocurcumin | -67.6 | -40.4 | 3.9 | -3.8 | -16.4 | -1.7 | 27.5 | -36.7 |
Hydroxytyrosol | -54.9 | -35.2 | 1.4 | -3.7 | -14.7 | -0.9 | 15.9 | -17.7 |
Hyperin 6''-[glucosyl-(1->3)-rhamnoside] | -76.6 | -59.1 | 8.4 | -8.8 | -15.0 | -1.9 | 36.0 | -36.1 |
Isoharmnetin | -47.2 | -22.8 | 0.7 | -2.4 | -7.5 | -2.8 | 21.7 | -34.1 |
Kaempferol | -52.6 | -34.1 | 1.1 | -2.9 | -4.7 | -2.3 | 24.8 | -34.5 |
Kaempferol 3-(3R-glucosylrutinoside) | -35.4 | -21.1 | 14.6 | -5.5 | -12.8 | -0.9 | 27.7 | -37.3 |
Luteolin | -39.4 | -33.7 | 2.8 | -3.3 | -4.0 | -1.4 | 26.6 | -26.5 |
Luteolin 7-rutinoside | -80.2 | -58.1 | 7.0 | -6.6 | -12.5 | -2.5 | 39.0 | -46.5 |
Narcissin | -56.0 | -42.0 | 20.4 | -3.0 | -15.4 | -2.4 | 34.7 | -48.4 |
Pectolinarigenin | -45.1 | -27.2 | 6.3 | -3.1 | -6.9 | -2.8 | 21.2 | -32.5 |
Pectolinarin | -71.1 | -45.6 | 8.0 | -5.4 | -14.1 | -2.9 | 37.3 | -48.2 |
Pinoresinol | -59.1 | -34.4 | 4.7 | -4.2 | -13.7 | -2.8 | 22.2 | -31.0 |
Plantagineoside C | -62.3 | -41.1 | 5.5 | -6.3 | -17.5 | -2.3 | 32.1 | -32.8 |
Quercetin | -36.4 | -32.8 | 2.9 | -3.3 | -4.5 | -1.3 | 28.9 | -26.3 |
Quercetin 3-glucoside2''-gallate | -76.7 | -61.6 | 13.2 | -6.9 | -11.9 | -3.0 | 51.1 | -57.5 |
Quercetin-3-o-rutinose | -45.3 | -38.8 | 3.1 | -5.8 | -8.4 | -1.9 | 44.4 | -37.8 |
Salvianolic Acid L (SAL) | -42.1 | 25.3 | 11.3 | -5.2 | -24.2 | -2.4 | 2.7 | -49.6 |
Secoisolariciresinol | -53.8 | -32.6 | 6.9 | -4.0 | -11.0 | -3.0 | 25.5 | -35.5 |
Shikonin | -52.9 | -25.8 | 6.9 | -2.8 | -15.1 | 0.0 | 19.8 | -35.9 |
Shimobashiric Acid C (SAC) | -39.0 | 2.0 | 8.3 | -7.2 | -5.0 | -1.5 | -1.3 | -34.4 |
Withaferin A | -51.8 | -19.8 | 4.0 | -1.7 | -14.1 | 0.0 | 22.6 | -42.8 |
*AHDPH = (3R,5R)-3-Acetoxy-5-Hydroxy-1,7-Bis(3,4-Dihydroxyphenyl)Heptane. |
**DDHHG = (3R,5R)-3,5-Dihydroxy-1-(3,4-Dihydroxyphenyl)-7-(4-Hydroxyphenyl)-Heptane 3-O-Beta-D-Glucopyranoside. |
***DHMMP-TRTH-TMMO-Chr-One = 5,7-Dihydroxy-2-(4-Hydroxy-3-Methoxyphenyl)-3-[(2S,3R,4S,5S,6R)-3,4,5-Trihydroxy-6-[[(3R,4R,5R,6S)-3,4,5-Trihydroxy-6-Methyloxan-2-Yl]Oxymethyl]Oxan-2-Yl]Oxychromen-4-One. |
****DDHH = 3,5-dihydroxy-1-(3,4-dihydroxyphenyl)-7-(4-hydroxyphenyl)-heptane. |
Among the selected phytochemicals, Forsythiaside had the best calculated binding energy, ΔGbind = -82.2 kcal/mol. Luteolin 7-rutinoside ranked second with a ΔGbind value of -80.2 kcal/mol, followed by quercetin 3-glucoside 2”-gallate and hyperin 6’-[glucosyl-(1 > 3)-rhamnoside] with a ΔGbind value of approximately − 76.0 kcal/mol. All of these molecules also showcased high docking scores above − 12.0. kcal/mol.
Pinoresinol, one of the metabolites of forsythiaside exhibited the highest binding energy of ~ -59.0 kcal/mol among all the aglycones studied. It also showcased a significant docking score of -8.7 kcal/mol. Secoisolariciresinol was the second-best aglycone with a binding energy of -53.8 kcal/mol, followed by kaempferol. Overall, the binding energy calculations aligned well with the computed docking scores of top 20 phytochemicals (Table 1), except for SAC and kaempferol 3-(3R-glucosylrutinoside), which deviated with relatively low binding energies (ΔGbind < -40 kcal/mol), despite securing high docking scores across all protein conformations. Cynarin, while displaying a significantly high docking score for the first protein conformation, exhibited a low binding energy value (ΔGbind = -36.1 kcal/mol). Additionally, the aglycones luteolin and quercetin demonstrated lower binding energies (ΔGbind = -39.4 and − 36.4 kcal/mol, respectively), although they showcased moderate docking scores in at least one protein conformation.
We studied the protein-ligand interaction analysis of the top ten phytochemicals based on docking scores and MM-GBSA energies. The heatmap illustrated in Fig. 6 highlights the significance of specific binding site residues, including His41, Asn142, Gly143, Cys145, Met165, Glu166, Arg188, and Glu189, in stabilizing the phytochemicals within the binding site. These residues formed van der Waals and Coulombic interactions with at least five out of the ten phytochemicals, emphasizing their crucial role in ligand binding.
Bioavailability Prediction
To gain insights into the bioavailability of the selected phytochemicals, we conducted an in silico ADMET study using ADMET Predictor by Simulations Plus43. This calculation generates a ADMET Risk score that reflects potential compound liabilities based on 20 rules within three Risk models (Absn_Risk, CYP_Risk, and Tox_Risk) of their ADMET Predictor. Additionally, the ADMET Risk score models pharmacokinetic properties termed fraction unbound (fu) and high-steady-state volume of distribution (Vd). Each Risk score is paired with a mnemonic Code list that identifies the rules that have been violated. Of the 39 phytochemicals predicted to bind strongly to MPro, ten had ADMET Risk scores of < = 1 (Table 5). On the other hand, six phytochemicals, including 1,3,6-Tri-O-galloyl-beta-D-glucose, 2-acetylacteoside, quercetin 3-glucoside2'' -gallate, balanophotannin E, SAL, and SAC had problematic ADEMT Risk scores exceeding 6.
Table 5
ADMET Prediction of phytochemicals.
Phytochemical ID | ADMET Risk | ADMET Code | Liver Microsomes | Hepatocytes | Systemic | THalf_hum-100.0 | S + MDCK-LE |
| | | %Fa | %Fb | %Fa | %Fb | %Fa | %Fb | | |
1,3,6-Tri-O-Galloyl-Beta-D-Glucose | 7.0 | Size; HBD; HBA; ch; Kow-; Peff+; Sw-; fu-; Vd-; CL+ | 28.3 | 23.4 | 28.3 | 23.4 | 28.2 | 24.9 | 1.2 | Low |
2-Acetylacteoside | 6.5 | Size; RotB; HBD; HBA; ch; Peff; CL+ | 40.1 | 34.4 | 40.1 | 34.4 | 40.0 | 36.9 | 0.8 | Low |
2''-O-Acetylrutin | 5.5 | Size; HBD; HBA; ch; Peff; CL- | 24.0 | 19.1 | 24.0 | 19.1 | 23.9 | 21.2 | 4.8 | Low |
*AHDPH | 3.4 | RotB; HBD; ch; CL | 99.9 | 93.7 | 99.9 | 93.7 | 99.8 | 99.8 | 1.8 | Low |
Balanophotannin E | 7.0 | Size; HBD; HBA; ch; Kow-; Peff+; Sw-; fu-; Vd-; CL+ | 5.1 | 4.5 | 5.1 | 4.5 | 5.1 | 4.8 | 1.3 | Low |
Brevifolincarboxylic acid | 1.5 | HBD; ch | 95.0 | 82.9 | 95.0 | 82.9 | 95.8 | 95.1 | 1.6 | Low |
Caffeic acid | 1.0 | HBD | 99.5 | 86.9 | 99.3 | 87.2 | 97.6 | 84.0 | 1.0 | High (96%) |
Cynarin | 5.0 | Size; HBD; HBA; ch; Peff | 44.6 | 40.1 | 44.6 | 40.1 | 44.4 | 42.0 | 3.8 | Low |
****DDHH | 2.2 | HBD; CL | 99.8 | 82.2 | 99.8 | 82.2 | 99.6 | 99.6 | 3.5 | Low |
**DDHHG | 5.0 | Size; RotB; HBD; HBA; ch | 54.9 | 48.3 | 54.7 | 48.2 | 53.3 | 47.6 | 3.5 | Low |
Demethoxycurcumin | 1.1 | fu; CL | 97.7 | 87.0 | 97.7 | 87.0 | 97.6 | 97.6 | 10.6 | High (99%) |
***DHMMP-TRTH-TMMO-Chr-One | 4.5 | Size; HBD; HBA; ch; CL- | 22.9 | 18.7 | 22.9 | 18.7 | 22.9 | 20.4 | 4.5 | Low |
Dihydrocaffeic Acid | 0.0 | | 99.6 | 83.6 | 99.6 | 83.6 | 99.8 | 99.7 | 0.9 | High |
Eriodictyol | 1.3 | HBD; CL | 99.9 | 90.1 | 99.9 | 90.1 | 99.6 | 99.5 | 1.9 | High (82%) |
Eriodictyol 7-O-Sophoroside | 6.0 | Size; HBD; HBA; ch; Kow-; Peff; CL- | 17.1 | 14.7 | 17.1 | 14.7 | 17.1 | 15.8 | 1.0 | Low |
Forsythiaside | 6.0 | Size; RotB; HBD; HBA; ch; Peff; CL- | 33.5 | 28.4 | 33.5 | 28.4 | 33.4 | 30.2 | 1.0 | Low |
Gallic Acid | 0.5 | HBD | 99.0 | 78.1 | 99.0 | 78.1 | 92.0 | 91.0 | 0.4 | High (85%) |
Hesperetin | 0.9 | CL | 99.9 | 92.7 | 99.9 | 92.7 | 92.5 | 91.9 | 5.7 | High (93%) |
Hexahydrocurcumin | 2.0 | RotB; CL | 99.9 | 82.0 | 99.9 | 82.0 | 99.9 | 99.9 | 4.6 | High (99%) |
Hydroxytyrosol | 0.0 | | 99.9 | 63.2 | 99.9 | 63.2 | 98.6 | 98.2 | 1.0 | High (99%) |
Hyperin 6''-[glucosyl-(1->3)-rhamnoside] | 6.0 | Size; HBD; HBA; ch; Kow-; Peff; CL- | 4.9 | 4.0 | 4.9 | 4.0 | 4.9 | 4.4 | 1.0 | Low |
Isorhamnetin | 0.5 | HBD; ch | 98.8 | 90.3 | 98.8 | 90.3 | 98.6 | 98.5 | 2.1 | High (88%) |
Kaempferol | 0.5 | HBD | 99.9 | 92.0 | 99.9 | 92.0 | 99.7 | 99.6 | 2.2 | High (93%) |
Kaempferol 3-(3R-glucosylrutinoside) | 6.0 | Size; HBD; HBA; ch; Kow-; Peff; CL- | 5.3 | 4.2 | 5.3 | 4.2 | 5.3 | 4.7 | 1.1 | Low |
Luteolin | 0.5 | HBD; CL | 93.9 | 87.5 | 93.9 | 87.5 | 93.7 | 93.7 | 3.8 | High (93%) |
Luteolin 7-rutinoside | 5.5 | Size; HBD; HBA; ch; Peff; CL- | 20.8 | 16.9 | 20.8 | 16.9 | 20.8 | 18.3 | 4.1 | Low |
Narcissin | 5.5 | Size; HBD; HBA; ch; Peff; CL- | 22.9 | 18.7 | 22.9 | 18.7 | 22.9 | 20.4 | 1.0 | Low |
Pectolinarigenin | 2.0 | Sw; CL | 23.6 | 21.7 | 23.6 | 21.7 | 23.6 | 23.6 | 5.2 | High (99%) |
Pectolinarin | 5.5 | Size; HBD; HBA; ch; Peff; CL- | 33.4 | 26.7 | 33.4 | 26.7 | 33.3 | 29.9 | 1.6 | Low |
Pinoresinol | 1.0 | CL | 99.9 | 76.2 | 99.9 | 76.2 | 99.8 | 99.8 | 3.7 | High (99%) |
Plantagineoside C | 5.5 | Size; HBD; HBA; ch; Peff; CL- | 70.9 | 64.8 | 70.9 | 64.8 | 70.2 | 69.0 | 0.8 | Low |
Quercetin | 2.0 | HBD; ch; CL | 98.1 | 88.3 | 98.1 | 88.3 | 97.8 | 97.7 | 1.5 | High (76%) |
Quercetin 3-glucoside2''-gallate | 6.5 | Size; HBD; HBA; ch; Kow-; Peff; Sw-; CL- | 24.2 | 21.6 | 24.2 | 21.6 | 24.1 | 22.9 | 1.6 | Low |
Quercetin-3-o-rutinose | 6.0 | Size; HBD; HBA; ch; Kow-; Peff; CL- | 16.9 | 13.9 | 16.9 | 13.9 | 16.9 | 15.2 | 1.1 | Low |
Salvianolic acid L (SAL) | 8.0 | Size; RotB; HBD; HBA; ch; Kow-; Peff; Sw-; fu-; CL+ | 48.5 | 43.9 | 48.5 | 43.9 | 48.3 | 45.6 | 1.2 | Low |
Secoisolariciresinol | 1.3 | RotB; HBD; CL | 99.9 | 80.4 | 99.9 | 80.4 | 99.8 | 99.8 | 3.7 | Low |
Shikonin | 0.0 | | 99.9 | 93.7 | 99.9 | 93.7 | 99.9 | 99.9 | 7.7 | High (88%) |
Shimobashiric Acid C (SAC) | 8.0 | Size; RotB; HBD; HBA; ch; Kow-; Peff; Sw-; fu-; CL+ | 45.8 | 43.4 | 45.8 | 43.4 | 45.8 | 43.4 | 14.0 | Low |
Withaferin A | 1.5 | Size; ch | 80.4 | 62.3 | 80.4 | 62.3 | 80.3 | 80.3 | 11.7 | High (99%) |
Red hues signify improved predictive scores, and blue hues represent poorer predictive scores for three clearance models. |
RotB = rotatable bonds; HBD = H-bond donors; HBA = H-bond acceptors; ch = charge; Kow = lipophilicity; Peff = permeability; Sw = water solubility; fu = fraction unbound; Vd = volume of distribution; CL = high microsomal clearance. |
%Fa = fraction absorbed; %Fb = fraction bioavailable. |
*AHDPH = (3R,5R)-3-Acetoxy-5-Hydroxy-1,7-Bis(3,4-Dihydroxyphenyl)Heptane. |
**DDHHG = (3R,5R)-3,5-Dihydroxy-1-(3,4-Dihydroxyphenyl)-7-(4-Hydroxyphenyl)-Heptane 3-O-Beta-D-Glucopyranoside. |
***DHMMP-TRTH-TMMO-Chr-One = 5,7-Dihydroxy-2-(4-Hydroxy-3-Methoxyphenyl)-3-[(2S,3R,4S,5S,6R)-3,4,5-Trihydroxy-6-[[(3R,4R,5R,6S)-3,4,5-Trihydroxy-6-Methyloxan-2-Yl]Oxymethyl]Oxan-2-Yl]Oxychromen-4-One. |
****DDHH = 3,5-dihydroxy-1-(3,4-dihydroxyphenyl)-7-(4-hydroxyphenyl)-heptane. |
We calculated absorption and bioavailability characteristics of these phytochemicals using three human clearance models (liver microsomes, hepatocytes, and systemic) at a dose of 100 mg for an immediate release tablet and observed significant variability in the calculated pharmacokinetics parameters (Table 5). Out of 39 phytochemicals examined, 20 exhibited high absorption (%Fa) and minimal metabolism or elimination in the liver during the first-pass effect (%Fb). A slight decrease in %Fb values were observed in the hepatocytes for some of these phytochemicals, suggesting that a portion of the compound is subject to metabolism.
In addition to absorption and bioavailability estimations, we calculated phytochemical plasma half-life (T-half) in humans and cell permeability (Madin-Darby canine kidney; MDCK), the latter serving as an estimate for intestinal epithelial absorption (Table 5). Demethoxycurcumin, shikonin, SAC, and withaferin, demonstrated an extended plasma half-life, surpassing 7 hours. Phytochemicals with low ADMET risks also exhibited high MDCK permeability, except for secoisolariciresinol and DDHH.
Cytotoxicity and Viral Replication Assay
We performed cell-based antiviral assay to evaluate the inhibitory effects of five highly performing phytochemicals (demethoxycurcumin, hydroxytyrosol, kaempferol, shikonin, and withaferin A) on SARS-CoV-2 replication. These compounds were selected based on their high overall performance in terms of docking score (above − 7.5 kcal/mol for at least one protein conformation), MM-GBSA binding energy (ΔGbind > -50 kcal/mol), and ADMET properties. Glycosides were excluded from the viral replication assay as their metabolic modification in the body would render the results less relevant. Although pinoresinol and hexahydrocurcumin fulfilled the criteria for top-performing molecules, meeting the specified factors, we opted not to include them in the study due to their current unavailability for immediate testing.
Viral assay demonstrated the SARS-Cov-2 replication inhibitory activity of three compounds, shikonin (EC50 = ~ 10 µM), demethoxycurcumin (EC50 = ~ 8.8 µM), and withaferin,(EC50 = ~ 2.8 µM) (Fig. 7). Importantly, in contrast to shikonin and withaferin, demethoxycurcumin showed no apparent cytotoxicity.
The inhibitory effects of these three compounds against MPro were previously speculated 44, 45, 46, 47, 48,49,50,51, however, the referenced studies primarily relied on computational or enzyme-based assays. Our study presents a robust cell-based antiviral data, providing a more direct and clinically relevant perspective on the inhibitory potential of these compounds against SARS-CoV-2 replication.
While our findings contribute to the ongoing discussion on effective viral inhibition potential of demethoxycurcumin, shikonin, and withaferin A, we also recognize the necessity for further research to elucidate the intricate mechanisms underlying this inhibition. Our work adds valuable insights to this evolving dialogue, underscoring the importance of continued investigation in this area.