Functional groups analysis
To calculate the frequency of different functional groups of IL6-mediated STAT3 inhibitors (positive dataset) and non-inhibitors (negative dataset), we utilized the ChemmineR package [29]. We analysed average frequency values and found that in the positive dataset, the abundance of rings, and aromatic groups is significantly higher as compared to non-inhibitors. On the other hand, the frequency of secondary amines (R2NH), tertiary amines (R3N), and ester (ROR) groups is significantly elevated in non-inhibitors compounds, i.e., STAT3 non-inhibitors, as represented in Figure 2.
Additionally, we also observed that the occurrence of the rings and aromatic groups in few existing STAT3 inhibitors such as Napabucasin (BBI608), an FDA-approved drug used for treating advanced malignancies, and STAT3 Inhibitor VII (STAT3-IN-8 ) drug is used for head and neck cancer treatment. Some indirect STAT3 inhibitors like AZD-1480 and Ruxolitinib (FDA-approved) also show similar trends. Figure 3 shows the presence of these functional groups in the chemical 2-D-structures of known STAT3 inhibitors, i.e., STAT3 Inhibitor VII, Ruxolitinib, AZD-1480, and BBI608. These findings suggest that the researcher can further utilize this analysis to design novel drug candidates to be used as an inhibitor for the STAT3 signaling pathway.
Prediction models
One of the major challenge in this type of study is to classify chemicals based on the 2-D, 3-D, and FP descriptors is that all descriptors are not relevant. Thus, several feature selection techniques have been used to get the best set of features for the classification. After selecting best feature, we developed several prediction models with the help of machine learning based classifiers such as RF, DT, LR, XGB, SVM, and GBM. The complete architecture of the study is shown in Figure 4.
Performance of classification models
2-D descriptors
We compute 1444 2-D descriptors for the positive and negative dataset. After removing low variance and highly correlated features, we get 74 features. With this feature set, we have developed the classification models. RF attains maximum performance with balanced sensitivity and specificity on the training dataset (AUC = 0.84) and validation (AUC = 0.84) dataset, and the complete information is available in Supplementary Table S1. Further, we obtained 41 2-D descriptors with the help of the SVC-L1 method. After reducing the features, there is a slight change in the AUC 0.83 and 0.84; accuracy 76.35% and 75.46% on training and validation dataset with the RF classifier (Table 1).
Table1: Performance of machine-learning models on training and validation dataset with best 41 2-D descriptors.
Classifier
|
Training Dataset
|
Validation Dataset
|
Sensitivity
|
Specificity
|
Accuracy
|
AUC
|
MCC
|
Sensitivity
|
Specificity
|
Accuracy
|
AUC
|
MCC
|
DT
|
64.15
|
64.29
|
64.22
|
0.69
|
0.28
|
72.24
|
59.74
|
66.20
|
0.73
|
0.32
|
RF
|
76.10
|
76.58
|
76.35
|
0.83
|
0.53
|
74.63
|
76.36
|
75.46
|
0.84
|
0.51
|
LR
|
69.68
|
69.00
|
69.32
|
0.75
|
0.39
|
71.64
|
69.01
|
70.37
|
0.77
|
0.41
|
XGB
|
71.55
|
71.80
|
71.68
|
0.78
|
0.43
|
72.54
|
70.93
|
71.76
|
0.80
|
0.44
|
KNN
|
70.33
|
70.40
|
70.36
|
0.77
|
0.41
|
70.75
|
70.93
|
70.83
|
0.79
|
0.42
|
GNB
|
65.20
|
66.13
|
65.69
|
0.70
|
0.31
|
69.55
|
68.05
|
68.83
|
0.73
|
0.38
|
SVM
|
74.80
|
73.79
|
74.27
|
0.81
|
0.49
|
71.34
|
74.76
|
72.99
|
0.81
|
0.46
|
3-D descriptors
Based on nine selected 3-D descriptors, RF performs best among all the classifiers with balanced sensitivity, specificity on training (AUC = 0.75), and validation dataset (AUC = 0.74), as shown in Supplementary Table S2. After removing four features with the help of SVC,-L1, the performance is computed on the best five 3-D descriptors. In this case, RF outperforms all other classifiers with highest AUC (0.741 and 0.729) on training and testing data. Whereas, XGB and performers quite well AUC~0.73 on training data and AUC~0.71 on validation data, as shown in Table 2.
Table 2: Performance of ML-based models 5 selected 3-D descriptors on training and validation dataset
Classifier
|
Training Dataset
|
Validation Dataset
|
Sensitivity
|
Specificity
|
Accuracy
|
AUC
|
MCC
|
Sensitivity
|
Specificity
|
Accuracy
|
AUC
|
MCC
|
DT
|
64.80
|
62.00
|
63.33
|
0.68
|
0.27
|
67.16
|
51.76
|
59.72
|
0.66
|
0.19
|
RF
|
67.15
|
66.35
|
66.73
|
0.74
|
0.34
|
66.27
|
65.18
|
65.74
|
0.73
|
0.31
|
LR
|
65.77
|
65.54
|
65.65
|
0.71
|
0.31
|
65.67
|
64.54
|
65.12
|
0.70
|
0.30
|
XGB
|
65.29
|
66.94
|
66.15
|
0.73
|
0.32
|
65.67
|
66.13
|
65.90
|
0.72
|
0.32
|
KNN
|
68.21
|
67.01
|
67.58
|
0.74
|
0.35
|
69.85
|
62.62
|
66.36
|
0.73
|
0.33
|
GNB
|
65.85
|
65.69
|
65.77
|
0.71
|
0.32
|
67.46
|
61.98
|
64.82
|
0.70
|
0.30
|
SVM
|
66.91
|
66.50
|
66.69
|
0.73
|
0.33
|
66.87
|
65.18
|
66.05
|
0.71
|
0.32
|
FP descriptors
Further, we developed classification models based on FP descriptors. Firstly, we used 1622 features after removing low variance and highly correlated descriptors. RF algorithm achieves maximum performance with AUC (0.86) on balanced sensitivity and specificity on both training and validation dataset. In this case, SVM also achieve comparable performance i.e., AUC (training data = 0.84 and testing data = 0.85), results of XGB, GBM, LR, DT, KNN is provided in Supplementary Table S3. We also developed models 116 features selected using SVC-L1 method and achieved nearly same performance (Table 3). These results shows that FP based models outperform all the classification models based on 2-D and 3-D chemical features.
Table 3: The performance of machine learning models on 116 FP based features on training and validation dataset
Classifier
|
Training Dataset
|
Validation Dataset
|
Sensitivity
|
Specificity
|
Accuracy
|
AUC
|
MCC
|
Sensitivity
|
Specificity
|
Accuracy
|
AUC
|
MCC
|
DT
|
64.96
|
65.24
|
65.11
|
0.71
|
0.30
|
67.46
|
61.66
|
64.66
|
0.70
|
0.29
|
RF
|
78.46
|
77.61
|
78.01
|
0.86
|
0.56
|
79.40
|
77.96
|
78.70
|
0.86
|
0.57
|
LR
|
75.85
|
76.66
|
76.28
|
0.83
|
0.53
|
72.84
|
76.68
|
74.69
|
0.81
|
0.50
|
XGB
|
77.32
|
77.54
|
77.43
|
0.84
|
0.55
|
77.91
|
80.83
|
79.32
|
0.86
|
0.59
|
KNN
|
76.18
|
75.04
|
75.58
|
0.83
|
0.51
|
77.02
|
73.80
|
75.46
|
0.83
|
0.51
|
GNB
|
73.98
|
74.08
|
74.03
|
0.81
|
0.48
|
69.55
|
73.80
|
71.61
|
0.79
|
0.43
|
SVM
|
78.62
|
78.35
|
78.48
|
0.86
|
0.57
|
77.31
|
80.19
|
78.70
|
0.86
|
0.58
|
Performance of Hybrid model
In order to improve the performance, we combine 2-D (41 features), 3-D (5 features), and FP(116 features) descriptors and developed the models using 162 features. The performance of RF based models using combined features was 0.87 and 0.88 AUC on training and validation dataset respectively (See Supplementary Table S4). We further perform feature ranking on the combined 162 features with the help of feature selector algorithm. Finally, we obtained a minimum set of features which have almost similar performance as the above mentioned models. First we rank the features and then check the performance of top-10, 20, 30,…..162 features. Finally, we select top-49 descriptors (i.e., 14 2-D, 1 3-D and 34 FP) out of 162 feature-set as represented in Supplementary Table S4. Top-49 features perform almost similar as 162 features. RF obtained maximum AUC of 0.87, and accuracy >78.5 on both training and testing dataset with minimum sensitivity and specificity difference. The results of all other classifiers i.e., SVM, DT, KNN, LR, XGB and GBM is shown in the Table 4.
Table 4: The performance of machine learning based on hybrid model (2-D+3-D+FP) descriptors on training and validation dataset
Classifier
|
Training Dataset
|
Validation Dataset
|
Sensitivity
|
Specificity
|
Accuracy
|
AUC
|
MCC
|
Sensitivity
|
Specificity
|
Accuracy
|
AUC
|
MCC
|
DT
|
68.22
|
68.03
|
68.12
|
0.74
|
0.36
|
66.67
|
72.70
|
69.91
|
0.74
|
0.39
|
RF
|
78.42
|
78.61
|
78.52
|
0.87
|
0.57
|
79.00
|
78.16
|
78.55
|
0.87
|
0.57
|
LR
|
77.00
|
76.34
|
76.66
|
0.84
|
0.53
|
75.67
|
77.87
|
76.85
|
0.83
|
0.54
|
XGB
|
77.31
|
77.10
|
77.20
|
0.85
|
0.54
|
80.00
|
75.29
|
77.47
|
0.85
|
0.55
|
KNN
|
74.94
|
75.89
|
75.43
|
0.83
|
0.51
|
78.00
|
75.58
|
76.70
|
0.83
|
0.53
|
GNB
|
74.23
|
74.00
|
74.11
|
0.81
|
0.48
|
75.33
|
72.99
|
74.07
|
0.80
|
0.48
|
SVM
|
77.71
|
77.55
|
77.63
|
0.86
|
0.55
|
78.33
|
76.72
|
77.47
|
0.85
|
0.55
|
Repurposing of FDA-approved drugs to target STAT3
In order to identify the potential drug candidates for the inhibition of the IL6/STAT3 pathway, we downloaded 1102 FDA-approved drug molecules from the Drug Bank database [30]. Then we trace the PubChem CID (compound ID) of the FDA-approved drugs. Out of 1102 drugs the 2-D structures were available for only 842 drugs. Further, we use SDF files of 842 molecules, for the identification of potential drug candidates. We have utilized “Predict” module of our web server “STAT3In” (with default parameters, i.e., Random Forest Threshold =0.48). Our model is able to predict 19 potential drug candidates that can be used as STAT3 inhibitors. Several past studies also support our findings that these drugs act as potential inhibitors in various diseases which are linked with IL6/STAT3 activation [31-35]. We identify eight potential drugs (such as warfarin, dexpanthenol, perindopril, tamoxifen, pentagastrin, duloxetine, ledipasvir, and, olopatadine) which are utilized in the treatment of severe disease conditions like tumor progression, angiogenesis, COVID-19 progression, by inhibiting IL6/STAT3 pathway, as depicted in Table 5.
Table 5: Potential FDA-approved drug candidates predicted by our web server (STAT3In) for STAT3 inhibition
Drug Bank ID
|
FDA-Approved
Drugs
|
STAT3In
Prediction
|
Functions
|
DB00682
|
Warfarin
|
Inhibitor
|
Inhibition of IL6/STAT3-dependent fibrin production in severe listeriosis [32]
|
DB09357
|
Dexpanthenol
|
Inhibitor
|
Inhibition of LPS-induced neutrophils influx, protein leakage, and release of TNF-α and IL6 in bronchoalveolar lavage fluid (BALF) in acute lung injury [33]
|
DB00790
|
Perindopril
|
Inhibitor
|
It regulates the inflammatory mediators, NF-κB/TNF-α/IL6, and apoptosis in renal diseases [34] and inhibit the activation of STAT3 [35]. ACE inhibitor perindopril-inhibited tumor growth was associated with the suppression of angiogenesis [36].
|
DB00675
|
Tamoxifen
|
Inhibitor
|
Treatment of ER‐positive breast cancer with tamoxifen by inhibiting the IL6/STAT3 signal pathway, Inhibition of tumor growth and angiogenesis [37, 38]. Anticancer drugs that have shown potential activity in both MERS and SARS-CoV [31].
|
DB00183
|
Pentagastrin
|
Inhibitor
|
Anti-malarial, anti-fungal, anti-bacterial, and anti-inflammatory [39].
|
DB00476
|
Duloxetine
|
Inhibitor
|
Inhibit overexpression of IL6 mRNA in anxiety- and major depressive disorder (MDD), anti-inflammatory action against IL6 [40-42].
|
DB09027
|
Ledipasvir
|
Inhibitor
|
Anti-viral activity against COVID-19 [43], (sofosbuvir, and ledipasvir) inhibited STAT3 protein levels to cure HCV infections [44].
|
DB00768
|
Olopatadine
|
Inhibitor
|
Inhibit CHMCs activation and release of IL6, tryptase, and histamine and use as anti-allergy drug [45].
|
Webserver Implementation
In order to assist the scientific society, we have developed a webserver, “STAT3In,” with the capability to classify STAT3 inhibitors. We have used HTML5, JAVA, CSS3, and PHP scripts to build the web server’s front- and back end. The STAT3In web server is compatible with various devices such as mobile, iPad, tablet, and desktop, and different browsers. We have implemented the random forest model developed using hybrid chemical descriptors as the input features, in the back-end of the server. There are three major modules in the web server, defined as “Predict,” “Draw,” and “Analog design”. The comprehensive description of each module is proffered below.
Predict
The predict module allows users to classify the uncharacterized chemical compound as STAT3 inhibitor or non-inhibitor. This module can accept chemical compounds in various formats, such as SDF, SMILES, and MOL, from the users and also allow them to select the desired threshold. The users can enter either a single or multiple compounds and can also upload a file consisting of multiple chemical compounds. The output page provides the class(es) of the submitted compound(s) as STAT3 inhibitor or non-inhibitor, along with their machine learning score. The result is downloadable in comma-separated value (CSV) format and also allows users to search or sort the output table.
Draw
This module allows users to draw or alter the chemical molecule structure and provide it to the prediction model to classify the molecule as a STAT3 inhibitor or non-inhibitor. In order to make the process interactive, we have implemented Ketcher [46], which is an open-source web-based chemical structure editor. The user is allowed to select the threshold as per their suitability. The output page shows the predicted class of the molecule in the tabular form, which is downloadable in CSV format.
Analog Design
The analog design module allows users to generate the analogs using combination of submitted scaffolds, building blocks, and linkers. We have implemented SmiLib [47] software to generate the analogs. Subsequently, the generated analogs are classified into STAT3 inhibitors or non-inhibitors based on the selected threshold. The result page exhibits the class of the generated analogs as inhibitors and non-inhibitors along with their machine learning score in the tabular form, which is downloadable in the CSV format.