Identifcation of DEGs
The dataset GSE123568 including 30 ONFH samples and 10 normal control samples was used for the analysis and identification of DEGs. Comparing with the genes in the normal control samples, we identified a total of 568 DEGs in the ONFH samples, including 221 down-regulated and 347 up-regulated genes. Next, a heat map and volcano plot analysis were used to visualize these DEGs, as shown in Fig. 2a-b. The median of each sample was basically on one horizontal line, so the degree of normalization among the samples was great (Fig. 2c).
Enrichment analysis
Functional and pathway enrichment analysis was performed using the R software clusterProfiler Package. The screening criteria for significant gene sets were p<0.05 and False discovery rate (FDR) <0.25. We observed that most of the enriched gene sets were associated with the innate immune responses, thrombosis and signal transduction (Table 1). Next, GO, KEGG pathway and Reactome enrichment analyses were performed for DEGs using the R software clusterProfiler package. We will assess the DEGs enrichment pathways from multiple perspectives. Based on the Q value < 0.05, we selected the top two, three, one and four biological processes (BP), cellular components (CC), molecular functions (MF), and KEGG pathways and displayed them in chordal curve, circle diagram, bubble plots, and histogram (Fig. 3a-e). The GO enrichment analysis of DEGs also shows that ONFH samples have a stronger neutrophil response than control samples, including neutrophil activation, neutrophil activation involved in immune response, neutrophil mediated immunity, and neutrophil degranulation. KEGG pathway enrichment analysis showed that DEGs were enriched for osteoclast differentiation, leishmaniasis, the interaction of viral proteins with cytokines and cytokine receptors, and chemokine signaling pathways. Particularly, nine pathways related to immunity and thrombosis were visualized in Fig. 3f.
PPI network construction, MCODE cluster modules and hub gene identifcation
The interaction network between proteins encoded by DEGs was constructed by STRING, consisting of 174 nodes and 1,510 edges, and visualized by Cytoscape (Fig. 4a). In this network, we identified four modules using MCODE plugin based on filtering criteria, as shown in Fig. 4b-e. Cluster 1 has the highest cluster score (24 nodes and 133 edges), followed by cluster 2 (11 nodes and 53 edges), cluster 3 (18 nodes and 55 edges), and cluster 4 (11 nodes and 15 edges). Fifteen hub genes such as cytochrome b-245 beta chain (CYBB) and CD86 molecule (CD86) were selected by intersecting the results of five algorithms of cytohubba, including MCC, DMNC, degree, MNC, and clustering coefficients as showed in Table 2 (16). The fifteen hub genes may play an imprtant role in the pathogenesis of ONFH. Finally, we analysed the correlation among the fifteen hub genes by using ggplot2 R package. We found that most genes were positively correlated with each others (Fig. 4f).
Table 2
15 hub genes identifed by cytoHubba in PPI network.
Gene symbol
|
Description
|
log2FC
|
Q value
|
Regulation
|
CYBB
|
cytochrome b-245 beta chain
|
1.200
|
0.002
|
Up
|
CD86
|
CD86 molecule
|
1.226
|
2.65E-04
|
Up
|
TYROBP
|
transmembrane immune signaling adaptor TYROBP
|
1.180
|
5.42E-05
|
Up
|
TLR4
|
toll like receptor 4
|
1.149
|
8.58E-04
|
Up
|
LILRB2
|
leukocyte immunoglobulin like receptor B2
|
1.200
|
2.21E-04
|
Up
|
TLR2
|
toll like receptor 2
|
1.381
|
9.83E-05
|
Up
|
TLR8
|
toll like receptor 8
|
1.329
|
9.60E-05
|
Up
|
CCR1
|
C-C motif chemokine receptor 1
|
1.123
|
0.003
|
Up
|
TLR1
|
toll like receptor 1
|
1.098
|
2.57E-04
|
Up
|
ITGAX
|
integrin subunit alpha X
|
1.143
|
1.97E-04
|
Up
|
NCF2
|
neutrophil cytosolic factor 2
|
1.242
|
1.22E-04
|
Up
|
TREM1
|
triggering receptor expressed on myeloid cells 1
|
1.465
|
1.25E-04
|
Up
|
IRF8
|
interferon regulatory factor 8
|
1.072
|
3.31E-05
|
Up
|
NOD2
|
nucleotide binding oligomerization domain containing 2
|
1.039
|
0.006
|
Up
|
PLEK
|
pleckstrin
|
1.008
|
1.77E-04
|
Up
|
FC: fold change, Q value: adjust P-value. |
Construction of the miRNA-mRNA and protein-chemical interaction networks
We used the NetworkAnalyst database to predict the target miRNAs and chemicals of hub genes. We obtained 81 target miRNAs for 7 central genes (Fig. 5a) and 8 target chemicals for 2 central genes (Fig. 5b).
ROC curve of 15 hub genes in the peripheral secrum samples (GSE123568)
The pROC and ggplot2 R package were used to analyse and draw the ROC curve of 15 hub genes in the peripheral secrum samples. Finally, the hub genes with hige diagnostic value (area under the curve, AUC > 0.8) were selected as key genes for further analysis, including CD86 (AUC=0.847), IRF8 (AUC=0.877), ITGAX (AUC=0.847), LILRB2 (AUC=0.880), NCF2 (AUC=0.887), PLEK (AUC=0.863), TLR1 (AUC=0.847), TLR2 (AUC=0.860), TLR4 (AUC=0.837), TREM1 (AUC=0.863), and TYROBP (AUC=0.873) (Fig. 6). Hence, we hypothesized they were the biomarkers for the diagnosis of ONFH based on our present samples.
The expression of 15 hub genes in the hip cartilage samples (GSE74089)
We analysed the expression of 15 hub genes in the hip cartilage samples, and identitied 4 key genes NOD2 (P value: 0.002 in GSE123568, 0.007 in GSE 74089), PLEK (P value: <0.001 in GSE123568, 0.001 in GSE 74089), TLR2 (P value: <0.001 in GSE123568, 0.038 in GSE 74089), TREM1 (P value: <0.001 in GSE123568, <0.001 in GSE 74089). As showed in Fig. 7a-c, the expression of four key genes were all increased in the peripheral secrum samples (GSE123568) and hip cartilage samples (GSE74089). Finally, three hub genes (PLEK, TREM1, TLR2) whose AUC (ROC)>0.8 and expression trend was verified in GSE74089 was selected as key genes in the procession of ONFH.
Tissue and intracellular distribution of PLEK, TREM1 and TLR2
We compared the gene expression score of three key genes in Bgee database and the gene expression level of three key genes in HPA database. We found that, in gene expression level, PLEK was highly expressed in bone marrow and lowly in cartilage, TLR2 and TREM1 were highly expressed in blood and lowly in skeletal muscle tissue (Fig. 8a). In gene expression scores, PLEK was highly expressed in bone marrow and lowly in muscle tissues, TLR2 and TREM1 were highly expressed in blood and lowly in muscle tissue (Fig. 8b). Finally, we explored the intracellular distribution of key genes, the results showed that PLEK mainly expressed in nucleus, TREM1 in dictyosome, and TLR2 in nucleoplasm and mitochondria (Fig. 8c).
Prediction of target ncRNAs and construction of ceRNA networks
It is well known that miRNAs induce gene silencing and down-regulate the gene expression by binding mRNAs. However, circRNA and lncRNA can up-regulate the expression of gene by binding miRNAs and competing with mRNAs. Such interactions between mRNAs, miRNAs, circRNAs, and lncRNAs are known as ceRNA networks (12). We selected ncRNAs present in most miRNA prediction results as our predicted circRNAs and lncRNAs. Finally, we obtained 13 target lncRNAs and 27 target circRNAs for PLEK 4 target miRNAs; and 13 target circRNAs and 19 target lncRNAs for TLR2 8 target miRNAs. But the predicted miRNAs of TREM1 was absent in NetworkAnalyst database. So only two ceRNA networks were constructed by Cytoscape based on the prediction results in the end (Fig. 9a-b). Subsequently, we conducted a literature search and selected two reproted down-regulated lncRNAs and one down-regulated miRNA in ONFH, one up-regulated miRNA in senescence of vascular smooth muscle cells, one down-regulated miRNA in processe of osteoblast differentiation, and one circRNA that presented in most miRNA prediction results of TLR2. Finally, we proposed four potential RNA regulatory pathways of ONFH: MALAT1 (metastasis associated lung adenocarcinoma transcript 1)-miR-146b-5p-TLR2, MALAT1- miR-664b-3p-PLEK, NORAD (non-coding RNA activated by DNA damage )-miR-106b-5p-TLR2, and MSMO1(methylsterol monooxygenase 1)-miR-106b-5p-TLR2 (Fig. 9c).