Identification of Aberrantly Expressed Genes
Raw data of the microarray datasets were downloaded from GEO database and then further processed with GEO2R tool. As a consequence, a total number of 3282 DEGs were screened out from the dataset GSE121367 with 1819 up-regulated and 1463 down-regulated genes. The volcano plot of GSE121367 is shown in Fig. 1a. The red plots represent the up-regulated mRNAs, green plots represent down-regulated mRNAs and black plots represent mRNAs without differentially expression based on the threshold value of P-value < 0.05 and |log fold change (FC)| > 2. The differential results of each comparison group were shown in Additional file 1. The top 250 of DEGs with significant fold change were also drawn with heatmap (Fig. 1b). Red pane shows high expression level and green pane shows low expression level.
Gene Functional Enrichment Analysis
To illustrate the functional annotations of DEGs, GO term enrichment analysis was carried out for the DEGs with DAVID(Additional file 2). Three categories of GO terms including biological process (BP), cellular component (CC), and molecular function (MF) results were presented in Fig. 1c-e and Table 1–2. The color of the bubble indicates the –log 10 P-value of the term and the size of the bubble indicates the number of the DEGs. GO analysis suggested that changes in BP term of top 250 key genes were significantly enriched in “negative regulation of DNA binding”, “type I interferon signaling pathway” and “neuron migration” (Fig. 1c), As for CC term, these genes showed enrichment in “anchored component of membrane”, “extracellular space” and “multivesicular body” (Fig. 1d), Besides, MF term indicated enrichment predominantly at “protein dimerization activity”, “Notch binding” and “transporter activity” (Fig. 1e). To further analyze the DEG-enriched pathways, KEGG pathway analysis was subsequently conducted༈Additional file 3༉. As shown in Fig. 1f, it covered “Cell adhesion molecules pathway” and “Endocytosis pathway”. This functional investigation identified that these DEGs had close association with changes of DNA binding and cell adhesion pathway. Furthermore, we also analyzed the pathway of differential genes by the website of Metascape (Fig. 1g-h). The Reactome pathway analysis revealed that differential genes were enriched in “mesenchymal cell differential”, “cell-cell adhesion mediated by cadherin” and “negative regulation of DNA binding and cell proliferation”.
Table 1
GO enrichment analysis of DEGs
Category | Term | Count | P Value |
GOTERM_BP_DIRECT | GO:0021983 ~ pituitary gland development | 5 | 2.52E-04 |
GOTERM_BP_DIRECT | GO:0043392 ~ negative regulation of DNA binding | 5 | 2.52E-04 |
GOTERM_BP_DIRECT | GO:0060337 ~ type I interferon signaling pathway | 6 | 7.45E-04 |
GOTERM_BP_DIRECT | GO:0010628 ~ positive regulation of gene expression | 11 | 7.56E-04 |
GOTERM_BP_DIRECT | GO:0001764 ~ neuron migration | 7 | 0.001169 |
GOTERM_CC_DIRECT | GO:0031225 ~ anchored component of membrane | 8 | 0.000235 |
GOTERM_CC_DIRECT | GO:0005615 ~ extracellular space | 29 | 0.000690 |
GOTERM_CC_DIRECT | GO:0005771 ~ multivesicular body | 4 | 0.001670 |
GOTERM_CC_DIRECT | GO:0005576 ~ extracellular region | 31 | 0.002465 |
GOTERM_CC_DIRECT | GO:0016324 ~ apical plasma membrane | 10 | 0.004640 |
GOTERM_MF_DIRECT | GO:0046983 ~ protein dimerization activity | 7 | 0.006223 |
GOTERM_MF_DIRECT | GO:0008201 ~ heparin binding | 7 | 0.008454 |
GOTERM_MF_DIRECT | GO:0005112 ~ Notch binding | 3 | 0.016286 |
GOTERM_MF_DIRECT | GO:0046872 ~ metal ion binding | 34 | 0.016772 |
GOTERM_MF_DIRECT | GO:0005215 ~ transporter activity | 7 | 0.023963 |
Table 2
Pathway enrichment analysis of DEGs
Category | Term | Count | P Value |
KEGG_PATHWAY | hsa04514:Cell adhesion molecules (CAMs) | 7 | 0.0046 |
KEGG_PATHWAY | hsa04612:Antigen processing and presentation | 5 | 0.0096 |
KEGG_PATHWAY | hsa05168:Herpes simplex infection | 7 | 0.0153 |
KEGG_PATHWAY | hsa04144:Endocytosis | 8 | 0.0166 |
KEGG_PATHWAY | hsa04145:Phagosome | 6 | 0.0246 |
Data Processing and Gene Set Enrichment Analysis (GSEA)
Although differential expression of individual genes could play a critical role in mechanistic aspects of cellular regulation, many compounds and genes are regulated complicatedly. For the sake of categorizing such modules of cellular regulation, bioinformatics approaches for “gene set enrichment” (GSEA) statistics have been developed [22]. The consequences of GSEA analysis revealed that 428 gene sets were upregulated in the IshikawaPR cell line with P-value < 0.05, among which 145 gene sets were significantly enriched at nominal P-value < 0.01. A total of 116 gene sets were upregulated in the Ishikawa cell line with P-value < 0.05, among which 34 gene sets were significantly enriched at nominal P-value < 0.01 (Additional file 2). As is shown in Table 3, pathways including interferon gamma response, TNF-a signaling via NF-KB, epithelial mesenchymal transition, interleukin1 beta production and negative regulation of response to drug were significantly enriched in IshikawaPR cell line (Fig. 2b). While in Ishikawa cell line (Table 4), the consequences showed that pathways about mesenchymal to epithelial transition, negative regulation of insulin secretion, apical junction assembly and plasma membrane receptor complex compounds were highly enriched (Fig. 2c). All the differential results of gene sets were visualized by Enrichment Map plug-in of Cytoscape (Fig. 2a). Altogether, these data suggested that when Ishikawa cells were stimulated and selected by MPA for almost 10 months, the functions of cell signal transduction such as nuclear receptor activity and cytokine biosynthetic process including interferon gamma, interleukin1 production and epithelial cell polarity were dramatically changed.
Table 3
Upregulated gene sets in the IshikawaPR cell line
Gene Sets | SIZE | ES | NES | NOM p-value | FDR q-value |
Interferon gamma response | 196 | 0.55 | 2.07 | 0.00 | 0.00 |
TNF-a signaling via NF-KB | 196 | 0.46 | 1.72 | 0.00 | 0.01 |
Epithelial mesenchymal transition | 195 | 0.43 | 1.62 | 0.00 | 0.01 |
Hypoxia | 191 | 0.39 | 1.47 | 0.00 | 0.04 |
Complement | 193 | 0.44 | 1.67 | 0.00 | 0.01 |
Negative regulation of regulated secretory pathway | 23 | 0.68 | 1.73 | 0.00 | 0.27 |
Chronic inflammatory response | 18 | 0.69 | 1.72 | 0.01 | 0.27 |
Interleukin1 production | 90 | 0.49 | 1.69 | 0.00 | 0.30 |
Interferon gamma mediated signaling pathway | 87 | 0.51 | 1.74 | 0.00 | 0.30 |
Negative regulation of response to drug | 25 | 0.63 | 1.68 | 0.00 | 0.30 |
Table 4
Upregulated gene sets in the Ishikawa cell line
Gene Sets | SIZE | ES | NES | NOM p-value | FDR q-value |
ATP dependent microtubule motor activity plus end directed | 26 | -0.65 | -1.76 | 0.00 | 0.62 |
Mesenchymal to epithelial transition | 20 | -0.69 | -1.75 | 0.00 | 0.58 |
Phospholipid catabolic process | 38 | -0.59 | -1.75 | 0.00 | 0.55 |
Respiratory chain complex IV | 15 | -0.72 | -1.73 | 0.01 | 0.56 |
Transcytosis | 18 | -0.69 | -1.72 | 0.01 | 0.52 |
Positive regulation of protein localization to cell periphery | 60 | -0.54 | -1.69 | 0.00 | 0.59 |
Negative regulation of insulin secretion | 36 | -0.55 | -1.59 | 0.01 | 0.66 |
Apical junction assembly | 58 | -0.48 | -1.52 | 0.01 | 0.88 |
Cell cell adhesion via plasma membrane adhesion molecules | 253 | -0.36 | -1.39 | 0.01 | 0.94 |
Plasma membrane receptor complex | 184 | -0.36 | -1.37 | 0.01 | 0.94 |
PPI Network Construction of DEGs and Verification of Hub genes
The top 250 DEGs (P < 0.05) of GSE121367 were used to construct PPI network by the database of STRING and visualized in Cytoscape software (Fig. 3a). The red color of a node reflects the upregulated gene and the green means the downregulated gene. The size of the node indicates the connectivity degree and the width of edge displays the combined score. PPI network analysis had been studied by using the threshold value of confidence > 0.4 and connectivity degree ≥ 10. In this network, it contained 159 nodes and 244 edges. The plug-in of cytoHubba in Cytoscape was used to screen hub genes, then a significant submodule was obtained (Fig. 3b), from which we chose the hub genes with high scores. Finally, 10 common hub genes (CDH1, JAG1, PTGES, EPCAM, CNTNAP2, TBX1, MSX1, KRT19, OAS1 and DAB2) were identified in the subnetwork (Table 5 and Table 6). Next, we observed the mutation and DNA copy number alterations of 10 key genes (Fig. 3c). As is shown in the OncoPrint tab, it demonstrated a visual summary of the different alterations of ten hub genes across all sets of uterine corpus endometrial carcinoma samples based on a query of the ten genes. Each row represents a gene, and each column represents a tumor sample. Red bars indicate gene amplifications, blue bars are deep deletions, grey bars are no alterations and green squares are missense mutation. The genomic alteration rates of hub genes were < 10% in all enrolled endometrial cancer cases. Furthermore, 10 hub genes were validated in database TCGA to compare gene expression between endometrial carcinoma samples and normal samples (Fig. 3d). The genes of CDH1, EPCAM, MSX1, KRT19 and OAS1 were overexpressed in tumor tissues (Additional Fig. 2), while genes including JAG1, TBX1 and DAB2 were downregulated in cancer tissues. There were no differences in the expression of PTGES and CNTNAP2 in cancer and normal samples (results not shown).
Table 5
Identification of hub genes by cytoHubba.
name | Betweenness | Bottle Neck | Closeness | Clustering Coefficient | Degree | DMNC | Ec Centricity | EPC | MCC | MNC | Radiality | Stress |
CDH1 | 7440.58 | 76 | 66.299 | 0.085 | 27 | 0.227 | 0.100 | 43.793 | 96 | 17 | 11.806 | 17516 |
JAG1 | 1707.27 | 14 | 52.316 | 0.321 | 8 | 0.428 | 0.100 | 39.736 | 26 | 6 | 11.457 | 3744 |
PTGES | 510.00 | 1 | 38.399 | 1.000 | 2 | 0.308 | 0.082 | 21.129 | 2 | 2 | 10.558 | 500 |
EPCAM | 423.05 | 6 | 52.267 | 0.378 | 10 | 0.339 | 0.090 | 41.186 | 58 | 10 | 11.312 | 1750 |
CNTNAP2 | 2582.66 | 10 | 38.916 | 0.000 | 5 | 0.000 | 0.100 | 15.668 | 5 | 1 | 10.609 | 4672 |
TBX1 | 1156.20 | 12 | 52.016 | 0.333 | 7 | 0.454 | 0.100 | 39.344 | 20 | 5 | 11.438 | 3108 |
MSX1 | 944.50 | 4 | 43.892 | 0.167 | 4 | 0.308 | 0.100 | 25.883 | 4 | 2 | 11.039 | 1976 |
KRT19 | 405.98 | 5 | 49.634 | 0.389 | 9 | 0.334 | 0.090 | 39.797 | 46 | 9 | 11.198 | 1452 |
OAS1 | 81.05 | 1 | 39.275 | 0.500 | 8 | 0.408 | 0.100 | 29.613 | 58 | 8 | 10.419 | 450 |
DAB2 | 751.752 | 2 | 45.945 | 0.267 | 6 | 0.379 | 0.112 | 32.237 | 10 | 4 | 11.191 | 1378 |
Table 6
The information of ten hub genes from GSE121367 datasets.
Gene name | logFC | P Value |
CDH1(cadherin 1) | -5.41 | 2.16E-10 |
JAG1(jagged 1) | -5.44 | 2.35E-10 |
PTGES(prostaglandin E synthase) | 7.77 | 2.42E-11 |
EPCAM(epithelial cell adhesion molecule) | -5.39 | 2.66E-10 |
CNTNAP2(contactin associated protein-like 2) | -7.10 | 1.76E-11 |
TBX1(T-box 1) | 7.61 | 2.71E-10 |
MSX1(msh homeobox 1) | -5.45 | 5.79E-10 |
KRT19(keratin 19) | -6.60 | 4.00E-10 |
OAS1(2'-5'-oligoadenylate synthetase 1) | 5.83 | 1.24E-10 |
DAB2(disabled homolog 2, mitogen-responsive phosphoprotein) | 9.00 | 3.02E-10 |
Hub Gene‑Associated MicroRNAs Network Analysis
For further investigation of the hub genes, gene-related miRNAs were predicted by miRTarBase. Main miRNAs with interactions of more than two genes were listed in Table 7. Moreover, hub gene-relevant miRNAs network was also constructed by Cytoscape (Fig. 4a). There were a total number of 8 genes, 118 miRNAs and 128 genemiRNA pairs enrolled in the network. Some miRNAs were found to play an important role in regulating essential genes. Has-miR-335-5p was predicted to regulate PTGES, OAS1, KRT19 and DAB2, has-miR-26b-5p may regulate DAB2, CDH1 and JAG1. Furthermore, genes of PTGES and JAG1 were regulated by has-miR-124-3p, whose high expression may be associated with worse survival (Additional Fig. 2), suggesting that it may be involved in tumor resistance and may become a prognostic indicator for endometrial cancer.
Table 7
The main related MicroRNAs of hub genes
MicroRNAs | Genes | Count |
has-miR-335-5p | PTGES, OAS1, KRT19, DAB2 | 4 |
has-miR-26b-5p | DAB2, CDH1, JAG1 | 3 |
has-miR-9500 | MSX1, PTGES | 2 |
has-miR-124-3p | PTGES, JAG1 | 2 |
has-miR-129-5p | DAB2, CDH1 | 2 |
has-miR-199a-5p | CDH1, JAG1 | 2 |
has-miR-193b-3p | CDH1, KRT19 | 2 |
Core Transcriptional Factors Mediation Network Analysis of Hub Genes
To identify the transcriptional regulation of the hub genes and assess the effect of TFs on the expression of the hub genes, gene-TFs regulation network was performed by using a Network Analyst network-based service. Totally, 143 TFs were included in the network, constructing 203 gene–TFs interaction pairs (Fig. 4b). In this network, MAZ was considered as the key TF to regulate five hub genes: CDH1, EPCAM, KRT19, MSX1, and TBX1. In addition, TFDP1 plays a second important role in regulating CDH1, CNTNAP2, KRT19 and MSX1 (Table 8). Furthermore, we explored the correlation of hub genes and core TFs of MAZ and TFDP1in endometrial carcinoma using TCGA datasets, respectively. From these results, we found that MAZ and TFDP1 had positive correlations with CDH1, EPCAM, MSX1, KRT19, OAS1 and had negative correlations with JAG1, TBX1 and DAB2 (Fig. 5a-b). Additionally, we found that MAZ most positively related with EPCAM and negatively related to DAB2. While TFDP1 positively associated with CDH1 and negatively associated with JAG1 significantly.
Table 8
The main related TFs of hub genes
TFs | Genes | Count |
MAZ | CDH1, EPCAM, KRT19, MSX1, TBX1 | 5 |
TFDP1 | CDH1, CNTNAP2, KRT19, MSX1 | 4 |
PPARG | CDH1, EPCAM, KRT19 | 3 |
NR2F1 | CDH1, KRT19, TBX1 | 3 |
DMAP1 | CDH1, KRT19, OAS1 | 3 |
EZH2 | CDH1, MSX1, TBX1 | 3 |
ELF3 | CDH1, EPCAM, KRT19 | 3 |
CHD1 | CDH1, KRT19, PTGES | 3 |
BCOR | EPCAM, PTGES, TBX1 | 3 |
E2F5 | KRT19, PTGES, TBX1 | 3 |
JUND | EPCAM, KRT19, TBX1 | 3 |
SMARCA5 | KRT19, PTGES, TBX1 | 3 |
Methylation Status and Expression Validation of Hub Genes in HPA
The initiation of cancer resistance was controlled by both genetic and epigenetic events. Epigenetic changes also make an important impact on occurrence of drug resistance. Therefore, we decided to detect the methylation status of hub genes. As is performed in Fig. 6a, the genes of CDH1, JAG1, EPCAM and MSX1 were aberrantly methylated, which were in consistent with their expression respectively, on the basis of Ualcan website (Fig. 6b). In addition, Immunohistochemistry (IHC) staining obtained from the HPA database showed the dysregulation of the expression of hub genes (Fig. 6c), among which CDH1, EPCAM and MSX1 were upregulated in endometrial carcinoma samples, while the expression of JAG1 were downregulated.
Tissue Specificity Analysis and Prognostic Value Evaluation of Hub Genes
To further evaluate the expression level of hub genes in different human carcinoma tissues, we explored the HPA website. From Fig. 7a-d, it indicated that the selected four hub genes had various RNA expression levels in different cancer tissues including glioma, thyroid cancer, lung cancer, colorectal cancer, head and neck cancer, stomach cancer, liver cancer, pancreatic cancer, renal cancer, urothelial cancer, prostate cancer, testis cancer, breast cancer, cervical cancer, endometrial cancer, ovarian cancer and melanoma. Moreover, CDH1 and EPCAM displayed moderate expression level in carcinoma of endometrium, while JAG1 had relatively low level. Nevertheless, MSX1 displayed the highest expression level in endometrial cancer, which showed high tissue specificity. Meanwhile, we explored the prognostic values of the four essential genes further, which were obtained in GEPIA and displayed in Fig. 7e-h. Overall survival for endometrial cancer patients was analyzed in correspondence with the low or high expression of each gene. As is shown, high mRNA expression of CDH1 (P = 0.01) was associated with better overall survival for endometrial cancer patients, along with EPCAM (P = 0.045), JAG1 (P = 0.02), MSX1 (P = 0.001).