Supplementary Figure 2 summarizes the technical roadmap of this research, and the results are split into five sections as follows.
3.1 Identification of DEGs between HGSOC and LGSOC
Among the 136 GSEs screened, we evaluated a total of four series that comprised HGSOC and LGSOC broad gene expression microarray data, namely GSE73638, GSE73551, GSE27651, and GSE14001. Because the series GSE73638 and GSE73551 belong to the same study, the current study included only GSE73638 because it had a larger sample size. Table 1 summarizes the essential information for the three GSEs.
Table 1
Sample details of the selected GEO Series.
GEO Series
|
Publication date
|
Platform
|
Samples
|
Source
|
Cell type
|
Case number
|
GSE73638
|
8-Nov-16
|
GPL20967
|
102
|
Primary tumor
|
Clear cell
|
12
|
Endometrioid
|
9
|
Mucinous
|
9
|
Serous low-grade
|
7
|
Serous high-grade
|
13
|
Ovarian cells
|
|
52
|
GSE27651
|
4-Mar-11
|
GPL570
|
49
|
Primary tumor
|
Serous low-grade
|
13
|
Serous high-grade
|
21
|
Low-malignant tumor of the ovary
|
9
|
Normal ovarian surface epithelials cells
|
6
|
GSE14001
|
31-May-09
|
GPL570
|
23
|
Primary tumor
|
Serous low-grade
|
10
|
Serous high-grade
|
10
|
Normal ovarian surface epithelials cells
|
3
|
We utilized the GEO2R online program to identify 1465, 9914, and 230 distinct genes from GSE73638, GSE27651, and GSE14001, respectively (Figure 2A, adj.P 0.01, |logFC| >1). The DEGs in the GSE14001 are restricted by the aforementioned threshold. To guarantee the availability of sufficient DEGs for further research, 9500 DEGs in GSE14001 were identified with a threshold adj.P 0.05 and utilized to confirm the overlapping DEGs between GSE73638 and GSE27651. According to the Venn diagram, overlapping DEGs between GSE73638 and GSE27651 comprised 157 upregulated and 204 downregulated genes (Figure 2B). Following validation by GSE14001, 79 upregulated and 85 downregulated genes were chosen for the current study (Figure 2C). We used heat maps to depict the distribution of screened gene expression in each GSE between HGSOC and LGSOC (Figure 2D).
3.2 Enrichment analysis for DEGs
We used the WebGestalt web tool to perform GO and KEGG enrichment analyses to identify the most important biological processes (BPs) and pathways. In total, 164 DEGs were mostly enriched in the BPs associated with mitotic cell cycle, organelle fission, and nuclear division (Figure 3A) and pathways such as hepatitis C, micro RNAs in cancer, and chronic myeloid leukemia (Figure 3B). In addition, we used GSEA to validate the GOBP and KEGG pathways. The GOBP or KEGG pathways that were substantially represented with a normalized p value of <0.05 were presented. In the HGSOC group, 363, 8, and 31 BPs were enriched in GSE73638, GSE27651, and GSE14001, respectively, and three BPs were enriched in all three GSEs (Figure 3C). In the LGSOC group, 88, 78, and 83 BPs were enriched in GSE73638, GSE27651, and GSE14001, respectively, whereas 10 BPs were enriched in two GSEs (Figure 3C). Of the three GSEs, only GSE73638 was enriched in 13 KEGG pathways in the HGSOC group (Figure 3D). In the LGSOC group, the number of KEGG pathways enriched in GSE73638, GSE27651, and GSE14001 was 1, 1, and 5, respectively, and no overlap was found among the GSEs (Figure 3D).
Meiotic cell cycle process, homologous chromosome segregation, and meiosis I cell cycle process were the BPs confirmed by the three GSEs (Figure 4). Cell cycle process and chromosomal segregation were the GOBPs confirmed through WebGestalt and GSEA. Chemokine signaling route and oocyte meiosis were the KEGG pathways confirmed through WebGestalt and GSEA.
3.3 PPI network construction and significant module identification
We utilized the String database to estimate the protein-level connection of the overlapped DEGs (Figure 5A). We improved the visualization with Cytoscape software and constructed a PPI network with 115 nodes and 894 edges (Figure 5B). MCODE was used to divide the PPI network into four modules (Figure 5C); the first module had 38 nodes and 661 edges (MCODE score 35.730), the second module had 7 nodes and 21 edges (MCODE score 7.000), the third module had 5 nodes and 10 edges (MCODE score 5.000), and the fourth module had 3 nodes and 3 edges (MCODE score 3.000). We utilized Cytohubba to filter the top 10 Hubba nodes, namely BIRC5, CDC20, CDK1, CDKN3, MKI67, NUSAP1, RRM2, TOP2A, TPX2, and UBE2C, for additional investigation using Radiality topological techniques (Figure 5D).
The expression differences between HGSOC and LGSOC could be displayed more naturally in the three GSEs by using the mountain map (Supplementary Figure 3A) and the grouped violin map (Supplementary Figure 3B).
3.4 Analysis of the hub genes
First, we used the GTEx website and the Oncomine database to evaluate the differential expression of 10 hub genes in normal ovarian tissues and OC tissues. Except for NUSAP1, which was moderately expressed in normal ovarian tissues, the expression levels of the other nine hub genes were low (Figure 6A). Other normal female reproductive organs, such as the breast, cervix, and endometrium, exhibited a minimal expression of these 10 hub genes. All the 10 hub genes were highly expressed in OC tissues, similar to those in other female common malignancies such as breast cancer and cervical cancer (Figure 6B).
Second, we used the GEPIA database to investigate the connection between the 10 hub genes and OC OS/staging. Among the 10 hub genes, only BIRC5 was found to be favorably linked with OS in OC (pHR = 0.014, Figure 7A), and only RRM2 was found to be negatively correlated with OC staging (p = 0.0251, Figure 7B).
Third, we used the cBioportal website to examine the alterations of the 10 OC hub genes. The oncoPrint indicated that the amplification of BIRC5, RRM2, and CDC20 has no evident concomitance, whereas the amplification of CDK1, MKI67, UBE2C, and TPX2 has a possible concomitance (Figure 8A). In OC, BIRC5 displayed more visible amplification and gain mutations, whereas RRM2 displayed less visible amplification and gain mutations (Figure 8B).
3.5 Re-analysis of the genes with a potential clinical value
Based on the aforementioned results, BIRC5 and RRM2 with potential therapeutic utility were chosen from a list of 10 hub genes for further investigations.
First, we examined the co-expressed genes of BIRC5 and RRM2 in the Oncomine database and compared these genes with 164 overlapping DEGs. The DEGs, NUF2, KIF23, DEPDC1, TOP2A, and GPSM2, were found to co-express with BIRC5 (Supplementary Figure 4A), whereas DLGAP5, CDKN3, ZWINT, MELK, and CDC20 were found to co-express with RRM2 (Supplementary Figure 4B).
Second, we performed IHC on BIRC5 and RRM2 by using the HPA database. By using antibodies HPA002830 and CAB004270, we found that the expression of BIRC5 in OC tissues was greater than that in normal ovarian tissues (Figure 9A-B). RRM2 expression in OC tissues could not be identified with the antibody HPA056994 in both OC tissues and normal ovarian tissues (Figure 9A&C).
Third, we utilized the miRWalk and Targetscan databases to identify BIRC5 and RRM2 upstream miRNAs. The intersection of the two datasets revealed that 88 miRNAs regulate BIRC5 expression, whereas 76 miRNAs regulate RRM2 expression. Simultaneously, four miRNAs were found to control both BIRC5 and RRM2 expressions (Figure 10A). The mRNA–miRNA map was constructed using Cytoscape, with the size of the node representing the strength of the connection (Figure 10B). Then, we validated overlapping miRNAs by using GSE61741, which contains miRNA profiles from 24 OC blood samples and 94 normal blood samples. The GEO2R online tool was used to identify 185 differential miRNAs from GSE61741 (Figure 10C, adj.P 0.05). The Venn diagram (Figure 10D) depicts the distribution of intersection miRNAs for BIRC5 and RRM2 across the GSE61741, miRWalk database, and Targetscan database (Figure 10E). Although 5 overlapping miRNAs were screened, we inferred that the upstream miRNAs of BIRC5 and RRM2 exhibit low expressions in OC tissues due to the negative regulatory connection between miRNA and mRNA. In addition, by using the heat map, hsa-miR-520a-3p, hsa-miR-1323, and hsa-miR-324-3p were found to match the requirements. Cytoscape was used to visualize the miRNA–mRNA map (Figure 10F).