Integrated bioinformatics analysis has been extensively applied to identify potential biomarkers closely associated with the diagnosis, treatment, and prognosis of GBM. For example, Bo et al identified potential key genes associated with the progression of GBM from GSE31262 gene expression profile data using a bioinformatics approach incorporating DEG screen, pathway enrichment analysis and co‑expression network construction [14]; Long et al identified hub genes for diagnosis and prediction of GBM from GSE90886 dataset by developing an integrated method including DEG screen, pathway enrichment analysis, PPI network construction and survival analysis [15]; Tang et al identified new insights into molecular mechanisms underlying GBM and suggested the candidate targets for the diagnosis and treatment of GBM by developing an integrated method including DEG integration, hierarchical clustering, Functional category enrichment analysis, transcription factor and target miRNAs enrichment analysis, and construction of gene/protein interaction network and analysis.[17] Bo et al further identified key genes in the recurrence of GBM from GSE32466 dataset using a bioinformatics approach incorporating differentially expressed miRNAs screen, hierarchical clustering analysis, target gene prediction of DEMs and construction of regulatory co-expression network [13]; Li et al identified hub DEGs and pathways for GBM by a network-based method that extracted from KEGG pathway database [16]; Kunkle et al revealed potential gene–environment interactions, and generated new data for hypothesis generation in GBM through a comprehensive bioinformatics method using genetic variations (copy number variations and small-scale variations) and environmental data integration that links with Glioblastoma (GEG) [38]; Celiku et al even reported a database called Glioblastoma Bio Discovery Portal (GBM-BioDP) which is a free web-accessible resource that hosts a subset of the glioblastoma TCGA data and enables an intuitive query and interactive display of the resultant data. Moreover, this resource provides visualization tools for the exploration of gene, miRNA, and protein expression, differential expression within the subtypes of GBM, and potential associations with clinical outcome, which are useful for virtual biological validation.[39] Whereas, most of these studies mainly focused on a single database with small sample size or a single bioinformatics approach which results may not that convincing. To overcome these limitations, the current study not only integrated microarray data with relative large sample size from multiple GEO datasets and RNA sequencing data from TCGA, but also constructed PPI networks and a Cox proportional hazards model to identify potential diagnostic and prognostic biomarkers in GBM.
In the present study, nine microarray datasets were integrated with RNA sequencing data downloaded from TCGA, and 512 DEGs between GBM and normal samples were identified, consisting of 212 down-regulated and 300 up-regulated genes. The functional enrichment analysis showed that the down-regulated genes were primarily implicated in various biological processes associated with modulation of chemical synaptic transmission including regulation of trans-synaptic signaling, synaptic vesicle cycle, regulation of vesicle-mediated transport, signal release and regulation of neuron projection development. For the up-regulated genes, they mainly played important roles in extracellular matrix and cell cycle regulation, such as extracellular structure organization, extracellular matrix organization, skeletal system development and cell cycle G1/S phase transition, nuclear division, organelle fission, G1/S transition of mitotic cell cycle, regulation of cell cycle phase transition, and regulation of mitotic cell cycle phase transition. Particularly, many up-regulated genes were enriched in pathways in cancer, such as PI3K-Akt signaling pathway, Cell cycle, p53 signaling pathway, and Cellular senescence, which suggested these genes might be important in carcinogenesis and metastasis of GBM. While the down-regulated genes mainly participated in synaptic chemical signal transduction associated signaling pathways, like cAMP signaling pathway, Neuroactive ligand-receptor interaction, Synaptic vesicle cycle, Endocytosis, Glutamatergic synapse, Adrenergic signaling in cardiomyocytes, and GABAergic synapse. Our findings in the functional enrichment analysis are in line with previous works.[40-42]
We also identified nine hub genes in the PPI network, namely, TP53, FN1, EGFR, MYC, RRM2, EZH2, FOXM1, CD44 and MMP2, and all of them were up-regulated genes in GBM. The TP53 gene mutations are a common mechanism for glial cell neoplasms in the 18-45-year age group but are unrelated to progression and advanced histological grade.[43] Moreover, the mutant TP53 is not only strongly associated with a poor prognosis for overall survival in patients with glioblastoma, but also may decrease the chemosensitivity of glioblastoma to temozolomide by increasing O6-methylguanine DNA-methyltransferase (MGMT) expression.[44] Besides, miR-25 and -32 might serve as positive regulators of p53, underscoring their role in tumorigenesis in glioblastoma.[45] The FN1 gene has been identified as a key gene of GBM [15] and one of the cis-expression quantitative trait locus (eQTL).[46] It’s also reported that FN1 is closely related to cell adhesion and invasion in glioblastoma.[47] EGFR is known to contribute to the malignant properties of glioblastoma multiforme (GBM), such as uncontrolled cell proliferation and evasion of apoptosis.[48] The EGFR signaling pathway is thought to play a crucial role in GBM pathogenesis as well, initiating the early stages of tumor development, sustaining tumor growth, promoting infiltration, and mediating resistance to therapy.[49] c-Myc, a crucial regulator of the Warburg effect [50], was regulated by an unexpected Akt-independent role for mTORC2 in GBM metabolic reprogramming. Importantly, overexpression of c-Myc is associated with significantly shorted survival in GBM patients.[51] It has been demonstrated that Myc-mediated regulation of glioblastoma multiforme cell differentiation [52], which might attribute to either promote or reinforce an undifferentiated phenotype required for glioma cells to respond to the oncogenic effects of elevated Ras and Akt activity.[53] Meanwhile, the Myc oncogene has been suggested to cooperatively regulate multiple downstream targets leading to changes in chromosome stability, gene mutations, and/or modulation of tumor growth in GBM.[54] Overexpression of RRM2 has been confirmed to enhance the proliferation, migration, and invasion of cancer cells in GBM [55, 56] and a potential prognostic biomarker with functional significance in GBM patients.[57] Recent study identified a novel role of BRCA1 as a transcriptional co-activator of RRM2, whereby BRCA1-mediated RRM2 expression protects GBM cells from endogenous replication stress, DNA damage and apoptosis.[58] Furthermore, RRM2 was determined to be correlated with glioma grade.[59] EZH2, the lysine methyl transferase of Polycomb repressive complex 2, is a negative prognostic factor and exhibits pro-oncogenic activity in glioblastoma.[60] EZH2 depletion strongly represses the expression of c-myc [61] and reverses the silencing of Polycomb target genes and diminishes STAT3 activity thus suppressing the tumorigenicity of glioblastoma stem-like cells.[62] Additionally, miR-101 and miR-137, are associated with a poor prognostic phenotype of GBM patients, were down-regulated in glioblastoma resulting in EZH2-induced proliferation, migration, and angiogenesis.[63, 64] FoxM1 plays a critical role in the development and progression of GBM by regulating key factors involved in cell proliferation, epithelial to mesenchymal transition (EMT), invasion, angiogenesis and upregulating Wnt/β-catenin signalling.[65, 66] Strongly elevated expression of FoxM1 is associated with poor prognosis, the regulation of metastasis and the preservation of neural, progenitor, and GBM stem cells.[66] Further, FoxM1 maintains the self-renewal and tumorigenicity of glioma stem-like cells through up-regulating the PDGF-A and STAT3 pathway [67], and promotes clonogenic growth and radiation resistance of GBM via FoxM1-Sox2 signaling axis.[68, 69] CD44 is a major cell surface hyaluronan receptor and cancer stem cell marker that participates in the progression of GBM. CD44 depletion blocks GBM growth and sensitizes GBM cells to cytotoxic drugs in vivo.[70] Lower levels of CD44 expression particularly correlated with lower survival of GBM patients which is a promising candidate for further development as a prognostic and therapeutic tool.[71] Specifically, CD44s-targeted treatment with specific mAb may prevent the progression of highly invasive GBMs.[72] It has been demonstrated that the complex balance between TIMP-2 and MMP-2 is a critical determinant of glioblastoma invasion.[73] Additionally, MMP2 is essential for cancer neovascularization and cancer invasion in promoting extracellular matrix degradation. [74]
The current study identified 22 pivotal genes associated with GBM prognosis and constructed a prognostic gene signature comprised of these genes. Among these 22 genes, RAB33B, KIAA1199, EVC, SOD2, hCG_40738, CHD9, GCSH, SUHW1, RPS6KA5, KCNG1, DECR1, PPCS, NAT10, OR2W1 and HIC2 were identified as protective prognostic genes. The prognostic value of RPS6KA5, DECR1, NAT10 and HIC2 in GBM has been evaluated before. RPS6KA5 has been identified as a protective prognostic gene in GBM based on bioinformatics approach and is significantly associated with overall survival (OS) of GBM patients.[75] DECR1, an enzyme that required for the mitochondrial β oxidation of lipids[76, 77], is highly up-regulated in GBM.[78] Moreover, a latest study based on co-expression network analysis observed that overexpression of DECR1 is relevant to the overall survival time of patients with GBM.[79] NAT10 belongs to a family of N-terminal acetyltransferases (NATs) which are arranged in complexes and believed to target ~80-90 % of all soluble human proteins. Increased expression of NAT10 correlated negatively with patient survival in the group of “all gliomas” patients.[80] HIC2 gene encodes a hypermethylated in cancer 2 protein in human which functions involve in DNA binding, protein C-terminus binding, metal ion binding and nucleic acid binding and so on. Recent study reported that frequent deletion in 9p11.2 (FOXD4L2 and AQP7P3) and 22q11.21 (HIC2) are significantly associated with short-term survivor of GBM patients based on a deep genomic comparative analysis of a cohort of patients receiving standard therapy.[81] However, the prognostic value of RAB33B, KIAA1199, EVC, SOD2, CHD9, KCNG1 and PPCS in GBM has not been validated in previous studies. RAB33B is concordantly overexpressed in GBM patients [82], whereas, the prognostic value of RAB33B has not been validated in previous studies. KIAA1199 is a newly found gene with significance in senescence, progression, distant metastasis and poor prognosis of cancer patients. Importantly, the KIAA1199 gene is located on chromosome 15q25, where shows a strong linkage to familial gliomas.[83] Motaln et al showed that hMSCs were responsible for the impairment of GBM cell proliferation, invasion and growth by affecting KIAA1199 genes.[84] However, the prognostic value of KIAA1199 in GBM remain buried. EVC transcript were identified in GBM and LGG but not in normal brain tissue, and the expression data suggests that EVC is expressed more in GBM and LGG tumors when compared with normal samples, although these data did not pass their stringent 2-fold difference in expression.[85] SOD2, encoding an intramitochondrial free radical scavenging enzyme located in the mitochondria, was identified as a novel primary GBM-specific marker.[86] Although limited to GBMs, there was a significant relationship between iNOS and SOD1 expression. The induction of both iNOS and SOD1 would be functionally and precisely controlled within the iNOS-inducing tumors.[87] CHD9 genes is highly upregulated in glioblastoma in a silicon analysis using the TCGA [88] and correlated with GBM outcomes by contributing to stress response.[89] Besides, KCNG1 and PPCS remains largely uncharacterized genes. KCNG1 encodes a member of the potassium channel, voltage-gated and subfamily G, thus regulating the potassium channel activity, voltage-gated potassium channel activity, ion channel activity and voltage-gated ion channel activity. It is reported that KCNG1 encodes extracellular antigenic epitopes with the highest difference between ovarian tumors and normal tissues.[90] PPCS is an enzyme that catalyzes the chemical reaction which constitutes the second of five steps involved in the conversion of pantothenate to Coenzyme A. The molecular function of PPCS consists of regulating phosphopantothenate--cysteine ligase activity and ligase activity. Furthermore, the prognostic value of hCG_40738, GCSH, SUHW1 and OR2W1 has not been absolutely clarified and further studies are still demanded to validate our findings, the importance of these four genes should not be underestimated.
With regard to the seven risky prognostic genes (CXCR4, TMSB10, TEK, PDCD4, ZG16, PIR and SERPINF1), the correlation of TMSB10 with GBM has been investigated before. Existing studies presented that TMSB10 is identified as a negative prognostic gene of GBM via Robust Likelihood-Based Survival Modeling Analysis.[91] CXCR4 is a cell surface chemokine receptor that have been identified as a robust mediator of glioma cell proliferation and invasion.[92] CXCR4 is proven to play a key part in glioma cell migration through HIF-1α in the pseudopalisading tumor cells.[93] Additionally, CXCL12 has been known to modulate the CXCR4 and CXCR7 activity in human glioblastoma stem-like cells and regulate the tumor microenvironment.[94] It has been reported that inhibition of CXCL12/CXCR4 autocrine/paracrine loop reduces viability of human glioblastoma stem-like cells affecting self-renewal activity. Moreover, recent evidence suggested that miR-663 negatively regulated CXCR4 to inhibit its oncogenic effect. TEK, also known as EC-specific receptor tyrosine kinase Tie2 (EC, endothelial cell), whose activation is positively and negatively modulated by angiopoietin-1 (Ang1) and angiopoietin-2 (Ang2), respectively. The modulation of Tie2/Tek activation by Ang1 and Ang2 contributes to GBM vascularization and overall growth. Recent studies have found an up-regulation in expression of Ang1, Ang2 and Tie2/Tek with increasing malignancy grade of astrocytomas.[95] It has been reported that a robust increase in Tie2/Tek expression, restricted to the EC of GBMs compared to low-grade astrocytomas and normal brain.[95] Similarly, it is found that Tie2 signal transduction plays a significant regulatory role in the pathological vascular growth of GBMs.[96] Additionally, purified ExTek treatment in intracranial models of GBMs increases the overall survival by approximately 35%. PDCD4, a well-known tumor-suppressor gene, has been identified as a functional target of mir-21.[97] It is demonstrated that downregulation of PDCD4 by mir-21 facilitates glioblastoma proliferation.[98, 99] And the loss of PDCD4 expression was closely associated with the progression and prognosis of various tumors, including glioblastomas.[100] Besides, the loss of PDCD4 contributes to enhanced chemoresistance in GBM via de-repression of Bcl-xL translation.[101] As for PIR, it has been identified as a protective prognostic gene and been reported that high expression of PIR (p = 0.025, HR 0.62 (0.41–0.95); Q1 cut-off) predicts superior overall survival of GBM patients based on univariate analysis.[102] However, unlike the previous study, our results suggest that high expression of PIR (p = 0.018, HR 1.089(1.015-1.168)) predicts inferior overall survival of GBM patients. This discrepancy may lie in the reasons as follows: (1) Previous study mainly focused on a specific subventricular zone (SVZ) of GBM patients; (2) All clinical data were obtained through review of medical charts and gathered in Department of Neurosurgery (University Hospital, Heidelberg, Germany) between 1998 and 2011; (3) Previous prognostic significance was determined using univariate and multivariate Cox regression analyses and log-rank tests. In terms of ZG16 and SERPINF1 genes, little is known about their prognostic value in GBM. ZG16, which is significantly decreased in colorectal cancer (CRC) samples compared to adjacent non-tumor tissues and associated with prognosis of CRC patients. Overexpression of ZG16 significantly inhibits growth and sphere formation of stem-like CRC cells.[103] Multiple mutations were found in the SERPINF1 (Serpin Peptidase Inhibitor, Clade F) gene which encodes PEDF (pigment epithelium-derived factor), a potent inhibitor of angiogenesis and known regulator of bone density.[104] Although the status of ZG16 and SERPINF1 and their correlation with prognosis in GBM have seldom been reported before, they could provide helpful evidence for potential prognostic biomarkers in future studies.
The limitations of our study were as follows: (1) we performed our study based on integrated bioinformatics analysis, our results need further biological experiments validation. (2) we obtained the data used in present study from public databases and we cannot evaluate the quality of these data; (3) our study mainly focused on the genes commonly identified as significantly altered ones in multiple datasets, some biological information (characteristic details such as gender, age, race, tumor grade and stage, etc.) may be overlooked in our study.