Identification of significantly differentially expressed genes
Gene expression profiles for bladder cancer and paracancerous tissues were obtained from the GEO (https://www.ncbi.nlm.nih.gov/geo/) and TCGA databases(https://portal.gdc.cancer.gov/). After data filtering, normalisation, and ID transformation, two expression matrices were obtained from 414 bladder cancer samples and 19 adjacent bladder tissues in TCGA and 165 primary bladder cancer samples and 9 normal bladder tissues in the GSE13507 dataset. The two datasets were used to screen for DEGs based on the established criteria and results were visualised as heat maps and volcano plots, as shown in Fig. 2. In total, 3461 and 1069 DEGs met the screening criteria in the TCGA-BC and GSE13507 datasets.
Weighted gene co-expression network construction
To identify potential mechanisms underlying the development and progression of bladder cancer and to explore candidate biomarkers, the WGCNA package was used to construct a weighted gene co-expression network for both datasets (TCGA-BC and GSE13507). Seven gene modules were identified in TCGA-BC (Fig. 3) and 12 gene modules were identified in the GSE13507 dataset (Fig. 4, with different colours representing different gene modules). Subsequently, correlations between modules and clinical characters (tumour and normal) in TCGA-BC and GSE13507 were visualised by heatmaps (Fig. 3B, 4B). We identified modules that were highly associated with normal tissues (blue module in TCGA: r = 0.85, p < 1e − 200; tan module in GSE13507: r = 0.64, p = 9.6e − 11) and tumour tissues (green module in TCGA: r = 0.7, p = 1.7e − 95; yellow module in GSE13507: r = 0.49, p = 3.6e − 65). The overlap between genes in modules and DEGs in the two clinical groups (tumour and normal) were visualised by a Venn diagram. There were 11 overlapping genes in the normal group and 76 genes in the tumour group (Fig. 5). These 87 overlapping genes were used for subsequent analyses.
Functional annotation of overlapping DEGs
The potential functions of 11 overlapping genes in the normal group and 76 genes in the tumour group were explored by GO and KEGG pathway enrichment analyses. As shown in Fig. 6, in the BP category, 11 overlapping genes in the normal group were mainly enriched in neurotransmitter uptake, long-term synaptic potentiation, and serotonin transport (Fig. 6A), and 76 genes in the tumour group were mainly related to mitotic nuclear division, nuclear division, sister chromatid segregation, and regulation of chromosome segregation (Fig. 6B), which are important processes in the development of tumours. In the CC category, overlapping genes in the normal group were related to growth cone and the site of polarised growth, and genes in the tumour group were related to chromosomal regions, condensed chromosomes, centromeric regions, and condensed chromosome kinetochores. In the MF category, genes in the normal groups were mainly involved in SNARE binding and iron ion binding, and overlapping genes in the tumour group were mainly enriched in ATPase activity and 3′ to 5' DNA helicase activity. A KEGG pathway enrichment analysis showed that intersecting genes in the normal group were linked to arachidonic acid metabolism and synaptic vesicle cycle and those in the tumour group were related to mitotic nuclear division, regulation of mitotic nuclear division, and metaphase/anaphase transition of mitotic cell cycle (supplemental online Fig. 1).
PPI network construction and hub gene identification
PPI networks for the normal and tumour groups were constructed using the STRING database(https://www.string-db.org/) (Fig. 7) and the CytoHubba plugin was then used to calculate scores based on the MCC algorithm in order to select hub genes in the cytosol. As shown in Fig. 7A, the top 20 genes with the highest scores in the tumour group, including KIF2C, NUSAP1, MELK, PBK, KIF20A, AURKA, NCAPG, TPX2, KIF4A, ASPM, AURKB, CDC20, CCNB1, BUB1B, CCNB2, CCNA2, BUB1, TOP2A, UBE2C, and TTK, and the top 11 genes with the highest scores in the normal group. including GPM6B, CYP2U1, SNCA, PLP1, LGI4, RELN, MPZ, CDH19, GFRA3, COBL, and SNAP25, were identified as hub genes.
Prognostic marker screening and validation of the expression of proteins encoded by hub genes
To gain further insight into the prognostic value of hub genes, the R survival package was used to analyse 31 hub genes in both normal and tumour groups. Nine genes were associated with the prognosis of bladder cancer (Fig. 8).
CDH19, RELN, PLP1, and TRIB3 were significantly associated with prognosis (P < 0.05). Moreover, these four hub genes were closely related to the clinical stage of bladder cancer (supplemental online Fig. 2). The expression levels of the four hub genes were verified in TCGA-BC dataset (supplemental online Fig. 3), CDH19, RELN, and PLP1 levels in normal bladder tissues were significantly higher than those in bladder cancer tissues, while the expression of TRIB3 was obviously higher in bladder cancer tissues than in normal bladder tissues. Furthermore, the protein expression level of TRIB3 was explored using the HPA online database(http://www.proteinatlas.org/). As shown in supplemental online Fig. 4, the expression of TRIB3 in normal bladder tissues was higher than that in tumour tissues, contrary to the results for RNA expression; further studies are needed to validate these findings.
Identification of hub gene-related pathways by a gene set enrichment analysis
Pathways involving four hub genes in bladder cancer were evaluated by a GSEA. The expression levels of hub genes were categorised as high and low in accordance with the cutoff value for hub genes. Then, a GSEA was performed using two datasets for four hub genes based on the normalised enrichment score (NES), nominal P-value (NOM P-val), and false discovery rate (FDR). The enrichment results for the MSigDB gene sets were significantly different according to the GSEA results (NOM P-val < 0.05, FDR < 0.05). As shown in Figure supplemental online Fig. 5A, the MAPK, VEGF, and cell adhesion molecule pathways were significantly enriched in the CDH19-related phenotype. The cell adhesion molecules, MAPK signalling pathway, JAK-STAT signalling pathway, and cancer-related pathways were significantly enriched in the PLP1-related phenotype (supplemental online Fig. 5B). The WNT signalling pathway, cancer-related in pathways, VEGF signalling pathway, and MAPK signalling pathway were significantly enriched in the RELN-related phenotype (supplemental online Fig. 5C). Moreover, TRIB3 was significantly related to bladder cancer, cell cycle, and P53 signalling pathways (supplemental online Fig. 5D).
Associations between hub genes and immune infiltration or immune therapy
The relationships between hub genes and immune cell infiltration were explored using the TIMER algorithm(http://cistrome.dfci.harvard.edu/TIMER/) 16. As shown in supplemental online Fig. 6 and supplemental online Table 1, multiple types of copy number alterations in the four hub genes, especially deletions, were significantly related to immune cells in bladder cancer, and the deletion of hub genes can reduce the level of immune infiltration in multiple immune cells. Negative correlations were detected, as shown in supplemental online Fig. 7, between hub genes and tumour purity (CDH19, r = -0.26, p = 4.29e-07; PLP1, r = -0.311, p = 9.63E-10; RELN, r = -0.382, p = 3.06e-14; TRIB3, r = -0.138, p = 8.14e-03). The statistical significance and Spearman’s correlation coefficients for the relationships between RENL, PLP1, TRIBE, CDH19, and immune cell infiltration signatures are shown in supplemental online Table 1 and Table 2. The immune cell signatures included CD8 + T cells, T cells (general), B cells, monocytes, tumour-associated macrophages (TAMs), M1 macrophages, M2 macrophages, macrophages, neutrophils, natural killer cells, dendritic cells, Th1, Th2, Tfh, Th17, Treg, and T cell exhaustion (*p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001). Furthermore, the TISIDB database 17 was used to explore the value of hub genes in bladder cancer immunotherapy. We divided bladder cancer into six groups (wound healing, IFN-gamma-dominant, inflammatory, lymphocyte-depleted, immunologically quiet, and TGF-β-dominant) according to the classification described by Thorsson et al. 18. We found that TRIB3 expression was highest in the IFN-gamma-dominant subtype, CDH19 expression was highest in wound healing, and both PLP1 and RELN levels were high in the inflammatory type (supplemental online Fig. 8). Additionally, using thresholds of |r| > 0.5 and p < 0.05, we found that both PLP1 and RELN were significantly associated with CXCL12 (sup-9A, PLP1, r = 0.542, p < 2.2e-16; RELN, r = 0.513, p < 2.2e-16), an immunoinhibitory factor. In addition, RELN was also strongly related to ENTPD1 (supplemental online Fig. 9B, r = 0.497, p < 2.2e-16). The relationships between PLP1, RENL, ENTPD1, and CXCL12 were verified using the starBase database (supplemental online Fig. 9D,E,F), which yielded consistent results.