1. Immune/stromal score were significantly associated with T pathological stage and PRCC subtypes
We downloaded 323 PRCC samples from the TCGA database. Gene expression patterns and clinical characteristics were included. Based on the ESTIMATE algorithm, immune score (-1952.04 to 3372.51) and stromal score (-1821.04 to 1469.55) of 291 samples were calculated. We attempted to analyze the relations between immune/stromal score and TNM, stage and subtype (Fig. 1A-D). Results showed the differences in the stromal score of T pathological stages were significant (T3 > T2 > T1, T4, P < 0.05). Since only two T4 samples are not representative, we considered that high T pathological grade correlated with high stromal score. And the average immune scores of type 2 were higher than type 1 (P < 0.05). Similarly, type 2 had higher stromal scores compared to type 1 (P < 0.05, Fig. 1E, F). Finally, we divided the samples into two groups on the basis of median immune/stromal score, but no significant differences were found in OS via Kaplan–Meier analysis (Fig. 1G, H).
2. Identification of DEGs with immune/stromal score
To determine whether all genes were associated with immune/stromal score, gene expression profile of 291 patients were analyzed. We mapped volcano plots with the cutoffs of |Log2FC|>1 and FDR < 0.05 (Fig. 2A, B). 983 upregulated and 6 downregulated genes were identified according to immune score. 1142 upregulated and 27 downregulated genes were extracted based on stromal score. Venn diagrams identified 763 upregulated common genes and 4 downregulated common genes (Fig. 2C, D, Supplementary Table S1). GO analysis showed upregulated genes were mostly associated with immune and inflammatory response, chemokine activity, IgG binding and cell adhesion molecule binding (Fig. 2E-G).
3. Association of upregulated common genes with OS
In order to obtain the relationships between each gene and OS, Kaplan–Meier was applied for survival analysis of 763 upregulated genes, and the impact of 120 genes (Supplementary Table S2) on OS was statistically significant.
4. Selection of molecular complexes from protein interaction networks
To study protein interactions of 120 genes, String tool was performed to structure a PPI network comprised of 84 nodes and 297 edges. Then we used MCODE to select prominent modules: module 1, composed of 17 hub genes (CXCR5, CD19, PDCD1, IL21R,GZMB༌TNFRSF9༌CD38༌CXCL10༌SELL, LAG3, CD44, CD80, CCL21, ITGA4, CCL19, IL6, and CXCL13, Fig. 3A) and module 2, including 5 hub genes (BLK, CD79A, FCRLA, MS4A1 and POU2AF, Fig. 3B), which were strongly associated with other genes, suggesting that they might play an important role in PRCC.
5. Functional enrichment analysis of valuable genes
To understand whether 22 genes have an impact on the TME, we performed KEGG analysis by DAVID, and the results showed that these genes mainly participated in the following pathways: cytokine-cytokine receptor interaction, chemokine signaling pathway and primary immunodeficiency (Fig. 3C).
6. Construction and validation of a survival model to predict prognosis of PRCC patients
323 PRCC samples from TCGA and 34 samples from GEO were used as train and test group. Univariate cox regression analysis has approved all of 22 genes were significantly with OS (Supplementary Table S3). Then, we applied multivariate cox regression analysis to evaluate the coefficients between 22 hub genes from train group and OS. 4 genes (CD79A, CXCL13, IL6 and CCL19) significantly with patients’ prognosis were confirmed (Supplementary Table S3). Risk score was calculated based on the formula (risk score= (-0.24020) *CD79A + 0.22031* CXCL13 + 0.12025* IL6 + 0.17997* CCL19). According to median risk score, 323 PRCC samples and 34 samples were divided into high and low risk group, Kaplan–Meier survival curves both indicated patients’ prognosis in high risk group were worse than that in low risk group (p = 0.00326, p = 0.02537, Fig. 4A, B). Receiver operating characteristic (ROC) analysis was performed, and area under curve (AUC) of train group and test group was 0.76 and 0.708 (Fig. 4C, D).