TMB scores and the immunoactivity-related gene sets are conjoined with OC’s overall survival
To answer if a high load of somatic mutation is associated with enhanced tumor immune cell infiltration and result in prolonged overall survival, datasets comprising RNAseq from 379 TCGA tumors and mutation annotation files from 436 TCGA tumors (272 which overlapped with RNAseq data) were obtained from the TCGA Data Portal in March 2019 (https://portal.gdc.cancer.gov/). We evaluated all OC cases with complete gene expression data and clinical information in TCGA. For mutation annotation files, we presented the Word Cloud of gene symbols identified in OC which the mutation sites > 5 based on the WordArt (https://wordart.com). We found that TP53 was the most significant (90% cases) mutated gene[22] (Figure S1A). We then screened the top 25 genes and employed the DISCOVER (Discrete Independence Statistic Controlling for Observations with Varying Event Rates)[23] algorithm to examine co-occurrence and mutual exclusion among these genes and found that TTN and LRP2 are significantly mutual exclusive (p < 0.001) (Figure S1B). In order to better visualized the mutually exclusive genes pairs, we selected 420 (96.33%) of 436 samples and showed in a waterfall plot that the most frequently mutated genes were TP53 and TTN (p < 0.05, FDR < 0.01) (Figure S1C). These analyses of OC samples’ mutation can be used to guide further TMB analysis.
Based on the somatic mutation data across all the OC cases in this study, TMB scores range from 0.0263 to 6.5260. Interestingly, the TMB scores of OC cases at Grade 3 & Grade 4 were significantly higher than those of Grade 1 & Grade 2 OC cases (Figure 2A, n=417, except for the blank values, p=0.048). And then based on the ssGSEA scores of immune cell types, we divided the immunoactivity-related gene sets into three immune infiltration clusters by unsupervised clustering, and some of which were defined as immunityhigh clusters (n=193) and immunitylow clusters (n=149) (Figure 2B). Besides, the tumor purity in immunityhigh group was significantly lower than that in immunitylow group validating the reliability of our classification (Figure S1D). Moreover, the PD-L1 expression level of the immunityhigh group is significantly higher than that of immunitylow group (Figure 2B). We further accessed if the TMB or immune infiltration profile could affect the overall survival. Kaplan-Meier survival curves showed that the overall survival time of TMBhigh or immunityhigh group was significantly longer than TMBlow or immunitylow group (Figure 2C 2.58 years vs. 1.92 years, p=0.0295 in the log-rank test. Figure 2D 2.50 years vs. 2.45 years, p=0.00082 in log-rank test). We then asked if there is any association between the TMB group and the immunity group. However, we did not see any correlation between these two factors across 379 OC cases (Figure 2E).
We further studied the relationship between the immunity group and the immune cell infiltration in OC. So We further studied the abundance of 22 different immune cells subpopulations in the high&low immunity group or the TMB group of cancers. However, only memory B cells and NK cells were increased in TMBhigh group while there are seven immune cell subpopulations were enriched in high immune scores group, that was most of these immune cells have a higher infiltration degree in immunityhigh group than in immunitylow group of OC (Figure S2A and 2B).
KEGG pathway and GO term analysis of DEGs associated with OC.
Next, as to reveal the correlation between the TMB, the immunoactivity-related gene sets, and global gene expression profiles, and wonder if individual genes that are upregulated in both TMBhigh and immunehigh groups are involved in anti-tumor related biological pathways. To this end, we overlapped TMBhigh gene expression signature (top 200 genes) with immunityhigh gene expression signature (427 genes) by Venn diagrams and finally obtained 14 differentially expressed genes (DEGs) that may be closely predicted the response to checkpoint blockade-based OC immunotherapy (Figure 3A). Heatmaps of the expression levels of 14 up-regulated DEGs in the TMB group and immunity group were shown in Figure S2C. Next, we conducted unsupervised hierarchical clustering of KEGG pathways related to Up-DEGs (Figure 3B) and found that most of these KEGG pathways are immune-related, such as “Cytokine cytokine receptor interaction”, “Chemokine signaling pathway”, as well as “Primary immunodeficiency”. Moreover, in order to summarize the potential function of the DEGs, we performed functional enrichment analysis of the 14 up-regulated genes between TMBhigh and immunityhigh group. GO terms and KEGG pathway analysis also demonstrate that the Up-DEGs function in the immune and inflammatory response, cytokine activities and T cell proliferation (Figure 3C and 3D). These results indicated that the gene sets shared by both TMBhigh and immunehigh groups have active immune functions.
Protein-protein interactions among genes associated with prognosis in OC
In order to understand the interactions between the identified DEGs, we performed a PPI networking study using the Search Tool for the Retrieval of Interacting Gene (STRING) tool. All 14 DEGs were uploaded to STRING as a whole Up-DEGs module (Figure 4) to construct PPI networks. A network consisting of 9 nodes and 10 edges were generated when the minimum required interaction score was set 1.5 and disconnected nodes were hidden in the network in the STRING website (Figure 4A). By counting the number of connections (Figure 4B), it shows that CXCL13 has the most key link with other members of the module in the network. Besides CXCL13, several key genes associated with T cell proliferation, inflammation, and immune response were located at the center of the module, such as PLA2G2D and FCRLA. These results indicated that PLA2G2D, MS4A1, ADAMDEC1, FCRLA, CXCL13 interact more with other molecules, suggesting that these genes are more likely to be key genes in regulating tumor restricting functions in OCs.
Correlation of expression of individual DEGs in overall survival
Since both TMBhigh group and immunityhigh group were correlated with better survival, we reasoned that there could be potential prognosis predicting biomarkers among the 14 shared DEGs defined above. To this end, we analyzed the association of individual DEGs with overall survival time. Among them, four genes, including PLA2G2D, CXCL13, FCRLA, and MS4A1, were validated to be significantly relevant to good survival and prognosis outcomes (log-rank test, p < 0.05) (Figure 5A-D). To validate our results, we downloaded and analyzed gene expression data from a different cohort from the GEO database. Meta-analysis further indicated that OC cases that have high expression of these four genes showed a significant survival benefit and all the four genes are independent predictors for the prognosis of OCs (Figure 5E-H). These results indicated that CXCL13, FCRLA, PLA2G2D, and MS4A1 could be a useful biomarkers for predicting responses to immunotherapy of OC.
Correlation of four genes expression with immune infiltration level in OC and pathway analysis.
Tumor purity was usually taken into account when obtaining immunotherapy expression markers from transcriptome data. Some studies have shown that tumor samples with low purity tend to have more immune cells and a higher mutation load, because inflammatory responses induced by immune cells can increase the mutation rate of tumor cells, and the effect of immunotherapy may be better[24, 25]. Therefore, to study if the four signature genes can predict tumor immune infiltration, we used TIMER (Tumor Immune Estimation Resource, https://cistrome.shinyapps.io/timer/), an online database, to analyze the infiltration of different immune cells in tumor tissues based on available RNA-seq data. Our results demonstrated that CXCL13, FCRLA, MS4A1, and PLA2G2D have a negative correlation with tumor purity in OC (Figure 6A-D). Besides CXCL13 expression showed a very weak relationship with B cell infiltration level in OC (Figures 6A) while FCRLA, MS4A1, PLA2G2D have no significant correlations with B cell infiltration level in OC (Figure 6B-D). Moreover, there were moderate to strong positive relationships between the expression levels of PLA2G2D, FCRLA, MS4A1, CXCL13, and infiltration level of CD4+ T cells, as well as have a prominent positive correlation between expression level and infiltration level of CD8+ T cells in OC, especially CXCL13, MS4A1, PLA2G2D, FCRLA (Figures 6A-D, Table S1). In conclusion, CXCL13, FCRLA, MS4A1, and PLA2G2D might have multiple and closely function in immune infiltration.
Next, we conducted gene set enrichment analysis (GSEA) to analyze the four genes and found that all of four genes were related to immune cell signaling pathways. They were all associated with B cell receptor, chemokine, cytokine-cytokine receptor interaction, natural killer cell-mediated cytotoxicity and T cell receptor signaling pathway (Figure 6E-H). Interestingly, CXCL13, FCRLA, and PLA2G2D were also associated with primary immunodeficiency (Figure 6E-H). MS4A1 was associated with autoimmune thyroid disease (Figure 6G). All of these findings revealed that CXCL13, FCRLA, MS4A1 and PLA2G2D’s expression was closely related to immune function were enriched for hallmarks of OC. More information can be got from Table S2. These results indicated that CXCL13, FCRLA, PLA2G2D, and MS4A1 were validated as prognostic biomarkers for the immunotherapy of OC. These results indicate that CXCL13, FCRLA, PLA2G2D, and MS4A1 are positively correlated with immune cell infiltration, and these genes are enriched to be associated with immune pathways by GSEA enrichment method.
Prognostic evaluation of four genes in melanoma patients treated with immunocheckpoint inhibitors.
Immune infiltration and active inflammations are prerequisites of anti-tumor immune reaction and better response to immunotherapy. We have shown that CXCL13, FCRLA, PLA2G2D, and MS4A1 are upregulated in the immunehigh group and involved in inflammation and T cell function. We then asked if these four genes can be used as biomarkers to predict responses to immune checkpoint blockade-based immunotherapy. Since the data of OC response to immunotherapy are not available, to address that, we explored an online database, TIDE (Tumor Immune Dysfunction and Exclusion, http://tide.dfci.harvard.edu/query/) and analyzed the data collected from melanoma. As shown in Figure 7, our results showed that among the melanoma patients treated with anti-PD-L1 or anti-CTLA4, the CXCL13, MS4A1, FCRLA, and PLA2G2D were positively correlated with the number of CTL infiltration. The two-sided t-test p values for correlations in CXCL13, MS4A1, FCRLA, PLA2G2D and mean are1.43×10-10, 8.12×10-2, 6.6×10-10 and 2.1×10-4, respectively. And the high expression levels of these genes were associated with the better response of melanoma patients.
Furthermore, we also investigated the protein expression of CXCL13, FCRLA, and MS4A1 in OC and normal ovarian tissues based on the IHC (immunohistochemistry) staining in The Human Protein Atlas (https://www.proteinatlas.org). We found that the staining intensity of CXCL13 and MS4A1 was slightly different in normal ovarian tissues and OC tissues, the protein expression levels in cancer were slightly lower than in adjacent tissues, but the difference was not significant. And the level of FCRLA protein was up-regulated in OC tissues compared to the normal tissues (Figure S3). There are no PLA2G2D IHC data available in the database currently. Considering the positive association between PLA2G2D mRNA level and prognosis data in OC, it is worth developing antibodies for future studies.