Differentially expressed m6A regulators in DLBCL
The expression of the 22 m6A regulators in 48 DLBCL and 337 normal controls were compared. As shown in Figure 2A, the heatmap displayed the expression distribution of the 22 m6A regulators. And the violin plot in Figure 2B visualized the expression difference of each m6A regulator. In summary, 19 out of 22 regulators were differentially expressed, among which, 10 were up-regulated (EIF3A, HNRNPA2B1, YTHDC2, ZC3H13, ALKBH5, RBM15, METTL5, YTHDF1, RBM15B, YTHDF2) and 9 were down-regulated (METTL3, FTO, FMR1, YTHDC1, HNRNPC, WTAP, YTHDF3, IGF2BP2, KIAA1429) in DLBCL. In Figure 2C, the Spearman correlation analysis was achieved for the 19 differentially expressed m6A regulators. We can observe that the correlations among the m6A regulators were complicated. For example, the “Writers” can be positively correlated such as METTL3 and RMB15B (coef=0.46), or negatively correlated such as METTL3 and KIAA1429 (coef=-0.32). The two “Erases” (ALKBH5 and FTO) were positively associated with each other (coef=0.58). And the “Readers” can also be positively related like YTHDF3 and YTHDC1 (coef=0.44), or negatively related such as YTHDF3 and HNRNPA2B1 (coef=-0.51). Similarly, a “Writer” can be positively related to a “Reader” such as METTL3 and HNRNPA2B1 (coef=0.68), or negatively related to another “Reader” such as METTL3 and YTHDF3 (coef=-0.61).
m6A-sorted DLBCL clusters and clinical implication
As we can observe in Figure 3A-3C and Figure S1, when clustering the samples using a specified cluster count (k) from 2 to 9, only k=2 showed the best performance to distinguish the DLBCL samples by the differentially expressed m6A regulators. In Figure 3D-3E, the heatmap showed that the samples in cluster 1 were highly correlated, so were those in cluster 2. There was barely any correlation significant between cluster 1 and cluster 2, which indicated that m6A regulators were capable to differentiate the DLBCL samples into molecularly distinguishable subtypes. The finding was validated using a PCA analysis, which was displayed in Figure 3F. The samples in cluster 1 or cluster 2 were highly clustered, which further proved that m6A-sorted DLBCL clusters are reliable. The clinical significance of the two m6A-sorted DLBCL clusters was also investigated and demonstrated in Figure 4. Figure 4A displayed the expression heatmap of the differentially expressed m6A regulators in m6A sorted clusters and their correlation with DLBCL clinical parameters. The gender, age, ECOG performance status, number of extranodal sites, and survival status showed statistical differences in cluster 1 and cluster 2 (p<0.05). And in Figure 4B, the survival of DLBCL patients in cluster 1 and cluster 2 also showed statistical differences. The overall survival rate of DLBCL patients in cluster 2 outperformed than those in cluster 1 (p<0.001).
m6A signature and clinical implication
Univariate Cox regression algorithm was carried out for the 19 differentially expressed m6A regulators. Six out of 19 m6A regulators (ALKBH5, FMR1, HNRNPC, RBM15B, YTHDC2, and YTHDF1) show statistically prognostic value in DLBCL patients (Figure 5A). Through further Lasso regression, all the six prognostic m6A regulators were determined for prognostic signature construction (Figure 5B-5C). The calculated risk score curve and survival information of the patients in the training set were displayed in Figure 5D. The K-M survival curve suggested the patients with m6A low-risk have higher overall survival than those in m6A high-risk group in the training set (Figure 5E). ROC curve revealed the AUC at 1-, 2-, and 5-year was 0.674, 0.699, and 0.691 which suggested a certain predictive capacity of the m6A signature in the training set.
The m6A prognostic signature in the training set was validated using a testing set and a combined data of the training set and testing set as a whole. In Figure 6A, the risk score curve and survival information of patients in the testing set was visualized. Although there was no statistical significance when conducting a K-M survival analysis for the samples in the testing set (p=0.081), it still showed an obvious trend that patients with m6A low-risk have better survival (Figure 6B-6C). The AUC of the 1-, 2-, and 5-year ROC curve achieved 0.552, 0.584, and 0.611 in the testing samples. Figure 6D displayed the calculated risk score and survival information of the combined samples of the training set and testing set. A statistically significant difference was observed in the K-M survival curve analysis for the combined samples (p<0.001). Patients with m6A low-risk showed better survival than those in m6A high-risk group (Figure 6E). Figure 6F displayed the predictive efficacy of the m6A signature, which yielded an AUC for 1-, 2-, and 5-year of 0.605, 0.640, and 0.652, respectively.
Whether m6A signature is an independent factor in DLBCL was investigated. Displaying in Figure 7A, the univariate Cox regression revealed that m6A signature (HR=1.994, p<0.001), age (HR=1.768, p=0.001), molecular subtype (HR=2.207, p<0.001), ECOG performance status (HR=2.660, p<0.001), stage (HR=1.890, p<0.001), and LDH ratio (HR=2.641, p<0.001) were all statistically significant prognostic predictors. And multivariate Cox regression analysis displayed in Figure 7B further verified that m6A signature (HR=1.994, p<0.001), age (HR=1.574, p=0.017), molecular subtype (HR=1.888, p=0.003), ECOG performance status (HR=1.981, p=0.001), stage (HR=1.719, p=0.007), and LDH ratio (HR=1.877, p=0.003) were all independent prognostic predictors in DLBCL.
Estimation of immune cell infiltration
The immune cell infiltration in m6A low/high-risk groups was explored. The bar plot in Figure 8A visualized the proportion of the 22 immune cells in each DLBCL sample on the whole. And Figure 8B also displayed the 22 immune cells’ distribution in m6A low/high-risk groups as a heatmap. The immune cell infiltration difference in m6A low-risk vs. high-risk groups was compared by statistics. As shown in Figure 9A, ten out of 22 immune cells showed a statistical difference between m6A low-risk and high-risk groups. The B cell naïve, NK cells resting, NK cells activated, and Mast cells activated were significantly up-regulated in m6A high-risk group. The Plasma cells, T cells CD4 memory activated, T cells gamma delta, Dendritic cells resting, Mast cells resting, and Eosinophils were significantly down-regulated in m6A high-risk group. The correlations among the 10 differentially distributed immune cells were depicted in Figure 9B. We can notice that T cells CD4 memory activated was positively correlated to T cells gamma delta (coef=0.41), while negatively correlated to B cell naïve (coef=-0.35). The T cells gamma delta was negatively related to B cell naïve (coef=-0.44) and NK cells resting (coef=-0.49).
Immune checkpoint is a critical mechanism of tumor immune escape, which makes it a prospective target for cancer immunotherapy. Therefore, the expression of nine well-studied immune checkpoint genes in low/high-risk groups was analyzed and displayed in Figure 9C. We found that PDCD1 (P=0.045) and KIR3DL1 (P=0.012) were significantly increased in m6A high-risk group, while TIGIT (p<0.001), IDO1 (p=0.016), and BTLA (p=0.008) was significantly decreased. Regards LAG3, CTLA4, VISTA, and HAVCR2, no statistical significance was observed.
Enriched pathways in m6A low/high-risk DLBCL
To reveal the potential biological function difference between the m6A low-risk and high-risk groups, GSEA analysis was carried out. The top 10 significantly enriched terms in the m6A low-risk group and high-risk group were selected for displaying, respectively. As we can see in Figure 10, the top 10 significantly enriched terms in m6A high-risk group included Telomerase holoenzyme complex, Mitotic G2-M transition checkpoint, Blastocyst formation, Regulation of transcription by RNA polymerase III, Saga type complex, Base excision repair, ATPase complex, DNA replication checkpoint, Histone H4 acetylation, INO80 type complex. And GDP binding, Double stranded RNA binding, Positive regulation of endothelial cell apoptotic process, Nitric oxide synthase biosynthetic process, Barbed end actin filament capping, Positive regulation of viral genome replication, Regulation of defense response to virus, Positive regulation of interferon Alpha production, Ubiquitin dependent ERAD pathway, and Endoplasmic reticulum tubular network organization were the top 10 significantly enriched items in m6A low-risk group (Figure 11).