3.1 Macrophage exhibited the highest IRGs AUCell score in single-cell transcriptomic atlas of BRCA
The scRNA-seq data of 26 BRCA samples in GSE176078 was downloaded, after cell filtration, standardization, dimension reduction, and cell clustering analysis, 13 cell clusters were obtained (Fig. 1A). According to the expressions of classical maker genes in each cell cluster, 6 cell types were classified (Fig. 1B), including T cell (marked with CD2, CD3D, CD3E, and CD69), Epithelial cell (marked with EPCAM, KRT18, and KRT8), Macrophage (marked with CD68, CD163, AIF1, FCER1G, LYZ, and TYROBP), Endothelial cell (marked with PECAM1, PLVAP, and VWF), Fibroblast (marked with COL1A1, COL1A2, DCN, and LUM), and B cell (marked with CD79A, DERL3, JCHAIN, MZB1, and MS4A1) (Fig. 1C). Furthermore, the abundance of each cell type in 26 BRCA samples was compared, showing that T cell accounted for a high proportion in the samples (Fig. 1D). In addition, Macrophage exhibited the highest IRGs AUCell score among 6 cell types (Fig. 1E).
Fig. 1 The single-cell transcriptomic atlas of breast cancer (BRCA) in GSE176078 dataset. (A) UMAP plot of different cell clusters. (B) UMAP plot of different cell types. (C) Bubble plot of classical marker genes in each cell type. (D) The proportion of cell types in 26 BRCA samples. (E) AUCell enrichment scores of immune-related genes (IRGs) in different cell types.
3.2 Macrophage C1 highly expressed the marker genes of M2 macrophages
The Macrophage was re-clustered by UMAP clustering and separated into two subpopulations: C1 and C2 (Fig. 2A), which was marked as Macrophage C1 (CD163, FGL2, and MSR1) and Macrophage C2 (CIB1 and KYNU) (Fig. 2B). The IRGs AUCell score of Macrophage C1 was higher than that of Macrophage C2 (Fig. 2C). Moreover, Macrophage C1 highly expressed the marker genes of M2 macrophages such as CD163, FGL2, and MSR1 (Fig. 2D). Thus, Macrophage C1 was selected as M2 macrophages for further study. KEGG enrichment analysis indicated that the specific high expression genes of M2 macrophages were notably enriched in Phagosome, IL-17 signaling pathway, Chemokine signaling pathway, NK-kappa B signaling pathway, Antigen processing and presentation, TNF signaling pathway, Apoptosis, and so on (Fig. 2E).
Fig. 2 The heterogeneity of Macrophage. (A) UMAP plot of Macrophage. (B) The marker genes with high expression in Macrophage C1 and Macrophage C2. (C) AUCell scores of IRGs in Macrophage C1 and Macrophage C2. (D) Violin plot of expression levels of marker genes in M2 macrophages. (E) KEGG pathway enrichment analysis of marker genes in M2 macrophages.
3.3 903 genes highly relevant to M2-like TAMs were identified by WGCNA
The ssGSEA showed that the M2-like TAMs score of tumor group was noticeably lower than that of normal group in TCGA-BRCA dataset (Fig. 3A), manifesting that M2-like TAMs were dysregulated in the tumor samples. According to the Kaplan-Meier survival curve, the OS rate of BRCA patients in high M2-like TAMs score group was higher than that in low M2-like TAMs score group within 10 years, yet the OS rate after 10 years showed the opposite outcome (Fig. 3B). Further, WGCNA was applied to identify the key module correlated with M2-like TAMs of BRCA patients in TCGA-BRCA dataset. The optimal soft threshold (power) was determined to be 6 by pickSoftThreshold function (Fig. S1) and 9 modules were acquired (Fig. S2). The number of genes contained in each module was displayed by lollipop plot (Fig. 3C). The heatmap of module-trait relationships showed that the black module exhibited a remarkable positive correlation with M2-like TAMs score (cor:0.88, p:0.00) (Fig. 3D). Then, 903 genes highly relevant to M2-like TAMs were screened based on the |GS|>0.3 and |MM|>0.3 (Fig. 3E).
Fig. 3 Identification of M2-like tumor-associated macrophages (TAMs) related genes by WGCNA in TCGA-BRCA dataset. (A) M2-like TAMs scores in tumor and normal samples by ssGSEA. ** indicates p < 0.01. (B) OS survival curves between high and low M2-like TAMs score groups. (C) The number of genes included in each module identified by WGCNA. (D) The heatmap of module-trait relationships. (E) Scatter plot of module membership (MM) in black module versus gene significance (GS) for M2-like TAMs score.
3.4 RiskScore model based on 10 M2-like TAMs-related prognostic signatures showed robust performance
First, the prognosis genes were screened from the 903 M2-like TAMs related genes by univariate and LASSO Cox regression analysis, and 17 genes were selected for subsequent study with lambda=0.0172 (Fig. 4A, B). Next, 10 prognostic signatures associated with M2-like TAMs were obtained, including 6 “Protective” genes (ARHGAP26, RILP, KLRB1, CSTA, KLHDC7B, and PSMB8) and 4 “Risk” genes (KYNU, RNASE1, LONRF3, and TRPM2) (Fig. 4C). Then, the RiskScore model was constructed, namely “RiskScore=0.42*TRPM2+0.295*RNASE1-0.266*RILP-0.146*PSMB8+0.353*LONRF3+0.197*KYNU-0.252*KLRB1-0.153*KLHDC7B-0.166*CSTA-0.287*ARHGAP26”. Furthermore, high-risk group had lower OS rate and shorter time of survival than the low-risk group in TCGA-BRCA dataset (Fig. 4D). The area under ROC curve (AUC) analysis suggested that the Riskscore model was reliable with 1-year AUC of 0.73, 3-years AUC of 0.76, 5-years AUC of 0.72, and 7-years AUC of 0.73 (Fig. 4D). The robustness of RiskScore model was validated in the validation set GSE20685, and the results were similar to the TCGA-BRCA dataset (Fig. 4E).
Fig. 4 Establishment and validation of RiskScore model. (A) The coefficients of each independent variable changing with lambda. (B) The confidence interval under lambda. (C) The prognostic signatures of RiskScore model. (D) The ROC and OS survival curves of RiskScore model in TCGA-BRCA dataset. (E) The ROC and OS survival curves of RiskScore model in validation set GSE20685.
3.5 Nomogram exhibited good performance for predicting the prognosis of BRCA
The results of univariate and multivariate Cox regression analyses showed that stage, Age, and RiskScore were the independent prognostic factors (Fig. 5A, B). Then, a nomogram was developed by integrating stage, Age, and RiskScore (Fig. 5C). Further, the predictive calibration points for 1, 3, 5, and 7 years of the calibration curve were close to the standard curve (Fig. 5D), manifesting that the nomogram had strong predictive performance. The net benefit of nomogram was higher than the extreme curve by DCA curve (Fig. 5E), which demonstrated that the nomogram was dependable.
Fig. 5 Construction and validation of nomogram. (A) Univariate Cox regression analysis on RiskScore and clinicopathological features in TCGA-BRCA dataset. (B) Multivariate Cox regression analysis on RiskScore and clinicopathological features in TCGA-BRCA dataset. (C) Construction of nomogram by combining stage, Age, and RiskScore. (D) The calibration curve of nomogram. (E) The DCA curve of nomogram.
3.6 The BRCA patients with low RiskScore were more sensitive to chemotherapeutic drugs
The StromalScore, ImmuneScore, and ESTIMATEScore of high-risk group were considerably lower than the low-risk group in TCGA-BRCA dataset (Fig. 6A), suggesting that the decrease of immune scores may be related to the tumor progression and poor prognosis of patients. Moreover, the correlation between 10 immune cells and RiskScore was assessed by MCP-counter algorithm, founding that most immune cells exhibited a negative correlation with RiskScore (Fig. 6B). In addition, there was a remarkable correlation between RiskScore and IC50 of 8 chemotherapeutic drugs (FDR<0.05 and |cor|>0.3) (Fig. 6C), such as Ribociclib_1632 (R=0.34) (Fig. 6D). The IC50 of Ribociclib_1632 in high-risk group was higher compared to low-risk group (Fig. 6E), which conversely indicated that the BRCA patients with low RiskScore were more sensitive to Ribociclib_1632.
Fig. 6 Correlation analysis between RiskScore and TME or chemotherapeutic drugs in TCGA-BRCA dataset. (A) StromalScore, ImmuneScore, and ESTIMATEScore of high/low-risk groups assessed by ESTIMATE algorithm. (B) Correlation analysis between RiskScore and 10 immune cells infiltration calculated by MCP-counter algorithm. (C) Correlation analysis between RiskScore, key model genes and 8 chemotherapeutic drugs. (D) Correlation analysis between RiskScore and Ribociclib_1632. (E) IC50 values of Ribociclib_1632 between high- and low-risk groups. **** indicates p < 0.0001; *** indicates p < 0.001; ** indicates p < 0.01; * indicates p < 0.05.