1. Baseline characteristics of patients
The clinical information of 1083 patients were downloaded from the TCGA database, see Table 1 for details. The expression of RGCC was significantly correlated with Age (P = 0.004), T stage (P = 0.003), Histological type (P < 0.001), PAM50 (P < 0.001), Anatomic neoplasm subdivisions (P = 0.048), radiation therapy (P = 0.035), OS event (P = 0.028). In addition, we also proved by univariate logistic regression analyses that there are clinicopathological differences between the RGCC high expression group and the low expression group, including age (p = 0.003), Histological type (P < 0.001), HER2 status (P = 0.049), PAM50 (P < 0.001), radiation therapy (P = 0.030), Anatomic neoplasm subdivisions (P = 0.042) (Table 2).
Table 1
Clinicopathological characteristics of high- and low-RGCC expression groups.
Characteristic
|
Low expression of RGCC
|
High expression of RGCC
|
p
|
n
|
541
|
542
|
|
Age, n (%)
|
|
|
0.004
|
<=60
|
276 (25.5%)
|
325 (30%)
|
|
> 60
|
265 (24.5%)
|
217 (20%)
|
|
T stage, n (%)
|
|
|
0.003
|
T1
|
112 (10.4%)
|
165 (15.3%)
|
|
T2
|
333 (30.8%)
|
296 (27.4%)
|
|
T3
|
73 (6.8%)
|
66 (6.1%)
|
|
T4
|
21 (1.9%)
|
14 (1.3%)
|
|
N stage, n (%)
|
|
|
0.292
|
N0
|
246 (23.1%)
|
268 (25.2%)
|
|
N1
|
187 (17.6%)
|
171 (16.1%)
|
|
N2
|
61 (5.7%)
|
55 (5.2%)
|
|
N3
|
32 (3%)
|
44 (4.1%)
|
|
M stage, n (%)
|
|
|
0.914
|
M0
|
462 (50.1%)
|
440 (47.7%)
|
|
M1
|
11 (1.2%)
|
9 (1%)
|
|
Pathologic stage, n (%)
|
|
|
0.076
|
Stage I
|
74 (7%)
|
107 (10.1%)
|
|
Stage II
|
318 (30%)
|
301 (28.4%)
|
|
Stage III
|
126 (11.9%)
|
116 (10.9%)
|
|
Stage IV
|
9 (0.8%)
|
9 (0.8%)
|
|
Histological type, n (%)
|
|
|
< 0.001
|
Infiltrating Ductal Carcinoma
|
420 (43%)
|
352 (36%)
|
|
Infiltrating Lobular Carcinoma
|
60 (6.1%)
|
145 (14.8%)
|
|
PR status, n (%)
|
|
|
0.567
|
Negative
|
178 (17.2%)
|
164 (15.9%)
|
|
Indeterminate
|
2 (0.2%)
|
2 (0.2%)
|
|
Positive
|
335 (32.4%)
|
353 (34.1%)
|
|
ER status, n (%)
|
|
|
0.157
|
Negative
|
111 (10.7%)
|
129 (12.5%)
|
|
Indeterminate
|
2 (0.2%)
|
0 (0%)
|
|
Positive
|
403 (38.9%)
|
390 (37.7%)
|
|
HER2 status, n (%)
|
|
|
0.116
|
Negative
|
263 (36.2%)
|
295 (40.6%)
|
|
Indeterminate
|
7 (1%)
|
5 (0.7%)
|
|
Positive
|
88 (12.1%)
|
69 (9.5%)
|
|
PAM50, n (%)
|
|
|
< 0.001
|
Normal
|
9 (0.8%)
|
31 (2.9%)
|
|
LumA
|
235 (21.7%)
|
327 (30.2%)
|
|
LumB
|
161 (14.9%)
|
43 (4%)
|
|
Her2
|
55 (5.1%)
|
27 (2.5%)
|
|
Basal
|
81 (7.5%)
|
114 (10.5%)
|
|
Menopause status, n (%)
|
|
|
0.301
|
Pre
|
111 (11.4%)
|
118 (12.1%)
|
|
Peri
|
16 (1.6%)
|
24 (2.5%)
|
|
Post
|
362 (37.2%)
|
341 (35.1%)
|
|
Anatomic neoplasm subdivisions, n (%)
|
|
|
0.048
|
Left
|
298 (27.5%)
|
265 (24.5%)
|
|
Right
|
243 (22.4%)
|
277 (25.6%)
|
|
Radiation therapy, n (%)
|
|
|
0.035
|
No
|
228 (23.1%)
|
206 (20.9%)
|
|
Yes
|
252 (25.5%)
|
301 (30.5%)
|
|
OS event, n (%)
|
|
|
0.028
|
Alive
|
452 (41.7%)
|
479 (44.2%)
|
|
Dead
|
89 (8.2%)
|
63 (5.8%)
|
|
DSS event, n (%)
|
|
|
0.156
|
Alive
|
479 (45.1%)
|
499 (46.9%)
|
|
Dead
|
49 (4.6%)
|
36 (3.4%)
|
|
PFI event, n (%)
|
|
|
0.210
|
Alive
|
460 (42.5%)
|
476 (44%)
|
|
Dead
|
81 (7.5%)
|
66 (6.1%)
|
|
Table 2
Associations of RGCC expression with clinicopathological characteristics of patients (n = 1083)
Characteristics
|
Total(N)
|
Odds Ratio(OR)
|
P value
|
T stage (T3&T4 vs. T1&T2)
|
1,080
|
0.822 (0.593–1.137)
|
0.236
|
N stage (N1&N2&N3 vs. N0)
|
1,064
|
0.885 (0.696–1.126)
|
0.320
|
M stage (M1 vs. M0)
|
922
|
0.859 (0.343–2.095)
|
0.738
|
Pathologic stage (Stage III&Stage IV vs. Stage I&Stage II)
|
1,060
|
0.890 (0.672–1.177)
|
0.413
|
Age (> 60 vs. <=60)
|
1,083
|
0.695 (0.546–0.884)
|
0.003
|
Histological type (Infiltrating Lobular Carcinoma vs. Infiltrating Ductal Carcinoma)
|
977
|
2.884 (2.078–4.044)
|
< 0.001
|
PR status (Positive vs. Negative)
|
1,030
|
1.144 (0.882–1.483)
|
0.311
|
ER status (Positive vs. Negative)
|
1,033
|
0.833 (0.623–1.112)
|
0.215
|
HER2 status (Positive vs. Negative)
|
715
|
0.699 (0.488–0.997)
|
0.049
|
PAM50 (LumA&LumB&Her2&Basal vs. Normal)
|
1,083
|
0.279 (0.124–0.568)
|
< 0.001
|
radiation_therapy (Yes vs. No)
|
987
|
1.322 (1.028–1.702)
|
0.030
|
Anatomic neoplasm subdivisions (Right vs. Left)
|
1,083
|
1.282 (1.010–1.628)
|
0.042
|
Menopause status (Post vs. Pre&Peri)
|
972
|
0.842 (0.635–1.116)
|
0.233
|
2. Low expression of RGCC in BRCA
A total of 33 cancer types were included in the pan-cancer analysis. The results showed that there were significant differences in the expression of RGCC in 23 tumors, of which 12 were up-regulated in tumor tissues and 11 were down-regulated. (Fig. 2A). RGCC is significantly lower expressed in breast cancer tissues compared to normal tissues. (p < 0.001) (Fig. 2B). Meanwhile, the expression of RGCC in 112 pairs of breast cancer tissues was significantly lower than that in adjacent tissues.(p < 0.001) (Fig. 2C). Then we studied the expression of RGCC protein in BRCA in the UALCAN database and compared it with normal breast tissue, and found that the expression level of RGCC protein in BRCA was significantly lower than that in normal breast tissue (p < 0.001) (Fig. 2D) .The above results are verified in GSE35525 (p < 0.001) (Fig. 2E). In addition, IHC staining data from the HPA database indicated that RGCC was moderately expressed in normal breast tissue and lowly expressed in BRCA tissue. (Fig. 2F、G).
3. The correlation between RGCC expression and immune infiltration.
Expression of RGCC was correlated with iDC (Spearman R = 0.424, p < 0.001), Macrophages (Spearman R = 0.404, p < 0.001), Neutrophils (Spearman R = 0.399, p < 0.001), CD8 T cells (Spearman R = 0.384, p < 0.001), and Th1 cells (Spearman R = 0.378, p < 0.001) were significantly positively correlated with immune cell infiltration levels. (Fig. 3A). In addition, the enrichment scores of iDC, Macrophages, Neutrophils, CD8 T cells and Th1 cells in the RGCC high expression group were significantly higher than those in the RGCC low expression group (p < 0.001) (Fig. 3B-K).
4. Correlation between RGCC expression and clinical features.
RGCC mRNA expression was significantly correlated with Age (Fig. 4A), Histological type (Fig. 4B), T stage (Fig. 4C), HER2 status (Fig. 4D) and PAM50 (Fig. 4E). However, no statistically significant correlations were found between RGCC expression levels and N stage (Fig. 4F), PR status (Fig. 4G), ER status (Fig. 4H) and M stage (Fig. 4I). The above results are consistent with the results of univariate logistic regression analyses.
5. Diagnostic value of RGCC expression in BRCA
In order to evaluate the diagnostic value of RGCC, we constructed receiver operating characteristic (ROC) curves, the area under the curve (AUC) was 0.934, which proved that it has high diagnostic value, as shown in Fig. 5A. Next, we constructed the ROC curve for each clinical subgroup, and the result was AUC values of 0.934 for stage I/II (Fig. 5B), 0.952 for stage III/IV (Fig. 5C), 0.942 for M0 (Fig. 5D), 0.937 for T1/T2 (Fig. 5E), 0.948 for T3/T4 (Fig. 5F), 0.930 for N0 (Fig. 5G) and 0.946 for N1-N3 (Fig. 5H), which proved that All have high diagnostic value.
6. Effect of RGCC expression on prognosis in breast cancer patients
Kaplan-Meier survival analysis was used to evaluate the Correlation between RGCC expression and prognosis of breast cancer patients. The results demonstrated that the low RGCC expression group exhibited a worse prognosis than the high RGCC expression Group as OS [HR = 0.60 (0.43–0.83), P = 0.002] and DSS [HR = 0.61 (0.40–0.94), P = 0.026]. (Fig. 6A、B). Univariate Cox analysis indicated that a low RGCC expression was significantly correlated with poor OS (HR = 0.597, 95% CI = 0.432–0.825, p = 0.002) and the different cancer stage categories: T stage(T3&T4 vs. T1&T2, P = 0.012), N stage (N1&N2&N3 vs. N0, P < 0.001), M stage (M1 vs. M0, P < 0.001), Pathologic stage (Stage III&Stage IV vs. Stage I&Stage II, P < 0.001), Age (> 60 vs. <=60, P < 0.001) and Menopause status (Post vs. Pre&Peri, P < 0.001). (Fig. 6C; Table 3). These data suggest that the expression of RGCC affects the prognosis of breast cancer patients with different pathological stages.
Table 3
Univariate and multivariate Cox regression analyses of the clinical characteristics associated with overall survival in BRCA in The Cancer Genome Atlas (TCGA).
Characteristics
|
Total(N)
|
Univariate analysis
|
Multivariate analysis
|
Hazard ratio (95% CI)
|
P value
|
Hazard ratio (95% CI)
|
P value
|
T stage (T3&T4 vs. T1&T2)
|
1079
|
1.608 (1.110–2.329)
|
0.012
|
1.944 (0.861–4.386)
|
0.109
|
N stage (N1&N2&N3 vs. N0)
|
1063
|
2.239 (1.567–3.199)
|
< 0.001
|
1.030 (0.400–2.650)
|
0.951
|
M stage (M1 vs. M0)
|
922
|
4.254 (2.468–7.334)
|
< 0.001
|
1.939 (0.510–7.377)
|
0.331
|
RGCC (High vs. Low)
|
1082
|
0.597 (0.432–0.825)
|
0.002
|
1.175 (0.601–2.296)
|
0.638
|
Pathologic stage (Stage III&Stage IV vs. Stage I&Stage II)
|
1059
|
2.391 (1.703–3.355)
|
< 0.001
|
4.351 (1.501–12.613)
|
0.007
|
Age (> 60 vs. <=60)
|
1082
|
2.020 (1.465–2.784)
|
< 0.001
|
2.606 (1.253–5.417)
|
0.010
|
Histological type (Infiltrating Lobular Carcinoma vs. Infiltrating Ductal Carcinoma)
|
977
|
0.827 (0.526–1.299)
|
0.410
|
|
|
PR status (Positive vs. Negative)
|
1029
|
0.732 (0.523–1.024)
|
0.068
|
0.686 (0.281–1.674)
|
0.407
|
ER status (Positive vs. Negative)
|
1032
|
0.712 (0.495–1.023)
|
0.066
|
0.486 (0.184–1.283)
|
0.145
|
HER2 status (Positive vs. Negative)
|
715
|
1.593 (0.973–2.609)
|
0.064
|
0.710 (0.314–1.605)
|
0.410
|
PAM50 (LumA&LumB&Her2&Basal vs. Normal)
|
1082
|
0.830 (0.388–1.773)
|
0.630
|
|
|
Menopause status (Post vs. Pre&Peri)
|
971
|
2.348 (1.428–3.860)
|
< 0.001
|
2.677 (1.008–7.113)
|
0.048
|
Anatomic neoplasm subdivisions (Right vs. Left)
|
1082
|
0.766 (0.554–1.057)
|
0.105
|
|
|
radiation_therapy (Yes vs. No)
|
986
|
0.576 (0.394–0.841)
|
0.004
|
0.470 (0.240–0.922)
|
0.028
|
7. RGCC expression and chemokines and/or chemokine receptors.
To investigate the role of ROCC in the immune microenvironment of breast cancer, the correlation between the expression levels of RGCC and immune cell chemokines and chemokine receptors was analyzed in Pan-cancer via TISIDB. The results indicated that several chemokines and chemokine receptors were significantly associated with the expression of RGCC in BRCA. (Fig. 7A、B). The above results suggest that RGCC may be related to tumor immunity. To further elucidate the relationship between RGCC and immune cell migration, we analyzed the correlation between RGCC expression and chemokine/chemokine receptors. Scatter plot results displaying that RGCC expression was significantly positively correlated with CXCL12 (r = 0.409, p < 2.2e-16), CXCL2 (r = 0.343, p < 2.2e-16), CX3CL1 (r = 0.36, p < 2.2e-16), CCL11 (r = 0.382, p < 2.2e-16), CCL4 (r = 0.323, p < 2.2e-16), CCL2 (r = 0.326, p < 2.2e-16), CXCR4 (r = 0.336, p < 2.2e-16), and CCR10 (r = 0.314, p < 2.2e-16)(Fig. 7C-J).
8. RGCC expression and immunoinhibitors and immunostimulators.
We used the TISIDB database to analyzed the correlation between RGCC and expression of immunoinhibitors and immunostimulators in 30 human cancers. (Fig. 8A、B). The analysis results showed that RGCC was significantly positively correlated with the immunoinhibitors TGFB1 (r = 0.379, p < 2.2e-16) and the immunostimulators C10orf54 (r = 0.52, p < 2.2e-16), CD40 (r = 0.392, p < 2.2e-16), CXCL12 (r = 0.409, p < 2.2e-16), CXCR4 (r = 0.336, p < 2.2e-16), IL6 (r = 0.36, p < 2.2e-16), NT5E (r = 0.345, p < 2.2e-16), TNFRSF4 (r = 0.41, p < 2.2e-16), TNFRSF8 (r = 0.38, p < 2.2e-16) and TNFSF9 (r = 0.341, p < 2.2e-16). (Fig. 8C-L). In conclusion, RGCC may be associated with tumor immunity.
9. Construction and validation of a nomogram based on RGCC expression.
In order to evaluate the prognosis of breast cancer patients, we constructed a prognostic model based on OS as an independent factor according to the expression of RGCC and several other clinical factors. Multivariate analysis was performed by integrating RGCC, Pathologic stage and M stage in the nomogram. A higher the point in the figure, the worse the prognosis. (Fig. 9A). In addition to this, we constructed a calibration curve to evaluate the performance of the RGCC nomogram. The C index of this model is 0.681 (95% CI = 0.653–0.710), suggesting that it has a certain accuracy in predicting OS in breast cancer patients. (Fig. 9B-D).
10. Identification of hub genes and their prognostic value in BRCA.
First, the most important clusters in the PPI network were identified using STRING database (Fig. 11C). We used cytoHubba (ranked by degree) to find 5 hub genes in the above PPI network, which are HLA-DMA, HLA-DRB1, HLA-DPB1, CD74 and HRA-DRA respectively (Fig. 10A). Then, we used Kaplan-Meier Plotter to evaluate the prognostic value of hub gene, and the results showed that high expression of the above 5 bub genes was significantly correlated with better prognosis (Fig. 10B-F).
11. Co-expression Gene Analysis of RGCC in BRCA
In order to explore the biological function of RGCC in BRCA, we used the LinkFinder module of LnkedOmics website to analyze the co-expression of RGCC in BRCA in TCGA database. Set | Spearman cor | >0.35 and adjusted p-value < 0.05, 1352 RGCC related genes were screened. We displayed the top 50 co-expression genes positively or negatively correlated with RGCC expression in BRCA in the heatmap(Fig. 11A、B). A total of 1666 DEGs were acquired with the threshold values of |log2 fold-change (FC)| > 1.5 and adjusted p-value < 0.05, including 97 upregulated genes and 1569 downregulated genes (Fig. 11D).
Then, we performed GO term and KEGG pathway analyses for the above 100 genes, revealing that the primary biological process (BP) contained gland development, regulation of cell-cell adhesion, embryo implantation, and positive regulation of osteoblast proliferation. The cellular component (CC) was mainly enriched in collagen-containing extracellular matrix, extrinsic component of membrane, extrinsic component of plasma membrane and cavella. The molecular function (MF) was primarily involved in insulin-like growth factor binding (Fig. 12B).
In order to screen the genes associated with survival prognosis, 1352 genes with the highest RGCC correlation were crossed with 2691 up-regulated genes associated with survival in BRCA, and 105 genes associated with RGCC and BRCA survival were detected at the crossing points (Fig. 12A). Next, GO functional enrichment and KEGG pathway analysis were performed on these 105 genes, showing that the primary biological process (BP) contained regulation of lymphocyte activation, positive regulation of cell adhesion and positive regulation of cell-substrate adhesion. The cellular component (CC) was mainly enriched in MHC protein complex, MHC class II protein complex and integral component of lumenal side of endoplasmic reticulum membrane. The molecular function (MF) was primarily involved in MHC protein complex binding, MHC class II protein complex binding and peptide antigen binding. The KEGG pathway enrichment was mainly related to Hematopoietic cell lineage, Viral myocarditis, and Asthma (Fig. 12C).
12. The expression of RGCC was detected by scRNA sequence analysis
In this study, three single-cell data sets were introduced and scTIME Portal was used to analyze the expression of RGCC. Figure 13A is the histogram of RGCC expression in each immune cell in GSE114725, Fig. 13B is the UMAP of the subcell population identified by this data set, and Fig. 13C is the corresponding expression of RGCC in each subcell population. The above results suggest that RGCC is highly expressed in Macrophage. Next, we used the CancerSEA website to verify the potential function of RGCC in two single-cell datasets. In GSE75367, the expression of RGCC was significantly positively correlated with Inflammation, Angiogensis and Differentiation, and negatively correlated with DNA repair and DNA damage (Fig. 14A). In the dataset GSE86978, the expression of RGCC has positive correlation with Quiescence and Angiogensis, and negative correlation with Metastasis (Fig. 14B). In addition, T-SNE diagram showed the RGCC expression profile of BRCA at the single-cell level (Fig. 14C、D).