EPOR expression in LUAD patients
The mRNA expression of EPOR in LUAD tissues was analyzed by TIMER database for the first time. EPOR expression was higher in bladder urothelial carcinoma (BLCA), cholangiocarcinoma (CHOL), head and neck squamous cell carcinoma (HNSC), renal clear cell carcinoma (KIRC), gastric adenocarcinoma (STAD) and thyroid carcinoma (THCA) tissues than in the adjacent normal tissues, and lower in breast invasive carcinoma (BRCA) and lung squamous cell carcinoma (LUSC) tissues than in normal tissues (Fig. 1a). In the UALCAN database, EPOR expression was not significantly different in LUAD primary tumor tissues from normal lung tissues (Fig. 1b). However, in GEPIA, EPOR expression was lower in LUAD tissues than in normal lung tissues (Fig. 1c). Subsequently, EPOR expression in LUAD tissues and in adjacentnormal tissues was analyzed using data obtained directly from TCGA. The expression of EPOR in LUAD was significantly decreased (Fig. 1d). After sample matching, 57 paired LUAD tumor specimens and adjacent normal tissue specimens were obtained, and EPOR expression was significantly downregulated in LUAD tissues (Fig. 1e). In addition, the EPOR mRNA expression was further detected by Oncomine database, and the expression of EPOR in LUAD tissues of six cohorts was downregulated compared to normal lung tissues, including "Bhattacharjee Lung" and "Stearman Lung" and so on (Supplementary Fig. 1). These findings were based on mRNA levels, but protein levels seems closer to the real situation. Therefore, we further studied the expression of EPOR protein in LUAD tissues and found that the protein levels of EPOR in LUAD tissues was higher than that in paired normal lung tissues (t = -10.184, P < 0.001) (Fig. 2a, b).
Gene mutations affecting EPOR expression and the correlation of EPOR expression with ALK, EGFR and PD-L1 in LUAD
We were curious about the gene mutations affecting EPOR expression in LUAD, and to this end, explored which gene mutations in LUAD lead to downregulation/upregulation of EPOR expression. The muTarget database showed that mutations of DDX60L, LGR6, POTEB3, RIF1 and SOX5 genes resulted in the downregulation of EPOR expression (Fig. 3a), and mutations of C1orf168, DBX2, EIF5B, FNDC1, KIAA0430, LRRC16A, MGAT3, PTPRM, TPH2 and UNC80 genes resulted in the upregulation of EPOR expression (Fig. 3b). The presence of ALK gene rearrangements and EGFR mutations in LUAD, both of which are tumor driver genes with targeted drug therapy and are of great significance to LUAD patients. PD-L1 is often closely associated with immune escape in cancer, and PD-L1 is also one of the common immunotherapy targets in lung cancer. Based on our clinicopathological data of tissue microarrays containing FISH data of ALK and EGFR and IHC data of PD-L1, we first compared the expression of EPOR in the ALK/EGFR/PD-L1 groups with high and low expression using the TCGA database. The results showed that the expression level of EPOR was significantly higher in the ALK high expression group than in the ALK low expression group, and there was no significant difference in EGFR and PD-L1 groups between high and low expression.In our pathological data, no difference was found in the expression of EPOR between ALK/EGFR negative and positive groups and between PD-L1 high and low expression groups (Fig. 2c).
EPOR-interacting genes and proteins, EPOR co-expressed genes and genetic alterations
The PPI network of EPOR was generated using the STRING database. There were 46 edges and 11 nodes, including EPO, JAK2 and STAT5B, etc (Fig. 4a). We also constructed correlations between EPOR genes and genes expressing these proteins. EPOR genes were more correlated with STAT5B and PTPN6 genes (R > 0.2), and was correlated with EPO, KIT and GRB2 (Fig. 4c). The EPOR-interacting genes network was constructed by GeneMANIA. EPOR interacted physically with EPO, CCDC150 and CFAP161, with the strongest interactions with EPO, and had weak genetic interaction with PCNA (Fig. 4b). The top 40 EPOR co-expressed genes were screened by GEPIA database and a single gene co-expression heat map was constructed (Fig. 4d). The three datasets, LUAD (TCGA, Nature 2014), LUSC (TCGA, Nature 2012) and NSCLC (TRACERx, NEJM&Nature 2017), were first analyzed in the cBioPortal database, and the results showed that the alteration frequencies of EPOR in LUSC and NSCLC were more than 2% and less than 1% in LUAD (Supplementary Fig. 2a). The EPOR gene alterations in the three LUAD datasets were further analyzed, and the average alteration frequency was only 0.7%, with “Deep Deletion” being the more common type (Supplementary Fig. 2b, d). Due to the lack of alteration data, the Kaplan-Meier plotter and log-rank test could only obtain the correlation between OS and EPOR altered/unaltered groups, and the result showed that the difference is not statistically significant (Supplementary Fig. 2c).
Correlation of EPOR expression with tumor purity, immune cells, SCNA and tumor microenvironment
In the TIMER database, we analyzed the correlation of EPOR expression with tumor purity and six infiltrating immune cells including B cells, CD8+ T cells, CD4+ T cells, macrophages, neutrophils, and dendritic cells (DCs).The results showed that EPOR expression levels were not significantly correlated with tumor purity and infiltration of macrophages, neutrophils, DCs, negatively correlated with infiltration of CD8+ T cells, and positively correlated with infiltration of B cells and CD4+ T cells, where the correlation coefficientwith CD4+ T cells was close to 0.3 (Fig. 5a).We further analyzed the correlation of EPOR expression with gene markers of immune cells in the TIMER database.The results after adjusting for tumor purity showed that EPOR expression correlated with some gene markers of B cells, M1 macrophages, neutrophils, natural killer (NK) cells, DCs, type 1 T - helper (Th1) cells, type 2 T - helper (Th2) cells, type 17 T - helper (Th17) cells, follicular helper T (Tfh) cells, regulatory T (Treg) cells, effector memory T (Tem) cells and natural killer T (NKT) cells. Among them, the correlation coefficientwith CD19 (B cell), CD11c (DC), STAT6 (Th2), BCL6 (Tfh) and TGFB1 (Treg) was more than 0.2, but the overall showed a weak correlation between EPOR expression and immune cell markers in LUAD (Table 1). The results of the SCNA module showed that significant differences in infiltration levels of B cells, CD4+ T cells, macrophages, neutrophils and DCs at the "Arm-level Deletion" somatic copy number state of EPOR in LUAD compared to normal levels,in addition, the infiltration levels of macrophages in the "Arm-level Gain" somatic copy number state of EPOR was significantly different compared to the normal level (Fig. 5b). In addition, we further investigated the correlation between EPOR expression and tumor microenvironment (TME) in LUAD. TME can be evaluated by ImmuneScore, StromalScore and ESTIMATEScore, which indicate tumor immune cell infiltration, presence of tumor tissue mesenchyme and tumor purity, respectively. The results showed no significant correlation between EPOR mRNA expression levels and the three scores ,with |R| ≤ 0.2 (Fig. 5c).
Table 1
Correlation analysis between EPOR and gene markers of immune cells in TIMER
Description
|
Gene markers
|
|
|
LUAD
|
|
|
|
|
None
|
|
Purity
|
|
|
Cor
|
P
|
|
Cor
|
P
|
B cell
T cell (general)
CD8+ T cell
Monocyte
TAM
M1
M2
Neutrophil
NK
DC
Th1
Th2
Th17
Tfh
Treg
Tem
NKT
|
CD19
CD79A
CD3D
CD3E
CD2
CD8A
CD8B
CD86
CD115(CSF1R)
CCL2
CD68
IL10
IRF5
COX2(PTGS2)
INOS(NOS2)
CD163
VSIG4
MS4A4A
CC16b(CEACAMB)
CC11b(ITGAM)
CCR7
KIR2DL1
KIR2DL3
KIR2DL4
KIR3DL1
KIR3DL2
KIR3DL3
KIR2DS4
HLA-DPB1
HLA-DQB1
HLA-DRA
HLA-DPA1
BDCA-1(CD1C)
BDCA-4(NRP1)
CD11c(ITGAX)
TBX21
STAT1
STAT4
IFN-γ(IFNG)
TNF-α(TNF)
STAT5A
STAT6
GATA3
IL13
STAT3
IL17A
BCL6
IL21
FOXP3
CCR8
TGFB1
DUSP4
GZMK
GZMA
ZBTB16
KLRB1
|
0.138
0.046
-0.079
-0.012
-0.025
-0.077
-0.094
-0.087
0.024
-0.088
-0.085
-0.05
0.153
0.041
0.061
-0.06
-0.078
-0.084
0.165
0.07
0.104
-0.019
-0.09
-0.142
0.023
-0.068
-0.047
-0.006
0.037
0.063
-0.062
-0.015
0.094
0.093
0.228
0.047
-0.105
0.075
-0.091
0.082
0.075
0.289
-0.022
0.116
0.107
-0.019
0.263
-0.03
0.042
-0.025
0.178
-0.07
-0.017
-0.121
0.121
-0.086
|
**
0.299
0.075
0.784
0.568
0.079
*
*
0.586
*
0.051
0.254
***
0.359
0.169
0.174
0.078
0.056
***
0.110
*
0.664
*
**
0.595
0.123
0.288
0.899
0.398
0.156
0.163
0.735
*
*
***
0.286
*
0.088
*
0.064
0.090
***
0.626
**
*
0.664
***
0.491
0.347
0.571
***
0.111
0.692
**
**
0.052
|
|
0.218
0.107
-0.043
0.045
0.026
-0.046
-0.073
-0.058
0.069
-0.067
-0.057
-0.02
0.185
0.033
0.063
-0.03
-0.059
-0.056
0.173
0.11
0.184
-0.011
-0.076
-0.138
0.039
-0.047
-0.035
0.002
0.086
0.097
-0.026
0.028
0.132
0.099
0.3
0.091
-0.099
0.113
-0.072
0.137
0.129
0.295
0.012
0.136
0.101
0.001
0.276
-0.012
0.091
0.014
0.217
-0.066
0.038
-0.1
0.134
-0.046
|
***
*
0.346
0.322
0.569
0.311
0.105
0.200
0.126
0.140
0.203
0.652
***
0.463
0.161
0.500
0.190
0.218
***
*
***
0.804
0.090
**
0.381
0.302
0.435
0.965
0.056
*
0.566
0.529
**
*
***
*
*
*
0.109
**
**
***
0.783
**
*
0.977
***
0.791
*
0.752
***
0.146
0.398
*
**
0.312
|
*P < 0.05, **P < 0.01, ***P < 0.001 |
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis and gene set enrichment analysis (GSEA) of DEGs
Data mining techniques in the TCGA database were used to screen the top 300 EPOR differentially expressed genes (DEGs) in LUAD. To explore the potential biological functions of DEGs, GO and KEGG enrichment analyses were performed. The results showed that MFs of DEGs were significantly enriched in passive transmembrane transporter activity, channel activity, substrate-specific channel activity, ion channel activity and sodium ion transmembrane transporter activity. In CCs, DEGs mainly distributed in neuronal cell body, presynapse, collagen-containing extracellular matix, neuron projection terminus and axon terminus. In BPs, signal release, response to metal ion, neurotransmitter transport, hormone metabolic process and amine transport dominated (Fig. 6a). And DEGs were enriched in neuroactive ligand-receptor interaction pathway and protein digestion and adsorption pathway (Fig. 6b). To make the enrichment pathways more comprehensive, we further performed gene set enrichment analysis (GSEA) on all DEGs. The results showed that they were significantly enriched in KEGG pathways such as calcium signaling pathway, pathways in cancer, cytokine-cytokine receptor interaction, MAPK signaling pathway and neuroactive ligand-receptor interaction (Fig. 6c),and significantly enriched in Wiki pathways such as focal adhesion PI3K/Akt/mTOR signaling pathway, sudden infant death syndrome-sids susceptibility pathways, MAPK signaling pathway, PI3K/Akt signaling pathway and nuclear receptors meta pathway (Fig. 6d).
Prognostic analysis of EPOR in LUAD patients
We first analyzed the prognosis of EPOR in LUAD patientsin KM Plotter database. In OS, 37986_at and 216999_at probes showed that patients with high EPOR mRNA expression in LUAD had a better prognosis, and 209962_at, 209963_s_at and 396_f_at probes showed that patients with high EPOR expression in LUAD had a worse prognosis (Fig. 7a). In PFS, the 37986_at, 209962_at and 209963_s_at probes showed that patients with high EPOR mRNA expression in LUAD had a better prognosis, and the 396_f_at probe showed that patients with high EPOR expression in LUAD had a worse prognosis (Fig. 7b). We then performed a further prognostic analysis in the PrognoScan database, which showed that high EPOR expression was associated with better OS in the GSE13213, GSE31210, jacob-00182-MSK and jacob-00182-UM LUAD datasets, and in the jacob-00182-CANDF dataset, high EPOR expression was associated with poorer OS (Fig. 7c).
Validation of EPOR prognostic value based on clinicopathological data of patients
To validate the prognostic value of EPOR expression in LUAD patients, we performed a prognostic analysis using detailed clinicopathological data from our 92 cases of lung adenocarcinoma tissue microarrays (HLugA180Su04). KM analysis showed poorer OS in the EPOR high expression group (29.5 vs 46 months) (Fig. 8a).To test the independence of EPOR as a prognostic factor, we performed the Cox risk regression analysis.Multivariate Cox risk regression analysis showed that T3-T4 stage, AJCC stage III/IV, high expression of EPOR and ALK (+) were associated with OS (Table 2). KM analysis and multivariate Cox analysis demonstrated better agreement on EPOR being a risk factor for OS in LUAD.Subsequently, based on the results of multivariate Cox risk regression analysis, we constructed a prediction model, and the nomogram more visually demonstrated the effect of EPOR on 5-year survival rate (C-index = 0.711) (Fig. 8b). The calibration curve showed good agreement between our results and predicted values in the 5-year survival rate (Fig. 8c).
Table 2
Univariate and multivariateanalyses of prognostic factors for OS
Characteristics
|
Total(N)
|
Univariate analysis
|
Multivariate analysis
|
HR (95% CI)
|
P value
|
HR (95% CI)
|
P value
|
Gender
|
58
|
|
|
|
|
Male
|
34
|
Reference
|
|
|
|
Female
|
24
|
0.733 (0.384-1.398)
|
0.345
|
|
|
Age
|
58
|
|
|
|
|
<65
|
34
|
Reference
|
|
|
|
≥65
|
24
|
1.773 (0.944-3.328)
|
0.075
|
1.984 (0.977-4.028)
|
0.058
|
T stage
|
58
|
|
|
|
|
T1-T2
|
41
|
Reference
|
|
|
|
T3-T4
|
17
|
1.985 (1.019-3.866)
|
0.044
|
0.256 (0.085-0.770)
|
0.015
|
N stage
|
58
|
|
|
|
|
N0
|
27
|
Reference
|
|
|
|
N1
|
13
|
1.470 (0.635-3.402)
|
0.369
|
0.390 (0.087-1.760)
|
0.221
|
N2
|
17
|
3.281 (1.542-6.978)
|
0.002
|
0.189 (0.028-1.262)
|
0.085
|
N3
|
1
|
4.357 (0.550-34.494)
|
0.163
|
0.425 (0.028-6.358)
|
0.535
|
M stage
|
58
|
|
|
|
|
M0
|
56
|
Reference
|
|
|
|
M1
|
2
|
1.742 (0.414-7.338)
|
0.449
|
|
|
AJCC stage
|
58
|
|
|
|
|
Ⅰ
|
21
|
Reference
|
|
|
|
Ⅱ
|
16
|
1.440 (0.598-3.467)
|
0.416
|
4.100 (0.869-19.342)
|
0.075
|
Ⅲ
|
19
|
4.283 (1.898-9.669)
|
<0.001
|
59.422 (5.978-590.716)
|
<0.001
|
Ⅳ
|
2
|
3.347 (0.711-15.765)
|
0.127
|
13.110 (2.149-79.995)
|
0.005
|
EPOR
|
58
|
|
|
|
|
Low
|
24
|
Reference
|
|
|
|
High
|
34
|
2.108 (1.076-4.128)
|
0.030
|
2.857 (1.202-6.792)
|
0.017
|
ALK
|
58
|
|
|
|
|
-
|
51
|
Reference
|
|
|
|
+
|
7
|
2.640 (1.092-6.384)
|
0.031
|
3.937 (1.372-11.299)
|
0.011
|
EGFR
|
58
|
|
|
|
|
-
|
43
|
Reference
|
|
|
|
+
|
15
|
1.229 (0.622-2.428)
|
0.553
|
|
|
PD-L1
|
58
|
|
|
|
|
Low
|
30
|
Reference
|
|
|
|
High
|
28
|
1.121 (0.598-2.102)
|
0.721
|
|
|
Bold text indicated P < 0.05
|