Pan-cancer and HCC SRD5A3 expression analysis
We first assessed SRD5A3 expression in pan-cancer data from TCGA and GTEx. The data revealed that SRD5A3 expression was significantly higher in 29 types of tumors than those corresponding normal tissues, including ACC, BLCA, BRCA, CESC, CHOL, COAD,DLBC, ESCA, GBM, KICH, KIRC, KIPP, LAML,LIHC,LGG, LUAD, LUSC, OV, PAAD, PRAD, READ, SARC, SKCM, STAD, TGCT, THCA, THYM, UCEC, UCS(all P<0.05). (Fig.1a). We further compared the expression of SRD5A3 in GTEX combined with TCGA normal samples and TCGA hepatocellular carcinoma samples, 50 paracancerous and 371 HCC samples from TCGA, 50 HCC samples and their corresponding paired paracancerous samples from TCGA. Finally, it was found that SRD5A3 was significantly overexpressed in HCC samples, and the results were statistically significant (P<0.001). (Fig.1b-d). In addition, the ROC curve with an area under the receiver operating characteristic curve value of 0.849 (95% CI 0.804 to 0.895, p< 0.001), which showed high diagnostic accuracy of SRD5A3 in HCC., as is shown in (Fig.1e).
Differentially expressed genes (DEGs) analysis
We performed a differentially expressed genes (DEGs) analysis using TCGA cohort data, patients with HCC were divided into the high expression group and the low expression group according to the expression level of SRD5A3. A total of 242 differentially expressed genes were found using screening, 184 were highly expressed and 58 were lowly expressed, as is shown in Fig.2a. We constructed a gene expression heatmap to get an overview for the top 10 genes with the greatest differences in expression in HCC (Fig.2c).
Association with SRD5A3 expression and clinicopathologic variables
As shown in Table 1, 371 primary tumors with both clinical and gene expression data were downloaded from TCGA data. There were 250 male and 121 female with median age 61 years in the cohort, of the 371 specimens. Correlation analysis showed that SRD5A3 protein expression was significantly associated with race (p = 0.001), T stage (p < 0.001), pathologic stage (p < 0.001), tumor status (p = 0.008), and residual tumor (p = 0.017). No correlation was found between SRD5A3 expression and other clinicopathologic features, detail information was showed in table 1. Univariate analyses using logistic regression revealed that up-regulation of SRD5A3 in HCC is significantly associated with race (p < 0.001), BMI (p = 0.049), pathologic stage (p < 0.001), tumor status (p = 0.006) and residual tumor (p = 0.016). (Table 2).
As shown in Fig 3, the higher expression of SRD5A3 correlated significantly with gender (p<0.01), tumor status (p<0.01), residual tumor (p<0.01), race(Asian vs. White, p<0.01), T stage (T1 vs. T2, p<0.001;T1 vs. T3, p<0.05 ), and pathologic tumor (stage Ⅰ vs. stage Ⅱ , p<0.01; stage Ⅰ vs. stage Ⅲ, p<0.01).
Table 1 Relationship between SRD5A3 expression and clinicopathological characteristics of patients [n (%)]
Characters
|
level
|
Low expression
of SRD5A3
|
High expression
of SRD5A3
|
P value
|
Gender (%)
|
Female
|
52(28.0%)
|
69(37.3%)
|
0.071‡
|
|
Male
|
134(72.0%)
|
116(62.7%)
|
|
Race (%)
|
Asian
|
96(53.6%)
|
62(34.4%)
|
0.001*, ‡
|
|
Black or African American
|
7(3.9%)
|
10(5.6%)
|
|
|
White
|
76(42.5%)
|
108(60.0%)
|
|
Age (median [IQR])
|
|
61.00[51.00,68.00]
|
61.00[52.00,69.00]
|
0.673§
|
Height (median [IQR])
|
|
168.00[161.00,174.00]
|
167.00[160.25,174.00]
|
0.610§
|
Weight (median [IQR])
|
|
68.00[59.50,79.00]
|
72.00[58.00,85.00]
|
0.344§
|
BMI (median [IQR])
|
|
24.01[21.90,27.54]
|
25.28[21.45,29.75]
|
0.296§
|
T stage (%)
|
T1
|
112(60.5%)
|
69(37.7%)
|
<0.001*, ɤ
|
|
T2
|
33(17.8%)
|
61(33.3%)
|
|
|
T3
|
34(18.4%)
|
46(25.1%)
|
|
|
T4
|
6(3.2%)
|
7(3.8%)
|
|
N stage (%)
|
N0
|
136(98.6%)
|
116(98.3%)
|
1.000 ɤ
|
|
N1
|
2(1.4%)
|
2(1.7%)
|
|
M stage (%)
|
M0
|
141(99.3%)
|
125(97.7%)
|
0.348 ɤ
|
|
M1
|
1(0.7%)
|
3(2.3%)
|
|
Pathologic stage (%)
|
Stage I
|
108(60.3%)
|
63(37.5%)
|
<0.001*, ɤ
|
|
Stage II
|
33(18.4%)
|
53(31.5%)
|
|
|
Stage III
|
36(20.1%)
|
49(29.2%)
|
|
|
Stage IV
|
2(1.1%)
|
3(1.8%)
|
|
Tumor status (%)
|
Tumor free
|
115(64.2%)
|
86(49.7%)
|
0.008*, ‡
|
|
With tumor
|
64(35.8%)
|
87(50.3%)
|
|
Residual tumor (%)
|
R0
|
173(97.7%)
|
151(91.5%)
|
0.017*, ɤ
|
|
R1
|
4(2.3%)
|
13(7.9%)
|
|
|
R2
|
0(0.0%)
|
1(0.6%)
|
|
Histologic grade (%)
|
G1
|
26(14.0%)
|
29(16.1%)
|
0.379 ɤ
|
|
G2
|
88(47.3%)
|
89(49.4%)
|
|
|
G3
|
63(33.9%)
|
59(32.8%)
|
|
|
G4
|
9(4.8%)
|
3(1.7%)
|
|
Child-Pugh grade (%)
|
A
|
125(93.3%)
|
92(87.6%)
|
0.107 ɤ
|
|
B
|
8(6.0%)
|
13(12.4%)
|
|
|
C
|
1(0.7%)
|
0(0.0%)
|
|
Fibrosis ishak score (%)
|
0
|
35(32.1%)
|
39(37.9%)
|
0.697‡
|
|
1/2
|
18(16.5%)
|
13(12.6%)
|
|
|
3/4
|
16(14.7%)
|
12(11.7%)
|
|
|
5/6
|
40(36.7%)
|
39(37.9%)
|
|
Adjacent hepatic tissue inflammation (%)
|
Mild
|
54(42.5%)
|
45(42.1%)
|
0.845 ɤ
|
|
None
|
62(48.8%)
|
55(51.4%)
|
|
|
Severe
|
11(8.7%)
|
7(6.5%)
|
|
Vascular invasion (%)
|
No
|
112(68.3%)
|
94(62.3%)
|
0.314‡
|
|
Yes
|
52(31.7%)
|
57(37.7%)
|
|
TP53 status (%)
|
Mut
|
53(29.1%)
|
49(27.8%)
|
0.880‡
|
|
WT
|
129(70.9%)
|
127(72.2%)
|
|
Total bilirubin(mg/dl) (median [IQR])
|
|
0.60[0.50,0.90]
|
0.70[0.50,1.02]
|
0.151§
|
Albumin(g/dl)
(median [IQR])
|
|
4.00[3.50,4.30]
|
4.00[3.30,4.40]
|
0.651§
|
AFP (ng/ml)
(median [IQR])
|
|
11.00[4.00,322.25]
|
22.50[5.00,231.50]
|
0.273§
|
Prothrombin time
(median [IQR])
|
|
1.10[1.00,8.70]
|
1.10[1.00,10.20]
|
0.457§
|
*Statistically significant; § Wilcoxon rank sum test; ɤ Fisher exact test; ‡ χ2 test
Table2 SRD5A3 expression associated with clinical pathological characteristics (logistic regression)
Characteristics
|
Total(N)
|
Odds Ratio (OR)
|
P value
|
Gender (Female vs. Male)
|
371
|
1.53(0.99-2.38)
|
0.056
|
Race (Asian vs. Black or African American &White)
|
359
|
0.45(0.30-0.69)
|
<0.001
|
Age (<=60 vs. >60)
|
370
|
0.90(0.60-1.35)
|
0.603
|
Height (< 170 vs. >=170)
|
339
|
1.05(0.68-1.61)
|
0.842
|
Weight (<=70 vs. >70)
|
344
|
0.68(0.44-1.04)
|
0.073
|
BMI (<=25 vs. >25)
|
335
|
0.65(0.42-1.00)
|
0.049
|
T stage (T1&T2 vs. T3&T4)
|
368
|
0.68(0.42-1.08)
|
0.106
|
N stage (N0 vs. N1)
|
256
|
0.85(0.10-7.20)
|
0.875
|
M stage (M0 vs. M1)
|
270
|
0.30(0.01-2.34)
|
0.294
|
Pathologic stage (Stage I vs. Stage II &Stage III &Stage IV)
|
347
|
0.39(0.25-0.61)
|
<0.001
|
Tumor status (Tumor free vs. With tumor)
|
352
|
0.55(0.36-0.84)
|
0.006
|
Residual tumor (R0 vs. R1&R2)
|
342
|
0.25(0.07-0.71)
|
0.016
|
Histologic grade (G1&G2 vs. G3&G4)
|
366
|
1.20(0.79-1.84)
|
0.397
|
Child-Pugh grade (A vs. B&C)
|
239
|
0.51(0.20-1.23)
|
0.138
|
Fibrosis ishak score (0 vs. 1/2&3/4&5/6)
|
212
|
1.29(0.73-2.28)
|
0.380
|
Adjacent hepatic tissue inflammation (Mild vs. None &Severe)
|
234
|
0.98(0.58-1.65)
|
0.943
|
Vascular invasion (No vs. Yes)
|
315
|
0.77(0.48-1.22)
|
0.261
|
TP53 status (Mut vs. WT)
|
358
|
0.94(0.59-1.49)
|
0.789
|
Total bilirubin(mg/dl) (<2 vs. >=2)
|
301
|
0.56(0.20-1.49)
|
0.250
|
Albumin(g/dl) (<3.5 vs. >=3.5)
|
297
|
1.51(0.88-2.60)
|
0.137
|
AFP (ng/ml) (<=400 vs. >400)
|
278
|
1.21(0.70-2.14)
|
0.497
|
Prothrombin time (<=4 vs. >4)
|
294
|
0.72(0.44-1.19)
|
0.203
|
Clinical value of SRD5A3 in prognosis.
To confirm the correlation between the SRD5A3 expression and the prognosis of HCC, survival rates between the high and low SRD5A3 level groups were compared. The Kaplan–Meier survival analysis found that HCC patients in the high SRD5A3 expression group had poorer overall survival (OS, HR=2.26(1.58-3.24), p<0.001) (Fig. 4). T stage (p < 0.001), M stage (p = 0.018), Pathologic stage (p < 0.001), Tumor status (p < 0.001) and SRD5A3 expression were included in Cox proportional risk regression model for multivariate analyses. Multivariate analyses showed that SRD5A3 remained independently associated with overall survival (HR=2.382(1.447-3.920), p<0.05), along with tumor status, TP53 status, T stage (all p p<0.05), as is shown in Table 3.
Based on Cox proportional risk regression model, tumor status, TP53 status, T stage and the expression of SRD5A3 were included in the nomogram (Fig.5a). The C-index of the prognostic model was 0.692(95%CI:0.666-0.719). We constructed calibration plots evaluating the agreement between the predicted and the actual OS for the prognosis model, and the results showed that the predicted results of the nomogram were reliable (Fig. 5b).
Table3 Univariate and multivariate analyses of overall survival in HCC patients from TCGA.
|
Univariate analysis
|
|
Multivariate analysis
|
|
Characteristics
|
HR (95% CI)
|
P value
|
HR (95% CI)
|
P value
|
Gender (Female vs. Male)
|
1.225(0.860-1.746)
|
0.260
|
|
|
Race (Asian vs. Black or African American &White)
|
0.760(0.525-1.101)
|
0.146
|
|
|
Age (<=60 vs. >60)
|
0.802(0.565-1.136)
|
0.214
|
|
|
Height (< 170 vs. >=170)
|
0.828(0.570-1.201)
|
0.319
|
|
|
Weight (<=70 vs. >70)
|
1.091(0.762-1.562)
|
0.634
|
|
|
BMI (<=25 vs. >25)
|
1.223(0.843-1.775)
|
0.289
|
|
|
T stage (T1&T2 vs. T3&T4)
|
0.394(0.277-0.560)
|
<0.001
|
0.499(0.280-0.888)
|
0.018
|
N stage (N0 vs. N1)
|
0.499(0.122-2.037)
|
0.333
|
|
|
M stage (M0 vs. M1)
|
0.248(0.078-0.789)
|
0.018
|
0.712(0.168-3.021)
|
0.645
|
Pathologic stage (Stage I vs. Stage II &
Stage III &Stage IV)
|
0.482(0.330-0.705)
|
<0.001
|
0.763(0.404-1.442)
|
0.405
|
Tumor status (Tumor free vs. With tumor)
|
0.424(0.291-0.617)
|
<0.001
|
0.521(0.310-0.875)
|
0.014
|
Residual tumor (R0 vs. R1&R2)
|
0.637(0.322-1.258)
|
0.194
|
|
|
Histologic grade (G1&G2 vs. G3&G4)
|
0.893(0.623-1.281)
|
0.539
|
|
|
Child-Pugh grade (A vs. B&C)
|
0.619(0.305-1.254)
|
0.183
|
|
|
Fibrosis ishak score (0&1/2 vs. 3/4&5/6)
|
1.320(0.794-2.197)
|
0.285
|
|
|
Adjacent hepatic tissue inflammation
(None vs. Mild &Severe)
|
0.815(0.501-1.325)
|
0.409
|
|
|
Vascular invasion (No vs. Yes)
|
0.742(0.490-1.124)
|
0.159
|
|
|
TP53 status (Mut vs. WT)
|
1.434(0.972-2.115)
|
0.069
|
1.818(1.083-3.050)
|
0.024
|
Total bilirubin(mg/dl) (<2 vs. >=2)
|
0.858(0.347-2.118)
|
0.740
|
|
|
Albumin(g/dl) (<3.5 vs. >=3.5)
|
1.085(0.665-1.771)
|
0.743
|
|
|
AFP (ng/ml) (<=400 vs. >400)
|
0.947(0.579-1.548)
|
0.827
|
|
|
Prothrombin time (<=4 vs. >4)
|
0.752(0.496-1.140)
|
0.179
|
|
|
SRD5A3 (High vs. Low)
|
2.261(1.580-3.236)
|
<0.001
|
2.382(1.447-3.920)
|
<0.001
|
Gene sets enriched in SRD5A3 expression phenotype and protein-protein interaction
The functions of SRD5A3 were predicted by analyzing GO and KEGG in Metascape. The top 20 GO enrichment items were classified into three functional groups: biological process group (11 items), cellular component group (3 items), and molecular function group (6 items) (Fig. 6a, b). Regarding to the KEGG pathway analysis as shown in Fig.6c. Mineral absorption, gastric cancer, Salivary secretion, Protein digestion and absorption, parathyroid hormone synthesis, secretion and action were the significant metabolic pathways.
In addition, to better understand the potential biological function of SRD5A3 in HCC, we performed a Metascape protein–protein interaction enrichment analysis. The protein–protein interaction network and MCODE components identified in the gene lists are shown in Fig. 6d. The four most significant MCODE components were extracted from the protein–protein interaction network. After pathway and process enrichment analysis was independently applied to each MCODE component, the results showed that biological function was mainly related to comification, intermediate filament, keratinization, O-glycan processing, protein O-lined glycosylation, macromolecule glycosylation, G protein-coupled receptor binding, cellular response to lipid, anchored component of membrane. A single gene may be correlated with multiple
target genes, which may form functionally interactive modules involved in regulation of SRD5A3 in HCC. Through the application of molecular complexity detection (MCODE) algorithm to identify the densely connected network, we obtained 22 hub genes, including KRT19, KRT1, KRT6C, KRT80, KRT4, KRT12, MUC5B, MUC21, MUC5AC, MUC1, CHST4, CEACAM7, PRSS21, YPD68, GP2, SAA1, PENK, SSTR5, CXCL5, FGF23, FDF5, CALML3, as is shown in Fig. 6e. These hub genes mainly belong to the keratin and MUC family.
To identify signaling pathways that are differentially activated in HCC, we further conducted GSEA between low and high SRD5A3 expression data sets. Using MSigDB Collection (c2.cp.v7.0.symbols.gmt and c5.all.v7.0.symbols.gmt), we selected the most significantly enriched signaling pathways based on their normalized enrichment score (NES) (Fig. 7, Table 4). Due to the limited space, only six pathways of high and expression are listed here. GSEA identified cell cycle mitotic (Fig. 7a), cell cycle checkpoints (Fig. 7b), mitotic nuclear division (Fig. 7c), O-glycan processing (Figure 7d), mitotic prometaphase (Fig. 7e), protein O-linked glycosylation (Fig. 7f) were differentially enriched in the high SRD5A3 expression phenotype pathway.
Table 4. GSEA analysis of SRD5A3 related enriched gene set with high expression of SRD5A3
MsigDB collection
|
Gene set name
|
NES
|
p.adjust
|
FDR
|
c2.cp.v7.0.symbols.gmt
c5.all.v7.0.symbols.gmt
|
REACTOME_CELL_CYCLE_MITOTIC
|
3.02
|
0.016
|
0.01
|
REACTOME_CELL_CYCLE_CHECKPOINTS
|
2.998
|
0.016
|
0.01
|
GO_MITOTIC_NUCLEAR_DIVISION
|
2.983
|
0.013
|
0.008
|
GO_O_GLYCAN_PROCESSING
|
2.979
|
0.013
|
0.008
|
REACTOME_MITOTIC_PROMETAPHASE
|
2.918
|
0.016
|
0.01
|
GO_PROTEIN_O_LINKED_GLYCOSYLATION
|
2.897
|
0.013
|
0.008
|
NES: normalized enrichment score; FDR: false discovery rate. Gene sets with FDR<0.25 and p.adjust<0.05 are considered as significant.
Relationship between SRD5A3 expression and tumor-infiltrating immune cells
Analysis and characterisation of proteins and genes involved in cancer development at the molecular level, could add to our knowledge of potential prognostic factors [11]. Numerous studies had now documented a link between the immune infiltrate in several human carcinoma types and prognosis and response to therapy [12]. Therefore, we tried to find whether SRD5A3 expression was associated with immune infiltration in HCC. Spearman correlation was employed to show the association between the expression level (TPM) of SRD5A3 and immune cell infiltration level quantified by GSEA in the HCC tumor microenvironment. Additionally, our findings strongly underline the significant role of SRD5A3 in immune infiltration. SRD5A3 was negatively correlated Th17(R = -0.238, p < 0.001) (Fig. 8a), Cytotoxic cells (R = -0.234, p < 0.001) (Fig. 8b), whereas it was positively correlated with Th2 cells (R = 0.258, p < 0.001) (Fig. 8c).