Correlation of stromal and immune scores with different clinical features in patients with non-small-cell lung cancer
A total of 196 patients with NSCLC were included in this study, of which 107 (54.6%) were males, and 89 (45.4%) were females. Of the 196 patients, 106, 66, and 24 had LUAD, LUSC, and large cell neuroendocrine carcinoma, respectively. Regarding the clinical classification stage, 40 cases were in stage IA, 90 cases in stage IB, 6 cases in stage IIA, 29 cases in stage IIB, 21 cases in stage IIIA, 6 cases in stage IIIB, and 4 cases in stage IV (Table 1).
Table 1
Clinical features of patients in GSE37745
Clinical features
|
|
Count (%)
|
Age (y)
|
|
|
|
<=65
|
102(52%)
|
|
> 65
|
94(48%)
|
Gender
|
|
|
|
Male
|
107(55%)
|
|
Female
|
89(45%)
|
Status
|
|
|
|
survive
|
51(26%)
|
|
died
|
145(74%)
|
Histology
|
|
|
|
LUAD
|
106(54%)
|
|
LUSC
|
66(34%)
|
|
LCNEC
|
24(12%)
|
Stage
|
|
|
|
IA
|
40(20%)
|
|
IB
|
90(46%)
|
|
IIA
|
6(3%)
|
|
IIB
|
29(15%)
|
|
IIIA
|
21(11%)
|
|
IIIB
|
6(3%)
|
|
IV
|
4(2%)
|
LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; LCNEC, Large cell neuroendocrine carcinoma |
Then, we used the ESTIMATE algorithm to examine the distribution of stromal and immune scores for each patient. The results showed that LUAD and early clinical stages of NSCLC were associated with high immune scores (P < 0.05, Fig. 1A, B). Otherwise, patients with high immune scores and high stromal scores showed better survival rate (P < 0.05, Fig. 1C, D). Survival analysis showed that the OS in high and low stromal/immune score groups was not significantly different (P > 0.05, Fig. 1E, F).
Screening of differentially expressed genes between different score groups
In total, 893 genes were differentially expressed between high and low stromal score groups in the GSE37745 dataset, including 738 upregulated and 155 downregulated genes. Similarly, 787 genes were differentially expressed between high and low immune score groups, including 635 upregulated and 152 downregulated genes. The top 20 DEGs from different score groups were plotted, respectively (Fig. 2A, B). The number of overlapping DEGs between the stromal score groups and the immune score groups was 437, including 380 upregulated and 57 downregulated genes (Fig. 2C, D).
Identification of prognostic genes from differentially expressed genes
The Kaplan–Meier method was used to analyze the relationship between OS and the 437 overlapping DEGs in the GSE37745 dataset, and 43 genes were found to be closely associated with OS (Supplementary table 1). Therefore, PLAUR, PSAT1, and RAD54B were identified as prognosis risk genes in NSCLC (Fig. 2E).
Validation of prognostic genes and selected hub genes
The expression of PSAT1 was verified using TCGA dataset and GEPIA2, which demonstrated that only PSAT1 was highly expressed in LUAD and LUSC (Fig. 3A) and upregulated with increasing tumor stage (P = 1.991e-05, Fig. 3B). Kaplan–Meier Plotter data showed that higher expression of PSAT1 led to lower survival time in NSCLC (Fig. 3C). The results were further validated by verifying the expression level of PSAT1 using qRT-PCR. As expected, in comparison with matched adjacent nontumor tissues, PSAT1 was significantly upregulated in cancer tissues (Fig. 3D and Supplementary table 2). In addition, we examined the expression of PSAT1 in NSCLC tissues and normal tissues by WB. A higher level of PSAT1 was observed in the NSCLC tissues and tumor samples, whereas a lower expression of PSAT1 was detected in normal tissues (Fig. 3E). The protein expression of PSAT1 was also investigated by immunohistochemical staining using tissue sections obtained from patients with NSCLC at our cancer center, which showed that PSAT1 was more highly expressed in tumor tissues than in peritumor tissues (Fig. 3F). Given the results of above, we evaluated hub genes' levels in a cohort of 99 NSCLC patients with median 5-y follow-up in which high-score patients displayed poor OS (Fig. 3G and Supplementary table 3).
Enrichment analysis of PSAT1 co-expression genes
To further explore the molecular mechanism of the PSAT1 gene in tumorigenesis, we identified the top 20 PSAT1-binding proteins using the STRING database (Fig. 4A). GO term enrichment showed that PSAT1 co-expression genes were mainly involved in the alpha-amino acid metabolic and biosynthetic process, cellular amino acid metabolic process, carboxylic acid biosynthetic process, organic acid biosynthetic process, and cellular amino acid biosynthetic process (Fig. 4B). The KEGG pathway enrichment analysis indicated that biosynthesis of amino acids; glycine, serine and threonine metabolism; and carbon metabolism are involved in the effect of PSAT1 on tumor pathogenesis (Fig. 4C and Supplementary table 4). Remarkably, the top 20 positively co-expressed genes showed a high probability of becoming high-risk markers of NSCLC (Fig. 4D).
Transcription factor–miRNA–target network construction
To understand the regulatory factors of PSAT1 in NSCLC, we analyzed the miRNAs and transcription factors (TFs) associated with PSAT1. This network consisted of 24 nodes and 36 interactions, from which we were able to identify 12 TFs—PAX9, TP53, ETS1, FOXI1, IRF3, NR2F1, NFYA, E2F1, NKX2-3, GATA5, SRF, and EBF1and 12miRNAs—including hsa-miR-1914-5p, hsa-miR-192-5p, hsa-miR-7109-5p, hsa-miR-874-3p, hsa-miR-7113-3p, hsa-miR-1224-5p, hsa-miR-4656, hsa-miR-6886-5p, hsa-miR-3619-3p, hsa-miR-1207-5p, hsa-miR-665 and hsa-miR-659-3p.that potentially interact with PSAT1 (Fig. 4E and Supplementary table 5).
Immune infiltration analysis of PSAT1 in NSCLC
We evaluated the correlation between PSAT1 expression and immune invasion in NSCLC using the TIMER database. The results revealed that PSAT1 expression levels were correlated with infiltration of B cells (LUAD: r = -0.072, P = 1.14e− 01; LUSC: r = -0.024, P = 5.98e− 01), CD8+ T cells (LUAD: r = 0.081, P = 7.24e− 02; LUSC: r = -0.065, P = 1.57e− 01), CD4+ T cells (LUAD: r = -0.193, P = 1.89e− 05; LUSC: r = -0.298, P = 3.56e− 11), macrophages (LUAD: r = -0.117, P = 9.94e− 03; LUSC: r = -0.197, P = 1.54e− 05), neutrophils (LUAD: r = -0.018, P = 6.95e− 01; LUSC: r = -0.239, P = 1.30e− 07), and dendritic cells (LUAD: r = -0.115, P = 1.07e− 02; LUSC: r = -0.209, P = 4.68e− 06; Fig. 4F).
Pan-cancer analysis of PSAT1
Gene expression analysis indicated a significant difference in PSAT1 expression between healthy tissues and tumor tissues of breast invasive carcinoma (BRCA), cervical squamous cell carcinoma and endocervical adenocarcinoma, Colon adenocarcinoma, lymphoid neoplasm diffuse large B-cell lymphoma, glioblastoma multiforme, Brain lower grade glioma, ovarian serous cystadenocarcinoma, prostate adenocarcinoma, rectal adenocarcinoma, stomach adenocarcinoma (STAD), thymoma, and uterine corpus endometrial carcinoma (UCEC; Fig. 5A, P < 0.05). We also found that PSAT1 expression is closely associated with the pathological stages of BRCA, kidney chromophobe, kidney renal clear cell carcinoma (KIRC), kidney renal papillary cell carcinoma (KIRP), STAD, testicular germ cell tumor, thyroid carcinoma (THCA), and UCEC (Fig. 5B, P < 0.05). Next, we assessed the prognostic value of PSAT1 expression across cancers in Kaplan-Meier Plotter, and results showed that PSAT1 plays a detrimental role in BRCA, KIRC, KIRP, liver hepatocellular carcinoma (LIHC), pancreatic adenocarcinoma (PAAD), sarcoma (SARC), THCA, and UCEC (Fig. 5C, P < 0.05).