1. Expression of cuproptosis-related genes (CRGs) in AML patients
A total of 172 AML patients in TCGA database were enrolled in this analysis after filtered as mentioned above. In addition, 136 samples from GSE37642 cohort were recruited as validation group. The detailed clinical information (age, gender, FAB subtype, WBC, etc.) for TCGA and GSE37642 cohorts were gathered and listed in Table 1.
Table 1
Clinical characteristics of AML patients in TCGA and GEO cohorts.
Characteristics
|
TCGA
|
GSE37642
|
Gender (number)
Male
Female
|
92
80
|
|
Age (years)
Median
Range
|
58
18-88
|
59.5
18-85
|
FAB (number)
M0
M1
M2
M3
M4
M5
M6
M7
NA
|
16
44
38
16
34
17
2
3
2
|
8
29
47
7
17
19
7
1
1
|
WBC
Median
Range
|
16.5
0.4-297.4
|
|
BM-BLAST
Median
Range
|
72.5
30-100
|
|
Living status (number)
Dead
Alive
|
113
59
|
98
38
|
Overall survival (months)
Median
Range
|
17.25
0.1-118.1
|
14.68
0.07-130.63
|
We extracted 14 CRGs through screening relative articles published. The expression of CRGs in AML patients were compared with 70 control samples from GTEx database. As seen in Fig. 1a, ATP7A, ATP7B, DLAT, DLD, FDX1, GCSH, LIAS, PDHA1, SLC25A3, and SLC31A1 were decreased, while DBT, DLST and LIPT1 were increased with P values all less than 0.001. No obvious difference was found in PDHB expression.
2. Prognostic values of CRGs in TCGA AML cohort
Next, we explored the prognostic values of CRGs. Kaplan-Meier analyses were performed for each CRGs in TCGA AML cohort and the results were visualized as forest plots to display P values and hazard ratios (HR). According to the forest plots, patients with a low-expression of GCSH or LIPT1, or a high-expression of DLAT exhibited markedly worse overall survival rate (Fig. 1b).
3. Construction of a prognostic gene signature in AML
Since PDHB showed no difference of expression in AML patients compared with controls, only 13 CRGs were retained for further analysis except for PDHB. LASSO Cox regression analysis was applied to develop a prognostic model. As a result, a 3-gene signature was generated (Fig. 2a, b). The risk score was calculated through the following formula: risk score = (-0.01161) * GCSH expression + (-0.40387) * LIPT1 expression + (0.248985) * PDHA1 expression.
To evaluate the performance of the 3-gene signature, 172 AML patients from TCGA were stratified equally into low- and high-risk subgroups based on the median score calculated by the formula (Fig. 2c). Patients in the latter group had more deaths than in the former group (Fig. 2d). In addition, Kaplan-Meier analysis revealed that the OS time was notably different between the two groups. Relative to high-risk group, patients in low-risk group had an advantage of higher OS (P = 0.003, Fig. 2e). We further performed time dependent ROC analysis to evaluate the specificity and sensitivity of the gene signature. The area under the ROC curve (AUC) was 0.665, 0.644 and 0.611 for 1, 2, and 3-year survival, separately (Fig. 2f).
We next evaluated whether the risk model could serve as an independent prognostic factor. As shown in Fig. 2g and h, both the univariate and multivariate Cox regression analyses implied that the risk score was an independent prognostic factor (p = 0.0034, 0.0027, separately). These results demonstrated that a high risk-score was able to predict independently poor survival of AML patients.
4. Validation of the risk model in GEO datasets
GSE37642 cohort was utilized as external validation. A total of 136 AML patients were classified equally into the low- and high-risk groups according to the median risk score calculated (Fig. 3a). Same to the TCGA cohort, patients who had a high risk-score in GSE37642 were observed to have shorter survival times than those in low-risk group (Fig. 3b). The OS rate in high-risk group was significantly poorer than in low-risk group, as the Kaplan-Meier curves indicated (P = 0.0216, Fig. 3c). In addition, time dependent ROC analysis showed similar AUC: 0.602, 0.621 and 0.636 for 1, 2, and 3-year survival, separately (Fig. 3d).
5. Functional analysis of DEGs between low- and high-risk groups
To further explore risk score-associated differences of gene expression and biological functions and pathways, we screened for DEGs between the two subgroups using R package “limma”. The criteria were set as mentioned above, and as a result, 240 genes were identified. Among them, 159 genes were upregulated while 81 were downregulated in the high-risk group. The result was displayed in a volcano plot (Fig. 4a).
As shown in Fig. 4b & c, GO and KEGG functional enrichment analysis were performed. The 240 DEGs were mainly enriched in the following terms or pathways: positive regulation of inflammatory response, amide binding, peptide binding, osteoclast differentiation, and B cell receptor signaling pathway, etc.
A PPI network of 240 DEGs were constructed in STRING database and analyzed in Cytoscape software. The top 3 nodes of most importance were selected: S100A9, S100A8, and LILRB2 (Fig. 4d).
6. Prediction of drug sensitivity based on the risk model
We further tested the impact of risk score on cells’ response to different drugs based on GDSC drug sensitivity data. A total of 41 drugs were found to have a significant difference of IC50 values between low- and high-risk groups. AML patients in the high-risk group showed a worse response to all of the 41 drugs, the top 5 of which were Parthenolide, SL.0101.1, Sunitinib, KU.55933, and RDEA119, with P values of 7.71E-06, 1.49E-05, 4.99E-05, 1.07E-04, and 1.21E-04, separately (Fig. 5a).
7. Results of immune cell infiltration analysis
The immune cell infiltration analysis was also performed to evaluate the correlation of risk scores with immune cells and immune-related pathways in the TCGA cohort. We discovered that aDCs, pDCs, and tumour-infiltrating lymphocytes (TILs) were significantly enriched in the high-risk group (Fig. 5b). Compared to the low-risk group, some immune pathways were upregulated in high-risk group, such as APC co-inhibition, CCR, Check-point, HLA, MHC class Ⅰ, Parainflammation, and T cell co-inhibition (Fig. 5c).