DEmRNAs, DElncRNAs and DEmiRNAs
The flow diagram of the current study is shown in Figure 1.
A total of 5373 DEmRNAs (2886 upregulated and 2487 downregulated), 355 DEmiRNAs (217 upregulated and 138 downregulated) and 1159 DElncRNAs (819 upregulated and 340 downregulated) were obtained through comparing COAD and normal tissues in TCGA-COAD database. The volcano maps and heatmaps showed the DEmRNAs, DEmiRNAs and DElncRNAs expression of COAD respectively. (Figures 2A-F).
Construction of a Pyroptosis-Related ceRNA Network in COAD
To construct the pyroptosis-related ceRNA network, several databases were used. Based on the miRcode database, the lncRNA-miRNA interactions including 218 DElncRNAs and 38 DEmiRNA were identified. The interaction of miRNA-mRNA including 38 miRNAs and 1533 miTGs was acquired based on TargetScan, mirTarBase and miRDB databases. Then, we screened out 5 pyroptosis-related genes by taking the intersection of 1533 miTGs, 5373 DEmRNAs and 155 pyroptosis-related genes (Figure 3A). Finally, 5 DEmRNAs, 7 DEmiRNAs and 132 DElncRNAs were gained, then we constructed a pyroptosis-related network of COAD. (Figure 3B, Supplement Table S2).
Construction of a PRlncRNA Prognostic Risk model
We screened 11 survival-related lncRNAs by employing univariate Cox regression analysis (p<0.05, Supplement Table S3). Among them, 8 lncRNAs (HOTAIR, LINC00402, SFTA1P, LINC00461, DSCR8, CYP1B1-AS1, LINC00330, ALMS1-IT1) were risk genes with HR>1 and other 3 lncRNAs (ZRANB2-AS1, MYB-AS1, TP53TG1) were protective genes with HR<1 (Figure 4A, Table 2). Subsequently, the 11 lncRNAs were incorporated into LASSO regression analysis to avoid overfitting, and a prognostic PRlncRNA risk model was constructed including the above 11 lncRNAs (Figures 4B, C). The risk score was computed as the following formula: risk score = (0.0013*HOTAIR exp) + (0.0174*LINC00402 exp) + (0.0186*SFTA1P exp) + (0.0373*LINC00461 exp) + (-0.2108*ZRANB2-AS1 exp) + (-0.0012* TP53TG1 exp) + (-0.0647* MYB-AS1 exp) + (0.0032* DSCR8 exp) + (0.0084* LINC00330 exp) + (0.0156* CYP1B1-AS1 exp) + (0.0053*ALMS1-IT1 exp). COAD patient was categorized into high or low risk groups in terms of median risk value. According to the previous ceRNA hypothesis, 7 miRNA (hsa-mir-155, hsa-mir-21, hsa-mir-182, hsa-mir-96, hsa-mir-152, hsa-mir-17, hsa-mir-106a) and 5 mRNAs (CEBPB, IL1B, SESN2, ALK, TXNIP) were obtained, and then we re-established a ceRNA network based on them (Figure 4D).
Functional Enrichment Analysis of the Pyroptosis-Related ceRNA Genes
To explore potential functions of these 11 lncRNAs, 7miRNAs and 5mRNAs in biological processes, GO and KEGG function enrichment analyses were performed with P value < 0.05 as the threshold. For GO analysis, these genes were primarily enriched in the cellular response to biotic stimulus and lipid catabolic process in biological processes (BPs); TORC2 complex, GATOR2 complex and TOR complex in cellular components (CCs); ubiquitin-like protein ligase binding in molecular function (MF) (Figure 4E). KEGG pathway analysis suggested these genes were markedly concentrated in the IL-17 signaling pathway, TNF signaling pathway, Tuberculosis, and NOD-like receptor signaling pathway (Figure 4F).
Validation of PRlncRNA Prognostic Risk Model
In the prognostic PRlncRNA risk model, 8 lncRNAs (HOTAIR, LINC00402, SFTA1P, LINC00461, DSCR8, CYP1B1-AS1, LINC00330, ALMS1-IT1) were upregulated while other 3 lncRNAs (ZRANB2-AS1, MYB-AS1, TP53TG1) were downregulated in tumor tissues (Figure 5A). Kaplan-Meier analysis showed that patients in the high-risk group had shorter survival times (Figure 5B). Likewise, patients were separated into two risk groups based on the median risk score (Figure 5C). As the risk score increased, the patient's survival time decreased gradually (Figure 5D). ROC analysis indicated that the PRlncRNA risk model was able to excellently predict the 1-year (0.744), 3-year (0.696) and 5-year (0.623) survival of COAD patients, respectively (Figure 5E).
Independent Prognostic Analysis of the Prognostic PRlncRNA Risk Model
To explore whether the prognostic PRlncRNA risk model can be an independent prognostic factor, univariate and multivariate Cox regression analyses were employed. The univariate Cox regression analysis showed that the hazard ratio (HR) of this risk model was 3.054 (95% CI: 2.185-4.269) (Figure 6A). And we got consistent results using the multivariate Cox model (HR: 2.647, 95%CI:1.841-3.804) (Figure 6B). Eventually, these results implied that our prognostic PRlncRNA risk model can be an independent prognostic factor independent of other clinical characteristics. Moreover, the heatmap of clinical characteristics implied that the survival status of patients was differentially distributed between low- and high-risk subgroups (p<0.05, Figure 6C).
Construction of a Predictive Nomogram
To further assess whether this prognostic PRlncRNA risk model had optimal predictive capabilities, we collected clinical characteristics, including age, gender and stage, as the candidate predictive biomolecular indicators. We constructed a nomogram with risk, age, gender and stage (Figure 7A). The calibration curves suggested that the nomogram had good predictive power (Figures 7B-D). The above findings showed a promising capacity of the PRlncRNA risk model for patient prognosis and survival prediction.
Immunoinfiltration Analysis
Pyroptosis plays an important role in regulating the tumor-immune microenvironment (TIME). In the current study, the CIBERSOR algorithm was utilized to investigate the relevance of risk scores to immune infiltrating cells (Figure 8A). B cells naïve, T cells CD8, T cells follicular helper, T cells regulatory (Tregs) and Macrophages M1 were higher expressed while T cells CD4 memory resting, Dendritic cells activated and Mast cells activated were lower expressed in the high-risk group by using Wilcoxon rank-sum test (Fig 8B). Additionally, we performed correlation analysis between prognostic risk model with the above eight significant difference immune infiltrating cells, and the results showed that the risk score positively related to the immune infiltration of B cells naïve (R=0.19, p=2e-04), Macrophages M1 (R=0.17, p=0.0012), T cells CD8 (R=0.21, p=4.7e-05) and T cells follicular helper (R=0.13, p=0.011), while negatively associated with T cells CD4 memory resting infiltration (R=-0.19, p=0.00015) (Figure 8C).
Relationship of the Prognostic PRlncRNA Risk Model and Immune Checkpoint Blockades (ICBs)
ICB-related immunotherapy had become a promising modality for patients with COAD. To investigate the response of COAD samples to immunotherapy, we examined the association of ICB-related genes (i.e., PD-1, CTLA4, HAVCR2, etc) and risk score (Figure 9A). The results showed that risk score was significantly positively correlated with PD-1 (R=0.245, p=1.44e-06), PD-L1 (R=0.24,P=2.5e-06), PD-L2 (R=0.15, p=0.0025), GITR (R=0.13,p=0.012), HAVCR2 (R=0.17,p=0.00065) and CTLA4 (R=0.18, p=0.00044), while negatively associated with SOAT1 (R=-0.1, p=0.042), indicating that risk score model was useful in assessing patients' response to immunotherapy (Figure 9B).