Identification of disulfidptosis-related lncRNA signature using RNA-Sequencing and Bioinformatics Analysis in Head and Neck Squamous Cell Carcinoma

doi:10.21203/rs.3.rs-4321726/v1

Download PDF

Article

Identification of disulfidptosis-related lncRNA signature using RNA-Sequencing and Bioinformatics Analysis in Head and Neck Squamous Cell Carcinoma

https://doi.org/10.21203/rs.3.rs-4321726/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

As a novel form of regulated cell death, disulfidptosis generating a favorable opportunity in better understanding of tumor pathogenesis and therapeutic strategies. Long non-coding RNAs (lncRNAs) regulate the biology functions of tumor cells with combination of variable targets. However, the prognostic value of disulfidptosis-related lncRNAs (DRlncRNAs) in head and neck squamous cell carcinoma (HNSC) remains largely unknown. Hence, we aimed to delve into the molecular characteristics of HNSC, focusing on disulfidptosis-related long non-coding RNAs (DRLs), and explore potential therapeutic strategies. We conducted lncRNA-mRNA RNA-Seq analyses on HNSC cell lines and utilized The Cancer Genome Atlas (TCGA) database, comprising RNA sequencing, clinical data, and gene mutation information from 522 HNSC tumors and 44 normal tissues. Bioinformatics analyses were employed to identify DRLs associated with disulfidptosis and assess their prognostic significance. Additionally, a risk model based on three selected DRLs was constructed to investigate its correlation with patient outcomes. Tumor mutation burden and chemotherapeutic responses were also explored in relation to the risk model. Our analysis pinpointed three DRLs (LINC02434, AC245041.2, and LINC02762) significantly correlated with HNSC prognosis. qRT-PCR results validated the consistency of LINC02434 and AC245041.2. The risk model, based on these DRLs, effectively stratified patients into high- and low-risk groups, revealing distinct survival patterns. Furthermore, the risk model demonstrated independent prognostic value in HNSC. Examination of the tumor microenvironment highlighted differences between high- and low-risk groups, suggesting potential implications for immunotherapeutic approaches. Specific chemotherapeutic agents with varying sensitivities across risk groups and molecular subtypes were identified. The identified three DRLs signature emerges as a novel biomarker with predictive value for HNSC prognosis. This study provides insights into potential therapeutic avenues and contributes to our understanding of the molecular landscape of HNSC.

Biological sciences/Cancer/Head and neck cancer

Health sciences/Biomarkers/Predictive markers

Head and neck squamous cell carcinoma

Disulfidptosis

LncRNA

Prognosis

Drug sensitivity

Head and neck squamous cell carcinomas (HNSCs) primarily arise from the mucosal epithelium in the oral cavity, pharynx, and larynx, predominantly manifesting as squamous cell carcinomas^1,2. Ranking as the seventh most prevalent cancer worldwide, HNSC is associated with a poor prognosis, witnessing over 660,000 new cases and 325,000 deaths annually ^1,2. Epidemiological studies have identified various risk factors contributing to HNSC, including the consumption of carcinogen-containing substances such as tobacco, areca nut, betel quid, and alcohol, exposure to environmental pollutants, as well as infections with human papillomavirus (HPV) and Epstein-Barr virus (EBV)^1,3,4. Multiple mechanisms have been proposed to underlie the invasion and metastasis of HNSC tumor cells ^5,6. These encompass DNA mutations, genetic impairments, disrupted epigenetic changes, heightened inflammation responses, and immune evasion facilitated by the activation of inhibitory checkpoint pathways that curtail the immune response^7-12. Typically, patients with HNSC receive curative treatments including surgery, adjuvant radiation or chemotherapy in combination with radiation, and immunotherapy, guided by factors such as the anatomical subsite, disease stage, functional considerations, and patient preferences¹³. However, despite these treatment approaches, the 5-year overall survival rate remains less than 50% due to the molecular heterogeneity of HNSC, challenges associated with early detection, and the lack of reliable predictive biomarkers^14,15. Consequently, there is an urgent need to develop a robust prognostic model tailored specifically for HNSC, which would significantly improve patient outcomes and guide treatment strategies.

The metabolic reprogramming of cancer cells, driven by the bioenergetic and biosynthetic demands for cell survival, is a well-established hallmark of tumor initiation and progression¹⁶. Targeting cancer metabolism has gained significant attention and has been extensively explored in cancer treatment strategies^17,18. Regulated cell death (RCD), a common type of cell death, has emerged as a key aspect of cancer metabolic therapy ^19,20. Multiple forms of RCD have been identified, including apoptosis, necroptosis, ferroptosis, autophagy-dependent cell death, immunogenic cell death, lysosome-dependent cell death, oxeiptosis, disulfidptosis, and alkaliptosis, and others^21,22. Of note, a novel mechanism of disulfidptosis has recently been discovered, involving disulfide bond reactions between intracellular and extracellular protein molecules of the cytoskeleton²³. The conformational changes and alterations induced by excessive intracellular cystine accumulation and disulfide formation ultimately trigger the collapse of the histone skeleton and cell death²⁴. Furthermore, emerging studies have placed emphasis on the relationship between disulfidptosis and cancer^25,26. For instance, in hepatocellular carcinoma and thyroid carcinoma, the low disulfidptosis subtype has been associated with better patient prognosis and characterized by high infiltration of immune cells^27,28. Hence, maintaining a balanced disulfidptosis state may hold potential as a therapeutic target to improve treatment response rates and survival outcomes for HNSC patients ^23,29. However, its correlation with the prognosis and drug therapy outcomes in HNSC merit further clarification.

Long non-coding RNAs (LncRNAs) constitute a substantial portion of the genomes in complex organisms, encompassing non-coding transcripts that exceed 200 nucleotides in length and have the ability to modulate the expression of various infrastructural RNAs³⁰. With the advances in genomics and transcriptomics recently, the biological functions of lncRNAs are being unveiled, revealing their involvement in cell differentiation, development regulation, as well as their role as oncogenes or tumor suppressors in cancer initiation and progression^31,32. Li et al. indicated that disulfidptosis-related lncRNAs (DRLs) such as LINC01352, AC093673.1, AL606834.1, AL365181.2, and MHENCR serve as prognostic indicators and are relevant to immune responses in lung adenocarcinoma. These lncRNAs likely exert their functional effects by modulating the structural composition of the cytoskeleton, thereby influencing actin binding and activation³³. In another study, DRLs including ZEB1-AS1, SNHG16, and ALMS1-IT1 were found to be highly expressed in colon adenocarcinoma samples, predicting prognosis and influencing the response to immunotherapy and chemotherapy in patients³⁴. Additionally, the upregulation of FOXD2-AS1 and AC002070.1 in the immune microenvironment of kidney renal cells suggests their potential role in disulfidptosis in kidney renal clear cell carcinoma³⁵. The identification of DRLs holds fundamental implications for unraveling the specific mechanisms underlying oncogenesis and predicting the prognosis of HNSC. By investigating the involvement of these lncRNAs, we can gain valuable insights into the underlying molecular mechanisms and develop potential therapeutic strategies for HNSC patients.

In this study, we used RNA Sequencing (RNA-Seq) to identify differentially expressed DRLs and mRNA in HNSC cell lines and developed a novel prognostic model of HNSC based on independent expression patterns of these DRLs. Our model incorporates not only survival analysis but also explores the underlying immune landscapes and chemotherapeutic efficacy within the context of HNSC. Through our high throughput sequencing and comprehensive analysis, we have successfully illuminated the significanft role of DRLs in prognosis prediction and response to immunotherapy in HNSC. This signature represents a major step forward in predicting patient prognosis and guiding new targets of clinical management for HNSC.

LncRNA-mRNA sequencing revealing involvement of disulfide-induced cell death in HNSC

Utilizing lncRNA-mRNA RNA-Seq, we identified 1,151 upregulated lncRNAs and 2,529 upregulated mRNAs in the HNSC cell group compared to the controls. Additionally, 2,277 lncRNAs and 2,603 mRNAs were downregulated, exhibiting a fold change > 2.0 and P < 0.05 in the HNSC cell group (Figure 1A-B). Comprehensive details about the differentially expressed RNAs can be found in (Supplementary Table S2-S3).

Given that the functions of lncRNAs primarily involve the regulation of coding gene expression³⁶, we performed GO enrichment and KEGG pathway analyses to elucidate the molecular functions of the DE lncRNAs. Remarkably, significant GO enrichment terms in all DE mRNAs encompassed cell-cell adhesion, microtubule cytoskeleton, response to oxidative stress, and dynein complex (Figure 1C). Results from the KEGG pathway enrichment analysis of the DE mRNAs revealed clusters of functional pathways, including axon guidance, cell adhesion molecules, proteoglycans in cancers, the Hippo signaling pathway, and the PI3K-AKT pathway (Figure 1D). Within the context of oxidative stress, cells undergo a shift to form protein disulfide bonds, leading to the collapse of the cytoskeleton network—a process crucial for promoting disulfide stress-induced cell death. Notably, we identified representative DE mRNAs associated with the microtubule cytoskeleton, response to oxidative stress, and three key genes linked to disulfidptosis pathways (SLC7A11, MYH10, and TLN1) (Figure 1B, E). These findings collectively suggest the potential involvement of disulfide-induced cell death in HNSC cell lines.

Identification of three DRLs and construction of the risk model

To delve into the role of disulfidptosis in HNSC, we compiled a set of eighteen previously reported disulfidptosis-related genes from existing literature (Supplementary Table S4) and extracted their expression data from HNSC samples (Supplementary Table S5). The visual representation of the study workflow through a flowchart was shown in Figure S1. Employing Pearson correlation analysis, we identified a total of 3,573 lncRNAs. Module identification was conducted using the dynamic tree clipping method, and modules with at least 50 lncRNAs and a similarity greater than 0.75 were merged. The resulting module LncRNAs tree diagram is presented in Figure 2A. Correlation analysis between modules and trait groupings generated by clustering was performed, and a heat map of the module and trait data was plotted (Figure 2B). LncRNAs in the yellow module were selected for further analysis (Supplementary Table S6), demonstrating correlations with one or more disulfidptosis-related genes (Figure 2C).

To ensure the reliability of our findings, we randomly assigned a total of 519 HNSC patients into a training set (n = 260) and a testing set (n = 259). The clinical characteristics of these patient sets are detailed in Table 1, revealing no significant differences in clinical traits between the training and testing sets. Utilizing the univariate Cox regression analysis method, we screened and identified 14 lncRNAs significantly associated with prognosis, labeling them as disulfidptosis-related prognostic lncRNAs (p < 0.05) (Figure 2D). To streamline variables and prevent overfitting in the training cohort, the LASSO Cox regression analysis method was applied, incorporating the lambda value (Figure 2E-F). Ultimately, three lncRNAs were identified according to our lncRNA sequencing results, and a predictive signature risk model was developed based on their expression patterns. The coefficients obtained through regression analysis determined the weighting for each lncRNA in the risk model. The correlation between the three lncRNAs in the signature and disulfidptosis-related genes is illustrated in Figure 2G. Additionally, the risk scores for all samples, calculated based on the expression levels of the three key DRLs, are provided in Supplementary Table S7.

Validation of the three DRLs by qRT-PCR

In the validation cohort, the levels of key DRL genes, namely LINC02434, AC245041.2, and LINC02762, were assessed using qRT-PCR. As illustrated in Figure 2H, all three genes exhibited differential expression in FaDu cells compared to NP69 cells (p < 0.0001). Except for LINC02762, the expression pattern of these three genes consistently aligned with both our RNA-sequencing and database analysis results.

Survival analysis of the risk model and three DRLs

In the survival analysis, we classified the training patients into high-risk and low-risk groups based on their median risk scores. This same categorization was applied to the testing and all patients. In the training set, patients classified as low-risk exhibited significantly better OS compared to those in the high-risk group (p < 0.001) (Figure 3A). This trend persisted consistently observed in both the testing and all patient sets (p = 0.003 and p < 0.001, respectively) (Figure 3B-C). When examining PFS, patients classified as low-risk in the training set demonstrated a higher survival rate compared to those in the high-risk group (p = 0.003) (Figure 3D). This finding was similarly observed in the testing and all patient sets (p = 0.031 and p < 0.001, respectively) (Figure 3E-F). Additionally, for DSS, patients in the low-risk group in the training set had a higher survival rate than those in the high-risk group (p = 0.002) (Figure 3G). This association remained consistent in both the testing and all patient sets (p = 0.013 and p < 0.001, respectively) (Figure 3H-I). These survival analyses highlight the prognostic significance of the risk model, providing valuable insights into their associations with OS, DSS, and PFS in HNSC patients.

Risk curve, survival status, heatmap and independent prognostic value of the risk model

The mortality rate of HNSC patients increased as the risk scores escalated in the training set (Figure 4A-D). This trend was consistently observed in both the testing (Figure 4B, 4E) and all patient sets (Figure 4C, 4F). Expression analysis revealed that LINC02434, AC245041.2 and LINC02762 were more highly expressed in the high-risk group compared to the low-risk group in the training, testing, and all patient sets (Figure 4G-I), suggesting that these DRLs could serve as poor prognostic predictors. By performing both univariate and multivariate Cox regression analyses, we determined that the risk score, calculated based on the risk model, served as an independent prognostic factor for HNSC patients in the training set. The univariate analysis demonstrated a high risk (HR) of 1.784 with a p-value less than 0.001 (Figure 5A), while the multivariate analysis showed an HR of 1.696 with a p-value less than 0.001 (Figure 5D). These findings were further validated in the testing set, with the univariate analysis yielding an HR of 1.549 and a p-value of 0.003 (Figure 5B), and the multivariate analysis resulting in an HR of 1.643 with a p-value of 0.001 (Figure 5E). Similar results were observed in the analysis of the all patients set, where the univariate analysis showed an HR of 1.670 with a p-value less than 0.001 (Figure 5C), and the multivariate analysis revealed an HR of 1.643 with a p-value less than 0.001 (Figure 5F). These results demonstrate that the risk score, derived from the risk model based on the identified DRLs, represents an independent and reliable prognostic factor for HNSC patients. This highlights the potential clinical utility of the risk score in predicting patient outcomes and guiding treatment decisions.

Diagnosis and prognosis of the risk model

To assess the diagnostic value of the risk model, ROC curve analysis was employed. In the training set, the Area Under Curve (AUC) for 1-, 3-, and 5-year survival was 0.636, 0.680, and 0.666, respectively (Figure 6A). While these values did not surpass 0.7, they demonstrated superior performance compared to individual clinical characteristics such as age, gender, grade, and clinical stage in predicting survival at 1, 3, and 5 years, respectively (Figure 6B-D). Similar results were observed in both the testing set (AUC for 1-, 3-, and 5-year was 0.683, 0.683, and 0.619) and the all patients set (AUC for 1-, 3-, and 5-year was 0.660, 0.680, and 0.627) (Figure 6E-L). Thus, the diagnostic value of the risk model surpassed that of individual clinical traits.

Moreover, the C-index, a measure of discrimination, demonstrated higher scores for the risk model compared to other clinical variables in both the training, testing, and all patient sets (Figure S2A-C). This indicates that the risk score derived from the model could serve as a robust prognostic factor, particularly for long-term clinical outcomes in HNSC patients. For practical application, a nomogram was developed, integrating clinical variables and risk scores to predict probabilities of 1-, 3-, and 5-year OS for individual patients (Figure S2D). The nomogram's predictions were validated through calibration plots, which demonstrated good agreement between the predicted and observed clinical outcomes (Figure S2E). Furthermore, survival probabilities and clinical features of HNSC patients across different variables, including age, gender, grade, T stage, N stage, M stage, and clinical stage, were compared. The results consistently showed that high-risk patients had shorter OS compared to low-risk patients across most clinical variables (Figure S3). These findings highlight the applicability of our risk model across different clinical scenarios. In conclusion, our risk model demonstrates both diagnostic and prognostic value in HNSC patients. It outperforms individual clinical traits in predicting survival outcomes, as evidenced by the AUC values, C-index scores, and nomogram predictions.

Samples of HNSC were divided into three tumor clusters

To employ a consensus clustering algorithm based on the expression of the three DRLs, we identified three distinct molecular subtypes within the cohort of 519 HNSC samples (Figure 7A) (Supplementary Table S8). Subsequent survival analysis revealed that patients in cluster 2 exhibited a significantly higher survival rate compared to those in clusters 1 and 3 (p < 0.001) (Figure 7B). This observation can be attributed to the grouping of patients, as most of cluster 2 belonged to the low-risk group, which is associated with a more favorable prognosis (Figure 7C).

To further assess the molecular differences between the identified tumor clusters, we employed PCA and T-distributed stochastic neighbor embedding (tSNE) visualization techniques. Both PCA and tSNE plots demonstrated clear distinctions among the three tumor clusters, indicating robust separability based on the identified molecular subtypes (Figure 7D-E). In addition, we conducted difference analysis among the clinical features of clusters 1, 2 and 3, displaying the proportion of their respective clinical features. We found a statistical difference in N stage (p = 0.021) (Figure 7F). These findings highlight the presence of distinct molecular subtypes within HNSC, contributing to our understanding of the heterogeneity of the disease and offering insights for personalized treatment approaches.

PCA, TMB and TIDE analysis

The PCA method was employed to assess the efficiency of the constructed signature in determining the risk status of patients. The analysis revealed that the risk status could be efficiently determined using the signature derived from the identified DRLs (Figure S5D). However, analyzing only the whole genome expression data (Figure S5A), the disulfidptosis-related genes (Figure S5B) or the DRLs alone (Figure S5C) did not provide an efficient discrimination of the risk status values.

To scrutinize the alterations in somatic mutations between high- and low-risk groups, we retrieved somatic mutation data from the TCGA database. The resulting waterfall plots depict the top 15 mutation genes for both high- and low-risk groups (Figure 8A and Figure 8B). According to the survival analysis, patients in the low-TMB group exhibited a prolonged survival period compared to those in the high-TMB group (p = 0.006) (Figure 8C). Moreover, subgroup survival analysis revealed that patients with both low-TMB and low-risk had the highest survival probability compared to other subgroups (p < 0.001) (Figure 8D). Simultaneously, the low-risk group demonstrated a lower TIDE score than the high-risk group (p < 0.05), suggesting a potentially more effective response to immunotherapy due to a reduced likelihood of immune escape (Figure 8E).

Screening potential drugs for HNSC based on the two risk groups and the three tumor clusters

Employing the "oncoPredict" package, we delved into identifying potentially effective chemotherapeutic agents for HNSC, considering both the two risk groups and the three distinct tumor clusters, which proved meaningful in both contexts. The results, systematically organized in alphabetical order, are presented in (Figure S4A-P). Significantly, Alpelisib, Axitinib, AZD1208, Doramapimod, GSK269962A, JQ1, Sinularin, and Tozasertib demonstrated higher sensitivity (lowest IC50) in low-risk groups (or the highest sensitivity in cluster 2). This implies that patients in low-risk groups (or cluster 2) with HNSC are more likely to experience therapeutic benefits from these specific drugs (All p < 0.001). Further details on the potential drugs can be found in the supplementary materials. These findings illuminate potential treatment options tailored to specific molecular subtypes, paving the way for personalized therapeutic interventions in HNSC patients.

In this study, we conducted a comprehensive analysis of disulfidptosis patterns in HNSC cell lines using high throughput sequencing and bioinformatics analysis. In total, differential expression of 3,428 lncRNAs and 5,132 mRNAs were detected. These DE RNAs were distinctively classified by the different groups in hierarchical clusterin, suggesting a potential use for these lncRNAs in distinguishing HNSC from healthy controls. GO and KEGG pathway analysis based on the differentially expressed mRNAs indicated that several pathways may play important roles in HNSC pathogenesis, such as cell-cell adhesion, calcium ion binding, MAPK signaling pathway, PI3K-AKT pathway, and IL-17 signaling pathway. This functional annotation provides bioinformatics-based evidence regarding the potential underlying mechanism driving HNSC tumorigenesis and offers new insight into the molecular mechanism of lncRNAs in HNSC.

Prevailing notions suggest a correlation between disulfidptosis and disulfide stress, primarily occurring between intracellular and extracellular protein molecules of the cytoskeleton²³. This phenomenon triggers conformational changes and disrupts normal protein function, ultimately resulting in cell death, such as disulfidptosis²⁴. Notably, our study revealed intriguing insights into the enriched pathways of cytokine-cytokine receptor interaction and cell adhesion molecules, which are likely involved in the regulation of cell death in HNSC and, consequently, the overall cancer development^28,37. However, the precise mechanisms by which these proteins related to disulfide bond formation induce cell death, as well as the potential therapeutic targeting of this pathway through cytoskeleton disruption in HNSC cells, warrant further investigation. Unraveling these aspects will deepen our understanding of the complex interplay between disulfidptosis, cell death pathways, and the development of effective therapeutic strategies for HNSC.

Currently, the regulatory mechanisms of disulfidptosis remain insufficiently understood, particularly in the field of lncRNAs²⁷. We identified two distinct DRLs, namely LINC02434 and AC245041.2, which may exhibit significant roles in the development of HNSC using lncRNA sequencing and bioinformatics analysis. We performed qPCR experiments to verify the significant abnormal expression of selected DRLs in HNSC cell lines. To delve into more specific details, all of them were found to be associated with poor prognosis of HNSC, indicating the potential involvement of disulfidptosis in HNSC. The upregulation of LINC02434 gene has been proposed in HNSC cell lines, suggesting its potential utility as a valuable tumor tissue marker for diagnosing HNSC patients³⁸. LncRNA AC245041.2 and mRNA LAMA3 were identified to exhibite a strong expression correlation in pancreatic adenocarcinoma, reinforcing its potential role in tumor progression³⁹. Additionally, the upregulation of the lncRNA AC245041.2 has been reported as a novel ferroptosis-related lncRNA signature for prognosis prediction in gastric cancer, providing a promising novel strategy for cancer treatment⁴⁰. Importantly, the risk scoring and subgroup analysis of these identified lncRNAs further support their potential as prognostic factors in HNSC patients, adding to the growing body of evidence in this field. The precise mechanisms by which the identified DRLs regulate tumor development and the immune processes in HNSC remain speculative and require further validation.

In addition, our findings confirmed the validity of subtypes on the basis of disulfidptosis-related genes and construction of a prognostic risk model for HNSC patients. Furthermore, we identified potential drugs for different patient groups by analyzing transcriptome profiles, clinical data, and gene mutation data. While the role of disulfide stress in tumors has limited evidence, clues related to disulfide stress strongly suggest its significance and clinical value in predicting prognosis and offering therapeutic strategies for HNSC.

The crucial role of immune cells within the tumor microenvironment (TME) in the development of various tumors has been widely recognized⁴¹. Current research in HNSC immunotherapies has revealed that targeting specific immune checkpoints could rescue T cell responses and establish promising treatment strategies^42-44. Here, we employed a consensus clustering algorithm based on the expression of distinct DRLs and identified three molecular subtypes within HNSC samples. Of note, cluster 3, representing the low-risk group, exhibited the highest expression levels of immune checkpoint genes and displayed more favorable clinical outcomes. Furthermore, our risk signature demonstrated a strong predictive value for OS and could serve as an independent prognostic indicator in HNSC due to its potential to mitigate immune escape mechanisms⁴⁵. Thus, our findings underscore the associations between DRL subtypes and changes in the immunological tumor microenvironment in HNSC, highlighting their significance in tailoring treatment strategies. However, the current treatment strategies for HNSCs remain inadequate and often result in drug resistance. To address this challenge, the development and identification of novel therapeutic strategies, as well as specific molecular or cellular markers, are essential for improving treatment outcomes and predicting the survival of HNSC patients⁴⁶. In this context, our study provides valuable insights into more targeted treatment strategies for HNSCs, considering the distinct molecular subtypes identified in our analysis.

Although our study provides valuable insights into the potential roles of these DRLs, additional experimental and clinical investigations are warranted to elucidate their target mRNAs and confirm their functional significance in HNSC. Addressing these limitations through future studies with diverse clinical cohorts and functional experiments will enhance the robustness and applicability of our findings, providing a more comprehensive understanding of the involvement of DRLs in HNSC progression.

In aggregate, our research delved into the molecular variances of HNSC, shedding light on important insights for this complex disease. We successfully constructed a novel and robust disulfidptosis-associated signature, offering a valuable tool for evaluating prognosis and suggesting tailored therapeutic strategies for diverse groups of HNSC patients. However, it is crucial to note that to obtain more compelling and conclusive results, further prospective studies and basic research are necessary. Future investigations should strive for high-quality data, large sample sizes, and sufficient follow-up periods to strengthen the reliability and generalizability of the results. By addressing these requirements, we can enhance the reliability and generalizability of our findings, ultimately advancing our understanding of HNSC and facilitating the development of more effective interventions for patients.

RNA library preparation, sequencing, and data processing

Total RNA extraction utilized the RNAeasy™ Animal RNA Isolation Kit (Beyotime, R0026). Sequencing libraries were created with the RNA Library Prep Kit following the manufacturer's guidelines, incorporating index codes for sample attribution. Briefly, regulatory ncRNA and mRNA were purified from total RNA by utilizing probes to eliminate rRNA. Fragmentation was conducted using divalent cations at an elevated temperature in the First Strand Synthesis Reaction Buffer (5X). For the first strand cDNA synthesis, a random hexamer primer and M-MuLV Reverse Transcriptase (RNaseH) were employed. Subsequently, second strand cDNA synthesis was achieved using DNA Polymerase I and RNase H. Exonuclease/polymerase activities were employed to convert remaining overhangs into blunt ends. After adenylation of DNA fragments' 3’ ends, NEBNext Adaptor with a hairpin loop structure was ligated for hybridization preparation. To select cDNA fragments of optimal length (370-420 bp), the library fragments underwent purification with the AMPure XP system. A 3 µL USER Enzyme was applied to size-selected, adaptor-ligated cDNA at 37°C for 15 min followed by 5 min at 95 °C before PCR. PCR was carried out with Phusion High-Fidelity DNA polymerase, Universal PCR primers, and Index Primer. Finally, PCR products underwent purification, and library quality was assessed on the Agilent 5400 system, with quantification by QPCR (1.5 nM). Qualified libraries were pooled and sequenced on Illumina platforms with PE150 strategy in Novogene Bioinformatics Technology Co., Ltd (Beijing, China), based on effective library concentration and required data amount. The original fluorescence image files from the Illumina platform were transformed into short reads (Raw data) through base calling, recorded in FASTQ format containing sequence information and corresponding sequencing quality information. Sequence artifacts, such as reads with adapter contamination, low-quality nucleotides, and unrecognizable nucleotides (N), pose a barrier for subsequent reliable bioinformatics analysis⁴⁷. Therefore, quality control is imperative and was implemented using Fastp (version 0.23.1) for basic statistics on the raw reads' quality. The data processing steps included discarding paired reads with adapter contamination, more than 10% uncertain bases in either read, or a proportion of low-quality bases (Phred quality <5) exceeding 50% in either read.

Cell culture and qRT-PCR

Human HNSC cell line (FaDu, CL-0083) and nasopharyngeal epithelial cell line (NP69SV40T, CL-0804) were sourced from Procell Life Science&Technology Co., Ltd (Wuhan, China). FaDu and NP69 cells were cultured in specific cell media (CM-0804 and CM-0083) from Procell Life Science&Technology Co., Ltd (Wuhan, China) at 37 °C and 5% CO2. Total RNA extraction was carried out using the RNAeasy™ Animal RNA Isolation Kit (Beyotime, R0026). For cDNA synthesis, equal amounts of RNA were subjected to the PrimeScript RT Reagent Kit (Takara, RR037A). Real-time PCR was conducted in a total volume of 25 µl using TB Green Premix Ex Taq II (Tli RNaseH Plus) (Takara, RR820A) and detected by the LightCycler® 96 System (Roche). Primer sequences for target genes are provided in Supplemental Table S1, with GAPDH serving as an internal standard. Relative transcript levels of target genes were calculated using the2^-ΔΔCT method.

Data Acquisition

We retrieved RNA sequencing data, clinical data, and gene mutation data of HNSC from The Cancer Genome Atlas (TCGA), accessed on 17 May 2023. The dataset encompassed a total of 522 HNSC tumors and 44 normal tissues. Utilizing the Perl programming language (version Strawberry-perl-5.30.0; https://www.perl.org accessed on 17 May 2023), we extracted and organized the RNA-seq data, clinical data, and gene mutation data. To ensure the robustness of our findings, we categorized 519 patients (excluding those without transcript data) into two cohorts: a training cohort (n = 260) and a testing cohort (n = 259) in a 1:1 ratio. The classification process was executed randomly to minimize potential bias during the signature identification evaluation (refer to Table 1). This approach aimed to conduct a comprehensive assessment and validation of our proposed signature, ensuring its reliability and generalizability across the HNSC patient population.

Acquisition of DRLs

In the existing literature, a total of eighteen genes associated with disulfidptosis have been documented^23,24. We compiled these genes and categorized them as disulfidptosis-related genes. An in-depth analysis was conducted to assess the expression levels and co-expression patterns of both these genes and lncRNAs. The "Limma" package was employed for the analysis, considering correlations with |correlation coefficients| greater than 0.1 and p-values below 0.05. Further identification of modular genes in tumor and normal tissues was performed using the weighted gene co-expression network analysis (WGCNA) method with the "WGCNA" package, with a soft power of 4. Univariate Cox regression analysis was then applied to identify DRLs with prognostic potential, setting a threshold of p-values below 0.05. To visually represent the relationship between disulfidptosis genes and DRLs, a sankey diagram was generated using the "ggplot2" and "ggalluvial" packages, providing a comprehensive and intuitive illustration of the connections between disulfidptosis genes and the identified DRLs.

Construction of risk model and validation

Data overfitting for the case of the training cohort was eliminated using the Least Absolute Shrinkage and Selection Operator (LASSO) regression. The multivariate Cox analysis algorithm was used to streamline the amount and calculate the coefficients corresponding to the DRLs. Three prognostic DRLs (AC245041.2, LINC02762 and LINC02434) were selected for the construction of the risk model. The coefficients were determined using the Cox pro-portional hazard regression analysis algorithm, and the coefficients were used for calculating the risk score using the following formula:

Risk score =coefi × disulfidptosis − relatedlncRNAexpression.

The “survival” and “survminer” packages were used for Kaplan–Meier curves to validate the risk model between high and low risk groups based on training, testing and entire cohort (overall survival (OS), progression- free survival (PFS), disease- specific survival (DSS)). The Kaplan–Meier curves were also conducted to compare the OS values of clinicopathologic features (age, gender, grade, T and N stages, metastasis and clinical stage). The prognostic value of the risk model was analyzed using the Cox regression analysis. We used a nomogram to predict the 1- year, 3- year, and 5- year OS of HNSC patients. Performance of the prognostic model was further validated by the calibration plot, time- dependent receiver operating characteristic (ROC) curves and C- index. The performance of the risk model in determining the risk status was studied using the principal component analysis (PCA). Heatmaps were analyzed to determine the lncRNA levels in the risk groups.

Tumor Mutation Burden (TMB)

The mutated genes were identified using the “maftools” package, and the top 15 genes with maximum mutation frequencies were displayed in waterfall charts. TMB was determined by dividing the sum of mutations by the exome size⁴⁸. Visualization of difference analysis and correlation analysis was carried out using the “ggpubr,” “limma,” and “reshape2” packages. OS values for different subgroups were evaluated using the “survival” and “survminer” packages.

Tumor immune dysfunction and exclusion (TIDE) and chemotherapy

TIDE scoring files were downloaded from http://tide.dfci.harvard.edu, accessed on 24 May 2023, and difference analysis was conducted between high and low-risk groups. IC50 values of drugs for HNSC treatment in high-risk and low-risk sets were predicted using the “oncoPredict” package⁴⁹.

Tumor clusters

The “ConsensusClusterPlus” package was utilized to categorize tumor samples into three clusters. A sankey diagram depicted the relationship between three clusters and risk groups using the “ggalluvial” package. Survival analysis, identification of potential drugs, and T-distributed stochastic neighbor embedding (tSNE) were performed using the methods mentioned earlier, with the addition of the “Rtsne” package for tSNE.

Statistical analysis

All data analysis, including co-expression analysis, difference analysis, survival analysis, univariate and multivariate Cox regression analysis, correlation analysis, and LASSO regression analysis, was conducted using R (version 4.2.2). Statistical methods included the Chi-square test or the Wilcoxon signed-rank test. Relevant charts were generated using R, with statistical significance set at p < 0.05.

Data availability

RNA-sequencing data and clinical data of HNSC were downloaded from The Cancer Genome Atlas (TCGA) (https://portal.gdc.cancer.gov/). Further inquiries can be directed to the corresponding author.

Author contributions

Qi Chen and Xiao Shi designed, and conceived the project, analyzed the data, and wrote the manuscript. Yuanyuan Bao collected the information from the databases and organized the related data. Yue Chen designed the project, and contributed to data processing, made the charts and figures, and edited the manuscript.

Competing interests

The authors declare no competing interests.

Johnson, D. E. et al. Head and neck squamous cell carcinoma. Nature reviews. Disease primers6, 92, doi:10.1038/s41572-020-00224-3 (2020).
Sung, H. et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA: a cancer journal for clinicians71, 209-249, doi:10.3322/caac.21660 (2021).
Isayeva, T., Li, Y., Maswahu, D. & Brandwein-Gensler, M. Human papillomavirus in non-oropharyngeal head and neck cancers: a systematic literature review. Head and neck pathology6 Suppl 1, S104-120, doi:10.1007/s12105-012-0368-1 (2012).
Wong, I. C., Ng, Y. K. & Lui, V. W. Cancers of the lung, head and neck on the rise: perspectives on the genotoxicity of air pollution. Chinese journal of cancer33, 476-480, doi:10.5732/cjc.014.10093 (2014).
Pai, S. I. & Westra, W. H. Molecular pathology of head and neck cancer: implications for diagnosis, prognosis, and treatment. Annual review of pathology4, 49-70, doi:10.1146/annurev.pathol.4.110807.092158 (2009).
Lui, V. W. et al. Frequent mutation of the PI3K pathway in head and neck cancer defines predictive biomarkers. Cancer discovery3, 761-769, doi:10.1158/2159-8290.Cd-13-0103 (2013).
Comprehensive genomic characterization of head and neck squamous cell carcinomas. Nature517, 576-582, doi:10.1038/nature14129 (2015).
Alsahafi, E. et al. Clinical update on head and neck cancer: molecular biology and ongoing challenges. Cell death & disease10, 540, doi:10.1038/s41419-019-1769-9 (2019).
Rocco, J. W. & Ellisen, L. W. p63 and p73: life and death in squamous cell carcinoma. Cell cycle (Georgetown, Tex.)5, 936-940, doi:10.4161/cc.5.9.2716 (2006).
Lu, H. et al. TNF-α promotes c-REL/ΔNp63α interaction and TAp73 dissociation from key genes that mediate growth arrest and apoptosis in head and neck cancer. Cancer research71, 6867-6877, doi:10.1158/0008-5472.Can-11-2460 (2011).
Zhang, Z., Filho, M. S. & Nör, J. E. The biology of head and neck cancer stem cells. Oral oncology48, 1-9, doi:10.1016/j.oraloncology.2011.10.004 (2012).
Virós, D. et al. Prognostic role of MMP-9 expression in head and neck carcinoma patients treated with radiotherapy or chemoradiotherapy. Oral oncology49, 322-325, doi:10.1016/j.oraloncology.2012.10.005 (2013).
Iyer, N. G. et al. Randomized trial comparing surgery and adjuvant radiotherapy versus concurrent chemoradiotherapy in patients with advanced, nonmetastatic squamous cell carcinoma of the head and neck: 10-year update and subset analysis. Cancer121, 1599-1607, doi:10.1002/cncr.29251 (2015).
Elbers, J. B. W. et al. Immuno-radiotherapy with cetuximab and avelumab for advanced stage head and neck squamous cell carcinoma: Results from a phase-I trial. Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology142, 79-84, doi:10.1016/j.radonc.2019.08.007 (2020).
Vos, J. L. et al. Neoadjuvant immunotherapy with nivolumab and ipilimumab induces major pathological responses in patients with head and neck squamous cell carcinoma. Nature communications12, 7348, doi:10.1038/s41467-021-26472-9 (2021).
Hsieh, Y. T., Chen, Y. F., Lin, S. C., Chang, K. W. & Li, W. C. Targeting Cellular Metabolism Modulates Head and Neck Oncogenesis. International journal of molecular sciences20, doi:10.3390/ijms20163960 (2019).
Martínez-Reyes, I. & Chandel, N. S. Cancer metabolism: looking forward. Nature reviews. Cancer21, 669-680, doi:10.1038/s41568-021-00378-6 (2021).
Stine, Z. E., Schug, Z. T., Salvino, J. M. & Dang, C. V. Targeting cancer metabolism in the era of precision oncology. Nature reviews. Drug discovery21, 141-162, doi:10.1038/s41573-021-00339-6 (2022).
Wabnitz, G. H. et al. Mitochondrial translocation of oxidized cofilin induces caspase-independent necrotic-like programmed cell death of T cells. Cell death & disease1, e58, doi:10.1038/cddis.2010.36 (2010).
Carneiro, B. A. & El-Deiry, W. S. Targeting apoptosis in cancer therapy. Nature reviews. Clinical oncology17, 395-417, doi:10.1038/s41571-020-0341-y (2020).
Zheng, T., Liu, Q., Xing, F., Zeng, C. & Wang, W. Disulfidptosis: a new form of programmed cell death. Journal of experimental & clinical cancer research : CR42, 137, doi:10.1186/s13046-023-02712-2 (2023).
Tang, D., Kang, R., Berghe, T. V., Vandenabeele, P. & Kroemer, G. The molecular machinery of regulated cell death. Cell research29, 347-364, doi:10.1038/s41422-019-0164-5 (2019).
Liu, X. et al. Actin cytoskeleton vulnerability to disulfide stress mediates disulfidptosis. Nature cell biology25, 404-414, doi:10.1038/s41556-023-01091-2 (2023).
Zheng, P., Zhou, C., Ding, Y. & Duan, S. Disulfidptosis: a new target for metabolic cancer therapy. Journal of experimental & clinical cancer research : CR42, 103, doi:10.1186/s13046-023-02675-4 (2023).
Zhao, S. et al. Crosstalk of disulfidptosis-related subtypes, establishment of a prognostic signature and immune infiltration characteristics in bladder cancer based on a machine learning survival framework. Frontiers in endocrinology14, 1180404, doi:10.3389/fendo.2023.1180404 (2023).
Qi, C., Ma, J., Sun, J., Wu, X. & Ding, J. The role of molecular subtypes and immune infiltration characteristics based on disulfidptosis-associated genes in lung adenocarcinoma. Aging15, 5075-5095, doi:10.18632/aging.204782 (2023).
Wang, T. et al. Disulfidptosis classification of hepatocellular carcinoma reveals correlation with clinical prognosis and immune profile. International immunopharmacology120, 110368, doi:10.1016/j.intimp.2023.110368 (2023).
Feng, Z. et al. Identification a unique disulfidptosis classification regarding prognosis and immune landscapes in thyroid carcinoma and providing therapeutic strategies. Journal of cancer research and clinical oncology, doi:10.1007/s00432-023-05006-4 (2023).
Meng, Y., Chen, X. & Deng, G. Disulfidptosis: a new form of regulated cell death for cancer treatment. Molecular biomedicine4, 18, doi:10.1186/s43556-023-00132-4 (2023).
Mercer, T. R., Dinger, M. E. & Mattick, J. S. Long non-coding RNAs: insights into functions. Nature reviews. Genetics10, 155-159, doi:10.1038/nrg2521 (2009).
Chen, L., Zhu, Q. H. & Kaufmann, K. Long non-coding RNAs in plants: emerging modulators of gene activity in development and stress responses. Planta252, 92, doi:10.1007/s00425-020-03480-5 (2020).
Wierzbicki, A. T., Blevins, T. & Swiezewski, S. Long Noncoding RNAs in Plants. Annual review of plant biology72, 245-271, doi:10.1146/annurev-arplant-093020-035446 (2021).
Li, W. et al. Disulfidptosis-related lncRNA signatures predict prognosis and immune relevance of lung adenocarcinoma. doi:10.21203/rs.3.rs-3083164/v1 (2023).
Xue, W. et al. Disulfidptosis-associated Long Non-Coding RNA signature predicts the prognosis, tumor microenvironment, and immunotherapy and chemotherapy options in colon adenocarcinoma. doi: https://doi.org/10.21203/rs.3.rs-2903764/v1 (2023).
Shen, C. et al. Identification and validation of fatty acid metabolism-related lncRNA signatures as a novel prognostic model for clear cell renal cell carcinoma. Scientific reports13, 7043, doi:10.1038/s41598-023-34027-9 (2023).
Mattick, J. S. et al. Long non-coding RNAs: definitions, functions, challenges and recommendations. Nature reviews. Molecular cell biology24, 430-447, doi:10.1038/s41580-022-00566-8 (2023).
Liberzon, A. et al. The Molecular Signatures Database (MSigDB) hallmark gene set collection. Cell systems1, 417-425, doi:10.1016/j.cels.2015.12.004 (2015).
Jiang, H. et al. A Novel Three-lncRNA Signature Predicts the Overall Survival of HNSCC Patients. Annals of surgical oncology28, 3396-3406, doi:10.1245/s10434-020-09210-1 (2021).
Tian, C., Li, X. & Ge, C. High expression of LAMA3/AC245041.2 gene pair associated with KRAS mutation and poor survival in pancreatic adenocarcinoma: a comprehensive TCGA analysis. Molecular medicine (Cambridge, Mass.)27, 62, doi:10.1186/s10020-021-00322-2 (2021).
Wei, J., Zeng, Y., Gao, X. & Liu, T. A novel ferroptosis-related lncRNA signature for prognosis prediction in gastric cancer. BMC cancer21, 1221, doi:10.1186/s12885-021-08975-2 (2021).
Wei, X. et al. Analysis of the role of the interleukins in colon cancer. Biological research53, 20, doi:10.1186/s40659-020-00287-2 (2020).
Chen, L. & Han, X. Anti-PD-1/PD-L1 therapy of human cancer: past, present, and future. The Journal of clinical investigation125, 3384-3391, doi:10.1172/jci80011 (2015).
Chen, S. M. Y. et al. Tumor immune microenvironment in head and neck cancers. Molecular carcinogenesis59, 766-774, doi:10.1002/mc.23162 (2020).
Zou, W., Wolchok, J. D. & Chen, L. PD-L1 (B7-H1) and PD-1 pathway blockade for cancer therapy: Mechanisms, response biomarkers, and combinations. Science translational medicine8, 328rv324, doi:10.1126/scitranslmed.aad7118 (2016).
Wang, X. et al. An Immunogenic Cell Death-Related Classification Predicts Prognosis and Response to Immunotherapy in Head and Neck Squamous Cell Carcinoma. Frontiers in immunology12, 781466, doi:10.3389/fimmu.2021.781466 (2021).
Martin, D. et al. The head and neck cancer cell oncogenome: a platform for the development of precision molecular therapies. Oncotarget5, 8906-8923, doi:10.18632/oncotarget.2417 (2014).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics (Oxford, England)34, i884-i890, doi:10.1093/bioinformatics/bty560 (2018).
Chan, T. A. et al. Development of tumor mutation burden as an immunotherapy biomarker: utility for the oncology clinic. Annals of oncology : official journal of the European Society for Medical Oncology30, 44-56, doi:10.1093/annonc/mdy495 (2019).
Maeser, D., Gruener, R. F. & Huang, R. S. oncoPredict: an R package for predicting in vivo or cancer patient drug response and biomarkers from cell line screening data. Briefings in bioinformatics22, doi:10.1093/bib/bbab260 (2021).

Table 1 is available in the Supplementary Files section.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Identification of disulfidptosis-related lncRNA signature using RNA-Sequencing and Bioinformatics Analysis in Head and Neck Squamous Cell Carcinoma

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Conclusion

Materials And Methods

Declarations

References

Tables

Additional Declarations

Status:

Version 1