Prognostic value of anoikis-related genes revealed using multi-omics analysis and machine learning based on lower-grade glioma features and tumour immune microenvironment

doi:10.21203/rs.3.rs-2370831/v1

Download PDF

Research Article

Prognostic value of anoikis-related genes revealed using multi-omics analysis and machine learning based on lower-grade glioma features and tumour immune microenvironment

https://doi.org/10.21203/rs.3.rs-2370831/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background: Lower-grade glioma (LGG) is a prevalent glial cell-derived brain tumor that is aggressive and infiltrative. Anoikis, a new and distinct form of cell death, is a catch-all phrase describing cells losing their ability to adhere to the extracellular matrix (ECM) and nearby cells, followed by the inducing of apoptosis. However, what role the mechanisms associated with anoikis play in LGG have not been thoroughly discovered.

Methods: The Cancer Genome Atlas (TCGA), Gene Expression Omnibus (GEO), and Chinese Glioma Genome Atlas (CGGA) are three large databases that provide sequencing information for LGG patients, as well as the corresponding clinical data, were included in this study as the training set and multi-group validation set for the data. Application of ConsensusClusterPlus Consensus Clustering for molecular subtype classification of LGG patients based on anoikis-related genes (ARGs)with prognostic value. Subsequently, we screened genes significantly associated with patient prognosis using different machine learning algorithms. Risk profiles are constructed and assessed based on these screened genes.

Results: Patients with LGG were classified into two distinct molecular subtypes based on a clustering approach, each characterized by their prognosis, clinical features, and tumor microenvironment. A 6-ARG prognostic signal (EGFR, SIX1, SP1, ANGPTL2, PDCD4, and BMP2) was subsequently constructed, and the signature genes showed good predictive performance not only in the training set but also in multiple validation sets. Additionally, we go into great depth about how high-risk and low-risk groups differ from one another in terms of attributes, including immune characteristics, tumor mutation characteristics, and drug sensitivity showing significant differences in the risk subgroups. Finally, this risk score is combined with multiple LGG clinicopathological features to create an at-a-glance nomogram for quantitatively predicting the probability of clinical survival in individuals with LGG, and the AUC values and decision curve analysis (DCA) of this nomogram suggest that the model can benefit patients from clinical treatment strategies.

Conclusion: Overall, ARG signs can be used as a valid indicator of prognosis prediction and response to immunotherapy in patients with LGG.

anoikis

lower-grade glioma

prognostic signature

immunotherapy

tumor microenvironment

Gliomas originate from glial cells and are the brain's most common primary malignant tumor, accounting for nearly 80% of malignant brain tumors, with an age-adjusted death rate of 4.19 cases per 100,000 per year[1]. Lower-grade gliomas (LGG) are a subgroup of all gliomas, including WHO grade II and III astrocytomas, oligodendrogliomas, and oligodendrogliomas, which make up 15% of gliomas and 5% of primary brain tumors, respectively[2, 3]. LGG is relatively slow growing, has a relatively long course, and a relatively good prognosis, with survival times of over ten years[4, 5]. However, LGG usually recurs after initial treatment. After tumor recurrence, LGG can transform into a malignant higher-grade glioma with a median overall survival (OS) of only 2.4 years after malignant transformation[6]. Currently, LGG treatment is based on maximum safety surgical resection, followed by radiotherapy, chemotherapy, or a combination of treatments, individualized according to the surgery outcome, tumor location, histological diagnosis, molecular pathological features, and individual patient characteristics[7, 8]. But its highly aggressive and infiltrative nature makes the various treatments for LGG less effective[9, 10]. It is therefore critical to understand the underlying mechanisms of LGG tumor infiltration and invasion, and to identify reliable multi-biomarkers to accurately forecast the outlook of LGG patients.

Infiltration of gliomas is associated with abnormal changes in the adhesion junctions of brain glial cells[11]. This view links gliomas to anoikis. Anoikis is essentially an apoptotic event, and It is a specific presence of programmed cell death caused by loss of cell adhesion to the extracellular matrix (ECM) or improper cell adhesion[12, 13], and they have a crucial role in the growth of the organism, the maintenance of healthy tissue, the progression of disease, and the spread of tumors. In other words, the ECM and other supporting cells make create the milieu in which normal cells can live. This stereo scaffold supplies the necessary biochemical and mechanical signals for the growth, differentiation, and maintenance of other normal physiological processes of the attached cells [14]. Whereas this specific form of programmed apoptosis results from the disruption of ECM-mediated loss of cell adhesion during cell shedding or microenvironmental changes, its activation is used to remove isolated cells[15]. At this time, Anoikis regulates cell death primarily through two pathways, one using mitochondrial events induced by cellular stress (intrinsic pathway) and the other mediated by tumor necrosis factor (TNF) and first apoptotic signal (Fas) - ligands (extrinsic pathway)[16–18]. Although earlier research has consistently shown that anoikis is inevitably connected to the growth of many cancers[19–21], prognostic indicators based on the association of anoikis have rarely been analyzed in LGG. Therefore, we have done studies in this area.

We first used the combined TCGA and GTEx datasets in this study for variant analysis. We examined the differential expression of anoikis-related genes (ARGs) in LGG samples using weighted gene co-expression network analysis (WGCNA). Then we explained the differential expression of ARGs in LGG samples, gene interactions, somatic mutation incidence, genetic loci, and CNV. Subsequently, on the one hand, we identified two well-defined clusters based on ARGs expression. We analyzed the differences between these clusters regarding clinical features, tumors, and immune microenvironments.On the other hand, we used machine learning (including Lasso regression and Random Forest (RF) algorithms and multi-factor COX regression to screen for anoikis-associated solid genes and construct a risk score model and validate its stability in internal and external validation groups consisting of multiple cohorts such as TCGA, CGGA, and GEO. Under this risk score, potential associations between risk subtypes and clinical features, TME, immune microenvironment, and somatic mutations are comprehensively assessed. The value of doing so is that our comprehensive analysis not only demonstrates that the anoikis correlation set can accurately assess the prognosis of LGG patients but also attempts to explain the mechanism of the function of anoikis in the onset and progression of LGG disease. We have also used this risk subtype as a basis for our efforts to find chemotherapeutic agents that can improve the prognosis of LGG patients. Of course, we have created an at-a-glance nomogram to assist better clinicians and LGG patients in assessing and predicting prognosis. Finally, we have also analyzed the ARG signal in our study by immunohistochemistry and single-cell analysis. Our results have repeatedly confirmed the efficacy of this ARG signal as a new and effective marker in the diagnosis, treatment, and assessment of prognosis in LGG patients.

2.1 Inclusion of patients

Gene expression RNA-seq data for LGG tissues and normal tissues that we need to be downloaded from the "TCGA TARGET GTEx" cohort compiled by Toil Recalculation on the UCSC Xena website (https://xena.ucsc.edu/)[22]. We downloaded clinical data for LGG patients from the TCGA database (https://portal.gdc.cancer.gov). 105 normal samples from the cerebral cortex and 497 LGG samples were kept for examination after samples with no survival data, tumor grade, and non-primary LGG were eliminated. In addition, LGG samples from another three RNA-seq cohorts from the CGGA and GEO databases were screened for external validation in this study, these including CGGA325 (172 cases), CGGA693 (420 cases), and GSE16011 (106 cases) respectively.

2.2 Anoikis-Relevant Gene Selection

We searched the Genecards (https://www.genecards.org/) website for related genes using the keyword anoikis. The GeneCards Inferred Function Score (GIFtS) was used as the gene-related score for this website[23], and we selected genes with GIFtS > 30 to continue the study and included a total of 739 ARGs.

2.3 Detection of Differentially Expressed Genes (DEGs) and LGG-Related Genes

We performed differential analysis to identify differential genes between the normal tissue group and the LGG tumor group using the R package limma. To be more precise, we split the various gene expression patterns into two groups based on fold change (FC), and then we computed the mean expression of the genes to get t-values. Based on t-values, P-values were generated and then modified using the false discovery rate (FDR) approach. We filtered the DEGs at ∣logFC ∣>1 and adjusted for p < 0.05. we obtained the final significance of the difference in the expression of each gene. In addition, the Visualisation of DEGs by mapping volcanoes. WGCNA[24] is a system biology approach to describe patterns of gene association between different samples and is often used to identify highly synergistic genomes. In order to identify gene modules associated with LGG in DEGs, we applied the WGCNA package based on R software to identify highly synergistic genes. We obtained a number of modules depicting the link between cancers and normal tissues using the correlation coefficients and accompanying p-values. We chose the top LGG-related modules based on the most significant association coefficients from all the modules obtained by variance analysis and WGCNA. We extracted information about the genes from these modules. We identified the intersections between ARGs and the best LGG-related modules through the online website (http://www.ehbio.com/test/venn/#/)[25] and we drew Venn diagrams to visualize the results of the intersections.

2.4 Integrated multivariate analysis of ARGs

Firstly, the R package pheatmap was used to visualize the expression levels of the 77 ARGs. We sought to identify potential interactions between these genes and subsequently used the R package STRINGdb to develop a protein-protein interaction network (PPI), with correlations between genes detected by the R package corrploy. Next, the somatic mutations of these genes in patients are described by the R package maftool[26]. And from each gene's copy number variation (CNV) status, chromosomal information and gain or loss status are obtained and visualized in the Circos diagram.

2.5 Different pattern recognition for anoikis based on unsupervised clustering

To reduce the dimensionality of the clustering analysis, an initial screening of genes of predictive value in the cohort based on univariate Cox regression analysis. For the cluster analysis, only prognosis-related genes strongly associated with anoikis were retained as dimensions and parameters.Unsupervised clustering was performed using the ConensusClusterPlus package[27], using the following parameter settings: the clustering hierarchy clustering algorithm was chosen as partitioning around medoid (PAM), using clusters of Pearson correlation distances. In addition, 1000 iterations were performed to ensure the stability of the classification.

2.6 Characterisation between molecular subtypes

For the clustering results, the Prognostic performance of the various subgroups was assessed by Kaplan-Meier survival curves. Fisher's test tested differences in the composition of clinical features between subtypes, and we used the heatmap package to combine clinical information from LGG. We used the heatmap package in conjunction with clinical information from LGG. The 22 immune cell infiltrates in each LGG sample were then examined using the CIBERSORT method, with simultaneous use of four immune cell infiltration algorithms, MCPCOUNTER, QUANTISEQ, EPIC, and TIMER, to further assess the immune cell composition and immune profile between subtypes of molecules. The R package estimation was also used to calculate and score these components of immunity, stroma, estimation, and tumor purity for each LGG sample. The “will-cox.test” function was then put into place to look into and display these component differences between subtypes.

2.7 Machine learning-based identification of risk signatures for optimal multi-gene combinations

First, the training cohort was created by randomly screening 70% of the total LGG samples from the TCGA cohort. The remaining 30% of the randomly selected LGG samples were designated to the Test1 cohort as internal validation. In addition, the Test2 queue is made up of all samples from the TCGA queue as another internal validation. At the same time, LGG samples from the CGGA-325, CGGA-693, and GSE-16011 datasets were assigned to the Test3, Test4, and Test5 cohorts as external validation.

The combined analysis of the two algorithms in the training cohort was used to select ARGs for the putative prognosis. The random forest (RF) algorithm was implemented by the R program randomForestSRC package with a feature tree number of 1000 and a random split number of 1. The R program glmnet package implemented the lasso with 10-fold cross-validation. We intersected the two gene lists to obtain the signature gene for our study. Based on the expression of the risk genes, a Cox proportional risk regression model was applied. The cox regression coefficients estimated by considering the expression and correlation of signature genes were used to calculate the risk score formula: ARG RiskScore = (Coef Gene1* Exp Gene1) + (Coef Gene2* Exp Gene2) + ....+ (Coef GeneN-1* Exp GeneN-1) + (Coef GeneN* Exp GeneN).

The median determined the thresholds for grouping LGG patients into high- or low-risk groups based on the estimated risk scores, and the risk score distribution for each sample was plotted. ROC analysis of the RiskScore prognostic grading was performed using the Time ROC package to analyze the classification efficiency of the 1-year, 3-year, and 5-year prognostic predictions. We compared OS for both sets of patients in the cohort using the Kaplan-Meier method and the log-rank test.

To evaluate the model's robustness, all samples in the Test1-5 cohort were divided into different risk subgroups for further study (same steps as before).

2.8 Characterisation between risk subtypes

We are curious about the relationship between ARGs' predictive and clinicopathological features, so the fisher test was applied to show differences in survival status, gender, tumor grade, IDH1 status, and MGMT promoter methylation status distribution between different risk populations. We used the R software ESTIMATE package to calculate Stromal Score, Immune Score, Estimate Score, and Tumour Purity to assess differences in the tumor microenvironment with the low- and high-risk populations. Five immune cell infiltration algorithms, CIBERSORT, TIMER, QUANTISEQ, EPIC, and MCPCOUNTER algorithm, were also compared to assess immune cell composition and characteristics between risk populations and investigated using the "wil-cox.test" algorithm. We investigated the differences in these five immune infiltration scores, 47 immune checkpoint genes[28], and HLA gene expressions between subtypes. The correlation between the CIBERSORT immune infiltration algorithm and the six gene expression and risk scores in the Anoikis correlation predictive model was then assessed using Spearman's rank correlation coefficients.

We then took the mutation annotation format from the TCGA database with the maftools R package to visualize differences in mutations in LGG patients between various risk groups. We also calculated the TMB score for each LGG sufferer in the entire TCGA cohort, using the "wil-cox.test" algorithm to investigate differences in TMB scores between subtypes. We also assessed the correlation between TMB and risk scores using Spearman's rank correlation coefficient.

2.9 Risk subtype gene set enrichment analysis and Gene set enrichment analysis (GSEA)

To determine the differentially expressed biofunctional phenotypes between the high-risk and low-risk groups, we performed DEGs enrichment analysis, and Gene set enrichment analysis (GSEA) between the two groups. For differential analysis, we utilized the R package limma, and we selected ∣logFC∣>1 and P < 0.05 as filtering criteria to obtain DEGs between the low- and high-risk groups. In addition, DEGs were visualized by plotting volcanoes using the ggplot package in the R software.

For the functional enrichment analysis of the DEGs gene set, we first used the R software org.Hs.eg.db package and the clusterProfiler package to map the DEGs to the background set using the GO annotation as a background. Next, the latest KEGG Pathway gene annotations were obtained from the KEGG rest API (https://www.kegg.jp/kegg/rest/keggapi.html), and gene enrichment results were obtained using the R software clusterProfiler package. Finally, for Gene set enrichment analysis (GSEA), we obtained the GSEA software (version 4.0) from the GSEA website (http://software.broadinstitute.org/gsea/index.jsp)[29], and downloaded the c2.cp.kegg.v7.4.symbols (http://www.gsea-msigdb.org/gsea/downloads.jsp)[30]. Based on the complete gene Expression profiles and we analyzed subtype groupings. This allowed a more detailed investigation of the functional biological pathways involved, and p-values of < 0.05 were considered statistically significant.

2.10 Chemotherapeutics Forecast and Molecular Docking

OncoPredict[31], an R package that comprehensively analyses drug response and drug response markers, was used to predict the 50% inhibitory concentration (IC50) values of LGG samples to various antitumor drugs in the Cancer Treatment Response Portal (CTRP) and then to explore the differences in sensitivity between the two groups of drugs based on Anoikis-related risk subtype, to explore the differences in drug sensitivity between the two groups and work towards finding relevant chemotherapeutic agents.

A connectivity map (CMap; https://clue.io) is a program that predicts potential drugs that may induce a biological state encoded by a specific gene expression profile. We imported DEGs screened by Anoikis-related risk subtypes into the CMap database to explore small molecule drugs that might treat LGG. A negative mean score indicated that the drug reversed the desired biological properties and was of potential therapeutic value (p < 0.05 for statistical significance). We then used the AutoDock Vina software to calculate the free binding energy between the candidate macromolecular protein and the molecule compound. Finally, we visualized the results of molecular docking using Pymol software.

2.11 Creation and verification of Nomogram

We collected clinical information on LGG patients in all of the above training cohorts and validation cohorts, including the age of patients, gender, tumor grade, IDH1 status, receipt of radiotherapy and receipt of chemotherapy information, and analyzed them together with an ARGs-based RiskScore by multivariate Cox regression analysis, resulting in the construction of nomograms containing polygenic features and other independent prognostic factors. Nomogram plots were displayed by the replot package, the timeROC R package performed ROC analysis, and the nomograms were calibrated at 1, 3, and 5 years. The calibration chart is used to assess the efficacy of the treatment accurately.

2.12 Immunohistochemical analysis of six Anoikis-related model genes

The Human Protein Atlas (HPA; https://www.proteinatlas.org) is a database containing immunohistochemical (IHC)-premised protein expression patterns from cell lines, normal and cancerous tissues. In the current investigation, we used this database to obtain IHC pictures of the protein expression of ARG model genes in clinical samples from LGG patients.

2.13 Tumor immuno-single cell analysis

A comprehensive study of the different datasets and cell types of TME heterogeneity was carried out using the TISCH database.

2.14 Statistics analysis

Using R software, statistical analyses and graphs were created. In order to compare differences between the two groups, the Wilcoxon test was used. For all statistical calculations, a P-value of 0.05 or below was regarded as statistically significant.

3.1 Variation Analysis and WGCNA of DEGs

Differential expression analysis revealed that we obtained 4311 DEGs, indicating that the amount of these genes that were up- and down-regulated compared to normal tissues was 2480 and 1831, respectively(Fig. 2A). The WGCNA analysis was based on 4311 DEGs performed. The minimum soft threshold for constructing a scale-free network is 10, so 10 can be chosen as the optimal soft threshold for subsequent analysis (Fig. 2B). We set minModuleSize = 30 and MEDissThres = 0.25 as clustering criteria and identified eight gene modules (Fig. 2C, D). After analysis, we displayed the blue module to have the highest association between normal and tumor tissue (r = ± 0:9, p = 2e − 216) (Fig. 2E).

The Venn diagram shows the intersection between the best relevant blue modules and ARGs for LGG (n = 77) for subsequent analysis (Fig. 2F).

3.2 Characteristics of the ARGs

We analyzed the expression patterns of these 77 ARGs. Most ARGs were highly expressed in LGG, but a few genes remained lowly expressed in LGG tissues, including SMARCB1, FYN, and ADGRG1 (Fig. 3A). In addition, we performed PPI network analysis using the STRING database to reveal further the linkage between these ARG-associated genes (Fig. 3B). To get a comprehensive picture of the relationship between ARGs in LGG, we conducted a regression analysis (Fig. 3C). The results showed strong correlations between BRMS1 and GSTP1 (r = 0.769, p < 0.001), NUDT1 and BRMS1 (r = 0.757, p < 0.001), and SLC39A6 and CSNK2A1 (r = 0.7, p < 0.001), and the findings suggest that the three pairs of genes are likely to have similar biological functions.

CNV alterations in ARGs were visible on chromosomes (Fig. 3D), and the survey showed a prevalence of CNV-associated mutations. Copy number increases were most common, such as MYC, SMARCE1, and NOTCH1, which showed extensive CNV amplification. In contrast, NOS3, CASP3, and SMAD4 showed copy number loss (Fig. 3E). and in addition, We investigated the incidence of somatic mutations in 77 ARGs in LGG. Among them, TP53 had the highest mutation rate (up to 45.3%), with higher mutation rates in NOTCH1 (7.4%), PIK3CA (7.4%), and EGFR (6.2%), while the other genes had relatively low mutation rates (Fig. 3F).

3.3 Anoikis Patterns in LGG

To determine which ARGs had prognostic value, we first performed a univariate Cox analysis, and this analysis screened 50 of the 77 ARGs (Fig. 4A). A composite illustration of the complex relationship between the 50 ARGs and the prognostic value of the LGG was displayed as a network diagram (Fig. 4B). The data suggest a significant positive correlation between ARGs with prognostic impact and that there may be complex crosstalk between ARGs, which has important implications for patient prognosis.

Next, we identified two modification patterns based on the expression of 50 ARGs for patients with qualitatively different Anoikis patterns using consensus clustering (Fig. 4C, D), including 383 cases in Cluster1 and 114 cases in Cluster2. Predictive analysis showed that Cluster1 had a better prognosis than Cluster2 (Fig. 4E). As shown in Fig. 4F, analysis of compositional differences regarding clinical characteristics showed that compared to C1, the C2 species had more deceased patients (p = 2.5e-11), tumor grade G3 patients (p = 3.6e-06), IDH1-wild type patients (p = 1.2e-69), and MGMT promoter unmethylated patients (p = 1.2e-23), all of which suggest that patients in the C2 group may have a worse prognosis. Next, we used heat maps to illustrate ARGs' expression patterns and show that ARGs were differentially expressed in the two clusters (Fig. 4G).

In addition, TME was calculated in C1 and C2 separately, and the results of Fig. 4H showed that C1 had a lower ImmuneScore, StromalScore, and EstimateScore, but higher tumor purity. Next, we assessed the difference in immune infiltration of immune cells between the two subtypes based on the gene expression profile of LGG, using the CIBERSORT algorithm combined with the LM22 signature matrix and thus inferring the proportion of 22 tumor-infiltrating immune cells in each subtype, of which we significantly infiltrated 13 immune cells between the two subtypes (Fig. 4I, J). The immune cell composition and immune profile between molecular subtypes were further assessed using four additional immune cell infiltration algorithms, MCPCOUNTER, QUANTISEQ, EPIC, and TIMER (Figure S1). These findings imply that the C1 subtype and the cold immune phenotype are correlated with the C2 subtype and immunological hot phenotype, respectively.

3.4 Building a prognostic risk model using ARGs

In this study, data from 497 cases of LGG were randomly split into a training set of 348 cases and a validation set of 149 cases in a 7:3 ratio. We included 50 significant genes with a significant association with the overall survival of patients in LASSO and RF regression. In the LASSO regression analysis, we set the Lambda value to 0.02547996, resulting in 15 genes (Fig. 5A); the RF algorithm screened 42 genes associated with clinical outcomes in LGG patients (Fig. 5B). We then obtained the intersection of these two algorithms, with 12 genes present simultaneously in the results of both regression analyses (Fig. 5C). We performed a multivariate Cox regression analysis of the crossover genes to reduce the dimensionality further, resulting in the selection of six genes (Fig. 5D). We then created a six-gene signature. The model developed was as follows: ARG RiskScore = 0.106966*expEGFR + 0.383904*expSIX1 + 1.353163*expSP1-0.22767*expANGPTL2-0.69252*expBMP2-0.45003*expPDCD4.

3.5 Predictive value of 6 genetic markers

We calculated the expression level of the risk score for each sample and divided the 348 patients into a high-risk group and a low-risk group using the median. The risk score distribution of the samples was also plotted, with higher risk scores on the OS graph than the low-risk scores, suggesting that samples with high RiskScore had a poorer prognosis. Among the different samples, changes in SP1, EGFR, and SIX1 gene expression increased the risk values, and therefore the high expression of these genes was identified as a risk factor. In contrast, changes in ANGPTL2, BMP2, and PDCD4 gene expression decreased the risk values, and therefore the high expression of these genes was identified as a protective factor (Fig. 5E). Survival analysis showed that high-risk patients had poorer OS (Fig. 5H). The prognostic grading of the risk score was also analyzed by ROC curves over time using the R software Time ROC package. We analyzed the classification efficiency for 1, 3, and 5-year predictions. We can see that the model has a high AUC offline region with an AUC above 0.8 and an AUC offline region of 0.87 for the 1-year prediction (Fig. 5K).

Subsequently, we used TCGA (Test1, Test2), CGGA-325 (Test3), CGGA-693 (Test4), and GSE-16011 (Test5) as the validation cohorts and similarly calculated the expression of risk scores for each sample. The 149 LGG patients in the Test1 cohort, 497 LGG patients in the Test2 cohort, 172 LGG patients in the Test3 cohort, 420 LGG patients in the Test4 cohort, and 106 LGG patients in the Test5 cohort were divided into two groups based on the median RiskScore of the validation cohort. The survival analysis results were similar to the previous ones, with patients in the high-risk group being more likely to have shorter OS and higher mortality. Risk score distribution plots display that a high-risk score leads to poor survival times. Heatmaps were created indicating the presence of genes with high expression levels (SP1, EGFR, and SIX1) and genes with diminished expression (ANGPTL2, BMP2, and PDCD4) in the high-risk subgroup. AUC analysis of the risk score ROC curves indicated a high diagnostic value of the risk score in these five validation cohorts. Kaplan-Meier survival curves, RiskScore distribution plots, and risk score ROC curves for the five validation cohorts are shown in Figs. 5F, G, I, J, L, M, and Figure S2A-L, respectively.

3.6 Characterisation between risk subtypes

As shown in Fig. 6A, the high-risk group had a higher proportion of deceased patients, tumor grade G3 patients, IDH1 wild-type patients, and MGMT promoter non-methylated patients (all P < 0.05), all of this suggests poorer prognosis for patients in the high-risk group. We have adopted the ESTIMATE algorithm to study the specific performance of ARGs in TME. Our calculations suggested increased levels of immune fraction, stromal fraction, and estimated fraction but reduced levels of tumor purity in the high-risk group (Fig. 6B).

To further investigate the prevalence of infiltrating immune cells in TME, we used five methods to assess the proportion of immune cell infiltration in high- and low-risk LGG populations. As Fig. 6C shows, according to the CIBERSORT algorithm, the scores of T cell CD4 + memory resting, Macrophage M1, and Neutrophil immune cell infiltration levels are higher in high-risk groups. In contrast, Neutrophil and B cell memory immune cell infiltration levels were relatively low. We also analyzed four other immune infiltration algorithms to understand the immune profile between risk subtypes comprehensively. The level of multiple immune cells infiltrates generally higher in the high-risk group in the TIMER algorithm (Fig. 6D), QUANTISEQ algorithm (Fig. 6E), EPIC algorithm (Fig. 6F), and MCPCOUNTER algorithm (Fig. 6H). The increased immune cell infiltration is most likely a compensatory consequence of the low local immune response. We confirmed this conjecture of ours not only in the multiple immune infiltration algorithm but also in the examination of 47 immune checkpoint gene expressions and human leukocyte antigen (HLA) gene expressions in both risk subtypes, with most HLA genes and immune checkpoints being upregulated in the high-risk group and, conversely, a trend towards downregulation in the low-risk group (Fig. 6H, I). These findings imply that the immunological hot type is linked to the high-risk group, whereas the immune cold type is linked to the low-risk group.

Given these significant immune-related biological features, we investigated the association between Anoikis-associated prognostic model genes and risk scores with the tumor microenvironment separately and further. First, we examined the relationship between the CIBERSORT immune infiltration algorithm score and the expression of six Anoikis-associated prognostic model genes (Fig. 7A), indicating that all six Anoikis-associated prognostic model genes have an impact on the immune microenvironment. Correlations between CIBERSORT scores and ARG risk scores were also assessed (Fig. 7B), and results yielded positive correlations between T.cells.CD4.memory.resting, Macrophages.M1, Macrophages.M0 and T.cells.regulatory.Tregs. and ARG risk scores (Fig. 7C, D). correlated (Fig. 7C,D,E,F); whereas NK.cells.activated, B.cells.memory, Mast.cells.activated, and Monocytes were negatively correlated with the ARG risk score (Fig. 7G, H, I, J).

The difference in the two groups was considerable, as evidenced by our estimated TMB scores (p < 0.05), moreover, TMB scores and risk scores were positively correlated, indicating good efficacy of our risk scores in assessing the tumor microenvironment (Fig. 7K). In addition, in looking at the tumor mutational load in different risk groups (Fig. 7L), we found that the highest frequently mutated in the high-risk group were IDH1, TPRB, and ATRX. The incidence of IDH1 mutations was significantly lower in patients in the high-risk group (61.7%) than in those in the low-risk group (91.4%).

3.7 Identification and biological functional analysis of DEGs between risk subtypes

To analyze the molecular biological processes guided by the six Anoikis-associated prognostic biomarkers, The analysis of differences first obtained 133 DEGs under our set fold change = 1 and P < 0.05, of which 85 were upregulated and 48 downregulated (Fig. 8A). GSEA results for risk subtypes suggest that the high-risk group was enriched for Fc epsilonri signaling pathway, Fc gammar mediated phagocytosis and Mismatch repair. In contrast, the low-risk group was enriched for Ribosome, Selenoamino acid metabolism, and Taurine and hypotaurine metabolosm (Fig. 8B). Subsequent KEGG analysis showed that these DEGs were enriched in numerous Cellular Processes, Environmental Information Processing, Human Diseases, and Organismal Systems (Fig. 8C, E, G).GO analysis showed that more DEGs were enriched in immune-related biological pathways (Fig. 8D, F, H). The results suggest that these numerous molecular biological pathways profoundly impact the anoikis biological pathway.

3.8 ARG risk predicts new chemotherapy regimens

To find chemotherapeutic agents that could improve the prognosis of LGG patients, on the one hand, we used the R software oncoPredict package to analyze the differences in sensitivity of commonly used chemotherapeutic agents between the two risk subtypes. We found that the eight chemotherapeutic agents with the most significant differences in drug sensitivity were LFM.A13, S.Trityl.L.cysteine, Rapamycin, Parthenolide, QS11, PF.562271, Roscovitine and MP470 (Fig. 9A).

On the other hand, based on screening robust DEGs between high and low subgroups, which DEGs imported into Cmap, we screened the four most statistically significant small molecule drugs, including Risperidone, Pipamperone, FR-180204, and Erastin (Fig. 9B). Notably, a negative Score indicated that the drug reversed the desired biological properties and had potential therapeutic value. We then used AutoDock Vina to molecularly dock these four small molecule compounds to their target macromolecular proteins. The highest binding affinity after molecular docking was then visualized using pymol for DRD2 and Risperidone (-8.1 kcal/mol), DRD2 and Pipamperone (-6.9 kcal/mol), MAPK1 and FR-180204 (-7.4 kcal/mol), VDAC2 and Erastin (-7.6 kcal/mol). The binding sites and interaction results for the best-selected conformations showed that three hydrogen bonds in the binding of DRD2 and Risperidone (Fig. 9C), DRD2 and Pipamperone (Fig. 9D), VDAC2 and Erastin (Fig. 9E); and four hydrogen bonds were found in the binding of MAPK1 and FR-180204 (Fig. 9F).

3.9 Building and evaluating survival models for nomograms

To test if RiskScore based on ARGs may be an independent prognostic indicator, multivariate Cox regression studies were done to create and assess a nomogram survival model. The analysis showed that RiskScore was an independent prognostic factor for LGG patients in the TCGA training set (HR = 1.063, 95% CI: 1.037–1.090, p < 0.001) and was validated in multiple validation cohorts (Fig. 10A). Multivariate Cox was used to build a nomogram model in the TCGA cohort to estimate OS at 1-, 3- and 5 years. Age, gender (male, female), tumor grade (GII, GIII), IDH1 status (mutated, wild), receipt of additional adjuvant therapy such as radiation and chemotherapy (radiation and chemotherapy, radiation or chemotherapy, no adjuvant therapy), ARGs-based RiskScore was included in the model. AUC analysis of the model's ROC curve showed that the nomogram incorporated factors of high diagnostic value, both in the TCGA training cohort and the remaining five validation cohorts, with a C-index value of 0.82 (95% CI: 0.77–0.87) for the training cohort model (Fig. 10B, D, F, H, J, L). The model's accuracy in forecasting the 1-, 3-, and 5-year survival rates was demonstrated using calibration curves. The results showed that the column line plot accurately predicted 1-, 3- and 5-year survival rates for LGG patients (Fig. 10C, E, G, I, K, M). Nomograms for the TCGA queue are shown. (Fig. 11A).

3.10 Immunohistochemical analysis of six Anoikis-related model genes

We retrieved IHC-stained images of six Anoikis model gene-associated proteins from the HPA database in LGG and normal brain tissue. We used them to determine whether these six ARGs exhibited differentially high protein expression levels in LGG. Consistent with the above findings, the analysis revealed that protein expression levels of EGFR, SIX1, SP1, ANGPTL2, and PDCD4 were significantly higher in LGG samples than in normal samples (Fig. 12A, B, C, D, E).

3.11 Single-cell analysis of six Anoikis-related model genes

We used the single-cell data GSE89567 from the TISCH database of gliomas to investigate the expression of six ARGs in TME. This dataset has 17 cell group annotations and four intermediate cell types (Fig. 13A, B). EGFR, SIX1, SP1, ANGPTL2 PDCD4 and BMP2 were all expressed in cancer cells, and notably PDCD4 was expressed in all four intermediate cell types (Fig. 13C,D)

LGG is a group of primary aggressive brain tumors that develop from supporting glial cells and are very common in adults. However, as its pathogenesis is still unclear and treatment outcomes are not satisfactory for the time being, the underlying disease mechanisms are yet to be studied in depth, and the search for treatment modalities that can provide predictive benefits to clinical LGG patients is imminent. Anoikis, a new and unique form of cell death, is highly likely to provide a new treatment strategy for LGG. The activation of apoptosis in response to loss of adherence to the ECM and surrounding cells is known as anoikis. Steven M. Frisch and Hunter Francis initially clarified and outlined the connection between cell adhesion and apoptosis in 1994[13]. According to their research, epithelial cell lines experienced cell death following separation. A phrase derived from the Greek word for homeless was later used to describe this phenomenon: "anoikis." Anoikis is essential in avoiding improper cell attachment and translocation, which might result in aberrant development in an ectopic setting. Due to the heterogeneity in the prognosis of LGG, it emphasizes the need to develop accurate and practical biomarkers for early intervention or even prophylactic treatment of high-risk patients who may have a poor prognosis. We, therefore, aimed to identify and validate a new and validated multigene biomarker and further classify it to predict prognosis and treatment response in patients with ARG-based LGG.

In this study, we provide a comprehensive view of the gene set consisting of 77 anoikis-associated genes with differential expression patterns in LGG samples, first obtained using variance analysis and WGCNA methods in the combined TCGA and GTEx datasets, and observed the characteristics of these genes from a multi-omics perspective. Regarding gene expression patterns, most ARGs are highly expressed in LGGs, and there is some correlation between these genes. The survey showed that CNV-associated mutations were prevalent and copy number increases were most common, with varying degrees of somatic mutation frequency, with TP53 having the highest mutation rate at 45.3%, as observed from somatic mutation patterns.

We then identified 50 ARGs with predictive value for unsupervised consistency clustering and feature construction. First, we found two molecular subgroups based on ARG that had significantly different clinicopathological characteristics, tumor immune microenvironment, and prognoses. The subtype with a better prognosis, the C1 cluster, comprises more surviving G2, IDH1 mutant, and MGMT promoter methylation patients. TME is a complex combination of multiple components, so ImmuneScore, StromalScore, EstimateScore, and Tumourpurity were calculated to infer the respective components in each patient. The results suggest that LGG patients in cluster C1 may have greater stromal, immune, and extracellular abundance, and LGG patients in cluster C2 may have more tumor purity components. TME components play a crucial role in the initiation and progression of cancer, and targeting TME remodeling may offer a promising treatment strategy to slow tumor growth[32]. The biological activity of tumors is significantly impacted by the immunological microenvironment[33]. As a result, multiple immune cell infiltration scores and differences between the different clusters were visualized. These findings imply that the C1 subtype connects with the cold immune phenotype, whereas the C2 subtype correlates with the immunological hot phenotype, suggesting that the C2 cluster population may benefit from immunotherapy.

We then used machine learning algorithms such as Lasso regression and RF algorithms to screen for robust anoikis-related genes, construct risk score models, and validate their stability in internal validation groups and external validation groups consisting of multiple cohorts such as TCGA, CGGA, and GEO. This new ARG signature involves EGFR, SIX1, SP1, ANGPTL2, PDCD4, and BMP2, forming a robust risk-scoring signature. Among these is EGFR, a member of the group of tyrosine kinases known as the epidermal growth factor and transmembrane receptors (EGF)[34]. AREG, BTC, EGF, EREG, HBEGF, and TGF are the currently known EGFR ligands. Previous research has demonstrated the critical role of the EGFR signaling cascade response in the development of cancer[35]. EGFR regulates anoikis resistance in gliomas through downstream pathways such as PI3K/AKT pathway, MAPK pathway, and PLCγ/PKC pathway[21, 36–38]. Specifically, in normal cells, Bim is one of the only BH3 proteins that, when detached, is upregulated by downregulating EGFR signaling, which triggers the intrinsic pathway of loss-of-nest apoptosis. Specifically, the EGFR/MAPK signaling event targets Bim, a member of the BH3-only Bcl-2 family in normal cells. When cells are shed, they are upregulated by downregulating EGFR signaling, which triggers the intrinsic pathway of loss-of-nest apoptosis[39]. In vitro experiments have shown that overexpression of EGFR inhibits BIM via the MAPK pathway, thereby inducing anoikis resistance[40]. In the PI3K/AKT pathway, NF-kB is a downstream target of EGFR and an inducer of anoikis resistance[41]. In addition to the above pathways, EGFR amplification activates NF-kB via TMEM43/LUMA, leading to glioma resistance to anoikis apoptosis[42]. SIX1 is a developmentally relevant transcription factor that promotes cell proliferation and represses apoptosis in organoid embryonic development[43]. Overexpression of SIX1 has been observed in various malignancies, including gliomas[44, 45]. SIX1 enhances the adhesion of tumor cells to ECM molecules by upregulating the expression of α5β1, enhancing the invasion of tumor cells to target organs, and enhancing the anti-anoikis ability of tumor cells, thus promoting tumor metastasis[46]. SP1 is a well-known transcription factor family member and is essential for normal embryonic development[47]. Aberrant overexpression of the SP1 gene and disruption of the transcriptional activity of the protein it encodes is associated with numerous cancers, including lung, breast, gastric, and glioma[48]. SP1 upregulates the expression of the endogenous apoptotic pathway suppressor survivin, and the subsequent increase in survivin protects cancer cells against anoikis by blocking intrinsic apoptotic activity[49]. ANGPTL2 is considered to be an aggravating factor for metastasis in a variety of cancers. Resistance to anoikis is inhibited in cancer cells expressing ANGPTL2[50]. PDCD4 is a tumor suppressor gene that exerts anti-tumor effects by promoting apoptosis and inhibiting the proliferation and metastasis of tumor cells[51]. PDCD4 is regulated by mir-21 targeting and promotes anoikis resistance in cancer cells[52]. BMP2 is a pluripotent factor and a member of the transforming growth factor-β (TGF-β) superfamily, associated with embryonic development and homeostasis of tissues and organs[53]. BMP2 and BMP9 inhibit anchoring non-dependent survival and promote anoikis[54].

In both cohorts of the risk model we created based on this signature, a poorer prognosis was observed for patients in the high-risk subgroup, where more fantastic immune, stromal and extracellular abundance resided. The high-risk subgroup had higher immune cell infiltration scores. In addition, increased expression of immune checkpoint genes showed an association with poor survival prognosis, suggesting that immune checkpoint inhibitors may provide clinical benefit to these populations. Our signatures are of good value. Age, tumor grade, IDH1 status, receipt of additional adjuvant treatments such as radiotherapy and chemotherapy, and the ARGs-based RiskScore were shown to act as independent prognostic indicators. Then, nomograms with the aforementioned independent prognostic indications were drawn and utilized to calculate the likelihood that patients with LGG will survive. Regarding the nomogram's comprehensive validation, the ROC curves' AUC values are likewise good, demonstrating the nomogram's strong predicting ability.

There are several limitations to this study. Firstly, although validated in many ways to the best of our ability, the ARG-based signature was created and validated based on retrospective data from a database. Its effectiveness and usefulness still need to be evaluated in more large-scale prospective clinical investigations. In addition, additional and more rigorous basic research experiments are needed to stress the critical role of the ARG signature gene in LGG genesis and development.

In conclusion, our study provides a critical approach and a new perspective on the role of anoikis in LGG patients. It sheds light on the potential mechanisms by which anoikis influence LGG progression.

LGG: Lower-grade glioma; ECM: Extracellular matrix; TCGA: The Cancer Genome Atlas; GEO: Gene Expression Omnibus; CGGA: Chinese Glioma Genome Atlas; ARGs: Anoikis-related genes; TNF: Tumor necrosis factor; Fas: First apoptotic signal; WGCNA: Weighted gene co-expression network analysis; RF: Random Forest; TME: Tumor microenvironment; GIFtS: GeneCards Inferred Function Score; DEGs: Differentially Expressed Genes; FC: Fold change; FDR: False discovery rate; PPI: Protein-protein interaction network; CNV: Copy number variation; PAM: Partitioning around medoid; GSEA: Gene set enrichment analysis; IC50: 50% Inhibitory concentration; CTRP: Cancer Treatment Response Portal; Cmap: Connectivity map; HPA: Human Protein Atlas; TGF-β: Transforming growth factor-β;

Availability of data and material

Data and download URLs involved in this study had been described in detail in the materials and methods section. The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Code availability

Bioinformatics analysis of this study was based on R 4.2.2, and the involved R packages were described in detail in the materials and methods section. We uploaded the R code in the supplementary file.

Ethics approval and consent to paticipate
All datasets used in the present study were downloaded from public databases, including TCGA, GEO and CGGA database. These public databases allow researchers to download and analyze public datasets for scientific purposes and thus no ethical approval nor informed consent was required. The current research follows the TCGA, GEO and CGGA data access policies and publication guidelines. All methods/protocols were performed in accordance with the relevant guidelines and regulations.

Consent to participate

All authors voluntarily participated in this study.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Funding

No funding.

Author Contributions

AA & AM: Conceptualization, Methodology, Validation, Investigation, Supervision, Visualization, Writing - original draft, Writing- Reviewing. ZW, QF, SL, YL, GF, YS, YM, YW, and QZ participated in the coordination of data acquisition and data analysis and reviewed the manuscript.

Acknowledgments

We are grateful to the contributors to the public databases used in this study.

Ostrom QT, Patil N, Cioffi G, Waite K, Kruchko C, Barnholtz-Sloan JS: CBTRUS Statistical Report: Primary Brain and Other Central Nervous System Tumors Diagnosed in the United States in 2013–2017. Neuro Oncol 2020, 22(12 Suppl 2):iv1-iv96.
Nunna RS, Khalid S, Ryoo JS, Sethi A, Byrne RW, Mehta AI: Radiotherapy in adult low-grade glioma: nationwide trends in treatment and outcomes. Clinical & translational oncology: official publication of the Federation of Spanish Oncology Societies and of the National Cancer Institute of Mexico 2021, 23(3):628–637.
Louis DN, Perry A, Wesseling P, Brat DJ, Cree IA, Figarella-Branger D, Hawkins C, Ng HK, Pfister SM, Reifenberger G et al: The 2021 WHO Classification of Tumors of the Central Nervous System: a summary. Neuro Oncol 2021, 23(8):1231–1251.
Tabrizi S, Shih HA: The path forward for radiation therapy in the management of low-grade gliomas. Neuro Oncol 2020, 22(6):748–749.
Jooma R, Waqas M, Khan I: Diffuse Low-Grade Glioma - Changing Concepts in Diagnosis and Management: A Review. Asian journal of neurosurgery 2019, 14(2):356–363.
Tom MC, Park DYJ, Yang K, Leyrer CM, Wei W, Jia X, Varra V, Yu JS, Chao ST, Balagamwala EH et al: Malignant Transformation of Molecularly Classified Adult Low-Grade Glioma. International journal of radiation oncology, biology, physics 2019, 105(5):1106–1112.
Forst DA, Nahed BV, Loeffler JS, Batchelor TT: Low-grade gliomas. The oncologist 2014, 19(4):403–413.
Nabors LB, Portnow J, Ahluwalia M, Baehring J, Brem H, Brem S, Butowski N, Campian JL, Clark SW, Fabiano AJ et al: Central Nervous System Cancers, Version 3.2020, NCCN Clinical Practice Guidelines in Oncology. Journal of the National Comprehensive Cancer Network: JNCCN 2020, 18(11):1537–1570.
Stupp R, Mason WP, van den Bent MJ, Weller M, Fisher B, Taphoorn MJ, Belanger K, Brandes AA, Marosi C, Bogdahn U et al: Radiotherapy plus concomitant and adjuvant temozolomide for glioblastoma. The New England journal of medicine 2005, 352(10):987–996.
Jiang T, Nam DH, Ram Z, Poon WS, Wang J, Boldbaatar D, Mao Y, Ma W, Mao Q, You Y et al: Clinical practice guidelines for the management of adult diffuse gliomas. Cancer letters 2021, 499:60–72.
Gritsenko PG, Atlasy N, Dieteren CEJ, Navis AC, Venhuizen JH, Veelken C, Schubert D, Acker-Palmer A, Westerman BA, Wurdinger T et al: p120-catenin-dependent collective brain infiltration by glioma cell networks. Nature cell biology 2020, 22(1):97–107.
Chiarugi P, Giannoni E: Anoikis: a necessary death program for anchorage-dependent cells. Biochemical pharmacology 2008, 76(11):1352–1364.
Frisch SM, Francis H: Disruption of epithelial cell-matrix interactions induces apoptosis. The Journal of cell biology 1994, 124(4):619–626.
Frantz C, Stewart KM, Weaver VM: The extracellular matrix at a glance. Journal of cell science 2010, 123(Pt 24):4195–4200.
Han HJ, Sung JY, Kim SH, Yun UJ, Kim H, Jang EJ, Yoo HE, Hong EK, Goh SH, Moon A et al: Fibronectin regulates anoikis resistance via cell aggregate formation. Cancer letters 2021, 508:59–72.
Simpson CD, Anyiwe K, Schimmer AD: Anoikis resistance and tumor metastasis. Cancer letters 2008, 272(2):177–185.
Amoedo ND, Rodrigues MF, Rumjanek FD: Mitochondria: are mitochondria accessory to metastasis? The international journal of biochemistry & cell biology 2014, 51:53–57.
Paoli P, Giannoni E, Chiarugi P: Anoikis molecular pathways and its role in cancer progression. Biochimica et biophysica acta 2013, 1833(12):3481–3498.
Jin L, Chun J, Pan C, Kumar A, Zhang G, Ha Y, Li D, Alesi GN, Kang Y, Zhou L et al: The PLAG1-GDH1 Axis Promotes Anoikis Resistance and Tumor Metastasis through CamKK2-AMPK Signaling in LKB1-Deficient Lung Cancer. Molecular cell 2018, 69(1):87–99.e87.
Wang J, Luo Z, Lin L, Sui X, Yu L, Xu C, Zhang R, Zhao Z, Zhu Q, An B et al: Anoikis-Associated Lung Cancer Metastasis: Mechanisms and Therapies. Cancers (Basel) 2022, 14(19).
Zhu Z, Fang C, Xu H, Yuan L, Du Y, Ni Y, Xu Y, Shao A, Zhang A, Lou M: Anoikis resistance in diffuse glioma: The potential therapeutic targets in the future. Front Oncol 2022, 12:976557.
Vivian J, Rao AA, Nothaft FA, Ketchum C, Armstrong J, Novak A, Pfeil J, Narkizian J, Deran AD, Musselman-Brown A et al: Toil enables reproducible, open source, big biomedical data analyses. Nature biotechnology 2017, 35(4):314–316.
Yang F, Wang T, Yan P, Li W, Kong J, Zong Y, Chao X, Li W, Zhao X, Wang J: Identification of pyroptosis-related subtypes and establishment of prognostic model and immune characteristics in asthma. Front Immunol 2022, 13:937832.
Langfelder P, Horvath S: WGCNA: an R package for weighted correlation network analysis. BMC bioinformatics 2008, 9:559.
Chen T, Zhang H, Liu Y, Liu YX, Huang L: EVenn: Easy to create repeatable and editable Venn diagrams and Venn networks online. Journal of genetics and genomics = Yi chuan xue bao 2021, 48(9):863–866.
Mayakonda A, Lin DC, Assenov Y, Plass C, Koeffler HP: Maftools: efficient and comprehensive analysis of somatic variants in cancer. Genome research 2018, 28(11):1747–1756.
Wilkerson MD, Hayes DN: ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinformatics (Oxford, England) 2010, 26(12):1572–1573.
Wang J, Ren J, Liu J, Zhang L, Yuan Q, Dong B: Identification and verification of the ferroptosis- and pyroptosis-associated prognostic signature for low-grade glioma. Bosn J Basic Med Sci 2022, 22(5):728–750.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES et al: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences of the United States of America 2005, 102(43):15545–15550.
Liberzon A, Subramanian A, Pinchback R, Thorvaldsdóttir H, Tamayo P, Mesirov JP: Molecular signatures database (MSigDB) 3.0. Bioinformatics (Oxford, England) 2011, 27(12):1739–1740.
Maeser D, Gruener RF, Huang RS: oncoPredict: an R package for predicting in vivo or cancer patient drug response and biomarkers from cell line screening data. Briefings in bioinformatics 2021, 22(6).
Lim AR, Rathmell WK, Rathmell JC: The tumor microenvironment as a metabolic barrier to effector T cells and immunotherapy. eLife 2020, 9.
Angelova M, Mlecnik B, Vasaturo A, Bindea G, Fredriksen T, Lafontaine L, Buttard B, Morgand E, Bruni D, Jouret-Mourin A et al: Evolution of Metastases in Space and Time under Immune Selection. Cell 2018, 175(3):751–765.e716.
Kelly DM, Li L, Burgess AI, Poole DL, Duerden JM, Rothwell PM: Associations of blood biomarkers with glomerular filtration rate in patients with TIA and stroke: population-based study. Stroke and vascular neurology 2021, 6(1):48–56.
Sabbah DA, Hajjo R, Sweidan K: Review on Epidermal Growth Factor Receptor (EGFR) Structure, Signaling Pathways, Interactions, and Recent Updates of EGFR Inhibitors. Current topics in medicinal chemistry 2020, 20(10):815–834.
Kumagai S, Koyama S, Nishikawa H: Antitumour immunity regulated by aberrant ERBB family signalling. Nature reviews Cancer 2021, 21(3):181–197.
Runkle KB, Kharbanda A, Stypulkowski E, Cao XJ, Wang W, Garcia BA, Witze ES: Inhibition of DHHC20-Mediated EGFR Palmitoylation Creates a Dependence on EGFR Signaling. Molecular cell 2016, 62(3):385–396.
Yarden Y, Shilo BZ: SnapShot: EGFR signaling pathway. Cell 2007, 131(5):1018.
Quadros MR, Connelly S, Kari C, Abrams MT, Wickstrom E, Rodeck U: EGFR-dependent downregulation of Bim in epithelial cells requires MAPK and PKC-delta activities. Cancer biology & therapy 2006, 5(5):498–504.
Buchheit CL, Angarola BL, Steiner A, Weigel KJ, Schafer ZT: Anoikis evasion in inflammatory breast cancer cells is mediated by Bim-EL sequestration. Cell death and differentiation 2015, 22(8):1275–1286.
Shao N, Lu Z, Zhang Y, Wang M, Li W, Hu Z, Wang S, Lin Y: Interleukin-8 upregulates integrin β3 expression and promotes estrogen receptor-negative breast cancer cell invasion by activating the PI3K/Akt/NF-κB pathway. Cancer letters 2015, 364(2):165–172.
Jiang C, Zhu Y, Zhou Z, Gumin J, Bengtsson L, Wu W, Songyang Z, Lang FF, Lin X: TMEM43/LUMA is a key signaling component mediating EGFR-induced NF-κB activation and tumor progression. Oncogene 2017, 36(20):2813–2823.
Kumar JP: The sine oculis homeobox (SIX) family of transcription factors as regulators of development and disease. Cellular and molecular life sciences: CMLS 2009, 66(4):565–583.
Chen G, Chen Z, Zhao H: MicroRNA-155-3p promotes glioma progression and temozolomide resistance by targeting Six1. Journal of cellular and molecular medicine 2020, 24(9):5363–5374.
Fang ZX, Li CL, Wu Z, Hou YY, Wu HT, Liu J: Comprehensive analysis of the potential role and prognostic value of sine oculis homeobox homolog family in colorectal cancer. World journal of gastrointestinal oncology 2022, 14(11):2138–2156.
Liu D, Zhang XX, Wan DY, Xi BX, Ma D, Wang H, Gao QL: Sine oculis homeobox homolog 1 promotes α5β1-mediated invasive migration and metastasis of cervical cancer cells. Biochemical and biophysical research communications 2014, 446(2):549–554.
Vizcaíno C, Mansilla S, Portugal J: Sp1 transcription factor: A long-standing target in cancer chemotherapy. Pharmacology & therapeutics 2015, 152:111–124.
Ivanenko KA, Prassolov VS, Khabusheva ER: [Transcription Factor Sp1 in the Expression of Genes Encoding Components of MAPK, JAK/STAT, and PI3K/Akt Signaling Pathways]. Molekuliarnaia biologiia 2022, 56(5):832–847.
Mak CS, Yung MM, Hui LM, Leung LL, Liang R, Chen K, Liu SS, Qin Y, Leung TH, Lee KF et al: MicroRNA-141 enhances anoikis resistance in metastatic progression of ovarian cancer through targeting KLF12/Sp1/survivin axis. Molecular cancer 2017, 16(1):11.
Takeshita Y, Motohara T, Kadomatsu T, Doi T, Obayashi K, Oike Y, Katabuchi H, Endo M: Angiopoietin-like protein 2 decreases peritoneal metastasis of ovarian cancer cells by suppressing anoikis resistance. Biochemical and biophysical research communications 2021, 561:26–32.
Lu K, Chen Q, Li M, He L, Riaz F, Zhang T, Li D: Programmed cell death factor 4 (PDCD4), a novel therapy target for metabolic diseases besides cancer. Free radical biology & medicine 2020, 159:150–163.
Zhao MY, Wang LM, Liu J, Huang X, Liu J, Zhang YF: MiR-21 Suppresses Anoikis through Targeting PDCD4 and PTEN in Human Esophageal Adenocarcinoma. Current medical science 2018, 38(2):245–251.
Li TT, Lai YW, Han X, Niu X, Zhang PX: BMP2 as a promising anticancer approach: functions and molecular mechanisms. Investigational new drugs 2022, 40(6):1322–1332.
Shonibare Z, Monavarian M, O'Connell K, Altomare D, Shelton A, Mehta S, Jaskula-Sztul R, Phaeton R, Starr MD, Whitaker R et al: Reciprocal SOX2 regulation by SMAD1-SMAD3 is critical for anoikis resistance and metastasis in cancer. Cell reports 2022, 40(4):111066.

No competing interests reported.

FigureS1.tif
(A) Two clustering components of 8 immuno-infiltrative cells in EPIC algorithm. (B) Two clustering components of 6 immuno-infiltrative cells in TIMER algorithm. (C) Two clustering components of 11 immuno-infiltrative cells in QUANTISEQ algorithm. (D) Two clustering components of 10 immuno-infiltrative cells in MCPCOUNTER algorithm.
FigureS2.tif
Distribution of risk scores, survival times, and gene expression panels, CGGA-325(A), CGGA-693 (B), GSE-16011 (C). Kaplan-Meier curves for LGG OS based on risk scores, CGGA-325(D), CGGA-693 (E), and GSE-16011 (F). The ROC curves show the predictive efficiency of the risk scores for 1, 3, and 5 years, CGGA-325(G), CGGA-693 (H), and GSE-16011 (I).

Download PDF

Version 1

posted

You are reading this latest preprint version

Prognostic value of anoikis-related genes revealed using multi-omics analysis and machine learning based on lower-grade glioma features and tumour immune microenvironment

Status:

Version 1

Abstract

Figures

1. Introduction

2. Methods

2.1 Inclusion of patients

2.2 Anoikis-Relevant Gene Selection

2.3 Detection of Differentially Expressed Genes (DEGs) and LGG-Related Genes

2.4 Integrated multivariate analysis of ARGs

2.5 Different pattern recognition for anoikis based on unsupervised clustering

2.6 Characterisation between molecular subtypes

2.7 Machine learning-based identification of risk signatures for optimal multi-gene combinations

2.8 Characterisation between risk subtypes

2.9 Risk subtype gene set enrichment analysis and Gene set enrichment analysis (GSEA)

2.10 Chemotherapeutics Forecast and Molecular Docking

3. Results

3.1 Variation Analysis and WGCNA of DEGs

3.2 Characteristics of the ARGs

3.3 Anoikis Patterns in LGG

3.4 Building a prognostic risk model using ARGs

3.5 Predictive value of 6 genetic markers

3.6 Characterisation between risk subtypes

3.7 Identification and biological functional analysis of DEGs between risk subtypes

3.8 ARG risk predicts new chemotherapy regimens

3.9 Building and evaluating survival models for nomograms

3.10 Immunohistochemical analysis of six Anoikis-related model genes

4. Discussion

5. Conclusion

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1