Signature Construction and Molecular Subtype Identification Based on Immune-Related Genes for Better Prediction of Prognosis in Hepatocellular Carcinoma

doi:10.21203/rs.3.rs-1634804/v1

Download PDF

Research Article

Signature Construction and Molecular Subtype Identification Based on Immune-Related Genes for Better Prediction of Prognosis in Hepatocellular Carcinoma

https://doi.org/10.21203/rs.3.rs-1634804/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 14 Jun, 2023

Read the published version in BMC Medical Genomics →

You are reading this latest preprint version

Objective: Hepatocellular carcinoma (HCC) immunotherapy is a focus of current research. We established a model that can effectively predict the prognosis and efficacy of HCC immunotherapy by analyzing the immune genes of HCC.

Methods: Through the data mining of hepatocellular carcinoma in The Cancer Genome Atlas (TCGA), the immune genes with differences in tumor and normal tissues are screened, and then the univariate regression analysis is carried out to screen the immune genes with differences related to prognosis. The prognosis model of immune related genes is constructed by using the minimum absolute contraction and selection operator (lasso) Cox regression model in the TCGA training set data, The risk score of each sample was calculated, and the survival was compared with the Kaplan Meier curve and the receiver operating characteristic (ROC) curve to evaluate the predictive ability. Data sets from ICGC and TCGA were used to verify the reliability of signatures. The correlation between clinicopathological features, immune infiltration, immune escape and risk score was analyzed.

Results: Seven immune genes were finally determined as the prognostic model of liver cancer. According to these 7 genes, the samples were divided into the high and low risk groups, and the results suggested that the high-risk group had a poorer prognosis, lower risk of immune escape, and better immunotherapy effect. In addition, the expression of TP53 and MSI was positively correlated in the high-risk group. Consensus clustering was performed to identify two main molecular subtypes (named clusters 1 and 2) based on the signature. It was found that compared with cluster 1, better survival outcomes was observed in cluster 2.

Conclusion: signature construction and molecular subtype identification of immune-related genes could be used to predict the prognosis of HCC, which may provide a specific reference for the development of novel biomarkers for HCC immunotherapy.

hepatocellular carcinoma

prognostic model

bioinformatics

immune microenvironment

immunotherapy

Hepatocellular carcinoma (HCC) is currently the sixth most common tumor in the world and the fourth most common in terms of mortality⁽¹⁾. HCC is expected to become the third leading cause of cancer-related deaths by 2030, according to epidemiology⁽²⁾. Despite known risk factors for HCC, including hepatitis B, alcoholism and cirrhosis, the incidence of HCC remains high worldwide. At present, there are many ways to treat hepatocellular carcinoma, among which surgical resection is the main method, and intervention, targeted drugs and immunotherapy are also important treatment methods. However, the therapeutic effect of hepatocellular carcinoma is still poor, and its recurrence rate and mortality rate have not been effectively controlled⁽³⁾. Only 30% of HCC patients are reported to be in the early stages suitable for radical surgery. In addition, the efficacy of chemotherapeutic agents and targeted agents for advanced HCC is still limited⁽⁴⁾.

Although currently targeted drug therapy for unresectable HCC has been proven to be effective⁽⁵⁾. Such as sorafenib, lovastinib and the vascular endothelial growth factor(VEGF) inhibitor ramuzumab are widely used in clinical applications^(6–8). However, we found that all of these drugs had drug reactions of varying degrees, such as skin itching, gastrointestinal reactions, and elevated blood pressure⁽⁹⁾. In addition, long-term use of the body is easy to produce drug resistance, its treatment effect is not as expected⁽¹⁰⁾. In recent years, new therapeutic strategies such as tumor immunosuppressive therapy have extended patients' lives, and the combination of immune checkpoint inhibitors (ICIs) and VEGF inhibitors is currently positioned as the first-line treatment for advanced HCC.

Immunotherapy has opened a new era of tumor therapy, and immunocheckpoint inhibitors (ICIs), including programmed cell death 1(PD-1)/programmed cell death ligand 1(PD-L1) inhibitors, have become a breakthrough in tumor therapy. ICIs are typical immunotherapies that activate anti-tumor immunity by inhibiting negative regulatory receptors such as PD-1 and cytotoxic T lymphocyte antigen 4(CTLA4)⁽¹¹⁾.

At present, immunotherapy for hepatocellular carcinoma has received more and more attention. The occurrence and development of hepatocellular carcinoma is closely related to tumor microenvironment⁽¹²⁾. Tumor microenvironment is a dynamic system composed of tumor cells, complex cytokine environment, extracellular matrix and immune cell subsets⁽¹³⁾.

In this study, we constructed and validated an immune-related prognostic model based on the TCGA-LIHC dataset and ICGC-LIRI-JP dataset. In addition, we explored the relationship between the constructed prognostic model and the clinical and pathological features of HCC patients. We analyzed the characteristics of tumor immune microenvironment, including tumor-infiltrating cell composition, immune escape, TP53 mutation rate, and tumor microenvironment. These findings may provide new insights into novel therapeutic targets for hepatocellular carcinoma

Data Preparation and Processing

From TCGA (https://portal.gdc.cancer.gov/) download the mRNA expression data and clinical data of LIHC. We obtained the data of 374 tumor specimens and 50 normal specimens. Excluding HCC samples with a survival time of less than 30 days, we finally obtained the data of 342 HCC patients. 342 TCGA tumor samples were randomly divided into two equal parts: training set (Set1) and verification set (set2). The total samples of TCGA were used as another verification set (set3). From ICGC (https://dcc.icgc.org/) download the data of the Japan Institute of liver cancer (ICGC-LIRI-JP), exclude patients with metastatic liver cancer and survival of less than 30 days, and finally 229 patients with HCC were included in the study. These sample data were used as an external validation set (Set4).

From the Gene List module of the Immunology Database and Analysis Portal (ImmPort) database, we downloaded complete gene names directly, totaling 1793 immune-related genes.

Differential Expression Analysis and Prognostic Gene Screening

Based on the data of LIHC in TCGA, we analyzed the mRNA expression differences between 374 tumor samples and 50 normal samples. The "Limma" R package was used to screen out differential genes (DEGs) according to adjust P < 0.01 and absolute value of logFC > 2. The differentially expressed immune genes (DEIGs) were obtained by the intersection of immune-related genes and DEGs of TCGA. Then DEIGs were obtained by univariate Cox analysis to explore the relationship between overall survival (OS) and gene expression level. When P value < 0.05, genes were considered to have significant prognostic potential.

Copy Number Variation and Functional Enrichment Analysis

The "RCircos" R package was used to show the mutation locations of these prognostic DEIGs on 23 chromosomes. The gain or loss of these genes were visualized. The "org.hs.eg.db", "Enrichment plot" and "clusterProfiler" packages in R were used to analyze GO and KEGG enrichment of prognostic DEIGs in TCGA to explore potential molecular mechanisms and biological functions.

Establishment of Immune Risk Scoring Signature (IRSS) for Prognosis

In the TCGA train set (set1), LASSO regression was adopted to process prognostic DEIGs to further identify differentially expressed genes with independent prognostic value. Multivariate cox regression analysis was then used to evaluate whether these genes could be used as independent prognostic predictors and finally determine the genes to construct the model. Next, The following formula was used to calculate the risk score for each patient:

risk score = expression for each gene ∗ coefficient for each gene

The sample was divided into high-risk and low-risk groups based on the median value of the risk score.

Validation of The Risk Score with TCGA and ICGC Datasets

According to the established IRSS scoring system, the risk score of each sample was calculated. Set1, set2 and set3 groups were divided into high risk group and low risk group respectively according to the median value of risk score. Then ROC curve and Kaplan-Meier curve were drawn to verify the prediction accuracy of the risk scoring model. Nomogram was used to assess the survival risk of HCC patients in TCGA, including gender, age, TNM stage, IRSS and other clinical information. Calibration curves (1, 2 and 3 years) were drawn to assess the accuracy of Nomogram predictions. To further verify the accuracy of this model, we compared it with published prediction models of HCC.

Genomic Alterations Analyses

To determine whether risk score levels were associated with specific genomic traits, Copy number variation (CNV) analysis was performed using the TCGA dataset.

Association between Microsatellite Instability (MSI) and Constructed Predictive Models

The whole TCGA dataset was further analyzed based on the constructed prognostic model after removing the samples without microsatellite status information. Then, according to the microsatellite status information extracted from the phenotypic data, we first compared whether there was any difference in the expression level of MSI in the high and low risk groups, and then divided the total samples into MSI-high and MSI-low groups through the expression level of MSI in each sample, and compared whether there was a significant difference in OS between the two groups. Thus, the relationship between high and low risk groups and OS can be further determined.

Analysis of Tumor Mutation Burden (TMB) and Immune Escape

To further verify the relationship between the constructed model and tumor microenvironment and immunotherapy, correlation boxplot was constructed by Pearson correlation analysis to study the impact of risk score on TMB. TIDE scores were compared between the high and low risk groups to analyze whether there was a difference in the efficacy of immune checkpoint blocking treatment between the high and low risk groups.

Consensus Clustering of Prognostic Genes

To investigate the functions of the seven selected prognostic DEIGs, we clustered the HCC into different groups with “ConsensusClusterPlus” (50 iterations, resample rate 80% and Pearson correlation). Kaplan-Meier survival curve was used to analyze the OS of each subtype, and the clinical data and gene expression levels between subgroups were compared and analyzed, as well as the immune cell infiltration and immune cell content between subgroups.

Statistical Analysis

The data were analyzed by Excel software, and the visualization of DEGs was completed by "ggplot2", "Cairo" and "ggrepel" R packages. The difference in overall survival was calculated by Kaplan-Meier method, and the significant difference was determined by R. Spearman regression analysis by Cox. ROC curve of R package survival was drawn and visualized to calculate AUC⁽¹⁴⁾. Somatic mutation and CNV data were downloaded from TCGA database. Copy number changes associated with risk scores were analyzed using GISTIC 2. All data and statistical analysis were based on R software 4.1.2, and P < 0.05 was considered statistically significant.

Prognostic DEIGs Screening

The overall process of the study is shown in Fig. 1. We first analyzed the differences between 374 tumor samples and 50 normal samples in TCGA. According to adjust P < 0.01 and absolute value of logFC > 2, 2874 differential genes were screened out (Table S1), including 362 down-regulated genes and 2512 up-regulated genes. Visualization using heat and volcano maps (Figs. 2A,B). Then, 154 differential expression immune genes(DEIGs) were obtained through the intersection of immune-related genes (Fig. 2C). Univariate Cox proportional risk analysis was performed on DEIGs, and 28 DEIGs related to prognosis were obtained (Fig. 2D, Table S2).

Prognostic DEIGs Functional Enrichment and Genetic Alterations

GO and KEGG enrichment analysis was performed on 28 prognostic DEIGs. GO enrichment results showed that prognostic DEIGs were mainly enriched in Epithelial cell proliferation, gland development, regulation of cysteine-type endopeptidase activity, regulation of cysteine-type endopeptidase activity involved in apoptotic process, regulation of endopeptidase activity. (Fig. 3A). KEGG results showed that prognostic DEIGs were enriched in Bladder cancer, Cytokine-cytokine receptor interaction, IL-17 signaling pathway, Melanoma, Neuroactive ligand-receptor interaction (Fig. 3B).

We also studied the copy number variation of 28 prognostic genes and summarized the CNV variation frequency of 28 prognostic DEIGs in TCGA-LIHC (Fig. 3C). Visualized the position of CNV variation on chromosomes (Fig. 3D).

Construction and Prognostic Value of IRSS

In order to calculate the risk score, patients with no survival information and survival time less than 30 days were excluded. Finally, 342 HCC patients were included in the study, and the total sample set (Sum) was randomly divided into the train set and the test set, with 171 patients in each group. Chi-square test was used to determine that there was no statistically significant difference in clinical characteristics among each group (Table 1). LASSO regression analysis was performed on 28 prognostic DEIGs in a training set containing 171 patients, and the model fitted best when the penalty index was 10 (Fig. 4A,B). Then, 7 prognostic genes were obtained through multivariate Cox regression analysis :GAL, NR0B1, MAPT, CCR3, GLP1R, BIRC5 and IL-17B(Table S3). Combined with the corresponding regression coefficients, the final IRSS is established:

IRSS=(GAL exp * 0.359 )+(NR0B1 exp * 0.454)+ (MAPT exp*0.495)+(CCR3 exp * 0.641)+(GLP1R exp * 0.306)+(BIRC5 exp * 0.428)+(IL17B exp * 2.100)

The mRNA expression differences of seven genes in the train group were represented by heat maps (Fig. 4C). The risk score for each patient was calculated based on IRSS, and the sample was divided into high-risk and low-risk groups using the median. In the train set, the probability of OS was lower in the high-risk group than in the low-risk group (P < 0.05) (Fig. 4D). As the risk score increased, the survival time decreased and the number of patients in the state of death increased gradually (Fig. 4E). The accuracy of the model in predicting OS of HCC patients was evaluated by ROC curve, and the AUC values at 1, 2 and 3 years were 0.845, 0.823 and 0.808, respectively (Fig. 4F).The mRNA expression differences of 7 prognostic DEIGs in tumor and normal tissues in the TCGA data set (Figure S1).

Table 1

Chi-square test results of the TCGA training set, the test set and the ICGC cohort
Characteristics		TCGA		ICGC n = 229	P-value
Characteristics	Train (set1) n = 171	Test (set2) n = 171	Sum (set3) n = 342	ICGC n = 229	P-value
Gender
Male	125(73.1%)	110(64.3%)	233(68.1%)	168(73.4%)	0.103
Female	46(26.9%)	61(35.7%)	109(31.9%)	61(26.6)
Age
<=65	108(63.2%)	109(63.2%)	216(63.2%)	88(38.4%)	0.911
> 65	63(36.8%)	62(36.8%)	126(36.8%)	141(61.6%)
Status
Alive	113(66.1%)	110(64.3%)	223(65.2%)	189(82.5%)	0.733
Dead	58(33.9%)	61(35.7%)	119(34.8%)	40(17.5%)

Validation of The Risk Score with TCGA and ICGC Datasets

In the test set, according to the risk score calculated by IRSS, the samples were divided into high and low risk groups using the median value. The analysis found that there was a significant difference in survival probability between the high and low risk groups (P < 0.05), and the high risk group had a lower survival rate (Fig. 5A). The AUC values assessed by ROC curve at 1, 2 and 3 years were 0.748, 0.740 and 0.684, respectively(Fig. 5B). The mRNA expression differences of prognostic genes in the test set were represented by heat maps (Figure S2A).

In TCGA sum set, the total sample is also divided into high and low risk groups according to IRSS calculated risk score. Analysis showed that there was a significant difference in overall survival between the two groups (P < 0.05), and the survival rate was lower in the high-risk group (Fig. 5C). The AUC values of 1, 2 and 3 years were 0.798, 0.756 and 0.706, respectively (Fig. 5D). The mRNA expression differences of prognostic genes in TCGA sets were represented by heat maps (Figure S2B).

According to IRSS, ICGC data were divided into high risk group and low risk group, and the survival rate difference between the two groups was also statistically significant (P < 0.05)(Fig. 5E). The AUC values of 1, 2 and 3 years were 0.757, 0.759 and 0.772, respectively (Fig. 5F). After multiple validation, we find that the model has high robustness and accuracy. The mRNA expression differences of prognostic genes in the ICGC dataset were represented by heat maps (Figure S2C). Meanwhile, in order to further verify the reliability of the model, we compared it with four published prediction models of HCC^(15–18), and the results showed that our model had higher accuracy and reliability (Fig. 6).

Independent Prognostic Analysis

To verify the reliability of the risk factors, independent prognostic analyses were performed. In univariate Cox analysis, risk score, Stage and T Stage were significantly correlated with OS (P < 0.05)(Fig. 7A). In multivariate Cox analysis, only risk score was confirmed as an independent predictor of OS (Fig. 7B). The above results again demonstrate the stability of the IRSS established by us. In order to further evaluate individual patients, Nomograms were used to simplify the statistical prediction model to comprehensively predict the prognosis of HCC patients by calculating the scores of clinical data and risk scores (Fig. 7C,D).

In addition, we demonstrated the correlation between clinical features and risk in heat map (Fig. 7E). Boxplot was used to show the clinical indicators and risk scores and the differences between the high and low risk groups. There were no significant differences in age, gender, N stage and M stage in the high and low risk groups (P > 0.05). Stage, Grade and T Stage were later with the increase of risk score (Figure S3).

Genomic Alterations Analyses

To determine whether risk score levels were associated with specific genomic traits, CNV and somatic mutation analyses were performed using the TCGA data set. According to the risk score levels, TP53(40%), CTNNB1(21%), TTN(21%) and MUC16(20%) had the highest mutation frequency in the high-risk group (Fig. 8A,B). In the low-risk group, CTNNB1(27%), TTN(26%), MUC16 and TP53(14%) were more frequent, and the mutation rate of TP53 was significantly higher in the high-risk group (Fig. 8C,D). TP53 is a well-known tumor suppressor gene, which is usually associated with poor prognosis⁽¹⁹⁾. Therefore, we also conducted survival analysis on TP53 mutation data, and the results showed that the survival rate of TP53 mutation group was significantly lower (P < 0.05) (Fig. 8E).

Correlation Analysis between Tumor Microenvironment and Stem Cells

Analysis of tumor microenvironment suggested that the expression level of tumor microsatellite instability(MSI) was significantly different in the high-risk group, and higher in the high-risk group (Fig. 9A). In order to explore the relationship between MSI and survival of HCC, we divided the samples into the high MSI expression group and the low MSI expression group according to the expression level of MSI. Survival analysis of the two groups showed that the high MSI expression group had a lower survival rate (P < 0.05)(Fig. 9B), which further confirmed the low survival rate of the high-risk group. There was no significant difference in tumor mutation burden(TMB) between high and low risk groups (Fig. 9C).

In recent years, tumor stem cells have been considered as the root cause of tumor occurrence, metastasis and recurrence. We analyzed the association between risk score and tumor stem cells and found that there was a significant correlation between the two. The higher the risk score, the higher the score of tumor stem cells (P < 0.05)(Fig. 9D). It can be inferred that the higher the risk score, the lower the degree of tumor differentiation.

Immunotherapy Analysis

The infiltration of immune cells and stromal cells in HCC tissues was analyzed according to the risk score groups, and the results showed that there was significant difference in stromal cell score between high-risk and low-risk groups (P < 0.05), but no significant difference in immune cells (P > 0.05)(Fig. 10A). Comparison of TIDE scores showed that there were fewer dysfunction and immune rejection T cells in the high-risk group for HCC (Fig. 10B). In order to explore the relationship between the efficacy of immune checkpoint inhibitors and risk score, the expression of PD-1 and CTLA-4 in the high-risk group was significantly higher than that in the low-risk group (P < 0.05) by analyzing the expression differences of immune checkpoint between the high-risk group and the low-risk group (Fig. 10C,D). There was no significant difference in PD-L1 between the two groups (P > 0.05)(Fig. 10E).

Consensus Clustering of Seven Prognostic Genes

Consensus clustering of the seven prognostic DEIGs identified two clusters of HCC in the TCGA and CGGA datasets with distinct clinical outcomes, clinical features and pathological features(Fig. 11A,B). According to expression similarity, k = 2 was selected with clustering stability rising from k = 2 to 10 in the TCGA and ICGC datasets. A contingency table showed consistency between clustered groups and risk groups in both TCGA and ICGC datasets(Figure S4). In the TCGA and ICGC datasets, the survival difference between the two clusters was significant(Fig. 11C,D). Between groups, PCA distribution was clearly separated in the TCGA and CGGA datasets(Figure S5A,B). The relationship between each subtype and risk score and prognosis is shown in Figure S5C,D.

Differential Analysis of Immune Cell Composition

Firstly, the content of immune cells in each sample in the TCGA dataset was analyzed and shown in a histogram (Fig. 12A). By analyzing these seven immune genes and immune function, we found that risk score was correlated with immune cell regulation, and in the high-risk group, B cells naive, T cells CD4 memory resting, NK cells activated, monocyte, Macrophages M1, Macrophages M2, Mast cells resting significantly increased (P < 0.05), B cells memory, T cells CD4 memory activated, T cells follicular helper, T cells regulatory(Tregs), M0 of Macrophages was significantly decreased (P < 0.05) (Fig. 12B). The correlation between immune cells and 7 prognostic genes is shown in Figure S6. In addition, we used heat maps to show differences in immune cell content between the high and low risk groups (Fig. 12C).

By studying and analyzing the differential immunity genes related to prognosis in HCC, we aim to further explore the targets that can effectively improve prognosis and therapeutic effect of hepatocellular carcinoma. The treatment of HCC is a global challenge, and its molecular pathogenesis varies with different genotoxic lesions and diseases. Although our understanding of the pathophysiology and drivers of disease has improved, this knowledge has not yet been translated into clinical practice⁽²⁰⁾. About 25% of HCC patients have operable mutations, but the incidence of most mutations is less than 10%, which makes the study more complicated⁽²¹⁾.

Currently, systemic therapies including immune checkpoint inhibitors (ICIs), tyrosine kinase inhibitors (TKI), and monoclonal antibodies challenge the use of conventional therapies for HCC. Tumors often upregulate immune checkpoints to avoid detection and killing by the host immune system. Activation of checkpoint cascades, such as those controlled by PD-1 or CTLA-4, leads to tumor-specific T cell inactivation and immune evasion^{(22, 23)}. Treatment with anti-PD-1, anti-PD-L1 or anti-CTLA-4 revitalizes T cells and allows the adaptive immune system to target tumor cells^{(24, 25)}. Inhibitors of PD1, PDL1, and CTLA-4 are pillars of clinical practice or systemic therapies under development for HCC. Data from the CheckMate-040 Phase I/II trial, presented at the American Society of Clinical Oncology Annual Meeting 2019, The combination of Opdivo (PD-1 Antibody) and Yervoy (CTLA-4 Antibody) yielded an objective response rate of 31% and a response time of 17.5 months⁽²⁶⁾. This result suggests that immunotherapy has a potential and surprising effect on HCC.

Tumor microenvironment (TME) of HCC refers to a complex and spatially structured mixture of liver parenchymal resident cells, tumor cells, immune cells and tumor-associated fibroblasts⁽²⁷⁾. These populations interact dynamically through intercellular contact and the release or recognition of cytokines and other soluble factors. This complex cellular interaction has a significant impact on tumor immune evasion. TMB is often used as a proxy for the number of neoantigens because the probability of recognizing neoantigen-specific T lymphocytes is associated with TMB⁽²⁸⁾. The frequency of these genetic changes for each type varies greatly between individual tumors and between tumor types⁽²⁹⁾. TMB can be used to predict Immune checkpoint blockade(ICB) efficacy and has become a useful biomarker for multiple cancer types to identify patients who will benefit from immunotherapy⁽³⁰⁾.

GAL, NR0B1, MAPT, CCR3, GLP1R, BIRC5 and IL17B were the Prognostic DEIGs included in the prognostic signature. which can preliminarily predict the prognosis and immunotherapy effect of HCC patients by analyzing their relationship with tumor microenvironment, immune invasion and immunotherapy. It is helpful for the systemic treatment of HCC. Anti-GAL antibodies have been confirmed to play a role in the immunotherapy of pancreatic cancer⁽³¹⁾. NR0B1 has also been confirmed to be related to the occurrence and development of a variety of tumors, for example, the transformed phenotype of Ewing's sarcoma requires sustained NR0B1 expression⁽³²⁾. Knockdown of NR0B1 can reduce the tumorigenic and anti-apoptotic potential of lung adenocarcinoma⁽³³⁾. MAPT is a microtubule-related protein tau, which can inhibit the function of taxanes, and its high expression reduces the sensitivity to taxanes, which is of great significance in breast cancer research⁽³⁴⁾. Studies have also confirmed that MAPT is often methylated, and hypermethylation is associated with poor prognosis in patients with stage II colorectal cancer⁽³⁵⁾. Both CCR3 and IL17B have been found to be related to the role of tumor microenvironment in regulating tumor growth and metastasis, and may be new immunotherapy targets^{(36, 37)}. Previous studies have shown that glucagon-like peptide-1 receptor (GLP-1R) is essential for the regulation of glucose homeostasis, and in recent years, it has been found to be related to the occurrence and development of tumors^{(38, 39)}. BIRC5 has been confirmed to be related to the occurrence of a variety of tumors, and its role in the progression of liver cancer has also been confirmed by studies⁽⁴⁰⁾. However, there are few studies on its immunity to liver cancer, which is worth exploring.

In addition, by exploring tumor immune microenvironment and gene mutations, our study found that mutation rates of MSI and TP53 may also be independent prognostic indicators. The higher the risk score, the lower the survival rate. However, patients also had a lower immune escape frequency fraction, suggesting that the high-risk group may have a better effect on immunotherapy. The expression levels of PD-1 and CTLA-4 were higher in the high-risk group. Therefore, the treatment effect of anti-PD-1 and anti-CTLA-4 is better, which is expected to improve the prognosis of patients. Up to now, there are many prognostic models for HCC, but due to the complexity of HCC, no one model has been considered as the gold standard. Our study focused on the prognosis and immunoassay of hepatocellular carcinoma, combined with the current hot spot of immunotherapy, aiming to find effective immunotherapy targets for hepatocellular carcinoma. The disadvantages of this study are as follows: The clinical data in TCGA and ICGC databases did not specify in detail whether patients had received chemotherapy or radiotherapy, which may have certain influence on patient survival data and immunoassay results.

In summary, a robust immune-related prognosis model was constructed and tumor microenvironment and immune function were analyzed, providing potential targets for immunotherapy of HCC.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Ethics Committee of the Second Affiliated Hospital of Nanchang University. No personal information is involved, so informed consent is not required.

Consent for publication

All the authors report no disclosures relevant to the manuscript.

Competing interests

The authors declare that the have no competing of interest.

Availability of data and materials

The dataset supporting the conclusions of this article is included within the article and its additional file. TCGA-LIHC is available at https://portal.gdc.cancer.gov/; ICGC-LIRI-JP is available at https://dcc.icgc.org/releases/current/Projects/LINC-JP.

Author contributions

Liang Sun, Zhengyi Wu and Cairong Dong have contributed equally to this work.

LS , ZW and CD conceived the study, performed the data analysis, and wrote the manuscript. SY and HH performed bioinformatics analyses and assisted with analyzing other data. ZC, ZW, and XY helped to revise the manuscript. All authors have read and approved the final manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (81760439).

Acknowledgements

Not applicable.

Bray F, Ferlay J, Soerjomataram I, et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394-424.
Rahib L, Smith BD, Aizenberg R, et al. Projecting cancer incidence and deaths to 2030: the unexpected burden of thyroid, liver, and pancreas cancers in the United States. Cancer Res. 2014;74(11):2913-21.
McGlynn KA, Petrick JL, London WT. Global epidemiology of hepatocellular carcinoma: an emphasis on demographic and regional variability. Clin Liver Dis. 2015;19(2):223-38.
Xu F, Jin T, Zhu Y, et al.. Immune checkpoint therapy in liver cancer. J Exp Clin Cancer Res. 2018;37(1):110.
Ricke J, Klümpen HJ, Amthauer H, et al. Impact of combined selective internal radiation therapy and sorafenib on survival in advanced hepatocellular carcinoma. J Hepatol. 2019;71(6):1164-74.
Bruix J, Qin S, Merle P, et al. Regorafenib for patients with hepatocellular carcinoma who progressed on sorafenib treatment (RESORCE): a randomised, double-blind, placebo-controlled, phase 3 trial. Lancet. 2017;389(10064):56-66.
Kudo M, Finn RS, Qin S, et al. Lenvatinib versus sorafenib in first-line treatment of patients with unresectable hepatocellular carcinoma: a randomised phase 3 non-inferiority trial. Lancet. 2018;391(10126):1163-73.
Hinshaw DC, Shevde LA. The Tumor Microenvironment Innately Modulates Cancer Progression. Cancer Res. 2019;79(18):4557-66.
Greten TF, Lai CW, Li G, et al. Targeted and Immune-Based Therapies for Hepatocellular Carcinoma. Gastroenterology. 2019;156(2):510-24.
Huang A, Yang X-R, Chung W-Y, et al. Targeted therapy for hepatocellular carcinoma. Signal Transduct Target Ther. 2020;5(1):146.
Tolba MF. Revolutionizing the landscape of colorectal cancer treatment: The potential role of immune checkpoint inhibitors. Int J Cancer. 2020;147(11):2996-3006.
Jiang Y, Han Q-J, Zhang J. Hepatocellular carcinoma: Mechanisms of progression and immunotherapy. World J Gastroenterol. 2019;25(25):3151-67.
Chew V, Lai L, Pan L, et al. Delineation of an immunosuppressive gradient in hepatocellular carcinoma using high-dimensional proteomic and transcriptomic analyses. Proc Natl Acad Sci U S A. 2017;114(29):E5900-E9.
Robin X, Turck N, Hainard A, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:77.
Gu X, Guan J, Xu J, et al. Model based on five tumour immune microenvironment-related genes for predicting hepatocellular carcinoma immunotherapy outcomes. J Transl Med. 2021;19(1):26.
Ju M, Jiang L, Wei Q, et al. A Immune-Related Signature Associated with TME Can Serve as a Potential Biomarker for Survival and Sorafenib Resistance in Liver Cancer. Onco Targets Ther. 2021;14:5065-83.
Peng Y, Liu C, Li M, et al. Identification of a prognostic and therapeutic immune signature associated with hepatocellular carcinoma. Cancer Cell Int. 2021;21(1):98.
Wang Z, Pan L, Guo D, et al. A novel five-gene signature predicts overall survival of patients with hepatocellular carcinoma. Cancer Med. 2021;10(11):3808-21.
Olivier M, Hollstein M, Hainaut P. TP53 mutations in human cancers: origins, consequences, and clinical use. Cold Spring Harb Perspect Biol. 2010;2(1):a001008.
Bruix J, Sherman M. Management of hepatocellular carcinoma. Hepatology. 2005;42(5):1208-36.
Wu Y, Liu Z, Xu X. Molecular subtyping of hepatocellular carcinoma: A step toward precision medicine. Cancer Commun (Lond). 2020;40(12):681-93.
Brown KE, Freeman GJ, Wherry EJ, et al. Role of PD-1 in regulating acute infections. Curr Opin Immunol. 2010;22(3):397-401.
Iwai Y, Ishida M, Tanaka Y, et al. Involvement of PD-L1 on tumor cells in the escape from host immune system and tumor immunotherapy by PD-L1 blockade. Proc Natl Acad Sci U S A. 2002;99(19):12293-7.
Li X, Shao C, Shi Y. et al. Lessons learned from the blockade of immune checkpoints in cancer immunotherapy. J Hematol Oncol. 2018;11(1):31.
Rowshanravan B, Halliday N, Sansom DM. CTLA-4: a moving target in immunotherapy. Blood. 2018;131(1):58-67.
Del Pozo Martín Y. 2019 ASCO Annual Meeting. Lancet Oncol. 2019;20(7):909-10.
Nakano S, Eso Y, Okada H, et al. Recent Advances in Immunotherapy for Hepatocellular Carcinoma. Cancers (Basel). 2020;12(4).
Chan TA, Yarchoan M, Jaffee E, et al. Development of tumor mutation burden as an immunotherapy biomarker: utility for the oncology clinic. Ann Oncol. 2019;30(1):44-56.
Chalmers ZR, Connelly CF, Fabrizio D, et al. Analysis of 100,000 human cancer genomes reveals the landscape of tumor mutational burden. Genome Med. 2017;9(1):34.
Sugawara S, Lee JS, Kang JH, et al. Nivolumab with carboplatin, paclitaxel, and bevacizumab for first-line treatment of advanced nonsquamous non-small-cell lung cancer. Ann Oncol. 2021;32(9):1137-47.
Tanemura M, Miyoshi E, Nagano H, et al. Role of α-gal epitope/anti-Gal antibody reaction in immunotherapy and its clinical application in pancreatic cancer. Cancer Sci. 2013;104(3):282-90.
Kinsey M, Smith R, Lessnick SL. NR0B1 is required for the oncogenic phenotype mediated by EWS/FLI in Ewing's sarcoma. Mol Cancer Res. 2006;4(11):851-9.
Susaki Y, Inoue M, Minami M, et al. Inhibitory effect of PPARγ on NR0B1 in tumorigenesis of lung adenocarcinoma. Int J Oncol. 2012;41(4):1278-84.
Ikeda H, Taira N, Hara F, et al. The estrogen receptor influences microtubule-associated protein tau (MAPT) expression and the selective estrogen receptor inhibitor fulvestrant downregulates MAPT and increases the sensitivity to taxane in breast cancer cells. Breast Cancer Res. 2010;12(3):R43.
Wang C, Liu Y, Guo W, et al. promoter CpG island hypermethylation is associated with poor prognosis in patients with stage II colorectal cancer. Cancer Manag Res. 2019;11:7337-43.
Liu F, Wu H. CC Chemokine Receptors in Lung Adenocarcinoma: The Inflammation-Related Prognostic Biomarkers and Immunotherapeutic Targets. J Inflamm Res. 2021;14:267-85.
Fabre J, Giustiniani J, Garbar C, et al. Targeting the Tumor Microenvironment: The Protumor Effects of IL-17 Related to Cancer Type. Int J Mol Sci. 2016;17(9).
Kanda R, Hiraike H, Wada-Hiraike O, et al. Expression of the glucagon-like peptide-1 receptor and its role in regulating autophagy in endometrial cancer. BMC Cancer. 2018;18(1):657.
He L, Zhang S, Zhang X, et al. Effects of insulin analogs and glucagon-like peptide-1 receptor agonists on proliferation and cellular energy metabolism in papillary thyroid cancer. Onco Targets Ther. 2017;10:5621-31.
Zhang M, Yan X, Wen P, et al. CircANKRD52 Promotes the Tumorigenesis of Hepatocellular Carcinoma by Sponging miR-497-5p and Upregulating BIRC5 Expression. Cell Transplant. 2021;30:9636897211008874.

No competing interests reported.

Additionalfile1.xlsx
Additional file 1Table S1: Difference analysis between normal and tumor groups of LIHC.
Additionalfile2.xlsx
Additional file 2Table S2: Univariate cox regression for DEIGs.
Additionalfile3.xlsx
Additional file 3Table S3: The multivariate Cox regression analysis.
Additionalfile4.docx
Additional file 4Figure S1: The expression of seven IRSS genes between normal and tumor.
Additionalfile5.docx
Additional file 5Figure S2: Heat map results of seven IRSS genes expression levels.
Additionalfile6.docx
Additional file 6Figure S3: The differences in risk scores across clinical features.
Additionalfile7.docx
Additional file 7Figure S4: Identification of consensus clusters by prognostic genes.
Additionalfile8.docx
Additional file 8Figure S5: Clustering analyses in TCGA and ICGC.
Additionalfile9.docx
Additional file 9Figure S6: Correlation between immune cells and seven genes in TCGA.

Download PDF

Journal Publication

published 14 Jun, 2023

Read the published version in BMC Medical Genomics →

Editorial decision: Major revision
07 Dec, 2022
Reviews received at journal
04 Dec, 2022
Reviewers agreed at journal
23 Nov, 2022
Reviews received at journal
16 Jun, 2022
Reviewers agreed at journal
16 Jun, 2022
Reviewers invited by journal
03 Jun, 2022
Editor assigned by journal
19 May, 2022
Editor invited by journal
10 May, 2022
Submission checks completed at journal
10 May, 2022
First submitted to journal
08 May, 2022

You are reading this latest preprint version

Signature Construction and Molecular Subtype Identification Based on Immune-Related Genes for Better Prediction of Prognosis in Hepatocellular Carcinoma

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Materials And Methods

Data Preparation and Processing

Differential Expression Analysis and Prognostic Gene Screening

Copy Number Variation and Functional Enrichment Analysis

Establishment of Immune Risk Scoring Signature (IRSS) for Prognosis

Validation of The Risk Score with TCGA and ICGC Datasets

Genomic Alterations Analyses

Association between Microsatellite Instability (MSI) and Constructed Predictive Models

Analysis of Tumor Mutation Burden (TMB) and Immune Escape

Consensus Clustering of Prognostic Genes

Statistical Analysis

Results

Prognostic DEIGs Screening

Prognostic DEIGs Functional Enrichment and Genetic Alterations

Construction and Prognostic Value of IRSS

Validation of The Risk Score with TCGA and ICGC Datasets

Independent Prognostic Analysis

Genomic Alterations Analyses

Correlation Analysis between Tumor Microenvironment and Stem Cells

Immunotherapy Analysis

Consensus Clustering of Seven Prognostic Genes

Differential Analysis of Immune Cell Composition

Discussion

Conclusion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1