Predicting the efficacy of immune checkpoint inhibitors monotherapy in advanced non-small cell lung cancer: a machine learning method based on multidimensional data

doi:10.21203/rs.3.rs-1688580/v1

Download PDF

Research Article

Predicting the efficacy of immune checkpoint inhibitors monotherapy in advanced non-small cell lung cancer: a machine learning method based on multidimensional data

https://doi.org/10.21203/rs.3.rs-1688580/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background Immunotherapy has improved the prognosis of patients with advanced non-small cell lung cancer (NSCLC), but only a small subset of patients achieved clinical benefit. The purpose of our study was to integrate multidimensional data using a machine learning method to predict the therapeutic efficacy of immune checkpoint inhibitors (ICIs) monotherapy in patients with advanced NSCLC.

Methods We retrospectively enrolled 112 patients with stage IIIB-IV NSCLC receiving ICIs monotherapy. The random forest (RF) algorithm was used to establish efficacy prediction models based on five different input datasets, including precontrast computed tomography (CT) radiomic data, postcontrast CT radiomic data, combination of the two CT radiomic data, clinical data, and combination of radiomic and clinical data. The 5-fold cross validation was used to train and test the random forest classifier. The performance of the models was assessed according to the area under the curve (AUC) in the receiver operating characteristic (ROC) curve. Survival analysis was performed to determine the difference in progression-free survival (PFS) between two groups with the prediction label generated by the combined model.

Results The radiomic model based on the combination of precontrast and postcontrast CT radiomic features and the clinical model produced an AUC of 0.92 ± 0.04 and 0.89 ± 0.03, respectively. By integrating radiomic and clinical features together, the combined model had the best performance with an AUC of 0.94 ± 0.02. The survival analysis showed that the two groups had significantly different PFS time (P < 0.0001).

Conclusion The baseline multidimensional data including CT radiomic and multiple clinical features were valuable in predicting the efficacy of ICIs monotherapy in patients with advanced NSCLC.

Non-small cell lung cancer (NSCLC)

Immune checkpoint inhibitors

Monotherapy

Radiomics

Clinical features

Machine learning

Lung cancer is the most common cancer worldwide, seriously endangering human health and life, and non-small cell lung cancer (NSCLC) accounts for 85% of lung cancers. Multiple clinical trials [1–3] have demonstrated that immune checkpoint inhibitors (ICIs), targeting the programmed death 1 (PD-1)/programmed death ligand 1 (PD-L1) signaling pathway, have significantly improved the survival benefit of patients with advanced NSCLC, and are recommended by treatment guidelines for driver gene-negative advanced NSCLC [4]. However, only about 20% of patients with advanced NSCLC respond to immune monotherapy in an unselected population [1, 2]. Additionally, patients who do not respond to immunotherapy not only have expensive drug costs, but also may suffer from serious immune adverse events. Therefore, it is crucial to identify potential beneficiaries of immunotherapy early.

PD-L1 expression is the most widely used and evidence-based positive predictor of immunotherapy efficacy in NSCLC patients [5, 6]. However, PD-L1 expression is controversial in clinical practice, and the survival benefit can also be observed in the PD-L1 negative subgroup [1]. The expression of PD-L1 in tumors is spatially and temporally heterogeneous [7]. In addition, the tumor mutation burden (TMB) [8] is currently recognized as another immunotherapy biomarker for screening potential beneficiaries with some limitations in terms of inconsistent threshold, and high detection cost. Therefore, it is necessary to identify inexpensive, noninvasive, and easily available biomarkers to predict the efficacy of immunotherapy.

Many previous studies [9–13] have demonstrated that computed tomography (CT) radiomics can be used as a noninvasive imaging marker to predict the outcomes of immunotherapy for NSCLC. In addition, multiple clinical factors are considered to be related to the prognosis of immunotherapy, such as histologic type [14], liver metastasis [15], and some peripheral blood inflammatory indicators [16]. Yang et al. [9] combined CT radiomics and clinicopathological characteristics to predict the clinical outcome of immunotherapy in lung cancer patients, but the study cohort, which included monotherapy and immunotherapy in combination with chemotherapy, was heterogeneous. In fact, the efficacy mechanism of chemotherapy was completely different to that of immunotherapy. Some studies [10–13] predicted the efficacy of monotherapy, but did not incorporate clinical factors into the analysis. Due to tumor heterogeneity and the complex anti-tumor mechanism of immunotherapy, radiomics can only characterize the internal heterogeneity of tumor tissue, while multidimensional source data can comprehensively evaluate the biological behavior of the tumor and physiological status of the body.

Moreover, the outcomes based on persistence of the benefit time can more representatively reflect the real benefit of immunotherapy, eliminating patients with rare short-term reactions [17]. The durable clinical benefit (DCB) is usually used to measure immunotherapy efficacy with a threshold of progression-free survival (PFS) more than 6 months in clinical practice.

We hypothesize that the integration of multidimensional data including CT radiomics, demographic characteristics, clinical characteristics, and peripheral blood indicators using a machine learning method will be able to predict immunotherapy efficacy, which is DCB in our study. In order to identify advanced NSCLC patients who will benefit from immunotherapy itself, our study only enrolled patients treated with PD-1/PD-L1 monotherapy to reduce the heterogeneity of the study cohort. A flow chart of our study is shown in Fig. 1.

Patient

We retrospectively included 180 patients with pathologically confirmed NSCLC in Zhejiang Cancer Hospital between January 2016 and September 2020. Inclusion criteria were as follows: (1) older than 18 years; (2) treated with PD-1/PD-L1 monotherapy; (3) stage IIIB/IIIC and IV, according to the 8th edition TNM staging system, formulated by the International Association for the Study of Lung Cancer (IASLC); (4) follow-up time more than 6 months before progressive disease; (5) no artifacts in CT images. The exclusion criteria were as follows: (1) no postcontrast CT image within 30 days before the start of immunotherapy; (2) no measurable (long diameter >10 mm) primary lung cancer lesions; (3) advanced NSCLC patients with postoperative recurrence and metastasis; (4) patients lost to imaging follow-up after treatment; (5) patient who died 10 days after the first-time treatment due to severe immune-related pneumonia. The study ultimately enrolled 112 eligible patients. The inclusion and exclusion diagram are shown in Fig. 2.

This study was approved by the Ethics Committee of Zhejiang Cancer Hospital. Informed consent was waived as private information on these patients was hidden in the retrospective data.

Clinical features

We included a total of 20 clinical features, which were all previously reported to be associated with the prognosis of immunotherapy. The clinical features in our study included baseline demographic characteristics, clinical characteristics, and peripheral blood indicators, as follows:

The demographic and clinical characteristics included age [18], sex [19], body mass index (BMI, kg/m²) [20], smoking history [21], chronic obstructive pulmonary disease (COPD) [22], Eastern Cooperative Oncology Group (ECOG) score [23], histologic type [14], type of ICIs [24], therapy line [25], tumor stage [26], bone metastasis [27], brain metastasis [28], liver metastasis [15], and pleural effusion [29].

Laboratory data were obtained within 2 weeks prior to the first ICIs treatment. The final peripheral blood indicators for data analysis included hemoglobin (g/dL) [30], serum albumin (g/dL) [31], lactate dehydrogenase (LDH) (U/L) [32], and composite inflammation indicators including the neutrophil-to-lymphocyte ratio (NLR) [33], platelet-to-lymphocyte ratio (PLR) [33], and lymphocyte-to-monocyte ratio (LMR) [34].

Efficacy evaluation criteria and follow-up

If the patients received multi-line immunotherapy, the analysis was performed using the first immunotherapy. Response assessment included complete response (CR), partial response (PR), stable disease (SD) and progressive disease (PD) based on the Response Evaluation Criteria in Solid Tumors (RECIST), version 1.1 [35]. When the results of PD, retrospectively determined according to RECIST 1.1, were inconsistent with the results determined by clinicians based on the conditions of patients, PD cases identified by clinicians in real time were regarded as events. The therapeutic efficacy was defined as DCB (CR, PR or SD lasting > 6 months) and no durable benefit (NDB: PD or SD lasting ≤ 6 months) [17]. PFS was defined as the time from the first ICIs treatment to disease progression or death from any cause, and patients without progression were censored at the time of the last clinical visit.

Image acquisition

The CT scans of all patients were acquired with a 16 or 64 row multi-slice spiral CT (Siemens SOMATOM Sensation 16; Siemens SOMATOM Definition Flash 64; GE Optima CT680). During the scan, the patients were instructed to hold their breath at the end of deep inhalation to avoid breathing motion artifacts. The tube voltage was 120 kV, and the tube current was 150-200 mAs with automatic adjustment. The scanning range was continuous from the lung apex to the lung bottom, and the pitch was 1.2-1.375. The slice thickness and slice spacing were both 5 mm. The CT images were reconstructed with a 512 × 512 matrix. In contrast scanning, a high-pressure syringe was used to inject non-ionic contrast agent into the anterior elbow vein. The injection rate was 2.0-2.5 mL/s, and the injection volume was 80-100 mL. The contrast scanning was delayed by 38-40 s.

Image segmentation and feature extraction

Image segmentation and feature extraction were performed with YITU AI Enabler, using Python pyradiomics (version 3.0.1). All imaging data were preprocessed by resampling to 1mm × 1mm × 1mm voxel size to minimize the impact of different scanning protocols or equipment on quantitative radiomics analysis. Manual segmentation can ensure the accuracy of the region of interest (ROI) and is the gold standard for clinical segmentation. The primary lung lesions were delineated as ROIs layer-by-layer for the entire tumor by a radiologist (NL, 5 years of experiences in diagnosing thoracic tumors). Another senior radiologist (LS, 15 years of experiences in diagnosing thoracic tumors) then confirmed and adjusted the outlined boundary. The two radiologists were both blinded to the therapeutic efficacy.

ROIs were delineated on the postcontrast CT images to avoid blood vessels and atelectasis as far as possible, and then the ROIs were copied to the precontrast CT images. Nine hundred and sixty features were first extracted from each patient based on precontrast and postcontrast CT images, respectively. Then a feature stability check was performed with minor changes of ROIs to filter out unstable features using intraclass correlation coefficients (ICC) between the features extracted within the lesion ROIs and the extended lesion ROIs. The extended lesion ROIs were produced by extending the boundary of ROIs by 1 image pixel. The features with an ICC greater than 0.8 were preserved as stable features.

In precontrast CT images, there were 790 stable features (Supplementary Fig. S1) from each patient including 14 shape features, 167 first-order statistics features, 213 gray level co-occurrence matrix (GLCM) features, 131 gray level difference matrix (GLDM) features, 155 gray level run length matrix (GLRLM) features and 110 gray level size zone matrix (GLSZM) features. In postcontrast CT images, there were 767 stable features (Supplementary Fig. S2) from each patient including 14 shape features, 161 first-order statistics features, 196 GLCM features, 141 GLDM features, 151 GLRLM features, and 104 GLSZM features.

Model construction

We used recursive feature elimination (RFE) to select 10 radiomic features most related to the therapeutic efficacy from precontrast and postcontrast radiomic data, respectively. The scikit-learn package (version 1.0.2) in Python programming software (version 3.9.7) was used for model construction and evaluation. We performed random over-sampling (imblearn package; version 0.9.0) of the minority class and used these balanced datasets for developing machine learning models. All the codes are available at https://github.com/BioAI-kits/RadClin.

In order to select the most suitable machine learning method for fitting radiomic data, the efficacy classification models were constructed based on radiomic features using different machine learning algorithms, including logistic regression (LR), support vector machine (SVM), multi-layer perceptron (MLP), and random forest (RF). For each machine algorithm, we used a three-step approach to build the model. First, we constructed models using various combinations of tunable hyperparameters that were adjusted depending on the algorithm. After developing these models for each hyperparameter combination, we tested the performance of the models using the average values of AUC from 5-fold cross validation. Finally, we selected the best hyperparameters with the highest average AUC for each algorithm. Furthermore, we rebuilt the machine learning model of each algorithm with the best parameters and evaluated these models with multiple metrics including AUC, balanced accuracy specificity, and sensitivity to select the optimal algorithm.

Based on the optimal machine learning algorithm, we further constructed five RF models with different input datasets. The dataset and corresponding model were as follows: precontrast CT radiomic features, precontrast model; postcontrast CT radiomic features, postcontrast model; precontrast and postcontrast CT radiomic features, radiomic model; clinical features, clinical model; combined clinical and radiomic features, combined model. The construction of the models was consistent with the methods above. We evaluated prediction performance of different models using the AUC in the ROC curves. In addition, the calibration curves were generated as a supplement to the model evaluation to visualize the goodness of fit of predictive models. The patients were divided into two groups with the prediction label (predicted DCB vs. predicted NDB), which was finally generated from the combined model. Survival analysis was then performed on the PFS of these two groups of patients.

Statistical analysis

Comparisons of clinical features were performed using SPSS 26.0 for statistical analysis. The continuous variables are presented as mean (standard deviation, SD) and median (interquartile range, IQR), which were compared by the independent sample t and Mann-Whitney U test. The categorical variables were compared by the Chi-square and Fisher’s exact test as appropriate. Kaplan-Meier analysis was used to generate survival curves, and the log-rank test was performed to compare PFS time between the two groups on R software (survminer; version 0.4.9). All statistical analyses were two-sided and the differences were considered statistically significant at P < 0.05.

The clinical features of patients

The baseline demographic characteristics, clinical characteristics, and peripheral blood indicators of the 112 patients are presented in Table 1. Thirty-nine (34.82%) patients achieved DCB, and the overall median PFS time was 2.8 months. The mean age of the patients was 59.43 years (± 7.98). There were 85 males (75.89%) and 27 females (24.11%). Fifty (44.64%) patients were diagnosed with squamous cell carcinoma (SCC), and 62 (55.36%) patients were diagnosed with non-squamous cell carcinoma (NSCC). All patients received ICIs monotherapy, including 32 (28.57%) patients treated with anti-PD-L1 drugs and 80 (71.43%) patients treated with anti-PD-1 drugs. Thirteen (11.61%) patients were treated with first-line ICIs, 85 (75.89%) patients with second-line ICIs, and 14 (12.50%) patients with third-line or above ICIs. A higher proportion of patients had stage IV tumors (90, 80.36%) than stage IIIB/IIIC tumors (22, 19.64%). There were significant differences in histologic type, tumor stage, and hemoglobin (P = 0.026, P = 0.005, and P = 0.044, respectively). The PD-L1 status was known in a small percentage of patients, including 18 (16.07%) patients with positive status and 5 (4.46%) patients with negative status. No statistical analysis of PD-L1 status was performed in this study.

Table 1

Demographic characteristics, clinical characteristics, and peripheral blood indicators of patients
Characteristics	Total (N = 112)	DCB (N = 39)	NDB (N = 73)	P
Age, mean (SD)	59.43 (7.98)	59.72 (7.96)	59.27 (8.04)	0.781
Sex, N (%)				0.115
male	85 (75.89)	33 (84.62)	52 (71.23)
female	27 (24.11)	6 (15.38)	21(28.77)
BMI, mean (SD)	22.62 (2.62)	23.22 (3.36)	22.30 (2.08)	0.129
Smoking history, N (%)				0.611
yes	77 (68.75)	28 (71.79)	49 (67.12)
no	35 (31.25)	11 (28.21)	24 (32.88)
COPD, N (%)				0.683
yes	29 (25.89)	11 (28.21)	18 (24.66)
no	83 (74.11)	28 (71.79)	55 (75.34)
ECOG, N (%)				0.465
0	19 (16.96)	8 (20.51)	11 (15.07)
1	93 (83.04)	31 (79.49)	62 (84.93)
Histologic type, N (%)				0.026*
SCC	50 (44.64)	23 (58.97)	27 (36.99)
NSCC	62 (55.36)	16 (41.03)	46 (63.01)
Type of ICIs, N (%)				0.950
anti-PD-L1	32 (28.57)	11 (28.21)	21 (28.77)
anti-PD-1	80 (71.43)	28 (71.79)	52 (71.23)
Therapy line, N (%)				0.468
1st	13 (11.61)	6 (15.38)	7 (9.59)
2nd	85 (75.89)	27 (69.23)	58 (79.45)
≥ 3rd	14 (12.50)	6 (15.38)	8 (10.96)
Tumor stage, N (%)				0.005*
IIIB	14 (12.50)	10 (25.64)	4 (5.48)
IIIC	8 (7.14)	1 (2.56)	7 (9.59)
IVA	43 (38.39)	17 (43.59)	26 (35.62)
IVB	47 (41.96)	11 (28.21)	36 (49.32)
Bone metastasis, N (%)				0.196
none	76 (67.86)	29 (74.36)	47 (64.38)
single	11 (9.82)	5 (12.82)	6 (8.22)
multiple	25 (22.32)	5 (12.82)	20 (27.40)
Brain metastasis, N (%)				0.914
none	95 (84.82)	34 (87.18)	61 (83.56)
single	6 (5.36)	2 (5.13)	4 (5.48)
multiple	11 (9.82)	3 (7.69)	8 (10.96)
Liver metastasis, N (%)				0.625
none	98 (87.50)	36 (92.31)	62 (84.93)
single	3 (2.68)	1 (2.56)	2 (2.74)
multiple	11 (9.82)	2 (5.13)	9 (12.33)
Pleural effusion, N (%)				0.283
none	73 (65.18)	28 (71.79)	45 (61.64)
yes	39 (34.82)	11 (28.21)	28 (38.36)
Hemoglobin, mean (SD)	12.30 (1.49)	12.67 (1.45)	12.09 (1.48)	0.044*
Albumin, median (IQR)	4.16 (3.88,4.35)	4.12 (3.88,4.38)	4.16 (3.91,4.34)	0.898
LDH, median (IQR)	224.50 (193,298.75)	209 (184,293)	239 (198,335)	0.110
NLR, median (IQR)	3.24 (2.29,4.66)	3.44 (2.25,4.63)	3.14 (2.34,5.00)	0.995
PLR, median (IQR)	173.00 (129.38,229.82)	166.15 (131.50,216.67)	185.63 (128.25,241.00)	0.330
LMR, median (IQR)	2.41 (1.91,3.31)	2.25 (1.83,3.40)	2.50 (2.00,3.25)	0.561
* P < 0.05

Machine learning model comparison

The common machine learning algorithms, including LR, SVM, MLP, and RF, were compared to select the most suitable machine learning algorithm for radiomics data. In this study, we found that RF showed better performance than the other machine learning algorithms.

The recursive elimination method was performed to select 10 of the most important features for efficacy from the precontrast and postcontrast CT radiomic dataset, respectively (Fig. 3a and 3b). Four machine learning algorithms were used to construct the efficacy classification models based on the 20 selected radiomic features. The ROC curves and the average AUC are shown in Fig. 3c, and all evaluation metrics are recorded in Table 2. As a result, the RF model achieved the best predictive performance (AUC, 0.92 ± 0.04; balanced accuracy, 0.80 ± 0.03; specificity, 0.84 ± 0.10; sensitivity, 0.76 ± 0.09).

Table 2

The evaluation metrics of different machine learning models based on radiomics data
Method	AUC	Accuracy	Specificity	Sensitivity
LR	0.58 ± 0.09	0.56 ± 0.08	0.57 ± 0.12	0.57 ± 0.13
SVM	0.71 ± 0.07	0.64 ± 0.04	0.83 ± 0.13	0.46 ± 0.12
MLP	0.64 ± 0.05	0.52 ± 0.02	0.81 ± 0.21	0.26 ± 0.22
RF	0.92 ± 0.04	0.80 ± 0.03	0.84 ± 0.10	0.76 ± 0.09

Building the random forest model with different input data

In this study, we further constructed five RF models with different input data, including precontrast CT radiomic features, postcontrast CT radiomic features, combined radiomic features, clinical features, combined radiomic and clinical features. The evaluation metrics of these models are shown in Table 3, and the ROC curves of each model are shown in Figs. 3d and 4a. The results showed that the mean AUC of the two models trained using only precontrast or postcontrast features were 0.88 ± 0.05 and 0.87 ± 0.06, respectively. Of note, the radiomic model that was constructed with combined precontrast and postcontrast radiomic features showed better predictive performance (AUC, 0.92 ± 0.04; balanced accuracy, 0.80 ± 0.03; specificity, 0.84 ± 0.10; sensitivity, 0.76 ± 0.09). The clinical model that was constructed based on various clinical features had an AUC value of 0.89 ± 0.03 (balanced accuracy, 0.81 ± 0.04; specificity, 0.80 ± 0.04; sensitivity, 0.81 ± 0.07). Furthermore, when clinical features were introduced into the combined model, the predictive performance was further improved (AUC, 0.94 ± 0.02; balanced accuracy, 0.82 ± 0.03; specificity, 0.86 ± 0.09; sensitivity, 0.79 ± 0.12). We also determined the goodness of fit of the radiomic model, clinical model and combined model in the calibration curves (Fig. 4b).

Table 3

The evaluation metrics of random forest models based on different input datasets
Model	AUC	Accuracy	Specificity	Sensitivity
Precontrast model	0.88 ± 0.05	0.80 ± 0.03	0.86 ± 0.10	0.76 ± 0.09
Postcontrast model	0.87 ± 0.06	0.71 ± 0.05	0.87 ± 0.08	0.56 ± 0.11
Radiomic model	0.92 ± 0.04	0.80 ± 0.03	0.84 ± 0.10	0.76 ± 0.09
Clinical model	0.89 ± 0.03	0.81 ± 0.04	0.80 ± 0.04	0.81 ± 0.07
Combined model	0.94 ± 0.02	0.82 ± 0.03	0.86 ± 0.09	0.79 ± 0.12

Applying the combined model to prognosis analysis

To quantify the contribution of individual clinical variables to efficacy prediction, we performed an interpretability analysis of the combined model. The results indicated that BMI, LMR, NLR, age and PLR were the five most important clinical variables in efficacy prediction (Fig. 4c). Furthermore, the patients were classified into two groups by the combined model: the predicted DCB group and predicted NDB group. The survival analysis showed that PFS time between the two groups was significantly different (P < 0.0001), with a median PFS time of 11.9 (95% CI: 10.47–24.80) months and 1.9 (95% CI: 1.43–2.07) months, respectively (Fig. 4d).

The results of the present study confirmed that CT radiomics and multiple clinical data were both valuable for the efficacy prediction of immunotherapy, and the combination of the two resulted in improved prediction. Furthermore, only patients treated with PD-1/PD-L1 monotherapy were included in this study to rule out interference due to other treatments. Currently, studies predicting the efficacy of ICIs monotherapy using multidimensional data are rarely reported.

Yang et al. [36] previously used deep learning models based on multidimensional data to distinguish responders and non-responders to ICIs monotherapy at 60- and 90-days post-treatment. In terms of the time point of efficacy prediction, we assessed the therapeutic efficacy at 6 months. The efficacy was evaluated according to DCB and NDB, which is a practical and simple method for clinically classifying those who benefit from immunotherapy. The duration of the benefit time can more clearly capture the main contributor to benefit, which is persistence. Compared with treatment response defined by best response, DCB can not only exclude short-term responders, but also accurately assess benefit in those with SD, a population with heterogeneous immunotherapy benefit profiles [17].

CT is most commonly used for tumor staging and response assessment for NSCLC in the clinic. We used two types of CT radiomic features for modeling, and our findings demonstrated that radiomic models based on precontrast or postcontrast CT radiomic features both predicted efficacy. It is known that precontrast CT radiomic features are associated with the heterogeneity of tissue density due to necrosis, hemorrhage and myxoid changes [37], and postcontrast CT radiomic features can provide information on the spatial heterogeneity of microvascular distribution and permeability [38]. Thus, radiomic features at the macroscopic level can reflect the underlying tumor pathophysiology.

Considering that the combination of two types of CT radiomic features has the potential to comprehensively reflect tumor heterogeneity, we combined precontrast and postcontrast CT radiomic features and the combination further improved prediction performance. The results indicated that the simultaneous application of the two types of CT radiomic features was more reliable in predicting immunotherapy efficacy. Compared with the study by Wu et al. [39], our findings further confirmed the potential advantage of the combined radiomics model to predict the efficacy of ICIs monotherapy. The immunotherapy efficacy of cohorts with heterogeneous immunotherapy regimens including monotherapy and combined therapy may have been affected by other treatments in the study by Wu et al. In addition, Wu et al. did not combine currently known clinical biomarkers with radiomic features.

Immunotherapy efficacy in tumors is affected by a variety of biological factors, which have a complex impact on the development of tumors and immune responses. A previous study [40] indicated that the predictive ability of biomarkers might be improved by the combination of different biomarkers to reduce the assumed risk associated with each one. In order to confirm the hypothesis that individual treatment response may be the comprehensive result of the interaction of various factors, we integrated multidimensional data including radiomic features, demographic characteristics, clinical characteristics, and peripheral blood indicators to predict immunotherapy efficacy. The results showed that the combination of radiomic and clinical features was better than radiomic or clinical features alone.

Machine learning methods can combine different types of features in a non-linear fashion and are able to overcome the limitations of predictors that rely on a single feature [41]. The study by Chowell et al. [41] revealed that the non-linear combination of multiple features had different degrees of contribution to the overall prediction of response. In our study, the top five most important clinical features associated with therapeutic efficacy using the RF classifier were BMI, LMR, NLR, age and PLR, which are known to provide information on nutrition, immune, and inflammatory status. A previous study [42] demonstrated that nutrition status can affect tumor development and response to treatment, and is closely related to the prognosis of cancer patients. Elderly patients tend to develop immunosenescence, which is characterized by a decline in immune capacity with increasing age [43]. Inflammation can promote or induce tumor initiation, progression and metastasis by regulating the tumor microenvironment [44]. An increasing number of studies [33, 34] have demonstrated that LMR, NLR and PLR are biomarkers which reflect the level of systemic inflammation, and can represent the balance between promoting tumor response and antitumor immune function. Additionally, these potential prognostic factors are noninvasive, inexpensive and routinely obtained in clinical practice.

Moreover, we compared the performance of various machine learning methods to build classification models and found that the RF algorithm performed best in our study. The RF algorithm, which has high anti-interference for over-fitting and noise, can handle high dimensional input variables, and is able to quantify the importance of each feature in the classification [45].

Our study had several limitations: Firstly, this was a single-center retrospective study with a small sample size and there was no external validation dataset. As it was difficult to identify a PD-1/PD-L1 monotherapy cohort with complete clinical and imaging data at baseline, we failed to find homogenization data in multiple centers. However, we believe that our findings will motivate more researchers to further examine this issue. Secondly, overall survival (OS) data were not used in the analysis as the majority of patients received subsequent multi-line therapy after immunotherapy.

This preliminary exploratory study demonstrated that CT features combined with multiple biological factors were valuable for predicting the efficacy of PD-1/PD-L1 monotherapy in patients with advanced NSCLC. The results are expected to provide a basis for the establishment of a multidimensional model based on clinical and laboratory indicators in addition to imaging features for subsequent researchers.

NSCLC: Non-small cell lung cancer

ICIs: immune checkpoint inhibitors

CT: computed tomography

AUC: area under the curve

ROC: receiver operating characteristic

PFS: progression-free survival

PD-1: programmed death 1

PD-L1: programmed death ligand 1

TMB: tumor mutation burden

DCB: durable clinical benefit

NDB: no durable benefit

IASLC: International Association for the Study of Lung Cancer

BMI: body mass index

COPD: chronic obstructive pulmonary disease

ECOG: Eastern Cooperative Oncology Group

LDH: lactate dehydrogenase

NLR: neutrophil-to-lymphocyte ratio

PLR: platelet-to-lymphocyte ratio

LMR: lymphocyte-to-monocyte ratio

CR: complete response

PR: partial response

SD: stable disease

PD: progressive disease

RECIST: Response Evaluation Criteria in Solid Tumors

ROI: region of interest

ICC: intraclass correlation coefficients

GLCM: gray level co-occurrence matrix

GLDM: gray level difference matrix

GLRLM: gray-level run length matrix

GLSZM: gray level size zone matrix

RFE: recursive feature elimination

LR: logistic regression

SVM: support vector machine

MLP: multi-layer perceptron

RF: random forest

SD: standard deviation

IQR: interquartile range

SCC: squamous cell carcinoma

NSCC: non-squamous cell carcinoma

OS: overall survival

Funding

This work was supported by grants from Natural Science Foundation of Zhejiang Province (Grant Number: Y22H227294), Medical Science and Technology Project of Zhejiang Province (Grant Number: 2022KY097, 2022RC114)

Competing Interests

The authors have no relevant financial or non-financial interests to disclose.

Acknowledgements

We thank International Science Editing (http://www.internationalscienceediting.com) for editing this manuscript.

Author contributions

JX, ZBS and LS contributed to conception and design of the study. Data collection was performed by NL, BQZ, JJS and JTY. BLL conducted data processing and analysis. NL and LL written the first draft of the manuscript, and NL contributed to rewriting the revised manuscript. All authors read and gave approval of the final accepted version of the manuscript.

Availability of data and material

The datasets analyzed during the current study are available from the corresponding author on reasonable request.

Ethics approval

This study was approved by the Ethics Committee of Zhejiang Cancer Hospital (Ethics approval number: IRB-2021-408).

Consent to participate

Informed consent was waived in view of the retrospective nature of the study.

Brahmer J, Reckamp KL, Baas P et al (2015) Nivolumab versus Docetaxel in Advanced Squamous-Cell Non-Small-Cell Lung Cancer. N Engl J Med 373: 123–135. http://doi.org/10.1056/NEJMoa1504627
Borghaei H, Paz-Ares L, Horn L et al (2015) Nivolumab versus Docetaxel in Advanced Nonsquamous Non-Small-Cell Lung Cancer. N Engl J Med 373: 1627–1639. http://doi.org/10.1056/NEJMoa1507643
Reck M, Rodriguez-Abreu D, Robinson AG et al (2016) Pembrolizumab versus Chemotherapy for PD-L1-Positive Non-Small-Cell Lung Cancer. N Engl J Med 375: 1823–1833. http://doi.org/10.1056/NEJMoa1606774
Hanna NH, Robinson AG, Temin S et al (2021) Therapy for Stage IV Non-Small-Cell Lung Cancer With Driver Alterations: ASCO and OH (CCO) Joint Guideline Update. J Clin Oncol 39: 1040–1091. http://doi.org/10.1200/JCO.20.03570
Ettinger DS, Wood DE, Aisner DL et al (2021) NCCN Guidelines Insights: Non-Small Cell Lung Cancer, Version 2.2021. J Natl Compr Canc Netw 19: 254–266. http://doi.org/10.6004/jnccn.2021.0013
Mok TSK, Wu YL, Kudaba I et al (2019) Pembrolizumab versus chemotherapy for previously untreated, PD-L1-expressing, locally advanced or metastatic non-small-cell lung cancer (KEYNOTE-042): a randomised, open-label, controlled, phase 3 trial. Lancet 393: 1819–1830. http://doi.org/10.1016/S0140-6736(18)32409-7
Hong L, Negrao MV, Dibaj SS et al (2020) Programmed Death-Ligand 1 Heterogeneity and Its Impact on Benefit From Immune Checkpoint Inhibitors in NSCLC. J Thorac Oncol 15: 1449–1459. http://doi.org/10.1016/j.jtho.2020.04.026
Hellmann MD, Ciuleanu TE, Pluzanski A et al (2018) Nivolumab plus Ipilimumab in Lung Cancer with a High Tumor Mutational Burden. N Engl J Med 378: 2093–2104. http://doi.org/10.1056/NEJMoa1801946
Yang B, Zhou L, Zhong J et al (2021) Combination of computed tomography imaging-based radiomics and clinicopathological characteristics for predicting the clinical benefits of immune checkpoint inhibitors in lung cancer. Respir Res 22: 189. http://doi.org/10.1186/s12931-021-01780-2
Khorrami M, Prasanna P, Gupta A et al (2020) Changes in CT Radiomic Features Associated with Lymphocyte Distribution Predict Overall Survival and Response to Immunotherapy in Non-Small Cell Lung Cancer. Cancer Immunol Res 8: 108–119. http://doi.org/10.1158/2326-6066.CIR-19-0476
Vaidya P, Bera K, Patil PD et al (2020) Novel, non-invasive imaging approach to identify patients with advanced non-small cell lung cancer at risk of hyperprogressive disease with immune checkpoint blockade. J Immunother Cancer 8. http://doi.org/10.1136/jitc-2020-001343
Liu C, Gong J, Yu H et al (2021) A CT-Based Radiomics Approach to Predict Nivolumab Response in Advanced Non-Small-Cell Lung Cancer. Front Oncol 11: 544339. http://doi.org/10.3389/fonc.2021.544339
Granata V, Fusco R, Costa M et al (2021) Preliminary Report on Computed Tomography Radiomics Features as Biomarkers to Immunotherapy Selection in Lung Adenocarcinoma Patients. Cancers (Basel) 13:3992. http://doi.org/10.3390/cancers13163992
Zheng Q, Huang Y, Zeng X et al (2021) Clinicopathological and molecular characteristics associated with PD-L1 expression in non-small cell lung cancer: a large-scale, multi-center, real-world study in China. J Cancer Res Clin Oncol 147: 1547–1556. http://doi.org/10.1007/s00432-020-03444-y
Yu J, Green MD, Li S et al (2021) Liver metastasis restrains immunotherapy efficacy via macrophage-mediated T cell elimination. Nat Med 27: 152–164. http://doi.org/10.1038/s41591-020-1131-x
Li S, Zhang C, Pang G et al (2020) Emerging Blood-Based Biomarkers for Predicting Response to Checkpoint Immunotherapy in Non-Small-Cell Lung Cancer. Front Immunol 11: 603157. http://doi.org/10.3389/fimmu.2020.603157
Rizvi H, Sanchez-Vega F, La K et al (2018) Molecular Determinants of Response to Anti-Programmed Cell Death (PD)-1 and Anti-Programmed Death-Ligand 1 (PD-L1) Blockade in Patients With Non-Small-Cell Lung Cancer Profiled With Targeted Next-Generation Sequencing. J Clin Oncol 36: 633–641. http://doi.org/10.1200/JCO.2017.75.3384
Nishijima TF, Muss HB, Shachar SS et al (2016) Comparison of efficacy of immune checkpoint inhibitors (ICIs) between younger and older patients: A systematic review and meta-analysis. Cancer Treat Rev 45: 30–37. http://doi.org/10.1016/j.ctrv.2016.02.006
Conforti F, Pala L, Bagnardi V et al (2018) Cancer immunotherapy efficacy and patients' sex: a systematic review and meta-analysis. Lancet Oncol 19: 737–746. http://doi.org/10.1016/S1470-2045(18)30261-4
Kichenadasse G, Miners JO, Mangoni AA et al (2020) Association Between Body Mass Index and Overall Survival With Immune Checkpoint Inhibitor Therapy for Advanced Non-Small Cell Lung Cancer. JAMA Oncol 6: 512–518. http://doi.org/10.1001/jamaoncol.2019.5241
Norum J, Nieder C (2018) Tobacco smoking and cessation and PD-L1 inhibitors in non-small cell lung cancer (NSCLC): a review of the literature. ESMO Open 3: e000406. http://doi.org/10.1136/esmoopen-2018-000406
Zhou J, Chao Y, Yao D et al (2021) Impact of chronic obstructive pulmonary disease on immune checkpoint inhibitor efficacy in advanced lung cancer and the potential prognostic factors. Transl Lung Cancer Res 10: 2148–2162. http://doi.org/10.21037/tlcr-21-214
Spigel DR, McCleod M, Jotte RM et al (2019) Safety, Efficacy, and Patient-Reported Health-Related Quality of Life and Symptom Burden with Nivolumab in Patients with Advanced Non-Small Cell Lung Cancer, Including Patients Aged 70 Years or Older or with Poor Performance Status (CheckMate 153). J Thorac Oncol 14: 1628–1639. http://doi.org/10.1016/j.jtho.2019.05.010
Duan J, Cui L, Zhao X et al (2020) Use of Immunotherapy With Programmed Cell Death 1 vs Programmed Cell Death Ligand 1 Inhibitors in Patients With Cancer: A Systematic Review and Meta-analysis. JAMA Oncol 6: 375–384. http://doi.org/10.1001/jamaoncol.2019.5367
Wu S, Wang L, Li W et al (2021) Comparison between the first-line and second-line immunotherapy drugs in the progression-free survival and overall survival in advanced non-small cell lung cancer: a systematic review and meta-analysis of randomized controlled trials. Ann Palliat Med 10: 1717–1726. http://doi.org/10.21037/apm-20-449
Wu J, Xu C, Guan X et al (2021) Comprehensive analysis of tumor microenvironment and identification of an immune signature to predict the prognosis and immunotherapeutic response in lung squamous cell carcinoma. Ann Transl Med 9: 569. http://doi.org/10.21037/atm-21-463
Zhu YJ, Chang XS, Zhou R et al (2022) Bone metastasis attenuates efficacy of immune checkpoint inhibitors and displays "cold" immune characteristics in Non-small cell lung cancer. Lung Cancer 166: 189–196. http://doi.org/10.1016/j.lungcan.2022.03.006
Zhou S, Xie J, Huang Z et al (2021) Anti-PD-(L)1 immunotherapy for brain metastases in non-small cell lung cancer: Mechanisms, advances, and challenges. Cancer Lett 502: 166–179. http://doi.org/10.1016/j.canlet.2020.12.043
Epaillard N, Benitez JC, Gorria T et al (2021) Pleural effusion is a negative prognostic factor for immunotherapy in patients with non-small cell lung cancer (NSCLC): The pluie study. Lung Cancer 155: 114–119. http://doi.org/10.1016/j.lungcan.2021.03.015
Zhang Z, Zhang F, Yuan F et al (2020) Pretreatment hemoglobin level as a predictor to evaluate the efficacy of immune checkpoint inhibitors in patients with advanced non-small cell lung cancer. Ther Adv Med Oncol 12: 1758835920970049. http://doi.org/10.1177/1758835920970049
Yoo SK, Chowell D, Valero C et al (2022) Pre-treatment serum albumin and mutational burden as biomarkers of response to immune checkpoint blockade. NPJ Precis Oncol 6: 23. http://doi.org/10.1038/s41698-022-00267-7
Mezquita L, Auclin E, Ferrara R et al (2018) Association of the Lung Immune Prognostic Index With Immune Checkpoint Inhibitor Outcomes in Patients With Advanced Non-Small Cell Lung Cancer. JAMA Oncol 4: 351–357. http://doi.org/10.1001/jamaoncol.2017.4771
Diem S, Schmid S, Krapf M et al (2017) Neutrophil-to-Lymphocyte ratio (NLR) and Platelet-to-Lymphocyte ratio (PLR) as prognostic markers in patients with non-small cell lung cancer (NSCLC) treated with nivolumab. Lung Cancer 111: 176–181. http://doi.org/10.1016/j.lungcan.2017.07.024
Sekine K, Kanda S, Goto Y et al (2018) Change in the lymphocyte-to-monocyte ratio is an early surrogate marker of the efficacy of nivolumab monotherapy in advanced non-small-cell lung cancer. Lung Cancer 124: 179–188. http://doi.org/10.1016/j.lungcan.2018.08.012
Eisenhauer EA, Therasse P, Bogaerts J et al (2009) New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer 45: 228–247. http://doi.org/10.1016/j.ejca.2008.10.026
Yang Y, Yang J, Shen L et al (2021) A multi-omics-based serial deep learning approach to predict clinical outcomes of single-agent anti-PD-1/PD-L1 immunotherapy in advanced stage non-small-cell lung cancer. Am J Transl Res 13: 743–756.
Ganeshan B, Abaleke S, Young RC et al (2010) Texture analysis of non-small cell lung cancer on unenhanced computed tomography: initial evidence for a relationship with tumour glucose metabolism and stage. Cancer Imaging 10: 137–143. http://doi.org/10.1102/1470-7330.2010.0021
Win T, Miles KA, Janes SM et al (2013) Tumor heterogeneity and permeability as measured on the CT component of PET/CT predict survival in patients with non-small cell lung cancer. Clin Cancer Res 19: 3591–3599. http://doi.org/10.1158/1078-0432.CCR-12-1307
Wu M, Zhang Y, Zhang J et al (2021) A Combined-Radiomics Approach of CT Images to Predict Response to Anti-PD-1 Immunotherapy in NSCLC: A Retrospective Multicenter Study. Front Oncol 11: 688679. http://doi.org/10.3389/fonc.2021.688679
Camidge DR, Doebele RC, Kerr KM (2019) Comparing and contrasting predictive biomarkers for immunotherapy and targeted therapy of NSCLC. Nat Rev Clin Oncol 16: 341–355. http://doi.org/10.1038/s41571-019-0173-9
Chowell D, Yoo SK, Valero C et al (2022) Improved prediction of immune checkpoint blockade efficacy across multiple cancer types. Nat Biotechnol 40: 499–506. http://doi.org/10.1038/s41587-021-01070-8
Zitvogel L, Pietrocola F, Kroemer G (2017) Nutrition, inflammation and cancer. Nat Immunol 18: 843–850. http://doi.org/10.1038/ni.3754
Pawelec G, Derhovanessian E, Larbi A (2010) Immunosenescence and cancer. Crit Rev Oncol Hematol 75: 165–172. http://doi.org/10.1016/j.critrevonc.2010.06.012
Mantovani A, Allavena P, Sica A et al (2008) Cancer-related inflammation. Nature 454: 436–444. http://doi.org/10.1038/nature07205
Scott IM, Lin W, Liakata M et al (2013) Merits of random forests emerge in evaluation of chemometric classifiers by external validation. Anal Chim Acta 801: 22–33. http://doi.org/10.1016/j.aca.2013.09.027

No competing interests reported.

Supplementarydata.pdf

Download PDF

Version 1

posted

You are reading this latest preprint version

Predicting the efficacy of immune checkpoint inhibitors monotherapy in advanced non-small cell lung cancer: a machine learning method based on multidimensional data

Status:

Version 1

Abstract

Figures

Introduction

Materials And Methods

Results

Discussion

Conclusions

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1