COVID-19 Mortality Risk Prediction using Clinical and Laboratory Examination: Machine Learning Approach for Implementation

doi:10.21203/rs.3.rs-2152771/v1

Download PDF

Article

COVID-19 Mortality Risk Prediction using Clinical and Laboratory Examination: Machine Learning Approach for Implementation

https://doi.org/10.21203/rs.3.rs-2152771/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 10 Feb, 2023

Read the published version in Scientific Reports →

You are reading this latest preprint version

Background and Aim: We aimed to propose a mortality risk prediction tool to facilitate COVID-19 patient management and allocation for the frontline physician on admission day.

Methods: We used a dataset of confirmed COVID-19 patients admitted to three general hospitals in Tehran. Clinical and laboratory values on admission were gathered. Different machine learning methods were used to assess the risk of in-hospital mortality, including logistic regression, k-nearest neighbor (KNN), gradient boosting classifier, random forest, support vector machine, and deep neural network (DNN). Least absolute shrinkage and selection operator (LASSO) regression and Boruta feature selection methods were used for feature selection. The proposed model was selected using the area under the receiver operator curve (AUC). Furthermore, a dataset from the fourth hospital was used for external validation.

Results: 5320 hospitalized COVID-19 patients were enrolled in the study with a mean age of 61.6± 17.6 years and a fatality rate of 17.24% (N=917). All methods showed fair performance with AUC>80%, except for the KNN method. The feature selection method selected ten laboratories and eight clinical features. Our proposed DNN and LASSO feature selection methods showed AUC scores of 83.4% and 82.8% in internal and external validation, respectively. Furthermore, our imputer worked fairly when two out of ten laboratory parameters were missing (AUC=81.8%).

Conclusion: We worked intimately with healthcare professionals to provide a tool that can solve real-world needs. Our proposed model showed promising results and confirms the potential of ML methods for use in clinical practice as a decision-support system. Future studies are warranted to investigate barriers to the implementation of ML tools.

Health sciences/Medical research

Health sciences/Risk factors

Biological sciences/Microbiology

Biological sciences/Microbiology/Virology/Sars cov 2

COVID-19

Prognosis

Machine Learning

Deep Learning

Hospital Mortality

As of 25 September 2022, 612 million confirmed cases and 6.5 million deaths had been reported globally (WHO, 2022). Many studies have investigated the predictors of death and severity of COVID-19 (1). Assessing the risk of death after COVID-19 is useful for guiding clinical decisions for patients and evaluating the effectiveness of prevention strategies (2). New analytic approaches may enhance risk prediction with existing data beyond traditional methods.

Machine learning (ML) as a novel approach can enhance policy-making, forecasting, screening, drug development, diagnosis, and risk stratification during the COVID-19 pandemic. Artificial intelligence (AI) can fill the gap between healthcare resources and patient load by reducing human workload. In the shortage of intensive care unit (ICU) beds, risk stratification of patients can identify the most vulnerable (3). Although many ML algorithms strived to help physicians, no ML tool has yet been implanted in clinical practice. This issue is partly because of clinicians' hardship in using and interpreting computational models. Herby, creating an interpretable model should be a vital part of the practice and can be achieved by healthcare professionals' feedback.

Even after the introduction of COVID-19 vaccines, the peaks in the incidence of COVID-19 are evident in many countries. COVID-19 deaths also endured in countries with low vaccine coverage (4). Moreover, training a generalizable ML needs precise variable and population selection. The ML training data set should represent the actual population using the model (5).

Accordingly, this study used clinical and laboratory data from three general hospitals to predict the mortality risk of hospitalized COVID-19 patients and determined the external validation of the model. We aimed to propose a mortality risk prediction tool to facilitate COVID-19 patient management and allocation for the frontline physician on admission day.

Data collection

We enrolled 5320 confirmed COVID-19 patients admitted to three general hospitals in Tehran, Iran, from March 2020 to March 2021. A Medical team reviewed patients' medical records and gathered patients' demographics, symptoms, comorbidities, admission vital signs, and outcomes. Laboratory results were collected for all patients on the first day of admission through the hospital information system. Confirmation of cases was based on real-time polymerase chain reaction (RT-PCR) for SARS-CoV-2 of nasal or oropharyngeal swab samples on the first days of admission. The outcome of the study was death versus discharge from the hospital. We previously explored the epidemiology of the cohort used in this study in detail (6).

Data cleaning and imputation

Patients with any missing categorical variable or missing more than two numerical features were removed from the dataset. Out of 88 features collected from cohort patients, including 52 categorical features and 29 continuous features, none of the categorical features contained missing data. Conversely, seven numerical features were dropped due to a proportion of missing values greater than 5 percent; then, other missing values were imputed using Python's Sci-kit learn iterative imputer.

Feature selection

Feature selection is a crucial method in developing machine learning models. It can prevent overfitting, a great problem in machine learning, by eliminating redundant collinear features. We recognized the most predictive values using the least absolute shrinkage and selection operator (LASSO) regression and Boruta feature selection methods. LASSO confirmed 37 features containing 25 categorical and 12 nominal features, and Boruta selected 24 features, all of which were nominal. We used both of these groups separately as our training data features and compared the performances.

Model development

Six machine learning classification models were trained and fine-tuned, including support vector machine (SVM) with Radial Basis Function (RBF) as kernel and the degree set to 3, logistic regression (LR), k-nearest neighbors (KNN) with number of neighbors set to 5 and weights to uniform, random forest (RF) with number of estimators set to 100 and criterion set to gini, gradient boosting decision tree (GBDT) with number of estimators set to 100, learning rate set to 0.1, and loss set to log_loss, and deep neural network (DNN) to calculate the risk of mortality in admitted covid patients. SVM, and LR were regularized using the L2-regularization (Ridge regression) method. After fine-tuning, the neural network contained two hidden layers with 128 units for the first and 64 units for the second hidden layer. Moreover, all layers were activated using rectified linear unit (ReLU) activation function, and the output layer contained a unit with a sigmoid activation function. All layers except the output layer had 50% Dropout, a deep neural network compiled with binary cross-entropy as loss function and stochastic gradient descent with learning rate, decay, momentum, and Nesterov set to 0.01, 1e-7, 0.9, and true respectively as optimizer. The machine learning pipeline of proposed DNN model and its implementation is depicted in Fig. 1.

Model training and evaluation

Two data sets were created using features confirmed by each feature selection method. Then datasets were randomly split into training and validation sets in a ratio of 7:3 while preserving the same proportion of mortality in all datasets due to the small percentage of mortality in datasets.

Using accuracy for evaluating model performance was inappropriate due to the skewness of the data. Precision, Recall, F1-Score, sensitivity, specify, and area under the curve (AUC) of the receiver operating characteristic (ROC) score was calculated to evaluate model performance on validation datasets, and the ROC curve visualized model performance.

We tested our trained models' performances on an external dataset from another province's tertiary hospital to evaluate the generalizability of our models to evaluate the models' generalizability.

Effect of using iterative imputer on models' performances

One of the most critical issues that every machine learning and deep learning project on tabular data must overcome is dealing with missing data. There are several ways to solve the missing values problem, including filling with median, mean, arbitrary value, previous/next value, using the most common value, and imputing the missing values using machine learning models. In this study, we used an iterative imputer, a multivariate imputer -- It estimates the missing values in each feature using all other features in the dataset. This is one of the most commonly used machine learning strategies for missing values. In this study, we evaluated the effect of the iterative imputer on machine learning models' performances and compared it with models trained on datasets without missing values. For this comparison, we randomly removed 20% of the numerical values in our training datasets and trained the same machine learning models with the same hyperparameters on these datasets. Then we evaluated the performance metrics of these models on the main testing dataset to compare them with the main models' performances.

Statistical analysis

Data analysis and visualization were performed using the R program. Kolmogorov-Smirnov normality test is used to evaluate the normal distribution of a variable. The Fisher exact test was used to determine the significance of categorical features, and the Mann-Whitney U test was used to evaluate significant non-parametric variables. An Independent t-test was used to find the significance of parametric features. The categorical variables are presented as number and percent, and numerical variables are presented as mean and standard deviation (SD)

Ethics

All methods were performed in accordance to Helsinki protocol. The Institutional Review Board (IRB) at the Shahid Beheshti University of Medical Science approved the study and waived informed consent gathering (IR.SBMU.RIGLD.REC.1400.014). Data were anonymized before analysis, and patients' confidentiality and data security were concerned. This study is part of an observational, retrospective, multicentric research project to investigate the epidemiological characteristics of COVID-19 patients.

Basic Characteristics

After excluding 1703 patients due to missing categorical variables or missing more than two nominal variables, 5320 hospitalized COVID-19 patients were enrolled in the study with a mean (SD) age of 61.6 (17.6) years. The fatality rate in the enrolled cohort was 17.24% (N = 917), and patients who died due to covid-19 were significantly older than those who survived (70.3 (15.1) versus 58.6 (17.1), P < 0.001). The basic characteristics of survived and mortality cohort is presented in Supplementary Table S1.

Factors Associated with Mortality

As depicted in Supplementary Table S2, age, history of myalgia, loss of consciousness, vertigo and vomiting, skin lesions, alcohol consumption, history of gastrointestinal problems, rheumatoid arthritis, Neurologic disorders, leukocytosis, thrombocytopenia, low hemoglobin level, high CRP, low HCO3, high CPK level, low oxygen saturation, pulse rate, and respiratory rate at the time of admission, were demonstrated as factors associated with a higher risk of mortality in admitted covid patients using cox proportional hazards model. The most important features associated with mortality were alcohol consumption and loss of consciousness at the time of admission. Table 1 depicts the mean difference and hazard ratio of selected features.

Table 1

Mean comparison and Cox Regression of selected variables for inclusion in the model.
	Cox Regression				Mean Comparison*
Feature	HR	Lower 95% CI	Upper 95% CI	P-Value	Mortality Cohort	Survived Cohort	P-Value
Demographic and Habitual History
Age	1.028	1.023	1.034	0.001	74.00 (61.00,83.00)	60.00(47.00,71.00)	0.001
Opium	0.827	0.581	1.178	0.293	43.0(4.69%)	135.0(1.06%)	0.39
Alcohol consumption	2.599	1.235	5.469	0.012	10.0(1.09%)	11.0(0.09%)	0.022
Comorbidities
DM	1.09	0.936	1.27	0.266	346.0(37.73%)	784.0(6.17%)	0.001
IHD	1.101	0.927	1.309	0.272	214.0(23.34%)	394.0(3.10%)	0.001
Cancer	1.253	0.966	1.626	0.089	78.0(8.51%)	128.0(1.01%)	0.001
CHF	1.129	0.761	1.675	0.546	31.0(3.38%)	52.0(0.41%)	0.01
COPD	1.181	0.755	1.849	0.466	22.0(2.40%)	47.0(0.37%)	0.133
CVA	1.207	0.957	1.522	0.112	101.0(11.01%)	134.0(1.06%)	0.001
GI problems	1.797	1.037	3.113	0.037	15.0(1.64%)	35.0(0.28%)	0.271
Hepatitis C	1.348	0.185	9.805	0.768	1.0(0.11%)	4.0(0.03%)	0.625
Alzheimer	1.038	0.776	1.387	0.802	63.0(6.87%)	48.0(0.38%)	0.001
Psychological problems	1.636	1.073	2.495	0.022	24.0(2.62%)	39.0(0.31%)	0.017
Parkinson	1.106	0.72	1.7	0.645	25.0(2.73%)	24.0(0.19%)	0.001
Medical Exam and History
Respiratory rate (/min)	1.009	1.002	1.016	0.016	19 (18.00,22.00)	18 (18.00,20.00)	0.001
Fever	0.936	0.774	1.133	0.5	343 (37.40%)	1312 (10.33%)	0.001
Sore throat	0.828	0.481	1.426	0.496	14 (1.53%)	73 (0.57%)	0.046
Headache	0.881	0.668	1.164	0.374	58 (6.32%)	379 (2.98%)	0.001
Vomiting	0.83	0.696	0.99	0.038	180 (19.63%)	767 (6.04%)	0.001
Myalgia	0.825	0.688	0.988	0.037	181 (19.74%)	895 (7.05%)	0.001
Cough	0.946	0.811	1.104	0.481	373(40.68%)	1402 (11.04%)	0.001
Arthralgia	0.992	0.555	1.775	0.979	14 (1.53%)	40 (0.32%)	0.515
Insomnia	0.925	0.38	2.253	0.864	5 (0.55%)	54.0(0.43%)	0.001
Loss of consciousness	1.499	1.253	1.794	0.001	233 (25.41%)	179.0(1.41%)	0.001
Rhinorrhea	1.892	0.926	3.868	0.08	9 (0.98%)	20.0(0.16%)	0.303
Laboratory Values
Ph (VBG)	0.651	0.413	1.024	0.063	7.36(7.29,7.41)	7.38(7.34,7.42)	0.001
HCo3 (VBG)	0.971	0.957	0.986	0.001	23.70(20.20,27.40)	26.00(23.20,28.70)	0.001
Calcium	0.979	0.919	1.042	0.501	8.50(8.00,9.10)	8.70(8.20,9.23)	0.001
Hemoglobin (CBC)	0.962	0.931	0.995	0.025	11.80(10.00,13.30)	12.40(11.00,13.60)	0.001
White blood cell (CBC)	1.008	1.002	1.015	0.015	9.20(6.30,13.30)	6.80(4.90,9.70)	0.001
Neutrophil (%) (CBC)	1.019	1.003	1.036	0.019	85.00(78.00,90.00)	80.00(70.00,85.00)	0.001
INR	1.1	0.954	1.267	0.188	1.14(1.00,1.30)	1.07(1.00,1.20)	0.001
Potassium	1.04	0.991	1.091	0.111	4.20(3.80,4.60)	4.00(3.80,4.40)	0.0001
Creatinine	1.041	1	1.085	0.051	1.40(1.10,2.20)	1.10(0.90,1.40)	0.001
Magnesium	1.02	0.836	1.243	0.848	2.00(1.80,2.20)	1.90(1.80,2.10)	0.001
Footnote: * Mann–Whitney U test was performed for evaluating difference in mean values. VBG: Venous blood gas, DM: Diabetic mellites, INR: international normalized ratio, CBC: complete blood count, IHD: ischemic heart disease, CHF: chronic heart failure, COPD: chronic obstructive pulmonary disease, CVA: cerebrovascular accident

Feature Selection Methods and Variable Importance

LASSO and Brouta feature selection methods were used for variable importance, and results are visualized in Supplementary Figures S1 and S2. Twenty-four features out of eighty-one were confirmed by the Boruta method, consisting mostly of laboratory tests (Supplementary Figure S1). The most important features among them are oxygen saturation at admission, age, neutrophil count, serum level of creatinine, troponin and loss of consciousness. Thirty-seven features were confirmed by the LASSO regression method, including 25 categorical features and 12 continuous variables (Supplementary Figure S2). Among these, 23 features were positively associated with mortality, and 14 were negatively correlated with covid patients' mortality. Internal and External Validation

The details of the model's performance in the test datasets are summarized in Table 2, and Fig. 2 shows the ROC curve of the models. Most of the trained models showed promising performance for internal validation (AUC score > 80%) except K nearest neighbor, which had the lowest AUC score among all selected models in both datasets. Deep neural networks showed the best performance, with an AUC score of 83.4% in the LASSO-selected validation dataset and 82.6% in the Boruta dataset. The multivariate imputation showed a good performance on the primary test set when 2 out of 10 laboratory variables were missing. The change in model performance ranged from − 1.4% (GBDT with LASSO features) to 4.2% (KNN with LASSO variables). The performance of the DNN model with LASSO features decreased by 1.6%. The generalized performance of the DNN model using LASSO variables was confirmed in the external validation (83.4–82.8%), and the model performance change ranged between 0.7% increase (GDBT with LASSO features) to 11.9% decrease (SVM with Brouta features) in AUC. The confusion matrix of the proposed model (DNN using LASSO features) in the external validation dataset is presented in Fig. 3 using binary and ternary classification (using cut-off points offered by an expert clinician).

Table 2

Model internal and external validation; and validation of imputer model for 2 out of 10 missing lab value.
	Feature selection method	Model	AUC score	Sensitivity	specificity	PPV	NPV
Internal Validation	LASSO Regression	DNN	83.4	62.2	92.2	70.2	89.2
		SVM	81.6	40.6	93.9	66.3	84.2
		RF	80.6	66.6	81.8	52.1	89.2
		GBDT	78.9	58.1	83.8	51.6	87.1
		KNN	69.6	31.5	88.3	44.4	81.3
		LR	82.3	44.2	90.1	57.0	84.5
	Boruta	DNN	82.7	51.2	88.0	59.2	84.1
		SVM	81.7	42.1	90.1	59.1	82.1
		RF	82.5	43.2	91.6	63.6	82.6
		GBDT	82.0	44.0	90.1	60.1	82.5
		KNN	70.5	38.18	89.5	55.2	81.0
		LR	82.7	41.09	90.7	60.1	81.9
Imputer Validation (Two out of ten missing lab values)	LASSO Regression	DNN	81.8	60.6	86	72	79.2
		SVM	80	37.6	93.4	62.6	83.4
		RF	81.3	43	90.5	57.2	84.3
		GBDT	80.3	55.7	83.9	50.5	86.5
		KNN	65.4	33.3	89.4	48.2	81.9
		LR	79.1	44.2	90.3	57.4	84.5
	Boruta	DNN	81.6	48.7	90.9	65.9	83.2
		SVM	79.1	37.1	93.6	67.6	80.6
		RF	80.5	46.6	89.8	62.2	82.4
		GBDT	79.3	47.1	88.5	59.6	82.3
		KNN	70.6	31.9	92.1	59.2	79
		LR	79.3	42.4	91.9	65.3	81.6
External Validation	LASSO Regression	DNN	82.8	98.1	23.7	79.2	80.7
		SVM	72.1	47.4	78	38.9	21.6
		RF	78.6	44	75.6	34.8	21.1
		GBDT	79.6	9.5	63.2	43.3	19.1
		KNN	60.1	9	75.9	52.6	22
		LR	82.4	6.4	68.6	37.7	19.8
	Boruta	DNN	75.3	94.5	25.7	79	61.1
		SVM	69.8	73.3	81.3	53.7	22.8
		RF	71.4	5.8	82.2	49.5	22.7
		GBDT	71.8	89.1	74.2	50.6	21.6
		KNN	59.6	10.4	78.6	59	22.8
		LR	74	6	73.2	39.8	20.8
Footnote: DNN: deep neural network, SVM: supervector machine, RF: random forest, GDBT: gradient booster decision tree, KNN: k-nearest neighbor, LR: logistic regression

As of March 2022, the coronavirus has caused five global peaks in the number of patients and deaths from COVID-19 through different strains. It is critical to monitor and allocate patients to increase the efficacy of the health system. The high capabilities of artificial intelligence and machine learning algorithms in information processing can help us improve patient management. In this study, we worked intimately with healthcare professionals to provide a tool that can solve real-world needs. For this, we developed a model to predict the mortality risk of COVID-19 inpatients at admission using clinical and laboratory data. In addition, a set of eight clinical and ten cheap, available laboratories were selected in our model. Furthermore, an imputation tool is used to impute the not-available labs, and a ternary outcome classification (low, high, and very high risk) was proposed as healthcare experts' suggestion which is helpful during peaks of disease.

The results of this study are promising and applicable for managing COVID-19 inpatients with the current and upcoming COVID-19 variants. The internal validation, validation with 20% missing laboratories, and external validation showed promising results (AUC > 80). Validation with 20% missing data indicates the approved potential of our model in cases when extracting some of the patient's data is not feasible and needs to be imputed. Moreover, the model's generalization was investigated using data from the fourth hospital in a different province. The AUC of 82.8% was achieved in external validation, which further confirmed the model performance for global application.

Finally, we selected a deep neural network model trained on features determined by the lasso regression method as our proposed model based on its performance on the external dataset (AUC = 83.4%). Despite the susceptibility of neural networks to overfitting, our neural network models performed well on the external validation dataset due to feature selection methods and large sample sizes. Several studies have developed machine learning models to predict COVID-19 patients' mortality risk. However, as demonstrated in Table 3, models with high AUC scores are most likely trained on a small dataset or the data gathered from a single medical center which can indicate that these models may not be generalized and their performance can drop in a dataset from a different center (7–12). Furthermore, our proposed model performed relatively better when compared to models trained on a larger multicentral dataset. This higher performance may be due to the large number of input features, which can simultaneously analyze different aspects of a patient's health (13–16).

Table 3

Current studies with external validation in the literature predicting prognosis of COVID-19 using clinical and laboratory retrieved from search in PubMed and Scopus databases and review articles (29, 30).
Author, Publish Date,	Training dataset sources, Country	Number of patients for model development	Variable for prediction	Outcome	Proposed Model	Internal (In) and External (Ex) Validation AUROC (95% CI)
Our Model, Iran	3 centers	5320	27 clinical (history and examination) and 10 laboratory variables	In-hospital mortality	Deep neural network, LASSO	In: 83.8% Ex: 82.8%
Singh et al, Dec 2021, (13)	3 centers	8,427	10 markers selected from 57 laboratory, clinical, and demographic variables	Disease severity*	minimum redundance maximum relevance, hybrid feature selection	In: 78% Ex: 74%
Noy et al, Feb 2022(7)	1 centers, Israel	417	Static and dynamic features including demographics, background disease, vital signs and lab measurements	deterioration within the next 7–30 h	CatBoost (ensemble decision tree)	In: 84% Ex: 74%
Chen et al, Apr 2021, (14)	7 centers, China	6415	4 Clincal and 4 Laboratory Variables	In-hospital Mortality	Random forrest, LASSO	In: 90% Ex: 89%, 90%, 81%
Clift et al, Oct 2020, (15)	910 practices, UK	6,083,102	age, ethnicity, deprivation, body mass index, and a range of comorbidities	In-hospital Mortality	regression coefficients, LASSO	AUROC is not reported, R squared = 73.1%
Vaid et al, Sep 2020, (8)	1 center, USA	1514	Age and 8 laboratory markers	In-hospital Mortality (following 1,3,5,7 days)	XGBoost, LASSO	In: 89% at 3 days, 85% at 5 and 7 days Ex: 80% at 3 days, 79% at 5 days, 80% at 7 days
Ko et al, Nov 2020, (9)	1 center, China	361	Age, gender, and 28 blood biomarkers	In-hospital mortality	deep neural network and random forest models	In: accuracy = 93% Ex: accuracy = 92%
Gao et al, Oct 2020 (10)	2 centers, China	1506	6 clinical and 2 laboratory biomarkers	mortality risk stratification	Logistic Regression, Support Vector Machine, Gradient Boosted Decision Tree, and Neural Network	In: 92.4%, Ex: 95.5%, 87.9%
Bertsimas et al, Dec 2020, (16)	33 centers	3,927	Age and 9 laboratory biomarkers	In-hospital mortality	XGBoos	In: 90% Ex: 87%, 92%, 80%
Guan et al, Jan 2021 (11)	2 centers, China	1270	2 clinical and 4 laboratory features	In-hospital mortality	Simple-tree XGBoost	In:99.1% Ex: 99.7%
Hu et al, Sep 2020 (12)	1 center, China	183	Age and 4 laboratory variables	In-hospital mortality	Logistic Regression	int:89.5% Ex: 88.1%
Footnote: AUROC: ;LASSO: ; * Severity level 0 (no respiratory problem) to level 4 (in-hospital ≤ 30-day mortality)

The application of machine learning models in the clinic depends on the features based on which the machine predicts. Ease of access and the possibility of easy measurement of these features to predict with high accuracy at the right time is of great importance at the bedside of patients. Predictors were selected by the use of 2 different feature selection methods and their further comparison. Selected features in the present study include 18 factors: age, history of myalgia, loss of consciousness, vertigo and vomiting, dramatic lesions, alcohol consumption, history of GI problems, rheumatoid arthritis, neurologic disorders, leukocytosis, thrombocytopenia, low hemoglobin level, high CRP, low HCO3, high CPK level, low O₂ saturation, pulse rate, and respiratory rate at the time of admission. Previous studies included many of our selected features for prognosis prediction (7–9, 13, 14).

Our predictors are easily accessible and routinely checked with a simple history, physical examination, and blood test in all hospitals. Our results can better explain COVID-19 poor prognostics since all these factors are associated with high mortality risk. Multicollinearity may bring about redundancy in the model performance. Feature selection methods dismissed parameters with a high level of correlations and collinearity. In previous studies, laboratory markers, patient demographics, medical history, and vital signs have been used as effective features in predicting the mortality of patients with COVID-19 (7, 13, 17–22). Some studies used factors including different inflammatory cytokines (23–26), which are not part of patients' routine admission measurements and cannot be obtained in settings with congested resources in contrast to our predictors. Our model may prompt individualized treatments due to distinguishing patients' prognoses based on their different clinical characteristics. This can lead to optimal decision-making of physicians.

There are some limitations to this work that should be noted. First, even though we had a relatively large patient population, our study was retrospective. Prospective validation of our study is required to ascertain the results. The hospitals in our study are all in a developing country (Iran). The scarcity of medical resources in hospitals may bring about inadequate service allocated to patients. This condition can thereby increase the mortality rate in such countries in contrast to countries with effective medical systems. Additionally, the current model does not encompass imaging, microbiological, and histological data, which could contribute to a more precise prognosis prediction despite the inconvenience. Socioeconomic and racial differences, which were investigated in some studies (27, 28), might as well play a role in prognosis.

In conclusion, this study shows that using machine learning methods can predict the mortality risk of COVID-19 patients on admission. This confirms the potential of ML methods for use in clinical practice as a decision-support system. However, effective machine learning models should satisfy the real-world needs of healthcare experts to increase the chance of implementation in practice. Further studies are suggested to investigate the current barriers to implementing ML in practice.

Author Contribution

MAP and SAASN was responsible for conceptualization. MAP, HH, and SAASN was responsible for administration. MAP was responsible for funding acquisition. SAASN was responsible for data curation, SSB was responsible for deep learning and algorithm development, with the help of AS and SA, AT, SI, FS, and SE was responsible for investigation. FS, AT, and SI was responsible for writing the original draft with the help of SSB, SAASN. HH and MAP was responsible to grant access for data

Data Availability statement

The datasets used in the current study are available from the corresponding author on reasonable request. The dataset would be unreservedly available for use as a validation dataset of other research projects, after sending the request to the corresponding author.

Conflict of Interests

SAASN is a founder of MedicAi startup; and SAASN and SSB received compensation as a member of research and development unit of AiMedic.co. The authors declare no other conflict of interests related to this work.

Funding

This study was conducted in the Gastroenterology and Liver Diseases Research Centre of Shahid Beheshti University of Medical Sciences and supported by grant number 29041.

Li J, Huang DQ, Zou B, Yang H, Hui WZ, Rui F, et al. Epidemiology of COVID-19: A systematic review and meta-analysis of clinical characteristics, risk factors, and outcomes. J Med Virol. 2021;93(3):1449–58. DOI: 10.1002/jmv.26424.
Girum T, Lentiro K, Geremew M, Migora B, Shewamare S. Global strategies and effectiveness for COVID-19 prevention through contact tracing, screening, quarantine, and isolation: a systematic review. Tropical Medicine and Health. 2020;48(1):91. DOI: 10.1186/s41182-020-00285-w.
Lalmuanawma S, Hussain J, Chhakchhuak L. Applications of machine learning and artificial intelligence for Covid-19 (SARS-CoV-2) pandemic: A review. Chaos Solitons Fractals. 2020;139:110059. DOI: 10.1016/j.chaos.2020.110059.
Our World in Data. Daily new confirmed COVID-19 cases & deaths per million people (Accessed on 29 August 2022) Acces Date: [Available from: https://ourworldindata.org/explorers/coronavirus-data-explorer?uniformYAxis=0&Interval=7-day+rolling+average&Relative+to+Population=true&country=USA~AUS~ITA~CAN~DEU~GBR~FRA&Metric=Cases+and+deaths&Color+by+test+positivity=false
Chowdhury MZI, Turin TC. Variable selection strategies and its importance in clinical prediction modelling. Fam Med Community Health. 2020;8(1):e000262. DOI: 10.1136/fmch-2019-000262.
Hatamabadi H, Sabaghian T, Sadeghi A, Heidari K, Safavi-Naini SAA, Looha MA, et al. Epidemiology of COVID-19 in Tehran, Iran: A Cohort Study of Clinical Profile, Risk Factors, and Outcomes. Biomed Res Int. 2022;2022:2350063. DOI: 10.1155/2022/2350063.
Noy O, Coster D, Metzger M, Atar I, Shenhar-Tsarfaty S, Berliner S, et al. A machine learning model for predicting deterioration of COVID-19 inpatients. Sci Rep. 2022;12(1):2630. DOI: 10.1038/s41598-022-05822-7.
Vaid A, Somani S, Russak AJ, De Freitas JK, Chaudhry FF, Paranjpe I, et al. Machine Learning to Predict Mortality and Critical Events in a Cohort of Patients With COVID-19 in New York City: Model Development and Validation. J Med Internet Res. 2020;22(11):e24018. DOI: 10.2196/24018.
Ko H, Chung H, Kang WS, Park C, Kim DW, Kim SE, et al. An Artificial Intelligence Model to Predict the Mortality of COVID-19 Patients at Hospital Admission Time Using Routine Blood Samples: Development and Validation of an Ensemble Model. J Med Internet Res. 2020;22(12):e25442. DOI: 10.2196/25442.
Gao Y, Cai G-Y, Fang W, Li H-Y, Wang S-Y, Chen L, et al. Machine learning based early warning system enables accurate mortality risk prediction for COVID-19. Nature Communications. 2020;11(1):5033. DOI: 10.1038/s41467-020-18684-2.
Guan X, Zhang B, Fu M, Li M, Yuan X, Zhu Y, et al. Clinical and inflammatory features based machine learning model for fatal risk prediction of hospitalized COVID-19 patients: results from a retrospective cohort study. Annals of Medicine. 2021;53(1):257–66. DOI: 10.1080/07853890.2020.1868564.
Hu C, Liu Z, Jiang Y, Shi O, Zhang X, Xu K, et al. Early prediction of mortality risk among patients with severe COVID-19, using machine learning. International Journal of Epidemiology. 2020;49(6):1918–29. DOI: 10.1093/ije/dyaa171.
Singh V, Kamaleswaran R, Chalfin D, Buno-Soto A, San Roman J, Rojas-Kenney E, et al. A deep learning approach for predicting severity of COVID-19 patients using a parsimonious set of laboratory markers. iScience. 2021;24(12):103523. DOI: 10.1016/j.isci.2021.103523.
Chen Z, Chen J, Zhou J, Lei F, Zhou F, Qin J-J, et al. A risk score based on baseline risk factors for predicting mortality in COVID-19 patients. Current Medical Research and Opinion. 2021;37(6):917–27. DOI: 10.1080/03007995.2021.1904862.
Clift AK, Coupland CAC, Keogh RH, Diaz-Ordaz K, Williamson E, Harrison EM, et al. Living risk prediction algorithm (QCOVID) for risk of hospital admission and mortality from coronavirus 19 in adults: national derivation and validation cohort study. BMJ. 2020;371:m3731. DOI: 10.1136/bmj.m3731.
Bertsimas D, Lukin G, Mingardi L, Nohadani O, Orfanoudaki A, Stellato B, et al. COVID-19 mortality risk assessment: An international multi-center study. PLOS ONE. 2020;15(12):e0243262. DOI: 10.1371/journal.pone.0243262.
Banoei MM, Dinparastisaleh R, Zadeh AV, Mirsaeidi M. Machine-learning-based COVID-19 mortality prediction model and identification of patients at low and high risk of dying. Crit Care. 2021;25(1):328. DOI: 10.1186/s13054-021-03749-5.
Jamshidi E, Asgary A, Tavakoli N, Zali A, Setareh S, Esmaily H, et al. Using Machine Learning to Predict Mortality for COVID-19 Patients on Day 0 in the ICU. Front Digit Health. 2021;3:681608. DOI: 10.3389/fdgth.2021.681608.
Moulaei K, Shanbehzadeh M, Mohammadi-Taghiabad Z, Kazemi-Arpanahi H. Comparing machine learning algorithms for predicting COVID-19 mortality. BMC Med Inform Decis Mak. 2022;22(1):2. DOI: 10.1186/s12911-021-01742-0.
Fernandes FT, de Oliveira TA, Teixeira CE, Batista AFM, Dalla Costa G, Chiavegatto Filho ADP. A multipurpose machine learning approach to predict COVID-19 negative prognosis in Sao Paulo, Brazil. Sci Rep. 2021;11(1):3343. DOI: 10.1038/s41598-021-82885-y.
Laatifi M, Douzi S, Bouklouz A, Ezzine H, Jaafari J, Zaid Y, et al. Machine learning approaches in Covid-19 severity risk prediction in Morocco. J Big Data. 2022;9(1):5. DOI: 10.1186/s40537-021-00557-0.
Dabbah MA, Reed AB, Booth ATC, Yassaee A, Despotovic A, Klasmer B, et al. Machine learning approach to dynamic risk modeling of mortality in COVID-19: a UK Biobank study. Sci Rep. 2021;11(1):16936. DOI: 10.1038/s41598-021-95136-x.
Mehta P, McAuley DF, Brown M, Sanchez E, Tattersall RS, Manson JJ, et al. COVID-19: consider cytokine storm syndromes and immunosuppression. Lancet. 2020;395(10229):1033–4. DOI: 10.1016/S0140-6736(20)30628-0.
Babajani A, Hosseini-Monfared P, Abbaspour S, Jamshidi E, Niknejad H. Targeted Mitochondrial Therapy With Over-Expressed MAVS Protein From Mesenchymal Stem Cells: A New Therapeutic Approach for COVID-19. Front Cell Dev Biol. 2021;9:695362. DOI: 10.3389/fcell.2021.695362.
Conti P, Ronconi G, Caraffa A, Gallenga CE, Ross R, Frydas I, et al. Induction of pro-inflammatory cytokines (IL-1 and IL-6) and lung inflammation by Coronavirus-19 (COVI-19 or SARS-CoV-2): anti-inflammatory strategies. J Biol Regul Homeost Agents. 2020;34(2):327–31. DOI: 10.23812/CONTI-E.
Jamshidi E, Babajani A, Soltani P, Niknejad H. Proposed Mechanisms of Targeting COVID-19 by Delivering Mesenchymal Stem Cells and Their Exosomes to Damaged Organs. Stem Cell Rev Rep. 2021;17(1):176–92. DOI: 10.1007/s12015-020-10109-3.
Abrams LS, Moio JA. Critical Race Theory and the Cultural Competence Dilemma in Social Work Education. Journal of Social Work Education. 2013;45(2):245–61. DOI: 10.5175/jswe.2009.200700109.
Bai AD, Li XX, Alsalem M, Khan S, Smieja M, Mertz D, et al. Utility of asymptomatic inpatient testing for COVID-19 in a low-prevalence setting: A multicenter point-prevalence study. Infect Control Hosp Epidemiol. 2020;41(10):1233–5. DOI: 10.1017/ice.2020.349.
Bottino F, Tagliente E, Pasquini L, Napoli AD, Lucignani M, Figà-Talamanca L, et al. COVID Mortality Prediction with Machine Learning Methods: A Systematic Review and Critical Appraisal. Journal of Personalized Medicine. 2021;11(9):893.
Miller JL, Tada M, Goto M, Chen H, Dang E, Mohr NM, et al. Prediction models for severe manifestations and mortality due to COVID-19: A systematic review. Academic Emergency Medicine. 2022;29(2):206–16. DOI: https://doi.org/10.1111/acem.14447.

No competing interests reported.

Supplemantary.docx

Download PDF

Journal Publication

published 10 Feb, 2023

Read the published version in Scientific Reports →

Editorial decision: Major revision
28 Dec, 2022
Reviews received at journal
25 Dec, 2022
Reviewers agreed at journal
11 Nov, 2022
Reviews received at journal
10 Nov, 2022
Reviewers agreed at journal
10 Nov, 2022
Reviewers invited by journal
04 Nov, 2022
Editor assigned by journal
04 Nov, 2022
Editor invited by journal
20 Oct, 2022
Submission checks completed at journal
20 Oct, 2022
First submitted to journal
10 Oct, 2022

You are reading this latest preprint version

COVID-19 Mortality Risk Prediction using Clinical and Laboratory Examination: Machine Learning Approach for Implementation

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Material And Method

Data collection

Data cleaning and imputation

Feature selection

Model development

Model training and evaluation

Effect of using iterative imputer on models' performances

Statistical analysis

Ethics

Results

Basic Characteristics

Factors Associated with Mortality

Feature Selection Methods and Variable Importance

Discussion

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1