Outcome prediction for adult mechanically ventilated patients using machine learning models and comparison with conventional statistical methods: a single-centre retrospective study

doi:10.21203/rs.3.rs-3632094/v1

Download PDF

Research Article

Outcome prediction for adult mechanically ventilated patients using machine learning models and comparison with conventional statistical methods: a single-centre retrospective study

https://doi.org/10.21203/rs.3.rs-3632094/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

In this retrospective single-centre study spanning five years (2016–2021) and involving 2,368 adult Intensive Care Unit (ICU) patients requiring over four hours of mechanical ventilation (MV) in a tertiary care hospital, we investigated the feasibility and accuracy of using machine learning (ML) models in predicting outcomes post-ICU discharge compared to conventional statistical methods (CSM). The study aimed to identify associated risk factors impacting these outcomes. Poor outcomes, defined as ICU readmission, mortality, and prolonged hospital stays, affected 40.2% of the discharged MV patients. The Extreme Gradient Boost (XGBoost) ML model showed superior performance compared to CSM (Area under the receiver operating characteristic curve: 0.693 vs. 0.667; p-value = 0.03). At 95% specificity, XGBoost displayed enhanced sensitivity (30.6% vs. 23.8%) and accuracy (0.706 vs. 0.703) compared to CSM. Risk factors such as Glasgow Coma Score at ICU discharge, GCS best motor score during ICU admission, MV duration, ICU length of stay, and Charlson Comorbidity Index were identified. While both ML and CSM exhibited moderate accuracy, the study suggests ML algorithms have the potential for better predictive capabilities and individual risk factor identification, potentially aiding in the improvement of patient outcomes by identifying high-risk patients requiring closer monitoring. Further validation in larger studies is necessary, but the study underscores the potential for real-time application of ML algorithms developed from the increasing availability of electronic medical records (EMR).

Patients admitted to intensive care units (ICUs) who require mechanical ventilation (MV) have significantly high morbidity and mortality, even after a successful discharge from the ICU.¹ This includes the risk of re-admission to ICU after discharge to general wards, longer hospital length of stay (LOS), and higher risk of in-hospital mortality.¹ Previous retrospective studies suggest that more than 10% of ICU readmissions are potentially avoidable and preventable.^2,3 Patients who experience unplanned ICU readmissions have increased mortality, longer LOS, and cost compared with those not requiring ICU readmission during their hospital stay.^3,4 Hospital LOS has been shown to be a good clinical indicator of the quality of hospital management with direct and indirect consequential effects on the cost of hospitalization and patient satisfaction.⁵

Clinical determination of patients’ readiness for ICU discharge remains a regular challenge for any ICU care team, often subjective and relying on clinical intuition.² The number of ICU beds available may additionally influence the clinician's daily decision-making.³ For these reasons, improving risk stratification for post-ICU discharge patients at risk of suffering from poor clinical outcomes in the general ward may have significant benefits.^4–5

Historically, statistical scoring systems using conventional statistical methods (CSM) have been widely used to predict patients’ outcomes after ICU discharge.^4–5 However, the tools developed to date have used only a fraction of the data available through existing EMRs.⁴ In addition, the validation studies have demonstrated poor accuracy.^4–6 In other fields of medicine, new complex algorithms using machine learning (ML) have leveraged big data to create accurate predictive models.⁶ ML methods use computer algorithms to learn relationships among different data elements to inform outcomes without explicitly specifying the exact relationship, in contrast to CSM.^10,12 The CSM approach of dichotomizing continuous variables for outcome prediction may provide suboptimal stratification.^7–9 The ML models allow for the assessment of various variables in a non-linear fashion which overcomes the limitations of currently employed regression models.⁷ Additionally, ML can incorporate more elements into patient selection and provide more granularity that would likely add to the predictive value.^8–9 Therefore, ML has emerged as a powerful computational method to predict clinical outcomes better and play an important role in clinical practice.¹⁰

Numerous studies have shown that ML methods might be more accurate than CSM in the areas of early warning scores for predicting clinical deterioration in hospitalised patients, outcome prediction among cardiology patients¹¹, and demonstrated better recognition in patients’ responses following cardiac resynchronisation therapy.¹² However, data published about ML applications to predict outcomes among MV patients in clinical research remained scarce. Previous small studies have attempted to develop ML models to predict outcomes based on a large array of clinical variables among mechanically ventilated critically ill patients.¹³ Our study aimed to compare poor outcome prediction for post-ICU discharge patients using ML methods versus CSM. We also sought to identify the risk factors associated with these poor outcomes. The poor outcomes assessed included all-cause ICU readmission within 48 hours, all-cause in-hospital general ward mortality within 14 days, and prolonged hospital stay longer than 14 days.^14–16

Study design, setting, and participants:

We performed a single-centre retrospective study at a 34-bed combined medical and surgical ICU in Ng Teng Fong General Hospital, a 700-bed tertiary care teaching hospital in Singapore. Clinical data were obtained from the electronic medical records (EMR), and a secure electronic research data system was used to capture, collect, and analyze study data in a pseudo-anonymous fashion. All consecutive adult patients (≥ 18 years of age) requiring mechanical ventilation in the ICU and who were discharged alive to the general ward between January 1st, 2016, and December 31^st, 2021, were included. The study was approved by the National Healthcare Group (NHG) Domain-Specific Review Board (DSRB) medical ethics committee with a waiver of informed consent due to the non-interventional retrospective study design (NHG DSRB reference number − 2020/01167). The study has been performed in accordance with the Helsinki Declaration of 1975.

Data Collection:

We utilized data extraction from EMR, ensuring minimal data loss. We collected data involving patient characteristics and demographics (age, gender, body mass index [BMI]), comorbidities (congestive heart failure, stroke, chronic pulmonary disease, chronic kidney disease, cirrhosis, and Charlson Comorbidity Index [CCI]), vital signs (heart rate, respiratory rate, and oxygen saturation [SpO₂]), laboratory values (pH, partial pressure of arterial carbon dioxide [PaCO₂], PaO₂/FiO₂ ratio, bicarbonate), and the cumulative urine output. We collected data for severity illness scoring systems such as APACHE II and sequential organ failure assessment (SOFA) scores on admission as well as the time of ICU discharge involving Glasgow Coma Score (GCS), ICU length of stay (LOS), and duration of MV. Using EMR, ICU readmission, death outcome in the general ward, and non-ICU prolonged hospital stay were identified using location-stamped vital signs and location of death as transcribed in both transfer and discharged/death summaries.

Inclusion criteria, exclusion criteria, and patient outcomes:

We included all adult (aged ≥ 18 years) ICU patients requiring intubation and MV for more than 4 hours and were successfully extubated and discharged to the general ward. We excluded patients who were: 1) terminally discharged and had a do-not-resuscitate (DNR) clinical status at the time of ICU discharge, 2) intubated and mechanically ventilated for less than four hours. The 4-hour threshold was chosen to help differentiate between patients who require only short-term MV for elective procedures or surgeries, deemed well, and were unlikely to benefit from outcome prediction, compared to those who needed longer-term MV.¹⁷

The outcomes assessed were a composite of all-cause ICU readmission within 48 hours, all-cause in-hospital general ward mortality within 14 days, and prolonged hospital stay longer than 14 days. ICU readmission was defined as transferring patients back to the ICU after being initially discharged from the ICU to general wards within 48 hours.^{4–6, 18} All-cause in-hospital general ward mortality was defined as death from any illness during admittance to the general ward after being discharged from the ICU within 14 days.¹⁹ Prolonged hospital stay was defined as prolonged admittance to the general ward for longer than 14 days post-discharge from an ICU setting.^18–20 Risk factors associated with poor outcomes post-ICU discharge were also assessed.

Data Analysis:

Descriptive statistics were used to characterize the patient population and distributions of these predictor variables. For continuous variables, a t-test for independent samples was used to assess differences between groups. Categorical variables were analysed using chi-squared tests. Furthermore, to evaluate the associations between the factors of ICU readmission and our study outcomes, we employed logistic regression analysis. For all statistical tests conducted, we have included the corresponding p-values to quantify the significance of the observed associations.

We compared the following four ML models with the CSM model (logistic regression) for clinical feature screening and model construction in the training data set: Random Forest (RF), Explainable Boosting Machine (EBM), Extreme Gradient Boost (XGBoost), and Multilayer Perceptron (MLP). The primary objective of our study was the area under the receiver operating characteristic curve (AUROC) of the model’s prediction in comparison to CSM for determining poor outcomes post-ICU discharge. The secondary objectives of our study were the accuracy, sensitivity, and specificity of the model’s prediction in comparison to CSM while determining associated risk factors for poor outcomes post-ICU discharge.

Prediction Modelling for Conventional Statistical Methods (CSM) using Logistic Regression

Logistic Regression is a well-established and widely recognized statistical technique, particularly suited for binary and categorical outcome prediction which has been used in many clinical research studies.^8,15 Logistic Regression estimates the probability of a specific event occurring, in our case, the prediction of clinical severity outcomes (Readmission to ICU), based on the relationships between multiple predictor variables. In this study, factors with a p-value < 0.10 in univariate analysis were entered into the logistic regression model.

Prediction Modelling for Machine Learning (ML)

The model was made using a decision-tree-based technique. Variables with poor outcomes occurrences accounting for less than 10% were excluded from the machine learning analysis.^{20, 21} The data set was randomly split into two data sets: a training (80%) data set, which was used to develop the models (test groups), and an internal test (20%) data set, which was used to validate the constructed models (test groups). Based on the ML model’s prediction, we additionally assessed the precision (also called positive predictive value) and compared them in the test cohort.²¹ For ML model data processing, we also integrated a feature importance ranking technique to identify risk factors and performed partial plots to identify cut-offs.

All data processing and prediction modelling were performed in Python (v.3.9.12, Python Software Foundation, Wilmington, Delaware), and the statistical tests were conducted using SciPy (v1.7.3) package libraries.

During the five-year study period, a total of 3,682 ICU patients were mechanically ventilated for more than four hours. Of these, 1,314 (35.6%) patients were excluded from the analysis (Figure 1).

Figure 1: Flow chart of the ventilated patient population and their outcomes.

The remaining 2,368 patients were included, where 1,895 (80%) patients were divided into the training cohort and 473 (20%) patients into the test cohort. The baseline characteristics are shown in (Table 1). Among the included patients, the ICU readmission rate, general ward mortality, and prolonged hospital stay were 12.7%, 3.1%, and 24.4%, respectively. The median length of stay in the hospital post-discharge from ICU among good versus poor outcome groups was 5.0 days versus 14.9 respectively.

Table 1: Baseline characteristics of the patient population
	Total (n=2368)	Good Outcome (n=1415)	Poor Outcome (n=953)	p-value
*Demographics*
Age [median (IQR)]	63.0 (52.0-72.0)	61 (49-71)	65 (55-74)	0.001**
Male gender - n (%)	1547 (65.3%)	908 (64.2%)	639 (67.1%)	0.150
Admission diagnosis (Medical) - n (%)	1371 (57.9%)	805 (56.9%)	566 (59.4%)	0.226
CCI [median (IQR)]^#	2.0 (1.0-2.0)	1.0 (0.0-2.0)	1.0 (0.0-2.0)	<0.001**
BMI [median (IQR)]	24.0 (20.9-27.7)	24.1 (21.0-27.8)	23.8 (20.6-27.4)	0.241
APACHE II at ICU admission [median (IQR)]	17.0 (13.0-22.0)	17.0 (12.0-23.0)	17.0 (13.0-22.0)	0.589
SOFA Score at ICU admission [median (IQR)]^#	3.0 (2.0-5.0)	3.0 (2.0-5.0)	3.0 (2.0-5.0)	<0.001**
Vasopressor support during ICU stay - n (%)	2366 (99.9%)	1414 (99.9%)	952 (99.9%)	0.779
Heart Failure as the cause of index ICU admission - n (%)	270 (11.4%)	183 (12.9%)	87 (9.1%)	0.004**
Pneumonia as the cause of index ICU admission - n (%)	473 (20.0%)	270 (19.1%)	203 (21.3%)	0.187
Duration of MV, hours [median (IQR)]	27.5 (15.0-55.0)	20.0 (10.0-43.0)	29.0 (6.0-81.0)	<0.001**
ICU LOS (Days) - [median (IQR)]	69.4 (43.2-125.9)	62.0 (41.3-100.0)	89.9 (47.5-168.7)	<0.001**
*Comorbidities n (%)*
COPD	95 (4.0%)	72 (5.1%)	23 (2.4%)	0.001**
Other Chronic respiratory diseases - Severe TB/ ILD/ Chest wall deformity/OSA/ Bronchiectasis	95 (4.0%)	48 (3.4%)	47 (4.9%)	0.061
Stroke	175 (7.4%)	51 (3.6%)	124 (13.0%)	<0.001**
Chronic Kidney Disease	145 (6.1%)	63 (4.5%)	82 (8.6%)	<0.001**
Cirrhosis	52 (2.2%)	31 (2.2%)	21 (2.2%)	0.984
Immunocompromised state	5 (0.2%)	3 (0.2%)	2 (0.2%)	0.992
*ICU discharge*
Heart rate at ICU discharge [median (IQR)]	84.0 (74.0-94.0)	84.0 (74.0-94.0)	85.0 (75.0-95.0)	0.040*
Respiratory rate at ICU discharge [median (IQR)]	19.0 (18.0-22.0)	19.0 (18.0-22.0)	19.0 (18.0-22.0)	0.103
GCS at ICU discharge [median (IQR)]	15.0 (14.0-15.0)	15.0 (15.0-15.0)	15.0 (13.0-15.0)	<0.001**
Motor score of GCS at ICU discharge [median (IQR)]^#	6.0 (6.0-6.0)	6.0 (6.0-6.0)	6.0 (6.0-6.0)	<0.001**
SOFA score at ICU discharge [median (IQR)]	2.0 (2.0-4.0)	2.0 (2.0-3.0)	2.0 (2.0-4.0)	0.002**
Respiratory secretions required assistance at ICU discharge, n (%)	174 (7.3%)	106 (7.5%)	68 (7.1%)	0.741
Fluid balance in the last 48 hours at ICU discharge, ml [median (IQR)]	2225.0 (-872.0-7024.0)	1951.1 (-562.0-6248.2)	2620.0 (-1705.0-8372.6)	0.978
FiO₂ at ICU discharge [median (IQR)]	30.0 (25.0-35.0)	30.0 (25.0-35.0)	30.0 (26.0-35.0)	0.788
SpO₂ at ICU discharge [median (IQR)]	97.0 (95.0-99.0)	97.0 (95.0-99.0)	97.0 (95.0-99.0)	0.070
pH at ICU discharge [median (IQR)]	7.43 (7.39-7.47)	7.42 (7.38-7.46)	7.44 (7.40-7.47)	<0.001**
PaO₂ at ICU discharge [median (IQR)]	88.2 (71.0-128.4)	89.8 (72.0-137.3)	89.8 (70.3-132.4)	0.310
PaO₂/FiO₂ ratio at ICU discharge [median (IQR)]	316.0 (248.0-386.0)	314.0 (247.0-386.0)	322.0 (250.0-392.0)	0.449
PaCO₂, mmHg at ICU discharge [median (IQR)]	36.1 (32.0-40.4)	36.6 (32.5-40.8)	35.2 (31.4-39.7)	<0.001**
Bicarbonate, mmol/L at ICU discharge [median (IQR)]	22.0 (20.0-25.0)	23.0 (20.0-25.0)	22.0 (20.0-25.0)	<0.001**

Abbreviations: APACHE - Acute Physiology and Chronic Health Evaluation, BMI – Body mass index, CCI - Charlson Comorbidity Index, COPD- chronic obstructive pulmonary disease, FiO2 – fraction of inspired oxygen, ICU – Intensive care unit, ILD – interstitial lung disease, IQR – Interquartile range, MV – Mechanical ventilation, OSA – obstructive sleep apnea, PaO2 - partial pressure of arterial oxygen, PaCO2 - partial pressure of arterial carbon dioxide, SD – Standard Deviation, SOFA - Sequential Organ Failure Assessment, SpO2 – oxygen saturation, TB - tuberculosis

* p-value<0.05; ** p-value<0.01

# Despite the groups displaying similar median values, the significant difference after a median test suggests that the test evaluates the entire distribution, detecting variations in the shape and spread of the data beyond the central tendencies, leading to the identification of statistically significant distinctions between the groups.

A total of 22 Variables excluding variables with poor outcomes occurrences accounting for less than 10% were used in machine learning analysis. Subsequently, the machine learning feature importance ranking technique identified in Figure 2 found the five most important predictor variables (GCS at the time of ICU discharge, duration of MV, ICU LOS, GCS motor response, and CCI) that significantly enhanced our model's predictive accuracy.

Figure 2: Twenty-two most important variables in the XGBoost model. The features represent each variable's relative importance.

XGBoost had the highest AUROC of 0.693 and demonstrated the best precision and accuracy, followed by Random Forest (AUROC 0.679), EBM (AUROC 0.677), CSM Logistic Regression (AUROC 0.667), and lastly, multilayer perceptron (AUROC 0.646) as shown in (Table 2 and Figure 3). The same three ML models were more accurate than the CSM model in the test cohorts (p-value <0.01). Furthermore, at a specificity of 95%, the XGBoost model had the highest sensitivity at 27.3% and additionally highest accuracy at 70.6%.

Table 2: Area under the receiver operating characteristic curve (AUROC) comparison of different machine learning models in the internal validation report.

Model	AUROC	Standard Deviation	Accuracy	Sensitivity when Specificity > 95%
XGBoost	0.693*	0.0042	0.706	0.273
EBM	0.677*	0.0037	0.705	0.243
RF	0.679*	0.0076	0.708	0.252
Multilayer Perceptron	0.646	0.0097	0.668	0.277
Logistic Regression	0.667	0.0068	0.703	0.238

Abbreviations: AUROC - Area under the receiver operating characteristic curve, EBM - Explainable Boosting Machine, XGBoost - Extreme Gradient Boost, RF - Random Forest.

*p-value<0.05

Figure 3: Comparison of area under the receiver operating characteristic curve (AUROC) curves, overview comparison of the five models.

Partial plots were performed and revealed significant associations between specific important feature factors and the increased incidence of poor composite outcome post-ICU discharge. A cutoff GCS lower than 13 with a motor GCS score of less than 5, a duration of MV longer than 100 hours, an extended ICU LOS of more than 400 hours well and CCI scores of 3 or more exhibited a substantial correlation with higher rates of adverse outcomes (Figure 4).

Figure 4: Partial plot of the effect of (A) Glasgow Coma Score (GCS), (B) Mechanical ventilation in hours, (C) ICU length of stay, (D) Charlson Comorbidity Index (CCI), (E) GCS Motor Score on the risk of poorer outcome post-ICU discharge across different value in the XG boosted machine model.

ML models such as XGBoost, compared to CSM, could significantly improve the ability to distinguish ICU patients at risk for poor outcomes post-ICU discharge while identifying associated risk factors. In an effort to bring precision medicine closer to reality, ML model results can be used to determine risk probability in real time from data in the EMR.^{5,18, 22}Such high-risk patients could benefit from closer monitoring in the general wards post-ICU discharge.

ML models are increasingly used in various fields of medicine to improve diagnosis and prognosis. The impact of ML on radiologists has been discussed in the literature.²³ A systematic review and meta-analysis found that ML was comparable to healthcare professionals in detecting diseases from EMR and medical imaging.²⁴ Another study found that ML algorithms provide efficient and effective data analysis models to uncover hidden patterns and other meaningful information from the data.²⁵ML has become the main tool in many hospitals worldwide for automatic COVID-19 classification and detection using chest X-ray images or other types of images.²⁶

The Precision Medicine Initiative proposes that medical professionals avoid oversimplification and consider individual variability to improve the clinical decision-making process.²⁷ Ongoing difficulties with the reproducibility, explainability, and replicability of ML-driven clinical research may undermine stakeholder confidence in ML integration into clinical research.¹⁴ However, increased reliability and trustworthiness may be built by running ML models in clinical research contexts together with traditional research methods to show that the ML methods perform at least as well as conventional approaches like logistic regression techniques.^10,12-15 Therefore, our study adds to the understanding of existing literature by comparing the two approaches for outcomes of ICU post-discharge.^6-8

ML model in our study identified a low GCS score at ICU discharge, a high CCI score at hospital admission, prolonged ICU LOS, and MV duration, as predictors of poor outcomes. Many of these clinical features, such as the GCS score, SOFA score, prolonged ICU LOS, and CCI score, have been widely used in determining patient’s progression and status for many different purposes, other than outcome prediction after ICU discharge.^2,4,6-8

Our study has a few limitations. As a single-centre retrospective study, potential unidentified confounders exist. The usual limitations of ML remain, including data biases, logistics of prospective validation, and the ethical issues associated with machines making decisions in a research context.²⁸Previous studies have shown that algorithms may perform significantly differently in test data sets than in training data sets.^14,33 Therefore, the results of this study should be validated among larger study populations.

Although our analysis yielded a statistically significant p-value of <0.01 in the AUROC values among the various models studied, we acknowledge that further analyses, including clinical validation and cost-benefit assessments, are required, to understand better how such models can impact patient care and healthcare systems. We also emphasise the importance of ethical considerations, collaboration with clinicians, and a patient-centred approach to ensure that future research contributes meaningfully to the field of precision medicine. Whether ML is implemented by most intensivist and critical care researchers in discharging patients from ICU will likely depend on the successful resolution of concerns fueling hesitancy to embrace ML compared to existing scoring systems or CSM.

ML algorithms can predict ICU discharge outcomes and identify individual risk factors better than CSM, although both methods have moderate accuracy. Improved outcome prediction modelling may better identify high-risk patients requiring closer monitoring. Also, with the increasing availability of EMR, the models developed with ML algorithms can be deployed for prospective real-time applications. However, further validation in larger, multi-centre studies is required to establish the utility of ML models.

Ethics approval and consent to participate

The study was approved by the National Healthcare Group (NHG) Domain-Specific Review Board (DSRB) medical ethics committee with a waiver of informed consent due to the non-interventional retrospective study design (NHG DSRB reference number - 2020/01167). The study has been performed in accordance with the Helsinki Declaration of 1975.

Consent for publication

Not applicable

Availability of data and materials

The datasets during and/or analysed during the current study available from the corresponding author on reasonable request.

Competing interests

The authors declare that they have no competing interests

Funding

This study received no financial support. All authors have disclosed that they do not have any conflicts of interest.

Authors' contributions

Kansal, Ong, and How helped with conceptualization, methodology, data curation, resources, investigation, and formal analysis. Kansal, Chong, Ong, How, Khan, and Ngiam helped write the original draft. Kansal helped with project administration and supervision. Ong and How helped in data validation and visualization. All authors helped in writing—reviewing, and editing.

Acknowledgements

The authors would like to thank Ms Eleanor Dela Pena, Ms Patricia Leong and all Respiratory Therapists and Nurses from the Intensive Care Unit, Ng Teng Fong General Hospital, for their support with data collection.

Schneeweiss S. N Engl J Med. 2014;370:2161–3. doi: 10. 1056/NEJMp1401111. Learning from Big Health Care Data.
Long J, Wang M, Li W, et al. The risk assessment tool for intensive care unit readmission: A systematic review and meta-analysis. Intensive Crit Care Nurs. 2023;76(12):103378. 10.1016/j.iccn.2022.103378;47.
Deo RC. Machine learning in medicine. Circulation. 2015;132:1920–30. 10.1161/circulationaha.115.001593.
Rapsang AG, Shyam DC. Scoring systems in the intensive care unit: A compendium. Indian J Crit Care Med. 2014;18(4):220–8. 10.4103/0972-5229.130573.
Rojas JC, Carey KA, Edelson DP, et al. Predicting Intensive Care Unit Readmission with Machine Learning Using Electronic Health Record Data. Ann Am Thorac Soc. 2018;15(7):846–53. 10.1513/AnnalsATS.201710-787OC.
Shi K, Ho V, Song JJ, Bechler K, Chen H. J. Predicting Unplanned 7-day Intensive Care Unit Readmissions with Machine Learning Models for Improved Discharge Risk Assessment. AMIA Jt Summits Transl Sci Proc. 2022;2022:446–455.
Collins GS, de Groot JA, Dutton S, et al. External validation of multivariable prediction models: a systematic review of methodological conduct and reporting. BMC Med Res Methodol. 2014;14:1–11. 10.1186/1471-2288-14-40.
Churpek MM, Yuen TC, Winslow C, et al. Multicenter comparison of machine learning methods and conventional regression for predicting clinical deterioration on the wards. Crit Care Med. 2016;44:368–74. 10.1097/CCM.0000000000001571.
Giordano C, Brennan M, Mohamed B, et al. Accessing Artificial Intelligence for Clinical Decision-Making. Front Digit Health. 2021;3:645232. 10.3389/fdgth.2021.645232.
Magunia H, Lederer S, Verbuecheln R, et al. Machine learning identifies ICU outcome predictors in a multicenter COVID-19 cohort. Crit Care. 2021;25(1):295. 10.1186/s13054-021-03720-4.
Wasfy JH, Singal G, O'Brien C, et al. Enhancing the prediction of 30-day readmission after percutaneous coronary intervention using data extracted by querying the electronic health record. Circ Cardiovasc Qual Outcomes. 2015;8:477–85. 10.1161/CIRCOUTCOMES.115.001855.
Motwani M, Dey D, Berman DS, et al. Machine learning for prediction of all-cause mortality in patients with suspected coronary artery disease: a 5-year multicentre prospective registry analysis. Eur Heart J. 2017;38(7):500–7. 10.1093/eurheartj/ehw188.
Kalscheur MM, Kipp RT, Tattersall MC, et al. Machine learning algorithm predicts cardiac resynchronization therapy outcomes: lessons from the companion trial. Circ Arrhythm Electrophysiol. 2018;11(1):e005499. 10.1161/CIRCEP.117.005499.
Tefera GM, Feyisa BB, Umeta GT, et al. Predictors of prolonged length of hospital stay and in-hospital mortality among adult patients admitted at the surgical ward of Jimma University Medical Center, Ethiopia: prospective observational study. J Pharm Policy Pract. 2020;13:24. 10.1186/s40545-020-00230-6.
Sanatinia R, Burns A, Crome P, et al. Factors associated with shorter length of admission among people with dementia in England and Wales: retrospective cohort study. BMJ Open. 2021;11:e047255. 10.1136/bmjopen-2020-047255.
Fogg C, Griffiths P, Meredith P, et al. Hospital outcomes of older people with cognitive impairment: An integrative review. Int J Geriatr Psychiatry. 2018;33(9):1177–97. 10.1002/gps.4919.
Cuthill JA, Jarvie L, McGovern C, et al. The effects of sedation cessation within the first four hours of intensive care unit admission in mechanically ventilated critically ill patients - a quality improvement study. E Clin Med. 2020;26:100486. 10.1016/j.eclinm.2020.100486.
Badawi O, Breslow MJ. Readmissions and Death after ICU Discharge: Development and Validation of Two Predictive Models. PLoS ONE. 2012;7(11):e48758. 10.1371/journal.pone.0048758.
Cabral CDR, Teixeira C, Rosa RG, et al. Mortality, morbidity, and quality-of-life outcomes of patients requiring ≥ 14 days of mechanical ventilation: a 12-month post-intensive-care-unit cohort study. Rev Bras Ter Intensiva. 2019;31(3):425–7. 10.5935/0103-507X.20190058.
Jaotombo F, Pauly V, Fond G, et al. Machine-learning prediction for hospital length of stay using a French medico-administrative database. J Mark Access Health Policy. 2022;11(1):2149318. 10.1080/20016689.2022.2149318.
Gianfrancesco MA, Tamang S, Yazdany J, Schmajuk G. Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data. JAMA Intern Med. 2018;178(11):1544–7. 10.1001/jamainternmed.2018.3763.
Desautels T, Das R, Calvert J, et al. Prediction of early unplanned intensive care unit readmission in a UK tertiary care hospital: a cross-sectional machine learning approach. BMJ Open. 2017;7(9):e017199. 10.1136/bmjopen-2017-017199.
Li Y, Yao C, Ma T, et al. Readmission prediction via deep contextual embedding of clinical concepts. PLoS ONE. 2018;13:e0195024. 10.1371/journal.pone.0195024.
Adams SA, Petersen C. Precision medicine: opportunities, possibilities, and challenges for patients and providers. J Am Med Inform Assoc. 2016;23(4):787–90. 10.1093/jamia/ocv215.
Ahuja AS. The impact of artificial intelligence in medicine on the future role of the physician. PeerJ. 2019;7:e7702. 10.7717/peerj.7702. PMID: 31592346; PMCID: PMC6779111.
Liu X, Faes L, Kale AU, Wagner SK, Fu DJ, Bruynseels A, et al. A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis. Lancet Digit Health. 2019;1:e271–97. 10.1016/S2589-7500(19)30123-2.
Shamshirband S, Fathi M, Dehzangi A, Chronopoulos AT, Alinejad-Rokny H. A review on deep learning approaches in healthcare systems: Taxonomies, challenges, and open issues. J Biomed Inform. 2021;113:103627. 10.1016/j.jbi.2020.103627. Epub 2020 Nov 28. PMID: 33259944.
Alzubaidi L, Zhang J, Humaidi AJ, et al. Review of deep learning: concepts, CNN architectures, challenges, applications, future directions. J Big Data. 2021;8:53. https://doi.org/10.1186/s40537-021-00444-8.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Outcome prediction for adult mechanically ventilated patients using machine learning models and comparison with conventional statistical methods: a single-centre retrospective study

Status:

Version 1

Abstract

Figures

BACKGROUND

MATERIALS AND METHODS

Study design, setting, and participants:

Data Collection:

Inclusion criteria, exclusion criteria, and patient outcomes:

Data Analysis:

Prediction Modelling for Conventional Statistical Methods (CSM) using Logistic Regression

Prediction Modelling for Machine Learning (ML)

RESULTS

DISCUSSION

CONCLUSION

Declarations

References

Additional Declarations

Status:

Version 1