Study population
The dataset used in this study was obtained from the Department of Physical Medicine and Rehabilitation at OO University OO Hospital. The medical records of 333 visits by 162 patients with CVD who underwent CPET for CR between March 2020 and May 2022 were retrospectively analyzed. The analyzed medical records comprised an initial postoperative evaluation and up to five follow-up assessments during one year, with patients advised to visit every 2–3 months. The exclusion criteria were: (i) significant orthopedic conditions or pain that limited participation in the CPET, (ii) unstable cardiopulmonary conditions, and (iii) severe cognitive impairment. This study was approved by the Institutional Review Board of OO University Hospital (IRB no. 2022AN0365) and conducted according to the principles of the Declaration of Helsinki.
Study design
Two tasks were designed to predict VO2 peak using the frameworks illustrated in Fig. 1. Task 1 estimated VO2 peak at the same visit point using only clinical information and functional assessments, excluding CPET data. Task 2 predicted VO2 peak for the next visit point (post-CR VO2 peak) and change in VO2 peak between two visits (ΔVO2 peak, Positive: recovery, Negative: deterioration) based on the amount of exercise performed between visits and pre-CR data, including CPET results. For Task 1, the data utilized included demographic information and physical measurements collected at the initial visit, along with medical history. Additionally, questionnaire data and functional assessments involving strength and endurance tests conducted at each visit were used in Task 1. In Task 2, in addition to the top eight variables shown to have high importance in Task 1, the CPET results and CR information collected through exercise logs were analyzed.
The detailed items utilized for each task analysis are listed in Table 1. The clinical information was categorized into demographic and disease-related information. Variables such as sex, underlying CVD, history of cardiac procedures and surgeries, and use of cardiovascular medication, were encoded as binary data. Ejection fraction was assessed via echocardiography during hospitalization, and categorized as follows: normal (≥ 50%), mildly reduced (35–49%), and reduced (< 35%).
Table 1
Summary of factors used to predict VO2 peak
Demographic information | Age Sex Height Weight Body Mass Index (BMI) |
Disease-related information | Underlying cardiovascular disease Cardiac surgery history Cardiac procedure history Cardiovascular medication Ejection fraction (EF) on echocardiography |
Self-reported measures | Korean Activity Scale Index (KASI) EuroQol-5 dimension (EQ-5D) Drinking history Smoking history Previous exercise history |
Performance-based measures | Six-minute walk distance (6MWD) Hand grip strength (HGS) |
CPET information* | Peak oxygen consumption (VO2 peak) Peak ventilatory threshold (VT peak) Peak heart rate (HR peak) Ventilatory equivalent for carbon dioxide (VE/VCO2) Peak O2 Pulse Peak systolic BP (SBP peak) Peak diastolic BP (DBP peak) Peak Respiratory Exchange Ratio (RER peak) Peak Rate Pressure Product (RPP peak) Peak Rate Perceived Exertion (RPE peak) Total exercise duration |
CR exercise information* | Exercise type (resistance exercise, aerobic exercise (walking, cycling, others)) Duration (minutes per day) Frequency (days per week) Total exercise time (minutes per week) |
* indicates that it was used only in the Task 2 study. |
CPET, cardiopulmonary exercise test; CR, Cardiac rehabilitation. |
At each visit, patients underwent functional assessments consisting of self-reported measures, performance-based measures, and CPET. Performance-based measures included hand grip strength (HGS), which was measured using a JAMAR PLUS hand dynamometer. Measurements were taken alternately from the left and right hands twice each, with the highest values recorded17. The 6MWD was assessed by instructing the patients to walk as far as possible within six minutes, maintaining an intensity level between 3 (moderate) and 4 (somewhat strong) on the Borg CR 10 scale18.
The self-reported Korean Activity Scale Index (KASI) was used to evaluate the feasibility of 15 daily activities by assigning a weighted score to each item19. The EuroQol-5 Dimension (EQ-5D) was used to assess the quality of life and general health status across five dimensions: mobility, self-care, usual activities, pain/discomfort, and anxiety/depression20. From the second visit onwards, the patients completed questionnaires regarding their CR exercises, the details of which are specified in Table 1.
Patients performed symptom-limited progressive treadmill exercises as part of the standardized CPET. The exercise testing protocol was terminated at the patient's request or upon signs of gait instability or cardiovascular decompensation, following the guidelines of the American College of Sports Medicine21. The parameters derived from each assessment are listed in Table 1.
Following the American Association of Cardiovascular and Pulmonary Rehabilitation guidelines, patients were classified into low-, moderate-, and high-risk groups and prescribed target metabolic equivalents (METs) and heart rates (HR)22. During CR, patients maintained exercise diaries that included the duration of aerobic and resistance exercises, resting and peak HR, Rating of Perceived Exertion (RPE), and Respiratory Disturbance Index (RDI).
ML modeling
To ensure accurate validation and comparison of the dataset, regression and ML models were employed. Specifically, for Linear Regression models, basic linear regression is used along with various regularizations, such as LASSO23, RIDGE24, and SGD25. Additionally, six ML models were employed for comparison with the linear regression models. Support Vector Regression (SVR)26 was used because of its robustness against outliers and generalizations. Ensemble methods have been utilized, including GradientBoost27, RandomForest28, CatBoost29, XGBoost30, and LightGBM31. The optimal model was selected and parameter tuning was conducted accordingly.
For rigorous validation of each model's performance, five rounds of ML analyses were conducted, with each analysis randomly splitting the data into training and validation datasets. Given the small size of the dataset, the K-Fold Validation technique was employed to ensure that all data were used for both training and validation. This approach allows the performance of the model to be validated without data loss, offering a more reliable assessment than using a single validation set.
Among the metrics used to evaluate the regression model, the Sum of Squared Error (SSE), Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE) were selected to prevent outliers and overestimation. These indicators enable an objective comparison of performance, especially when the values are either positive or negative.
Feature importance refers to the techniques that determine the extent to which each feature in a dataset contributes to the predictive power of the model. The SHapley Additive explanation (SHAP) algorithm32 is a technique in Explainable AI that calculates the SHAP value, which is the Shapley value for the conditional expectation function of an ML model, to analyze how much each feature contributes to individual prediction outcomes. It assumes that if the predictive performance changes significantly with the removal of a specific variable, then that variable is highly important. This approach consistently provides coherent interpretations by identifying variable importance and their positive or negative relationship with the target metric, taking into account correlations between variables.