Study design and ethics
This study incorporated a single-center, prospective, cohort design. All participants underwent ultrasound imaging of the diaphragm preoperatively and at 1 and 2 weeks postoperatively, in addition to routine evaluation. This study was approved by the Institutional Review Board of Akita University Graduate School of Medicine (approval number: 2692) and conducted in accordance with the Declaration of Helsinki. The objective and content of the study were explained to the participants orally, as well as in written documents. Written informed consent was obtained after the participants were informed that they could participate based on their own free will and that their privacy would be reasonably protected.
Participants
Patients with esophageal cancer who were scheduled to undergo thoracoscopic esophagectomy at Akita University Hospital between June 2021 and May 2024 and received pre- and postoperative rehabilitation were enrolled. The exclusion criteria were as follows: salvage esophagectomy, open thoracic esophagectomy, transhiatal esophagectomy, laryngopharyngeal esophagectomy, two-stage surgery, scheduled to undergo tracheostomy, diagnosis of hemiplegia owing to cerebrovascular disease, diagnosis of dementia, and need for assistance with daily living.
Measurements
Ultrasound imaging of the diaphragm
We assessed the diaphragm thickening fraction (DTF) using ultrasound imaging 18. Diaphragm thickness (DT) was measured (with the patient in the supine position) using B-mode ultrasonography with a 10–5 MHz linear transducer (iViz air, FUJIFILM SonoSite, Inc.) at the zone of apposition between the eighth or ninth right intercostal space at the anterior axillary and midaxillary line. In this area, the diaphragm appears as a three-layered structure, viz. a non-echogenic central layer bordered by two echogenic layers, i.e., the peritoneum and diaphragmatic pleurae. DT at rest (DTRE), end of the maximal inspiration (DTEI), and end of maximal expiration (DTEE) was measured as the distance from the deep edge of the peritoneum to the superficial edge of the diaphragmatic pleura (Fig. 1) using an electronic caliper to the nearest 0.1 mm. The values of three subsequent measurements were averaged. The DTF was calculated using the following formula:
DTF = (DTEI – DTEE) / DTEE × 100.
Measurements were performed preoperatively, and 1 (7 to 9 postoperative days) and 2 weeks postoperatively (14 to 16 postoperative days). One investigator with more than 10 years’ experience in ultrasonography of the diaphragm performed the measurements.
Outcomes
The primary outcome was all PPCs, including atelectasis/sputum expectoration difficulty (e.g., need for bronchoscopy), pneumonia, or reintubation for respiratory failure. Postoperative pneumonia was defined as the presence of new or progressive infiltrates on chest radiography or computed tomography and the presence of at least one of the following parameters per the revised Uniform Pneumonia Score: temperature ≥ 38°C or ≤ 36°C and white blood cell count ≤ 4,000 or ≥ 10,000/µL 19. Other patients with closely approximated above criteria and who were diagnosed as pneumonia by a physician were also included. All PPCs were classified as grade II or higher according to the Clavien–Dindo classification 20.
Collection of clinical and confounding data
Demographic data (sex, age, height, weight, body mass index, smoking and alcohol status, comorbidities, blood test results, nutritional status, and pulmonary function), tumor-specific data (cancer histology, tumor location, clinical staging according to the Union for International Cancer Control 8th edition tumor–node–metastasis classification 21, and neoadjuvant therapy), operative details (surgical procedure, surgical time, blood loss volume, and reconstruction route), postoperative complications, and postoperative length of hospital stay were collated from the patient medical records.
Smoking status was assessed using the Brinkman index. Comorbidity status was assessed using the Charlson Comorbidity Index (CCI). Data on comorbidities including hypertension, diabetes mellitus, and chronic respiratory disease (chronic obstructive pulmonary disease, asthma, or interstitial lung disease) were collected. Hemoglobin, serum albumin and serum C-reactive protein levels were obtained from the hematological data. Malnutrition was assessed using the Global Leadership Initiative on Malnutrition (GLIM) criteria. Data on pulmonary function, including forced vital capacity (FVC), forced expiratory volume in 1 s (FEV1), and forced expiratory volume % in one second (FEV1/FVC), were collected and the percentage of the predicted value was calculated. Anastomotic leakage, recurrent laryngeal nerve palsy (RLNP), arrhythmia, and surgical site infection were designated as major postoperative complications of esophagectomy and designated as Clavien-Dindo grade ≥ 2. The exception was RLNP, which was recorded as grade 1 or higher.
Surgical procedure
Our standard surgical procedure consisted of right thoracoscopic (including robot-assisted surgery) esophagectomy with two-field or extended three-field lymphadenectomy, including the bilateral cervical, mediastinal, and abdominal lymph nodes. Standard reconstruction was performed with a gastric tube (or pedicled colon) through an open laparotomy via the retrosternal or posterior mediastinal route 12,22. Almost all of the patients were extubated in the operating room and admitted to the intensive care unit. Intraoperative ventilation was performed with one-lung ventilation using bronchial blockers and is regulated by the anesthesiologist according to lung protection strategies. The tidal volume was set ≤ 6–8 mL/kg with reference to ideal body weight. Peak inspiratory pressure was set ≤ 30 cmH2O whenever possible to minimize driving pressure. A reasonable level of positive end-expiratory pressure of 5–10 cmH2O was used, also considering the surgical visualization. Mild hypercapnia (partial pressure of carbon dioxide < 60 torr) was accepted, and the fraction of inspired oxygen was set as low as possible to maintain oxygen saturation ≥ 95%. Lung recruiting maneuvers were not routinely performed but were used as needed to relieve hypoxemia. Patients were extubated in the operating room and admitted to the intensive care unit.
Perioperative rehabilitation program
All patients underwent a similar perioperative rehabilitation program 12, which was conducted for 1–3 sessions before surgery and consisted of breathing exercises mainly for diaphragmatic breathing, sputum expectoration exercises (e.g., active cycle breathing technique), and orientation to postoperative rehabilitation, especially early mobilization. The postoperative rehabilitation program was initiated on the day after surgery and out-of-bed mobilization performed as early as possible. Positioning and sputum expectoration assistance were provided in the early postoperative period. Patients were typically discharged from the intensive care unit 5 days after surgery. Thereafter, aerobic exercise, such as with a bicycle ergometer, was commenced as soon as possible (approximately 7–10 days after surgery) and continued until hospital discharge.
Statistical analysis
All analyses were performed within a Bayesian framework with Markov chain Monte Carlo (MCMC) methods to calculate the posterior distributions of the outcomes. Posterior distributions were summarized using mean values as point estimates and percentile-based 95% credibility intervals (CrIs). We also extracted the probabilities of different effects from the posterior distributions. The MCMC estimation in each model was performed with 5,000 iterations, using the initial 2,500 iterations as a burn-in and four chains with random initial chain values. Convergence was confirmed visually using trace plots and the Gelman–Rubin convergence diagnostic (Rhat) < 1.1 for each variable. To accommodate missing data, a complete case analysis was planned if data were missing in less than 5% of all participants. If data were missing in more than 5% of all participants, multiple imputation methods were used.
First, we examined the longitudinal changes in the DTF using linear mixed models. The fixed effect included the measurement time point (preoperatively and 1 and 2 weeks postoperatively). The random effect consisted of the random intercept, allowing individual differences to be reflected at baseline. The prior distribution of the change in the DTF was assumed to be normal with mean of 0 and variance of 10000 as a weakly informative prior distribution.
Second, we examined the relationship between the DTF and PPCs using a logistic regression model. The dependent variable for the model was the PPC, and the independent variable DTF. We calculated the generalized propensity score for the degree of the DTF using a general linear model to estimate balancing weights. In this generalized propensity score model, a response variable was DTF and explanatory variables were the following characteristics that were selected as potential confounders for DTF and PPCs based on previous studies: age, smoking status (Brinkman index), comorbidity status (CCI), malnutrition (GLIM criteria), pulmonary function (FEV1/FVC), advanced cancer stage (clinical stage ≥ II) and postoperative RLNP. Stabilized inverse probability weights calculated using the generalized propensity score were used for the balancing weights. The Spearman’s correlation coefficient was used to assess the balance after weighting for covariates; the correlation coefficient < 0.1 was considered well-balanced 23. We conducted two logistic regression models, which were unweighted and weighted, to calculate the odds ratios (OR). The prior distribution of the coefficients of DTF was assumed to be normal with a mean of 0 and a variance of 100 as a weakly informative prior distribution.
Finally, a receiver operating characteristic (ROC) curve was plotted to validate the optimal cutoff of the DTF to predict PPCs. The prediction accuracy of PPCs was analyzed using the area under curve (AUC). The 95% confidence interval (CI) for the AUC were calculated with 2000 stratified bootstrap sampling. Additionally, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated using the standard formula. We selected the optimal cutoff values, based on the Youden’s index (sensitivity + specificity -1), and the threshold where the specificity was the highest in the range where the sensitivity (recall) > 80% for coverage.
All analyses were performed using R version 4.2.2 with the Tidyverse, the WeightIt, the brms, and the pROC packages.