Study aim, design, and setting. Our aim was to evaluate lung ultrasound as an alternative to CT for monitoring COVID-19 pneumonia lung involvement on the ICU. We conducted a prospective observational cohort study of laboratory-confirmed COVID-19 cases in two academic adult ICUs (Amsterdam UMC, location VUmc, the Netherlands and LUMC, Leiden, the Netherlands). Bedside ultrasound evaluations are regularly performed in these centers, providing there is a relevant clinical indication and an available certified ultrasound physician. The local ethics boards approved the study and usage of data gathered during routine ultrasound without informed consent. This trial was registered in Dutch Trial Registry (ID: NL8584) and was drafted in compliance with the STROBE guidelines (17).
Participants and outcome variables. Adult (>18 years) patients admitted to the ICU and diagnosed with COVID-19 between April 1st and May 30th were screened. They were included when a clinically indicated 12-zone lung ultrasound had was performed and recorded within 48 hours of a chest CT-scan. Baseline characteristics (age, sex, height, weight), ventilator settings, arterial blood gas values, and Sequential Organ Failure Assessment score (SOFA) were collected from the electronic patient database as close to time of CT as possible. The ratio of arterial oxygen partial pressure to fractional inspired oxygen (P/F ratio) was calculated based on arterial blood gas values and concurrent ventilator oxygen setting. We used the Kigali Modification of the Berlin Definition of ARDS (so non-ventilated patients could also be classified) to classify COVID-19 cases as mild, moderate, and severe (18). P/F ratio for non-ventilated patients on low-flow oxygen was estimated using an established conversion method (19). Follow-up started at intubation or, for non-ventilated patients, at ICU admission. Patients were followed for the longest possible follow-up until discharge, death, or, when still admitted, until drafting of this manuscript. An inclusive composite outcome of death or ICU stay >30 days was calculated.
Lung Ultrasound. Images were acquired or supervised by certified clinicians (n=8) using the Sonosite-EDGE II or Philips Lumify ultrasound system. Certification entailed a two-day course and thereafter supervision by a physician with extensive ultrasound experience (>5 years) until sufficient expertise was reached (a minimum of 30 exams) prior to this study (20). All measurements were performed on supine patients using a 10-5 MHz linear transducer (VUmc) or a Lumify 4-1MHZ MHz S4-1 broadband phased array transducer (LUMC) with the lung examination setting with a depth of >6 centimeters (21). Measurements were conducted according to the 12-zone LUS protocol: one superior and inferior zone on ventral, lateral, and dorsal areas of each hemithorax (22). Offline analyses of ultrasound images were performed by researchers blinded to the patient’s CT results. The offline reviewers determined the LUS of involvement: normal=0, well-separated B-lines=1; coalescent B-lines, small consolidation or quad sign (< 1 cm) =2, consolidation, large consolidation or quad sign (> 1cm)=3; of each zone(22). A global score was calculated by summing the scores of all 12 lung regions, ranging from 0 (i.e. all zones with normal aeration) to 36 (i.e. all zones with large consolidation or large quad signs). Regional scores were calculated by summing the field scores of ventral, lateral, and dorsal regions (ranging from 0 to 12) or superior and inferior regions (ranging from 0 to 18). A ventral-1-lateral score (3 views per hemithorax) was derived by summing the ventral and lateral scores without the ventroinferior points (23). Missing scores values from one or more regions that were non-examinable were resolved by expressing the lung ultrasound score as an ‘involvement index’(LUSI): (actual score / total score achievable) × 100. The number of potential regions was at most 12, 6, or 4 for the LUSI and regional scores, respectively. As such, an involvement percentage of 0% would represent normal aeration on all lung fields and a score of 100% would represent consolidation on all lung fields.
Chest computed tomography. Chest CT was performed on two multidetector CT scanners: Siemens Somatom Drive (Siemens Healthineers, Erlangen, Germany), and a GE Discovery 750 HD (GE Healthcare, Milwaukee, MI). All patients underwent CT scanning of the chest in the supine position during end-inspiration. Slice thickness for all scanners was between 0.625-1.25 mm. HD lung (GE Healthcare) kernel, pulmonary Br59F kernel (Siemens Healthineers) were applied. The chest CT was performed for clinical reasons at any point after the definitive diagnosis was made, on indication of the treating physician, and evaluated by a radiologist blinded for lung ultrasound results. Further follow-up CT scans were performed because of non-resolving or worsening clinical picture. The radiologists in the Netherlands determined a CT-SS based on a previously validated study in severe acute respiratory syndrome (24). The five lobes of the lung were each scored for involvement with ground glass or consolidation: 0% (0 points), 1-5% (1 point), 5-25% (2 points), 25-50% (3 points), 50-75% (4 points), or >75% (5 points). Data on the CT-SS, ranging from 0 to 25, was collected from the radiology report. A CT-SS ‘involvement index’ (CTSI), with 0% representing no involvement, and 100% representing >75% involvement on all five lobes, was also calculated for the CT-SS (CTSI).
Statistical analysis. Statistical analyses were performed using SPSS IBM version 22 (SPSS Inc., Chicago, IL, USA) and the R language for statistical computing with the tidyverse suite of packages (25). Demographic, clinical, and outcome variables were presented as means ± standard deviations (±SD), medians and interquartile range [IQR], or numbers (percent %) when appropriate. A Shapiro-Wilk’s test, visual inspection of histograms, and Q-Q plots were used to determine data distribution.
Baseline and different zones. An ANOVA one-way (or Kruskal Wallis if non-parametric) test was used to compare baseline characteristics across categories of ARDS severity. The same test was used to determine whether there were differences in (regional) LUSI, CTSI, and across ARDS severity categories.
Primary outcome: correlation, agreement and concordance. The Spearman’s rank test was used to assess the correlation coefficient (r) between LUSI and CTSI on all examinations. We used the same test to assess the correlation between different zone regions of LUSI and CTSI for all examinations and only for unique patients. A correlation coefficient between 0.10 and 0.39 indicates weak, 0.4 and 0.69 moderate, and 0.70 and 0.89 a strong positive relationship (26). A Bland-Altman plot was created to assess agreement. The change in LUSI was assessed by correlating the difference (Δ) of sequential LUSI and CTSI examinations with a Spearman’s rank test. The overall concordance was assessed by allocating full concordance (1) to changes in the same direction, discordance (0) to changes opposite directions, or tie (0.5) when either LUSI of CTSI did not change.
Secondary outcome: prediction of outcomes. A logistic regression analysis was performed to assess the prediction of LUSI and CTSI on the outcomes of all unique patients. Five independent variables were selected as candidate predictors: age, P/F ratio, SOFA score, LUSI, and CTSI. As LUSI and CTSI are percentages of lung involvement and not strictly continuous variables, they were dichotomized to high involvement (≥50%) and low involvement, reflecting the ‘severe illness’ category in the National Institutes of Health guidelines for the management of COVID-19 (27). A univariate analysis was made for death, ICU stay >30 days, and their composite. A multivariate analysis was performed for the composite outcome.
Sample size. A previous study that correlated CT tissue density with LUS for ARDS found a strong correlation coefficient of 0.79 (28). Considering a two-sided α of 0.05 and a β of 0.05 this study would require a sample size of 14 to determine that the correlation coefficient differs from zero (29). Cases were collected until a sufficient sample for clinical evolution was also reached.