Trial population
The VALIDATE-SWEDEHEART (bivalirudin versus heparin in ST-Segment and Non-ST-Segment Elevation Myocardial Infarction Registry Trial) was a multicentre, controlled, registry-based randomized clinical trial comparing treatment effects between bivalirudin and heparin in conjunction with the PCI for AMI.6, 7 A composite of death from any-cause, AMI and bleeding at 180 days was the primary endpoint. Twenty-five of Sweden´s 29 PCI centers participated in the trial. Of the 12561 patients screened for participation, 6555 were not randomized and 6006 (48%) were randomized and referred to in this paper as the trial population (TP).
Screened not enrolled
Of the 6555 non-randomized patients, 150 had a missing treatment assignment in the registry and were excluded, leaving 6405 in the screened not-enrolled (SNE) population, a secondary population of interest in this study. Of those 6405, 3422 did not meet the inclusion criteria for various reasons: not able to give informed consent (22%), not treated with ticagrelor or prasugrel (21%), indicated as other reason for non-inclusion in the registry (19%), 5000 U of heparin before arrival or 3000 U in the lab before angiography (13%), ST-segment elevation myocardial infarction/non-ST-segment elevation myocardial infarction (STEMI/NSTEMI) (9%), reduced kidney function (5%), planned glycoprotein IIb/IIIa inhibitors (GP) (3%), life expectancy of one year or less (3%), contraindication for either heparin or bivalirudin in the trial (2%), continuous bleeding (2%), uncontrolled hypertension, thrombocytopenia, and endocarditis (0.9%), not being 18 years old (0.1%). Thus, 2983 remained as the screened not enrolled population with fulfilled inclusion criteria (SNEF). The reason for the SNEF not being randomized was not documented. The VALIDATE-SWEDEHEART trial was powered for evaluation of STEMI and NSTEMI separately with enrolment of an equal number of each type of ACS. Thus 3001 patients with NSTEMI and 3006 STEMI were enrolled. In the general SWEDEHEART population however, there were more NSTEMI then STEMI, which made the SNEF even more unbalanced when it came to the number of NSTEMI compared to STEMI. Additionally, some of the hospitals did not include patients during off-hours, i.e. night shifts or the weekends and some physicians did not participate in the study.
The study populations
The two patient populations were combined into a single dataset for further analysis, 6006 randomized in the VALIDATE trial and 2983 with known treatment assignment and satisfying inclusion criteria. One assumption, in order to assess the generalizability of trial results, was to remove those who had zero chance of being selected to participate in the trial.8 Therefore, those patients who did not meet the inclusion criteria were removed, leaving the (SNEF) (Figure 1).
In total, 8989 patients were included in this study. Due to the randomization, the TP was balanced on the observed covariates; the SNEF patients were not randomized to bivalirudin or heparin, thus treatment groups were not balanced with respect to the observed covariates (Table 1).
An additional analysis of all 12411 screened patients with known treatment assignments was also performed (Appendix).
Endpoints
In the VALIDATE study (TP), bleeding events were adjudicated and classified by an endpoint committee, but bleeding events were not adequately reported in the SWEDEHEART registry for the SNEF in the present study. Thus, an analysis with the composite endpoint of death, AMI or bleeding was not possible. We therefore used death at 30 and 180 days as the primary endpoint.
Methods
A baseline table, Kaplan-Meier curves, and log-rank test were produced to describe the study data (Table 1, Figures 2-4).
A propensity score method, using logistic regression with inclusion to VALIDATE as the outcome to generate scores, was used to evaluate population similarity with respect to prognostic factors between TP and the SNEF.8 This was done in a stratified analysis of STEMI and NSTEMI as well. Traditionally, propensity scores are used to mimic randomized trials with observational data through balancing observed covariates between treatment groups in observational studies.9 The lower the propensity-score difference, the more similar the populations. Standardized mean differences exceeding 0.25 were considered large and indicated dissimilarity between populations as defined by Stuart et al.8 Logistic regression modelled the probability of being selected in the VALIDATE trial with variables of interest. These covariates were potentially associated with differences in VALIDATE trial selection. Additional variables were included to ensure balance.10 The model included the following variables: STEMI/NSTEMI, age, sex, low weight, smoking, diabetes, hypertension, hyperlipidaemia, previous AMI, previous PCI, previous coronary artery bypass graft (CABG), previous stroke, renal failure, thrombectomy, ticagrelor before inclusion, clopidogrel before inclusion, cardiopulmonary resuscitation (CPR), puncture location, creatinine (umol/L), ACE inhibitors, Killip class and off-hours vs. regular hours (Appendix). Off-hours, signified the night or weekend shifts. Incidence proportions for off-hours and regular hours were calculated for the different groups and endpoints. Variables that had 3% or more missing were altered to include an extra category for missing; these variables were smoking, CPR, ACE inhibitors, Killip class and off-hours vs. regular hours.
Propensity scores were then used in an inverse-probability weighted (IPW) Cox regression analysis using the TP only to estimate the difference in mortality (DIM) as it would have been had the trial included the SFC.7 The point of the weights is to weight down patients who were very likely to be included as well as weight up those who had a very low probability of being in the trial. The patients with a very low probability of being in the trial were rare and most comparable to the non-randomized population11 in terms of the covariates included in the propensity score. A person with a high probability to be included in the trial who had a large propensity score received a weight equal to the inverse of their propensity score, a low weight. If the patient’s propensity score were 0.8, then their weight would be 1/0.8=1.25. A person who was very unlikely to be included in the trial, but was included anyway would receive a larger weight; a person with a propensity score of 0.2 would, therefore, have a weight of five. These weights were checked via trimmed and stabilized weighting.12
All the variables that were included in the IPW analysis were checked in separate Cox models as part of an interaction term with the treatment variable. A sensitivity analysis solely including the significant variables from this subgroup analysis were produced to see if the mean propensity score difference changed.
To investigate potential differences between the TP and the SNEF in terms of mortality, a Cox regression with a treatment x trial selection interaction term was applied. The inclusion of the interaction term helped produce hazard ratios (HR) between treatments for the SNEF and TP separately as well as the HR for the treatment effect in the SNEF divided by the corresponding HR in the TP. The interaction term also produced HR’s comparing mortality across treatments, i.e. heparin treatment in the SNEF compared to heparin treatment in the TP. The same was produced for bivalirudin vs. bivalirudin.
The analysis was adjusted for: STEMI/NSTEMI, age, gender, low weight, smoking, diabetes, hypertension, hyperlipidaemia, previous AMI, previous PCI, CABG, previous stroke, renal failure, thrombectomy, ticagrelor before trial, clopidogrel before trial, CPR, puncture location, creatinine (umo/L), ACE inhibitors, Killip class and regular vs. off-hours. We refer to this analysis as the adjusted Cox model (Tables 2-3).
These analyses were repeated as a sensitivity analysis with separate models for STEMI and NSTEMI populations to see if the treatment effects changed by type of myocardial infarction.
All analyses were performed in STATA/MP version 15. All tests were two-sided and the alpha-level was set to 0.05.