In prospectively-planned analyses of HCSD conducted after the results of RESTART were known, we found that HCSD had reasonable diagnostic accuracy when compared with adjudicated outcomes, and that HCSD alone gave an identical estimate of treatment effect on the primary outcome of recurrent ICH (which became statistically significant) and a similar estimate of treatment effect on the secondary outcome of MACE. This suggests that biases in the ascertainment of events were similar for the two methods.
Our study has strengths: we included participants from two UK nations in the only RCT of antiplatelet therapy after ICH. HCSD performed well despite the population being multimorbid with multiple cardiovascular diseases. Whereas previous studies in this area have compared HCSD to the trial outcome,(32) we undertook a thorough process of outcome adjudication, and were able to compare HCSD to a best-available outcome, to give a truer estimate of the sensitivity and specificity of HCSD.
The weaknesses of our study are that we were not able to analyze the HCSD for the full original trial population of our relatively small trial, due to administrative and governance issues, so did not link data for the 41 people living in Northern Ireland and Wales. The study design is susceptible to incorporation bias, which occurs when the reference standard incorporates the test under study. Our reference standard included HCSD with the adjudicated trial outcome, but this did not apply to the primary outcome and only affected the secondary outcome (we included three MACE outcomes identified by HCSD in the reference standard). Analyses may have been affected by unquantified migration of patients between and from UK nations, which would lead to an underestimate of the accuracy of HCSD, but we expect migration rates to be low in relatively elderly population. There is no agreement about how accurate HCSD need to be for use in RCT follow up, so our values for PPV and sensitivity must be interpreted in the context of similarities in treatment effects. Our reference standard is imperfect, because traditional trial outcome ascertainment with adjudication has limitations. It is reliant on reporting of events by site investigators; source documentation can be incomplete or inconsistent; and adjudicators often disagree, underlining the inherent subjectivity in the process.(33)
Our results are relevant to MACE in a secondary prevention RCT after ICH. There may be condition-specific issues with HCSD which reduce the generalizability of our results. In observational studies, definitions used for MACE in HCSD vary, prohibiting comparison, aggregation and replication of findings.(34) Further work is needed to comprehensively understand the accuracy of coding of cardiovascular disease outcomes in modern healthcare systems. Countries and regions have different systems and incentives for recording healthcare data which may bias results,(35) and our results only apply to England and Scotland.
Previous studies have shown that the result of a clinical trial obtained through HCSD may be closer to null than when obtained through traditional follow up methods, implying that larger sample sizes might be needed if planning follow up with HCSD.(16) More recently, a comparison of major bleeding events recorded as adjudicated outcomes in the ASCEND trial compared to HCSD found no clinically important differences in the relative risk with treatment of events depending on data source used.(36) The result we obtained for RESTART using HCSD for the primary outcome had an identical hazard ratio to the original trial, and for secondary outcomes the hazard ratio was similar. It may be that improvements in electronic healthcare records over the past decade have increased the accuracy of HCSD. The accuracy of recording of outcome events in HCSD for a study of bladder cancer in England improved substantially between 2011 and 2017, with remuneration policies likely driving the improvement in data quality.(37) Many of the studies of HCSD in cardiovascular disease were conducted years ago, such as the West of Scotland Coronary Prevention Study, which originally began recruiting in 1995.(32)
In the current study, HCSD identified both false positive and false negative events, compared to the best-available data. The relatively large number of false positives reduced the PPV of HCSD. Our experience suggests that re-admissions can often attract incorrect coding, leading to false positives. Future work could focus on how to identify these events without sacrificing completeness. Two primary outcome events and seven secondary outcome events were false negatives due to miscoding. Access to alternative sources of information, such as national audit data or primary care data, would allow triangulation of outcomes and potentially increase the accuracy of HCSD.
Our sample size was too small to explore whether HCSD missed mild events that did not result in hospital admission or death, but were detected by RESTART. These events could be detected with access to primary care records, but linking RCT data to primary care records is currently not possible in England due to data governance and data ownership constraints.(38) Widening the scope of access could improve the accuracy of HCSD for all clinical research.
HCSD is being used in RCTs to enhance data collection, as in the RECOVERY platform trial in Covid-19;(39) for very long-term follow up;(40) and as exclusive outcome follow up.(41, 42) Although digitally-enabled RCTs seem to offer benefits, researchers in the BladderPath RCT of bladder cancer treatment pathways found rapid access to outcomes from digital sources was “too cumbersome and expensive” in England for incorporation into the RCT design. The same authors were able to use HCSD for one-off long term follow up of another RCT.(43) The MRC Clinical Trials Unit has described the protracted process of applications for HCSD from NHS Digital and the National Institute for Cardiovascular Outcomes Research, which in the case of the PATCH trial took several years.(44)
In this study, we have demonstrated that HCSD has good accuracy for identifying cardiovascular outcome events and effect estimates that are comparable to adjudicated outcome events. A Delphi consensus has prioritized the remaining challenges for use of HCSD in RCTs.(45) Further work is needed to confirm the accuracy of HCSD for cardiovascular research in other healthcare systems, and to enable other types of informative HCSD to be efficiently incorporated into RCTs.