Despite the solid evidence that screening can decrease colorectal cancer (CRC) incidence and CRC mortality [1, 2], and the availability of multiple screening test options with a range of attributes [3], screening for CRC and its precursors remains underutilized around the world among persons who are eligible for screening. Research and development of novel tests to screen for CRC and its precursors is a dynamic field [4–8], fueled by the goals to improve the sensitivity and specificity of available tests, and to address patient concerns over the attributes of existing tests (e.g. the stool handling that is required by fecal test-based strategies, or the invasive nature of colonoscopy and the required colon cleansing before colonoscopy), and by the commercial opportunities for industry in the large CRC screening market. From the public health perspective, novel tests should ideally deliver improved outcomes at acceptable incremental costs and ease of use.
The recently published update to the World Endoscopy Organization (WEO) CRC Screening Committee Expert Working Group (EWG) guiding principles on the evaluation of novel non-invasive screening tests proposes a four-phased approach, and considers where decision analytic modeling may be useful [9]. Phases I and II consist, respectively, of small studies assessing the test’s ability to discriminate between CRC and non-cancer states, and the test’s accuracy across the continuum of neoplastic lesions in neoplasia-enriched populations. Progress to Phase III requires promising results in Phases I and II. Phase III consists of prospective studies to determine single round intention-to-screen outcomes, and confirmation of the test positivity threshold. The ambitious Phase IV involves evaluation over repeated screening rounds with monitoring for missed lesions. The consensus document states that modeling studies that mimic randomized controlled trials and are based on high-quality observational data can be informative, but there is no explicit recommendation on how they might be incorporated in the early phases of test development.
The rich published literature on the cost-effectiveness of CRC screening typically focuses on screening tests with solidly ascertained test performance characteristics, usually in randomized controlled trials or large prospective studies (i.e. usually Phase III studies) [10]. While Phases III and IV provide the most reliable data to model test impact on CRC incidence and CRC mortality in scenarios that are not addressed directly in the real-world trials, exploratory modeling can contribute to research and development efforts at earlier stages in the development of novel tests.
During the planning and preparation of the recent WEO consensus document, one of its lead authors, Graeme Young, asked whether early-stage proxies/surrogates for long-term programmatic effectiveness and cost-effectiveness could serve as a form of rudimentary early-stage modeling to inform test development. The aim of developing such early-stage proxies/surrogates would be to incorporate a quantitative estimation of the potential clinical and economic impact of emerging tests during the early phases of test development (Phases 1 and II), as opposed to only during later phases.
Preliminary exploration of two potential surrogates, the number needed to colonoscope to detect one CRC or advanced precancerous lesion (APL), and the cost to detect one CRC or APL in one (the initial) round of screening, was presented by one of the authors (UL) at the WEO CRC Screening Committee 2022 meeting in San Diego. During the discussion, another author (ILV) asked whether proxies/surrogates were necessary if comprehensive modeling could be undertaken instead. The aim of the current paper is to explore these two related questions.
The paper consists of two distinct parts. First, we explored two potential early-stage surrogates of effectiveness and cost-effectiveness by comparing them to the long-term predictions of four established decision analytic models of CRC screening in the US population (MOSAIC [11], and three models of the CISNET consortium, MISCAN, CRC-SPIN and SIMCRC [12]). In order to provide the greatest transparency, we used recently published estimates from the four models in two publications [11, 12] that explored the potential effectiveness and cost-effectiveness of novel blood-based biomarkers of colorectal neoplasia. These publications focused on the potential long-term effectiveness and cost-effectiveness of a hypothetical test that meets the minimum thresholds set by the Centers for Medicare and Medicaid Services (CMS) of CRC sensitivity ≥74% and specificity ≥90%, but they also examined other existing or hypothetical tests [13]. In the current study, to reflect clinical effectiveness, we compared the models’ published predictions for long-term CRC mortality reduction to the newly calculated number needed to colonoscope (NNC) to detect one CRC or one APL in round 1 of screening for a series of tests, based on test performance characteristics. To reflect cost-effectiveness, we compared to the models’ published predictions for long-term cost-effectiveness to the newly calculated cost to detect one CRC or one APL in round 1 of screening for the series of tests, based on test performance characteristics and associated costs. The model’s results for discounted cost/quality-adjusted life-year [QALY] gained vs. no screening were used as the measure of cost-effectiveness.
Second, as a first exploration of the feasibility of performing comprehensive modeling during early stages of test development using existing tools, we performed first-pass modeling with a web-based tool of the EU-TOPIA project (https://eu-topia.org) [14], which is based on MISCAN, and compared the results to those recently published for a model that was adapted from the previous version of MOSAIC [15]. The case study for this exercise was the potential effectiveness of screening with annual fecal immunochemical testing (FIT) with various starting and stopping ages in Israel, with a focus on the potential incremental benefit of lowering the screening initiation age from 50 to 45 years. Given that we were interested in the feasibility of two different methods (i.e. use of surrogate markers or rapid modeling), it was not necessary for the two parts of the current study to be based on the same scenarios. Time accounting for modeling with the EU-TOPIA tool was performed in order to reflect the potential effort required to carry out exploratory modeling.