Descriptive summary of study characteristics
From the 1,955 articles identified in the literature search, screening of bibliographies (of the 27 articles found eligible for full-text screening in the literature search), and random search, 16 articles were included in the review (Fig. 1). A summary of the study characteristics of the included studies is presented in Additional File 1. Of these included articles, 10 were retrospective studies assessing either historical outbreaks or data of cases of gastrointestinal illness and data signals for early detection of waterborne outbreaks (18–27), and six were simulation studies evaluating the system performance of different SyS systems (28–32). The included studies originated from Sweden (n = 2), France (n = 5), the USA (n = 4), and the United Kingdom (n = 4), with one study assessing data from several European countries suggesting a common surveillance approach (25), covering an overall study period of 1997 to 2013, with multiple agents causing waterborne outbreaks or illness. All of the included studies were published in the period 2010 to 2018—except for one that was published in 2006 (21) (Additional File 1).
Among the excluded articles, the majority were data signal studies, including investigations of water quality data or disturbances in the distribution systems, in combination with other signals from the health sector (33–42). One study used web queries to estimate the burden of disease due to gastrointestinal illness related to pipe breaks (7), while another study assessed the relationship between precipitation and waterborne diseases (43). Common in these studies was the fact that, despite that they demonstrated promising correlations, they did not report on the experienced effectiveness or value of using the same signals in their surveillance system explicitly. The other excluded studies addressed SyS but in the context of describing or reviewing such systems in a general manner (44, 45) (Additional File 1).
Data synthesis
The data extracted from the included articles is synthesized in Table 1. When reported, the sensitivity of the SyS in the retrospective studies was below 50%, with one exception reporting a sensitivity of 89% (19). In the simulation studies, the sensitivity was reported above 70% when using different aberration adjustments.
Table 1
Synthesis of data from the included articles (n = 16)
Data signal
|
Reference
|
Timeliness
|
Sensitivity/Specificity
|
Pros
|
Cons
|
Over-the-counter (OTC) sales of pharmacy sales
|
Kirian et al., 2011
|
NI
|
Sensitivity: 4–14%, specificity: 97–100%.
|
It may capture symptoms in the population before a person with gastrointestinal illness seeks health care.
|
It does not necessarily indicate the buyer’s location, their demographic status, or the reason for the purchase. Those who purchase OTC medications for their illness may not be representative of the sick population as a whole. Hoarding behaviour will also affect the outcome.
|
Reimbursement of prescription drugs
|
Bounoure et al., 2011
|
NI
|
Reported sensitivity of 89% and specificity 89% to distinguish acute gastrointestinal cases based on drug sales.
|
Prescription drug data can be considered for the development of a detection system of waterborne outbreaks given its ability to describe an epidemic signal. It could support authorities in slow developing outbreaks.
|
The algorithm cannot be used directly in other countries because of their different health systems, types and sources of data, and medical practices. The accuracy depends on the medical consultation rate in the impacted population.
|
Mouly et al., 2016
|
NI
|
Sensitivity: 6% and 21% for two examined outbreaks.
|
Calls to health advice line (‘telehealth’)
|
Bjelkmar et al., 2017
|
~ 6 days
|
NI
|
It may capture symptoms in the population before seeking health care.
|
The alarm does not contain information regarding the cases’ medical status to validate the cause of the alarm. Moderate and low outbreaks (< 1000 cases) are unlikely to be detected. The detection ability varies seasonally. Telehealth may, in general, be driven by media bias.
|
Cooper et al., 2006
|
Unlikely to detect local outbreak
|
NI
|
Emergency care data; medical dispatch, ambulance medical service, emergency department chief complaints
|
Ziemann et al., 2014
|
NI
|
NI
|
This system could detect changes in local trends and clusters of statistical alarms.
|
It is not likely to detect local gastrointestinal outbreaks with few, mild, or dispersed cases. The probability of detecting an outbreak increases with the outbreak size. The results cannot be generalized to region-level data or very sparse time series.
|
Over-the-counter (OTC), web queries, calls to health advice line
|
Andersson et al., 2014
|
NI
|
Calls to health advice line: sensitivity: 40–50%, specificity: 99%, web queries and OTC: no signal.
|
SyS can serve as an early warning, especially with telephone triage data with sufficient temporal and spatial resolution. It may be suited to detecting widespread rises in syndromes and, rarely, small-scale outbreaks.
|
The alarm does not contain information on the cases’ medical status to validate the cause of the alarm. Moderate and low outbreaks (< 1000 cases) are unlikely to be detected.
|
Telehealth, in-hours and out-of-hours GP, ED visits
|
Smith et al., 2010
|
Peak of calls coincides with outbreak (95% CI) in one area
|
NI
|
Multiple syndromic data streams are an advantage.
|
Telehealth may, in general, be driven by media bias.
|
Combined health, spatial and environmental data
|
Coly et al., 2017
|
NI
|
Detected outbreaks < 100 cases.
|
Increases sensitivity and timely detection of waterborne outbreaks.
|
These systems are expensive in terms of resources and shared expertise in incorporating local knowledge regarding both environmental and health data.
|
Rambaud et al., 2016
|
NI
|
NI
|
Combining two complementary methods protects against false positives, e.g. confusion of cases stemming from exposure from other types of food or swimming, for example.
|
Pilot-study and not tested on a larger scale.
|
Multiple SyS and
environmental data
|
Burkom et al., 2011
|
NI
|
Sensitivity: 80%, specificity: 99%
|
Use of multiple syndromic data streams is an advantage. The number of false alarms is greatly reduced.
|
Simulation results must generally be improved with real epidemiological data.
|
Colón-Gonzales et al., 2018
|
Unlikely to detect outbreaks < 1000 cases
|
NI
|
Framework applicable for other SyS systems.
|
The detection ability varies seasonally.
|
Mouly et al., 2018
|
NI
|
Sensitivity: 73%, PPV: 90.5%
|
Space-time increases the likelihood of detecting outbreaks.
|
The probability of detecting outbreaks increases with the outbreak size.
|
Noufaily et al., 2012
|
NI
|
Unknown
|
Able to reduce the number of false alarms and, in some cases, increase the sensitivity.
|
Week-to-week variation in outbreaks may prove unhelpful.
|
Zhou et al., 2014
|
3.3 to 6.1 days
|
When reported, the sensitivity ranged from 24 to 77%, and the PPV was 90.5%.
|
Sensitivity and timeliness increase with stratification.
|
Study population perhaps not representative.
|
Xing et al., 2011
|
NI
|
Of the simulated models, the regression method had higher sensitivity (range 6–14% improvement of sensitivity in the surveillance system).
|
Demonstrates possible improvement in the surveillance system to increase sensitivity.
|
Simulations based on small number of data points.
|
Note: NI = not identified, PPV = positive predictive value
Some of the included studies addressed the same surveillance system but with different study purposes. In France, a national surveillance system based on administrative health data from the French National Health Insurance on the reimbursement of prescriptive drugs has been functioning since the late 1990s (19). The system contains information on the medications for gastrointestinal illness, which are reimbursable, prescribed by a general practitioners (GPs) and dispensed in a pharmacies covering approximately 98% of the French population. (24). All the included articles originating from French study data were related to this health administrative database.
In the UK, the SyS at Public Health England (PHE) is based on four National Health Service (NHS) healthcare settings: telehealth, in- and out-of-hours, unscheduled care general practitioner consultations, and emergency department (ED) attendances (30). This system has been examined, together with the of the Health Protection Agency (HPA) and QSurveillance, a national surveillance system set up by the University of Nottingham, and the Egton Medical Information System, which consists of a network of GPs (23). Meanwhile, Noufaily et al. assessed weekly counts of samples sent to the HPA. One of the studies in this review included an older version of the SyS in the UK (21).
In the US, several surveillance systems exist (39, 44, 45), and, in this review, we included publications addressing the Electronic Surveillance System for the Early Notification of Community-Based Epidemics (ESSENCE) (28). Additionally, two studies assessed the US Centers for Disease Control and Prevention BioSense surveillance system using emergency department chief complaint data (32) and daily syndrome counts from the outpatients of the U.S. Department of Veteran Affairs’ Veteran Health Administration (31). Moreover, both of the two included studies from Sweden addressed data signals from Swedish Health Care Direct 1177 (Vårdguiden 1177), along with signals such as web queries and over-the-counter pharmacy sales in one of the study (18, 22).
Single data signal SyS system
Five of the included studies addressed a single preclinical data signal for outbreak detection and gastrointestinal illness. Kirian et al. (2011) evaluated the ability of drug sales in predicting endemic and epidemic gastrointestinal disease in the San Francisco area and found no significant correlations between drug sales and illness case counts, outbreak counts, or the number of outbreak-associated cases and reported a low sensitivity (4–14%) and high specificity (97–100%) in the study (26).
In the UK study included in this review, calls made to the health helpline (NHS Direct) were assessed based on whether the number of calls about diarrhoea exceeded a statistical threshold (21). The authors of the study predicted a 4% chance of detection when assumed that one-twentieth of cryptosporidiosis cases telephoned the helpline, which rose to a 72% chance when assumed nine-tenths of cases telephoned. They concluded that NHS Direct was currently unlikely to detect an event similar to the cryptosporidiosis outbreak used in the study and may be most suited to detecting more widespread increases in symptoms (21).
Bjelkmar et al. (2017) extended on such a system for nurse health calls proposed by Andersson (18). The authors compared phone call patterns to the Swedish Health Care Direct 1177 during the outbreak in Skellefteå in different water distribution areas, suggesting that the systematic monitoring of phone calls made to health services could have limited the outbreak from 18,500 cases to approximately 2,300 cases by detecting the outbreak approximately six days earlier than actually detected (22).
Multiple data signal SyS systems
For establishing a national SyS system, Andersson et al. (18) evaluated the efficiency of alternate data sources for the early detection of nine investigated outbreaks in Sweden, including telephone triage, web-queries, and over-the-counter (OTC) pharmacy sales. The authors suggested that SyS can serve as an early warning of waterborne outbreaks, especially with telephone triage data with sufficient temporal and spatial resolution (40–50% sensitivity and 99% specificity); however, data was lacking for outbreaks of moderate size (300-1,000 cases) (18).
Smith et al. (23) evaluated the value of SyS in monitoring small waterborne outbreaks using data from a SyS system featuring a direct telephone helpline and QSurveillance national SyS using clinical diagnosis data extracted from the GP clinical information system (23). The authors reported that, for the first time, such a SyS system was helping to monitor a small-scale waterborne outbreak; however, the peaks of calls to the helpline observed may have been influenced by the media as a boil water advisory was issued during the outbreak (23).
Using routine emergency data based on an inventory of sub-national emergency data available in 12 European countries, Ziemann et al. (2013) proposed a framework of definitions for specific symptoms and a SyS system design applying cumulative sum and spatial-temporal cluster analyses for the detection of local gastrointestinal outbreaks in four countries. Based on the suggested system, the authors identified two gastrointestinal outbreaks in two countries, and 1 out of the 147 confirmed outbreaks in the studied countries was detected (25).
Combined SyS systems with environmental data
Two articles included in this review combined water quality data and information on supply zones in the SyS in France. A pilot study was conducted to assess the utility of using a health insurance database for the automated detection of waterborne outbreaks of acute gastroenteritis (27). Overall, 193 clusters were identified, with 10% of the municipalities involved in at least one cluster and less than 2% in several (27). To improve the detection of waterborne outbreaks, Coly et al. (24) developed an integrated approach to detect any study clusters of acute gastrointestinal infection in geographical areas with a homogeneous exposure to drinking water. They used data from the French SyS system, geographical and population data, and environmental data based and the application of a space-time detection method identified 11 potential waterborne disease outbreaks. The outbreaks identified were not investigated, but the risk factors of exposure were examined (24).
Method evaluations via simulations
Three of the included articles concerned simulations of SyS systems in the US. Burkom et al. (28) studied an integrated approach for the fusion of water quality data (e.g., faecal indicator bacteria, chlorine, pH, conductivity, and turbidity) with health monitoring data (ESSENCE) using probabilistic Bayesian networks. The simulations indicated a sensitivity of 80% and specificity of 99% for the symptoms “nausea/vomit” (28), however, further component simulations and the multidisciplinary development of realistic data scenarios would be needed (28). Xing et al. (32) compared timeliness of the SyS system using five regression models, and found that the sensitivity for ‘nausea and vomiting’ was calculated to approximately 55% (32). The simulations in the study of Xing et al (32) had a number of limitations, including a low number of data points.
Zhou et al. (31) examined the performance of the U.S. Centers for Disease Control and Prevention’s BioSense SyS system by injecting multi-day signals stochastically drawn from lognormal distributions into time series of aggregated daily visit counts for the outpatients at the Department of Veterans Affairs’ Veterans Health Administration (VHA) (31). The authors reported that, with a daily background alert rate of 1% and 2%, the sensitivities and timeliness in the SyS ranged from 24 to 77% and 3.3 to 6.1 days, respectively (31).
In the UK, two published studies presented measures to improve the method performance of national SyS systems. Noufaily et al. (46) reviewed the performance of aberrances among the weekly counts of isolates reported to the Health Protection Agency. By simulating different contrasting scenarios, the authors suggested several improvements related to the treatment of trends, seasonality, and the re-weighting of baselines. They claimed that the suggested system was able to reduce the number of false reports while retaining good power to detect genuine outbreaks. However, no explicit results regarding sensitivity and specificity related to detection of waterborne outbreaks were reported in the article (46).
Colón-Gonzales et al. (30) investigated how the characteristics of different outbreaks affected outbreak detection and the utility of SyS in detecting outbreaks using modelling and probability/statistics for two possible scenarios, including a localized outbreak of cryptosporidiosis. The authors reported that small gastrointestinal outbreaks (e.g., cryptosporidiosis) were unlikely to be detected unless the number of cases was over 1,000, with the detection of waterborne outbreaks varying by season (30). Multiple data streams (e.g., emergency attendance) are an advantage of influenza detection but not for outbreaks of cryptosporidiosis. However, the proposed framework of Colón-Gonzales et al. (2018) could, according to the authors, be applicable for the evaluation of any SyS system (30).
Mouly et al. 2018 (29) evaluated the performance of an algorithm using the French SyS system for waterborne outbreak detection through a simulation-based study using multivariate regression to identify the factors associated with outbreak detection. Almost three-quarters of the simulated outbreak were detected (sensitivity of 73%), and more than 9 out of the 10 detected signals corresponded to a waterborne outbreak (positive predictive value of 90.5%). The probability of was found to increase with the outbreak size (29).
Risk of bias and cumulative body of evidence
The risk of bias of the included studies was overall assessed to be moderate to serious (Additional File 1). Due to the heterogeneity of the articles included, the cumulative body of evidence was partly assessed using the PRECEPT framework. The evidence was graded as high due to the low risk of publication bias.