Model segmentation captures dynamical trends in COVID-19 datasets
Shelter-in-place orders (SIPOs) are premised on a hypothesis that if COVID-19 spreads primarily through interactions between infected and susceptible individuals, then reducing the number of these interactions will reduce the number of individuals that develop an infection. By 20 April 2020, at least 40 state governors had issued SIPOs, and a statistical analysis of data ac- quired for a 3-week period between 8 March and 17 April linked these policy decisions to a 44% reduction in the number of cumulative infections that followed policy implementation [14]. Thus, SIPO compliance appears to correlate with a reduction in COVID-19 cases, possibly through a reduction in population mobility. This prompts the question of whether a population mobility mechanism can be used to link dynamical features in COVID-19 time series data with SIPO dates. We cannot directly incorporate a reduction of population mobility into our compart- mentalized model because our assumption that accumulation or depletion rates are “reaction limited” eliminates their explicit spatial dependence. In our model, new COVID-19 infections emerge from interactions between susceptible and infected individuals, so we could achieve the effects of reduced population mobility by altering the accessibility of susceptible individuals to infected ones. As shown in the Supplementary Material, at a date in the time series associated with a SIPO, we partition a fraction, !, of the susceptible population at time t, St, into one or more subsets, (1 − !)&!, which remain inaccessible to infected individuals for the duration of the SIPO period. We note that this methodology lacks an associated period of compliance, which is at least as long as the COVID-19 latency [14].
We fit our deterministic model to new daily infection data using a Bayesian approach, which defines a probability distribution over the model parameters given daily new infection data. The distribution depends on the nonlinear form of our model and does not follow any standard form. Computationally, we employ a combination of maximum a posteriori (MAP) esti- mation and Markov chain Monte Carlo (MCMC) sampling. Our approach first identifies a set of model parameters that maximize the likelihood of observing the reported COVID-19 time series data and then explores the entire posterior distribution to characterize the uncertainty in the model parameters. Posterior samples of the model parameters get propagated through the model equations to characterize the model’s predictive uncertainty. This Bayesian approach re- quires the definition of a prior probability distribution for the model variables and a statistical error model for the difference between predictions and observations. We adapt a previously re- ported prior distributions for the model parameters [11], while using a combination of log-normal and uniform distributions to represent our prior knowledge of the initial conditions of the mod- eled subpopulations. We then make the common assumption that differences at time t between model predictions and data are normally distributed, with a constant variance that is estimated in a hierarchical Bayesian formulation alongside the model parameters.
We focus our model optimization on the last 28-day segment of time-series infection da- tasets for each of the three states we examined. As human behavior can be dynamic, the model fit to these truncated data provide a picture of the current disposition of a state’s residents in relation to SIPOs. This can be seen in our fits for Massachusetts, New Jersey, and Washington, in which the deterministic model was fit to time-series data from 22 January 2020 to 31 May 2020 (Figure 1). Here, black circles illustrate the new daily reported infections, whereas the red curves illustrate the forecast associated with the median of the ensemble trajectories identified through our Bayesian curve-fitting method. Finally, the blue squares illustrate the daily reported infec- tions since the model’s training, from 01 June 2020 to 22 June 2020. Massachusetts and New Jersey have tracked well with the model’s predictions, while Washington has seen a positive trend in new cases since the beginning of June not predicted by the model. This suggests either a change in SIPOs policies in Washington, such as the introduction of the phased returns that began 01 June 2020 [15], or changes in adherence and population behavior, resulting in out- breaks such as those in agricultural businesses and long-term care facilities such as those seen in eastern Washington [16], driving the increase. This highlights the primary limitation of the model: the resulting forecasts are only applicable given a continuance of SIPOs policy and social behaviors.
Linking policy decisions to additional infection waves
Model fits to new daily case data are greatly improved if we sequester a fraction of sus- ceptible individuals to limit access to infected ones (Fig. 1), but clearly these individuals do not indefinitely remain immobile. If a goal of SIPOs is to minimize the extent and severity of COVID- 19 infections, then it seems reasonable to lift them once infections fall below an acceptable level. What is this level and what happens if SIPOs are removed too early or ignored, as we have seen in Washington and other states?
The Spanish Flu pandemic of 1918-1919 offers some insight, specifically in that it pre- sented as three successive waves of infections across the world [17, 18]. Mathematical modeling of this disease at a population level suggests that its three infectious waves were rooted in the time dependence of its population-averaged transmission rate [19]. Social factors, such as changes in population mobility, in addition to others, can facilitate these trends. However, the Spanish Flu pandemic is thought to have been affected by antigenic drift, which could also con- tribute changing immunity and transmission rates [20, 21]. A more general survey of mechanisms that might reproduce multiple infection waves for influenza pandemics suggests that a time-de- pendent transmission rate is sufficient to fit long-term infection trends with low error [22]. This study also suggested that only very strong intervention (i.e., approx. 90% reduction in initial sus- ceptibility) eliminated the appearance of second waves. Although the molecular details of these viruses differ, these studies suggest that concepts such as a time-dependent transmissibility and manipulation of susceptible populations are general enough to describe how multiple waves of COVID-19 infections could evolve in the future.
We demonstrate a hypothetical second wave scenario for Massachusetts, New Jersey, and Washington in Figure 2. A fraction of the population (70%, in this example) is reserved in isolation early in the simulation, then released on 15 July 2020 when the SIPOs are relaxed in these scenarios. We then forecast through the end of November for each state. States then see the peak of the second wave in mid-October to mid-November 2020. Note that, without the re- implementation of social distancing, the second waves are more severe than the first, reaching over twice the number of active cases in the peak days compared to the first wave (Figure 2, black). We also test the scenario in which states stand up SIPOs starting 2 weeks into the second wave, upheld through the end of the simulation. These policies result in a nearly-complete atten- uation of the second wave (Figure 2, red). This result highlights the necessity to analyze current policies and prepare populations to socially distance themselves as quickly as possible once an- other wave is detected.
The effective reproductive number, Re, is a metric of the transmissibility of the virus; vi- ruses with Re greater than 1 are likely to spread while those with R0 less than 1 are likely to die. An early review of Re estimates for China found the average to be 3.28, with a median of 2.79 [23]. We have implemented a “next-generation” matrix method to estimate the R0 for each day [11, 24]. This allows us to dynamically track the Re and determine how changes in policy affect the transmissibility of the virus in state level populations.
We demonstrate in Figure 3 the daily changes in the Re for the completely open and shel- tered-in-place second wave scenarios for the three states. Initially, the Re ranges between 2-3.5 before dropping due to changes in social distancing policy and behavior in the model. These then gradually decrease until Re is less than 1. If current conditions are maintained, Re=1 quantifies a threshold below which the population will eventually return to a disease-free state. Once the SIPO is lifted on 15 July 2020, the Re increases back to approximately 1-2 due to an influx of people from isolation, peaking in advance of secondary infection waves. These Re values fall to below 1 again in the “open state” scenario, trending monotonically down throughout the extent of the second wave. However, reimplementation of the SIPO 2 weeks after the second wave begins, sees Re values fall back below 1 where they persist, hearkening back to the attenuation of the curve we demonstrated in Figure 2. This suggests that observation of a dynamic Re could be used to monitor current and future waves of infections and to justify changes in state policy and social behavior to mitigate public health impacts.