medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Quantifying the Benefits of Targeting for Pandemic Response Sergio Cameloa, Dragos F. Ciocanb, Dan A. Iancub,c, Xavier S. Warnesc, and Spyros I. Zoumpoulisd aInstitute for Computational and Mathematical Engineering, Stanford University bTechnology and Operations Management, INSEAD cOperations, Information & Technology, of Business, Stanford University dDecision Sciences, INSEAD

ABSTRACT

To respond to pandemics such as COVID-19, policy makers have relied on interventions that target specific age groups or activities. Such targeting is potentially contentious, so rigorously quantifying its benefits and downsides is critical for designing effective and equitable pandemic control policies. We propose a flexible modeling framework and algorithms to compute optimally targeted interventions that coordinate across two dimensions of heterogeneity: age of different groups and the specific activities that individuals engage in during the course of a day. We showcase a complete implementation in a case study focused on the COVID-19 epidemic in the Île-de-France region of France, based on hospitalization, community mobility, social contacts and economic data. We find that optimized dual-targeted policies generate substantial complementarities that lead to Pareto improvements, reducing the number of deaths and the economic losses overall and reducing the time in confinement for each age group, compared to less targeted interventions. These policies have a simple and explainable structure. Since dual-targeted policies could lead to increased discrepancies in the confinements faced by distinct groups, we also quantify the impact of requirements that explicitly limit such disparities, and find that satisfactory trade-offs may be achievable through limited targeting.

Keywords: Pandemic management, Confinement, Targeted interventions, Optimization, COVID-19

Introduction The COVID-19 pandemic has forced policy makers worldwide to rely on a range of large-scale population confinement measures in an effort to contain the disease spread. In determining these measures, a key recognition has been that substantial differences exist in the health and economic impact produced by different individuals engaged in distinct activities. Targeting confinements to account for such heterogeneity could be an important lever to mitigate a pandemic’s impact, but could also lead to potentially contentious and discriminatory measures. This work is aimed at developing a rigorous framework to quantify the benefits and downsides of such targeted interventions, and applying it to the COVID-19 pandemic as a real-world case study. One real-world contentious example of targeting has been to differentiate confinements based on age groups, e.g., sheltering older individuals who might face higher health risks if infected, or restricting younger groups who might create higher infection risks. Such measures, focusing on confinements or other interventions, and targeting age or other population characteristics, have been studied in the literature (1; 2; 3; 4; 5; 6; 7; 8; 9; 10) and implemented in several settings – e.g., with stricter confinements applied to older groups in Finland (11), Ireland (12), Israel (13) and Moscow (14), or curfews applied to children and youth in Bosnia and Herzegovina (15) and Turkey (16) – but some of the measures were eventually deemed unconstitutional and overturned (13; 15). A different example of targeting has stemmed from the recognition that different activities (more specifically, population interactions in locations of certain activities), such as work, schooling, transport, leisure, result in significantly different patterns of social contacts and new infections. This has been shown to be critical when modeling pandemic spread (17; 18; 19) and has been recognized in numerous implementations that differentially confine various activities (e.g., closures of schools, workplaces, recreation venues, etc.), and even some that differentiate based on both age groups and activities (e.g., dedicated hours when only the senior population was allowed to shop at supermarkets (20)). As these examples suggest, targeted confinements have merits but also pose potentially significant downsides. On the one hand, targeting can generate improvements in both health and economic outcomes, giving policy makers an improved lever when navigating difficult trade-offs. Additionally, explicitly considering multiple dimensions of targeting simultaneously – activities and age groups – could overturn some of the prevailing insight that specific age groups should uniformly face stricter confinements. However, such granular policies are more difficult to implement, and could lead to discriminatory and potentially unfair measures. Given that some amount of targeting of activities and age groups is already in place in existing real-world

NOTE: This preprint reports new research that has not been certified by peer review and should not be used to guide clinical practice. medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

policy implementations, it seems critical to quantify the relative merits of a policy that (i) targets both age-based population groups and activities, and (ii) identifies optimal interventions. We propose a rigorous modeling framework and develop a set of associated algorithms that compute optimally targeted interventions that target both age groups and activities. The framework provides quantifiable answers to the following natural research questions: How large are the health and economic benefits of dual-targeted confinements? Would dual targeting lead to significant synergies, and why? Could dual targeting reduce time in confinement for every age group? What is the relationship between the effectiveness and the level of targeting allowed across distinct groups? We showcase a proof of concept for our framework through a case study calibrated on Île-de-France data – a region of France encompassing Paris with a population of approximately 12 million. The implementation, publicly available at http://insead.arnia.ro, is flexible and portable to other geographies.

Methods Our framework relies on a flexible model that captures several important real-world considerations. We extend a version of the discretized SEIR (Susceptible-Exposed-Infectious-Recovered) epidemiological model (18; 21; 22) with multiple population groups that interact with each other (SI §2). We augment the model with controls that target based on (i) age groups, and (ii) types of activities that individuals engage in. Different policy interventions can be embedded as controls: we focus on time-dependent, targeted confinements, but extend the model to mass testing in the SI; vaccinations can also be accommodated. Interventions modulate the rate of social contacts and the economic value generated, and the objective of the control problem is to minimize a combination of health and economic losses caused by deaths, illness, and activity restrictions. The model captures important resource constraints (such as hospital and ICU), and allows explicitly controlling the amount of targeting through “limited disparity” constraints that limit the difference in the extent of confinement imposed on distinct age groups. Epidemiological Model and Controls. We segment the population by age into nine groups g G ; the youngest group g 2 captures individuals with age 0-9 and the oldest those aged 80 or above. For each g, the compartmental model includes states for susceptible, exposed, infectious, quarantined infectious, recovered, and deceased individuals. We also reserve separate states for individuals who are hospitalized due to being infected, in either general hospital wards or in intensive care units (ICU). We use T to denote the time horizon of the control problem, and X t to denote the entire vector of epidemiological states at time 0 t T.   Individuals interact in activities belonging to the set A = work,transport,leisure,school,home,other . These interac- { } tions generate social contacts which drive the rate of new infections. We control the SEIR dynamics by adjusting the confinement intensity in each group-activity pair over time: we let a `g(t) [0,1] denote the activity level allowed for group g and activity a at time t, expressed as a fraction of the activity level 2 a under normal course of life (no confinement). We denote `g(t)=[`g(t)]a A , ut =[`g(t)]g G and ut:t =[ut ,...,ut ]. 2 2 0 0 We propose a parametric model to map activity levels to social contacts. We use cg,h(`g,`h) to denote the mean number of total daily contacts between an individual in group g and individuals in group h across all activities when their activity levels are `g,`h, respectively. Varying the activity levels changes the social contacts according to

a a a1 a a2 cg,h(`g,`h)= Â Cg,h (`g) (`h) , (1) a A · · 2 a where Cg,h denote the mean number of daily contacts in activity a under normal course (i.e., without confinement), and a a1,a2 R are parameters. We retrieve values for C from the data tool (23), which is based on the French social contact 2 g,h survey data in (24), and we estimate a1,a2 from health outcome data (25) and Google mobility data (26). When the number of patients requiring hospitalization or ICU treatment exceeds the respective capacity of available beds, we assume that patients are turned away from each age group according to a proportional rule.1 Objective. Our objective captures two criteria. The first quantifies the total deaths directly attributable to the pandemic, which we denote by Total Deaths(u0:T 1) to reflect the dependency on the specific confinement policy u0:T 1 followed. The second captures the economic losses due to the pandemic, denoted by Economic Loss(u0:T 1) and stemming from work restrictions of activity (SI §2). For example, lowering `30 39 y.o.(t) causes lost in g = 30 39 y.o., while lowering `school (t) causes lost schooling costs for g = 10 19 y.o. 10 19 y.o. To allow policy makers to weight the importance of the two criteria, we associate a cost c to each death, which we express in multiples of French GDP per capita. Our framework can capture a multitude of policy preferences by considering a wide range of c values, from completely prioritizing economic losses (c = 0) to completely prioritizing deaths (c •). ! 1Our framework allows implementing any turn-away rule or even optimizing over these decisions.

2/9 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Optimization Problem. We seek control policies u0:T 1 that minimize the sum of economic and mortality costs. More formally, we solve:

minu0:T 1 Economic Loss(u0:T 1)+c Total Deaths(u0:T 1) (2) · subject to constraints⇥ that (i) the state trajectory follows the SEIR dynamics,⇤ and (ii) the controls and states do not exceed the available capacities of ICU beds and hospital beds. Re-Optimization with Linearized Dynamics — ROLD. Minimizing (2) is challenging to solve to optimality even for small time horizons due to the non-linear and non-convex nature of the SEIR dynamics. To overcome this, we design a model predictive control (27) approximate algorithm that remains tractable for realistic time horizons and problem sizes. The algorithm proceeds sequentially over the time horizon. When the SEIR state is X t at time t, it builds an approximation of the SEIR dynamics from t to T that is linear in the controls ut:T 1. This linearity allows us to find the best control ut⇤:T 1 with respect to these linearized dynamics. The algorithm uses ut⇤ as the intervention in the real SEIR system to advance to state X t+1, and iterates by updating the linearization around the new state. Re-solving at each time t is aimed to prevent the linearization errors from growing too large (details in SI §3). Parametrization and Model Calibration. We adopt the confidence regions for SEIR parameters reported in (22) for the Île-de-France region, which we complement with Google mobility data to approximate the mean effective lockdowns for activities during the horizon of interest. We estimate values of all parameters by comparing the simulated SEIR model against real data on hospital and ICU utilization and deaths. We calibrate our economic model using data from France (and where available Île-de-France) on full time equivalent wages and rates, and sentiment surveys on business activity levels during confinement. Experimental Setup. We use a horizon of T = 90 in the experiments reported in the main paper, and allow up to T = 360 days in additional experiments reported in SI. We allow the confinement decisions to change every two weeks. We use a capacity of 2900 beds for ICU in Île-de-France, and infinite beds for general hospital wards. We consider several ROLD policies that differ in the level of targeting allowed, which we compare over a wide range of values for c, from 0 to 1000 the annual GDP per ⇥ capita in France2. In SI §7 we also discuss the additional benefits of targeted administration of viral tests across age groups. We provide all the details for the ROLD heuristic, calibration, parameter specification, and experimental setup in the SI, where we also report results from sensitivity and robustness analyses on the fitted parameters.

Results We use this framework to address our main research questions. How large are the gains from dual targeting? To isolate the benefits of each type of targeting, we compare four versions of ROLD that differ in the level of targeting allowed: no targeting whatsoever (“NO-TARGET”), targeting age groups only (“AGE”), activities only (“ACT”), or both (“AGE-ACT”, or simply “ROLD” when no confusion can arise). Figure 1a records each policy’s performance in several problem instances parameterized by the cost of death c. A striking feature is that each of the targeted policies actually Pareto-dominates the NO-TARGET policy, and the improvements are significant: relative to NO-TARGET and for same number of deaths, economic losses are reduced by EUR 0-2.9B (0%-35.9%) in AGE, by EUR 0.4B-2.1B (4.5%-49.8%) in ACT, and by EUR 3.3B-5.3B (35.7%-80.0%) in AGE-ACT. This Pareto-dominance is unexpected, since it is not a property that we explicitly ask for in our optimization procedure, and it underlines that any form of targeting can lead to significant improvements in terms of both health and economic outcomes. When comparing the different types of targeting, neither AGE nor ACT Pareto-dominate each other, and neither policy dominates in terms of the total loss objective (SI Figure S3). In contrast and crucially, AGE-ACT Pareto-dominates all other policies, and moreover leads to super-additive improvements in almost all cases: for the same number of deaths, AGE- ACT reduces economic losses by more than AGE and ACT added together (SI Figure S4). This suggests that substantial complementarities may be unlocked through targeting both age groups and activities, which may not be available under less granular targeting. To confirm the significance of these gains, we also compare ROLD AGE-ACT with various practical benchmark policies in Figure 1b. Benchmarks ICU-t and Hybrid-t AND/Hybrid-t OR mimic implementations in the U.S. Austin area (28) and, respectively, France (29). These policies switch between a stricter and a relaxed confinement level based on conditions related to hospital occupancy and the rate of new infections (SI §5). We also consider two extreme benchmarks corresponding to enforcing “full confinement” (FC) or remaining “fully open” (FO); these can be expected to perform well when completely prioritizing one of the two metrics of interest, with FC minimizing the number of deaths and FO ensuring low economic losses.

2We quantify the cost of death c as a multiple of the annual GDP per capita in France, and use the shorthand notation n to denote a value of n times this ⇥ annual GDP per capita.

3/9 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

ROLD Pareto-dominates all these benchmarks, decreasing economic losses by EUR 5.3B-16.9B (71.0%-82.6%) relative to Hybrid-t AND, by EUR 7.1B-11.6B (62.2%-82.8%) relative to Hybrid-t OR, and by EUR 5.4B-11.6B (62.2%-78.0%) relative to ICU-t for the same number of deaths. Additionally, ROLD meets or exceeds the performance of the two extreme policies: for a sufficiently large c, ROLD exactly recovers the FC policy, resulting in 890 deaths and economic losses of EUR 27.6B; for a sufficiently low c, ROLD actually Pareto-dominates the FO policy, reducing the number of deaths by 16,688 (76.7%) and reducing economic losses by EUR 1.6B (65.3%). The latter result, which may seem surprising, is driven by the natural premise captured in our model that deaths and illness generate economic loss because of lost productivity; thus, a smart sequence of confinement decisions can actually improve the economic loss relative to FO. Among all the policies we tested, ROLD AGE-ACT was the only one capable of Pareto-dominating the FO benchmark, confirming that dual targeting is critical and powerful. The Pareto-dominance of ROLD AGE-ACT implies its dominance in terms of the total loss objective (SI Figure S3). These results are robust under more problem instances (SI Figure S5).

(a) Comparison between four ROLD policies with different levels of (b) Comparison between ROLD with dual targeting and benchmark targeting policies

Figure 1. The total number of deaths and the economic losses generated by ROLD policies with different levels of targeting and by the benchmark policies. Panel (a) compares the four versions of ROLD that differ in the level of targeting allowed. Panel (b) compares the ROLD policy that targets age groups and activities with the benchmark policies. Each marker corresponds to a different problem instance parametrized by the cost of death c. We include 128 distinct values of c from 0 to 990 , and panel (b) also includes a very large value (c = 1016 ). ⇥ ⇥ How do gains arise from dual targeting? We examine the structure of the optimal ROLD AGE-ACT confinement decisions. We focus our discussion on the value c = 50 , which is in the mid-range of estimates used in the economics literature on ⇥ COVID-19 (30) and is representative of the behavior we observe across all experiments (SI §7). Figure 2a visualizes the optimized confinement policy. Generally, the ROLD policy maintains high activity levels for those groups with a high ratio of marginal economic value to total social contacts in the activity, i.e., a high a dvg(ut )/d`g(t) “econ-to-contacts-ratio” := a , Âh G Ch,g 2 where vg(ut ) is the economic value created by an individual in age group g when activity levels are ut . For example, in work, ROLD completely opens up the 40-69 y.o. groups, while confining the 20-39 y.o. groups during the first two weeks and the 10-19 y.o. groups for the first ten weeks. This is explainable since the 40-69 y.o. age groups produce the highest econ-to-contacts-ratio in work, while the younger groups have progressively lower ratios. Similarly, ROLD prioritizes activity in transport, then other, then leisure, in accordance with the relative econ-to-contacts-ratio of these activities. To confirm the robustness of this insight, we also conduct a more thorough study where we compute optimal ROLD policies for several problem instances, and then train regression decision trees to predict the optimal ROLD activity levels as a function of several features (SI §7). The results are captured in Figure 2b and SI Figure S10. These simple trees can predict the optimal ROLD activity levels quite well (with root MSE values in the range 0.10-0.22), and they confirm our core insight that the econ-to-contacts ratio is the most salient feature when targeting confinements, as it is used as a split variable in the root node of each tree, with higher ratios leading to higher activity levels in all activities considered. To understand how complementarities arise in this context, note that the ability to separately target age groups and activities allows the ROLD policy to fully exploit the fact that distinct age groups may be responsible for the largest econ-to-contacts

4/9 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

ratio in different activities. As an example, the 20-69 y.o. groups have the highest ratio in work, whereas the 0-19 y.o. and 70+ y.o. groups have the highest ratio in leisure. ROLD coordinates confinements to account for this: groups 20-69 y.o. remain more open in work but face confinement in leisure for the first ten weeks, whereas the 10-19 y.o. group is confined in work for a long period while remaining open in leisure, as remain the 70+ y.o. groups. These complementary confinement schedules allow ROLD to reduce both the number of deaths and economic losses, with the added benefit that no age group is completely confined.

samples = 9,240 activity level = 0.00% samples = 1,575 activity level = 8.25% Decision tree for "work" RMSE: 0.18439 econ-to-contacts-ratio <= 60.6 time <= 63.0 samples = 630 samples = 11,445 samples = 2,205 True activity level = 100.00% activity level = 6.64% activity level = 34.46% econ-to-contacts-ratio <= 176.9 samples = 27,720 samples = 2,010 activity level = 55.85% time <= 21.0 econ-to-contacts-ratio <= 765.4 activity level = 43.45% False samples = 16,275 samples = 4,650 activity level = 90.45% activity level = 69.48% samples = 2,640 activity level = 89.31%

econ-to-contacts-ratio <= 305.9 samples = 625 samples = 11,625 activity level = 81.60% activity level = 98.83%

samples = 11,000 activity level = 99.81%

samples = 7,875 activity level = 0.00% samples = 1,520 activity level = 5.71% Decision tree for "leisure" RMSE: 0.21767 econ-to-contacts-ratio <= 51.4 time <= 49.0 samples = 1,140 samples = 10,535 samples = 2,660 True activity level = 86.79% activity level = 10.22% activity level = 40.46% econ-to-contacts-ratio <= 113.3 samples = 27,720 samples = 1,640 activity level = 56.67% time <= 21.0 econ-to-contacts-ratio <= 226.5 activity level = 23.87% False samples = 17,185 samples = 4,910 activity level = 85.14% activity level = 60.86% samples = 3,270 activity level = 79.41%

econ-to-contacts-ratio <= 213.8 samples = 3,475 samples = 12,275 activity level = 82.51% activity level = 94.86%

samples = 8,800 activity level = 99.74%

Decision tree for "other" RMSE: 0.13489 samples = 2,250 activity level = 89.86% True samples = 7,875 activity level = 0.00% econ-to-contacts-ratio <= 158.4 χ <= 20.5 samples = 27,720 samples = 2,835 samples = 585 activity level = 68.25% time <= 7.0 activity level = 77.33% activity level = 29.12% False samples = 19,845 activity level = 95.33% time <= 21.0 samples = 2,835 samples = 17,010 activity level = 91.69% activity level = 98.33%

samples = 14,175 activity level = 99.66% (a) Optimized ROLD confinement policy and its estimated impact (b) Decision trees for work, leisure and other activities

Figure 2. The optimized ROLD AGE-ACT policies for problem instances with a 90-day optimization horizon starting on October 21, 2020 (see SI for optimized ROLD policies with a 360-day optimization horizon). Figure (a) corresponds to a problem instance where the cost of death c is 50 . The seven panels depict the time evolution for the occupation of hospital ⇥ and ICU beds (top panel), the number of actively infectious individuals and the cumulative number of deceased individuals in the population (panel 2), and the confinement policy imposed by ROLD in each age group and activity (panels 3-7). In panels 3-7, the values correspond to the activity levels allowed for the respective age group, and are color-coded so that darker shades capture a stricter confinement. Figure (b) depicts decision trees approximating the optimized ROLD confinement decisions for work, leisure and other (trained with 27,720 samples), with a horizon of T = 90 days. Each node in the tree records several pieces of information: a logical condition based on which all the samples in the node are split, with the upper sub-tree corresponding to the logical condition being true (e.g., “econ-to-contacts-ratio 176.9” for the root node in the work tree), the  number of training samples falling in the node (“samples”), and the average activity level for all the samples in the node. The nodes are color-coded based on the activity level, with darker colors corresponding to stricter confinement.

Can dual targeting reduce time in confinement for each age group? We calculate the fraction of time spent by each age group in confinement under each ROLD policy, averaged over the activities relevant to that age group (SI §7). The results are visualized in Figure 3, which depicts boxplots for the fractions of time in confinement across all problem instances parameterized by c. We find that the dual-targeted AGE-ACT policy is able to reduce the confinement time quite systematically for every age group, relative to all other policies. Specifically, it results in the lowest confinement time for every age group in 70% of all problem instances when compared with NO-TARGET, in 60% of instances when compared with AGE, in 83% of instances when compared with ACT, and in 50% of instances when compared with all other policies. Moreover, the fraction of confinement time achieved by AGE-ACT is within 5% (in absolute terms) from the lowest confinement time achieved by any policy for every age group, in 76% of all instances; within 10% in 80% of the instances; and within 14% in all instances.

5/9 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Thus, even when the dual-targeted policy confines certain age groups more, it does not do so by much. These outcomes are quite unexpected as they are not something that the ROLD framework explicitly optimizes for, but rather a by-product of a dual-targeted confinement policy that minimizes the total loss objective (2). It is worth noting that although ROLD AGE-ACT reduces confinements for every age group compared to less targeted policies, it does not do so uniformly. Instead, it can lead to a larger discrepancy in the confinements faced by different age groups: those aged 10-59 are generally more confined than those aged 0-9 or 60+.

Figure 3. Average time in confinement for the ROLD policies with different targeting types. Each boxplot depicts the fraction of time the age group spends in confinement under the respective policy averaged over the activities relevant to that age-group, for different problem instances parameterized by the cost of death c.

The impact of limited disparity requirements. That targeted policies confine some age groups more than others could be perceived as disparate or unfair treatment, so it is important to quantify how an intervention’s effectiveness is impacted when requiring less differentiation across age groups. We embed a set of “limited disparity” constraints in ROLD that allow the activity levels of distinct age groups to differ by at most D in absolute terms, in each activity and at any point of time (SI §7). D = 0 corresponds to a strictly non-discriminatory policy, whereas a larger value of D allows more targeting, with D = 1 corresponding to a fully targeted policy. For every value of D, we record the total loss incurred by a ROLD policy with the limited disparity constraints, and calculate the increase in total loss relative to a fully targeted ROLD policy. We repeat the experiment for different problem instances parametrized by c, and Figure 4 depicts boxplots of all the relative increases in total loss, as a function of D. The results suggest that limited disparity requirements may be costly: on average, completely eliminating disparity in confinements would increase the total losses by EUR 1.2B (21.6%) and produce an additional 506 deaths (16.6%) and an extra EUR 0.5B of economic losses (18.9%) compared to a fully targeted policy. In certain instances, the increase in total loss could be as high as 63%. The high losses persist even when some limited discrepancy is allowed, dropping at an initially slow rate as D increases from 0 and eventually at a slightly faster rate as it approaches 1. This suggests that to fully leverage the benefits of targeting, a high level of disparity must be accepted, but reasonable trade-offs can be achieved with some intermediate disparity.

Figure 4. The impact of limited disparity requirements. The plot shows the relative (%) increase in total loss generated by a ROLD policy, compared to a fully targeted policy, as a function of the disparity parameter D that measures the maximum allowed difference in activity levels for distinct age groups. The experiments are run using several values of c, which are used to generate each of the boxplots. Eleven values of D are tested, ranging from 0 to 1.

6/9 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Discussion Our case study suggests that an optimized intervention targeting both age groups and activities carries significant promise for alleviating a pandemic’s health, economic and even psychological burden, but also points to certain challenges that require care in a real-world setting. Why consider optimized dual-targeted interventions? The first reason are the significantly better health and economic outcomes: for the same or a lower number of deaths, dual- targeted confinements can reduce economic losses more than any of the simpler interventions that uniformly confine age groups or activities. Furthermore, the super-additive gains imply that significant synergies can be generated through finer targeting, with the ability to target along activities improving the effectiveness of targeting along age groups, and vice-versa. The second reason is the intuitive nature of the optimized targeted confinement policy, which is consistent with a simple “bang-for-the-buck” rule: impose less confinement on group-activity pairs that generate a relatively high economic value prorated by (activity-specific) social contacts. This simple intuition combined with the reliance on just a few activity levels are appealing practical features, as they provide transparency into how targeted confinement decisions could be made. The simple “bang-for-the-buck” intuition is also related to the third benefit of dual targeting: the ability to impose less restrictive confinements across all age groups. As different age groups may be responsible for generating a larger economic value prorated by social contacts in distinct activities, dual-targeted confinements may enable all age groups to remain more active, resembling normal life more closely compared to less differentiated confinements. This could result in more socially acceptable restrictions, and a more appealing policy intervention overall. Lastly, we note that although dual targeting allows and can result in differences in confinements across age groups, such interventions are actually not far from many real-world policy implementations, which have been more or less explicit in their age-based discrimination. Dual targeting can arise implicitly in interventions that only seem to target activities. As an example, France implemented a population-wide 6 p.m. to 6 a.m. curfew (31), while maintaining school and work activities largely de-confined. This is effectively implementing restrictions similar to ROLD AGE-ACT: since a typical member of the 20-65 y.o. group is engaged in work until the start of the curfew, their leisure and other activities are implicitly limited; moreover, since most individuals aged above 65 are not in active employment, they are not that restricted in these last two activities. These examples show that some amount of targeting of activities and age groups is already in place and is perhaps unavoidable for effective pandemic management. Given this state of affairs, our framework highlights the significant benefits in explicitly and transparently modelling targeting and identifying the interventions that rigorously optimize overall societal welfare, given some allowable amount of differentiation across age groups. Challenges and limitations. An immediate practical challenge is data availability. Social contact matrices by age group and activity may be available from surveys on social behavior, which have been conducted in a number of countries; however, further data collection might be required to obtain these matrices for more refined population group or activity definitions. Similarly, economic data is reported by industry activities, but we are not aware of a dataset that splits economic value into separate (group, activity) contributions. Disparate data sources may be difficult to align: for example, social contact surveys and Google mobility reports use different activity categories, which requires non-trivial fitting (SI §4). Availability of data also constrains our model’s structure in several ways. In our model social contacts between age groups only depend on confinements in the same activity, since the available social contacts dataset (24) only reports interactions in the same activity. However, contacts occur as individuals are engaged in different activities (e.g., a professional in the services industry interacts during work with individuals who are engaged in leisure activities). A more refined contact mixing model that captures such interactions would be more appropriate for this study, provided that relevant data are available. Our economic model similarly ignores cross effects, such as young age groups engaged in school producing value in conjunction with educational staff engaged in work. Another challenge with targeted interventions is the perception that they lead to unfair outcomes, as certain age groups face more confinement than others. Such discrimination does arise in the optimized dual-targeted policies; our framework can partially address these concerns through explicit constraints that limit the disparities across groups. Our requirement that limited disparity hold for every time period and every activity is quite strict, and a looser requirement based, e.g., on time-average confinements could lead to smaller incremental losses. Alternatively, one can impose fairness requirements based on the intervention’s outcomes, e.g., requiring that the health or economic losses faced by different groups satisfy certain axiomatic fairness properties (32). Although we focus on confinement policies, a direction for future research is to investigate how these can be optimally combined with other types of targeted interventions. SI §7 reports experiments where we optimize a targeted policy based on confinements and randomized testing and quarantining. The framework is sufficiently flexible to accommodate interventions such as contact tracing and also vaccinations, although a careful implementation would require work beyond the scope of this article.

7/9 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Acknowledgments The authors thank Ciprian Badescu,˘ Florin Georgescu, Alexandru Hulea, Andrei Leica, and Vlad Paunescu˘ from Arnia Software for their efforts in implementing the framework and hosting it at http://insead.arnia.ro.

Authors’ contributions DFC, DAI and SIZ contributed equally to the research design, conceptualisation, modeling, analysis, and writing. SC and XSW contributed equally to the modeling, software implementation, data analysis and creating visualizations. DFC, DAI and SIZ verified all the data used in this study.

Declaration of interests All authors declare no competing interests.

References 1. Acemoglu, D., Chernozhukov, V., Werning, I. & Whinston, M. D. Optimal targeted lockdowns in a multi-group SIR model. Working Paper 27102, NBER (2020). DOI: 10.3386/w27102.

2. Matrajt, L., Eaton, J., Leung, T. & Brown, E. R. Vaccine optimization for COVID-19, who to vaccinate first? (2020).

3. Goldstein, J. R., Cassidy, T. & Wachter, K. W. Vaccinating the oldest against COVID-19 saves both the most lives and most years of life. Proc. Natl. Acad. Sci. 118, DOI: 10.1073/pnas.2026322118 (2021). https://www.pnas.org/content/118/ 11/e2026322118.full.pdf.

4. Bertsimas, D. et al. Optimizing vaccine allocation to combat the COVID-19 pandemic (2020).

5. Favero et al., C. A. Restarting the economy while saving lives under COVID-19 (2020).

6. Birge, J. R., Candogan, O. & Feng, Y. Reducing economic losses with targeted closures. Working Paper 2020-57, University of Chicago, Becker Friedman Institute for Economics (2020).

7. Chang, S. et al. Mobility network models of COVID-19 explain inequities and inform reopening. Nature 589, 1–6 (2020).

8. Evgeniou, T. et al. Epidemic models for personalised COVID-19 isolation and exit policies using clinical risk predictions, DOI: 10.1101/2020.04.29.20084707 (2020). https://www.medrxiv.org/content/early/2020/05/12/2020.04.29.20084707.full. pdf.

9. Giordano, G. et al. Modeling vaccination rollouts, SARS-CoV-2 variants and the requirement for non-pharmaceutical interventions in Italy. Nat. Medicine 27, 993–998, DOI: 10.1038/s41591-021-01334-5 (2021).

10. Bastani, H. et al. Deploying an artificial intelligence system for COVID-19 testing at the Greek border. Available at SSRN (2021).

11. Tiirinki, H. et al. COVID-19 pandemic in Finland–preliminary analysis on health system response and economic consequences. Heal. policy technology 9, 649–662 (2020).

12. Harrison, S. Coronavirus: Ireland’s restrictions eased for over 70s (2020). Access. 01/12/2021.

13. Magid, J. Cabinet approves removal of age restriction for those returning to work (2020). Accessed January 12, 2021.

14. Foy, H. Russian businesses prepare for fresh lockdowns as Covid-19 cases soar (2020). Accessed January 12, 2021.

15. Reuters Staff. Bosnian region eases lockdown on seniors, children after court ruling (2020). Accessed January 12, 2021.

16. Kanbur, N. & Ankgül, S. Quaranteenagers: A single country pandemic curfew targeting adolescents in Turkey. J. Adolesc. Heal. 67 (2020).

17. Kucharski, A. J. et al. Effectiveness of isolation, testing, contact tracing, and physical distancing on reducing transmission of sars-cov-2 in different settings: a mathematical modelling study. The Lancet Infect. Dis. 20, 1151–1160, DOI: 10.1016/S1473-3099(20)30457-6 (2020).

8/9 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

18. Prem, K. et al. The effect of control strategies to reduce social mixing on outcomes of the COVID-19 epidemic in Wuhan, China: a modelling study. The Lancet Public Heal. 5, e261–e270, DOI: 10.1016/S2468-2667(20)30073-6 (2020).

19. Di Domenico, L., Pullano, G., Sabbatini, C. E., Boëlle, P.-Y. & Colizza, V. Impact of lockdown on COVID-19 epidemic in Île-de-France and possible exit strategies. BMC medicine 18, 1–13 (2020). 20. Aguilera, J. Some supermarkets are launching senior-only hours during the coronavirus pandemic. Not all retailers think that’s a good idea (2020). Accessed January 12, 2021.

21. Anderson, R. M. & May, R. M. Infectious Diseases of Humans: Dynamics and Control (Oxford University Press, 1992). 22. Salje, H. et al. Estimating the burden of SARS-CoV-2 in France. Science 38, DOI: 10.1126/science.abc3517 (2020). 23. Wille, L. et al. SOCRATES: an online tool leveraging a social contact data sharing initiative to assess mitigation strategies for COVID-19. BMC Res. Notes 13 (2020).

24. Béraud, G. et al. The French connection: The first large population-based contact survey in France relevant for the spread of infectious diseases. PLOS ONE 10, 1–22, DOI: 10.1371/journal.pone.0133203 (2015). 25. French Government. Données hospitalières relatives à l’épidémie de COVID-19. https://www.data.gouv.fr/fr/datasets/ donnees-hospitalieres-relatives-a-lepidemie-de-covid-19/ (2020). Accessed October 21, 2020.

26. Google. COVID-19 community mobility report. https://www.google.com/covid19/mobility/ (2020). Accessed October 21, 2020.

27. Borrelli, F., Bemporad, A. & Morari, M. Predictive control for linear and hybrid systems (Cambridge University Press, 2017).

28. Duque, D. et al. Timing social distancing to avert unmanageable COVID-19 hospital surges. Proc. Natl. Acad. Sci. 117, 19873–19878 (2020).

29. Lehot, M. & Borgne, B. L. Covid-19 : ces chiffres qui montrent que Paris a dépassé le seuil d’alerte maximale depuis le 25 septembre (2020). Accessed October 5, 2020.

30. Alvarez, F. E., Argente, D. & Lippi, F. A simple planning problem for COVID-19 lockdown. Working Paper 26981, National Bureau of Economic Research (2020). DOI: 10.3386/w26981.

31. Reuters. France goes under nationwide 6pm curfew as Covid-19 death toll surpasses 70,000 (2021). Accessed January 12, 2021.

32. Young, H. P. Equity (Princeton University Press, 1994).

9/9 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Supplementary Information for

Quantifying The Benefits of Targeting for Pandemic Response

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis

Emails of authors: [email protected], fl[email protected], [email protected], [email protected], [email protected]

This PDF file includes: Supplementary text Figs. S1 to S10 Tables S1 to S8 SI References

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 1 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Supporting Information Text 1. Some Notation We denote scalars by lower-case letters, as in v, and vectors by bold letters, as in v. We use square brackets to denote the concatenation into vectors: v := [v0,v1]. For a time series of vectors v1,...,vn, we use the notation vi:j := [vi,...,vj ] to denote the concatenation of vectors vi through vj . Lastly, we use v€ to refer to the transpose of v.

2. Model and Optimization Problem In this section, we propose a modified version of the discretized SEIR (Susceptible - Exposed - Infectious - Recovered) population-based epidemiological model with multiple population groups, which interact with each other. In our model, the SEIR dynamics are controllable via (i) choosing confinement patterns for dierent groups and (ii) choosing how to apportion testing among the groups. Then, we build a economic model on top of the SEIR system, which factors in the cost of lost output, together with the cost of deaths. We conclude by formulating the overall optimization problem.

A. Epidemiological Model. Time. Time is discrete, indexed by t =0, 1,...,T, and measured in days. We assume that no infections are possible beyond day T .

Age Groups. We split the population into age groups. We use to denote the collection of considered age groups. In our G case study for Île-de-France we use nine age groups, split in 10-year buckets, where the youngest group are the 0-9 year olds (y.o.), and the oldest group are the 80+ year olds. Since individuals in separate groups have social contacts with each other, the dynamics of each group depend on the number of infections in other groups.

Compartmental Model and States. At any given time, the population of an age group is divided into susceptible, exposed, infectious, recovered, and deceased as in a classical SEIR model. Figure S1 represents the compartmental model and the SEIR transitions for a specific group g. All the states represent the number of individuals in a compartment of the model in the beginning of the time period. State Sg(t) is the number of individuals in group g that are susceptible to be infected at time t. State Eg(t) is the number of individuals in group g that have been exposed to the SARS-CoV-2 virus at time t, but are not yet infectious. Ig(t) is the number of individuals in group g that are infectious at time t, and have not yet been tested, or recovered, or transferred to the hospital or the ICU. Susceptible individuals get infected and transition to the exposed state at a rate determined by the number of social contacts as well as transmission rate —(t). Exposed individuals transition to the infectious state at a rate ‡. q Ig (t) is the number of individuals in group g who have been confirmed to be infectious through viral testing at time t, and are thus quarantined. We subdivide Iq(t) into Iq (t) for j a, ps, ms, ss to model dierent degrees of severity of symptoms: g j,g œ{ } infectious individuals can be asymptomatic, paucisymptomatic, have mild symptoms, or have severe symptoms. We assume that an infectious individual in group g will exhibit symptoms of degree j with probability pj,g. Infectious individuals transition out of the infectious state at a rate µ. Individuals with severe symptoms need hospitalization. Hg(t) is the number of individuals in group g that are in the general hospital wards at time t, and ICUg(t) is the number of individuals in group g that are in the intensive care unit at time t.We H ICU assume that an infectious individual will need to be hospitalized in the general wards (go to ICU) with probability pg (pg ). H ICU 1 We have pss,g = pg + pg . On average, patients who are treated in the general hospital wards spend ⁄H≠ days in the hospital; 1 patients who are treated in the ICU spend ⁄ICU≠ days in the ICU. Infectious individuals with severe symptoms may decease. Dg(t) is the number of individuals in group g that have died from COVID-19 up until time t. We assume that an infectious individual with severe symptoms in group g will decease with probability pD (and recover with probability pR =1 pD). g g ≠ g Infectious individuals usually recover. Rg(t) is the number of individuals in group g who have recovered until time t, but q have not been confirmed to have recovered. Rg(t) is the number of individuals in group g who have recovered until time t, and have been confirmed to have recovered and to have had the virus either through testing, or because they recovered during their stay in the hospital or the ICU. We keep track of the total number of individuals in group g who are not confirmed infectious, are not confirmed recovered, are not in the hospital or the ICU, and have not died: Ng(t):=Sg(t)+Eg(t)+Ig(t)+Rg(t).Welet

X(t)= Sg(t),Ig(t),...,Dg(t) g œG denote the full state of the system at time t =0, 1,...,T# 1,T, i.e., the number$ of individuals in each of the states, across ≠ groups. We denote the number of compartments by . Then the dimension of X(t) is 1. |X | |G||X | ◊ We assume that Dg(0) = 0 for all g . œG

2 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Fig. S1. Compartmental SEIR model for a specific group g

q q Ij,g ICUg Rg

q Sg Eg Ig Iss,g Hg Dg

SEIR transition Rg Transition due to viral testing

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 3 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

A.1. Controls/Decision Variables. We allow for three levers for the policy maker’s response: targeted testing, targeted confinements, and targeted patient turn-away decisions.

Confinements and social mixing model. We refer to the activity level decision for a group g as the vector

home work school transport leisure other 6 ¸g(t):= ¸ (t),¸ (t),¸ (t),¸ (t),¸ (t),¸ (t) [0, 1] g g g g g g œ that captures the level of each activity# that an individual in group g is allowed to perform$ in period t, in relation to the normal activity level under normal course (no confinement), as a result of imposed confinement restrictions. We denote the set of considered activities by := home, work, school, transport, leisure, other . A { } work Normal course, i.e., no confinement, is modeled using ¸g = 1 =[1, 1, 1, 1, 1, 1]. As an example, ¸g =0.7 can be interpreted as a restriction for individuals in group g to spend at the workplace only 70% of the baseline time they spend at the workplace. We provide more details about how the baseline is defined for our numerical experiments and calibration in Section 4 B. When unambiguous, we use the terms activity levels and confinements to refer to the same underlying decision of ¸g(t). The confinement decision variables for the policy maker are the activity levels of each group, in each time period: ¸g(t) g ,t=0:T 1. { } œG ≠ In our proposed implementation for Île-de-France we allow the confinement decisions to update every two weeks. Confinements aect the amount of social contacts between individuals of dierent groups. We build a model to reflect this dependence which we describe below. For activity a and groups g, h, we denote Ca the number of contacts in a day between an individual in group g and œA g,h individuals in group h in activity a under normal course (no confinement). We retrieve the contact matrices per activity Ca,a from the social contact data tool by (1), using the survey on social contacts in France by (2). œA Let cg,h(¸g, ¸h) denote the mean number of contacts per day between an individual in group g and individuals in group h, when groups g, h are confined according to confinement patterns ¸g, ¸h, respectively. We use the following multiplicative extrapolation of social mixing matrices, with parameter –1,–2 R: œ

a a –1 a –2 cg,h(¸g, ¸h)= C ¸ (¸ ) . [1] g,h · g · h a ÿœA ! " We remark that this parametrization is similar to a Cobb-Douglas production function (3), using the confinement patterns as inputs, and the number of social contacts as output. In Section 4 B, we describe our procedure for fitting the social mixing parameters from data on health outcomes and mobility data.

Testing. We model viral diagnostic tests. Individuals who are found to be infected through a diagnostic test are quarantined. We assume that there is access to perfect tests (i.e., the sensitivity and specificity are 100%). We assume that a viral test yields a positive result when conducted on an infectious individual (compartment Ig), and a negative result when conducted on an individual who is in a non-infectious state. Vtest We use Ng (t) to denote the number of viral tests allocated to group g in period t. The testing decisions for the policy Vtest maker are then Ng (t) g ,t=0:T 1.We assume that, given a testing allocation decision across groups, the individuals { } œG ≠ to receive the allocated tests in group g are randomly chosen among individuals in Ng(t). For example, the outflow from Vtest performing Ng (t) viral tests in group g, which happens out of compartment Ig(t), is equal to

Ig(t) Vtest Ng (t). [2] Ng(t) ·

In our proposed implementation for Île-de-France we allow the testing decisions to change weekly. Figure S1 represents the compartmental model and the flows resulting from testing from one compartment to the other for a specific group g.

Turning patients away. When the patient inflow into the hospital or the ICU exceeds the remaining number of available beds, then the decision maker needs to decide how to prioritize the admission of patients to the available beds. Our general formulation allows the decision maker to optimize the number of patients from each group turned away from the general hospital wards and from the ICU, at each time period. We denote the number of patients from age group g who need to be H admitted to the general hospital wards but are turned away in period t by Bg (t). We denote the number of patients from age ICU group g who need to be admitted to the ICU but are turned away in period t by Bg (t). The turn-away decision variables for H ICU the policy maker are then Bg (t),Bg (t) g ,t=0:T 1. We assume that all patients who are turned away decease immediately. { } œG ≠ In our experiments we use a proportional turn-away rule, which is defined in Section A.3.

Let Vtest H ICU u(t)= ¸g(t),N (t),B (t),B (t) g g g g œG denote the vector of all the decisions/controls at# time t =0, 1,...,T 1, i.e., the$ confinement and viral test decisions for all ≠ the groups, as well as the decisions on turning patients away. We denote the number of dierent decisions for a given group at agiventimeby . Then the dimension of u(t) is 1. |U| |G||U| ◊

4 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

A.2. Dynamics of the Controlled SEIR Model. Having defined the states and controls, we write down a set of discrete time dynamics for the controlled SEIR model. We use notation X(t) to denote X(t +1) X(t). For all groups g , we write ≠ œG

Vtest Ig(t) H ICU Ng(t)= Ng (t) µ (pg + pg ) Ig(t) [3] ≠ · Ng(t) ≠ · · infected going to ICU, H

¸ ˚˙ Ih(˝t) Sg(t)= —(t)Sg(t) cg,h(¸g(t),¸h(t)) q [4] ≠ · Nh(t)+R (t) Ah h B ÿœG Ih(t) Eg(t)=—(t)Sg(t) cg,h(¸g(t),¸h(t)) q ‡Eg(t) [5] · Nh(t)+R (t) ≠ Ah h B ÿœG H ICU H ICU Vtest Ig(t) Ig(t)=‡Eg(t) µ (pg + pg ) Ig(t) µ (1 pg pg ) Ig(t) Ng (t) [6] ≠ · · ≠ · ≠ ≠ · ≠ · Ng(t) go to ICU, H recover q outflow to I ,g from positive V tests · H¸ ICU ˚˙ ˝ ¸ ˚˙ ˝ Rg(t)=µ (1 pg pg ) Ig(t) ¸ ˚˙ ˝ [7] · ≠ ≠ · q Vtest Ig(t) q Ij,g(t)= pj,g Ng (t) µ Ij,g(t), j a, ps, ms [8] · · Ng(t) ≠ · ’ œ{ } recover inflow from Ig from positive V tests

q Vtest Ig(t) ¸ ˚˙ ˝ q Iss,g(t)= ¸pss,g Ng ˚˙ (t) ˝ µ Iss,g(t) [9] · · Ng(t) ≠ · go to ICU, H, R, or D inflow from Iss,g from positive V tests

q q H R ¸ ˚˙ICU ˝ R Rg(t)=µ ¸ I˚˙ (t)+⁄g ˝ pg Hg(t)+⁄g pg ICUg(t) [10] · j,g · · · · j a,ps,ms œ{ ÿ } recovery of known infected cases H ¸ H H ˚˙ pg q H ˝ Hg(t)= ⁄g Hg(t)+µ pg Ig(t)+ H ICU Iss,g(t) Bg (t) [11] ≠ · · pg + pg · ≠ 3 4 ICU ICU ICU pg q ICU ICUg(t)= ⁄g ICUg(t)+µ pg Ig(t)+ H ICU Iss,g(t) Bg (t) [12] ≠ · · pg + pg · ≠ 3 4 H D ICU D H ICU Dg(t)=⁄ p Hg(t)+⁄ p ICUg(t)+B (t)+B (t). [13] g · g · g · g · g g

In Eq. (7) and Eq. (10), note that we do not have terms for the population turned away from hospital/ICU which may eventually recover. Instead, we assume the turned away patients will go into the deceased state. In Eq. (13), we are assuming that if a patient is turned away from the ICU, they transition into deceased, instead of being allocated a hospital bed if one is available.

We can summarize the SEIR dynamics with

X(t +1)=X(t)+Ft (X(t), u(t)) , [14]

where the function Ft captures the dynamics in Eq. (3)-Eq. (13).

We now provide justification for how we account for social contacts and, in particular, for the expressions in Eq. (4) q and Eq. (5). We assume that individuals in Rg can interact with members of Sg,Eg,Ig and Rg but cannot infect.

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 5 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Fix a person i in age group g ,instateSg. Then we can write œG

Pr i got infected through socializing with age group h [15] { } Ah B €œG

= Pr i got infected through socializing with individuals in Sh,Eh,Ih or Rh [16] { } Ah B €œG =1 Pr( i did not get infected through socializing with individuals in any Sh,Eh,Ih or Rh for any h ) [17] ≠ { œG} =1 Pr( i did not get infected through socializing with individuals in Sh,Eh,Ih or Rh ) [18] ≠ { } h ŸœG Nh(t) q cg,h Ih(t) Nh(t)+R (t) =1 1 —(t) h [19] ≠ ≠ Nh(t) h 3 4 ŸœG Nh(t) Ih(t) 1 1 —(t) q cg,h [20] ¥ ≠ ≠ · Nh(t)+R (t) · · Nh(t) h 3 h 4 ŸœG Ih(t) —(t) cg,h q , [21] ¥ Nh(t)+R (t) h h ÿœG where in Eq. (19) we use the following reasoning. Having fixed person i in age group g, the number of her contacts in age group h is given by cg,h = cg,h(¸g(t),¸h(t)). The number of contacts cg,h reflects the number of contacts of individual i from group h, assuming all individuals in age group h are put under the same confinement pattern. However, recall that individuals q q in Rh(t) cannot infect individuals in Sh,Eh,Ih or Rh. Therefore, we scale cg,h by the ratio of (S, E, I, R, R ) individuals in Nh(t) age group h that are in states (S, E, I, R),i.e., N t Rq t , to account for the role of the relative size of groups of individuals h( )+ h( ) in dierent confinement patterns within an age group. Finally, getting infected as the result of any contact within group h is considered to be an independent event. By taking the expectation of random variable

i got infected through socializing , { } i S ÿœ g we retrieve the expressions in Eq. (4) and Eq. (5).

A.3. Resources and Constraints. We use KH(t) (KICU(t)) to denote the capacity of beds in the general hospital wards (in the ICU) on day t. When the patient inflow into the hospital or the ICU exceeds the remaining number of available beds, then the policy maker needs to decide how many patients to turn away from each group. Although our framework allows to optimize over the decisions of turning away patients, we mainly look at a specific turn-away rule: let in patients from each age group proportionally to the arrivals from that age group, up to capacity, and turn away the remaining patients. This specific turn-away rule can be expressed by setting

H H pg q µ pg Ig(t)+ H ICU Iss,g(t) H · pg +pg · Bg (t)= pH 1 H h q 2 µ p Ih t I t h h ( )+ pH +pICU ss,h( ) · h h · 1 2 q proportion of inflow into H from group g + ¸ ˚˙ ˝ H H ph q H H Q µ p Ih(t)+ I (t) K (t) 1 ⁄ Hh(t) R [22] · h · pH pICU · ss,h ≠ ≠ ≠ g h + h A B c h 3 4 h d cÿ ÿ ! " d c total inflow into H available beds in H d a ICU b ICU pg q µ¸ pg Ig(t)+ H ˚˙ ICU Iss,g(t) ˝ ¸ ˚˙ ˝ ICU · pg +pg · Bg (t)= pICU 1 ICU h q 2 µ p Ih t I t h h ( )+ pH +pICU ss,h( ) · h h · 1 2 + q ICU ICU ph q ICU ICU µ p Ih(t)+ I (t) K (t) 1 ⁄ ICUh(t) [23] · h · pH + pICU · ss,h ≠ ≠ ≠ g A h 3 h h 4 A h BB ÿ ÿ ! " We further assume a given capacity for viral tests each day, which we denote by KVtest(t) on day t. We assume that viral tests used to test individuals with severe symptoms that enter the ICU or hospital, as well as viral tests that test hospitalized

6 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

individuals to confirm they have recovered, come from a dierent pool of tests and do not consume the capacity for viral testing in the non-hospitalized population. We can now write the following constraints for the optimization problem:

H (“Hospital capacity”) Hg(t) K (t), t [24] Æ ’ g ÿ H H H pg q (“Hospital turn away”) Bg (t) µ pg Ig(t)+ H ICU Iss,g(t) 0, g, t [25] ≠ · pg + pg · Æ ’ 3 4 ICU (“ICU capacity”) ICUg(t) K (t), t [26] Æ ’ g ÿ ICU ICU ICU pg q (“ICU turn away”) Bg (t) µ pg Ig(t)+ H ICU Iss,g(t) 0, g, t [27] ≠ · pg + pg · Æ ’ 3 4 (“Viral testing capacity”) N Vtest(t) KVtest(t), t [28] g Æ ’ g ÿ (“Fractional lockdown”) ¸a(t) 1, g, a, t [29] g Æ ’ (“Non-negative decisions”) N Vtest(t),¸a(t) 0,BH (t) 0,BICU(t) 0 g, a, t [30] g g Ø g Ø g Ø ’ We denote by (X(t)) the feasible set described by Eq. (24)-Eq.(30) for the vector of decisions u(t) at time t. C B. Economic Model and Optimization Objective. Here, we describe in more detail the objective we use in our optimization problem. This objective seeks to combine two dierent losses due to the pandemic: (i) the measurable economic loss stemming from quarantining, confinements and wages lost due to death and illness as well as (ii) the mortality from the pandemic. We first describe how we account for economic loss in our model.

B.1. Economic Loss. These losses come from three separate sources:

Eect of quarantining. We capture the eect of quarantine and hospitalization by assuming that if at some time pe- q riod an individual in group g is in one of the SEIR chambers Ij,g, j a, ps, ms, ss ,Hg, ICUg, then they generate no q ’ œ{ } economic value. At the same time, we assume that individuals in Rg generate economic value as they would under no confinement.

Eect of confinement. To account for confinement in the non-quarantined population, we make the economic value per day generated by an individual in group g in the remaining (non-quarantined) SEIR chambers explicitly depend on the enforced confinement in the population. Recall that for a group g, the activity levels ¸g specify the level of each activity allowed for that group as compared to normal course, and ¸ = ¸g g . We denote the economic value generated by a member of g per day by { } œG vg(¸). We remark that vg(1) corresponds to the economic value generated by an individual under normal circumstances. The vg(¸) specific to a group can be of two types: (a) wages from employment and (b) future wages from employment due to schooling. Naturally, depending on the age group, both, one, or neither of these will actually contribute to economic value. Distinguishing whether the specific group is comprised of school age, employable or retired population, we define

schooling vg (¸) if g =0-9y.o. employment schooling vg (¸)+vg (¸) if g =10-19y.o. vg(¸):=Y [31] vemployment(¸) if g =20-29,30-39,40-49,50-59,60-69y.o. _ g ]0 otherwise.

_ schooling employment We down the definitions[ of vg (¸) and vg (¸) below:

employment • Value from employment vg (¸). The value generated from employment is a function of the confinement level in the work activity, but also of the confinement levels in leisure, transport,aswellasother activities. As an example, we expect the economic value of those employed in restaurants, retail stores, etc. to depend on foot trac levels, which in turn are driven by the confinement levels in leisure, transport and other activities across all groups. employment Our model for employment value is a linear parametrization of these confinement decisions; specifically, vg (¸) is linear in ¸work and the average of ¸transport, ¸leisure and ¸other across these three activities and all groups g : g œG

transport leisure other employment work work other activities 1 ¸h + ¸h + ¸h fixed v (¸):=wg ‹ ¸ + ‹ + ‹ . [32] g · · g · 3 A A |G| h B B ÿœG Additionally, ‹work,‹other activities and ‹fixed are activity level sensitivity parameters such that ‹work 1+‹other activities 1+ fixed · · ‹ =1; under fully open activity, they induce a multiplier of 1 in Eq. (32). Then, wg measures the overall employment employment value of a typical member of group g under no confinement, and is equal to vg (1).

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 7 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

We estimate the coecients of this model from data, as we describe in detail in Section 4.

schooling • Value from scho oling vg (¸). A day of schooling for the individuals in relevant groups results in economic value, equal to a day of wages that members of these groups would gain in the future. We use the of the 20-29 year-old group, multiplied by a factor, and we discount for a number of years corresponding to the dierence between the midpoint of the age group and the beginning of the 20-29 year-old group. For instance, for the 0-9 year olds, we would discount over 15 years, and for 10-19 year olds we discount over five years. That is, the discounting factor we apply is 15 (1 + r)≠ if g = 0-9 y.o. 5 ”g := (1 + r)≠ if g = 10-19 y.o., Y ]0 otherwise, where r is the discount rate. We further multiply[ the by fg, which is the fraction of group g that is in school. Lastly, we also use a multiplicative factor ◊. We use this for sensitivity analysis: ◊ reflects that an additional day of schooling may have a multiplier eect in future wages, as well as the fact that schooling can be continued online during lockdowns. We provide ranges for ◊ in Section 6. Thus, the definition for value of school days becomes

schooling employment school v (¸):=◊ fg ”g v (1) ¸ . [33] g · · · 20-29 y.o · g

Eect of lost wages due to death. We account for a deceased individual’s loss of wages which they would have earned from their current age until age, given the prevailing wage curve under status quo conditions given by vg(1) g . { } œG For group g, we set the current age to the mid-point of the age group. We discount the resulting cash flows by an annualized life interest rate. We denote the resulting lost wages amount by vg . For instance, for someone in age group 30-39 y.o., we calculate this cash flow by†

69 1 vlife 30 39 y.o. := · 35 ≠ (1 + r) ≠ · ÿ=35 1 employment 1 employment 35 · 39 v30 39 y.o. (1)+ 40 · 49 v40 49 y.o. (1) · { Æ Æ }· ≠ { Æ Æ }· ≠ 1 employment 1 employment + 50 · 59 v50 59 y.o. (1)+ 60 · 69 v60 69 y.o. (1) . [34] ! { Æ Æ }· ≠ { Æ Æ }· ≠ We are now ready to define the economic loss component of the objective. We first define a quantity V "which represents the economic value that would be generated in all groups g under a “no pandemic” scenario. More precisely, to calculate V we œG assume that at time t =0all the infected or exposed population is instantaneously healed and able to generate full economic value vg(1). Thus,

T 1 ≠ q q V := vg(1) Ng(0) + I (0) + Hg(0) + ICUg(0) + R (0) . [35] · Q j,g g R t=0 g j a,ps,ms,ss ÿ ÿœG œ{ ÿ } a b We note that this term is a constant and does not depend on the policy followed by the policy maker. The economic loss is then defined as the dierence between V and the (potentially negative) value generated during the pandemic, and which depends on the policy followed by the policy maker:

T 1 ≠ q life Economic Loss(u0:T 1):=V vg (¸(t)) (Sg(t)+Eg(t)+Ig(t)+Rg(t)) + vg(1) Rg(t) + vg Dg(T ), [36] ≠ ≠ · · · t=0 g g ÿ ÿœG ! " ÿœG B.2. Total Deaths. We associate each death with a non-pecuniary cost parametrized by ‰. Similarly to (4), our approach is to characterize the frontier between deaths and economic losses. For any cost of death ‰, maximizing the objective function for that ‰ will yield a particular point on the frontier. One can trace the frontier by varying ‰. We focus on characterizing the frontier, rather than selecting an optimal point along the frontier. Selecting an optimum would entail determining a choice for the value of life, which is not desirable, as there is disagreement about the right value. We define

Total Deaths(u0:T 1):= Dg(T ). [37] ≠ g ÿœG This is due to the fact that a small fraction of the members of the 10-19 year old group are already in workforce. We do not count the value of lost schooling for them. †Note that we are counting the entire value for the 60-69 y.o. age group; this is due to the fact that, as we explain in Section 4, this value has already incorporated that only a fraction of the population in the 60-69 y.o. age group have work-eligible ages (60-64 y.o.), while the rest are retired.

8 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

C. Optimization Problem. We can now formulate the overall optimization problem we are solving. Our objective captures the trade-o between the cost of the pandemic in terms of lost lives, and the cost in terms of foregone economic output (the economic loss). In addition, we require that the state trajectory follow the SEIR dynamics, and that the controls and state trajectory respect the capacity and feasibility constraints discussed above. We can then formulate this problem as

min Economic Loss(u0:T 1)+‰ Total Deaths(u0:T 1) [38] u0:T 1 ≠ · ≠ ≠ s.t. X(t)+Ft (X(t), u(t)) , 0 t T 1 [39] ’ Æ Æ ≠ u(t) (X(t)), 0 t T 1. [40] œC ’ Æ Æ ≠

3. ROLD: Re-Optimization with Linearized Dynamics In this section, we describe our approach to solving problem Eq. (38)-Eq. (40) via a linearization heuristic. Before delving into this, we make a few comments on why is it dicult to solve this problem to optimality. Observe that a key dynamic in any SEIR-type model is the rate of new infections that multiplies the current susceptible population in a given group with the infected population in another group and which already introduces non-linearity in the state trajectory. This can be illustrated with our dynamic for the evolution of the susceptible population in group g,Eq.(4), which we replicate here:

Ih(t) Sg(t)= —(t) Sg(t) cg,h(¸g(t),¸h(t)) q . ≠ · · · Nh(t)+R (t) Ah h B ÿœG Vtest It can be easily seen that expanding out Sg(t) produces a nested fraction of polynomials in all the past decisions ¸(·),N (·) for 0 · t 1‡. This function has no identifiable structure that would make the resulting optimization problem tractable via Æ Æ ≠ convex optimization techniques. Similarly, the objective also involves products of states and controls, and suers from the same non-linearity. Moreover, this issue persists even if one focuses on holding testing fixed and optimizing over the confinement decisions, or vice-versa. With this in mind we focus on developing heuristics that can tractably yield good policies, and propose an algorithm, which we call Re-optimization with Linearized Dynamics, or ROLD, that builds a control policy by incrementally solving linear approximations of the true SEIR system.

A. Linearization. The key idea here is to solve the problem in a shrinking-horizon fashion, where at each time step k =0,...,T we linearize the system dynamics and objective (over the remaining horizon), determine optimal confinement and testing decisions for all k, . . . , T, and only implement the decisions for the current time-step. We first describe the linearization of the dynamics. As a slight abuse of notation which helps readability, we index time with subscripts, i.e. we write Xt := X(t) and similarly, ut := u(t). Note that we can write the true evolution of our dynamical system as:

Xt = Ft(Xt, ut), [41] t

where the (multi-)function Ft is given by Eq. (3)-Eq. (13) and is non-linear. The typical approach in dynamical systems is to linearize the system dynamics around a particular “nominal” trajectory. More precisely, assume that at time k we have access to a nominal control uˆk:T 1 and let Xˆk:T denote the resulting nominal system trajectory under the true dynamic Eq. (41) and ≠ under uˆk:T 1. We approximate the original dynamics through a Taylor expansion around Xˆt, uˆt : ≠

Xt ! " Ft(Xˆt, uˆt)+ X Ft(Xˆt, uˆt)(Xt Xˆt)+ uFt(Xˆt, uˆt)(ut uˆt), [42] t ¥ Ò ≠ Ò ≠

where X Ft and uFt denote the Jacobians with respect to Xt and ut, respectively. Note that these Jacobians are evaluated Ò Ò at points on the nominal trajectory, so Eq. (42) is indeed a linear expression of Xt and ut. Additionally, by induction, every state Xt under dynamic Eq. (42) will be a linear function of u· for ·

Linearization procedure. We use the following heuristic for obtaining an approximate control at time k, for k =0,...,T 1: ≠

1. Given the current Xk, use the true dynamic Eq. (41) to build a forecast for the trajectory Xˆt,t k +1,...,T using œ{ } some nominal control uˆk:T 1. Apply the capping function fi( ) to ensure the trajectory is feasible. ≠ · ‡ In our social contacts model, cg,h(¸g (t), ¸h(t)) is a polynomial.

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 9 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

2. Use Eq. (42) to approximate the state dynamic for periods t k, . . . , T around the forecasted trajectory Xˆt. The œ{ } calculation of the Jacobians is provided in Section 8 B. Note that under this approximation, all the states Xt will be linear functions of the controls uk,...,uT 1. ≠

3. Consider the objective-to-go in the sub-problem over periods k, . . . , T. Since vg(¸(t)) is linear in ut and the states Xg(t) are also linear in controls uk,...,ut 1, this objective will contain bilinear terms. More precisely, the objective can be ≠ written compactly as

T 1 ≠ ut€MXt + “€Xt + ÷€XT , [43] t=k ÿ ! " for some matrix M with dimensions , and vectors “ and ÷ of dimensions 1, which we precisely |G||U| ◊ |G||X | |G||X | ◊ compute in Appendix C. We also linearize it using a Taylor approximation, i.e., we consider the following objective instead:

T 1 ≠ uˆ€MXˆt + Xˆ €M €(ut uˆt)+uˆ€M(Xt Xˆt)+“€Xt + ÷€XT . [44] t t ≠ t ≠ t=k ÿ ! " This objective is now linear in all the decisions uk,...,uT 1. ≠ ,(k) ú 4. Determine decision variables uk:T 1 to maximize the objective-to-go Eq. (44) subject to all the relevant constraints. Similarly, all constraints are linear≠ in these decisions.

,(k) ú 5. Update the true states using the optimal ukú := uk and using the proportional turn-away rule Eq. (22)-Eq.(23).

The linearization described above is run for all periods k =0,...,T 1 sequentially to output a full control policy ukú:T 1. H ≠ ICU ≠ We emphasize that although we allow the turn-away variables Bg (t) and Bg (t) to be optimized when building and optimizing the linearized model above, when we apply utú in the real dynamical system, we discard the turn-away values generated by the linearization and instead set the turn-away variables to their proportional values from Eq. (22), Eq. (23) as reported in Section 2 A.3.§ Our algorithm needs to set an initialization for the nominal control uˆk:T 1 at period k. In the algorithmic details that ≠ (k) follow (Section 3 B), we specify a choice for the period k initialization, which we denote by uˆk:T 1. Trust region implementation. In our experiments, we have found that the linearized model described≠ above may diverge significantly from the real dynamical system when the controls diverge suciently from the nominal-controls. This linearization (k) error leads to a large sensitivity to the initialization uˆ0:T 1 around which we construct the Taylor approximations. For example, if we construct the Taylor approximation around a policy≠ of full confinement, the linearized model would underestimate the eect of completely relaxing all the confinements, with respect to the real dynamics. Intuitively this is because the linearization (k) (k) errors are increasing in u0:T 1 uˆ0:T 1 . || ≠ ≠ ≠ || We overcome this diculty by employing an iterative procedure for optimizing within each period k; this procedure resembles a trust region optimization method. The key idea is that, in order to avoid the large approximation errors, we run the linearization iteratively, with each linearization only allowed to take a small step towards the optimum within a trust region of an ‘-ball around uˆk:T 1, and with the updated control of each linearization being used as seed for the next. This leads to a ≠ procedure that is much more robust to the initial guess of control sequence, albeit at the expense of increased computation time.

B. Algorithmic Details. In this section, we clarify the algorithmic details of the linearization procedure described above. We first focus on how we build a linear model given k, Xk and uˆk:T 1. ≠

Linearized Dynamics. In step 2, the algorithm builds a approximation of the state dynamics that is linear in the controls uk,...,uT 1. Here, we compute the coecients in front of each ut explicitly. Let us introduce the notation: ≠

At := I +t X Ft(Xˆt, uˆt) [45] ·Ò Bt := t uFt(Xˆt, uˆt) [46] ·Ò ct := t Ft(Xˆt, uˆt) X Ft(Xˆt, uˆt) Xˆt uFt(Xˆt, uˆt) uˆt , [47] · ≠Ò · ≠Ò ·

where matrix At has dimensions # , matrix Bt has dimensions ,$ and vector ct has dimensions |G||X | ◊ |G||X | |G||X | ◊ |G||U| 1. With this, we have |G||X | ◊ Xt = AtXt + Btut + ct,t=0,...T 1. +1 ≠ §We note that Eq. (22) - Eq. (23) are non-linear dependencies which we could also linearize in the heuristic, at the cost of increased complexity. We instead chose a simpler architecture for the heuristic, which keeps the turn-away variables free in the linearization.

10 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

¶ We can then express the state Xt as:

t 1 t 1 ≠ ≠ Xt = A· Xk + Ai B· u· + Ai c· . [48] A·=t 1,t 2,...,k B ·=k Ai=t 1,t 2,...,·+1 B ·=k Ai=t 1,t 2,...,·+1 B ≠ Ÿ≠ ÿ ≠ Ÿ≠ ÿ ≠ Ÿ≠

It is now possible to express both objective and constraints linearly in the decision ut.

Constraint coecients. We can write each of the constraints Eq. (24), Eq. (26) and Eq. (28) in the form

Lt K(t), where Lt := “x€ Xt + “u€ ut, t k, k +1,...,T 1 , [49] Æ · · ’ œ{ ≠ } for some (time-invariant) “x, “u. Since Xt is linear in uk,...,ut 1, to represent one such constraint we just need to store the ≠ coecients corresponding to all decision variables (i.e., uk,...,uT 1) and the free terms/constants that appear in Lt. ≠ In particular, in the LHS “x€ Xt + “u€ ut of such a constraint, the decision u· , for k · t, will have coecients · · Æ Æ

“u€, for · = t [50] € “x i=t 1,t 2,...,·+1 Ai B· , for · = t 1,t 2,...,k I ≠ ≠ ≠ ≠ 1 2 To make calculations ecient, we note thatr the coecients can be obtained recursively as in the Calculate ConstraintCo- efficients function defined in Algorithm 1.

Algorithm 1 CalculateConstraintCoefficients function.

Require: X¯k := Xk for t = k, k +1,...,T 1 do ≠ Calculate the constants in Lt as: “x€X¯t Set X¯t+1 := AtX¯t + ct Calculate coecients for u· in Lt as:

“u€, for · = t

“x€A¯t,· B· , for · = t 1,t 2,...,k ; ≠ ≠ where I, for · = t 1 A¯t,· := ≠ [51] A¯t,· A· , for · = t 2,t 3,...,k. ; +1 +1 ≠ ≠

return Coecients for u· in Lt in each t = k, k +1,...,T 1. ≠

Objective coecients. Up to constants, Eq. (44) can be written as

T

dt€Xt + et€ut [52] t=k ÿ ! " with

uˆt€M + “€, if t

In Eq. (44) the decisions ut, for k t T 1, will have objective coecients: Æ Æ ≠ T 1 ≠ Xˆt€M € + uˆ·€M + “€ Ai Bt ·=t+1 Ai=· 1,· 2,...,t+1 B ÿ ! " ≠ Ÿ≠

+ ÷€ Ai Bt. [54] Ai=T 1,T 2,...,t+1 B ≠ Ÿ≠

Then, the objective coecients in front of each ut can be calculated in a recursive manner similar to the one we use for the constraint coecients. The detailed function CalculateObjectiveCoefficients is defined in Section 8 C.1.

Specifics of Iterative Linearization Procedure.

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 11 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Algorithm 2 Linearization function.

Require: time period k, starting state Xk, nominal control initialization uˆk:T 1 ≠ Calculate forecast Xˆk:T from Xk and uˆk:T 1 ≠ Run CalculateObjectiveCoefficients and CalculateConstraintCoefficients to compute linear coecients of decisions ut,t= k, . . . , T 1 ≠ ,(k) Solve resulting linear program with additional trust region constraints uk:T 1 B‘Œ(uˆk:T 1) to obtain controls uú ≠ ≠ k:T 1 ,(k) œ ≠ ú return uk:T 1. ≠

12 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Having defined these functions, the Linearization function is described in Algorithm 2. This function builds the linear approximation for the remaining trajectory of the system, and optimizes it in a trust region of an infinity-norm ‘-ball around the initial nominal control uˆk:T 1. We denote that ‘-ball by B‘Œ(uˆk:T 1). ≠ ≠ Within each period k, we call the Linearization function iteratively up to a termination condition, using the output control to initialize the nominal control and the trust region for the next call of the function. This still requires us to choose an (0) initialization of the k =0nominal control uˆ0:T 1; in our experiments we initialize this with a heuristic solution generated via a gradient method. The pseudo-code for this procedure≠ is Algorithm 3. For the termination conditions of the initial iterations, we combine a fixed upper bound on the number of iterations with a ,(k) ú condition that we do not repeat control sequences uk , in order to avoid cycles. The fixed upper bound on the number of ,(k 1) ú ≠ iterations is set so as to ensure that for each k, every confinement decision in uk:T 1 can be changed to any value in [0, 1] ≠ mult with ‘-length steps. In particular, for all the runs presented, we take the upper bound of steps to be ‘ ,with‘ =0.05 and mult =2, resulting in an upper bound of 40 runs for the inner loop. We experimented with dierent values of ‘ between 0.01 and 0.5 and values of mult between 1 and 5. As expected, lower values of ‘ resulted in a more stable and higher performing heuristic, while higher values of mult improved the heuristic only up to around 2, after which point the non-cycling termination condition was triggered almost always. On the other hand, reducing ‘ did have a significant impact on the run-time of the linearization algorithm. We chose the combination of ‘ and mult that gave us the best trade-o between the quality of the solution and the total run-time.

Algorithm 3 Iterative linearization algorithm. (0) Require: initial state X0 and initial nominal control for first period uˆ0:T 1 for k =0,...,T 1 do ≠ ≠ Initialize (0) (k) uˆ0:T 1, if k =0 uˆk:T 1 = ,(k≠ 1) [55] ≠ ú ≠ uk:T 1 , otherwise ; ≠ ˆ X0, if k =0 Xk = ,(k 1) [56] ˆ ú ≠ Calculate Xk from Xk 1 and uk 1 , otherwise ; ≠ ≠ while termination condition? do ,(k) (k) ú ˆ uk:T 1 = Linearization(k, Xk, uˆk:T 1) (k) ≠ ,(k) ≠ ú uˆk:T 1 = uk:T 1 ≠ ≠ ,(k) ú Set ukú equal to the confinement and testing decisions of uk , and use the proportional turn-away rule defined in Eq. (22)-Eq.(23)

4. Parametrization and Calibration of the Model for Île-de-France We parametrize the model using SEIR parameter estimates from (5). We calibrate the model using data on health outcomes, Google mobility data, as well as economic output data. Overall, we use data specific to the the Île-de-France region, or specific to France when Île-de-France-specific data is not available.

A. Basic SEIR Model Parameters. The SEIR model parameters that pertain across the age groups are summarized in Table S1. The age-group specific parameters are reported in Table S2. We start with the parameters as reported in (5), and then we 1 1 ICU D allow the values of parameters —,⁄≠ ,⁄≠ , pg g , pg g to change through time, as detailed in Section 4 B.We H ICU { } œG { } œG allow these parameters to change in time in order to model changes in the way hospitals manage COVID patients, as well as changes in mandates for using masks) and other measures that reduce* transmission. 1 For R0 and ⁄H , the reported uncertainty ranges are 95% confidence intervals. For ‡≠ (i.e., the mean stay in compartment E), the uncertainty range is calculated as 4 0.8 0.6, where 0.6 is half the width of the 95% confidence interval for the ± · incubation period reported in (6), and 0.8 accounts for the fact that the stay in compartment E is 4/5 of the mean incubation 1 time in (5). For µ≠ (i.e., the mean stay in an infectious state), the uncertainty range is calculated as 4 0.43, where 0.43 is ± half the width of the 95% confidence interval for the serial interval reported by (7). For the average stay in the ICU, we add to the mean stay of 20.46 days for Île-de-France, another 1.5 day, which is the mean time spent in hospital prior to ICU admission (5).

¶In the expressions in Eq. (48), Eq. (50), Eq. (54), we follow the convention that a product of matrices over an empty set of indices results in the identity matrix. We retrieve the parameter values as reported before (5) got updated on July 8, 2020. We note that (7) estimate the serial interval, and not the infectious period, to be 3.96 days. We borrow their confidence interval for the serial interval estimation and use it as an uncertainty range for our infectious period, which is of about the same length as their estimated serial interval.

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 13 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Table S1. SEIR model parameters

Value Variable Description Reference [Uncertainty Range] 2.9 R0 Basic reproduction number (5) [2.8 - 2.99]

1 4.0 days ୠLatency period (5, 6) [3.57 - 4.43]

1 4.0 days µ≠ Infectious period (5, 7) [3.52 - 4.48]

H 1 14.94 days (⁄ )≠ Average time spent in Hospital (5) [14.55 - 15.32]

ICU 1 21.96 days (⁄ )≠ Average time spent in ICU (5) [21.38 - 22.6]

Table S2. Age-group specific SEIR model probability parameters

ICU H D pss,g pg pg pg Prob. of severe symptoms Prob. of ICU Prob. of Hospital Prob. of Death Age group g given infection given infection given infection given severe symptoms 0.002 0.000444 0.001556 0.006 0-9 [0.001 - 0.003] [0.000192 - 0.000765] [0.000235 - 0.002808] [0.003 - 0.013] 0.002 0.000444 0.001556 0.006 10-19 [0.001 - 0.003] [0.000192 - 0.000765] [0.000235 - 0.002808] [0.003 - 0.013] 0.006 0.00069 0.00531 0.011 20-29 [0.004 - 0.01] [0.000404 - 0.00132] [0.00268 - 0.009596] [0.007 - 0.016] 0.013 0.002067 0.010933 0.019 30-39 [0.008 - 0.02] [0.001168 - 0.00346] [0.00454 - 0.018832] [0.015 - 0.023] 0.017 0.003774 0.013226 0.033 40-49 [0.01 - 0.027] [0.0021 - 0.006345] [0.003655 - 0.0249] [0.029 - 0.037] 0.035 0.00966 0.02534 0.065 50-59 [0.021 - 0.054] [0.005565 - 0.015498] [0.005502 - 0.048435] [0.06 - 0.07] 0.071 0.021868 0.049132 0.126 60-69 [0.042 - 0.11] [0.012516 - 0.03498] [0.00702 - 0.097484] [0.12 - 0.132] 0.113 0.028137 0.084863 0.21 70-79 [0.067 - 0.175] [0.016147 - 0.04515] [0.02185 - 0.158853] [0.203 - 0.218] 0.32 0.01792 0.30208 0.316 80+ [0.19 - 0.494] [0.01007 - 0.029146] [0.160854 - 0.48393] [0.309 - 0.324]

Calculating the transmission rate — from R0. We obtain — by linearizing the dynamics for Eg,Ig around a point where Sh Nh,Ih 0, h. More precisely, we have: ¥ ¥ ’ ˆEg —(t) cghIh(t) ‡Eg(t) ˆt ¥ ≠ h ÿ ˆIg ‡Eg(t). ˆt ¥ T Then, with Y (t):= E1(t),E2(t),...,E ,I1(t),...,I , we can write Y˙ (t)=( + )Y (t), where |G| |G|

# $ 0 [cgh]g,h = — œG [57] · 0 0 5 6 and diag(‡) 0 = ≠ [58] diag(‡) diag(µ) 5 ≠ 6 1 Then R can be identified as the spectral radius (i.e., the largest absolute value of the eigenvalues) of the matrix ≠ (8, 9). 0 ≠ Since the eigenvalues of a matrix — A are simply — multiples of the eigenvalues of A, we can therefore determine — as R0 · 1 divided by the spectral radius of the matrix ( /—) ≠ . ≠

14 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

B. Epidemiological Model Parameter Fitting Using Health Outcomes and Mobility Data. We use data on health outcomes from the French Public Health Agency (10), as well as Google mobility data (11), to estimate the unknown parameters in our model. The data on health outcomes includes counts for individuals who are in the hospital, in the ICU, and have died, by age group, and is maintained and updated daily by the French Public Health Agency (Santé publique France). The Google mobility data reports changes in activity at dierent places compared to a baseline, and is calculated using aggregate and anonymized data. For both health outcomes and mobility, we use data specific to the Île-de-France region. The calibration exercise has two purposes: (a) to further refine the SEIR parameters reported in the literature to the data observed in Île-de-France and (b) to estimate our other parameters for which we do not have existing references. We then use the values of the estimated parameters in our experiments and simulations. We first describe the set of parameters to be estimated, which we denote by . P • Date of patient zero. We assume that the SEIR process starts with an infected individual of the 40-49 y.o. age group (12). We wish to estimate the date when this infection occurs. • Epidemiological parameters. We use the epidemiological parameters of (5) to initialize the SEIR model. We allow these parameters to change in time in order to model changes in the way hospitals manage COVID patients, as well as changes in mandates for using masks and other measures that reduce transmission. We assume that on date d, each parameter 1 1 ICU D from the set = —,⁄≠ ,⁄≠ , pg g , pg g changes with respect to its initial value (as reported in Section S H ICU { } œG { } œG A), according to the relationship ) s d =*ms s d,s , [59] after_ · before_ œS where ms is a multiplier pertaining to parameter s. We assume the same multiplier m ICU for all groups g , and pg œG similarly for m D . We seek to determine the date of change d as well as the multipliers ms,s . pg œS • Confinement patterns. To estimate activity levels for the activities in our social mixing model, we use Google mobility data (11). The mobility data reports changes of activity (visits and length of stay) for each day, compared to a baseline value. The baseline used corresponds to the median value for the corresponding day of the week, during the five-week period January 3 - February 6 2020. We fix the “home” activity level to be equal to 1, throughout time. We estimate the level of the other activities using the corresponding activities from the Google mobility data, as shown in Table S3.

Table S3. Mapping between the activities in our model and the activities in the Google mobility data

Activity in Our Model Activity in Google Mobility Data Work Workplaces Transport Transit stations Leisure Retail & recreation Other – Retail & recreation +(1 – ) Grocery & pharmacy other· ≠ other ·

What remains to be estimated in the calibration is the weight parameter –other, as well as the school activity levels. We calibrate the level of schooling activity for four dierent time periods. These periods are chosen to reflect (i) the dates when the French government closed down schools, and (ii) the French school calendar and summer recess. • Social mixing parameter. To reduce the number of parameters to be calibrated, we simplify our mixing model in Eq. (1) by constraining –1 = –2. We seek to determine this mixing parameter. We next describe the details of the fitting procedure that we set up in order to retrieve an optimal parameter fitting. The mixing dynamics of the SEIR model are driven by the vector of activity levels ¸g of each age group (as described in Section A). Data on activity levels can be noisy; we model this uncertainty by assuming the vector of activity levels is a random vector, distributed as follows: t t 2 2 ¸˜g(t)=¸¯g(t)(1 + Z ),Z N(0,‡ I),‡=10≠ . [60] ≥ The value ¸¯g(t) is obtained from the Google activity data at time t; this dataset does not dierentiate activity by age, so ¸¯g(t)=¸¯h(t) for all g, h ; in other words, all groups are assigned the same activity level. Recall that Hg(t),ICUg(t),Dg(t) œG denote the hospital utilization, ICU utilization and cumulative number of deaths according to the SEIR model. We denote these quantities with HP (t, ¸˜),ICUP (t, ¸˜),DP (t, ¸˜) to emphasize the dependence of the SEIR process on parameters and g g g P the vector of activity levels ¸˜. We aggregate these quantities over all age groups:

HP (t, ¸˜):= HgP (t, ¸˜) [61] g ÿœG ICUP (t, ¸˜):= ICUgP (t, ¸˜) [62] g ÿœG DP (t, ¸˜):= DgP (t, ¸˜) [63] g ÿœG Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 15 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Denote Hobs(t),ICUobs(t),Dobs(t) as the general ward hospital beds utilization, ICU beds utilization, and cumulative deaths, respectively, at time t, as observed from the Île-de-France real data from (10). We calculate the relative fitting error of the SEIR model at time t for each of these three quantities as

obs HP (t, ¸˜) H (t) P (t, ¸˜):= ≠ [64] EH Hobs(t) - - obs - ICUP (t, ¸˜) ICU - (t) P (t, ¸˜):=- ≠ - [65] EICU - ICUobs(t) - - - obs - DP (t, ¸˜) D (t) - P (t, ¸˜):=- ≠ . - [66] ED - Dobs(t) - - - - - We define the total expected fitting error as a sum of these- errors over dierent- time intervals: - - Peak ˜ Peak ˜ Peak ˜ P :=E˜ w HP (t, ¸)+ w ICUP (t, ¸)+ w DP (t, ¸) E ¸ ·E ·E ·E Ct Peak_H t Peak_ICU t Peak_D œ ÿ œ ÿ œ ÿ Tail Tail Tail + w P (t, ¸˜)+w P (t, ¸˜)+w P (t, ¸˜) [67] H ·EH ICU ·EICU D ·ED t Tail œÿ ! " Tail Tail Tail + (1 w ) P (t, ¸˜)+(1 w ) P (t, ¸˜)+(1 w ) P (t, ¸˜) , [68] ≠ H ·EH ≠ ICU ·EICU ≠ D ·ED t Main D œÿ ! " where the expectation is taken with respect to random vector ¸˜, and where the time intervals are defined in Table S4 and comprise the entire period between March 17 2020 and October 21 2020. Our approach penalizes the errors at the peak times for hospitalizations, beds utilization and deaths; it also penalizes errors over the last 14 days of the considered period, to ensure an accurate fit at the end of the calibration horizon. We use dierent weights to account for the dierent errors. We use Peak Tail Tail Tail w =1/6 to account for the relatively smaller period of the Peaks. We use wH = wD =1/3, but set wICU to 2/3. The reason for this choice is that we are interested in low error at the tail predictions of ICU utilization, as this number plays an important role in the dynamics of the model right after October 21st.

Table S4. Time intervals used in the calibration

Label Period March 17 - October 7 2020 Main (except for April 8, April 14, May 14) Peak_ICU April 8 2020 Peak_H April 14 2020 Peak_D May 14 2020 Tail October 8 - 21 2020

We seek to determine the set of parameters that minimize P in Eq. (67). We approximate the expectation in Eq. (67) P E through a Monte Carlo sample-average approximation, using 100 samples. The set contains both discrete and continuous P parameters. To minimize P , we first do a grid search over all possible combinations of the discrete parameters, and then for E each such combination, we perform a gradient descent procedure over the space of the continuous parameters. Our calibration procedure yields the parameter fitting summarized in Table S5. We compare the fitted values of the SEIR model with the values reported by the French National Agency of Public Health in Figure S2.

C. Economic Model Parameter Fitting. We obtain data on population, employment, and wages from the French National Institute of Statistics and Economic Studies (INSEE). Where relevant, we discount all cash flows at 3% annualized rate. We set the retirement age to be 65 (i.e., 64 is the last working year of age.) We first obtain the initial population data Ng(0) for each age group in Île-de-France at the end of 2019 from (13).

Estimation of wg. Recall that wg in Eq. (32) corresponds to the employment value for a member of group g, under normal conditions. To estimate wg, we use two datasets from INSEE: • Yearly full time equivalent (FTE)†† wages and employed population count for Île-de-France in 2016, broken up into the age groups “under 26 years old”, “26 to 49 years old” and “more than 50 years old” (14).

• FTE employment rates across the entire economy for the fourth quarter of 2019, bucketed by age groups “15 to 24 years old”, “25 to 49 years old”, “50 to 64 years old”, and “55 to 64 years old” (15).

††This is a normalization to account for employees doing part time work.

16 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Table S5. Calibration: fitted epidemiological and economic parameters and sensitivity analysis ranges

Parameter Estimated Value Estimation Interval Date of patient zero Jan 1 2020 Dec 8 2019 - Jan 27 2020 Date d when epidemiological parameters change April 16 2020 Mar 1 2020 - Apr 30 2020

Multiplier m— 0.60 0.5-2.0

Multiplier m ICU 1.66 0.5-2.0 pg

Multiplier m D 1.13 0.5-2.0 pg

Multiplier m⁄H 1.50 0.5-2.0

Multiplier m⁄ICU 2.00 0.5-2.0

Weight for activities –other 0.96 0.0-1.0 School activity ¸school Mar 17 - May 10 2020 0.0 0.0-0.1 School activity ¸school May 11 - Jun 30 2020 0.0 0.0-1.0 School activity ¸school Jul 1 - Aug 31 2020 0.2 0.0-0.2 School activity ¸school Sep 1 - Oct 21 2020 1.0 0.5-1.0

Social mixing –1 = –2 0.39 0.1-2.0 Sensitivity of econ. value ‹work 0.50 Sensitivity of econ. value ‹fixed 0.47

(a) ICU Beds Utilization (b) Hospital Beds Utilization (c) Cumulative Deaths

Fig. S2. Predictions of the fitted SEIR model vs. reported values by the French Public Health Agency for three different quantities: (a) ICU Beds Utilization, (b) Hospital Beds Utilization and (c) Cumulative Deaths.

Since we do not have a consolidated data source for economic data split by our exact age group definitions, we use the above datasets to interpolate values for wg. At a high level, we derive wage curves across age ranges. We next explain the general procedure, as well as the additional assumptions we have made for the interpolation. First, for the construction of wage curve by age bucket:

• We assume that the national level employment rates from (15) are equal to those of the Île-de-France region. Because the age bucketing for our age groups is finer than the age bucketing in the data, we use interpolation. Specifically, we fit a piece-wise linear model (consisting of three pieces) to the four employment rates reported for the “15 to 24 years old”, “25 to 49 years old”, “50 to 64 years old”, and “55 to 64 years old” groups. We take the mid-point of the age group as the x value of the datapoint; for example, for “50 to 64 years old” we use a midpoint of 57.5. With this model, we can infer an employment rate for any arbitrary age and construct an employment rate curve.

• We perform a similar fitting procedure for the age group wage information from (14); since the wage progression by age is much smoother, we use simple linear regression to construct a wage curve for each one of our age buckets.

• The previous wage curve only accounts for the employed population, whereas our age groups count the entire population. We thus combine the wage curve with employment rate and population data to arrive at a wage number blended across an entire age group’s population. When doing this, we treat the 10-19 y.o. and 60-69 y.o. age groups specially by assuming the employment rates are reported only with respect to the work-eligible population in that bucket (15-19 and 60-64 year olds, respectively). We also set the work-eligible population for the 0-9, 70-79, and 80+ age buckets to 0. The formula we use is

average_wage = employed_average_wage employment_rate fraction_work_eligible_population [69] g g ◊ g ◊ g

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 17 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

• The interpolations we use introduce errors: in particular, if we aggregate the wages inferred by our constructed curve

across the entire population, we overestimate the real total wages by 5.12%. We shade down all wages average_wageg proportionally so as to retrieve the real total wage amount wg. Table S6 summarizes the year-based employment contribution parameters we use in the objective function per age group. We note that we divide these year-based values by 365 when using them in the objective of the optimization problem, in order to capture employment value on a daily basis.

Table S6. Year-based economic value parameters under normal activity, per age group (in e)

Age Group gwg 365 ◊ 0-9 0.00 10-19 2084.79 20-29 11 743.20 30-39 25 799.29 40-49 31 746.68 50-59 32 573.79 60-69 12 640.83 70-79 0.00 80+ 0.00

Estimation of ‹work,‹other activities,‹fixed. We move on to the estimation of parameters ‹work,‹other activities,‹fixed in Eq. (32). These measure the sensitivity of economic value to the confinement pattern ¸(t). We estimate them from data on lost economic output during the first lockdown phase employed in Île-de-France, and in particular using the month of April 2020. We break up the approach into a few steps: • We use survey data of French managers regarding business activity during the lockdown starting March 17 2020. This is sentiment data where managers are asked to compare current business conditions to normal conditions for the same relevant time period (16, 17). These data are reported by industry, and we aggregate them into a single number, weighting by industry size. We use FTE wages and employed population count for the Île-de-France region in 2016 (18) to figure out the appropriate weights to use in the aggregation. We then use these monthly readings as proxies for the economic activity level due to confinements in the month of April 2020, as compared to normal activity. The economic activity level for the month of April is 58.51%.

• A requirement for our estimation are the precise levels of confinement in April 2020. We retrieve these from Google mobility data (11), as explained in Section 4 B. To simplify estimation, we set ‹other activities =0and then determine parameters ‹work,‹fixed solving the system of equations ‹work + ‹fixed =1 [70] work work fixed ‹ ¸April + ‹ =0.5851, [71]

work work where ¸April corresponds to the average value of ¸ (t) through the month of April 2020. In our experiments, we also test our algorithm in alternative scenarios where we set ‹other activities > 0, keep the value for ‹fixed from the system Eq. (70)- Eq. (71), and adjust ‹work =1 ‹other activities ‹fixed. The specific values we test are ‹other activities 0, 0.1, 0.2 . ≠ ≠ œ{ } 5. Benchmark Policies Benchmark policies for lockdown. We compare ROLD to several simpler classes of policies drawing inspiration by real life confinement management rules: • ICU admissions trigger policy — ICU-t. This class of policies is similar in spirit to the trigger rule proposed by (19) for the Austin metropolitan area. This rule places all age groups and activities (except home) at a strict level of confinement when the average seven-day inflow of hospital admissions exceeds a pre-determined threshold, and then relaxes the confinement to a relaxed level when the average seven-day hospital admissions and the hospital utilization rates drop below pre-determined thresholds. Since (19) does not dierentiate between hospital and ICU beds, we define our policy class on ICU admissions and utilization instead of hospitalizations. Specifically, the ICU admissions trigger policy is defined as in Algorithm 4.

We optimize over the parameters ¸strict, ¸relaxed, ·admissions and ·utilization via grid search with the goal of minimizing the objective in Eq. (38) corresponding to the total economic and life loss due to the pandemic, and we report the performance of the best policy.

18 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Algorithm 4 ICU Admissions Trigger Policy — ICU-t.

Require: parameters ¸strict, ¸relaxed, ·admissions, ·utilization Initialize ¸a(0) = ¸ for all a work, school, transport, leisure, other ,g g relaxed œ{ } œG for t 1 ...T 1 , g do œ{ ≠ } œG if 7 day average ICU admissions >·admissions then Set ¸a(t)=¸ for all a work, school, transport, leisure, other ,g g strict œ{ } œG ICU else if (7 day average ICU admissions · )AND ICUg(t)/K (t) · then Æ admissions g Æ utilization Set ¸a(t)=¸ for all a work, school, transport, leisure, other ,g g relaxed œ{ 1q } œG 2 else Set ¸a(t)=¸a(t 1) for all a work, school, transport, leisure, other ,g g g ≠ œ{ } œG

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 19 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

• Hybrid trigger policy — Hybrid-t. This policy resembles the rule used in France for declaring a region in “maximum alert”‡‡ (20). Like the previous policy, this policy also switches between uniform strict versus relaxed confinement levels, but the trigger condition combines ICU utilization with infection prevalence rates in the population. Specifically, the policy switches to strict confinement if the seven-day incidence rate in the population, defined§§ as

t 6 · t g Ig(·)/ g Ng(0), is greater than a threshold flincidence, and the incidence rate in age groups corresponding ≠ Æ Æ to the population that is 60 y.o. and above t 6 · t g 60 y.o. Ig(·)/ g Ng(0) is greater than a threshold flincidence_60+, q q q ≠ Æ Æ Ø and ICU utilization rate is greater than flICU_utilization. The specific rule is described in Algorithm 5. We optimize over all parameters ¸strict, ¸relaxed, flincidence, flincidence_60+q andqflICU_utilization withq the goal of minimizing the total loss objective in Eq. (38), and we report the performance of the best policy. This is the Hybrid-t AND policy, and it is described in Algorithm 5. We also test a stricter version of this policy that takes the logical or of the three conditions (Hybrid-t OR) instead of taking the and.

Algorithm 5 Hybrid Trigger Policy — Hybrid-t AND.

Require: parameters ¸strict, ¸relaxed, flincidence, flincidence_60+, flICU_utilization Initialize ¸a(0) = ¸ for all a work, school, transport, leisure, other ,g g relaxed œ{ } œG for t 1 ...T 1 , g do œ{ ≠ } œG if t 6 · t g Ig(·)/ g Ng(0) >flincidence AND t 6 · t g 60y.o. Ig(·)/ g Ng(0) >flincidence_60+ AND ≠ Æ Æ ≠ Æ Æ Ø 1 ICU 2 1 2 g ICUqg(t)/K q(t) >flICU_utilizationq then q q q Set ¸a(t)=¸ for all a work, school, transport, leisure, other ,g 1q g strict œ{ 2 } œG else Set ¸a(t)=¸ for all a work, school, transport, leisure, other ,g g relaxed œ{ } œG

• Fully open — FO. This corresponds to the normal conditions where ¸a(t)=1for all a home, work, school, transport, leisure, other , g œ{ } g and t 0,...,T 1 . œG œ{ ≠ } a • Full confinement — FC. In this policy, all activities except home are fully restricted. That is, we set ¸g (t)=0for all a work, school, transport, leisure, other , g ,t 0,...,T , and ¸home(t)=1for all g and t 0,...,T 1 œ{ } œG œ{ } g œG œ{ ≠ } Benchmark policies for testing. We compare the testing decisions of ROLD against a proportional testing policy, which allocates testing proportionally to each age group’s census population, up to capacity.

6. Experimental and Optimization Setup We run experiments over a range of values for the parameters of our model. Table S7 summarizes the values we use for each parameter in our experimental setup, as well as the details of the optimization setup. The “Baseline Value” column reports the values of the parameters used in our baseline setting, as reported in results in the main paper and the SI, unless specified otherwise. We also run sensitivity analysis on the calibrated parameters: we run experiments over a range for each of the fitted parameters, as reported in Table S8. We make some further comments on ICU and hospital capacity, as well as testing capacity. For the ICU capacity, we do a sensitivity analysis using values that range up to 3200 beds. We note that the ICU capacity in Île-de-France started at 1,200 beds (21) and was enhanced during the crisis, with a reported peak ICU utilization during the spring of 2020 of 2,668 beds, on April 8 (22). In our experiments we set the capacity for general ward beds to infinity. When we run experiments with viral testing, we consider a baseline capacity of 60,000 tests daily for Île-de-France (23, reported on May 11, 2020). We run experiments for a range of values for the cost of death ‰, accounted as multiples of the GPD per capita of France. For the GDP per capita of France, we use the figure for 2019, converting US dollars to euros using the exchange rate on June 17, 2020.

7. Additional Results

A. Benefits of Dual Targeting. We consider four versions of ROLD that dier in the level of targeting of confinements allowed: no targeting whatsoever (NO-TARGET), targeting age groups only (AGE), targeting activities only (ACT), or targeting both (AGE-ACT). To obtain each of the four variants (NO-TARGET, AGE, ACT, AGE-ACT), we run constrained versions of the optimization problem.

‡‡For reference, we provide here the French rule. For a geographic “department” in France to be declared under maximum alert, three criteria need to be satisfied. First criterion: the incidence rate, i.e. the number of positive cases per 100,000 inhabitants over seven days, must be greater than 250 per 100,000 inhabitants. Second criterion: the incidence rate among those over 65, the most vulnerable population, must exceed 100 cases per 100,000 inhabitants. Third criterion: more than 30% of resuscitation beds must be occupied by patients with COVID-19. §§For the incidence rate, we count new cases using the Infectious state (and not the Exposed state) of the SEIR model; this is aligned with the assumption that a positive case can be detected through a viral test if and only if the individual is in the Infectious state.

20 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Table S7. Parameter values for experimental and optimization setup

Parameter Description Notation Values in Experiments Baseline Value Cost of death ‰ 121 values in [0, 990] GDP per capita of France 50 ◊ ◊ GDP per capita of France e37199.03 ICU capacity KICU 2000, 2300, 2600, 2900, 3200 2900 Hospital capacity KH Œ Viral testing capacity KVtest 243 values in the range of 0 to 1.2 million daily 0 Sensitivity of econ. value on confinement ‹other activities 0, 0.1, 0.2 0.1 Discount rate r 0.03 1 for g = 0-9 y.o. Fraction going to school fg 0.907 for g = 10-19 y.o. Mult. factor for value of schooling ◊ 0.5, 1, 5 0.5 Starting time for optimization October 21 2020 Optimization horizon T 90, 180, 360 days 90 days { } Frequency of confinement decisions 14 days Frequency of testing decisions 7 days

a a NO-TARGET: ¸ (t)=¸ Õ (t) g, gÕ , a, aÕ , t [72] g gÕ ’ œG ’ œA ’ a aÕ AGE: ¸g (t)=¸g (t) g , a, aÕ , t [73] a a ’ œG ’ œA ’ ACT: ¸ (t)=¸ (t) g, gÕ , a , t [74] g gÕ ’ œG ’ œA ’ We impose no additional constraints for AGE-ACT. All four variants are allowed to change the confinement policy through time. We assume no testing for all four variants. For each variant, ROLD gets initialized at the solution of the gradient descent heuristic with the corresponding constraints. Figure S3a plots the total losses as a function of the cost of death ‰ for each of the four variants. Figure S3b compares the total losses of ROLD with dual targeting (AGE-ACT) against the total losses of the benchmark policies, as a function of the cost of death ‰.

(a) Comparison between four ROLD policies with different levels of targeting (b) Comparison between ROLD with dual targeting and benchmark policies

Fig. S3. Total losses generated by each policy at different values of the cost of death ‰. (a) The impact of targeting confinements by age group and activity. The figure depicts the performance of four versions of ROLD that differ in the level of targeting allowed: no targeting whatsoever (NO-TARGET), targeting age-groups only (AGE), targeting activities only (ACT), or targeting both (AGE-ACT). The performance is calculated for different values of the parameter ‰ corresponding to the cost of death. A total of 128 values of ‰ are tested, ranging from 0 to 990 times the annual GDP per capita. (b) Comparison of the ROLD policy that targets age groups and activities with the benchmark policies. A total of 128 values of ‰ are tested, ranging from 0 to 990 times the annual GDP per capita. In both panels, all policies implement zero viral tests, so only confinement decisions are optimized and compared.

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 21 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

AGE-ACT not only Pareto-dominates AGE, ACT, and NO-TARGET, but also leads to super-additive improvements in almost all cases: for the same number of deaths, AGE-ACT reduces economic losses (with respect to NO-TARGET) by more than AGE and ACT added together. Furthermore, in almost all cases, for a given ‰, AGE-ACT reduces total losses (with respect to NO-TARGET) by more that AGE and ACT added together. Figure S4 visualizes these comparisons.

(a) The difference between the improvement in economic losses from ROLD AGE- (b) The difference between the improvement in total losses from ROLD AGE-ACT ACT and the sum of the improvements from ROLD AGE and ROLD ACT, for the and the sum of the improvements from ROLD AGE and ROLD ACT, for the same same number of deaths value of ‰

Fig. S4. The super-additivity of ROLD AGE-ACT. The figures compare the improvement from AGE-ACT with the sum of the improvements from AGE and ACT. All improvements are with respect to NO-TARGET. Panel (a) compares the improvements in economic losses for the same number of deaths. Panel (b) compares the improvements in total losses for the same cost of death ‰.

B. Robustness Analyses. We analyze additional problem instances by changing the value of each of 13 estimated parameters within a sensitivity range, as shown in Table S8. For each parameter, we sample 40 values uniformly at random from its specified sensitivity range. In each problem instance, one parameter is changed from its estimated value, for a total of 13 40 = 520 ◊ problem instances.

Table S8. Robustness analysis: parameters and sensitivity analysis ranges

Parameter Estimated Value Sensitivity Range

Multiplier m— 0.60 0.45-0.75

Multiplier m ICU 1.66 1.51-1.81 pg

Multiplier m D 1.13 0.98-1.28 pg

Multiplier m⁄H 1.50 1.35-1.65

Multiplier m⁄ICU 2.00 1.85-2.15

Weight for activities –other 0.96 0.76-1.0 School activity ¸school Mar 17 - May 10 2020 0.0 0.0-0.2 School activity ¸school May 11 - Jun 30 2020 0.0 0.0-0.2 School activity ¸school Jul 1 - Aug 31 2020 0.2 0.0-0.4 School activity ¸school Sep 1 - Oct 21 2020 1.0 0.8-1.0

Social mixing –1 = –2 0.39 0.19-0.59 work Work activity ¸April April 2020 0.213 0.113-0.313 Economic activity level April 2020 58.51% 48.51%-68.51%

Figure S5 shows robustness results for six values of the economic cost of death ‰: [0, 15, 25, 50, 100, 150] the annual GDP ◊ per capita in France. The shown boxplots summarize results over the 520 problem instances, for each value of ‰. These results reinforce our findings on the gains of dual targeting, as well as the observation that dual targeting unlocks complementarities which may not be available under targeting age groups or activities separately.

C. Testing and Quarantining. Value of targeting testing. We next assess the value that can be generated by targeting testing based on age groups, with subsequent quarantining of positive cases. We compare the performance of two testing policies, while keeping all activities open, and varying the testing capacity: targeting testing to dierent age groups using ROLD, versus allocating testing proportionally to the census population of each age group. Figure S6 reports the performance of these two testing policies. Figure S7 reports the dierence in deaths, economic losses, and total losses between targeted testing and proportional testing.

22 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

(a) Comparison between ROLD policies with different levels of targeting in terms of (b) The difference between the improvement in total losses from ROLD AGE-ACT total losses and the sum of the improvements from ROLD AGE and ROLD ACT

Fig. S5. Robustness analyses showing the superiority of ROLD AGE-ACT over ROLD policies with less granular targeting, and the super-additive improvements of ROLD AGE-ACT over the sum of the improvements of ROLD AGE and ROLD ACT, for different values for the cost of death ‰, over a wide set of problem instances. All improvements are with respect to ROLD NO-TARGET. For each value of ‰, the boxplots summarize results over 520 different problem instances.

At a cost of death ‰ =50 the annual GDP per capita, the value of targeted testing with a capacity of 60,000 daily tests is ◊ an improvement by EUR 2.9B (7.8%) in terms of total losses (1488 or 7.8% less deaths; EUR 0.1B or 8.3% less economic loss). The improvement in deaths and in the total loss due to targeting testing would be maximized (3210 or 26.7% less deaths, EUR 6.3B or 26.9% less total losses) with a daily testing capacity of around 238,700 tests, corresponding to more than 1.9% of the Île-de-France population. This is more than four times larger than the real testing capacity in Île-de-France (24). Figures S6 and S7 report results for a wide range of testing capacities. We note that for targeted testing, the improvement in total losses from higher testing capacity starts slowing down around 230,000 daily tests, as shown in Figure S6; that’s also the testing capacity at which the dierence between targeted and proportional testing peaks, as shown in Figure S7. At that testing capacity, there is no more infection in the community. There are still benefits from a larger testing capacity, as the point in time at which the infection is killed would come sooner, however the benefits are diminishing. For proportional testing, the improvement in total losses from higher testing capacity starts slowing down around 400,000 daily tests, as shown in Figure S6. First, it makes sense that the proportional testing curve would start slowing down for a larger testing capacity than the targeted testing curve. Second, that’s also the testing capacity at which the drop in the dierence between targeted and proportional testing tapers o, as shown in Figure S7. At large testing capacities, for which the infection is killed both for targeted testing and for proportional testing, targeted testing is still able to extract more value from abundant testing capacity than proportional testing.

Fig. S6. Comparison of the performance of testing-and-quarantining-only interventions (with ROLD targeted testing vs. with proportional testing), and confinement-only interventions (with ROLD confinement decisions), for different testing capacities, and for different levels of effective confinement time. The experiments are run using ‰ =50 ◊ the annual GDP per capita and an ICU capacity of 2900 beds. The number of values tested is 243 and 541 values of testing capacity for the targeted and proportional testing policies respectively, and 99 values of the percentage of total effective time in confinement for the confinements-only policy. The range for the number of tests is from 0 tests to 10% of the population of Île-de-France daily. (The population of Île-de-France is 12278210.)

Value of confinement versus value of testing. We use our framework to compare (a) an intervention that only uses confinement and no testing to (b) an intervention that only uses testing and subsequent quarantining while keeping all

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 23 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

(a) Difference in deaths (b) Difference in econ. losses (c) Difference in total losses

Fig. S7. Comparison of targeted testing using ROLD versus proportional allocation of testing to age groups, in terms of deaths, economic losses, and total losses. The experiments are run while keeping all activities open, and using ‰ =50 the annual GDP per capita and an ICU capacity of 2900 beds. The number of data points in each ◊ plot is 100. activities open. For the confinement-only intervention, we constrain ROLD to use a specified “budget” of total eective time in confinement. For the testing-only intervention, we constrain ROLD to use a specified testing capacity. We run experiments where we vary the eective confinement time for the confinement-only policy, as well as the testing capacity for the testing-only policy, and report the performance of ROLD. In both cases we allow ROLD to target its decisions to dierent age groups. Note that for the confinement-only policy, we do not allow ROLD to target activities, so as to keep the comparison between the confinement intervention and the testing intervention fair: both can only target age groups. For the testing-and-quarantining-only intervention, we run ROLD assuming no confinement, for a range of testing capacities, allowing for optimized allocation of tests to dierent age groups. We run ROLD for a wide range of daily testing capacity KVtest, from no testing up to testing 10% of the population daily. We also run ROLD assuming no testing, allowing for optimized targeting of confinements to dierent age groups, subject to a constraint of “total confinement time”. To model total confinement time, we add the following constraint to ROLD:

T 1 ≠ a (“Total confinement time”) ¸ (t)Ng(0) (1 plock) T Ng(0), a . [75] g Ø ≠ · ’ œA t g g ÿ=0 ÿ ÿ where p is the percentage of time spent in confinement. We provide numerical results for a range of values of p [0, 1]. lock lock œ Figure S6 reports the comparison in the performance (in terms of the total loss objective) between the confinement-only and testing-only interventions, for a cost of death ‰ =50 the annual GDP per capita. The plot shows that all the benefits ◊ from confinement can be accomplished with a total eective confinement time that corresponds to 19.2% of the total horizon window, or 17.3 days of eective confinement time within a 90-day horizon, as long as the confinement is optimally targeted to dierent age groups. To attain the same total loss (EUR 8.6B) without confinement, an extraordinary amount of (targeted) testing would be needed: more than 632,500 tests would need to be administered daily, which corresponds to more than 5.1% of the Île-de-France population. The plot shows that testing without confinement can in theory reach performance levels that are unattainable by confinement without testing, however this only happens when the daily testing capacity exceeds 5.1% of the Île-de-France population. This is about eleven times larger than the real testing capacity in Île-de-France (24).

D. The Proposed Policy. Comments on the simplicity of the policy. We make some other observations about the simplicity of the optimized ROLD policy: it relies on only a few dierent values for activity levels, and it relaxes confinements as we get closer to the end of the optimization horizon. Furthermore, at any fixed time, as the cost of death ‰ increases, there is generally a nestedness in the confinement decisions: if a certain age group is confined in a particular activity at a value of ‰, that group will face a confinement that is at least as strict at a larger value of ‰. The proposed policy with testing. We allow ROLD to also optimize for testing decisions. Figure S8a shows the ROLD confinement and testing policy, for a cost of death ‰ =50 and a 90-day optimization horizon, under a daily testing capacity ◊ of 60,000 tests. ROLD cleverly mixes confinement decisions and targeted testing. For example, it distributes the testing capacity to the 40-69 age groups, which are also kept open in work. Intuitively, this is because the work activity generates more social contacts (and thus new infections) than other activities. Moreover, of all the age groups (10-69) that are active in work, although the 40-69s do not have the largest number of social contacts, they do have the most contacts with the high risk 70+ age groups, so most of the testing capacity is focused on them. In other words, the tests are used to detect and quarantine the individuals which are most likely to create new infections in high risk individuals through their social contacts in open activities. Adding in testing can substantially change the confinement policy by permitting less total confinement of the population. Figure S8a show that the testing reduces the overall confinement in work, leisure and other. Also, although testing is mostly concentrated in the 40-49 y.o. and 50-59 y.o., it also serves to reduce confinement in some groups which are not tested, such as the confinement in work for the 10-19 y.o. group.

24 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

(a) ‰ =50 the annual GDP per capita (b) ‰ =150 the annual GDP per capita ◊ ◊

Fig. S8. The optimized ROLD confinement and testing decisions and their impact on hospitalizations, infections, and deaths, for different values of the cost of death, for a 90-day optimization horizon, with a testing capacity of 60,000 daily tests.

Dependence of the proposed policy on problem parameters. ROLD decisions generally depend quite strongly on the specific problem parameters, such as the cost of death ‰. We make a few other observations on how the ROLD policies depend on critical problem parameters. Cost of death ‰: At any fixed time, as ‰ increases there is generally a nestedness in the confinement decisions followed by ROLD: if a certain age group is confined in a particular activity at a fixed time for a value of ‰, that group will face a confinement that is at least as strict in that activity, at that time, for a larger value of ‰. As a comparison to the policy for ‰ =50 , Figure S8b reports the decisions produced by ROLD for a cost of death ‰ =150 the annual GDP per capita, using ◊ ◊ an optimization horizon of T =90days. This property can be very useful for decision making, especially in cases where a policy maker cannot or prefers not to assign a specific value for ‰, but instead is comfortable providing a range for ‰. It is important to note that this property of nested confinements does not immediately follow, e.g., as ‰ increases and a policy decides to impose more strict confinements on specific age groups, it could relax the confinements of other groups. Time horizon T : In order to understand the eect of the time horizon, we also run experiments for an optimization horizon of T =360days. These reinforce the behavior observed for T =90days. Figure S9 reports the decisions produced by ROLD for an optimization horizon of T =360days, using a cost of death ‰ =50 the annual GDP per capita. Comparing the policy ◊ for T =360days with the policy for T =90days, we see that for T =360days the confinement policy is not monotonic over time. However, the policy still relaxes confinements as we get close to the end of the horizon. Furthermore, the confinement for a fixed (age group, activity) pair and time t will generally be at least as strict as the optimization horizon increases. Exceptions to this general “rule” can occur at early times of a long optimization horizon.

E. Additional Details for the Regression Trees for the ROLD Policy. To understand how the ROLD policies target confinements across dierent age groups and activities, we train an explainable machine learning model – a regression decision tree – to predict the optimal ROLD confinement in each activity as a function of several features. To build a training set, we first create a larger set of problem instances by considering dierent values for the cost of death (‰), the relative importance of each activity in generating economic value (captured by the parameters ◊, ‹work, ‹other activities), and the ICU capacity. All instances use a horizon of T =90days, adopted to reduce the computational burden of the

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 25 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Fig. S9. The optimized ROLD confinement and testing decisions and their impact on hospitalizations, infections, and deaths for a 360-day optimization horizon, in a problem instance with cost of death ‰ =50 and a testing capacity of 60,000 daily tests. ◊ experiments. For each problem instance, we compute the optimal ROLD activity levels, i.e., the decisions ¸a(t) for g ,a , 0 t g œG œA Æ Æ T 1, which we then use to simulate the SEIR dynamics and calculate all the corresponding values of the SEIR states. Lastly, ≠ we create a set of training set samples — with one sample for each time period t, each age group g, and each activity a relevant to that age group — where we include several direct and derived features based on the parameter values characterizing the a instance and the induced SEIR states, and a target corresponding to the optimal ROLD decision ¸g (t). Specifically, we include the following as features:

other work life • the activity name a, the age group g, the parameters ‰, ‹ , ‹ , vg , the ICU capacity, the time t

a • "econ-gradient(g)": the gradient of the economic value with respect to the activity level ¸g (t)

• "I-perc(t)": the percentage of the population estimated by the model to be in the infectious state at time t,i.e., q g Ig(t)/[ g Ng(t)+Rg(t)] œG œG q q 26 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

• "R-perc(t)": the percentage of the population estimated by the model to be in the recovered state at time t,i.e., q g Rg(t)/[ g Ng(t)+Rg(t)] œG œG q q • "H-perc(t)": the percentage of the population in the hospital due to COVID-19 at time t,i.e., g Hg(t)/[ g Ng(t)+ q œG œG Rg(t)] q q • "ICU-utilization(t)": percentage of ICU beds occupied at time t

a • "contacts-received(g)": the number of social contacts received by group g from other groups in activity a,i.e., h Cg,h œG q • "contacts-received-into-deaths(g)": contacts received by group g multiplied by their probability of dying conditional on a D being infected, i.e., h Cg,h pg ; œG · q • "infections-received(g)": contacts received by group g multiplied by the probability of those contacts coming from infected a q people, i.e., h Cg,h Ih(t)/[ g Nh(t)+Rh(t)]; œG · œG q q • "deaths-received(g)": "infections-received(g)" multiplied by the group g’s probability of dying conditional on being infected a q D by those contacts, i.e., h Cg,h Ih(t)/[ g Nh(t)+Rh(t)] pg ; œG · œG ·

q q life • "deaths-econ-received(g)": "deaths-received(g)" multiplied by the economic value of the losses of those deaths ‰ + vg , a q D life i.e., h Cg,h Ih(t)/[ g Nh(t)+Rh(t)] pg [‰ + vg ]; œG · œG · ·

q q a • "contacts-given(g)": number of social contacts given to other groups by group g in the specific activity a,i.e., h Ch,g; œG q • "contacts-given-into-deaths(g)": the sum of terms of the form Ch,g multiplied by group h’s probability of dying conditional a D on being infected, i.e., h Ch,g ph ; œG · q • "infections-given(g)": the number of "contacts-given(g)" multiplied by the fraction of the population in age group g being a q infected, i.e., h Ch,g Ig(t)/[ g Ng(t)+Rg(t)]; œG · œG q q • "deaths-given(g)": infections-given(g) multiplied by the probability of dying conditional on being infected for each age a q D group, i.e., h Ch,g Ig(t)/[ g Ng(t)+Rg(t)] ph ; œG · œG ·

q q life a • "deaths-econ-given(g)": "deaths-given(g)" multiplied by the economic cost of those deaths ‰ + vh ,i.e., h Ch,g q D life œG · Ig(t)/[ g Ng(t)+Rg(t)] ph [‰ + vh ]; œG · · q

q D • "death-prob(g)": group g’s probability of dying conditional on being infected pg ;

• "econ-contacts-ratio": the ratio of "econ-gradient(g)" and "contacts-given(g)".

This results in a training set of 27,720 unique samples for each of the work, leisure and other activities. Using this data, we then use the scikit-learn Python library to train separate, activity-specific regression decision trees to predict the optimal ROLD confinement decisions in the respective activity as a function of the considered features. We train trees with dierent maximum depth ranging from three to six, and using the traditional mean-squared-error (MSE) criterion as a goodness of fit metric. The trees reported in the main body of the paper were all trained with a depth of three and using the econ-to-contacts-ratio, the cost of death ‰, the age, and time as features. Other than confirming that the econ-to-contacts ratio is the most salient feature when targeting confinements, the trees also confirm that time and the cost of death ‰ play an important role: the optimal ROLD policy tends to enforce stricter confinements in each activity in earlier periods and subsequently relax these through time, and the confinements become stricter for higher values of ‰. In addition, we also report in Figure S10 trees of depth three trained using all the features described above. Note that the "econ-to-contacts-ratio" continues to be the most relevant feature, used as a split variable in the root node of each activity tree, and also subsequently used for splits in several sub-trees.

F. Can Dual Targeting Reduce Time in Confinement for Each Age Group?. We calculate the fraction of time each age group spends in confinement under each ROLD policy, averaged over the activities relevant to that age-group. Specifically, these are

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 27 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

samples = 9,240 activity level = 0.00% samples = 1,575 samples = 7,875 activity level = 8.25% activity level = 0.00% samples = 1,520 Decision tree for "work" activity level = 5.71% Decision tree for "leisure" RMSE: 0.17235 contacts-received <= 4.3 time <= 63.0 samples = 630 RMSE: 0.21671 econ-gradient <= 215.9 time <= 49.0 True samples = 11,445 samples = 2,205 samples = 1,140 activity level = 100.00% True samples = 10,535 samples = 2,660 activity level = 6.64% activity level = 34.46% activity level = 86.79% econ-to-contacts-ratio <= 166.5 activity level = 10.22% activity level = 40.46% econ-to-contacts-ratio <= 113.3 samples = 27,720 samples = 1,527 activity level = 55.85% samples = 27,720 samples = 4,250 R-perc <= 0.2 econ-to-contacts-ratio <= 765.4 activity level = 31.26% activity level = 56.67% deaths-econ-given <= 1039.6 econ-to-contacts-ratio <= 213.8 activity level = 73.46% False samples = 16,275 samples = 3,482 False samples = 17,185 samples = 16,070 activity level = 90.45% activity level = 61.74% samples = 1,955 activity level = 85.14% activity level = 90.20% samples = 11,820 activity level = 85.56% activity level = 96.22%

econ-to-contacts-ratio <= 305.9 R-perc <= 0.2 samples = 718 samples = 1,074 samples = 12,793 samples = 1,115 activity level = 73.40% activity level = 8.92% activity level = 98.26% activity level = 12.27%

samples = 12,075 samples = 41 activity level = 99.74% activity level = 100.00%

Decision tree for "other" RMSE: 0.10849 samples = 17,635 activity level = 99.34% True samples = 7,875 activity level = 0.00% econ-to-contacts-ratio <= 172.5 deaths-econ-given <= 326.6 samples = 27,720 samples = 18,663 samples = 1,028 activity level = 68.25% deaths-econ-given <= 487.1 activity level = 98.74% activity level = 88.46% False samples = 19,845 activity level = 95.33% econ-to-contacts-ratio <= 399.3 samples = 534 samples = 1,182 activity level = 10.12% activity level = 41.51%

samples = 648 activity level = 67.37%

Fig. S10. Decision trees for work, leisure and other activities trained on the optimal ROLD policy for different values of problem parameters. Darker shading in a node corresponds to lower activity level decisions for samples in that node. Each tree is trained with a total of 27,720 unique samples. defined as:

T 1 1 ¸a(t), if g =0-9y.o. [76] ≠ 4T g t=1 a school,leisure,other,transport ÿ œ{ ÿ } T 1 1 ¸a(t), if g =10-19y.o. [77] ≠ 5T g t=1 a work,school,leisure,other,transport ÿ œ{ ÿ } T 1 1 ¸a(t), if g =20-29,30-39,40-49,50-59,60-69y.o. [78] ≠ 4T g t=1 a work,leisure,other,transport ÿ œ{ ÿ } T 1 1 ¸a(t), otherwise. [79] ≠ 3T g t=1 a leisure,other,transport ÿ œ{ ÿ }

Further observations. We make some further comments about the results shown in Figure 3 in the main paper. It is worth noting that although ROLD AGE-ACT reduces confinements for every age group compared to less targeted policies, it does not do so uniformly, and it can lead to an even larger discrepancy in the confinements faced by dierent age groups: those aged 10-59 are generally more confined than those aged 0-9 or 60 and above. Such disparate treatment is non-existent in the NO-TARGET policy by definition, and is less apparent in the AGE and ACT policies. Interestingly, while AGE generally exhibits a similar discrimination profile to AGE-ACT, ACT exhibits an almost opposite profile, with more confinement applied to those aged 70 and above. This is explainable by considering again the “econ-to-contacts” ratio: for instance, since leisure is responsible for the smallest (population-weighted average) econ-to-contacts ratio among all activities, it is the first activity that ACT targets for confinement, and this disproportionately impacts the groups aged 70 and above, who are active in fewer activities overall than other age groups and have leisure as one of their activities.

G. Limited Disparity Requirements. We impose limited disparity constraints on confinement decisions across pairs of age groups, for a given activity. For work, we impose constraints for age groups 10-19 y.o., . . . , 60-69 y.o. For transport, leisure and other, we impose constraints for all age groups. No constraints are imposed for home and for school. Given a disparity tolerance 0, we formalize the constraints as follows: Ø

work work ¸ (t) ¸ (t)+, t 0,...,T 1 , (g, gÕ):g = gÕ,g,gÕ / 0-9 y.o., 70-79 y.o., 80+ y.o. g Æ gÕ ’ œ{ ≠ } ’ ” œ{ } work work ¸ t ¸ t , t ,...,T , g, gÕ g gÕ,g,gÕ / g ( ) gÕ ( ) 0 1 ( ): = 0-9 y.o., 70-79 y.o., 80+ y.o. a Øa ≠ ’ œ{ ≠ } ’ ” œ{ } ¸ t ¸ t , t ,...,T , a , , , g, gÕ g gÕ g ( ) gÕ ( )+ 0 1 leisure transport other ( ): = a Æ a ’ œ{ ≠ } ’ œ{ } ’ ” ¸ (t) ¸ (t) , t 0,...,T 1 , a leisure, transport, other , (g, gÕ):g = gÕ. g Ø gÕ ≠ ’ œ{ ≠ } ’ œ{ } ’ ” Smaller values of imply more stringent non-disparity constraints. We provide numerical results for a range of values of in [0, 1].

28 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

8. Further Calculations and Algorithmic Details for ROLD

A. Capping the Turn-away Variables. Because our controls ut include decisions on how many people are turned away for each group from the hospital and ICU, we need to guarantee that when calling Ft(Xt, ut), the turn-away decisions are feasible,i.e., that they satisfy the conditions in Eq. (25), Eq. (27). In order to ensure this, we perform a modification of the controls that accomplishes two goals:

1. Cap all turn-away decisions to the incoming flow to the hospital (ICU).

2. Increase the turned away proportionally to the excess contribution of each group if necessary to respect the capacity.

H H pg q To accomplish the first goal, we simply cap all the controls by the inflow, µ pg Ig(t)+ H ICU Iss,g(t) (and equivalently · pg +pg · H ICU H for ICU). To accomplish the second goal, we define new controls Bg (t) and1Bg (t), based on the given controls2 B˜g (t) and ICU ¶¶ B˜g (t),as:

H H pg q H µ p Ig t I t B˜ t g ( )+ pH +pICU ss,g( ) g ( ) BH t B˜H t · g g · ≠ g ( )= g ( )+ H [80] 1 H ph q 2 H µ p Ih(t)+ I (t) B˜ (t) h · pH + pICU · ss,h ≠ h h 3 h h 4 ÿ + H H,R H,D H ph q H H 1 ⁄ ⁄ Hh(t)+µ p Ih(t)+ I (t) B˜ (t) K (t) [81] · ≠ h ≠ h h · pH + pICU · ss,h ≠ h ≠ AA h 3 h h 4 B B ÿ ! " ICU ICU pg q ICU µ p Ig t I t B˜ t g ( )+ pH +pICU ss,g( ) g ( ) BICU t B˜ICU t · g g · ≠ g ( )= g ( )+ ICU [82] 1 ICU ph q 2 ICU µ p Ih(t)+ I (t) B˜ (t) h · pH + pICU · ss,h ≠ h h 3 h h 4 ÿ + ICU ICU,R ICU,D ICU ph q ICU ICU 1 ⁄ ⁄ ICUh(t)+µ p Ih(t)+ I (t) B˜ (t) K (t) · ≠ h ≠ h h · pH + pICU · ss,h ≠ h ≠ AA h 3 h h 4 B B ÿ ! " [83]

¶¶To simplify the notation, we set

H,R R H ⁄ = p ⁄ g g · g H,D D H ⁄ = p ⁄ g g · g ICU,R R ICU ⁄ = p ⁄ g g · g ICU,D D ICU ⁄ = p ⁄ . g g · g

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 29 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

B. Deriving the Jacobians. Here, we give the exact formulas for the Jacobians we use for the ROLD procedure.

N,g Vtest Ig(t) H ICU f (Xt, ut):= Ng (t) µ (pg + pg ) Ig(t) [84a] ≠ · Ng(t) ≠ · · N,g ˆf Vtest Ig(t) = Ng (t) 2 [84b] ˆNg Ng (t) N,g Vtest ˆf Ng (t) H ICU = µ (pg + pg ) [84c] ˆIg ≠ Ng(t) ≠ · N,g ˆf Ig(t) Vtest = [84d] ˆNg (t) ≠ Ng(t) [84e]

S,g Ih(t) f (Xt, ut):= —(t)Sg(t) cg,h(¸) q [84f] ≠ · Nh(t)+R (t) Ah h B œG S,g ÿ ˆf Ih(t) — t S t c ¸ , h = ( ) g( ) g,h( ) q 2 [84g] ˆNh · (Nh(t)+Rh(t)) ’ S,g ˆf Ih(t) = —(t) cg,h(¸) q [84h] ˆSg ≠ · Nh(t)+R (t) Ah h B ÿœG S,g ˆf Ih(t) — t S t c ¸ , h q = ( ) g( ) g,h( ) q 2 [84i] ˆR · (Nh(t)+R (t)) ’ h 3 h 4 ˆfS,g 1 = —(t)Sg(t) cg,h(¸) q , h [84j] ˆIh ≠ · Nh(t)+R (t) ’ 3 h 4 S,g ˆf ˆcg,h(¸) Ih(t) = —(t)Sg(t) q , a [84k] ˆ¸g,a ≠ · ˆ¸g,a Nh(t)+R (t) ’ Ah h B œG S,g ÿ ˆf ˆcg,h(¸) Ih(t) = —(t)Sg(t) q , h, a [84l] ˆ¸h,a ≠ · ˆ¸h,a Nh(t)+Rh(t) ’ ’ [84m]

E,g Ih(t) f (Xt, ut):=—(t)Sg(t) cg,h(¸) q ‡Eg(t) [84n] · Nh(t)+R (t) ≠ Ah h B ÿœG ˆfE,g ˆfS,g = [84o] ˆSg ≠ ˆSg ˆfE,g ˆfS,g = , X N,Rq,I , h [84p] ˆXh ≠ ˆXh ’ œ{ } ’ ˆfE,g ˆfS,g = , a [84q] ˆ¸g,a ≠ ˆ¸g,a ’ ˆfE,g ˆfS,g = , h, a [84r] ˆ¸h,a ≠ ˆ¸h,a ’ ’ ˆfE,g = ‡ [84s] ˆEg ≠

30 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

I,g Vtest Ig(t) f (Xt, ut):=‡Eg(t) µ Ig(t) Ng (t) [85a] ≠ · ≠ · Ng(t) ˆfI,g = ‡ [85b] ˆEg ˆfI,g N Vtest(t) = µ g [85c] ˆIg ≠ ≠ Ng(t) I,g ˆf Ig(t) N Vtest t = g ( ) 2 [85d] ˆNg (Ng(t)) I,g ˆf Ig(t) Vtest = [85e] ˆNg (t) ≠ Ng(t) [85f] R,g H ICU f (Xt, ut):=µ(1 p p )Ig(t) [85g] ≠ h ≠ g R,g ˆf H ICU = µ(1 pg pg ) [85h] ˆIg ≠ ≠ [85i]

I,j,g Vtest Ig(t) q f (Xt, ut):=pj,gNg (t) µIj,g(t), j as, ps, ms, ss [85j] Ng(t) ≠ ’ œ I,j,g Vtest ˆf Ng (t) = pj,g [85k] ˆIg Ng(t) I,j,g ˆf Ig(t) p N Vtest t = j,g g ( ) 2 [85l] ˆNg ≠ (Ng(t)) ˆfI,j,g q = µ [85m] ˆIj,g ≠ I,j,g ˆf Ig(t) Vtest = pj,g [85n] ˆNg (t) Ng(t) [85o]

Rq,g q H,R ICU,R f (Xt, ut):=µ I (t)+⁄ Hg(t)+⁄ ICUg(t) [85p] · j,g g · g · j a,ps,ms œ{ } q ÿ ˆfR ,g q = µ, j [85q] ˆIj,g ’ Rq,g ˆf H,R = ⁄g [85r] ˆHg Rq,g ˆf ICU,R = ⁄g [85s] ˆICUj,g [85t]

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 31 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

H H,g H,R H,D H pg q H f (Xt, ut):= ⁄g + ⁄g Hg(t)+µ pg Ig(t)+ H ICU Iss,g(t) Bg (t) [86a] ≠ · · pg + pg · ≠ 3 4 H,g ˆf ! H,R H,D " = ⁄g + ⁄g [86b] ˆHg ≠ H,g ˆf ! H " = µ pg [86c] ˆIg · H,g H ˆf pg = µ H ICU [86d] ˆIss,g · pg + pg ˆfH,g H = 1 [86e] ˆBg ≠ [86f] ICU ICU,g ICU,R ICU,D ICU pg q ICU f (Xt, ut):= ⁄g + ⁄g ICUg(t)+µ pg Ig(t)+ H ICU Iss,g(t) Bg (t) [86g] ≠ · · pg + pg · ≠ 3 4 ICU,g ˆf ! ICU,R ICU,D " = ⁄g + ⁄g [86h] ˆICUg ≠ ICU,g ˆf ! ICU " = µ pg [86i] ˆIg · ICU,g ICU ˆf pg q = µ H ICU [86j] ˆIss,g · pg + pg ˆfICU,g ICU = 1 [86k] ˆBg ≠ [86l] D,g H,D ICU,D H ICU f (Xt, ut):=⁄ Hg(t)+⁄ ICUg(t)+B (t)+B (t) [86m] g · g · g g D,g ˆf H,D = ⁄g [86n] ˆHg D,g ˆf ICU,D = ⁄g [86o] ˆICUg ˆfD,g H =1 [86p] ˆBg ˆfD,g ICU =1. [86q] ˆBg

Partials of cg,h with respect to ¸ for the multiplicative mixing model:

a –1+–2 1 ( ) a ≠ (–1 + –2)Cg,h ¸g if u = g = h ˆcg,h(¸g, ¸h) a –1· 1 ( ) a ≠ a –2 a = –1Cg,h ¸g (¸h) if u = g = h [87a] ˆ¸u Y (a) · a –1 ! a "– 1 ” _– C ¸ (¸ ) 2≠ if g = h = u ] 2 g,h · ! g " h ” C. Calculation of Objective Coefficients. [_ ! "

C.1. Recursion for the Objective Coefficients. Algorithm 6 calculates the coecients of each decision ut.

C.2. Calculation of M, “ and ÷. We recall the objective

T 1 ≠ q V vg (¸(t)) (Sg(t)+Eg(t)+Ig(t)+Rg(t)) + vg(1) R (t) ≠ · · g t=0 g ÿ ÿœG ! " life + v + ‰ Dg(T ), g · g ÿœG ! " as well as the economic model: schooling vg (¸) if g =0-9y.o. employment schooling vg (¸)+vg (¸) if g =10-19y.o. vg(¸):=Y vemployment(¸) if g =20-29,30-39,40-49,50-59,60-69y.o. _ g ]0 otherwise, _ [_ 32 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

Algorithm 6 CalculateObjectiveCoefficients algorithm.

Initialization: X¯k := Xk for t = k, k +1,...,T 1 do ≠ Calculate coecients for u· in the period t summand of Eq. (52)as:

et€, for · = t [88] dt€A¯t,· B· , for · = t 1,t 2,...,k ; ≠ ≠ where I, for · = t 1 A¯t,· := ≠ [89] A¯t,· A· , for · = t 2,t 3,...,k. ; +1 +1 ≠ ≠

for t = T do Calculate coecients for u· in the period T summand of Eq. (52)as:

dT €A¯T,·B· , for · = T 1,T 2,...,k [90] ≠ ≠ where I, for · = T 1 A¯T,· := ≠ [91] A¯T,· A· , for · = T 2,T 3,...,k. ; +1 +1 ≠ ≠

Sum up the terms attributed to a common u· :

T

u· = e· € + dt€A¯t,· B· . [92] t · =ÿ+1 return Coecients for u· for each · = k, k +1,...,T 1 ≠

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 33 of 35 medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

where

other leisure transport employment work work other activities 1 ¸h + ¸h + ¸h fixed v (¸):=wg ‹ ¸ + ‹ + ‹ g · · g · 3 A h B |G| œG work work ÿ =¸g wg‹ [93]

a other activities 1 + ¸ wg‹ [94] Q hR 3 h a other,leisure,transport |G| ÿœG œ{ ÿ } a fixed b + wg‹ , [95] and

schooling employment school v (¸):=◊ fg ”g v (1) ¸ . g · · · 20-29 y.o · g From the equations above, we can write M (where the rows are indexed by the controls and the columns indexed by the compartments) as

work work work work work wg‹ , if g =10-19,...,60-69y.o. M[¸g ,Sg]=M[¸g ,Eg]=M[¸g ,Ig]=M[¸g ,Rg]= ≠ 0, otherwise. ; w ‹other activities ≠ g a a a a 3 , if g =10-19,...,60-69y.o.,h , a other, leisure, transport M[¸h,Sg]=M[¸h,Eg]=M[¸h,Ig]=M[¸h,Rg]= |G| œG œ{ } I0, otherwise. employment school school school school ◊fg”gv20-29 y.o (1), if g =0-9,10-19y.o. M[¸g ,Sg]=M[¸g ,Eg]=M[¸g ,Ig]=M[¸g ,Rg]= ≠ 0, otherwise. ; M[ , ]=0, otherwise. · · Similarly, we can write “ (indexed by the compartments for each group) as

fixed wg‹ , if g =10-19,...,60-69y.o. “[Sg]=“[Eg]=“[Ig]=“[Rg]= ≠ [96] 0, otherwise. ; q “[R ]= vg(1), for g [97] g ≠ œG “[ ]=0, otherwise. [98] · Finally, we have € ÷ = 0, , 0, (vlife + ‰) , 0, , S ··· g ···T compartments of group g W Xg U V œG where the only non-zero values are in the indices corresponding¸ ˚˙ to compartment˝ Dg of each group g.

References 1. L Wille, et al., SOCRATES: an online tool leveraging a social contact data sharing initiative to assess mitigation strategies for covid-19. BMC Res. Notes 13 (2020). 2. G Béraud, et al., The French connection: The first large population-based contact survey in France relevant for the spread of infectious diseases. PLOS ONE 10,1–22(2015). 3. A Mas-Colell, MD Whinston, JR Green, , et al., Microeconomic theory. (Oxford University Press New York) Vol. 1, (1995). 4. D Acemoglu, V Chernozhukov, I Werning, MD Whinston, Optimal targeted lockdowns in a multi-group SIR model, (NBER), Working Paper 27102 (2020). 5. H Salje, et al., Estimating the burden of SARS-CoV-2 in France. Science 38 (2020). 6. Q Bi, et al., Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study. The Lancet Infect. Dis. 20 (2020). 7. Z Du, et al., Serial interval of covid-19 among publicly reported confirmed cases. Emerg. Infect. Dis. 26 (2020). 8. O Diekmann, JAP Heesterbeek, MG Roberts, The construction of next-generation matrices for compartmental epidemic models. J. The Royal Soc. Interface 7,873–885(2010). 9. A Perasso, An introduction to the basic reproduction number in mathematical epidemiology. ESAIM: Proc. Surv. 62, 123–138 (2018).

34 of 35 Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis medRxiv preprint doi: https://doi.org/10.1101/2021.03.23.21254155; this version posted June 18, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license .

10. French Government, Données hospitalières relatives à l’épidémie de COVID-19 (https://www.data.gouv.fr/fr/datasets/ donnees-hospitalieres-relatives-a-lepidemie-de-covid-19/)(2020)AccessedOctober21,2020. 11. Google, COVID-19 community mobility report (https://www.google.com/covid19/mobility/)(2020)AccessedOctober21,2020. 12. H Mohammad, Coronavirus : un habitant de Bobigny considéré comme le nouveau “patient zéro” (France Bleu) (2020) Accessed January 9, 2021. 13. INSEE, Estimation de population par département, sexe et âge quinquennal - années 1975 à 2020 (https://www.insee.fr/fr/ statistiques/fichier/1893198/estim-pop-dep-sexe-aq-1975-2020.xls) (2020) Accessed June 19, 2020. 14. INSEE, T404 : Salaire brut en équivalent temps plein, par tranche d’âge simplifiée, catégorie socioprofessionnelle simplifiée et région (https://www.insee.fr/fr/statistiques/fichier/4204500/T404.xls) (2016) Accessed June 19, 2020. 15. INSEE, Au quatrieme trimestre 2019, le taux de chômage passe de 8,5 % a 8,1 % (https://www.insee.fr/fr/statistiques/4309346) (2019) Accessed June 19, 2020. 16. Banque de France, Point sur la conjoncture française à fin avril 2020 (https://www.banque-france.fr/sites/default/files/media/ 2020/06/10/point-conjoncture_avril-2020.pdf) (2020) Accessed June 19, 2020. 17. Banque de France, Point sur la conjoncture française à fin mai 2020 (https://www.banque-france.fr/sites/default/files/media/ 2020/06/11/point-conjoncture-09_juin-2020-20200609.pdf) (2020) Accessed June 19, 2020. 18. INSEE, T402 : Salaire brut en équivalent temps plein, par secteur d’activité, région et département (https://www.insee.fr/fr/ statistiques/fichier/4204500/T402.xls) (2016) Accessed June 19, 2020. 19. D Duque, et al., Timing social distancing to avert unmanageable covid-19 hospital surges. Proc. Natl. Acad. Sci. 117, 19873–19878 (2020). 20. M Lehot, BL Borgne, Covid-19 : ces chires qui montrent que Paris a dépassé le seuil d’alerte maximale depuis le 25 septembre (2020) Accessed October 5, 2020. 21. S Godeluck, Coronavirus : plus de 2.000 patients en réanimation en Île-de-France (Les Echos) (2020) Accessed June 30, 2020. 22. C Sterlé, V Alexandre, S de Livonnière, Coronavirus en Île-de-France : 13 000 malades encore hospitalisés, déjà 40 000 tests positifs (Le Parisien) (2020) Accessed June 30, 2020. 23. Agence Régionale de Santé — Île-de-France, Déconfinement : stratégie de dépistage et recherche de cas-contacts (2020) Accessed June 30, 2020. 24. C Sterlé, Île-de-France : y aura-t-il assez de tests Covid avant Noël ? (https://www.leparisien.fr/societe/sante/ ile-de-france-y-aura-t-il-assez-de-tests-covid-avant-noel-18-12-2020-8414952.php) (2021) Accessed January 28, 2021.

Sergio Camelo, Dragos F. Ciocan, Dan A. Iancu, Xavier S. Warnes, and Spyros I. Zoumpoulis 35 of 35