<<

Measurability of the reproduction number in data-driven contact networks Quan-Hui Liua,b,c, Marco Ajellic,d, Alberto Aletae,f, Stefano Merlerd, Yamir Morenoe,f,g, and Alessandro Vespignanic,g,1

aWeb Sciences Center, University of Electronic Science and Technology of China, Chengdu 611731, Sichuan, People’s Republic of China; bBig Data Research Center, University of Electronic Science and Technology of China, Chengdu 611731, Sichuan, People’s Republic of China; cLaboratory for the Modeling of Biological and Socio-Technical Systems, Northeastern University, Boston, MA 02115; dBruno Kessler Foundation, 38123 Trento, Italy; eInstitute for Biocomputation and Physics of Complex Systems, University of Zaragoza, 50018 Zaragoza, Spain; fDepartment of Theoretical Physics, University of Zaragoza, 50009 Zaragoza, Spain; and gISI Foundation, 10126 Turin, Italy

Edited by Simon A. Levin, Princeton University, Princeton, NJ, and approved October 16, 2018 (received for review June 27, 2018)

The basic reproduction number is one of the conceptual corner- an infectious individual at time t), thus relaxing the hypothesis of stones of mathematical . Its classical definition as a fully susceptible population (10).

the number of secondary cases generated by a typical infected R0 and Tg are mathematically well defined in the early stages individual in a fully susceptible population finds a clear analytical of the epidemic in homogeneous models and are widely used in expression in homogeneous and stratified mixing models. Along predictive approaches (10, 11). However, several studies have with the generation time (the interval between primary and cautioned on the importance of the local contact structures secondary cases), the reproduction number allows for the charac- (e.g., households, extended families, communities) in estimat- terization of the dynamics of an epidemic. A clear-cut theoretical ing the value of R0 (12–17). Theoretical work has explored picture, however, is hardly found in real data. Here, we infer the definition of the reproduction number in models entailing from highly detailed sociodemographic data two multiplex con- community and household structure (18–24), although an oper- tact networks representative of a subset of the Italian and Dutch ational way to compute R0 is still lacking. For more complex populations. We then simulate an process models closely resembling the actual structure of the human pop- on these networks accounting for the natural history of influenza ulations (e.g., accounting for households, schools, workplaces) and calibrated on empirical epidemiological data. We explicitly (25–34), the estimation of R0 mainly relied on the direct count measure the reproduction number and generation time, record- of secondary cases generated by the of the outbreak ing all individual-level transmission events. We find that the index (R0 ) and/or the analysis of the growth rate of the simulated classical concept of the basic reproduction number is untenable . However, both methods have limitations and show in realistic populations, and it does not provide any conceptual marked differences in the estimated values (31, 32). Indeed, understanding of the epidemic evolution. This departure from the index R0 is computed from the analysis of the index case of an classical theoretical picture is not due to behavioral changes and outbreak, which is not necessarily representative of a “typi- other exogenous epidemiological determinants. Rather, it can be cal” infectious individual. In addition to this, growing evidence simply explained by the (clustered) contact structure of the pop- shows that the classic exponential early growth of the epidemic ulation. Finally, we provide evidence that methodologies aimed is an oversimplification often not backed by real-world data (35, at estimating the instantaneous reproduction number can oper- 36). For this reason, statistical methods were developed for the ationally be used to characterize the correct epidemic dynamics analysis of R(t), assuming that the variations of this quantity from data. are mostly due to the impact of the performed intervention strategies and behavioral changes in the population (10, 16, 17, computational modeling | infectious | multiplex networks | 37). Unfortunately, for these methods, disentangling the role of reproduction number | generation time Significance athematical and computational models of infectious dis- Meases are increasingly recognized as relevant quantitative The analysis of real epidemiological data has raised issues of support to epidemic preparedness and response (1–3). Indepen- the adequacy of the classic homogeneous modeling frame- dent of the type of modeling approach, our understanding of work and quantities, such as the basic reproduction number epidemic models is generally tied to two fundamental concepts. in real-world situations. Based on high-quality sociodemo- One is the basic reproduction number R0, which is the average graphic data, here we generate a multiplex network describ- number of secondary cases generated by a typical infectious indi- ing the contact pattern of the Italian and Dutch populations. vidual over the entire course of the in a fully By using a microsimulation approach, we show that, for susceptible population (4). The other is the generation time Tg, epidemics spreading on realistic contact networks, it is not the time interval between the infection time of the infector and possible to define a steady exponential growth phase and a her/his infectees (5, 6). These quantities are the cornerstones basic reproduction number. We show the operational use of of our understanding of basic epidemic models, as they encom- the instantaneous reproduction rate as a good descriptor of pass the condition for the occurrence of an epidemic outbreak the transmission dynamics. (R0 > 1) and the epidemic doubling time. Both the reproduction number and generation time are determined by the biological Author contributions: Q.-H.L., M.A., A.A., S.M., Y.M., and A.V. designed research, characteristics of the pathogen (e.g., probability of transmission performed research, analyzed data, and wrote the paper.y given a contact), the pathogen– system (e.g., timeline of The authors declare no conflict of interest.y pathogen replication inside the host), and the contact patterns of This article is a PNAS Direct Submission.y the population in which the infection spreads (4, 7). The concept This open access article is distributed under Creative Commons Attribution-NonCommercial- of the reproduction number has been extended to stratified mod- NoDerivatives License 4.0 (CC BY-NC-ND).y els (8) and to heterogeneous contact networks (9) to account for 1To whom correspondence should be addressed. Email: [email protected] more complex interaction patterns. Furthermore, the definition This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10. of R0 has been generalized by the effective reproduction number 1073/pnas.1811115115/-/DCSupplemental.y R(t) (i.e., the average number of secondary cases generated by Published online November 21, 2018.

12680–12685 | PNAS | December 11, 2018 | vol. 115 | no. 50 www.pnas.org/cgi/doi/10.1073/pnas.1811115115 Downloaded by guest on September 27, 2021 Downloaded by guest on September 27, 2021 tde aetu a oue ntehre ako nlzn the time. analyzing over of number task reproduction harder the the of on dynamics focused for far thus patterns have contact studies compute human operational clear to on a summary, way In data population. the microscale due of challenging fractions of large is lack number reproduction the the to of definition changes the behavioral in and interventions from patterns connectivity i tal. et a Liu to according are indi- compartment Infectious they removed stage. the as to infectious soon move the then as to viduals and proceed they individuals, infected, infectious with through Ess- infection contacts 1C). the acquire (Fig. can scheme individuals susceptible (SIR) entially, removed infectious, susceptible, a (43–45). members, basis household regular (e.g., a contacts on of colleagues) set schoolmates, same the the meet highlights individuals to where tend readily populations, human individuals of clustering strong between typical rep- links This 1B). of characterized (Fig. are resentation distributions degree layers other different four any remarkably The infecting by population]. of the probability of (low) each individual certain [i.e., network a complete has a individual assume in contacts we encodes where layer community, fourth the a accounts and layer workplaces, third in individu- A contacts other school). for with same represents only the contact in node attending/working has every als and teacher (i.e., or school student in one other one the contacts to second represents represents A only household. layer layer linked own her/his is household of and members the representing population 26, nodes in real (15, the node 1A) of Each (Fig. individual 42). community 41, general the 34, in encounters and casual colleagues, work contacts schoolmates, network of members, network household multiplex the between representing weighted layers four a by same made defines the (38–40) spread in effectively individuals can This two infection of settings. copresence the the by which determined through are contacts Dutch and effective Italian The the Methods of and subset (Materials a populations about of of representative patterns multi- each contact two agents, the generate describing to networks data plex sociodemographic detailed use We Results that provided dynamics, of epidemic estimates reliable complex tools understanding valuable the represent for of time over instan- structure the number of contact reproduction analyses taneous natural Bayesian that the show we by Finally, population. explained be strategies, also control inflection or could changes epidemic behavioral that to suggests ascribed often temporal study points, Our complex theory. the classical the given definitions of their dynamics of adequacy measuring the in challenges the highlights R models, null from nontrivial results a determine of colleagues) variability household work between classmates, contacts (e.g., members, patterns contact human of ing such quantities, the basic on measure as data We epidemiological networks. empirical the multiplex of inferred match history to trans- natural calibrated influenza the and simple for a accounting sociodemographic of model progression mission real-world the simulated from We where data. inferred model multiplex networks synthetic transmission high-resolution (25–30). contact two infection on chain) spreads an pathogen transmission the the consider entire of we interactions the all Namely, and of track agents access keeping complete simulated syn- (e.g., have data can detailed we microlevel highly where to a agents, of on population based thetic process transmission infection R 0 (t ee esuytevr ento n esrblt of measurability and definition very the study we Here, h nunalk rnmsindnmc r endthrough defined are dynamics transmission influenza-like The R and ), and 0 , R (t Tg and ), Tg nra-ol ytm.Ti assqetosabout questions raises This systems. real-world in R R (t ihdt-rvnmcoiuainmdl fthe of models microsimulation data-driven with (t Tg ) ) R and and n hwta h eeoeet n cluster- and heterogeneity the that show and , ssillcig n opeitv modeling predictive no and lacking, still is Tg Tg Tg r available. are (t (t oho hc ihydvaefrom deviate highly which of both ), hseiec,cnrse ihthe with contrasted evidence, This ). icse h methodology). the discusses 0,000 500, R 0 , ihtpclvle eotdfriflez nteltrtr (46)]. literature the in influenza for (R reported case values index typical with the repro- of the number that such duction rates transmission to layer-specific linked Moreover, these 33). are set 27, we (26, occurring community) influenza the transmission all in occurring of to transmission 33% linked influenza and 19% are workplaces, all schools, infections in in of influenza occurring 18% all transmission setting, of to household linked transmis- are to the values linked infections literature in are with of infections occurring agreement influenza fractions sion in all the of are that 30% layers way (namely, four a the such transmission in in infection she/he cases calibrated the if is layer, only each nodes school In an between given there. instance, a works For in or infection populations. studies the human transmit cluster- of can the typical agent for accounting contacts and of on), ing so and characteris- specific school, a own or to her/his membership household infectiousness, with individual age, each (e.g., individual, tics specific complex each answer for to Of strategies. control needed effects. of impact are confounding the as avoid models such to questions, refined on) more so distinction hospital- course, and the individuals, compartment, individuals, asymptomatic latent ized and a symptomatic as such between introduction classes, the avoiding other (e.g., possible of transmis- as the period) simple keep as infectious We scheme recovering]. or sion before time stage removal infectious expo- the (the an in time spends of individual amount infectious an nential that [such rate removal the susceptible. S, of for removed; R, time examples infectious; generation I, with chains. and transmission along number simulated model the reproduction (C individual transmission of layers. infection computation workplace the and of are distributions household, edges Degree tation (B) all school, illustration. not the the although of readability in graph, of complete sake the a for Note is visible population. layer synthetic community the of the individuals that 10,000 of subsample a senting 1. Fig. esmlt h rnmsindnmc sasohsi process stochastic a as dynamics transmission the simulate We iulzto ftemlilxntokrepre- network multiplex the of Visualization (A) structure. Model PNAS | eebr1,2018 11, December 0 index | o.115 vol. s13[nagreement [in 1.3 is ) ceai represen- Schematic ) | o 50 no. | 12681

POPULATION BIOPHYSICS AND BIOLOGY COMPUTATIONAL BIOLOGY Finally, without loss of generality, we fix the removal time to 3 d A (15, 47, 48). In SI Appendix, we report all of the details of the transmis- sion model (SI Appendix, section 1.1). In the text, we also report the corresponding analysis of R(t) and Tg(t) for a stochastic implementation of a homogeneous mixing SIR model at the indi- vidual level, where all individuals are identical and are in contact with the same fixed probability. Along with the homogeneous SIR model, we also studied a set of alternative null models with an increasing level of complexity from the homogeneous model to the data-driven model (details are in SI Appendix, sections 1.2–1.5). Null models include annealed (edges are constantly rewired) and quenched (edges are fixed over time) configura- tion models. Results reported in the text refer to the synthetic B population for Italy.

Effective Reproduction Number and Generation Time. The first quantities generally investigated in epidemic models are the inci- dence (of new infections) as a function of time and the associated growth rate of the epidemic. In homogeneous models, the num- ber of new cases increases exponentially at a nearly constant rate r (5, 49, 50) during the early phase of the epidemic. This is not the case in the data-driven model, where we find a non- monotonous behavior: an increasing trend over the initial phase of the epidemic occurs followed by a marked decrease about 20 d before the epidemic peak (Fig. 2A). Such a result is in sharp C contrast with the classic theory, where the epidemic growth rate is expected to slowly and monotonically decrease over time in the early epidemic phase (Fig. 2A). This suggests that, in con- trast to simple SIR models where the basic reproduction number can readily be defined through the relation R0 = 1 + rTg (5), it is difficult to find a proper definition of the basic reproduc- tion number in populations characterized by realistic connectivity patterns. The daily effective reproduction number and generation time can be computed from the microsimulations by keeping track of the exact number of secondary infections generated by each indi- vidual infected at time t in the simulations (Fig. 1C). We find that Fig. 2. Fundamental epidemiological indicators. (A) Mean daily exponen- R(t) increases over time in the early phase of the epidemic, start- tial epidemic growth rate, r, over time of the data-driven and homogeneous ing from Rindex = 1.3 to a peak of about 2.1 (Fig. 2B). In contrast, models. The colored area shows the density distribution of r(t) values 0 obtained in the single realizations of the data-driven model. Results are in the homogeneous model, which lacks the typical structures based on 50,000 realizations of each model. Results are aligned at the epi- of human populations, R(t) is nearly constant in the early epi- demic peak, which corresponds to time t = 0. Inset shows the logarithm demic phase and then rapidly declines before the epidemic peak of the mean daily incidence of new influenza infections over time, which (Fig. 2A) as predicted by classical mathematical epidemiology does not follow a linear trend. (B) Mean R(t) of data-driven and homoge- theory (4). The pattern found in the data-driven model can neous models. The colored area shows the density distribution of R(t) values also be partly explained by the variation of the average degree obtained in the single realizations of the data-driven model. (C) The three induced by the infection of individuals with a higher number lines represent the mean Tg(t) of data-driven and homogeneous models. of adequate contacts [an effect already observed in heavy-tailed The colored area shows the density distribution of Tg(t) values obtained networks (51)], thus leading to an average growth of the repro- in the single realizations of the data-driven model. The horizontal dotted gray line represent the constant value of the duration of the infectious duction number. The temporal dynamics of R(t) does not show period. a constant phase, implying that R0 loses its meaning as a funda- mental indicator in favor of R(t). Although we report the results averaged over 50,000 realizations of the model, this result is sup- of the deviations of R(t) and Tg(t) from the classical theory ported by our analysis of the outcome of each single simulation, (Fig. 3). Specifically, we found that the average degree of infec- highlighting that the early time increase of R(t) is a common pat- tious nodes as well as R(t) tend to peak in the workplace layer tern in the performed simulations (SI Appendix, section 2.1). In (SI Appendix, section 2.2), and at least to some extent, the same Fig. 2C, we show an analogous analysis of the estimated genera- happens in the school layer. However, R(t) generally decreases tion time in the data-driven model. We find that the generation in the household and community layers (Fig. 3A). Indeed, R(t) time is considerably shorter than the duration of the infectious in the household layer tends to be more uniform across the period (i.e., 3 d in our simulations). The estimated average Tg different nodes and thus, follows a general decreasing trend sim- over the whole epidemic is 2.67 d (≈ 11% shorter than the the- ply led by the depletion of susceptible contacts. We also found oretical value), with a more marked shortening right before the that, in the household layer, Tg is remarkably shorter than in epidemic peak (Fig. 2C). This differs from what is predicted by all other layers (Fig. 3B)—with an average fluctuating around the classic theory and the analysis of the homogeneous model 2.6 d, close to the value reported by analyzing real data for house- (Fig. 2C), where the length of the infectious period corresponds hold transmission (26). To provide a simple illustration of the to the generation time (6). saturation effect in households, let us consider a household of A closer look at the transmission process in the different lay- three, with one index case and two susceptible members. If, at ers of the multiplex network helps in understanding the origin time t, the index case infects exactly one of the two susceptibles,

12682 | www.pnas.org/cgi/doi/10.1073/pnas.1811115115 Liu et al. Downloaded by guest on September 27, 2021 Downloaded by guest on September 27, 2021 ucpiiiyb g 3,5,5) hsi nlddi h model the in included is differential this a 54); of 53, presence (34, the age was by susceptibility H1N1 signatures characteristic 2009 the the of 2009 of One model, the Italy. model transmission in to epidemic pandemic framework influenza modeling realistic data-driven more the a used we in results Italy. in the Pandemic of Influenza H1N1 2009 The distributions). in gamma reported are and analyses exponential These tested (we time the removal both of terms in in transmission (measured spreading influenza intensity populations), infection synthetic Dutch the and simulated Italian pop- we the of 3B). (i.e., structure (Fig. sociodemographic ulation the community in of changes general to patterns respect the observed the of of exception All the are with effects tings, Saturation shortening (or studies. for time responsible household also generation from the interval) of serial estimates empirical effects competition providing household when observed within evi- why This considering period. for explain infectious calls to the dence 52 than shorter ref. are in times used generation was argument any infectious. similar be infect still A cannot may she/he she/he although because member, household simply other gen- shorter, resulting is the time infection; eration the transmit to individual infectious i tal. et Liu time at then of distribution density the realizations. single shows the area in colored obtained The values layers. four the Mean in (B) realizations. of single distribution the density in the shows obtained area colored The layers. four the 3. Fig. B A Mean (A) patterns. Layer-specific t 1 + h ne aehst opt ihteother the with compete to has case index the , R etos2.4–2.6. sections Appendix, SI Tg R index (t nteohrtasiso set- transmission other the in R(t ) ,adtedsrbto fthe of distribution the and ), and Tg(t o h aadie oe in model data-driven the for ) o h aadie model data-driven the for ) ots h robustness the test To Tg (t ) r outwith robust are R(t values ) Tg(t ) etwt htotie nmdlsmltos(i.4 the (Fig. simulations before model weeks in consis- is obtained 3 data that the about with in observed the tent drop trend of The sharp 4B). phase (Fig. pandemic peak a early epidemic influenza the by H1N1 during followed 2009 trend epidemic the increasing of an reported course shows cases clearly the (ILI) over estimated illness The Italy influenza-like 4A). in (Fig. the pandemic from the of rate end growth the at age by rates (55). in pandemic provided influenza are H1N1 details 2009 Monte Model the after chain and before Markov Italy collected in age Bayesian by data seroprevalence a on approach through (MCMC) Carlo calibrated rates. transmission are layer-specific They four the parameters: unknown ei n a eylmtdutk nIay(acnto cover- ( Italy in uptake not limited pan- age very is the a of had Vaccination tail and the (55). during demic only data started vaccination serological as pop- literature considered, the to the in age in according by reported ulation prepandemic as consider Italy also for We (55). estimated values using by model. dis- density 2009 the of shows the area tribution on colored calibrated The (C model data. seroprevalence Italy. data-driven influenza the in H1N1 (r of open rate analysis growth regularly the epidemic were from exponential schools resulting weekly period 2009, mean the the in over of 51 avail- pattern that, week Temporal the are Note to over Data (old.iss.it/flue/). 35 axis). Italy website week right in Influnet from the cases ISS on ILI the (scale new at pandemic able of H1N1 2009 incidence influenza the Weekly H1N1 of 2009 2009 axis). in course the left 41 week of the to course 35 on week the from (scale over model linear Italy best-fitting the in and cases pandemic ILI new of dence nunapnei nIay(5 n setmtdb h airtdmodel. calibrated H1N1 the time 2009 by over the estimated rate of as growth end and Epidemic the (55) (B) at Italy conducted in serosurvey pandemic a influenza in observed as age by 4. Fig. C B A h airtdmdli bet elcpueteseropositive the capture well to able is model calibrated The < 5) si h rvosscin h oe a four has model the section, previous the in As (56). 1%) eorvlnerates Seroprevalence (A) Italy. in pandemic influenza H1N1 2009 The r (t ausotie ntesnl elztoso h data-driven the of realizations single the in obtained values ) PNAS | eebr1,2018 11, December eto 1.1. section Appendix, SI r (t setmtdfo h ekyinci- weekly the from estimated as ) | o.115 vol. | o 50 no. .Ftiga Fitting C). | 12683 ) )

POPULATION BIOPHYSICS AND BIOLOGY COMPUTATIONAL BIOLOGY linear regression model to the estimated growth rate over time A in the ILI data results in an estimated coefficient of 0.064 (SE of 0.033), while the mean value obtained with a linear regression model of case incidence from all stochastic realizations is 0.064 (95% CI, −0.207–0.305). An increase in the initial growth rate of the epidemic has also been observed for other diseases and countries (57–59). Although alternative explanations may exist (60), our results provide a plausible explanation for these pat- terns based on the intrinsic structure of human contact networks. In terms of the dynamics of R(t) and Tg(t), the model calibrated on the 2009 influenza pandemic confirms the noticeable devia- B tions of the data-driven model from the homogeneous modeling framework (SI Appendix, section 2.3).

Discussion Our simulation results clearly highlight how the heterogene- ity and clustering of human interactions (e.g., contacts between household members, classmates, work colleagues) alter the stan- dard results of fundamental epidemiological indicators, such as the reproduction number and generation time over the course C of an epidemic. Our results seem to be consistent in different countries (SI Appendix, section 2.4), suggesting that the observed patterns are due to the structure and clustering of human con- tact patterns rather than country-specific features. Furthermore, the analysis of alternative null models, such as degree-preserving and layer-preserving configuration models (SI Appendix, section 2.7), shows markedly different behaviors than those exhibited by the data-driven model, suggesting that the multiplex structure and the strong clustering effect typical of human populations, not Fig. 5. Estimation of R(t). (A) Daily R(t) as inferred from the daily incidence captured by null models, are at the root of the observed behavior. of new infections for one stochastic model realization. Tg is assumed to be Our numerical study questions the measurability of R0 in exponentially distributed with an average of 3 d. (B) The same as A but the realistic populations and its adequacy even as an approximate distribution of Tg has been derived from the analysis of the transmission tree descriptor of the epidemic dynamics. However, it is still possible of the selected model simulation. (C) The same as A but using the distribution to characterize the dynamics of an epidemic in terms of its effec- of Tg over time as derived from the analysis of the transmission tree of the R(t) selected model simulation. The goodness of the estimate of R(t) increases with tive reproduction number over time by estimating from the a more precise knowledge of the generation time value and distribution. daily number of new cases by using Bayesian approaches (11, 17) (Materials and Methods). In Fig. 5, we show the compari- α α son between R(t) as inferred from the time series of cases and layer-specific adjacency matrix A , where elements aij = wα > 0 if there is α R(t) resulting from the microsimulation data of the transmission a link between nodes i and j in layer α and aij = 0 otherwise. chain for one stochastic model realization (other realizations are shown in SI Appendix, section 2.8). The overall good agreement Homogeneous Mixing Network. This model assumes a single fully connected between the estimated and actual values shows that it is possible network. Following the notation introduced above, we have aij = w for each = / − to operationalize the estimation of R(t) to provide projections of i and j, where w 1 (N 1) and N is the total number of individuals in the population. the epidemic dynamics. The goodness of the obtained estimates depends on the knowledge of the distribution of the generation The Epidemic Transmission Model. On each of the introduced networks, we time (Fig. 5) and on the availability of a reliable time series of simulate the influenza transmission process as an SIR model. The SIR model cases not markedly affected by noise and underreporting (10). assumes that individuals can be in one of the following three states: sus- ceptible, infectious, and removed. Two types of transitions between states Conclusion are possible: (i) from susceptible to infectious and (ii) from infectious to The analysis presented here takes advantage of “in silico” numer- removed. The transition from susceptible to infectious requires a contact between an infectious individual and a . Specifically, ical experiments to open a window of understanding in the given that, at time step t, node j is infectious and its neighbor node i is analysis of realistic epidemic scenarios. Although a lot of theo- susceptible, the probability that j infects i (i.e., i changes its status from retical work has been done to define the reproduction number in susceptible to infectious) is given by β = pw, where p is the transmission nonhomogeneous models (19–24), a unified theory is still lack- probability per contact and w is the contact weight. However, the transition ing. While we are not providing a theoretical framework for the from infectious to removed does not involve contact between individuals. computation/definition of the reproduction number and the gen- Given that, at time step t, node i is in the infectious state, it has a probability eration time on realistic contact networks, we provide evidence γ to recover at time step t + 1. that estimates of R(t) can be used to characterize the epidemic without resorting to untenable assumptions that may eventu- Estimation of R (t ) from Transmission Events Time Series. Following the same approach used in refs. 11 and 17, we assume that the daily number of new ally mislead our understanding of the epidemic transmission cases C(t) at time t can be approximated by a Poisson according to the potential and temporal dynamics. following equation:

t ! X Materials and Methods C(t) ≈ Pois R(t) φ(s)C(t − s) , Data-Driven Contact Network. The model considers a weighted multiplex s=1 ~ network (38) G = (GH, GS, GW , GC ), where H, S, W, and C represent house- hold, school, workplace, and community layers, respectively. Let us define V where φ is the generation time distribution and R(t) is the effective repro- as the set of all nodes, which is common to all layers, and Eα as the set of duction number at time t. The likelihood L of the observed time series of edges in layer α. We can thus characterize each layer Gα = (V, Eα) by the cases from day 1 to T is thus given by

12684 | www.pnas.org/cgi/doi/10.1073/pnas.1811115115 Liu et al. Downloaded by guest on September 27, 2021 Downloaded by guest on September 27, 2021 5 acee ,e l 21)Rl fsca ewrsi hpn ies transmission disease shaping in and networks social structured of Role in (2011) al. (t) remedies. et and S, R Biases Cauchemez epidemics: number emerging 15. in Estimation reproduction (2018) GS The Tomba T, (2009) Britton G 14. Chowell T, Burr 13. 6 jliM ta.(05 h 04Eoavrsdsaeotra nPjhn iraLeone: Sierra Pujehun, in outbreak disease 2014 The (2015) al. et M, Ajelli 16. first The Africa: West in Cintr disease 12. virus Ebola (2014) Team Response respiratory Ebola acute WHO severe for 11. curves epidemic Different (2004) P Teunis J, Wallinga 10. 0 rsrC(07 siaigidvda n oshl erdcinnmesi an in numbers reproduction household and individual Estimating and transmission (2007) on C distribution household Fraser of 20. effect The (1995) K Dietz NG, models. Becker epidemic network 19. for challenges Eight in (2015) al. epidemic et Ebola L, Pellis the 18. of dynamics Spatiotemporal (2016) al. et M, Ajelli 17. 2 odti ,e l 20)Rpoutv ubr,eiei pedadcnrli a in control and spread epidemic numbers, Reproductive (2009) al. household et explicit with E, models Goldstein epidemic 22. Deterministic (2008) MJ Keeling T, House 21. 0 uael ,Ael ,Mre ,Fruo M acee 21)Mdlbsdcom- Model-based (2016) S Cauchemez NM, Ferguson influenza S, an Merler M, of Ajelli containment L, Fumanelli layered targeted 30. Modeling (2008) al. Italy: et in ME, influenza pandemic Halloran for measures 29. Mitigation (2008) al. et ML, pandemic. Atti influenza Degli an Ciofi mitigating 28. for Strategies (2006) al. et NM, source. Ferguson pandemic the influenza emerging 27. an at containing for influenza Strategies (2005) al. pandemic et NM, Containing Ferguson (2005) 26. al. et repro- IM, household Longini and rate 25. growth Epidemic (2011) C Fraser population NM, a in Ferguson dynamics L, disease Pellis of Calculation 24. (2010) MJ Keeling T, House JV, Ross 23. i tal. et Liu CCsampling. MCMC rate observing known of a probability the (i.e., tion here, where .Pso-aorsR atlaoC a ige ,Vsinn 21)Epidemic (2015) A Vespignani P, Mieghem Van com- C, the Castellano and definition R, the Pastor-Satorras On A 9. (1990) infections: JA viral Metz respiratory JAP, acute Heesterbeek of O, periods Diekmann Incubation 8. (2009) al. et repro- J, basic Lessler the relationship 7. of estimation the Model-consistent shape (2007) intervals J generation Heesterbeek M, How Roberts (2007) 6. M Lipsitch J, Wallinga Africa 5. West (1991) B the Anderson of RM, and modeling May RM, Mathematical Control Anderson (2015) Disease 4. DB George for S, Centers Riley JP, the Chretien from 3. Results (2016) al. et M, Biggerstaff 2. .Vbu ,e l 21)TeRPD bl oeatn hleg:Snhssadlessons and Synthesis challenge: forecasting ebola RAPIDD The (2018) al. et C, Viboud 1. uigacmuiyotra f20 11pnei influenza. pandemic H1N1 2009 USA of outbreak community a during arXiv:1803.01688. populations. nonstructured data. outbreak disease Eng from number reproductive effective the of mation pdmooyadipc finterventions. of impact and Epidemiology projections. forward and epidemic the 1495. of months 9 measures. control of impacts 516. similar reveal syndrome in diseases infectious networks. for complex in models processes in R0 ratio reproduction populations. basic heterogeneous the of putation infection. review. systematic emerging an of incidence the 803–816. from number duction numbers. reproductive and rates growth between Control and epidemic. Ebola Challenge. Season Influenza 2013–2014 16:357. the Predict Prevention’s mrigepidemic. emerging diseases. infectious highly of control 62. modeling computational A elimination: disease analysis. and vaccination for implications and learnt. omnt fhouseholds. of community structure. rhnieaayi fsho lsr oiisfrmtgtn nunaeieisand epidemics influenza mitigating for policies . closure school of analysis prehensive States. United the in pandemic scenarios. different considering model based individual An 442:448–452. Asia. Southeast in 309:1083–1087. workplaces. and schools households, 63:691–734. of communities in number duction households. of 6:261–282. 108:2825–2830. nAisA Castillo-Ch A, on-Arias ´ Epidemics M Med BMC ahBiosci Math LSCmu Biol Comput PLoS P Ofr nvPes Oxford). Press, Univ (Oxford (k .Tepseirdsrbto of distribution posterior The λ). LSOne PLoS , eLife stepoaiiyms ucino oso distribu- Poisson a of function mass probability the is λ) 22:13–21. Nature actIfc Dis Infect Lancet L 14:130. LSOne PLoS = 4:e09186. 213:29–39. Y t =1 T 5:e9666. 437:209–214. ahBiosci Math vzC etnor M ly L ak 20)Teesti- The (2009) H Banks AL, Lloyd LM, Bettencourt C, avez ´ P ahBoc Eng Biosci Math ahBiol Math J

12:e1004681. 2:e758. rcNt cdSiUSA Sci Acad Natl Proc C e o Phys Mod Rev (t ), 9:291–300. ahBiosci Math R(t 221:11–25. ) 28:365–382. X s=1 k M Med BMC t netosDsae fHmn:Dynamics Humans: of Diseases Infectious 6:239–259. vnsi hs vnsocrwith occur events these if events 87:925–979. φ(s)C 127:207–219. rcBo Sci Biol Proc (t R(t 13:281. 105:4639–4644. − ste xlrdusing explored then is ) s) nlJMed J Engl N mJEpidemiol J Am LSOne PLoS ! , 274:599–604. rcNt cdSci Acad Natl Proc Epidemics ahBiol Math J M netDis Infect BMC 3:e1790. ahBiosci Math ahBiol Math J 371:1481– 160:509– Science Nature 10:58– 55: 0 oet ,Ael ,Mre 21)Teefc frs ecpino h 09H1N1 2009 the on perception risk of in effect outbreak The (2011) S 2017 Merler M, The Ajelli (2018) P, Poletti E Hernandez-Vargas 60. CP, Rojas VK, Nguyen interval serial 59. the and number reproductive the of epidemic Estimation the (2009) of al. et estimation LF, bayesian White time 58. Real and (2008) risk RM Italy. Ribeiro Age, in pandemic LM, (H1N1) Italy: Bettencourt a influenza in 57. 2009 the A/H1N1pdm to Response (2010) influenza al. et Pandemic C, Rizzo (2013) 56. al. et S, Merler 55. 4 acee ,e l 20)Hueodtasiso f20 admciflez A influenza pandemic 2009 Early of (H1N1): transmission A Household influenza (2009) of al. strain et a S, of Cauchemez potential Pandemic 54. (2009) al. et C, Fraser 53. Svensson G, Tomba Scalia 52. 1 Barth estimating 51. of cons and Pros (2010) C Castillo-Chavez M, Safan G, Chowell H, Nishiura The (2004) 50. JM seasonal Hyman and PW, Fenimore pandemic C, Castillo-Chavez of NW, serial epidemiology Hengartner the G, Comparative of Chowell (2010) Estimation 49. (2009) al. GM et Leung BJ, JM, Peiris Cowling S, 48. Riley VJ, the Fang of Estimates BJ, (2014) Cowling L 47. Finelli M, Gambhir of C, spread Reed the S, to Cauchemez relevant M, patterns Biggerstaff contact 46. Estimating (2017) M Litvinova M, Ajelli 45. structure the Inferring (2012) S Merler A, Vespignani of P, Manfredi spread M, the Ajelli to L, relevant Fumanelli patterns mixing 44. and contacts Social (2008) al. et J, Mossong 43. networks. multilayer of formulation Mathematical (2013) al. et M, Domenico De 39. respiratory Kivel acute severe of 38. control and dynamics Transmission (2003) al. et M, characterize Lipsitch to model generalized-growth 37. A (2016) G Chowell L, repro- Simonsen the C, Viboud Characterizing 36. (2016) SM Moghadas L, Simonsen C, Viboud G, spatiotem- Chowell the of 35. Determinants (2011) NM Ferguson A, Pugliese mobility M, human Ajelli and heterogeneity S, population Merler of 34. role The (2010) M Ajelli S, Merler 33. 2 acee ,Cra ,Vbu ,Vleo ,Bel 20)ABysa MCMC Bayesian A (2004) P Boelle A, Valleron C, Viboud F, in Carrat contexts social S, different of Cauchemez role The 42. (2014) S Merler A, Melegaro P, Poletti M, Ajelli 41. (2018) Y Moreno F, Rodrigues G, Arruda de Ferraz E, Cozzo 40. for strategies Mitigation (2006) CA Macken in IM, component Longini contacts unstructured K, the Kadau of TC, impact Germann The 32. (2008) S Merler M, Ajelli 31. I21-71-.Tefneshdn oei td ein aacollection data design, manuscript. study the in of Grant preparation role Funds or Regional no analysis, Desarrollo and had de of funders Europeo Fondo The Government and FIS2017-87519-P. the MINECO as from well as support partial acknowledges Arag Y.M. scheme. Econom ity de Formaci acknowledges Ministerio A.A. the acknowl- from MIDAS-U54GM111274. of A.V. Grant support and NIH M.A. the of support 201606070059. the Grant edge Council Scholarships China ACKNOWLEDGMENTS. admciflez dynamics. influenza pandemic bioRxiv:10.1101/247569. modelling. epidemic and descriptions Data Madagascar: USA. the in pandemic A/H1N1 influenza 2009 the Respir of phase early in diseases. infectious emerging of potential Surveill susceptibility. population HN)vrsi h ntdStates. United the in virus (H1N1) findings. diseases. communicable for times 223:24–31. generation observing on siderations networks. scale-free in outbreaks epidemic 92:178701. of spread hierarchical h erdcinnme rmeryeiei rwhrt fiflez (H1N1) A influenza of rate growth epidemic early 2009. from number reproduction The the measures: health public of effects Uganda. and the Congo and of Ebola cases of number reproductive basic households. in A influenza influenza. of interval systematic A influenza: literature. zoonotic the and of pandemic, review seasonal, for number reproduction Russia. in diseases infectious spread. diseases infectious of Biol analysis the Comput in PLoS data demographic from contacts social of diseases. infectious syndrome. outbreaks. disease infectious of phase ascending early the dynamics. growth subexponential early with Interface epidemics real-time of for number Implications duction Europe: in pandemic H1N1 2009 the modelling. of dynamics poral influenza. pandemic of spread the in praht td rnmsino nuna plcto ohueodlongitudinal household to Application influenza: of data. transmission study to approach pandemic. 2009 the during transmission influenza shaping Cham, (Springer, Complexity in Briefs Springer Switzerland). Properties, Structural and Formalism X Rev States. United the in influenza pandemic modeling. pandemic influenza n pi hog rn t h ru F group the (to grant a through Spain on, ´ ,e l 21)Mliae networks. Multilayer (2014) al. et M, a ¨ lm ,Bra ,Pso-aorsR epgaiA(04 eoiyand Velocity (2004) A Vespignani R, Pastor-Satorras A, Barrat M, elemy ttMed Stat ´ ho ilMdModel Med Biol Theor 3:041022. 15:19744. Science 13:20160659. Science LSCmu Biol Comput PLoS 3:267–276. 23:3469–3487. PNAS 324:1557–1561. 8:e1002673. 300:1966–1970. LSMed PLoS Epidemiology .HL cnwegsspotfo rga fthe of Program from support acknowledges Q.-H.L. M netDis Infect BMC | LSOne PLoS ,AiannT isceJ(00 oemdlbsdcon- based model Some (2010) J Giesecke T, Asikainen A, ˚ nlJMed J Engl N eebr1,2018 11, December ho Biol Theor J ho Biol Theor J 7:1. 7:e1002205. nProa netgdrDcoa Fellowship Doctoral Investigador Personal on ayCmeiiia MNC)adismobil- its and (MINECO) Competitividad y ´ ıa 5:e74. ´ LSOne PLoS LSOne PLoS nlJMed J Engl N 8:e74785. 20:344. rcBo Sci Biol Proc 14:480. 6:e16460. 362:2175–2184. 419:1–7. 3:e1519. 229:119–126. rcNt cdSiUSA Sci Acad Natl Proc LSOne PLoS ope Netw Complex J 2009:2619–2627. 277:557–565. | sc Estad ´ ısica 3:e2185. o.115 vol. Epidemics utpe ewrs Basic Networks: Multiplex c Rep Sci 2:203–271. | siayN Lineal) No y ´ ıstica 103:5935–5940. o 50 no. 4:7218. nunaOther Influenza 15:27–37. hsRvLett Rev Phys ahBiosci Math | Soc R J 12685 Euro Phys

POPULATION BIOPHYSICS AND BIOLOGY COMPUTATIONAL BIOLOGY