Arxiv:2109.00050V1 [Stat.AP] 31 Aug 2021 2 Methodology
Total Page:16
File Type:pdf, Size:1020Kb
USE OF ALTERNATIVE DATA:HIGH FREQUENCY READOUT OF THE SITUATION - COVID POLICIES, MOBILITY AND R-NUMBER Ashutosh Mani Dixit Suraj Regmi Economist Data Scientist [email protected] [email protected] ABSTRACT The role of alternative data in the crisis was recognized even before the COVID-19 pandemic[1]. Now, the months of stalemate made it more urgent to understand the importance of high-frequency data to inform the policy responses [2]. In Nepal, the Government has exerted stay put measures, and physical data collection activities are suspended. The confirmed cases of COVID-19 has reached more than 560,000[3] and the country is on high alert . In this impasse, the number of secondary cases one would produce over the course of outbreak - the reproduction number (R0) is useful to monitor the transmissibility of COVID-19 [4]. As the R-value is rapidly changing, it can be affected by a range of factors, including not just how infectious a disease is but how Government responds to it, and how the population behaves1. The World Health Organization (WHO) has suggested to the Government of Nepal several recommendations to contain the further spread of COVID-19. To get a sense of how Nepal is coping with the coronavirus pandemic we look at the alternative data sets to get a better understanding of the pandemic policies, mobility, and R-value during COVID. Keywords Alternative data · COVID-19 · R-Number 1 Objective (a) To get the high frequency read out of the COVID situation in Nepal We calculate effective reproduction number (R-value) from OWID data (smoothed)[5], and gain additional insights from COVID-19 community mobility reports2, the Oxford Coronavirus Government response tracker - Oxford stringency index3 and Google search trends.[7] (b) Make available the source code for extracting alternative data The data, and source code, along with frequently updated dashboard monitoring the R-value will be made open and available for public use. arXiv:2109.00050v1 [stat.AP] 31 Aug 2021 2 Methodology 2.1 Effective Reproduction Number The real time reproduction number is estimated using Bayesian approach, assuming the new number of daily cases satisfies the Poisson paradigm. The work[8] done by Kevin Systrom at the US state level is replicated here. Systrom used the modified version of a solution created by [9] to estimate a real time reproduction number. As with changing conditions (behavior of people, government policies, etc), the value of Rt changes. The effective reproduction number 1https://www.weforum.org/agenda/2020/05/covid-19-what-is-the-r-number/ 2Google community mobility report was launched in April to showcase change in mobility trends in COVID situations. 3Policy responses come from the Oxford Coronavirus Government Response Tracker (OxCGRT). The tracker is published by researchers at the Blavatnik School of Government at the University of Oxford[6] Alternative Data: COVID depends on yesterday’s (or previous) reproduction number and number of daily new cases. [9] use Bayes’ rule to update the real time value of Rt from the number of daily new cases and prior value of reproduction number. The new number of cases are seen everyday. This number of new cases says us something about the tranmissibility. Also, the Rt value of today has relation with Rt−1 value of yesterday, and every previous value of Rt−m. [9] use Bayes’ rule to update the true value of Rt based on the number of new cases daily. Mathematically, P (kjR ) · P (R ) P (R jk) = t t t P (k) So, if we see k new cases, the distribution of Rt is equal to the likelihood of seeing k new cases given Rt times the prior beliefs of the value of P (Rt) without the data divided by the probability of seeing this many cases in general. Now, every day that passes, we use yesterday’s prior P (Rt−1) to estimate today’s prior P (Rt). The distribution of Rt is assumed to be a Gaussian centered around Rt−1, i.e. P (RtjRt−1) = N (Rt−1; σ), where σ is a hyperparameter. Choosing a Likelihood Function P (kjRt) A likelihood function function says how likely we are to see k new cases, given a value of Rt. We model the probability of seeing k new cases according to Poisson distribution, with arrival rate λ equal to number of new cases each day. λke−λ P (kjλ) = k! Connecting λ and Rt The connection between Rt and λ is given in the paper as: γ(Rt−1) λ = kt−1e where γ is the reciprocal of the serial interval. The serial interval is about 7 days according to CDC. As we know the number of new cases on the previous day, we can reformulate the likelihood function as a Poisson parameterized by fixing k and varying Rt. γ(Rt−1) λ = kt−1e λke−λ P (kjR ) = t k! 3 Limitations 3.1 Google mobility data There are blind spots in alternative data, in particular data coming from mobile phones. Lower smartphone penetration rate among older people, and rural population, may not give a complete picture of the mobility. Additionally, the Google mobility report states that their data comes only from android smartphone users who allowed the device to track their location. 3.2 Oxford Government Response tracker Oxford Coronavirus Government response tracker does not aim to measure the appropriateness or effectiveness of a country’s response. So a higher index should not be interpreted as the efficacy or effectiveness of the policy. 2 Alternative Data: COVID 4 Background After several months of relatively low COVID cases in Nepal , COVID-19 cases began to rapidly spike in mid-April 2021 following a steep upwards trajectory (Figure 1). Figure 1: Cases, stringency index4, and residential mobility5 Source: (Our World in Data); (Google LLC, 2020); (University of Oxford - Blavatnik School of Government, 2020) 4“The Stringency Index is an aggregate score/composite measure made up of a particular combination of policy indicators/response metrics from the codebook and their values (for the Stringency Index these are C1-8 and H1). The OxCGRT aggregates these policy indicator values into a common “Stringency Index” that runs from 0 -100.”- www.bsg.ox.ac.uk 5Stay at home requirements: 0 - no measures , 1 - recommend not leaving house , 2 - require not leaving house with exceptions for daily exercise, grocery shopping, and ’essential’ trips , 3 - require not leaving house with minimal exceptions (eg allowed to leave once a week, or only one person can leave at a time, etc) , Blank - no data 3 Alternative Data: COVID It was on 29th April 2021 after the cases started to surge, the transmission rate shooted upto 2.480 on 21st April - highest observed by the country till date (Figure 2). Nepal reinstated the control of social behavior, the stringency index which measures the severity of Government response increased from 30.56 to 91.67, as the Government increased the stay home requirements from 0 to 3 i.e from “no measures” to “require not leaving house with minimal exceptions”. People spent more time at home, and there was a spike in “residential percentage change from the baseline6” from 1% on 28th April 2021 to 19% on 5th May 2021. Figure 2: R-value and stringency index 6All lines represent a 7-day moving average. Baseline values were established using a median of the corresponding day of the week from the period between January 3 and February 6, 2020. 4 Alternative Data: COVID Grocery and pharmaceuticals - generally experiencing high mobility, also recorded slump in the movement on 5th May 2021 and thereon (Figure 3). On Wednesday 28th April 2021, the change in mobility in grocery and pharmaceuticals in Nepal was 81 percent, whereas on Wednesday 5th May 2021 it receded to -21 percent. This can also be interpreted as “people were 21 percent less likely to be in grocery and pharmaceuticals on Thursday 5th May” than they were in the baseline (median of the corresponding day between 3rd January and 6th February). Similarly, the people were 81 percent more likely to be in grocery and pharmaceuticals on 28th April 20217. This was a result of restriction on movement as the Government reduced the opening hours of Grocery. The index of restriction on internal movement went up to 2. Figure 3: Grocery and pharma percent change from baseline, and restriction on internal movement Source: (Google LLC, 2020)[10]; (University of Oxford - Blavatnik School of Government, 2020) 7Because of the privacy, google did not release the absolute number. 5 Alternative Data: COVID The movement in transit and stations changed from 56 percent in 28th April to -41 percent in 5th May from the baseline, approximately 43 percent points increase. The closure of public transport, and restrictions in internal movement imposed by the Government decreased the frequency of visits in the transit stations such as bus parks and airports. However, some movements were allowed with travel passes (Figure 4). Figure 4: Transit and station Source: (Google LLC, 2020); (University of Oxford - Blavatnik School of Government, 2020) 6 Alternative Data: COVID In the Google mobility report, the baseline for the weekend is the median of weekends falling between January 3 and February 6. As the weekend visits get closer to normal value the relative change becomes smaller, as such we see recurrent spikes in workplace mobility (Figure 5). In Nepal, while most of the corporate offices, IT companies, NGOs and INGOs, switched to work from home8, there were banks and financial institutions which continued to operate as per the regulatory instructions9 and remained open even during the lockdown with limited staff and reduced hours of operations.