Geographically and Temporally Weighted Regression Modeling in Analyzing Factors Affecting the Spread of Dengue Fever in Malang
Total Page:16
File Type:pdf, Size:1020Kb
Geographical & Temporal Weighted Regression Model in 71 The Spread of Dengue Fever in Malang (Indrayani et al) Geographically and Temporally Weighted Regression Modeling in Analyzing Factors Affecting the Spread of Dengue Fever in Malang A. Fahmi Indrayani1*, Henny Pramoedyo2, Atiek Iriany2 1Master Program of Mathematics, Faculty of Mathematics and Natural Sciences, University of Brawijaya, Malang, Indonesia 2Department of Mathematics, Faculty of Mathematics and Natural Sciences, University of Brawijaya, Malang, Indonesia Abstract Geographically and Temporally weighted regression (GTWR) modeling has been developed to evaluate spatial heterogeneity and temporal heterogeneity in factors influencing the spread of dengue fever in Malang city. By using the monthly data in 2012-2015 as the temporal unit of each urban village in Malang and village is considered as a spatial unit. GTWR model is compared with the GWR model using several statistical criteria. GTWR model shows that the relationship between dengue incidence with population density and monthly average temperature significantly affects each Village in Malang. Keywords: dengue hemorrhagic fever, Geographically and Temporally weighted regression, spatiotemporal pattern. INTRODUCTION dengue fever and rainfall varies greatly in each Dengue hemorrhagic fever (DHF) is a disease location of observation. As a result, its conclu- caused by the dengue virus that is found in tropi- sions only look at the spatial effects and one fac- cal and subtropical areas. The number of suffer- tor, not paying attention to the influence of time ers and the extent of their spreading areas in- and other factors that can also affect the spread creases with increasing mobility and population of dengue in the area. density. In Indonesia, the disease transmitted by In many cases, data structures are not only Aedes aegypti mosquitoes was first discovered in owned by geographic location but also related to Surabaya and Jakarta in 1968 [1]. the time factor. One method for examining spa- For the last three years, patients with dengue tial and temporal influences is geographically and fever in Malang continue to increase (Health Of- temporally weighted regression (GTWR). This fice of City Government of Malang, 2016, un- statistic method is developed from GWR. It is published). Various programs have been imple- expexted that this model can accommodate spa- mented for disease prevention and control, but tial influence, and also able to analyze the tem- DHF in Malang remains high. High dengue cases poral influence which present in the data simul- require serious efforts to prevent the spread of taneously [4]. DHF endemic disease by analyzing the risk factors The purpose of this study is to extend the associated with dengue virus infection. Many GTWR modeling framework in analyzing factors research on the factors that cause the spread of causing dengue extensively by observing the dengue virus in the community were conducted presence of spatial heterogeneity and temporal [2]. heterogeneity in the modeling. The GTWR model Previous study used the Geographically obtained is compared with the GWR model using Weighted Regression (GWR) model in its analysis; several statistical tests. The results of this study where GWR statistical methods adjusted the will practically be useful for the authorities in global framework with local regression models regulating the prevention program of DHF infec- allowing estimation regression parameters for tion locally and at certain times. each spatial unit [3]. And it is reported that rain- MATERIAL AND METHOD fall significantly affects the spread of DHF. How- Study Area ever, this research is limited to spatial effects and Location of this research is Malang City, East rainfall factors, where the relationship between Java, which consists of 57 urban villages. Total area of Malang is 252.10 km2 which is located on o o Java island about 7 06 '- 8 07' latitude and Correspondence address: o o A. Fahmi Indrayani 112 06 '- 112 07' longitude [5]. Village is used as Email : [email protected] a spatial unit in this study, there are 57 villages Address : Dept. Mathematics, Faculty of Mathematics and and the month as a temporal unit, consisting of Natural Science, University of Brawijaya, Veteran 48 months. Malang, Malang 65145. J.Exp. Life Sci. Vol. 8 No. 2, 2018 ISSN. 2087-2852 E-ISSN. 2338-1655 Geographical & Temporal Weighted Regression Model in 72 The Spread of Dengue Fever in Malang (Indrayani et al) Data Collection observation at location ith. The first step for Data on the number of DHF patients in each GTWR modeling to define the coordinates of Village and each month during the year 2012 - Longitude and Latitude for each Village at t time. 2015 was obtained from the Health Office of Ma- Geographical coordinates are used to determine lang city government. Monthly data as a tem- the euclidean distance between villages observed poral unit in each Village, one Village is defined at t time as one spatial unit. For population density data, 2 2 2 the number of Puskesmas (health center) in each dij ui u j vi v j ti t j ……3 Village, as well as the height of the place in each Village obtained from Malang Central Agency on λ and μ are scale factors used to measure Statistics publication [5-8]. Data of rainfall and location and time difference. Distance in equa- average temperature obtained from Karangploso tion (3) as a basis in weighting data to estimate Agency for Meteorological, Climatological and GTWR model parameters. The closer the distance Geophysics. between the expected points each village, the Number of DHF patients selected as response greater the weight of the data during the estima- variable (Y), population density (X1), Number of tion. In this study, data weighting is done by us- health center auxiliary (X2), rainfall (X3), average ing the adaptive Gaussian kernel function temperature (X4) and altitude of place from sea 2 ………………………………….4 wij expdij / h level (X5) as predictors variable. Statistical Methods Where, h is the constant bandwidth per- The relationship between DHF incidence and formed by the Incremental Spatial Autocorrela- factors considered to affect the spread of DHF tion (ISA) method [9]. The Adaptive Gaussian was analyzed by comparing the results of GWR kernel function is used in forming a weighted model and GTWR model. If GTWR model is better matrix. than GWR model, the relation between respond- Wui ,vi ,ti …………………..5 ent variable and the predictor variables is spatial- diagW1ui ,vi ,ti ,,Wn ui ,vi ,ti ly and temporally different for each Village. GWR model is a simple model and is usually Each observed data has one weighted matrix used in research. The spatial heterogeneity of the Wui ,vi ,ti in estimating the parameters. Using relationship between the occurrence of DHF and the algebraic matrix approach, and using the factors influencing it was analyzed by the GWR Weighted least square (WLS) method the param- model allowing estimation of different regression eter estimation can be written [4] parameters in each Village [3]. ˆ βui ,vi ,ti ……6 p Y u , v u , v X …..1 1 i 0 i i k 1 k i i ik i X 'Wui ,vi ,ti X XWui ,vi ,ti Y Significant test of GTWR model is given on Where u ,v and u ,v is the co- 0 i i k i i [10]. Where the null hypothesis of test indicates efficient of GWR in each Village i; u ,v is the i i the function k ui , vi for each coordinate location point of the Village is the definition of ui ,vi of the research location. If the null hy- the longitude and latitude coordinates ; and i pothesis is rejected, then the GTWR model is is an error that is assumed to be normal distribu- accepted to describe the data. Meanwhile, tests tion with mean 0 and its variant known. for the variation of each set of for the entire The GTWR model is formed by using the spa- k tial weight function and the temporal weighting study site will be used in F3 statistics [10]. The function in estimating its parameters [4]. identification of the GTWR model will also be shown by coefficient determination (R2). The p Y u ,v ,t u ,v ,t X ……2 it 0 i i i k1 k i i i ik i comparison of the GWR model and the GTWR model will be shown with Root mean square er- Where u ,v ,t is the location of Village ith i i i ror. According to RMSE the regression model that and the observation time for the observation has the smallest RMSE value is defined as the location. Longitude and Latitude coordinate de- best model. fined by ; is a response variable, Yit X ik is a factor that may affect and i is an error of J.Exp. Life Sci. Vol. 8 No. 2, 2018 ISSN. 2087-2852 E-ISSN. 2338-1655 Geographical & Temporal Weighted Regression Model in 73 The Spread of Dengue Fever in Malang (Indrayani et al) RESULT AND DISCUSSION this model the value of obtained R2 = 51.1% and Result of GWR Model RMSE = 13.13. Further analysis of the model indi- Using the spatial weighting, the parameter cates that the error approaches normal distribu- estimation ratios (regression coefficients) for the tion. obtained GWR model are shown in Table 1. In Table 1. Summary for Coefficient of GWR Model Parameter Minimun Median Maximum β0 0.9138619705260 0.9138767081168 0.9138948801913 β1 0.0000010683522 0.0000010684401 0.0000010685115 β2 -0.2121692197872 -0.2121678107572 -0.2121659143796 β3 0.0004221212435 0.0004221221469 0.0004221234641 β4 2.0687979109140 2.0687998269883 2.0688021917249 β5 282.8140409500290 282.8165895808340 282.8184768167160 altitude of the sea surface contribute positively Result of GTWR Model to the response variable, this indicates that if By using ISA [9], GTWR modeling has band- these variables increase, then the number of width Optimum h = 3947.54.