A Bayesian Computation for the Prediction of Football Match Results Using Artificial Neural Network
Total Page:16
File Type:pdf, Size:1020Kb
International Journal of Scientific Knowledge (Computing and Information Technology) Volume 1 Issue 2, Oct 2012 @2012 IJSK & K.J.A. All rights reserved ISSN 2305-1493 www.ijsk.org A Bayesian Computation for The Prediction Of Football Match Results Using Artificial Neural Network Mehmet Ali Cengiz, Department of Statistics, 19 Mayıs University, Turkey Naci Murat, Department of Statistics, 19 Mayıs University, Turkey Haydar Koç, Department of Statistics, 19 Mayıs University, Turkey Talat Şenel Department of Statistics, 19 Mayıs University, Turkey Email: [email protected], [email protected], [email protected], [email protected] Abstract. The problem of modelling football data has become increasingly popular in the last few years and many different models have been proposed with the aim of estimating the characteristics that bring a team to lose or win a game, or to predict the score of a particular match. We propose a Bayesian model to test its predictive strength on data for the Turkish Super League 2008-2009. We compare the predictive performance of Bayesian approach with a Poisson Loglinear approach and Artificial Neural Network approaches. First statistical model for soccer predictions was 1 Introduction proposed by [8]. He showed that both Poisson distribution and negative binomial distribution Statistical soccer prediction is a method used in provided an adequate fit to results of soccer sports betting, to predict the outcome of soccer games.[9] analyzed the numbers of ball passing matches by means of statistical tools. The between players during matches using negative problem of modelling football [soccer] data has binomial distribution. become increasingly popular in the last few years [7] proposed the first model predicting outcomes and many different models have been proposed of soccer matches between teams with different with the aim of estimating the characteristics that skills using the Poisson distribution. He bring a team to lose or win a game, or to predict developed a model in which the home and away the score of a particular match. team scores follow independent Poisson distributions, with means reflecting the attacking 31 International Journal of Scientific Knowledge (Computing and Information Technology) Volume 1 Issue 2, Oct 2012 @2012 IJSK & K.J.A. All rights reserved ISSN 2305-1493 www.ijsk.org and defensive capabilities of the two teams. [3] gth game of the season (g 1,...,G 306) as y employ a similar approach as [7] to develop a g1 forecasting model capable of generating previous and y g2 respectively. The vector of observed match outcome probabilities. All parameters are counts y (y , y ) is modelled as estimated using data from previous matches only. g1 g2 In the same way, [10] assume that the time- independent Poisson: varying attacking and defensive parameters of all (ygj gj ~ Poisson ( gj )) (1) teams vary randomly over time. where the parameters (g1,g2 ) represent [2] proposed the methods for modelling the home the scoring intensity in the g -th game for the field advantage factor. [6] showed empirical low team playing at home ( j 1) and away ( j 2) , levels of correlation between the numbers of respectively. We model these parameters goals scored in a single game by the two according to a formulation that has been used opponents. [5] advocated the use of a bivariate widely in the statistical literature [5], [6] Poisson distribution that has a more complicated assuming a log-linear random effect model: formulation for the likelihood function, and includes an additional parameter explicitly logθg1 home att h(g) defa(g) accounting for the covariance between the goals (2) scored by the two competing teams. They specify logθg2 att a(g) defh(g) the model in a frequents framework, and their The parameter home represents the advantage for main purpose is the estimation of the effects used the team hosting the game and we assume that to explain the number of goals scores [1]. this effect is constant for all the teams and Bayesian methods can be used to update the prior throughout the season. In addition, the scoring estimates of these parameters as new match intensity is determined jointly by the attack and results information is received, and Markov chain defence ability of the two teams involved, Monte Carlo iterative simulation techniques are represented by the parameters att and def , used for inference. respectively. The nested indexes h(g), a(g) 1,...,T identify the team that is In this paper we use the same model in [5], which is a Bayesian approach in WinBUGS [11], which playing at home (away) in the g th game of the has become the standard software for Bayesian season. analysis for predicting match outcomes for The data structure for the model is presented in Turkish super League for the season 2008-2009 TABLE I and consist of the name and code of the and compare the results with a Poisson Loglinear teams, and the number of goals scored for each approach and Artificial Neural Network game of the season. As is possible to see, the approaches. indexes h(g) and a(g) are uniquely associated with one of the 18 teams. 2 Model In line with the Bayesian approach, we have to specify some suitable prior distributions for all We use the same model in [5]. We first consider the random parameters in our model. The variable Turkish super League for the season 2008-2009. home is modelled as a fixed effect, assuming a The league is made by a total of T = 18 teams, standard flat prior distribution (notice that we use playing each other twice in a season (one at home here the typical notation to describe the Normal and one away). We indicate the number of goals distribution in terms of the mean and the precision): scored by the home and by the away team in the home ~ Normal(0,0.0001) (3) 32 International Journal of Scientific Knowledge (Computing and Information Technology) Volume 1 Issue 2, Oct 2012 @2012 IJSK & K.J.A. All rights reserved ISSN 2305-1493 www.ijsk.org Conversely, for each t 1,...,T , the team- Models in sports are used mainly for prediction. specific effects are modelled as exchangeable Here we briefly illustrate how we can obtain from a common distribution: predictions for six future games. The approach can be easily generalized to additional games attt ~ Normal( att , att ) , using the same approach. Posterior probabilities of each predicted outcome are summarized in def ~ Normal( , ) t def def Table 3. Outcome probabilities indicate As suggested by various works, we need to Antalyaspor’s probability of winning the game impose some identifiably constraints on the team- against MKE Ankaragücü is about 43%. specific parameters. In line with [5], we use a Concerning the second game (Trabzonspor vs. sum-to-zero constraint, that is Fenerbahçe), the posterior model probabilities confirm that the posterior probability of away T att 0 , T def 0. (4) t1 t t1 t team is greater than the posterior probability of However, we also assessed the performance of the home team. model using a corner constraint instead, in which the team-specific effect for only one team are set In order to compare the predictive accuracy, Poisson loglinear approach, two Artificial Neural att1 0 def1 0 to 0, for instance and . Even if Network approaches were also considered. The this latter method is slightly faster to run, the main concept is to use the number of goals interpretation of these coefficients is incremental scored by the home and by the away team and with respect to the baseline identified by the team being home team effect as input parameters, to associated with an attacking and defending predict the number of goals that will be scored by strength of 0 and therefore is less intuitive. the home and by the away team for future game Finally, the hyper-priors of the attack and defence (output parameter). In our study a feed forward effects are modelled independently using again multi layer network architecture (ANN1) and a flat prior distributions: radial basis function approach (ANN2) were ~ Normal(0,0.000.1) , employed which are widely used in ANN att applications. For training and testing the proposed def ~ Normal(0,0.000.1) ANN model the overall data set was randomly (5) divided into two separate sets. The first set, att ~ Gamma (0.1,0.1) , namely the training set, consisted of 75% data records while the remaining 25% of the data ~ Gamma (0.1,0.1) def records formed the test set. For comparison, the overall data set was divided as the same way with 3 Application ANN approach for Bayesian approaches. Table 4 provides RMSE and MAPE values for all models Posterior summaries of the Poisson log-linear with three different approaches. model parameters are provided in TABLE I. As we can see from the estimated model Fenerbahce had the highest attacking parameter while 4 Conclusions Kayserispor had the lowest (i.e. best) defensive parameter. In this study, we give some details for a Bayesian Approach given by Karlis and Ntzoufras (2003) Posterior credible intervals of attacking and to predict future match scores for Turkish super defensive parameters for each team are depicted League [5]. We obtained fairly good posterior in Fig. 1 and 2 model probabilities for unplayed six matches. We also used a Poisson Loglinear approach and two Artificial Neural Network approaches to compare 33 International Journal of Scientific Knowledge (Computing and Information Technology) Volume 1 Issue 2, Oct 2012 @2012 IJSK & K.J.A. All rights reserved ISSN 2305-1493 www.ijsk.org with the Bayesian Poisson Loglinear approach 8. Moroney M J (1956).