SEAM Methodology for Context-Rich Player Matchup Evaluations in Baseball
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Regimes in Baseball Players' Career Data
Data Min Knowl Disc (2017) 31:1580–1621 DOI 10.1007/s10618-017-0510-5 Regimes in baseball players’ career data Marcus Bendtsen1 Received: 28 February 2016 / Accepted: 12 May 2017 / Published online: 27 May 2017 © The Author(s) 2017. This article is an open access publication Abstract In this paper we investigate how we can use gated Bayesian networks, a type of probabilistic graphical model, to represent regimes in baseball players’ career data. We find that baseball players do indeed go through different regimes throughout their career, where each regime can be associated with a certain level of performance. We show that some of the transitions between regimes happen in conjunction with major events in the players’ career, such as being traded or injured, but that some transitions cannot be explained by such events. The resulting model is a tool for managers and coaches that can be used to identify where transitions have occurred, as well as an online monitoring tool to detect which regime the player currently is in. 1 Introduction Baseball has been the de facto national sport of the United States since the late nine- teenth century, and its popularity has spread to Central and South America as well as the Caribbean and East Asia. It has inspired numerous movies and other works of arts, and has also influenced the English language with expressions such as “hitting it out of the park” and “covering one’s bases”, as well as being the origin of the concept of a rain-check. As we all are aware, the omnipresent baseball cap is worn across the world. -
Summaries of Research Using Retrosheet Data
Acharya, Robit A., Alexander J. Ahmed, Alexander N. D’Amour, Haibo Lu, Carl N. Morris, Bradley D. Oglevee, Andrew W. Peterson, and Robert N. Swift (2008). Improving major league baseball park factor estimates. Journal of Quantitative Analysis in Sports, Vol. 4 Issue 2, Article 4. There are at least two obvious problems with the original Pete Palmer method for determining ballpark factor: assumption of a balanced schedule and the sample size issue (one year is too short for a stable estimate, many years usually means new ballparks and changes in the standing of any specific ballpark relative to the others). A group of researchers including Carl Morris (Acharya et al., 2008) discerned another problem with that formula; inflationary bias. I use their example to illustrate: Assume a two-team league with Team A’s ballpark “really” has a factor of 2 and Team B’s park a “real” factor of .5. That means four times as many runs should be scored in the first as in the second. Now we assume that this hold true, and that in two-game series at each park each team scores a total of eight runs at A’s home and two runs a B’s. If you plug these numbers into the basic formula, you get 1 – (8 + 8) / 2 = 8 for A; (2 + 2) / 2 = 2 for B 2 – (2 + 2) / 2 = 2 for A; (8 + 8) / 2 = 8 for B 3 – 8 / 2 = 4 for A; 2 / 8 = .25 for B figures that are twice what they should be. The authors proposed that a simultaneous solving of a series of equations controlling for team offense and defense, with the result representing the number of runs above or below league average the home park would give up during a given season. -
Bill James - Wikipedia
2/11/2021 Bill James - Wikipedia Bill James George William James (born October 5, 1949) is an American baseball writer, historian, and statistician whose work has been Bill James widely influential. Since 1977, James has written more than two dozen books devoted to baseball history and statistics. His approach, which he termed sabermetrics in reference to the Society for American Baseball Research (SABR),[1] scientifically analyzes and studies baseball, often through the use of statistical data, in an attempt to determine why teams win and lose. In 2006, Time named him in the Time 100 as one of the most influential people in the world.[2] In 2003, James was hired as senior advisor on Baseball Operations for the Boston Red Sox.[3] James in 2010 Contents Born George William James Early life October 5, Career The Bill James Baseball Abstracts 1949 Post-Abstracts work Holton, STATS, Inc. Kansas, U.S. Innovations Alma mater University of Acceptance and employment in mainstream baseball Kansas In culture Occupation Historian, Controversies statistician Dowd Report controversy Paterno controversy Known for Sabermetrics Personal life Bibliography Books about James See also Notes References Further reading External links Early life https://en.wikipedia.org/wiki/Bill_James 1/11 2/11/2021 Bill James - Wikipedia James was born in Holton, Kansas; his mother died in 1954 when he was five. His father was a janitor and a handyman. After four years at the University of Kansas (KU) residing at Stephenson Scholarship hall, James joined the Army in 1971. He was the last person in Kansas to be sent to fight in the Vietnam War, although he never saw action there. -
Predicting the Potential of Professional Soccer Players
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Lirias Predicting the Potential of Professional Soccer Players Ruben Vroonen1, Tom Decroos1, Jan Van Haaren2, and Jesse Davis1 1 KU Leuven, Department of Computer Science, Celestijnenlaan 200A, 3001 Leuven, Belgium 2 SciSports, Hengelosestraat 500, 7251 AN Enschede, The Netherlands Abstract. Projecting how a player's skill level will evolve in the future is a crucial problem faced by sports teams. Traditionally, player projec- tions have been evaluated by human scouts, who are subjective and may suffer from biases. More recently, there has been interest in automated projection systems such as the PECOTA system for baseball and the CARMELO system for basketball. In this paper, we present a projec- tion system for soccer players called APROPOS which is inspired by the CARMELO and PECOTA systems. APROPOS predicts the potential of a soccer player by searching a historical database to identify similar play- ers of the same age. It then bases its prediction for the target player's progression on how the similar previous players actually evolved. We evaluate APROPOS on players from the five biggest European soccer leagues and show that it clearly outperforms a more naive baseline. 1 Introduction With more than 250 million players, soccer is the most popular sport in the world. Due to technological advances, new soccer data sources such as event streams and optical tracking from matches are rapidly becoming available. This has lead to an explosion of interest in the area of soccer analytics. Most research tends to focus on analyzing soccer gameplay (e.g., [11, 7, 8, 9, 10]). -
C 2014 RYAN XAVIER PINHEIRO ALL RIGHTS
c 2014 RYAN XAVIER PINHEIRO ALL RIGHTS RESERVED EFFICIENT FREE AGENT SPENDING IN MAJOR LEAGUE BASEBALL A Thesis Presented to The Graduate Faculty of The University of Akron In Partial Fulfillment of the Requirements for the Degree Master of Science Ryan Xavier Pinheiro May, 2014 EFFICIENT FREE AGENT SPENDING IN MAJOR LEAGUE BASEBALL Ryan Xavier Pinheiro Thesis Approved: Accepted: Advisor Dean of the College Dr. James P. Cossey Dr. Chand Midha Faculty Reader Dean of the Graduate School Dr. Stefan Forcey Dr. George R. Newkome Faculty Reader Date Dr. Nao Mimoto Department Chair Dr. Timothy Norfolk ii ABSTRACT During the 2012-2013 offseason, the Cleveland Indians signed a number of high pro- file free agents. Our goal is to determine whether these free agent signings were efficient. We will focus on Michael Bourn and Mark Reynolds for our paper, though our methods would easily apply to other free agents. To accomplish this we use statis- tical analysis to predict each player’s future performance, and we use game theoretic models to analyze the efficiency of the contracts. iii ACKNOWLEDGEMENTS Thanks to my parents, my sister, my extended family around the world, and all my friends for the support and encouragement you have always offered me. I would never have made it to this point without you. Thanks to my professors as well for challenging me intellectually and inspiring with your passion. A special thanks to my advisor, Dr. Cossey, for mentoring me throughout the process of this thesis. It has been more than a pleasure. Lastly, thank you to the Cleveland Indians. -
Baseball Prediction Using Ensemble Learning by Arlo Lyle
Baseball Prediction Using Ensemble Learning by Arlo Lyle (Under the direction of Dr. Khaled Rasheed) Abstract As the salaries of baseball players continue to skyrocket and with the ever-increasing popularity of fantasy baseball, the desire for more accurate predictions of players’ future performances is building both for baseball executives and baseball fans. While most existing work in performance prediction uses purely statistical methods, this thesis showcases research in combining multiple machine learning techniques to improve on current prediction systems by increasing the accuracy of projections in several key offensive statistical categories. By using the statistics of players from the past thirty years, the goal of this research is to more accurately learn from this data how a player’s performance changes over time and apply this knowledge to predicting future performance. Results have shown that using machine learning techniques to predict a player’s performance is comparable to the accuracy seen by some of the best prediction systems currently available. Index words: Machine Learning, Ensemble Learning, Baseball Prediction, Model Trees, Artificial Neural Networks, Support Vector Machines, Bagging, Boosting, Stacking Baseball Prediction Using Ensemble Learning by Arlo Lyle B.S., The University of Tulsa, 2005 A Thesis Submitted to the Graduate Faculty of The University of Georgia in Partial Fulfillment of the Requirements for the Degree Master of Science Athens, Georgia 2007 c 2007 Arlo Lyle All Rights Reserved Baseball Prediction Using Ensemble Learning by Arlo Lyle Approved: Major Professor: Dr. Khaled Rasheed Committee: Dr. Walter D. Potter Dr. Donald Nute Electronic Version Approved: Maureen Grasso Dean of the Graduate School The University of Georgia May 2007 Acknowledgments I would like to thank my advisor Dr.