openWAR: An Open Source System for Evaluating Overall Player Performance in Major League Baseball Benjamin S. Baumer Shane T. Jensen Smith College The Wharton School
[email protected] University of Pennsylvania
[email protected] Gregory J. Matthews Loyola University Chicago
[email protected] March 25, 2015 Abstract Within sports analytics, there is substantial interest in comprehensive statistics intended to capture overall player performance. In baseball, one such measure is Wins Above Replace- ment (WAR), which aggregates the contributions of a player in each facet of the game: hitting, pitching, baserunning, and fielding. However, current versions of WAR depend upon propri- etary data, ad hoc methodology, and opaque calculations. We propose a competitive aggregate measure, openW AR, that is based on public data, a methodology with greater rigor and trans- parency, and a principled standard for the nebulous concept of a \replacement" player. Finally, we use simulation-based techniques to provide interval estimates for our openW AR measure that are easily portable to other domains. Keywords: baseball, statistical modeling, simulation, R, reproducibility 1 Introduction In sports analytics, researchers apply statistical methods to game data in order to estimate key quantities of interest. In team sports, arguably the most fundamental challenge is to quantify the contributions of individual players towards the collective performance of their team. In all sports the arXiv:1312.7158v3 [stat.AP] 24 Mar 2015 ultimate goal is winning and so the ultimate measure of player performance is that player's overall contribution to the number of games that his team wins. Although we focus on a particular measure of player contribution, wins above replacement (WAR) in major league baseball, the issues and approaches examined in this paper apply more generally to any endeavor to provide a comprehensive measure of individual player performance in sports.