India's Take on Sports Analytics

Total Page:16

File Type:pdf, Size:1020Kb

India's Take on Sports Analytics PSYCHOLOGY AND EDUCATION (2020) 57(9): 5817-5827 ISSN: 00333077 India’s Take on Sports Analytics Rohan Mehta1, Dr.Shilpa Parkhi2 Student, Symbiosis Institute of Operations Mangement, Nashik, India Deputy Director, Symbiosis Institute of Operations Mangement, Nashik, India Email Id: [email protected] ABSTRACT Purpose – The aim of this paper is to study what is sports analytics, what are the different roles in this field, which sports are prominently using this, how big data has impacted this field, how this field is shaping up in Indian context. Also, the aim is to study the growth of job opportunities in this field, how B-schools are shaping up in this aspect and what are the interests and expectations of the B-school grads from this sector. Keywords Sports analytics, Sabermetrics, Moneyball, Technologies, Team sports, IOT, Cloud Article Received: 10 August 2020, Revised: 25 October 2020, Accepted: 18 November 2020 Design Approach analysis, he had done on approximately 10000 deliveries. Another writer, for one of the US The paper starts by explaining about the origin of magazines, F.C Lane was of the opinion that the sports analytics, the most naïve form of it, then batting average of the individual doesn’t reflect moves towards explaining the evolution of it over the complete picture of the individual’s the years (from emergence of sabermetrics to the performance. There were other significant efforts most advanced applications), how it has spread made by other statisticians or writers such as across different sports and how the applications of George Lindsey, Allan Roth, Earnshaw Cook till it has increased with the advent of different 1969. In this year The Baseball Encyclopedia was enabling technologies. Then the Indian context is published by Macmillan Inc, founded by George studied, in terms of opportunities, supply and Brett. This encyclopedia was a comprehensive demand, growth of the field, how the B-schools collection of the major-league baseball statistics are faring in this context, initiatives by right from the year 1871. This paved the way for independent institutions and lastly the survey of further research which was carried out by different the B-school grads to understand their knowledge writers. Bill James was one among them, who about the sector, interests, exposure and coined the term sabermetrics. Sabermetrics also expectations from it. known as SABRmetrics (Society for American Introduction Baseball Research) can be understood as analysis of the actual (empirical) data to assist in decision Sports analytics in nutshell, is capturing the making. Bill James’s Baseball Abstract from 1982 required data with the help of technology, then till 1988 included his work on sabermetrics and its running the data through statistical model, tools fundamentals. There were other sabermetricians and visualization to provide insights into player such as Pete Palmer, who along with John Thorm, performances and assist in giving published The Hidden Game of Baseball – which recommendations to the player or the team. had a summary of the sabermetric principles. Though Henry Chadwick ,one of the pioneer Apart from this it also had few associations with writers of baseball, who was inducted in the the work done by F.C Lane. In 2002, Bill James national baseball hall of fame in 1938 [31] wrote Win Shares, which he summarized the invented the box score in 1859 which is used to performance of every player in the major league evaluate players, the initial efforts in sports by a single number for every season, which analytics (emerged from baseball) started in the succeeded Palmer’s Total Player Rating. early years of 20th century, with Hugh Fullerton predicting the outcome of the world series [26]. In the year 2003, Michael Lewis wrote Moneyball Later on, in the year 1910, he published an article that gave insights on the work done by Oakland “The Inside Game”, where he gave insights to the athletics and its general manager Billy Beans. It 5817 www.psychologyandeducation.net PSYCHOLOGY AND EDUCATION (2020) 57(9): 5817-5827 ISSN: 00333077 was from this moment, where teams understood while he was working for Roke Manor research. the importance of sabermetrics and how teams can Hawk-eye uses cameras (for triangulation and improve their performances by analyzing the data. creating 3d image), speed gun for measuring the By 2012, almost all the Major league Baseball speed of the ball, thereby predicts the trajectory of teams had employed at least one statistician the ball. Over the years other applications of (sabermetrician). analytics have been introduced, such as Literature Review performance analysis of the players from a specific country in IPL using cluster analysis[11], Having looked at the emergence of sports player ranking with respect to the performance in analytics, now the further parts will concentrate different formats of the game [15], team selection on the nuances of the field, divergence to different by using a multi-criteria and multi-objective sports, impact of technologies and other points as decision making approach [15]. Another mentioned previously. The study focuses on the application is in predicting the out-comes of the team sports which are considered to be highly game (the dynamic win percentage of the teams valued or rated in terms of financial metrics [2]. contesting) or predicting the score of the team From the application point of view the work done batting. WASP (Winning And Score Predictor) is by researchers, major leagues in the sports – one such technique used to predict the scores and standards adopted by them, individually on team probable outcomes of the match [19]. Other basis or as a league and the work done by the technique predicts the outcome of the game companies who have published the white papers considering the player strength and weaknesses, has been explored. comparing it with the opposition and recommends The Sports analytics market team selection [9]. Another application is predicting or calculating the correct/winning score The worldwide sports market is expected to reach for the team batting second (in limited overs approximately the value of US $4 billion by 2022 cricket) in case the match is called off due to [47] and US $5.2 billion by 2024 [48]. Sports inclement weather. ICC has formally adopted the analytics is majorly concerned with Data driven DLS (Duckwroth Lewis Stern method) since decision making (to assist coaches, managers, before the 2015 world-cup [39]. Over the years players etc) and predictive analytics (to predict the the different statistical measures or KPIs are used outcome of games, performance of individuals to assess the player’s performance – strengths, etc). The market, based on the application is weakness etc. From batting average to finding out prominently stratified into performance analysis, the most preferred scoring shot or the favorable player safety & fitness, valuation, fan engagement scoring shot of the player [espn] different metrics and broadcast management. The performance are used by coaches, players, oppositions to devise analysis and fan engagement are the two areas a suitable strategy. growing at a rapid pace [48], due to the increase in demand for the relevant metrics and data analysis b. Football by coaches for improving the team or player Football being more past paced game than cricket performance and by board management for has more need of real time data analysis than increasing the revenues by engaging the fans in cricket. Also, the way the sport has been the most effective way. The market is also structured with teams playing at least 30 league stratified on the basis of the sport – team sport & games in a season (for leagues that have at least individual sport. The team sport market is 15 teams), other domestic matches, champions expected to grow more rapidly than the individual league and players also representing their national sport segment [48]. side, the data being generated is huge and it needs Different sports and its applications in-memory analytics [37]. Approximately 6000 videos are added into the database of a company a. Cricket every month and around 500 people are employed The early forms of visual analytics in cricket was to analyze it frame by frame [30]. English Premier observed with the emergence of hawk-eye in 2001 league, had a tie-up with opta sports as a media [36]. Hawk-eye developed by Dr. Paul Hawkins, data partner for 2017-18 season, where opta sports could collect data and was also made publicly 5818 www.psychologyandeducation.net PSYCHOLOGY AND EDUCATION (2020) 57(9): 5817-5827 ISSN: 00333077 available to different media [27]. Opta had earlier effective fan engagement is also one of the introduced an advanced metric system which had areas that come under sports analytics. David parameters such as expected goals, expected Johnson once said that it is important for NHL and assists etc which calculates the expected goals or broadcasters to incorporate data to tell stories, assists a player should have had during the match instead of considering coach’s or analysts opinion based on different parameters. This is somewhere as gospel. He wanted data to back the storyline between predictive analytics and descriptive [28]. Up to 2010 analytics in NHL wasn’t looked analytics. Also, other applications include finding upon with great respect. [2]. But now most of the out or exploiting the weakness of the opposition NHL teams have analytics staff and some have team by analyzing their matches against other even recruited academic statisticians for the same. teams who were able to do well against them, There are even few R packages available that creating game plans for specific instances or provide the data for processing NHL games. One scoring opportunities – during a freekick or a of the areas that has been explored in NHL is the corner [30].
Recommended publications
  • The Numbers Game: Baseball's Lifelong Fascination with Statistics (Book Review)
    The Numbers Game: Baseball's Lifelong Fascination with Statistics (Book Review) The American Statistician May 1, 2005 | Cochran, James J. The Numbers Game: Baseball's Lifelong Fascination with Statistics. Alan SCHWARZ. New York: Thomas Dunne Books, 2004, xv + 270 pp. $24.95 (H), ISBN: 0‐312‐322222‐4. I am amazed at how frequently I walk into a colleague's office for the first time and spot a copy of The Baseball Encyclopedia or Total Baseball on a bookshelf. How many statisticians developed and nurtured their interest in probability and statistics by playing Strat‐O‐ Matic[R] and/or APBA[R] baseball board games; reading publications like the annual The Bill James Baseball Abstract; or scanning countless box scores and current summary statistics in the Sporting News or in the sports section of the local newspaper, looking for an edge in a rotisserie baseball league? For many of us, baseball is what initially opened our eyes to the power of probability and statistics, and the sport continues to be an integral part of our lives (for evidence of this, attend the sessions or business meeting of the ASA's Statistics in Sports section at the next Joint Statistical Meetings). This is why so many probabilists and statisticians will enjoy reading Alan Schwarz's The Numbers Game: Baseball's Lifelong Fascination with Statistics. In this book, Schwarz effectively chronicles the ongoing association between baseball and statistics by describing the evolution of the sport from the mid‐nineteenth century through the beginning of the twenty‐first century, and looking at several of its most influential characters.
    [Show full text]
  • Signature Redacted Author
    1 Using Machine Learning to Derive Insights from Sports Location Data by Joel Brooks Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY June 2018 @ Massachusetts Institute of Technology 2018. All rights reserved. Signature redacted Author.. - -- 'A 1- - .... ..................... A5 epartment of Electrical Engineering and Computer Science April 2, 2018 CertifiedCetiid byySignature redacted ....... ... ... .. ..... John Guttag Dugald C. Jackson Professor Thesis Supervisor Signature redacted Accepted by ........... I ((J Leslie A. Kolodziejski Professor of Electrical Engineering and Computer Science Chair, Department Committee on Graduate Students MASSACHUSETTS INSTITUTE OF TECHNOLOGY JUN 18 2018 LIBRARIES ARCHIVES 77 Massachusetts Avenue Cambridge, MA 02139 MITLibraries http://Iibraries.mit.edu/ask DISCLAIMER NOTICE Due to the condition of the original material, there are unavoidable flaws in this reproduction. We have made every effort possible to provide you with the best copy available. Thank you. The images contained in this document are of the best quality available. 2 Using Machine Learning to Derive Insights from Sports Location Data by Joel Brooks Submitted to the Department of Electrical Engineering and Computer Science on April 2, 2018, in partial fulfillment of the requirements for the degree of Doctor of Philosophy Abstract Historically, much of sports analytics has aimed to find relationships between discrete events and outcomes. The availability of high-resolution event location and tracking data has led to many new opportunities in sports research. However, it is often challenging to apply machine learning to understand a particular aspect of a sport.
    [Show full text]
  • Sabermetrics Over Time: Persuasion and Symbolic Convergence Across a Diffusion of Innovations
    SABERMETRICS OVER TIME: PERSUASION AND SYMBOLIC CONVERGENCE ACROSS A DIFFUSION OF INNOVATIONS BY NATHANIEL H. STOLTZ A Thesis Submitted to the Graduate Faculty of WAKE FOREST UNIVERSITY GRADUATE SCHOOL OF ARTS AND SCIENCES in Partial Fulfillment of the Requirements for the Degree of MASTER OF ARTS Communication May 2014 Winston-Salem, North Carolina Approved By: Michael D. Hazen, Ph. D., Advisor John T. Llewellyn, Ph. D., Chair Todd A. McFall, Ph. D. ii Acknowledgments First and foremost, I would like to thank everyone who has assisted me along the way in what has not always been the smoothest of academic journeys. It begins with the wonderful group of faculty I encountered as an undergraduate in the James Madison Writing, Rhetoric, and Technical Communication department, especially my advisor, Cindy Allen. Without them, I would never have been prepared to complete my undergraduate studies, let alone take on the challenges of graduate work. I also want to thank the admissions committee at Wake Forest for giving me the opportunity to have a graduate school experience at a leading program. Further, I have unending gratitude for the guidance and patience of my thesis committee: Dr. Michael Hazen, who guided me from sitting in his office with no ideas all the way up to achieving a completed thesis, Dr. John Llewellyn, whose attention to detail helped me push myself and my writing to greater heights, and Dr. Todd McFall, who agreed to assist the project on short notice and contributed a number of interesting ideas. Finally, I have many to thank on a personal level.
    [Show full text]
  • 投稿類別:英文寫作 篇名: Data Science and Analysis Are Changing
    投稿類別:英文寫作 篇名: Data Science and Analysis are Changing the Baseball 作者: 林俊沂。私立僑泰高中。高二 1 班 指導老師: 胡雯俐老師 Data Science and Analysis are Changing the Baseball I. Introduction 1. Motivation Whoever relishes baseball would spend valuable time paying highly constant attention to it. I am no exception. Reading articles or watching programs like MLB almost occupy my free time. I notice that understanding the statistics about baseball is essential because statistics is the most objective ways to define player’s capability. Although baseball statistics sometimes takes a lot of time to grasp, it is actually fun to strengthen the baseball knowledge and to acquaint the influence on players with data science. Therefore, I decide to learn more about baseball statistics and try to show its phenomena and effects through this research. 2. Purpose This research aims at the revolution of the baseball statistics with its origin and the use in the few years. I also sort out some opposite of baseball statistics to distinguish two different views of the topic to annotate it deeper. Through two-side arguments, the research aims to present the influence of the baseball statistics and explain it. II. Body 1. Sabermetrics Sabermetrics is a baseball statistics that can make objective analysis of baseball activities, as for the interpretation and evaluation of baseball statistics during baseball games. The term coined by Bill James, is derived from the acronym SABR which stands for the Society for America Baseball Research and is rooted with metrics. 1.1. The early history of Sabermetrics The first baseball statistics way to describe the baseball activity called box scores, developed by Henry Chadwick in 1858.
    [Show full text]
  • SABR Biblio News Son and Dover Publications About Possible Books Dover Might Include in a Projected Reprint Series
    Society for American Baseball Research BIBLIOGRAPHY COMMITTEE NEWSLETTER September 2008 (08—3) ©2008 Society for American Baseball Research Opinions expressed by contributors do not necessarily reflect the position or official policy of SABR or its Bibliography Committee Editor: Ron Kaplan (23 Dodd Street, Montclair, NJ 07042, 973-509-8162, [email protected]) In the last newsletter, I included a note from Paul Dick- SABR Biblio News son and Dover publications about possible books Dover might include in a projected reprint series. So far, the nomi- nees have included Every Diamond Doesn’t Sparkle (Fresco Comments from the Chair Thompson and Cy Rice), Dodger Daze and Knights (Tom- my Holmes), Baseball and the Cold War (Howard Senzel), Cleveland proved to be another excellent convention, Percentage Baseball (Earnshaw Cook), Ban Johnson: Czar of Baseball (Eugene Murdock), and 100 Years of Baseball with a decent hotel with a wide range of places to eat nearby (Lee Allen). If you have any further suggestions, please and a pleasant walk to the ballpark. Our committee meeting was graced with the presence send them to Paul ([email protected]) with a copy to me ([email protected]). of Frank Phelps, our founding chair, who hadn’t been able to get to a convention in some years. Frank attended the .Andy McCue Chair, Bibliography Committee committee meeting and took part in a number of convention activities. On Thursday evening, attendees could go over to the Western Reserve Historical Society for a tour of the SABR archives and the Frank Phelps Collection of baseball research materials, which Frank donated some years ago.
    [Show full text]
  • Major League Baseball Yesterday and Today
    THE CHANGING GAME REVISITED: Major League Baseball Yesterday and Today. BOB SAWYER ABSTRACT Retrosheet’s expanding data base no provides data about Reaching base On Error and Innings batted back to 1916. Section Two demonstrates how this information enables deduction of base running outs for teams and leagues Section Three extends the format of The Baseball Encyclopedia’s “The Changing Game” to opposition pitching and fielding . The six tables create a statistical profile of play during the 1916, 1921, 1971 and 2019 seasons. The tables identify areas in which Major League baseball changed rapidly between 1916 and 1921 and then continued changing at a more evolutionary pace for another century. Section One: THE VOCABULARY OF TABLETOP SUCCESS: Sports Illustrated Baseball™ is a board game in which statistics-based player-charts interact in simulation of baseball.1 Board gamers control of batting orders and substitutions, with dice rolls determining the outcomes of their choices to bunt or take a gamble on the base paths. Winning SI Baseball is the result of brains and luck rather than ability to throw, catch or hit a real baseball. And yet the game is so well-designed that good tactics for real baseball are nearly always just as useful for an SI Baseball manager. Fly outs, ground outs, singles, and doubles are subdivided in SI Baseball to allow differing amounts of potential advancement by base runners. The safe on error result designated by “E” has the same effect on batters and base runners as the type of single designated by “1”. The results of “WP”, “PB”, and BK(for Balk) are likewise interchangeable in Si Baseball.
    [Show full text]
  • Out of Left Field Evidence for and Against Baseball’S Conventional Wisdom
    Olin College Out of Left Field Evidence For and Against Baseball’s Conventional Wisdom Christopher Joyce 10/4/2012 This project aims to map baseball conventional wisdom to facts, and see if the things that ‘everybody knows’ about baseball are supported or contradicted by statistical evidence. I found that most baseball conventional wisdom does not line up with statistics, and in some cases is outright contradicted. Contents Background & Dataset .................................................................................................................................. 2 Exploration #1: Walks per Country .............................................................................................................. 4 Exploration #2: Good-Fielding Shortstops ................................................................................................... 6 Exploration #3: Knuckleballers and Control ................................................................................................. 8 Conclusions ................................................................................................................................................. 10 Bibliography ................................................................................................................................................ 11 Background & Dataset Baseball is a sport with a 140 year old history, but it’s only in the past 30 years or so that there’s been much focus on analyzing baseball with statistics (an approach called sabermetrics), rather than
    [Show full text]
  • How to Make a Fortune in Bull, Bear,And Black
    Thinking Outside the Box 5 How to hit home runs: I swing as hard as I can, and I try to swing right through the ball . The harder you grip the bat, the more you can swing it through the ball, and the farther the ball will go. I swing big, with everything I’ve got. I hit big or I miss big. I like to live as big as I can. —Babe Ruth What is striking is that the leading thinkers across varied felds—including horse betting, casino gambling, and investing—all emphasize the same point. We call it the Babe Ruth effect: even though Ruth struck out a lot, he was one of baseball’s greatest hitters. —Michael J. Mauboussin1 Lenny [Dykstra] didn’t let his mind mess him up. Only a psychological freak could approach a 100-mph Since the frst edition of Trend Following, sports analytics has fastball aimed not all that exploded. In the last decade professional sports have undergone a far from his head with total confdence. “Lenny remodeling, with teams scrambling to change strategies to accommodate was so perfectly designed, untold new trends in statistical analysis. There haven’t necessarily been emotionally, to play the major rule changes, nor have there been any substantial changes to the game of baseball. venues or the equipment. Instead, the renaissance is rooted in an uncon- He was able to instantly ventional process k nown as sabermetrics.3 forget any failure and draw strength from every Today, every major professional sports team either has an analyt- success.
    [Show full text]
  • Baumer and Zimablist Sabermetric Revolution
    The Sabermetric Revolution Benjamin Baumer, Andrew Zimbalist Published by University of Pennsylvania Press Benjamin Baumer. and Andrew Zimbalist. The Sabermetric Revolution: Assessing the Growth of Analytics in Baseball. Philadelphia: University of Pennsylvania Press, 2013. Project MUSE. Web. 21 Aug. 2015.http://muse.jhu.edu/. For additional information about this book http://muse.jhu.edu/books/9780812209129 Access provided by University of Michigan @ Ann Arbor (4 Dec 2015 21:10 GMT) PREFACE ichael Lewis wrote Moneyball because he fell in love with a story. Te Mstory is about how intelligent innovation (the creative use of statistical analysis) in the face of market inefciency (the failure of all other teams to use available information productively) can overcome the unfairness of baseball economics (rich teams can buy all the best players) to enable a poor team to slay the giants. Lewis is an engaging storyteller and, along the way, introduces us to intriguing characters who carry forward the rags-to-riches plot. By the end, the story of the Oakland A’s and their general manager, Billy Beane, is so well told that we believe its portrayal of baseball history, economics, and competitive success. Te result is a new Horatio Alger tale that reinforces a beloved American myth and, all the better, applies to our national pastime. Te appeal of Lewis’s Moneyball was sufciently strong that Hollywood wanted a piece of the action. With a compelling script, smart direction, and the handsome Brad Pitt as Beane, Moneyball became part of mass culture and its perceived validity—and its legend—only grew. Tis book will attempt to set the record straight on Moneyball and the role of “analytics” in baseball.
    [Show full text]
  • Open Victoria Decesare Thesis - Final.Pdf
    THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF STATISTICS USING CONVENTIONAL AND SABERMETRIC BASEBALL STATISTICS FOR PREDICTING MAJOR LEAGUE BASEBALL WIN PERCENTAGE VICTORIA DECESARE SPRING 2016 A thesis submitted in partial fulfillment of the requirements for a baccalaureate degree in Science with honors in Statistics Reviewed and approved* by the following: Andrew Wiesner Lecturer of Statistics Thesis Supervisor David Hunter Department Head of Statistics Honors Adviser * Signatures are on file in the Schreyer Honors College. i ABSTRACT Major League Baseball is dominated by statistical analysis; one cannot watch a baseball game on the television without hearing and seeing a plethora of statistics such as batting average, runs batted in, earned run average, and the list goes on. In addition to these popular stats that most people are familiar with, there are several, more complex baseball statistics – known as “sabermetric” statistics – that have been developed over the past few decades that seek to evaluate players and the game more scientifically and comprehensively. However, with all of these stats available, it is easy to get caught up in the data and overlook the main goal of MLB teams: to win games. With this in mind, the goal of this research is to explore some of the numerous baseball statistics available, both the traditional and modern ones, and observe which ones are truly the best at predicting wins. Encompassing this, is it better to use the more complex methods in analyzing how teams win, or does it hold true that “less is more”? This research seeks to answer these questions and to provide a unique perspective for fans and managers alike when trying to make use of the ever-growing world of baseball data.
    [Show full text]
  • Alesandrini, Zach 22007 Caughron.Pdf (1.404Mb)
    NORTHERN ILLINOIS UNIVERSITY How Winning Teams Become Championship Teams: a Baseball Team Architectural Analysis A Thesis Submitted to the University Honors Program In Partial Fulfillment of the Requirements of the Baccalaureate Degree With University Honors Department of Management By Zach Alesandrini DeKalb, Illinois May, 2007 ABSTRACT The purpose of this project was to understand why certain Major League Baseball teams are extremely successful during Major League Baseball's regular season, but often do not have that same success in the play-offs. Similarly, why do certain teams have tremendous success in the play-offs? This project analyzed the architecture of Major League Baseball teams that had success in the regular season to teams that had success in the post season. More specifically, the teams were separated into two groups: teams that were Regular Season champions and teams that were World Series champions. The teams were analyzed and compared on the basis of how they distributed their payroll and statistics within their rosters. The analysis showed that there were differences between the two groups. There were also many similarities between the two groups which proved that successful teams are built in many of the same ways. Finally, recommendations were made on how a team can build its roster to have not only a winning team, but a championship team. 11 TABLE OF CONTENTS Page LIST OF APPENDICES iii INTRODUCTION : 1 REVIEW OF LITERATURE 4 METHODS 17 RESULTS 32 DISCUSSION 43 CONCLUSIONS 49 BffiLIOGRAPHY 53 111 TABLE OF APPENDICES Page APPENDIX A - Team Rosters and Base Data 55 APPENDIX B - Individual Player Payroll Information and Distribution 70 APPENDIX C - IndividualPlayer Statistical Information and Distribution 86 Alesandrini 1 INTRODUCTION Every year, in late February, Major League Baseball (MLB) players report to their respective camps in Arizona or Florida for Spring Training.
    [Show full text]
  • A Regression Model Using Common Baseball Statistics to Project Offensive and Defensive Efficiency
    REGRESSION PLANES TO IMPROVE THE PYTHAGOREAN PERCENTAGE A regression model using common baseball statistics to project offensive and defensive efficiency by Dennis Moy A thesis submitted in fulfillment of the requirements for the degree of honors in Statistics University of California - Berkeley 2006 UNIVERSITY OF CALIFORNIA - BERKELEY ABSTRACT Prediction Planes for the Pythagorean Percentage by Dennis Moy Advisor: Professor David Aldous Department of Statistics In 1985, Bill James, arguably the most renowned analytical baseball statistician, devised a very simple, but effective formula that predicted a team’s winning percentage given its runs scored and runs allowed. Despite its remarkable accuracy, this model, coined Pythagorean expectation, was used primarily on seasons of the past rather than performance forecasts. This thesis develops prediction models for runs scored and runs allowed that will be converted by Pythagorean expectation to winning percentages. Data from the past twenty years were taken from four different sources of baseball statistics via the internet to produce 562 arrays that underwent computations through GRETL to create two different ordinary least-squares regression planes (offense and defense). The GRETL outputs yielded robust models that had strong positive R2 results with significant F-statistics from the Wald test that evaluated the planes’ goodness of fit, which with a potentially adjusted Pythagorean expectation, can now forecast future winning percentages. Armed with this knowledge and a little calculus,
    [Show full text]