Sabermetrics - Statistical Modeling of Run Creation and Prevention in Baseball Parker Chernoff [email protected], [email protected]
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
NCAA Division I Baseball Records
Division I Baseball Records Individual Records .................................................................. 2 Individual Leaders .................................................................. 4 Annual Individual Champions .......................................... 14 Team Records ........................................................................... 22 Team Leaders ............................................................................ 24 Annual Team Champions .................................................... 32 All-Time Winningest Teams ................................................ 38 Collegiate Baseball Division I Final Polls ....................... 42 Baseball America Division I Final Polls ........................... 45 USA Today Baseball Weekly/ESPN/ American Baseball Coaches Association Division I Final Polls ............................................................ 46 National Collegiate Baseball Writers Association Division I Final Polls ............................................................ 48 Statistical Trends ...................................................................... 49 No-Hitters and Perfect Games by Year .......................... 50 2 NCAA BASEBALL DIVISION I RECORDS THROUGH 2011 Official NCAA Division I baseball records began Season Career with the 1957 season and are based on informa- 39—Jason Krizan, Dallas Baptist, 2011 (62 games) 346—Jeff Ledbetter, Florida St., 1979-82 (262 games) tion submitted to the NCAA statistics service by Career RUNS BATTED IN PER GAME institutions -
Sports Analytics: Maximizing Precision in Predicting Mlb Base Hits
SPORTS ANALYTICS: MAXIMIZING PRECISION IN PREDICTING MLB BASE HITS Pedro Miguel Gonçalves André Alceo Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management NOVA Information Management School Instituto Superior de Estatística e Gestão de Informação Universidade Nova de Lisboa SPORTS ANALYTICS: MAXIMIZING PRECISION IN PREDICTING MLB BASE HITS Pedro Miguel Gonçalves André Alceo Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, Specialization Knowledge Management and Business Intelligence Advisor: Roberto André Pereira Henriques February 2019 ACKNOWLEDGEMENTS The completion of this master thesis was the culmination of my work accompanied by the endless support of the people that helped me during this journey. This paper would feel emptier without somewhat expressing my gratitude towards those who were always by my side. Firstly, I would like to thank my supervisor Roberto Henriques for helping me find my passion for data mining and for giving me the guidelines necessary to finish this paper. I would not have chosen this area if not for you teaching data mining during my master’s program. To my parents João and Licínia for always supporting my ideas, being patient and for along my life giving the means to be where I am right now. My sister Rita, her husband João and my nephew Afonso, whose presence was always felt and helped providing good energies throughout this campaign. To my training partner Rodrigo who was always a great company, sometimes an inspiration but in the end always a great friend. My colleagues Bento, Marta and Rita who made my master’s journey unique and very enjoyable. -
Sabermetrics: the Past, the Present, and the Future
Sabermetrics: The Past, the Present, and the Future Jim Albert February 12, 2010 Abstract This article provides an overview of sabermetrics, the science of learn- ing about baseball through objective evidence. Statistics and baseball have always had a strong kinship, as many famous players are known by their famous statistical accomplishments such as Joe Dimaggio’s 56-game hitting streak and Ted Williams’ .406 batting average in the 1941 baseball season. We give an overview of how one measures performance in batting, pitching, and fielding. In baseball, the traditional measures are batting av- erage, slugging percentage, and on-base percentage, but modern measures such as OPS (on-base percentage plus slugging percentage) are better in predicting the number of runs a team will score in a game. Pitching is a harder aspect of performance to measure, since traditional measures such as winning percentage and earned run average are confounded by the abilities of the pitcher teammates. Modern measures of pitching such as DIPS (defense independent pitching statistics) are helpful in isolating the contributions of a pitcher that do not involve his teammates. It is also challenging to measure the quality of a player’s fielding ability, since the standard measure of fielding, the fielding percentage, is not helpful in understanding the range of a player in moving towards a batted ball. New measures of fielding have been developed that are useful in measuring a player’s fielding range. Major League Baseball is measuring the game in new ways, and sabermetrics is using this new data to find better mea- sures of player performance. -
2017 Altoona Curve Final Notes
EASTERN LEAGUE CHAMPIONS DIVISION CHAMPIONS PLAYOFF APPEARANCES PLAYERS TO MLB 2010, 2017 2004, 2010, 2017 2003, 2004, 2005, 2006, 144 2010, 2015, 2016, 2017 THE 19TH SEASON OF CURVE BASEBALL: Altoona finished the season eight games over .500 and won the Western 2017 E.L. WESTERN STANDINGS Division by two games over the Baysox. The title marked the franchise's third regular-season division championship in Team W-L PCT GB franchise history, winning the South Division in 2004 and the Western Division in 2010. This year, the Curve led the Altoona 74-66 .529 -- West for 96 of 140 games and took over sole possession of the top spot for good on August 22. The Curve won 10 of Bowie 72-68 .514 2.0 their final 16 games, including a regular-season-best five-game winning streak from August 20-24. Altoona clinched the Western Division regular-season title with a walk-off win over Harrisburg on September 4 and a loss by Bowie at Akron 69-71 .493 5.0 Richmond that afternoon. The Curve advanced to the ELCS with a three-game sweep over the Bowie Baysox in the Erie 65-75 .464 9.0 Western Division Series. In the Championship Series, the Curve took the first two games in Trenton before beating Richmond 63-77 .450 11.0 the Thunder, 4-2, at PNG Field on September 14 to lock up their second league title in franchise history. Including the Harrisburg 60-80 .429 14.0 regular season and the playoffs, the Curve won their final eight games, their best winning streak of the year. -
Understanding Advanced Baseball Stats: Hitting
Understanding Advanced Baseball Stats: Hitting “Baseball is like church. Many attend few understand.” ~ Leo Durocher Durocher, a 17-year major league vet and Hall of Fame manager, sums up the game of baseball quite brilliantly in the above quote, and it’s pretty ridiculous how much fans really don’t understand about the game of baseball that they watch so much. This holds especially true when you start talking about baseball stats. Sure, most people can tell you what a home run is and that batting average is important, but once you get past the basic stats, the rest is really uncharted territory for most fans. But fear not! This is your crash course in advanced baseball stats, explained in plain English, so that even the most rudimentary of fans can become knowledgeable in the mysterious world of baseball analytics, or sabermetrics as it is called in the industry. Because there are so many different stats that can be covered, I’m just going to touch on the hitting stats in this article and we can save the pitching ones for another piece. So without further ado – baseball stats! The Slash Line The baseball “slash line” typically looks like three different numbers rounded to the thousandth decimal place that are separated by forward slashes (hence the name). We’ll use Mike Trout‘s 2014 slash line as an example; this is what a typical slash line looks like: .287/.377/.561 The first of those numbers represents batting average. While most fans know about this stat, I’ll touch on it briefly just to make sure that I have all of my bases covered (baseball pun intended). -
The Rules of Scoring
THE RULES OF SCORING 2011 OFFICIAL BASEBALL RULES WITH CHANGES FROM LITTLE LEAGUE BASEBALL’S “WHAT’S THE SCORE” PUBLICATION INTRODUCTION These “Rules of Scoring” are for the use of those managers and coaches who want to score a Juvenile or Minor League game or wish to know how to correctly score a play or a time at bat during a Juvenile or Minor League game. These “Rules of Scoring” address the recording of individual and team actions, runs batted in, base hits and determining their value, stolen bases and caught stealing, sacrifices, put outs and assists, when to charge or not charge a fielder with an error, wild pitches and passed balls, bases on balls and strikeouts, earned runs, and the winning and losing pitcher. Unlike the Official Baseball Rules used by professional baseball and many amateur leagues, the Little League Playing Rules do not address The Rules of Scoring. However, the Little League Rules of Scoring are similar to the scoring rules used in professional baseball found in Rule 10 of the Official Baseball Rules. Consequently, Rule 10 of the Official Baseball Rules is used as the basis for these Rules of Scoring. However, there are differences (e.g., when to charge or not charge a fielder with an error, runs batted in, winning and losing pitcher). These differences are based on Little League Baseball’s “What’s the Score” booklet. Those additional rules and those modified rules from the “What’s the Score” booklet are in italics. The “What’s the Score” booklet assigns the Official Scorer certain duties under Little League Regulation VI concerning pitching limits which have not implemented by the IAB (see Juvenile League Rule 12.08.08). -
A Statistical Study Nicholas Lambrianou 13' Dr. Nicko
Examining if High-Team Payroll Leads to High-Team Performance in Baseball: A Statistical Study Nicholas Lambrianou 13' B.S. In Mathematics with Minors in English and Economics Dr. Nickolas Kintos Thesis Advisor Thesis submitted to: Honors Program of Saint Peter's University April 2013 Lambrianou 2 Table of Contents Chapter 1: The Study and its Questions 3 An Introduction to the project, its questions, and a breakdown of the chapters that follow Chapter 2: The Baseball Statistics 5 An explanation of the baseball statistics used for the study, including what the statistics measure, how they measure what they do, and their strengths and weaknesses Chapter 3: Statistical Methods and Procedures 16 An introduction to the statistical methods applied to each statistic and an explanation of what the possible results would mean Chapter 4: Results and the Tampa Bay Rays 22 The results of the study, what they mean against the possibilities and other results, and a short analysis of a team that stood out in the study Chapter 5: The Continuing Conclusion 39 A continuation of the results, followed by ideas for future study that continue to project or stem from it for future baseball analysis Appendix 41 References 42 Lambrianou 3 Chapter 1: The Study and its Questions Does high payroll necessarily mean higher performance for all baseball statistics? Major League Baseball (MLB) is a league of different teams in different cities all across the United States, and those locations strongly influence the market of the team and thus the payroll. Year after year, a certain amount of teams, including the usual ones in big markets, choose to spend a great amount on payroll in hopes of improving their team and its player value output, but at times the statistics produced by these teams may not match the difference in payroll with other teams. -
Baseball U Maryland Defensive Philosophy
Baseball U Maryland Defensive Philosophy Defense LONG TOSS AS MUCH AS POSSIBLE Individual skill work is a priority GET BETTER everyday Team Defensive Concepts: We will be fundamentally sound defensively- both physically and mentally. 1. Win the Free Base War : BB,HBP, errors, extra bases, mental mistakes 2. Prevent the Big Inning : 3 or more runs 3. Keep Double Play a pitch away (keep runners at 1B) 4. Communicate, Communicate, Communicate Goal: .970 fielding percentage (relates to 1 per game) - majority of errors should be IF play (3b,SS, 2b)- 1B, C, P, and OF cannot attribute to errors. Catching a. Catcher needs to be the most athletic guy on the field…VERY TOUGH b. Catcher has to be a LEADER…MUST BE VOCAL…confidence is key a. MUST be constantly communicating the defense b. MUST know each pitcher and situation c. Before the 1st pitch is thrown, get to know the umpire-“Good Morning, my name is …..” d. Calling Pitches: a. When giving signs, they must be hidden b. Be consistent when calling pitches (i.e. 1-FB, 2 CB, etc.)(Be consistent with a runner on base (i.e. 2nd sign, etc.) c. Know what the Pitchers BEST pitch is THAT DAY d. Avoid tendencies (i.e. every 0-2 count, throw CB) e. Defensive tenets: a. Receiving (80%) (1) be on time, (2) manipulate the ball, (3) keep strikes strikes i. 1 knee vs. 2 knee stances…comfort, try it all in bullpens for best results b. Blocking (15%) (1) know the pitcher, (2) anticipate block, (3) control the baseball i. -
Making It Pay to Be a Fan: the Political Economy of Digital Sports Fandom and the Sports Media Industry
City University of New York (CUNY) CUNY Academic Works All Dissertations, Theses, and Capstone Projects Dissertations, Theses, and Capstone Projects 9-2018 Making It Pay to be a Fan: The Political Economy of Digital Sports Fandom and the Sports Media Industry Andrew McKinney The Graduate Center, City University of New York How does access to this work benefit ou?y Let us know! More information about this work at: https://academicworks.cuny.edu/gc_etds/2800 Discover additional works at: https://academicworks.cuny.edu This work is made publicly available by the City University of New York (CUNY). Contact: [email protected] MAKING IT PAY TO BE A FAN: THE POLITICAL ECONOMY OF DIGITAL SPORTS FANDOM AND THE SPORTS MEDIA INDUSTRY by Andrew G McKinney A dissertation submitted to the Graduate Faculty in Sociology in partial fulfillment of the requirements for the degree of Doctor of Philosophy, The City University of New York 2018 ©2018 ANDREW G MCKINNEY All Rights Reserved ii Making it Pay to be a Fan: The Political Economy of Digital Sport Fandom and the Sports Media Industry by Andrew G McKinney This manuscript has been read and accepted for the Graduate Faculty in Sociology in satisfaction of the dissertation requirement for the degree of Doctor of Philosophy. Date William Kornblum Chair of Examining Committee Date Lynn Chancer Executive Officer Supervisory Committee: William Kornblum Stanley Aronowitz Lynn Chancer THE CITY UNIVERSITY OF NEW YORK I iii ABSTRACT Making it Pay to be a Fan: The Political Economy of Digital Sport Fandom and the Sports Media Industry by Andrew G McKinney Advisor: William Kornblum This dissertation is a series of case studies and sociological examinations of the role that the sports media industry and mediated sport fandom plays in the political economy of the Internet. -
Sports Analytics Algorithms for Performance Prediction
Sports Analytics Algorithms for Performance Prediction Paschalis Koudoumas SID: 3308190012 SCHOOL OF SCIENCE & TECHNOLOGY A thesis submitted for the degree of Master of Science (MSc) in Data Science JANUARY 2021 THESSALONIKI – GREECE -i- Sports Analytics Algorithms for Performance Prediction Paschalis Koudoumas SID: 3308190012 Supervisor: Assoc. Prof. Christos Tjortjis Supervising Committee Mem- Assoc. Prof. Maria Drakaki bers: Dr. Leonidas Akritidis SCHOOL OF SCIENCE & TECHNOLOGY A thesis submitted for the degree of Master of Science (MSc) in Data Science JANUARY 2021 THESSALONIKI – GREECE -ii- Abstract This dissertation was written as a part of the MSc in Data Science at the International Hellenic University. Sports Analytics exist as a term and concept for many years, but nowadays, it is imple- mented in a different way that affects how teams, players, managers, executives, betting companies and fans perceive statistics and sports. Machine Learning can have various applications in Sports Analytics. The most widely used are for prediction of match outcome, player or team performance, market value of a player and injuries prevention. This dissertation focuses on the quintessence of foot- ball, which is match outcome prediction. The main objective of this dissertation is to explore, develop and evaluate machine learning predictive models for English Premier League matches’ outcome prediction. A comparison was made between XGBoost Classifier, Logistic Regression and Support Vector Classifier. The results show that the XGBoost model can outperform the other models in terms of accuracy and prove that it is possible to achieve quite high accuracy using Extreme Gradient Boosting. -iii- Acknowledgements At this point, I would like to thank my Supervisor, Professor Christos Tjortjis, for offer- ing his help throughout the process and providing me with essential feedback and valu- able suggestions to the issues that occurred. -
{Download PDF} the Sabermetric Revolution Assessing the Growth of Analytics in Baseball 1St Edition Ebook, Epub
THE SABERMETRIC REVOLUTION ASSESSING THE GROWTH OF ANALYTICS IN BASEBALL 1ST EDITION PDF, EPUB, EBOOK Benjamin Baumer | 9780812223392 | | | | | The Sabermetric Revolution Assessing the Growth of Analytics in Baseball 1st edition PDF Book Citations should be used as a guideline and should be double checked for accuracy. The Milwaukee Brewers have made a sabermetric shift under GM David Stearns, who took over in the fall of , and the new front office team of the Minnesota Twins also has sabermetric tendencies. The Sabermetric Revolution sets the record straight on the role of analytics in baseball. However, over the past two decades, a wider range of statistics made their way into barroom debates, online discussion groups, and baseball front offices. This is a very useful book. Subscribe to our Free Newsletter. But, be on guard, stats freaks: it isn't doctrinaire. He is the Robert A. Frederick E. Rocketed to popularity by the bestseller Moneyball and the film of the same name, the use of sabermetrics to analyze player performance has appeared to be a David to the Goliath of systemically advantaged richer teams that could be toppled only by creative statistical analysis. Also in This Series. Prospector largest collection. But how accurately can crunching numbers quantify a player's ability? Goodreads helps you keep track of books you want to read. The book is ideal for a reader who wishes to tie together the importance of everything they have digested from sites like Fangraphs , Baseball Prospectus , Hardball Times , Beyond the Box Score , and, even, yes, Camden Depot. Ryne rated it liked it May 27, Not because I rejected new age stats, but because I never sought them out. -
Open Evan Bittner Thesis.Pdf
THE PENNSYLVANIA STATE UNIVERSITY SCHREYER HONORS COLLEGE DEPARTMENT OF STATISTICS PREDICTING MAJOR LEAGUE BASEBALL PLAYOFF PROBABILITIES USING LOGISTIC REGRESSION EVAN J. BITTNER FALL 2015 A thesis submitted in partial fulfillment of the requirements for a baccalaureate degree in Statistics with honors in Statistics Reviewed and approved* by the following: Andrew Wiesner Lecturer of Statistics Thesis Supervisor Murali Haran Associate Professor of Statistics Honors Adviser * Signatures are on file in the Schreyer Honors College. i ABSTRACT Major League Baseball teams are constantly assessing whether or not they think their teams will make the playoffs. Many sources publish playoff probabilities or odds throughout the season using advanced statistical methods. These methods are somewhat secretive and typically advanced and difficult to understand. The goal of this work is to determine a way to calculate playoff probabilities midseason that can easily be understood and applied. The goal is to develop a method and compare its predictive accuracy to the current methods published by statistical baseball sources such as Baseball Prospectus and Fangraphs. ii TABLE OF CONTENTS List of Figures .............................................................................................................. iii List of Tables ............................................................................................................... iv Acknowledgements .....................................................................................................