Markov League Baseball
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Statistical Study Nicholas Lambrianou 13' Dr. Nicko
Examining if High-Team Payroll Leads to High-Team Performance in Baseball: A Statistical Study Nicholas Lambrianou 13' B.S. In Mathematics with Minors in English and Economics Dr. Nickolas Kintos Thesis Advisor Thesis submitted to: Honors Program of Saint Peter's University April 2013 Lambrianou 2 Table of Contents Chapter 1: The Study and its Questions 3 An Introduction to the project, its questions, and a breakdown of the chapters that follow Chapter 2: The Baseball Statistics 5 An explanation of the baseball statistics used for the study, including what the statistics measure, how they measure what they do, and their strengths and weaknesses Chapter 3: Statistical Methods and Procedures 16 An introduction to the statistical methods applied to each statistic and an explanation of what the possible results would mean Chapter 4: Results and the Tampa Bay Rays 22 The results of the study, what they mean against the possibilities and other results, and a short analysis of a team that stood out in the study Chapter 5: The Continuing Conclusion 39 A continuation of the results, followed by ideas for future study that continue to project or stem from it for future baseball analysis Appendix 41 References 42 Lambrianou 3 Chapter 1: The Study and its Questions Does high payroll necessarily mean higher performance for all baseball statistics? Major League Baseball (MLB) is a league of different teams in different cities all across the United States, and those locations strongly influence the market of the team and thus the payroll. Year after year, a certain amount of teams, including the usual ones in big markets, choose to spend a great amount on payroll in hopes of improving their team and its player value output, but at times the statistics produced by these teams may not match the difference in payroll with other teams. -
Improving the FIP Model
Project Number: MQP-SDO-204 Improving the FIP Model A Major Qualifying Project Report Submitted to The Faculty of Worcester Polytechnic Institute In partial fulfillment of the requirements for the Degree of Bachelor of Science by Joseph Flanagan April 2014 Approved: Professor Sarah Olson Abstract The goal of this project is to improve the Fielding Independent Pitching (FIP) model for evaluating Major League Baseball starting pitchers. FIP attempts to separate a pitcher's controllable performance from random variation and the performance of his defense. Data from the 2002-2013 seasons will be analyzed and the results will be incorporated into a new metric. The new proposed model will be called jFIP. jFIP adds popups and hit by pitch to the fielding independent stats and also includes adjustments for a pitcher's defense and his efficiency in completing innings. Initial results suggest that the new metric is better than FIP at predicting pitcher ERA. Executive Summary Fielding Independent Pitching (FIP) is a metric created to measure pitcher performance. FIP can trace its roots back to research done by Voros McCracken in pursuit of winning his fantasy baseball league. McCracken discovered that there was little difference in the abilities of pitchers to prevent balls in play from becoming hits. Since individual pitchers can have greatly varying levels of effectiveness, this led him to wonder what pitchers did have control over. He found three that stood apart from the rest: strikeouts, walks, and home runs. Because these events involve only the batter and the pitcher, they are referred to as “fielding independent." FIP takes only strikeouts, walks, home runs, and innings pitched as inputs and it is scaled to earned run average (ERA) to allow for easier and more useful comparisons, as ERA has traditionally been one of the most important statistics for evaluating pitchers. -
Determining the Value of a Baseball Player
the Valu a Samuel Kaufman and Matthew Tennenhouse Samuel Kaufman Matthew Tennenhouse lllinois Mathematics and Science Academy: lllinois Mathematics and Science Academy: Junior (11) Junior (11) 61112012 Samuel Kaufman and Matthew Tennenhouse June 1,2012 Baseball is a game of numbers, and there are many factors that impact how much an individual player contributes to his team's success. Using various statistical databases such as Lahman's Baseball Database (Lahman, 2011) and FanGraphs' publicly available resources, we compiled data and manipulated it to form an overall formula to determine the value of a player for his individual team. To analyze the data, we researched formulas to determine an individual player's hitting, fielding, and pitching production during games. We examined statistics such as hits, walks, and innings played to establish how many runs each player added to their teams' total runs scored, and then used that value to figure how they performed relative to other players. Using these values, we utilized the Pythagorean Expected Wins formula to calculate a coefficient reflecting the number of runs each team in the Major Leagues scored per win. Using our statistic, baseball teams would be able to compare the impact of their players on the team when evaluating talent and determining salary. Our investigation's original focusing question was "How much is an individual player worth to his team?" Over the course of the year, we modified our focusing question to: "What impact does each individual player have on his team's performance over the course of a season?" Though both ask very similar questions, there are significant differences between them. -
Gonzo Honored to Join Latino Baseball Hall of Fame by Josh Rawitch
│ Optimizing the Diamondbacks Lineup: vs. LHP Gonzo honored to join Latino Baseball Hall of Fame By Joseph Jacquez / Venom Strikes By Josh Rawitch / Arizona Diamondbacks http://venomstrikes.com/2015/02/08/optimizing- http://m.dbacks.mlb.com/news/article/108697656/d-backs- diamondbacks-lineup-vs-lhp/ luiz-gonzalez-honored-to-join-latino-baseball-hall-of-fame Luis Gonzalez Enters the Arizona Sports Hall of Fame D-backs done dealing for now, likely out on Shields By Thomas Lynch / Venom Strikes By Steve Gilbert / MLB.com http://venomstrikes.com/2015/02/07/luis-gonzalez-enters- http://m.dbacks.mlb.com/news/article/108553930/arizona- arizona-sports-hall-fame/ diamondbacks-indicate-they-dont-plan-to-make-more-deals- before-spring-training Diamondbacks RHP Touki Toussaint Hires Agent By Joseph Jacquez / Venom Strikes D-backs celebrate first graduates of Dominican academy http://venomstrikes.com/2015/02/06/diamondbacks-rhp- By Josh Rawitch / Arizona Diamondbacks touki-toussaint-hires-agent/ http://m.dbacks.mlb.com/news/article/108558196/arizona- diamondbacks-celebrate-graduation-at-academy-in-dominican- Diamondbacks Preparing for Arbitration Hearings with republic Trumbo, and Reed By Joseph Jacquez / Venom Strikes Diamondbacks to have crowded rotation competition http://venomstrikes.com/2015/02/06/diamondbacks- By Nick Piecoro / Arizona Republic preparing-arbitration-hearings-trumbo-reed/ http://www.azcentral.com/story/sports/mlb/diamondbacks/2 015/02/07/diamondbacks-to-have-crowded-rotation- 13 Days Until Diamondbacks Pitchers & Catchers Report -
Bayesian Statistics and Baseball
Pomona College Senior Thesis in Mathematics Bayesian Statistics and Baseball Author: Advisor: Guy Stevens Jo Hardin Submitted to Pomona College in Partial Fulfillment of the Degree of Bachelor of Arts April 5, 2013 Contents 1 Introduction 2 2 Variables and Data 2 3 Methodology 3 3.1 Logistic Regression . 3 3.2 Bayesian Statistics . 3 3.3 Hierarchical Bayesian Logistic Regression . 4 3.3.1 Model Specifications . 4 3.3.2 Parameter Priors . 5 3.3.3 Joint Distributions . 5 3.4 Metropolis Hastings and the Gibbs Sampler . 6 3.5 Proof of Convergence . 7 4 Results 10 4.1 Model Output . 10 4.2 Individual Pitchers . 13 5 Discussion 15 6 Appendix A: Code 16 1 Introduction The projection of future performance in professional baseball has long been a serious question of interest to those inside and outside the sport. It has been heavily researched in the academic community and among quantitative analysts within the baseball community. Past academic research has taken a number of distinct approaches to making predictions for hitters, but research on pitchers is a bit more limited. The study of projections often provides valuable insight into aspects of performance used in modeling. One projection engine, created by Tom Tango, is called MARCEL; it uses a weighted average of recent performance and regresses it to the mean. It uses simple, publicly-available data and, in general, performs quite well as a prediction tool for both hitters and pitchers (Tango 2004). However, this approach leaves much to be desired. Tango explains that this model is the simplest possible model that should be accepted; while it stacks up well against some basic systems, it is meant as a simple threshold for considering a model to be effective. -
SF Giants Press Clips Friday, 21, 2018
SF Giants Press Clips Friday, 21, 2018 San Francisco Chronicle A toast to Hunter Pence and his memorable Giants’ carrer John Shea Hunter Pence gets cheered when he steps out of the dugout. When he takes the field. When he’s introduced as a pinch-hitter. When he gets a big hit. Even when he makes an out. Pence is treated like a legend — an overused word, but perhaps fitting in this case — and a two-time champion who was instrumental and inspirational at the height of the Giants’ glory days. Because that’s who he is, even though he’s hitting a career-worst .211. Pence remains a crowd favorite despite his struggles, despite the team’s struggles, and this will be accentuated in the final week of the season. That starts with Monday’s homestand opener and runs through the Sept. 30 finale against the Dodgers, the final opportunities for fans to stand and cheer the immensely popular 35-year-old outfielder. The Giants plan an on-field tribute to Pence on that last day. “I appreciate it every day regardless of conditions,” Pence said of his love affair with fans. “Ultimately, it just goes to show it’s a big reflection of how good a fan base they are and how good a community the Giants have built. It’s a product of that. 1 “In sports and life in general, not everything’s always positive. But there’s a benefit from some of the difficult things. It’s part of life. The ebb and flow of the good times and the bad times.” Pence is winding down the final days of his contract and, considering the Giants’ plan to reshape the roster and get younger next year, the final days of his career in San Francisco. -
Framing the Game Through a Sabermetric Lens: Major League
FRAMING THE GAME THROUGH A SABERMETRIC LENS: MAJOR LEAGUE BASEBALL BROADCASTS AND THE DELINEATION OF TRADITIONAL AND NEW FACT METRICS by ZACHARY WILLIAM ARTH ANDREW C. BILLINGS, COMMITTEE CHAIR DARRIN J. GRIFFIN SCOTT PARROTT JAMES D. LEEPER KENON A. BROWN A DISSERTATION Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the College of Communication and Information Sciences in the Graduate School of The University of Alabama TUSCALOOSA, ALABAMA 2019 Copyright Zachary William Arth 2019 ALL RIGHTS RESERVED i ABSTRACT This purpose of this dissertation was to first understand how Major League Baseball teams are portraying and discussing statistics within their local broadcasts. From there, the goal was to ascertain how teams differed in their portrayals, with the specific dichotomy of interest being between teams heavy in advanced statistics and those heavy in traditional statistics. With advanced baseball statistics still far from being universally accepted among baseball fans, the driving question was whether or not fans that faced greater exposure to advanced statistics would also be more knowledgeable and accepting of them. Thus, based on the results of the content analysis, fans of four of the most advanced teams and four of the most traditional teams were accessed through MLB team subreddits and surveyed. Results initially indicated that there was no difference between fans of teams with advanced versus traditional broadcasts. However, there were clear differences in knowledge based on other factors, such as whether fans had a new school or old school orientation, whether they were high in Schwabism and/or mavenism, and how highly identified they were with the team. -
Major Qualifying Project: Advanced Baseball Statistics
Major Qualifying Project: Advanced Baseball Statistics Matthew Boros, Elijah Ellis, Leah Mitchell Advisors: Jon Abraham and Barry Posterro April 30, 2020 Contents 1 Background 5 1.1 The History of Baseball . .5 1.2 Key Historical Figures . .7 1.2.1 Jerome Holtzman . .7 1.2.2 Bill James . .7 1.2.3 Nate Silver . .8 1.2.4 Joe Peta . .8 1.3 Explanation of Baseball Statistics . .9 1.3.1 Save . .9 1.3.2 OBP,SLG,ISO . 10 1.3.3 Earned Run Estimators . 10 1.3.4 Probability Based Statistics . 11 1.3.5 wOBA . 12 1.3.6 WAR . 12 1.3.7 Projection Systems . 13 2 Aggregated Baseball Database 15 2.1 Data Sources . 16 2.1.1 Retrosheet . 16 2.1.2 MLB.com . 17 2.1.3 PECOTA . 17 2.1.4 CBS Sports . 17 2.2 Table Structure . 17 2.2.1 Game Logs . 17 2.2.2 Play-by-Play . 17 2.2.3 Starting Lineups . 18 2.2.4 Team Schedules . 18 2.2.5 General Team Information . 18 2.2.6 Player - Game Participation . 18 2.2.7 Roster by Game . 18 2.2.8 Seasonal Rosters . 18 2.2.9 General Team Statistics . 18 2.2.10 Player and Team Specific Statistics Tables . 19 2.2.11 PECOTA Batting and Pitching . 20 2.2.12 Game State Counts by Year . 20 2.2.13 Game State Counts . 20 1 CONTENTS 2 2.3 Conclusion . 20 3 Cluster Luck 21 3.1 Quantifying Cluster Luck . 22 3.2 Circumventing Cluster Luck with Total Bases . -
Measuring Pitchers' Performance Using Data Envelopment Analysis
Contemporary Management Research Pages 351-384, Vol. 11, No. 4, December 2015 doi:10.7903/cmr.14157 Measuring Pitchers’ Performance Using Data Envelopment Analysis with Advanced Statistics Shihteng Chiu Stella Maris Primary School, Taiwan E-Mail: [email protected] Chiahuei Hsiao National Taipei University E-Mail: [email protected] Huichin Wu National Taipei University E-Mail: [email protected] ABSTRACT This paper evaluates starting pitchers’ pitching performance during the 2008 to 2014 Major League Baseball (MLB) seasons. We use data envelopment analysis (DEA) based on two inputs (i.e., innings pitched [IPs] and per pitched innings) and three outputs (fielding independent pitching, earned run average [ERA], and skill-interactive ERA) to evaluate the performance of the 30 MLB teams’ starting pitchers with IPs greater than 200 in each single season (2008 to 2014, regular season only). We used the CCR models to calculate the overall efficiency, scale efficiency, technical efficiency, efficiency value, and the slack analysis to measure a pitcher’s performance in each single season. The results showed that 4, 3, 4, 3, 5, 3, and 5 pitches reached overall efficiency each year, from 2008 to 2014 (regular season). By analyzing the results and computing performance indexes and benchmarks for each starting pitcher, we determine the true value of each player to help baseball teams select highly indexed players and maximize player efficiency. Keywords: Advanced Statistics, Pitchers, Data Envelopment Analysis, Innings Pitched, Earned Run Average Contemporary Management Research 352 INTRODUCTION Competitive sports interest a large number of people who watch them because of their uncertainty and unpredictability. -
Pitcher Effectiveness, That Uses Statcast Data to Evaluate Starting Pitchers Dynamically, Based on the Results of In-Game Outcomes After Each Pitch
Pitcher Effectiveness: A Step Forward for In Game Analytics and Pitcher Evaluation Christopher Watkins1,*, Vincent Berardi2, Cyril Rakovski3 1Computational and Data Sciences Program, Chapman University, Orange, CA, 2Psychology Department, Chapman University, Orange, CA, 3Mathematics Department, Chapman University, Orange, CA *Corresponding Author E-mail: [email protected] Abstract With the introduction of Statcast in 2015, baseball analytics have become more precise. Statcast allows every play to be accurately tracked and the data it generates is easily accessible through Baseball Savant, which opens the opportunity for improved performance statistics to be developed. In this paper we propose a new tool, Pitcher Effectiveness, that uses Statcast data to evaluate starting pitchers dynamically, based on the results of in-game outcomes after each pitch. Pitcher Effectiveness successfully predicts instances where starting pitchers give up several runs, which we believe make it a new and important tool for the in-game and post-game evaluation of starting pitchers. Keywords: Baseball, MLB, Statcast, Analytics, Prediction 1 Introduction 1.1 Evolution of Statistics in Baseball Baseball has quickly become one of the most analyzed sports with significant growth in the last 20 years (Koseler and Stephan 2017) with an enormous amount of data collected every game that requires professional teams to have a state-of the-art analytics team in order to compete in today’s game. As an example, the Houston Astros lost over 100 games each season from 2011-2013. In 2014, sports writer Ben Reiter predicted the Astros to win a World Series sooner rather than later, in 2017, because of the advanced analytics team they built (Reiter 2014). -
Predicting Baseball Win/Loss Records from Player Projections
Predicting Baseball Win/Loss Records from Player Projections Connor Daly [email protected] November 29, 2017 1 Introduction When forecasting future results in major league baseball (MLB), there are essentially two sources from which you can derive your predictions: teams and players. How do players perform individually, and how do their collaborative actions coalesce to form a team's results? Several methods of both types exist, but often are shrouded with proprietary formulas. Currently, several mature and highly sophisticated player projection systems are used to forecast season results. None are abundantly transparent about their methodology. Here I set out to develop the simplest possible player-based team projection system and try to add one basic improvement. 2 Predicting Wins and Losses 2.1 Team-Based Projections One approach to such forecasting is to analyze team performance in head to head matchups. A common implementation of this approach is known as an elo rating and prediction system. 1 Elo systems start by assigning teams an average rating. After games are played, winning teams' ratings increase and losing teams' ratings decrease relative to the expected outcome of the matchup. Expected outcomes are determined by the difference in rating between the two teams. If a very good team almost loses to a really bad team, its rating will only increase slightly. If an underdog pulls of an upset, however, it will earn relatively more points. As more games are played, older games become progressively less meaningful. Essentially, this prediction method considers only who you played, what the margin of victory was, and where the match was played (home team advantage is adjusted for). -
The Baseball Season Has Almost Reached the Quarter Pole, and the Indians Have Experienced Plenty of Ups and Downs
Poor luck at plate part of Tribe's uneven start By Jordan Bastian MLB.com @MLBastian CLEVELAND -- The baseball season has almost reached the quarter pole, and the Indians have experienced plenty of ups and downs. The rotation has continued to be a strength, the bullpen has struggled to find its footing and the offense has been plagued by extreme hot and cold spells. The result so far has been an 18-18 record, heading into Thursday's off-day. Fortunately, given the state of the American League Central, that ledger is good enough for first place. A year ago, Cleveland endured similar issues early on and had a 31-31 record in mid-June. The Indians then got on a roll, ending with 102 wins and a division crown. The Tribe is hoping for a similar run as this summer progresses, but there are plenty of kinks to iron out in the meantime. With that in mind, what follows is a glance at the season's first six weeks for the Indians through a mix of Statcast™ data. The good Francisco Lindor recently overtook Yonder Alonso for the team lead in Barrels with 16 on the season, tying the All-Star shortstop for the sixth- highest total in the Majors. Sluggers Giancarlo Stanton and Bryce Harper also had 16 on the season going into Thursday, while Alonso and Manny Machado were among those right behind with 15. Lindor also leads Cleveland batters (minimum 50 results) with a .414 expected weighted on-base average (34th in MLB). The bad As a team, the Indians have feasted on fastballs (their .475 slugging percentage on all varieties of heaters ranked fifth in MLB), but they have struggled against offspeed offerings.