Sabermetrics Documentation Release 0.1.1

Total Page:16

File Type:pdf, Size:1020Kb

Sabermetrics Documentation Release 0.1.1 Sabermetrics Documentation Release 0.1.1 Fernando Crema August 08, 2016 Contents 1 sabermetrics 3 1.1 sabermetrics package...........................................3 2 Indices and tables 5 Python Module Index 7 i ii Sabermetrics Documentation, Release 0.1.1 Contents: Contents 1 Sabermetrics Documentation, Release 0.1.1 2 Contents CHAPTER 1 sabermetrics 1.1 sabermetrics package 1.1.1 Submodules 1.1.2 sabermetrics.batting module sabermetrics.batting.avg(h, ab) Batting average is determined by dividing a player’s hits by his total at-bats for a number between zero (shown as .000) and one (1.000). H AV G = AB Parameters • h (int.) – Hits. • ab (int.) – At bats. Returns float – The batting average. Raises ZeroDivisionError sabermetrics.batting.babip(h, hr, ab, k, sf ) BABIP measures a player’s batting average exclusively on balls hit into the field of play, removing outcomes not affected by the opposing defense (namely home runs and strikeouts). H − HR BABIP = AB − K − HR − SF Parameters • h (int.) – Hits. • hr (int.) – Homeruns. • ab (int.) – At bats. • k (int.) – Strikeouts. • sf (int.) – Sacrifice fly. Returns float – Batting average balls in play.. Raises ZeroDivisionError 3 Sabermetrics Documentation, Release 0.1.1 1.1.3 sabermetrics.fielding module sabermetrics.fielding.fpct(a, po, e) How often does a fielder or team make the play when tasked with fielding a batted ball, throwing a ball, or receiving a thrown ball for an out. PO + A FPCT = PO + A + E Parameters • a (int.) – Assists. • po (int.) – Putouts. • e (int.) – Errors. Returns float – The fielding percentage. Raises ZeroDivisionError 1.1.4 sabermetrics.pitching module sabermetrics.pitching.era(er, outs) Represents the number of earned runs a pitcher allows per nine innings – with earned runs being any runs that scored without the aid of an error or a passed ball. 27ER BABIP = Outs Parameters • er (int.) – Earned runs. • outs (int.) – Number of batters and baserunners that are put out while the pitcher is on the mound. Returns float – the return code. Raises ZeroDivisionError 1.1.5 Module contents Sabermetrics: A python package with a set of baseball analytic functions for sabermetric purposes. sabermetrics.start() Initializing modules. 4 Chapter 1. sabermetrics CHAPTER 2 Indices and tables • genindex • modindex • search 5 Sabermetrics Documentation, Release 0.1.1 6 Chapter 2. Indices and tables Python Module Index b batting (Unix, Windows),3 f fielding (Unix, Windows),4 p pitching (Unix, Windows),4 s sabermetrics,4 sabermetrics.batting,3 sabermetrics.fielding,4 sabermetrics.pitching,4 7 Sabermetrics Documentation, Release 0.1.1 8 Python Module Index Index A avg() (in module sabermetrics.batting),3 B babip() (in module sabermetrics.batting),3 batting (module),3 E era() (in module sabermetrics.pitching),4 F fielding (module),4 fpct() (in module sabermetrics.fielding),4 P pitching (module),4 S sabermetrics (module),4 sabermetrics.batting (module),3 sabermetrics.fielding (module),4 sabermetrics.pitching (module),4 start() (in module sabermetrics),4 9.
Recommended publications
  • Sabermetrics: the Past, the Present, and the Future
    Sabermetrics: The Past, the Present, and the Future Jim Albert February 12, 2010 Abstract This article provides an overview of sabermetrics, the science of learn- ing about baseball through objective evidence. Statistics and baseball have always had a strong kinship, as many famous players are known by their famous statistical accomplishments such as Joe Dimaggio’s 56-game hitting streak and Ted Williams’ .406 batting average in the 1941 baseball season. We give an overview of how one measures performance in batting, pitching, and fielding. In baseball, the traditional measures are batting av- erage, slugging percentage, and on-base percentage, but modern measures such as OPS (on-base percentage plus slugging percentage) are better in predicting the number of runs a team will score in a game. Pitching is a harder aspect of performance to measure, since traditional measures such as winning percentage and earned run average are confounded by the abilities of the pitcher teammates. Modern measures of pitching such as DIPS (defense independent pitching statistics) are helpful in isolating the contributions of a pitcher that do not involve his teammates. It is also challenging to measure the quality of a player’s fielding ability, since the standard measure of fielding, the fielding percentage, is not helpful in understanding the range of a player in moving towards a batted ball. New measures of fielding have been developed that are useful in measuring a player’s fielding range. Major League Baseball is measuring the game in new ways, and sabermetrics is using this new data to find better mea- sures of player performance.
    [Show full text]
  • Does Sabermetrics Have a Place in Amateur Baseball?
    BaseballGB Full Article Does sabermetrics have a place in amateur baseball? Joe Gray 7 March 2009 he term “sabermetrics” is one of the many The term sabermetrics combines SABR (the acronym creations of Bill James, the great baseball for the Society for American Baseball Research) and theoretician (for details of the term’s metrics (numerical measurements). The extra “e” T was presumably added to avoid the difficult-to- derivation and usage see Box 1). Several tight pronounce sequence of letters “brm”. An alternative definitions exist for the term, but I feel that rather exists without the “e”, but in this the first four than presenting one or more of these it is more letters are capitalized to show that it is a word to valuable to offer an alternative, looser definition: which normal rules of pronunciation do not apply. sabermetrics is a tree of knowledge with its roots in It is a singular noun despite the “s” at the end (that is, you would say “sabermetrics is growing in the philosophy of answering baseball questions in as popularity” rather than “sabermetrics are growing in accurate, objective, and meaningful a fashion as popularity”). possible. The philosophy is an alternative to The adjective sabermetric has been back-derived from the term and is exemplified by “a sabermetric accepting traditional thinking without question. tool”, or its plural “sabermetric tools”. The adverb sabermetrically, built on that back- Branches of the sabermetric tree derived adjective, is illustrated in the phrase “she The metaphor of sabermetrics as a tree extends to approached the problem sabermetrically”. describing the various broad concepts and themes of The noun sabermetrician can be used to describe any practitioner of sabermetrics, although to some research as branches.
    [Show full text]
  • "What Raw Statistics Have the Greatest Effect on Wrc+ in Major League Baseball in 2017?" Gavin D
    1 "What raw statistics have the greatest effect on wRC+ in Major League Baseball in 2017?" Gavin D. Sanford University of Minnesota Duluth Honors Capstone Project 2 Abstract Major League Baseball has different statistics for hitters, fielders, and pitchers. The game has followed the same rules for over a century and this has allowed for statistical comparison. As technology grows, so does the game of baseball as there is more areas of the game that people can monitor and track including pitch speed, spin rates, launch angle, exit velocity and directional break. The website QOPBaseball.com is a newer website that attempts to correctly track every pitches horizontal and vertical break and grade it based on these factors (Wilson, 2016). Fangraphs has statistics on the direction players hit the ball and what percentage of the time. The game of baseball is all about quantifying players and being able give a value to their contributions. Sabermetrics have given us the ability to do this in far more depth. Weighted Runs Created Plus (wRC+) is an offensive stat which is attempted to quantify a player’s total offensive value (wRC and wRC+, Fangraphs). It is Era and park adjusted, meaning that the park and year can be compared without altering the statistic further. In this paper, we look at what 2018 statistics have the greatest effect on an individual player’s wRC+. Keywords: Sabermetrics, Econometrics, Spin Rates, Baseball, Introduction Major League Baseball has been around for over a century has given awards out for almost 100 years. The way that these awards are given out is based on statistics accumulated over the season.
    [Show full text]
  • Baseball U Maryland Defensive Philosophy
    Baseball U Maryland Defensive Philosophy Defense LONG TOSS AS MUCH AS POSSIBLE Individual skill work is a priority GET BETTER everyday Team Defensive Concepts: We will be fundamentally sound defensively- both physically and mentally. 1. Win the Free Base War : BB,HBP, errors, extra bases, mental mistakes 2. Prevent the Big Inning : 3 or more runs 3. Keep Double Play a pitch away (keep runners at 1B) 4. Communicate, Communicate, Communicate Goal: .970 fielding percentage (relates to 1 per game) - majority of errors should be IF play (3b,SS, 2b)- 1B, C, P, and OF cannot attribute to errors. Catching a. Catcher needs to be the most athletic guy on the field…VERY TOUGH b. Catcher has to be a LEADER…MUST BE VOCAL…confidence is key a. MUST be constantly communicating the defense b. MUST know each pitcher and situation c. Before the 1st pitch is thrown, get to know the umpire-“Good Morning, my name is …..” d. Calling Pitches: a. When giving signs, they must be hidden b. Be consistent when calling pitches (i.e. 1-FB, 2 CB, etc.)(Be consistent with a runner on base (i.e. 2nd sign, etc.) c. Know what the Pitchers BEST pitch is THAT DAY d. Avoid tendencies (i.e. every 0-2 count, throw CB) e. Defensive tenets: a. Receiving (80%) (1) be on time, (2) manipulate the ball, (3) keep strikes strikes i. 1 knee vs. 2 knee stances…comfort, try it all in bullpens for best results b. Blocking (15%) (1) know the pitcher, (2) anticipate block, (3) control the baseball i.
    [Show full text]
  • Applied Operational Management Techniques for Sabermetrics
    Applied Operational Management Techniques for Sabermetrics An Interactive Qualifying Project Report submitted to the faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial fulfillment of the requirements for the Degree of Bachelor of Science by Rory Fuller ______________________ Kevin Munn ______________________ Ethan Thompson ______________________ May 28, 2005 ______________________ Brigitte Servatius, Advisor Abstract In the growing field of sabermetrics, storage and manipulation of large amounts of statistical data has become a concern. Hence, construction of a cheap and flexible database system would be a boon to the field. This paper aims to briefly introduce sabermetrics, show why it exists, and detail the reasoning behind and creation of such a database. i Acknowledgements We acknowledge first and foremost the great amount of work and inspiration put forth to this project by Pat Malloy. Working alongside us on an attached ISP, Pat’s effort and organization were critical to the success of this project. We also recognize the source of our data, Project Scoresheet from retrosheet.org. The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at 20 Sunset Rd., Newark, DE 19711. We must not forget our advisor, Professor Brigitte Servatius. Several of the ideas and sources employed in this paper came at her suggestion and proved quite valuable to its eventual outcome. ii Table of Contents Title Page Abstract i Acknowledgements ii Table of Contents iii 1. Introduction 1 2. Sabermetrics, Baseball, and Society 3 2.1 Overview of Baseball 3 2.2 Forerunners 4 2.3 What is Sabermetrics? 6 2.3.1 Why Use Sabermetrics? 8 2.3.2 Some Further Financial and Temporal Implications of Baseball 9 3.
    [Show full text]
  • Determining the Value of a Baseball Player
    the Valu a Samuel Kaufman and Matthew Tennenhouse Samuel Kaufman Matthew Tennenhouse lllinois Mathematics and Science Academy: lllinois Mathematics and Science Academy: Junior (11) Junior (11) 61112012 Samuel Kaufman and Matthew Tennenhouse June 1,2012 Baseball is a game of numbers, and there are many factors that impact how much an individual player contributes to his team's success. Using various statistical databases such as Lahman's Baseball Database (Lahman, 2011) and FanGraphs' publicly available resources, we compiled data and manipulated it to form an overall formula to determine the value of a player for his individual team. To analyze the data, we researched formulas to determine an individual player's hitting, fielding, and pitching production during games. We examined statistics such as hits, walks, and innings played to establish how many runs each player added to their teams' total runs scored, and then used that value to figure how they performed relative to other players. Using these values, we utilized the Pythagorean Expected Wins formula to calculate a coefficient reflecting the number of runs each team in the Major Leagues scored per win. Using our statistic, baseball teams would be able to compare the impact of their players on the team when evaluating talent and determining salary. Our investigation's original focusing question was "How much is an individual player worth to his team?" Over the course of the year, we modified our focusing question to: "What impact does each individual player have on his team's performance over the course of a season?" Though both ask very similar questions, there are significant differences between them.
    [Show full text]
  • How to Do Stats
    EXPLANATION OF STATS IN SCORE BOOK FIELDING STATISTICS COLUMNS DO - Defensive Outs The number of put outs the team participated in while each player was in the line-up. Defensive outs are used in National Championships as a qualification rule. PO - Put out (10.09) A putout shall be credited to each fielder who (1) Catches a fly ball or a line drive, whether fair or foul. (2) Catches a thrown ball, which puts out a batter or a runner. (3) Tags a runner when the runner is off the base to which he is legally entitled. A – Assist (10.10) Any fielder who throws or deflects a battered or thrown ball in such a way that a putout results or would have except for a subsequent error, will be credited with an Assist. E – Error (10.12) An error is scored against any fielder who by any misplay (fumble, muff or wild throw) prolongs the life of the batter or runner or enables a runner to advance. BATTING STATISTICS COLUMNS PA - Plate Appearance Every time the batter completes his time at bat he is credited with a PA. Note: if the third out is made in the field he does not get a PA but is first to bat in the next innings. AB - At Bat (10.02(a)(1)) When a batter has reached 1st base without the aid of an ‘unofficial time at bat’. i.e. do not include Base on Balls, Hit by a Pitched Ball, Sacrifice flies/Bunts and Catches Interference. R – Runs (2.66) every time the runner crosses home plate scoring a run.
    [Show full text]
  • Oklahoma City Dodgers
    Oklahoma City Dodgers Game Information Baseball America’s 2018 Bob Freitas Triple-A Organization of the Year Pacific Coast League Affiliate of the Los Angeles Dodgers Chickasaw Bricktown Ballpark | 2 S. Mickey Mantle Drive | Oklahoma City, OK 73104 | Phone: (405) 218-1000 Alex Freedman: (405) 218-2126 | [email protected] | Lisa Johnson: (405) 218-2143 | [email protected] Oklahoma City Dodgers (29-40) vs. El Paso Chihuahuas (43-27) Game #70 of 140/Road #40 of 70 (19-20) Pitching Probables: OKC-RHP Dennis Santana (2-4, 5.60) vs. ELP-LHP Jerry Keel (5-2, 6.30) Tuesday, June 18, 2019 | Southwest University Park | El Paso, Texas | 8:05 p.m. CT Radio: KGHM AM-1340 The Game, 1340thegame.com, iHeartRadio - Alex Freedman Today’s Game: The Oklahoma City Dodgers close out their seven-game road trip with an 8:05 p.m. game against the El Paso Chihuahuas at OKC Dodgers Trends Southwest University Park. Tonight’s game is part of MiLB’s Copa de la Diversión, and the Dodgers will take the field as Cielo Azul de Oklahoma Overall Record ........................29-40 Home Record..........................10-20 City against the El Paso Margaritas...OKC leads the current series, 2-1, and has won 10 of its last 11 road games. Road Record.................... ......19-20 Current Streak.................... ........W1 Last Game: A key two-run double by Connor Joe and go-ahead home run by Will Smith paired with a great effort from the pitching staff led to Current Road Trip............... .........5-1 Last Road Trip..................... .......7-7 a 3-2 win for the Dodgers over El Paso Monday night at Southwest University Park.
    [Show full text]
  • Sabermetrics Over Time: Persuasion and Symbolic Convergence Across a Diffusion of Innovations
    SABERMETRICS OVER TIME: PERSUASION AND SYMBOLIC CONVERGENCE ACROSS A DIFFUSION OF INNOVATIONS BY NATHANIEL H. STOLTZ A Thesis Submitted to the Graduate Faculty of WAKE FOREST UNIVERSITY GRADUATE SCHOOL OF ARTS AND SCIENCES in Partial Fulfillment of the Requirements for the Degree of MASTER OF ARTS Communication May 2014 Winston-Salem, North Carolina Approved By: Michael D. Hazen, Ph. D., Advisor John T. Llewellyn, Ph. D., Chair Todd A. McFall, Ph. D. ii Acknowledgments First and foremost, I would like to thank everyone who has assisted me along the way in what has not always been the smoothest of academic journeys. It begins with the wonderful group of faculty I encountered as an undergraduate in the James Madison Writing, Rhetoric, and Technical Communication department, especially my advisor, Cindy Allen. Without them, I would never have been prepared to complete my undergraduate studies, let alone take on the challenges of graduate work. I also want to thank the admissions committee at Wake Forest for giving me the opportunity to have a graduate school experience at a leading program. Further, I have unending gratitude for the guidance and patience of my thesis committee: Dr. Michael Hazen, who guided me from sitting in his office with no ideas all the way up to achieving a completed thesis, Dr. John Llewellyn, whose attention to detail helped me push myself and my writing to greater heights, and Dr. Todd McFall, who agreed to assist the project on short notice and contributed a number of interesting ideas. Finally, I have many to thank on a personal level.
    [Show full text]
  • 2018-19 Mo Warcats Defensive Stats # Name G TC a PO E DP FPCT Catcher INN Catcher PB Catcher SB Catcher CS Catcher PIK 20 Esswein 7474 5 67 2 0 97.3% 0 0 0 0 0
    2018-19 Mo Warcats Offensive Stats # Name G PA AB H 1B 2B 3B HR RBI R HBP ROE FC BB K AVG OBP SLG OPS PA/BB BB/K C% K-L SB CS XBH TB BA/RISP PS/PA SAC SF LOB QAB QAB% 20 Esswein 74 207 194 86 71 13 2 0 27 51 1 6 7 10 6 0.443 0.473 0.531 1.004 20.70 1.67 0.969 1 8 0 15 103 0.533 3.09 1 0 61 117 56.52% G = Games played RBI = Runs batted in OBP = On-base percentage CS = Caught stealing PA = Plate appearances R = Runs scored SLG = Slugging percentage XBH = Extra Base Hits AB = At bats HBP = Hit by pitch OPS = On-base % plus slugging % TB = Total bases H = Hits ROE = Reached on error PA/BB = Plate appearances per walk BA/RISP = Batting average with runners in scoring position 1B = Singles FC = Hit into fielder's choice BB/K = Walks per strikeout SAC = Sacrifice hits & bunts 2B = Doubles BB = Base on balls (walks) C% = Contact Percentage SF = Sacrifice flies 3B = Triples K = Strikeouts K-L = Strikeouts Looking LOB = Runners left on base when batter is out HR = Home runs AVG = Batting average SB = Stolen bases QA = Quality At Bats QA% = Quality At Bat Percentage 2018-19 Mo Warcats Pitching Stats # Name G IP GP GS W L SV BS H R ER BB SO HBP ERA WHIP BF #P TS TB P/IP P/BF LOO S% BB/IN LOBB WP SM SM% K/G K/BB GO FO BAA HR 20 Esswein 74 19.1 5 4 2 0 0 0 29 23 8 6 7 0 2.897 1.81 99 332 227 105 17.172 3.354 11 0.684 0.31 1 3 19 0.057 2.534 1.167 21 25 0.319 1 G = Games played BS = Blown Saves ERA = Earned run average (season) P/BF = Pitches per batter faced SM% = Swing & Miss Percent IP = Innings Pitched H = Hits allowed WHIP = Walks plus hits per
    [Show full text]
  • The Impact of Hitters on Winning Percentage and Salary Using Sabermetrics
    Mandrik 1 The Impact of Hitters on Winning Percentage and Salary Using Sabermetrics Zach Mandrik Professor Deprano 4/11/2017 Economics 490 Directed Research Abstract This paper addresses the question if advanced sabermetrics and traditional statistics are efficient predictors in estimating a hitter’s salary entering free agency. In addition the research also looks to answer if these same statistics can significantly predict a team’s winning percentage. Using data from 1985-2011, the models found that both traditional and advanced baseball metrics are significant in relation to salary and winning percentage. However, all of the regression models have low explanatory power in their ability to forecast results. I suggest that MLB clubs use these models as a checkpoint for free agent player salaries and/or ability to contribute to a team (winning percentage), rather than as an absolute determination of these variables. Mandrik 2 Overview Introduction Literature Review Statistics Review wOBA wRC+ BsR Regression Models The Data What are the outputs telling us? Forecasts and Analysis Conclusion Suggestions for the Sabermetrics Community References Appendix; Graphs and Tables Mandrik 3 Introduction One of the major goals for a baseball franchise, or any professional sports franchise in general, is to ultimately win a championship to bring in fans. Winning as a result typically brings an inflow of revenue, which is an owner’s desire. A portion of building a winning baseball team is centered on statistics and analytics. Thanks to the works of Bill James and many other baseball analysts, the development of sabermetrics has revolutionized the way business is done in baseball.
    [Show full text]
  • The Real Sabermetrics | the Harvard Independent Page 1 of 3
    The Real Sabermetrics | The Harvard Independent Page 1 of 3 Popu News The Real Sabermetrics Today's: Where baseball meets statistics. Tomorrow Forum Tomorrow By DAVID ROHER | 02/12/09 Give Blago a Break A Cartoony 'Tis the Se I should love Bobby Abreu. Valentine's Day Special: A New and At least I should according to Ed Wade, the GM of the Houston Astros. Wade told Improved Holiday the New York Times that the former all-star is “a sabermetrician’s dream, from the Last view standpoint of what he produces statistically.” Fortunately, Wade is not my most A Cartoony Valentine's Day Special: Are You a trusted source for baseball analysis. The only dreams real sabermetricians have Tomorrow Wifey or a Jump-Off? about the right fielder these days are nightmares about his horrifying outfield Tomorrow defense. Training th Valentine's Day Special: How Dateable Are You? But if we're using the unfortunate mainstream definition of sabermetrics, Wade is right on the money. Thanks in part to Michael Lewis’ 2003 bestseller, Moneyball, most baseball fans have a limited definition of the term. It mostly conjures images Arts of slow, colossal power hitters blessed with a 6th sense for taking pitches out of I'm Into It the strike zone. A lot of them generate enough wind from their frequent swings and misses to provide electricity for a small town. And for a player to truly be Romance Amidst the Rubble considered sabermetric, he has to be hated by scouts and anyone else who subscribes to a “conventional” method of analysis.
    [Show full text]