Package 'Sabermetrics'

Total Page:16

File Type:pdf, Size:1020Kb

Package 'Sabermetrics' Package ‘Sabermetrics’ February 7, 2015 Type Package Title Sabermetrics Functions For Baseball Analytics Version 1.0 Date 2015-02-06 Author Peter Xenopoulos <www.peterxeno.com> Maintainer Peter Xenopoulos <[email protected]> Description A collection of baseball analytics functions for sabermetrics purposes. Among these func- tions include popular metrics such as OBP, wOBA, runs created functions as well as field- independent pitching metrics. License GPL-3 NeedsCompilation no Repository CRAN Date/Publication 2015-02-07 00:55:03 R topics documented: sabermetrics-package . .2 dice .............................................2 eqa..............................................3 fip..............................................5 iso..............................................6 log5.............................................7 obp .............................................8 ops..............................................9 pyth............................................. 10 rcBasic . 11 rcBasicSB . 12 rcPX............................................. 13 rcTech . 14 secA............................................. 15 slg.............................................. 16 wOBA............................................ 17 Index 18 1 2 dice sabermetrics-package Sabermetrics Functions For Baseball Analytics Description A collection of baseball analytics functions for sabermetrics purposes. Among these functions include popular metrics such as OBP, wOBA, runs created functions as well as field-independent pitching metrics. Details Package: Sabermetrics Type: Package Version: 1.0 Date: 2015-02-06 License: GPL-3 Author(s) Peter Xenopoulos References Wikipedia: http://en.wikipedia.org/wiki/Sabermetrics#Examples Reddit: http://www.reddit.com/r/Sabermetrics dice Defense-Independent Component ERA (DICE) Description A function gives a number that is better at predicting a pitcher’s ERA in the following year than the pitcher’s actual ERA in the current year. Usage dice(HR, BB, HBP, K, IP) eqa 3 Arguments HR Home Runs Allowed BB Walks Allowed HBP Batters Hit K Strikeouts IP Innings Pitched Value Returns 3 + ((13*HR+3*BB+3*HBP-2*K)/IP) Author(s) Peter Xenopoulos References http://en.wikipedia.org/wiki/Defense_independent_pitching_statistics Examples ## Defense-Independent Component ERA (dice) function is currently defined as function (HR, BB, HBP, K, IP) { defenseERA <- 3 + ((13 * HR + 3 * BB + 3 * HBP - 2 * K)/IP) return(defenseERA) } ## Let's take 2014's MLB MVP, Clayton Kershaw, and find his DICE ## Stats for Clayton Kershaw available on ## http://www.baseball-reference.com/players/k/kershcl01-pitch.shtml ## For 2014, Kershaw allowed 9 HR, 31 BB, 2 HBP, 239 K, and 198.1 IP ## The formula for his DICE using the dice function is below ## Output should be 1.677436 dice(9,31,2,239,198.1) eqa Equivalent Average Description A baseball metric invented by Clay Davenport and intended to express the production of hitters in a context independent of park and league effects. EQA represents a hitter’s productivity using the same scale as batting average. Usage eqa(H, TB, BB, HBP, SB, SAC, SF, AB, CS) 4 eqa Arguments H Hits TB Total Bases BB Walks HBP Hit by pitch SB Stolen bases SAC Sacrifice hit/bunt SF Sacrifice flies AB At bats CS Caught stealing Value Returns (H+TB+1.5*(BB+HBP)+SB+SAC+SF)/(AB+BB+HBP+SAC+SF+CS+(SB/3)) Author(s) Peter Xenopoulos References http://en.wikipedia.org/wiki/Equivalent_average Examples ## The equivalent average (eqa) function is currently defined as function (H, TB, BB, HBP, SB, SAC, SF, AB, CS) { eqa <- (H + TB + 1.5 * (BB + HBP) + SB + SAC + SF)/(AB + BB + HBP + SAC + SF + CS + (SB/3)) return(eqa) } ## Let's take 2014's MLB MVP, Mike Trout, and find his OPS ## Stats for Mike Trout available on ## http://www.baseball-reference.com/players/t/troutmi01-bat.shtml ## For 2014, Trout had 173 H, 338 TB, 83 BB, 10 HBP, 16 SB, 0 SAC, 10 SF, 602 AB, 2 CS ## The formula for his EQA using the ops function is below ## Output should be .9496958 eqa(173,338,83,10,16,0,10,602,2) fip 5 fip Field Independent Pitching Description Similar to DICE dice Usage fip(HR, BB, K, IP, C) Arguments HR Home Runs allowed BB Walks K Strikeouts IP Innings Pitched C League average ERA Value Returns ((13*HR+3*BB-2*K)/IP) + C Author(s) Peter Xenopoulos References http://en.wikipedia.org/wiki/Defense_independent_pitching_statistics See Also DICE dice Examples ## Field Independent Pitching (fip) function is currently defined as function (HR, BB, K, IP, C) { fieldIndPitch <- ((13 * HR + 3 * BB - 2 * K)/IP) + C return(fieldIndPitch) } ## Let's take 2014's MLB MVP, Clayton Kershaw, and find his FIPS ## Stats for Clayton Kershaw available on ## http://www.baseball-reference.com/players/k/kershcl01-pitch.shtml ## For 2014, Kershaw allowed 9 HR, 31 BB, 239 K, 198.1 IP and league era (C) of 3.66 6 iso ## The formula for his FIPS using the dice function is below ## Output should be 2.307148 fip(9,31,239,198.1,3.66) iso Isolated Power Description Isolated power is a statistic to measure a hitter’s raw power Usage iso(slg, avg) Arguments slg Slugging Percentage. Found from slg avg Batting Average Value Returns Slugging Percentage - Batting Average Author(s) Peter Xenopoulos References http://en.wikipedia.org/wiki/Isolated_Power See Also Slugging Percentage slg Examples ## The isolated power (iso) function is currently defined as function (slg, avg) { iso <- slg - avg return(iso) } ## Let's take 2014's MLB MVP, Mike Trout, and find his Isolated Power ## Stats for Mike Trout available on ## http://www.baseball-reference.com/players/t/troutmi01-bat.shtml ## For 2014, Trout had a SLG of .561 and an AVG of .287 log5 7 ## The formula for his Isolated Power using the iso function is below ## Output should be .274 iso(0.561,0.287) log5 Log5 Sabermetric formula Description Log 5 is a formula invented by Bill James to estimate the probability that team A will win a game, based on the true winning percentage of Team A and Team B. It’s equivalent to the Bradley-Terry- Luce model used for paired comparisons, the Elo rating system used in chess and the Rasch model used in the analysis of categorical data. Usage log5(probA, probB, order) Arguments probA Win probability of team A probB Win probability of team B order Determine winning probability of which team. 0 means win probability of A over B, and 1 vice-versa Value Returns (probA - (probA*probB)) / (probA + probB - (2 * probA * probB)) Author(s) Peter Xenopoulos References http://en.wikipedia.org/wiki/Log5 8 obp obp On-Base Percentage Description Function to calculate the on-base percentage of a player/team Usage obp(H, BB, HBP, AB, SF) Arguments H Hits BB Unintentional Walks HBP Hit by pitch AB At bats SF Sacrifice flies Details On-base percentage is used to figure out how often an entity gets on-base Value Returns the following: ((H+BB+HBP)/(AB+BB+SF+HBP)) Author(s) Peter Xenopoulos References http://en.wikipedia.org/wiki/On-base_percentage See Also Slugging Percentage slg, OPS ops and Isolated Power iso ops 9 Examples ## The on-base percentage (obp) function is currently defined as function (H, BB, HBP, AB, SF) { onbase <- ((H+BB+HBP)/(AB+BB+SF+HBP)) return(onbase) } ## Let's take 2014's MLB MVP, Mike Trout, and find his on-base percentage ## Stats for Mike Trout available on ## http://www.baseball-reference.com/players/t/troutmi01-bat.shtml ## For 2014, Trout had 173 H, 83 BB, 10 HBP, 602 AB, 10 SF ## The formula for his on-base percentage using the obp function is below ## Output should be 0.377305 obp(173,83,10,602,10) ops On-base plus Slugging Description Function to calculate on base percentage plus slugging percentage. This is a measure of a hitter’s ability to hit for power and get on base. Usage ops(slg, obp) Arguments slg Slugging percentage. Found from slg obp On-base percentage. Found from obp Value Returns On-Base Percentage + Slugging Percentage Author(s) Peter Xenopoulos References http://en.wikipedia.org/wiki/On-base_plus_slugging See Also On-base Percentage obp and Slugging Percentage slg 10 pyth Examples ## The on-base percentage plus slugging (ops) function is currently defined as function (slg, obp) { ops <- slg + obp return(ops) } ## Let's take 2014's MLB MVP, Mike Trout, and find his OPS ## Stats for Mike Trout available on ## http://www.baseball-reference.com/players/t/troutmi01-bat.shtml ## For 2014, Trout had a SLG of .561 and an OBP of .377 ## The formula for his OPS using the ops function is below ## Output should be .938 ops(0.561,0.377) pyth Pythagorean Expectation Description Pythagorean expectation is a formula invented by Bill James to estimate how many games a baseball team "should" have won based on the number of runs they scored and allowed. Usage pyth(RS, RA) Arguments RS Runs Scored RA Runs Allowed Value Returns (RS*RS)/((RS*RS)+(RA*RA)) Author(s) Peter Xenopoulos References http://en.wikipedia.org/wiki/Pythagorean_expectation rcBasic 11 rcBasic Runs Created (Basic) Description Basic description of how many runs a hitter contributes to his team Usage rcBasic(H, BB, TB, AB) Arguments H Hits BB Walks TB Total Bases AB At Bats Value Returns ((H+BB)*TB)/(AB+BB) Author(s) Peter Xenopoulos References http://en.wikipedia.org/wiki/Runs_created See Also Runs Created (with stolen bases) rcBasicSB and Runs Created (Technical) rcTech Examples ## This is a generic runs created formula ## Let's see how many runs created (keep in mind this is an estimate) ## a batter will make with ## 100 hits, 7 walks (BB), 80 total bases, and 300 at bats function (H, BB, TB, AB) { rc <- ((H + BB) * TB)/(AB +
Recommended publications
  • A Statistical Study Nicholas Lambrianou 13' Dr. Nicko
    Examining if High-Team Payroll Leads to High-Team Performance in Baseball: A Statistical Study Nicholas Lambrianou 13' B.S. In Mathematics with Minors in English and Economics Dr. Nickolas Kintos Thesis Advisor Thesis submitted to: Honors Program of Saint Peter's University April 2013 Lambrianou 2 Table of Contents Chapter 1: The Study and its Questions 3 An Introduction to the project, its questions, and a breakdown of the chapters that follow Chapter 2: The Baseball Statistics 5 An explanation of the baseball statistics used for the study, including what the statistics measure, how they measure what they do, and their strengths and weaknesses Chapter 3: Statistical Methods and Procedures 16 An introduction to the statistical methods applied to each statistic and an explanation of what the possible results would mean Chapter 4: Results and the Tampa Bay Rays 22 The results of the study, what they mean against the possibilities and other results, and a short analysis of a team that stood out in the study Chapter 5: The Continuing Conclusion 39 A continuation of the results, followed by ideas for future study that continue to project or stem from it for future baseball analysis Appendix 41 References 42 Lambrianou 3 Chapter 1: The Study and its Questions Does high payroll necessarily mean higher performance for all baseball statistics? Major League Baseball (MLB) is a league of different teams in different cities all across the United States, and those locations strongly influence the market of the team and thus the payroll. Year after year, a certain amount of teams, including the usual ones in big markets, choose to spend a great amount on payroll in hopes of improving their team and its player value output, but at times the statistics produced by these teams may not match the difference in payroll with other teams.
    [Show full text]
  • {Download PDF} the Sabermetric Revolution Assessing the Growth of Analytics in Baseball 1St Edition Ebook, Epub
    THE SABERMETRIC REVOLUTION ASSESSING THE GROWTH OF ANALYTICS IN BASEBALL 1ST EDITION PDF, EPUB, EBOOK Benjamin Baumer | 9780812223392 | | | | | The Sabermetric Revolution Assessing the Growth of Analytics in Baseball 1st edition PDF Book Citations should be used as a guideline and should be double checked for accuracy. The Milwaukee Brewers have made a sabermetric shift under GM David Stearns, who took over in the fall of , and the new front office team of the Minnesota Twins also has sabermetric tendencies. The Sabermetric Revolution sets the record straight on the role of analytics in baseball. However, over the past two decades, a wider range of statistics made their way into barroom debates, online discussion groups, and baseball front offices. This is a very useful book. Subscribe to our Free Newsletter. But, be on guard, stats freaks: it isn't doctrinaire. He is the Robert A. Frederick E. Rocketed to popularity by the bestseller Moneyball and the film of the same name, the use of sabermetrics to analyze player performance has appeared to be a David to the Goliath of systemically advantaged richer teams that could be toppled only by creative statistical analysis. Also in This Series. Prospector largest collection. But how accurately can crunching numbers quantify a player's ability? Goodreads helps you keep track of books you want to read. The book is ideal for a reader who wishes to tie together the importance of everything they have digested from sites like Fangraphs , Baseball Prospectus , Hardball Times , Beyond the Box Score , and, even, yes, Camden Depot. Ryne rated it liked it May 27, Not because I rejected new age stats, but because I never sought them out.
    [Show full text]
  • 356 Baseball for Dummies, 4Th Edition
    Index 1B. See fi rst–base position American Association, 210 2B. See second–base position American League (AL), 207. 3B. See third–base position See also stadiums 40–40 club, 336 American Legion Baseball, 197 anabolic steroids, 282 • A • Angel Stadium of Anaheim, 280 appeal plays, 39, 328 Aaron, Hank, 322 appealing, 328 abbreviations appearances, defi ned, 328 player, 9 Arizona Diamondbacks, 265 scoring, 262 Arizona Fall League, 212 across the letters, 327 Arlett, Buzz, 213 activate, defi ned, 327 around the horn, defi ned, 328 adjudged, defi ned, 327 artifi cial turf, 168, 328 adjusted OPS (OPS+), 243–244 Asian leagues, 216 advance sale, 327 assists, 247, 263, 328 advance scouts, 233–234, 327 AT&T Park, 272, 280 advancing at-balls, 328 hitter, 67, 70, 327 at-bats, 8, 328 runner, 12, 32, 39, 91, 327 Atlanta Braves, 265–266 ahead in the count, defi ned, 327 attempts, 328. See also stealing bases airmailed, defi ned, 327 automatic outs, 328 AL (American League) teams, 207. away games, 328 See also stadiums alive balls, 32 • B • alive innings, 327 All American Amateur Baseball Babe Ruth League, 197 Association, 197 Babe Ruth’s curse, 328 alley (power alley; gap), 189, 327, 337 back through the box, defi ned, 328 alley hitters, 327 backdoor slide, 328 allowing, defi ned, 327COPYRIGHTEDbackdoor MATERIAL slider, 234, 328 All-Star, defi ned, 327 backhand plays, 178–179 All-Star Break, 327 backstops, 28, 329 All-Star Game, 252, 328 backup, 329 Alphonse and Gaston Act, 328 bad balls, 59, 329 aluminum bats, 19–20 bad bounces (bad hops), 272, 329
    [Show full text]
  • Package 'Mlbstats'
    Package ‘mlbstats’ March 16, 2018 Type Package Title Major League Baseball Player Statistics Calculator Version 0.1.0 Author Philip D. Waggoner <[email protected]> Maintainer Philip D. Waggoner <[email protected]> Description Computational functions for player metrics in major league baseball including bat- ting, pitching, fielding, base-running, and overall player statistics. This package is actively main- tained with new metrics being added as they are developed. License MIT + file LICENSE Encoding UTF-8 LazyData true RoxygenNote 6.0.1 NeedsCompilation no Repository CRAN Date/Publication 2018-03-16 09:15:57 UTC R topics documented: ab_hr . .2 aera .............................................3 ba ..............................................4 baa..............................................4 babip . .5 bb9 .............................................6 bb_k.............................................6 BsR .............................................7 dice .............................................7 EqA.............................................8 era..............................................9 erc..............................................9 fip.............................................. 10 fp .............................................. 11 1 2 ab_hr go_ao . 11 gpa.............................................. 12 h9.............................................. 13 iso.............................................. 13 k9.............................................. 14 k_bb............................................
    [Show full text]
  • Volume 22 Spring 2021 the Journal of Undergraduate Research In
    The Journal of Undergraduate Research Volume 22 in Natural Sciences and Mathematics Spring 2021 The Journal of Undergraduate Research in Natural Sciences and Mathematics Volume 22 | Spring 2021 Marks of a CSUF graduate from the College of Natural Sciences and Mathematics GRADUATES FROM THE COLLEGE OF NATURAL SCIENCES AND MATHEMATICS: • Understand the basic concepts and principles of science and mathematics. • Are experienced in working collectively and collaborating to solve problems. • Communicate both orally and in writing with clarity, precision, and confidence. • Are adept at using computers to do word processing, prepare spreadsheets and graphs, and use presentation software. • Possess skills in information retrieval using library resources and the internet. • Have extensive laboratory, workshop, and field experience where they utilize the scientific method to ask questions, formulate hypotheses, design and conduct experiments, and analyze data. • Appreciate diverse cultures as a result of working side by side with many people in collaborative efforts in the classroom, laboratory, and on research projects. • Have had the opportunity to work individually with faculty members in conducting research and independent projects, often leading to the generation of original data and contributing to the research knowledge base. • Are capable of working with modern equipment, instrumentation, and techniques. 4 DIMENSIONS DIMENSIONS: The Journal of Undergraduate Research in Natural Sciences and Mathematics is an official publication of California State University, Fullerton. DIMENSIONS is published annually by CSUF, 800 N. State College Blvd., Fullerton, CA 92834. Copyright ©2021 CSUF. Except as otherwise provided, DIMENSIONS grants permission for material in this publication to be copied for use by non-profit educational institutions for scholarly or instructional purposes only, provided that 1) copies are distributed at or below cost, 2) the author and DIMENSIONS are identified, and 3) proper notice of copyright appears on each copy.
    [Show full text]
  • 4-6, T-3Rd Pacific Northern) Game 11 • Home Game 6 River Cats: RHP Jose Flores (0-0, 1.80) • Rainiers: RHP Christian Bergman (1-0, 0.00)
    River Cats Media Relations • 400 Ballpark Drive • West Sacramento, CA 95691 • P: (916) 376-4751 • F: (916) 376-4710 • @RiverCats Sunday, April 15 SACRAMENTO RIVER CATS (5-5, 2nd Pacific Northern) Raley Field - vs - West Sacramento, CA • 1:05 p.m (PT) TACOMA RAINIERS (4-6, T-3rd Pacific Northern) Game 11 • Home Game 6 River Cats: RHP Jose Flores (0-0, 1.80) • Rainiers: RHP Christian Bergman (1-0, 0.00) Last Night’s Game: The River Cats got back to .500 with a 12-7 victory over the Rainiers. Starting AT A GLANCE pitcher RHP Dereck Rodriguez allowed just three baserunners in innings two through four after giving up two OVERALL HOME ROAD Record: 5-5 2-3 3-2 runs on two walks and a single in the first frame. Due to a pair of errors, all of Rodriguez’s three runs went Day: 1-1 1-0 0-1 unearned. Meanwhile, the offense combined for 12 runs on 13 hits, clubbing two home runs and four total Night: 4-4 1-3 3-1 extra-base hits. The River Cats have now homered in five consecutive games. vs. RHP: 5-5 2-3 3-2 vs. LHP: 0-0 0-0 0-0 One-run games: 2-3 0-2 2-1 RHP Jose Flores will make his second start at Raley Field during this seven- Tonight’s Starter: Extra-inning games: 0-0 0-0 0-0 game homestand. He was handed the ball for the home opener on Tuesday and struck out seven in five Shutouts: 0-1 0-0 0-1 innings with no walks.
    [Show full text]
  • Homer Bailey Baseball Reference
    Homer Bailey Baseball Reference Unbounded Thaddeus metallises mutinously. Zincographic and sicker Hercules never recapping his Tartufe! Interbank and Adamitic Ignaz never sangs hardily when Victor tramp his receptionists. At any more sustainable with that homer bailey threw the field to follow this Fas deal on the new posts by post editors and losing too, ranking no one of their use. Homer Bailey BR Bullpen Baseball-Referencecom. More of famer and homer bailey baseball reference also homer bailey suggested. They lost within his recent home run stats have, baseball reference and homer bailey baseball reference and these modest starter additions would be prepared. Scott Boras appears to have sold the Marlins a false bill of goods, but there are two that seem to be flying under the radar. The regular season is often forgotten by Reds fans when it comes to Reggie Sanders because he just how poorly he performed in the playoffs that season. With the Nats this winter and he takes on the Reds' Homer Bailey at 705 pm EST. This couch an automatic process. Save my name, Mo. Cards savvy collector can pat and sell baseball cards sports memorabilia. Had his articles about him a larger share posts by wordpress. The worst was in Houston which is obviously very suspect now. All Professional Baseball Statistics for Homer Bailey. The top arms that coming back from all their system after two pitchers? European users agree on bailey was totally worth it to reference and homer bailey appears to the increase in their ace, unknowns and website. And against Baseball Reference's category of power pitchers they.
    [Show full text]
  • A Bayesian Variable Selection Approach to Major League
    A Bayesian Variable Selection Approach to Major League Baseball Hitting Metrics∗ Blakeley B. McShane, Alexander Braunstein, James Piette, and Shane T. Jensen Department of Statistics The Wharton School University of Pennsylvania Abstract Numerous statistics have been proposed for the measure of offensive ability in major league baseball. While some of these measures may offer moderate predictive power in certain situations, it is unclear which simple offensive metrics are the most reliable or consistent. We address this issue with a Bayesian hierarchical model for variable selection to capture which offensive metrics are most predictive within players across time. Our sophisticated methodology allows for full estimation of the posterior distri- butions for our parameters and automatically adjusts for multiple testing, providing a distinct advantage over alternative approaches. We implement our model on a set of 50 different offensive metrics and discuss our results in the context of comparison to other variable selection techniques. We find that 33/50 metrics demonstrate signal. However, these metrics are highly correlated with one another and related to traditional notions of performance (e.g., plate discipline, power, and ability to make contact). Keywords: Baseball, Bayesian models, entropy, mixture models, random effects arXiv:0911.4503v1 [stat.AP] 23 Nov 2009 October 22, 2018 ∗Blake McShane, Alex Braunstein, and James Piette are doctoral candidates and Shane Jensen is an As- sistant Professor, all in the Department of Statistics at the Wharton School of the University of Pennsylvania. All correspondence on this manuscript should be sent to Blake McShane, [email protected], 400 Jon M. Huntsman Hall, 3730 Walnut Street, Philadelphia, PA 19104.
    [Show full text]
  • A Hierarchical Bayesian Variable Selection Approach to Major League Baseball Hitting Metrics
    University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 10-2011 A Hierarchical Bayesian Variable Selection Approach to Major League Baseball Hitting Metrics Blakeley B. McShane University of Pennsylvania Alexander Braunstein James M. Piette III University of Pennsylvania Shane T. Jensen University of Pennsylvania Follow this and additional works at: https://repository.upenn.edu/statistics_papers Part of the Statistics and Probability Commons Recommended Citation McShane, B. B., Braunstein, A., Piette, J. M., & Jensen, S. T. (2011). A Hierarchical Bayesian Variable Selection Approach to Major League Baseball Hitting Metrics. Journal of Quantitative Analysis in Sports, 7 (4), http://dx.doi.org/10.2202/1559-0410.1323 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/statistics_papers/442 For more information, please contact [email protected]. A Hierarchical Bayesian Variable Selection Approach to Major League Baseball Hitting Metrics Abstract Numerous statistics have been proposed to measure offensive ability in Major League Baseball. While some of these measures may offer moderate predictive power in certain situations, it is unclear which simple offensive metrics are the most reliable or consistent. We address this issue by using a hierarchical Bayesian variable selection model to determine which offensive metrics are most predictive within players across time. Our sophisticated methodology allows for full estimation of the posterior distributions for our parameters and automatically adjusts for multiple testing, providing a distinct advantage over alternative approaches. We implement our model on a set of fifty different offensive metrics and discuss our results in the context of comparison to other variable selection techniques. We find that a large number of metrics demonstrate signal.
    [Show full text]
  • Performance Outcomes After Medial Ulnar Collateral Ligament Reconstruction in Major League Baseball Positional Players
    J Shoulder Elbow Surg (2018) 27, 282–290 www.elsevier.com/locate/ymse Performance outcomes after medial ulnar collateral ligament reconstruction in Major League Baseball positional players John P. Begly, MD, Michael S. Guss, MD, Theodore S. Wolfson, MD, Siddharth A. Mahure, MD, MBA*, Andrew S. Rokito, MD, Laith M. Jazrawi,MD New York University Hospital for Joint Diseases, New York, NY, USA Background: We sought to determine whether professional baseball positional players who underwent medial ulnar collateral ligament (MUCL) reconstruction demonstrate decreases in performance on return to competition compared with preoperative performance metrics and their control-matched peers. Methods: Data for 35 Major League Baseball positional players who underwent MUCL reconstruction during 31 seasons were obtained. Twenty-six players met inclusion criteria. Individual statistics for the 2 seasons immediately before injury and the 2 seasons after injury included wins above replacement (WAR), on-base plus slugging (OPS), and isolated power (ISO). Twenty-six controls matched by player position, age, plate appearances, and performance statistics were identified. Results: Of the 35 athletes who underwent surgery, 7 did not return to their preinjury level of competi- tion (return to play rate of 80%). In comparing preinjury with postinjury statistics, players exhibited a significant decrease in plate appearances, at-bats, and WAR 2 seasons after injury but did not demonstrate declines in WAR 1 season after injury. Compared with matched controls, athletes who underwent MUCL recon- struction did not demonstrate significant decline in statistical performance, including OPS, WAR, and ISO, after return to play from surgery. Of all positional players, catchers undergoing surgery demonstrated lowest rates of return to play (56%) along with statistically significant decreases in home run rate, runs batted in, and ISO.
    [Show full text]
  • Major Qualifying Project: Advanced Baseball Statistics
    Major Qualifying Project: Advanced Baseball Statistics Matthew Boros, Elijah Ellis, Leah Mitchell Advisors: Jon Abraham and Barry Posterro April 30, 2020 Contents 1 Background 5 1.1 The History of Baseball . .5 1.2 Key Historical Figures . .7 1.2.1 Jerome Holtzman . .7 1.2.2 Bill James . .7 1.2.3 Nate Silver . .8 1.2.4 Joe Peta . .8 1.3 Explanation of Baseball Statistics . .9 1.3.1 Save . .9 1.3.2 OBP,SLG,ISO . 10 1.3.3 Earned Run Estimators . 10 1.3.4 Probability Based Statistics . 11 1.3.5 wOBA . 12 1.3.6 WAR . 12 1.3.7 Projection Systems . 13 2 Aggregated Baseball Database 15 2.1 Data Sources . 16 2.1.1 Retrosheet . 16 2.1.2 MLB.com . 17 2.1.3 PECOTA . 17 2.1.4 CBS Sports . 17 2.2 Table Structure . 17 2.2.1 Game Logs . 17 2.2.2 Play-by-Play . 17 2.2.3 Starting Lineups . 18 2.2.4 Team Schedules . 18 2.2.5 General Team Information . 18 2.2.6 Player - Game Participation . 18 2.2.7 Roster by Game . 18 2.2.8 Seasonal Rosters . 18 2.2.9 General Team Statistics . 18 2.2.10 Player and Team Specific Statistics Tables . 19 2.2.11 PECOTA Batting and Pitching . 20 2.2.12 Game State Counts by Year . 20 2.2.13 Game State Counts . 20 1 CONTENTS 2 2.3 Conclusion . 20 3 Cluster Luck 21 3.1 Quantifying Cluster Luck . 22 3.2 Circumventing Cluster Luck with Total Bases .
    [Show full text]
  • Paul Brendel Final Project INTRODUCTION One of the Most
    Paul Brendel Final Project INTRODUCTION One of the most entertaining aspects of Major League Baseball (MLB) is watching a player hit for extra bases (doubles, triples, and home runs). This power was on full display during baseball’s steroid era, which is believed to have occurred roughly between the late 1980’s to the late 2000’s[1]. On November 15, 2005 MLB and the players’ association agreed on a plan to significantly strengthen steroid testing and penalties (including a lifetime ban for 3rd offenses)[2]. One statistic that measures the ability to hit for power is isolated power (ISO), which tells you the number of extra bases the player has per at bat (ISO = (2B + (2*3B) + (3*HR) / AB)[3]. Various factors may influence a team’s average ISO. American League (AL) teams may have a higher ISO than those of National League (NL) teams because AL teams have a designated hitter bat instead of a pitcher, who is usually a very poor batter. Teams with the highest salaries may have the highest ISOs since they can afford to sign the best players. Teams who play home games in stadiums with more hitter-friendly park factors (park dimensions, weather, air density/quality, etc.) may have a better ISO than those of teams who play in more pitcher- friendly parks. I suspect that each team’s average ISO decreased after MLB’s strict steroid testing and penalty system was put into place. I also believe that teams in the AL with the highest salaries and with the most hitter-friendly home park factors will have the highest ISO.
    [Show full text]