Pitch Quantification Part 1: Between Pitcher Comparisons of QOP with Conventional Statistics" (2016)

Total Page:16

File Type:pdf, Size:1020Kb

Pitch Quantification Part 1: Between Pitcher Comparisons of QOP with Conventional Statistics Biola University Digital Commons @ Biola Faculty Articles & Research 2016 Pitch quantification arP t 1: between pitcher comparisons of QOP with conventional statistics Jason Wilson Biola University Follow this and additional works at: https://digitalcommons.biola.edu/faculty-articles Part of the Sports Studies Commons, and the Statistics and Probability Commons Recommended Citation Wilson, Jason, "Pitch quantification Part 1: between pitcher comparisons of QOP with conventional statistics" (2016). Faculty Articles & Research. 393. https://digitalcommons.biola.edu/faculty-articles/393 This Article is brought to you for free and open access by Digital Commons @ Biola. It has been accepted for inclusion in Faculty Articles & Research by an authorized administrator of Digital Commons @ Biola. For more information, please contact [email protected]. | 1 Pitch Quantification Part 1: Between-Pitcher Comparisons of QOP with Conventional Statistics Jason Wilson1,2 1. Introduction The Quality of Pitch (QOP) statistic uses PITCHf/x data to extract the trajectory, location, and speed from a single pitch and is mapped onto a -10 to 10 scale. A value of 5 or higher represents a quality MLB pitch. In March 2015 we presented an LA Dodgers case study at the SABR Analytics conference using QOP that included the following results1: 1. Clayton Kershaw’s no hitter on June 18, 2014 vs. Colorado had an objectively better pitching performance than Josh Beckett’s no hitter on May 25th vs. Philadelphia. 2. Josh Beckett’s 2014 injury followed a statistically significant decline in his QOP that was not accompanied by a significant decline in MPH. These, and the others made in the presentation, are big claims. Can they be substantiated? Is QOP a valid statistic? Could it be a reliable new stream of information about pitch quality? The purpose of this paper is to provide evidence to MLB analysts, analyst-types, and fans of the affirmative. We originally developed QOP with the desire to provide a more comprehensive measurement for individual pitches that could be used by pitching coaches for player development and by scouts to identify player potential. The analytical and medical applications in our presentation came later. Before we can go about the business of developing QOP for such high-level applications, however, the MLB analysts with whom we have spoken are asking for evidence of the validity of QOP. How does it compare with conventional statistics like ERA and WHIP and FIP, or BABIP and OPS? Other results? Is it consistent? Is it better? This paper presents that comparison. I will make a case that while QOP values can be shown to associate with traditional pitch classification at a macro level, direct comparisons of QOP with conventional statistics have low correlation due to the difference between the statistics and the loss of variation inherent in averaging hundreds of pitching and hitting events. However, when QOP is compared with raw results data -- number of bases earned per hit -- and conditioning by pitch designation, it is shown to strongly associate with results. Additionally, I demonstrate a positive way to relate QOP to ERA is as a predictor. QOP has a 76 to 97% success rate for predicting a pitchers’ drop in ERA or FIP in subsequent seasons for pitchers with QOP average over 5.0. In order to be as clear as possible at the outset, let me remark on what QOP is, and is not. My colleagues and I have designed the QOP function to calculate the three key properties of a pitch which 1 Biola University, 13800 La Mirada, CA 90639; [email protected] 2 © Greiner Agencies Inc., 1478 Welbourn Drive, Edmonton, Alberta, Canada, T6M 2M1 | 2 are: trajectory, location, and speed. Specifically, these properties are determined from the following components, shown with their relationship to QOP: 1. Increased rise lower QOP (for curveballs) 2. Increased total break higher QOP 3. Increased late vertical break higher QOP 4. Increased horizontal break higher QOP 5. Closeness to corners of strike zone higher QOP 6. Increased velocity higher QOP Thus, the calculation of QOP is completely objective because it incorporates the above objectively measurable pitch components. There are no subjective evaluations in these calculations2. As the old English measurement system had inconsistent lengths between the “foot” measurement of different merchants, so contemporary major league baseball has inconsistent assessments of pitch quality between observers. As the “foot” inconsistency was resolved by standardized ruler, so we would like QOP to become a “ruler” of pitch quality for baseball. Therefore, the QOP model does not include pitch tempo, pitch sequence, or pitch intention. Neither do we consider pitcher deception or batter anticipation to be measurable properties for inclusion in QOP. We include only measurable components in the QOP “ruler”. In our theory of pitch quantification, we distinguish between pitch quality (QOP), pitch context (count, tempo, sequence, base runners, etc.), and pitch effectiveness (results). Pitch quality is the foundational issue. To the degree that the measurement of pitch quality is flawed, the estimation of batter effects and prediction of results will be flawed. We hypothesize that when accurate pitch quality is correctly combined with pitch context, then pitch effectiveness/results can be accurately modeled. QOP is our attempt to measure pitch quality accurately. In this paper, we are not addressing the relationship between pitch quality and pitch context. Such development of our theory remains for future work. In order to get there, the prior issue is the establishment within the baseball analytics community of the validity of QOP, which is the focus of this paper. Once QOP, an accurate measurement of pitch quality, is established as a valid metric, future development can occur. 2. Overview of QOP I begin with a brief description of how QOP is calculated and how it differs from conventional statistics. Those already familiar with QOP may skip this section. The formula for QOP is a multiple regression model with the following form (see Figure 1 for a graphical representation of components): QOP = β1*rise + β2*break.point + β3*vert.break + β4*horiz.break + β5*location + β6*MPH where 1. rise: the number of feet the ball rises vertically to the maximum height 2. break.point: the horizontal distance (in feet) from the release point to the maximum height 3. vert.break: the number of vertical feet from the maximum ball height to the point it crosses the plate | 3 4. horiz.break: the number of horizontal feet, at home plate, from the “straight line” crossing point of the ball to the actual crossing point 5. location: a measurement of how easy the pitch is to hit, on average, based upon the location in the strike zone (the higher the value, the easier to hit, see Figure 2) 6. MPH: pitch speed Figure 1. Curveball trajectory diagram. The trajectory components of a pitch used to develop the initial QOP model. In the model shown, the parameters were derived from the pitch measurements using a low-tech method. The current model uses PITCHf/x data. # Residuals: # Min 1Q Median 3Q Max # -2.1066 -0.4282 -0.0656 0.3274 3.3782 # # Coefficients: # Std. Error t value Pr(>|t|) # rise 0.69040 -2.809 0.00519 ** # break.point 0.01837 12.956 < 2e-16 *** # vert.break 0.01964 50.342 < 2e-16 *** # loc 0.01537 -65.053 < 2e-16 *** # horiz.break 0.08130 4.050 6.1e-05 *** # --- # Residual standard error: 0.6724 on 419 degrees of freedom # Multiple R-squared: 0.9578, Adjusted R-squared: 0.9573 # F-statistic: 1903 on 5 and 419 DF, p-value: < 2.2e-16 Table 1. Statistical software output for QOP model. It may be noticed that MPH does not appear in the output. This is because we combine this trajectory and location parameter model with MPH separately. The rise and location coefficients are negative, while the break.point, vert.break, horiz.break, and MPH are all positive. This makes sense, because a rising pitch and bad location make for a low quality pitch whereas a big break and fast speed make for a good quality pitch. Under extensive model testing, there were no quadratic relationships or interactions found. It is noteworthy that | 4 this is a zero intercept model. The model fit is excellent with a p-value < 10 for all coefficients (except rise, which is 0.005), and adjusted R2 = 0.957. All QOP work has been−5 done in R3 and the model output is in Table 1. Unfortunately, the patent pending model is proprietary and the coefficients cannot be released at this time. Figure 2. Diagram used for developing the location variable in the current QOP model. Different colors indicate different points assigned to the location parameter. The best location is near the strike zone corners, and near the edge of the strike zone. The worst location is the exact center of the strike zone, and far out of the strike zone. The edge of the strike zone, inside and out, rates well because it can lure the batter into swinging, while still difficult to hit. This is a conceptual diagram only; this is NOT the exact way location is calculated for the QOP model. 3. Results How does QOP compare with pitching statistics like FIP and batting statistics like OPS? These questions form the subject of the first two sub-sections, below. Since QOP provides a number for every pitch, we can look at either an average or the individual pitch values. To distinguish these, we let QOPV refer to a set of individual pitch values and QOPA refer to their average. While QOPV is normally on a -10 to 10 scale, there are no theoretical limits to the endpoints, since it is a regression model.
Recommended publications
  • 2015 MLB Season in Review Using Pitch Quantification and the QOP1 Metric
    Biola University Digital Commons @ Biola Faculty Articles & Research 2016 2015 MLB Season in Review Using Pitch Quantification and the QOP1 Metric Jason Wilson Biola University Follow this and additional works at: https://digitalcommons.biola.edu/faculty-articles Part of the Sports Studies Commons, and the Statistics and Probability Commons Recommended Citation Wilson, Jason, "2015 MLB Season in Review Using Pitch Quantification and the QOP1 Metric" (2016). Faculty Articles & Research. 392. https://digitalcommons.biola.edu/faculty-articles/392 This Article is brought to you for free and open access by Digital Commons @ Biola. It has been accepted for inclusion in Faculty Articles & Research by an authorized administrator of Digital Commons @ Biola. For more information, please contact [email protected]. | 1 The 2015 MLB Season in Review Using Pitch Quantification and the QOP1 Metric Jason Wilson2 and Wayne Greiner3 1. Introduction The purpose of this paper is to provide SABR with the work we would like to present, should we be selected as presenters at the 2016 SABR Analytics Conference. Our subject is Quality of Pitch (QOP). QOP is a statistic calculated from the trajectory, location, and speed of a single pitch (see Appendix 1 for how QOP is calculated). QOP was introduced at last year’s SABR Analytics Conference (2015), after which the primary question received from analysts was, “How does QOP compare with conventional MLB statistics?” Section 2 answers this question. Having provided evidence for the validity of QOP, the meat of the presentation would be Sections 3, which explores the following questions: 1. Which MLB players threw the highest quality pitches in 2015? 2.
    [Show full text]
  • Seattle Mariners Opening Day Record Book
    SEATTLE MARINERS OPENING DAY RECORD BOOK 1977-2012 All-Time Openers Year Date Day Opponent Att. Time Score D/N 1977 4/6 Wed. CAL 57,762 2:40 L, 0-1 N 1978 4/5 Wed. MIN 45,235 2:15 W, 3-2 N 1979 4/4 Wed. CAL 37,748 2:23 W, 5-4 N 1980 4/9 Wed. TOR 22,588 2:34 W, 8-6 N 1981 4/9 Thurs. CAL 33,317 2:14 L, 2-6 N 1982 4/6 Tue. at MIN 52,279 2:32 W, 11-7 N 1983 4/5 Tue. NYY 37,015 2:53 W, 5-4 N 1984 4/4 Wed. TOR 43,200 2:50 W, 3-2 (10) N 1985 4/9 Tue. OAK 37,161 2:56 W, 6-3 N 1986 4/8 Tue. CAL 42,121 3:22 W, 8-4 (10) N 1987 4/7 Tue. at CAL 37,097 2:42 L, 1-7 D 1988 4/4 Mon. at OAK 45,333 2:24 L, 1-4 N 1989 4/3 Mon. at OAK 46,163 2:19 L, 2-3 N 1990 4/9 Mon. at CAL 38,406 2:56 W, 7-4 N 1991 4/9 Tue. CAL 53,671 2:40 L, 2-3 N 1992 4/6 Mon. TEX 55,918 3:52 L, 10-12 N 1993 4/6 Tue. TOR 56,120 2:41 W, 8-1 N 1994 4/4 Mon. at CLE 41,459 3:29 L, 3-4 (11) D 1995 4/27 Thurs.
    [Show full text]
  • Language Arts
    2013 EDUCATION DAYS At Wednesday, April 10th @ 10:05 am Monday, April 29th @ 10:05 am Wednesday, May 15th @ 10:05 am Curriculum Guide Including Kentucky’s Learning Goals and Academic Expectations IMPORTANT: PLEASE READ BEFORE CONTINUING This curriculum guide was created to provide you with a learning “compass” to help guide your students through a truly rewarding “out of class” experience. The ideas and lesson plans included within stand to serve as a guide to educators to create educational, appropriate ideas to assist in the “Education Day” at Whitaker Bank Ballpark. The curriculum has been designed by educators for educators. It is intended to help provide activities toward meeting Kentucky’s Learning Goals and Academic Expectations. For your convenience, the Learning Goals and Academic Expectations are listed within the curriculum. Regardless of the activity, the procedures and methods can easily be substituted to fit your appropriate grade level. This curriculum has been organized with the following in mind: . Educators tailor their lesson plans to meet the needs of their students. Educators may use bits and pieces of a lesson plan to create their own lesson plans. Educators have the flexibility to use this curriculum to integrate ideas into current studies. This curriculum guide has been divided into four basic areas of study: MATHEMATICS, SCIENCE, SOCIAL STUDIES, and LANGUAGE ARTS. Within each of these four areas of study, we have further divided them into two distinct sub-headings: FOCUSED LESSON PLANS and EXTENDED IDEAS. Focused Lesson Plans: With at least three samples per section, these lesson plans have been designed as a complete, top-to-bottom lesson plan.
    [Show full text]
  • Testing the Minimax Theorem in the Field
    Testing the Minimax Theorem in the Field: The Interaction between Pitcher and Batter in Baseball Christopher Rowe Advisor: Professor William Rogerson Abstract John von Neumann’s Minimax Theorem is a central result in game theory, but its practical applicability is questionable. While laboratory studies have often rejected its conclusions, recent field studies have achieved more favorable results. This thesis adds to the growing body of field studies by turning to the game of baseball. Two models are presented and developed, one based on pitch location and the other based on pitch type. Hypotheses are formed from assumptions on each model and then tested with data from Major League Baseball, yielding evidence in favor of the Minimax Theorem. May 2013 MMSS Senior Thesis Northwestern University Table of Contents Acknowledgements 3 Introduction 4 The Minimax Theorem 4 Central Question and Structure 6 Literature Review 6 Laboratory Experiments 7 Field Experiments 8 Summary 10 Models and Assumptions 10 The Game 10 Pitch Location Model 13 Pitch Type Model 21 Hypotheses 24 Pitch Location Model 24 Pitch Type Model 31 Data Analysis 33 Data 33 Pitch Location Model 34 Pitch Type Model 37 Conclusion 41 Summary of Results 41 Future Research 43 References 44 Appendix A 47 Appendix B 59 2 Acknowledgements I would like to thank everyone who had a role in this paper’s completion. This begins with the Office of Undergraduate Research, who provided me with the funds necessary to complete this project, and everyone at Baseball Info Solutions, in particular Ben Jedlovec and Jeff Spoljaric, who provided me with data.
    [Show full text]
  • The Logan Banner, Thursday, June 20, 2013 Obituaries
    A1 LOG ONTO WWW.LOGANBANNER.COM FOR ARCHIVE • GAMES • FeaTURES • E-EDITION • POLLS & MORE HE OGAN ANNER COMMENTARYT WEATHER L SPORTS B ONLINE Public opinion for Sunny. High mid 80s. Hooping it Busy? Find us statewide growth Low in the lower 60s up.... Page online, anytime at: .... Page A4 .... Page A3 B1 loganbanner.com Vol. 126, Number 75 THURSDAY, JUNE 20, 2013 50 cents daily 2 lawsuits filed over runoff from reclaimed mines HUNTINGTON, W.Va. Pocahontas Land Corp. former Southeast Ridge new strategy from the en- myth that they no lon- for Hernshaw Partners, (AP) — Environmental- and Hernshaw Partners Mine and a former Chafin vironmental community, ger threaten the environ- based in Gilbert. ists are going after the LLC in U.S. District Court Branch Coal Co. site. The which has long sued coal ment when they’re done The complaints ask a owners of two more for- in Huntington Wednesday lawsuits say state and companies over pollution producing coal. judge to stop the unpermit- mer mountaintop remov- over alleged violations of federal regulators are do- but rarely sued after their Pocahontas Land Corp. ted discharges and to fine al mines in West Virginia, the Clean Water Act. ing nothing to end water operations are reclaimed of Bluefield is a subsidiary the defendants $37,500 hoping to hold them re- The groups filed simi- pollution from old surface per day for each violation. of Virginia-based Norfolk sponsible for water pollu- lar complaints over three mines after they’ve been and released from bonding They also demand moni- tion from the sites.
    [Show full text]
  • Sugar Land Skeeters (19-9) Vs. Oklahoma City Dodgers (12-17) RHP Peter Solomon (2-0, 4.34 ERA) Vs
    Sugar Land Skeeters (19-9) vs. Oklahoma City Dodgers (12-17) RHP Peter Solomon (2-0, 4.34 ERA) vs. RHP Markus Solbach (0-1, 10.80 ERA) Tuesday, June 8th, 2021 • Chickasaw Bricktown Ballpark • Oklahoma City, OK • 7:05 p.m. • Audio: sugarlandskeeters.com I Video: MiLB.TV Game #29 Away #18 LAST NIGHT: The Skeeters split their doubleheader BLANCO FUEGO: Ronel Blanco converted his fourth ABOUT THE RECORD in Oklahoma City, losing Game 1, 4-2, and winning straight save opportunity Monday night. Blanco has Game 2, 3-2. Abraham Toro's two-run homer in a save in each of the Skeeters' last two wins and Overall Record.....................................19-9 Game 2 helped lead the Skeeters to the split. ranks tied for fourth in Triple A West in saves. Home Record......................................10-1 Current Streak......................................W 1 DON'T MESS WITH THE BULL: Abraham Toro has MiLB HONORS: RHP Brett Conine was named the Current Road Trip..................................2-3 hit .400 (10x25) with a home run and eight RBIs over Triple A West Pitcher of the Month for May by Minor Last Road Trip.......................................7-5 his last eight games. Toro was activated from the League Baseball. Conine has gone 4-0 with a 0.95 Last Homestand...................................10-1 Minor League IL and assigned to the Skeeters on ERA (2 ER/19 IP) through four games (three starts). May 27. He's recorded the lowest ERA in Triple A West and Last 5 Games.........................................2-3 leads the league in wins, opponent batting average Last 10 Games.......................................6-4 HOT STREAK: The Skeeters have won 12 of their (.167) and WHIP (0.95).
    [Show full text]
  • "What Raw Statistics Have the Greatest Effect on Wrc+ in Major League Baseball in 2017?" Gavin D
    1 "What raw statistics have the greatest effect on wRC+ in Major League Baseball in 2017?" Gavin D. Sanford University of Minnesota Duluth Honors Capstone Project 2 Abstract Major League Baseball has different statistics for hitters, fielders, and pitchers. The game has followed the same rules for over a century and this has allowed for statistical comparison. As technology grows, so does the game of baseball as there is more areas of the game that people can monitor and track including pitch speed, spin rates, launch angle, exit velocity and directional break. The website QOPBaseball.com is a newer website that attempts to correctly track every pitches horizontal and vertical break and grade it based on these factors (Wilson, 2016). Fangraphs has statistics on the direction players hit the ball and what percentage of the time. The game of baseball is all about quantifying players and being able give a value to their contributions. Sabermetrics have given us the ability to do this in far more depth. Weighted Runs Created Plus (wRC+) is an offensive stat which is attempted to quantify a player’s total offensive value (wRC and wRC+, Fangraphs). It is Era and park adjusted, meaning that the park and year can be compared without altering the statistic further. In this paper, we look at what 2018 statistics have the greatest effect on an individual player’s wRC+. Keywords: Sabermetrics, Econometrics, Spin Rates, Baseball, Introduction Major League Baseball has been around for over a century has given awards out for almost 100 years. The way that these awards are given out is based on statistics accumulated over the season.
    [Show full text]
  • 2018 Media-Guide.Pdf
    SALTDOGS BASEBALL 2018 TABLE OF CONTENTS Contact Information/Local Media 2 All-Time Highs and Lows 48 About the Lincoln Saltdogs 3 2017 American Association Final Standings 49 Haymarket Park 3 2017 American Association Team Stats 50 About Lincoln Pro Baseball 3 2017 American Association Leaders 51 About Lincoln, Nebraska 4 2017 Lincoln Saltdogs Game-by-Game Results 53 Saltdogs in MLB organizations 5 2017 Lincoln Saltdogs Team Stats 54 Media Information 6 Saltdogs Individual Game Records 55 Saltdogs Manager Bobby Brown 7 Saltdogs Team Game Records 58 Lincoln Saltdogs Coaching Staff 8 Saltdogs Individual Season Records 61 Lincoln Saltdogs 2018 Player Bios 9 Saltdogs Team Season Records 66 Front Office Staff 35 Saltdogs Individual Career Records 71 Medical Staff 38 Saltdogs Longest Hit Streaks 78 2018 Lincoln Saltdogs Schedule 39 Saltdogs All-Time Roster 79 History of the American Association 40 2018 Lincoln Saltdogs Roster/Pronunciation Guide 83 Commissioner Miles Wolff 41 American Association Staff 42 Roster Rules 43 Current American Association Franchises 44 Saltdogs All-Time Wins-Losses Breakdown 47 32 players signed to MLB clubs since 2011 Four former Saltdogs have reached “The Show” LINCOLN SALTDOGS CONTACT INFORMATION: VISITING TEAM HOTEL: RADIO STATIONS: 403 Line Drive Circle, Suite A Graduate—Lincoln ESPN 1480 Lincoln, NE 68508 141 N 9th St 402-466-3776 Main Office: (402) 474-BALL (2255) Lincoln, NE 68508 ESPN1480.com Fax: (402) 474-2254 Phone: (402) 475-4011 www.saltdogs.com [email protected] KFOR 1240 LOCAL NEWSPAPER: Press Box:
    [Show full text]
  • Kansas City Royals OFFICIAL GAME NOTES Cleveland Indians (2-4) @ Kansas City Royals (3-4) Kauffman Stadium - Saturday, April 14, 2012 - 6:10 P.M
    Kansas City Royals OFFICIAL GAME NOTES Cleveland Indians (2-4) @ Kansas City Royals (3-4) Kauffman Stadium - Saturday, April 14, 2012 - 6:10 p.m. Game #8 - Home Game #2 Fox Sports Kansas City (HD) and KCSP Radio (610 Sports) UPCOMING PITCHING PROBABLES Sunday, April 15 vs. Cleveland Indians: RHP Ubaldo Jimenez (0-0, 2.86) vs. RHP Luis Mendoza (0-1, 1.59), 1:10 p.m., FSKC (HD) & 610 Sports Monday, April 16: RHP Justin Verlander (0-1, 2.20) vs. LHP Danny Duffy (1-0, 0.00), 7:10 p.m., FSKC (HD) & 610 Sports Tuesday, April 17: LHP Drew Smyly (0-0, 2.25) vs. LHP Bruce Chen (0-0, 1.54), 7:10 p.m., FSKC (HD) & 610 Sports Wednesday, April 18: RHP Max Scherzer (0-1, 10.38) vs. TBA, 7:10 p.m., FSKC (HD) & 610 Sports Thursday, April 19: Off day at home Tonight’s game is being broadcast in Kansas City on KCSP Radio (610 Sports) and on the Royals Radio Network with Denny Matthews, Steve Physioc and Steve Stewart, while Don Free controls the dials as the producer/engineer...tonight’s game is being televised in High Definition on Fox Sports Kansas City with Ryan Lefebvre and Rex Hudler...the pre and postgame show, Royals Live, hosted by Joel Goldberg, who is joined by Royals Hall of Famer Jeff Montgomery Royals vs. Indians ROUGH FIRST LEADS TO HOME OPENER SETBACK--The Cleveland Indians spoiled an KC’s recent scuffles with the Tribe continued yesterday with otherwise glorious day at “The K” on Friday, scoring early and often with a 7-run first inning en route the 8-3 loss in the home opener...the Royals were just 3-6 here vs.
    [Show full text]
  • Between-‐Pitcher Comparisons of QOP With
    | 1 ͳǣ - Jason Wilson1,2 1. Introduction The Quality of Pitch (QOP) statistic uses PITCHf/x data to extract the trajectory, location, and speed from a single pitch and is mapped onto a -10 to 10 scale. A value of 5 or higher represents a quality MLB pitch. In March 2015 we presented an LA Dodgers case study at the SABR Analytics conference using QOP that included the following results1: 1. ůĂLJƚŽŶ<ĞƌƐŚĂǁ͛ƐŶŽŚŝƚƚĞƌŽŶ:ƵŶe 18, 2014 vs. Colorado had an objectively better pitching ƉĞƌĨŽƌŵĂŶĐĞƚŚĂŶ:ŽƐŚĞĐŬĞƚƚ͛ƐŶŽŚŝƚƚĞƌŽŶDĂLJϮϱth vs. Philadelphia. 2. :ŽƐŚĞĐŬĞƚƚ͛Ɛ2014 injury followed a statistically significant decline in his QOP that was not accompanied by a significant decline in MPH. These, and the others made in the presentation, are big claims. Can they be substantiated? Is QOP a valid statistic? Could it be a reliable new stream of information about pitch quality? The purpose of this paper is to provide evidence to MLB analysts, analyst-types, and fans of the affirmative. We originally developed QOP with the desire to provide a more comprehensive measurement for individual pitches that could be used by pitching coaches for player development and by scouts to identify player potential. The analytical and medical applications in our presentation came later. Before we can go about the business of developing QOP for such high-level applications, however, the MLB analysts with whom we have spoken are asking for evidence of the validity of QOP. How does it compare with conventional statistics like ERA and WHIP and FIP, or BABIP and OPS? Other results? Is it consistent? Is it better? This paper presents that comparison.
    [Show full text]
  • Combining Radar and Optical Sensor Data to Measure Player Value in Baseball
    sensors Article Combining Radar and Optical Sensor Data to Measure Player Value in Baseball Glenn Healey Department of Electrical Engineering and Computer Science, University of California, Irvine, CA 92617, USA; [email protected] Abstract: Evaluating a player’s talent level based on batted balls is one of the most important and difficult tasks facing baseball analysts. An array of sensors has been installed in Major League Baseball stadiums that capture seven terabytes of data during each game. These data increase interest among spectators, but also can be used to quantify the performances of players on the field. The weighted on base average cube model has been used to generate reliable estimates of batter performance using measured batted-ball parameters, but research has shown that running speed is also a determinant of batted-ball performance. In this work, we used machine learning methods to combine a three-dimensional batted-ball vector measured by Doppler radar with running speed measurements generated by stereoscopic optical sensors. We show that this process leads to an improved model for the batted-ball performances of players. Keywords: Bayesian; baseball analytics; machine learning; radar; intrinsic values; forecasting; sensors; batted ball; statistics; wOBA cube 1. Introduction The expanded presence of sensor systems at sporting events has enhanced the enjoy- ment of fans and supported a number of new applications [1–4]. Measuring skill on batted balls is of fundamental importance in quantifying player value in baseball. Traditional measures for batted-ball skill have been based on outcomes, but these measures have a low Citation: Healey, G. Combining repeatability due to the dependence of outcomes on variables such as the defense, the ball- Radar and Optical Sensor Data to park dimensions, and the atmospheric conditions [5,6].
    [Show full text]
  • Batter Handedness Project - Herb Wilson
    Batter Handedness Project - Herb Wilson Contents Introduction 1 Data Upload 1 Join with Lahman database 1 Change in Proportion of RHP PA by Year 2 MLB-wide differences in BA against LHP vs. RHP 2 Equilibration of Batting Average 4 Individual variation in splits 4 Logistic regression using Batting Average splits. 7 Logistic regressions using weighted On-base Average (wOBA) 11 Summary of Results 17 Introduction This project is an exploration of batter performance against like-handed and opposite-handed pitchers. We have long known that, collectively, batters have higher batting averages against opposite-handed pitchers. Differences in performance against left-handed versus right-handed pitchers will be referred to as splits. The generality of splits favoring opposite-handed pitchers masks variability in the magnitude of batting splits among batters and variability in splits for a single player among seasons. In this contribution, I test the adequacy of split values in predicting batter handedness and then examine individual variability to explore some of the nuances of the relationships. The data used primarily come from Retrosheet events data with the Lahman dataset being used for some biographical information such as full name. I used the R programming language for all statistical testing and for the creation of the graphics. A copy of the code is available on request by contacting me at [email protected] Data Upload The Retrosheet events data are given by year. The first step in the analysis is to upload dataframes for each year, then use the rbind function to stitch datasets together to make a dataframe and use the function colnames to add column names.
    [Show full text]