Why Do Games Take So Long?

Total Page:16

File Type:pdf, Size:1020Kb

Why Do Games Take So Long? Why do games take so long? By David W. Smith Presented June 23, 2018 SABR48, Pittsburgh, Pennsylvania Game length has been a common topic in the baseball media for several years and many have noted increasing game time over some short interval of a few years. Various explanations have been offered and different aspects of the game emphasized depending on who is doing the analysis. I decided to take a long view to examine many years to look for patterns and trends that can be measured quantitatively. The data for this study come from Retrosheet of course and here is a summary of what I had available. 183,224 Games 108 Seasons: 1908 to 2017 minus 1918 and 1919 The exclusion of 1918 and 1919 reflects the unavailability of time of game for those two seasons. I checked several newspaper sources to no avail. This coincides with the demise of the Sporting Life weekly newspaper and the monopoly that resulted for The Sporting News beginning in 1918. The Sporting Life reported time of game faithfully, but The Sporting News did not fill this need until 1920 although it was useful thereafter. Most of my analysis will have those two seasons omitted. In order to make fair comparisons, it is necessary to remove extra inning games since they would naturally be expected to take longer and skew the overall average. Games that ended early due to rain, curfew, or the like were also removed. The remainder can be termed “regulation-length” games and they are divided by whether or not the home team bats in the bottom of the 9th. The home team wins about 54% of these regulation-length games. Regulation-length Games Games Percentage of total games 8.5 80968 44.3 9.0 83516 45.7 All Regulation-length 164484 90.0 How has the length of the average game changed? Figure 1 shows the data from 1908-2017, excluding 1918 and 1919, but including the extra innings games this time to see the extreme values. Figure 1. Average minutes for all games, 1908-2017 There have been the expected annual variations and periods of rise as well as decline. However, when a linear regression is performed to determine the best fit line, the result shows an extremely strong direct relationship with the R2 value indicating that 94% of the variance in the game length is accounted for by the passage of the years. Highlights along this 110 year trip are here. Landmarks in game length First year with average of over two hours 1934 (123.6 minutes) First year with average of over two and a half hours 1954 (150.3 minutes) First year with average of over three hours 2000 (181.4 minutes) Longest average game time 2017 (188.7 minutes) The patterns of annual variation for the regulation-length games by themselves are parallel with similar high values of R2 in the linear regression. Since my focus is on the regulation-length games, I explored the time differences between the 8.5 and 9.0 games in more detail. Figure 2 has the differences between these two types of games in minutes over the 108 seasons Figure 2. Annual differences in game length between 8.5 and 9.0 inning games, 1908-2017 The average difference has increased, but there is much year to year variation. The linear regression R2 value of 0.51 shows a moderate relation, but certainly less than for the game times. There has been a general increase in the time difference, from around 6 minutes until 1930 to over 10 for most years since 2011. This result is not unexpected since increased game length likely means that each inning is getting somewhat longer. Of course, the only half inning for which we can do this exact calculation is the bottom of the 9th. Interestingly, if we take this current value of 10 minutes for the bottom of the 9th and assume that it applies to all 18 half innings, then we find an expected average of 180 minutes (3 hours) which fits the observed data well. I began researching this topic last summer during SABR47 in New York when my friend Steve Steinberg asked me what the relation was between number of pitches and game length. First I need to make a digression on the data we have. Retrosheet’s pitch data has two distinct components. The answer to Steve’s question is shown in Figure 3 for all regulation-length games from this entire period. Figure 3. Regulation game length and number of pitches. There are several points to make about this result. The R2 value of 0.73 means that the number of pitches in a game explains nearly three quarters of the variance in the time of game for this set of over 64,000 games covering more than 18 million pitches. That is a strong relation, although we would always like it to be more. I did analyze the 8.5 and 9 inning games separately and also the Roth games separately from the modern ones. The Roth data fits in extremely well with the modern information so there is no need to present separate graphs. Also the calculated slopes of the lines for 8.5 and 9 inning games are only slightly different and I therefore combined them all into this one figure. Examination of this figure shows very large ranges in both pitch totals and game times. These extremes and the averages are summarized as follows With ranges this large, it is essential to have big data sets to find statistically meaningful differences. Playing the bottom of the 9th adds an average of 10 minutes to the game. The average number of pitches for these 10 minutes is 15. With this clear importance of the number of pitches on the time of game, I then set about looking for explanations of what would make the number of pitches increase. Here are the quantitative, measurable causes of game actions I considered. Activities with possible effect on number of pitches in a game. Amount of scoring and other offensive actions Changes in proportions of different types of events The level of scoring is presented in Figure 4 along with several other offensive statistics, all for regulation-length games. Figure 4. Major offensive categories, 1908-2017 Runs are, of course, the net result of all offensive action. They are presented in the blue line here. Scoring has varied over the last 110 years, but there is no obvious upward trend to match the time of game. We have still not returned to the level of scoring seen in the first 15 years of the lively ball era although the average game length then was more than an hour less than it is now. So more scoring doesn’t give us our answer. The orange line has the average number of hits per game and the changes there are pretty close to the pattern for runs, but once again there is no systematic upward trend. Walks, shown here in green, take more pitches than other kinds of event (more details on that in a moment), but they also show little systematic change. On the other hand, strikeouts, shown in red, have changed dramatically. As the lively ball era began, the number of strikeouts per game fell, being less than six per game for both teams combined until 1930. The average stayed in the mid-7 range until 1952 when it began a steady increase to a peak of 11.6 in 1967. After the mound was lowered in 1969, the average began to drop, reaching 9.2 in 1981. However, since then there has been a steady rise (with some short-term oscillations) and the value really took off in 2006. The strikeout rate in 2017 was 16.2 per game, the first time it has passed 16. We must address home runs as well and those annual rates are in Figure 5. Figure 5. Home runs per regulation-length game, 1908 – 2017, both teams combined . Home runs have certainly increased since 1908, but there have been boom and bust years here as well. As expected, there was a surge with the introduction of the lively ball in 1920, but that ended dramatically in 1940, with a drop of 42% to 0.7 per game in 1943, perhaps reflecting changes in the construction of the ball due to wartime shortages. However, that slack time was followed by a dramatic upsurge from 1945 to 1961 when it reached 1.9 per game. The next dramatic point was in 1987 which has been written about a great deal. There is no satisfactory explanation for this 16% spurt in a single year although there was much speculation at the time about a “juiced” ball. Sports Illustrated published a study in which the physical properties of the 1987 ball were studied and nothing was detected to account for this large increase. The decline of 28% the next year is equally mysterious. At any rate, the next sustained increase was from 1992 to 1999, followed by a slow decline to 2014 when it was 1.7. In the four seasons since (2014 to 2017), we have seen an extraordinary 46% increase to last year’s all-time high of just under 2.5 per game. The R2 shows a strong relationship over time. I go through all this detail to make the point that there is a strong relationship between home run increase and strikeout increase. This is shown clearly in Figure 6.
Recommended publications
  • 2020 MLB Ump Media Guide
    the 2020 Umpire media gUide Major League Baseball and its 30 Clubs remember longtime umpires Chuck Meriwether (left) and Eric Cooper (right), who both passed away last October. During his 23-year career, Meriwether umpired over 2,500 regular season games in addition to 49 Postseason games, including eight World Series contests, and two All-Star Games. Cooper worked over 2,800 regular season games during his 24-year career and was on the feld for 70 Postseason games, including seven Fall Classic games, and one Midsummer Classic. The 2020 Major League Baseball Umpire Guide was published by the MLB Communications Department. EditEd by: Michael Teevan and Donald Muller, MLB Communications. Editorial assistance provided by: Paul Koehler. Special thanks to the MLB Umpiring Department; the National Baseball Hall of Fame and Museum; and the late David Vincent of Retrosheet.org. Photo Credits: Getty Images Sport, MLB Photos via Getty Images Sport, and the National Baseball Hall of Fame and Museum. Copyright © 2020, the offiCe of the Commissioner of BaseBall 1 taBle of Contents MLB Executive Biographies ...................................................................................................... 3 Pronunciation Guide for Major League Umpires .................................................................. 8 MLB Umpire Observers ..........................................................................................................12 Umps Care Charities .................................................................................................................14
    [Show full text]
  • Chadwick Documentation Release 0.9.0
    Chadwick Documentation Release 0.9.0 T. L. Turocy Jan 04, 2021 Contents 1 Introduction 1 2 Command-line tools 3 i ii CHAPTER 1 Introduction Chadwick is a collection of command-line utility programs for extracting information from baseball play-by-play and boxscore files in the DiamondWare format, as used by Retrosheet (http://www.retrosheet.org). 1.1 Author Chadwick is written, maintained, and Copyright 2002-2020 by Dr T. L. Turocy (ted.turocy <aht> gmail <daht> com) at Chadwick Baseball Bureau (http://www.chadwick-bureau.com). 1.2 License Chadwick is licensed under the terms of the GNU General Public License. If the GPL doesn’t meet your needs, contact the author for other licensing possibilities. 1.3 Development The Chadwick source code is managed using git, at https://github.com/chadwickbureau/chadwick. Bugs in Chadwick should be reported to the issue tracker on github at https://github.com/chadwickbureau/chadwick/ issues. Please be as specific as possible in reporting a bug, including the version of Chadwick you are using, the operating system(s) you’re using, and a detailed list of steps to reproduce the issue. 1.4 Community To get the latest news on the Chadwick tool suite, you can: • Subscribe to the Chadwick Baseball Bureau’s twitter feed (@chadwickbureau); 1 Chadwick Documentation, Release 0.9.0 • Like the Chadwick Baseball Bureau on Facebook; • Read the Chadwick Baseball Bureau’s blog at (http://www.chadwick-bureau.com/blog/) 1.5 Acknowledgments The author thanks Sports Reference, LLC, the Society for American Baseball Research, and XMLTeam, Inc.
    [Show full text]
  • Time Between Pitches: Cause of Long Games? by David W
    Time Between Pitches: Cause of Long Games? By David W. Smith Presented June 29, 2019 SABR49, San Diego, California The length of the average game continues to be a major topic for MLB and the baseball press as it has been above three hours for several years. Many factors have been suggested to account for the longer games and I addressed several of these last year by looking at patterns over the past 110 seasons (https://www.retrosheet.org/Research/SmithD/WhyDoGamesTakeSoLong.pdf). The two strongest connections I found were increases in the number of strikeouts and the overall number of pitches. One possibility that has received a great deal of attention is the time between pitches and in fact MLB has considered instituting a 20-second clock with the bases empty although that has not been implemented. At last year’s SABR convention, Eliza Richardson Malone presented the results of her study of 31 starting pitchers in 2017. Although her data set was limited, her conclusion was clear, namely that she found very few pitch intervals exceeding 20 seconds. Therefore the proposal to force pitchers to throw within 20 seconds would not have a significant impact on game length. With the help of Major League Baseball Advanced Media (MLBAM), I have the precise time down to the second that each pitch was thrown in every game in 2018 with the exceptions of the two games played in Puerto Rico and the three in Mexico. Therefore I had 717,410 pitches to study. By the way, that works out to just under 148 pitches per team per game.
    [Show full text]
  • National Pastime a REVIEW of BASEBALL HISTORY
    THE National Pastime A REVIEW OF BASEBALL HISTORY CONTENTS The Chicago Cubs' College of Coaches Richard J. Puerzer ................. 3 Dizzy Dean, Brownie for a Day Ronnie Joyner. .................. .. 18 The '62 Mets Keith Olbermann ................ .. 23 Professional Baseball and Football Brian McKenna. ................ •.. 26 Wallace Goldsmith, Sports Cartoonist '.' . Ed Brackett ..................... .. 33 About the Boston Pilgrims Bill Nowlin. ..................... .. 40 Danny Gardella and the Reserve Clause David Mandell, ,................. .. 41 Bringing Home the Bacon Jacob Pomrenke ................. .. 45 "Why, They'll Bet on a Foul Ball" Warren Corbett. ................. .. 54 Clemente's Entry into Organized Baseball Stew Thornley. ................. 61 The Winning Team Rob Edelman. ................... .. 72 Fascinating Aspects About Detroit Tiger Uniform Numbers Herm Krabbenhoft. .............. .. 77 Crossing Red River: Spring Training in Texas Frank Jackson ................... .. 85 The Windowbreakers: The 1947 Giants Steve Treder. .................... .. 92 Marathon Men: Rube and Cy Go the Distance Dan O'Brien .................... .. 95 I'm a Faster Man Than You Are, Heinie Zim Richard A. Smiley. ............... .. 97 Twilight at Ebbets Field Rory Costello 104 Was Roy Cullenbine a Better Batter than Joe DiMaggio? Walter Dunn Tucker 110 The 1945 All-Star Game Bill Nowlin 111 The First Unknown Soldier Bob Bailey 115 This Is Your Sport on Cocaine Steve Beitler 119 Sound BITES Darryl Brock 123 Death in the Ohio State League Craig
    [Show full text]
  • Applied Operational Management Techniques for Sabermetrics
    Applied Operational Management Techniques for Sabermetrics An Interactive Qualifying Project Report submitted to the faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial fulfillment of the requirements for the Degree of Bachelor of Science by Rory Fuller ______________________ Kevin Munn ______________________ Ethan Thompson ______________________ May 28, 2005 ______________________ Brigitte Servatius, Advisor Abstract In the growing field of sabermetrics, storage and manipulation of large amounts of statistical data has become a concern. Hence, construction of a cheap and flexible database system would be a boon to the field. This paper aims to briefly introduce sabermetrics, show why it exists, and detail the reasoning behind and creation of such a database. i Acknowledgements We acknowledge first and foremost the great amount of work and inspiration put forth to this project by Pat Malloy. Working alongside us on an attached ISP, Pat’s effort and organization were critical to the success of this project. We also recognize the source of our data, Project Scoresheet from retrosheet.org. The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at 20 Sunset Rd., Newark, DE 19711. We must not forget our advisor, Professor Brigitte Servatius. Several of the ideas and sources employed in this paper came at her suggestion and proved quite valuable to its eventual outcome. ii Table of Contents Title Page Abstract i Acknowledgements ii Table of Contents iii 1. Introduction 1 2. Sabermetrics, Baseball, and Society 3 2.1 Overview of Baseball 3 2.2 Forerunners 4 2.3 What is Sabermetrics? 6 2.3.1 Why Use Sabermetrics? 8 2.3.2 Some Further Financial and Temporal Implications of Baseball 9 3.
    [Show full text]
  • Triple Plays Analysis
    A Second Look At The Triple Plays By Chuck Rosciam This analysis updates my original paper published on SABR.org and Retrosheet.org and my Triple Plays sub-website at SABR. The origin of the extensive triple play database1 from which this analysis stems is the SABR Triple Play Project co-chaired by myself and Frank Hamilton with the assistance of dozens of SABR researchers2. Using the original triple play database and updating/validating each play, I used event files and box scores from Retrosheet3 to build a current database containing all of the recorded plays in which three outs were made (1876-2019). In this updated data set 719 triple plays (TP) were identified. [See complete list/table elsewhere on Retrosheet.org under FEATURES and then under NOTEWORTHY EVENTS]. The 719 triple plays covered one-hundred-forty-four seasons. 1890 was the Year of the Triple Play that saw nineteen of them turned. There were none in 1961 and in 1974. On average the number of TP’s is 4.9 per year. The number of TP’s each year were: Total Triple Plays Each Year (all Leagues) Ye a r T P's Ye a r T P's Ye a r T P's Ye a r T P's Ye a r T P's Ye a r T P's <1876 1900 1 1925 7 1950 5 1975 1 2000 5 1876 3 1901 8 1926 9 1951 4 1976 3 2001 2 1877 3 1902 6 1927 9 1952 3 1977 6 2002 6 1878 2 1903 7 1928 2 1953 5 1978 6 2003 2 1879 2 1904 1 1929 11 1954 5 1979 11 2004 3 1880 4 1905 8 1930 7 1955 7 1980 5 2005 1 1881 3 1906 4 1931 8 1956 2 1981 5 2006 5 1882 10 1907 3 1932 3 1957 4 1982 4 2007 4 1883 2 1908 7 1933 2 1958 4 1983 5 2008 2 1884 10 1909 4 1934 5 1959 2
    [Show full text]
  • The 3-6-3 Double Play
    The 3-6-3 Double Play By Christopher Chestnut January 5, 2010 E-mail: [email protected] Purpose The purpose of this study is to determine which major league first baseman turned the most 3-6-3 double plays. Data Sources The information used in this study was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at www.retrosheet.org. Definitions 3-6-3 Double Play – the first baseman fields a ground ball and throws to the shortstop to retire the runner on first. The shortstop returns the throw to the first baseman to retire the batter/runner. Opportunity – all ground balls (including bunts, excluding base hits) fielded by the first baseman with a runner on first (additional runners may be on second and/or third) and less than two outs. Attempt – all opportunities where the first play by the first baseman is a throw to the shortstop. Methodology I used play-by-play data from Retrosheet to support my analysis. The available data includes the American and National Leagues for 1952-2009. I processed the data for each season to generate a list of all players with at least one opportunity in any one of these eight situations: No outs, runner on first No outs, runners on first and second No outs, runners on first and third No outs, bases full One out, runner on first One out, runners on first and second One out, runners on first and third One out, bases full 1 of 7 The 3-6-3 Double Play January 5, 2010 Results Filtering out the non-ground balls and base hits, I found: 1,319 players
    [Show full text]
  • The First Baseman Fields a Ground Ball and Throws to the Shortstop to Retire the Runner on First
    The 3-6-3 Double Play By Christopher Chestnut July 2, 2008 Definitions 3-6-3 Double Play – the first baseman fields a ground ball and throws to the shortstop to retire the runner on first. The shortstop returns the throw to the first baseman to retire the batter/runner. Opportunity – all situations with a runner on first (additional runners may be on second and/or third), less than two outs, and a ground ball fielded by the first baseman. This excludes line drives and pop-ups, but includes bunted grounders. In addition to the double play, other play results include base hits, errors, or some combination of runners retired. Data Sources The information used here was obtained free of charge from and is copyrighted by Retrosheet. Interested parties may contact Retrosheet at www.retrosheet.org. The available Retrosheet data includes the National League for 1911, 1921, 1922, and 1953 and the American and National Leagues for 1954-2007. Additional data was used from www.mlb.com and www.baseball-reference.com. Summary The available data shows 4,538 3-6-3 double plays turned by 1,297 different first basemen in 72,097 opportunities. The overall percentage of 3-6-3 double plays turned is 6.29%. Left-handed first basemen converted 6.51% (1,784 out of 27,387) while right-handers converted 6.16% (2,754 of 44,710). Gil Hodges holds the single season record for most opportunities with 117 in 1954 (he converted six). Albert Pujols has converted the most in a single season, with 13 in 2007 (in 58 opportunities).
    [Show full text]
  • Greenberg and Interleague Play Tigers in Danger from Train Fire
    Official Publication of Retrosheet, Inc. Volume 5, Number 4 December 1, 1998 Greenberg and Interleague Play As we come to the close of another year, it is conventional to summarize the big events of the last 12 months. We have done a lot this The last issue of TRS carried an article concerning interleague play year, for example, as discussed in the late 1950s. Hank Greenberg was the originator passing the 50,000 of the idea and our crack staff has tracked down the information. In View from mark in total May 1954, Greenberg, then GM of the Indians, offered a plan for games entered, games that count in the standings to be played between all NL and the Vault AL clubs. gathering David Smith, thousands more President Greenberg’s plan included four games between each AL and NL game accounts club, with two at home and two away. The intraleague contests from many sources would be cut from eleven to nine (home and away) to and increasing our visibility with the general accommodate these new games. However, that arrangement public as well as many Major League teams. would have to be modified each year since the eight game inter- league sets would add 32 games while the intraleague reductions However, this is also a good opportunity to would only account for 28 games. Greenberg said that the details think of ways to improve the organization in could be worked out later but the idea was to have a home and the coming year. To me there is one area away engagement with each club.
    [Show full text]
  • Newsletter Vol 6 Nbr 3.Pub
    6HSWHPEHU ,QVLGH 9ROXPH 1XPEHU 0HHWLQJ 5HSRUW &RXUWHV\ 5XQQHU 6WUDQJH 3OD\V The Retro Sheet 1HZ 'HEXW 'DWH 2IILFLDO 3XEOLFDWLRQ RI 5HWURVKHHW ,QF There are three big items of news since the last *DPH $FFRXQW $FTXLVLWLRQV issue of TRS: 1) we have completed the acquisi- tion and computer entry of all games from 1974- By David W. Smith forward; 2) thanks to the gen- erosity of Steve Gietschier The news continues to be good in this area. For quite some time we have needed and The Sporting News, we 13 games from 1975 and 1976, all played in Atlanta. In late July I received an e- have microfilm copies of the mail message from Brian Westgate, who lives in Texas. Brian has hundreds of daily totals for NL players for game accounts on paper and a sizable number on audio tape. He had seen the almost all of the 20th century; lists of games we needed in our "Most Wanted" feature on the web page and very 3) we received permission generously donated copies of four of his scoresheets plus audio tapes of five oth- David W. Smith ers. We were able to obtain the last few we needed from another source so that from Total Sports to post President we now have complete coverage of all games from 1974-forward, a tremendously 1984-1990 event files on our pleasing set of games. The "most wanted" page has paid off for us at least four web site and the first two separate times; we are keeping it updated. seasons went up in July.
    [Show full text]
  • Major Qualifying Project: Advanced Baseball Statistics
    Major Qualifying Project: Advanced Baseball Statistics Matthew Boros, Elijah Ellis, Leah Mitchell Advisors: Jon Abraham and Barry Posterro April 30, 2020 Contents 1 Background 5 1.1 The History of Baseball . .5 1.2 Key Historical Figures . .7 1.2.1 Jerome Holtzman . .7 1.2.2 Bill James . .7 1.2.3 Nate Silver . .8 1.2.4 Joe Peta . .8 1.3 Explanation of Baseball Statistics . .9 1.3.1 Save . .9 1.3.2 OBP,SLG,ISO . 10 1.3.3 Earned Run Estimators . 10 1.3.4 Probability Based Statistics . 11 1.3.5 wOBA . 12 1.3.6 WAR . 12 1.3.7 Projection Systems . 13 2 Aggregated Baseball Database 15 2.1 Data Sources . 16 2.1.1 Retrosheet . 16 2.1.2 MLB.com . 17 2.1.3 PECOTA . 17 2.1.4 CBS Sports . 17 2.2 Table Structure . 17 2.2.1 Game Logs . 17 2.2.2 Play-by-Play . 17 2.2.3 Starting Lineups . 18 2.2.4 Team Schedules . 18 2.2.5 General Team Information . 18 2.2.6 Player - Game Participation . 18 2.2.7 Roster by Game . 18 2.2.8 Seasonal Rosters . 18 2.2.9 General Team Statistics . 18 2.2.10 Player and Team Specific Statistics Tables . 19 2.2.11 PECOTA Batting and Pitching . 20 2.2.12 Game State Counts by Year . 20 2.2.13 Game State Counts . 20 1 CONTENTS 2 2.3 Conclusion . 20 3 Cluster Luck 21 3.1 Quantifying Cluster Luck . 22 3.2 Circumventing Cluster Luck with Total Bases .
    [Show full text]
  • Clutch and Choke Hitters in Major League Baseball: Romantic Myth Or Empirical Fact
    Clutch and Choke Hitters in Major League Baseball: Romantic Myth or Empirical Fact. By: Elan Fuld 1st Released draft May 19th 2005 Page 2 ABSTRACT For a long time now, sabermetricians (baseball statisticians) believed that, at the Major League level, hitters who are truly clutch or choke performers1 do not really exist, even while the many managers, players, and announcers insist that they do exist. We performed statistical analyses to determine if clutch and/or choke hitters exist in the major leagues using all regular season play during the 1974-1992 seasons. We use a Gompertz regression2, a regression similar to logistic regression, but without symmetry around the point of inflection. Our independent variable is a probabilistically constructed measure of the ex ante importance of each plate appearance, and our dependent variable is based upon the value that OPS (on-base percentage plus slugging percentage) would assign to the outcome of the plate appearance.3 We find that clutch/choke effects do exist at the major league level. We also find that a relatively small proportion of major leaguers exhibit such effects. According to our findings, only about 2% of major leaguers are in fact either clutch or choke hitters.4 Furthermore, we find that clutch hitters are not always the players commonly thought to be clutch hitters. In some cases, they are the last people thought to be. For example, one of only two men to show up as clutch at the .001 significance level, regardless of how we treat reaching on an error in the regression, is a man best known for his choke performance defensively: Bill Buckner.
    [Show full text]