Data Mining and Its Application to Baseball Stats CSU Stanislaus
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
NCAA Division I Baseball Records
Division I Baseball Records Individual Records .................................................................. 2 Individual Leaders .................................................................. 4 Annual Individual Champions .......................................... 14 Team Records ........................................................................... 22 Team Leaders ............................................................................ 24 Annual Team Champions .................................................... 32 All-Time Winningest Teams ................................................ 38 Collegiate Baseball Division I Final Polls ....................... 42 Baseball America Division I Final Polls ........................... 45 USA Today Baseball Weekly/ESPN/ American Baseball Coaches Association Division I Final Polls ............................................................ 46 National Collegiate Baseball Writers Association Division I Final Polls ............................................................ 48 Statistical Trends ...................................................................... 49 No-Hitters and Perfect Games by Year .......................... 50 2 NCAA BASEBALL DIVISION I RECORDS THROUGH 2011 Official NCAA Division I baseball records began Season Career with the 1957 season and are based on informa- 39—Jason Krizan, Dallas Baptist, 2011 (62 games) 346—Jeff Ledbetter, Florida St., 1979-82 (262 games) tion submitted to the NCAA statistics service by Career RUNS BATTED IN PER GAME institutions -
Sabermetrics: the Past, the Present, and the Future
Sabermetrics: The Past, the Present, and the Future Jim Albert February 12, 2010 Abstract This article provides an overview of sabermetrics, the science of learn- ing about baseball through objective evidence. Statistics and baseball have always had a strong kinship, as many famous players are known by their famous statistical accomplishments such as Joe Dimaggio’s 56-game hitting streak and Ted Williams’ .406 batting average in the 1941 baseball season. We give an overview of how one measures performance in batting, pitching, and fielding. In baseball, the traditional measures are batting av- erage, slugging percentage, and on-base percentage, but modern measures such as OPS (on-base percentage plus slugging percentage) are better in predicting the number of runs a team will score in a game. Pitching is a harder aspect of performance to measure, since traditional measures such as winning percentage and earned run average are confounded by the abilities of the pitcher teammates. Modern measures of pitching such as DIPS (defense independent pitching statistics) are helpful in isolating the contributions of a pitcher that do not involve his teammates. It is also challenging to measure the quality of a player’s fielding ability, since the standard measure of fielding, the fielding percentage, is not helpful in understanding the range of a player in moving towards a batted ball. New measures of fielding have been developed that are useful in measuring a player’s fielding range. Major League Baseball is measuring the game in new ways, and sabermetrics is using this new data to find better mea- sures of player performance. -
Understanding Advanced Baseball Stats: Hitting
Understanding Advanced Baseball Stats: Hitting “Baseball is like church. Many attend few understand.” ~ Leo Durocher Durocher, a 17-year major league vet and Hall of Fame manager, sums up the game of baseball quite brilliantly in the above quote, and it’s pretty ridiculous how much fans really don’t understand about the game of baseball that they watch so much. This holds especially true when you start talking about baseball stats. Sure, most people can tell you what a home run is and that batting average is important, but once you get past the basic stats, the rest is really uncharted territory for most fans. But fear not! This is your crash course in advanced baseball stats, explained in plain English, so that even the most rudimentary of fans can become knowledgeable in the mysterious world of baseball analytics, or sabermetrics as it is called in the industry. Because there are so many different stats that can be covered, I’m just going to touch on the hitting stats in this article and we can save the pitching ones for another piece. So without further ado – baseball stats! The Slash Line The baseball “slash line” typically looks like three different numbers rounded to the thousandth decimal place that are separated by forward slashes (hence the name). We’ll use Mike Trout‘s 2014 slash line as an example; this is what a typical slash line looks like: .287/.377/.561 The first of those numbers represents batting average. While most fans know about this stat, I’ll touch on it briefly just to make sure that I have all of my bases covered (baseball pun intended). -
The Rules of Scoring
THE RULES OF SCORING 2011 OFFICIAL BASEBALL RULES WITH CHANGES FROM LITTLE LEAGUE BASEBALL’S “WHAT’S THE SCORE” PUBLICATION INTRODUCTION These “Rules of Scoring” are for the use of those managers and coaches who want to score a Juvenile or Minor League game or wish to know how to correctly score a play or a time at bat during a Juvenile or Minor League game. These “Rules of Scoring” address the recording of individual and team actions, runs batted in, base hits and determining their value, stolen bases and caught stealing, sacrifices, put outs and assists, when to charge or not charge a fielder with an error, wild pitches and passed balls, bases on balls and strikeouts, earned runs, and the winning and losing pitcher. Unlike the Official Baseball Rules used by professional baseball and many amateur leagues, the Little League Playing Rules do not address The Rules of Scoring. However, the Little League Rules of Scoring are similar to the scoring rules used in professional baseball found in Rule 10 of the Official Baseball Rules. Consequently, Rule 10 of the Official Baseball Rules is used as the basis for these Rules of Scoring. However, there are differences (e.g., when to charge or not charge a fielder with an error, runs batted in, winning and losing pitcher). These differences are based on Little League Baseball’s “What’s the Score” booklet. Those additional rules and those modified rules from the “What’s the Score” booklet are in italics. The “What’s the Score” booklet assigns the Official Scorer certain duties under Little League Regulation VI concerning pitching limits which have not implemented by the IAB (see Juvenile League Rule 12.08.08). -
FROM BULLDOGS to SUN DEVILS the EARLY YEARS ASU BASEBALL 1907-1958 Year ...Record
THE TRADITION CONTINUES ASUBASEBALL 2005 2005 SUN DEVIL BASEBALL 2 There comes a time in a little boy’s life when baseball is introduced to him. Thus begins the long journey for those meant to play the game at a higher level, for those who love the game so much they strive to be a part of its history. Sun Devil Baseball! NCAA NATIONAL CHAMPIONS: 1965, 1967, 1969, 1977, 1981 2005 SUN DEVIL BASEBALL 3 ASU AND THE GOLDEN SPIKES AWARD > For the past 26 years, USA Baseball has honored the top amateur baseball player in the country with the Golden Spikes Award. (See winners box.) The award is presented each year to the player who exhibits exceptional athletic ability and exemplary sportsmanship. Past winners of this prestigious award include current Major League Baseball stars J. D. Drew, Pat Burrell, Jason Varitek, Jason Jennings and Mark Prior. > Arizona State’s Bob Horner won the inaugural award in 1978 after hitting .412 with 20 doubles and 25 RBI. Oddibe McDowell (1984) and Mike Kelly (1991) also won the award. > Dustin Pedroia was named one of five finalists for the 2004 Golden Spikes Award. He became the seventh all-time final- ist from ASU, including Horner (1978), McDowell (1984), Kelly (1990), Kelly (1991), Paul Lo Duca (1993) and Jacob Cruz (1994). ODDIBE MCDOWELL > With three Golden Spikes winners, ASU ranks tied for first with Florida State and Cal State Fullerton as the schools with the most players to have earned college baseball’s top honor. BOB HORNER GOLDEN SPIKES AWARD WINNERS 2004 Jered Weaver Long Beach State 2003 Rickie Weeks Southern 2002 Khalil Greene Clemson 2001 Mark Prior Southern California 2000 Kip Bouknight South Carolina 1999 Jason Jennings Baylor 1998 Pat Burrell Miami 1997 J.D. -
A Statistical Study Nicholas Lambrianou 13' Dr. Nicko
Examining if High-Team Payroll Leads to High-Team Performance in Baseball: A Statistical Study Nicholas Lambrianou 13' B.S. In Mathematics with Minors in English and Economics Dr. Nickolas Kintos Thesis Advisor Thesis submitted to: Honors Program of Saint Peter's University April 2013 Lambrianou 2 Table of Contents Chapter 1: The Study and its Questions 3 An Introduction to the project, its questions, and a breakdown of the chapters that follow Chapter 2: The Baseball Statistics 5 An explanation of the baseball statistics used for the study, including what the statistics measure, how they measure what they do, and their strengths and weaknesses Chapter 3: Statistical Methods and Procedures 16 An introduction to the statistical methods applied to each statistic and an explanation of what the possible results would mean Chapter 4: Results and the Tampa Bay Rays 22 The results of the study, what they mean against the possibilities and other results, and a short analysis of a team that stood out in the study Chapter 5: The Continuing Conclusion 39 A continuation of the results, followed by ideas for future study that continue to project or stem from it for future baseball analysis Appendix 41 References 42 Lambrianou 3 Chapter 1: The Study and its Questions Does high payroll necessarily mean higher performance for all baseball statistics? Major League Baseball (MLB) is a league of different teams in different cities all across the United States, and those locations strongly influence the market of the team and thus the payroll. Year after year, a certain amount of teams, including the usual ones in big markets, choose to spend a great amount on payroll in hopes of improving their team and its player value output, but at times the statistics produced by these teams may not match the difference in payroll with other teams. -
OFFICIAL GAME INFORMATION Lake County Captains (14-15) Vs
High-A Affiliate OFFICIAL GAME INFORMATION Lake County Captains (14-15) vs. Dayton Dragons (16-13) Sunday, June 6th • 1:30 p.m. • Classic Park • Broadcast: WJCU.org Game #30 • Home Game #12 • Season Series: 3-2, 19 Games Remaining RHP Mason Hickman (1-2, 3.45 ERA) vs. RHP Spencer Stockton (2-0, 3.57 ERA) YESTERDAY: The Captains’ three-game winning streak ended with a 15-4 loss to Dayton on Saturday night. Kevin Coulter surrendered seven runs on 10 hits over 1.2 innings to take the loss in a spot start. Dragons centerfielder Quin Cotton hit two home runs and drove in six High-A Central League runs to lead the Dayton offense. Dragons starter Graham Ashcraft earned the win with seven strong innings, in which he allowed just one run on two hits and struck out nine. East Division W L GB COMING ALIVE: After scoring just 12 runs and suffering a six-game sweep last week at West Michigan, the Captains have already scored 29 runs in the first five games of this series against Dayton. Will Brennan has gone 7-for-18 (.389) with two home runs, two doubles, 10 RBI and West Michigan (Detroit) 16 12 -- a 1.254 OPS. Joe Naranjo has gone 3-for-10 with a team-leading five walks for a .533 on-base percentage. Dayton (Cincinnati) 16 13 0.5 BRENNAN BASHING: Captains OF Will Brennan leads the High-A Central League (HAC) lead in doubles (11). He is second in batting average (.326), fourth in wRC+ (154), fifth in on-base percentage (.410), sixth in OPS (.920), sixth in extra-base hits (13) and ninth in slugging Great Lakes (Los Angeles - NL) 15 14 1.5 percentage (.511). -
MLB Statistics Feeds
Updated 07.17.17 MLB Statistics Feeds 2017 Season 1 SPORTRADAR MLB STATISTICS FEEDS Updated 07.17.17 Table of Contents Overview ....................................................................................................................... Error! Bookmark not defined. MLB Statistics Feeds.................................................................................................................................................. 3 Coverage Levels........................................................................................................................................................... 4 League Information ..................................................................................................................................................... 5 Team & Staff Information .......................................................................................................................................... 7 Player Information ....................................................................................................................................................... 9 Venue Information .................................................................................................................................................... 13 Injuries & Transactions Information ................................................................................................................... 16 Game & Series Information .................................................................................................................................. -
Does Sabermetrics Have a Place in Amateur Baseball?
BaseballGB Full Article Does sabermetrics have a place in amateur baseball? Joe Gray 7 March 2009 he term “sabermetrics” is one of the many The term sabermetrics combines SABR (the acronym creations of Bill James, the great baseball for the Society for American Baseball Research) and theoretician (for details of the term’s metrics (numerical measurements). The extra “e” T was presumably added to avoid the difficult-to- derivation and usage see Box 1). Several tight pronounce sequence of letters “brm”. An alternative definitions exist for the term, but I feel that rather exists without the “e”, but in this the first four than presenting one or more of these it is more letters are capitalized to show that it is a word to valuable to offer an alternative, looser definition: which normal rules of pronunciation do not apply. sabermetrics is a tree of knowledge with its roots in It is a singular noun despite the “s” at the end (that is, you would say “sabermetrics is growing in the philosophy of answering baseball questions in as popularity” rather than “sabermetrics are growing in accurate, objective, and meaningful a fashion as popularity”). possible. The philosophy is an alternative to The adjective sabermetric has been back-derived from the term and is exemplified by “a sabermetric accepting traditional thinking without question. tool”, or its plural “sabermetric tools”. The adverb sabermetrically, built on that back- Branches of the sabermetric tree derived adjective, is illustrated in the phrase “she The metaphor of sabermetrics as a tree extends to approached the problem sabermetrically”. describing the various broad concepts and themes of The noun sabermetrician can be used to describe any practitioner of sabermetrics, although to some research as branches. -
Name of the Game: Do Statistics Confirm the Labels of Professional Baseball Eras?
NAME OF THE GAME: DO STATISTICS CONFIRM THE LABELS OF PROFESSIONAL BASEBALL ERAS? by Mitchell T. Woltring A Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Science in Leisure and Sport Management Middle Tennessee State University May 2013 Thesis Committee: Dr. Colby Jubenville Dr. Steven Estes ACKNOWLEDGEMENTS I would not be where I am if not for support I have received from many important people. First and foremost, I would like thank my wife, Sarah Woltring, for believing in me and supporting me in an incalculable manner. I would like to thank my parents, Tom and Julie Woltring, for always supporting and encouraging me to make myself a better person. I would be remiss to not personally thank Dr. Colby Jubenville and the entire Department at Middle Tennessee State University. Without Dr. Jubenville convincing me that MTSU was the place where I needed to come in order to thrive, I would not be in the position I am now. Furthermore, thank you to Dr. Elroy Sullivan for helping me run and understand the statistical analyses. Without your help I would not have been able to undertake the study at hand. Last, but certainly not least, thank you to all my family and friends, which are far too many to name. You have all helped shape me into the person I am and have played an integral role in my life. ii ABSTRACT A game defined and measured by hitting and pitching performances, baseball exists as the most statistical of all sports (Albert, 2003, p. -
"What Raw Statistics Have the Greatest Effect on Wrc+ in Major League Baseball in 2017?" Gavin D
1 "What raw statistics have the greatest effect on wRC+ in Major League Baseball in 2017?" Gavin D. Sanford University of Minnesota Duluth Honors Capstone Project 2 Abstract Major League Baseball has different statistics for hitters, fielders, and pitchers. The game has followed the same rules for over a century and this has allowed for statistical comparison. As technology grows, so does the game of baseball as there is more areas of the game that people can monitor and track including pitch speed, spin rates, launch angle, exit velocity and directional break. The website QOPBaseball.com is a newer website that attempts to correctly track every pitches horizontal and vertical break and grade it based on these factors (Wilson, 2016). Fangraphs has statistics on the direction players hit the ball and what percentage of the time. The game of baseball is all about quantifying players and being able give a value to their contributions. Sabermetrics have given us the ability to do this in far more depth. Weighted Runs Created Plus (wRC+) is an offensive stat which is attempted to quantify a player’s total offensive value (wRC and wRC+, Fangraphs). It is Era and park adjusted, meaning that the park and year can be compared without altering the statistic further. In this paper, we look at what 2018 statistics have the greatest effect on an individual player’s wRC+. Keywords: Sabermetrics, Econometrics, Spin Rates, Baseball, Introduction Major League Baseball has been around for over a century has given awards out for almost 100 years. The way that these awards are given out is based on statistics accumulated over the season. -
October, 1998
By the Numbers Volume 8, Number 1 The Newsletter of the SABR Statistical Analysis Committee October, 1998 RSVP Phil Birnbaum, Editor Well, here we go again: if you want to continue to receive By the If you already replied to my September e-mail, you don’t need to Numbers, you’ll have to drop me a line to let me know. We’ve reply again. If you didn’t receive my September e-mail, it means asked this before, and I apologize if you’re getting tired of it, but that the committee has no e-mail address for you. If you do have there’s a good reason for it: our committee budget. an e-mail address but we don’t know about it, please let Neal Traven (our committee chair – see his remarks later this issue) Our budget is $500 per year. know, so that we can communicate with you more easily. Giving us your e-mail address does not register you to receive BTN by e- Our current committee member list numbers about 200. Of the mail. Unless you explicitly request that, I’ll continue to send 200 of us, 50 have agreed to accept delivery of this newsletter by BTN by regular mail. e-mail. That leaves 150 readers who need physical copies of BTN. At four issues a year, that’s 600 mailings, and there’s no As our 1998 budget has not been touched until now, we have way to do 600 photocopyings and mailings for $500. sufficient funds left over for one more full issue this year.