
machine learning GOING LONG ON MACHINE LEARNING How AWS and the NFL teamed up to transform a 100-year-old league TABLE OF CONTENTS Introduction 03 Letter from Michelle McKenna, SVP and CIO of the NFL What are the odds? 04 Opening story What happened? 05 The data behind the stat The factors 06 How is Completion Probability calculated? The mechanics 08 How does Next Gen Stats calculate Completion Probability? The formula 09 How the machine learning models are trained The journey 11 The NFL’s machine learning journey The problem 13 Why machine learning? The end results 14 Measuring the impact Ready for more? 15 Taking machine learning beyond the end zones Get in the game 16 Resources Glossary 17 More stats 2 INTRODUCTION If you grew up in a Web Services. Working with AWS just made sense because of their football family like I did, flexibility, security, and ability to scale. you might already know that the AWS has the broadest offering of NFL is America’s largest sports cloud services for our builders to We’re excited to share organization, with over 188 million build upon—giving us the ideal this portion of our fans worldwide. We’re a big content platform to grow. “ creator—NFL games accounted for journey with you and For me, getting started with machine 38 of last year’s top 50 telecasts. learning was not a question of “why” help you see a little bit We’re also a big data creator— but of “how.” I knew machine learning every week our league generates of what we’ve been up would transform our company. I also 3terabytes, equivalent to 1,500 knew our technology projects needed to at the NFL. hours, of data. to be supported at the top level of As CIO of the NFL, I’m responsible the organization to get both the for ensuring we leverage our data to business and technical teams We’re working to build a better create the best and most efficient working together and sharing playing, coaching, and viewing technology solutions that will evolve the same priorities. experience. Thanks to AWS machine our game, engage our fans, and learning, we’re revolutionizing a Implementing machine learning protect and develop our players. 100-year-old league. benefits the entire company, not just the technology department. Machine learning has made the NFL’s production teams more efficient— transforming previously tedious roles Transforming a like video labeling into an automated, 100-year-old league streamlined process. Coaches can use “ the technology to officiate playbook is not an easy job. formations and automatically draw Likewise, the NFL’s out plays, saving them time on the sidelines. With the power of machine machine learning learning on AWS, we can better journey has not been a understand fan engagement, how a game is presented, the potential straightforward path. impact of adjusting the rules of play, how the game is called, and player performance and safety. Next Gen Michelle McKenna Next Gen Stats, one of our machine Stats allows us to use this real-time Senior Vice President and CIO, NFL learning projects—is the product of data to engage, inform, and empower our working partnership with Amazon fans in new and unique ways. 3 WHAT ARE THE ODDS? It was Sunday night, week one in the 2018 season, fourth quarter. The Green Bay Packers were down 3 to 20 against longtime rival the Chicago Bears. Aaron Rodgers, who sat out much of the first half from a knee injury, was back in the game, but things weren’t looking good. Second and 2, and Corey Linsley snaps to Rodgers. Man-to-man coverage. All eyes are on Rodgers, who appears to have time, except nobody down the field looks open. Rodgers steps back and launches. FPO Not all passes are created equal. When players defy the odds, we are exposed to how talented they truly are. But this often doesn’t get represented by traditional box score stats, which would score Rodgers the same whether his pass traveled three yards behind the line of scrimmage to an open running back or whether his pass did what happened next. The ball sails 39 yards down the field toward the back-right corner of the Bears’ end zone, and the Bears’ Kyle Fuller is all over Geronimo Allison, the target of the last several of Rodgers’ first-down throws. Everyone can see it’s an unlikely catch as it arcs toward the pylon. But how unlikely? Next Gen Stats, powered by machine learning models built on Amazon SageMaker from AWS, had just launched another new metric for the 2018 season called Completion Probability, which leverages tracking data to improve upon the limitations of raw box score stats and add context to each passing play. Next Gen Stats calculated the pass had just a 14.7% Completion Probability—the most improbable completion that week. Allison leaps with Fuller on his back, in full reach, fingers wide to swat the ball, and misses by what seems like inches. The ball lands right in the pocket as Allison cradles it close and plants two feet with full control before sliding out of bounds for a touchdown. This is the beginning of the end for the Bears, who ultimately lose the game 23-24 to a significant fourth-quarter comeback for the Packers. 4 WHAT HAPPENED? The data behind the stat By Matt Swensson, Vice President of Emerging Products and Technology of the NFL Completion Probability is measured All of those factors, among several using more than 10 different in-play others, had a direct relationship with factors starting with data transmitted the likelihood Rodgers’ pass would by RFID chips in the football and on be complete or incomplete. We players’ shoulder pads all collected by can evaluate these relationships by RF receivers around the stadium. plotting each in-play factor against the actual completion percentage In the case of Rodgers, the data to better understand each factor’s shows the pass traveling 60.3 yards in effect on the outcome of a play and the air from the location of Rodgers contextualize the difficulty of a throw. at the time of the throw to Allison at the time of the catch. Rodgers Let’s review some of these factors had 2.1 yards of separation from and examine how the predictive Jonathan Bullard when he released models were trained. the ball, and Allison had 0.9 yards of separation from Kyle Fuller at the moment of the catch. 5 MLUNDERSTANDING DIFFERENCE THE FACTORS Completion Probability’s top factors 1.0 Air Distance 0.8 The further the ball has to travel, the lower the likelihood of completion. This 1 is measured by the air distance – the true 0.6 distance from the location the ball is thrown to where it is caught. Passes traveling xComp 0.4 more than 40 air distance yards have approximately 20% chance of completion. 0.2 1.0 0.0 0 10 20 30 40 50 60 70 Air distance 0.8 0.6 Target Separation xComp As the distance between the receiver and 0.4 nearest defender increases, the likelihood 2 of a completion also increases. The larger 0.2 circles at lower target separation show that it’s more common for receivers to have close defenders. 0.0 0 2 4 6 8 10 12 14 Target separation at pass arried Sideline Separation As the distance between the receiver and the sideline decreases, the likelihood of a 3 completion also decreases. The probability of a completed pass decreases rapidly at five yards of sideline separation. Controlling for all other factors, passes to the sideline xComp just inside the white paint have a roughly 30% chance of a completion. After about 10 yards, we see diminishing returns. Separation from sideline 6 MLUNDERSTANDING DIFFERENCE THE FACTORS 1.0 Pass Rush Separation 0.8 As the distance between the quarterback and nearest pass rusher at the time of 4 the throw decreases, the likelihood of a 0.6 completion also decreases. A quarterback xComp throwing with no defenders around has 0.4 a higher probability of a completed pass compared to a quarterback with a pass 0.2 rusher within a few yards at the time of the throw. 0.0 0 1 2 3 4 5 Closest to b 1.0 0.8 Passer Speed As the speed of the quarterback at the time 0.6 of the throw increases, the likelihood of a 5 completed pass decreases. Speeds below xComp 8 MPH have little effect on the probability 0.4 of a completion. However, as the speed of the quarterback increases above 8 MPH, the 0.2 chance of a completion decreases. 0.0 0 2 4 6 8 10 12 14 16 18 1.0 Passer speed at pass forward 0.8 Time to Throw 0.6 Most passes occur between 2 and 3 seconds after the snap. As the duration xComp 6 of time increases from snap to throw, the 0.4 likelihood of a completed pass decreases. The probability of a completion declines 0.2 significantly after 3 seconds. 0.0 0 1 2 3 4 5 6 Time to throw These are just a few of the data points measured and fed into machine learning model to develop the Next Gen Stats Completion Probability metric. Next we’ll explore why the NFL decided to use machine learning. 7 THE MECHANICS How does Next Gen Stats calculate Completion Probability? Amazon SageMaker Building and training machine learning models used to be By Jarvis Lee, AWS Data Scientist and Tyler Mullenbach, AWS Practice Manager locked in the ivory towers of elite developers and data By leveraging AWS’ broad range Stats team to reflect the trends and scientists.
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages19 Page
-
File Size-