Predictive Analytics, Workplace Complements, and Heterogeneous Firm Performance 1

The Power of Prediction: Predictive Analytics, Workplace Complements, and Heterogeneous Firm Performance 1 Erik Brynjolfsson Wang Jin Kristina McElheran Stanford University and MIT Sloan School of University of Toronto NBER Management [email protected] [email protected] [email protected] Abstract Anecdotes abound concerning the promise of predictive analytics tools and techniques in an age of increasingly ubiquitous digital information. However, large-scale microdata on this fast- emerging practice and its nuances has been lacking. To address this gap, we worked with the U.S. Census Bureau to collect detailed responses from over 30,000 establishments in the U.S. manufacturing sector on predictive analytics and key workplace complements. This effort yielded the first statistics on the widespread adoption of predictive analytics in a large, representative panel of establishments. Focusing on plant-level productivity over time, we find a significant increase in firm performance among adopters – but almost entirely among plants that have also pursued specific tangible and intangible investments. For workplaces with high IT investment, educated employees, a high-efficiency production strategy, and certain management practices, multi-factor productivity is more than two to four percent higher with the use of predictive analytics. Instrumental variables estimates and the timing of performance gains with respect to adoption suggest a causal relationship. While the use of predictive analytics is diffusing rapidly across a wide range of rims, there are understudied strategic and organizational contingencies that determine how productive it is, in practice. Keywords: predictive analytics, productivity, complementarity, strategic commitment, management practices, information technology, digitization 1 Disclaimer: Any opinions and conclusions expressed herein are those of the authors and do not necessarily represent the views of the U.S. Census Bureau. All results have been reviewed to ensure that no confidential information is disclosed. All errors are our own. 1 “What information consumes is rather obvious: it consumes the attention of its recipients…The real design problem is not to provide more information to people, but [to design] intelligent information-filtering systems.” - Herbert A. Simon 1971 Designing Organizations for an Information-Rich World 1. Introduction According to Ransbotham et al. (2016), “the hype around data and analytics has reached a fever pitch.” The increased volume, variety, and timeliness of digital information produced by sources ranging from social media to the Internet of Things (IoT) have dramatically shifted what is measurable within firms and markets (McAfee and Brynjolfsson 2012). At the same time, exponential growth in computing power and the rise of new computational methods (in particular, new machine learning algorithms) have fueled excitement about tools and practices to extract more value from this data. In a 2019 IDC report, the worldwide forecasted revenues for big data and business analytics will reach $274.3 billion by the year of 2022 with a five-year compound annual growth rate of 13.2%. 2 Research in strategic management is increasingly orienting towards understanding the impacts of digitization for firm strategy and firm performance (e.g., Greenstein et al. 2013, Adner et al. 2019, Bennett 2020). Ironically, however, rising big-data enthusiasm is not grounded in big-sample evidence. Critcal gaps persist in our understanding of the prevalence, business impacts, and limitations of these tools and techniques among heterogeneous firms. We worked to address these gaps by collaborating with the US Census Bureau to gather the first large scale, representative data on the adoption of predictive analytics and potential tangible and intangible complements such as IT investement, worker education, production strategy, and structured management practices. We link this new survey to a detailed panel of administrative data to estimate productivity at the establishment level. Leveraging these novel measures, instrumental variables estimates based on plant-level variance in government-mandated data collection, and key details of timing for distinct waves of adopters, we provide new insights into the prevalence and variance in predictive analytics use, the causal relationships at work, and the 2 For more details, please see IDC website: https://www.idc.com/getdoc.jsp?containerId=IDC_P33195. 2 workplace complements that both constrain and augument the benefits achievable with these rapidly-emerging tools and techniques. In our representative sample of over 30,000 U.S. manufacturing establishments, we find that adoption of predictive analytics is both widespread and significantly associated with at least two to four percent higher multi-factor productivity. This shift in how firms extract value from data translates into an average $830,000 difference in sales between adopters and non-adopters, accounting for production inputs and a wide range of other factors. Critically, these gains are only present among establishments that have also pursued high levels of IT capital, employ educated workers at high rates, have committed to a high-efficiency production strategy, or rely on certain management practices centered on tracking key performance indicators. Predictive analytics, unlike other methods for making sense of data inputs, 3 is a set of techniques ranging from data mining to statistical modeling (including machine learning and “artificial intelligence”) to analyze historical and current data in order to make predictions about future events. 4 Long-standing theories in economics and management have modeled how better information can support and improve human decision-making, creating value at both the individual and organizational levels (Raiffa 1968; Blackwell 1953; Gorry and Scott 1971; March 1994). By extracting core information from large and complex data, predictive analytics shrinks the dimensions of relevant information, reduces the cognitive costs of decision-making, and speeds the execution of higher-quality decisions, thereby boosting returns from digital information. Significant research attention has been paid in recent years to “Data-Driven-Decision Making” (DDD) practices among managers and the rise of predictive analytics as a related but distinct phenomenon. The adoption of DDD and data analytics have been linked to higher Tobin’s q and profits (Brynjolfsson et al. 2011; Saunders and Tambe 2015), better productivity (Müller et al. 2018; Brynjolfsson and McElheran 2019), and innovation (Wu et al. 2019). However, the causal link between predictive analytics and organizational performance in large-scale, representative studies remains lacking. Building on prior literature, we address this gap by exploring the 3 For instance, a recent study by Berman and Israeli study the impacts of descriptive analytics (2020). All of our results separately control for management practices that are “data-driven” according to other questions on the survey (Brynjolfsson and McElheran 2019), so we are specifically isolating this particular set of techniques. 4 Please see Wikipedia and citations within at: https://en.wikipedia.org/wiki/Predictive_analytics. 3 performance implications of predictive analytics in a sample covering more than 50% of the U.S. manufacturing economy. We address open questions about causality by employing both instrumental variables estimation and timing tests to rule out reverse causality. Our findings provide evidence that predictive analytics is causally related to higher productivity, on average. However, important boundary conditions apply related to firm hetreogeneity and strategic commitments that have received little or no attention to date. In particular, our study reveals how these non-trivial returns on the use of predictive analytics depend almost entirely on the presence of other tangible and intangible firm investments. Anecdotal evidence is mounting that many firms struggle to realize benefits from their investments in data analytics (Schrage 2014). Lack of complementary skills and unaligned corporate strategy are reported to be key challenges (Ransbotham et al. 2015). Earlier acdaemic work on IT productivity emphasizes significant heterogeneity across industries and firms in the benefits of generic IT investment (e.g., Stiroh 2002, Brynjolfsson and Hitt 1995). A robust theoretical literature argues that this heterogeneity may arise from investments in complementary assets and managerial practices (Kandel and Lazear 1992; Milgrom and Roberts 1990, 1995; Holmstrom and Milgrom 1994; Brynjolfsson and Milgrom 2013; Brynjolfsson et al. 2018). A number of empirical studies have confirmed this intuition with respect to general-purpose IT and computer use (Black and Lynch 2001; Caroli and Van Reenen 2001; Bresnahan et al. 2002; Aral and Weill 2007; Bloom et al. 2012; Aral et al. 2012) as well as for data-centered management practices (Tambe 2014; Brynjolfsson and McElheran 2019; Dubey et.al. 2019). Yet tools and techniques for leveraging digital information are evolving rapidly, detailed data on their use in practice is difficult to obtain for large samples of firms, and many open questions remain. In particular, large-scale evidence on what specific complements should be considered – much less how much they matter – is almost entirely lacking. Also missing is a rigorous empirical approach for disentangling causality and potential confounds across a large and economically

Load more