Cascading Power Outages Propagate Locally in an Influence Graph That Is

1 Cascading Power Outages Propagate Locally in an Influence Graph that is not the Actual Grid Topology Paul D. H. Hines, Senior Member, IEEE, Ian Dobson, Fellow, IEEE, Pooya Rezaei, Student Member, IEEE Abstract—In a cascading power transmission outage, com- as disease propagation [4], [5]. Similar topological contagion ponent outages propagate non-locally; after one component models have been suggested as a tool for power systems outages, the next failure may be very distant, both topologically analysis (e.g., [6], [7]). and geographically. As a result, simple models of topological contagion do not accurately represent the propagation of cascades This approach has two problems. First, in power grids it is in power systems. However, cascading power outages do follow more appropriate to focus first on line (edge) outages, since patterns, some of which are useful in understanding and reducing line outages are an order of magnitude more likely than node blackout risk. This paper describes a method by which the data (substation) outages. More importantly, as is well known to from many cascading failure simulations can be transformed power systems engineers, cascading outages in power systems into a graph-based model of influences that provides actionable information about the many ways that cascades propagate in propagate non-locally as well as locally. In a power grid, the a particular system. The resulting “influence graph” model is next component to fail after a particular line outages may be Markovian, in that component outage probabilities depend only very distant, both geographically and topologically [8]. This on the outages that occurred in the prior generation. To validate can be seen very clearly from the sequence of events in the the model we compare the distribution of cascade sizes resulting Western US blackout of 1996, shown in Fig. 1, in which the from n − 2 contingencies in a 2896 branch test case to cascade sizes in the influence graph. The two distributions are remarkably failure sequence jumps across long distances at several points similar. In addition, we derive an equation with which one can in the cascade. Cascades in power networks are more similar quickly identify modifications to the proposed system that will to the random fuse model [9] from statistical physics, in which substantially reduce cascade propagation. With this equation one non-local propagation does occur. can quickly identify critical components that can be improved to substantially reduce the risk of large cascading blackouts. Index Terms—Cascading failure, big data, complex networks I. INTRODUCTION OWER systems are generally robust to small disturbances, P but unexpected combinations of failures sometimes ini- tiate long chains of cascading outages, which can result in massive and costly blackouts. Because of the low-probability high-impact nature of cascading failures, there is limited empirical data from which to understand the many ways that cascades propagate through a power system. Simulations of cascading mechanisms can produce data, but there has been insufficient progress in extracting insights and useful statistical information from these data. One approach to obtaining useful information is to try Fig. 1. An illustration of the event sequence for the Western US blackout on arXiv:1508.01775v2 [physics.soc-ph] 6 Jun 2016 and leverage successes from network science, which has an July 2, 1996 (from [10]). The sequence jumps across hundreds of kilometers extensive literature on cascading failure and (more generally) at several points, such as from 3 - 4 and from 7 - 8 . contagion. Models of topological contagion in which failures spread locally from a node to its immediate neighbors [1], On the other hand, simulation models that capture as- [2], [3] have provided insight into a variety of problems, such pects of detailed power grid physics and engineering are useful for understanding how cascades propagate. Impor- P.H. was funded in part by US NSF Award Nos. ECCS-1254549 and DGE- tantly these models do show the non-local propagation that 1144388, and US DTRA Award No. HDTRA110-1-0088. I.D. was funded in is apparent in historical cascades. Examples of physics- part by NSF grant CPS-1135825. P. Hines and P. Rezaei are with the School of Engineering and the Complex based models of cascading include quasi-steady state models, Systems Center, University of Vermont, Burlington, VT 05405, USA; e-mail: such as DCSIMSEP [11], [12], OPA [13], [14], Manchester [email protected], [email protected]. model [15], and TRELSS [16], and dynamic models, such as I. Dobson is with the ECpE department, Iowa State University, Ames IA 50011 USA; email: [email protected]. COSMIC [17] and the hybrid model in [18]. While engineering models are useful, the data that result from this type of 2 simulation are complicated and difficult to summarize, even or more component outages. Each initiating event will perturb for a single cascading failure simulation. In order to obtain the state of the network, which may result in excessive stress useful statistical information, thousands, or even millions of on other components. Because of the laws of power flow, simulations are often required, resulting in enormous sets of these stressed components may be topologically distant from complicated output data. To summarize and understand big the initiating events. Excessive stress may cause one or more data from detailed simulations, there is a need for higher-level dependent outages, which may subsequently cause additional models of cascading in large-scale systems that are simple stress and additional outages. Together this sequence is a enough to provide statistical insight, without abstracting away cascading failure. physical details in a way that could lead to erroneous con- Prior work [27] has shown that one can gain substantial clusions. Only a few such statistical models, which do not insight into the statistical properties of cascading failure by neglect the non-local nature of cascading, have been proposed grouping sequences of component outages into generations, in the literature. Reference [19] finds critical clusters of lines and looking at the growth (or propagation) rates among in simulated cascade data using a synchronization matrix, generations. Typically, the first generation represents the ex- which determines the critical clusters as sets of lines that ogenously caused initiating events. Subsequent generations frequently overload in the same cascade and in a cascade that can be thought of as “children” of prior “parent” outage leads to a large blackout. This approach does not consider the generations. In this branching process model, each generation order in which the lines overload during the cascade, but does of failures produces some number of dependent failures and indicate combinations of critical lines that are associated with the rate at which prior generation (parent) outages cause new blackouts. The general idea of building an influence graph (child) outages is known as the propagation rate, λ. If jZ0j describing ways that component failures might spread goes is the total number of outages (over many cascades) in the back to [20], [21]. set of all initiating events over many cascades and jZ1j is the This form of influence graph couples Markov processes number of outages in the first dependent generation, then the at each node, but it remains unknown how to construct propagation rate from generation zero into the first generation these influence graphs from data. Stochastic models in [22], is: λ0 = jZ1j=jZ0j. More generally: [23] use Markov-chains to represent key characteristics of jZm+1j simulated cascades. In prior work, the authors proposed a λ = (1) m jZ j “line-interaction graph” approach to modeling cascades from m data [24]. A matrix-based approach, which is in some ways However, the approach of [27] simply counts outages with- similar to ours, was used to describe the probability of a out discriminating which component outages will result from a cascade propagating from one network node to another in [25]. particular prior outage, or how the components relate to other Ref. [25] is primarily motivated by avalanches of neurons components. It is clear that component outages vary in their firing, but also mentions blackouts, and analyzes the the frequency and impact on other components. For example, the statistics of the number of generations in a cascade as well outage of a large transmission line carrying a large amount of as the number of failures. One difference is that [25] restores power is likely to result in more dependent outages than the nodes in the generation after they fail, which allows cascades failure of a smaller line. Thus, it is useful to consider different to propagate through previously failed nodes and prolongs the components as having different propagation rates. With this cascades. The ideas in [24] were extended in [26] to build an in mind, our model assumes that each component i produces interaction model of cascading that can reproduce statistical a random number, Ki;m, of child outages according to the properties of simulated cascades. following conditional probability function: This paper presents a new influence graph method, also f[kji; m] = Pr[ k outages in generation m + 1, given based on the concepts in [24], in which we take large amounts a single outage of i in generation m] (2) of data from cascade simulations and synthesize the data into a Markovian network model. Importantly, the resulting model In this paper we assume that f[kji; m] is a Poisson distribution, has a network structure through which cascades propagate with λi;m representing the mean number of outages propagated locally, but that network structure is dramatically different by the outage of i in generation(s) m. from that of the original power network. That is, the outages Finally, if component i outages and causes a dependent propagate along the influence graph, which is completely outage in the next generation, some components are more different than the graph topology of the grid.

Cascading Power Outages Propagate Locally in an Influence Graph That Is

Prevention of Cascading Outages Using Sparse Wide Area Synchrophasor Measurements

Controlling Cascading Failures with Cooperative Autonomous Agents

Final Blackout Report Chapters 7-10

Revealing Cascading Failure Vulnerability in Power Grids Using

Augmented Power Dispatch for Resilient Operation Through Controllable Series Compensation and N-1-1 Contingency Assessment

Special Issue on “Thermal Safety of Chemical Processes”

Comparing a Transmission Planning Study of Cascading with Historical Line Outage Data Ian Dobson Iowa State University, [email protected]

DOE Technical Support Document

Power Grid Cascading Failure Mitigation by Reinforcement Learning

Cascading Failures in Power Grids – Analysis and Algorithms

A Modeling and Simulation Framework for the Reliability/Availability

The Cost of Cascading Failure Risk and Resilience Within UK Infrastructure Networks