arXiv:1312.7712v1 [stat.ME] 30 Dec 2013 iyadcmlxt fteerhuk phenomenon. earthquake the con- of diver- complexity been of and because sity have remain though uncertainties discussions large even ducted, and However, analysis it. detailed with associated earthquake, anisms mech- major seismic remark- important every elucidated progressed have as After researchers has geophysics. increased in earthquakes steadily ably of has study data the relevant avail- of The of understanding significantly. ability our increased 1960s, has late earthquakes the since science rseto atqaePrediction Ogata Yosihiko Earthquake of Research Prospect A 03 o.2,N.4 521–541 4, DOI: No. 28, Vol. 2013, Science Statistical er-u oy 5-55e-mail: Komaba, 153-8505 4-6-1 Tokyo Professor, Megro-Ku, Visiting Science, Tokyo, Industrial of of University Institute and 190-8562 Tachikawa,Tokyo Midori-cho, 10-3 Organization, System Research and Information Mathematics, Statistical
oiioOaai rfso mrts nttt of Institute Emeritus, Professor is Ogata Yosihiko c ttsia Science article Statistical original the the of by reprint published electronic an is This ern iesfo h rgnli aiainand pagination in detail. original typographic the from differs reprint hog eakbedvlpet nsldEarth solid in developments remarkable Through nttt fMteaia Statistics Mathematical of Institute 10.1214/13-STS439 .INTRODUCTION 1. nttt fMteaia Statistics Mathematical of Institute , rhclsaetm TSmdl,poaiiyfrcss p forecasts, probability changes. stress models, gains, ETAS mo (ETAS) space–time sequence archical aftershock epidemic-type constraints, eerht elz rcia prtoa oeatn nt phrases: and in words forecasting Key operational practical realize conside earthquake to of is research prospect the risk describes manuscript of earthqua this modeling fore, the quantitative earthquak to unless for urgency information incomplete t assigning useful potentially precursors in of as are discovery well phenomena unc as abnormal of earthquakes, nece whether amount which large identifying a considered, in particular, be In compl must prediction. prediction, elements stochastic effective uncertain determinist For and apparent, difficult. tions the readily is of not prediction most are quake Because mechanisms crust. their Earth’s and the in stress accumulated Abstract. 03 o.2,N.4 521–541 4, No. 28, Vol. 2013, 2013 , atqae cu eas farp lp nfut u to due faults on slips abrupt of because occur Earthquakes [email protected] bomlpeoea simcsi,Bayesian slip, aseismic phenomena, Abnormal This . . in 1 o h td fErhuk Predictability Earthquake of as Study known the study for interna- cooperative An research predictability. tional organized earthquake an on develop program to seismologists for manner. this should in power models evolve forecasting predictive Earthquake whether improved. be is determine should models to seismicity over evaluated standard information in claim used useful those that potentially models incorporate prediction to New merit. lack presented may arguments otherwise, evaluation; such for a mn onre rn omjrearthquakes major to prone countries among way http://www.cseptesting.org/ (Jor- controversial effec- ( is al. the techniques et however, dan these basis types; the of various on tiveness of proposed anomalies been of have earthquakes ing effective for considered pro- be prediction. the earthquake to reflect earthquakes of faithfully and cesses must diverse scenarios all because complex determin- prediction in earthquake challenges unachievable istic to leads This eety hr a engoigmomentum growing been has there Recently, nteohrhn,svrltcnqe o predict- for techniques several hand, other the On 2011 ) hrfr,ojciiyi required is objectivity Therefore, )). ena future. near he rdcinis prediction e predictability ranylies ertainty e.There- red. es hier- dels, robability xcondi- ex cearth- ic efaults se e Any ke. ssitates large o scretyunder currently is ) Collaboratory (CSEP; 2 Y. OGATA for exploring possibilities in earthquake prediction has been referred to as (relative) entropy, which is (e.g., Jordan (2006)). An immediate objective of the essentially similar to the log-likelihood. project is to encourage the development of statisti- 2.2 Space–Time-Magnitude Forecasting of cal models of seismicity, such as those subsequently Earthquakes discussed in Section 2, and to evaluate their predic- tive performances in terms of probability. Baseline models should be set to compare with In addition, the CSEP study aims to develop a and evaluate all predictability models. Based on em- scientific infrastructure to evaluate statistical signif- pirical laws, we can predict standard reference prob- icance and probability gain (Aki (1981)) of various ability of earthquakes in a space–time-magnitude methods used to predict large earthquakes by using range on the basis of the time series of present and observed abnormalities such as seismicity anomaly, past earthquakes. The framework of CSEP, which transient crustal movements and electromagnetic has evaluated performances of submitted forecasts anomaly. Here probability gain is defined as the ra- of respective regions (Jordan (2006); Zechar, Ger- tio of probability of a large earthquake estimated stenberger and Rhoades, 2010; Nanjo et al. (2011)), based on an anomaly to the underlying probability is similar to that of the California Regional Earth- without anomaly. Section 3 describes this important quake Likelihood Models (RELM) project for spatial concept, and then discusses statistical point-process forecast (Field (2007); Schorlemmer et al. (2010)). models to examine the significance of causality of Different space–time models were submitted to the anomalies and also to evaluate the probability gains CSEP Japan Testing Center at the Earthquake Re- conditional on the anomalous events. search Institute, University of Tokyo, for the one- For prediction of large earthquakes with a higher day forecast applied to the testing region in Japan probability gain, comprehensive studies of anoma- (Nanjo et al. (2012)). This means that the model lous phenomena and observations of earthquake forecasts the probability of an earthquake at each mechanisms are essential. Several such studies are space–time-magnitude bin. However, the CSEP pro- summarized in Sections 4–6. Particularly, I have tocols have to be improved to those in terms of been interested in elucidating abnormal seismic ac- point-processes on a continuous time axis for the tivities and geodetic anomalies to apply them for evaluation including a real-time forecast (Ogata promoting forecasting abilities, as described in these et al. (2013)). sections. Almost all models incorporated the Gutenberg– 2. PROBABILITY FORECASTING OF Richter (G–R) law for forecasting the magnitude BASELINE SEISMICITY factor, and take different variants of the space–time epidemic-type aftershock sequence (ETAS) model 2.1 Log-Likelihood for the Evaluation Score of (Nanjo et al. (2012), and Ogata et al. (2013)). In Probability Forecast the following sections, I will outline these models. Through repeated revisions, CSEP attempts to es- 2.2.1 Magnitude frequency distribution Guten- tablish standard models to predict probability that berg and Richter (1944) determined that the num- conform to various parts of the world. Here I mean ber of earthquakes increased (decreased) exponen- the prediction/estimation of probability as predict- tially as their magnitude decreased (increased). De- ing/estimating conditional probabilities given the scribing this theory in terms of point processes, the past history of earthquakes and other possible pre- intensity of magnitude M is cursors. The likelihood is used as a reasonable measure of prediction performance (cf. Boltzmann 1 λ0(M) = lim Prob(M < Magnitude ≤ M + ∆) (1878); Akaike (1985)). The evaluation method for ∆→0 ∆ (1) probabilistic forecasts of earthquakes by the log- = 10a−bM = Ae−βM likelihood function has been proposed, discussed and implemented (e.g., Kagan and Jackson (1995); for constants a and b. In other words, the magnitude Ogata (1995); Ogata, Utsu and Katsura, 1996; Vere- of each earthquake will obey an exponential dis- −β(M−Mc) Jones (1999); Harte and Vere-Jones (2005); Schor- tribution such that f(M)= β e , M ≥ Mc, lemmer et al. (2007); Zechar, Gerstenberger and where β = b ln 10, and Mc is a cutoff magnitude Rhoades, 2010; Nanjo et al. (2012); Ogata et al. value above which all earthquakes are detected. Tra- (2013)). In some such studies, the evaluation score ditionally, the b-value had been estimated graph- EARTHQUAKE PREDICTION 3
Fig. 1. Top panel shows Delaunay tessellation upon which the piecewise linear function is defined. The smoothness con- straint is posed in the sum of squares of integrated slopes. Bottom panel shows b-value estimates of the G–R formula in equation (1) estimated from the data for earthquakes of M ≥ 5.4 from the Harvard University global CMT catalog (http: // www. globalcmt. org/ CMTsearch. html ). One conspicuous feature is that b-values are large in oceanic ridge zones, but small in plate subduction zones. ically, however, more efficient estimation is per- ther varies with time. Temporal and spatial b-value formed by the maximum likelihood method. Utsu changes have attracted the attention of many re- (1965) derived it by the moment method. Later, searchers ever since Suyehiro (1966) reported a dif- Aki (1965) demonstrated that this is a maximum- ference between b-values of foreshocks and after- likelihood estimate (MLE) and provided the error shocks in a sequence. estimate. It should be noted that the magnitudes in Here, we consider that β can vary with time, most catalogs are given in the interval of 0.1 (dis- space and space–time according to a function such crete magnitude values), hence, care should be taken as β(t), β(x,y,z) or β(t,x,y). Various nonparamet- for avoiding the bias of the b estimate in likelihood- ric smoothing algorithms such as kernel methods based estimation procedures (Utsu (1969)). have been proposed (Wiemer and Wyss, 1997). Al- Although the coefficient b in a wide region is ternatively, the β value can be parameterized by generally slightly smaller than 1.0, Gutenberg and smooth cubic splines (Ogata, Imoto and Katsura, Richter (1944) further determined that the b-value 1991; Ogata and Katsura (1993)) or piece-wise lin- varies according to location in smaller seismic re- ear expansions on Delaunay tessellated space (Ogata gions. The b-value varies even within Japan and fur- (2011c); see also Figure 1, e.g.). In such a case, a 4 Y. OGATA penalized log-likelihood (Good and Gaskins (1971)) where M is the magnitude of an aftershock and t is is used whereby the log-likelihood function is asso- the time following a main shock of magnitude M0; ciated with penalty functions in which the coeffi- the parameters are independently estimated by the cients are constrained for smoothness of the β func- maximum-likelihood method for respective empiri- tion. For the optimal estimation of the β function, cal laws. the weights in the penalty function are objectively After a large earthquake occurs, the Japan Me- adjusted in a Bayesian framework, as suggested by teorological Agency (JMA) and the United States Akaike (1980a). Geological Survey (USGS) have undertaken opera- 2.2.2 Aftershock analysis and probabilistic fore- tional probability forecast of the aftershocks. How- casting Typical aftershock frequency decays accord- ever, the forecast is announced after the elapse of ing to the reverse power function with time (Omori 24 h or more. This is due to the deficiency of after- (1894); Utsu, 1961, 1969). First, let N(s,t) be the shock data due to overlapping of seismograms after number of aftershocks in an interval (s,t). Then, the the main shock. In particular, the parameter a is occurrence rate of aftershocks at the elapsed time t crucial for the early forecast, but difficult to estimate since the main shock is in an early period, whereas the other parameters can 1 be default values for the early forecast [Reasenberg ν(t) = lim P {N(t,t + d∆) ≥ 1| and Jones (1994); Earthquake Research Committee ∆→0 ∆ (ERC), 1998]. The difficulty is because the parame- (2) Mainshock at time 0} ter a can substantially differ even if the magnitudes K of the main shocks are almost the same: for exam- = (t + c)p ple, the numbers of the aftershocks of M ≥ 4.0 of for constants K, c and p. This is known as the two nearby main shocks of the same M6.8 differ by Omori–Utsu (O–U) law. 6–7 times (JMA (2009)). Traditionally, estimates of the parameter p have It is notable that the strongest aftershocks oc- been obtained since the study of Utsu (1961) in curred within 24 h in most sequences (JMA (2008)). the following manner. The numbers of aftershocks Therefore, despite adverse conditions during data in a unit time interval n(t) are first plotted against collection, probabilistic aftershock forecasts should elapsed time on doubly logarithmic axes, and then be delivered as soon as possible within 24 h after are fit to an asymptotic straight line. The slope of the main shock to mitigate secondary disasters in this line is an estimate for p. The values of c can affected areas. be determined by measuring the bending curve im- For this purpose, it is necessary to estimate time- mediately after the main shock. Such an analysis is dependent missing rates, or detection rates, of af- based on the time series of counted numbers of af- tershocks (Ogata and Katsura, 1993, 2006; Ogata, tershocks. By such a plot, we can find aftershock 2005c) because they enable probabilistic forecasting sequences for which the formula (2) lasts a long pe- immediately after the main shock (Ogata, 2005c; riod, more than 120 years, for example (Utsu (1969); Ogata and Katsura (2006)). The detection rate of Ogata (1989); Utsu, Ogata and Matsu’ura, 1995). earthquakes is described by a probability function To efficiently estimate the three parameters di- q(M) of magnitude M such that 0 ≤ q(M) ≤ 1. The rectly on the basis of occurrence time records of intensity λ(M) for actually observed magnitude fre- aftershocks, assuming nonstationary Poisson pro- quency is described by λ(M)= λ0(M)q(M), corre- cess with intensity function (2), Ogata (1983) sug- sponding to thinning or random deletion. An ex- gested the maximum-likelihood method, which en- ample of the detection rate function is the cumu- abled the practical aftershock forecasting. Reasen- lative of Gaussian distribution or the so-called er- berg and Jones (1989) proposed a procedure based ror function q(M) = erf{M|µ,σ}. The parameter on the joint intensity rate of time and magnitude µ represents the magnitude at which earthquakes of aftershocks (Utsu (1970)) according to the G–R are detected at a rate of 50%, and σ represents law (1), a range of magnitudes in which earthquakes are partially detected. Let a data set of magnitudes λ(t, M)= λ0(M)ν(t) (3) {(ti, Mi); i = 1,...,N} be given at a period imme- 10a+b(M0−M) diately after the main shock. Assume that the pa- = (a, b, c, p; constant), (t + c)p rameters are time-dependent during the period such EARTHQUAKE PREDICTION 5 that exogenous variables. The ETAS model is applied to 10a+b(M0−M) data from the target time interval [S, T ]. The occur- (4) λ(t, M)= q{M|µ(t),σ} rence history H during the precursor period [0,S] (t + c)p S is used for sustaining stationarity of the process af- with an improving detection rate µ(t). An additional ter the time S. parametric approach proposed by Omi et al. (2013) Then, the model’s effectiveness in fitting an earth- uses the state–space representation method for real- quake sequence can be evaluated by comparing the time forecasting within the 24 h period. cumulative number N(S,t) of earthquakes with the 2.2.3 Epidemic-type aftershock sequence (ETAS) rate predicted by the model model The epidemic-type aftershock sequence t (ETAS) model describes earthquake activity as a (7) Λ(S,t)= λ(u|Hu) du point process (Ogata (1986), 1988) and includes the ZS O–U law for aftershocks as a descendant process. in the time interval S