<<

arXiv:1312.7712v1 [stat.ME] 30 Dec 2013 iyadcmlxt fteerhuk phenomenon. the con- of diver- complexity been of and because sity have remain though uncertainties discussions large even ducted, and However, analysis it. detailed with associated earthquake, anisms mech- major seismic remark- important every elucidated progressed have as After researchers has geophysics. increased in steadily ably of has study data the relevant avail- of The of understanding significantly. ability our increased 1960s, has late earthquakes the since science rseto atqaePrediction Ogata Yosihiko Earthquake of Research Prospect A 03 o.2,N.4 521–541 4, DOI: No. 28, Vol. 2013, Science Statistical er-u oy 5-55e-mail: Komaba, 153-8505 4-6-1 Tokyo Professor, Megro-Ku, Visiting Science, Tokyo, Industrial of of University Institute and 190-8562 Tachikawa,Tokyo Midori-cho, 10-3 Organization, System Research and Information Mathematics, Statistical

oiioOaai rfso mrts nttt of Institute Emeritus, Professor is Ogata Yosihiko c ttsia Science article Statistical original the the of by reprint published electronic an is This ern iesfo h rgnli aiainand pagination in detail. original typographic the from differs reprint hog eakbedvlpet nsldEarth solid in developments remarkable Through nttt fMteaia Statistics Mathematical of Institute 10.1214/13-STS439 .INTRODUCTION 1. nttt fMteaia Statistics Mathematical of Institute , rhclsaetm TSmdl,poaiiyfrcss p forecasts, probability changes. stress models, gains, ETAS mo (ETAS) space–time sequence archical epidemic-type constraints, eerht elz rcia prtoa oeatn nt phrases: and in words forecasting Key operational practical realize conside earthquake to of is research prospect the risk describes manuscript of earthqua this modeling fore, the quantitative earthquak to unless for urgency information incomplete t assigning useful potentially precursors in of as are discovery well phenomena unc as abnormal of earthquakes, nece whether amount which large identifying a considered, in particular, be In compl must . prediction, elements stochastic effective uncertain determinist For and apparent, difficult. tions the readily is of not prediction most are quake Because mechanisms crust. their Earth’s and the in stress accumulated Abstract. 03 o.2,N.4 521–541 4, No. 28, Vol. 2013, 2013 , atqae cu eas farp lp nfut u to due faults on slips abrupt of because occur Earthquakes [email protected] bomlpeoea simcsi,Bayesian slip, aseismic phenomena, Abnormal This . . in 1 o h td fErhuk Predictability Earthquake of as Study known the study for interna- cooperative An research predictability. tional organized earthquake an on develop program to seismologists for manner. this should in power models evolve forecasting predictive Earthquake whether improved. be is determine should models to seismicity over evaluated standard information in claim used useful those that potentially models incorporate prediction to New merit. lack presented may arguments otherwise, evaluation; such for a mn onre rn omjrearthquakes major to prone countries among way http://www.cseptesting.org/ (Jor- controversial effec- ( is al. the techniques et however, dan these basis types; the of various on tiveness of proposed anomalies been of have earthquakes ing effective for considered pro- be prediction. the earthquake to reflect earthquakes of faithfully and cesses must diverse scenarios all because complex determin- prediction in earthquake challenges unachievable istic to leads This eety hr a engoigmomentum growing been has there Recently, nteohrhn,svrltcnqe o predict- for techniques several hand, other the On 2011 ) hrfr,ojciiyi required is objectivity Therefore, )). ena future. near he rdcinis prediction e predictability ranylies ertainty e.There- red. es hier- dels, robability xcondi- ex cearth- ic efaults se e Any ke. ssitates large o scretyunder currently is ) Collaboratory (CSEP; 2 Y. OGATA for exploring possibilities in has been referred to as (relative) entropy, which is (e.g., Jordan (2006)). An immediate objective of the essentially similar to the log-likelihood. project is to encourage the development of statisti- 2.2 Space–Time-Magnitude Forecasting of cal models of seismicity, such as those subsequently Earthquakes discussed in Section 2, and to evaluate their predic- tive performances in terms of probability. Baseline models should be set to compare with In addition, the CSEP study aims to develop a and evaluate all predictability models. Based on em- scientific infrastructure to evaluate statistical signif- pirical laws, we can predict standard reference prob- icance and probability gain (Aki (1981)) of various ability of earthquakes in a space–time-magnitude methods used to predict large earthquakes by using range on the basis of the of present and observed abnormalities such as seismicity anomaly, past earthquakes. The framework of CSEP, which transient crustal movements and electromagnetic has evaluated performances of submitted forecasts anomaly. Here probability gain is defined as the ra- of respective regions (Jordan (2006); Zechar, Ger- tio of probability of a large earthquake estimated stenberger and Rhoades, 2010; Nanjo et al. (2011)), based on an anomaly to the underlying probability is similar to that of the Regional Earth- without anomaly. Section 3 describes this important quake Likelihood Models (RELM) project for spatial concept, and then discusses statistical point-process forecast (Field (2007); Schorlemmer et al. (2010)). models to examine the significance of causality of Different space–time models were submitted to the anomalies and also to evaluate the probability gains CSEP Japan Testing Center at the Earthquake Re- conditional on the anomalous events. search Institute, University of Tokyo, for the one- For prediction of large earthquakes with a higher day forecast applied to the testing region in Japan probability gain, comprehensive studies of anoma- (Nanjo et al. (2012)). This means that the model lous phenomena and observations of earthquake forecasts the probability of an earthquake at each mechanisms are essential. Several such studies are space–time-magnitude bin. However, the CSEP pro- summarized in Sections 4–6. Particularly, I have tocols have to be improved to those in terms of been interested in elucidating abnormal seismic ac- point-processes on a continuous time axis for the tivities and geodetic anomalies to apply them for evaluation including a real-time forecast (Ogata promoting forecasting abilities, as described in these et al. (2013)). sections. Almost all models incorporated the Gutenberg– 2. PROBABILITY FORECASTING OF Richter (G–R) law for forecasting the magnitude BASELINE SEISMICITY factor, and take different variants of the space–time epidemic-type aftershock sequence (ETAS) model 2.1 Log-Likelihood for the Evaluation Score of (Nanjo et al. (2012), and Ogata et al. (2013)). In Probability Forecast the following sections, I will outline these models. Through repeated revisions, CSEP attempts to es- 2.2.1 Magnitude frequency distribution Guten- tablish standard models to predict probability that berg and Richter (1944) determined that the num- conform to various parts of the world. Here I mean ber of earthquakes increased (decreased) exponen- the prediction/estimation of probability as predict- tially as their magnitude decreased (increased). De- ing/estimating conditional probabilities given the scribing this theory in terms of point processes, the past history of earthquakes and other possible pre- intensity of magnitude M is cursors. The likelihood is used as a reasonable measure of prediction performance (cf. Boltzmann 1 λ0(M) = lim Prob(M < Magnitude ≤ M + ∆) (1878); Akaike (1985)). The evaluation method for ∆→0 ∆ (1) probabilistic forecasts of earthquakes by the log- = 10a−bM = Ae−βM likelihood function has been proposed, discussed and implemented (e.g., Kagan and Jackson (1995); for constants a and b. In other words, the magnitude Ogata (1995); Ogata, Utsu and Katsura, 1996; Vere- of each earthquake will obey an exponential dis- −β(M−Mc) Jones (1999); Harte and Vere-Jones (2005); Schor- tribution such that f(M)= β e , M ≥ Mc, lemmer et al. (2007); Zechar, Gerstenberger and where β = b ln 10, and Mc is a cutoff magnitude Rhoades, 2010; Nanjo et al. (2012); Ogata et al. value above which all earthquakes are detected. Tra- (2013)). In some such studies, the evaluation score ditionally, the b-value had been estimated graph- EARTHQUAKE PREDICTION 3

Fig. 1. Top panel shows Delaunay tessellation upon which the piecewise linear function is defined. The smoothness con- straint is posed in the sum of squares of integrated slopes. Bottom panel shows b-value estimates of the G–R formula in equation (1) estimated from the data for earthquakes of M ≥ 5.4 from the Harvard University global CMT catalog (http: // www. globalcmt. org/ CMTsearch. html ). One conspicuous feature is that b-values are large in oceanic ridge zones, but small in plate subduction zones. ically, however, more efficient estimation is per- ther varies with time. Temporal and spatial b-value formed by the maximum likelihood method. Utsu changes have attracted the attention of many re- (1965) derived it by the moment method. Later, searchers ever since Suyehiro (1966) reported a dif- Aki (1965) demonstrated that this is a maximum- ference between b-values of and after- likelihood estimate (MLE) and provided the error shocks in a sequence. estimate. It should be noted that the magnitudes in Here, we consider that β can vary with time, most catalogs are given in the interval of 0.1 (dis- space and space–time according to a function such crete magnitude values), hence, care should be taken as β(t), β(x,y,z) or β(t,x,y). Various nonparamet- for avoiding the bias of the b estimate in likelihood- ric smoothing algorithms such as kernel methods based estimation procedures (Utsu (1969)). have been proposed (Wiemer and Wyss, 1997). Al- Although the coefficient b in a wide region is ternatively, the β value can be parameterized by generally slightly smaller than 1.0, Gutenberg and smooth cubic splines (Ogata, Imoto and Katsura, Richter (1944) further determined that the b-value 1991; Ogata and Katsura (1993)) or piece-wise lin- varies according to location in smaller seismic re- ear expansions on Delaunay tessellated space (Ogata gions. The b-value varies even within Japan and fur- (2011c); see also Figure 1, e.g.). In such a case, a 4 Y. OGATA penalized log-likelihood (Good and Gaskins (1971)) where M is the magnitude of an aftershock and t is is used whereby the log-likelihood function is asso- the time following a main shock of magnitude M0; ciated with penalty functions in which the coeffi- the parameters are independently estimated by the cients are constrained for smoothness of the β func- maximum-likelihood method for respective empiri- tion. For the optimal estimation of the β function, cal laws. the weights in the penalty function are objectively After a large earthquake occurs, the Japan Me- adjusted in a Bayesian framework, as suggested by teorological Agency (JMA) and the United States Akaike (1980a). Geological Survey (USGS) have undertaken opera- 2.2.2 Aftershock analysis and probabilistic fore- tional probability forecast of the . How- casting Typical aftershock frequency decays accord- ever, the forecast is announced after the elapse of ing to the reverse power function with time (Omori 24 h or more. This is due to the deficiency of after- (1894); Utsu, 1961, 1969). First, let N(s,t) be the shock data due to overlapping of seismograms after number of aftershocks in an interval (s,t). Then, the the main shock. In particular, the parameter a is occurrence rate of aftershocks at the elapsed time t crucial for the early forecast, but difficult to estimate since the main shock is in an early period, whereas the other parameters can 1 be default values for the early forecast [Reasenberg ν(t) = lim P {N(t,t + d∆) ≥ 1| and Jones (1994); Earthquake Research Committee ∆→0 ∆ (ERC), 1998]. The difficulty is because the parame- (2) at time 0} ter a can substantially differ even if the magnitudes K of the main shocks are almost the same: for exam- = (t + c)p ple, the numbers of the aftershocks of M ≥ 4.0 of for constants K, c and p. This is known as the two nearby main shocks of the same M6.8 differ by Omori–Utsu (O–U) law. 6–7 times (JMA (2009)). Traditionally, estimates of the parameter p have It is notable that the strongest aftershocks oc- been obtained since the study of Utsu (1961) in curred within 24 h in most sequences (JMA (2008)). the following manner. The numbers of aftershocks Therefore, despite adverse conditions during data in a unit time interval n(t) are first plotted against collection, probabilistic aftershock forecasts should elapsed time on doubly logarithmic axes, and then be delivered as soon as possible within 24 h after are fit to an asymptotic straight line. The slope of the main shock to mitigate secondary disasters in this line is an estimate for p. The values of c can affected areas. be determined by measuring the bending curve im- For this purpose, it is necessary to estimate time- mediately after the main shock. Such an analysis is dependent missing rates, or detection rates, of af- based on the time series of counted numbers of af- tershocks (Ogata and Katsura, 1993, 2006; Ogata, tershocks. By such a plot, we can find aftershock 2005c) because they enable probabilistic forecasting sequences for which the formula (2) lasts a long pe- immediately after the main shock (Ogata, 2005c; riod, more than 120 years, for example (Utsu (1969); Ogata and Katsura (2006)). The detection rate of Ogata (1989); Utsu, Ogata and Matsu’ura, 1995). earthquakes is described by a probability function To efficiently estimate the three parameters di- q(M) of magnitude M such that 0 ≤ q(M) ≤ 1. The rectly on the basis of occurrence time records of intensity λ(M) for actually observed magnitude fre- aftershocks, assuming nonstationary Poisson pro- quency is described by λ(M)= λ0(M)q(M), corre- cess with intensity function (2), Ogata (1983) sug- sponding to thinning or random deletion. An ex- gested the maximum-likelihood method, which en- ample of the detection rate function is the cumu- abled the practical aftershock forecasting. Reasen- lative of Gaussian distribution or the so-called er- berg and Jones (1989) proposed a procedure based ror function q(M) = erf{M|µ,σ}. The parameter on the joint intensity rate of time and magnitude µ represents the magnitude at which earthquakes of aftershocks (Utsu (1970)) according to the G–R are detected at a rate of 50%, and σ represents law (1), a range of magnitudes in which earthquakes are partially detected. Let a data set of magnitudes λ(t, M)= λ0(M)ν(t) (3) {(ti, Mi); i = 1,...,N} be given at a period imme- 10a+b(M0−M) diately after the main shock. Assume that the pa- = (a, b, c, p; constant), (t + c)p rameters are time-dependent during the period such EARTHQUAKE PREDICTION 5 that exogenous variables. The ETAS model is applied to 10a+b(M0−M) data from the target time interval [S, T ]. The occur- (4) λ(t, M)= q{M|µ(t),σ} rence history H during the precursor period [0,S] (t + c)p S is used for sustaining stationarity of the process af- with an improving detection rate µ(t). An additional ter the time S. parametric approach proposed by Omi et al. (2013) Then, the model’s effectiveness in fitting an earth- uses the state–space representation method for real- quake sequence can be evaluated by comparing the time forecasting within the 24 h period. cumulative number N(S,t) of earthquakes with the 2.2.3 Epidemic-type aftershock sequence (ETAS) rate predicted by the model model The epidemic-type aftershock sequence t (ETAS) model describes earthquake activity as a (7) Λ(S,t)= λ(u|Hu) du point process (Ogata (1986), 1988) and includes the ZS O–U law for aftershocks as a descendant process. in the time interval S

Although several alternative versions to the spa- tial factor given by the bracket of (8), as described by Ogata (1998), are available, the form in (8) fits best in terms of the Akaike information criterion (AIC; Akaike (1974)) for Japanese earthquake data sets. All extensions of the temporal ETAS model are referred to as space–time ETAS models (e.g., Nanjo et al. (2012)). 2.2.5 Hierarchical space–time ETAS model When a region becomes wide or the number of earthquakes becomes sufficiently large, spatial heterogeneity of seismicity becomes conspicuous. For example, many studies have been conducted on regional variation of seismicity-related parameters such as the b-value of the G–R law and p-values of the O–U law (Utsu, 1961, 1969; Mogi (1967)). Regarding space–time ETAS models, the af- Fig. 2. Iso-surface plot of the estimated conditional in- tensity function (8) of the space–time ETAS model (Ogata, tershock productivity K may differ significantly 1998) to the JMA data of shallow earthquakes among locations, even if magnitudes of triggering (depth ≤ 100 km) of magnitude 5.0 or larger from the period earthquakes are similar (see Section 2.2.2). More- 1926–1995; in addition, Utsu’s earthquake catalog for the 40 over, the main shock–aftershock and swarm-type years period before 1926 (1885–1925) was used as the preced- clusters exhibit significantly different activity pat- ing occurrence history of the space–time ETAS model. Space means longitude and latitude, whereas the depth data are ne- terns. Therefore, we applied an extension to the glected. above space–time model to earthquakes in the en- tire region for developing a hierarchical space–time ETAS (HIST–ETAS) model, which is a space–time where Sj is a normalized positive definite symmetric ETAS model in which parameter values µ, K, α, matrix for anisotropic clusters such that p and q can vary depending on location, such as (x,y)S(x,y)t µ(x,y), K(¯xj, y¯j), α(¯xj , y¯j),p(¯xj, y¯j) and q(¯xj, y¯j). (9) Thus, coefficients of parameter functions of the 1 σ σ space–time ETAS model in equation (8) must be = 2 x2 + 2ρxy + 1 y2 . 1 − ρ2 σ1 σ2 evaluated. Coefficients of each parameter function      are defined by values at locations of earth- Here, (¯xjp, y¯j) is an average location of earthquakes quakes and a number of points on the region bound- that are placed in the same cluster as (xj,yj). Both ary. Hence, each function is uniquely defined by lin- (¯xj, y¯j) and coefficients of Sj for a selected set of ear interpolation of values at three nearest points large earthquakes j are identified by fitting a bi- (earthquakes) determined by Delaunay tessellation variate normal distribution to spatial coordinates that is constructed by all the earthquake locations of the cluster occurring within a square of 3.33 × and additional points on the boundary of the region 100.5Mj −2 km side-length and within 100.5Mj −1 days (see Figure 1). after the large event of magnitude Mj , according to For a stable optimal estimation, the freedom of Utsu (1969); but I use 1 h in prediction stage. The lo- coefficients of parameter functions needs to be con- cations (¯xj, y¯j) of all other events, including cluster strained to assign penalties against roughness of the members, remain the same as the epicenter coordi- functions. The coefficients that maximize the pe- nates of the original catalog; and they are associated nalized log-likelihood are then sought, which is the with the identity matrix for Sj, namely, σ1 = σ2 = 1 equivalent of attaining the maximum posterior dis- and ρ = 0. See Figure 2 for an illustrative view of tribution. Here, we adjusted the optimal prior func- the conditional intensity (8). Further details of the tion for the parameter constraints in terms of the algorithm can be found in studies of Ogata (1998, penalty function by an empirical Bayesian method 2011a, 2011b). (Akaike (1980a)). Further details can be found in EARTHQUAKE PREDICTION 7

This equation considers the potential of further model extensions by useful physical concepts in the elastic rebound theory, such as stress interaction from neighboring earthquake ruptures. This phys- ical concept will be subsequently described in Sec- tions 4 and 5. BPT renewal process is based on the following Brownian perturbation process: (11) S(t)= λt + σW (t), t ≥ 0, which includes linearly increasing drift for stress ac- cumulation and diffusion rate σ. An earthquake oc- curs when the path S(t) attains the critical stress level sf , and the accumulated stress is released down to the ground state s0 based on elastic rebound the- ory of earthquakes (Reid (1910)). Random fluctua- Fig. 3. The optimal solution of the µ-values for background tions represent the transient stress changes due to seismicity of the space time ETAS model (8) in terms of min- the effect of other earthquakes in close proximity imum ABIC priors. The model is estimated from the JMA (see Section 4). This model includes four parame- data with earthquakes of M5.0 or larger for the target period ters: the stress accumulation rate λ, perturbation 1926–1995. In addition, Utsu’s earthquake catalog for the 40 rate σ, failure state s , and ground state s . If we years period before 1926 (1885–1925) was used as the precur- f 0 sory occurrence history of the space–time ETAS model. Con- assume that failure and ground states sf and s0, re- tours are equidistant in the logarithmic scale. Stars indicate spectively, are constant, the interval of earthquakes locations of earthquakes of magnitude 6.7 or larger that oc- is independent and identically distributed with the curred during 1996–2009, which mostly occurred in high back- BPT distribution, in which parameters are related ground rates. Note however that there are several large earth- by µ = (sf − s0)/λ and α = σ/ λ(sf − s0). quakes occurred at very low seismicity rates, the issue of which Because of very small sample size available from lead to Section 2.3. each , the mean parameterp µ has been esti- mated using methods other than the MLE. The the studies of Ogata, Katsura and Tanemura (2003) ERC (2001) uses a common α value of 0.24 through- and Ogata (2004a, 2011b). Figure 3 shows the opti- out Japan. This is because better fit of the same α mal solution of background seismic activities µ(x,y), value was shown by the AIC comparison than the which appear useful for long-term prediction of large different α estimates for respective active faults, for earthquakes in and near Japan. Moreover, stochastic a set of occurrence data with moderate sample sizes declustering using the space–time ETAS model can from four active faults (ERC, 2001). Also, the ERC make realizations of background seismicity (Zhuang, has estimated µ in two ways: as the mean of past Ogata and Vere-Jones, 2002; Zhuang et al. (2005b); recurrence intervals and as expected intervals esti- Bansal and Ogata (2013)). mated from the slip data of the fault plane. The 2.3 Long-Term Probability Forecasts of latter estimate is expressed by T = U/V , where U is Characteristic Earthquakes the slip size per earthquake and V is the deforma- tion rate per year, observed from the escarpment of A characteristic earthquake is a repeating large the fault. The ERC selects and applies one of these earthquake that is traditionally defined from pale- estimates for µ of each active fault according to re- oseismology observations. The estimation is made liability of the data. by using recurrence times of a large earthquake on Alternatively, Nomura et al. (2011) propose fol- an active fault or a particular seismogenic region on lowing Bayesian estimation procedure assuming a plate boundary. The Earthquake Research Com- a common prior distribution for fault segments mittee of Japan (ERC, 2001) adopted the Brown- throughout Japan. Consider historical occurrence ian Passage Time (BPT; Matthews, Ellsworth and X j data j = {Xi ; i = 1, 2,...,n} in the jth segment of Reasenberg, 2002) renewal process, in which the m fault segments. Consider a posterior density inter-event probability density function is given by posterior(µj, αj|Tj, φµ, φα) µ (x − µ)2 (12) (10) f(x|µ, α)= exp − . X 2πα2x3 2µα2x = L(µj, αj| j)π1(µj|Tj, φµ)π2(αj|φα), r   8 Y. OGATA where likelihood L is based on the renewal process Bayesian framework can also be used when the taking account of the forward and backward recur- occurrence times are uncertain, especially when we rence times (Daley and Vere-Jones (2003)) and Tj are dealing with geological data (Ogata (1999b); No- is the above-mentioned geologically estimated slip mura et al. (2011)) in addition to the magnitude deformation ratio from slip data. Furthermore, the dependent model (Ogata (2002)) based on the time- values of the hyperparameters φµ and φα character- predictable model (Shimazaki and Nakata (1980)). izing the prior densities of µ and α are common to all considered fault segments. We obtain their esti- 3. PRACTICAL EARTHQUAKE mates by maximizing the integrated posterior dis- FORECASTING tribution Probability gain refers to the ratio of predicted m ∞ ∞ conditional probability relative to baseline earth- Λ(φ)= posterior(µj, αj| quake probability. As far as I know, most probabil- 0 0 j=1 Z Z ity gains of are not very high, even rela- (13) Y tive to the stationary Poisson process model. There- Tj, φµ, φα) dµj dαj, fore, predictions based on a single anomaly data set where the subscript j represents the jth segment alone are not satisfactory for disaster prevention be- of m fault segments. This maximizing procedure cause baseline probability of a large earthquake it- is called the Type II maximum likelihood method self is very small according to the G–R law. Also, (Good (1965)). The selection of the best combi- the BPT renewal process model has been applied nation of the prior distribution factors and the to active fault segments to estimate time-dependent optimal values of the hyper-parameters in (12) probability on the basis of the last earthquake and are carried out to attain the smallest value of stress accumulation rate. In Californian, probability the Akaike Bayesian information criterion (ABIC ; gain showed an improvement of approximately 1.7 Akaike (1980a)) that is defined by ABIC = times over Poisson process model predictions (Jor- −2maxφ log Λ(φ)+2dim(φ), where dim{φ} denotes dan et al. (2011)). the number of hyperparameters. The key for research progress in practical proba- The most common forecast technique is a plug-in bility is to use a multiple pre- method, which is a probability forecast that uses a diction formula (Utsu (1979)) such that total prob- distribution or conditional intensity function with ability gain is approximately equal to the product of a parameter set to its estimated value, such as individual probability gains (Aki (1981)). The rate MLE. This method works well if the estimation er- of probability gain that an individual anomaly was ror is sufficiently small. However, its predictive per- actually a precursor to an earthquake may be calcu- formance can be inadequate when the sample size lated as its success rate of the anomaly divided by is small. Hence, ERC adopts the plug-in method precursor time (Utsu (1979)). Success rate can only only for µ and uses a common α value of 0.24 be determined from accumulation of actual earth- throughout Japan. Alternatively, Rhoades, Van Dis- quake occurrences; and precursor time can be stud- sen and Dowrick (1994), Ogata (1999b, 2002) and ied experimentally and theoretically (Aki (1981)). In Nomura et al. (2011) propose the Bayesian predic- this section I review important suggestions by Utsu tion (Akaike (1985)) (1979) and Aki (1981) and provide some examples ∞ ∞ of causality modeling toward improved accuracy for ˜ X h(y| )= h(y|µ, α){1 − F (y|µ, α)} probability gain. 0 0 (14) Z Z n 3.1 Abnormal and Precursor Phenomena × f(Xi|µ, α) dµ dα, The continuing pursuit of possible algorithms used i Y=1 to predict large earthquakes should consider specific where F (y|µ, α) is the cumulative distribution of the developmental patterns listed in the seismic catalog. density f(y|µ, α) in (10), and h(y|µ, α) is the hazard So far, an alarm-type method of earthquake pre- rate function. This prediction is shown to provide diction (Keilis-Borok et al., 1988; Keilis-Borok and a systematically better performance in the sense of Malinovskaya (1964); Rundle et al. (2002); Shebalin expected entropy criterion (Akaike (1985)) than the et al. (2006); Sobolev (2001); Tiampo, et al., 2002) plug-in predictor in case of very small sample sizes based on seismicity patterns has been operationally of data (Nomura et al. (2011)). implemented, in which predictions are conveyed to EARTHQUAKE PREDICTION 9 seismologists through e-mail. Some predictions in greater than M in a specified area, under the con- each year are published in an official document such dition that N anomalies A,B,C,...,S appeared si- as the Center for Analysis and Prediction, State multaneously. Assuming that anomalies are condi- Seismological Bureau, China (1990–2003). In addi- tionally independent on EM and the complement of tion, many papers have been published on earth- EM , Aki (1981) derived the following equation of quake predictions. Some of these predictions may be Utsu (1977, 1979) using Bayes’ theorem: statistically significant against the stationary Pois- son process that is assumed for normal seismicity. P (EM |A,B,C,...,S) Such alarm-type predictions have been further eval- 1 1 1 (15) = 1+ − 1 − 1 − 1 · · · uated by Zechar and Zhuang (2010), Jordan et al. PA PB PC (2011), Zhuang and Ogata (2011) and Zhuang and      1 1 N−1 −1 Jiang (2012). · − 1 − 1 , Comprehensive studies of anomalous phenomena P P  S   0   and observations of earthquake mechanisms are es- . where P = P (E ), P = P (E |A), P = P (E |B), sential for predicting large earthquakes with high 0 M A M B M ...,P = P (E |S). Note that this formula can be probability gains. However, it is difficult to deter- S M written as a linear relation of logit functions of prob- mine whether detected abnormalities are precursors abilities [see equation (23) in Section 3.4.2]. These to large earthquakes. Nevertheless, we aspire to be- probabilities for a short time interval become very come able to say that the probability of occurrence of a large earthquake, in a certain period and a cer- small so that (15) can be approximated by tain region, has increased a certain extent as com- PA PB PC PS (16) P (EM |A,B,C,...,S) ≈ P0 · · · . pared with the reference probability. Therefore, it is P0 P0 P0 P0 necessary to estimate uncertainty of the nature and urgency of abnormal phenomena relative to their The above relation shows that for multiple inde- roles as precursors to major earthquakes. For this pendent precursors, the conditional rate of earth- purpose, it is necessary to study a large number of quake occurrence can be obtained by multiplying anomalous cases for potential precursory links to the unconditional rate P0 with ratios of conditional large earthquakes. Thus, incorporation of this in- probability to unconditional probability P0. Each formation in the design of a prediction model for ratio is defined as the probability gain of a precursor. probability that exceeds the underlying probability Utsu (1979) retrospectively reported a high proba- is important. bility forecast of the 1978 Izu–Oshima–Kinkai earth- quake of M7.0 using the multiple independent pre- 3.2 Conditional Probability of an Earthquake for cursor formula. This is based on each probability as- Multiple Independent Precursors sessment of the anomalous phenomena consisting of As previously described, although an individual uplift in the Izu Peninsula, swarm, and a composite precursory anomaly is insufficient for providing a of a anomaly, anomalous water table change forecast of an earthquake with a high probabil- and volumetric strain anomaly. Each of such prob- ity, forecasting probability can be enhanced if sev- abilities was not very high. Aki (1981) summarized eral anomalies are simultaneously observed (Utsu, the Utsu report and further explained similar possi- 1977, 1979, 1982; Aki (1981)). The probability of ble calculations for successful prediction of the 1975 an anomaly being a precursor of a large earth- Haicheng earthquake of M7.3 in China by consid- quake should be estimated through comprehensive ering long-term, intermediate-term, short-term and observations. Then, it provides medium- or short- imminent precursory phenomena. term probability forecasts depending on the time 3.3 Improving Probability Gains by Seeking scale of enhanced earthquake probability following Statistically Significant Phenomenon the anomaly. For example, identification of fore- shocks (Section 3.4.2) and seismicity quiescence Here I would like to describe several point process (Section 5.1) belongs to short- and medium-term models which can enhance the probability gains. To forecasting, respectively. examine whether certain abnormal phenomena af- Let us find the probability P (EM |A,B,C,...,S) fect changes in the baseline rate of earthquake occur- of occurrence of an earthquake, with a magnitude rences, Ogata and Akaike (1982), Ogata, Akaike and 10 Y. OGATA

Katsura (1982) and Ogata and Katsura (1986) an- An example is the data of unusual intensities of alyzed causal relationships between earthquake se- ground electric potential, which were observed in ries from two different seismogenic regions, A and the vicinity of Beijing, China, during a 16-year pe- A B B. Let Nt and Nt be the number of earthquakes riod beginning in 1982. The issue was whether or above a certain magnitude thereshold in the time in- not these factors were useful as precursors to strong terval (0,t) in regions A and B, respectively. Then, earthquakes of M ≥ 4.0. Electricity anomalies could consider a model of the intensity function of point have been aftereffects of strong earthquakes. How- A process Nt for earthquake occurrences in the region ever, by comparing the goodness of fit of models (17) A, conditional on the history of earthquake occur- by AIC, anomalies were deemed statistically signif- A B rences Ht and Ht in both regions: icant as precursors to earthquakes (Zhuang et al. (2005a)). Moreover, the conditional intensity rate of J t A B j A declustered earthquakes M ≥ 4.0 or larger within a λA(t|Ht ,Ht )= µ + ajt + g(t − s) dNs radius of 300 km from the Huailai ground-electricity j=1 Z0 (17) X station was given by t B t + h(t − s) dNs , a 0 λ(t|Ht)= µ + h(t − s)ξ(s) ds Z S where the first two terms on the right-hand side (19) Z t of the equation represent the Poisson process of a −0.142(t−j) 0.69 trend, the third term represents the cluster com- = 0.00702 + 0.000117e ξj j S ponent within region A including aftershocks and X= swarms, and the last term represents the effect of (event/day) in the study of Ogata and Zhuang earthquake occurrences in region B. (2001), in which successively occurring M ≥ 4 earth- Here, it must be noted that even if a significant quakes within five days and 30 km distance were re- correlation is observed between the two series of moved from the data to account for the self-exciting events, it is insufficient from the standpoint of pre- effect in equation (18). According to this model, the diction, and it is necessary to identify causality. rate of M ≥ 4 earthquakes varies from a half to 10 Thus, we must examine the opposite causality by times the average occurrence of 0.0126 event/day. interchanging A and B in equation (17). If both Furthermore, the time series of electric anomaly direction models hold, this process is mutually ex- records were available from three other stations near citing (Hawkes (1971)). Furthermore, the correla- Beijing. If we assume that the four sets of the time tion between A and B regions may be indirect such series are approximately independent, we may con- that activities in both regions may be affected by sider the following conditional intensity rate by ex- additional factors, for which the trend term may tending the multiple precursor in equation (16): be useful if the polynomial can efficiently capture 4 4 m such an effect. According to our analysis of seismic- λAm (t|Ht ) (20) λ t Hm ≈ λˆ ity in two seismogenic regions along the subducting A t A ˆ m=1 ! m=1 λAm Pacific plate interface beneath the central Honshu, \ Y 4 Japan, seismicity causality was found as a one-way for the common region A = m=1 Am among four effect from the deeper to the shallower. The maxi- circular regions Am of 300 km radii from the four mum probability gain of the causal effect was several stations. Retrospective total probabilityT gain varies times the average occurrence rate. in the range 1/10–100 times of the average occur- Similarly, we can examine the causal relationship rence rate λA in the common region (Ogata and from some observed geophysical time series of ξs Zhuang (2001)). (Ogata and Akaike (1982); Ogata, Akaike and Kat- Here, we considered declustered earthquakes near sura (1982)) as follows: Beijing, but if we can consider the original data and model that take the triggered clustering effect into J t A ξ j A account (cf. Zhuang et al., 2005a, 2013), the corre- λA(t|H ,H )= µ + ajt + g(t − s) dN t t s sponding model would become j 0 X=1 Z (18) 4 4 m t m λ(t|Ht ) + h(t − s)f(ξ ) ds. (21) λ t Ht ≈ λ0(t|Ht) . s ! λ0(t|Ht) 0 m=1 m=1 Z \ Y

EARTHQUAKE PREDICTION 11

A similar but more general model is applied in 3.4 Probabilistic Identification of Foreshocks forecasting discussed in the following sec- The study of foreshocks should lead to a short- tion (Section 3.4.2). term forecasting. Although a considerable number Moreover, we examined periodicity, or seasonal- of foreshocks are observed, most are recognized af- ity, of earthquake occurrences (Ogata and Katsura, ter occurrence of a large earthquake. Nevertheless, 1986; Ma and Vere-Jones (1997)). Although such when earthquakes begin to occur in a local region, issues have been frequently discussed in statistical its residents should determine whether or not such , it was difficult to analyze correlations in movement is a precursor of a significantly larger the conventional method because the clustering fea- earthquake. The probability of foreshock type can ture of earthquakes frequently leads to incorrect re- be determined statistically from the data of ongo- sults (Aki, 1956). On the other hand, we found it ef- ing earthquakes in a particular region. Moreover, by fective to apply statistical models of stochastic point using composite identification data of magnitude se- processes that incorporate a clustering component; quence and degree of hypocenter concentration, the further details can be found in the study of Ogata probability gain of prediction is heightened. (1999a), and the references therein. These models can also be applied to examine whether or not var- 3.4.1 Working definitions for foreshock discrimi- ious geophysical anomalies are statistically signifi- nation When an earthquake of M4.0 or larger oc- cant as precursors of an approaching large earth- curs, it must first be determined whether or not the quake: movement is a continuation of nearby earthquakes. J Precisely, the connection to past earthquakes is de- j λθ(t|Ht)= µ + ajt termined by the single-link clustering (SLC) algo- j=1 rithm of Frohlich and Davis (1990). X The largest earthquake in a cluster is designated K 2πkt 2πkt as the main shock. Pre-shocks refer to all earth- (22) + c2k−1 cos + c2k sin quakes preceding the main shock of a cluster. All T0 T0 Xk=1  pre-shocks in a cluster become foreshocks when the t magnitude gap or magnitude difference between the + g(t − s) dNs. largest pre-shock and main shock is 0.45 or greater. 0 Z If the magnitude gap is smaller than 0.45, the cluster From estimated amplitudes of the one-year peri- with pre-shocks is defined as a swarm. An additional odic term with T = 365.24 days, it is evident that 0 type of cluster is the main shock–aftershock type, probability gains vary around the average occur- in which the main shock occurs first in the clus- rence rate of corresponding M ≥ 4.0 and M ≥ 5.0 ter. A magnitude gap of 0.45 or larger between the earthquakes. More extensive studies were reported main shock and largest pre-shock occurs in less than by Matsumura (1986) by using the above model, in approximately 20% of pre-shock clusters in Japan. which the trend-term (first two terms) was used for This 0.45 borderline of foreshock- and the swarm- artificial nonstationarity due to an increasing num- types has been determined by a trade-off between ber of observed earthquakes in the long-term global achievement of a larger magnitude gap, which re- catalog. He detected periodic effects for many mid- sults in better discrimination of the foreshock, and latitude seismic inland regions, whereas the season- a greater number of foreshock clusters, which re- ality was rarely observed in tropical seismic regions sults in better statistics. Here, to characterize these and ocean seismogenic zones. Correlations with pre- features, we note that clusters of foreshock-type cipitation variations were common in these results exclude main shock and subsequent aftershocks, and are most probably due to pore fluid pressure whereas other cluster types include all events in each changes in faults (see Section 4.1 for the physical cluster. This designation is made because real-time mechanism). An extension of the above periodic- recognition of the main shock, which is preceded by ity model, reported by Iwata and Katao (2006), in- foreshocks, is easy owing to the large magnitude gap, cludes a combination of (lunar) synodic and semi- whereas main shocks of other cluster types are dif- synodic periods to examine whether or not and how ficult to recognize until the end of the cluster. certain seismicity is affected by Earth tides. Statis- tical models, applications to validate data from the 3.4.2 Probability forecast by discrimination of earthquake-induced phenomena and their references foreshocks Using the location (x,y) of the first were reviewed by Ogata (1999a). earthquakes from clusters or isolated single earth- 12 Y. OGATA quakes, by the empirical Bayesian logit model, (23) 3 k Ogata, Utsu and Katsura (1996) obtained a map × µ0 + bkγi,j µ(x,y) of a probability that the earthquake will be a i 0, The earth crust and upper mantle lithosphere can respectively, where σ1 = 0.6709 and σ2 = 0.4456 be approximately considered as an elastic body. (km). These become distorted under stress, which in- Suppose that, at the current time, c|n shows the creases steadily in a particular direction. Fault stage where the nth earthquake (n = 2, 3, 4,..., #c) planes are cracks within the lithosphere or plate has occurred in a cluster c, where #c is the number boundary interfaces. Earthquakes occur through of all earthquakes in the cluster c. We propose the distortion of subsurface rocks and both sides of the forecasting probability pc|n by using the following fault plane moving out of alignment. The earth- logistic transformation: Set f = logit p = (1 − p)/p, quake location listed in hypocenter catalogs is lo- or p = 1/(1 + ef ); then, cation at which a fault displacement started, and earthquake magnitude represents eventual size of logit pc|n = logit µ(x1,y1) the displacement. Moreover, some catalogs record 1 the orientations and slips of fault planes of relatively + #(i < j ≤ n) large earthquakes. EARTHQUAKE PREDICTION 13

For each fault plane, stress tensor in lithosphere 4.2 Predicting Seismicity in the Peripheral Area is decomposed into two perpendicular components. by Abrupt Stress Changes The shear stress acts in a direction parallel to fault To explain earthquake induction or suppression shifting, and the normal stress acts perpendicular to of seismicity, it is useful to determine whether or the fault plane. The orientation of each fault plane not ∆CFS was due to a rapid faulting event that determines shear and normal stress, which define caused an earthquake. When a large earthquake oc- Coulomb failure stress (CFS): curs, low-frequency seismic waves and global posi- CFS = (Shear Stress) − (friction coefficient) tioning system (GPS) crustal displacement are ob- (24) served. From such observations, source fault mecha- × (Normal Stress − pore fluid pressure). nisms of the earthquake can be solved, such as size, orientation and vector of the fault slip. Such source CFS increases at a constant rate over time. When parameters are input into a computer program de- CFS exceeds a particular threshold, the fault slips signed by Okada (1992) to calculate ∆CFS in a re- dramatically (an earthquake). Then, the stress re- ceiver fault system on the basis of source fault data. duces to a certain value and accumulates again Thus, studies on induction of earthquakes, based on over decades to result in large earthquakes on plate ∆CFS, have become popular. Special issue volumes boundaries, and over thousands of years to result in on this subject have been edited by Harris (1998) slips on inland active faults (see Section 2.3). and Steacy, Gomberg and Cocco (2005). When an earthquake occurs, the displacement For example, Ogata (2004b) examined regional of the source fault causes sudden Coulomb stress ∆CFS in southwestern Japan by analyzing M7.9 To- changes (∆CFS) on the peripheral receiver faults. nankai and M8.1 Nankai earthquakes in 1944 and The ∆CFS of each receiver fault plane either de- 1946, respectively. Conventionally, some seismic qui- creases or increases depending on its orientations escence in this period was either considered as a relative to the slip angles of the source fault. On the genuine precursor or suspected as an artifact be- faults with increased ∆CFS, earthquakes occur ear- cause of incomplete detection of earthquakes during lier than expected, whereas on those with decreased the Second World War. Positive and negative ∆CFS ∆CFS, forthcoming earthquakes are delayed. When correlated strongly with seismic activation and qui- faults of similar orientations dominate a region, ei- escence, respectively. In particular, this study classi- ther seismic activation or quiescence is expected in fied seismicity anomalies into pre-seismic, coseismic the region. and post-seismic before, during and after massive In equation (24), the pore fluid pressure of the earthquakes, respectively. These scenarios may be gap fault related to CFS is generally a constant. helpful in interpreting seismicity in western Japan However, its changes may be important. For exam- prior to occurrences of expected subsequent large ple, pressure changes in the fluid magma gap affect earthquakes along the . swarm activity in volcanic areas (Dieterich, Cayol and Okubo, 2000; Toda, Stein and Sagiya, 2002). 4.3 Physical Implication of the ETAS Model and In addition, earthquakes can be induced through Seismicity increased pore fluid pressure in a fault system In general, interactions among earthquakes are (Hainzl and Ogata (2005); Terakawa, Hashimoto fairly complex. Once an earthquake occurs in a par- and Matsu’ura (2013)), which is occasionally due to ticular location, CFS of the fault system adjacent heavy rainfall or shaking of the earth crust owing to to the source fault is considerably increased, and propagated strong seismic waves; the latter causes many earthquakes are induced. Traditionally, these dynamic triggering (Steacy, Gomberg and Cocco, earthquakes are called as aftershocks. Some of them 2005, and papers included in the same volume). are induced outside the aftershock region; these are Relevantly, the seasonal nature of seismicity or an- also called as off-fault aftershocks, or aftershocks in nual periodicity has been discussed in Section 3.3 a broad sense. Significant changes in stress result [cf. equation (22)]. Moreover, the ETAS model ap- in many aftershocks; even small changes can induce plications to seismicity changes that were induced aftershocks to some extent. Furthermore, any after- by dynamic triggering or injection of water were shock can change stress, too, causing their after- reported (Lei et al., 2008, Lei, Xie and Fu, 2011). shocks. Because of such complex interactions among 14 Y. OGATA invisible fault segments in the crust, detailed calcu- studies such as aftershocks in Japan and the time lations of such stress changes are difficult and im- evolution of seismicity rate. Because regional charac- practical. teristics of earthquake occurrences can be captured Therefore, statistical models designed to describe and considered as typical seismicity in this model, the actual macroscopic outcome of these stress inter- it has been accepted by seismologists as a standard actions are required. For example, the ETAS model model of ordinary seismicity. The ETAS model is in equation (5), which consists of empirical laws used as a “barometer” for detecting significant de- of aftershocks, quantifies dynamic forecasting of in- viations from normal activities as demonstrated in duced effects. By fitting to the selected data from Figure 4 in Ogata (2005a). the catalog earthquake, this ETAS model deter- mines the parameters by the maximum-likelihood 5. SEISMICITY ANOMALIES FOR method. Thus, prediction of earthquakes conform- INTERMEDIATE-TERM FORECASTS ing to regional diversity is possible. 5.1 Seismicity Quiescence Relative to the ETAS On the other hand, the friction law of Dieterich Model (1994), which was developed on the basis of rock fracture experiments with controlled stress, can be The deviation of actual cumulative number of linked to statistical laws of earthquake occurrences. earthquakes is measured relative to the theoretical In particular, this law reproduces temporal and spa- cumulative function of the earthquake that serves tial distribution of the attenuation rate of after- as an indefinite integral in time for the predicted shocks, such as that determined by the O–U law rate function (7) of the ETAS model. Relative qui- in equation (2). However, because of seismicity di- escence occurs when actual earthquake occurrence versity, predictions adapting well to development of rates are systematically lowered in comparison with seismicity appear to be difficult. the predicted incidence that is determined by the Seismicity anomalies can hardly be detected by ETAS model (Ogata (1992)). Relative quiescence observing conventional plots of earthquake series lasting for many years was observed in a broad because they show a complex generation due to region before great earthquakes of M8 class and successive occurrences of earthquakes or clustering. larger occurred in and near Japan (Ogata (1992), The clustering nature is also the main difficulty 2006b). Similar phenomena were observed before for traditional statistical test analysis. Complexity M9-class large earthquakes in other regions of the due to the clustering feature creates difficulties in world. revealing anomalies of seismicity caused by slight Since 2001, I have reported 25 agendas of var- stress changes, hence, various anomalous signals are ious seismicity anomalies and forecasting propos- missed. als in Japan at the Coordinating Committee for Therefore, some seismologists have devised vari- Earthquake Prediction of Japan (CCEP). Except ous declustering methods that include only isolated the agenda that reported seismic quiescence of af- and largest earthquakes in a clustering group or the tershock activity before the largest aftershock (see main shock, and other earthquakes are excluded. Section 6.2 for detail), all were ex-post analysis re- On the basis of declustered data, statistical signif- port; the agendas were summarized in Ogata (2009). icance of seismic quiescence was tested against the In addition, among 76 aftershock cases in Japan Poisson process. Occasionally, however, analysis re- that I have investigated, relative quiescence was ob- sults depend on the choice of criteria of the adopted served in 34 (see Ogata (2001b), and its appendix declustering algorithm (Van Stiphout, Zhuang and for details of the case studies). Moreover, Section 5.4 Marsan, 2012). Hence, results could be due to arti- includes a discussion on the manner in which re- ficial treatment. In addition, declustering methods sults of this aftershock study will be used for space– result in a significant loss of information because time probability prediction of a neighboring large they discard a large amount of data from the origi- earthquake with a size similar to that of the main nal catalog. shock. The ETAS model uses original earthquake data Here, I will note the results on the aftershock without declustering. As mentioned in Section 2.2.3, research of inland earthquakes of M6.0 or larger the ETAS model is a point process model configured in southwestern Japan that occurred 30 years be- to conform to empirical laws derived from various fore and after the M8.1 Nankai earthquake in 1946. EARTHQUAKE PREDICTION 15

Fig. 4. Shallow earthquakes in the inland rectangular region in (a) were analyzed from August 2002–July 2003 to investigate the effect of the M7.0 earthquake in 2003. CFS increments of this region took the largest values transferred from the M7.0 earthquake that is shown by the small rectangle fault located at (141.7◦E, 38.8◦N) at a depth of 71 km. and latitude versus time are shown in (a) and (b), respectively. The occurrence time of the M7.0 earthquake is shown in each panel as the vertical dotted line indicated by TJ. The ETAS model was fitted to the data from the target period (TS, TJ). The panel (c) shows cumulative numbers and magnitude versus ordinary time; and the panel (d) shows these values against the transformed time determined in equation (7) by the estimated ETAS model. The black and grey cumulative function in panels (c) and (d) show the empirical cumulative function (step function) and the theoretical one (curve and straight line) estimated and then predicted by the ETAS model, respectively. At the southeastern corner of the inland rectangular region, a large M6.2 earthquake (occurrence time indicated by TM) and its largest M5.5 foreshock (indicated by TF) occurred on July 26, 2003. The panel (d) shows that the foreshock activity was more active than was expected, which is seen from the steepest slope of the cumulative function in (d). In contrast, the aftershock activity of the M6.2 earthquake, during the period TM-Tend, appears to be similar to predicted rates in (d). Dotted parabola-like envelope curves show twofold standard deviations (95% error bands) of cumulative numbers of the transformed time.

Among six earthquakes which occurred before 1946, and future aftershock activity of similar large inland relative quiescence was observed in five aftershock earthquakes. sequences. In contrast, among seven earthquakes af- 5.2 Aseismic Slip, Stress Change and Seismicity ter 1946, relative quiescence was not observed in Anomalies six aftershock sequences, and these aftershock se- quences were on track as expected. Since the ERC Since a dense GPS observation network was es- forecasts the next large earthquake for the next 30 tablished in Japan, aseismic fault motions or slow years 60–70% (see Section 2.3, also see Ogata, 2001b slips that could not be detected by and 2002), it would be worthy to monitor recent have been successively identified in the plate bound- 16 Y. OGATA

(a) (b)

Fig. 5. (a) Seismic activity for the eight years period before the M6.8 earthquake of 2004. Dominating fault slip orientations of earthquakes in this area are similar to those of the main shock and its aftershocks. The small black rectangle in the center of the magnified geographical map shows the main shock fault model determined by the GPS observations. The regions of thin and thick contours show positive and negative Coulomb failure stress (CFS) increments, respectively, assuming that slow slips in the source have occurred for some time. These define four subregions N, S, W and E used for the following ETAS analysis. (b) The four panels show the empirical cumulative curve (thin black) of the sequence of earthquakes of magnitude 2 or larger in each of the four divided subregions from 1997 until the M6.8 earthquake (downward arrows). Thick gray curves show estimated and predicted cumulative functions before and after each change-point time, respectively. Activation and quiescence relative to those predicted by the ETAS model agrees with increase and decrease in CFS, respectively. ary regions. We can take occurrence of such motion 5.3 Variation of Local Stress Deduced from into account in discussing the relationship between Spatio-Temporal Variation of Aftershocks seismicity anomalies (quiescence or activation) and Local anomalies occur in space–time locations of stress changes. most of the aftershocks. To elucidate these anoma- Specifically, it can be assumed that slow slips on a lies, we firstly apply the O–U formula (2) for after- focal fault or its adjacent part have occurred during shock decay to data of occurrence times and convert a particular period. Then, depending on dominat- these times by the estimated theoretical cumulative ing orientations of receiver faults neighboring the function (7). We then examine whether space–time focal fault, CFS could decrease or increase. Accord- coordinates on a projected line, such as longitude ingly, we expect that seismicity there decrease or and latitude, against the converted time remained increase relative to the expected occurrence rate by temporally uniform or not. If nonuniformity in a cer- the ETAS model. Such seismicity anomalies are re- tain portion of space–time conversion is observed, vealed before some recent large earthquakes (Ogata, this implies discrepancies between theoretical and 2005b, 2007, 2010a, 2011c; and Kumazawa, Ogata actual aftershock occurrences in such a place. Sev- and Toda, 2010). See Figure 5 for an example. By eral possible scenarios for such discrepancies are of- assuming slow slip on the source fault, the peripheral fered: Secondary aftershocks that follow a large af- regions were classified as either of the increasing or tershock are obvious once seen as a cluster. Such a decreasing CFS. Then, each region can theoretically cluster shows traces of a new local rupture to ex- correspond to an area that either promoted or sup- tend the peripheral portion of the fault of the main pressed seismicity. Such anomaly patterns of seis- shock. Moreover, when a nonuniform portion other micity relative to the ETAS model (5) are in good than the secondary aftershocks is observed, it is cru- agreement with those of CFS increment. cial for us to explore the reasons. EARTHQUAKE PREDICTION 17

Based on recent accurate aftershock location data, of the distance between stations is basically linear Ogata (2010b) revealed local relative quiescence and with time because the subducting plate converges activations; these can occur associated with post- with constant speed. However, several years prior to or pre-slips of a large aftershock. These anomalies large inland earthquakes of M7 class, the time series were systematically investigated assuming that they of the baseline distance variation around the fault were related to changes in the CFS rate. In addi- was observed with systematic deviation from a lin- tion, assuming several scenarios of stress changes ear trend (Ogata, 2007, 2010a, 2011c; Kumazawa, due to slow slips, Ogata and Toda (2010) and Ogata Ogata and Toda, 2010; GSI, 2009). Each deviation (2010b) performed simulations to reproduce seismic- of these baselines was consistently explained by slow ity anomalies of relative activation and quiescence slips on the earthquake source fault or on its down- within aftershocks on the basis of the rate/state fric- dip extension. These results were due to post-hoc tion law of Dieterich (1994). analysis based on knowledge of the source fault ob- 5.4 Space–Time Probability Gain of a Large tained by coseismic displacement. Earthquake Under Relative Quiescence of From a predictive perspective, it is highly desir- Aftershocks able to estimate such a fault slip in near real-time to The probability of relative quiescence being pre- that of occurrence. So far, several estimates of suf- cursor to a large earthquake must be evaluated with ficiently large slips on plate boundaries have been their likely time and location. Because these in- obtained from GPS records by inversion analysis. volve many conditions and a number of parameters, GSI has regularly reported such estimates of coseis- they cannot be easily stated. However, by statisti- mic, post-seismic and large-size habitual slips, at cal studies of aftershock sequences in Japan (Ogata, the CCEP meeting. However, it is difficult to ob- 2001a), what I can say about a probability gain that tain fine images of small slips, particularly in in- a large earthquake will occur is as follows: First, if a land, even though inland GPS stations are arranged large earthquake occurred in a particular location, closely. This is attributed to high seismicity rather the probability per unit area that another earth- than GPS observation errors. Because strong earth- quake of similar magnitude will occur in the vicin- quakes occur frequently, various effects of slow slips ity is greater than that which will occur in a distant in GPS records are mixed up with such stronger area. This is the result of simple statistics regarding changes. Hence, development of statistical models the self-similarity feature (inverse-power law corre- and methods to separate such signals is crucial. To lations), and also physically suggests that the neigh- estimate slow slips more precisely, combined model- boring earthquake will be more probably induced by ing and analysis of seismicity and geodetic anomalies a sudden stress change on the periphery because of will be useful. Analyzing both seismicity and tran- the abrupt slip of the earthquake. Moreover, if after- sient geodetic movements in a number of areas and shock activity becomes relatively quiet, it becomes locating the area of aseismic slip is very important more likely that large aftershocks will occur around for increasing the probability gain of a large earth- the boundary of the aftershock area. Furthermore, if quake. relative quiescence lasts for a sufficiently long time more than a few months, the probability that an- 6.2 Considering the Scenario of an Earthquake other earthquake of similar magnitude will increase from Aseismic Slip within six years in the vicinity of the aftershock area Observed anomalies of crustal movement and seis- within 200 km distance. micity assume fault mechanisms and locations of 6. SEISMICITY AND GEODETIC ANOMALIES slip precursors for prediction probability; therefore, their uncertainty must be estimated. In addition, 6.1 Aseismic Slip and Crustal Deformation probabilities of considered scenarios must be esti- The Geological Survey Institute (GSI) of Japan mated. Such tasks are difficult. A possible method compiles daily geodetic locations of global position- is to consider the logic tree of various scenarios re- ing system (GPS) stations throughout Japan, and garding destruction of the fault system by attaching baseline distances between GPS stations can be cal- appropriate subjective or objective probabilities to culated from data in the GPS catalog. The geode- tree components, as was performed for long-term tic time series show that contraction or extension predictions in California and Japan. Hence, such a 18 Y. OGATA scenario ensemble gives a forecast probability. Simi- boundary, where the next great earthquakes are ex- larly, medium- and short-term prediction logic trees pected. of various scenarios must be considered. At the CCEP meeting on April 6, 2005, I reported 7. CONCLUSIONS relative quiescence of aftershocks of the Fukuoka– To predict the future of a complex and diverse Oki earthquake (Ogata, 2005d). In addition, I ex- earthquake generation process, probability fore- amined potential slow slip areas on nearby active casting cannot be avoided. The likelihood (log- faults that may have created a stress shadow (re- likelihood) is rational to measure the performance gion of decreased CFS) to cause relative quiescence of the prediction. To provide a standard stochastic in the aftershock sequence. Among these areas, the prediction of seismic activity in long term and short Kego fault, traversing Fukuoka City, had a large term, it is necessary to construct proper point pro- positive ∆CFS because of the main shock rup- cess models and revise those that conform to each ture, which showed evidence of possible slow-slip in- region. By the appearance of the anomaly, we need duction. Furthermore, the seismogenic zone along to evaluate the probability that it will be a pre- the Kego fault had already activated before the cursor to a large earthquake. Namely, we need to Fukuoka–Oki earthquake occurred (Ogata (2010a)). forecast that the probability in a space–time zone will increase to an extent, relative to those of the ref- Therefore, because a slow-slip scenario on this fault erence probability. For this, we make use of a point was possible, I examined the pattern causing stress process model for the causality relationship. variation in the aftershock area. However, no stress It is desired to search any anomaly phenomena shadow in the aftershock area was found. Therefore, that enhance the probability gains. Having such I determined that probability of slow slip on the anomalies, application of the multiple element pre- Kego fault was quite low. I also examined whether diction formula increases a precursory probability. other possible slow slips in neighboring active faults A comprehensive physical study between precursory could create a stress shadow that covered the af- phenomena and earthquake mechanisms is essential tershock region. However, no large earthquake has for composing useful point process models. These occurred in those faults thus far. elements must be incorporated to achieve predicted Approximately one month later, however, the probability exceeding predictions of typical statisti- largest aftershock occurred at the southeast end cal models. of the aftershock zone. Post-mortem examination Furthermore, to determine urgency and uncer- based on information of the fault mechanism of this tainty of major earthquakes against abnormal phe- aftershock and detailed aftershock data revealed a nomena, numerous research examples must be ac- detailed scenario. This means that by assuming a cumulated. On the basis of these examples, possi- slow slip into the gap between the fault of the largest ble prediction scenarios must be presented. Further- aftershock and main shock, relative quiescence of ac- more, to adapt well to diversity of earthquake gen- tivity in the deeper part of the aftershock zone can eration, it is useful to adopt Bayesian predictions be clearly explained (Ogata (2006a)). Moreover, the (Akaike (1980b); Nomura et al. (2011)) and consider slip can explain relative quiescence in the induced region-specific models. My experiences thus far confirm that the method swarm activity that occurred away from the after- of statistical science is essential to elucidate move- shock area (Ogata (2006a)). ment leading to prediction of a complex system. This setting as a prediction of future scenarios is There is a need for development of a forecasting much more vague and difficult to explain, even if it model that reflects diversity of the vast amount includes an ex-post scenario. Moreover, the time of of information on seismicity and various covariate occurrence must be predicted in addition to loca- data. I believe that these will be developed by in- tion, which is more difficult. Occurrence of slow slip venting an appropriate hierarchical Bayesian model. does not always indicate a proximate precursor of Space–time models for seismicity have become in- fault corruption. Nevertheless, it is desired to keep creasingly complicated (Ogata, 1998, 2004a, 2011b; observing GPS data to form scenarios of forthcom- Ogata, Katsura and Tanemura (2003); Ogata and ing large earthquake. For example, using Bayesian Zhuang (2006)). inversion by using GPS records, Hashimoto et al. A similar evolution is required for statistical mod- (2009) estimated the locked zones on the plate els of geodetic GPS data. EARTHQUAKE PREDICTION 19

ACKNOWLEDGMENTS Dieterich, J. (1994). A constitutive law for rate of earth- quake production and its application to earthquake clus- I am grateful to the Japan Meteorological Agency tering. J. Geophys. Res. 99 2601–2618. (JMA), the National Research Institute for Earth Dieterich, J., Cayol, V. and Okubo, P. G. (2000). The Science and Disaster Prevention, and universities use of earthquake rate changes as a stress meter at Kilauea 408 for the providing hypocenter data. I am also grate- volcano. Nature 457–460. Earthquake Research Committee (1998). Regarding meth- ful to the anonymous referee, Associate Editor and ods for evaluating probility of aftershocks. Available at the Editor for their careful reviews and suggestions, http://www.jishin.go.jp/main/yoshin2/yoshin2.htm. which led to a significant revision of the present Earthquake Research Committee (2001). Regarding meth- manuscript. This work was supported by JSPS ods for evaluating long-term probility of earthquake oc- currence. Available at http://www.jishin.go.jp/main/ KAKENHI Grant Number 23240039, and by the choukihyoka/01b/chouki020326.pdf. Aihara Innovative Mathematical Modelling Project, Field, E. (2007). Overview of the working group for the de- the “Funding Program for World-Leading Innova- velopment of regional earthquake likelihood models. Seis- tive R&D on Science and Technology (FIRST Pro- mology Research Letters 78 7–16. gram),” initiated by the Council for Science and Frohlich, C. and Davis, S. D. (1990). Single-link cluster analysis as a method to evaluate spatial and temporal prop- Technology Policy. erties of earthquake catalogues. Geophys. J. Int. 100 19–32. Good, I. J. (1965). The Estimation of Probabilities. MIT Press, Cambridge, MA. MR0185724 REFERENCES GSI (2009). Crustal movements in the Tohoku district. Rep. Coord. Comm. Earthq. Predict. 83 59–81. Akaike, H. (1974). A new look at the statistical model iden- Good, I. J. and Gaskins, R. A. (1971). Nonparametric tification. IEEE Trans. Automat. Control. AC-19 716–723. roughness penalties for probability densities. Biometrika MR0423716 58 255–277. MR0319314 Akaike, H. (1980a). Likelihood and the Bayes procedure. In Gutenberg, B. and Richter, C. (1944). Frequency of earth- Bayesian Statistics (Valencia, 1979) (J. Bernard, M. De quakes in California. Bull. Seismol. Soc. Amer. 34 185–188. Groot, D. Lindley and A. Smith, eds.) 143–166. Univ. Hainzl, S. and Ogata, Y. (2005). Detecting fluid signals in Press, Valencia. MR0638876 seismicity data through statistical earthquake modeling. Akaike, H. (1980b). On the use of the predictive likelihood J. Geophys. Res. 110 B05S07. 32 of a Gaussian model. Ann. Inst. Statist. Math. 311–324. Harris, R. A. (1998). Introduction to special section: Stress MR0609025 triggers, stress shadows, and implications for seismic haz- Akaike, H. (1985). Prediction and entropy. In A Celebration ard. J. Geophys. Res. 103 24347–24358. of Statistics (A. Atkinson and S. Fienberg, eds.) 1–24. Harte, D. and Vere-Jones, D. (2005). The entropy score Springer, New York. MR0816143 and its uses in earthquake forecasting. Pure Appl. Geophys. Aki, K. (1956). A review on statistical seismology. Zisin II 162 1229–1253. 8 (J. Seismol. Soc. Japan) 205–228. Hashimoto, C., Noda, A., Sagiya, T. and Matsu’ura, M. Aki, K. (1965). Maximum likelihood estimate of b in the for- (2009). Interplate seismogenic zones along the Kuril–Japan mula log N = a−bM and its confidence limits. Bull. Earthq. trench inferred from GPS data inversion. Nature Geo- 43 Res. Inst. 237–238. science 2 14–144. Aki, K. (1981). A probabilistic synthesis of precursory phe- Hawkes, A. G. (1971). Spectra of some self-exciting and nomena. In Earthquake Prediction (Maurice Ewing Series, mutually exciting point processes. Biometrika 58 83–90. 4) (D. Simpson and P. Richards, eds.) 566–574. Ameri- MR0278410 can Geophysical Union, Washington, DC. Iwata, T. and Katao, H. (2006). Correlation between the Bansal, A. and Ogata, Y. (2013). Non-stationary epidemic phase of the moon and the occurrences of microearthquakes type aftershock sequence model for seismicity prior to the in the Tamba region through point-process modeling. Geo- 26 December 2004, M9.1 Sumatra-Andaman Islands mega- physical Research Letters 33 L07302. earthquake. J. Geophys. Res. 118 1–14. JMA (2008). Japan Meteorological Agency, Seismic activity Boltzmann, L. (1878). Weitere Bemerkungen uber einige in and around Kanto and Chubu Districts (May 2007–April Plobleme der mechanischen Warmetheorie. Wiener 2008). Rep. Coord. Comm. Earthq. Predict. 80 80–99. Berichte 78 7–46. JMA (2009). The Iwate-Miyagi Nairiku earthquake in 2008. Center for analysis and prediction (1990–2003). Study on the Rep. Coord. Comm. Earthq. Predict. 81 101–131. seismic tendency in China (for the year 1989,..., 2002 and Jordan, T. H. (2006). Earthquake predictability, brick by 2003) (in Chines). Beijing, Seismological Press. brick. Seismol. Res. Lett. 77 3–6. Cox, D. R. (1975). Partial likelihood. Biometrika 62 269–276. Jordan, T. H., Chen, Y. T., Gasparini, P., Madaria- MR0400509 ga, R., Main, I., Marzocchi, W., Papadopoulos, G., Daley, D. J. and Vere-Jones, D. (2003). An Introduction to Sobolev, G., Yamaoka, K. and Zschau, J. (2011). Op- the Theory of Point Processes. Vol. I: Elementary Theory erational earthquake forecasting. State of knowledge and and Methods, 2nd ed. Springer, New York. MR1950431 guidelines for utilization. Ann. Geophys. DOI:10.4401/ 20 Y. OGATA

ag-5350. Available at http://www.annalsofgeophysics. Ogata, Y. (1992). Detection of precursory relative quies- eu/index.php/annals/article/view/5350/5371. cence before great earthquakes through a statistical model. Kagan, Y. and Jackson, D. (1995). New seismic gap J. Geophys. Res. 97 845–919. hypothesis–5 years after. J. Geophys. Res. B: Solid Earth Ogata, Y. (1995). Evaluation of probability forecasts of 100 3943–3959. events; invited discussion as a commentary on “Forecasting Keilis-Borok, V. and Malinovskaya, L. (1964). One regu- Earthquakes and Earthquake Risk” by Prof. D. Vere-Jones. larity in the occurrence of strong earthquakes. J. Geophys. Int. J. Forecasting 11 539–541. Res. 70 3019–3024. Ogata, Y. (1998). Space–time point-process models for Keilis-Borok, V., Knopoff, L., Rotwain, I. and earthquake occurrences. Ann. Inst. Statist. Math. 50 379– Allen, C. (1988). Intermediate-term prediction of occur- 402. rence times of strong earthquakes. Nature 335 690–694. Ogata, Y. (1999a). Seismicity analysis through point-process Kumazawa, T., Ogata, Y. and Toda, S. (2010). Precur- modeling: A review. Pure Appl. Geophys. 155 471–507. sory seismic anomalies and transient crustal deformation Ogata, Y. (1999b). Estimating the hazard of rupture using prior to the 2008 Mw = 6.9 Iwate-Miyagi Nairiku, Japan, uncertain occurrence times of paleoearthquakes. J. Geo- earthquake. J. Geophys. Res. 115 B10312. phys. Res. 104 17995–18014. Lei, X., Xie, C. and Fu, B. (2011). Remotely triggered seis- Ogata, Y. (2001a). Biases and uncertainties when estimating micity in Yunnan, southwestern China, following the 2004 the hazard of the next Nankai earthquake. Chigaku Zasshi Mw9.3 Sumatra earthquake. J. Geophys. Res. 116 B08303. (J. Geography) 110 602–614. Lei, X., Yu, G., Ma, S., Wen, X. and Wang, Q. (2008). Ogata, Y. (2001b). Increased probability of large earth- Earthquakes induced by water injection at ∼ 3 km depth quakes near aftershock regions with relative quiescence. within the Rongchang gas field, Chongqing, China. J. Geo- J. Geophys. Res. 106 8729–8744. phys. Res. 113 B10310. Ogata, Y. (2002). Slip-size-dependent renewal processes and Ma, L. and Vere-Jones, D. (1997). Application of M8 and Bayesian inferences for uncertainties. J. Geophys. Res. 107 Lin–Lin algorithms to New Zealand earthquake data. New 2268. Zealand Journal of Geology and Geophysics 40 77–89. Ogata, Y. (2004a). Space–time model for regional seismicity Matsumura, K. (1986). On regional characteristics of sea- and detection of crustal stress changes. J. Geophys. Res. sonal variation of shallow earthquake activities in the 109 B03308. World. Bull. Disas. Prey. Res. Inst., Kyoto Univ. 36 43–98. Ogata, Y. (2004b). Seismicity quiescence and activation in Matthews, M. V., Ellsworth, W. and Reasen- western Japan associated with the 1944 and 1946 great berg, P. A. (2002). A Brownian model for recurrent earth- earthquakes near the Nankai trough. J. Geophys. Res 109 quakes. Bull. Seismol. Soc. Amer. 92 2233–2250. B04305. Mogi, K. (1967). Regional variation of aftershock activity. Ogata, Y. (2005a). Detection of anomalous seismicity as a Bull. Earthquake Res. Inst. Univ. Tokyo 45 125–173. stress change sensor. J. Geophys. Res. 110 B05S06. Nanjo, K., Tsuruoka, H., Hirata, N. and Jordan, T. Ogata, Y. (2005b). Synchronous seismicity changes in and (2011). Overview of the first earthquake forecast testing around the northern Japan preceding the 2003 Tokachi-oki experiment in Japan. Earth Planets Space 63 159–169. earthquake of M8.0. J. Geophys. Res. 110 B08305. Nanjo, K., Tsuruoka, H., Yokoi, S., Ogata, Y., Fal- Ogata, Y. (2005c). Simultaneous estimation of b-values and cone, G., Hirata, N., Ishigaki, Y., Jordan, T., Kasa- detection rates of earthquakes for the application to after- hara, K., Obara, K., Schorlemmer, D., Shiomi, K. and shock probability forecasting (in Japanese). Rep. Coord. Zhuang, J. (2012). Predictability study on the aftershock Comm. Earthq. Predict. 73 666–669. sequence following the 2011 Tohoku-Oki, Japan, Earth- Ogata, Y. (2005d). Seismicity changes in and around Kyushu quake: First results. Geophysical Journal International 191 District before the 2005 earthquake of M7.0 in the western 653–658. offshore of Fukuoka Prefecture (in Japanese). Rep. Coord. Nomura, S., Ogata, Y., Komaki, F. and Toda, S. (2011). Comm. Earthq. Predict. 74 523–528. Bayesian forecasting of the recurrent earthquakes and its Ogata, Y. (2006a). Monitoring of anomaly in the aftershock predictive performance for a small sample size. J. Geophys. sequence of the 2005 earthquake of M7.0 off coast of the Res. 116 B04315. western Fukuoka, Japan, by the ETAS model. Geophys. Ogata, Y. (1983). Estimation of parameters in the modified Res. Let. 33 L01303. Omori formula for aftershock frequencies by the maximum Ogata, Y. (2006b). Seismicity anomaly scenario prior to the likelihood procedure. J. Phys. Earth. 31 115–124. major recurrent earthquakes off the east coast of Miyagi Ogata, Y. (1986). Statistical models for earthquake occur- Prefecture, northern Japan. Tectonophysics 424 291–306. rences and residual analysis for point processes. Mathemat- Ogata, Y. (2007). Seismicity and geodetic anomalies in a ical Seismology (I) (M. Saito, ed.). Institute of Statistical wide preceding the Niigata-Ken-Chuetsu earthquake of 23 Mathematics, Tokyo. October 2004, central Japan. J. Geophys. Res. 112 B10301. Ogata, Y. (1988). Statistical models for earthquake occur- Ogata, Y. (2009). 40 Years Activities of the Coordinat- rences and residual analysis for point processes. J. Amer. ing Committee for Earthquake Prediction; (4) The Insti- Statist. Assoc. 83 9–27. tute of Statistical mathematics, Research Organization of Ogata, Y. (1989). Statistical model for standard seismicity Information and Systems, Inter-University Research Inst and detection of anomalies by residual analysis. Tectono- Corporation, Edited by the Coordinating Committee for physics 169 159–174. Earthquake Prediction, Geographical Institute of Japan, EARTHQUAKE PREDICTION 21

pp. 374. Available at http://www.ism.ac.jp/~ogata/ Ogata, Y., Utsu, T. and Katsura, K. (1995). Statistical yotiren/Yotiren2008ISM.pdf. features of foreshocks in comparison with other earthquake Ogata, Y. (2010a). Anomalies of seismic activity and tran- clusters. Geophys. J. Int. 121 233–254. sient crustal deformations preceding the 2005 M7.0 earth- Ogata, Y., Utsu, T. and Katsura, K. (1996). Statistical quake west of Fukuoka. Pure and Applied Geophysics 167 discrimination of foreshocks from other earthquake clus- 1115–1127. ters. Geophys. J. Int. 127 17–30. Ogata, Y. (2010b). Space–time heterogeneity in aftershock Ogata, Y. and Zhuang, J. (2001). Statistical examination activity. Geophys. J. Int. 181 1575–1592. of anomalies for the precursor to earthquakes, and the Ogata, Y. (2011a). Operational probability foreshock fore- multi-element prediction formula: Hazard rate changes of casts up until Tohoku-Oki earthquake (in Japanese). Rep. strong earthquakes (M ≧ 4) around Beijing area based on Coord. Comm. Earthq. Predict. 86 123–125. the ultra-low frequency ground electric observation (1982– Ogata, Y. (2011b). Significant improvements of the space– 1997). Rep. Coord. Comm. Earthq. Predict. 66 562–570. time ETAS model for forecasting of accurate baseline seis- Ogata, Y. and Zhuang, J. (2006). Space–time ETAS model micity. Earth, Planets and Space 63 217–229. and an improved extension. Tectonophysics 413 13–23. Ogata, Y. (2011c). Pre-seismic anomalies in seismicity and Ogata, Y., Katsura, K., Falcone, G., Nanjo, K. and crustal deformation: Case studies of the 2007 Noto Hanto Zhuang, J. (2013). Comprehensive and topical evaluations earthquake of M6.9 and the 2007 Chuetsu-oki earthquake of earthquake forecasts in terms of number, time, space, of M6.8 after the 2004 Chuetsu earthquake of M6.8. Geo- and magnitude. Bull. Seismol. Soc. Amer. 103 3. phys. J. Int. 186 331–348. Okada, Y. (1992). Internal deformation due to shear and Ogata, Y. and Akaike, H. (1982). On linear intensity mod- tensile faults in a half-space. Bull. Seismol. Soc. Amer. 82 els for mixed doubly stochastic Poisson and self-exciting 1018–1040. point processes. J. R. Stat. Soc. Ser. B Stat. Methodol. 44 Omi, T., Ogata, Y., Hirata, Y. and Aihara, K. (2013). 102–107. MR0655379 Forecasting large aftershocks within one day after the main Ogata, Y., Akaike, H. and Katsura, K. (1982). The shock. Scientific Reports 3 Article No. 2218. application of linear intensity models to the investiga- Omori, F. (1894). On the aftershocks of earthquake. J. Coll. tion of causal relations between a point process and an- Sci. Imp. Univ. Tokyo. 7 111–200. other stochastic process. Ann. Inst. Statist. Math. 34 373– Reasenberg, P. and Jones, L. (1989). Earthquake hazard 387. after a mainshock in California. Science 243 1173–1176. Ogata, Y., Imoto, M. and Katsura, K. (1991). 3-D spatial Reasenberg, P. and Jones, L. (1994). Earthquake after- variation of b-values of magnitude-frequency distribution shocks: Update. Science 265 1251–1252. beneath the Kanto District, Japan. Geophys. J. Int. 104 Reid, H. F. (1910). The Mechanics of the Earthquake. The 135–146. California Earthquake of April 18, 1906, Report of the Ogata, Y., Jones, L. and Toda, S. (2003). When and where State Investigation Commission 2. Carnegie Institution, the aftershock activity was depressed: Contrasting decay Washington. patterns of the proximate large earthquakes in southern Rhoades, D. A., Van Dissen, R. J. and Dowrick, D. J. California. J. Geophys. Res. 108 2318. (1994). On the handling of uncertainties in estimating the Ogata, Y. and Katsura, K. (1986). Point-process mod- hazard of rupture on a fault segment. J. Geophys. Res. 99 els with linearly parametrized intensity for application 13701–13712. to earthquake data. J. Appl. Probab. 23A 291–310. Rundle, J., Tiampo, K., Klein, W. and Martins, J. MR0803179 (2002). Self-organization in leaky threshold systems: The Ogata, Y. and Katsura, K. (1993). Analysis of temporal influence of near-mean field dynamics and its implications and spatial heterogeneity of magnitude frequency distribu- for earthquakes, neurobiology, and forecasting. Proc. Natl. tion inferred from earthquake catalogues. Geophys. J. Int. Acad. Sci. USA 99 2514–2521. 113 727–738. Schorlemmer, D., Gestenberger, M. C., Wiemer, S., Ogata, Y., Katsura, K. and Tanemura, M. (2003). Mod- Jackson, D. and Rhoades, D. (2007). Earthquake likeli- elling heterogeneous space–time occurrences of earthquakes hood model testing. Seism. Res. Lett. 78 17–29. and its residual analysis. J. Roy. Statist. Soc. Ser. C 52 Schorlemmer, D., Zechar, J., Werner, M., Field, E., 499–509. MR2012973 Jackson, D., Jordan, T. and Relm Working Group Ogata, Y. and Katsura, K. (2006). Immediate and updated (2010). First results of the regional earthquake likelihood forecasting of aftershock hazard. Geophys. Res. Lett. 33 models experiment. Pure Appl. Geophys. 167 859–876. L10305. Shebalin, P., Kellis-Borok, V., Gabrielov, A., Zali- Ogata, Y. and Katsura, K. (2012). Prospective foreshock apin, I. and Turcotte, D. (2006). Short-term earth- forecast experiment during the last 17 year. Geophys. J. quake prediction by reverse analysis of lithosphere dynam- Int. 191 1237–1244. ics. Tectonophysics 413 63–75. Ogata, Y. and Toda, S. (2010). Bridging great earthquake Shimazaki, K. and Nakata, T. (1980). Time predictable doublets through silent slip: On- and off-fault aftershocks recurrence model for large earthquakes. Geophys. Res. Lett. of the 2006 Kuril Island subduction earthquake toggled by 7 279–282. a slow slip on the outer-rise normal fault the 2007 great Sobolev, G. (2001). The examples of earthquake preparation earthquake. J. Geophys. Res. 115 B06318. in Kamchatka and Japan. Tectonophysics 338 269–279. 22 Y. OGATA

Steacy, S., Gomberg, J. and Cocco, M. (2005). Introduc- www.corssa.org/articles/themev/van stiphout et al/ tion to special section: Stress transfer, earthquake trigger- vanstiphoutetal2012.pdf. ing, and time-dependent . J. Geophys. Res. Vere-Jones, D. (1999). Probabilities and in formation gain 110 B05S01. for earthquake forecasting. Computational Seismology 30 Suyehiro, S. (1966). Difference between aftershocks and fore- 248–263. shocks in the relationship of magnitude to frequency of oc- Wiemer, S. and Wyss, M. (1997). Mapping the frequency- currence for the great Chilean earthquake of 1960. Bull. magnitude distribution in asperities: An improved tech- Seismol. Soc. Amer. 56 185–200. nique to calculate recurrence times? J. Geophys. Res. 102 Tiampo, K., Rundle, J., Mcginnis, S., Gross, S. and 15. Klein, W. (2002). Mean-field threshold systems and phase Working Group on California Earthquake Probabilities dynamics: An application to earthquake fault systems. Eu- (2012). The uniform California earthquake rupture fore- rophys. Lett. 60 481–487. cast, Version 3 (UCERF3) project plan. Available at Terakawa, T., Hashimoto, C. and Matsu’ura, M (2013). http://www.wgcep.org/. Changes in seismic activity following the 2011 Tohoku-oki Zechar, J., Gerstenberger, M. and Rhoades, D. earthquake: Effects of pore fluid pressure. Earth Planet. (2010). Likelihood-based tests for evaluating space–rate- Sci. Lett. 365 17–24. magnitude earthquake forecasts. Bull. Seismol. Soc. Amer. Toda, S., Stein, R. and Sagiya, T. (2002). Evidence from 100 1184–1195. the A.D. 2000 Izu Islands swarm that seismicity is governed Zechar, J. and Zhuang, J. (2010). Risk and return: Evalu- by stressing rate. Nature 419 58–61. ating reverse tracing of precursors earthquake predictions. Utsu, T. (1961). A statistical study on the occurrence of Geophys. J. Int. 182 1319–1326. aftershocks. Geophys. Mag. 30 521–605. Zhuang, J. C. and Jiang, C. S. (2012). Evaluation of the Utsu, T. (1965). A method for determining the value of b in a prediction performance of the Annual Cosultation Meet- formula log n = a − bM showing the magnitude-frequency’ ing on Earthquake Tendency by using the gambling score. relation for earthquakes. Geophys. Bull. Hokkaido Univ. 13 Chinese Journal of Geophysics (in Chinese with English 99–103. Abstract) 55 1695–1709. Utsu, T. (1969). Aftershocks and earthquake statistics (1). Zhuang, J., Ogata, Y. and Vere-Jones, D. (2002). J. Fac. Sci. Hokkaido Univ. 2 129–195. Stochastic declustering of space–time earthquake occur- Utsu, T. (1970). Aftershocks and earthquake statistics (II)— rences. J. Amer. Statist. Assoc. 97 369–380. MR1941459 Further investigation of aftershocks and other earthquake Zhuang, J. and Ogata, Y. (2011). Evaluation of warning sequences based on a new classification of earthquake se- forecasts by a gambling score. Rep. Coord. Comm. Earthq. quences. J. Fac. Sci. Hokkaido Univ., Ser. 7 3 197–266. Predict. 85 451–452. Utsu, T. (1977). Probability in earthquake prediction. Zisin Zhuang, J., Vere-Jones, D., Guan, H., Ogata, Y. and (J. Seismol. Soc. Japan, 2nd Ser.) 30 179–185. Ma, L. (2005a). Preliminary analysis of observations on Utsu, T. (1979). Calculation of the probability of success of the ultra-low frequency electric field in a region around an earthquake prediction (In the case of Izu-Oshima-Kinkai Beijing. Pure and Applied Geophysics 162 1367–1396. earthquake of 1978). Rep. Coord. Comm. Earthq. Predict. Zhuang, J., Chang, C., Ogata, Y. and Chen, Y. (2005b). 21 164–166. A study on the background and clustering seismicity in the Utsu, T. (1982). The probability in earthquake prediction Taiwan region by using point process models. J. Geophys. (The second paper). Bull. Earthq. Res. Inst., Univ. Tokyo Res. 110 B5. 57 499–524. Zhuang, J., Ogata, Y., Vere-Jones, D., Ma, L. and Utsu, T., Ogata, Y. and Matsu’ura, R. (1995). The cen- Guan, H. (2013). Statistical modeling of earthquake occur- tenary of the Omori formula for a decay law of aftershock rences based on external geophysical observations: With an activity. J. Phys. Earth 43 1–33. illustrative application to the ultra-low frequency ground Van Stiphout, T., Zhuang, J. and Marsan, D. (2012). electric signals observed in the Beijing region. In Imaging, Declustering; The community online resource for statisti- Modeling and Assimilation in Seismology, Vol. II (Y. Li, cal seismicity analysis (CORSSA). Available at http:// ed.) De Gruyter, Berlin.