arXiv:1808.05010v3 [math.PR] 22 May 2020 upre npr yteRB rn 19-01-00356. Grant RFBR funded the Engineering by part Data-Centric in on supported Programme the by sponsored theory. ergodic infinite inducing, chain, Markov cha embedded Markov entrance distribution, stationary measure, invariant 28D05. 60F05, 60G40, Mi upre yteESCgatE/031/ n elwhpa Fellowship a and EP/P003818/1 grant EPSRC the by supported is AM phrases. and words Key 2010 ..Daiyo nrneadei akvcan 11 13 8 2 inducing of method the via chains Markov chains exit Markov and exit Entrance and entrance of 3. chains Duality Markov exit and 2.3. Entrance duality their and notation 2.2. chains Basic Markov exit and 2.1. Entrance paper the of 2. Structure questions results related main and 1.3. the Motivation of description and 1.2. Introduction 1.1. Introduction 1. fteetotpso akvcan ntecs hnteiiilchain initial the when case the in chains Markov of types two these of akvchain Markov Abstract. xttmsfrom times exit complement nain esr.Udrcranrcrec-yeassumptions recurrence-type certain Under measure. invariant loyed,i aua a,tevrin fterslsaoe(pro above results the of versions the way, natural classical the a in yields, also hr esuyteMro hi foesot bv h eolev zero intere the above mostly overshoots are of We chain between Markov oscillates the measure. study Lebesgue we the where under started chains xlctfrua o nain esrso hs his hnw st we Then chains. these that of assuming ergodicity measures invariant and for formulas explicit rv eta ii hoe o h ubro ee rsig for crossings level increments. of of number variance the finite for and theorem mean limit central a prove ahmtc ujc Classification. Subject Mathematics NUIG N EE-RSIG FRNO WALKS RANDOM OF LEVEL-CROSSINGS AND INDUCING, u prahi ae ntetcnqeof technique the on based is approach Our egv plctost admwlsin walks random to applications give We TTOAYETAC AKVCHAINS, MARKOV ENTRANCE STATIONARY nue akvchains Markov induced A o akvchain Markov a For LKADRMIJATOVI ALEKSANDAR bandb sampling by obtained c iial,cnie the consider Similarly, . A c −∞ to ee rsig admwl,oesot nesot oa time local undershoot, overshoot, walk, random crossing, Level A n + and hspprpoie rmwr o nlsn nain measures invariant analysing for framework a provides paper This . ∞ Y eso htti hi segdc n s hsrsl to result this use and ergodic, is chain this that show We . stplgclyrcret reuil,adwa Feller. weak and irreducible, recurrent, topologically is Y . Y ihvle naPls pc,cnie the consider space, Polish a in values with rmr:6J0 05,3A0 eodr:6J5 60G10, 60J55, secondary: 37A50; 60G50, 60J10, Primary: Contents ttemmnswe tetr xdset fixed a enters it when moments the at xtMro chain Markov exit N LDSA VYSOTSKY VLADISLAV AND C ´ 1 R d inducing hc ergr s“ttoay Markov “stationary” as regard we which , n xtMro hi,idcdMro chain, Markov induced chain, Markov exit in, rmifiieegdcter.This theory. ergodic infinite from bandb sampling by obtained , yLodsRgse onain Vis VV Foundation. Register Lloyd’s by ( Y a be can lo admwl that walk random a of el admwlswt zero with walks random ie nteppr for paper) the in vided tdi ieso one, dimension in sted Y d hi uniqueness their udy h lnTrn Institute, Turing Alan The t a known a has transient A ,w give we ), Y entrance frno walk, random of rmits from σ tthe at -finite 9 8 7 6 2 2 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY
3.1. Setup and notation 13 3.2. Invariant measures of general induced, entrance, and exit Markov chains 17 3.3. Existence and uniqueness for weak Feller chains 24 4. Applications to random walks in Rd 29 5. Futher results on level-crossings for one-dimensional random walks 32 5.1. The limit theorem for the number of level-crossings 32 5.2. Expected occupation times between level-crossings 36 Acknowledgements 37 Appendix A. Induced transformations in infinite ergodic theory 37 References 42
1. Introduction
1.1. Introduction and description of the main results. Let S = (Sn)n 0 with Sn = S + X + ... + X be a non-degenerate random walk in Rd, where d 1, with≥ independent 0 1 n ≥ identically distributed (i.i.d.) increments X1,X2,... and the starting point S0 that is a random vector independent of the increments. Define the state space of the walk S, denoted by , as the minimal topologically closed subgroup of (Rd, +) containing the topological Z support of the distribution of X1. Assume throughout that S0 . Let λ be the normalized Haar measure on ( , +); in dimension one this means that λ([0∈, Z x)) = x for positive x . Z ∈ Z Consider the case d = 1 and assume that either EX1 =0 or EX1 does not exist. This is equivalent to assuming that the random walk S oscillates, that is lim sup Sn = lim inf Sn = + a.s. as n (see Section 4). In particular, S can be transient. Define− the crossing times∞ of the zero→ ∞level by := 0 and T0 n := inf k> n 1 : Sk 1 < 0,Sk 0 or Sk 1 0,Sk < 0 , n N, T { T − − ≥ − ≥ } ∈ and let
n := S n , n := S n 1, n N (1) O T U T − ∈ be the corresponding overshoots and undershoots; put = := S . It is easy to see, using O0 U0 0 that the n’s are stopping times, that the sequence := ( n)n 0 is a Markov chain. The T O O ≥ sequence := ( n)n 0 also is a Markov chain but this is far less intuitive (cf. Lemma 2.1). ThisU paperU was≥ motivated by our interest in stationarity and stability properties of the Markov chain of overshoots . In our companion paper [33] we essentially showed that this chain has an invariant measureO
c1 ½ π(dx) := ½[0, )(x)P(X1 > x)+ ( ,0)(x)P(X1 x) λ(dx), x , (2) 2 ∞ −∞ ≤ ∈ Z
where either c1 := 2/E X1 if E X1 < (and d = 1) or c1 := 1 otherwise. Note that π is finite if and only if E| X| < | , in| which∞ case by the assumption of oscillation we have | 1| ∞ EX1 =0 and so π is a probability. We found this measure in [33] computing it in a special case using an ergodic averaging argument and assuming that an invariant measure exists. Then we proved invariance of π in general case using quite interesting but yet complicated and apparently ad-hoc arguments STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 3
based on time-reversibility. The same approach of computing or even guessing and then proving invariance was used in a number of other works (e.g. [8, 28, 36], commented below) concerning stability of certain Markov chains. In all these cases this neither gives insights on the form of invariant measures nor shows how to find them. In particular, this does not explain why the Haar measure, which was introduced in [33] only to simplify the notation, appears in formula (2). Moreover, this approach does not allow one to prove uniqueness. A universal approach for proving uniqueness of the invariant measure is in establishing some type of distributional convergence of n towards π as n (when EX1 = 0) starting S from every x . In this paper we set thisO delicate problem→∞ aside. It is considered, under ∈ Z additional smoothness assumptions on the distribution of X1, in the companion paper [33] using the methods that are entirely different from the ones used here. However, we were not able to establish convergence in full generality. The main difficulty is that no standard criteria of convergence apply to : in particular, this chain in general is neither weak Feller (Remark 3.2) nor ψ-irreducible.O By the same reasoning, a priori it is unclear if the chain has a stationary distribution regardless of moment assumptions on S. O This paper presents a new approach which allows us to find invariant measures and obtain their uniqueness and ergodicity in much more general context than level-crossings of one-dimensional random walks. In order to proceed to this general setting, note that the chain of overshoots has periodic structure since its values at consecutive steps have different signs. Therefore, it suffices to consider the non-negative Markov chain O = (On)n 0 of overshoots at up-crossings defined ≥ by On := 2n ½(S0<0) for n 1 and starting at O0 := S0, and the analogous negative chain O − ≥ O↓ of overshoots at down-crossings. Similarly, define the negative-valued chain U =(Un)n 0 ≥ of undershoots at up-crossings given by Un := 2n ½(S0<0) for n 1 and U0 := S0. This chain played an important role in the proof of equalityU − (2) in [33]. ≥ Observe that the Markov chain of overshoots O at up-crossings above the zero level is obtained by sampling the one-dimensional random walk S at the moments it enters the set [0, ) from ( , 0). Similarly, for any Markov chain Y we can consider the entrance ∞ −∞ A Markov chain, denoted by Y → , constructed by sampling Y at the moments of entry into an arbitrary fixed set A from its complement Ac. We also consider the exit Markov chain, Ac c denoted by Y →, obtained by sampling Y at the exit times from A to A; the Markov property of this sequence is not obvious and we refer to Lemma 2.1 for its proof. In this [0, ) ( ,0) notation, we have O = S→ ∞ and U = S −∞ →. Note that alternating the values of the entrance and exit chains gives the Markov chain of overshoots over the boundary ∂A analogous to but we will not give it any consideration. We will showO (Theorems 2.1 and 3.1) that for any Borel set A in a Polish space and any time-homogeneous Markov chain Y that takes values in and has an invariant σX- A Ac X finite measure µ, the entrance chain Y → and the exit chain Y → have respective invariant measures
entr c exit c µ (dx)= P (Yˆ A )µ(dx) on A, µ c (dx)= P (Y A)µ(dx) on A , (3) A x 1 ∈ A x 1 ∈
where Yˆ is a Markov chain dual to Y with respect to µ and Px(Y0 = Yˆ0 = x) = 1, assuming c exit that Y visits both sets A and A infinitely often Px-a.s. for µAc -a.e. x, and the same holds 4 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY true for Yˆ (in particular, A and Ac are recurrent sets for Y and Yˆ ). We will also show that A Ac the chains Y → and Y → are recurrent and ergodic if so is Y when “started” under µ. The assumptions above are satisfied (Remark 3.1.b) if Y is recurrent started under µ and it can get from Ac to A. We stress that this not a requirement, that is Y can be transient. Ac c It turns out that the exit chain Y → of Y from A to A is dual to the entrance chain ˆ Ac ˆ c exit Y → of Y into A from A with respect to the measure µAc (Theorem 2.1). This immediately exit Ac entr implies that µAc is invariant for the exit chain Y →. This in turn yields invariance of µA A for the entrance chain Y → since formulas (3) are antisymmetric in the sense that their right-hand sides interchange if we swap the chain Y and the set A with the dual chain Yˆ and the complement set Ac. Therefore, we can concentrate on entrance Markov chains. Our results apply for two very wide classes of Markov chains with a known invariant measure, namely reversible chains and random walks on topological groups with the Haar measure. In this paper we will only consider applications with concrete examples for random walks on (Rd, +) (see Section 4). Let us briefly describe possible applications for the first class of Markov chains. Recall that a chain Y is reversible if it is self-dual with respect to entr a σ-finite measure µ, which has to be invariant. For such chains, formula (3) for µA is particularly simple since we can take Yˆ1 = Y1. The assumptions can be verified as follows. A simple necessary and sufficient condition for recurrence of countable reversible Markov chains is due to Lyons [30]. For transient chains other than random walks, we are aware of only one particular example with explicit characterization of all recurrent sets: by Gantert et al. [19, Theorem 1.7], a planar simple random walk conditioned on never hitting the origin visits any infinite subset of Z2 infinitely often a.s. Necessary and sufficient conditions for recurrence of a set for a countable state space Markov chain can be found in Bucy [9], Murdoch [34], and Menshikov et al. [31, Theorem 2.5.8]. Let us stress that the above simple probabilistic explanation of invariance of the measures in (3) neither clarifies how to find them and why they have the form given, nor allows us to prove their uniqueness. Therefore, we use an entirely different method. Our approach is built on inducing, a basic tool of ergodic theory, introduced by S. Kaku- tani in 1943. We need to use infinite ergodic theory since in many cases of interest the invariant measure µ of the chain Y is infinite. In particular, so is the Haar measure λ on the subgroup of Rd. Since λ is invariant for the random walk S and the dual of S with respect to λ isZ S, the first formula in (3) reads as − λentr(dx)= P(X x Ac)λ(dx). A 1 ∈ − This example provides all applications of inducing presented in this paper (see Section 4). Our main interest is in level-crossings of random walks in dimension one, corresponding to A = [0, ). We also briefly consider other choices of A, two of which are discussed below. First,± ∞ let A be such that both sets A and Ac have non-empty interior, and assume ⊂ Z entr that the walk S is topologically recurrent on , so d =1 or d = 2. Then the measure λA AZ is always invariant for the entrance chain S→ into A and is finite if A is bounded. Second, A is either the non-negative or the negative orthant in Rd. Assume that both events Sn 0 and Sn < 0 occur infinitely often a.s.; we always mean that inequalities between{ points≥ } in Rd {hold coordinate-wise.} Notice that in dimension one this assumption is STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 5
equivalent to oscillation of S. Let us again stress that S can be transient. Then the measure
π+(dx) := c1 ½[0, )d (x)(1 P(X1 x))λ(dx), x , (4) ∞ − ≤ ∈ Z entr [0, )d which satisfies π+ = c1λ[0, )d , is invariant for the entrance chain S→ ∞ into the non- ∞ negative orthant [0, )d. In particular, for d = 1 this means that π is invariant for the chain ∞ + O. Combining this with the analogous result for the chain O↓ of overshoots at down-crossings, entr 1 1 which is invariant under π := c1λ d , we obtain that the measure π = π+ + π is − ( ,0) 2 2 − invariant for the chain of overshoots −∞. Let us mention that distributions of the same form O as π+ in d = 1 appear on many occasions, as discussed in detail in [33, Sections 2.1 and 2.2]. Our further result (Theorem 3.2) implies that under the topological assumptions of recurrence, irreducibility, and weak Feller property of the chain Y and, essentially, non- emptiness of the interiors of the sets A and Ac, the questions of existence of an invariant measure, its ergodicity and uniqueness (up to a constant factor) in the class of locally finite A Ac Borel measures have the same answer for each of the chains Y , Y → , Y →. In particular, since the Haar measure is known to be ergodic and unique locally finite invariant measure of the topologically recurrent random walk S on , this yields uniqueness and ergodicity of the invariant measure π for the chain of overshootsZ . More generally (Theorem 4.2), the same entr A O holds for the measure λA and the chain Y → if we assume for simplicity that λ(∂A) = 0. Theorem 3.2 also is a useful tool for proving existence (Proposition 3.2) of a locally finite invariant measure of the weak Feller chain Y when its path empirical distributions are not tight and so the classical Bogolubov–Krylov theorem does not apply. This is based on Kac-type formulas of Proposition 3.1, which in a sense are inverse to those of Theorem 3.1 obtained by inducing. NB: after this paper was finished, we found the works by Lin [29] and Skorokhod [42] on invariant distributions of weak Feller topologically recurrent Markov chains, with a stronger existence result (see Proposition 3.3 and the preceding discussion). Our approach also provides a natural venue to study induced Markov chains (sometimes referred to as embedded chains), obtained by sampling a Markov chain when it belongs to an arbitrary fixed set. These chains are closely related to the entrance and exit chains, and some of our results can actually be obtained by sampling the bivariate chain Z on , formed by the pair of two consecutive values of Y , when Z belongs to the set Ac A. InducedX × X chains are discussed, for example, in Revuz [40, Chapters 2.4 and 3.2], where× they appear in the context of probabilistic potential theory. For completeness of exposition, we also give versions of all our statements on existence and uniqueness of invariant measures for general induced chains. Some of these results, stated in Parts 1 of respective assertions, essentially are not new but it is hard to extract them from the literature even for recurrent chains. Based on our experience, the use of inducing in probability theory is mostly reverse, being related to Kac-type formulas for stationary processes and even more specifically, for Markov chains but only under the omnipresent assumption of recurrence. We stress that our main results Theorems 2.1, 3.1 and Proposition 3.1 also apply to transient Markov chains. We are not aware of any applications of (direct) inducing in problems specifically related to random walks. In the context of level-crossings by one-dimensional random walks, the classical and universal tool is the Wiener–Hopf factorization. Among the vast literature on the topic, the works of Baxter [3], Borovkov [8], and Kemperman [26] are the most relevant 6 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY to the questions considered here. These papers rely on the Wiener–Hopf factorization (which does not yield much for our problem!), in contrast to our entirely different approach. As far as we know, fluctuation theory for random walks in Rd is limited to a single paper by Greenwood and Shaked [20], and our formulas for invariant measures such as (4) are the only explicit high-dimensional results available. The idea to study properties of random walks regarding them as “stationary” processes starting from the Haar measure is novel to us, and we have never seen it in the vast literature on random walks. We are not aware of any works concerning entrance and exit Markov chains of any type. We conclude the paper with a number of one-dimensional results on level-crossings of random walks presented in Section 5. The main result there is the limit theorem for the number of level-crossings motivated below in Section 1.2. Crucially, this theorem requires no assumption other than that the increments are zero-mean and have finite variance. We also present several formulas for expected occupation times between level-crossings. 1.2. Motivation and related questions. This section concerns only the one-dimensional case. There are several good reasons to study overshoots of random walks besides purely theoretical interest. First, there is connection to the local time. Namely, the Markov chain features in Perkins’s [38] definition of the local time of a random walk. There is no conventionalO definition of this notion, see Cs¨org˝oand R´ev´esz [13] for other versions. Let L := max k 0 : n (5) n { ≥ Tk ≤ } be the number of zero-level crossings of the walk by time n, and let ℓ0 be the local time at level 0 at time 1 of a standard Brownian motion. Perkins [38, Theorem 1.3] proved that for 2 2 a zero mean random walk S with finite variance σ := EX1 and starting at S0 = 0, one has Ln 1 d k σℓ0. (6) √n |O | n−→ k →∞ X=1 To the best of our knowledge, all the other limit theorems for the local time (under either definition) of a random walk with finite variance require additional smoothness assumptions on the distribution of increments. Under the above assumptions on the random walk, by ergodicity of the Markov chain 1 n O (Theorem 4.2), we have n k=1 k x π(dx) a.s. for π-a.e. starting point S0 = x ; we will show that this convergence|O |→ actuallyZ | holds| for every x. Hence (6) immediately gives∈ Z P R a limit theorem for the number of level crossings Ln divided by √n (Theorem 5.1). Under 2 the optimal moment assumption EX1 < , limit theorems of such type were first obtained in the early 1980s by A.N. Borodin, who∞ studied more general questions of convergence of additive functionals of consecutive steps of random walks; see Borodin and Ibragimov [7, Chapter V] and references therein. However, Borodin’s method assumes that the distribu- tion of increments of the walk is either aperiodic integer-valued or has a square-integrable characteristic function, and hence it is (Kawata [25, Theorem 11.6.1]) absolutely continuous. We stress that our result, Theorem 5.1, does not require any smoothness assumptions. Second, the Markov chain O appeared in the study of the probabilities that the integrated random walk (S1 + ... + Sk)1 k n stays positive; see Vysotsky [46, 47]. The main idea of the approach of [46, 47] is≤ in≤ a) splitting the trajectory of the walk into consecutive STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 7
“cycles” between the up-crossing times; and b) using that for certain particular distributions of increments, e.g. in the case when the distribution P(X X > 0) is exponential, 1 ∈ ·| 1 the overshoots (On)n 1 are stationary (actually, i.i.d.) regardless of the starting point S0. The current paper was≥ originally motivated by the question whether this approach can be extended to general distributions of increments. Third, there is a close connection to so-called switching random walks. Define the switching ladder times
′ ′ inf k > n′ 1 : Sk S n−1 , if S n−1 0, 0′ := 0, n′ := { T − ≤ T } T ≥ n N. T T inf k > : S S ′ , if S ′ < 0, ∈ ( n′ 1 k n−1 n−1 { T − ≥ T } T
and the switching ladder heights Zn := S n′ , n 0. The random sequence Z = (Zn)n 0 belongs to a special type of Markov chainsT that≥ we call random walks with switch at zero,≥ whose distributions of increments depend only on the sign of the current position of the chain; the other authors call them oscillating random walks but this can be easily confused with oscillation in the sense used in Section 1.1. More precisely, the transition probabilities P (x,dy) of such a chain Y are of the form P (x,dy)= P (dy x) for x = 0 and P (0,dy)= sign x − 6 αP+(dy)+ (1 α)P (dy), where P+ and P are two probability distributions on R and − − − 1 α [0, 1]. In the symmetric case when P+(dy) = P ( dy) and α = , the sequence ∈ − − 2 ( Yn )n 0 is called a reflected random walk. | |Random≥ walks with switch were introduced by Kemperman [26], and then considered in a few works including that by Borovkov [8]. Reflected random walks received much more attention; see Peign´eand Woess [37] for the most recent and comprehensive list of references and generalizations to processes of iterated i.i.d. random continuous mappings. Their relevance to the present paper is that the overshoots above zero level of the switching ladder heights chain Z coincide with those of the random walk S. Recently we learned about the unpublished work by Peign´eand Woess [36] who found the invariant measure for a reflected random walk ( Yn )n 0 sampled at the moments of reflection at zero. In our terminology, this is the random| | sequence≥ of absolute values of non-strict overshoots above the zero level by Y . One can check using the Wiener–Hopf factorization that in the special case when Y is the switching ladder heights chain Z generated by a random walk S on = R with symmetrically distributed increments, the invariant measure of [36, Theorem 4.7]Z equals 2 π( ). The other notable fact is that, if EX1 = 0 and EX1 < , then π is an invariant distribution|·| for a random walk with switch at zero defined by P∞ = P(X X < 0), + 1 ∈ ·| 1 P = P(X1 X1 > 0), and α = 1; this can be shown using a stationary distribution for− Y found∈ in ·| [8]. We will use our inducing approach to explore these connections and show further relations to classical stationary distributions of renewal theory in the separate paper [48].
1.3. Structure of the paper. In Section 2 we carefully define the entrance and exit chains and use probabilistic arguments to check their Markov property and the crucial property of duality. In Section 3 we study stationarity of induced, entrance and exit chains using the idea of inducing from ergodic theory: in Section 3.1 we provide a self-contained setup needed to apply inducing in the context of Markov chains; in Section 3.2 we show its use 8 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY to find invariant measures for the three types of chains in question, obtained from a general Markov chain on a topological space; and in Section 3.3 we study existence and uniqueness of these invariant measures for a more specific class of weak Feller chains on Polish spaces. In Section 4 we give applications to the entrance chains sampled from random walks in arbitrary dimension, including the chains of overshoots above a level in dimension one. In Section 5 present further one-dimensional results on level-crossings of random walks, including the limit theorem for the number of level-crossings. We conclude the paper with the Appendix, where we give the basics on the use of inducing in infinite ergodic theory. We present them in a nearly self-contained way and with proofs to ensure assumptions more general than those we found in the literature.
2. Entrance and exit Markov chains and their duality 2.1. Basic notation. Throughout this paper will be a topological space. We always equip subsets of with the subspace (induced)X topology. All the measures on (and its subsets) consideredX here will be Borel measures, that is defined on the Borel σ-algebraX ( ). B X Let Y = (Yn)n 0 be a time-homogeneous Markov chain taking values in . By saying this, we assume that≥ Y is defined on some generic probability space (Ω, , P)X and Y has a transition kernel P on under P. We also assume that (Ω, ) is equippedF with a family of X F probability measures Px x such that: Y is a Markov chain with the transition kernel P { } ∈X under Px and Px(Y0 = x) = 1 for every x , and the function x Px(Y B) is measurable N0 ∈ X 7→ ∈ for any set B ( )⊗ , where N0 := N 0 . By the Ionescu Tulcea extension theorem (Kallenber [24,∈ Theorem B X 6.17]), such family∪{ of} measures always exists for any transition kernel on . For any Borel σ-finite measure µ on , denote Pµ := Px( )µ(dx). Then Y0 X X X · has “distribution” µ under Pµ, in which case we say that Y starts under µ. Although µ is not necessarily a probability, we prefer to (ab)use probabilistic notationR and terminology as above, instead of using respective notions of general measure theory. Denote by Ex and Eµ respective expectations (Lebesgue integrals) over Px and Pµ. We say that a transition kernel Pˆ on is dual to P with respect to a σ-finite measure µ on if the following equality of measuresX on ( ) ( ) is satisfied: X B X ⊗ B X µ(dx)P (x,dy)= µ(dy)Pˆ(y,dx), x,y . (7) ∈ X This equality is called the detailed balance condition. We will also use this definition of duality for Markov chains on subsets of of full measure µ. If is separable, there cannot be two distinct mod µ kernels dual to PX(see Revuz [40, LemmaX 4.7 in Chapter 2]); if is a Polish space, then a dual kernel always exists and is unique mod µ (see Section 3.1). TwoX Markov chains Y and Yˆ on are dual if so are their transition operators. In other words, X assuming for convenience that Yˆ0 = Y0, we have the time-reversal equality P ((Y ,Y ) B)= P ((Yˆ , Yˆ ) B), B ( ) ( ). µ 0 1 ∈ µ 1 0 ∈ ∈ B X ⊗ B X Note that this equality implies that the measure µ is invariant both chains Y and Yˆ , that is P (Y )= P (Yˆ )= µ. More generally, for any k 1 we have µ 1 ∈· µ 1 ∈· ≥ (k+1) P ((Y ,...,Y ) B)= P ((Yˆ ,..., Yˆ ) B), B ( )⊗ . (8) µ 0 k ∈ µ k 0 ∈ ∈ B X STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 9
2.2. Entrance and exit Markov chains. Fix a Borel set A . Define the entrance c A ⊂ X times to A from A of the chain Y by T0→ := 0 and A A c c Tn→ := inf k > Tn→ 1 : Yk 1 A ,Yk A on Yk A i.o.,Yk A i.o. , n N, { − − ∈ ∈ } { ∈ ∈ } ∈ and the sequences of entrances to A from Ac and exits from Ac to A for Y , respectively, by
A Ac c A A Yn→ := YTn→ and Yn → := YTn→ 1 on Yk A i.o.,Yk A i.o. , n N, − { ∈ ∈ } ∈ A Ac and Y0→ = Y0 → := Y0, where “i.o.” stands for “infinitely often”. We will need to clarify for which initial values Y0 of the Markov chain Y the i.o. events occur. To this end, we define the set NA′,Y (in short, NA′ ) as follows:
N ′ := x : P (Y A′ i.o.) = 1 , A′ ( ). A ,Y { ∈ X x k ∈ } ∈ B X Conditions when the sets NA and NA NAc are non-empty will be discussed in Section 3. c ∩ c c Define the exit set of Y from A , denoted by (A )ex,Y (in short, Aex): Ac := x Ac : P (x, A) > 0 . (9) ex,Y { ∈ } A Ac The following result justifies referring to the sequences Y → and Y → as entrance and exit Markov chains. Lemma 2.1. Let Y be a Markov chain taking values in a topological space . Let A be X A ⊂ X a Borel set such that NA NAc is non-empty. Then the entrance sequence Y → and the exit c A ∩ c c c sequence Y → are Markov chains on the respective sets A NA NA and Aex NA NA , with the probability transition kernels ∩ ∩ ∩ ∩ entr A P (x,dy) := P (Y → dy), x,y A N N c A x 1 ∈ ∈ ∩ A ∩ A and
exit Ac c P c (x,dy) := P (Y → dy)P (Y dz Y A), x,y A N N c . (10) A z 1 ∈ x 1 ∈ | 1 ∈ ∈ ex ∩ A ∩ A ZA A Ac We say that a Borel measure ν on is proper for the chains Y → and Y → if ν is Xc supported on NA NAc , i.e. ν((NA NAc ) ) = 0. We will start these chains only under their proper measures. ∩ ∩
Proof. We will use that for any A′ ( ), the set N ′ is absorbing for Y , in the sense that ∈ B X A P (Y N ′ )=1, x N ′ . (11) x 1 ∈ A ∈ A Indeed, for every x N ′ we have ∈ A 1= P (Y A′ i.o.) = P (Y A′ i.o. Y = y)P (Y dy)= P (Y A′ i.o.)P (x,dy). x k ∈ x k ∈ | 1 x 1 ∈ y k ∈ ZX ZX Hence P (Y A′ i.o.) = 1, i.e. y N ′ , for P (x, )-a.e. y. That is, P (Y N ′ ) = 1. y k ∈ ∈ A · x 1 ∈ A It follows from equality (11) that the non-empty set NA NAc is absorbing for Y . Then c ∩ clearly A NA NAc and A NA NAc are non-empty. Hence for every x NA NAc , the c ex ∩ A ∩ ∩A ∩ c c ∈ ∩ c sequences (Yn →)n 1 and (Yn→ )n 1 belong respectively to Aex NA NA and A NA NA , ≥ ≥ ∩ ∩A ∩ ∩ and thus have infinitely many terms, Px-a.s. Then the sequence (Tn→ )n 1 of entrance times c ≥ to A from A is infinite Px-a.s. This is a sequence of stopping times with respect to Y . Then 10 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY
A A A it follows from the equality Y → = (YTn→ )n 0 that Y → is a Markov chain under Px. We omit the proof of this standard fact, which≥ is based on the strong Markov property of Y . entr A entr The formula for the transition kernel PA of Y → is evident; PA is a probability kernel on A NA NAc by (11). ∩ ∩ Ac A The Markov property of the exit sequence Y → is not evident since (Tn→ 1)n N are exit − ∈ ex not stopping times with respect to Y . We see from definition (10) that P c (in short, P c ) c A A c c A is a probability kernel on Aex NA NA by (11). To prove that Y → is a Markov chain on c c ∩ ∩ ex c Aex NA NA with the transition kernel PAc , it suffices to show that for any x NA NA , ∩ ∩ c ∈ ∩ integer n 2, and Borel sets B , B ,... A N N c , we have ≥ 1 2 ⊂ ex ∩ A ∩ A
Ac Ac Ac ex ex Px(Y1 → B1,...,Yn → Bn)= Px(Y1 → dx1) PAc (x1, dx2) ... PAc (xn 1, dxn). ∈ ∈ ∈ − ZB1 ZB2 ZBn A The proof is by induction. Let n = 2. Since T1→ is finite Px-a.s., it is true that
c c ∞ c A A A A Px(Y1 → B1,Y2 → B2)= Px(T1→ = k,Yk 1 B1,Y2 → B2) ∈ ∈ − ∈ ∈ k X=1 ∞ c A A = Px T1→ >k 1,Yk 1 B1,Yk A, Y2 → B2 − − ∈ ∈ ∈ k=1 X ∞ A = Px(T1→ >k 1,Yk 1 dx1) − − ∈ k=1 ZB1 X Ac A Px Yk A, Y2 → B2 Yk 1 = x1, T1→ >k 1 . × ∈ ∈ − −
By the Markov property of Y , for Px(Yk 1 )-a.e. x1 B1 and every k 1 we have − ∈· ∈ ≥ Ac A Ac Px Yk A, Y2 → B2 Yk 1 = x1, T1→ >k 1 = Px1 (Y1 A, Y2 → B2) ∈ ∈ − − ∈ ∈ Ac = P (Y → B )P (Y dz). z 1 ∈ 2 x1 1 ∈ ZA ex On the other hand, from definition (10) of PAc we see that
Ac ex P (Y → B )P (Y dz)= P (Y A)P c (x , B ). (12) z 1 ∈ 2 x1 1 ∈ x1 1 ∈ A 1 2 ZA Putting everything together, we obtain
c c ∞ A A A Px(Y1 → B1,Y2 → B2)= Px(T1→ >k 1,Yk 1 dx1)Px1 (Y1 A)P (x1, B2) ∈ ∈ − − ∈ ∈ k B1 X=1 Z Ac ex = P (Y → dx ) P c (x , dx ), (13) x 1 ∈ 1 A 1 2 ZB1 ZB2 as required in the case n = 2. This proves the basis of induction. STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 11
To prove the inductive step, we proceed exactly as above and arrive at Ac Ac P (Y → B ,...,Y → B ) x 1 ∈ 1 n+1 ∈ n+1 ∞ c c A A A = Px(T1→ >k 1,Yk 1 dx1) Pz(Y1 → B2,...,Yn → Bn+1) Px1 (Y1 dz). − − ∈ ∈ ∈ ∈ k B1 A X=1 Z Z c We now use the assumption of induction for the probability under A when z NA NA . c c c∈ ∩ Since Px1 (Y1 NA NA ) = 1 for every x1 NA NA because the set NA NA is absorbing for Y , we get∈ ∩ ∈ ∩ R ∩
Ac Ac Ac P (Y → B ,...,Y → B ) P (Y dz)= P (Y dz) f(x )P (Y → dx ), z 1 ∈ 2 n ∈ n+1 x1 1 ∈ x1 1 ∈ 2 z 1 ∈ 2 ZA ZA ZB2 where f is a non-negative measurable function on B2 given by
ex ex f(x2) := PAc (x2, dx3) ... PAc (xn, dxn+1). ZB3 ZBn+1 We claim that for any x B and any non-negative measurable function g on B , 1 ∈ 1 2 Ac ex P (Y dz) g(x )P (Y → dx )= P (Y A) g(x )P c (x , dx ). (14) x1 1 ∈ 2 z 1 ∈ 2 x1 1 ∈ 2 A 1 2 ZA ZB2 ZB2 ex Indeed, for indicator functions g this holds by definition (10) of PAc ; cf. (12). Hence, (14) holds for simple functions (i.e. finite linear combinations of indicator functions) by additivity of the three integrals in (14). Finally, since any non-negative measurable function g can be represented as pointwise limit of a pointwise non-decreasing sequence of simple functions, equality (14) follows from the monotone convergence theorem. Putting everything together and applying (14) with g = f establishes the inductive step (exactly as we obtained (13) applying (12) in the case n = 2). 2.3. Duality of entrance and exit Markov chains. The following assertion is crucial. Theorem 2.1. Let Y and Yˆ be Markov chains that take values in a topological space , are X dual with respect to a σ-finite invariant Borel measure µ, and satisfy Y0 = Yˆ0. Let A ( ) c c ∈ BAX be a set such that N := NA,Y NA ,Y NA,Yˆ NAc,Yˆ is non-empty. Then the exit chain Y → Ac∩ ∩ ∩ c and the entrance chain Yˆ → are dual with respect to the Borel measure on A given by exit c
µ˜ c (dx) := ½ (x)P (Y A)µ(dx), x A . A ,Y N x 1 ∈ ∈ Recall that there always exists a chain Yˆ dual to Y if is a Polish space (see Section 3.1). Note in passing that in the special case when Y is an oscillatingX random walk S on = R and A = [0, ), we actually have (see [33, Section 2.4]) a quite surprising represenZtation ∞ A Ac of the transition kernels of the chains S→ and S → (i.e. O and U) as products of two entr − − kernels reversible (self-dual) with respect to λA,S = π+; see the Introduction for the notation.
A A Corollary 2.1. The entrance chain Y → and the exit chain Yˆ → are dual with respect to exit the measure µ˜A,Yˆ on A. This follows if we apply Theorem 2.1 to the chain Yˆ and the set Ac instead of Y and A. 12 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY
exit Ac Corollary 2.2. The measure µ˜Ac,Y is invariant for the exit chain Y → and the measure exit A µ˜A,Yˆ is invariant for the entrance chain Y → . The follows immediately from the dualities in Theorem 2.1 and Corollary 2.1. exit entr Corollary 2.3. The measures µAc and µA (defined in (3)) are invariant for the respective Ac A exit c entr c c chains Y → and Y → when µAc (N )= µA (N )=0 (e.g., this holds if µ(N )=0). exit exit exit c c This follows from Corollary 2.2 since the assumption yieldsµ ˜A ,Y = µA andµ ˜A,Yˆ = entr c exit entr µA . These inequalities hold if µ(N ) = 0 since µAc µ and µA µ. Note that in general, they may cease to be true, for example, if the set≪N is empty. ≪
Ac Ac Proof of Theorem 2.1. By Lemma 2.1, the sequences Y → and Yˆ → are Markov chains c c exit on the respective sets Aex,Y N and A N. Since the measureµ ˜Ac,Y is supported on Ac N, the duality stated∩ will follow once∩ we check the detailed balance condition ex,Y ∩ exit exit exit entr c µ˜ c (dx)P c (x,dy)=˜µ c (dy)Pˆ c (y,dx), x,y A N. A ,Y A A ,Y A ∈ ex,Y ∩ exit c By definition ofµ ˜ c , this amounts to showing that for any Borel sets B , B A N, A ,Y 1 2 ⊂ ex,Y ∩ exit entr P c (x, B )P (Y A)µ(dx)= Pˆ c (y, B )P (Y A)µ(dy). (15) A 2 x 1 ∈ A 1 y 1 ∈ ZB1 ZB2 Let us transform the l.h.s. The chain Yˆ starts under µ since Yˆ0 = Y0 by assumption. exit Then, using at first definition (10) of the transition operator PAc , we get
Ac LHS (15) = µ(dx) P (Y → B )P (Y dz) z 1 ∈ 2 x 1 ∈ ZB1 ZA ∞ k+m+1 k c m 1 = P (Y ) B (A N) (A N) − B (A N) µ n n=0 ∈ 1 × ∩ × ∩ × 2 × ∩ k,m=1 X ∞ k+m+1 c m 1 k = P (Yˆ ) (A N) B (A N) − (A N) B , µ n n=0 ∈ ∩ × 2 × ∩ × ∩ × 1 k,m=1 X where the third equality followed from duality of Y and Yˆ by (8), and in the second equality we used that the set N is absorbing for the chain Y by (11), the assumption B1 N, and A ⊂ the fact that T → is finite P -a.s. for every x N by N N N c . We transform the 2 x ∈ ⊂ A,Y ∩ A ,Y last sum above using the same reasoning applied to the dual chain Yˆ instead of Y :
entr ½ ½ ˆ ˆ ˆ c ˆ LHS (15) = Eµ[ A N (Y0) B2 (Y1)PA (Y1, B1)] ∩
entr ½ ½ ˆ c = Eµ[ A N (Y1) B2 (Y0)PA (Y0, B1)] = RHS (15), ∩ where in the second equality we again used duality of Y and Yˆ . This finishes the proof. Remark 2.1. Assuming that the transition kernel P of the chain Y admits a dual kernel Pˆ with respect to µ, define the entrance set into A of Y started under µ: A (µ) := x A : Pˆ(x, Ac) > 0 . (16) en,Y { ∈ } STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 13
It is readily seen from the time-reversal equality (see (8) for k = 1) that this set supports entr entr c the entrance measure µA , given by µA = Pµ(Y0 A ,Y1 A ), in the sense that entr entr ∈ ∈ ∩· µ (A A (µ)) = 0 and µ (A′) > 0 for every Borel set A′ A (µ) such that A \ en,Y A ⊂ en,Y µ(A′) > 0. This ensures that Aen,Y (µ) does not depend mod µ on the dual kernel Pˆ chosen. We now can simplify the assumptions of Corollary 2.3 using the equivalences
exit c entr c c c µ c (N )=0 µ (N )=0 µ(A N en ′ )= µ(A N ′ )=0 for Y ′ Y, Yˆ , A ⇔ A ⇔ ex∩ A ,Y en∩ Aex,Y ∈{ } c c where Aen = Aen,Y (µ) and Aex = (A )ex,Y . These equivalences follow rather straightfor- wardly and we will not prove them in full. The fact that the third condition implies the first two is essentially shown in the proof of Part 2b of Theorem 3.1 below. Notice that the third condition is symmetric in the sense that it does no change if we ˆ c c substitute (Y, A) by (Y , A ). This simply interchanges the sets Aen and Aex since by (16), c c Aen,Y (µ)= Aex,Yˆ mod µ and Aex,Y = Aen,Yˆ (µ) mod µ. (17)
3. Entrance and exit Markov chains via the method of inducing In this section we study stationarity of general entrance and exit Markov chains using the results of infinite ergodic theory. The application of these results is rather straightforward for recurrent Markov chains and amounts to working with measure preserving shifts on sequences. For transient chains, we need to introduce additional construction allowing the time to run backwards in order to let us work with invertible two-sided shifts on bilateral sequences. The simple time-reversal argument used in the proof of Theorem 2.1 suggests that this extension is indeed natural. For completeness of exposition, we give all results on entrance and exit chains together with analogous statements for closely related induced Markov chains. This addition is very natural within the context used, and it does not take a lot of effort to provide it. The corresponding results (presented in Parts 1 of Theorems 3.1, 3.2 and Proposition 3.1) are not new, morally or essentially, but it is hardly possibly to provide matching references, especially due to the very general assumptions we use.
3.1. Setup and notation. Recall that Y is a Markov chain on the topological space . Y X Denote by Pµ := Pµ(Y ) the “distribution” of Y started under a Borel measure µ on the ∈·N0 Y Y space of sequences ( )⊗ , and denote by Eµ the “expectation” (integral) over Pµ . For the rest ofB SectionX 3.1 we assume that µ a σ-finite non-zero invariant measure of Y . N0 For any x , denote by xi the i-th coordinate of x. Similarly, for any non-empty set ∈ X I I N0, denote by xI the projection of x onto . Let θ be the one-sided shift operator on ⊂N0 X defined by (θx)i := xi+1 for i N0. Then θ is a measure preserving transformation of X N0 ∈ N0 Y the σ-finite measure space ( , ( )⊗ , Pµ ). XN0 B X For any set C ( )⊗ , the shift θ defines the first hitting time τC of C, the first
∈ B τXC τC ½X\C hitting mapping ϕC := θ of C,ϕ ˜C := θ · , and the induced shift θC := (ϕC ) C on C; k | see the Appendix for details. For any k 1 and B ( )⊗ , define the cylindrical set ≥ ∈ B X N0 CB := x :(x0,...,xk 1) B . { ∈ X − ∈ } 14 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY
We will use the short notation τB′ := τCB , which matches in the case k = 1 the traditional probabilistic notation for the hitting time of the set B. For arbitrary k, we can think of τB′ k as of the hitting time of B by the multivariate Markov chain (Yn,...,Yn+k 1)n 0 on . A − ≥ X We can use the new notation to write the entrance chain Y → into a set A ( ) and Ac c ∈ B X the exit chain Y → from A , defined in Section 2.2, as
Ac A n 1 c (Y →,Y → )= θ −c ϕ˜ c (Y ) , on Y A i.o.,Y A i.o. , n N. (18) n n CA ×A CA ×A 0,1 k k ◦ { } { ∈ ∈ } ∈ We will also consider the induced sequence Y A obtained by restricting the chain Y to A: A n 1 Y := θ − ϕ (Y ) on Y A i.o. , n N . (19) n CA ◦ CA 0 { k ∈ } ∈ 0 A It is easy to show, using the strong Markov property of Y , that Y is a Markov chain on A A NA (cf. Lemma 2.1). We will say that a Borel measure ν on is proper for Y if ∩ c X ν(NA) = 0; we will start induced chains only under their proper measures. The powerful idea of ergodic theory is that the induced shift θC is a measure preserving N0 Y transformation of the induced space (C, ( )⊗ C, (P ) ), under certain recurrence-type B X ∩ µ |C assumptions on Y and C; see Lemmas A.1, A.1′, and A.2. Below we introduce the definitions needed to apply these general results of ergodic theory in the context of Markov chains. We also refer the reader to Kaimanovich [23, Section 1] for a brief account of relevant results on invariant Markov shifts, and to Foguel [18] for a detailed one. The Markov chain Y is called recurrent starting under µ if for every Borel set B ⊂ X such that µ(B) < , we have Px(τB′ (Y ) < ) = 1 for µ-a.e. x B. Clearly, we can drop the assumption µ(∞B) < , as readily seen∞ from σ-additivity of∈µ. It follows easily from ∞ invariance of µ that this definition is equivalent to Px( Yn B i.o. )= 1 for µ-a.e. x B; see (48). Following Kaimanovich [23], we say that Y is transient{ ∈ starting} under µ if for every∈ B ( ) such that µ(B) < , we have P ( Y B i.o. )=0 for µ-a.e. x B. We stress ∈ B X ∞ x { n ∈ } ∈ that for transient Y , it may be that Pµ B ( Yn B i.o. ) > 0 when µ(B)= . There is the usual recurrence–transience dichotomy,| see{ Condition∈ } 5 in Lemma 3.1 below.∞ We say that Y is topologically recurrent if Px(τG′ (Y ) < ) = 1 for every open set G and every x G. Warning: this definition matches the∞ ergodic-theoretic one, while⊂ the X Markov chains∈ literature often defines topological recurrence by taking every x instead of every x G (with G = ∅). In certain cases these two definitions are equivalent;∈ X see (32). Furthermore,∈ we say6 that Y is ergodic starting under µ or, synonymously, µ is an ergodic Y invariant measure of Y , if the (Pµ -preserving) shift θ is ergodic. The chain Y is called irreducible starting under µ if every invariant set of Y is µ-trivial, that is for any A (X), c ∈ B the equality Px(Y1 A)= ½A(x) mod µ implies that either µ(A)=0or µ(A ) = 0. Warning: this shall not be confused∈ with µ-irreducibility. We say that Y is topologically irreducible if Px(τG′ (Y ) < ) > 0 for every x and every non-empty open set G . Let us give∞ necessary and sufficient∈ X conditions for recurrence and ergodicity⊂ X of Y . Lemma 3.1. Let Y be a Markov chain that takes values in a topological space and has a σ-finite invariant Borel measure µ. The following statements hold true. X N0 N0 Y 1) Y is recurrent starting under µ iff the shift θ on ( , ( )⊗ , P ) is conservative. X B X µ 2) Y is recurrent starting under µ iff there exists a sequence of sets Bn n 1 ( ) such { } ≥ ⊂ B X that = n 1Bn mod µ, and Pµ Bn (τB′ n (Y )= )=0 and µ(Bn) < for every n 1. X ∪ ≥ | ∞ ∞ ≥ STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 15
k 3) Y is recurrent starting under µ if for some k 1 there exists a set B ( )⊗ such ≥ ∈ B X that P (τ ′ (Y ) = )=0 and P ((Y ,...,Y ) B) < . In particular, this holds if µ B ∞ µ 1 k ∈ ∞ Px(τG′ (Y ) < )=1 for an open set G of finite measure µ and every x . 4) Y is ergodic and∞ recurrent iff it is irreducible⊂ X and recurrent, all properties starting∈ X under µ. 5) If Y is irreducible, then it is either recurrent or transient, all properties starting under µ. The assumption in Condition 3 is not vacuous since in general, a Borel measure on may be infinite on every non-empty open set. For example, if is separable, take a Xsum of δ-measures at the points of a dense countable subset of ;X see Lemma 3.3 below for conditions to exclude such pathologies. Condition 5 extendsX the standard recurrence– transience dichotomy for countable space chains. Proof. 1) For the direct implication, note that since µ is σ-finite, N0 can be exhausted X by countably many cylindrical sets CBn with bases Bn ( ) of finite measure. Each Y ∈ B X set has measure Pµ (CBn ) = µ(Bn) < and is recurrent for θ by recurrence of Y . Then θ is conservative by the Conditions for∞ conservativity from the Appendix. For the reverse implication, every measurable cylindrical set CB is recurrent for θ by conservativity of θ, hence Y is recurrent. 2) These are the necessary and sufficient conditions appeared in the proof of Condition 1. 3) This holds by Maharam’s recurrence theorem from the Appendix restated for Y using Condition 1. Here τ is finite PY -a.e. and PY (C )= P ((Y ,...,Y ) B) < . CB µ µ B µ 1 k ∈ ∞ 4) The direct implication holds since every θ-invariant cylindrical set CB with one- dimensional base B ( ) is PY -trivial. The reverse one is in [23, Proposition 1.7]. ∈ B X µ 5) This is stated in [23, Theorem 1.2]. Since there is neither formal proof nor exact reference given there, let us comment that this claim follows from Foguel [18, Chapter II]. In more detail, we have = C D, where C and D are respectively conservative and dissipative parts, which are disjointX and∪ measurable. It follows (see [18, p. 17]) from irreducibility of Y
that Px(Y1 C) = ½C (x) mod µ. Then either C = X mod µ, in which case Y is recurrent
by [18, Eq.∈ (2.4)], or D = X mod µ, in which case Y is transient by repeating the argument ½ after [18, Eq. (2.4)] (for any B ( ) such that µ(B) < , take f = ½B and u = BM , d ∈ B Xk ∞ ∞ where BM := x B : k=0 Pµ B (Y dx) M , and let M ). ∈ dµ | ∈ ≤ →∞ We will also considerP non-recurrent Markov chains, in which case we shall work with invertible measure preserving transformations on spaces of sequences. The shift θ on one- sided sequences in N0 is not invertible, therefore we shall extend time to negative integers. This corresponds toX the natural extension in ergodic theory. It can be constructed using a standard argument (see Doob [16, Chapter X.1]) based on Kolmogorov’s consistency theo- rem. However, this theorem may cease to hold if the space is not Polish (see Bogachev [6, Example 7.7.3]). Therefore, here we proceed differently. X Z Z Consider the space of two-sided sequences equipped with the σ-algebra ( )⊗ . ¯ X Z ¯ Z B X The two-sided shift θ on is given by (θx)i := xi+1, where x , i Z. It is invertible 1 X ∈ X Z∈ and θ¯− is measurable, since the collection of measurable sets B such that θ(B) is measurable is a σ-algebra which contains all cylindrical sets. ⊂ X Z Z Define the time-reversal operator R : , given by R(x)i := x i, where i Z, Z NX → X − ∈ for x . Denote the projection onto 0 by x+ := (x , x ,...), and define the mapping ∈ X X 0 1 16 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY
N0 N N0 V : − by V (y) i := yi, where i N, for y . It is easy to see that R, V , X + → X − ∈ ∈ X and ( ) are measurable mappings. Note also that for every x , the restriction V C x 0 { 0} · ∈ X | 1 is bijective and its inverse is measurable by the same argument as we used above for θ¯− . Assume now that there exists a transition kernel Pˆ dual to P with respect to µ. This is always true when is Polish space, in which case a dual kernel is unique mod µ (we will X always denote it by Pˆ). Indeed, if µ is a probability measure, then this claim is nothing but the disintegration theorem combined with existence of regular conditional distributions for probability measures on Polish spaces; see Kallenberg [24, Theorems 6.3, 6.4, and A1.2] or Aaronson [1, Theorem 1.0.8]. This extends to σ-finite measures by σ-additivity. Recall that Z Z for a Polish space , we have ( ) ( )= ( ), ( )⊗ = ( ), etc. X B X ⊗ B X N0B X × X B X B X Denote by Pˆ the distribution on ( )⊗ of a Markov chain with the transition kernel x0 B X Pˆ and starting at x0 ; it exists by the Ionescu Tulcea theorem ([24, Theorem 6.17]). We say that the probability∈ X measure Y 1 Y Z P¯ := (Pˆ V − ) P on ( )⊗ , x0 x0 ◦ ⊗ x0 B X Z ( N) N0 is an extended law of Y starting at x , where ( )⊗ is understood as ( )⊗ − ( )⊗ . 0 B X B X ⊗B X We can assume that there is a sequence Yˆ =(Yˆn)n 0 of random elements of , defined ≥ X on the same measurable space (Ω, ) with Y , such that Yˆ0 = Y0 and for every x0 , a) Yˆ F ˆ ∈ X ˆ is a Markov chain with the transition kernel P under probability Px0 ; and b) Y and Y are N0 N0 independent, as random elements of ( , ( )⊗ ), under probability P . X B X x0 Then Yˆ is a Markov chain dual to Y with respect to µ. Moreover, Yˆ is recurrent starting under µ iff so is Y ; this follows from Conditions 1 and 2 of recurrence and Remark A.1.b. Z Z Indeed, we can define Y and Yˆ on the canonical space (Ω, )=( , ( )⊗ ) with + ˆ + Z P F ¯Y X B X ω = x by taking Y (ω)= x and Y (ω)= R(x) for x and x0 = Px0 for x0 . Then N0 ∈ X ∈ X for any x0 and any sets B1, B2 ( )⊗ C x0 , we have ∈ X ∈ B X ∩ { } Y + + Px (Yˆ B ,Y B )= Px (V (Yˆ ) V (B ),Y B )= P¯ V (R(x) ) V (B ), x B , 0 ∈ 1 ∈ 2 0 ∈ 1 ∈ 2 x0 ∈ 1 ∈ 2 where the first equality holds true and makes sense since the function V C x is bijective and { 0} | + + its inverse is measurable (hence V (B1) is measurable). By the equality x =(V (R(x) ), x ) ¯Y and the definition of the extended law Px0 , we get Y 1 Y Px (Yˆ B ,Y B )= P¯ (V (B ) B )=(Pˆx V − )(V (B )) P (B ). 0 ∈ 1 ∈ 2 x0 1 × 2 0 ◦ 1 · x0 2 N 1 Since B1 = x0 B1′ for some B1′ ( )⊗ , we have V − (V (B1)) = B1′ , hence ˆ 1 { } × ˆ ˆ∈ B X X × Px0 V − (V (B1)) = Px0 (B1) because Px0 is supported on C x0 . Therefore, we get { } Y Px (Yˆ B ,Y B )= Pˆx (B ) P (B ), 0 ∈ 1 ∈ 2 0 1 · x0 2 ˆ which implies independence of Y and Y under Px0 , as required. Furthermore, arguing as above, we can obtain Y P¯ = Px (V (Yˆ ),Y ) , x . x0 0 ∈· 0 ∈ X ¯Y Z Notice that the function x0 Px0 ( B) is measurable for every B ( )⊗ since the col- lection of the sets B with this7→ property is a σ-algebra (this follows∈ B fromX the monotone STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 17
convergence theorem) which contains measurable rectangles. Then we can define
Y Z P¯ (B) := P (V (Yˆ ),Y ) B µ(dx ), B ( )⊗ , (20) µ x0 ∈ 0 ∈ B X ZX an extended law of Y starting under µ, which is simply the law of (V (Yˆ ),Y ) under Pµ. When is a Polish space, such extended law always exists and is unique since the same holds for Pˆ. X We now claim that the two-sided shift θ¯ is an invertible measure preserving transforma- Z Z ¯Y ¯ 1 tion of the σ-finite measure space ( , ( )⊗ , Pµ ). Then so is the measurable mapping θ− . ¯ X B X ¯Y It suffices to show that θ preserves the measure Pµ on cylindrical sets whose bases are measurable rectangles. To this end, it suffices to check that for any sets B0, B1,... ( ) and any integers 1 k n, we have ∈ B X ≤ ≤ Pµ(Y0 B0,...,Yn Bn)= Pµ(Yˆk B0,..., Yˆ1 Bk 1,Y0 Bk,...,Yn k Bn). (21) ∈ ∈ ∈ ∈ − ∈ − ∈
Denote f(x0) := Px0 (Y1 Bk+1,...,Yn k Bn) if k < n and f(x0) := 1 if k = n. Then ∈ − ∈
Pµ(Y0 B0,...,Yn Bn)= µ(x0) f(xk)Px0 (Y0 B0,...,Yk dxk) ∈ ∈ Bk ∈ ∈ ZX Z
= E [½(Y B ,...,Y B )f(Y )] µ 0 ∈ 0 k ∈ k k
= E [½(Yˆ B ,..., Yˆ B )f(Yˆ )] µ k ∈ 0 0 ∈ k 0 ˆ ˆ = Px0 (Yk B0,..., Y1 Bk 1)f(x0)µ(dx0), ∈ ∈ − ZBk
where in the first equality follows from the Markov property of Y (under Px0 ) and the third one follows from time-reversal equality (8). This yields required equality (21) by independence of Y and Yˆ under P for every x . x0 0 ∈ X 3.2. Invariant measures of general induced, entrance, and exit Markov chains. The proof of our first result shows that the method of inducing allowsus to compute invariant measures of the Markov chains mentioned. c c Recall that the exit set Aex of the chain Y from A was defined in (9), and the entrance set Aen of Y into A was defined, in the case when Y has a dual, in (16). Theorem 3.1. Let Y be a Markov chain that takes values in a topological space and has a σ-finite invariant Borel measure µ. Let A be a Borel set. X ⊂ X 1) Assume that µ(A) > 0. Then the induced chain Y A has a proper non-zero invariant Borel measure µA := µ A on A if either of the following conditions is true: a) Y is recurrent| starting under µ; ˆ b) P A (τ ′ (Y )= )=0 and the same holds (instead of Y ) for a Markov chain Y that is µ A ∞ dual to Y with respect to µ and satisfies Yˆ0 = Y0;
c) PµA (τA′ (Y )= )=0 and µ(A) < ; Moreover, if Y is∞ recurrent (resp. recurrent∞ and ergodic) starting under µ, then Y A is recurrent (resp. recurrent and ergodic) starting under µA. 18 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY
A Ac 2) Assume that Pµ Ac (Y1 A) > 0. Then the entrance chain Y → and the exit chain Y → have respective proper| ∈ non-zero invariant Borel measures
entr exit c µA := Px(Y1 )µ(dx) on A and µAc (dx) := Px(Y1 A)µ(dx) on A , c ∈· ∈ ZA if either of the following conditions is true: a) Y is recurrent starting under µ; c b) Pµ Ac (τA′ en (Y ) = ) = Pµ Aen (τA′ (Y ) = )=0 and the same holds (instead of Y ) | ex ∞ | ex ∞ for a Markov chain Yˆ that is dual to Y with respect to µ and satisfies Yˆ0 = Y0; c c) Pµ Ac (τA′ (Y )= )= Pµ Ac (τA′ (Y )= )=0 and Pµ Ac (Y1 A) < . | ex ∞ | ex ex ∞ | ∈ ∞ A Moreover, if Y is recurrent (resp. recurrent and ergodic) starting under µ, then Y → and Ac entr exit Y → are recurrent (resp. recurrent and ergodic) starting respectively under µA and µAc .
Theorem 3.1 applies to recurrent chains merely if Pµ Ac (Y1 A) > 0, which means that Y can get from Ac to A. However, we stress that Y is not| required∈ to be recurrent; instead, there shall exist a dual chain Yˆ satisfying Assumptions 1b and 2b. The result of Part 1a is not new and it is especially well-known for finite µ. However, the best explicit reference we found, Revuz [40, Proposition 2.9 in Chapter 3], assumes that Y is Harris-recurrent (this is stronger than recurrence starting under µ). We do not know references for transient Y . For Part 2b, we essentially give a second proof of Theorem 2.1 using the inducing approach (Remark 2.1 in Section 2.3 explains the assumptions), which we will need later for Part 2b of Proposition 3.1.
Remark 3.1. The assumption of recurrence (in Parts a) is the strongest and the assumptions of Parts b are the weakest, in the following precise sense. a) Parts 1c and 2c are secondary but we stated them for the purpose of completeness (on practise, it can be difficult to check irreducibility of Y ). Indeed, if Y is recurrent starting under µ, then Assumptions 1c and 2c are clearly satisfied. In the opposite direction, if Y is irreducible starting under µ, then either of these assumptions implies, by Condition 5 in Section 3.1, that Y is recurrent starting under µ, since Y cannot be transient. entr b) Assume that is a Polish space. Then the measure µA , as defined in Theorem 3.1 in a more general setting,X has the form given by (3); either of Assumptions 1a or 1c implies Assumption 1b; and either of Assumptions 2a or 2c implies Assumption 2b. Indeed, on a Polish space, there always exists a Markov chain Yˆ dual to Y (see Sec- entr tion 3.1). Then the formula for µA simplifies immediately by the duality. Furthermore, if Y is recurrent starting under µ, then Assumption 1b for Y is satisfied, and Assumption 2b for Y holds by Condition 1 in Section 3.1. Also, if Y is recurrent starting under µ, then so is the dual chain Yˆ (see Section 3.1). Then Assumption 1b for Yˆ holds true as just shown. By the same reasoning applied for the set Ac instead of A, we see that Assumption 2b for ˆ c Y is satisfied since Pµ A (Y1 A ) > 0 by the duality and we can write the sets Aen = Aen,Y | c c ∈ ˆ and Aex = (A )ex,Y in terms of Y using equalities (17). Finally, Assumption 1c (resp. 2c) implies Assumption 1b (resp. 2b) by Remark A.1.b. STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 19
c) The results of Theorem 3.1 (as well as of Theorem 2.1 and Proposition 3.1) are purely measure-theoretic and can be restated for any σ-algebra on instead of ( ). The X B X assumptions of Parts a, b, c correspond respectively to those of Lemmas A.2, A.1, A.1′. Proof. 1) a) By Condition 1 in Section 3.1, recurrence of Y is equivalent to conservativity N0 N0 Y Y of the measure preserving shift θ on ( , ( )⊗ , P ). We have P (C )= µ(A) > 0 and X B X µ µ A by Lemma A.2, the induced shift θCA is a measure preserving conservative transformation N0 Y of the induced space (C , ( )⊗ C , (P ) A ). Then for any Borel set B A, A B X ∩ A µ |C ⊂ µ (B)= P (Y B)=(PY ) (C ) A µ 1 ∈ µ |CA B = (PY ) x N0 : θ (x) C = P (θ (Y )) B = P (Y A B), µ |CA ∈ X CA ∈ B µA CA 0 ∈ µA 1 ∈ where in the last equality we used definition (19) of Y A and the equality θ = (ϕ ) . CA CA CA A A Thus, the measure µA is invariant for the induced chain Y . This measure is proper for Y by implication (48) and the fact that (PY ) (τ = ) = 0, which holds by conservativity µ |CA CA ∞ of θ. Clearly, µA is non-zero since µ(A) > 0. A Recurrence of Y starting under µA follows trivially from that of Y starting under µ. It remains to obtain ergodicity of the induced chain from ergodicity and recurrence of Y . Use A Y A representation (19) to write the law of the induced chain Y starting under µA as PµA = Y 1 N0 2 (Pµ ) CA ψ− , where ψ : CA A is defined by ψ(x) := (x0, (θCA (x))0, (θCA (x))0,...). Note| ◦ that ψ(θ (x)) = θ→(ψ(x)) for every x C , implying CA ∈ A 1 1 1 1 N0 θ− (ψ− B)= ψ− (θ− B), B (A) . (22) CA ∈ B A N0 N0 N0 Y In particular, this yields that θ (restricted to A ) is measure preserving on (A , (A)⊗ , PµA ). A B We already know this since µA is an invariant measure for the chain Y and we have N0 N0 N0 (A)⊗ = ( )⊗ A , which follows from the definition of the induced topology B B X ∩ N N on A. To show ergodicity of θ on A 0 , consider an invariant set B (A) 0 , that is 1 Y A 1 1 1 Y ∈ B θ− B = B mod PµA or, equivalently, ψ− (θ− B)= ψ− B mod (Pµ ) CA . By (22), this gives 1 1 1 Y 1 | θC−A (ψ− B) = ψ− B mod (Pµ ) CA , meaning that ψ− B is an invariant set for θCA on CA. | 1 Y Since θCA is ergodic by Lemma A.2, the set ψ− B is (Pµ ) CA -trivial, implying that B is Y A | A PµA -trivial. This proves ergodicity of the induced Markov chain Y starting under µA. b) The two-sided shift θ¯ is an invertible measure preserving transformation of the mea- Z Z Y 1 sure space ( , ( )⊗ , P¯ ), and so is θ¯− . Denote C¯ := C . This is a cylindrical set in X B X µ A A Z with no constraints on negative coordinates. Hence P¯Y (C¯ ) = PY (C )= µ(A) > 0 and X µ A µ A ¯Y ¯ ¯ k ¯ ¯Y Z Pµ CA k 1θ− (CA) = Pµ (x : x0 A, τC¯A = ) \ ∪ ≥ ∈ X ∈ ∞ Y N0 = P x : x A, τ = = P (τ ′ (Y )= )=0. (23) µ ∈ X 0 ∈ CA ∞ µA A ∞ A In particular, by (48) this implies that the measure µA is proper for Y . Similarly, use representation (20) to get
¯Y ¯ ¯k ¯ ¯ Y Yˆ Pµ CA k 1θ (CA) = Pµ x : x0 A, x 1, x 2,... A = Pµ x : x0 A, τCA = =0. \ ∪ ≥ ∈ − − 6∈ ∈ ∞ 20 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY
¯ ¯ Z ¯ ¯Y From Lemma A.1, the induced shift θC¯A is measure preserving on (CA, ( )⊗ CA, (Pµ ) C¯A ). Hence for any Borel set B A, B X ∩ | ⊂ Y Y Y µ (B)=(P¯ ) ¯ (C¯ )=(P¯ ) ¯ x : θ¯¯ (x) C¯ = (P ) x : θ (x) C . A µ |CA B µ |CA CA ∈ B µ |CA CA ∈ B A The last probability was already shown in (the proof of) Part 1a to be PµA (Y B). 1 ∈ c) We have PY (C ) > 0 and PY x C , τ = = 0 arguing as in Part 1b, and the µ A µ ∈ A CA ∞ claim follows as in Part 1a if we apply Lemma A.1′ instead of Lemma A.2. 2) a) We argue as above in Part 1a. The cylindrical set CAc A with two-dimensional × c Y c base A A satisfies Pµ (CA A) = Pµ Ac (Y1 A) > 0. The induced shift θCAc A on × | × × N0 Y ∈ c c c (CA A, ( )⊗ CA A, (Pµ ) CA ×A ) is measure preserving and conservative. In particular, × B X ∩ × | Y c c implication (48) and the equality (Pµ ) CA ×A (θCA ×A = ) = 0, which holds by conservativ- ity of θ, give us | ∞
Y k c c c c c 0=(Pµ ) CAc A ( θ CA A i.o. )= Pµ( Y0 A ,Y1 A Yk A i.o.,Yk A i.o. ). | × { ∈ × } { ∈ ∈ }∩{ ∈ ∈ } c exit entr Hence, by the definitions of the sets NA, NA and the measures µAc , µA , we get c c 0= P (Y A (N N c ),Y A)+ P (Y A ,Y A (N N c )) µ 0 ∈ \ A ∩ A 1 ∈ µ 0 ∈ 1 ∈ \ A ∩ A exit c entr = µ c (A (N N c )) + µ (A (N N c )). A \ A ∩ A A \ A ∩ A exit entr Ac A Thus, both measures µAc and µA are proper for the chains Y → and Y →. They are non-zero by entr exit c c µA (A)= µAc (A )= Pµ(Y0 A ,Y1 A)= Pµ Ac (Y1 A) > 0. ∈ ∈ | ∈ We now give the key computation which shows the use of inducing in finding invariant measures. By Lemma A.2, for any measurable set B Ac A, ⊂ × Y Y N0 P ((Y ,Y ) B)=(P ) c (C )=(P ) c x : θ c (x) C µ 0 1 ∈ µ |CA ×A B µ |CA ×A ∈ X CA ×A ∈ B c P c = µ (Y0,Y1) A A, (θCA ×A (Y )) 0,1 B . ∈ × { } ∈
Combining this with representation (18) and the identity θCAc A =(ϕCAc A ) c , we obtain × × CA ×A c Ac A P ((Y ,Y ) B)= P (Y ,Y ) A A, (Y →,Y → ) B . µ 0 1 ∈ µ 0 1 ∈ × 2 2 ∈ Let us take B = Ac B , where B A is an arbitrary Borel set. Then × 1 1 ⊂ entr c A µ (B )= P (Y ,Y ) A A, Y → B A 1 µ 0 1 ∈ × 2 ∈ 1 A = µ(dx0) Px0 (Y2→ B1 Y1 = x1)Px0 (Y1 dx1) c ∈ | ∈ ZA ZA A = µ(dx0) Px1 (Y1→ B1)Px0 (Y1 dx1) c ∈ ∈ ZA ZA entr entr = PA (x1, B1)µA (dx1), (24) ZA where in the third equality we used the Markov property of Y and in the last one we used entr A entr that the measure µA is proper for the entrance chain Y → (its transition kernel is PA ). entr A Thus, µA is invariant for Y → . STATIONARY ENTRANCE MARKOV CHAINS, INDUCING, AND LEVEL-CROSSINGS 21
Similarly, let us take B = B A, where B Ac is an arbitrary Borel set. Then 0 × 0 ⊂ exit c Ac µ c (B )= P (Y ,Y ) A A, Y → B A 0 µ 0 1 ∈ × 2 ∈ 0 Ac = µ(dx0) Px1 (Y1 → B0)Px0 (Y1 dx1) c ∈ ∈ ZA ZA Ac = Px0 (Y1 A)µ(dx0) Px1 (Y1 → B0)Px0 (Y1 dx1 Y1 A) c ∈ ∈ ∈ | ∈ ZAex ZA exit exit = PAc (x0, B0)µAc (dx0), (25) c ZAex exit where in the last equality we used formula (10) for the transition kernel PAc of the entrance Ac exit Ac exit chain Y → and the fact that the measure µAc is proper for Y →. Thus, µAc is invariant Ac for Y →. Recurrence of the entrance chain trivially follows from conservativity of the induced A entr c transformation θCA ×A if we argue as above in (24) to start Y → under µA . Similarly, the exit exit chain is recurrent starting under µAc ; cf. (25). c N N As for ergodicity, define the functions ψ0 : CAc A (A ) and ψ1 : CAc A A by × → × →
2 c ψi(x) := (xi, (θCAc A (x))i, (θCAc A (x))i,...), x CA A, i 0, 1 . × × ∈ × ∈{ } A A c The entrance chain (Yn→ )n 1 starts from Y1→ , which is Y1 on the event Y0 A ,Y1 A . Since for any Borel set B ≥A it is true that { ∈ ∈ } ⊂
entr c Y c µA (B)= Pµ(Y0 A ,Y1 B) = Pµ x CA A : x1 B , ∈ ∈ ∈ × ∈ A A entr Y 1 c we can write the law of (Yn→ )n 1 with Y1→ distributed according to µA as (Pµ ) CA ×A ψ1− . ≥ Ac Ac exit Y | ◦ 1 c Similarly, the law of the exit chain (Yn →)n 1 with Y1 → following µA is (Pµ ) CA ×A ψ0− . A A≥c | ◦ Then ergodicity of the chains Y → and Y → follows from that of Y exactly as in Part 1. ¯ c ¯Y ¯ c b) We prove as in Part 1b. The cylindrical set CA A satisfies Pµ (CA A)= Pµ Ac (Y1 × × | ∈ c c A) > 0. Arguing similarly to (23) and using that τA′ A(Y )= τA′ ex Aen (Y ) mod Pµ, we get × ×
Y k ¯ ¯ c ¯ ¯ c Pµ CA A k 1θ− (CA A) × \ ∪ ≥ × c = Pµ Y0 A ,Y1 A, τA′ c A(Y )= ∈ ∈ × ∞ = P (Y Ac ,Y A ,Y ,Y ,... Ac ) µ 0 ∈ ex 1 ∈ en 2 3 6∈ ex ∞ + P Y Ac ,Y A ,Y ,...,Y Ac ,Y Ac ,Y ,Y ,... A µ 0 ∈ ex 1 ∈ en 2 k 6∈ ex k+1 ∈ ex k+2 k+3 6∈ en k=1 X c Pµ Aen (τA′ (Y )= )+ Pµ Ac (τA′ en (Y )= )=0. (26) ≤ | ex ∞ ∞· | ex ∞ exit entr In particular, this implies (as in Part 1b) that both measures µAc and µA are proper for Ac A the chains Y → and Y →. 22 ALEKSANDAR MIJATOVIC´ AND VLADISLAV VYSOTSKY
1 Further, by representation (20) and invariance of the right two-sided shift θ¯− , we get Y k ¯ ¯ c ¯ ¯ c Pµ CA A k 1θ (CA A) × \ ∪ ≥ × ¯Y Z c c c = Pµ x :(x0, x1) A A, (x 1, x0) A A, (x 2, x 1) A A, . . . ∈ X ∈ × − 6∈ × − − 6∈ × ¯Y Z c c c = Pµ x :(x0, x 1) A A , (x 1, x 2) A A , (x 2, x 3) A A ,... ∈ X − ∈ × − − 6∈ × − − 6∈ × ˆ ˆ c ˆ = Pµ Y0 A, Y1 A , τA′ Ac (Y )= =0, ∈ ∈ × ∞