Depend. Model. 2021; 9:1–12 Research Article Open Access Jeonghwa Lee* Generalized Bernoulli process with long-range dependence and fractional binomial distribution https://doi.org/10.1515/demo-2021-0100 Received October 23, 2020; accepted January 22, 2021 Abstract: Bernoulli process is a nite or innite sequence of independent binary variables, Xi , i = 1, 2, ··· , whose outcome is either 1 or 0 with probability P(Xi = 1) = p, P(Xi = 0) = 1 − p, for a xed constant p 2 (0, 1). We will relax the independence condition of Bernoulli variables, and develop a generalized Bernoulli process that is stationary and has auto-covariance function that obeys power law with exponent 2H − 2, H 2 (0, 1). Generalized Bernoulli process encompasses various forms of binary sequence from an independent binary sequence to a binary sequence that has long-range dependence. Fractional binomial random variable is dened as the sum of n consecutive variables in a generalized Bernoulli process, of particular interest is when its variance is proportional to n2H , if H 2 (1/2, 1). Keywords: Bernoulli process, Long-range dependence, Hurst exponent, over-dispersed binomial model MSC: 60G10, 60G22 1 Introduction Fractional process has been of interest due to its usefulness in capturing long lasting dependency in a stochas- tic process called long-range dependence, and has been rapidly developed for the last few decades. It has been applied to internet trac, queueing networks, hydrology data, etc (see [3, 7, 10]). Among the most well known models are fractional Gaussian noise, fractional Brownian motion, and fractional Poisson process. Fractional Brownian motion(fBm) BH(t) developed by Mandelbrot, B. and Van Ness, J. (1968) [11] is zero- mean, increment stationary Gaussian process that has self-similarity, i.e., for any a > 0, law H fBH(at), t 2 Rg = fa BH(t), t 2 Rg law where 0 < H < 1 is called Hurst parameter, and = means equality of all the nite distributions. FBm has auto-covariance function var(BH(1)) 2H 2H 2H cov(BH(s), BH(t)) = jtj + jsj − jt − sj . 2 2H Especially, the variance of BH(t) is var(BH(1))jtj . If H = 1/2, then fBm becomes standard Brownian motion which has independent increment. The one time dierence of fBm, Yj = BH(j) − BH(j − 1) for j 2 Z, is called fractional Gaussian noise(fGn) which is mean-zero, stationary Gaussian process with auto-covariance function var(Y0) 2H 2H 2H γ(j) := cov(Y , Yj) = jj + 1j − 2jjj + jj − 1j . 0 2 The auto-covariance function obeys power law with exponent 2H − 2 for large lag, 2H−2 γ(j) ∼ var(Y0)H(2H − 1)j as j ! ∞. (1.1) *Corresponding Author: Jeonghwa Lee: Department of Statistics, Truman State University, USA, E-mail: [email protected] Open Access. © 2021 Jeonghwa Lee, published by De Gruyter. This work is licensed under the Creative Commons Attribution alone 4.0 License. 2 Ë Jeonghwa Lee If 1/2 < H < 1, fGn has long-range dependence, X cov(Y0, Yj) = ∞. (1.2) j2N Note that a covariance function that decreases as slowly as the power law with exponent between 0 and -1 for large lag leads to long-range dependence of the process. The time fractional Poisson process (TFPP) developed by Laskin, N. (2003) is a fractional generalization of Poisson process. Poisson process N(t) is used for counting the number of events in time-interval [0, t] where inter-arrival times between events are independent and exponentially distributed with parameter λ. As a re- sult, it has Markov property with expected number of events E(N(t)) = λt. In [9], TFPP, Nµ(t), 0 < µ ≤ 1, is dened with Mittag-Leer distributed waiting times where the inter-arrival distribution has heavy tail that decreases as slowly as 1/tµ+1, and this makes TFPP non-Markov process, possesses long-range dependence, µ and has expectation that is proportional to the fractional exponent of the time, E(Nµ(t))/t . For other frac- tional generalizations of Poisson process, such as mixed fractional Poisson process and their properties, see [2, 4, 5, 8]. In this paper, we propose a generalized Bernoulli process(GBP) that is stationary and has long-range de- pendence, and fractional binomial distribution as the sum of consecutive variables in a generalized Bernoulli process. Bernoulli process is a nite or innite sequence of independent binary variables, Xi , i = 1, 2, ··· , whose outcome is either 1 or 0 with probability P(Xi = 1) = p, P(Xi = 0) = 1−p, for a xed constant p 2 (0, 1). If we interpret Xi = 1 as a success in i-th trial, and Xi = 0 as a failure in i-th trial, then binomial variable that is the sum of n binary variables in Bernoulli process counts the number of successes among n trials, and the binomial variable has expectation np and variance np(1 − p). When applying binomial model in real data analysis, over-dispersion is frequently observed. Over- dispersion is a term referring to the phenomenon that larger variance is observed than the nominal variance under some presumed model. It occurs when assumptions in the presumed model are violated. In binomial model, it is assumed that trials are independent and each has the same probability of success. Due to depen- dence among trials or heterogeneous probability of success in trials, over-dispersed binomial model is widely used in application. See [6] for more account for over-dispersion in various statistical models. In this paper, we will relax the independence condition of Bernoulli variables so that binary variables have dependence with the auto-covariance which decreases with power law with exponent 2H − 2, for H 2 (0, 1). If H 2 (1/2, 1), the generalized Bernoulli process possesses long-range dependence, therefore, fractional binomial random variable, which is dened as the sum of n successive variables in a GBP,has larger variance than ordinary binomial random variable. Over-dispersion parameter ψn > 0 is dened and incorporated in binomial model as follows: E(Bn) = np, Var(Bn) = np(1 − p)(1 + ψn), (1.3) where Bn is over-dispersed binomial variable that has n number of trials and probability of success p. Among methods that incorporate over-dispersion in binomial distribution are beta-binomial model and generalized linear mixed model, [1, 6]. It turned out that the fractional binomial variable dened from a GBP shows over- 2H−1 dispersion (1.3) with various ψn . In particular, ψn is proportional to fractional exponent of n, ψn / n , when H 2 (1/2, 1). The contents of the paper are as follows. In section 2, we construct a generalized Bernoulli process whose auto-covariance function decreases with the power law of exponent 2H − 2. In section 3, some interesting properties of a GBP are investigated regrading conditional probability. In section 4, fractional binomial vari- able is dened from a GBP, and the connection of fractional binomial distribution to the over-dispersed bi- nomial distribution is investigated. 0 0 0 Throughout this paper, fi, i0, i1, ··· g, fi , i0, i1, ··· g ⊂ N, and for any set A = fi0, i1, ··· , ing, jAj = n+1, the number of elements in the set A, with jAj = 0, if A = ;. We also dene the maximum of empty set as 0, i.e., max ; = 0. Generalized Bernoulli process and fractional binomial distribution Ë 3 2 Generalized Bernoulli process We will dene stationary process, fXi , i 2 Ng, where each Xi takes one of two possible outcomes, 0 or 1, with P(Xi = 1) = p, P(Xi = 0) = 1 − p, and 0 2H−2 cov(Xi , Xj) = c ji − jj , i ≠ j, 0 for some constants c 2 R+, H 2 (0, 1). If H 2 (1/2, 1), n 0 X c 2H−1 cov(X , Xi) ∼ jnj , 1 2H − 1 i=1 which diverges as n increases, and the process is said to have long-range dependence. Below is how we will proceed to dene such stationary process. Let Xi be a binary variable with P(Xi = 1) = p, P(Xi = 0) = 1 − p, for a xed p 2 (0, 1). (2.1) * Dene the following function P with constants c > 0, H 2 (0, 1), and for any i0, i1, ··· , in , P* X p ( i0 = 1) := , P* X X p p cji i j2H−2 ( i0 = 1, i1 = 1) := + 1 − 0 , P* X X X p p cji i j2H−2 p cji i j2H−2 ( i0 = 1, i1 = 1, i2 = 1) := + 1 − 0 + 2 − 1 , and in general, P* X X X p p cji i j2H−2 p cji i j2H−2 ( i0 = 1, i1 = 1, ··· , in = 1) := + 1 − 0 + 2 − 1 2H−2 × · · · × p + cjin − in−1j . (2.2) Furthermore, for any disjoint sets, A, B ⊂ N, A ≠ ;, B ≠ ;, dene jBj * * X X k * P (\i02BfXi0 = 0g) \ (\i2AfXi = 1g) := P (\i2AfXi = 1g) + (−1) P (\i2B0[AfXi = 1g) k=1 B0⊂B 0 jB j=k X X P* \ fX g P* \ fX g P* \ fX g = ( i2A i = 1 ) − ( i2A[fjg i = 1 ) + ( i2A[fj1 ,j2g i = 1 ) j2B j1 <j2 j1 ,j22B jBj * + ··· (−1) P (\i2A[BfXi = 1g), (2.3) and jBj * X X k * P (\i2BfXi = 0g) := 1 + (−1) P (\i2B0 fXi = 1g) k=1 B0⊂B 0 jB j=k X X P* fX g P* \ fX g jBjP* \ fX g = 1 − ( i = 1 ) + ( i2fj1 ,j2g i = 1 ) + ··· (−1) ( i2B i = 1 ). (2.4) i2B j1 <j2 j1 ,j22B * If we can show that P is non-negative in (2.3-2.4) for any disjoint sets, A, B ⊂ N, then fXi , i 2 Ng is stationary process with probability P = P*.
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages12 Page
-
File Size-