University of Texas Rio Grande Valley ScholarWorks @ UTRGV
Mathematical and Statistical Sciences Faculty Publications and Presentations College of Sciences
2020
The quantization of the standard triadic Cantor distribution
Mrinal Kanti Roychowdhury The University of Texas Rio Grande Valley
Follow this and additional works at: https://scholarworks.utrgv.edu/mss_fac
Part of the Mathematics Commons
Recommended Citation Roychowdhury, Mrinal K. 2020. “The Quantization of the Standard Triadic Cantor Distribution.” HOUSTON JOURNAL OF MATHEMATICS 46 (2): 389–407.
This Article is brought to you for free and open access by the College of Sciences at ScholarWorks @ UTRGV. It has been accepted for inclusion in Mathematical and Statistical Sciences Faculty Publications and Presentations by an authorized administrator of ScholarWorks @ UTRGV. For more information, please contact [email protected], [email protected]. arXiv:1809.07913v7 [math.DS] 21 Nov 2019 eeae by generated rpsto 1.1. Proposition ( to ieso,qatzto coefficient. quantization dimension, α fi xss scle the called is (0 exists, it if fi xss scle the called is exists, it if uhaset a Such in fqatzto hoy n a e G,G,Z o t epb deep its for Z] GN, Let [GG, technology. see may engineering and One an theory theory. results, theoretical quantization measur more of probability for tions a with [GL1,GL3,GL4,GL5,P] measure to probability refer given a of approximation by e of set oeteEciennr on norm Euclidean the note esr notmlstof set optimal an measure R ntedrcino pia esof sets optimal of direction the in number i ) min , ⊂ e od n phrases. and words Key 2010 Mathematics of Journal Houston appear, To n ftemi ahmtclam fteqatzto rbe st st to is problem quantization the of aims mathematical main the of One a + V P hnt n te lmn in element other any to than R ∞ ( ( a P M d n ∈ ahmtc ujc Classification. Subject Mathematics H UNIAINO H TNADTIDCCANTOR TRIADIC STANDARD THE OF QUANTIZATION THE apnssc that such mappings mto fagvnpoaiiydsrbto yapoaiiydsrbto th distribution probability given a a by For distribution points. probability many given finitely a of imation Abstract. P ytesmlrt mappings similarity the by when ieso exists. dimension n ,tenumber the ), h ooo eingnrtdby generated region Voronoi the , ; α -means ( ≥ α sauiu oe rbblt esr on measure probability Borel unique a is k a .Te,the Then, ). α x | .W ute hwta h uniaincecetde o exist not does coefficient quantization the that show further We 2. α k − o hc h nmmocr n otisn oethan more no contains and occurs infimum the which for )) a ,w netgt h pia esof sets optimal the investigate we 3, = a n sdntdby denoted is and , ∈ > k 2 α 0 dP h uniainshm npoaiiyter el ihfidn best a finding with deals theory probability in scheme quantization The i.e., , , Let ( ( ii x ) α sotnrfre oa the as to referred often is ) n atrst rbblt itiuin pia es uniainer quantization sets, optimal distribution, probability set, Cantor P S M hqatzto ro,dntdby denoted error, quantization th ea pia e of set optimal an be uniaindimension quantization j V s n ( ( ( -dimensional ∂M n x masawy a exactly has always -means a = ) R inf = | α d RNLKNIROYCHOWDHURY KANTI MRINAL ( S = ) a 2 o any for j k | 1 α − α o 1 for D n n 1 e snwsaetefloigpooiin(e [GG,GL1]). (see proposition following the state now us Let . )=0 = )) α { k mas n srfre o[R R,G2 R 1R] The R1–R5]. RR, GL2, DR1, [CR, to referred is one -means, x V ( x n P ≥ + DISTRIBUTION ( 0x,2A0 94A34. 28A80, 60Exx, 1. ∈ := P =lim := ) ≤ ,let 2, 2( d 2 R n R ; uniaincoefficient quantization k j Introduction lim , j →∞ a α α − − d ≥ d ( 1) 1 : ) ≤ n n ∈ iii eoethe denote : R ( { →∞ ,and 1, n P k n o all for S k α ) α uhthat such -means, x j nti ae,frtepoaiiymeasure probability the for paper, this In . 2 s .I skonta o otnosprobability continuous a for that known is It ). 1 n a 1 : V stesto l lmnsin elements all of set the is − ⊂ − masadthe and -means n = of a ( cost ≤ R log x log 2 P k n E d j P ∈ ) , min = ∈ ( , V n ≤ P R X a card( d or n n sdntdby denoted is and N n let and , n lmns(e G1) osesm work some see To [GL1]). (see elements k dmninlEcienspace, Euclidean -dimensional ( ∈ a upr h atrstgenerated set Cantor the support has } P : o nt set finite a For . itrinerror distortion X α eastof set a be V ) b α ∈ and , , n α ∈ ) := n k P,2 o rmsn applica- promising for [P1,P2] d ≤ M hqatzto rosfrall for errors quantization th P x V for n − = M ( n a o ( huhtequantization the though n b | k 1 k ( P . α k} P a cgon ninformation in ackground P onsi aldan called is points otatv similarity contractive ,i endby defined is ), )) | ie nt subset finite a Given . α . j k ti upre on supported is at . ffiiespot We support. finite of e =1 ) hn o every for Then, eteVrniregion Voronoi the be for D P α d h ro nthe in error the udy R ( ◦ ⊂ d P α S hc r closer are which n sdenoted is and , j .Frany For ). R − o,quantization ror, 1 d approx- Then, . h number the , P k · k , optimal a ∈ s de- α ∈ , 2 Mrinal Kanti Roychowdhury
From the above proposition, we can say that if α is an optimal set of n-means for P , then each a ∈ α is the conditional expectation of the random variable X given that X takes values in the Voronoi region of a. Sometimes, we also refer to such an a ∈ α as the centroid of its own Voronoi region. In this regard, interested readers can see [DFG, DR2, R1]. Let k ≥ 2 be a positive integer, and let {Sj : 1 ≤ j ≤ k} be a set of contractive similarity 1 2(j−1) R 1 k −1 mapping such that Sj(x)= 2k−1 x + 2k−1 for all x ∈ , and let P = k j=1 P ◦ Sj . Then, P is a unique Borel probability measure on R, and P has support the Cantor set C generated by P k the similarity mappings Sj for 1 ≤ j ≤ k, and C satisfies the invariance equality C = ∪ Sj(C) j=1 (see [F,H]). The Cantor set C generated by the k similarity mappings is called the k-adic Cantor set, more specifically the standard k-adic Cantor set, and the probability measure P is called the k-adic Cantor distribution, more specifically the standard k-adic Cantor distribution. If k = 2, 1 1 2 R then we have two similarity mappings given by S1(x) = 3 x and S2(x) = 3 x + 3 for all x ∈ , 1 −1 1 −1 and then the probability measure P is given by P = 2 P ◦ S1 + 2 P ◦ S2 , which has support the classical Cantor set C satisfying C = S1(C) ∪ S2(C). For this dyadic Cantor distribution, in [GL2], Graf and Luschgy determined the optimal sets of n-means and the nth quantization errors for all n ≥ 2. They also showed that the quantization dimension D(P ) of P exists, and equals the Hausdorff dimension of the invariant set C, but the quantization coefficient does not exist. By a word σ of length n, where n ≥ 1, over the alphabet {1, 2, 3}, it is meant that σ := n σ1σ2 ··· σn, where σj ∈{1, 2, 3} for all 1 ≤ j ≤ n. By {1, 2, 3} , we denote the set of all words over the alphabet {1, 2, 3} of some finite length n ≥ 0. Notice that the empty word ∅ has length n zero. For σ := σ1σ2 ··· σn ∈ {1, 2, 3} , by Sσ it is meant that Sσ := Sσ1 ◦···◦ Sσn . For the empty word ∅, by S∅ it is meant the identity mapping on R. Write ∞ {1, 2, 3}∗ := ∪ {1, 2, 3}n, n=0 i.e., {1, 2, 3}∗ denotes the set of all words over the alphabet {1, 2, 3} including the empty word ∅. Let X be a random variable with probability distribution P . For words β, γ, ··· , δ in {1, 2, 3}∗, by a(β, γ, ··· , δ) we mean the conditional expectation of the random variable X given Jβ ∪ Jγ ∪···∪ Jδ, i.e., 1 a(β, γ, ··· , δ)= E(X|X ∈ J ∪ J ∪···∪ J )= xdP (x). β γ δ P (J ∪···∪ J ) β δ ZJβ∪···∪Jδ Definition 1.2. For n ∈ N with n ≥ 3 let ℓ(n) be the unique natural number with 3ℓ(n) ≤ ℓ(n)+1 n < 3 . Write α2 := {a(1, 21), a(22, 23, 3)} and α3 := {a(1), a(2), a(3)}. For n ≥ 3, define αn := αn(I) as follows: ℓ(n) ℓ(n) ℓ(n) {a(ω): ω ∈{1, 2, 3} \ I} ∪ Sω(α2) if 3 ≤ n ≤ 2 · 3 , ω∈I αn(I)= ℓ(n) ℓ(n) ℓ(n)+1 {Sω(α2): ω ∈{1, 2, 3} \ I} ∪ Sω(α3) if 2 · 3 In this paper, in Section 3 we show that for all n ≥ 2, the sets αn given by Definition 1.2 form the optimal sets of n-means for the standard triadic Cantor distribution P . In Section 4, we show that the quantization coefficient does not exist though the quantization dimension 1 D(P ) exists. Notice that the probability measure P is symmetric about the point 2 , i.e., if two 1 intervals of equal lengths are equidistant from the point 2 , then they have the same probability. Thus, it seems, which is true in the case of dyadic Cantor distribution (see [GL2]), that if the closed interval [0, 1] is partitioned in the middle, then the conditional expectations of the left 1 1 half [0, 2 ], and the right half [ 2 , 1] will form the optimal set of two-means. In Proposition 2.6, we show that it is not true in the case of triadic Cantor distribution. The result in this paper The quantization of the standard triadic Cantor distribution 3 extends the well-known result for the dyadic Cantor distribution given by Graf-Luschgy, and we are grateful to say that the work in this paper was motivated by their work (see [GL2]). 2. Preliminaries R 1 2 Let Sj for 1 ≤ j ≤ 3 be the contractive similarity mappings on given by Sj(x)= 5 x+ 5 (j−1) R n for all x ∈ . For σ := σ1σ2 ··· σn ∈{1, 2, 3} , set Jσ := Sσ([0, 1]), where Sσ := Sσ1 ◦···◦Sσn . For the empty word ∅, write J := J∅ = S∅([0, 1]) = [0, 1]. Then, the set C := n∈N σ∈{1,2,3}n Jσ is known as the Cantor set generated by the mappings Sj, and equals the support of the probability 3 1 −1 ∗ T S 1 measure P given by P = j=1 3 P ◦ Sj . For σ = σ1σ2 ··· σn ∈{1, 2, 3} , n ≥ 0, write pσ := 3n and s := 1 . σ 5n P Let us now give the following lemmas. The proofs are similar to the similar lemmas in [GL2]. Lemma 2.1. Let f : R → R+ be Borel measurable and k ∈ N. Then, 1 fdP = (f ◦ S )(x)dP (x). 3k σ k Z σ∈{X1,2,3} Z Lemma 2.2. Let X be a random variable with probability distribution P . Then, E(X) = 1 1 R 2 1 2 2 and V := V (X)= 9 , and for any x0 ∈ , (x − x0) dP (x)= V +(x0 − 2 ) . From Lemma 2.1 and Lemma 2.2 the followingR corollary is true. ∗ Corollary 2.3. Let σ ∈{1, 2, 3} and x0 ∈ R. Then, 2 2 1 2 (1) (x − x0) dP (x)= pσ sσV +(Sσ( ) − x0) . J 2 Z σ Remark 2.4. From the above lemma it follows that the optimal set of one-mean is the expected value and the corresponding quantization error is the variance V of the random variable X. For n 1 σ ∈{1, 2, 3} , n ≥ 1, we have a(σ)= E(x : X ∈ Jσ)= Sσ( 2 ). ℓ(n) ℓ(n) Proposition 2.5. Let αn := αn(I) be the set given by Definition 1.2. If 3 ≤ n ≤ 2 · 3 , 3ℓ(n) then the number of such sets is Cn−3ℓ(n) , and the corresponding distortion error is given by 1 V (P ; α (I)) = (2 · 3ℓ(n) − n)V +(n − 3ℓ(n))V (P ; α ) . n 75ℓ(n) 2 ℓ(n) ℓ(n)+1 3ℓ(n) If 2 · 3