<<

arXiv:1505.07176v2 [math.NA] 19 Oct 2015 1-66 aa ( Japan 113-8656, Keywords h reitga of integral true the nti ae esuymliait nerto ffntosdefined functions of integration multivariate study we s paper this In Introduction 1 classifications MSC sets point ley dmninlui ue o nitgal integrable an For cube. unit -dimensional ∗ † rdaeSho fEgneig h nvriyo oy,7- Tokyo, of University The Engineering, of School Graduate hswr a upre yJP rn-nAdfrYugScien Young for Grant-in-Aid JSPS by supported was work This The rwho h ubro onswt h ieso.Moreover, ex i dimension. sets an the point b of with Hammersley points expense symmetrized two-dimensional of the sider number at the integrands of converge smooth growth order for high error achieve which tegration (h symmetrized rules component-by-component good lattice finding to polynomial for applied construction Korobov be or can tion result The space. etpsil re of order possible best ro fsmerzddgtlnt over nets digital symmetrized of error ot al QC nerto.I hspprw osdrapp consider point over QMC we of nets paper class this ital certain theor a discrepancy In to the technique of integration. symmetrization area (QMC) the Carlo in Monte known well is principle, bevto egnrlz h oino ymtiainfro symmetrization base of arbitrary notion the generalize operations compact we arithmetic observation a dyadic in with technique group geometric abelian we a connected as cube, symmetrization unit at multi-dimensional look the in technique geometric n rv httemnmmDc egti ag nuht ach to enough large is weight Dick minimum the that prove and , h oino ymtiain lokona aepr’ refl Davenport’s as known also symmetrization, of notion The us-ot Carlo, Quasi-Monte : b ai ymtiaino iia esfor nets digital of symmetrization -adic [email protected]) us-ot al integration Carlo quasi-Monte b Z 13,65C05. 11K38, : f ∈ b lhuhsmerzto a enrcgie sa as recognized been has symmetrization Although . by N , b L ≥ p I etme ,2018 1, September .Sbeunl,w td h M integration QMC the study we Subsequently, 2. iceac o l 1 all for discrepancy ( f aah Goda Takashi = ) b ai ymtiain iia es Hammers- nets, digital symmetrization, -adic Abstract Z [0 , 1 1] s Z f b ( x narpouigkre Hilbert kernel reproducing a in d ) † x ≤ . < p f - og,Bny-u Tokyo Bunkyo-ku, Hongo, 3-1 ∞ [0 : . , it No.15K20964. tists 1] ae nthis on Based . ae2t an to 2 base m c ftein- the of nce s rm base prime n n quasi- and y ieanother give alddig- called s ge order) igher → oal dis- totally ∗ ponential construc- R econ- we eethe ieve yn a lying edenote we , ection vrthe over We consider an approximation of I(f) given in the form 1 I(f; P )= f(x), P x | | X∈P where P [0, 1]s denotes a finite point set in which we count points according to their multiplicity.⊂ I(f; P ) is called a quasi-Monte Carlo (QMC) integration rule of f over P . Obviously the absolute error I(f; P ) I(f) depends only on a point set P for a given f. There are two prominent| classes− of| point sets: digital nets [6, 17] and integration lattices [17, 24]. We are concerned with digital nets in this paper. The quality of a point set has been often measured by the so-called dis- crepancy [16, 17]. For t = (t ,...,t ) [0, 1]s, we denote by [0, t) the anchored 1 s ∈ axis-parallel rectangle [0,t1) [0,t2) [0,ts). The local discrepancy function ∆( ; P ) : [0, 1]s R is defined× by ×···× · → 1 s ∆(t; P )= 1 0 (x) t , P [ ,t) − j x j=1 | | X∈P Y where 1[0,t) denotes the characteristic function of [0, t). Now for 1 p the L discrepancy of P is defined as the L -norm of ∆( ; P ), i.e., ≤ ≤ ∞ p p · 1/p p Lp(P ) := ∆(t; P ) dt , s | | Z[0,1] ! with the obvious modification for p = . When f has bounded variation V (f) on [0, 1]s in the sense of Hardy and Krause,∞ the absolute error is bounded by

I(f; P ) I(f) V (f)L (P ). | − |≤ ∞ This inequality is called the Koksma-Hlawka inequality [17, Chapter 2]. An inequality of similar type also holds for 1 p < , see for instance [25]. This ≤ ∞ is why we consider the Lp discrepancy as a quality measure of a point set. It is, however, not an easy task to construct point sets with low Lp discrep- ancy. As an example, let us consider the two-dimensional Hammersley point sets in base b defined as follows. Definition 1. Let b 2 be a positive . For m N, the two-dimensional Hammersley point set≥ in base b consisting of bm points∈ is defined as a a a a P := 1 + + m , m + + 1 : a 0, 1,...,b 1 . H b ··· bm b ··· bm i ∈{ − } n  o It is known that PH has optimal order of the L∞ discrepancy, while it does not have optimal order of the L discrepancy for all p [1, ), see for instance p ∈ ∞ [18]. Here the general lower bounds on the Lp discrepancy for all s N have been shown by Roth [21] for p 2, Schmidt [22, 23] for p > 1 and Hal´asz∈ [10] ≥ for p = 1. Note that the optimal orders of the L1 and L∞ discrepancy are still unknown for s 3. ≥ There are several ways to modify PH such that the modified point set has optimal order of L discrepancy for p [1, ), see for instance [1, 7, 11]. p ∈ ∞

2 The notion of symmetrization (also known as Davenport’s reflection principle) [2] is one of the best-known remedies in order to achieve optimal order of Lp discrepancy and has been thoroughly studied in the literature, see for instance [11, 12, 13, 14, 20]. We also refer to [5] in which symmetrized point sets are studied in the context of QMC rules using integration lattices. Although the original symmetrization due to Davenport has been recog- nized as a geometric technique in the s-dimensional unit cube, in this paper, we give another look at symmetrization as that in a compact totally disconnected with dyadic arithmetic operations. This implies that symmetriza- tion fits quite well with dyadic structure of point sets and also with the tools used for analyzing the point sets. Therefore, it is quite reasonable to consider symmetrized two-dimensional Hammersley point sets in base 2 [11, 13], or more generally, symmetrized digital nets in base 2 [14]. Although there are some exceptions as in [12, 20] where symmetrization is shown to be helpful even if point sets have not dyadic structure, it must be interesting to find a geometric symmetrization technique which acts on a compact totally disconnected abelian group with b-dic arithmetic operations for b 2. The aim of this paper is two-fold: to generalize≥ the notion of symmetriza- tion from base 2 to an arbitrary base b N, b 2 and to obtain some basic results on the QMC integration error of symmetrized∈ ≥ digital nets in base b. In particular, we study the worst-case error of symmetrized digital nets in base b in a reproducing kernel Hilbert space (RKHS) in Section 4. This result for digital nets can be regarded as an analog of the result for lattice rules obtained in [5, Section 4.2] where only the sum of the half-period cosine space and the Korobov space is considered as a RKHS. In Section 4, we also study the mean square worst-case error with respect to a random digital shift in a RKHS. Fur- thermore, in Section 5, we prove that symmetrized Hammersley point sets in base b achieve the best possible order of Lp discrepancy for all 1 p< . Notation. Let N be the set of positive and N := ≤N 0∞. Let 0 ∪{ } C be the set of all complex numbers. For a positive integer b 2, Zb denotes the residue class ring modulo b, which is identified with the set≥0, 1,...,b 1 equipped with addition and multiplication modulo b. For x {[0, 1], its b-adic− } expansion x = ∞ ξ b−i with ξ Z is unique in the sense∈ that infinitely i=1 i i ∈ b many of the ξi are different from b 1 except for the endpoint x = 1 for which P − 0 all ξi’s are equal to b 1. Note that for 1 N we use the b-adic expansion 1 b , whereas for 1 [0, 1]− we use the b-adic expansion∈ (b 1)(b−1 + b−2 + ...).· It will be clear from∈ the context which expansion we use.−

2 Preliminaries

Here we recall necessary background and notation, including infinite direct prod- ucts of Zb, Walsh functions and digital nets (with infinite digit expansions) over Zb. We essentially follow the exposition of [9, Section 2].

2.1 Infinite direct products of Zb N Let us define G = (Zb) , which is a compact totally disconnected abelian group with the product topology where Zb is considered to be a discrete group. We denote by and addition and subtraction in G, respectively. Let ν be ⊕ ⊖

3 the product measure on G inherited from Zb, that is, for every cylinder set E = n Z Z with Z Z for 1 i n, we have ν(E) = i=1 i × i≥n+1 b i ⊆ b ≤ ≤ ( n Z )/bn. i=1Q i Q A character| | on G is a continuous group homomorphism from G to z C : zQ = 1 , which is a multiplicative group of complex numbers whose absolute{ ∈ value| | is} 1. We define the k-th character as follows. Definition 2. For a positive integer b 2, let ω := exp(2π√ 1/b) be the ≥ − primitive b-th root of unity. Let z = (ζ1, ζ2,... ) G and k N0 whose b-adic ∈a−1 ∈ expansion is given by k = κ0 + κ1b + + κa−1b with κ0,...,κa−1 Zb. Then the k-th character W : G 1,ω,...,ω··· b−1 is defined as ∈ k →{ } κ0ζ1+···+κa−1ζa Wk(z) := ω . (1)

We note that every character on G is equal to some Wk, see [19]. Let us now consider the higher-dimensional case. Let Gs denote the s-ary Cartesian product of G. Note that Gs is also a compact totally disconnected abelian group with the product topology. The operators and now denote addition and subtraction in Gs, respectively. We denote⊕ by ν⊖the product measure on Gs inherited from ν. The k-th character on Gs can be defined as follows. Definition 3. For a positive integer b 2 and a dimension s N, let z = s ≥ Ns ∈ (z1,...,zs) G and k = (k1,...,ks) 0. Then the k-th character Wk : Gs 1,ω ,...,ω∈ b−1 is defined as ∈ →{ b b } s

Wk(z) := Wkj (zj ). j=1 Y s We note again that every character on G is equal to some Wk as with the one-dimensional case. The group G is related to the unit interval [0, 1] through the following maps π : G [0, 1] and σ : [0, 1] G. Let z = (ζ1, ζ2,... ) G, and let x → → ∞ −i ∈ Z ∈ [0, 1] with its unique b-adic expansion x = i=1 ξib with ξi b. Then the ∞ −i ∈ projection map π is defined as π(z) := ζib and the section map σ is Pi=1 defined as σ(x) := (ξ1, ξ2,... ). By definition, π is surjective and σ is injective. For the s-dimensional case, both the projectionP and section maps are applied componentwise, through which the group Gs is related to the unit cube [0, 1]s. We note that π is a continuous map and that π σ = id[0,1]s . We summarize some important facts below [9, Lemma 4, Propositions◦ 3 and 5]. Proposition 4. The following holds true: 1. For k N , we have ∈ 0 1 if k =0, Wk(z) dν(z)= 0 otherwise. ZG ( 2. For k, l Ns, we have ∈ 0 1 if k = l, Wk(z)Wl(z) dν(z)= s 0 otherwise. ZG (

4 3. For f L1(Gs), we have ∈ f(z) dν(z)= f(σ(x)) dx. s s ZG Z[0,1] 4. For f L1([0, 1]s), we have ∈ f(x) dx = f(π(z)) dν(z). s s Z[0,1] ZG 5. Let H := z = (ζ , ζ ,... ) G : ζ = ζ = = ζ =0 . Then we have n { 1 2 ∈ 1 2 ··· n } sn s b if z Hn, Wk(z)= ∈ Ns (0 otherwise. k∈ 0 Xn kj

Definition 5. For a positive integer b 2, let ωb = exp(2π√ 1/b). We denote the b-adic expansion of k N ≥by k = κ + κ b + −+ κ ba−1 ∈ 0 0 1 ··· a−1 with κ0,...,κa−1 Zb. Then the k-th b-adic Walsh function bwalk : [0, 1] 1,ω ,...,ωb−1 is∈ defined as → { b b } κ0ξ1+···+κa−1ξa bwalk(x) := ωb , for x [0, 1] with its unique b-adic expansion x = ξ b−1 + ξ b−2 + . ∈ 1 2 ··· This definition can be generalized to the higher-dimensional case. Definition 6. For a positive integer b 2 and a dimension s N, let x = s ≥ Ns ∈ (x1,...,xs) [0, 1] and k = (k1,...,ks) 0. Then the k-th b-adic Walsh ∈ s b−1 ∈ function walk : [0, 1] 1,ω ,...,ω is defined as b →{ b b } s

bwalk(x) := bwalkj (xj ). j=1 Y Since we shall always use Walsh functions in a fixed base b, we omit the subscript and simply write walk or walk in this paper. From the definitions of characters on Gs and Walsh functions, we see that for any x [0, 1]s ∈ walk(x)= Wk(σ(x)). (2) Ns Since the system walk : k 0 is a complete orthonormal system in L ([0, 1]s) [6, Theorem{ A.11], we have∈ a} Walsh series expansion for f L ([0, 1]s) 2 ∈ 2 fˆ(k)walk, k Ns X∈ 0 where the k-th Walsh coefficient is given by

fˆ(k)= f(x)walk(x) dx. s Z[0,1] We refer to [6, Appendix A.3] and [8, Lemma 18] for a discussion about the pointwise absolute convergence of the Walsh series to f.

5 2.3 Digital nets Here we define digital nets in Gs by using infinite-column generating matrices, i.e., generating matrices whose each column can contain infinitely many non-zero entries. This definition has been recently given in [9]. N ZN×m m Definition 7. For m,s , let C1,...,Cs b . For 0 n

if k ⊥, Wk(z)= |P| ∈P z (0 otherwise. X∈P 3 The b-adic symmetrization

In this section we generalize the notion of symmetrization from base 2 to an arbitrary base b N, b 2. Before that, we recall the original symmetrization introduced by Davenport∈ ≥ [2] and give another look at it as a geometric technique in Gs with b = 2. Let P [0, 1]2 be a finite point set. Then the symmetrized point set in the sense of Davenport⊂ is defined as

P sym,D := (x, y) (x, 1 y) : (x, y) P . { ∪ − ∈ } It is often the case that the symmetrized point set is defined as

P sym := (x, y) (x, 1 y) (1 x, y) (1 x, 1 y) : (x, y) P . { ∪ − ∪ − ∪ − − ∈ }

6 In the remainder of this paper we only consider the symmetrized point set defined in the latter sense. For the higher-dimensional case, the symmetrized point set of P [0, 1]s is defined as ⊂ P sym := sym (x): x P,u 1,...,s , { u ∈ ⊆{ }}

where symu(x) denotes the s-dimensional vector whose j-th coordinate is 1 xj if j u and x otherwise, that is, sym (x) = (y ,...,y ) with − ∈ j u 1 s

1 xj if j u, yj = − ∈ (xj otherwise.

By definition, we have P sym =2s P . Here we give another| look| at the| | original symmetrization. Let b = 2 and z = (ζ1, ζ2,...) G with ζi Z2. By denoting e = (1, 1,...) G, we have z e = (1 ζ , 1∈ ζ ,...) G∈ and thus ∈ ⊕ − 1 − 2 ∈ 1 ζ 1 ζ π(z e)= − 1 + − 2 + ⊕ 2 22 ··· 1 1 ζ ζ = + + 1 + 2 + =1 π(z). 2 22 ··· − 2 22 ··· −     For z Gs, we denote by symG(z) the s-dimensional vector whose j-th coor- ∈ u dinate is zj e if j u and zj otherwise. Then the symmetrized point set of Gs can⊕ be given∈ by P ⊂ sym := symG(z): z ,u 1,...,s . P u ∈P ⊆{ } As a natural extension from b = 2 to an arbitrary positive integer b 2, ≥ we now introduce the notion of the b-adic symmetrization. For l Zb, we Zs ∈ write el = (l, l, . . .) G. For a vector l = (l1,...,ls) b, we write el = (e ,...,e ) Gs. ∈ ∈ l1 ls ∈ Definition 10. For a point set Gs, its symmetrized point set is defined as P ⊂ sym s := z el : z , l Z . P { ⊕ ∈P ∈ b} For a point set P [0, 1]s, its symmetrized point set is defined as ⊂ P sym = π [(σ(P ))sym] .

By definition, we have sym = bs and P sym = bs P . |P | |P| | | | | 3.1 Symmetrized digital nets s Zs In the following let be a digital net in G . We write = el : l b , which is also a digitalP net in Gs. From Definition 10, the symmetrizedQ { point∈ set} sym can be regarded as the direct sum of two digital nets and . Thus, it isP obvious that the following holds true. P Q Lemma 11. For a digital net in Gs, let sym be the symmetrized point set of . Then sym is also a digitalP net in Gs.P P P

7 Remark 12. Consider a digital net in Gs constructed with infinite-row gen- ZN×m P sym s erating matrices C1,...,Cs b . Then is a digital net in G whose ∈ N P generating matrices D ,...,D Z ×(m+s) are given as 1 s ∈ b 1 j 1 j j +1 s ··· − ··· 0 0 1 0 0 ··· ···  0 0 1 0 0  N×s D = (C , E ) with E = ··· ··· Z . j j j j 0 0 1 0 0 ∈ b    ···. . ···.   .. . ..   .    For a point set sym, we have the following orthogonal property. P Lemma 13. Let be a digital net in Gs and sym be its symmetrized point set. For k Ns, weP have P ∈ 0 sym if k ⊥ s, Wk(z)= |P | ∈P ∩E z sym (0 otherwise. ∈PX In the above, := k N0 : δ(k) 0 (mod b) where δ(k) denotes the b-adic sum-of-digits ofE k and{ ∈ is given as δ≡(k) := κ + κ} + for k = κ + κ b + . 0 1 ··· 0 1 ··· Proof. From the definition of the b-adic symmetrization, we have

Wk(z)= Wk(z el) ⊕ z∈Psym z∈P l∈Zs X X Xb = Wk(z) Wk(el). z∈P l∈Zs X Xb Since is a digital net in Gs, the first sum in the last expression equals if k ⊥P and 0 otherwise. On the second sum in the last expression, we have|P| ∈P s s lj δ(kj ) lj δ(kj ) Wk(el)= ωb = ωb . l∈Zs l∈Zs j=1 j=1 l ∈Z Xb Xb Y Y jXb

As ωb denotes the primitive b-th root of unity, the inner sum on the rightmost- side above equals b if δ(k ) 0 (mod b) and 0 otherwise. Thus, j ≡ bs if k s, Wk(el)= ∈E 0 otherwise. l∈Zs ( Xb All together, we obtain

bs if k P ⊥ s, Wk(z)= |P| ∈ ∩E z sym (0 otherwise. ∈PX Since sym = bs , the result follows. |P | |P| Combining Lemmas 9, 11 and 13 implies that

( sym)⊥ = ⊥ s. P P ∩E

8 Remark 14. In fact, the argument in this subsection can be generalized in the following way. Let and ′ be digital nets in Gs. Consider the direct sum P P = z z′ : z , z′ ′ . R { ⊕ ∈P ∈P } Then is also a digital net in Gs and satisfies the orthogonal property R if k ⊥ ( ′)⊥, Wk(z)= |R| ∈P ∩ P z (0 otherwise. X∈R In the remainder of this paper, however, we only consider the case ′ = , which gives us = sym. P Q R P 4 QMC integration over symmetrized digital nets

Let us consider a reproducing kernel Hilbert space (RKHS) with reproducing s s H kernel K : [0, 1] [0, 1] R. The inner product in is denoted by f,g H for f,g and the× associated→ norm is denoted by f H:= f,f . Ith is knowni ∈ H k kH h iH that if K(x, x) dx < the squared worst-case error in the space of [0,1]s ∞ p H a QMC integration over a point set P [0, 1]s is given by R p ⊂ 2

2 e (P,K) :=  sup I(f) I(f; P )  f∈H | − | kfkH≤1   2 1 x y x y x y y x y = K( , ) d d K( , ) d + 2 K( , ), 2s − P s P [0,1] x [0,1] x y Z | | X∈P Z | | X, ∈P (3) Ns see for instance [25]. For k, l 0, the (k, l)-th Walsh coefficient of K is defined by ∈

Kˆ (k, l) := K(x, y)walk(x)wall(y) dx dy. 2s Z[0,1] In the following we always assume K(x, x) dx < . We study the [0,1]s ∞ worst-case error of symmetrized point sets in a RKHS first and then move on R p to the mean square worst-case error with respect to a random digital shift.

4.1 The worst-case error From the proof of [9, Proposition 21], we have the following pointwise absolute convergence of the Walsh series of K. Lemma 15. Let K be a continuous reproducing kernel. We assume

Kˆ (k, l) < . | | ∞ k l Ns ,X∈ 0 For any z, w Gs, we have ∈ K(π(z), π(w)) = Kˆ (k, l)Wk(z)Wl(w). (4) k l Ns ,X∈ 0

9 Under some assumptions on K, the worst-case error is given as follows. Theorem 16. Let , ⊥ be a digital net in Gs and its dual net, respectively, and sym be the symmetrizedP P point set of . Let K be a continuous reproducing kernelP which satisfies P

K(x, x) dx < and Kˆ (k, l) < . s ∞ | | ∞ [0,1] k,l∈Ns Z p X 0 The squared worst-case error of a QMC integration over π( sym) is given by P e2(π( sym),K)= Kˆ (k, l). P ⊥ s 0 k,l∈P X∩E \{ } Although the proof is almost the same with that of [9, Proposition 21], we provide it below for the sake of completeness. Proof. We evaluate the three terms of (3) separately in which π( sym) is sub- stituted into P . The first term of (3) is simply P

K(x, y) dx dy = Kˆ (0, 0), 2s Z[0,1] by the definition of the Walsh coefficients. For the second term of (3), we have 2 K(π(z), y) dy sym s z sym [0,1] |P | ∈PX Z 1 1 = K(π(z), y) dy + K(y, π(z)) dy sym s sym s z sym [0,1] z sym [0,1] |P | ∈PX Z |P | ∈PX Z 1 1 = K(π(z), π(w)) dν(w)+ K(π(w), π(z)) dν(w) sym s sym s z sym G z sym G |P | ∈PX Z |P | ∈PX Z 1 = Kˆ (k, l)Wk(z) Wl(w) dν(w) sym s z∈Psym k l Ns G |P | X ,X∈ 0 Z 1 ˆ k l z w ν w + sym K( , )Wl( ) Wk( ) d ( ) s z sym k l Ns G |P | ∈PX ,X∈ 0 Z 1 1 ˆ k 0 z ˆ 0 l z = K( , ) sym Wk( )+ K( , ) sym Wl( ) k Ns z∈Psym l Ns z∈Psym X∈ 0 |P | X X∈ 0 |P | X = Kˆ (k, 0)+ Kˆ (0, l), ⊥ s ⊥ s k∈PX∩E l∈PX∩E where we use the symmetry of K, Item 4 of Proposition 4, Equation (4), Item 1 of Proposition 4 and Lemma 13 in this order for each of the five equalities. Finally, for the last term of (3), we have 1 z w sym 2 K(π( ), π( )) z w sym |P | , X∈P 1 ˆ k l z w = sym 2 K( , )Wk( )Wl( ) z w sym k l Ns |P | , X∈P ,X∈ 0

10 1 1 ˆ k l z w = K( , ) sym Wk( ) sym Wl( ) k l Ns z sym w sym ,X∈ 0 |P | ∈PX |P | ∈PX = Kˆ (k, l), ⊥ s k,l∈PX∩E where we use Equation (4) and Lemma 13 in the first and third equalities, respectively. Substituting these results into the right-hand side of (3), the result follows. Remark 17. The result of Theorem 16 has some similarity to that of [9, Propo- sition 22]. When a QMC integration over a folded digital net by means of the b-adic tent transformation is considered, the squared worst-case error in a RKHS is given by

Kˆ ( k/b , l/b ), ⌊ ⌋ ⌊ ⌋ ⊥ s 0 k,l∈PX∩E \{ } s where we write x = ( x1 ,..., xs ) for x = (x1,...,xs) R . Since the number of points⌊ of⌋ a folded⌊ ⌋ digital⌊ net⌋ is the same as that of a∈n original digital net, a folded digital net has a cost advantage over a symmetrized digital net.

4.2 The mean square worst-case error Here we study the mean square worst-case error of symmetrized point sets with respect to a random digital shift. Now the error criterion is given by

e˜2(π( sym),K)= e2(π( sym) σ,K) dσ, P s P ⊕ Z[0,1] where the operator is defined for x, y [0, 1]s as ⊕ ∈ x y := π [σ(x) σ(y)] . ⊕ ⊕ From [6, Theorem 12.4], we have the following.

s 2s Lemma 18. For a point set G and a reproducing kernel K L2([0, 1] ), the mean square worst-case errorP ⊂ of a set π( ) with respect to a random∈ digital shift is given by P

e˜2(π( ),K)= e2(π( ),K ), P P ds where Kds is called a digital shift invariant kernel defined as

Kds(x, y) := K(x σ, y σ) dσ, s ⊕ ⊕ Z[0,1] for any x, y [0, 1]s. ∈ Similarly to Lemma 15, we have the following pointwise absolute convergence of the Walsh series of Kds.

11 Lemma 19. Let K be a continuous reproducing kernel. We assume

Kˆ (k, k) < . | | ∞ k Ns X∈ 0 For any z, w Gs, we have ∈

Kds(π(z), π(w)) = Kˆ (k, k)Wk(z)Wk(w). (5) k Ns X∈ 0 ˆ Proof. By the assumption k Ns K(k, k) < , the right-hand side on (5) ∈ 0 | | ∞ converges absolutely. Thus it suffices to show that P

lim Kˆ (k, k)Wk(z)Wk(w)= Kds(π(z), π(w)). n→∞ Ns k∈ 0 Xn kj

In fact we have

Kˆ (k, k)Wk(z)Wk(w) Ns k∈ 0 n kj

= Wk(z)Wk(w) K(x, y)walk(x)walk(y) dx dy Ns [0,1]2s k∈ 0 Z Xn kj

′ ′ ′ ′ ′ ′ = Wk(z)Wk(w) K(π(z ), π(w ))Wk(z )Wk(w ) dν(z ) dν(w ) Ns G2s k∈ 0 Z n kj

′ ′ ′ ′ ′ ′ = K(π(z ), π(w )) Wk((z z ) (w w )) dν(z ) dν(w ) G2s Ns ⊖ ⊖ ⊖ Z k∈ 0 n kj

′ ′ ′ ′ ′ ′ = K(π(z z ), π(w w )) Wk(w z ) dν(z ) dν(w ), G2s ⊕ ⊕ Ns ⊖ Z k∈ 0 n kj

sn ′ ′ ′ ′ b if (z , w ) Jn,2s, Wk(w z )= ∈ Ns ⊖ (0 otherwise. k∈ 0 n kj

Thus we have

Kˆ (k, k)Wk(z)Wk(w) Ns k∈ 0 Xn kj

= bsn K(π(z z′), π(w w′)) dν(z′) dν(w′) ⊕ ⊕ ZJn,2s

12 = bsn K(π(z z′), π(w w′)) dν(z′) dν(w′) s w′ s ⊕ ⊕ ZG Z ⊖Hn K(π(z w′), π(w w′)) dν(w′) as n , → s ⊕ ⊕ → ∞ ZG where the last convergence stems from the facts that K π is continuous since ◦ ′ s both K and π are continuous and that the product measure of the set w Hn equals bsn for any w′ Gs. Finally we have ⊖ ∈

lim Kˆ (k, k)Wk(z)Wk(w) n→∞ Ns k∈ 0 n kj

= K(π(z w′), π(w w′)) dν(w′) s ⊕ ⊕ ZG = K(π(z) π(w′), π(w) π(w′)) dν(w′) s ⊕ ⊕ ZG = K(π(z) σ, π(w) σ) dσ = Kds(π(z), π(w)), s ⊕ ⊕ Z[0,1] where we use Item 3 of Proposition 4 in the third equality. Thus the result follows. Under some assumptions on K, the mean square worst-case error with re- spect to a random digital shift is given as follows. Theorem 20. Let , ⊥ be a digital net in Gs and its dual net, respectively, and sym be the symmetrizedP P point set of . Let K be a continuous reproducing kernelP which satisfies P

K(x, x) dx < and Kˆ (k, k) < . s ∞ | | ∞ [0,1] k∈Ns Z p X0 The mean square worst-case error of a QMC integration over π( sym) with respect to a random digital shift is given by P

e˜2(π( sym),K)= Kˆ (k, k). P ⊥ s 0 k∈P X∩E \{ } sym Proof. We evaluate the three terms of (3) separately in which π( ) and Kds are substituted into P and K, respectively. The first term of (3)P is given by

Kds(x, y) dx dy = Kds(π(z), π(w)) dν(z) dν(w) 2s 2s Z[0,1] ZG

= Kˆ (k, k) Wk(z)Wk(w) dν(z) dν(w) 2s k Ns G X∈ 0 Z = Kˆ (0, 0), where we use Item 1 of Proposition 4 in the third equality. For the second term of (3), we have 2 2 Kds(π(z), y) dy = Kds(π(z), π(w)) dν(w) sym s sym s z sym [0,1] z sym G |P | ∈PX Z |P | ∈PX Z

13 2 = Kˆ (k, k)Wk(z) Wk(w) dν(w) sym s z sym k Ns G |P | ∈PX X∈ 0 Z =2Kˆ (0, 0), where we use Item 4 of Proposition 4, Equation (5) and Item 1 of Proposition 4 in this order for each of three equalities. Finally, for the last term of (3), we have 1 z w sym 2 Kds(π( ), π( )) z w sym |P | , X∈P 1 ˆ k k z w = sym 2 K( , )Wk( )Wk( ) z,w∈Psym k Ns |P | X X∈ 0 1 1 ˆ k k z w = K( , ) sym Wk( ) sym Wk( ) k Ns z∈Psym w∈Psym X∈ 0 |P | X |P | X = Kˆ (k, k), ⊥ s k∈PX∩E where we use Equation (5) and Lemma 13 in the first and third equalities, respectively. Substituting these results into the right-hand side of (3), the result follows. Remark 21. The result of Theorem 20 has some similarity to that of [8, The- orem 16]. When a QMC integration over a “digitally shifted and then folded” digital net is considered, the mean square worst-case error with respect to a random digital shift in a RKHS is given by

Kˆ ( k/b , k/b ), ⌊ ⌋ ⌊ ⌋ ⊥ s 0 k∈P X∩E \{ } s where again we write x = ( x1 ,..., xs ) for x = (x1,...,xs) R . As already mentioned in Remark⌊ ⌋ ⌊ 17,⌋ this approach⌊ ⌋ has a cost advantage∈ over a symmetrized digital net. Remark 22. Following the same arguments as in [9, Sections 4–6], Theo- rems 16 and 20 can be applied to component-by-component (CBC) construction or Korobov construction of good symmetrized (higher order) polynomial lattice rules which achieve high order convergence of the worst-case error in an unan- chored Sobolev space of smoothness α N, α 2. For instance, for m N and a prime b, the CBC construction can find∈ symmetrized≥ higher order poly∈nomial lattice rules with bm+s points which achieve the worst-case error convergence of O(b−αm+ǫ) with an arbitrary small ǫ > 0. Moreover, this construction can be done in O(sαmbαm/2) arithmetic operations using O(bαm/2) memory.

5 Discrepancy bounds for symmetrized Ham- mersley point sets

Finally in this paper we prove that symmetrized Hammersley point sets in prime base b achieve the best possible order of L discrepancy for all 1 p ≤

14 p < . According to Definitions 1 and 10, the two-dimensional symmetrized Hammersley∞ point set in base b is given as follows. Definition 23. Let b 2 be a positive integer. For m N, the two-dimensional symmetrized Hammersley≥ point set in base b is a point∈ set consisting of bm+2 defined as

sym m+2 P := (xa,ya): a = (a ,...,a ) 0, 1,...,b 1 , H 1 m+2 ∈{ − } where 

a1⊕am+1 am⊕am+1 am+1 am+1 xa = b + + bm + bm+1 + bm+2 + , am⊕am+2 ··· a1⊕am+2 am+2 am+2 ··· ya = + + m + m+1 + m+2 + . ( b ··· b b b ··· sym We also consider a truncated version of PH . For a positive integer n m + 2, the two-dimensional truncated symmetrized Hammersley point set in≥ sym,n base b, denoted by PH , is given as P sym,n := (xn,yn): a = (a ,...,a ) 0, 1,...,b 1 m+2 , H a a 1 m+2 ∈{ − } where 

n a1⊕am+1 am⊕am+1 am+1 am+1 xa = b + + bm + bm+1 + + bn , n am⊕am+2 ··· a1⊕am+2 am+2 ··· am+2 y = + + m + m+1 + + n . ( a b ··· b b ··· b sym,n Z Here note that PH is a digital net over b with generating matrices of size n (m + 2) × 1 0 0 1 0 0 0 1 0 1 0 1 ··· 0 1 0 0 ··· 1 0 0 1  . . ···. . . .   . ···. . . . .  ......     C =  0 0 1 1 0  , C =  1 0 0 0 1  . (6) 1  ···  2  ···   0 0 0 1 0   0 0 0 0 1   ···   ···   ......   ......   ......   ......       0 0 0 1 0   0 0 0 0 1   ···   ···      The following theorem is the main result of this section. N sym Theorem 24. For a prime b and m , let PH be the symmetrized Ham- mersley point set in base b consisting∈ of N = bm+2 points. Then, for all sym 1 p < , the Lp discrepancy of PH is of the best possible order. Namely, for≤ any 1∞ p< there exists a constant C > 0 such that ≤ ∞ p

sym √m +2 logb N Lp(PH ) Cp m+2 = Cp . ≤ b p N Our proof consists of two parts. We first prove that the Lp discrepancy of sym,n PH is of the best possible order for any 1 p < when n > 2m. Then ≤ ∞ sym sym,n we show that the difference between the Lp discrepancies of PH and PH is sym small enough that the Lp discrepancy of PH is still of the best possible order for any 1 p< . ≤ ∞

15 5.1 The minimum Dick weight of point sets

To prove the first part, we introduce the Dick weight function µ2 given in [3, 4] and the minimum Dick weight ρ2(P ) for a two-dimensional digital net P .

a1−1 Definition 25. For k N, we denote its b-adic expansion by k = κ1b + a2−1 av−1∈ κ2b + + κvb such that κ1,...,κv 1,...,b 1 and a1 > a2 > >a >···0. Then the Dick weight function µ∈: {N R −is defined} as ··· v 2 0 → a + a if v 2, 1 2 ≥ µ2(k)= a1 if v =1, 0 if k =0.

For vectors k = (k , k ) N2, we define µ (k) := µ (k )+ µ (k ). Moreover, 1 2 ∈ 0 2 2 1 2 2 let P be a two-dimensional digital net over Zb. Then the minimum Dick weight ρ2(P ) is defined as

ρ2(P ) := min µ2(k). k∈P ⊥\{(0,0)}

The following lemma shows how the minimum Dick weight of a digital net connects with a structure of its generating matrices, see [6, Chapter 15].

Lemma 26. For m,n N with n m, let P be a digital net over Zb with generating matrices C ,∈ C of size n ≥ m. For j = 1, 2 and 1 l n, let c 1 2 × ≤ ≤ j,l denote the l-th row vector of Cj . When l>n, cj,l denotes the vector consisting of m zeros. Let ρ be a positive integer such that for all 1 i1,v1 <

min(v1,2) min(v2,2) i + i ρ, 1,l 2,l ≤ Xl=1 Xl=1 c c c c the vectors 1,i1,v1 ,..., 1,i1,1 , 2,i2,v2 ,..., 2,i2,1 are linearly independent over Zb. Then we have ρ2(P ) >ρ. Although Definition 25 and Lemma 26 are considered only for the two- dimensional case, they can be generalized to the s-dimensional case with any s N. Recently, Dick [4] proved that digital nets over Z with large minimum ∈ 2 Dick weight achieve the best possible order of the Lp discrepancy for 1

m Proposition 27. Let P be a digital net over Zb consisting of b points which satisfies ρ (P ) > 2m t for some integer 0 t 2m. Then for all 1 p< 2 − ≤ ≤ ≤ ∞ there exists a constant Cp,t which depends only on t and p such that we have

√m L (P ) C . p ≤ p,t bm

16 From Lemma 26 and Proposition 27, as the first part of the proof of Theo- rem 24, it suffices to show the linear independence properties of the generating sym,n matrices (6) such that the minimum Dick weight of PH is large.

Lemma 28. For m,n N with n > 2m, let C1 and C2 denote the generating matrices (6). For j =1∈, 2 and 1 l n, c denotes the l-th row vector of C . ≤ ≤ j,l j The following sets of the vectors are linearly independent over Zb: 1. c ,..., c , c ,..., c for 0 r m +1, { 1,1 1,r 2,1 2,m+1−r} ≤ ≤ 2. c ,..., c , c and c ,..., c , c for 1 r m, { 1,1 1,m+1 2,r} { 2,1 2,m+1 1,r} ≤ ≤ 3. c1,1,..., c1,r, c2,1,..., c2,m−r, cj,s for j =1, 2, 0 r m, and m +1 {s n, } ≤ ≤ ≤ ≤

4. c1,1,..., c1,r1,2 , c1,r1,1 , c2,1,..., c2,r2,2 , c2,r2,1 for 0 < r1,2 < r1,1 m and{ 0 < r < r m such that r + r +} r + r 2m +1. ≤ 2,2 2,1 ≤ 1,1 1,2 2,1 2,2 ≤ Since the proof is quite similar to that of [7, Lemma 3.1], we omit it. Using the linear independence properties of (6) shown in the above lemma together with Lemma 26 and Proposition 27, we have the following. N sym,n Proposition 29. For m,n with n > 2m, let PH be the truncated symmetrized Hammersley point∈ set in base b consisting of N = bm+2 points. sym,n The minimum Dick weight of PH is larger than 2m +1, that is,

sym,n ρ2(PH ) > 2m +1.

sym,n This implies that the Lp discrepancy of PH is bounded by

sym,n √m +2 logb N Lp(PH ) Cp m+2 = Cp , ≤ b p N for any p [1, ) with a positive constant C depending only on p. ∈ ∞ p sym,n Since the proof on the minimum Dick weight of PH is quite similar to that of [7, Lemma 3.3], we omit it and just give some comments on the Lp discrepancy sym,n m+2 bound instead. As PH consists of b points and the minimum Dick weight sym,n of PH is larger than 2m +1=2(m + 2) 3, the t-value in Proposition 27 of P sym,n always equals 3 for any m N. This− is why we simply write C instead of H ∈ p Cp,t in the above proposition. This t-value is same as that for two-dimensional folded Hammersley point sets as given in [7, Lemma 3.3]. Moreover, if one wants to get an explicit value of Cp, it might be better to use the Littlewood-Paley inequality in conjunction with the Haar coefficients of the local discrepancy sym,n function for PH as done in [11], instead of our proof based on the linear independence properties. However, this is beyond the scope of this paper.

5.2 Effect of the truncation on the Lp discrepancy As the second part of the proof of Theorem 24, we show that the difference sym sym,n between the Lp discrepancies of PH and PH is small enough that the Lp sym discrepancy of PH is still of the best possible order for any 1 p< . Namely we show the following. ≤ ∞

17 N sym Proposition 30. For m,n with n m +2, let PH be the symmetrized ∈ ≥ m+2 sym,n Hammersley point set in base b consisting of N = b points and PH be its truncated point set. Then we have 1 L (P sym) L (P sym,n)+ , (7) p H ≤ p H bm+2(n−m−1)/p for any p [1, ). ∈ ∞ Before providing the proof of Proposition 30, it should be mentioned that we arrive at the result of Theorem 24 by combining Propositions 29 and 30 since the second term on the right-hand side of (7) is small enough for any p [1, ) ∈ ∞ that it does not affect the order of the Lp discrepancy when n> 2m.

Proof. From the definition of the Lp discrepancy we have

1/p sym sym p Lp(PH )= ∆(t; PH ) dt 2 | | Z[0,1] ! 1/p sym sym,n sym,n p = ∆(t; PH ) ∆(t; PH ) + ∆(t; PH ) dt 2 | − | Z[0,1] ! 1/p sym,n sym sym,n p Lp(PH )+ ∆(t; PH ) ∆(t; PH ) dt , ≤ 2 | − | Z[0,1] ! where we use the Minkowski inequality. Thus, we shall focus on the second term on the rightmost-hand side in the following. Now for any t [0, 1]2 we have ∈ sym sym,n 1 n n ∆(t; P ) ∆(t; P )= 1 0 (xa,ya) 1 0 (x ,y ) . H − H bm+2 [ ,t) − [ ,t) a a a∈{0,1,...,b−1}m+2 X  For the summand on the right-hand side, we have

n n 1 if (xa,ya) [0, t) and (xa,ya) / [0, t), n n ∈ n n ∈ 1 0 t (xa,ya) 1 0 t (x ,y )= 1 if (xa,ya) / [0, t) and (x ,y ) [0, t), [ , ) − [ , ) a a − ∈ a a ∈ 0 otherwise.

For a given set a = a1,...,am+2 , we have { } n a1 am+1 am am+1 am+1 am+1 am+1 x xa = ⊕ + + ⊕ + + + + + a ≤ b ··· bm bm+1 ··· bn bn+1 ··· a 1 = xn + m+1 xn + . a bn(b 1) ≤ a bn − Similarly we have

n n am+2 n 1 y ya = y + y + . a ≤ a bn(b 1) ≤ a bn − n n This implies that we never have the case where (xa,ya) [0, t) and (xa,ya) / [0, t). Thus, for any t [0, 1]2 we have ∈ ∈ ∈ ∆(t; P sym) ∆(t; P sym,n) | H − H |

18 m+2 n n a 0, 1,...,b 1 : (xa,ya) / [0, t) and (x ,y ) [0, t) = | ∈{ − } ∈ a a ∈ | bm+2  m+2 n n a 0, 1,...,b 1 : xa

References

[1] D. Bilyk, On Roth’s orthogonal function method in discrepancy theory, Uniform Distrib. Theory, 6 (2011) 143–184. [2] H. Davenport, Note on irregularities of distribution, Mathematika, 3 (1956) 131–135.

19 [3] J. Dick, Walsh spaces containing smooth functions and quasi-Monte Carlo rules of arbitrary high order, SIAM J. Numer. Anal., 46 (2008) 1519–1553. [4] J. Dick, Discrepancy bounds for infinite-dimensional order two digital se- quences over F2, J. Number Theory, 136 (2014) 204–232. [5] J. Dick, D. Nuyens, F. Pillichshammer, Lattice rules for nonperiodic smooth integrands, Numer. Math., 126 (2014) 259–291. [6] J. Dick, F. Pillichshammer, Digital nets and sequences: Discrepancy the- ory and quasi-Monte Carlo integration, Cambridge University Press, Cam- bridge, 2010.

[7] T. Goda, On the Lp discrepancy of two-dimensional folded Hammersley point sets, Arch. Math., 103 (2014) 389–398. [8] T. Goda, K. Suzuki, T. Yoshiki, The b-adic tent transformation for quasi- Monte Carlo integration using digital nets, J. Approx. Theory, 194 (2015) 62–86. [9] T. Goda, K. Suzuki, T. Yoshiki, Digital nets with infinite digit expansions and construction of folded digital nets for quasi-Monte Carlo integration, J. Complexity, DOI:10.1016/j.jco.2015.09.005. [10] G. Hal´asz, On Roth’s method in the theory of irregularities of point distri- butions, in: Recent Progress in Analytic Number Theory, Academic Press, London (1981) pp. 79–94.

[11] A. Hinrichs, R. Kritzinger and F. Pillichshammer, Optimal order of Lp- discrepancy of digit shifted Hammersley point sets in dimension 2, Uniform Distrib. Theory, 10 (2015) 115–133.

r [12] R. Kritzinger, Lp- and SpqB-discrepancy of the symmetrized van der Corput sequence and modified Hammersley point sets in base b, ArXiv Preprint, arXiv:1507.01684.

[13] G. Larcher, F. Pillichshammer, Walsh series analysis of the L2 discrepancy of symmetrisized point sets, Monatsh. Math., 132 (2001) 1–18.

[14] G. Larcher, F. Pillichshammer, On the L2 discrepancy of the Sobol- Hammersley net in dimension 3, J. Complexity, 18 (2002) 415–448.

r [15] L. Markhasin, Lp- and Sp,qB-discrepancy of (order 2) digital nets, Acta Arith., 168 (2015) 139–159. [16] J. Matouˇsek, Geometric discrepancy, Springer, Berlin, 1999. [17] H. Niederreiter, Random number generation and quasi-Monte Carlo meth- ods, CBMS-NSF Series in Applied Mathematics, vol. 63, SIAM, Philadel- phia, 1992.

[18] F. Pillichshammer, On the Lp-discrepancy of the Hammersley point set, Monatsh. Math., 136 (2002) 67–79. [19] L. S. Pontryagin, Topological groups, Gordon and Breach Science Publish- ers, Inc., New York, 1966.

20 [20] P. D. Proinov, Symmetrization of the van der Corput generalized sequences, Proc. Japan Acad. Ser. A, 64 (1988) 159–162. [21] K. F. Roth, On irregularities of distribution, Mathematika, 1 (1954) 73–79. [22] W. M. Schmidt, Irregularities of distribution, VI, Acta Arith., 21 (1972) 45–50. [23] W. M. Schmidt, Irregularities of distribution, X, in: Number Theory and Algebra, Academic Press, New York (1977) pp. 311–329. [24] I. H. Sloan, S. Joe, Lattice methods for multiple integration, Oxford Uni- versity Press, Oxford, 1994. [25] I. H. Sloan, H. Wo´zniakowski, When are quasi-Monte Carlo algorithms efficient for high-dimensional integrals?, J. Complexity, 14 (1998) 1–33.

21