EXAMINING the ABSOLUTE RATE of CONVERGENCE of SUMMABILITY ASSISTED FOURIER SERIES a Dissertation Submitted to Kent State Univers

EXAMINING THE ABSOLUTE RATE OF CONVERGENCE OF SUMMABILITY ASSISTED FOURIER SERIES

A dissertation submitted to Kent State University in partial fulﬁllment of the requirements for the degree of Doctor of Philosophy

Brian M. Wright

March, 2007 Dissertation written by

Brian M. Wright

M.A., Kent State University, 2000

B.S./B.S., Bucknell University 1992

Approved by

Kazim Khan , Chair, Doctoral Dissertation Committee

Charles Gartland , Members, Doctoral Dissertation Committee

Laura Smithies ,

C. C. Lu ,

Declan Keane ,

Accepted by

Andrew Tonge , Chair, Department of Mathematical Sciences

Jerry Feezel , Dean, College of Arts and Sciences

ii TABLE OF CONTENTS

Acknowledgements ...... v

Introduction ...... 1

0.1 History and deﬁnitions ...... 1

0.2 Overview ...... 6

1 Introduction to Summability Methods and Some Lemmas ...... 8

1.1 Why summability methods? ...... 8

1.2 Hausdorﬀ summability ...... 10

1.3 Preliminary Lemmas ...... 11

1.4 Lemma for Chapter 2 ...... 13

1.5 Lemma for Chapter 3 ...... 15

2 Bound for Absolute Rate of Convergence for a Hausdorﬀ Method . . . 17

2.1 Main Theorem ...... 17

2.2 How strict a requirement? ...... 24

2.3 Example illustrating the sharpness of the bound ...... 24

iii 3 Results Concerning a Hausdorﬀ Method’s Row Total Variation . . . . . 26

3.1 Introduction ...... 26

3.2 Two theorems ...... 26

3.3 Proof of theorem 3.2.1 ...... 28

3.4 Proof of theorem 3.2.2 ...... 33

4 Bounds for Tensor Product Hausdorﬀ Transforms ...... 35

4.1 Introduction and deﬁnitions ...... 35

4.2 Extending lemma 1.4.1 ...... 39

4.3 Some necessary propositions ...... 45

4.4 Main theorem and proof ...... 46

4.5 Example illustrating the sharpness of the bound ...... 57

4.6 Future endeavors ...... 58

BIBLIOGRAPHY ...... 59

iv Acknowledgements

To call this work “my” dissertation would be an unimaginable conceit. Although its

completion required my blood, sweat and tears (and even a few sleepless nights - though I

tried to avoid that like the plague), I want to gratefully acknowledge all those whose help

was invaluable.

First and foremost, I wish to thank my advisor, Dr. Kazim Khan. You were the perfect

advisor for me - someone who let me work at my own pace, yet was willing to give me a

nudge (or kick in the butt) when necessary. The original concept for research area, and many

of the clever solution techniques found within were the product of your fertile imagination.

I hope that we may continue to collaborate on future projects, as the student has not yet

become the master (PhD not withstanding)!

Second, I have found Kent State University to be a wonderful place to study. Many of the professors have gone out of their way to help and teach, and I am extremely grateful

by how much they aided in my job search. Also, some of my fellow students have become

friends whose friendship I hope to enjoy the rest of my life. Two friends in particular are

Juan Seoane (el carajote numero uno - or was it numero cero?) and Antonia Cardwell (also

a carajote). During the writing of this dissertation, you both have proofread numerous drafts; encouraged me to keep on keeping on; and just listened when I needed to vent. Hugs to both of you!

Last, but not least, I want to thank my family and friends from home for being emo- tional support as well. I particularly want to dedicate this dissertation to my mother and

grandparents who have been waiting very patiently (usually) for me to ﬁnish up my PhD.

v Here it is - ﬁnally!

OK, I am sure most readers would like to get past all this mushy stuﬀ and get into the

mathematics. Enjoy!

vi Introduction

0.1 History and deﬁnitions

Approximating an arbitrary function by a series of “nicer” functions is nothing new

in mathematics. Perhaps the most famous example is the Taylor series approximation

which approximates a function by a polynomial. This is advantageous since mathematicians

have studied polynomials for hundreds of years and know many results concerning them.

Another useful approximation technique is Fourier series approximation, particularly when

approximating a periodic function, and it is this on which we will focus. A Fourier series

1 has the desirable qualities that the functions { 2 , cos x, sin x, cos 2x, sin 2x, . . . } are each 2π-periodic and the set is both orthogonal and complete on any interval of length 2π. As

Zygmund points out in [17], many basic results of the theory of functions have been obtained by mathematicians who were working on trigonometric series. For example, the deﬁnition of a Riemann integral in its general form ﬁrst appeared in Riemann’s Habilitationsschrift which is devoted to trigonometric series; the more modern Lebesgue integral was developed in close connection with the theory of Fourier series; and set theory was created by Cantor in his attempts to solve the problem of the sets of uniqueness for trigonometric series. For an integrable function, f, we will denote its Fourier series by:

∞ a (1) S(f, x) ∼ 0 + (a cos kx + b sin kx) 2 k k k X=1

where the coeﬃcients ak and bk are deﬁned as:

1 2

1 π 1 π (2) a = f(t) cos kt dt, b = f(t) sin kt dt, k = 0, 1, 2, . . . . k π k π Z−π Z−π

These coefficient definitions demonstrate another interesting difference between Fourier

series and Taylor series. In Taylor series, the coefficients require the existence of derivatives of the function, while Fourier series’ coefficients require integrability of the function (a much less strict requirement). The ∼ symbol in (1) above indicates we recognize that the infinite

sum need not converge. This convergence, or lack thereof, immediately became crucial in

the study of Fourier series. As a result, it is useful to deﬁne the nth partial sum of the

series: n a0 2 + (ak cos kx + bk sin kx), n = 1, 2, . . . , (3) Sn(f, x) =  k=1  X  a0 2 , n = 0.  If we substitute (2) into (3),we can derive another well known formula for the Fourier series

partial sums (see [17] for example):

1 π (4) S (f, x) = f(x − u)D (u) du, n π n Z−π where

1 (n + 2 ), if u = 2mπ for some integer m (5) Dn(u) =  1 sin(n+ 2 )u  1 , otherwise.  2 sin( 2 u)  Dn(u) is called the Dirichletkernel. An equivalent form for the Dirichlet kernel is:

n 1 (6) D (u) = + cos(ju). n 2 j X=1 The ﬁgure on the next page illustrates the shape of the Dirichlet kernel for n = 5 and n = 8. 3

6 10 The Dirichlet Kernel for n=5 The Dirichlet Kernel for n=8

5 8

6 3

(u) 2 (u) 4 5 8 k k

1 2

0 −1

−2 −2 −4 −2 0 2 4 −4 −2 0 2 4 u u

Figure 1: Typical Shapes of Dirichlet Kernel.

If we extend our deﬁnitions for an and bn to the other integral values of n in the following way:

a−n = an (n > 0), b−n = −bn (n > 0),

then we can deﬁne a new coeﬃcient:

1 (7) c = (a − ib ), n 2 n n for any n ∈ Z. By substituting (2) into (7), this is equivalent to:

1 π (8) c = f(t) e−int dt. n 2π Z−π Using this new coeﬃcient, we can express the Fourier series (1) as (see [6], for example):

∞ ikx (9) S(f, x) ∼ ck e . k=X−∞ The partial Fourier sums (3) may be written [6]:

n ikx (10) Sn(f, x) = ck e . kX=−n 4

In chapters 1 through 3, we will use (1) and (3) exclusively. In chapter 4, however, we ﬁnd

it useful to use the other, equivalent, deﬁnitions.

To draw any conclusions about the convergence of the partial sums, we will require

stronger restrictions on f. Therefore, denote

N b (11) V ara(f) = sup |f(ti) − f(ti−1)|, N,{ti} i X=1 b where a = t0 < t1 < . . . < tN = b. V ara(f) is called the total variation of the function

f on the interval [a, b]. If the total variation is ﬁnite, then the function is said to be of

bounded variation on that interval. The notation BV [a, b] represents the space of functions of bounded variation over the interval [a, b]. It is well known that f ∈ BV [a, b] if and only if there exist an f1 and an f2, both bounded, monotonic functions such that f = f1 − f2.

This implies BV [a, b] ⊆ L∞[a, b], so that BV [a, b] ⊆ L∞[a, b] ⊆ . . . ⊆ L2[a, b] ⊆ L1[a, b].

For f ∈ BV [−π, π] and 2π-periodic, there is a theorem of Dirichlet-Jordan [17] which states that:

1 + − (12) lim Sn(f, x) = [f(x ) + f(x )], n→∞ 2

for every x ∈ [−π, π]. Notice that for a periodic f ∈ BV [a, b], this implies the Fourier

series of the function at points of continuity converge (pointwise) to the function itself,

and at points of simple discontinuity gives a limit value for the convergence. The rate of

convergence was given by Bojanic in [3] to be (for n ≥ 1):

n 1 3 π (13) S (f, x) − [f(x+) + f(x−)] ≤ V ar k (φ ), n 2 n 0 x k=1 X

where

f(x + t) + f(x − t) − [f(x+) + f(x−)], t 6= 0, (14) φx(t) =   0, t = 0.

 5

The rate of convergence of Fourier series at smooth regions of the function have been well-studied, and we will not go into any details here. The interested reader may consult

[17], [8], or [11]. Similarly, there are many interesting results concerning lack of convergence which we will not go into in this dissertation, and the interested reader should consult [1] or [2].

Rather, not only will this dissertation focus on the convergence of Fourier series at points of discontinuities, but we also wish to focus our field of study on summability assisted Fourier series. Perhaps the simplest and most commonly used summability technique is the Cesàro transform (also called the (C, 1) transform). Here, instead of looking n 1 at convergence of S (f, x), we look at the convergence of S (f, x). Fejér [13] n n + 1 k k=0 n X 1 showed that S (f, x) converges to f(x) when f is 2π-periodic and continuous over n + 1 k k=0 [−π, π]. Riesz [17] Xextended this result by showing that the Cesàro transform converges to

1 + − 1 2 [f(x ) + f(x )] provided f is 2π-periodic, f ∈ L [−π, π], and x is a point of continuity or simple discontinuity.

More generally, for α > 0, we can deﬁne the (C, α) transform as:

n α 1 α−1 (15) Sn (f, x) = α Bn−k · Sk(f, x), Bn Xk=0 where, for t > −1 and n ≥ 0,

Γ(n + t + 1) Bt = , n Γ(n + 1)Γ(t + 1)

and ∞ Γ(z) = e−ttz−1 dt, Z0 for z > 0. The gamma function is such that if z is a positive integer, Γ(z) = (z − 1)!. Also

notice that in the case where α = 1, we have the Ces`aro transform, since

α−1 0 Bn−k Bn−k 1 α = 1 = . Bn Bn n + 1 6

Both Zygmund [17], and Bojanic and Mazhar [4] showed that the above theorems of Fej´er and Riesz extend for all (C, α) transforms.

In 1999, Humphreys and Bojanic [7] found a rate of convergence for the absolute (C, α) summed Fourier series. In fact, for f being 2π-periodic and f ∈ BV [−π, π], they found the absolute convergence rate to be:

n 1 4α π (16) Sα − [f(x+) + f(x−)] ≤ |Sα(f, x) − Sα (f, x)| ≤ V ar k (φ ), n 2 k k−1 nπ 0 x k>n+1 k=1 X X

for α > 0, and φx as deﬁned in (14). In 2003, Kunyang and Dan [10] showed that the

Humphreys and Bojanic result only held for α ≥ 1, and they found the bound for 0 < α < 1

to be:

n 1 100 π (17) Sα − [f(x+) + f(x−)] ≤ |Sα(f, x) − Sα (f, x)| ≤ kα−1V ar k (φ ). n 2 k k−1 α2nα 0 x k>n+1 k=1 X X

0.2 Overview

In this dissertation, we wish to expand results (16) and (17). In the ﬁrst chapter, we

will examine the purpose of summability methods in general. We then deﬁne a particular

class of summability methods called Hausdorﬀ methods and discuss some notation. Finally, we will introduce and prove several lemmas which will be used in the later chapters.

In the second chapter, we introduce and prove a theorem concerning a bound for the absolute rate of convergence of a Hausdorﬀ method. We then give an example to illustrate the sharpness of the bound. Finally, we examine the strictness of our assumptions in the theorem, and show that our theorem is in fact an extension of previous work done by

Humphreys and Bojanic, and Kunyang and Dan.

In the third chapter, we introduce and prove two new theorems to examine why the diﬀerent summability methods have diﬀerent convergence rates, and provide an application 7

of our results to show the equivalence of the submethods of regular Hausdorﬀ methods.

In chapter four, we look at functions of two variables, and give the extensions of all the previous deﬁnitions. We then introduce a new lemma and several propositions and extend a previous lemma in order to extend our bounding theorem of chapter 2 to the multivariate case. After stating and proving this extension, we again illustrate the sharpness of our bound. CHAPTER 1

Introduction to Summability Methods and Some Lemmas

1.1 Why summability methods?

For a general function, f, its Fourier series need not converge, even at a point of continuity. Poisson seems to have been the ﬁrst person to try to improve convergence using summability methods. He applied Abel’s summation technique to Fourier series in a method which is now referred to as, alternately, the Poisson method, Abel method, or A method.

This is a stronger method than the more common (C,1) method - but it is not a Hausdorff method. Since the developments in this dissertation are specifically for Hausdorff methods, we will say no more about the Abel method (the interested reader may consult [6] or [14], for example). Then, as mentioned in the introduction, in 1904 Fejér showed that the (C, 1) transform of f converges to f at any point of continuity. Later, Lebesgue showed that the (C, 1) transform of f converges to f almost everywhere [6]. These successes led to the

development of summability methods as an entire ﬁeld in their own right.

As more and more summability methods were being developed, several characteristics were determined to be important. First, it is considered desirable for the method to be linear. Second, and more importantly, the summability method should be regular. That is, anytime the Fourier series’ partial sums converge, the summability assisted sums also converge to the same value, so that the summability method always does at least as well as the original Fourier series in terms of convergence. Necessary and suﬃcient conditions for a general linear method to be regular were found by Toeplitz, and can be found in [6].

8 9

We will present the conditions required for a Hausdorﬀ method to be regular later in the chapter.

Although better convergence is considered the best beneﬁt of using a summability method, it is not the only one. After all, since we will be restricting ourselves to functions f which are 2π-periodic and of bounded variation on [−π, π], we are not worried about convergence. Recall from the introduction, that there is a theorem of Dirichlet-Jordan (12)

which states that for such functions, the regular Fourier series itself will converge for every x ∈ [−π, π]. So what can we gain from using a Hausdorﬀ summability method? Well, summability methods sometimes have more desirable properties than the original Fourier series. For example, it is well known [14] that a Fourier series exhibits Gibb’s phenomenon near a point of a jump discontinuity in the original function. Sometimes a summability method will be used expressly to eliminate this (although not every summability method eliminates Gibb’s phenomenon). For instance, Gronwall [5] has shown that there exists a constant, c, such that whenever α < c the (C, α) method preserves Gibb’s phenomenon, while when α ≥ c the (C, α) method eliminates it. Gronwall also found this constant to be approximately 0.4395. Hence, someone wishing to preserve the occurrence of Gibb’s phenomenon, and still needing to quantify the rate of convergence, need only choose to use a (C, α) method with an appropriately small α, while someone wishing to kill the Gibb’s phenomenon need only choose to use a (C, α) method with an appropriately large α.

Finally, notice that while Bojanic (13) gives a bound for the rate of convergence of functions of bounded variations, Humphreys and Bojanic (16) and Kunyang and Dan (17) are able to give a bound on the absolute convergence rate of the same type of functions which are (C, α) assisted. Even though we generalize to all Hausdorﬀ methods, we also will

ﬁnd a bound for the absolute rate of convergence. 10

1.2 Hausdorﬀ summability

For the rest of this dissertation, we wish to focus on Hausdorﬀ summability methods.

Let f be a 2π-periodic function, and let f ∈ BV [−π, π]. The Hausdorﬀ transform, (HΦS)k, of the partial sums of f is deﬁned to be:

k (1.1) (HΦS)k = hk,jSj(f, x), k = 0, 1, 2, . . . , j X=0 where 1 P (Xk,r = j) dΦ(r), 0 ≤ j ≤ k, (1.2) hk,j =  Z0  0, otherwise, for Φ ∈ BV [0, 1], Xk,r is a binomially distributed random variable, and P (Xk,r = j) =

k j k−j j r (1 − r) is the probability of getting j successes on k independent trials each of whic h results in a success with probability r. A Hausdorﬀ transformation, as deﬁned above, is regular if and only if the following two conditions hold [13]:

• Φ(r) is continuous from the right at r = 0,

1 • dΦ(r) = Φ(1) − Φ(0) = 1. Z0

Without loss of generality, we will assume Φ(0) = 0, in which case the regularity conditions become:

• Φ(0+) = Φ(0) = 0.

• Φ(1) = 1.

dΦ Some particular regular Hausdorﬀ methods are the Ces`aro (C, α) methods in which dr =

α−1 dΦ 1 1 α−1 α(1 − r) ; the H¨older (H, α) methods in which dr = Γ(α) [ln( r )] ; and the Euler 11

methods in which Φ(r) equals zero on [0, c) and equals one on [c, 1] for some ﬁxed c ∈ (0, 1).

The interested reader can verify that this deﬁnition of the (C, α) transform corresponds with that given in (15).

1.3 Preliminary Lemmas

The following two lemmas will be useful later in this chapter.

Lemma 1.3.1 Let j, k ∈ N, k ≥ 2 and 0 ≤ j ≤ k. Also let Xk,r be a binomially distributed random variable, then

(k − j)P (Xk,r = j) + (j + 1)P (Xk,r = j + 1) = k · P (Xk−1,r = j).

Proof

If j = k, both sides are trivially zero. For j < k:

(k − j)P (Xk,r = j) + (j + 1)P (Xk,r = j + 1) k k = (k − j) rj(1 − r)k−j + (j + 1) rj+1(1 − r)k−j−1 j j + 1 k!(k − j) k!(j + 1) = rj(1 − r)k−j + rj+1(1 − r)k−j−1 j!(k − j)! (j + 1)!(k − j − 1)! k! = rj(1 − r)k−j−1 [(1 − r) + r] j!(k − j − 1)! k − 1 = k · rj(1 − r)k−1−j j = k · P (Xk−1,r = j).

Lemma 1.3.2 If aj and bj are the Fourier coefficients as defined in (2) and φx is as defined in (14), then for j ≥ 1 : 12

1 π a cos(jx) + b sin(jx) = φ (t) cos(jt) dt. j j π x Z0

Proof

Using the deﬁnitions of the Fourier coeﬃcients, we have:

1 π 1 π a cos(jx) + b sin(jx) = f(t) cos(jt) cos(jx) dt + f(t) sin(jt) sin(jx) dt j j π π Z−π Z−π 1 π = f(t) cos[j(x − t)] dt, π Z−π using the trigonometric identity cos(a−b) = cos a cos b+sin a sin b. Making the substitution,

u = x − t, we obtain:

1 x+π a cos(jx) + b sin(jx) = f(x − u) cos(ju) du j j π Zx−π 1 π = f(x − u) cos(ju) du π Z−π 1 0 1 π = f(x − u) cos(ju) du + f(x − u) cos(ju) du. π π Z−π Z0 Making the change of variable, v = −u in the ﬁrst integral yields:

1 π 1 π a cos(jx) + b sin(jx) = f(x + v) cos(−jv) dv + f(x − u) cos(ju) du j j π π Z0 Z0 1 π 1 π = f(x + v) cos(jv) dv + f(x − u) cos(ju) du. π π Z0 Z0 Changing the dummy variables u and v to t, and combining the two integrals, gives:

1 π a cos(jx) + b sin(jx) = [f(x + t) + f(x − t)] cos(jt) dt j j π Z0 1 π = [f(x + t) + f(x − t) − f(x+) − f(x−)] cos(jt) dt π Z0 1 π = φ (t) cos(jt) dt, π x Z0 since f(x+) and f(x−) are just constants (with respect to t), and the integral of a constant times cosine from 0 to π equals zero. 13

1.4 Lemma for Chapter 2

Here we introduce and prove a lemma which we will use in chapter 2 in the proof of our main theorem.

Lemma 1.4.1 Let f be a 2π-periodic function, let hk,j be defined as in (1.2), and let φx be as defined in (14). For any Φ(r), if (HΦS)k is the Hausdorff transform of the Fourier

series, then:

k 1 π (H S) − (H S) = φ (t) h j cos(jt) dt. Φ k Φ k−1 kπ x k,j 0 j Z X=0

Proof

1 (H S) − (H S) = [k · (H S) − k · (H S) ] Φ k Φ k−1 k Φ k Φ k−1 k−1 1 1 = k · (H S) − k · Sj P (X = j) dΦ(r) k  Φ k k−1,r  j 0 X=0 Z  k−1  1 1 = k · (H S) − Sj k · P (X = j) dΦ(r) . k  Φ k k−1,r  j 0 X=0 Z   Now we will use lemma 1.3.1 to get:

(HΦS)k − (HΦS)k−1 k−1 1 1 = k · (H S)k − Sj [(k − j)P (Xk,r = j) + (j + 1)P (Xk,r = j + 1)] dΦ(r) k  Φ  j 0 X=0 Z   k 1 1 = k · Sj P (X = j) dΦ(r) k  k,r  j 0 X=0 Z  k−1  1 1 − Sj [(k − j)P (X = j) + (j + 1)P (X = j + 1)] dΦ(r) k  k,r k,r  j 0 X=0 Z   14

k−1 1 1 1 = kS P (X = k) dΦ(r) + kSj P (X = j) dΦ(r) k  k k,r k,r  0 j 0 Z X=0 Z  k−1  1 1 − Sj [(k − j)P (Xk,r = j) + (j + 1)P (Xk,r = j + 1)] dΦ(r) k   j 0 X=0 Z   1 1 = kS P (X = k) dΦ(r) k k k,r Z0 k−1 k−1 1 1 1 + jSj P (X = j) dΦ(r) − (j + 1)Sj P (X = j + 1) dΦ(r) . k  k,r k,r  j 0 j 0 X=0 Z X=0 Z   Now making the change of the index of summation i = j + 1 in the last sum gives:

(HΦS)k − (HΦS)k−1 1 1 = kS P (X = k) dΦ(r) k k k,r Z0 k−1 k 1 1 1 + jSj P (X = j) dΦ(r) − iSi P (X = i) dΦ(r) k  k,r −1 k,r  j 0 i 0 X=0 Z X=1 Z   k k 1 1 1 = jSj P (X = j) dΦ(r) − iSi P (X = i) dΦ(r) k  k,r −1 k,r  j 0 i 0 X=0 Z X=0 Z  k  1 1 = j(Sj − Sj ) P (X = j) dΦ(r) k  −1 k,r  j 0 X=0 Z  k  1 1 = j(aj cos(jx) + bj sin(jx)) P (X = j) dΦ(r) , k  k,r  j 0 X=0 Z   by the deﬁnition (3) of the partial Fourier sums. Therefore, using lemma 1.3.2:

k 1 1 π 1 (H S) − (H S) = · j φ (t) cos(jt) dt P (X = j) dΦ(r) Φ k Φ k−1 k π x k,r j 0 0 X=0 Z Z k 1 π = · φ (t) h j cos(jt) dt. kπ x k,j 0 j Z X=0 15

1.5 Lemma for Chapter 3

Next, we will introduce and prove a lemma which we will use in chapter 3.

Lemma 1.5.1 Let hn,k be the coefficients for a regular Hausdorff method as defined in (1.1)

N Ψ for the weight function Φ, and let n, k ∈ with 0 ≤ k ≤ n. Define hn,k to be the coefficients for the Hausdorff method with weight function Ψ such that dΨ(r) = r · dΦ(r); that is, the

Radon-Nikodym derivative of Ψ(r) with respect to Φ(r) is r. Then:

Ψ Ψ |hn+1,k − hn,k| = hn,k − hn,k−1 .

Proof

When k = 0,

1 n + 1 1 n h − h = r0(1 − r)n+1 dΦ(r) − r0(1 − r)n dΦ(r) n+1,0 n,0 0 0 Z0 Z0 1 = (1 − r)n(1 − r − 1) dΦ(r) 0 Z 1 = −r(1 − r)n dΦ(r). Z0 While:

1 n hΨ − hΨ = r0(1 − r)n dΨ(r) − 0 n,0 n,−1 0 Z0 1 = r(1 − r)n dΦ(r). Z0 Comparing the absolute values of the above results shows that the lemma holds for k = 0.

When 0 < k ≤ n, we have:

1 n + 1 1 n h − h = rk(1 − r)n+1−k dΦ(r) − rk(1 − r)n−k dΦ(r) n+1,k n,k k k Z0 Z0 1 rk(1 − r)n−kn! (n + 1)(1 − r) = · − 1 dΦ(r) k!(n − k)! (n + 1 − k) Z0 16

1 rk(1 − r)n−kn! k − (n + 1)r = · · dΦ(r). k!(n − k)! n + 1 − k Z0 Thus:

1 rk(1 − r)n−kn! (1.3) h − h = · [k − (n + 1)r] · dΦ(r). n+1,k n,k k!(n + 1 − k)! Z0 Meanwhile:

1 n 1 n hΨ − hΨ = rk(1 − r)n−k dΨ(r) − rk−1(1 − r)n−k+1 dΨ(r) n,k n,k−1 k k − 1 Z0 Z0 1 rk(1 − r)n−kn! r 1 − r = · − dΦ(r) (k − 1)!(n − k)! k n − k + 1 Z0 1 rk(1 − r)n−kn! nr − kr + r k − kr = · − dΦ(r) (k − 1)!(n − k)! k(n − k + 1) k(n − k + 1) Z0 1 rk(1 − r)n−kn! = · [(n + 1)r − k] · dΦ(r). k!(n − k + 1)! Z0 So:

1 rk(1 − r)n−kn! (1.4) hΨ − hΨ = − · [k − (n + 1)r] · dΦ(r). n,k n,k−1 k!(n − k + 1)! Z0 Comparing (1.3) with (1.4), we obtain the proof of lemma 1.5.1. CHAPTER 2

Bound for Absolute Rate of Convergence for a Hausdorﬀ Method

Now that all the pieces are in place, we are ready to give a bound for the Hausdorﬀ transform.

2.1 Main Theorem

Theorem 2.1.1 If f ∈ BV [−π, π] and 2π-periodic, and if the Hausdorﬀ transform of the k −β π sequence (sin jt, j = 0, 1, 2, . . .) is such that hk,j sin(jt) = O((kt) ) for every t ∈ [ k , π] j=0 and some constant β ∈ (0, 1], then for someXconstant C

∞ n C π RΦ(f, x) := |(H S) − (H S) | ≤ kβ−1V ar k (φ ), n ≥ 2. n Φ k Φ k−1 nβ 0 x k=Xn+1 Xk=1

Proof

Using Lemma 1.4.1, we have:

∞ Φ Rn (f, x) = |(HΦS)k − (HΦS)k−1| k n =X+1 ∞ k 1 π = φ (t) h j cos(jt) dt kπ x k,j k=n+1 Z0 j=0 X X ∞ k 1 π = h φ (t)j cos(jt) dt kπ k,j x k=n+1 j=0 Z0 X X ∞ k 1 π = h sin(jt) dφ (t) kπ k,j x k=n+1 j=0 Z0 X X

17 18

after integrating by parts. Thus:

∞ π k Φ 1 R (f, x) = h sin(jt) dφx(t) n kπ  k,j  k=n+1 Z0 j=0 X X ∞ k  1 π ≤ h sin(jt) dV art (φ ) kπ k,j 0 x k=n+1 Z0 j=0 X X

since we had a Riemann-Stieltjes integral. Therefore,

∞ π k 1 k (2.1) RΦ(f, x) ≤ h sin(jt) dV art (φ ) n kπ k,j 0 x k=n+1 Z0 j=0 X X

∞ π k 1 t + hk,j sin(jt) dV ar0(φx). kπ π k=n+1 Z k j=0 X X

Let us look at each part of the right hand side of the inequalit y, separately. For the ﬁrst term:

∞ π k 1 k h sin(jt) dV art (φ ) kπ k,j 0 x k=n+1 Z0 j=0 X X ∞ π k 1 k ≤ h |sin( jt)| dV art (φ ) kπ k,j 0 x k n 0 j =X+1 Z X=0 ∞ π k 1 k ≤ h (jt) dV art (φ ) kπ k,j 0 x k n 0 j =X+1 Z X=0 ∞ π k 1 1 k = P (X = j) dΦ(r) (jt) dV art (φ ) kπ k,r 0 x k n 0 j 0 =X+1 Z X=0 Z ∞ π k 1 1 k = jP (X = j) dΦ(r) t dV art (φ ) kπ k,r 0 x 0 j 0 k=Xn+1 Z X=0 Z ∞ π 1 k 1 k t = jP (X = j) dΦ(r) t dV ar (φx). kπ  k,r  0 k n 0 0 j =X+1 Z Z X=0   From probability theory, we recognize the sum in the parentheses as the expectation of a

binomial random variable, which is equal to kr. So:

∞ π k ∞ π 1 1 k 1 k h sin(jt) dV art (φ ) ≤ kr dΦ(r) t dV art (φ ). kπ k,j 0 x kπ 0 x k=n+1 Z0 j=0 k=n+1 Z0 Z0 X X X

1 Let c1 = 0 r dΦ(r). Then, we have:

R ∞ π k ∞ π 1 k c k h sin(jt) dV art (φ ) ≤ 1 t dV art (φ ). kπ k,j 0 x π 0 x k=n+1 Z0 j=0 k=n+1 Z0 X X X

At this point, recall the indicator function, I, is deﬁned as:

1, if a is true, (2.2) I[a] =   0, if a is false.

 Using this, we have:

∞ π k ∞ π 1 k c n h sin(jt) dV art (φ ) ≤ 1 I[kt ≤ π] t dV art (φ ). kπ k,j 0 x π 0 x k=n+1 Z0 j=0 k=n+1 Z0 X X X

t Since I, t, and dV ar0(φx) are all nonnegative, Fubini’s theorem [15] gives: ∞ π k π ∞ 1 k c n (2.3) h sin(jt) dV art (φ ) ≤ 1 I[kt ≤ π] t dV art (φ ). kπ k,j 0 x π 0 x k=n+1 Z0 j=0 Z0 k=n+1 X X X

Now let 0 < ε < π . Then notice: n π ∞ π c1 n t c1 n t I[kt ≤ π] t dV ar0(φx) = t dV ar0(φx) π ε π ε   k=n+1 n

π Since this is true for any 0 < ε < n , (2.3) becomes:

∞ π k 1 k π (2.4) h sin(jt) dV art (φ ) ≤ c V ar n (φ ). kπ k,j 0 x 1 0 x k=n+1 Z0 j=0 X X

As for the second part of (2.1), by assumption for some positive constant c : 2 ∞ π k 1 t hk,j sin(jt) dV ar0(φx) kπ π k=n+1 Z k j=0 X X

∞ π 1 c2 t ≤ dV ar0(φx) kπ π (kt)β k=Xn+1 Z k ∞ π c2 1 t = dV ar0(φx). π(kβ+1) π tβ k=Xn+1 Z k So:

∞ π k 1 t (2.5) hk,j sin(jt) dV ar0(φx) kπ π k=n+1 Z k j=0 X X ∞ π π c2 n 1 t 1 t ≤ dV ar0(φx) + dV ar0(φx) . π(kβ+1) π tβ π tβ k n k n ! =X+1 Z Z Let us look at each of these two terms separately. First, notice that:

∞ π ∞ π c2 n 1 t c2 n 1 t dV ar0(φx) = I[kt > π] dV ar0(φx). π(kβ+1) π tβ π(kβ+1) tβ k n k k n 0 =X+1 Z =X+1 Z So, again using Fubini’s theorem:

∞ π π ∞ c2 n 1 t n c2 1 t (2.6) dV ar0(φx) = I[kt > π] dV ar0(φx). π(kβ+1) π tβ π(kβ+1) tβ k n k 0 k n =X+1 Z Z =X+1 π Now, for any 0 < δ < n , we have:

π ∞ π n c 1 n c 1 2 I[kt > π] dV art (φ ) ≤ 2 dV art (φ ) π(kβ+1) tβ 0 x π(kβ+1) tβ 0 x δ k=n+1 δ k> π Z X Z Xt π ∞ n c2 1 t = β+1 β dV ar0(φx), δ π(k ) t k=[ π ]+1 Z Xt π π where [ t ] represents the greatest integer ≤ t . Then,

π ∞ n c 1 2 I[kt > π] dV art (φ ) π(kβ+1) tβ 0 x δ k n Z =X+1 π ∞ n c2 1 t ≤ β β+1 dV ar0(φx) δ πt  k  k=[ π ]+1 Z Xt π   ∞ n c2 1 1 t = β β+1 + β+1 dV ar0(φx) δ πt  π k  [ t ] + 1 k=[ π ]+2 Z Xt π   n ∞ c2 1 1 t ≤ + du dV ar0(φx) β π β+1 π β+1 δ πt [ ]+1 u Z t Z t ! 21

π β+1 ∞ n c2 t 1 t ≤ β β+1 + β+1 du dV ar0(φx) δ πt π π u Z Z t ! π β β n c t +1 t = 2 + dV art (φ ) πtβ πβ+1 βπβ 0 x Zδ π β β n c t t ≤ 2 + dV art (φ ), πtβ βπβ βπβ 0 x Zδ π as 0 < β ≤ 1 < n ≤ t . Therefore,

π ∞ π n c 1 2c n 2 I[kt > π] dV art (φ ) ≤ 2 dV art (φ ) π(kβ+1) tβ 0 x βπβ+1 0 x Zδ k=n+1 Zδ X π 2c n ≤ 2 dV art (φ ) βπβ+1 0 x Z0 2c π = 2 V ar n (φ ). βπβ+1 0 x

π Since this is true for all 0 < δ < n , (2.6) becomes:

∞ π n π c2 1 t 2c2 n (2.7) dV ar0(φx) ≤ V ar0 (φx). π(kβ+1) π tβ βπβ+1 k n k =X+1 Z As for the second term in (2.5):

∞ π π ∞ c2 1 t c2 1 t dV ar0(φx) = dV ar0(φx) π(kβ+1) π tβ π π(kβ+1) tβ k=Xn+1 Z n Z n k=Xn+1 π ∞ c2 1 t = dV ar0(φx) π πtβ kβ+1 Z n n+1 ! π X∞ c2 1 t ≤ β β+1 du dV ar0(φx) π πt n u Z n Z π c2 t = dV ar0(φx) π βπtβnβ Z n π c2 1 t = dV ar0(φx). βπnβ π tβ Z n Integrating by parts, we obtain:

∞ π c2 1 t dV ar0(φx) π(kβ+1) π tβ k n n =X+1 Z π π t c2 1 t V ar0(φx) ≤ β β · V ar0(φx) + β β+1 dt βπn t π π t " n Z n #

π π t c2 π c2 n c2 V ar0(φx) = V ar0 (φx) − V ar0 (φx) + dt. βπβ+1nβ βπβ+1 πnβ π tβ+1 Z n π Making the substitution u = t in the integral gives:

π π t n u β+1 V ar0(φx) 1 V ar0 (φx) · u β+1 dt = β 2 du π t π 1 u Z n Z 1 n π = V ar u (φ ) · uβ−1 du πβ 0 x Z1 n−1 1 k+1 π = V ar u (φ ) · uβ−1 du πβ 0 x k k X=1 Z n−1 1 π k+1 ≤ V ar k (φ ) uβ−1 du πβ 0 x k k X=1 Z n−1 1 π = V ar k (φ ) (k + 1)β − kβ βπβ 0 x k=1 h i Xn c π ≤ 3 kβ−1 · V ar k (φ ), βπβ 0 x k X=1

for some constant c3. Thus,

∞ π c2 1 t (2.8) dV ar0(φx) π(kβ+1) π tβ k=n+1 Z n X n c c π c · c π ≤ 2 V arπ(φ ) − 2 V ar n (φ ) + 2 3 kβ−1 · V ar k (φ ). βπβ+1nβ 0 x βπβ+1 0 x βπβ+1nβ 0 x k X=1 Plugging (2.7) and (2.8) back into (2.5) gives:

∞ π k 1 t (2.9) hk,j sin(jt) dV ar0(φx) kπ π k=n+1 Z k j=0 X X π n π c2 π c2 n c2 · c3 β−1 k ≤ V ar (φx) + V ar (φx) + k · V ar (φx). βπβ+1nβ 0 βπβ+1 0 βπβ+1nβ 0 Xk=1 And, ﬁnally, substituting (2.4) and (2.9) into (2.1), gives:

n c c π c · c π RΦ(f, x) ≤ 2 V arπ(φ ) + c + 2 V ar n (φ ) + 2 3 kβ−1 · V ar k (φ ). n βπβ+1nβ 0 x 1 βπβ+1 0 x βπβ+1nβ 0 x k X=1 Notice that the ﬁrst term above resembles (except for a diﬀerent multiplicative constant) the k = 1 term in the sum. Therefore, by adjusting the multiplicative constant on the 23

summation, we can absorb the ﬁrst term. Thus:

n c π C π (2.10) RΦ(f, x) ≤ c + 2 V ar n (φ ) + kβ−1 · V ar k (φ ). n 1 βπβ+1 0 x nβ 0 x k b X=1 Now, to ﬁnish the proof, we need to show that the remaining term outside the sum can

n π k also be absorbed in the summation. To do this, look at the weighted mean pkV ar0 (φx) k=1 n X where pk = 1, and 0 ≤ pk ≤ 1 for every k. By properties of a weighted mean, the mean k=1 is alwaXys greater than (or equal to) the smallest of the individual data values, and since the

π n total variation is a nondecreasing function, the smallest variation used is the V ar0 (φx); i.e. n π π d · kβ−1 V ar n (φ ) ≤ p V ar k (φ ). Now, we will choose to let p = . Obviously, we can 0 x k 0 x k nβ k=1 ﬁnd such a dXfor a ﬁxed n, but we want to ascertain that d remains bounded as n → ∞.

To see this, notice that:

n 1 = pk k=1 X n d = kβ−1 nβ k ! X=1 d n ≥ xβ−1 dx nβ Z1 d = nβ − 1 . β · nβ Therefore,

β · nβ d ≤ nβ − 1 nβ − 1 + 1 = β · nβ − 1 1 = β · 1 + . nβ − 1 Provided n ≥ 2, we can bound d by:

1 d ≤ β · 1 + . 2β − 1 24

Thus,

n π 1 1 π (2.11) V ar n (φ ) ≤ β · 1 + · kβ−1 · V ar k (φ ). 0 x 2β − 1 nβ 0 x Xk=1 Substituting (2.11) back into (2.10), we get, for some new positive constant C,

n C π RΦ(f, x) ≤ kβ−1 · V ar k (φ ). n nβ 0 x Xk=1

2.2 How strict a requirement?

Now that the theorem has been proven, let us examine how strict of a requirement is k −β π in theorem 2.1.1 that hk,j sin(jt) = O((kt) ) for every t ∈ [ k , π] and some constant j=0 β ∈ (0, 1]. In the introXduction, we have already discussed the work of Humphreys and

Bojanic [7], and Kunyang and Dan [10]. In [7], Humphreys and Bojanic show in lemma

2.2 that this requirement holds in the (C, α) case with α ≥ 1 for β = 1. In [10], Kunyang

and Dan show in lemma 2 that this requirement holds in the (C, α) case with 0 < α < 1

for β = α. Thus, our theorem can be applied for any (C, α) transform, α > 0, and it

extends both of these results. The natural question which then arises is: Besides the (C, α)

transforms, how can we tell which of the Hausdorﬀ transforms satisfy this requirement? We

will answer this question with the ﬁrst of two theorems presented in chapter 3.

2.3 Example illustrating the sharpness of the bound

To show the sharpness of the bound, we want to show that there exists a Hausdorﬀ

assisted transform of some function which converges only as quickly as the bound indicates.

The following example was given by Kunyang and Dan in [10]. Deﬁne the function, f, by: 25

π−x when 0 < x < 2π, ∞ sin kx 2 (2.12) f(x) = k=1 k =   0 when x = 0, P  on [0, 2π] and extend to the entire real lineby making f 2π-periodic. Kunyang and Dan show that the convergence rate of this particular function under the (C, α) transform at

π x = 2 when 0 < α < 1 is:

n 1 π Rα(f, x) > kα−1V ar k (φ ). n 2000αnα 0 x k X=1 Hence, the bound given cannot be improved without further assumptions. CHAPTER 3

Results Concerning a Hausdorﬀ Method’s Row Total Variation

3.1 Introduction

In this chapter, we will introduce and prove two theorems concerning a Hausdorff method’s row total variation, but first some remarks are in order. In (1.1), we defined the Hausdorff transform using summation notation. The transform also can be thought of in terms of matrix multiplication, however. Think of HΦ = (hk,j) as an infinite dimensional matrix, S as a column vector of the Fourier series’ partial sums, and (HΦS) as a column

th vector of the transforms (making (HΦS)k the k element). Thus we have:

h0,0 0 · · · · · · · · · S0 (HΦS)0       h1,0 h1,1 0 · · · · · · S1 (HΦS)1 H =  , S =  , and (H S) =  . Φ     Φ    h h h 0 · · ·   S   (H S)   2,0 2,1 2,2   2   Φ 2         ......   .   .   . . . . .   .   .              ∞ For any row k, the total row variation of HΦ is |hk,j − hk,j−1| where we will deﬁne j X=0 hk,−1 := 0. Because the Hausdorﬀ transforms have the property that hk,j = 0 for j > k, k+1 the total row variation for a row k can be written as |hk,j − hk,j−1|. j X=0

3.2 Two theorems

Now we are ready to answer the question posed at the end of chapter 2.

26 27

1 Theorem 3.2.1 For any Hausdorﬀ method, if the row total variation is O( k ), then

k 1 h sin(jt) = O( ), k,j kt j X=0 π for all t ∈ [ k , π]. In particular, when Φ is diﬀerentiable with respect to the Lebesgue measure with Φ0 ∈ BV [0, 1], then this result holds.

Before proving this theorem, we will mention that this result has further implications in summability theory and approximation theory. If HΦ is a regular Hausdorﬀ method,

λ and if {λ(k)} is an inﬁnite increasing sequence of positive integers, then HΦ := (hλ(k),j ) is

λ called a λ-submethod of HΦ. Note that HΦ is another inﬁnite dimensional matrix which is

λ derived by eliminating some of the rows of HΦ. If HΦ sums a sequence then, trivially, HΦ sums the same sequence to the same limit. However, the converse need not hold in general.

Those sequences {λ(k)} for which the converse holds (over a speciﬁed space of sequences) are called condensation sequences. By using our results on row total variation, we provide the following simple condensation test for bounded sequences in any normed linear space.

This extends the results of Osikiewicz [12].

Ψ Theorem 3.2.2 Let HΦ = (hk,j) be a regular Hausdorﬀ method. Let (hk,j) be the Hausdorﬀ method for the weight function dΨ(r) = r · dΦ(r), with row total variation:

∞ 1 (3.1) |hΨ − hΨ | = O , k,j k,j−1 kβ j X=0 for some β ∈ (0, 1]. Let {λ(k)} be an inﬁnite sequence of positive integers. If

λ(k + 1) 1 (3.2) = 1 + o , λ(k) λ(k)1−β

λ then HΦ and HΦ are equivalent over the space of bounded sequences in any normed linear space. 28

It is interesting to note that when Φ0 ∈ BV [0, 1], then we have Ψ0 = r · Φ0 ∈ BV [0, 1]

and therefore by theorem 3.2.1, condition (3.1) will hold with β = 1. For instance, all

(C, α) methods with α ≥ 1 are of this type. However, for (H, α) methods with α > 1,

Φ0 ∈/ BV [0, 1], but r · Φ0 ∈ BV [0, 1], and once again condition (3.1) will hold with β = 1.

3.3 Proof of theorem 3.2.1

Proof

Recall the trigonometric identity, cos(a − b) − cos(a + b) = 2 sin a sin b. Then:

∞ ∞ t t 2 sin h sin(jt) = h · 2 sin(jt) sin 2 k,j k,j 2 j j X=0 X=1 ∞ 1 1 = h · cos j − t − cos j + t k,j 2 2 j X=1 ∞ ∞ 1 1 = h cos i + t − h cos j + t k,i+1 2 k,j 2 i j X=0 X=1 ∞ ∞ 1 1 = h cos j + t − h cos j + t k,j+1 2 k,j 2 j j X=0 X=0 t +h cos k,0 2

∞ t 1 = h cos + (h − h ) cos j + t k,0 2 k,j+1 k,j 2 j X=0 ∞ 1 = (h − h ) cos j + t, k,j+1 k,j 2 j X=−1

recalling that hk,−1 = 0. So: 29

∞ ∞ t 2 sin h sin(jt) ≤ |h − h | 2 k,j k,j+1 k,j j=0 j=−1 X X ∞

= |hk,j − hk,j−1|. j X=0

t For any t ∈ (0, π], sin 2 > 0 so:

∞ ∞ 1 hk,j sin(jt) ≤ · |hk,j − hk,j−1|. 2 sin t j=0 2 j=0 X X

t 2t In fact, for t ∈ (0, π], 2 sin 2 ≥ π . Also, since we are assuming that the row total variation 1 is O( k ), there is a positive constant ζ, such that:

∞ π ζ h sin(jt) ≤ · k,j 2t k j=0 X 1 = O( ). kt

Hence, the ﬁrst part of theorem 3.2.1 is proved. To prove the second part, assume Φ(r) is diﬀerentiable with respect to the Lebesgue measure, and assume Φ0(r) ∈ BV [0, 1]. Thus, dΦ(r) = Φ0(r) dr. Recall from (1.1) that when 0 ≤ j ≤ k:

1 k h = rj(1 − r)k−j dΦ(r) k,j j Z0 1 1 (k + 1)! = rj(1 − r)k−j Φ0(r) dr k + 1 j!(k − j)! Z0 1 1 Γ(k − j + 1 + j + 1) = rj(1 − r)k−j Φ0(r) dr. k + 1 Γ(j + 1)Γ(k − j + 1) Z0

Now deﬁne a = j + 1 and b = k − j + 1. Then:

1 1 Γ(a + b) h = ra−1(1 − r)b−1 Φ0(r) dr. k,j k + 1 Γ(a)Γ(b) Z0 30

At this point, we claim that the integrand resembles the probability density function of a

Beta random variable with parameters a and b. Recall, that if Y ∼ Beta(a, b), then its probability density function, fa,b(y), and its probability distribution function, Fa,b(t) are

given by: Γ(a + b) f (y) := ya−1(1 − y)b−1, y ∈ (0, 1), a,b Γ(a)Γ(b)

t Fa,b(t) := fa,b(y) dy, t ∈ [0, 1]. Z0

Therefore, for 0 ≤ j ≤ k:

1 0 (k + 1)hk,j = fj+1,k−j+1(r) Φ (r) dr. Z0

And, further:

∞ (k + 1) |hk,j − hk,j−1| j X=0 k = (k + 1)hk,0 + (k + 1) |hk,j − hk,j−1| + (k + 1)hk,k j X=1 1 k 1 = f (r) Φ0(r) dr + (k + 1) |h − h | + f (r) Φ0(r) dr 1,k+1  k,j k,j−1  k+1,1 0 j 0 Z X=1 Z 1  k  1 0 0 ≤ f ,k (r) |Φ (r)| dr + (k + 1) |hk,j − hk,j | + fk , (r) |Φ (r)| dr. 1 +1  −1  +1 1 0 j 0 Z X=1 Z   Since Φ0(r) is of bounded variation on [0, 1], |Φ0(r)| is bounded by some positive constant,

ξ. Thus:

∞ k (3.3) (k + 1) |h − h | ≤ 2ξ + (k + 1) |h − h | . k,j k,j−1  k,j k,j−1  j j X=0 X=1   31

Now look at the term in the brackets:

k k (k + 1) |hk,j − hk,j−1| = |(k + 1)hk,j − (k + 1)hk,j−1| j j X=1 X=1 k 1 1 0 0 = fj+1,k−j+1(r) Φ (r) dr − fj,k−j+2(r) Φ (r) dr . j=1 Z0 Z0 X

Integrating both integrals by parts gives:

k (k + 1) |hk,j − hk,j−1| j X=1 k 1 1 0 0 0 0 = Φ (1) − Fj+1,k−j+1(r) dΦ (r) − Φ (1) + Fj,k−j+2(r) dΦ (r) j=1 Z0 Z0 X k 1 0 = [Fj,k−j+2(r) − Fj+1,k−j+1(r)] dΦ (r) j=1 Z0 X k 1 0 (3.4) ≤ |Fj,k−j+2(r) − Fj+1,k−j+1(r)| d Φ (r) . j=1 Z0 X

Now we will present a probabilistic proof to show that the absolute value may be removed

from the integrand in (3.4). Let X1, X2, ... be independent, identically distributed exponen-

tial random variables with parameter 1. Then for positive integers a and b, we have:

a i=1 Xi a+b ∼ Beta(a, b). Pi=1 Xi P So, noting that j + (k − j + 2) = k + 2 = (j + 1) + (k − j + 1), we can write:

j i=1 Xi Fj,k−j+2(r) = P k+2 ≤ r , Pi=1 Xi ! P 32

and,

j+1 i=1 Xi Fj+1,k−j+1(r) = P k+2 ≤ r . Pi=1 Xi ! P Notice that both random variables have the same denominator, while the former uses less random variables in the numerator than the latter. Therefore, the latter is stochastically larger which implies that Fj,k−j+2(r) ≥ Fj+1,k−j+1(r), for every r ∈ [0, 1]. Thus, (3.4) becomes:

k (k + 1) |hk,j − hk,j−1| j X=1 k 1 0 ≤ (Fj,k−j+2(r) − Fj+1,k−j+1(r)) d|Φ (r)| j 0 X=1 Z k 1 0 0 0 = Fj,k−j+2(1) · |Φ (1)| − Fj,k−j+2(0) · |Φ (0)| − fj,k−j+2(r) · |Φ (r)| dr j 0 X=1 Z 1 0 0 0 −Fj+1,k−j+1(1) · |Φ (1)| + Fj+1,k−j+1(0) · |Φ (0)| + fj+1,k−j+1(r) · |Φ (r)| dr Z0 k 1 0 = [fj+1,k−j+1(r) − fj,k−j+2(r)] · |Φ (r)| dr j 0 X=1 Z 1 k = [f (r) − f (r)] · |Φ0(r)| dr  j+1,k−j+1 j,k−j+2  0 j Z X=1 1   0 = (fk+1,1(r) − f1,k+1(r)) · |Φ (r)| dr 0 Z 1 1 0 0 ≤ fk+1,1(r) · |Φ (r)| dr + f1,k+1(r) · |Φ (r)| dr. Z0 Z0

Again using ξ as a bound for |Φ0(r)|, (3.3) then becomes:

∞ (k + 1) |hk,j − hk,j−1| ≤ 4ξ, j X=0 33

or,

∞ 1 |h − h | = O( ). k,j k,j−1 k j X=0

3.4 Proof of theorem 3.2.2

Proof

λ To prove that HΦ and HΦ are equivalent, note that if a sequence is summed by HΦ, then it is also obviously summed by any of its submethods.

For the converse, assume condition (3.2) holds, and let {fk, k = 0, 1, 2, . . .} be a bounded

λ sequence which is HΦ-summable to some limit, L. Consider the sequence ρ(n) consisting of the positive integers which are not picked up by λ(n). If ρ(n) consists of only ﬁnitely

λ many integers, then trivially HΦ and HΦ are equivalent. Thus, we will assume that ρ(n) contains an inﬁnite number of positive integers. Then there exists an integer N such that

for any n > N, ρ(n) > λ(0). Furthermore, for any n > N, we can ﬁnd a nonnegative

integer m such that λ(m) < ρ(n) < λ(m + 1). So, we will let ρ(n) = λ(m) + j where

0 < j < λ(m + 1) − λ(m). For n > N:

∞ ∞ ρ λ (HΦf)n − (HΦf)m = hρ(n),k · fk − hλ(m),k · fk k k X=0 X=0 ∞

= (hρ(n),k − hλ(m),k) · fk

k=0 X ∞

≤ sup kfkk · hρ(n),k − hλ( m),k k k=0 X

∞ j−1 = sup kfkk · hλ(m)+l+1,k − hλ(m)+l,k k k=0 l=0 X X ∞ j−1 ≤ sup kfkk · hλ(m)+l+1,k − hλ(m)+l,k k k=0 l=0 X X j−1 ∞ ≤ K |hλ(m)+l+1,k − hλ(m)+l,k|, l k X=0 X=0 deﬁning sup kfkk = K, and switching the order of summation. Using lemma 1.5.1, this k becomes:

j−1 ∞ ρ λ Ψ Ψ (HΦf)n − (HΦf)m ≤ K hλ(m)+l,k − hλ(m)+l,k−1 . l=0 k=0 X X By condition (3.1) of theorem 3.2.2, there exists a p ositive integer M > 0 suc h that:

j−1 M (Hρ f) − (Hλf) ≤ K Φ n Φ m (λ(m) + l)β Xl=0 j−1 1 ≤ MK (λ(m))β Xl=0 MK = · j (λ(m))β MK ≤ · [λ(m + 1) − λ(m)] (λ(m))β λ(m + 1) = MK · (λ(m))1−β · − 1 λ(m) = o(1),

by condition (3.2) of theorem 3.2.2. Thus, over the space of bounded sequences, the two

methods have the same asymptotic behavior, and in particular are equivalent. CHAPTER 4

Bounds for Tensor Product Hausdorﬀ Transforms

4.1 Introduction and deﬁnitions

In this chapter, we will extend our previous results to the multivariate case. First, we need to update our previous definitions, starting with the function, f. Now let f be a function of two variables; i.e. f(x, y). To go along with our previous idea of f being 2π- periodic, we will now define f over T2. Usually, T2 is defined to be [0, 2π] × [0, 2π], but as

K¨orner points out in [9], this assigns a special place to 0 and 2π. Rather, we may think of

T2 as any product space [a, b] × [c, d] where d − c = b − a = 2π. In fact, to mirror what we did in chapter 1, we wish to let T2 be [−π, π] × [−π, π]. To generalize so that the domain

2 of f(x, y) may be R , we deﬁne f(x, y) = f(xˆ, yˆ) where xˆ = x 2πn1 for some n1 ∈ Z,

2 yˆ = y 2πn2 for some n2 ∈ Z and (x,ˆ yˆ) ∈ T . We will be looking at a particular (x, y) which is either a point of continuity of f, or a point of simple discontinuity. We also deﬁne:

(4.1)

φx,y(u, v) = f(x + u, y + v) + f(x + u, y − v) + f(x − u, y + v) + f(x − u, y − v)

−[f(x+, y+) + f(x+, y−) + f(x−, y+) + f(x−, y−)], when u and v are not both equal to 0, and deﬁned to equal 0 when u = v = 0.

As before, we will need to have some restrictions on f. The total variation of f(x, y) on

[a, b] × [c, d] is deﬁned as:

(4.2)

35 36

N1 N2 b,d V ara,c(f) = sup |f(si, tj) − f(si, tj−1) − f(si−1, tj) + f(si−1, tj−1)|, N1,N2,{si},{tj } i j X=1 X=1

where a = s0 < s1 < . . . < sN1 = b, and c = t0 < t1 < . . . < tN2 = d. First, a comment

b on notation. Notice that in chapter 1, the variation always appeared in the form V ar0(φx). We will require similar expressions in this chapter, so for notational convenience, deﬁne:

b,d (4.3) V ar(b, d) := V ar0,0(φx,y).

Next, notice that the natural way to deﬁne a function of bounded variation would be a

function whose total variation on [a, b] × [c, d] is finite. Unfortunately, there are some such functions for which expressions of the form f(x+, y+) are not well-defined, as the limit approaching a point of discontinuity may depend on the path taken. Therefore, we will instead use the Hardy-Krause definition for a function of bounded variation. That is, we will call f(x, y) a function of bounded variation if and only if it can be written as:

(4.4) f(x, y) = a · h1(x, y) + b · h2(x, y) + c,

where a, b, and c are constants, and h1(x, y) and h2(x, y) are bivariate probability distri- butions. Note that this resembles the one variable case where a function is of bounded variation if and only if it can be written as the diﬀerence of two bounded, monotonic functions. Now, for functions of bounded variation in the Hardy-Krause sense, expressions of the form f(x+, y+) are always well-deﬁned.

As for the Fourier series of f, let f ∈ L1(T2) and deﬁne:

i(jx+ky) (4.5) S(f; x, y) = cj,k e , Z Z Xj∈ Xk∈ where the coeﬃcients cj,k are deﬁned as:

1 −i(ju+kv) Z (4.6) cj,k = 2 f(u, v) e du dv, j, k ∈ . 4π T2 Z Z 37

Thus, deﬁne the (m, n)th rectangular partial sum to be:

m n i(jx+ky) (4.7) Sm,n(f; x, y) = cj,k e . j m =X− kX=−n

We choose to use this version of the Fourier series in the two variable case for simplicity

in both notation and in proofs. We will also use the integral form for the partial sum (the two variable equivalent of (4) ). This expression is our ﬁrst lemma of this chapter.

Lemma 4.1.1 Let f ∈ L1(T2). The (m, n)th rectangular partial Fourier sum of f(x, y)

may be written:

1 Sm,n(f; x, y) = f(x − u, y − v) Dm(u) Dn(v) du dv, π2 T2 Z Z where Dm(u) and Dn(v) are the Dirichlet kernel deﬁned in (5).

Proof

m n i(jx+ky) Sm,n(f; x, y) = cj,k e j m =X− kX=−n m n 1 −i(ju+kv) i(jx+ky) = 2 f(u, v) e du dv · e 4π T2 j m k n =X− X=− Z Z m n 1 = f(u, v) ei[(jx+ky)−(ju+kv)] du dv 4π2 T2 j m =X− kX=−n Z Z m n 1 i[j(x−u)+k(y−v)] = 2 f(u, v) e du dv . 4π T2 j m k n =X− X=− Z Z Let s = x − u and t = y − v. Then:

m n 1 i[js+kt] Sm,n(f; x, y) = 2 f(x − s, y − t) e ds dt . 4π T2 j m k n =X− X=− Z Z To avoid an overabundance of variables, we will change the integration variables back to u and v (i.e. let u = s and v = t). Therefore:

m n 1 i[ju+kv] Sm,n(f; x, y) = 2 f(x − u, y − v) e du dv 4π T2 j m k n =X− X=− Z Z 38

m n 1 iju ikv = 2 f(x − u, y − v) e e du dv. 4π T2   j m ! Z Z =X− kX=−n   Examining the ﬁrst summation term in parentheses, and recalling the Euler identity eiθ =

cos θ + i sin θ, we have: m m eiju = [cos(ju) + i sin(ju)] j m j m =X− =X− m m = cos(ju) + i sin(ju) .     j m j m =X− =X− Since sin(0) = 0 and sin(−θ) = −sin(θ), the second sum above is zero. Since cos(0) = 1

and cos(−θ) = cos(θ), we have: m m eiju = 1 + 2 cos(ju) j m j =X− X=1 m 1 = 2 + cos(ju) 2  j X=1 = 2 · Dm(u), 

by (6). Repeating the same argument when we replace m with n and u with v gives:

1 Sm,n(f; x, y) = 2 f(x − u, y − v) Dm(u) Dn(v) du dv. π T2 Z Z

As before, we wish to focus on Hausdorff-transformed Fourier series. We define the tensor product Hausdorff transform of Sm,n as:

m n Φ,Ψ Φ Ψ (4.8) Sm,n (f; x, y) = hm,j · hn,k · Sj,k(f; x, y), m, n = 0, 1, 2, . . . , j X=0 Xk=0 where 1 P (Xm,r = j) dΦ(r), 0 ≤ j ≤ k, Φ (4.9) hm,j =  Z0  0, otherwise,

 39

for Φ ∈ BV [0, 1], Xm,r is a binomially distributed random variable, and P (Xm,r = j) =

m j m−j j r (1 − r) is the probability of getting j successes on m independent trials each of whic h results in a success with probability r. Thus, (4.9) is the same deﬁnition used in

(1.2), except now the weight function will be explicitly speciﬁed using a superscript (as was

done in chapter 3). Also notice that the notation for the transform has changed slightly from the single variable case. Then, the Hausdorﬀ transform was represented as (HΦS)n, whereas a superscript on S was only used for the (C, α) transform (15) - a particular type of Hausdorﬀ transform. Here, for simplicity in notation, we will use the superscripts on S

to indicate the weight functions of the general Hausdorﬀ transform.

4.2 Extending lemma 1.4.1

Next, we will require a bivariate form of lemma 1.4.1.

2 Φ,Ψ Lemma 4.2.1 Let f ∈ BV [T ], and let Φ(r), Ψ(s) ∈ BV [0, 1]. If Sm,n (f) is the tensor

product Hausdorﬀ transform of the double Fourier series, then

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ (4.10) Sm,n (f) − Sm−1,n(f) − Sm,n−1 + Sm−1,n−1 1 π π = φ (u, v)G0 (u) · G0 (v) du dv, mnπ2 x,y Φ,m Ψ,n Z0 Z0 where

m Φ GΦ,m(u) = hm,j sin(ju). j X=0 Proof

By deﬁnition:

Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) m n m−1 n Φ Ψ Φ Ψ = hm,j · hn,k · Sj,k(f) − hm−1,j · hn,k · Sj,k(f) j k j k X=0 X=0 X=0 X=0 40

n m m−1 (4.11) = hΨ hΦ · S (f) − hΦ · S (f) . n,k  m,j j,k m−1,j j,k  j j Xk=0 X=0 X=0   Notice that:

m m−1 m · hΦ · S (f) − hΦ · S (f)  m,j j,k m−1,j j,k  j j X=0 X=0  m m−1  Φ Φ = m · hm,j · Sj,k(f) − m · hm−1,j · Sj,k(f) j j X=0 X=0 m m−1 1 Φ = m · hm,j · Sj,k(f) − m · P (Xm−1,r = j) dΦ(r) · Sj,k(f). j j 0 X=0 X=0 Z Using lemma 1.3.1 on the term in parentheses yields:

m m−1 m · hΦ · S (f) − hΦ · S (f)  m,j j,k m−1,j j,k  j j X=0 X=0  m−1  Φ Φ = m · hm,m · Sm,k(f) + m · hm,j · Sj,k(f) j X=0 m−1 1 − (m − j) · P (Xm,r = j) dΦ(r) Sj,k(f) j 0 X=0 Z m−1 1 − (j + 1) · P (Xm,r = j + 1) dΦ(r) Sj,k(f) j 0 X=0 Z m−1 Φ Φ = m · hm,m · Sm,k(f) + m · hm,j · Sj,k(f) j X=0 m−1 m−1 Φ Φ − (m − j) · hm,j · Sj,k(f) − (j + 1) · hm,j+1 · Sj,k(f) j j X=0 X=0 m−1 Φ Φ = m · hm,m · Sm,k(f) + j · hm,j · Sj,k(f) j X=0 m−1 Φ − (j + 1) · hm,j+1 · Sj,k(f) j X=0 m m−1 Φ Φ = j · hm,j · Sj,k(f) − (j + 1) · hm,j+1 · Sj,k(f). j j X=0 X=0 41

Now making the change of the index of summation i = j + 1 in the second sum gives:

m m−1 m · hΦ · S (f) − hΦ · S (f)  m,j j,k m−1,j j,k  j j X=0 X=0  m m  Φ Φ = j · hm,j · Sj,k(f) − i · hm,i · Si−1,k(f) j i X=0 X=1 m m Φ Φ = j · hm,j · Sj,k(f) − i · hm,i · Si−1,k(f) j i X=0 X=0 m Φ = j · hm,j (Sj,k(f) − Sj−1,k(f)). j X=0 Applying lemma 4.1.1 and the second form of the Dirichlet kernel (6), we have:

m m−1 m · hΦ · S (f) − hΦ · S (f)  m,j j,k m−1,j j,k  j j X=0 X=0 m  1 π π  = j · hΦ f(x − u, y − v) D (v)[D (u) − D (u)] du dv m,j π2 k j j−1 j −π −π X=0 Z Z m 1 π π = j · hΦ f(x − u, y − v) D (v) cos(ju) du dv m,j π2 k j −π −π X=0 Z Z π π m 1 Φ = f(x − u, y − v) Dk(v) h · j cos(ju) du dv π2  m,j  −π −π j Z Z X=0 1 π π   = f(x − u, y − v) D (v) G0 (u) du dv, π2 k Φ,m Z−π Z−π by deﬁnition of GΦ,m(u). Thus:

m m−1 Φ Φ (4.12) hm,j · Sj,k(f) − hm−1,j · Sj,k(f) j j X=0 X=0 1 π π = f(x − u, y − v) D (v) G0 (u) du dv. mπ2 k Φ,m Z−π Z−π Substituting (4.12) back into (4.11), gives:

Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) n 1 π π = hΨ f(x − u, y − v) D (v) G0 (u) du dv n,k mπ2 k Φ,m k −π −π X=0 Z Z π π n 1 0 Ψ = 2 f(x − u, y − v) GΦ,m(u) hn,k Dk(v) du dv. mπ −π −π ! Z Z Xk=0 42

Repeating the same steps after replacing n with n − 1, yields:

Φ,Ψ Φ,Ψ Sm,n−1(f) − Sm−1,n−1(f) n−1 1 π π = f(x − u, y − v) G0 (u) hΨ D (v) du dv. mπ2 Φ,m n−1,k k −π −π k ! Z Z X=0 Therefore:

(4.13)

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) π π n n−1 1 0 Ψ Ψ = 2 f(x − u, y − v) GΦ,m(u) hn,k Dk(v) − hn−1,k Dk(v) du dv. mπ −π −π ! Z Z Xk=0 Xk=0 Notice that:

n n−1 Ψ Ψ n · hn,k Dk(v) − hn−1,k Dk(v) ! Xk=0 Xk=0 n n−1 Ψ Ψ = n · hn,k Dk(v) − n · hn−1,k Dk(v) Xk=0 Xk=0 n n−1 1 Ψ = n · hn,k Dk(v) − n · P (Xn−1,r = k) dΨ(r) Dk(v). 0 Xk=0 Xk=0 Z Using lemma 1.3.1 on the term in parentheses yields:

n n−1 Ψ Ψ n · hn,k Dk(v) − hn−1,k Dk(v) k k ! X=0 X=0 n−1 Ψ Ψ = n · hn,n Dn(v) + n · hn,k Dk(v) k X=0 n−1 1 − (n − k) · P (Xn,r = k) dΨ(r) Dk(v) k 0 X=0 Z n−1 1 − (k + 1) · P (Xn,r = k + 1) dΨ(r) Dk(v) 0 Xk=0 Z n−1 Ψ Ψ = n · hn,n Dn(v) + n · hn,k Dk(v) Xk=0 43

n−1 n−1 Ψ Ψ − (n − k) hn,k Dk(v) − (k + 1) hn,k+1 Dk(v) k k X=0 X=0 n n−1 Ψ Ψ = k · hn,k Dk(v) − (k + 1) hn,k+1 Dk(v). k k X=0 X=0 Now making the change of the index of summation i = k + 1 in the second sum gives:

n n−1 Ψ Ψ n · hn,k Dk(v) − hn−1,k Dk(v) k=0 k=0 ! Xn X n Ψ Ψ = k · hn,k Dk(v) − i · hn,i Di−1(v) k=0 i=1 Xn Xn Ψ Ψ = k · hn,k Dk(v) − i · hn,i Di−1(v) k=0 i=0 Xn X Ψ = k · hn,k[Dk(v) − Dk−1(v)] k=0 Xn Ψ = k · hn,k · cos(kv) k=0 X0 = GΨ,n(v),

by deﬁnition of GΨ,n(v). Thus:

n n−1 1 (4.14) hΨ D (v) − hΨ D (v) = G0 (v). n,k k n−1,k k n Ψ,n Xk=0 Xk=0 Substituting (4.14) back into (4.13) gives:

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) 1 π π = f(x − u, y − v) G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z−π Z−π 1 π 0 = f(x − u, y − v) G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z−π Z−π 1 π π + f(x − u, y − v) G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z−π Z0 1 π π = f(x + u, y − v) G0 (−u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z−π Z0 1 π π + f(x − u, y − v) G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z−π Z0 1 π π = f(x + u, y − v) G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z−π Z0 44

1 π π + f(x − u, y − v) G0 (u) G0 (v) du dv, mnπ2 Φ,m Ψ,n Z−π Z0 0 since GΦ,m(u) is an even function in u. Therefore:

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) 1 π π = [f(x + u, y − v) + f(x − u, y − v)] G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z−π Z0 1 0 π = [f(x + u, y − v) + f(x − u, y − v)] G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z−π Z0 1 π π + [f(x + u, y − v) + f(x − u, y − v)] G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z0 Z0 1 π π = [f(x + u, y + v) + f(x − u, y + v)] G0 (u) G0 (−v) du dv mnπ2 Φ,m Ψ,n Z0 Z0 1 π π + [f(x + u, y − v) + f(x − u, y − v)] G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z0 Z0 1 π π = [f(x + u, y + v) + f(x − u, y + v)] G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z0 Z0 1 π π + [f(x + u, y − v) + f(x − u, y − v)] G0 (u) G0 (v) du dv mnπ2 Φ,m Ψ,n Z0 Z0 1 π π = F G0 (u) G0 (v) du dv, mnπ2 Φ,m Ψ,n Z0 Z0 where F = [f(x + u, y + v) + f(x − u, y + v) + f(x + u, y − v) + f(x − u, y − v)]. Next notice that for any constant s:

π π π π 0 0 0 0 s · GΦ,m(u) GΨ,n(v) du dv = s · GΨ,n(v) GΦ,m(u) du dv Z0 Z0 Z0 Z0 π m π = s · G0 (v) j · hΦ · cos(ju) du dv Ψ,n  m,j  0 j 0 Z X=0 Z π  m  = s · G0 (v) 0 + j · hΦ · 0 dv Ψ,n  m,j  0 j Z X=1 = 0.  

Hence:

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) 1 π π = [F − s] G0 (u) G0 (v) du dv. mnπ2 Φ,m Ψ,n Z0 Z0 45

In particular, for s = f(x+, y+) + f(x+, y−) + f(x−, y+) + f(x−, y−), we have:

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) 1 π π = φ (u, v) G0 (u) G0 (v) du dv. mnπ2 x,y Φ,m Ψ,n Z0 Z0

4.3 Some necessary propositions

To examine the convergence rate, we will need Abel’s summation by parts formula.

Proposition 4.3.1

n n ak(sk − sk−1) = ansn − a0s0 − sk−1(ak − ak−1). k k X=1 X=1

The proof is trivially done by expanding the left- and right-hand sides and matching similar terms, so we will omit it here.

We will also need a bivariate version of the integration by parts formula. For this, we will use [16]:

Proposition 4.3.2 Let f(u, v) be of bounded variation over [a, b]×[c, d], and let F1(u) and

F2(v) be diﬀerentiable. Then the following integration by parts formula holds.

d b 0 0 f(u, v) F1(u) F2(v) du dv Zc Za = F1(b) · F2(d) · f(b, d) − F1(a) · F2(d) · f(a, d)

−F1(b) · F2(c) · f(b, c) + F1(a) · F2(c) · f(a, c) b b +F2(c) F1(u) duf(u, c) − F2(d) F1(u) duf(u, d) a a Z d Z d +F1(a) F2(v) dvf(a, v) − F1(b) F2(v) dvf(b, v) Zc Zc 46

d b + F1(u) F2(v) du,vf(u, v). Zc Za

In particular, if F1 and F2 are zero on the boundary, then we have:

d b d b 0 0 f(u, v) F1(u) F2(v) du dv = F1(u) F2(v) du,vf(u, v). Zc Za Zc Za Furthermore, by splitting f(u, v) into its monotonic components, we have:

d b d b 0 0 u,v f(u, v) F1(u) F2(v) du dv ≤ |F1(u) F2(v)| du,vV ara,c (f). Zc Za Zc Za

4.4 Main theorem and proof

With all of the above tools now assembled, we are ready to extend theorem 2.1.1.

T2 Φ Theorem 4.4.1 Let f ∈ BV [ ] (in the Hardy-Krause sense). Let HΦ = (hm,j) and

Ψ HΨ = (hn,k) be two regular Hausdorﬀ methods having the property that, for some constants α, β ∈ (0, 1]:

m 1 hΦ sin(ju) = O , u ∈ ( π , π], m = 1, 2, . . . , m,j (um)α m j X=0 n 1 hΨ sin(kv) = O , v ∈ ( π , π], n = 1, 2, . . . . n,k (vn)β n k X=0 Then we have the following results for M, N ≥ 2:

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) m>M n>N X X M N K π π ≤ jα−1 kβ−1 V ar , , M αN β j k j X=1 Xk=1 π π where K is a constant, V ar( j , k ) is as deﬁned in (4.3) and (4.2), and φx,y(f) is as deﬁned in (4.1).

Proof 47

By lemma 4.2.1:

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) m>M n>N X X 1 1 π π = φ (u, v) G0 (u) G0 (v) du dv , π2 mn x,y Φ,m Ψ,n m>M n>N Z0 Z0 X X where GΦ,m(u) and GΨ,n(v) are deﬁned in the lemma. Applying proposition 4.3.2, and noticing that sin(jπ) = 0 for every j ∈ Z, we get:

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) m>M n>N X X 1 1 π π ≤ |G (u) G (v)| d V ar(u, v). π2 mn Φ,m Ψ,n u,v m>M n>N 0 0 X X Z Z Or:

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ (4.15) Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) m>M n>N X X ≤ ∆1 + ∆2 + ∆3 + ∆4, where:

First, for ∆1, we use the bounds |GΦ,m(u)| ≤ c1mu and |GΨ,n(v)| ≤ c2nv for some positive constants c1 and c2, to get:

π π c · c n m ∆ ≤ 1 2 uv d V ar(u, v) 1 π2 u,v m>M n>N 0 0 X X Z Z 48

π π c · c N M = 1 2 I[um ≤ π] · I[vn ≤ π](uv) d V ar(u, v), π2 u,v m>M n>N 0 0 X X Z Z where I is the indicator function as deﬁned in (2.2). Therefore:

π π c1 · c2 N M ∆ ≤ 1 uv du,vV ar(u, v) 1 π2   0 0 Mvariance is a nondecreasing function in both vari- M N π π ables, the smallest variance is V ar π , π . Thus, V ar π , π ≤ p V ar , . M N M N j,k j k j=1 k=1 X X d · jα−1 kβ−1 We choose p = . Obviously, we can ﬁnd such a d for a ﬁxed M and N, but j,k M αN β we want to ascertain that d remains bounded as M, N → ∞. Notice that:

M N 1 = pj,k j X=1 Xk=1 M N d = jα−1 kβ−1 M αN β   j k ! X=1 X=1 d  M  N ≥ jα−1 dj kβ−1 dk M αN β Z1 Z1 d = (M α − 1) N β − 1 . αβM αN β Therefore:

M α N β d ≤ αβ M α − 1 N β − 1 1 1 = αβ 1 + 1 + M α − 1 N β − 1 49

1 1 = αβ 1 + 1 + , 2α − 1 2β − 1 π π since M, N ≥ 2. Bounding V ar M , N in ∆1 by this weighted mean yields, for some constant C1:

M N C π π (4.16) ∆ ≤ 1 jα−1 kβ−1V ar , . 1 M αN β j k j X=1 Xk=1

c Next, for ∆ , we use the bounds |G (u)| ≤ c mu and |G (v)| ≤ 3 (the assumed 2 Φ,m 1 Ψ,n (nv)β

bound in the theorem) for some positive constants c1 and c3. This gives:

π π c1 · c3 1 m mu ∆2 ≤ du,vV ar(u, v) π2 mn π nβvβ m>M n>N n 0 X X Z Z π π c1 · c3 1 M u = I[um ≤ π] du,vV ar(u, v) π2 n1+β π vβ m>M n>N n 0 X X Z Z π π c1 · c3 1 M u = 1 du,vV ar(u, v) 2 1+β π β π n 0 v  π  n>N n MN n 0 X Z Z π π c1 · c3 M 1 1 = dv,uV ar(u, v). π n1+β π vβ 0 n>N n Z X Z This implies that:

(4.17) π π c1 · c3 M 1 N 1 ∆2 ≤ dv,uV ar(u, v) π n1+β π vβ 0 n>N n Z X Z π π c1 · c3 M 1 1 + dv,uV ar(u, v). π n1+β π vβ 0 n>N N Z X Z We will look at each of the above terms separately. First:

π π c1 · c3 M 1 N 1 1+β β dv,uV ar(u, v) π 0 n π v Z n>N Z n X π π c · c M 1 N 1 = 1 3 I[nv > π] d V ar(u, v) π n1+β vβ v,u 0 n>N 0 Z X Z 50

π π c · c M N 1 1 = 1 3 I[nv > π] d V ar(u, v) π vβ n1+β v,u 0 0 n>N ! Z Z X π π c1 · c3 M N 1 1 = dv,uV ar(u, v) π vβ  n1+β  0 0 n> π Z Z Xv π π  ∞  c1 · c3 M N 1 1 = dv,uV ar(u, v), π vβ  n1+β  0 0 n=[ π ]+1 Z Z Xv   π π where [ v ] represents the greatest integer ≤ v . Then:

π π c1 · c3 M 1 N 1 dv,uV ar(u, v) π n1+β π vβ 0 n>N n Z X Z π π ∞ c1 · c3 M N 1 1 1 = + dv,uV ar(u, v) π vβ ([ π ] + 1)1+β n1+β  0 0 v n=[ π ]+2 Z Z Xv π π  ∞  c1 · c3 M N 1 1 1 ≤ β π 1+β + 1+β dn dv,uV ar(u, v) π 0 0 v ([ ] + 1) [ π ]+1 n Z Z v Z v ! π π ∞ c1 · c3 M N 1 1 1 ≤ β π β+1 + 1+β dn dv,uV ar(u, v) π 0 0 v ( ) π n Z Z v Z v ! π π β β c · c M N 1 v +1 v = 1 3 + d V ar(u, v) π vβ πβ+1 βπβ v,u Z0 Z0 π π β c · c M N 1 2v ≤ 1 3 d V ar(u, v), π vβ βπβ v,u Z0 Z0 π since 0 < β ≤ 1 < N ≤ v . Therefore:

π π c1 · c3 M 1 N 1 1+β β dv,uV ar(u, v) π 0 n π v Z n>N Z n X π π 2c · c M N ≤ 1 3 d V ar(u, v) βπβ+1 v,u Z0 Z0 2c · c π π = 1 3 V ar , . βπβ+1 M N

Notice the similarity between this expression and the one which we encountered in ∆1. By the same argument, we have, for some positive constant C1:

π π c1 · c3 M 1 N 1 c (4.18) dv,uV ar(u, v) π n1+β π vβ 0 n>N n Z X Z 51

M N C π π ≤ 1 jα−1 kβ−1V ar , . M αN β j k j=1 k c X X=1 As for the second term in (4.17):

π π c1 · c3 M 1 1 dv,uV ar(u, v) π n1+β π vβ 0 n>N N Z X Z π π c1 · c3 1 M 1 = dv,uV ar(u, v) π n1+β π vβ n>N ! 0 N X Z Z ∞ π π c1 · c3 1 M 1 ≤ 1+β dn β dv,uV ar(u, v) π N n 0 π v Z Z Z N π π c1 · c3 M 1 = β β dv,uV ar(u, v) βπN 0 π v Z Z N N−1 π π c1 · c3 M k 1 = dv,uV ar(u, v) βπN β π vβ 0 +1 Xk=1 Z Z k N−1 π π c1 · c3 β M k ≤ (k + 1) 1 dv,uV ar(u, v) βπβ+1N β π k 0 k+1 X=1 Z Z N−1 c · c π π π π π π = 1 3 (k + 1)β V ar( , ) − V ar( , ) − V ar(0, ) + V ar(0, ) βπβ+1N β M k M k + 1 k k + 1 k X=1 N−1 c · c π π π π = 1 3 (k + 1)β V ar( , ) − V ar( , ) βπβ+1N β M k M k + 1 Xk=1 N−1 c · c π π − 1 3 (k + 1)β V ar(0, ) − V ar(0, ) βπβ+1N β k k + 1 Xk=1 N−1 c · c π π π π ≤ 1 3 (k + 1)β V ar( , ) − V ar( , ) , βπβ+1N β M k M k + 1 k X=1 π π as V ar(0, k ) ≥ V ar(0, k+1 ) so we were subtracting away a positive amount. Applying Abel’s summation by parts formula (proposition 4.3.1) gives:

π π c1 · c3 M 1 1 dv,uV ar(u, v) π n1+β π vβ 0 n>N N Z X Z N−1 c1 · c3 β π π π π ≤ β+1 β − (k + 1) V ar( , ) − V ar( , ) βπ N " M k + 1 M k # Xk=1 N−1 c1 · c3 β π π π π π π β β = β+1 β −N V ar( , ) + V ar( , ) + V ar( , ) (k + 1) − k . βπ N " M N M 1 M k # Xk=1 52

β β β−1 Since (k + 1) − k is O(k ), we have, for some positive constant c4:

π π c1 · c3 M 1 1 dv,uV ar(u, v) π n1+β π vβ 0 n>N N Z X Z N−1 c1 · c3 β π π π π β−1 π π ≤ β+1 β −N V ar( , ) + V ar( , ) + c4 k V ar( , ) βπ N " M N M 1 M k # Xk=1 N−1 c · c π π π π ≤ 1 3 V ar( , ) + c kβ−1 V ar( , ) , βπβ+1N β M 1 4 M k " k # X=1 β π π since N V ar( M , N ) ≥ 0. Notice that the ﬁrst term in the brackets resembles the k = 1 term in the sum, so for some positive constant C:

π π c1 · c3 M 1e 1 dv,uV ar(u, v) π n1+β π vβ 0 n>N N Z X Z N−1 C π π ≤ kβ−1 V ar( , ) N β M k k e X=1 N C π π ≤ kβ−1 V ar( , ). N β M k k e X=1

As we did in bounding ∆1 (or perhaps more so to what we did in chapter 1 since here we

are averaging over a single summation), we can look at the weighted mean:

M N C π π p kβ−1 V ar( , ) . j N β j k j=1 k ! X e X=1

π π π π Noting that for any ﬁxed k, V ar( M , k ) ≤ V ar( j , k ) for every 1 ≤ j ≤ M. Therefore, e e C N β−1 π π M C N β−1 π π N β k=1 k V ar( M , k ) ≤ j=1 pj N β k=1 k V ar( j , k ) provided 0 ≤ pj ≤ 1 and MP P P d·jα−1 j=1 pj = 1. As we did before, we will let pj = M α , and we can follow the same reasoning toP show that there exists such a d which remains bounded as M → ∞. Hence, for some constant, C2, we have:

π π c c1 · c3 M 1 1 (4.19) dv,uV ar(u, v) π n1+β π vβ 0 n>N N Z X Z 53

M N C π π ≤ 2 jα−1 kβ−1V ar , . M αN β j k j=1 k c X X=1

Substituting (4.18) and (4.19) into (4.17), gives for some positive constant C2:

M N C π π (4.20) ∆ ≤ 2 jα−1 kβ−1V ar , . 2 M αN β j k j X=1 Xk=1

c As for ∆ , we use the bounds |G (u)| ≤ 4 (the assumed bound in the theorem), 3 Φ,m (mu)α and |GΨ,n(v)| ≤ c2nv for some positive constants c2 and c4. Except for the constants, this is same situation as ∆2 with the variables reversed. Hence, using the same reasoning, we have for some positive constant C3:

M N C π π (4.21) ∆ ≤ 3 jα−1 kβ−1V ar , . 3 M αN β j k j X=1 Xk=1

c c Finally, for ∆ , we use the bounds |G (u)| ≤ 4 , and |G (v)| ≤ 3 (the 4 Φ,m (mu)α Ψ,n (nv)β assumed bounds in the theorem). Thus,

π π c3 · c4 1 1 ∆4 ≤ du,vV ar(u, v) π2 mα+1nβ+1 π π uαvβ m>M n>N n m X X Z Z (4.22) ≤ δ1 + δ2 + δ3 + δ4, where,

π π c3 · c4 1 N M 1 δ1 = du,vV ar(u, v), π2 mα+1nβ+1 π π uαvβ m>M n>N n m X X Z Z π π c3 · c4 1 N 1 δ2 = du,vV ar(u, v), π2 mα+1nβ+1 π π uαvβ m>M n>N n M X X Z Z π π c3 · c4 1 M 1 δ3 = du,vV ar(u, v), π2 mα+1nβ+1 π π uαvβ m>M n>N Z N Z m X X π π c3 · c4 1 1 δ4 = du,vV ar(u, v). π2 mα+1nβ+1 π π uαvβ m>M n>N N M X X Z Z 54

We will start with δ1.

π π c3 · c4 1 M 1 1 N 1 δ1 = dv,uV ar(u, v) . π2 mα+1 π uα nβ+1 π vβ m>M m n>N n ! X Z X Z Examining the term in parentheses, and comparing it to the ﬁrst term of (4.17), we can see

the similarity. By following the same logic we used between (4.17) and (4.18), we have for

some positive constant K1:

π π f N 1 M 1 δ1 ≤ K1 du,vV ar(u, v) . mα+1 π uα 0 m>M m ! Z X Z f Repeating the same logic with the index of summation n replaced with m and integration

variable v replaced with u, and ﬁnishing as we did at (4.18), we get for some positive constant K1:

M N K π π (4.23) δ ≤ 1 jα−1 kβ−1V ar , . 1 M αN β j k j X=1 Xk=1

Next we will work on δ2.

π π c3 · c4 1 1 1 N 1 δ2 = dv,uV ar(u, v) . π2 mα+1 π uα nβ+1 π vβ m>M M n>N n ! X Z X Z

Treating the term in parentheses in the same way as we did for δ1, we get for some positive constant K2:

π π f N 1 1 δ2 ≤ K2 du,vV ar(u, v) . mα+1 π uα 0 m>M M ! Z X Z f Now the term in parentheses resembles the second term of (4.17). Following the same logic as we did from (4.18) to (4.19) gives, for some positive constant K2:

M N K π π (4.24) δ ≤ 2 jα−1 kβ−1V ar , . 2 M αN β j k j X=1 Xk=1

As for δ3, notice that it is equivalent to δ2 with the variables u and v switched (and

summation indices switched). So following the same reasoning, we have for some positive 55

constant K3:

M N K π π (4.25) δ ≤ 3 jα−1 kβ−1V ar , . 3 M αN β j k j X=1 Xk=1

This leaves only δ4.

π π c3 · c4 1 1 δ4 = du,vV ar(u, v) π2 mα+1nβ+1 π π uαvβ m>M n>N N M X X Z Z π π c3 · c4 1 1 = du,vV ar(u, v). π2 π π mα+1nβ+1 uαvβ N M m>M n>N ! Z Z X X Recall that:

1 ∞ 1 ≤ dm mα+1 mα+1 m>M M X Z 1 = . α M α

Therefore, for some positive constant K4:

π π K4 f 1 (4.26) δ4 ≤ du,vV ar(u, v) M αN β π π uαvβ f Z N Z M

Using our two-variable form of integration by parts (4.3.2), we have:

π π 1 1 α π du,vV ar(u, v) = V ar(π, π) − M V ar( , π) π π uαvβ π2 M Z N Z M 1 π π π + −N βV ar(π, ) + M αN βV ar( , ) π2 N M N N β π αV ar(u, π ) M α π βV ar( π, v) + N du + M dv πβ π uα+1 πα π vβ+1 Z M Z N 1 π αV ar(u, π) 1 π βV ar(π, v) − du − dv πβ π uα+1 πα π vβ+1 Z M Z N π π αβV ar(u, v) + du dv π π uα+1vβ+1 Z N Z M 56

1 π π ≤ V ar(π, π) + M αN βV ar( , ) π2 M N Nβ π αV ar(u, π ) M α π βV ar( π , v) + N du + M dv πβ π uα+1 πα π vβ+1 Z M Z N π π αβV ar(u, v) + du dv, π π uα+1vβ+1 Z N Z M since all the subtracted terms were positive. Of the remaining ﬁve terms, the ﬁrst is the

same as our bound when j = k = 1 and can be absorbed into it. The second, third,

and fourth terms all resemble terms which we have already bounded (in ∆1, ∆2, and ∆3 respectively). Thus, only the double integral is left to bound. Make the change of variables:

π π s = , t = . u v

The Jacobian of the inverse transformation is:

π2 J = . s2t2

So:

π π N M π π 2 αβV ar(u, v) αβV ar( s , t ) π α+1 β+1 du dv = α+1 β+1 · 2 2 ds dt π π u v 1 1 (π/s) (π/t) s t Z N Z M Z Z αβ N M V ar( π , π ) = s t ds dt πα+β s1−αt1−β Z1 Z1 M−1 N−1 αβ n+1 m+1 V ar( π , π ) = s t ds dt πα+β s1−αt1−β m=1 n=1 n m X X Z Z M−1 N−1 αβ π π n+1 m+1 1 ≤ V ar( , ) ds dt πα+β m n s1−αt1−β m=1 n=1 n m X X Z Z M−1 N−1 1 π π = V ar( , )[(m + 1)α − mα] · [(n + 1)β − nβ] πα+β m n m n X=1 X=1 M N K π π ≤ 4 jα−1kβ−1V ar( , ), πα+β j k j=1 k c X X=1

for some constant K4, and changing the summation indices as indicated. Thus, plugging this bound along withc the bounds for the four other terms back into (4.26), we have for some positive constant K4: 57

M N K π π (4.27) δ ≤ 4 jα−1 kβ−1V ar , . 4 M αN β j k j X=1 Xk=1

Substituting (4.23), (4.24), (4.25), and (4.27) back into (4.22) gives for some positive constant C4:

M N C π π (4.28) ∆ ≤ 4 jα−1 kβ−1V ar , . 4 M αN β j k j k X=1 X=1

Finally, substituting (4.16), (4.20), (4.21), and (4.28) back into (4.15) gives for some

positive constant K:

Φ,Ψ Φ,Ψ Φ,Ψ Φ,Ψ Sm,n (f) − Sm−1,n(f) − Sm,n−1(f) + Sm−1,n−1(f) m>M n>N X X M N K π π ≤ jα−1 kβ−1 V ar , . M αN β j k j X=1 Xk=1

4.5 Example illustrating the sharpness of the bound

To show the sharpness of the bound, look at the function, f, deﬁned by:

π−x π−y when (x, y) ∈ (0, 2π) × (0, 2π), ∞ sin jx ∞ sin kx 2 2 f(x, y) = j=1 j k=1 k =   0 when x = 0 or y = 0, P P  and extend to the entire (x, y) plane by deﬁning f(x, y) = f(x,ˆ yˆ) where xˆ = x + 2πm for some integer m, and yˆ = y + 2πn for some integer n, and (x,ˆ yˆ) ∈ [0, 2π) × [0, 2π). Since the variables x and y are separated, the tensor product summability transform is the product of the single variable transforms of the function given in (2.12). Hence, the bound is simply the product of the bounds, and as Kunyang and Dan showed [10], this bound is sharp. 58

4.6 Future endeavors

First of all, we believe that the results of chapter four could be extended to higher dimensions in an analogous manner. The proof would follow our proof with only two necessary modifications. First, notice that simply having functions of bounded variation was insufficient for our needs in the two variable case. We needed the Hardy-Krause definition of functions of bounded variation so that we could refer to function limits which were path independent, and something similar would need to be done in higher dimensions. Second, to mimic our proof, a higher order version of integration by parts would be necessary. Such extensions exist in the field of geometric measure theory.

Other possible directions for future work is to relax the requirement that the function be of bounded variation, to examine other methods of convergence besides pointwise, and to ﬁnd the discrete analog for discrete Fourier series. BIBLIOGRAPHY

[1] R.M. Aron, D. Perez-Gar´ c´ıa, J.B. Seoane-Sepul´ veda, Algebrability of the set of non-convergent Fourier series, Studia Mathematica. 175, (2006), pp. 83 - 90.

[2] F. Bayart, Topological and Algebraic Genericity of Divergence and Universality, Studia Mathematica. 167, (2005), pp. 153 - 160.

[3] R. Bojanic, An estimate of the rate of convergence for Fourier series of functions of bounded variation, Publications de L’Institut Math´ematique. Nouvelle S´erie, tome 26, 40 (1979), pp. 57 - 60.

[4] R. Bojanic, S. M. Mazhar, An estimate of the rate of convergence of Ces`aro means of Fourier series of functions of bounded variation, Proceedings of Mathemat- ical Analysis and its Applications. Kuwait (1985), pp. 17-22.

[5] T.H. Gronwall, Zur Gibbsschen Ersheinung, Annals of Mathematics. 31, (1930), no. 2, pp. 233-240.

[6] G.H. Hardy, W. W. Rogosinski, Fourier Series, Third Edition, Cambridge Uni- versity Press, 1965.

[7] N. Humphreys, R. Bojanic, Rate of Convergence for the Absolutely (C, α) Summable Fourier Series of Functions of Bounded Variation, Journal of Approxi- mation Theory. 101, (1999), pp. 212-220.

[8] D. Jackson, The Theory of Approximation, American Mathematical Society, New York, 1930.

[9] T.W. Korner¨ , Fourier Analysis, Cambridge University Press, 1988.

[10] W. Kunyang, Y. Dan, On the Absolute (C, α) Convergence for Functions of Bounded Variation, Journal of Approximation Theory. 123, (2003), pp. 300-304.

[11] G. Meinardus, Approximation of Functions: Theory and Numerical Methods, Springer-Verlag, Berlin, 1967.

[12] J. A. Osikiewicz, Equivalence results for Ces`aro submethods, Analysis. 20, (2000), pp. 35-43.

[13] R. E. Powell, S. M. Shah, Summability Theory and Applications, Van Nostrand Reinhold, London, 1972.

[14] W. W. Rogosinski, Fourier Series, Chelsea Publishing Company, New York, 1950.

[15] W. Rudin, Real & Complex Analysis, Third Edition, McGraw-Hill, 1966.

59 60

[16] A. Sard, Linear Approximation, American Mathematical Society, 1963.

[17] A. Zygmund, Trigonometric Series, Vol. I, Cambridge University Press, 1959.