POLYNOMIAL INEQUALITIES ON THE HAMMING CUBE
ALEXANDROS ESKENAZIS AND PAATA IVANISVILI
Abstract. Let (X, k · kX ) be a Banach space. The purpose of this article is to systematically investigate dimension independent properties of vector valued functions f : {−1, 1}n → X on the Hamming cube whose spectrum is bounded above or below. Our proofs exploit contractivity properties of the heat flow, induced by the geometry of the target space (X, k · kX ), combined with duality arguments and suitable tools from approximation theory and complex analysis. We obtain a series of improvements of various well-studied estimates for functions with bounded spectrum, including moment comparison results for low degree Walsh polynomials and Bernstein–Markov type inequalities, which constitute discrete vector valued analogues of Freud’s inequality in Gauss space (1971). Many of these inequalities are new even for scalar valued functions. Furthermore, we provide a short proof of Mendel and Naor’s heat smoothing theorem (2014) for functions in tail spaces with values in spaces of nontrivial type and we also prove a dual lower bound on the decay of the heat semigroup acting on functions with spectrum bounded from above. Finally, we improve the reverse Bernstein–Markov inequalities of Meyer (1984) and Mendel and Naor (2014) for functions with narrow enough spectrum and improve the bounds of Filmus, Hatami, Keller and 4 Lifshitz (2016) on the `p sums of influences of bounded functions for p ∈ 1, 3 .
2010 Mathematics Subject Classification. Primary: 42C10; Secondary: 41A17, 41A63, 46B07. Key words. Hamming cube, heat semigroup, hypercontractivity, Bernstein–Markov inequality, moment comparison.
1. Introduction
Fix n ∈ N and let (X, k · kX ) be a Banach space. If p ∈ [1, ∞), the vector valued Lp norm of a function f : {−1, 1}n → X is defined as 1/p def 1 X p kfk n = kf(ε)k . (1) Lp({−1,1} ;X) 2n X ε∈{−1,1}n
def n n As usual, we denote kfkL∞({−1,1} ;X) = maxε∈{−1,1} kf(ε)kX . For a subset A ⊆ {1, . . . , n} the n Q Walsh function wA : {−1, 1} → {−1, 1} is the Boolean function given by wA(ε) = i∈A εi, where n n ε = (ε1, . . . , εn) ∈ {−1, 1} . Every function f : {−1, 1} → X admits an expansion of the form X f = fb(A)wA, (2) A⊆{1,...,n} arXiv:1902.02406v2 [math.FA] 8 Sep 2020 where def 1 X fb(A) = f(δ)wA(δ) ∈ X. (3) 2n δ∈{−1,1}n For i ∈ {1, . . . , n} the i-th partial derivative of such a function f is given by
def f(ε) − f(ε1, . . . , εi−1, −εi, εi+1, . . . , εn) X ∂if(ε) = = fb(A)wA(ε) (4) 2 A⊆{1,...,n} i∈A
P. I. was partially supported by NSF DMS-1856486 and NSF CAREER-1945102. This work was carried out under the auspices of the Simons Algorithms and Geometry (A&G) Think Tank.
1 2 and satisfies ∂i f = ∂if. Therefore, the hypercube Laplacian of f is defined as n def X X ∆f = ∂if = |A|fb(A)wA. (5) i=1 A⊆{1,...,n}
−t∆ Finally, the action of the discrete heat semigroup {e }t>0 on the function f is given by −t∆ def X −t|A| ∀ t > 0, e f = e fb(A)wA. (6) A⊆{1,...,n} A straightforward calculation shows that the heat semigroup can equivalently be expressed as 1 X X X X e−t∆f(ε) = e−t|B|(1 − e−t)n−|B|f ε e + δ e , (7) 2n j j j j δ∈{−1,1}n B⊆{1...,n} j∈B j∈{1,...,n}rB n where {e1, . . . , en} is the orthonormal basis of R , which by convexity implies that for every t > 0 −t∆ n n and p ∈ [1, ∞], ke fkLp({−1,1} ;X) 6 kfkLp({−1,1} ;X). Identity (7) also has a useful probabilistic n n interpretation. For a fixed ε ∈ {−1, 1} and t > 0, consider a random vector η ∈ {−1, 1} such that 1+e−t each coordinate ηi is chosen independently to coincide with εi with probability 2 and with −εi 1−e−t −t∆ with probability 2 . Then, the value e f(ε) is the expectation of the random vector f(η). The main purpose of this paper is to investigate finer contractivity properties of the heat semi- group under suitable assumptions on the spectrum of the function f and the geometry of the target space (X, k · kX ). The common feature in the proofs of most of the following results is a duality argument inspired by classical work of Figiel (see [MS86, Theorem 14.6]), which allows the self- improvement of contractivity properties of the heat semigroup relying on suitable inequalities from classical approximation theory and complex analysis. All Banach spaces in the ensuing discussion will be assumed to be over the field of complex numbers. In the rest of the introduction, we proceed to describe our results in decreasing order of generality. In Section 1.1, we present estimates for functions with spectrum bounded from above and values in a general Banach space. In Section 1.2, we improve the bounds of Section 1.1 under the additional assumption that the target space (X, k·kX ) is K-convex (we postpone the relevant definitions until Section 1.2). We also present a new proof of a theorem of Mendel and Naor [MN14] related to the heat smoothing conjecture. Finally, in Section 1.3 we present explicit estimates which hold true for scalar valued functions. These include various Bernstein–Markov type inequalities and their reverses, estimates on the influences of bounded functions and moment comparison results.
1.1. Estimates for a general Banach space. Fix n ∈ N and let (X, k · kX ) be a Banach space. If d ∈ {0, 1 . . . , n}, we say that a function f : {−1, 1}n → X has degree at most d if fb(A) = 0 when |A| > d and we say that f belongs in the d-th tail space if fb(A) = 0 when |A| < d. Finally, we say that f is d-homogeneous if fb(A) = 0 when |A|= 6 d.
1.1.1. A lower bound on the decay of the heat semigroup. The following theorem establishes a lower bound on the decay of the heat semigroup acting on functions of low degree.
Theorem 1. Fix n, d ∈ N with d ∈ {1, . . . , n} and let (X, k · kX ) be a Banach space. For every p ∈ [1, ∞] and every function f : {−1, 1}n → X of degree at most d, we have
−t∆ 1 ∀ t 0, ke fk n kfk n , (8) > Lp({−1,1} ;X) > t Lp({−1,1} ;X) Td(e ) where Td is the d-th Chebyshev polynomial of the first kind.
2 We note that a weaker bound in the spirit of Theorem1, attributed partially to Oleszkiewicz, was established in [FHKL16, Lemma 5.4] (see Remark 18 below for a comparison with Theorem1). Using Theorem1 and the hypercontractivity of the discrete heat semigroup (see [Bon70]), we deduce the following moment comparison for functions of low degree.
Corollary 2. Fix n, d ∈ N with d ∈ {1, . . . , n} and let (X, k · kX ) be a Banach space. For every p > q > 1 and every function f : {−1, 1}n → X of degree at most d, we have rp − 1 kfk n T kfk n . (9) Lp({−1,1} ;X) 6 d q − 1 Lq({−1,1} ;X) To the extent of our knowledge, the best previously known moment comparison for general vector valued functions of low degree on the discrete cube can be extracted from an argument in the monograph [KW92] of Kwapie´nand Woyczy´nski.In Remark 19 below, we quantify their argument and show that the bounds of [KW92, Proposition 6.5.1] are weaker than those of Corollary2. Another application of Theorem1 is a refinement of a celebrated inequality on the discrete cube due to Pisier [Pis86]. In connection with his work on nonlinear type, Pisier showed that for every n Banach space (X, k · kX ), every p ∈ [1, ∞] and every function f : {−1, 1} → X, we have n 1 X 1 X X p 1/p f − f(δ) (log n + 1) δi∂if . (10) n n 6 n n 2 Lp({−1,1} ;X) 2 Lp({−1,1} ;X) δ∈{−1,1}n δ∈{−1,1}n i=1
For X = R, the right hand side of (10) is equivalent to the Lp norm of the gradient of f due to Khintchine’s inequality [Khi23], thus (10) can be understood as a vector valued Poincar´einequality. The dependence on the dimension n in Pisier’s inequality (10) for various classes of spaces X is fundamental in investigations in the nonlinear geometry of Banach spaces and the Ribe program (see [Nao12] for a detailed discussion around this topic). For general Banach spaces, Talagrand [Tal93] has proven that the factor log n in (10) is asymptotically optimal for every p ∈ [1, ∞), whereas Wagner [Wag00] has shown that when p = ∞, Pisier’s inequality holds with an absolute constant. Here we show the following refinement of Pisier’s inequality for functions of low degree.
Theorem 3. Fix n, d ∈ N with d ∈ {1, . . . , n} and let (X, k · kX ) be a Banach space. For every p ∈ [1, ∞) and every function f : {−1, 1}n → X of degree at most d, we have n 1 X 1 X X p 1/p f − f(δ) 3(log d + 1) δi∂if (11) n n 6 n n 2 Lp({−1,1} ;X) 2 Lp({−1,1} ;X) δ∈{−1,1}n δ∈{−1,1}n i=1 and the log d factor is asymptotically sharp for every p ∈ [1, ∞). 1.1.2. A Bernstein–Markov type inequality for ∆. A variant of the proof of Theorem1 implies the following Bernstein–Markov type inequality, which had previously appeared in [FHKL16].
Theorem 4. Fix n, d ∈ N with d ∈ {1, . . . , n} and let (X, k · kX ) be a Banach space. For every p ∈ [1, ∞] and every function f : {−1, 1}n → X of degree at most d, we have 2 n n k∆fkLp({−1,1} ;X) 6 d kfkLp({−1,1} ;X). (12) In [FHKL16, Lemma 5.4], the Bernstein–Markov type inequality (12) was stated only for real valued functions and p = 1. In the vector valued setting which is of interest here, inequality (12) is sharp for general Banach spaces. Recall that a Banach space (X, k · kX ) has cotype q ∈ [2, ∞] with constant C ∈ (0, ∞) if for every n ∈ N and vectors x1, . . . , xn ∈ X, we have n n 1 q 1/q 1 1/q X X X q εixi > kxikX . (13) 2n X C ε∈{−1,1}n i=1 i=1
3 The fact that every Banach space (X, k · kX ) has cotype q = ∞ with constant C = 1 follows from the triangle inequality, yet having finite cotype is a meaningful structural property of a given space. We refer to the survey [Mau03] for further information on the rich theory of type and cotype. We will show that any improvement of (12), forces the target space X to have finite cotype.
Theorem 5. Let (X, k · kX ) be a Banach space and p ∈ [1, ∞]. Suppose that there exist some n η ∈ (0, 1) and d ∈ N such that for every n ∈ N with d ∈ {1, . . . , n}, every function f : {−1, 1} → X of degree at most d satisfies the inequality 2 n n k∆fkLp({−1,1} ;X) 6 (1 − η)d kfkLp({−1,1} ;X). (14) Then X has finite cotype.
1.2. Estimates for K-convex Banach spaces. Let (X, k·kX ) be a Banach space. The X-valued n n Rademacher projection Rad : Lp({−1, 1} ; X) → Lp({−1, 1} ; X) is the operator given by n n def X ∀ ε ∈ {−1, 1} , Rad(f)(ε) = fb({i})εi. (15) i=1
We say that (X, k · k ) is K-convex (see also [Mau03]) if sup kRadk n n < X n∈N Lp({−1,1} ;X)→Lp({−1,1} ;X) ∞ for some (equivalently, for all) p ∈ (1, ∞). Pisier’s K-convexity theorem [Pis82] asserts that n ∞ a Banach space (X, k · kX ) is K-convex if and only if X does not contain copies of {`1 }n=1 with distortion arbitrarily close to 1. Moreover, both conditions are equivalent to X having nontrivial type, see [Pis73]. In the proofs of most results of this section, we will crucially use the deep fact that the heat semigroup with values in a K-convex space is a bounded analytic semigroup [Pis82].
1.2.1. The heat smoothing conjecture. In [MN14], Mendel and Naor asked whether for every K- convex Banach space (X, k · kX ) and p ∈ (1, ∞) there exist c(p, X),C(p, X) ∈ (0, ∞) such that for n every n ∈ N, every function f : {−1, 1} → X in the d-th tail space satisfies the estimate −t∆ −c(p,X)dt n n ∀ t > 0, ke fkLp({−1,1} ;X) 6 C(p, X)e kfkLp({−1,1} ;X). (16) In the direction of this question, currently known as the heat smoothing conjecture, they showed the following theorem, partially relying on ideas from [Pis07] (see also the work [HMO17] of Heilman, Mossel and Oleszkiewicz for an optimal result when d = 1 and X = C).
Theorem 6 (Mendel–Naor). Let (X, k · kX ) be a K-convex Banach space. For every p ∈ (1, ∞), there exist c(p, X),C(p, X) ∈ (0, ∞) and A(p, X) ∈ [1, ∞) such that for every n, d ∈ N with d ∈ {0, 1, . . . , n − 1} and every function f : {−1, 1}n → X in the d-th tail space, we have
−t∆ −c(p,X)d min{t,tA(p,X)} n n ∀ t > 0, ke fkLp({−1,1} ;X) 6 C(p, X)e kfkLp({−1,1} ;X). (17) In Section3 below, we present a simple proof of Theorem6 relying on a duality argument.
1.2.2. An improved lower bound on the decay of the heat semigroup. Under the assumption that the target space (X, k · kX ) is K-convex, we get the following improvement over Theorem1 (see also equation (45) in Remark 18 for comparison).
Theorem 7. Let (X, k · kX ) be a K-convex Banach space. For every p ∈ (1, ∞), there exist 1 c(p, X),C(p, X) ∈ (0, ∞) and η(p, X) ∈ 2 , 1 such that for every n, d ∈ N with d ∈ {1, . . . , n} and every function f : {−1, 1}n → X of degree at most d, we have
−t∆ −C(p,X)d max{t,tη(p,X)} n n ∀ t > 0, ke fkLp({−1,1} ;X) > c(p, X)e kfkLp({−1,1} ;X). (18)
4 Theorem7 should be understood as the dual of Mendel and Naor’s bound (17) for the heat smoothing conjecture. We conjecture that one can in fact take η(p, X) = 1 in Theorem7 for every p ∈ (1, ∞) and K-convex Banach space (X, k · kX ), but a proof of such a claim appears intractable with the technique presented here. A real valued version of this conjecture for p = 1 has previously appeared in [FHH+14, Section 5]. As in the case of Theorem1, Theorem7 also implies moment comparison for functions of low degree. We postpone the relevant result (Corollary 31) to Section3.
1.2.3. Bernstein–Markov type inequalities for the hypercube Laplacian and the gradient. Under the additional assumption that the target space (X, k · kX ) is K-convex, we can obtain the following asymptotic improvement of the bound of Theorem4.
Theorem 8. Let (X, k · kX ) be a K-convex Banach space. For every p ∈ (1, ∞), there exist α(p, X) ∈ [1, 2) and C(p, X) ∈ (0, ∞) such that for every n, d ∈ N with d ∈ {1, . . . , n} and every function f : {−1, 1}n → X of degree at most d, we have α(p,X) n n k∆fkLp({−1,1} ;X) 6 C(p, X)d kfkLp({−1,1} ;X). (19) We conjecture that the conclusion of Theorem8 holds true with α(p, X) = 1 for every p ∈ (1, ∞) and K-convex Banach space (X, k · kX ) (see also Remark 33 below). Since every K-convex Banach space has finite cotype, the conclusion of Theorem8 is consistent with Theorem5. However, there exist Banach spaces (e.g. X = `1) which have finite cotype but are not K-convex. It remains an interesting (and potentially challenging) open problem to understand whether the dependence on the degree in the vector valued Bernstein–Markov inequality (12) (even for, say, p = 2) can be 2 improved to o(d ) under the minimal assumption that (X, k · kX ) has finite cotype. We conclude this section by presenting a Bernstein–Markov type inequality where the hypercube Laplacian is replaced by the vector valued gradient appearing in Pisier’s inequality (10).
Theorem 9. Let (X, k · kX ) be a K-convex Banach space. For every p ∈ (1, ∞) there exist α(p, X) ∈ [1, 2) and C(p, X) ∈ (0, ∞) such that for every n, d ∈ N with d ∈ {1, . . . , n} and every function f : {−1, 1}n → X of degree at most d, we have n p 1/p 1 X X α(p,X) δi∂if C(p, X)d (log d + 1)kfk n . (20) n n 6 Lp({−1,1} ;X) 2 Lp({−1,1} ;X) δ∈{−1,1}n i=1 1.2.4. A reverse Bernstein–Markov inequality for ∆. In [MN14], Mendel and Naor asked if for every K-convex Banach space (X, k · kX ) and p ∈ (1, ∞) there exists c(p, X) ∈ (0, ∞) such that for every n n ∈ N and d ∈ {0, 1, . . . , n − 1}, every function f : {−1, 1} → X in the d-th tail space satisfies
n n k∆fkLp({−1,1} ;X) > c(p, X)dkfkLp({−1,1} ;X). (21) Similar estimates for scalar valued functions had also been obtained by Meyer in [Mey84]. The following theorem, whose proof relies on a recent inequality of Erd´elyi,contains a result in this direc- tion under the additional assumption that the spectrum of the function f is also bounded above.
Theorem 10. Let (X, k · kX ) be a K-convex Banach space. For every p ∈ (1, ∞) there exists n c(p, X) ∈ (0, ∞) such that for every n, d, m ∈ N with d+m 6 n and every function f : {−1, 1} → X of degree at most d + m which is also in the d-th tail space, we have d k∆fk n c(p, X) kfk n . (22) Lp({−1,1} ;X) > m Lp({−1,1} ;X) In particular, Theorem 10 provides a positive answer to the question of [MN14] in the special case when m = O(1) and improves upon the previously known bounds of Meyer [Mey84] and Mendel and Naor [MN14] when m is a small enough power of d.
5 1.3. Estimates for scalar valued functions. In this section we will present explicit estimates which hold true for scalar valued functions. Even though several of the following results are special cases of theorems from the previous section, we present the full statements for the convenience of the reader not interested in the general vector valued setting. 1.3.1. The decay of the heat semigroup. The optimal bounds that can be derived from our approach for the action of the heat semigroup on functions with bounded spectrum are the following. √ 2 p−1 Theorem 11. For p ∈ [1, ∞], let θp = 2 arcsin p . Then, for every p ∈ (1, ∞), n, d ∈ N with n d ∈ {0, 1, . . . , n} and every function f : {−1, 1} → C of degree at most d, we have π π !d t 2π−θp t 2π−θp −t∆ (e + 1) − (e − 1) n n ∀ t > 0, ke fkLp({−1,1} ;C) > π π kfkLp({−1,1} ;C). (23) (et + 1) 2π−θp + (et − 1) 2π−θp n Moreover, for every function f : {−1, 1} → C in the d-th tail space, we have π π !d −t θp −t θp −t∆ (1 + e ) − (1 − e ) n n ∀ t > 0, ke fkLp({−1,1} ;C) 6 π π kfkLp({−1,1} ;C). (24) (1 + e−t) θp + (1 − e−t) θp The above lower bound (23) on the decay of the heat semigroup combined with classical hy- percontractivite estimates [Bon70] implies the following improved moment comparison result for functions of low degree. √ 2 p−1 Corollary 12. For p ∈ [1, ∞], let θp = 2 arcsin p . Then, for every p > q > 1, every n, d ∈ N n with d ∈ {1, . . . , n} and every function f : {−1, 1} → C of degree at most d, we have √ √ π √ √ π !d ( p − 1 + q − 1) 2π−θp + ( p − 1 − q − 1) 2π−θp n n kfkLp({−1,1} ;C) 6 √ √ π √ √ π kfkLq({−1,1} ;C). (25) ( p − 1 + q − 1) 2π−θp − ( p − 1 − q − 1) 2π−θp Even though Bonami’s hypercontractive estimates [Bon70] break down at the endpoint p = 1, a n well known trick (see [O’D14, Theorem 9.22]) implies that if f : {−1, 1} → C has degree at most d, d n n kfkL2({−1,1} ;C) 6 e kfkL1({−1,1} ;C). (26) Relying on works of Beckner [Bec75] and Weissler [Wei79], we prove the following improved bound. n Theorem 13. For every n, d ∈ N with d ∈ {1, . . . , n} and every function f : {−1, 1} → C of degree at most d, we have d n n kfkL2({−1,1} ;C) 6 (2.69076) kfkL1({−1,1} ;C). (27) 1.3.2. Bernstein–Markov type inequalities. The bounds (19) in the Bernstein–Markov inequality for the hypercube Laplacian take the following explicit form for scalar valued functions. √ 2 p−1 Theorem 14. For p ∈ [1, ∞], let θp = 2 arcsin p . Then, for every n, d ∈ N with d ∈ n {1, . . . , n} and every function f : {−1, 1} → C of degree at most d, we have θ 2− p n π n k∆fkLp({−1,1} ;C) 6 10d kfkLp({−1,1} ;C). (28) n For p ∈ [1, ∞] and a function f : {−1, 1} → C, denote by n 1/2 def X 2 k∇fkL ({−1,1}n; ) = (∂if) . (29) p C L ({−1,1}n; ) i=1 p C In contrast to the vector valued Theorem9, we can prove the following improved Bernstein–Markov type inequality for the gradient of scalar valued functions as a consequence of Theorem 14.
6 √ 2 p−1 Theorem 15. For p ∈ [1, ∞], let θp = 2 arcsin p . Then, for every p ∈ (1, ∞), there exists n Cp ∈ (0, ∞) such that for every n, d ∈ N with d ∈ {1, . . . , n} and every function f : {−1, 1} → C of degree at most d, we have
θ 2 − p p pπ n Cpd log(d + 1)kfkLp({−1,1} ;C), if p ∈ 1, 2 k∇fkL ({−1,1}n; ) θ . (30) p C 6 1− p 2π n Cpd kfkLp({−1,1} ;C), if p ∈ [2, ∞) The change in the exponent from the range p ∈ (1, 2) to the range p ∈ [2, ∞) and its relation to discrete Riesz transforms is further discussed in Section4. We also postpone until then the statement of some endpoint (p = ∞ and p = 1) Bernstein–Markov inequalities for the discrete gradient (see Proposition 46 and Remark 49 respectively). We finally turn to a problem studied by Filmus, Hatami, Keller and Lifshitz in [FHKL16] (see n also [BB14]). Following their notation, for p ∈ [1, ∞) and a function f : {−1, 1} → C, denote by n (p) def X p Inf f = k∂ifk n . (31) Lp({−1,1} ;C) i=1 Motivated by a question of Aaronson and Ambrainis [AA14], the authors were interested in obtain- ing bounds of the form (p) p Inf f fp(d)kfk n , (32) 6 L∞({−1,1} ;C) 3−p where fp(d) is a polynomial in d. In this direction, they proved (32) with fp(d) = d when p ∈ [1, 2] and fp(d) = d when p ∈ [2, ∞), the latter of which is sharp. Using Theorem 11, we deduce 4 the following improved bounds for p ∈ 1, 3 . 4 Corollary 16. For every p ∈ 1, 3 there exists a constant Kp ∈ (0, ∞) such that for every n, d ∈ N n with d ∈ {1, . . . , n} and every function f : {−1, 1} → C of degree at most d, we have √ 1 2 p−1 (p) 2− π arcsin p p Inf f Kpd kfk n . (33) 6 L∞({−1,1} ;C) 1.3.3. A reverse Bernstein–Markov type inequality for ∇. For the case of scalar valued functions we will also prove the following variant of Theorem 10 for the discrete gradient.
Theorem 17. For every p ∈ (1, ∞) there exists cp ∈ (0, ∞) such that for every n, d, m ∈ N with n d + m 6 n and every function f : {−1, 1} → C of degree at most d + m which is also in the d-th tail space, we have r d k∇fk n c kfk n . (34) Lp({−1,1} ;C) > p m Lp({−1,1} ;C) Acknowledgements. We are indebted to Assaf Naor for many helpful discussions. We are also very grateful to Tam´asErd´elyifor proving the main result of [Erd20] upon our request and to an anonymous referee for sharing with us an argument which improved Corollary 21. Finally, we would like to thank Franoise Lust-Piquard for valuable feedback.
2. Estimates for a general Banach space We first present the proof of Theorem1, the lower bound on the decay of the heat semigroup acting on functions with values in a general Banach space. Recall that the d-th Chebyshev polyno- mial of the first kind Td(x) is the unique polynomial of degree d such that Td(cos θ) = cos(dθ) for every θ ∈ R. Chebyshev’s inequality [BE95, p. 235] asserts that the d-th Chebyshev polynomial of the first kind is characterized by the extremal property ∀ x ∈ R r [−1, 1], |Td(x)| = max |p(x)| : deg(p) 6 d and kpkC([−1,1]) = 1 , (35)
7 where for a continuous function h : K → C on a compact space K, we set khkC(K) = maxx∈K |h(x)|. n Proof of Theorem1. Fix n ∈ N, d ∈ {1, . . . , n}, p ∈ [1, ∞] and let f : {−1, 1} → X be a function of degree at most d. The contractivity of the heat semigroup which we derived from (7), can be ∆ n n rewritten as kx fkLp({−1,1} ;X) 6 kfkLp({−1,1} ;X) for every x ∈ [0, 1]. However, for x ∈ R, ∆ X |A| X |A| ∆ (−x) f(ε) = (−x) fb(A)wA(ε) = x fb(A)wA(−ε) = (x f)(−ε), (36) A⊆{1,...,n} A⊆{1,...,n} ∆ ∆ n n thus k(−x) fkLp({−1,1} ;X) = kx fkLp({−1,1} ;X). Consequently, ∆ n n ∀ x ∈ [−1, 1], kx fkLp({−1,1} ;X) 6 kfkLp({−1,1} ;X). (37) Let µ be any complex measure on [−1, 1]. Then, averaging over (37), we get 1 1 ∆ ∆ x f dµ(x) kx fk n d|µ|(x) n 6 Lp({−1,1} ;X) ˆ Lp({−1,1} ;X) ˆ −1 −1 (38) (37) 1 n n 6 kfkLp({−1,1} ;X) d|µ|(x) = kµkM([−1,1])kfkLp({−1,1} ;X), ˆ−1 where for a complex measure µ on a compact space K, we denote by kµkM(K) the total variation d of µ. For t > 0, consider the linear functional ϕt : span{1, x, . . . , x }, k · kC([−1,1]) → C given by d d X k def X tk ϕt akx = ake , (39) k=0 k=0 t or ϕt(p) = p(e ) when p is a polynomial with deg(p) 6 d. Then, Chebyshev’s inequality (35) can be rewritten as d t ∀ p ∈ span{1, x, . . . , x }, |ϕt(p)| 6 Td(e )kpkC([−1,1]). (40) Therefore, by the Hahn–Banach theorem and the Riesz representation theorem, there exists a t complex measure µt on [−1, 1] such that kµtkM([−1,1]) 6 Td(e ) and for every polynomial p, we have 1 t deg(p) 6 d =⇒ p(x) dµt(x) = p(e ). (41) ˆ−1
Since fb(A) = 0 when |A| > d, applying (38) for the measure µt satisfying (41), we deduce that 1 (40) t∆ (41) ∆ t ∀ t 0, ke fkL ({−1,1}n;X) = x f dµt(x) Td(e )kfkL ({−1,1}n;X), (42) > p n 6 p ˆ−1 Lp({−1,1} ;X) which is equivalent to the desired inequality (8). Remark 18. A weaker lower bound for the decay of the heat semigroup, attributed partially to Oleszkiewicz, was established in [FHKL16, Lemma 5.4] and asserts that for every function f : {−1, 1}n → X of degree at most d, we have −t∆ −td2 n n ke fkLp({−1,1} ;X) > e kfkLp({−1,1} ;X). (43) Inequality (43) was stated in [FHKL16] only for X = R and p = 1, but the argument presented there works in greater generality. Checking that (43) is weaker than the estimate (8) amounts to d2 showing that for y > 1, we have Td(y) 6 y , which can be easily derived from the identity (y + py2 − 1)d + (y − py2 − 1)d ∀ |y| 1,T (y) = . (44) > d 2 Furthermore, using (44), it is elementary to check that √ t 3 max{t, t}d ∀ t > 0,Td(e ) 6 e , (45)
8 therefore inequality (18) is also an improvement over (8) for t ≈ 0.
n Proof of Corollary2. Fix n ∈ N, d ∈ {1, . . . , n}, p > q > 1 and a function f : {−1, 1} → X of degree at most d. Bonami’s hypercontractive inequality [Bon70] (the straightforward vector valued extension of which is due to Borell, see [Bor79]) asserts that
1 p − 1 −t∆ ∀ t log , ke fk n kfk n . (46) > 2 q − 1 Lp({−1,1} ;X) 6 Lq({−1,1} ;X) 1 p−1 Combining (8) and (46), we deduce that for t > 2 log q−1 ,
(8) (46) t −t∆ t n n n kfkLp({−1,1} ;X) 6 Td(e )ke fkLp({−1,1} ;X) 6 Td(e )kfkLq({−1,1} ;X). (47) 1 p−1 Plugging t = 2 log q−1 in (47), we deduce the moment comparison (9). Remark 19. Inequality (9) for d = 1 with some constant C(p, q) originated in the work [Kah64] of q p−1 Kahane and the implicit constant was later improved to q−1 by Kwapie´nin [Kwa76]. The use of hypercontractivity for moment comparison of vector valued Walsh polynomials was initiated by Borell in [Bor79] (see the exposition [Pis78]), who later showed in [Bor84] that inequality (9) holds true with some constant depending only on p, q and d (this had previously been proven for scalar valued functions by Bourgain in [Bou80] via a square function approach). To the extent of our knowledge, the best known dependence in (9) before the present work could be extracted from an argument in the monograph [KW92, Proposition 6.5.1] which goes as follows. For k ∈ {0, 1, . . . , n} n n let Radk : Lp({−1, 1} ; X) → Lp({−1, 1} ; X) be the Rademacher projection on level k given by
n def X ∀ ε ∈ {−1, 1} , Radk(f)(ε) = fb(A)wA(ε). (48) A⊆{1,...,n} |A|=k