CONVEX GEOMETRIC CONNECTIONS TO INFORMATION THEORY
by
Justin Jenkinson
Submitted in partial fulfillment of the requirements
For the degree of Doctor of Philosophy
Department of Mathematics
CASE WESTERN RESERVE UNIVERSITY
May, 2013 CASE WESTERN RESERVE UNIVERSITY
School of Graduate Studies
We hereby approve the dissertation of Justin Jenkinson, candi- date for the the degree of Doctor of Philosophy.
Signed: Stanislaw Szarek
Co-Chair of the Committee
Elisabeth Werner
Co-Chair of the Committee
Elizabeth Meckes
Kenneth Kowalski
Date: April 4, 2013
*We also certify that written approval has been obtained for any proprietary material contained therein. c Copyright by
Justin Jenkinson
2013 TABLE OF CONTENTS
List of Figures ...... v
Acknowledgments ...... vi
Abstract ...... vii
Introduction ...... 1
CHAPTER PAGE
1 Relative Entropy of Convex Bodies ...... 6
1.1 Notation ...... 7 1.2 Background ...... 10 1.2.1 Affine Invariants ...... 11 1.2.2 Associated Bodies ...... 13 1.2.3 Entropy ...... 15 1.3 Mean Width Bodies ...... 16 1.4 Relative Entropies of Cone Measures and Affine Surface Areas . . 24 1.5 Proof of Theorem 1.4.1 ...... 30
2 Geometry of Quantum States ...... 37
2.1 Preliminaries from Convex Geometry ...... 39 2.2 Summary of Volumetric Estimates for Sets of States ...... 44 2.3 Geometric Measures of Entanglement ...... 51 2.4 Ranges of Various Entanglement Measures and the role of the Di- mension ...... 56 2.5 Levy’s Lemma and its Applications to p-Schatten Norms ...... 60 2.6 Concentration for the Support Functions of PPT and S ...... 64 2.7 Geometric Banach-Mazur Distance between PPT and S ...... 66 2.8 Hausdorff Distance between PPT and S in p-Schatten Metrics . . 67 2.9 Maximum Trace Distance from a PPT state to S ...... 70
iii Summary ...... 74
APPENDIX
A Constants in the Spherical Isoperimetric Inequality ...... 75
Bibliography ...... 81
iv LIST OF FIGURES
FIGURE PAGE
1 The Minkowski sum of a triangle and a circle...... 2
2 The polar of the cube is the tetrahedron...... 3
3 The polar of an ellipse is another ellipse...... 4
1.1 The Gauss map of an ellipse, K...... 8
1.2 The support function of K...... 10
A.1 The plots of q3 and q4...... 79
v ACKNOWLEDGMENTS
Many thanks to my advisors Stanislaw Szarek and Elisabeth Werner for providing so much inspiration and motivation and for leading me through this endeavor. The collaboration with Karol and Michal Horodecki was essential to many of the results in Chapter 3 and I am grateful for their participation and comments. I would also like to thank my committee members for many useful comments and suggestions. I would also like to thank the Department of Mathematics, the School of Graduate
Studies, and Case Western Reserve University in general.
This research has been partially supported by grants DMS-0503642, DMS-0652722 and DMS-0801275 from the National Science Foundation.
vi ABSTRACT
Convex Geometric Connections to Information Theory
by
JUSTIN JENKINSON
Convex geometry is a field of mathematics that has experienced rapid growth in recent years and has proven to be an extremely useful perspective in areas of research. Problems in many different fields can be interpreted geometrically which often leads to powerful and surprising results.
This thesis establishes connections between convex geometry and both classical and quantum information theory. We introduce the mean width bodies and illustrate the geometric interpretation they provide for the relative entropy of cone measures of a convex body and its polar. We define relative entropy for convex bodies and its relation to affine isoperimetric inequalities is considered. Other connections are made by considering quantum information theory.
The fundamental objects in quantum information theory are quantum states. The set of states is convex as are some of its important subsets. Therefore, convex geom- etry provides a natural approach to explore quantum states. Fairly sharp estimates are obtained regarding the geometry of quantum states using basic notions in convex geometry. In particular, the distance between the set of states with positive partial transpose and the set of separable states is explored.
vii Finally, the optimal constants for the spherical isoperimetric inequality are pro- vided and generalizations of a concentration inequality are suggested.
viii INTRODUCTION
Convex geometry is a relatively young field of mathematics, yet it is based on simple notions of volume and distance known even to the ancient Greeks. Modern convex geometry has its roots with Hermann Brunn and Hermann Minkowski around the turn of the 20th century. The body of knowledge that grew from their work became known as the Brunn-Minkowski theory. Rolf Schneider [81] describes the Brunn-
Minkowski theory as “...the result of merging two elementary notions for point sets in Euclidean space: vector addition and volume.” One of the central concepts is
n Minkowski addition. If K,L ⊂ R their Minkowski sum is given by:
K + L = {x + y : x ∈ K , y ∈ L}
(see figure 1). Minkowski addition of sets is a fundamental notion in convex geometry and leads to many important results such as isoperimetric inequalities, Hadwiger’s theorem, and the Brunn-Minkowski inequality. The Brunn-Minkowski inequality
n states that for any compact subsets, K and L, of R
|K + L|1/n ≥ |K|1/n + |L|1/n , or, equivalently, for t ∈ [0, 1]
|tK + (1 − t)L| ≥ |K|t |L|1−t .
Here and in what follows |·|, when applied to a set, denotes the volume of the set in the appropriate dimension. The content of the Brunn-Minkowski inequality can 1 Figure 1: The Minkowski sum of a triangle and a circle.
be summarized nicely (if somewhat informally) by noticing that the statements are equivalent to saying that |·|1/n is concave (or, equivalently, that log(|·|) is concave) with respect to Minkowski addition. The Brunn-Minkowski inequality is in fact re- sponsible for much of the modern theory [23, 81]. It was through this inequality that many important connections to other fields where made. Variants of the Brunn-
Minkowski inequality show up in many fields and have been proven to be extremely useful.
The classical isoperimetric inequality compares the length of a closed curve in the plane and the area which it encloses. This idea has been generalized and variants exist in many fields of mathematics. The classic survey by Osserman [73] shows how pervasive these ideas are. Isoperimetric inequalities will be discussed in more detail in section 1.2 and A. A geometric perspective is extremely useful for many fields of study. A few that have benefitted in particular are functional analysis, probability, graph theory, linear programming, and information theory.
n Another important notion in convex geometry is duality. For a set K ⊂ R , we
2 define the polar of K (or the dual of K) as
◦ n K = {y ∈ R : hx, yi ≤ 1 for all x ∈ K}
Notice that K◦ is always convex (a set L is convex if x, y ∈ L then for all t ∈ [0, 1], tx + (1 − t)y ∈ L). If K is also convex and contains the origin in its interior, then
◦◦ n n K = K. Additionally, the set K0 of convex bodies in R containing the origin as an interior point is closed under the polar operation, K 7→ K◦. The polar operation is also contravariant. That is, if K ⊂ L then L◦ ⊂ K◦ and (rK)◦ = 1/rK◦ for all
n K,L ⊂ R and r ∈ R. For example, figure 2 demonstrates the polar operation on 3 the cube in R and figure 3 demonstrates that the polar of an ellipse is again another ellipse. An important fact is that If X is a normed space with unit ball K, then the dual space, X∗, has unit ball K◦. This is the reason the term “dual” is sometimes used instead of “polar”. This is also illustrated in figure 2: the polar of the l∞ ball is the l1 ball and vise versa. This duality is a powerful tool for exploring the structure of convex sets and normed spaces.
Figure 2: The polar of the cube is the tetrahedron.
3 Figure 3: The polar of an ellipse is another ellipse.
Information theory has its roots with Claude Shannon [86] who wished to quantify information and to discover the best method of propagation. Information is prop- agated by a process of encoding, transmitting, and decoding. Shannon wished to
find bounds on the amount of information that could be passed through circuits and to find encoding schemes that were optimal for these circuits. Being able to dis- tinguish information from noise is crucial to encoding, decoding, and cryptography and quickly leads to questions of distinguishability of probability distributions [13].
Therefore, central notions in information theory involve measures of distinguisha- bility of probability distributions. These notions of distinguishability (Kolmogorov distance, Bhattacharyya coefficient, Shannon distinguishability [13, 22]) give us a new interpretation of certain mathematical objects and have quantum analogs as well. Quantum information theory arose from the work of Richard Feynman who, in
1982 in an entertaining paper [21] asked, “Can physics be simulated by a universal
4 computer?” The field shifted with the pioneering work of Bennett and Brassard [7] who considered a quantum cryptography. This led the way to quantum comput- ing and quantum algorithms. That is, using quantum interactions to propagate and manipulate information. In 1995 Peter Shor [88] showed that a quantum computer could, in theory, factorize prime numbers in polynomial time, a feat that is thought to be impossible with classical (non-quantum) computers. Shor’s groundbreaking result elevated quantum information theory from a novelty to a priority for cryptographers and security administrations. The secret ingredient that makes quantum information theory powerful and unique from classical information theory is the notion of entan- glement. The mathematical aspects of entanglement are discussed below in sections
2.2 and 2.3. An excellent survey article on the matter is [43].
The results from section 1 are part of a joint work with Prof. Werner and will appear in the Transactions of the AMS [48]. The main results from section 2.5 and 2.9 are part of a collaboration with Prof. Szarek, Karol Horodecki, and Michal
Horodecki [44]. This work is still being prepared for publication. While the results of
Appendix A are rather elementary, in view of their utility they will be also submitted for publication (perhaps after further development).
5 CHAPTER 1
RELATIVE ENTROPY OF CONVEX BODIES
Many ideas in information theory have analogs in convex geometric analysis or can be interpreted geometrically. This may be obvious for some examples, but many surprising connections have been discovered only recently. Geometric inequalities may often be used to obtain inequalities for probability densities. For instance, the entropy power inequality can be deduced from the Brunn-Minkowski inequality and vice versa.
In fact, the two inequalities can be seen as consequences of generalizations of Young’s inequality (see, for example [13, 16, 59, 60, 63]). The main avenue of discovery seems to be through affine invariants and the various geometric relationships that give rise to these structures. For instance: affine surface area, which is a basic affine invariant, can be arrived at by considering the floating bodies [83]; the Cramer-Rao inequality
(also known as the information inequality) can be seen as a consequence of relations between the Legendre ellipsoid and a polar L2-projection body [63].
These objects are part of what is now known as the Lp-Brunn-Minkowski theory which plays a central role in modern convex geometry and has seen steady growth in recent years and many results (see, e.g., [25], [27], [29], [35] - [37], [50], [51], [54]
- [65], [68], [69], [71], [79], [82] - [85], [90], [91], [97] - [101], [109]) have spoken to the pervasive nature of the theory. Problems in many different areas can be seen as affine geometric problems. Probably the most famous example is the Busemann-
Petty problem which asks if the volume inequality of symmetric bodies follows from
6 inequality on all central sections of the bodies. Satisfactory results where not obtained until the introduction of intersection bodies, an object of study in the Lp-Brunn- Minkowski theory, by Lutwak in [58]. The problem has since been solved thanks to
[24, 26, 79, 107, 108].
Two important notions of Lp-Brunn-Minkowski theory are the Lp-affine surface area introduced for p > 0 by Lutwak in [59] and Lp-centroid bodies introduced by Lutwak and Zhang in [61]. Paouris and Werner [74] used these notions to show that the exponential of the relative entropy of the cone measures of a symmetric convex body and its polar equals a limit of normalized Lp-affine surface areas. In doing so, they introduce a new affine invariant ΩK . This quantity establishes another link between convex geometry and information theory and is the motivation for the present work.
We make yet another connection between convex geometry and information theory by considering a new class of bodies: the mean width bodies. We use these bodies to arrive at characterization of relative entropy of cone measures (an information theoretic notion). What is especially interesting is that mean width bodies need not be symmetric or even convex. The fact that there are no symmetry assumptions is in contrast to [74] where symmetry is needed. Another surprise is that the volume estimates involved are only first-order estimates where as previous work required second-order estimates. This suggests that the mean width bodies are more sensitive to boundary structure than the Lp-centroid bodies.
1.1 Notation
Most of the notation used in this work is standard and in general we follow the conventions set by Schneider [81]. We record here some of the notational conventions used.
7 n We work in R , which is equipped with a Euclidean structure h· , ·i. We denote n by k · k the corresponding Euclidean norm. B2 (x, r) is the Euclidean ball centered
n n at x with radius r. We write B2 = B2 (0, 1) for the Euclidean unit ball centered at 0 and Sn−1 for the unit sphere. Volume is denoted by | · | and is understood to be in the appropriate dimension. Throughout this chapter, we will assume that all bodies
n n in question belong to K0 the set of convex bodies contained in R with centroid at n the origin. The centroid of a (measurable) set K ⊂ R is defined by
R xdx c = K = (X) K |K| E where X is a random point distributed uniformly in K.
Figure 1.1: The Gauss map of an ellipse, K.
For a point x ∈ ∂K, the boundary of K, NK (x) is the outer unit normal to K at
n−1 x. The map NK : ∂K → S , x 7→ NK (x), is sometimes called the Gauss map. See
8 2 2 figure 1.1. We write K ∈ C+, if K has smooth (C ) boundary ∂K with everywhere
2 strictly positive Gaussian curvature κK . More precisely, K ∈ C+ if, for every x ∈ ∂K, the differential of the Gauss map at x defines a positive definite operator on the tangent space at x of K, the determinant of which defines the Gaussian curvature,
κK (x). Alternatively, by the implicit function theorem, we may view the boundary
2 of K, at least locally, as the graph of some function f. Then K ∈ C+ if the Hessian
2 of f is positive definite everywhere on ∂K. The importance of C+ bodies is that If
2 K is C+ then the Gauss map is invertible and we can define the curvature function, fK (u) as the reciprocal of the Gaussian curvature κK (x) at this point x ∈ ∂K that has u as outer normal.
We denote by µK the usual surface area measure on the boundary of K, ∂K. When necessary, we will use ω for the usual surface area measure on Sn−1 induced ω(A) by Lebesgue measure on n and σ its normalization: σ(A) = for all Borel R ω(Sn−1) measurable sets A ⊂ Sn−1.
n For ξ and x in R , H = H(x, ξ) is the hyperplane through x orthogonal to ξ. The two closed half spaces generated by H are given by
+ + n H = H (x, ξ) = {y ∈ R : hy, ξi ≥ hx, ξi}
− − n H = H (x, ξ) = {y ∈ R : hy, ξi ≤ hx, ξi}
n n−1 Let K be a convex body in R and let u ∈ S . The support function of K in the direction u ∈ Sn−1 is given by
hK (u) = max{hx, ui : x ∈ K} (1.1.1)
The support function measures the distance from the origin to the hyperplane which is orthogonal to u and intersects K only on the boundary of K. See figure 1.2 for example.
9 Figure 1.2: The support function of K.
1.2 Background
This section provides background to the mathematical ideas and objects that are relevant to this chapter. These ideas include affine-invariants, bodies generated from a given body, and the notion of entropy.
Much of the general theory of convex bodies concerns the shape of a convex body and to a lesser extent the size or position of the body. Because of this, a natural ques- tion to ask is what attributes of a body are unchanged by affine transformations? The simplest affine invariant, known even to the ancient Greeks, is volume. Considering
10 only this quantity already yields important results such as the isoperimetric inequal- ity, which can expressed for K ∈ Kn as
n−1 |∂K| |K| n n−1 ≥ n (1.2.1) |S | |B2 | with equality if and only if K is an ellipsoid. The Blaschke-Santal´o inequality is another important result concerning the volume of convex bodies. This is discussed in more detail in section 2.1. Volume inequalities are important; however, sharper and deeper results are often required. To this end, we must obtain affine invariants that more accurately distinguish between the shape or boundary structure of a body.
These affine invariants more accurately describe the shape of a body and provide answers for many important questions.
1.2.1 Affine Invariants
3 The notion of affine surface area, first introduced by Blaschke [9] in R , plays a 3 fundamental role in convex geometry. Blaschke showed that for convex bodies in R with C∞ boundary (infinitely many non-vanishing derivatives), affine surface can be computed by the formula
Z 1 as(K) = κK (x) n+1 dµ(x) . (1.2.2) ∂K
Here κK (x) is the Gaussian curvature and µ is the surface measure of K. Expressions
n to compute affine surface area for arbitrary convex bodies in R were given at the same time by Leichtweiss [53], Lutwak [58] and Sch¨uttand Werner [83]. In fact,
Sch¨uttand Werner showed in [83] that the formula (1.2.2) holds for arbitrary convex
n bodies in R , replacing Gaussian curvature by the generalized Gaussian curvature.
The affine surface area has since been generalized to the family of Lp-affine surface areas defined for all p, the case p = 1 being the classical affine surface area. The case of p > 1 was established in 1996 by Lutwak in [59] while proving the Lp-affine 11 isoperimetric inequality for this case (the Lp-affine isoperimetric inequality will be discussed below). Using a geometric formulation of Lp-affine surface area, Meyer and Werner ([68, 69]) established the case of −n < p < 1 along with a definition for the case p = −n. The remaining cases were settled by Sch¨uttand Werner in [85] using yet another geometric interpretation. For p 6= −n the Lp-affine surface area is given by p Z κ(x) n+p asp(K) = n(p−1) dµ(x) . (1.2.3) ∂K hx, N(x)i n+p For p = 0 this takes the form Z as0(K) = hx, N(x)i dµ(x) = n |K| ∂K and for p = ±∞ (and with sufficiently smooth K) it is given by Z κ(x) ◦ as±∞(K) = n dµ(x) = n |K | . ∂K hx, N(x)i
The Lp-affine surface area of the unit ball coincides with the regular surface area. That is, For all p 6= −n,
n n−1 n asp(B2 ) = S = n |B2 | .
The Lp-affine surface areas are affine invariant. That is, if det(T ) = 1 then asp(T (K)) = asp(K). This follows from the general fact that [46, 59, 85]
n−p asp(T (K)) = (det(T )) n+p asp(K) .
If P is a polytope then asp(P ) = 0 for p > 0 since the generalized Gauss curvature is almost everywhere 0 on a polytope.
As previously mentioned, Lutwak introduced the Lp-affine surface area and es- tablished the Lp-affine isoperimetric inequality for p > 1 in [59]. This version can be stated as follows. n−p n+p asp(K) |K| n ≤ n , asp(B2 ) |B2 | 12 with equality if and only if K is an ellipsoid. This form holds, in fact, for all p ≥ 0 as shown by Hug [46]. If p = 0, equality holds trivially. In [100], Werner and Ye proved
Lp-affine isoperimetric inequalities for all p < 1. If −n < p ≤ 0 then
n−p n+p asp(K) |K| n ≥ n asp(B2 ) |B2 |
2 Again, with equality if and only if K is an ellipsoid. If p < −n and K is C+ then
n−p n+p asp(K) np |K| n+p n ≥ c n . asp(B2 ) |B2 |
Here c is the constant from the reverse Santal´oinequality stated below in (2.1.9) (see
[81] or [71]). The best known value of c is due to Kuperberg [52] where it was shown that c > 1/2 if K is symmetric and c > 1/4 in the general case. The constants in
[52] are actually slightly better but we state them this way for the sake of simplicity.
Mahler conjectured [66] that there is equality in the reverse Santal´oinequality when
K is a simplex. See the discussion following equation (2.1.9) for more details.
Another affine invariant, ΩK , mentioned above, was defined by Paouris and Werner
n [74]. For K ∈ K0 , n+p asp(K) ΩK = lim . (1.2.4) p→∞ n|K◦|
The quantity ΩK is related to the relative entropy of the cone measures of the convex body K and its polar K◦. See [74] for a detailed discussion of cone measures and the connection ΩK makes to information theory.
1.2.2 Associated Bodies
There are many convex bodies that can be derived from other bodies. These bodies are often discovered through attempts to solve problems such as Shepherd’s problem and the Busemann-Petty problem (see Lutwak [58] for example) or to find geometric characterizations of affine invariants. The most relevant such body for the present 13 discussion is the convex floating body. The convex floating body gives a useful charac- |K| terization of affine surface area and was defined in [83] as follows: if 0 ≤ δ < , the 2 + convex floating body Kδ of K is the intersection of all halfspaces H whose defining hyperplanes H cut off a set of volume at most δ from K:
\ + Kδ = H . (1.2.5) |H−∩K|≤δ
The floating bodies Kδ give a ‘parameterization’ of K in the sense that K0 = K, and
Kδ ⊆ K when δ ≥ . The floating body operation is just one of a whole class of operations that can be used to examine the surface structures of a body. The as(K) describes the limiting behavior of the volume differences, |K| − |Kδ|. That is, if Kδ is the floating body of K, it was proved in [83] that
|K| − |Kδ| lim 2 = cnas(K) . δ→0 δ n+1
−2 n−1 ! n+1 1 B where c = 2 . This is a special case of the more general fact n 2 n + 1
|K| − |K| as(K) lim n n = n . →0 |B2 | − |(B2 )| as(B2 ) where K is a ‘parameterization’ of K that satisfies certain conditions. We refer to [98] for the details.
n For a convex body K in R of volume 1 and 1 ≤ p ≤ ∞, the Lp-centroid body
Zp(K) is the convex body that has support function [61]
Z 1/p p hZp(K)(θ) = |hx, θi| dx . (1.2.6) K
The floating body and the Lp-centroid body are just two examples of bodies connected to a given convex body. Others include the illumination body [97], the projection body
[63], the surface body [85], and the intersection body [58].
14 1.2.3 Entropy
An important notion in the present work is the notion of entropy. Entropy was introduced by Shannon [86] to provide a measure of ‘randomness’ of probability dis- tributions. Specifically to determine which probability distributions are best suited to propagate information. If X is a random variable on a measure space, (Ω, µ), with probability density function f, the classical entropy, or Shannon entropy [13] is given by Z H(X) = − f(x) log(f(x))dµ(x) = E(− log(f(X))) . Ω Notice that the values that X takes are irrelevant to the entropy. The entropy is really the expected information content of the probability density. Joint entropy is given by
Z H(X,Y ) = − f(z) log(f(z))dµ(z) = E(− log(f(Z))) Ωx×Ωy where Z = X × Y and f is the joint distribution.
The relative entropy or Kullback-Leibler divergence between two distributions is given by Z p(x) DKL(p||q) = p(x) log dµ(x) . (1.2.7) Ω q(x) The mutual information can then be expressed as
I(X,Y ) = D(p(x, y)||p(x)p(y)) .
See [13] for more details and discussion of the above quantities. The notion of en- tropy has been generalized in several different directions. Most notable are the Von
Neumann entropy and the p-Renyi entropy (See e.g. [72]).
15 1.3 Mean Width Bodies
n The mean width W (K) of a convex body K in R is defined as Z W (K) = 2 hK (u)dσ(u). (1.3.1) Sn−1 where hK is the support function of K defined above by (1.1.1) (see e.g. Schneider [81]). The factor of 2 appears because the width of K in the direction parallel to u (i.e., the distance between the two supporting hyperplanes perpendicular to u) is
n−1 actually max hx, ui−min hx, ui = hK (u)+hK (−u), and averaging this sum over S x∈K x∈K results in a factor of 2. Let M and K be convex bodies such that 0 is the centroid of
K and K ⊂ M. By switching to polar coordinates it is easy to see [30] that Z 2 −(n+1) W (M) − W (K) = n−1 kξk dξ. (1.3.2) |S | K◦\M ◦
◦ Let f : K → R be a positive, integrable function. We generalize (1.3.2) to 2 Z Wf (M) − Wf (K) = n−1 f(ξ)dξ. (1.3.3) |S | K◦\M ◦ −2 Z Here we can define Wf (K) = n−1 f(ξ)dξ, but many important properties arise |S | K◦ only when we consider differences of such quantities. Also, defining Wf (K) separately costs some generality because since the difference of these quantities is given by an integration over K◦ \ M ◦, we only need to consider functions integrable on bounded sets separated from 0. For example, the generalization suggests that if f = k·k−(n+1) Z 2 −(n+1) then Wf (K) = W (K), but the integral n−1 kξk dξ is divergent. |S | K◦ In order to define the mean width bodies we consider a specific M. Namely, for
n x ∈ R , let Kx = [x, K] be the convex hull of x and K. For x ∈ K, Kx = K. Therefore, we will consider only x∈ / K. Let t ≥ 0 following Glasauer and Gruber
[30], we define the following convex bodies:
n K[t] = {x ∈ R : w(x) ≤ t} (1.3.4) 16 where Z 2 −(n+1) w(x) = W (Kx) − W (K) = n−1 kξk dξ. (1.3.5) |S | ◦ ◦ K \Kx The bodies K[t] in equation (1.3.4) have been used by several authors (e.g. by
B¨or¨oczkyand Schneider [10] and Glasauer and Gruber [30]) in connection with ap- proximation of convex bodies by polytopes.
We shall now provide a lemma concerning (1.3.3) for certain classes of functions;
n−1 the α-homogeneous functions. Let α ∈ R, α 6= 0. Let f : S → R be a positive function. f is said to be α-homogeneous, or homogeneous of degree α, if for all r ≥ 0,
f(ru) = rαf(u).
n Lemma 1.3.1. Let K and M be convex bodies in R such that 0 is the centroid of K n−1 and K ⊂ M. Let f : S → R be a positive, integrable function that is homogeneous of degree α.
(i) Let α 6= −n. Then
2 Z 1 1 Wf (M) − Wf (K) = f(u) α+n − α+n dσ(u). (α + n) Sn−1 hK (u) hM (u)
(ii) Let α = −n. Then Z hM (u) Wf (M) − Wf (K) = 2 f(u) log dσ(u). Sn−1 hK (u)
Proof. We convert to polar coordinates and use α-homogeneity to obtain
2 Z Wf (M) − Wf (K) = n−1 f(ξ)dξ |S | K◦\M ◦ 1 Z Z h (u) 2 K n−1 = n−1 f(ru)r drdω(u) |S | Sn−1 1 hM (u) 1 Z Z h (u) 2 K n+α−1 = n−1 f(u)r drdω(u) |S | Sn−1 1 hM (u) Integration then yields (i) and (ii). 17 After finding Lemma 1.3.1, we realize that the difference in width (1.3.3) of bodies may be described in terms of relative entropy (1.2.7). In light of this, we can ask for probability distributions with relative entropy that can be described by the width differences of convex bodies. 1 1 If we let f(u) = n (or f(u) = n ) in Lemma 1.3.1 (ii), then f(ru) = hK (u) hM (u) −n r −n n = r f(u). Thus this f is homogeneous of degree −n. hK (u) n−1 n Let now (X, µ) = (S , ω) and for convex bodies K and M in R put 1 1 pK = ◦ n , pM = ◦ n . (1.3.6) n|K |hK n|M |hM
n−1 Then dPK = pK dω and dPM = pM dω are probability measures on S and Lemma 1.3.1 (ii) becomes
Z n 2 ◦ 1 hM W 1 (M) − W 1 (K) = |K | log dσ hn hn ◦ n n K K n Sn−1 |K |hK hK ◦ Z ◦ 2|K | pK |K | = n−1 pK log + log ◦ dω |S | Sn−1 pM |M | 2|K◦| |K◦| = D (P kP ) + log . |Sn−1| KL K M |M ◦|
For the probability distributions described by (1.3.6) we have the following corol- lary.
n Corollary 1.3.2. Let K and M be convex bodies in R such that K ⊂ M and let pK and pM be the probability densities given in (1.3.6). Then Z 1 dξ |K◦| n ◦ = DKL(PK kPM ) + log ◦ K◦\M ◦ hK (ξ) |K | |M |
We now want to apply the above considerations for a specific M. Namely, M =
◦ Kx. We generalize them as follows. Let f : K → R be a positive, integrable function.
As above, with Kx instead of M, we put 2 Z wf (x) = Wf (Kx) − Wf (K) = n−1 f(ξ)dξ (1.3.7) |S | ◦ ◦ K \Kx 18 and generalize (1.3.4) to
n Kf [t] = {x ∈ R : wf (x) ≤ t}. (1.3.8)
−β Thus, for instance, for β ∈ R and fβ(ξ) = kξk we get
2 Z K [t] = x ∈ n : kξk−βdx ≤ t , (1.3.9) fβ R n−1 |S | ◦ ◦ K \Kx which, in the particular case β = n + 1, gives the bodies (1.3.4) above. x As K = [x, K], K◦ = K◦ ∩{y ∈ n : hy, xi ≤ 1}. Thus, putting H+ , x = x x R kxk2 x {y ∈ n : hy, xi ≤ 1}, K◦ is obtained from K◦ by cutting off a cap K◦∩H− , x R x kxk2 of K◦: x K◦ = K◦ ∩ H+ , x . x kxk2 and x K◦ \ K◦ = K◦ ∩ H− , x . x kxk2 Therefore ( ) 2 Z K [t] = x ∈ n : f(ξ)dξ ≤ t . (1.3.10) f R n−1 |S | K◦∩H− x ,x kxk2
This is the definition of the mean width bodies from [48].
Remarks 1: Properties of Kf [t]
(i) It is clear that for all f and for all t ≥ 0, K ⊂ Kf [t] and that Kfβ [0] = K for all
β. However, it can happen that K is a proper subset of Kf [0]. To see this, let K =
n n ◦ n n B∞ = {(x1, . . . , xn) ∈ R : max1≤i≤n|xi| ≤ 1}. Then K = B1 = {(x1, . . . , xn) ∈ R : n X n |xi| ≤ 1}. Define f : B1 → R,(x1, . . . , xn) → f((x1, . . . , xn)) by i=1 0, xn ≥ 0 f(x) = 1, otherwise. 19 3 3 Then (0,..., 0, ) ∈ K [0] but (0,..., 0, ) ∈/ K. 2 f 2
2 2 (ii) Kf [t] need neither be bounded nor convex. Indeed, let K = B∞. Define f : B1 →
R,(x1, x2) → f((x1, x2)) by 1 , x2 ≥ 0 f(x) = 2 1, otherwise. 1 3 1 If t ≥ , K [t] = 2. If ≤ t < , {(x , x ) ∈ 2 : x ≥ 0} ⊂ K [t]. If π f R 4π π 1 2 R 2 f 1 3 ≤ t < , {(0, x ) ∈ 2 : x ≥ 0} ⊂ K [t]. Thus K [t] is unbounded in those 2π 4π 2 R 2 f f 1 cases. If t < , then K [t] is bounded. 2π f 3 Moreover, with the same K and f: {(x , x ) ∈ 2 : x ≥ 0} ⊂ K [ ] and 1 2 R 2 f 4π 1 3 1 −1 0, − √ ∈ Kf [ ]. Let x0 = √ , √ . Then wf (x0) = 1 − 3/2 4π 1 − 3/2 1 − 3/2 √ √ 3 3 3 1 − 3/16 > . Therefore, K [ ] is not convex. 4π f 4π
(iii) Formulas (1.3.7) and (1.3.10) show that to define Kf [t], we cut off sets of “weighted volume” t of K◦. This is a similar procedure to that of the floating body
n defined in (1.2.5). If M ∈ K , we obtain Mδ from M, by cutting off sets of volume δ from M. A reasonable concern is that if the “weighted volume” is just regular
◦ volume, that is f(x) = 1, then is Kf [t] = (K )δ for some t and δ? For β = 0, we get in formula (1.3.10), 2 Z K [t] = {x ∈ n : dξ ≤ t} f0 R n−1 |S | K◦∩H− x , x kxk2 kxk n−1 n ◦ − x x tω(S ) = x ∈ R : K ∩ H , ≤ kxk2 kxk 2
◦ However, Kf0 [t] is not a convex floating body of K .
n n Indeed, it is easy to see that for the Euclidean ball B = rB2 in R with radius r,
Bf0 [t], for small t, is a Euclidean ball with radius of order
2n 2 r 1 + knr n+1 t n+1 , 20 2 n n+1 1 n(n + 1)|B2 | ◦ where kn = n−1 .(B )δ, for small δ, is a ball with radius of order 2 2|B2 |
1 2n 2 1 − c r n+1 δ n+1 , r n
2 1 n + 1 n+1 where cn = n−1 (see e.g. [83]) and Bδ, for small δ, is a ball with radius 2 |B2 | of order cn 2 n+1 r 1 − 2n δ , r n+1 (see also e.g. [83]).
δ Also, Kf0 [t] resembles the illumination body K which, for δ ≥ 0, is defined as follows [97]:
δ n K = {x ∈ R : |[x, K] \ K| ≤ δ}.
The resemblance is in the fact that the set [x, K]\K is looks similar to the domain of integration in (1.3.10). So if f(x) = 1 it is reasonable to be concerned that we have arrived at an illumination body. In fact, Kf [t] is not a illumination body. Again, this
n n δ can be seen by considering the Euclidean ball rB2 .(rB2 ) , for small δ, is a Euclidean ball with radius of order dn 2 n+1 r 1 + 2n δ , r n+1 2 1 n(n + 1) n+1 where dn = n−1 [97]. 2 |B2 |
We have seen that Kf [t] need not be convex. But it is always star-convex.
n Lemma 1.3.3. Let K be a convex body in R such that 0 is the centroid of K. Let ◦ f : K → R be a positive, integrable function.
(i) Kf [t] is star convex i.e. [0, x] ⊂ Kf [t] for all x ∈ Kf [t]. \ (ii) Kf [t] = Kf [t + s]. s>0 21 Proof. (i) Let x ∈ Kf [t] and let y ∈ [0, x]. Then Ky = [y, K] ⊂ [x, K] = Kx and
◦ ◦ ◦ ◦ ◦ consequently K \ Ky ⊂ K \ Kx. As f ≥ 0 on K , we therefore get
2 Z 2 Z n−1 f(ξ)dξ ≤ n−1 f(ξ)dξ ≤ t |S | ◦ ◦ |S | ◦ ◦ K \Ky K \Kx and thus y ∈ Kf [t]. \ (ii) For all s > 0, Kf [t] ⊂ Kf [t + s]. Therefore, we only need to show that Kf [t + s>0 \ s] ⊂ Kf [t]. Let thus x ∈ Kf [t + s]. Then for all s > 0, wf (x) ≤ t + s. Letting s>0 s → 0, we get wf (x) ≤ t.
Additional conditions on f ensure convexity of Kf [t]. This is shown in the next lemma whose proof is the same as the corresponding one in [10].
n Lemma 1.3.4. Let K be a convex body in R such that 0 is the centroid of K. Let n−1 f : S → R be a positive, integrable function that is homogeneous of degree α. If
α ≤ −(n + 1), then Kf [t] is convex.
Proof. Let x and y be in Kf [t] and let 0 < λ < 1. For t ∈ R, t ≥ 0, the function γ g(t) = t is convex if γ ≥ 1. Therefore, and as K(1−λ)x+λy ⊆ (1 − λ)Kx + λKy, we get for α ≤ −(n + 1)
−(α+n) −(α+n) −(α+n) −(α+n) hK (1 − λ) h + λ h (1 − λ) h + λ h (1−λ)x+λy ≤ Kx Ky ≤ Kx Ky . −(α + n) −(α + n) −(α + n)
22 Hence for α ≤ −(n + 1), 2 Z f(u)h−(α+n) (u)dσ(u) K(1−λ)x+λy −(α + n) Sn−1 2 Z ≤ (1 − λ) f(u)h−(α+n)(u)dσ(u) Kx −(α + n) Sn−1 Z +λ f(u)h−(α+n)(u)dσ(u) Ky Sn−1 Z 2 −(α+n) ≤ (1 − λ) f(u)hK (u)dσ(u) + t −(α + n) Sn−1 Z 2 −(α+n) +λ f(u)hK (u)dσ(u) + t −(α + n) Sn−1 Z 2 −(α+n) = f(u)hK (u)dσ + t. −(α + n) Sn−1
Remark. If α > −(n + 1), then Kf [t] need not be convex. An example is the cube
2 in R and the f given in Remark 1 (ii).
Now we give conditions that guarantee that Kf [t] is bounded and give a desirable boundary condition for Kf [t].
n Lemma 1.3.5. Let K be a convex body in R such that 0 is the centroid of K. Let ◦ f : K → R be a strictly positive, integrable function. Then
(i) Kf [0] = K.
(ii) There exists t0 such that for all t ≤ t0, Kf [t] is bounded.
(iii) Let t ≤ t0, where t0 is as in (ii). Then we have for all x ∈ ∂Kf [t] that wf (x) = t.
Proof.
(i) We only have to show that Kf [0] ⊂ K. Let x ∈ Kf [0]. Then 2 Z wf (x) = n−1 f(ξ)dξ = 0 . |S | ◦ ◦ K \Kx ◦ ◦ ◦ ◦ ◦ As f > 0 on K , this can only happen if m(K \ Kx) = 0. As Kx ⊂ K is closed and
◦ ◦ convex, this can only happen if Kx = K , or, equivalently, Kx = K, or x ∈ K. 23 (ii) This follows immediately from (i), Lemma 1.3.3 (ii) and the fact that, as K is a convex body, there exists α > 0 such that
1 Bn(0, α) ⊂ K ⊂ Bn 0, . (1.3.11) 2 2 α \ As K = Kf [0] = Kf [t], there exists t0 such that for all t ≤ t0, Kf [t] ⊂ 2K ⊂ t>0 2 Bn 0, . 2 α
(iii) Let t ≤ t0 and let x ∈ ∂Kf [t]. Suppose wf (x) < t. Let y ∈ {ax : a ≥ 1}. Z ◦ ◦ Then Kx = [x, K] ⊂ Ky = [y, K], hence Ky ⊂ Kx and therefore f(ξ)dξ ≥ K◦\K◦ Z y f(ξ)dξ. As f > 0 on K◦, we can choose y = ax with a > 1 such that ◦ ◦ K \Kx 2 Z n−1 f(ξ)dξ = t. This implies that x∈ / ∂Kf [t], a contradiction. |S | ◦ ◦ K \Ky
1.4 Relative Entropies of Cone Measures and Affine Surface
Areas
In this section we present new geometric interpretations of important affine invari- ants mentioned in the introduction, namely the Lp-affine surface areas. Many such geometric interpretations have been given (see e.g. [69, 84, 85, 99, 100, 101]). The remarkable fact here is that these geometric interpretations of affine invariants for convex bodies are expressed in terms of not necessarily convex bodies, a phenomenon which already occurred in [101] by Werner and Ye in regards to mixed p-affine surface areas.
We also give new geometric interpretations for the relative entropies of cone mea- sures of convex bodies. Geometric interpretations for those quantities were given
first in [74] in terms of Lp-centroid bodies defined by (1.2.6). However, in the con- text of the Lp-centroid bodies, the relative entropies appeared only after performing
24 a second-order expansion of certain expressions. Now, using the mean width bod- ies, already a first-order expansion makes them appear. Thus, these bodies detect
“faster” more detail of the boundary of a convex body than the Lp-centroid bodies.
n 2 Theorem 1.4.1. Let K be a convex body in R that is in C+ and such that 0 is the ◦ centroid of K. Let f : K → R be a continuous function such that f(y) ≥ c for all y ∈ K◦ and some constant c > 0. Then
Z 2 |Kf [t]| − |K| hx, NK (x)i dµK (x) lim 2 = 1 . t→0 kn t n+1 ∂K f(y(x))κK (x) n+1
2 n n+1 1 n(n + 1)|B2 | ◦ kn = n−1 and y(x) ∈ ∂K is such that hy(x), xi = 1. 2 2|B2 |
Remark. u We put NK (x) = u. Then hx, NK (x)i = hK (u) and y(x) = . As dµK = fK dω, hK (u) we therefore also have
|K [t]| − |K| Z h (u)2dω(u) lim f = K . (1.4.1) 2 n+2 t→0 n+1 n−1 u kn t S fK (u) n+1 f hK (u)
Theorem 1.4.1 leads to the announced new geometric interpretations of the above mentioned quantities which we introduce now. Recall Lp-affine surface area given above in equation (1.2.3);
p Z n+p κK (x) asp(K) = n(p−1) dµK (x) ∂K hx, NK (x)i n+p for real p 6= −n. Then we have
n 2 Corollary 1.4.2. Let K be a convex body in R that is in C+ and such that 0 is the centroid of K.
25 ◦ (i) For p ∈ R, p 6= −n, let pas : ∂K → R be defined by
n+p(n+2) ! n+p hx, NK (x)i pas(y) = 1 , κK (x) n+1 where, for y ∈ ∂K◦, x = x(y) ∈ ∂K is such that hx, yi = 1 Then
p Z n+p |Kpas [t]| − |K| κK (x) dµK (x) lim 2 = n(p−1) = asp(K). t→0 n+1 kn t ∂K hx, NK (x)i n+p
◦ (ii) For β ∈ R, let fβ : K → R be defined by 1 f (y) = = hx, N (x)iβ, β kykβ K where, again, for y ∈ ∂K◦, x = x(y) ∈ ∂K is such that hx, yi = 1 Then Z |Kfβ [t]| − |K| dµK (x) lim 2 = 1 t→0 β−2 kn t n+1 ∂K κK (x) n+1 hx, NK (x)i
2 Proof. As ∂K is in C+, the functions pas and fβ satisfy the conditions of Theorem 1.4.1. The proof of the corollary then follows immediately from Theorem 1.4.1.
Remarks n (i) For β = 0, we get in Corollary 1.4.2 (ii) the p = − L -affine surface area of n + 2 p K.
−(n−1) (ii) As κK (rx) = r κK (x), it makes most sense to put fK (ru) = frK (u) =
n−1 r fK (u) and define n − 1 to be the degree of homogeneity of the function fK . Then 2n(n + p(n + 2)) p is homogeneous of degree and f is homogeneous of degree β. as (n + 1)(n + p) β (n + 1)2 + 1 Thus, by Lemma 1.3.4, K [t] is convex if −n < p ≤ −n and K [t] pas (n + 1)2 + n + 2 fβ is convex if β ≤ −(n + 1).
n 2 Let K a convex body in R that is C+. Let
κK (x) hx, NK (x)i pK (x) = n ◦ , qK (x) = . (1.4.2) hx, NK (x)i n|K | n |K| 26 Then
PK = pK µK and QK = qK µK (1.4.3) are probability measures on ∂K that are absolutely continuous with respect to µK .
Recall now that the normalized cone measure cmK on ∂K is defined as follows: For every measurable set A ⊆ ∂K
1 cm (A) = |{ta : a ∈ A, t ∈ [0, 1]}|. (1.4.4) K |K|
The next proposition is well known. See e.g. [74] for a proof. It shows that the
◦ measures PK and QK defined in (1.4.3) are the cone measures of K and K.
n 2 Proposition 1.4.3. Let K a convex body in R that is C+. Let PK and QK be the probability measures on ∂K defined by (1.4.3). Then
−1 PK = NK NK◦ cmK◦ and QK = cmK , or, equivalently, for every measurable subset A in ∂K
−1 PK (A) = cmK◦ NK◦ NK (A) and QK (A) = cmK (A).
In the next two corollaries we also use the following notations. For a convex body
n K in R and x ∈ ∂K, let ri(x), 1 ≤ i ≤ n − 1 be the principal radii of curvature. We put
r = infx∈∂K min ri(x) and R = sup max ri(x). (1.4.5) 1≤i≤n−1 x∈∂K 1≤i≤n−1 n 2 Note that if K be a convex body in R that is in C+, then 0 < r ≤ R < ∞. Note also that r = R iff K is a Euclidean ball with radius r.
n 2 Corollary 1.4.4. Let K be a convex body in R that is in C+ and such that 0 is the centroid of K. Let r, R be as in (1.4.5). 27 ◦ (i) Let ent1 : ∂K → R be defined by
− n+2 κ (x) n+1 hx, N (x)in+1 ent (y) = K K , 1 2n R |K| κK (x) log 2n ◦ n+1 r |K | hx,NK (x)i where, again, for y ∈ ∂K◦, x = x(y) ∈ ∂K is such that hx, yi = 1 Then
Z 2n |Kent1 [t]| − |K| κK (x) R |K|κK (x) lim 2 = log dµK (x) t→0 n 2n ◦ n+1 kn t n+1 ∂K hx, NK (x)i r |K |hx, NK (x)i R = n|K◦| [D (P kQ ) + 2n log KL K K r ◦ −1 R = n|K | D N N ◦ cm ◦ kcm + 2n log . KL K K K K r
◦ (ii) Let ent2 : ∂K → R be defined by
− 1 κ (x) n+1 ent (y) = K , 2 2n R |K|κK (x) log 2n ◦ n+1 r |K |hx,NK (x)i where, again, for y ∈ ∂K◦, x = x(y) ∈ ∂K is such that hx, yi = 1 Then
Z 2n ◦ n+1 |Kent2 [t]| − |K| r |K |hx, NK (x)i lim 2 = − hx, NK (x)i log dµK (x) t→0 2n kn t n+1 ∂K R |K|κK (x) R = −n|K| D (Q ||P ) − 2n log KL K K r −1 R = −n|K| D cm kN N ◦ cm ◦ − 2n log . KL K K K K r
2 Proof. As ∂K is in C+, 0 < r ≤ R < ∞ and we have for all x ∈ ∂K that
n n B2 (x − rNK (x), r) ⊂ K ⊂ B2 (x − RNK (x),R).
Suppose first that r = R. Then K is a Euclidean ball with radius r and the right hand sides of the identities in the corollary are equal to 0. Moreover, in this case,
ent1 and ent2 are identically equal to ∞. Therefore, for all t ≥ 0, Kent1 [t] = K and
Kent2 [t] = K and hence for all t ≥ 0, |Kent1 [t]| − |K| = 0 and |Kent2 [t]| − |K| = 0. Therefore, the corollary holds trivially in this case. 28 Suppose now that r < R. Then, as
2n 4n R |K| κK (x) R 1 ≤ 2n ◦ n+1 ≤ . r |K | hx, NK (x)i r we get for all x ∈ ∂K that
n−1 ! 2 |K◦|rn−1 fPQ(x) ≥ R > 0. 2 log r
Thus the functions ent1 and ent2 satisfy the conditions of Theorem 1.4.1. The proof of the corollary then follows immediately from Theorem 1.4.1.
In [74], the following new affine invariant ΩK was introduced and its relation to the relative entropies was established.
n Let K a convex body in R with centroid at the origin. n+p asp(K) ΩK = lim . p→∞ n|K◦|
Let pK and qK be the densities defined in (1.4.2). It was proved in [74] that for a
n 2 convex body K in R that is C+. |K| − 1 D (P kQ ) = log Ω n (1.4.6) KL K K |K◦| K and
◦ 1 |K | − n D (Q kP ) = log Ω ◦ . (1.4.7) KL K K |K| K
In [74], geometric interpretations in terms of Lp-centroid bodies were given in the case of symmetric convex bodies for the new affine invariants ΩK . These interpre- tations are in the spirit of Corollary 1.4.2: As p → ∞, the quantities ΩK and the related relative entropies appear in appropriately chosen volume differences of K and its Lp-centroid bodies (1.2.6). However, in the context of the Lp-centroid bodies, a second-order expansion was needed for the volume differences in order to make these 29 terms appear. Now, it follows from Corollary 1.4.4 (i) and (ii) and Corollary 1.4.5 that no symmetry assumptions are needed and that already a first-order expansion gives such geometric interpretations, if one uses the mean width bodies instead of the
Lp-centroid body.
n 2 Corollary 1.4.5. Let K be a convex body in R that is in C+ and such that 0 is the centroid of K. Let the functions ent1 and ent2 be as in Corollary 1.4.2. Then
1 |Kent1 [t]| − |K| 2 ◦ R ◦ |K| − n lim 2 − 2n |K | log = n|K | log Ω . t→0 ◦ K kn t n+1 r |K | and
1 |Kent2 [t]| − |K| 2 R |K| n lim 2 − 2n |K| log = n|K| log Ω ◦ . t→0 ◦ K kn t n+1 r |K |
1.5 Proof of Theorem 1.4.1
To prove Theorem 1.4.1, we need the following lemmas. The first one, Lemma 1.5.1, is well known.
n Lemma 1.5.1. Let En(x0, a) be an ellipsoid in R centered at x0 and with axes parallel to the coordinate axes and of lengths a1, . . . , an. Let 0 < ∆ < an. Let
C(En, ∆) = En ∩ H(x0 + (an − ∆)en, en)
be a cap of En(x0, a) of height ∆. Then
n−1 n+1 ∆ 2 n−1 2 2 1 − |B | n−1 2an 2 Y ai n+1 √ ∆ 2 ≤ |C(E , ∆)| n + 1 a n i=1 n n+1 n−1 n−1 2 2 |B2 | Y ai n+1 ≤ √ ∆ 2 n + 1 a i=1 n
30 In the next few lemmas and throughout the remainder of this section we will use the following notation.
n ◦ Let K be a convex body in R . Let f : K → R be an integrable function and for t ≥ 0, let Kf [t] be a mean width body of K. For x ∈ ∂K, let
xt = {γx : γ ≥ 0} ∩ ∂Kf [t]. (1.5.1)
Let y(x) ∈ ∂K◦ be such that hy(x), xi = 1. Let m be the Lebesgue measure on 2f n and let m be the measure (on K◦) defined by m = m, i.e. for all A ⊂ K◦ R f f |Sn−1|
2 Z mf (A) = n−1 f(ξ)dξ. (1.5.2) |S | A
n 2 Lemma 1.5.2. Let K be a convex body in R that is in C+ and such that 0 is the ◦ centroid of K. Let f : K → R be an integrable function such that f(y) ≥ c for all ◦ y ∈ K and some constant c > 0. Let xt be as in (1.5.1). Then the functions 1 kxtk 2 − 1 t n+1 kxk are uniformly (in t) bounded by an integrable function.
Proof. We can assume that t ≤ t0 where t0 is given by Lemma 1.3.5. Then Kf [t] is bounded and hence
n Kf [t] ⊂ B2 (0, a) (1.5.3) for some a > 0. As f ≥ c on K◦, we get with (1.3.10)
2 Z t ≥ f(ξ)dξ n−1 − |S | ◦ xt x K ∩H 2 , kxtk kxk 2c ◦ − xt x ≥ n−1 K ∩ H 2 , . |S | kxtk kxk
31 2 ◦ 2 As K is in C+, K is in C+. Thus, by the Blaschke rolling theorem (see [81]), there
◦ n ◦ exists r0 > 0 such that for all y ∈ ∂K , B2 (y − r0NK◦ (y), r0) ⊂ K . Let now
◦ x y(x) ∈ ∂K be such that hx, y(x)i = 1. Then N ◦ (y(x)) = and thus K kxk 2c n x − xt x t ≥ n−1 B2 y(x) − r0 , r0 ∩ H 2 , |S | kxk kxtk kxk n+3 n−1 n+1 2 n−1 2 2 2 c r0 B2 1 1 ≥ n−1 − , (n + 1) |S | kxk kxtk n x − xt x where we have used that B2 y(x) − r0 , r0 ∩ H 2 , is the volume kxk kxtk kxk 1 1 kxt − xk n x of a cap of height − = of the ball B2 y(x) − r0 , r0 which kxk kxtk kxtkkxk kxk we have estimated from below using Lemma 1.5.1. We assume also that t is so small 1 1 that − < r0. kxk kxtk kx k kx − xk As x and x are collinear, t − 1 = t and hence t kxk kxk
2 − n−1 n−1 ! n+1 n+1 1 kxtk 1 kxt − xk (n + 1) |S | r0 2 − 1 = 2 ≤ n−1 n+3 kxtk t n+1 kxk t n+1 kxk c B2 2 n+1 2 − n−1 n−1 ! n+1 n+1 (n + 1) |S | r0 ≤ n−1 n+3 a. (1.5.4) c B2 2 n+1
In the last inequality we have used (1.5.3). The expression (1.5.4) is a constant and thus integrable.
n 2 Lemma 1.5.3. Let K be a convex body in R that is in C+ and such that 0 is the ◦ centroid of K. Let f : K → R be a continuous, positive function. Then for all x ∈ ∂K one has
n 2 hx, NK (x)i kxtk hx, NK (x)i lim 2 − 1 = 1 2 , t→0 n kn t n+1 kxk κK (x) n+1 f(y(x)) n+1
2 n n+1 1 n(n + 1)|B2 | ◦ where kn = n−1 and y(x) ∈ ∂K is such that hx, y(x)i = 1. 2 2|B2 | 32 Proof. Let x ∈ ∂K. Let xt be as in (1.5.1). As x and xt are collinear and as (1 + s)n ≥ 1 + ns for s ∈ [0, 1), one has for small enough t, hx, N (x)i kx kn hx, N (x)i kx − xkn K t − 1 = K 1 + t − 1 ≥ ∆(x, t), n kxk n kxk x where ∆(x, t) = ,N (x) kx − xk = hx − x, N (x)i. kxk K t t K Similarly, as (1 + s)n ≤ 1 + ns + 2ns2 for s ∈ [0, 1), one has for t small enough, hx, N (x)i kx kn 2n kx − xk K t − 1 ≤ ∆(x, t) 1 + t . (1.5.5) n kxk n kxk
Hence for ε > 0 there exists tε ≤ t0, t0 from Lemma 1.3.5, such that for all 0 < t ≤ tε n h kxtk i hx, NK (x)i kxk − 1 1 ≤ ≤ 1 + ε. n ∆(x, t)
◦ ◦ By Lemma 1.3.5 (iii) and (1.5.2), mf (K \ Kxt ) = t and thus h n i 2 hx, N (x)i kxtk − 1 m (K◦ \ K◦ ) n+1 K kxk f xt 1 ≤ 2 ≤ 1 + ε. n ∆(x, t) t n+1 N (x) Let now y = y(x) ∈ ∂K◦ be such that hx, yi = 1. Thus y = K and hx, NK (x)i x ◦ N ◦ (y) = . As f is continuous on K , there exists δ > 0 such that for all K kxk n z ∈ B2 (y, δ), f(y) − ε < f(z) < f(y) + ε.
◦ ◦ n We choose t so small that K \ Kxt ⊂ B2 (y, δ). Then
2 (f(y(x)) − ε) ◦ ◦ ◦ ◦ K \ K ≤ mf K \ K |Sn−1| xt xt 2 Z = n−1 f(ξ)dξ |S | K◦\K◦ xt
2 (f(y(x)) + ε) ◦ ◦ ≤ K \ K |Sn−1| xt and we get with (new) absolute constants c1 and c2 that
2 h n i n+1 hx, N (x)i kxtk − 1 2f(y(x)) K◦ \ K◦ K kxk |Sn−1| xt 1 − c1ε ≤ 2 n ∆(x, t) t n+1
≤ 1 + c2ε. (1.5.6) 33 ◦ 2 As K and hence K is in C+, κK◦ (y) > 0. It is well known (see [84]) that then there exists an ellipsoid E = E(y − anNK◦ (y), a) centered at y − anNK◦ (y) and with
◦ half axes of lengths a1 . . . an which approximates ∂K in a neighborhood of y. For the computations that follow, we can assume without loss of generality that NK◦ (y) = en and that the other axes of E coincide with e1 . . . , en−1. Thus (see [84]), for ε > 0 given, there exists ∆ε such that for all ∆ ≤ ∆ε