Convexity, Smoothness, Duality, and Stability
Yao-Liang Yu [email protected] Machine Learning Department Carnegie Mellon University
December 14, 2015
This note is about the interplay between convexity, smoothness, and Stability, through duality. The note is still largely under construction and will update from time to time. Contents
1 Topological background2 3 Uniformly convex and uniformly smooth functions 23 2 Convex Functions 10 1 Topological background
1 Topological background
In this section we collect some useful topological results. A significant portion is devoted to uniform spaces as the writer starts to appreciate this notion hence wishes to learn more about them. Theorem 1.1: Many useful spaces are not pseudo-metrizable
ω The space {0, 1} is metrizable iff |ω| ≤ ℵ0. ω Proof: The only if part is easy. For the if part, note that {0, 1} is not first countable if |ω| > ℵ0. Q Now take some (pseudo) metric spaces {Xγ : γ ∈ Γ} and consider its product γ Xγ . If |Γ| > ℵ0 (and each space Xγ contains two topologically distinct points), then the product space is not (pseudo) metrizable. As we will see, many important topological spaces can be treated as a subspace of a product of metrizable spaces (e.g. [0, 1]J for some index set J). Theorem 1.1 suggests that these spaces may not be metrizable but nevertheless they still enjoy enough “metric” structure. So we want to study and characterize these spaces. Definition 1.2: Topology
X A topology on a set X is a collection of sets (nhoods) {Ux ⊆ 2 : x ∈ X} such that:
(I). U ∈ Ux =⇒ x ∈ U ;
(II). If U ∈ Ux, then there exists some V ∈ Ux such that U ∈ Uy for all y ∈ V ;
(III). U , V ∈ Ux =⇒ U ∩ V ∈ Ux;
(IV). U ∈ Ux, U ⊆ V =⇒ V ∈ Ux.
A set U is called open iff U ∈ Ux for all x ∈ U . Trivially ∅ and X are open. A set is closed iff its complement is open. X The collection {Ux ⊆ 2 : x ∈ X} is called an nhood basis iff
(I). U ∈ Ux =⇒ x ∈ U ;
(II). If U ∈ Ux, then there exists some V ∈ Ux such that for all y ∈ V there exists some W ∈ Uy, W ⊆ U ;
(III). U , V ∈ Ux =⇒ there exists some W ⊆ U ∩ V , W ∈ Ux. Enlarging the basis by including all supersets we recover the nhood. Removing the last condition we get a subbasis of the topology; we can recover the basis by taking all finite intersections.
Theorem 1.3: The closure operator, Kurotowski
Theorem 1.4: Convergence class
Definition 1.5: Uniformity
A uniformity on a product set X × X is a collection of sets {D ⊆ X × X} such that (I). D ∈ D =⇒ D ⊇ ∆ := {(x, x): x ∈ X};
(II). D ∈ D =⇒ D−1 ∈ D; (III). If D ∈ D, then there exists some E ∈ D, E ◦ E ⊆ D;
December 14, 2015 revision: 1 main 2 1 Topological background
(IV). D, E ∈ D =⇒ D ∩ E ∈ D; (V). D ∈ D, D ⊆ E =⇒ E ∈ D, where D−1 := {(y, x):(x, y) ∈ D} and D ◦ E = {(x, y):(x, z) ∈ E, (z, y) ∈ D for some z ∈ X}. Uniformity is introduced to measure the “distance” between two points: There is a clear analogy between (I), (II), (III) and the definition of distance. (IV) and (V) are needed to extract a topology from a uniformity. If we omit (V) (and weaken (IV) a bit) we obtain a basis, while if we omit both (IV) and (V) we obtain a subbasis. Note that ∆ ◦ D = D ◦ ∆ = D. Using (I): E ◦ E ⊆ D =⇒ E ⊆ D. Thus, if (III) is satisfied, then we can strengthen it: for all n ∈ N, there exists some E ∈ D such that E ◦ · · · ◦ E ⊆ D, where w.l.o.g. | {z } n we can assume E is symmetric, i.e. E = E−1. Note also that usually ∆ 6∈ D. A topological space that admits a compatible (see Definition 1.7 below) uniformity will be called uniformizable. The notion uniformity was first introduced in Weil[1937], who allegedly also invented the empty set symbol ∅ (from Norwegian alphabet).
Alert 1.6: Intersection / union of uniformities
Unlike topology, intersection of uniformities need not be a (subbasis of) uniformity: For any x ∈ [0, 1] let Dx be the collection of supersets of ∆ ∪ {1, x} ∪ {x, 1}. Clearly Dx is a uniformity but for x 6= y, Dx∩Dy is not even a subbasis: it is the collection of supersets of ∆ˆ := ∆∪{1, x}∪{x, 1}∪{1, y}∪{y, 1} but ∆ˆ ◦ ∆ˆ = ∆ˆ ∪ {x, y} ∪ {y, x} hence (III) in Definition 1.5 is violated for ∆ˆ . Clearly, intersection of uniformities is a (subbasis of) uniformity iff they all contain a common (subbasis of) uniformity. Thus, for a collection of (subbases of) uniformities, we can define the smallest uniformity that contains all of them (since their union is a subbasis of uniformity).
Definition 1.7: (Uniform) topology from uniformity
Let D be a uniformity on X × X, then its induced topology (a.k.a. uniform topology) on X is defined using nhoods:
∀x ∈ X, Dx := {Dx : D ∈ D}, Dx := {y :(x, y) ∈ D}. (1)
We easily verify that {Dx}x∈X is a (basis, subbasis of) topology on X if D is a (basis, subbasis of) uniformity on X × X.
It is apparent that there is an intimate relation between topology and uniformity, and our central question is to reveal what kind of topology is derived from a uniformity. Example 1.8: Pseudo-metric uniformity
Let (X, d) be a pseudo-metric space, then it admits a natural uniformity whose basis is:
D := {(x, y): d(x, y) < r}r>0 (2)
Not surprisingly, the topology of X induced by the metric d coincides with the one induced by the above uniformity.
Proposition 1.9: Interior preserves uniformity
If D is a member of the uniformity D, then int D ∈ D, where the interior is taken w.r.t. the product of the uniform topology on X. Proof: Since D ∈ D there exists some symmetric E ∈ D such that E ⊆ E ◦ E ◦ E ⊆ D. We claim
December 14, 2015 revision: 1 main 3 1 Topological background
that E ⊆ int D: Indeed, (x, y) ∈ E =⇒ Ex × Ey ⊆ E ◦ E ◦ E ⊆ D.
Therefore, the collection of open symmetric sets D ∈ D is a basis of D. It is now clear that D ∈ D =⇒ D is an nhood of ∆. However, the converse need not hold: R2 1 Consider D := {(x, y) ∈ : |x − y| < 1+|y| } which is an nhood of the diagonal, but D cannot contain any member of the basis in (2).
Proposition 1.10: Closed sets as intersection of open sets
A topological space X is R0 (i.e., for all x, y ∈ X, x has an nhood not containing y iff y has an nhood not containing x) iff all closed set A ⊆ X can be written as the intersection of (a family of) open supersets of A. c Proof: Let X be R0 and A ⊆ X be closed. For any x 6∈ A and y ∈ A, the open set A contains x but not y, hence there is an open set Uy containing y but not x. Therefore the open set ∪y∈AUy contains U but not x. Since x is arbitrary, we know A is the intersection of all open supersets. Conversely, suppose any closed set A is the intersection of a family of open supersets. Take any c x, y ∈ X such that there is an open set U that contains x but not y. Thus the closed set U = ∩αVα c c c for a family of open sets Vα ⊇ U , and obviously y ∈ U , x 6∈ U . Therefore, there exists some α such that y ∈ Vα, x 6∈ Vα.
Note that a topological space is T1 iff it is T0 and R0. Moreover, a regular topological space is R0.
Proposition 1.11: Closure in uniform space T S For any subset A in a uniform space (X, D), cl A = DA, where DA := Dx. Similarly, for D∈D x∈A any B ⊆ X × X, cl B = T D ◦ B ◦ D. D∈D
Proof: x ∈ cl A ⇐⇒ ∀D ∈ D, Dx ∩ A 6= ∅ ⇐⇒ ∀ symmetric D ∈ D, x ∈ DA. Similarly, (x, y) ∈ cl B ⇐⇒ ∀ symmetric D ∈ D, Dx × Dy ∩ B 6= ∅ ⇐⇒ ∀ symmetric D ∈ D, (x, y) ∈ D ◦ B ◦ D.
Clearly, both intersections can be restricted to any symmetric basis (but not subbasis). It follows that for any E ∈ D, cl E ⊆ E ◦ E ◦ E, hence the collection of closed symmetric sets D ∈ D is a basis of D. Thus, the uniform topology is at least regular: Let x ∈ U for any open set U , then there exists a closed symmetric set D such that x ∈ Dx ⊆ U . Since the section map is closed, Dx is closed. Since Dx is an nhood, there is an open set V such that x ∈ V ⊆ Dx ⊆ U . Since the section map is open, DA is an open nhood of A if D is open. Thus, cl A can be written as the intersection of a family of open supersets of A.
Proposition 1.12: T0 = T2 in regular space
A regular topological space is T0 iff T1 iff T2.
Proof: Since a regular space is R0, it is T1 iff T0. If a regular space is T1, then it can separate disjoint points since a point is closed.
T T Therefore, the uniform topology is Hausdorff (T2) iff (cl x =) Dx = x iff D = ∆. D∈D D∈D
Definition 1.13: Uniform continuity
Let (X, D) and (Y, E) be two uniform spaces. We call the function f : X → Y uniformly continuous
December 14, 2015 revision: 1 main 4 1 Topological background
iff for all E ∈ E, the set {(x, y):(f(x), f(y)) ∈ E} ∈ D.
Theorem 1.14: Composition preserves uniform continuity
Let f :(X, D) → (Y, E), g :(Y, E) → (Z, F) be uniform continuous, then g ◦ f is also uniform continuous. Proof: For any F ∈ F,
∈D z }| { n o n o (x, y): g(f(x)), g(f(y)) ∈ F = (x, y):(f(x), f(y)) ∈ (a, b):(g(a), g(b)) ∈ F , | {z } ∈E
since both f and g are uniform continuous.
Theorem 1.15: Uniform continuous is continuous
A uniform continuous function f :(X, D) → (Y, E) is continuous (w.r.t. the uniform topology).
Proof: Fix any x and f(x). For any nhood Ef(x) of f(x), E ∈ E, the set D := {(y, z):(f(y), f(z)) ∈ E} ∈ D. Then f(Dx) ⊆ Ef(x).
The following definitions are slight modifications from their topological counterparts (with continuity enhanced to uniform continuity).
Definition 1.16: Making functions uniformly continuous
Let f : X → (Y, E), then the sets
{(x, y):(f(x), f(y)) ∈ E}E∈E
is easily verified to be a basis of uniformity. Therefore, by including all supersets we construct a coarsest uniformity on X that makes f uniform continuous. Similarly, there exists a coarsest uniformity W on X that makes a family of functions f α : X → α α (Y , E ) all uniformly continuous. Moreover, f :(Z, F) → (X, W) is uniformly continuous iff fα ◦ f is uniformly continuous for all α.
Definition 1.17: Subspace uniformity
Let A be a subset of the uniform space (X, D). We call A a (uniform) subspace of X if it is equipped with the coarsest uniformity such that the inclusion map ι : A → X, a 7→ a is uniform continuous. More precisely, the subspace uniformity on A is (A × A) ∩ D. Not surprisingly, the subspace topology on A coincides with the topology induced by the subspace uniformity.
Definition 1.18: Product uniformity
Let (Y α, Eα) be a collection of uniform spaces, then its product uniform space is defined as the Q α Q α α α Y such that the projections πα : α Y → Y are uniformly continuous. Again, the product topology coincides with the topology induced by the product uniformity. Moreover, the function Q α Q α f :(X, D) → ( α Y , α E ) is uniformly continuous iff πα ◦ f is uniformly continuous for all α.
December 14, 2015 revision: 1 main 5 1 Topological background
Definition 1.19: Quotient uniformity
Definition 1.20: Topological embedding Q Let fα : X → Xα be a family of functions and define the evaluation map e : X → α Xα, with [e(x)]α = fα(x). We usually can choose the space Xα, and we would like to know when the evaluation map e is a topological embedding. The functions fα : X → Xα separate points in X iff for all x 6= y in X there exists some α such that fα(x) 6= fα(y). This is equivalent to the evaluation map being 1-1. The functions fα : X → Xα separate points from closed sets in X iff for all x ∈ X and disjoint closed set A ⊆ X there exists some α such that fα(x) 6∈ cl fα(A).
Theorem 1.21: Topological (uniform) embedding Q The evaluation map e : X → α Xα is a topological (uniform) embedding iff the functions fα : X → Xα separate points and X is equipped with the coarsest topology (uniformity) that makes every fα (uniformly) continuous. Proof: We only prove the uniform case. The topological case is completely analogous. First note that the evaluation map e is 1-1 iff fα separate points. The evaluation map is uniformly continuous iff for all α, Dα ∈ Dα, the sets ( ) n Y Y αo (x, y) ∈ X × X : e(x), e(y) ∈ (u, v) ∈ Xα × Xα :(uα, vα) ∈ D , α α which after simplification are