<<

Christian Parkinson GRE Prep: & Notes 1

Week 6: Topology & Real Analysis Notes To this point, we have covered Calculus I, Calculus II, Calculus III, Differential Equations, Linear Algebra, and Abstract Algebra. These topics probably comprise more than 90% of the GRE math subject exam. The remainder of the exam is comprised of a seemingly random selection of problems from a variety of different fields (topology, real analysis, probability, combinatorics, discrete math, graph theory, algorithms, etc.). We can’t hope to cover all of this, but we will state some relevant definitions and theorems in Topology and Real Analysis.

Topology

The field of topology is concerned with the shape of spaces and their behavior under continuous transformations. Properties regarding shape and continuity are phrased using the concept of open sets.

Definition 1 (Topology / Open Sets). Let X be a and τ be a collection of of X. We say that τ is a topology on X if the following three properties hold:

(i) ∅,X ∈ τ n \ (ii) If T1,...Tn is a finite collection of members of τ, then Ti ∈ τ i=1 [ (iii) If {Ti}i∈I is any collection of members of τ, then Ti ∈ τ i∈I In this case, we call the pair (X, τ) a and we call the sets T ∈ τ open sets.

Note, there are two which we can always place on any set X: the trivial topol- ogy τ = {∅,X} and the discrete topology τ = P(X). Having defined open sets, we are able to define closed sets.

Definition 2 (Closed Sets). Let (X, τ) be a topological space. A set S ⊂ X is called closed iff Sc ∈ τ. That is, S is defined to be closed if Sc is open.

The words open and closed can be a bit confusing here. Often times students mistak- enly assume that a set is either open or closed; that these terms are mutually exclusive and describe all sets. This is not the case. Indeed, sets can be open, closed, neither open nor closed, or both open and closed. In any topological space (X, τ), the sets ∅ and X are both open and closed. By De Morgan’s laws, since finite intersections and arbitrary unions of open sets are open, we see that finite unions and arbitrary intersections of closed sets remain closed.

Example 3. The set of real numbers R becomes a topological space with open sets defined as follows. Define ∅ to be open and define ∅ 6= T ⊂ R to be open iff for all x ∈ T , there Christian Parkinson GRE Prep: Topology & Real Analysis Notes 2 exists ε > 0 such that (x − ε, x + ε) ⊂ T . Prototypical open sets in this topology are the open intervals (a, b) = {x ∈ R : a < x < b}. Indeed, this interval is open because for x ∈ (a, b), we can take ε = min{|x − a| , |x − b|} and we will find that (x − ε, x + ε) ⊂ (a, b). We can combine open sets via unions or (finite) intersections to make more open sets; for example (0, 1) ∪ (3, 5) is also an . Likewise, prototypical closed sets are closed in- tervals [a, b] = {x ∈ R : a ≤ x ≤ b}, and any intersection or (finite) of such sets will remain closed. As was observed above ∅ and R are both open and closed; in fact, in this space, these are the only sets which are both open and closed, though it is easy to construct sets which are neither open nor closed. Consider the set [0, 1) = {x ∈ R : 0 ≤ x < 1}. This set is not open because the point 0 is in the set, but it cannot be surrounded by an which remains in the set. The of this set is (−∞, 0) ∪ [1, ∞). This set is not open since 1 is in the set but cannot be surrounded by an interval which remains in the set. Since the complement is not open, the set [0, 1) is not closed. Note, this topology is called the standard topology on R.

Example 4. While the above example defines the standard topology on R, it is easy to come up with non-standard topologies as well. Indeed, let us now define T ⊂ R to be open if T can be written as a union of sets of the form [a, b) = {x ∈ R : a ≤ b < x}. These open sets comprise a topology on R. In this topology a prototypical open set is of the form [a, b). What other sets are open in this topology? Notice that

∞ [ (a, b) = [a + 1/n, b) n=1 which shows that sets of the form (a, b) remain open in this topology. Also notice that since [a, b) is open, we define [a, b)c = (−∞, a) ∪ [b, ∞) to be closed. However, both (−∞, a) and [b, ∞) are easily seen to be open, so the set (−∞, a) ∪ [b, ∞) is also open as a union of open sets. Since this set is open, it’s complement [a, b) is closed. Hence in this topology, all sets of the form [a, b) are both open and closed. The intervals [a, b] are closed and not open in this topology. Note, this topology is called the lower topology on R.

Notice in these example, the contains as open sets all of the sets which are open in the standard topology. In this way, the lower limit topology has “more” open sets and we can think of the lower limit topology “containing” the standard topology. We define these notions here.

Definition 5 (Finer & Coarser Topologies). Suppose that X is a set and τ, σ are two topologies on X. If τ ⊂ σ, we say that τ is coarser than σ and that σ is finer than τ.

On any space X, the finest topology is the discrete topology P(X) and the coarsest is the {∅,X}. A finer topology is one that can more specifically distinguish between elements. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 3

Definition 6 ( & ). Let (X, τ) be a topological space and let T ⊂ X. The interior of T is defined to be the largest open set contained in T . The closure of T is defined to be the smallest containing T . We denote these by int(T ) and cl(T ) respectively. In symbols, we have [ \ int(T ) = S and cl(T ) = S. S∈τ,S⊂T Sc∈τ,T ⊂S

Other common notations are T˚ for the interior of T , and T for the closure of T .

Example 7. Considering R with the standard topology and a, b ∈ R, a < b, we have int([a, b)) = (a, b) and cl([a, b)) = [a, b].

In both of the examples above, there was some notion of a “prototypical” open set, from which other open sets can be built. We give this notion a precise meaning here.

Definition 8 (Basis () for a Topology). Let X be a set and let β be a collection of subsets of X such that [ (1) X = B, B∈β

(2) if B1,B2 ∈ β, then for each x ∈ B1 ∩ B2, there is B3 ∈ β such that x ∈ B3 and B3 ⊂ B1 ∩ B2. Then the collection of sets  τ = T : T = ∪i∈I Bi for some collection of sets {Bi}i∈I ⊂ β

forms a topology on X. We call this τ the topology generated by β, and we call β a basis for the topology τ.

This is half definition and half theorem: we are defining what it means to be a basis, and asserting that the topology generated by a basis is indeed a topology. If we can identify a basis for a topology, then the basis sets are the “prototypical” open sets, and all other open sets can be built as unions of the basis sets. Morally, basis sets are representatives for the open sets; if you can prove a given property for basis sets, the property will likely hold for all open sets. Often times it is easiest to define a topology by identifying a basis.

Example 9. Above we defined the standard topology on R by saying that a set T is open if for all x ∈ T , there is ε > 0 such that (x − ε, x + ε) ⊂ T . It is important to see this definition of the topology; however, this is a much more analytic than topological definition. The topological way to define the standard topology on R would be to define it as the topol- ogy generated by the sets (a, b) where a, b ∈ R, a < b. Indeed, these two definitions of the standard topology are equivalent as the following proposition shows. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 4

Proposition 10. Suppose that (X, τ) is a topological space and that β is a basis for the topology τ. Then T ∈ τ iff for all x ∈ T , there is B ∈ β such that x ∈ B and B ⊂ T .

It is important to identify when two bases in a topological space generate the same topol- ogy and this next proposition deals with that question.

Proposition 11. Suppose that X is a set and β1, β2 are two bases for topologies τ1 and τ2. Then τ1 ⊂ τ2 iff for every B1 ∈ β1, and for every x ∈ B1, there is B2 ∈ β2 such that x ∈ B2 and B2 ⊂ B1. (Be very careful not to mix up the inclusions in this statement. What this is essentially saying is that β2 generates a larger (finer) topology iff β2 has more (smaller) sets.) Informally, the basis β2 generates a finer topology if we can squeeze basis sets from β2 inside basis sets from β1 (and not only that, but we can construct basis sets from β1 out of basis sets from β2).

Just as all groups have subgroups and all vector spaces have subspaces, there is a natural way to define subspaces of a topological space.

Definition 12 (). Let (X, τ) be a topological space and let Y ⊂ X. Then the collection of sets σ = {Y ∩ T : T ∈ τ} forms a topology on Y . This topology σ is called the subspace topology on Y inherited from (X, τ).

Again, this is part definition and part theorem; we are asserting that such σ does indeed define a topology on Y .

Example 13. Consider [0, 3] ⊂ R with the standard topology on R. Note that the subspace topology on [0, 3] includes standard open sets like (1, 2) since this set is open in R and (1, 2) = [0, 3] ∩ (1, 2).

Now consider the set (1, 3]. This set is not open in R; however, it is open in the subspace topology on [0, 3], because (1, 4) is open in R and (1, 3] = [0, 3] ∩ (1, 4).

Likewise there is a natural way to combine topological spaces in a Cartesian product.

Definition 14 (). Let (X, τ), (Y, σ). Recall the Cartesian product is given by coupling elements of X and Y : X × Y .= {(x, y): x ∈ X, y ∈ Y }. It is tempting to define a topology on X × Y comprised of sets of the form T × S for T ∈ τ, S ∈ σ. However, these do not form a topology on X × Y since a union of sets of this form will not be of this form anymore. So rather, we let β = {T × S : T ∈ τ, S ∈ σ} form the basis for a topology on X × Y . The topology generated by β is denoted τ × σ and the space (X × Y, τ × σ) is Christian Parkinson GRE Prep: Topology & Real Analysis Notes 5 the called the product space of X and Y .1

Example 15. Consider R with the standary topology, which we call τ1. The product space (R×R, τ1 ×τ1) can be visualized by drawing the plane with standard open sets as rectangles (a, b) × (c, d) = {(x, y) ∈ R × R : a < x < b and c < y < d}. Alternatively, we can consider 2 the set R of 2-dimensional vectors. On this space, we consider the topology τ2 generated 2 2 by circles: Br(v) = {z ∈ R : kv − zk < r} for v ∈ R and r > 0 (indeed, this is called the 2 x 2 standard topology on R ). We can identify each vector v = ( y ) ∈ R with the coordinates (x, y) ∈ R × R. Since this is a bijective map, the sets R × R and R2 are really the same. We’d like to know if the topologies τ1 × τ2 and τ2 are the same. To prove they are the same, consider the bases

β1 = {(a, b) × (c, d): a, b, c, d ∈ R, a < b, c < d} and  x 2 β2 = Br(v): v = ( y ) ∈ R , r > 0 .

For any B1 = (a, b)×(c, d) ∈ β1, take any (x, y) ∈ B1 and let r = min{x−a, b−x, y−x, d−y}. x Then for v = ( y ), we will have x ∈ Br(v) ⊂ B1; this shows that for any (x, y) ∈ B1, we can find a set B2 ∈ β2 such that (x, y) ∈ B2 and B2 ⊂ B1. Hence by Proposition 11, τ1×τ1 ⊂ τ2. x z Conversely, let v = ( y ) and r > 0√ and consider the set B2 = Br(v) ∈ β2. For any u = ( w ) ∈ 0 0 0 0 0 Br(v), define r = (r − ku − vk2)/ 2. Then the square B1 = (z − r , z + r ) × (w − r , w + r ) z satisfies u = ( w ) ∈ B1 and B1 ⊂ B2. Thus by Proposition 11, we have τ2 ⊂ τ1 × τ1, and we can conclude that τ2 = τ1 × τ1. (Note, this inclusions of basis sets is pictured in Figure 1.) That is, the standard topology on R2 is the product of two copies of the standard topology on R. More generally, for n ∈ N, we can define the standard topology on Rn to be the topology generated by open balls and we will find that this is the same as the product of n copies of the standard topology on R.

Topology gives us the minimum structure required to discuss limits and continuity. In- deed, in calculus we were only able to discuss these things because R is naturally a topological space with the standard topology. We give the topological definitions of limits and continuity here and discuss some of their properties.

∞ Definition 16 (Limit of a ). Let (X, τ) be a topological space and let {xn}n=1 ∞ be a sequence of values in X. We say that {xn}n=1 converges to a limit x ∈ X if for any open set T ∈ τ such that x ∈ T , there is N ∈ N, such that xn ∈ T for all n ≥ N. We write this as xn → x or lim xn = x. n→∞

1Note, this is only a good definition for the product topology when we are taking the product of a finite number of spaces. Indeed, if {(Xi, τi)}i∈I is an arbitrary collection of topological spaces, it is most natural Q to define the product topology on X = Xi to be the coarsest topology so that the projection maps i∈I Q πi : X → Xi are continuous. The topology defined generated by sets of the form i∈I Ui where Ui ∈ τi is then called the box topology. One can show that for a finite Cartesian product, the product topology and box topology agree with each other; this is not necessarily true for infinite products. (Another way to “correctly” Q define the product topology for an infinite product X = Xi is to let it be generated by sets of the form Q i∈I i∈I Ui where Ui ∈ τi and Ui = Xi for all but finitely many i ∈ I.) Christian Parkinson GRE Prep: Topology & Real Analysis Notes 6

Figure 1: A basis set from either topology τ1 × τ1 or τ2 can be fit around any point in a basis set from the other topology.

Note that this is a generalization of the definition we gave for a limit in calculus; in cal- culus we are always using the standard topology on R. One feature of the limit in calculus is that limits are unique: if xn → x and xn → y, then x = y. This is not true in a general topological space.

Example 17. Consider a space X with the trivial topology τ = {∅,X}. Take any sequence ∞ {xn}n=1 in X and apply the definition of the limit. For any x ∈ X, and any N ∈ N, we see that if T ∈ τ and x ∈ T , then T = X and xn ∈ T for all n ≥ N. Thus in this space, every sequence converges to every point.

There are non-trivial topologies where limits are still non-unique, but our intution tells us that limits should be unique and we can add a simple property to ensure that they are.

Definition 18 (Hausdorff Space). A topological space (X, τ) is called a Hausdorff space (or is said to have the Hausdorff property), if for all x, y ∈ X with x 6= y, there are open sets Tx,Ty ∈ τ such that x ∈ Tx, y ∈ Ty and Tx ∩ Ty = ∅.

Proposition 19. Limits in Hausdorff spaces are unique. That is, if (X, τ) is a Hausdorff ∞ ∞ space and {xn}n=1 is a sequence in X, then {xn}n=1 can have at most one limit x ∈ X.

Now we would like to discuss maps between spaces. As with linear transforms in linear algebra and homomorphisms in abstract algebra, we restrict our discussion to maps which preserve some of the underlying structure of the space. In topology, these are the continuous functions.

Definition 20 (Continuous ). Let (X, τ) and (Y, σ) be two topological spaces. A function f : X → Y is said to be continuous iff f −1(V ) = {x ∈ X : f(x) ∈ V } ∈ τ whenever V ∈ σ. That is, f is continuous if the preimage of every open set in Y is open in X. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 7

This gives us a way to define continuity without ever considering individual points. Con- trast this with the calculus definition of continuity where we first define what it means for a function to be continuous at a point, and then define continuous functions on a domain to be those which are continuous at each point. Of course, topology still has a notion of what it means to be continuous at a point.

Definition 21 (Pointwise Continuity). Let (X, τ) and (Y, σ) be two topological spaces and let f : X → Y . We say that f is continuous at the point x ∈ X, if for all V ∈ σ such that x ∈ V , we have f −1(V ) ∈ τ.

It is a good exercise to prove that when both X,Y are R with the standard topology, this definition of continuity is equivalent to the ε-δ definition of continuity presented in calculus.

Continuity is an important property because it preserves certain features of topological spaces. Indeed, if f : X → Y , we define the image of X under f by f(X) = {f(x): x ∈ X}. −1 We define σf to be the collection of subsets V ⊂ f(X) such that f (V ) is open in X. If f is continuous, this collection will form a topology, and this topology σf on f(X) will be the same as the subspace topology that f(X) inherits from (Y, σ). This shows that continuous functions respect the structure of the underlying spaces; in other words, continuous functions are the morphisms in the category of topological spaces. 2 Above we showed that the spaces (R , τ2) and (R × R, τ1 × τ1) are essentially the same space; it is important to be able to be able to identify equivalent spaces or distinguish be- tween distinct topological spaces and continuity helps us do that. We define a few more properties of topological spaces here.

Definition 22 (Connectedness). Let (X, τ) be a topological space and let C ⊂ X. We say that the set C is disconnected iff there exist non-empty A, B ∈ τ such that C ⊂ A ∪ B and A ∩ B = ∅. We say the space is connected iff it is not disconnected.

Example 23. Intuitively, a set is connected if it is in one whole piece; disconnected sets have separate pieces broken off from each other. Thus for exam- ple, in R with the standard topology, the set [5, 9) is connected while the set {1} ∪ [5, 9) ∪ (10, ∞) is dis- connected. However note, this is only a heuristic! It works very well in R, but even in R2 there are fa- mous examples that challenge this intuition. Indeed, consider the set C ⊂ R2 (pictured right) given by

 1  C = {(0, y): y ∈ [−1, 1]}∪ x, sin x : x ∈ (0, ∞) . This is called the Topolgist’s , and while it is defined as the disjoint union of two sets, it is actu- ally connected. Indeed, any open set containing the Figure 2: Topologist’s Sine Curve vertical strip {(0, y)}y∈[−1,1] will necessarily contain Christian Parkinson GRE Prep: Topology & Real Analysis Notes 8 some of the curve. Thus this set cannot be covered by two disjoint, open sets. (Actually, a stronger notion of connectedness is path-connectedness. Roughly speaking, a set is path-connected if one can draw a path between any two points in the set without leaving the set. At first glance, path-connectedness may seem equivalent to connectedness, but the topologist’s sine curve is connected without being path-connected.)

One important result involving connectedness helps us classify sets which are both open and closed.

Proposition 24. Suppose that (X, τ) is a topological space. Then X is connected iff the only sets which are both open and closed in X are ∅ and X itself.

Definition 25 (Compactness). Let (X, τ) be a topological space and let C ⊂ X. We say that C is compact iff from any collection of open sets {Ui}i∈I such that C ⊂ ∪i∈I Ui, we n can extract a finite collection of sets Ui1 ,...,Uin such that C ⊂ ∪k=1Uik .

Such a collection {Ui}i∈I in the definition of compactness is called an open cover of C. Thus, in words, a set C is compact if every open cover of C admits a finite subcover.

Example 26. Compactness is somehow a generalization of closedness and boundedness. Indeed, we give examples here of a which is not closed and a closed set which is not bounded and prove that neither are compact. Consider R with the standard topol- ogy and consider the bounded set (0, 1). This set is not closed, and we show that it is not ∞ compact. Consider Uk = (0, 1 − 1/k) for k ∈ N. We see that (0, 1) ⊂ ∪k=1Uk so {Uk} forms an open cover of (0, 1). However, if we take any finite subcollection Uk1 ,...,UkN of {Uk} N and let K = max{k1, . . . , kN }. Then (0, 1) 6⊂ ∪n=1Ukn = (0, 1 − 1/K). Thus there cannot be a finite subcover for this particular cover, and so this cover violates the definition of compactness and we conclude that (0, 1) is not compact. Similarly, consider the set [0, ∞). This set is closed, but is not bounded, and we show that it is not compact. Consider the ∞ collection Uk = (−1, k) for k ∈ N. We see [0, ∞) ⊂ ∪k=1Uk, but again, any finite collection N Uk1 ,...,UkN will satisfy [0, ∞) 6⊂ ∪n=1Ukn = (−1,K) where K = max{k1, . . . , kN }, and so this open cover admits no finite subcover, and hence we conclude that (0, ∞) is not compact. By contrast, a closed and bounded of R like [0, 1] is compact, though this is not at all trivial to prove. In general topological spaces, it is easier to show that a set isn’t compact, since this only requires exhibiting one example of an infinite cover that does not admit a finite subcover. In R (and more generally, in topologies), there are nice theorems which give concrete lists of properties which are equivalent to compactness. We cover some of these theorems in the Real Analysis portion of these notes.

Proposition 27. Suppose that (X, τ) and (Y, σ) are topological spaces and that f : X → Y is continuous. Let U ⊂ X and consider the image of U under f defined by f(U) .= {f(x): x ∈ U} ⊂ Y . If U is connected in X, then f(U) is connected in Y . Likewise, if U is compact, then f(U) is compact in Y .

This proposition tells us that these properties of connectedness and compactness are in- Christian Parkinson GRE Prep: Topology & Real Analysis Notes 9 variant under continuous maps. Statements like this help us characterize topological spaces. Note however that the converse is not true: a continuous map could still map a disconnected set to a connected set (for example, the f(x) = x2 on R maps the dis- connected set (−1, 0) ∪ (0, 1) to the connected set (0, 1)), or a non-compact set to a compact set (for example, the continuous map f(x) = sin(x) maps the non-compact set (0, ∞) to the compact set [−1, 1]). If we want a continuous map to not change the structure of a space at all, we need to require something more.

Definition 28 (). Suppose that (X, τ) and (Y, σ) are topological spaces. A function f : X → Y is called a homeomorphism iff the following four properties hold:

1. f is one-to-one,

2. f is onto,

3. f is continuous,

4. f −1 is continuous.

If such a function f exists, the topological spaces (X, τ) and (Y, σ) are called homeomorphic and we write X ∼= Y .

In words, a homeomorphism between two topological spaces is a bicontinuous bijection; it maps each space bijectively and continuously to the other. Homeomorphic spaces share essentially all important properties in common, so when two spaces are homeomorphic we think of them as ”morally the same space.” For this reason it can be important to identify whether two spaces are homemorphic. We make a few final statements about properties that homeomorphic spaces share, and give one example in conclusion.

Proposition 29. Let (X, τ) and (Y, σ) be topological space, let U ⊂ X and let f : X → Y be a homeomorphism. Then

ˆ U is open in X iff f(U) is open in Y ,

ˆ U is closed in X iff f(U) is closed Y ,

ˆ U is connected in X iff f(U) is connected in Y ,

ˆ U is compact in X iff f(U) is compact in Y ,

ˆ X is Hausdorff iff Y is Hausdorff.

Example 30. Consider R with the usual topology. Any open interval (a, b) is homeomorphic to the interval (0, 1) under the map f :(a, b) → (0, 1) defined by x − a f(x) = , x ∈ (a, b). b − a Christian Parkinson GRE Prep: Topology & Real Analysis Notes 10

Likewise, (0, 1) is homeomorphic to R under the map g : (0, 1) → R defined by

1  g(x) = tan π x − 2 , x ∈ (0, 1). Homemorphism is an equivalence relation (if two spaces are homemorphic to the same space, they are homemorphic to each other); indeed we can always compose homemorphic maps and retain a homemorphism. Thus any interval (a, b) is homemorphic to R. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 11

Real Analysis

Real analysis is concerned with the rigorous underpinnings of calculus. However, when we teach calculus, we do everything formally and so everything is assumed to be “nice” (all the common functions from calculus are smooth, for example). Now we make no such assumptions: analysis is largely about tearing down our intuition from calculus and building it back up again with rigor. Accordingly, most real analysis courses start with the basic construction of R; we discuss this formally later, but begin here by establishing properties of , functions, sets, etc. Much of this will overlap with the preceding topology notes, but often the same concepts are tackled in very different ways. Some of this will also be repeated from the Calculus I & II notes.

We’ll start by discussing general metric spaces, giving several definitions and theorems, and later specialize the conversation to R.

Definition 31 (). Let X be a set and let d : X × X → [0, ∞). We call d a metric on X (and call (X, d) a metric space) if the following three properties hold:

1. d(x, x) = 0 for all x ∈ X and d(x, y) > 0 when x, y ∈ X, x 6= y,

2. d(x, y) = d(y, x) for all x, y ∈ X,

3. d(x, z) ≤ d(x, y) + d(y, z) for all x, y, z ∈ X.

Metrics generalize the notion of distance to non-Euclidean spaces.

Example 32. The prototypical example of a metric space is R with the metric d(x, y) = |x − y|. This can be generalized to Rn. Indeed, in Rn, we define the metric

n !1/2 . X 2 n d(x, y) = kx − yk .= (xi − yi) , x, y ∈ R . i=1 Another example: for any set X, we can define the discrete metric d(x, y) = 0 if x = y and d(x, y) = 1 if x 6= y.

Definition 33 (Metric Topology). Let (X, d) be a metric space, x ∈ X and r > 0. Define the ball centered at x of radius r by

Br(x) = {y ∈ X : d(x, y) < r}.

Let τ be the topology generated by the set β = {Br(x): x ∈ X, r > 0}. This is called the metric topology on X. In this topology, a set U ⊂ X is open iff for all x ∈ U, there is r > 0 such that Br(x) ⊂ U. We will also refer to τ as the topology generated by the metric d. Conversely, if we have a topological space (X, τ) and there is a metric d on X that generates τ, then we call τ metrizable. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 12

For some example, the discrete metric on a set will generate the discrete topology. The standard metric d(x, y) = |x − y| on R will generate the standard topology on R. Metric topologies have very nice structure. Most of the topological properties discussed above can be given new definitions only using the metric structure that d lends to X. Thus, the definitions in real analysis and topology may look different at a first glance, but they are always compatible. We discuss some properties of metric topologies now.

Proposition 34. Let (X, d) be a metric space. The metric topology on (X, d) is Hausdorff.

This is very easy to prove: if x, y ∈ X and x 6= y then d(x, y) > 0. Then Bε(x) and Bε(y), where ε = d(x, y)/3, are open neighborhoods of x and y respectively which are dis- joint, proving that the space is Hausdorff.

∞ Definition 35 (). Let (X, d) be a metric space and let {xn}n=1 be a sequence in X. We say that x ∈ X is the limit of xn iff for all ε > 0, there is N ∈ N such that d(x, xn) < ε for all n ≥ N. In this case, we say that xn converges to x and we write xn → x or limn→∞ xn = x.

This definition of the limit is exactly as in calculus but generalized to arbitrary metric spaces. Limits give us a way to characterize closed sets in metric topologies.

Definition 36 (Limit Points). Let (X, d) be a metric space and let U ⊂ X. A point x ∈ X is called a of U if there is a sequence {xn} in U such that xn 6= x for all n ∈ N and xn → x.

Proposition 37. Let (X, d) be a metric space and let C ⊂ X. Then C is closed in the metric topology iff for all sqeuences {xn} in C converging to a limit x ∈ X, we have that x ∈ C. In the terminology of the above definition, a subset of a metric space is closed iff it contains all of its limit points.

Recall, in topology a set is closed iff the complement of the set is open. This theorem gives an alternate definition, and it is often times easier to check that a set contains its limit points than to check that its complement is open.

Definition 38 (Closure). Let U ⊂ X and let L = {x ∈ X \ U : x is a limit point of U.}. Then the closure of U is defined by U = U ∪ L. That is, the closure of U is the set U plus all of the limit points of U.

Note, we already defined the closure in topology to be smallest closed set containing a set. Again, these notions are compatible: U defined in the definition above is the smallest set which is closed in the metric topology and contains U. In many metric spaces, we can build any point in the space by considering a smaller set and taking limits.

Definition 39 (). Let (X, d) be a metric space and let D ⊂ X. We say that D is dense in X iff for all x ∈ X, there is a sequence {xn} in D such that xn → x. Equivalently, Christian Parkinson GRE Prep: Topology & Real Analysis Notes 13

D is dense in X iff for all x ∈ X and ε > 0, there is y ∈ D such that d(x, y) < ε. Again, equivalently, D is dense in X if D = X. One last equivalent statement: D is dense in X if every open subset of X contains a least one point in D.

Intuitively, a dense set is tightly packed into X; it may not include all elements, but the gaps between elements are infinitesimally small. In this way it seems like a dense set must contain “most” of the space, but this is a place where intuition fails. Indeed, a dense set can actually be quite small in a few different senses. We define one sense here and discuss it more when we discuss R later.

Definition 40 (). A set C is countable if there is a injective map f : C → N. Equivalently, a set C is countable if the elements of C can be listed in a ∞ sequence: C = {cn}n=1. A set is C countably infinite if there is a bijective map f : C → N.

Definition 41 (). Let (X, d) be a metric space (or more generally a topological space). We say that X is separable if there is a countable set D ⊂ X which is dense in X: D = X.

Again, intuitively, it may seem like a separable space needs to be “small” because it has a “small” dense set, but this intuition is not true in any meaningful sense. There are highly non-trivial separable spaces.

Definition 42 (). Let (X, d) be a metric space and let {xn} be a sequence in X. We call {xn} a Cauchy sequence iff for all ε > 0, there is N ∈ N such that d(xn, xm) < ε for all n, m ≥ N.

A Cauchy sequence is one that eventually begins to cluster together. Intuitively, we may think that if the sequence clusters together, it must cluster around some point and thus it will converge to that point. However, if the space X is “missing” some points, then the sequence may cluster around a missing point and thus fail to converge to any member of X. Thus we use Cauchy sequences to define a notion of a space not having any “missing” points.

Definition 43 (Complete Space). We call a metric space (X, d) complete iff for all Cauchy sequences {xn} in X, there is x ∈ X such that xn → x.

Complete spaces are nice because to prove a sequence {xn} has a limit one first needs to identify a candidate x and then prove that d(x, xn) becomes small. However, the candidate x may be difficult or impossible to identify. However, if the space is complete, to prove that a sequence converges, one no longer needs to identify a candidate; rather can instead prove that {xn} is a Cauchy sequence and conclude that it converges in that manner. Besides sequences, much of calculus is concerned with functions and their properties like continuity, differentiability and integrability. We can discuss continuity in general metric spaces; the other concepts require some of the structure of R, so we leave them for later.

Definition 44 (Continuity). Let (X, dX ) and (Y, dY ) be two metric spaces, let f : X → Y Christian Parkinson GRE Prep: Topology & Real Analysis Notes 14 and let x ∈ X. We say that f is continuous at x iff for all ε > 0, there is δ = δ(x, ε) > 0 such that for z ∈ X, dX (x, z) < δ =⇒ dY (f(x), f(z)) < ε. We say that f is continuous on X (or merely, f is continuous) iff f is continuous at every point x ∈ X. Equivalently, a function f is continuous on X iff for all x, y ∈ X and all ε > 0, there is δ = δ(x, y, ε) > 0 such that dX (x, y) < δ =⇒ dY (f(x), f(y)) < ε.

This is the exact notion of continuity that we presented in calculus, but generalized to metric spaces. Again, it is a useful exercise to prove that this notion of continuity is equiv- alent to the topological notion of continuity. Because metric spaces have nice structure, we can also characterize continuity in terms of limits of sequences.

Theorem 45 (Sequential Criterion Theorem). Let (X, dX ) and (Y, dY ) be metric spaces and let f : X → Y . Then f is continuous iff for all sequences {xn} in X converging to a point x ∈ X, we have that f(xn) → f(x) in Y .

A function satisfying the latter condition is said to be sequentially continuous, so this proposition tells us that a function between metric spaces is continuous iff it is sequentially continuous. In calculus, this would conclude our discussion of continuity. However, in analysis, we define more stringent definitions of continuity to more finely differentiate between classes of functions.

Definition 46 (Uniform Continuity). Let (X, dX ) and (Y, dY ) be two metric spaces and let f : X → Y . We say that f is uniformly continuous iff for all ε > 0, there is δ = δ(ε) > 0 such that for all x, y ∈ X, dX (x, y) < δ =⇒ dY (f(x), f(y)) < ε.

At first glance this definition looks identical to the definition of continuity, but it is not. The subtle difference is in the order of the quantifiers. In the definition of continuity, the δ is allowed to depend on the particular x and y you are testing; in the definition of uniform continuity, δ cannot depend on x and y: there must be a uniform δ that only depends ε. In logical notation this different is expressed as such: f is continuous iff

(∀ε > 0)(∀x, y ∈ X)(∃δ > 0) ; dX (x, y) < δ =⇒ dY (f(x), f(y)) < ε, whereas f is uniformly continuous iff

(∀ε > 0)(∃δ > 0)(∀x, y ∈ X); dX (x, y) < δ =⇒ dY (f(x), f(y)) < ε.

Thus uniform continuity is a stronger condition: if f is uniformly continuous, then f is continuous, but not vice versa. And even stronger notion of continuity is as follows.

Definition 47 (Lipschitz Continuity). Let (X, dX ) and (Y, dY ) be two metric spaces and let f : X → Y . We say that f is Lipschitz continuous iff there is a constant L > 0 such that for all x, y ∈ X, dY (f(x), f(y)) ≤ L · dX (x, y). In this case, the smallest such L is called the Lipschitz constant of f. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 15

Thus Lipschitz continuous function have an explicit bound on the distance between f(x) and f(y) in terms of the distance between x and y. If f is Lipschitz continuous with constant L, then for any ε > 0, we can take δ = ε/L and we will find that f satisfies the definition of uniform continuity; hence Lipschitz continuity is even stronger.

With this we drop generality and talk specifically about the analytic and topological structure of R. Again, we will not explicitly construct the real numbers, but we’ll present the rough idea, which is to start with rational numbers and define real numbers as limit points of Cauchy sequences of rationals.

Definition 48 (Rational Numbers). Define the set of rational numbers to be those p which can be written as a ratio of . That is, Q = { q : p, q ∈ Z, q 6= 0}. With this definition, the rational numbers form a field: they are an abelian group under addition and, neglecting the additive identity, they are an abelian group under multiplication.

The rational numbers become a metric space with the metric d(a, b) = |a − b| for a, b ∈ Q.

Proposition 49. Q is countably infinite. That is, we can enumerate the rationals in a ∞ sequence Q = {qn}n=1.

This requires what is called a diagonalization argument. One can list the rationals in a two-dimensional table where row k corresponds to the rationals whose denominator is k. Then one can traverse the table down each diagonal, assigning a to each .

Proposition 50. Q is indiscrete in the sense that between any two rational numbers, one can find another rational number. Indeed, if a, b ∈ Q and a < b, then for large enough 1 1 n ∈ N, we have a < a + n < b and a + n is still rational.

From this proposition, it seems that the rational numbers do not have any large holes, and this may lead us to believe that the rational numbers are complete, but this is incorrect. Indeed, the rational numbers do not form a .

2 Proposition 51. There is no x ∈ Q such that x = 2, but there is a Cauchy sequence {xn} 2 in Q such that xn → 2.

This proposition ruins completeness. Define f : Q → Q by f(x) = x2 for x ∈ Q. We see that f(0) = 0 and f(2) = 4, so intuitively because f cannot jump over points, we should be able to find x ∈ Q such that f(x) = 2, but we’ve just asserted that this is impossible. Thus we conclude that either f is discontinuous because it jumps over a point (but it is easy to show that f is indeed continuous), or that Q is missing some points. It is the latter that is true. In the terminology of metric spaces, this shows that Q is not complete. However, for any metric space, we can define a notion of the completion of the space; that is, we can define a new metric space by adding some points, which is compatible with the original but is complete. This is how we define R. That is, the real numbers are defined as those numbers Christian Parkinson GRE Prep: Topology & Real Analysis Notes 16

which can be realized as limit points of Cauchy sequences of rational numbers. Thus any rational is real, but the numbers x such that x2 = 2 are real without being rational. Though this is essentially the definition of R, we state this as a proposition.

Proposition 52. Any is a limit of rational numbers. That is, Q is dense in R, and thus any open set in R contains rational numbers. Furthermore R is complete under the metric d(x, y) = |x − y| , x, y ∈ R.

Definition 53 (Irrational Numbers). The irrational numbers to be those which are real but not rational; that is, the irrational numbers are given by R \ Q = {x ∈ R : x 6∈ Q}.

Thus, for example, the solution to x2 = 2 are irrational, and of course, since f : [0, ∞) → 2 [0, ∞) defined by f(x) = x for x ∈ [0, ∞) is bijective,√ we can define√ an inverse map, and once we’ve done, we denote the solutions to x2 = 2 as 2 and − 2. Other common irrational numbers are π and e. We asserted before that Q is a countable set. It is reasonable to ask if R is still countable since all numbers in R are simply limits of numbers in Q.

Proposition 54. The set R of real numbers is not countable, and thus R\Q is not countable either.

To prove this, one can use another type of diagonalization argument. If we assume that the numbers between 0 and 1 are countable, then we can list their decimal representations, but then it is not difficult to explicitly construct a number between 0 and 1 which is not accounted for in the list. In this sense Q is much smaller than R, but is still manages to be dense in R. While we know there are holes in Q, we might still wonder about the general structure of Q within R.

Proposition 55. Q is disconnected in R. √ √ Indeed, we see that Q ⊂ (−∞, 2) ∪ ( 2, ∞). which shows that Q is contained in two disjoint open sets. We can make this even stronger.

Proposition 56. The irrational numbers R \ Q are dense in R.

Thus in between any two rationals, we can find an irrational, and we can use this to show that if a set of rational numbers has two points, then it cannot be connected; that is, Q is totally disconnected as√ a subset of R. This example that 2 6∈ Q displays something troubling about the structure of Q. Con- sider the set A = {x ∈ Q : x2 < 2}. It is easy to see that this set is bounded (the elements of this set do not get arbitrarily large; for example, for x ∈ A, we will certainly have x < 5), but there is no tight upper bound in Q. That is, for any rational number q such that x < q for any x ∈ A, one could find a smaller rational number p < q such that x < p for all x ∈ A. In short: there is no least upper bound for this set in Q. This is a feature which is fixed by moving to R. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 17

Definition 57 (Supremum & Infimum). Suppose that A ⊂ R. The supremum of A is defined to be the least upper bound of A. That is, the supremum is the number S ∈ R (if such a number exists) such that x ≤ S for all x ∈ A, and if x ≤ M for all x ∈ A, then S ≤ M. Likewise, the infimum of the set A is defined to the be greatest lower bounded of A. That is, the infimum is the number I ∈ R (if such a number exists) such that x ≥ I for all x ∈ A, and if x ≥ m for all x ∈ A, then I ≥ m. When such numbers S,I exist, we write S = sup(A) and I = inf(A).

Definition 58 (Bounded Set). A set A ⊂ R is called bounded, if there is M > 0 such that |x| ≤ M for all x ∈ A.

Proposition 59. Every non-empty bounded set in R has a finite supremum and infimum. (Indeed, if we allow the supemum or infimum to take the values ±∞, then every non- in R has a supremum and infimum.)

This is one more way in which we have “filled in the holes” when moving from Q to R.

We want to further study the analytical and topological properties of R. We have already remarked that R is complete and thus every Cauchy sequence in R has a limit in R. As we said before, one advantage of this is that when testing if a sequence has a limit, we do not need to identify a candidate for the limit to prove convergence. We would like other such tests to characterize when limits in R exist.

Theorem 60 (Monotone Convergence Theorem). Suppose that {xn} is a sequence in R which is increasing (that is, xn ≤ xn+1 for all n ∈ N) and bounded above (that is, there exists M > 0 such that xn ≤ M for all n ∈ N). Then {xn} converges (and in fact, {xn} will converge to the least M such that xn ≤ M for all n ∈ N). Likewise if {xn} is decreasing and bounded below then it converges.

Loosely speaking, there are two possible ways for a sequence not to converge. It could 2 either blow up to ±∞ as with the sequence xn = n or it could oscillate between certain n values as with the sequence xn = (−1) . In the first case, no matter how we look at it, the sequence will always diverge, but in the latter case, if we look only along the even terms 2n x2n = (−1) = 1, then we have a stable sequence which converges. The succeeding defini- tions and theorems deals with this situation.

∞ Definition 61 (Subsequence). Suppose that {xn}n=1 is a sequence in R. A subsequence ∞ is a sequence {xnk }k=1 such that nk < nk+1 for all k ∈ N. Thus {xnk } ⊂ {xn}.

Theorem 62 (Convergence Along Subsequences). Suppose that {xn} is a sequence in R such that xn → x ∈ R as n → ∞. Then every subsequence {xnk } will also satisfy xnk → x as k → ∞.

Theorem 63 (Bolzano-Weierstrass Theorem). Every bounded sequence has a con- vergent subsequence. That is, suppose that {xn} is sequence in R and there is M > 0 such Christian Parkinson GRE Prep: Topology & Real Analysis Notes 18

that |xn| ≤ M for all n ∈ N. Then there exists some susbsequence {xnk } which converges.

Another way to state the above theorem is that if the sequence {xn} resides in the bounded set (a, b), then along a subsequence we can find a limit. If instead we consider the closed set [a, b], then this set contains its limit points and so the limit must also lie in [a, b]. With this is mind, we state a theorem characterizing compact sets in R.

Theorem 64 (Heine-Borel Theorem). A subset of R is compact (in the sense of open covers) iff it is closed and bounded.

Thus compact sets are precisely those which contain all their limit points and are not too large in either direction. With this in mind, the prototypical compact sets in R are the closed intervals [a, b] where a, b ∈ R, a < b. However, this theorem applies more generally in the metric topology on Rn. In light of the Bolzano-Weierstrass Theorem, we can add another equivalent statement.

Proposition 65. Suppose that C ⊂ R. Then the following are equivalent: ˆ C is compact,

ˆ C is closed and bounded,

ˆ every sequence in C has subsequence converging to a point in C.

Sets satisfying the third property are called sequentially compact and this theorem tells us that in R, sets are compact iff they are sequentially compact. Now a valid question is: why is compactness an important property? The definition of compactness is somewhat opaque, but compactness allows you to narrow your focus from infinitey many things to finitely many things. This especially comes in handy when dealing with functions. Indeed, we will move from here to discussing functions on R.

Definition 66 (Bounded Function). Suppose that f : A → R where A ⊂ R. We say that f id bounded iff there is M > 0 such that |f(x)| ≤ M for all x ∈ A. That is, f is bounded if the image {f(x): x ∈ A} is a bounded set in R.

Proposition 67. Continuous functions from compact sets into R are bounded, and achieve maximum and minimum values. Specifically, suppose that C ⊂ R, C is compact and f : C → R is continuous. Then there are xmin, xmax ∈ C such that f(xmin) ≤ f(x) ≤ f(xmax) for all x ∈ C. [Note: this is not only asserting that f remains trapped between two extreme values, it is also asserting the existence of xmin and xmax where f meets those extreme values.]

How does compactness come into play here? Consider, if f is continuous then the sets Un = {x ∈ C : f(x) < n} for n ∈ N are open since they are the pre-image of the open sets (−∞, n). Also since f(x) ∈ R for all x ∈ C, for each x ∈ C we can find n ∈ N such that x ∈ Un. This shows that {Un} is an open cover of C. If C is compact, there is a

finite subcover Un1 , ··· ,UnK . However, these sets are nested, so this shows that C ⊂ UN Christian Parkinson GRE Prep: Topology & Real Analysis Notes 19

where N = max{n1, . . . , nK }. Then for all x ∈ C, we have f(x) < N, which shows that f is bounded from above. What has happened here? We began with infinitely many different bounds f(x) < n for n ∈ N each of which may have applied at different portions of C. Using compactness we were able to pare this down to a finite number of bounds, and then simply pick the largest one. Making a similar argument, we can get a lower bound, and thus the range {f(x): x ∈ C} is bounded. Since the set is bounded, it has an infimum and supremum. The theorem also asserts that f will meet these values. How do we find the point that meets the supremum? We can contruct a maximizing sequence {xn} such that

f(xn) → sup f(x) = sup{f(x): x ∈ C}. x∈C

By sequential compactness, the sequence has a subsequence {xnk } with a limit xmax ∈ C and by continuity, we will have f(xnk ) → f(xmax) and f(xmax) = supx∈C f(x). Thus while compactness helps us arrive at the bound on f, sequential compactness helps us actually find the point where f achieves the bound. Above we defined not only continuity but also uniform continuity and Lipschitz conti- nuity. We would like easy ways to identify which functions satisfy these stronger properties and this is somewhere where compactness can help a bit as well.

Proposition 68. Continuous functions on compact sets are uniformly continuous. That is, suppose that C ⊂ R and f : C → R is continuous. If C is compact, then f is uniformly continuous.

Again, we should observe how compactness comes into play. Fix ε > 0. Recall, if f is continuous at each point x ∈ C, then for each individual point, we can find δx > 0 such that for y ∈ C, |x − y| < δx =⇒ |f(x) − f(y)| < ε. Here we have (possibly) infinitely many different δx > 0, but if we want to satisfy the definition of uniform continuity, we need to have a single δ > 0. If the number δx > 0 works in the definition of continuity at x ∈ C, then 0 any smaller number 0 < δ < δx will also work. Thus one idea is to take the minimum over all such δx > 0, and this minimal δ will work for all x ∈ C. However, the set {δx}x∈C may not have a minimum and its infimum maybe zero, so this doesn’t quite work. But we note that the sets (x − δx, x + δx) form any open cover of C. If C is compact, we can extrace a finite subcover (x1 − δx1 , x1 + δx1 ),..., (xK − δxK , xK + δxK ) which still covers all of C. Now there are only finitely many δxk to choose from; choosing the minimum δ = min{δx1 , . . . , δxK } will provide a δ > 0 which works uniformly over all x ∈ C, proving that f is uniformly contin- uous. Again, we started with an infinite collection, and compactness allowed us to pare it down to a finite collection.

Lipschitz continuity can also be easier to identify via compactness but in a slightly more complicated way. First, recall a few definitions and theorems from calculus (for more expo- sition regarding the calculus topic, one can look back to the calculus notes).

Definition 69 (Lipschitz Continuity). Suppose that f : R → R. We call f Lipschitz continuous if there is a constant L > 0 such that for all x, y ∈ R, |f(x) − f(y)| ≤ L |x − y| . Christian Parkinson GRE Prep: Topology & Real Analysis Notes 20

Definition 70 (Differentiability). Suppose that f : R → R and x ∈ R. We say that f is differentiable at x if the limit f(x + h) − f(x) lim h→0 h exists. In this case we call the limit f 0(x). We say that f is differentiable in a set A ⊂ R if f is differentiable at all x ∈ A.

Theorem 71 (Mean Value Theorem). Suppose that f : R → R is differentiable in R. Then for any a, b ∈ R, a < b, there is c ∈ (a, b) such that f(b) − f(a) = f 0(c). b − a This, in turn, implies that |f(b) − f(a)| = |f 0(c)| |b − a| .

Note the similarity between the last statement and the definition of Lipschitz continuity. There’s a very formal similarity that hints at some connection of the form |f 0(c)| ∼ L. In- deed, we can make this precise.

Proposition 72. Suppose that f : R → R is differentiable on R. If the derivative f 0 : R → R is continuous, then f is Lipschitz continuous on any compact subset of R. If the derivative f 0 is bounded, then f is Lipschitz continuous on all of R. If f 0 is unbounded on some subset of R, then f is not Lipschitz continuous on that subset.

This gives a rough equivalence between Lipschitz continuous functions and continuously differentiable functions. However, based on this continuous differentiability still seems a bit stronger than Lipschitz continuity and indeed, it is. Take for example, the function f(x) = |x| for x ∈ R. This function if Lipschitz continuous with Lipschitz constant 1 because of the reverse triangle inequality:

|f(x) − f(y)| = |x| − |y| ≤ |x − y| .

However, this function is not differentiable on all of R. [In fact, a famous theorem states thata function is Lipschitz continuous functions iff it is differentiable almost everywhere2 and the a.e. derivative is essentially bounded.]

Finally, we discuss sequences of functions and the interplay between sequences of func- tions and the Riemann . Indeed, one of the reasons that the Lebesgue integral and the field of measure theory came about was because the does not play nice with sequences of functions, as we will see shortly.

Definition 73 (). Let A ⊂ R and let {fn} be a sequence of functions fn : A → R. We say that the sequence {fn} converges pointwise to a function 2Here almost everywhere has a technical meaning. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 21

f : A → R iff for all x ∈ A, the sequence {fn(x)} in R converges to f(x). That is, {fn} converges pointwise to f iff for every x ∈ A and ε > 0, there is N = N(x, ε) ∈ N such that |fn(x) − f(x)| < ε for all n ≥ N.

Example 74. Consider the sequences fn, gn :[−1, 1] → R given by q n 2 1 fn(x) = |x| , and gn(x) = x + n for x ∈ [−1, 1]. Note that fn(0) = 0, fn(±1) = 1, for all n ∈ N. If x ∈ [−1, 1] \ {−1, 0, 1}, then log |x| < 0 and so

n n log|x| lim fn(x) = lim |x| = lim e = 0. n→∞ n→∞ n→∞

Thus fn converges pointwise to the function f :[−1, 1] → R defined by 1, x = −1, 1, f(x) = 0, x ∈ (−1, 1).

Next for gn, note that for all x ∈ [−1, 1], √ q 2 2 1 |x| = x ≤ x + n = gn(x).

Conversely, if a, b ≥ 0 then |a2 + b2| ≤ a+b (one can easily verify this inequality by squaring both sides), and so r 1 1 g (x) = x2 + ≤ |x| + √ . n n n Combining the inequalities and subtracting |x|, we see 1 0 ≤ g (x) − |x| ≤ √ , x ∈ [−1, 1]. n n Thus for all x ∈ [−1, 1], we have lim gn(x) = |x| n→∞ and so gn converges pointwise to the absolute value function on [−1, 1].

There are two interesting differences to point out between these examples. In the first example, we had a sequence of continuous functions which converged pointwise to a discon- tinuous function, which is somewhat disconcerting (this is similar to before when we had a sequence of rationals converging to an irrational; morally, this indicates that continiuous functions are “incomplete with respect to pointwise limits”). The other difference is that proving the convergence of fn required special consideration for different values of x, whereas proving the convergence for gn did not. To address both of these, we introduce a stronger notion of convergence. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 22

Definition 75 (). Let A ⊂ R and let {fn} be a sequence of functions fn : A → R. We say that the sequence {fn} converges uniformly to a function f : A → R iff for every ε > 0, there is N = N(ε) ∈ N such that |fn(x) − f(x)| < ε for all x ∈ A and n ≥ N.

Again, at first glance this looks roughly the same as pointwise convergence, but the word “uniform” indicates that the same N in the definition of convergence works for all x in the set. Because there are two different modes of convergence, when considering functions the statement fn → f is vague, and one should always specify what type of convergence is be- ing proven/assumed (there are many other types of convergence as well; these are typically addressed in a first course on measure theory). Uniform convergence is important for the following reason.

Proposition 76. A uniform limit of continuous functions is continuous. That is, if A ⊂ R and {fn} is a sequence of continous functions fn : A → R such that fn converges to f : A → R uniformly, then f is also continuous.

Morally, this proposition states that continuous functions are “complete with respect to uniform convergence.” We would like to place some topological structure on sets of continu- ous functions to make this more rigorous.

Definition 77 (Continuous Functions as a ). Suppose that A ⊂ R. Define the set C(A) = {f : A → R | f is continuous}. This set becomes a vector space over R under pointwise addition and pointwise scalar multiplication. That is, if f, g ∈ C(A) and α ∈ R, define (f + αg): R → R by (f + αg)(x) = f(x) + αg(x), x ∈ A.

Then (f + αg) ∈ C(A).

Definition 78 (/Metric on Continuous Functions). Suppose that A ⊂ R is compact. Define the map k · k : C(A) → [0, ∞) by

kfk = sup |f(x)| , f ∈ C(A). x∈A This map defines a norm on C(A). Thus d : C(A) × C(A) → [0, ∞) defined by

d(f, g) = kf − gk, f, g ∈ C(A) is a metric on C(A).

Using Proposition 76 one can prove that for A ⊂ R compact, the metric space (C(A), d) with d defined as in Definition 78 is a complete metric space (indeed convergence in this metric space is precisely uniform convergence). This fact is very important in differential equations, where one uses Picard iteration to prove existence and uniqueness of certain equations. There are several more theorems regarding the structure of C(A), focusing, for Christian Parkinson GRE Prep: Topology & Real Analysis Notes 23

example, on identifying sets which are compact in the metric topology or identifying the continuous dual space of C(A). These are beyond the scope of these notes.

Lastly, we want to examine the interplay between the Riemann integral and convergence of sequences of functions. We will not recall the definition of the Riemann integral here (see the notes on Calculus I), except to remind the reader that it is essentially “area under the curve” and can be computed using the Fundamental Theorem of Calculus. With this in mind, it is reasonable to think that if fn converges to f (in some sense) then the limit of the Riemann of fn should converge to the Riemann integral of f. However, this is not always the case.

Example 79. We’ll do two examples here where we have functions fn → f (in some sense) but the limit of the integral is not the integral of the limit. This will show that R b R b in many cases a fn(x)dx 6→ a f(x)dx even when fn → f. For the first example, con- sider fn : [0, 1] → R defined by  4n2x, 0 ≤ x < 1 ,  2n   2 1  1 1 fn(x) = 4n n − x , 2n ≤ x < n ,    1 0, n ≥ x ≤ 1. This is graphed in figure 3. Note that for all n ∈ N, fn(0) = 0 and for any x ∈ (0, 1], we have fn(x) = 0 for all n ≥ d1/xe. This shows that fn(x) → 0 for all x ∈ [0, 1] and we conclude that fn converges pointwise to the zero function f ≡ 0. However, calcu- Figure 3: Example 79, fn(x) lating the area under the curve, we see that R 1 1 1 0 f(x)dx = 2 · n · 2n = 1 for all n ∈ N, and so we have a situation where Z 1 Z 1 1 = lim fn(x)dx 6= f(x)dx = 0. n→∞ 0 0 [Note: this (and similar examples) is often referred to as “vertical escape to ∞”; the mass under the curve vanished as n → ∞ because the functions got very large on a very small set] For a second example, consider the functions fn : [0, ∞) → R defined by 0, 0 ≤ x < n,    1 fn(x) = n , n ≤ x ≤ 2n,   0, 2n < x < ∞. Christian Parkinson GRE Prep: Topology & Real Analysis Notes 24

Here we see that 0 ≤ fn(x) ≤ 1/n for all x ∈ [0, ∞) and all n ∈ N, from which it readily follows that fn converges uniformly to the zero function f ≡ 0. However, calculating the R ∞ 1 integral, we find that 0 fn(x)dx = n · (2n − n) = 1 and so once again, we have Z 1 Z 1 1 = lim fn(x)dx 6= f(x)dx = 0. n→∞ 0 0 [Note: this (and similar examples) is referred to as “horizontal escape to ∞”; the mass under the curve vanished as n → ∞ because the of the functions went to ∞.]

R b R b Both of these display cases where we may have a fn(x)dx 6→ a f(x)dx. What was the key feature of each example? In the first example, the convergence was non-uniform; in the second example, the domain was non-compact. If both these are rectified, then we can guarantee that limits of integrals are integrals of limits.

Proposition 80. Suppose that a, b ∈ R and a sequence fn :[a, b] → R converges uniformly to f :[a, b] → R. Then Z b Z b lim fn(x)dx = f(x)dx. n→∞ a a R b This proposition shows that the map f 7→ a f(x)dx is a continuous functional on C([a, b]). One final note: the conditions of Proposition 80 are sufficient but they are n not necessary. Going back to the example fn(x) = |x| for x ∈ (−1, 1) and n ∈ N, we note that fn converges pointwise to the zero function f ≡ 0 and indeed we also have Z 1 2 Z 1 |x|n dx = → 0 = f(x)dx, −1 n + 1 −1 so in this case uniform convergence and a compact domain were not necessary. This was one of the motivating factors for developing measure theory and the Lebesgue integral: in order to find less stringent conditions on the behavior of fn while still guaranteeing that R R X fn(x)dx → X f(x)dx when fn → f. There is one theorem that is particularly helpful for this. Because we do not have the machinery of measure theory, we cannot state the theorem in full generality but we will state a particular case.

Theorem 81 (Lebesgue Dominated Convergence Theorem (Baby Version)). Sup- pose that fn :[a, b] → R is a sequence of continuous functions converging pointwise to f :[a, b] → R. Further suppose that there is a constant M > 0 such that |fn(x)| ≤ M for all n ∈ N and all x ∈ [a, b]. Then Z b Z b lim fn(x)dx = f(x)dx. n→∞ a a This version of the theorem essentially says that so long as there is no possibility of “vertical/horizontal escape to ∞” as in Example 79, we will have the desired behavior for the limit of the integrals.