“measureTheory_v2” 2019/5/1 i i page 61 i i
Chapter 5 Construction of a General Measure Structure
Simple methods will soon lead us to results of far reaching theoretical and practical importance. We shall encounter theoretical conclusions which not only are unexpected but actually come as a shock to intuition and common sense.
W. Feller
What I don’t like about measure theory is that you have to say “almost everywhere” almost everywhere.
K. Friedrichs
Keep in mind that there are millions of theorems but only thousands of proofs, hundreds of proof blocks, and dozens of ideas. Unfortunately, no one has figured out how to transfer the ideas directly yet, so you have to extract them from complicated arguments by yourself. F. Nazarov
Let me do it. You tell me when you want it and where you want it to land, and I’ll do it backwards and tell you when to take off.
K. Johnson
I’ve been giving this lecture to first-year classes for over twenty-five years. You’d think they would begin to understand it by now.
J. Littlewood
Panorama From developing some ideas of measure theory from an experimental point of view, we turn 180◦ to the development of a rigorous general measure theory. The ingredients are a set describing the “universe” of points, a class of “measurable” subsets along with permissible operations on these sets, and the measure itself. After postulating the basic properties of measure theory, we can then develop consequences and applications of those properties to build a rich theory. That is the focus of the rest of this book.
61
i i
i i “measureTheory_v2” 2019/5/1 i i page 62 i 62 Chapter 5. Construction of a General Measure Structure i
However, before starting that path, we have to construct interesting measures satis- fying the assumptions and we have to explain how to compute the measure of compli- cated sets. The latter point is critical as it is impractical to assign the measure of every complicated set. We have to build a systematic method for computing the measures of complicated sets based on the measures of a class of simple sets. Indeed, this is how we approached measure in Chapter 4, where we started with the lengths of intervals. So, after describing the basic properties of the class of measurable sets and the measure, we develop a systematic approach to compute the measure of complicated sets. A number of different approaches to do this have been developed over the decades. We use the approach of Carathéodory because it is simultaneously general yet still closely related to intuition. The construction of measure is carried out in two stages, involving first an “outer measure” stage and then a “premeasure” stage. This is a long process that involves overcoming a number of technical difficulties. It is hard and likely to take some time to understand. This chapter is abstract, beginning with the assumption of a generic “master” set or “universe” X. We supply a number of simple examples that are illustrative but not very practical. The choice of X is important in practice. For example, as with B and I from Chapter 4, it may be related to modeling a physical situation. Its properties have a strong impact on the properties of measure. For example, whether or not X is a metric space and whether or not X is bounded are important. In this chapter, we give a real application of the general theory to systematic construction of measure on a general metric space. In the next chapter, we develop the main application to Euclidean space. We conclude this chapter with a general result on approximation of measures. This is a reality check in the sense that the long, complicated development of measure theory results in a concept of measure that can be computed through an approximation process.
5.1 Sigma algebras
Assume that X is a nonempty set.
We begin by describing the class of “measurable” sets. For convenience, we repeat Definition 2.1.4.
Definition 5.1.1
The family of all subsets of X is called the power set of X and is denoted by PX.
Recall that in Chapter 4, we found that natural questions about sequences of coin flips lead to consideration of countable unions and intersections, as well as complements of simple intervals. In abstract, defining a class of measurable sets involves specifying the permissible set operations that allow combining given measurable sets to get new measurable sets. This definition is very important in practice, since most of the time we build rich collections by starting with collection of simple sets and using the permissible operations. We distinguish the cases of finite and countable numbers of operations. Dealing with finite numbers of operations fits intuition about measuring sizes of sets, but we need to deal with countable numbers of operations to reach the desired generality. For those readers who have an aversion to “structural classification”, the name alge- bra does not imply that we dive into subjects like group theory.
i i
i i “measureTheory_v2” 2019/5/1 i i page 63 i 5.1. Sigma algebras 63 i
Definition 5.1.2: Algebra
An algebra on X is a non-empty collection of subsets M with the following properties,
1. If A ∈ M then Ac ∈ M .(Closed under complements) Sm 2. If A1,A2,...,Am ∈ M then i=1 Ai ∈ M .(Closed under finite unions)
Example 5.1.1
Let X = (0, 1] and M = {∅, finite unions of disjoint intervals that are open on the Sm left and closed on the right}. A typical A ∈ M has the form A = i=1(ai, bi], with ai > bi−1. It is easy to see that M is closed under complements and finite unions after noting that the complement of such an interval consists of two disjoint like intervals and the union of two such intervals that overlap is a like interval. The analogous set is not an algebra if X = R - try the complement condition.
Definition 5.1.3: σ-algebra
A σ- algebra (sigma algebra) on X is a non-empty collection of subsets M with the properties,
1. If A ∈ M then Ac ∈ M .(Closed under complements) ∞ ∞ S 2. If {Ai}i=1 is a collection of sets in M then Ai ∈ M .(Closed under i=1 countable unions)
Two immediate examples (proof is an exercise),
Theorem 5.1.1
PX and {∅, X} are σ- algebras.
Definition 5.1.4
PX is the maximal σ- algebra and {∅, X} is the trivial or minimal σ- algebra.
It is natural to wonder why it is necessary to consider σ- algebras other than PX since it contains all subsets of X and therefore is the “biggest” collection. It turns out n that PX contains too many subsets in the case of X = R . A simple σ-algebra that is not trivial:
Example 5.1.2
If A ⊂ X, the collection {∅, A, Ac, X} is a σ-algebra.
Conditions defining a σ- algebra can use a variety of set properties.
i i
i i “measureTheory_v2” 2019/5/1 i i page 64 i 64 Chapter 5. Construction of a General Measure Structure i
Example 5.1.3
If X is uncountable, M = {A ⊂ X : A is countable or Ac is countable} is a σ- algebra. c First note that (Ac) = A. Thus, if A ∈ M , it is either countable or Ac is c ∞ countable. So, A ∈ M . Let {Ai}i=1 ⊂ M . If all the sets are countable then the countable union is countable and so is in M . If not all the sets are countable, there ∞ c ∞ + c S T c exists an index j ∈ Z such that Aj is countable. Thus, Ai = Ai ⊂ i=1 i=1 c Aj is countable.
An algebra does not have to be a σ- algebra.
Example 5.1.4
Consider the algebra M defined in Example 5.1.1. M is not a σ- algebra. To see this, consider the sets
1 1 1 1 1 1 A = 0, ,A = + , + + , 1 2 2 2 22 2 22 23 1 1 1 1 1 1 1 1 1 A = + + + , + + + + , ··· . 3 2 22 23 24 2 22 23 24 25 S∞ Then, i=1 Ai is a countable union of disjoint intervals that is not a finite union of disjoint intervals.
Next, we present another example built on half-open intervals that is a σ- algebra.
Example 5.1.5
Let X = R and let M = {∅, countable unions of disjoint intervals of the form [i, i+1), and half rays (−∞, i) and [i, ∞), i ∈ Z}. ∅ and R are in M . A countable union of collections of countable unions of disjoint intervals [i, i+1) and indicated rays is a countable union of indicated intervals and rays. To see that M is closed under complements, for example, note that [i, i + 1)c = (−∞, i) ∪ [i + 1, ∞). If j < i, then [j, j + 1) ∪ [i, i + 1) = (∞, j) ∪ [j + 1, i) ∪ [i + 1, ∞). Thus, we can show that the complement of a countable union of indicated intervals and rays, which is the intersection of the complements of such sets, can be written as a countable union of indicated intervals and rays. Thus, M is a σ- algebra.
It is a good exercise to compare Examples 5.1.1 and 5.1.5. The starting point for defining a measure consists of a set and a σ- algebra.
Definition 5.1.5: Measurable space
If X has a σ- algebra M , we call (X, M ) a measurable space. The sets in M are called measurable sets.
i i
i i “measureTheory_v2” 2019/5/1 i i page 65 i 5.1. Sigma algebras 65 i
Example 5.1.6
In Chapter 4, we developed a rough picture of a collection of measurable sets in (0, 1] based on performing set operations such as union, intersection, and com- plement starting with half-open intervals (a, b]. We extend the idea to all of R and make it precise in Chapter 6 to obtain the Borel and closely related Lebesgue σ- algebras. These are certainly two of the most important examples of a σ- algebra, but establishing their existence takes some work. For now, we anticipate that countable unions and intersections of open, closed, and half-open intervals are in the σ- algebras.
Remark 5.1.1
If familar, it may help acceptance of the abstraction of defining a space through set operations by recalling the idea of a topological space, which is a nonempty set of points together with a family of subsets called the open sets that have the properties: (1) the space and the empty set are open; (2) a finite intersection of open sets is open; (3) any union of open sets is open. The properties of a topological space are an expression of the fundamental properties of “openness”, without any additional complications added. A metric space is an example of a topological space, though a metric space has more assumptions. Note that the differences in assumptions between measurable and topological spaces lead to very different constructions.
It is not immediately apparent, but the few assumptions for a σ- algebra have a number of consequences for other set operations.
Theorem 5.1.2: Basic Properties of σ- algebras
1.A σ- algebra on X is an algebra on X. 2. If M is an algebra or σ- algebra on X, then X, ∅ ∈ M . m 3. If M is an algebra on X and {Ai}i=1 is a collection of sets in M then m T Ai ∈ M .(Closed under finite intersections) i=1 ∞ 4. If M is a σ- algebra on X and {Ai}i=1 is a collection of sets in M then ∞ T Ai ∈ M .(Closed under countable intersections) i=1 5. If M is an algebra or σ- algebra on X and if A, B ∈ M , then A\B ∈ M .
Proof. We prove these in order. Result 1 Follows from the definitions. Result 2 Since M is nonempty there is a A ⊂ X such that A ∈ M . Thus Ac ∈ M and X = A ∪ Ac ∈ M . Since X ∈ M , Xc = ∅ ∈ M . m c m Sm c T Sm c c Result 3 Since {Ai }i=1 ⊂ M and i=1 Ai ∈ M , Ai = ( i=1 Ai ) ∈ M . i=1 ∞ c ∞ S∞ c T S∞ c c Result 4 Since {Ai }i=1 ⊂ M and i=1 Ai ∈ M , Ai = ( i=1 Ai ) ∈ M . i=1
i i
i i “measureTheory_v2” 2019/5/1 i i page 66 i 66 Chapter 5. Construction of a General Measure Structure i
Result 5 From above, A\B = A ∩ Bc ∈ M .
Remark 5.1.2
Result 2, ∅, X ∈ M , is often assumed in the definition of algebras and σ- algebras. As shown, the assumption that X is nonempty implies this holds. The following theorem provides a useful way to generate a σ- algebras on a member of a σ- algebra.
Theorem 5.1.3
Let M be a σ-algebra on X and B ⊂ X. Then, the collection MB = {B ∩ A : A ∈ M } is a σ- algebra on B.
Proof. Let B1 ∈ MB, then B1 = B ∩ A1 for some A1 ∈ M . The complement of B1 c c c c c c c in B is B ∩ B1. Now, B ∩ B1 = B ∩ (B ∪ A1) and B ∩ B1 = (B ∩ B ) ∪ (B ∩ A1). c c ∞ So B ∩ B1 = B ∩ A1 ∈ MB. Let {Bi}i=1 be a sequence in MB. Each Bi = B ∩ Ai ∞ ∞ S S for some Ai ∈ M . Therefore, Bi = B ∩ Ai ∈ MB. i=1 i=1
This result is very useful. Below, we construct a σ−algebra on R and easily obtain a σ−algebra on any measurable subset of R. As mentioned, in practice we start with a collection of simple sets and build a σ- algebra using set operations. Obviously, it is easier to define an algebra than a σ- algebra. The next result gives conditions under which an algebra is a σ-algebra.
Theorem 5.1.4
∞ An algebra of sets M on X is a σ- algebra if and only if {Ai}i=1 ⊂ M is a disjoint ∞ S collection implies Ai ∈ M .(Closed under countable disjoint unions) i=1
Proof. ⇒ follows by definition. ∞ For ⇐, consider a collection of sets {Bi}i=1 ⊂ M that may or may not be disjoint. ∞ Following Theorem 2.4.2, we can construct a disjoint collection {Aj}j=1 such that ∞ ∞ S S Bi = Aj ∈ M . i=1 j=1
There is little chance that an arbitrary collection of sets is a σ- algebra. It is natural to wonder if it is possible to construct a σ- algebra starting with a given collection of sets. The following result gives a partial answer.
Theorem 5.1.5
1. The intersection of any collection of σ- algebras on X is a σ- algebra. 2. If A is a collection of subsets in X, there is a unique smallest σ- algebra M containing A in the sense that any σ- algebra containing A also contains
i i
i i “measureTheory_v2” 2019/5/1 i i page 67 i 5.1. Sigma algebras 67 i
M . The result is "partial" in the sense that while it says that the unique smallest σ- algebra exists, the proof does not give a practical procedure to construct it.
Proof. Result 1 Let O be a collection of σ- algebras on X. Define, \ N = M = {A ⊂ X : A ∈ M for all M ∈ O}. M ∈O Suppose A ∈ N . Then, Ac ∈ M for all M ∈ O and therefore Ac ∈ N . In a similar ∞ S way, if {Ai} ⊂ N , then Ai ∈ N . i=1 Result 2 Define M to be the intersection of all σ- algebras containing A . The inter- section is nonempty and is itself a σ- algebra by 1. By definition, M is contained in any σ- algebra that contains A .
Definition 5.1.6
Let A be a collection of subsets of X. The unique smallest σ- algebra containing A is denoted by σ (A ) and is said to be the σ- algebra generated by A .
Example 5.1.7
Let X = {1, 2, 3, 4} and set A1 = {1, 2}, A2 = {2, 3}, and finally define A = {A1,A2}. Then σ(A ) = PX. {2} = A1 ∩ A2, {1} = A1\(A1 ∩ A2), {3} = c A2\(A1 ∩ A2), {4} = (A1 ∪ A2) , and now the other subsets can be obtained using unions.
Example 5.1.8
Let X = N and let A be the collection of sets consisting of a finite number of nonnegative integers. It is an easy exercise to see that σ(A ) = PX.
Example 5.1.9
Let X = {a, b, c, d} and set A1 = {a, b}, A2 = {b}, and finally define A = {A1,A2}. It is a good exercise to show that, σ(A ) = ∅, {a}, {b}, {a, b}, {c, d}, {a, c, d}, {b, c, d}, {a, b, c, d} 6= PX. Proving the following is a good exercise.
Theorem 5.1.6
1. If M is a σ- algebra on X, then M = σ (M ). 2. For any A ⊂ X, we have σ ({A}) = {∅, A, Ac, X}.
i i
i i “measureTheory_v2” 2019/5/1 i i page 68 i 68 Chapter 5. Construction of a General Measure Structure i
3. If A ⊂ B are collections of subsets of X, then σ (A ) ⊂ σ (B).
The following useful result gives conditions under which the σ- algebra generated by a collection of sets inherits some desirable property of those sets.
Theorem 5.1.7: Principle of Inheritance
Let A be a collection of subsets of X. Define C to be a collection of sets generated from A with a given property, i.e.,
C = {A ∈ σ (A ): A has a desired property}.
If A ⊂ C and C is a σ- algebra, then σ (A ) ⊂ C .
Definition 5.1.7
Theorem 5.1.7 is also called the Principle of Appropriate Sets and Principle of Good Sets.
Proof. σ (A ) ⊂ σ (C ) = C .
Example 5.1.10
Let M be a σ- algebra on X and B ∈ M . By Theorem 5.1.3, the collection MB = {B ∩ A : A ∈ M } is a σ- algebra on B. We show that if A is a collection of sets such that σ (A ) = M , then σ (AB) = MB, where AB = {B ∩ A : A ∈ A }. Theorem 5.1.6 implies that σ (AB) ⊂ MB. To show MB ⊂ σ (AB), consider the collection C = {A ∈ M : B ∩ A ∈ σ (AB)}. It is a good exercise to show that C is a σ- algebra. The result now follows from Theorem 5.1.7.
5.2 Measure In this section, we develop the basic properties of measure, which is a means of quanti- fying the “size” of the sets in a given σ- algebra on a set X. In other words, a measure is defined on a measurable space. Size could be the analog of length, area or volume as discussed in Chapter 4. But, it can be something else entirely, e.g. evaluating probabil- ity. In Chapter 4, the probability measure coincided with the measure of “length”, which worked out because of the length of the unit interval is 1 and we require the probability of the entire space to be 1 as well. But, that is a special case. In this section, we develop general properties and consequences. We leave a discus- sion of how measure is actually computed to later.
Let (X, M ) be a measurable space where X is nonempty.
i i
i i “measureTheory_v2” 2019/5/1 i i page 69 i 5.2. Measure 69 i
Definition 5.2.1: Measure
An additive measure on (X, M ) is a set function µ : M → [0, ∞] satisfying, • µ (∅) = 0, m • If {Ai}i=1 is a collection of disjoint sets in M , then
m ! m [ X µ Ai = µ (Ai) . i=1 i=1
A (countably additive) measure on (X, M ) is a set function µ : M → [0, ∞] satisfying, • µ (∅) = 0, ∞ • If {Ai}i=1 is a collection of disjoint sets in M , then
∞ ! ∞ [ X µ Ai = µ (Ai). i=1 i=1
Below, measure refers to a countably additive measure. Finite additivity fits intuition of how a function that measures the size of sets should behave, and countable additivity is the extension to countable collection of sets. Chapter 4 provides motivation for such an extension. Example 5.2.1
On the σ- algebra M = ∅, {a}, {b}, {a, b}, {c, d}, {a, c, d}, {b, c, d}, {a, b, c, d} on X = {a, b, c, d} defined in Example 5.1.9, define µ(∅) = 0, µ({a}) = 1, µ({b}) = 1, µ({a, b}) = 2, µ({c, d}) = 1, µ({a, c, d}) = 2, µ({b, c, d}) = 2, µ({a, b, c, d}) = 3.
It is easy to verify that µ is a measure.
The next example is sufficiently interesting to be described as a theorem. It shows how to turn an ordinary function on a countable set into a measure. As the basis of a measure space, a countable set is relatively easy to deal with since we can use the power set as a σ-algebra.
Theorem 5.2.1
Let X be countable, M = PX, and f : X → [0, ∞] any function. Define X µ (A) = f(x),A ∈ M . x∈A
Then, µ is a measure on (X, M ).
Note that we know that X is equivalent to N, i.e., X ∼ N, and we can write f(i) = ai, i = 0, 1, 2, ··· . Hence, Theorem 5.2.1 defines a measure on the space defined by considering subsets of the elements of a given sequence.
i i
i i “measureTheory_v2” 2019/5/1 i i page 70 i 70 Chapter 5. Construction of a General Measure Structure i
Proof. This is a good exercise. Countability of X and the nonnegativity of f are key.
Definition 5.2.2: Counting measure
If f(x) = 1 for all x ∈ X in Theorem 5.2.1, then µ is called the counting measure. If there is a point x0 ∈ X with f(x0) = 1 and f(x) = 0 for x 6= x0, then µ is called the point mass at x0. Recall that Example 5.1.3 is an example of how a variety of set properties can be used to define a σ- algebra. This is reflected in choices of measure.
Example 5.2.2
Let X be uncountable and let M be the σ- algebra defined by,
c {A ⊂ X : A is countable or A is countable} . It is a good exercise to show that the set function, ( 0, A is countable, µ (A) = 1,Ac is countable,
is a measure.
Example 5.2.3
Let X be an infinite set, M = PX, and define ( 0,A finite, µ (A) = ∞,A infinite.
It is a good exercise to show that µ is a finitely additive measure but it is not countably additive.
Now, we have defined the three ingredients for measure theory.
Definition 5.2.3
If (X, M ) is a measurable space on which there is a measure µ, then the triple (X, M , µ) is called a measure space.
Remark 5.2.1
All of these terms are abused regularly. We might say µ is a measure on X, where M is taken to be the “natural” domain for µ. For example, there is a natural choice in a metric space. If we know M , then we also know X since X is a maximal ele- ment of M . So X may not be mentioned explicitly. We try to be careful in notation because we want to emphasize that using measure theory requires specification of
i i
i i “measureTheory_v2” 2019/5/1 i i page 71 i 5.2. Measure 71 i
a universe, a σ- algebra, and a measure.
Now let (X, M , µ) be a measure space where X is nonempty.
There is a significant difference between the cases when the space has finite mea- sure and when it does not. In the latter situation, we distinguish two particular cases.
Definition 5.2.4
We say that µ is a finite measure if µ (X) < ∞ and that (X, M , µ) is a finite measure space. We say that µ is a probability measure if µ (X) = 1 and (X, M , µ) is a probability space. We say that µ is a σ- finite measure if M contains an increasing sequence ∞ S A1 ⊂ A2 ⊂ A3 ... such that Ai = X and µ (Ai) < ∞ for all i. We say that i=1 (X, M , µ) is a σ- finite measure space.
A finite measure is trivially σ-finite.
Example 5.2.4
Consider X = N and let µ be the measure defined in Theorem 5.2.1. 1. If f(i) = 2−(i+1), the resulting measure is a probability measure. −1 2 P∞ 2 2. If f(i) = C /(i + 1) , with C = i=1 1/(i + 1) , then the resulting measure is a probability measure. 3. If f(i) = 1/(i + 1), the resulting measure is not finite, but it is σ-finite.
∞ [ The idea behind σ- finite follows naturally from considering that R = (−i, i), i=1 which allows the extension of the Lebesgue measure on a finite interval developed in- formally in Chapter 4 to R.
We assume that any measure being considered is σ- finite (which includes finite measures).
Dealing with non-σ- finite measures require additional assumptions and work and some important results do not hold. The following Theorem gives a useful equivalence that is a good exercise.
Theorem 5.2.2
(X, M , µ) is σ- finite if and only if there is a countable disjoint collection of sets ∞ S∞ {Ai}i=1 with finite measure such that X = i=1 Ai.
i i
i i “measureTheory_v2” 2019/5/1 i i page 72 i 72 Chapter 5. Construction of a General Measure Structure i
Example 5.2.5
∞ ∞ [ [ R = (−i, i) = (i, i + 1]. i=1 i=−∞
Definition 5.2.5
For any measure space (X, M , µ), any countable disjoint collection of sets with ∞ S∞ finite measure {Ai}i=1 such that X = i=1 Ai is called a measurable decompo- sition of X. We note above that defining a measure is not the same as giving a practical recipe for computing the measure of sets. Even direct verification of the properties of a measure is difficult outside simple examples. A natural approach is to start with a “measure- like” function that satisfies the key properties on an algebra of relatively simple sets and then undertake some kind of limiting process to expand the domain and range of the “proto-measure”. We discuss this below. Next, we explore consequences of the assumptions made about measures. In par- ticular, we show that a measure, which is a function on sets, behaves continuously with respect to sequences of sets. It may be useful to review Section 2.4 before studying this theorem. Theorem 5.2.3: Properties of Measure
1. µ is finitely additive. 2. If A, B ∈ M and A ⊂ B then µ (A) ≤ µ (B). 3. If A, B ∈ M and A ⊂ B where µ (A) < ∞, then µ (B\A) = µ (B) − µ (A). ∞ 4. If {Ai}i=1 is a collection of sets in M , then
∞ ! ∞ [ X µ Ai ≤ µ (Ai). i=1 i=1
∞ 5. If {Ai}i=1 is a monotone sequence of sets in M such that A1 ⊂ A2 ⊂ A3 ... , then ∞ ! [ µ Ai = lim µ (Ai) . i→∞ i=1 ∞ 6. If {Ai}i=1 is a monotone sequence of sets in M such that A1 ⊃ A2 ⊃ + A3 ... and µ (Am) < ∞ for some m ∈ Z , then
∞ ! \ µ Ai = lim µ (Ai) . i→∞ i=1
Example 5.2.6
Using the preliminary ideas behind the Lebesgue measure µL on (0, 1], the sets
i i
i i “measureTheory_v2” 2019/5/1 i i page 73 i 5.2. Measure 73 i
Ai = (0, 1 − 1/i) for i ≥ 2 are measurable and µL (Ai) = 1 − 1/i. We have S∞ Ai ⊂ Ai+1, limi→∞ µL (Ai) = 1, and i=1 Ai = (0, 1).
Example 5.2.7
Using the preliminary ideas behind the Lebesgue measure µL on (0, 1], the sets Ai = (0, 1 + 1/i) for i ≥ 1 are measurable and µL (Ai) = 1 + 1/i. We have T∞ Ai ⊃ Ai+1, limi→∞ µL (Ai) = 1, and i=1 Ai = (0, 1].
Example 5.2.8
It is a good exercise to verify that the properties of Theorem 5.2.3 hold directly for the measure defined in Theorem 5.2.1.
These properties are essential to computing measures of complicated sets.
Definition 5.2.6
Property 2 is called monotonicity and Property 4 is called subadditivity. Prop- erty 5 is called continuity from below and Property 6 is called continuity from above.
Proof. We prove in order.
Result 1 Let Ai = ∅ for all i larger than a given finite index. Result 2 If A ⊂ B, then µ(B) = µ(A) + µ(B ∩ Ac) ≥ µ(A). We use the fact that A and B ∩ Ac are disjoint. Result 3 Since µ (A) < ∞ we can subtract the µ (A) from µ(B) = µ(A)+µ(B ∩Ac) to obtain µ (B) − µ (A) = µ (B ∩ Ac) = µ (B − A).
Result 4 Following Theorem 2.4.2, we construct a sequence of disjoint sets: B1 = A1 j−1 ∞ S S and Bj = Aj Ai for j ≥ 2. Note that µ (Bj) ≤ µ (Aj) for all j and Ai = i=1 i=1 ∞ S Bi. Hence, i=1
∞ ! ∞ ! ∞ ∞ [ [ X X µ Ai = µ Bi = µ (Bi) ≤ µ (Ai). i=1 i=1 i=1 i=1
S∞ Result 5 First we note that if µ(Ai) = ∞ some i, then µ j=1 Aj = µ(Ai) + S∞ µ j=1 Aj\Ai since ∞ = ∞. Moreover, µ(Aj) = ∞ for j ≥ i. So the result S∞ holds. So we assume that µ(Ai) is finite for all i. Set A0 = ∅. Since i=1 Ai = S∞ i=1(Ai\Ai−1), where the latter union is disjoint,
∞ ! ∞ [ X µ Ai = µ (Ai\Ai−1). i=1 i=1
i i
i i “measureTheory_v2” 2019/5/1 i i page 74 i 74 Chapter 5. Construction of a General Measure Structure i
We write the series as a limit of partial sums and use properties of measure to write,
∞ m m X X X µ (Ai\Ai−1) = lim µ (Ai\Ai−1) = lim µ (Ai) − µ (Ai−1) . m→∞ m→∞ i=1 i=1 i=1 We use the fact that the partial sums “telescope” to conclude
∞ ! m [ X µ Ai = lim µ (Ai) − µ (Ai−1) = lim µ(Am). m→∞ m→∞ i=1 i=1
Result 6 Let Bj = Am\Aj for j > m, and Bj = ∅ for j ≤ m. Then, Bm+1 ⊂ Bm+2 ⊂ ... , µ(Am) = µ(Bj) + µ(Aj) for j > m, and
∞ ∞ [ \ Bj = Am\ Aj . j=m+1 j=m
By Result 5,
∞ ∞ \ \ µ(Am) = µ Aj + lim µ(Bi) = µ Aj + lim µ(Am) − µ(Ai) . i→∞ i→∞ j=m j=m
Since µ(Am) < ∞, we can subtract it from both sides.
The extra assumption in proving continuity from above is necessary. It is a good exercise ∞ to produce an example of a decreasing sequence {Ai}i=1 with µ (Ai) = ∞ for all i, ∞ T but Ai = ∅. i=1
Remark 5.2.2
The proof of Result 2 shows that additive measures are monotone. The proof of Result 4 is a classic type of measure theory argument.
The continuity of a measure is an important property that leads to other useful prop- erties. For example, the following result says continuity together with finite additivity gives countable additivity.
Theorem 5.2.4
Let µ be a finitely additive measure on an algebra A . ∞ S∞ 1. If for every increasing sequence of sets {Ai}i=1 ⊂ A with A = i=1 Ai ⊂ A (so Ai ↑ A), µ(Ai) → µ(A), then µ is countably additive on A . ∞ 2. If for every decreasing sequence of sets {Ai}i=1 ⊂ A with Ai ↓ ∅, µ(Ai) → 0, then µ is countably additive on A .
Proof. ∞ Sm Result 1 As usual, we may assume the {Ai}i=1 are disjoint. If Bm = i=1 Ai, then Pm Bm ↑ A and µ(Bm) → µ(A). But, finite additivity implies µ(Bm) = i=1 µ(Ai) → P∞ i=1 µ(Ai) = µ(A).
i i
i i “measureTheory_v2” 2019/5/1 i i page 75 i 5.3. Sets of measure zero, completion of measure 75 i
∞ S∞ Result 2 Assume {Ai}i=1 ⊂ A is a disjoint collection with A = i=1 Ai ⊂ A Sm and set Bm = i=1 Ai. Since µ(A) = µ(Bm) + µ(A \ Bm) and A \ Bm ↓ ∅, µ(Bm) → µ(A). Now we apply the proof for Result 1.
The next theorem shows that the assumption of σ- finite implies that the space can- not have “too much content”.
Theorem 5.2.5
If (X, M , µ) is σ- finite, then M cannot contain an uncountable, disjoint collec- tion of sets of positive measure.
Proof. Let E = {Aα}α∈A be a disjoint collection of subsets in M such that for each ∞ α ∈ A , µ (Aα) > 0. We show that E is countable. There is a sequence of sets {Bi}i=1 ∞ S such that Bi % X and µ (Bi) < ∞ for each i. For any A ∈ E , A = (A ∩ Bi). i=1 ∞ We use {Bi}i=1 and a countable set of lower bounds to create a countable partition of E into sub-collections of sets where the measures of sets in a given sub-collection are bounded away from zero by one of the lower bounds. For j ∈ N, define 1 Ei,j = A ∈ E : µ (A ∩ Bi) > . j
For all i, j, Ei,j ⊂ E and for any A ∈ E there is a i, j ∈ N such that A ∈ Ei,j. Hence,
∞ [ E = Ei,j. i,j=1
The result follows if we show that Ei,j is finite for all indices. Consider a finite sequence m of sets {Ck}k=1 in a Ei,j. Since, these sets are disjoint,
m m ! m X [ ≤ µ (C ∩ B ) = µ (C ∩ B ) ≤ µ (B ) . j k i k i i k=1 k=1
Thus, m ≤ jµ (Bi), i.e. m is bounded by a constant that depends only on i and j, and therefore Ei,j is finite.
Remark 5.2.3
This proof is a typical σ- finite argument.
5.3 Sets of measure zero, completion of measure The discussion in Section 4.3 hints at the importance of dealing with sets of measure zero. There is a technical issue about such sets that we settle in this section.
Let (X, M , µ) be a measure space where X is nonempty.
i i
i i “measureTheory_v2” 2019/5/1 i i page 76 i 76 Chapter 5. Construction of a General Measure Structure i
Definition 5.3.1: Sets of measure zero
A set E ∈ M with µ (E) = 0 is called a set of measure zero. If a statement about points x ∈ X is true except for x in a set of measure zero, we say that the statement is true almost everywhere (a.e. ).
It is necessary to reconcile this definition with Definition 4.3.4 introduced in Chap- ter 4 for Lebesgue measure. In that chapter, we built up the idea of measure based on measuring the size of intervals. Now, we are dealing with an abstract construction of measure. Note that if we prove there is a Lebesgue measure µL and if A is a Lebesgue mea- ∞ surable set that is covered by a countable set of intervals {Ii}i=1, then Theorem 5.2.3 implies ∞ ! ∞ [ X µL(A) ≤ µL Ii ≤ µL(Ii). i=1 i=1 Now, if we can make the sum on the right smaller than any given δ > 0 by a suitable ∞ choice of {Ii}i=1, then µL(A) = 0. So, Definition 4.3.4 implies that Definition 5.3.1 holds. The reverse implication requires some proof, and we discuss that later. First, recall this claim from Chapter 4. Its proof is a good exercise.
Theorem 5.3.1
• A finite or countable union of sets of measure zero in M has measure zero. • If A is a measurable set of measure zero in M then any measurable subset of A has measure zero.
However, a subset of a measurable set of measure zero is not necessarily measur- able!
Example 5.3.1
Consider the σ- algebra {X, ∅} and the measure µ that is zero on this σ- algebra. Then µ is not defined on any proper, non-empty subset of X.
The fact that a set of measure 0 can contain a nonmeasurable set is not really sur- prising. After all, we have seen that sets of measure 0 can be very complicated, e.g. the Cantor Set (Definition 4.3.6) and the non-Normal numbers Nc. We resolve this annoy- ing point by adding all those subsets of sets in a σ- algebra that have measure 0 to the σ- algebra.
Definition 5.3.2
If M contains all subsets of sets in M with measure 0, then (X, M , µ) is com- plete.
Being complete eliminates some annoying issues and it can always be obtained by enlarging the domain of a given measure to obtain an equivalent measure in the following sense:
i i
i i “measureTheory_v2” 2019/5/1 i i page 77 i 5.3. Sets of measure zero, completion of measure 77 i
Theorem 5.3.2: Completion of a measure
Let N = {N ∈ M : µ (N) = 0}. Define,
M = {A ∪ B : A ∈ M and B ⊂ N for some N ∈ N }.
Then, M is a σ- algebra on X that contains M . Moreover, the unique measure µ on M defined by µ (A ∪ B) = µ (A) for all A ∈ M and B ⊂ N for some N ∈ N makes X, M , µ a complete measure space.
We literally add all the subsets of measurable sets of measure zero to the σ- algebra and define their measure to be 0.
Definition 5.3.3
µ is an example of an extension of µ from M to M . This particular extension is called the completion of µ and M is the completion of M with respect to µ.
Proof. Clearly M ⊂ M . We show that M is a σ- algebra. Let C ∈ M , so C = A∪B with A ∈ M and B ⊂ N for some N ∈ N . Since B ⊂ N, N c ⊂ Bc and Bc is equal to the disjoint union Bc = N c ∪(N ∩Bc). This implies, Cc = (Ac ∩ N c) ∪ (Ac ∩ N ∩ Bc). Now, (Ac ∩ N c) ∈ M since both E,N ∈ M . Also, (Ac ∩ N ∩ Bc) ⊂ N. Hence, Cc ∈ M . ∞ Let {Ci}i=1 be a sequence of sets in M . For each i, Ci = Ai ∪ Bi with Ai ∈ M and Bi ⊂ Ni for some Ni ∈ N . Thus,
∞ ∞ ! ∞ ! [ [ [ [ Ci = Ai Bi , i=1 i=1 i=1
∞ ∞ ∞ ∞ S S S S where Ai ∈ M and Bi ⊂ Ni. From Theorem 5.3.1, Ni ∈ N and hence i=1 i=1 i=1 i=1 ∞ S Ai ∈ M . Thus M is a σ- algebra. i=1 We next verify that µ (A) is a well defined function. If C ∈ M can be written as C = A1 ∪ B1 = A2 ∪ B2 with Ai ∈ M and Bi ⊂ Ni for some Ni ∈ N , then we want to show that µ (A1) = µ (A2). To see this note that A1 ⊂ A1 ∪ B1 = A2 ∪ B2 ⊂ A2 ∪N2 ∈ M . Thus µ (A1) ≤ µ (A2)+µ (N2) = µ (A2). Similarly, µ (A2) ≤ µ (A1). To show that X, M , µ is complete, we show that if C ∈ M satisfies µ (C) = 0, then D ∈ M for any D ⊂ C. With C = A ∪ B with A ∈ M and B ⊂ N for some N ∈ N , we have µ (A) = µ (C) = 0. This implies A ∈ N . But, D = ∅ ∪ D where ∅ ∈ M and D ⊂ A ∪ B ∈ N . Thus D ∈ M .
Example 5.3.2
Returning to Example 5.3.1, to complete X we add all subsets of X to the σ- algebra.
i i
i i “measureTheory_v2” 2019/5/1 i i page 78 i 78 Chapter 5. Construction of a General Measure Structure i
Thus, M = PX.
Remark 5.3.1
Unfortunately, unlike the other parts of measure structure, completion does not interact nicely with maps between measurable spaces. For this reason, it is some- times inconvenient to work with the completion of a given measure. An important example of this situation is the Borel measure on Rn and its completion Lebesgue measure (the rigorous formulations of the ideas behind Lebesgue measure in (0, 1] in Chapter 4). Most of the time, completeness is not an issue, but some key re- sults do require a complete measure. The reader should pay attention to whether or not measure being considered at any given point in a textbook is assumed to be complete.
5.4 Outer measures We have developed the basic properties of a measure on a σ- algebra, but we have presented only illustrative examples. In this section, we develop a systematic method for constructing a measure based on specifying the values of the measure on a class of “simple” sets. This is the same idea that drove the intuitive development of measure in Chapter 4, which was based on defining the measure of an interval I = (a, b] to be m m S P µ (I) = b − a and for a finite union of disjoint intervals {Ii}, µ Ii = µ (Ii). i=1 i=1 That is actually sufficient for the probability computations in Chapter 4. But, the discussion in Chapter 4 also hints that even in one dimension we are likely to want to consider sets that are more complicated than just finite collections of disjoint intervals, e.g. recall the Cantor set and the set of non-normal numbers. Working in higher dimensions introduces the possibility of further complications in boundaries and interior structure of sets. The technical difficulties involved in construction of measure, as well as computing its values, arise mainly from these complications. There are various ways to approach construction of measures that appear to be quite different, though they all end up with the same result. If the reader is looking at other books, they will encounter different approaches. The technical challenges involved with measure means that any approach has aspects that are not very intuitive. We use the “outer measure” approach developed by Carathéodory based on Lebesgue’s devel- opment.
Let X be a nonempty set. Recall that even though we are not assuming a measur- able space, PX is a ready made σ- algebra. We first briefly describe an early approach to defining measure due to Peano and Jordan that developed the ancient idea of approximating complex geometric shapes with simple shapes. This approach is intuitive and also reveals some of the technical issues that led to the more complicated development of Lebesgue.
Definition 5.4.1
An Jordan elementary set in Rn is a finite disjoint union of n-dimensional cubes. We define the measure µ(R) of n-dimensional cube R to be its Euclidean volume
i i
i i “measureTheory_v2” 2019/5/1 i i page 79 i 5.4. Outer measures 79 i
tb
1 m
Aδ
Aδ 3 2 1 2 3 m 0 1
Figure 5.1. Computation of the Jordan measure of an isoceles right triangle
m and the measure of a finite disjoint collection of n-dimensional cubes {Ri}i=1 to m m S P n be µ Ri = µ (Ri). A set A ∈ R is Jordan measurable if for any δ > i=1 i=1 δ δ δ 0, there are elementary sets Aδ and A with Aδ ⊂ A ⊂ A and µ A \ Aδ < δ.
This uses the fact that the set difference of two Jordan elementary sets is another Jordan elementary set. The idea of this definition is that if we take a sequence of δ → 0, we obtain a δ δ sequence of elementary sets {Aδ,A } such that the Jordan measures of Aδ and A δ converge to the same number. We can think of µ (Aδ) and µ A as the “inner” and “outer” content of A. We define the common limit to be the Jordan measure of A. This is a good idea and it can be used to improve the Riemann integral in particular.
Example 5.4.1
Consider the isoceles right triangle spanning (0, 0), (1, 0), and (0, 1) shown in Figure 5.1. We create a partition of the unit square consisting of nonoverlapping squares with sides of length h = 1/m for integer m ≥ 1. The measure of Aδ 2 2 2 1 m−1 δ is (m − 1) · h + (m − 2) · h + ··· + 0 · h = 2 m . The measure of A is 2 2 2 1 m+1 δ 2 1 m · h + (m − 1) · h + ··· + 1 · h = 2 m . Thus, µ A \Aδ = mh = m → 0 (illustrated in Figure 5.1 as the diagonal of lighter shaded squares) as m → ∞. The Jordan measure of the triangle is 1/2.
But it has the problem that the collection of Jordan measurable sets is too restric- tive. Example 5.4.2
It is a good exercise to show that the set of rational numbers Q in the unit interval I is not Jordan measurable. Lebesgue’s development aimed at preserving the properties of the Jordan measure, though extended to countable disjoint collections of Jordan measurable sets. But, it al- tered the Jordan approach to set approximation. It defined an “outer measure” µ∗(A) of
i i
i i “measureTheory_v2” 2019/5/1 i i page 80 i 80 Chapter 5. Construction of a General Measure Structure i
any set A ⊂ Rn to be the infimum of the measures of countable covers of A consisting of Jordan elementary sets. Lebesgue’s definition of measurability can be expressed as saying that for any δ > 0, there is a Jordan elementary set Aδ such that µ∗(A4Aδ) < δ. A notion of “inner measure” can also be defined using the outer measure of the com- plement of A. Since no approximation of a set by interior elementary sets is involved, this approach greatly increases the size of the collection of measurable sets. Indeed, the construction of non-measurable sets becomes quite difficult and depends on some set axioms. We present Carathéodory’s generalization of Lebesgue’s approach because it pro- vides a way to construct measures on a wide variety of spaces. This has similarities to the way we originally defined a set of measure zero in terms of countable covers, Def. 4.3.4.
Definition 5.4.2: Outer measure
∗ An outer measure on X is a set function µ : PX → [0, ∞] such that: 1. µ∗ (∅) = 0; 2. For any sets A ⊂ B, µ∗ (A) ≤ µ∗ (B); (Monotonicity) ∞ ∞ ∞ ∗ S P ∗ 3. For any collection {Ai}i=1 of sets in X, µ Ai ≤ µ (Ai). (Sub- i=1 i=1 additivity)
Note that an outer measure is defined on all of PX, which is the largest σ- algebra on X. So, provided we prove that outer measures exist, they place no restrictions on the subsets. But, this is also a disadvantage in the sense that computationally, we prefer to start with a function defined on some smaller collection of sets and build up the values on more complicated sets as opposed to having to specify values for every subset of X.
Example 5.4.3
∗ Set X = {a1, a2, a3} and define µ by
∗ ∗ µ (∅) = 0, µ ({ai}) = 1, i = 1, 2, 3, 3 µ∗({a , a }) = , µ∗({a , a }) = µ∗({a , a }) = 2, 1 2 2 1 3 2 3 5 µ∗({a , a , a }) = . 1 2 3 2 Then, we can systematically check Definition 5.4.2 to see that µ∗ is an outer mea- ∗ 3 ∗ ∗ sure, e.g. µ ({a1, a2}) = 2 ≤ µ ({a1}) + µ ({a2}) = 2, and so on. ∗ ∗ However, if we define µ ({a1, a2, a3}) = 3, then µ is not an outer measure ∗ ∗ ∗ 5 since µ ({a1, a2, a3}) = 3 > µ ({a1, a2}) + µ ({a3}) = 2
Example 5.4.4
i i
i i “measureTheory_v2” 2019/5/1 i i page 81 i 5.4. Outer measures 81 i
Set X = N and define ( e|A|, |A| < ∞, σ(A) = A ⊂ X. ∞, |A| = ∞,
σ is monotone since ex is a monotone increasing function of x ≥ 0. However, if we check subadditivity for A = {1, 2, ··· , m} using the singleton sets, we find m X that σ(A) = em > σ({i}) = me1 for any m ≥ 1, so σ is not an outer i=1 measure. The issue is that σ assigns too much “size” to the collection A compared to the sizes of the individual components. Note that once we find one violation, we do not need to check the other possibilities for partitioning A into subsets.
Example 5.4.5
Instead of the exponential function in Example 5.4.4, we consider a monomial. Set X = N and for fixed p > 0 define ( |A|p, |A| < ∞, σ(A) = A ⊂ X. ∞, |A| = ∞,
For any p, σ : PX → [0, ∞] and σ(∅) = 0. Moreover, σ is monotone for any p since xp is a monotone increasing function of x ≥ 0 for any p > 0. ∞ Thus, we have to check subadditivity. Let {Ai}i=1 be a collection of sets in ∞ S X and set A = Ai. There are two cases to treat, |A| finite and |A| infinite. In i=1 the finite case, we can assume A = {1, 2, ··· , m} without loss of generality, since |A| does not depend on the values of the members of A. If p > 1, we check subadditivity in the finite case using the singleton sets to p Pm p find that σ(A) = m > i=1 σ({i}) = m1 = m for m > 1. Thus, subadditiv- ity fails if p > 1, and σ is not an outer measure. ∞ ∗ S We next consider p ≤ 1. If any of the |Ai| is infinite then µ Ai = i=1 ∞ ∞ P ∗ P ∗ ∞ = µ (Ai). If |A| is infinite, then µ (Ai) = ∞, hence subadditivity i=1 i=1 ∞ P ∗ also holds. Hence, we assume that m = |A| and µ (Ai) are finite. By Theo- i=1 ∞ rem 2.4.2, we may assume that {Ai}i=1 are disjoint and therefore there are a finite number of sets in the collection. Thus, we have to check subadditivity for all possible partitions of A into a finite number of disjoint subsets. To do this, we use the inequality
mp ≤ xp + (m − x)p, 0 ≤ x ≤ m, 0 < p ≤ 1. (5.1)
The inequality is obvious for p = 1, and for p < 1 we define f(x) = xp + (m − x)p − mp and find the extrema of f to prove the result. The inequality implies that subadditivity holds for any partition of A into two disjoint subsets. Using
i i
i i “measureTheory_v2” 2019/5/1 i i page 82 i 82 Chapter 5. Construction of a General Measure Structure i
induction, we can prove it holds for any finite partition, so σ is subadditive and an outer measure if p ≤ 1. The following theorem presents a systematic way to construct an outer measure.
Theorem 5.4.1
Let E be a non-empty family of subsets of X with ∅, X ∈ E such that there is a set function f : E → [0, ∞] satisfying f(∅) = 0. For any subset A ∈ PX, define
( ∞ ∞ ) ∗ ∗ X [ µ (A) = µf (A) = inf f(Ai): {Ai} ⊂ E and A ⊂ Ai . (5.2) i=1 i=1
∗ Then, µf is an outer measure.
Theorem 5.4.1 suggests there are many ways to construct an outer measure correspond- ing to many possible choices of f. But, it is particularly meaningful when the function f is related to our idea about “area”. We illustrate in Figure 5.2.
A A A
2 Figure 5.2. Illustration of computing the infimum for the outer measure for a set in R in the case that the set function f gives the area of a square. We consider the family of squares and compute the infimum by “refining” the squares that cover the given set.
Definition 5.4.3
∗ We say that the outer measure µf in Theorem 5.4.1 is induced by f. We use the subscript f only when it is important to indicate the function f.
Proof. We begin by showing the infimum exists. The collection of countable covers in (5.2) is not empty since X ∈ E and X covers A. We are computing an infimum over positive real numbers that are bounded below by 0, so the infimum exists. Next, µ∗ (∅) = 0 because ∅ ∈ E is contained in a countable collection of empty sets and f(∅) = 0. Next, we verify monotonicity. If A ⊂ B, then µ∗(A) ≤ µ∗(B) since the collection of countable covers over which the infimum is computed for A includes the collection of countable covers of B. ∞ Let {Ai}i=1 be a collection of sets in X. Given > 0, for each i there is a collection ∞ S∞ of sets {Bi,j}j=1 ⊂ E such that Ai ⊂ j=1 Bi,j and ∞ X ∗ −i f(Bi,j) ≤ µ (Ai) + 2 . j=1
i i
i i “measureTheory_v2” 2019/5/1 i i page 83 i 5.4. Outer measures 83 i
S∞ S∞ If A = i=1 Ai, then A ⊂ i,j=1 Bi,j, and ∞ ∞ ∞ ∞ ∞ ∗ X X X X ∗ −i X ∗ µ (A) ≤ f(Bi,j) = f(Bi,j) ≤ µ (Ai) + 2 ≤ µ (Ai) + . i,j=1 i=1 j=1 i=1 i=1 Since is arbitrary, the result follows.
Remark 5.4.1
Analogs of the trick of specifying 2−i for the accuracy of each successive member of a countable cover of Ai is a standard argument in measure theory.
Example 5.4.6
The measure µ defined on the σ- algebra M on X = {a, b, c, d} in Example 5.2.1 is defined on a proper subset of PX. We treat this measure as a set function and compute the corresponding outer measure. Using the definition, we compute,
µ∗({c}) = µ∗({d}) = 1, µ∗({a, c}) = µ∗({a, d}) = µ∗({b, c}) = µ∗({b, d}) = 2, µ∗({a, b, c}) = µ∗({a, b, d}) = 3, µ∗({a, c, d}) = µ∗({b, c, d}) = 2,
and of course µ and µ∗ agree on M .
∗ Note that the outer measure µf induced by a set function f is not the same as f in general. After all, the outer measure is defined on all subsets of X, not just the family E on which f is defined. Example 5.4.7
In Example 5.4.5, σ is not an outer measure when p > 1. If we define an outer measure µ∗ from σ using (5.2), we find that µ∗({1, 2, ··· , m}) = m. Thus, the outer measure obtained from σ for p > 1 reduces to σ with p = 1.
On the other hand, Theorem 5.4.1 implies
Theorem 5.4.2
Any measure on X whose associated σ- algebra is PX is an outer measure on X.
Example 5.4.8
It is a good exercise to show that if X = R and f is defined as f([a, b]) = b − a for ∗ a, b ∈ R and f(∅) = 0, then the corresponding outer measure satisfies µf ([a, b]) = b − a for any a, b ∈ R.
As noted, outer measures have the advantage that they are defined on all of PX. However, a disadvantage of outer measures is that there is no assumption of countable
additivity. The next step is to identify a subset of PX on which an outer measure satisfies additional properties that make it into a measure. There are several ways to do this. We use Carathéodory’s approach.
i i
i i “measureTheory_v2” 2019/5/1 i i page 84 i 84 Chapter 5. Construction of a General Measure Structure i
Definition 5.4.4: Carathéodory’s Condition
Given an outer measure µ∗ on X, a set A is µ∗ - measurable or outer measurable if µ∗ (E) = µ∗ (E ∩ A) + µ∗ (E ∩ Ac) , (5.3)
for all E ⊂ X. We usually say outer measurable instead of µ∗ - measurable if µ∗ is clear from the context. Note that the sets E used to test (5.3) are not restricted to µ∗ - measurable sets. Below, we use E to denote the testing sets in order to make things a little more readable. There is no other significance to the choice of label.
Remark 5.4.2
The language is a bit confusing because it is possible to compute the outer measure of sets that are not outer measurable!
E U Ac E U E A E
Ac A X
Figure 5.3. Left: Illustration of Carathéodory’s condition. By varying E we can check Carathéodory’s condition in different regions. Right: Zooming in on a portion of A where the boundary is complicated.
Any set A together with its complement Ac forms a “decomposition” of X in the sense that if E ⊂ X then E is equal to the disjoint union (E ∩ A) ∪ (E ∩ Ac), see Figure 5.3. Therefore, A is µ∗ - measurable if µ∗ is additive with respect to all disjoint unions formed using A. We can interpret this assumption as follows. On one hand, µ∗(E ∩ A) is an outer measure of the part of E that lies “inside” A. On the other hand, µ∗(E) − µ∗(E ∩ Ac) can be interpreted as a way to compute the outer measure of that part of E that does not lie “outside” A, so this is an indirect way to compute the outer measure of the part of E “inside” A. So, (5.3) is the requirement that these ways to compute the outer measure of part of the interior of A should match. By varying E, we can check this condition on different parts of A.
Example 5.4.9
We check Carathéodory’s condition for the outer measure defined in Example 5.4.3. ∗ 3 ∗ We start with {1}. Using E = {1, 2}, we obtain µ ({1, 2}) = 2 6= µ ({1}) + µ∗({2}) = 2, so {1} is not outer measurable. Neither is {2} by symmetry. For
i i
i i “measureTheory_v2” 2019/5/1 i i page 85 i 5.4. Outer measures 85 i
{3}, we find that (5.3) is violated using E = {1, 2, 3}. Similarly, we find that none of the two subsets with two elements are outer measurable. Hence, {∅, X} are the outer measurable sets. We want to build a measure out of an outer measure. First a useful result:
Theorem 5.4.3
Let µ∗ be an outer measure on X. A set A is µ∗ - measurable if and only if µ∗ (E) ≥ µ∗ (E ∩ A) + µ∗ (E ∩ Ac) , (5.4)
for any subset E ⊂ X.
The point is that we only have to check (5.4) not (5.3).
Proof. Since µ∗ is sub-additive, we always have µ∗ (E) ≤ µ∗ (E ∩ A) + µ∗ (E ∩ Ac).
Example 5.4.10
We show that the set function σ defined in Example 5.4.5 is an outer measure for p ≤ 1. We now determine the outer measurable sets by checking (5.4). Choose A ⊂ X. First consider finite E with |E| = `, so (5.4) becomes `p ≥ (` − i)p + ip for 0 ≤ i ≤ `. However, (5.1) implies that this inequality does not hold for p < 1, while it does hold for p = 1. When p = 1, it is straightforward to verify that (5.4) holds for infinite E as well. So, when p < 1, the outer measurable sets are {∅, X} and when p = 1, the outer measurable sets are PX. We have the major result:
Theorem 5.4.4: Carathéodory
Let µ∗ be an outer measure on X. The collection M = {A ⊂ X : A is µ∗- ∗ ∗ measurable} of µ - measurable sets is a σ- algebra and the restriction µ |M of µ∗ to is a measure. Moreover, , , µ∗ is a complete measure space. M X M M
Hang on, this is the first major proof we present.
Proof. M is nonempty. We show that any set A of outer measure 0 is outer measurable so ∅ ∈ M since µ∗ (∅) = 0. Choose E ⊂ X. Since E ⊃ E ∩ Ac, µ∗ (E) ≥ µ∗(E ∩ Ac). Similarly, A ⊃ E ∩ A, so µ∗ (A) ≥ µ∗(E ∩ A). Addition gives µ∗ (E) + µ∗ (A) = µ∗ (E) ≥ µ∗ (E ∩ A) + µ∗ (E ∩ Ac), since µ∗ (A) = 0. M is an algebra. The definition of outer measurability is symmetric with respect to a set and its complement, so M is closed under complements. Let A1,A2 ∈ M . Then for any E1,E2 ⊂ X,
∗ ∗ ∗ c µ (E1) = µ (E1 ∩ A1) + µ (E1 ∩ A1), ∗ ∗ ∗ c µ (E2) = µ (E2 ∩ A2) + µ (E2 ∩ A2).
i i
i i “measureTheory_v2” 2019/5/1 i i page 86 i 86 Chapter 5. Construction of a General Measure Structure i
c For E ⊂ X, set E1 = E and E2 = E ∩ A1 to get
∗ ∗ ∗ c µ (E) = µ (E ∩ A1) + µ (E ∩ A1) ∗ c ∗ c ∗ c c µ (E ∩ A1) = µ (E ∩ A1 ∩ A2) + µ (E ∩ A1 ∩ A2).
or, ∗ ∗ ∗ c ∗ c µ (E) = µ (E ∩ A1) + µ (E ∩ A1 ∩ A2) + µ E ∩ (A1 ∪ A2) .
c Now, E ∩ (A1 ∪ A2) = (E ∩ A1) ∪ (E ∩ A1 ∩ A2), so
∗ ∗ ∗ c µ (E ∩ (A1 ∪ A2)) ≤ µ (E ∩ A1) + µ (E ∩ A1 ∩ A2) .
Hence, ∗ ∗ ∗ c µ (E) ≥ µ (E ∩ (A1 ∪ A2)) + µ (E ∩ (A1 ∪ A2) ),
∗ for any set E in X. Hence, A1 ∪ A2 ∈ M . Induction shows that µ is finitely additive on M . M is a σ- algebra. We show that M is closed under countable disjoint unions, so the ∞ result follows from Theorem 5.1.4. Let {Ai}i=1 be a disjoint collection of sets in M Sm S∞ and set Bm = i=1 Ai and B = i=1 Ai. For E ⊂ X,
∗ ∗ ∗ c ∗ ∗ µ (E∩Bm) = µ (E∩Bm ∩Am)+µ (E∩Bm ∩Am) = µ (E∩Am)+µ (E∩Bm−1).
By induction, m ∗ X ∗ µ (E ∩ Bm) = µ (E ∩ Ai). i=1 Thus,
m ∗ ∗ ∗ c X ∗ ∗ c µ (E) = µ (E ∩ Bm) + µ (E ∩ Bm) = µ (E ∩ Ai) + µ (E ∩ Bm). (5.5) i=1
∞ c ∞ Now {Bm}m=1 is an increasing sequence of sets, so {Bm}m=1 is a decreasing sequence of sets. Taking the limit as m → ∞ in (5.5) (noting that all the terms in the sums are nonnegative!) and using monotonicity, we obtain
∞ ∗ X ∗ ∗ c µ (E) ≥ µ (E ∩ Ai) + µ (E ∩ B ) i=1 ∞ ∗ [ ∗ c ≥ µ (E ∩ Ai) + µ (E ∩ B ) i=1 = µ∗(E ∩ B) + µ∗(E ∩ Bc) ≥ µ∗(E).
Therefore, all the inequalities are equalities, so B ∈ M . Taking E = B yields, ∞ ∞ ∗ [ X ∗ µ Ai = µ (Ai). i=1 i=1
∞ for any disjoint collection {Ai}i=1 in M .
i i
i i “measureTheory_v2” 2019/5/1 i i page 87 i 5.4. Outer measures 87 i
M is complete. Choose A ∈ M with µ∗(A) = 0 and let B ⊂ A. Since, B ⊂ X, µ∗ (B) ≤ µ∗ (A) = µ (A) = 0. As proved above, this implies that B ∈ M and µ(B) = 0.
Before continuing with the construction of measures, we present an important ex- ample of outer measure.
Assume that X is a nonempty metric space with metric d. Recall that the class of open sets (the topology) is centrally important to metric spaces.
Definition 5.4.5: Borel σ-algebra
The σ- algebra generated by the open sets of X is called the Borel σ- algebra and is denoted BX = B. The members of BX are called Borel sets.A Borel measure on a metric space is a measure whose domain consists of the Borel sets.
The Borel sets include countable unions and intersections of open and closed sets, and countable unions and intersections of those sets, and so on.
In this section, we work in the measurable space (X, B).
Definition 5.4.6
In reference to the Borel σ- algebra of a metric space X, a countable intersec- tion of open sets is called a Gδ−set, a countable union of closed sets is called a Fσ−set, a countable intersection of Gδ−sets is called a Gδδ−set, a countable union of Gδ−sets is called a Gδσ−set, a countable intersection of Fσ−sets is called a Fσδ−set, and so on. Note that taking countable unions, intersections and complements starting with open (or closed) sets does not generate all the members of B. The point is that these set operations can be repeated in an unlimited fashion, not simply a countable number of times. The existence of a metric makes it possible to “separate” sets.
Definition 5.4.7
Let A, B ⊂ X. The distance between A and B is defined, d(A, B) = inf{d(a, b): a ∈ A, b ∈ B}.
This is well defined since d is nonnegative. In the next definition, we add a condition to the definition of an outer measure.
Definition 5.4.8: Metric outer measure
Let µ∗ be an outer measure on X. We say that µ∗ is a metric outer measure if µ∗(A ∪ B) = µ∗(A) + µ∗(B), (5.6)
i i
i i “measureTheory_v2” 2019/5/1 i i page 88 i 88 Chapter 5. Construction of a General Measure Structure i
Gc G
A 1 1
A2 1/ A 2 3 A 1 A /3 4 1/ X 4
Figure 5.4. Illustration of computing a metric outer measure from the “inside”.
for all A, B ⊂ X with d(A, B) > 0.
Note that if d(A, B) > 0, then A ∩ B = ∅. But more than that, the two sets are “well separated”. This separation should mean that we can compute the outer measure of the union by summing the outer measures of each set. Before stating the main result, we prove a theorem by Carathéodory that discusses the approximation of outer measure from “within”.
Theorem 5.4.5
Let µ∗ be a metric outer measure on X, let G ⊂ X be open, and assume A ⊂ G. c ∗ For each i ≥ 1, set Ai = {x ∈ A : d(x, G ) ≥ 1/i}. Then, limi→∞ µ (Ai) = µ∗(A).
See Figure 5.4.
∞ Proof. Since {Ai}i=1 is an increasing sequence and Ai ⊂ A for all i, we just have to ∗ ∗ show that lim µ (Ai) ≥ µ (A). Each point of A is an interior point of G, therefore S∞ since A ⊃ i=1 Ai, each point of A must belong to Ai for i sufficiently large. Thus, S∞ S∞ A ⊂ i=1 Ai or A = i=1 Ai. Set Bi = Ai+1 \ Ai for i ≥ 1. For each m,
∞ ! ∞ ! ∞ ! [ [ [ A = A2m ∪ Bi = A2m ∪ B2i ∪ B2i+1 . i=2m i=m i=m
i i
i i “measureTheory_v2” 2019/5/1 i i page 89 i 5.5. Hausdorff measure 89 i
Therefore, ∞ ∞ ∗ ∗ X ∗ X ∗ µ (A) ≤ µ (A2m) + µ (B2i) + µ (B2i+1). (5.7) i=m i=m ∗ If both the series in (5.7) converge, then letting m → ∞ and noting that lim µ (A2m) = ∗ ∗ ∗ lim µ (Am), shows µ (A) ≤ lim µ (Am). Otherwise, at least one of the series di- verges. Without loss of generality, assume the first series in (5.7) diverges. Since 1 1 d(B2i,B2i+2) ≥ 2i+1 − 2i+2 > 0,
m−1 ! m−1 ∗ ∗ [ X ∗ µ (A2m) ≥ µ B2i = µ (B2i) → ∞ as m → ∞. i=1 i=1 ∗ ∗ So, lim µ (Am) = ∞ ≥ µ (A).
The next result characterizes the µ∗-measurable sets to be precisely the sets we would hope to be outer measurable.
Theorem 5.4.6
µ∗ is a metric outer measure on X if and only if every Borel set is µ∗-measurable.
Proof. First, we assume µ∗ is a metric outer measure and show that every closed set F is µ∗-measurable. Let E ⊂ X. E\F is contained in the open set F c, so there is a sequence ∞ ∗ ∗ {Ai}i=1 of subsets of E \ F such that d(Ai,F ) ≥ 1/i and lim µ (Ai) = µ (E \ F ). Therefore,
∗ ∗ ∗ ∗ ∗ ∗ µ (E) ≥ µ ((E ∩ F ) ∪ Ai) = µ (E ∩ F ) + µ (Ai) → µ (E ∩ F ) + µ (E \ F ).
Now we assume that every Borel set is µ∗-measurable. Let A, B ⊂ X satisfy d(A, B) > 0. Choose an open set G ⊃ A such that G ∩ B = ∅. By assumption, G is µ∗-measurable, so µ∗(A ∪ B) = µ∗((A ∪ B) ∩ G) + µ∗((A ∪ B) \ G) = µ∗(A) + µ∗(B).
Theorem 5.4.4 implies
Theorem 5.4.7
∗ Let µ be a metric outer measure on X. Then, (X, BX, µ) is a measure space, where µ = µ∗| . BX
5.5 Hausdorff measure This result suggests that abstract measure theory is applicable to a wide range of in- teresting examples, e.g. any metric space. That is, provided there are any interesting metric outer measures in a general metric space! We next show there is a default metric outer measure in a general metric space that is interesting. We being by extending the notion the basic geometric concept of diameter of a sphere:
i i
i i “measureTheory_v2” 2019/5/1 i i page 90 i 90 Chapter 5. Construction of a General Measure Structure i
Definition 5.5.1
The diameter of a set A ⊂ X is d(A) = sup{d(x, y): x, y ∈ A}. A set A ⊂ X is bounded if d(A) < ∞.
Example 5.5.1
This coincides with the usual definition of diameter for a ball in Rn.
Anticipating that the end result is indeed an outer measure, we define
( ∞ ∞ ) ∗ X p [ µH,p,δ(A) = inf d(Ai) : A ⊂ Ai, d(Ai) ≤ δ all i , i=1 i=1
for any A ⊂ X, with the convention that inf ∅ = ∞. It is an exercise to show that the covering sets can be restricted to be either closed or open. ∗ As δ decreases, µH,p,δ(A) increases since the inf is computed over a smaller col- lection of covers. Thus, we define
Definition 5.5.2: Hausdorff outer measure
∗ ∗ The p dimensional Hausdorff outer measure is defined µH,p(A) = lim µH,p,δ(A), δ→0 A ⊂ X.
The role and usefulness of the parameter p is not clear at this point. We note that if A is a “n-dimensional” set in a “n-dimensional space”, then d(A) has units of a single dimension, so we raise the diameter to the power p = n to get the right dimensions for 3 π 3 volume in n-dimensions. For example, the volume of a ball in R of radius ρ is 6 ρ . In general, including p provides a way to define the measure of a surface bounding a volume, which is “lower dimensional” than the volume, and we explore that later. The definition of Hausdorff outer measure of a set as the limit of measures of covers with decreasing diameters provides a way to deal with potential geometric complexities.
Example 5.5.2
Consider X = C([0, 1]) equipped with the usual sup = max metric. Suppose A1 is the set of continuous functions with values between .1 and .2 and A2 is the set of continuous functions with values between .6 and .7. Define A = A1 ∪A2. Then d(A) = .6 > d(A1) + d(A2) = .1 + .1 + .2.
The main result is,
Theorem 5.5.1
∗ µH,p is a metric outer measure on X.
∗ ∗ Proof. Theorem 5.4.1 implies that µH,p,δ is an outer measure, thus µH,p is an outer measure.
i i
i i “measureTheory_v2” 2019/5/1 i i page 91 i 5.5. Hausdorff measure 91 i
∞ Choose A, B ⊂ X with d(A, B) > 0. Choose a cover {Ai}i=1 of A ∪ B with d(Ai) < δ for all i. Then, Ai can have a nonempty intersection with A or B, but not both. We divide the sum , X p X p X p d(Ai) = d(Ai) + d(Ai) .
i Ai∩A6=∅ Ai∩B6=∅
This gives ∗ ∗ X p µH,p,δ(A) + µH,p,δ(B) ≤ d(Ai) i or ∗ ∗ ∗ µH,p,δ(A) + µH,p,δ(B) ≤ µH,p,δ(A ∪ B). We let δ → 0 to get the result.
∗ Theorem 5.4.7 implies that µH,p is a measure when restricted to BX.
Theorem 5.5.2
∗ ( , , µH,p) is a measure space, where µH,p = µ | . X BX H,p BX
Definition 5.5.3: Hausdorff measure
µH,p is the p dimensional Hausdorff measure.
It is an exercise to adapt earlier proofs to show,
Theorem 5.5.3
The Hausdorff measure is regular.
Determining the Hausdorff measure can be a complicated problem because sets can be very complicated.
Example 5.5.3: Sierpinski Carpet
We construct the Sierpinski Carpet using an iterative process. We begin with an equilateral triangle, see Figure 5.5. At step 1, we divide the initial triangle into four equilateral triangles and remove the middle third. At every subsequent step, we divide each equilateral in the current figure into four equilateral triangles, and remove the middle third. 3 i The area of the set at step i is 4 × the area of the initial triangle while the 3 i length of the perimeter of the set is 2 × the perimeter of the initial triangle. Thus, the perimeter of the sets tend to ∞ while their area tends to 0!
∗ The properties of a set is related to the parameter p in µH,p. In fact, there is a critical value of p for any set A ∈ BX.
i i
i i “measureTheory_v2” 2019/5/1 i i page 92 i 92 Chapter 5. Construction of a General Measure Structure i
Figure 5.5. First 6 sets in the construction of the Sierpinski Carpet.
Theorem 5.5.4
Let A ∈ BX. If µH,p(A) < ∞, then µH,q(A) = 0 for all q > p. Vice versa, if µH,p(A) > 0, then µH,q(A) = ∞ for all q < p.
Proof. For the first result, since µH,p(A) < ∞, for any δ > 0, there is a cover {Ai} ⊂ S P p BX with A ⊂ i Ai, d(Ai) < δ for all i, and i d(Ai) ≤ µH,p(A) + 1. For q > p, X q q−p X p q−p d(Ai) ≤ δ d(Ai) ≤ δ (µH,p(A) + 1). i i q−p So, µH,q(A) ≤ δ (µH,p(A) + 1). We let δ → 0. The second result follows immediately from the first result.
This theorem motivates Definition 5.5.4: Hausdorff dimension
Let A ∈ BX. The Hausdorff dimension of A ∈ BX is the common value of
dimH (A) = inf{p ≥ 0 : µH,p(A) = 0} = sup{p ≥ 0 : µH,p(A) = ∞}.
Determining the Hausdorff dimension of a set is generally a complicated compu- tation that involves establishing both lower and upper bounds, see [Fal03, Fol99]. We give a few examples using heuristic arguments.
Example 5.5.4
We give a plausible computation of the Hausdorff measure of a line segment of length l in the square. We use m circles of diameter l/m as shown in Figure 5.6 (d), and compute the Hausdorff measure m1−p as shown. If p < 1, this converges to ∞ as m → ∞. If p > 1, this converges to 0 as m → ∞. Hence, we conclude the Hausdorff dimension is 1.
Example 5.5.5
Next, we provide a plausible estimate of the Hausdorff measure of a unit square 2 in R . In Figure 5.6, we show three covers√ of the square. The cover (a) yields an estimate of the Hausdorff measure as ( 2)p. Because the square is a regular figure with several symmetries, we can consider covers that have symmetries as shown in (b) and (c). Given an integer m > 0, we use circles of diameter 1/m to cover the square in the pattern shown (b). In general, there are 2m(m + 1) circles in the
i i
i i “measureTheory_v2” 2019/5/1 i i page 93 i 5.5. Hausdorff measure 93 i
cover, so the measure of the cover is 2m(m + 1)/mp. If we choose p < 2, this converges to ∞ as m → ∞. If we choose p > 2, this converges to 0. Hence, the Hausdorff dimension is 2. Lastly, to convey the fact that covering a set is complicated business, we show a few members of a nonoverlapping cover in (c), but without the computation!
(a) (b) (c) (d)
Figure 5.6. Computation of the Hausdorff measure. (a), (b), (c) are covers of a square. (d) is a cover of a line segment in a square.
There are different approaches that exploit properties of the set such as self-similarity and symmetry. For example, we derive a scaling property of Hausdorff measure that holds in the metric space Rn.
Theorem 5.5.5
If A ⊂ Rn is a Borel set and α > 0, define αA = {αx : x ∈ A}. Then, p µH,p(αA) = α µH,p(A).
∞ Proof. Let {Ai}i=1 be a cover of A using sets of diameter no more than δ. Then, ∞ ∗ p ∗ {αAi}i=1 is a cover of αA. This gives, µH,p,δ(αA) ≤ α µH,p,δ(A) for any δ > p 0. Letting δ → 0, gives µH,p(αA) ≤ α µH,p(A). We get the reverse inequality by applying the same argument with the substitutions α → 1/α and A → αA.
Theorem 5.5.5 is useful for computing Hausdorff dimensions.
Example 5.5.6
We provide a plausible computation of the Hausdorff dimension of the Cantor set (Definition 4.3.6). C is the disjoint union C = CL ∩ CR of a “left-hand” part 1 2 CL = C ∩ [0, 3 ] and “right-hand” part CR = C ∩ [ 3 , 1]. CL and CR are similar to C, except for being scaled by a factor of 1/3 and a shift. By Theorem 5.5.5,
1p 1p µ (C) = µ (C ) + µ (C ) ≤ µ (C) + µ (C). H,p H,p L H,p R 3 H,p 3 H,p