CHAPTER 1
Measure Theory and Integration
One fundamental result of measure theory which we will encounter in this Chapter is that not every set can be measured. More specifically, if we wish to define the measure, or size of subsets of the real line in such a way that the measure of an interval is the length of the interval then, either we find that this measure does not extend all subsets of the real line or some desirable property of the measure must be sacrificed. An example of a desirable property that might need to be sacrificed is this: the (extended) measure of the union of two disjoint subsets might not be the sum of the measures. The Banach-Tarski paradox dramatically illustrates the difficulty of measuring general sets; it is stated in Section ???????? without proof. It is with this in mind that our discussion of measure and integration begins, in this Section, by making precise what constitutes a measurable set and a measurable function and then an integrable function.
1. Algebras of Sets Let Ω be a set and (Ω) denote the power set of Ω which is the set of all subsets of Ω. TheP reader is referred to Chapter 0, Subsection 1 if any notations or set theoretic notions are unfamiliar. Notation: N0 is the set of nonnegative integers; Q is the set of rational numbers; Z is the set of all integers. Suppose Ω and Ω are two sets and f : Ω Ω is a function with 1 2 1 → 2 domain Ω1 and range in Ω2 (but f need not be onto). For any B Ω2, denote by ⊆ 1 f − (B)= x Ω : f(x) B . { ∈ 1 ∈ } the inverse image of the set B under f. Of course f need not be 1 one-to-one (injective) and so f − need not exist as a function. Some of the elementary properties of the inverse image are worthy of note: If B,B′ Ω then ⊆ 2 1 1 1 f − (B B′) = f − (B) f − (B′) 1 ∩ 1 ∩ 1 f − (B B′) = f − (B) f − (B′) 1∪ c 1 c∪ f − (B ) = f − (B) 25 26 1.MEASURETHEORYANDINTEGRATION
The verification is asked for in the Exercises at the end of the Section. Definition: If A Ω then define the characteristic function χ on ⊆ A Ω by χA(x)=1if x A and χA(x)=0if x Ω A. ∈ 1 ∈ − Exercise: Determine χA− (B) for an arbitrary B R. There are 4 cases, if = A = Ω. ⊆ ∅ Exercise: (De Morgan’s Laws) If Aα : α is a collection of subsets of a set Ω then the operation {of complementation∈ I} has the properties. C
( α Aα) = α (Aα) C ∪ ∈I ∩ ∈I C ( α Aα) = α (Aα) C ∩ ∈I ∪ ∈I C If = N is countable then we can write n NAn = n NBn where I c c ∪ c∈ ∪ c∈ B1 = A1, B2 = A2 A1, ... Bn = An A1 A2 ... An 1. The Bn are then disjoint. ∩ ∩ ∩ ∩ ∩ − Definition: Let An, n N be a countable collection of sets in Ω. The limit supremum of A ∈ is { n} lim sup An = n N k n Ak n ∩ ∈ ∪ ≥ whereas the limit infimum of A is { n} lim inf An = n N k n Ak. n ∪ ∈ ∩ ≥
Intuitively lim supn An is all points that are in infinitely many of the An whereas liminfn An is all those points that are in all except possibly finitely many of the An. If A is a decreasing sequence of sets, A A A ... then n 1 ⊇ 2 ⊇ 3 ⊇ lim sup An = n NAn = liminf An n ∩ ∈ n Definition: A collection of subsets of Ω is an algebra (field) if F a. Ω . b. If A∈ F then Ac . ∈ F ∈ F c. If Aα : α and if is finite then α . { ∈I}⊆F I ∪ ∈I ∈ F Condition c in the definition could equivalently be replaced by either of the following Conditions.
c′. If A, B then A B . ∈ F ∪ ∈ F c′′. If A, B then A B . ∈ F ∩ ∈ F because Condition c′ is equivalent to c; and Condition c′′ in conjunction with Condition b is equivalent to Conditions c′ and b, as can be seen by observing that (A B)= A B. Definition: AC collection∩ Cof subsets∪C of Ω is a σ-algebra (or σ-field) if is an algebra and F F 1. ALGEBRAS OF SETS 27
d. If A : α and if is countable, or finite, then { α ∈I}⊆F I α Aα . ∪ ∈I ∈ F Of course condition d implies condition c in the definition of an algebra. Conditions b and d together are equivalent to b and d′ where
d′. If A : α and if is countable, or finite, then { α ∈I}⊆F I α Aα . ∩ ∈I ∈ F Example 1.1. (i) The power set (Ω) of a set Ω is a σ- algebra. P (ii) , Ω is a σ-algebra. (iii) {∅, A,} Ac, Ω is a σ-algebra, if A is any subset of Ω. {∅ } Example (i) is the largest σ-algebra of subsets of Ω; Example (ii) is the smallest and Example (iii) is the smallest σ-algebra that contains the set A. Example 1.2. Consider the set of all real intervals of the form [a,b) along with all intervals of the form [a, ) and ( ,b) where a and b are real numbers. Define to be the∞ set of all−∞ finite unions of disjoint intervals of this form alongF with the null set and R itself. We claim that is an algebra. Certainly . If A then we must check that AcFis in and that is obvious∅ if ∈A F= or A∈= FR and so F ∅ we may suppose A = 1 j n[aj,bj) where a1 Proposition 1.4. Let α : α be a non empty collection of σ-algebras (resp. algebras) on{F a given∈ set I}Ω indexed by a set . Then I the intersection α α is also a σ-algebra (resp. algebra). ∩ ∈I F Proof. We need only check that α α satisfies the three prop- erties of σ-algebras The set Ω is certainly∩ ∈I inF all σ-algebras and hence c in α α. If A α α then A is in every α. Finally if (An)n N ∩ ∈I F ∈ ∩ ∈I F F ∈ is a sequence of sets in α α then n NAn is in every α and this completes the proof in the∩ case∈I F of σ-algebras∪ ∈ and the proofF for algebras is analogous. Suppose now that (Ω) is any collection of subsets of a set Ω. Then there is a smallestS ⊆Pσ-algebra which contains which we shall denote this σ( ). To see this observe that the set of allSσ-algebras that contain is nonS empty because it contains (Ω) and so the intersection of all σ-algebrasS that contain is a σ-algebra,P by the Proposition and it certainly contains and soS it must be the minimal such σ-algebra. Definition: If S is a σ-algebra then a subset with the property that = σF( ) is said to be a system of generatorsS ⊆ F of . Consider nowF the caseS that Ω = Rn, where n 1. Similar consider-F n ≥ ations apply if Ω is any topological space. On R x =(x1,...,xn), we 2 1/2 ∋ define the norm x =( 1 j n xj ) and the corresponding (usual) topology where| a| base for≤ the≤ | neighborhoods| of a point x Rn is n ∈ Vr(x) : r > 0 where Vr(x) = y R : x y < r is an open ball{ of radius r}centered at x. Then{ ∈ a set U |is said− | to be} open in Rn, if, for every x U there is r > 0 so that Vr(x) U. The set of all open sets is denoted∈ and we define the Borel⊆subsets of Rn to be = σ( ), that is theO smallest σ-algebra that contains . We shall sometimesB O write (Rn) for when wishing to emphasizeO that it is the Borel σ-algebra onB Rn. InB general, if Ω is a topological space and is the set of all open subsets of Ω then the Borel σ-algebra on ΩO is σ( ). ORemark: If denotes the set of all closed subsets of Rn then σ( )= . For recallC that a set is closed if and only if its complement is open.C SinceB contains the open sets and is closed under the operation of taking complements,B it follows that and hence σ( ). Conversely the open sets are contained inB⊇Cσ( ): σ( ).B Therefore ⊇ C = σ( ) σ( ) and so = σ( ). ThereforeC O ⊆ isC a system of BgeneratorsO of⊆ byC the definitionB ofC and is aO second system of generators. B B C Example: The set of all intervals of the form = ( ,a): a R is a system of generatorsS of the Borel σ-algebraS of{ R−∞: = σ( ∈) Indeed,} since consists solely of open sets it is obviousB that B σ( S). S B ⊇ S 1. ALGEBRAS OF SETS 29 It therefore suffices to show that any open set is contained in σ( ). Certainly if a We shall have occasion to use the Monotone Class Lemma when verifying uniqueness of measures. Exercises: (1) Show that, if B,B′ Ω then ⊆ 2 1 1 1 f − (B B′) = f − (B) f − (B′) 1 ∩ 1 ∩ 1 f − (B B′) = f − (B) f − (B′) 1∪ c 1 c∪ f − (B ) = f − (B) (2) Show that, in general, liminfn An lim supn An (3) Recall the definition of the limit supremum⊆ and limit infimum of a sequence of real numbers an. lim sup an = inf sup ak : k n : n 1 and n N { { ≥ } ≥ } ∈ lim inf an = sup inf ak : k n : n 1 n N ∈ { { ≥ } ≥ } where the “value” (resp. ) is possible if the sequence is unbounded above∞ (resp. below).−∞ (See page two of Ash’s book Measure, Integration, and Functional Analysis.) How is limsup χAn related to χB where B = limsup An? What happens if we replace limsup χAn by liminf χAn ? (4) Suppose that A = x R3 : x (0, 0, ( 1)n/n) < 1 . Find n { ∈ | − − | } lim supn An and liminfn An. Compare this with Exercise 3, page 3 of Ash’s book. (5) Show that of Example 1.2 is not a σ-algebra. F (6) Suppose that E Ω is an arbitrary subset of a set Ω and is a collection of subsets⊆ of Ω. Define E = E A : A S . Show that ∩S { ∩ ∈ S} a. If 0 is an algebra of subsets of Ω then E 0 is also an algebraF of subsets of E. ∩ F b. σ(E ) = E σ( ). That is the smallest σ-algebra in (E)∩S containing∩ ES is the intersection of E with the smallestP σ-algebra in∩S(Ω) containing . c. Conclude further that,P if is a σ-algebraS of subsets of Ω, then E is a σ-algebraF of subsets of E. (Reference: Halmos’s∩ F Measure Theory) 32 1.MEASURETHEORYANDINTEGRATION 2. Measurability It is an objective of this Chapter to define the integral of a function. If f is a nonnegative function defined on an interval in the real line then the integral should be able to tell us the area under the graph of f just as the Riemann integral does in calculus. It will tell us much more but even in this limited context we shall discover that f must be restricted: the notion of area under the graph of f will not make sense for every f and we will be forced to restrict the class of functions considered. We introduce in this Section the concept of a “measurable” function. We shall see that f must be measurable if we are to make sense of the notion of area under the graph. Of course this difficulty arises already in the case of Riemann integration: the Riemann integral is not defined for arbitrary functions f. We shall clarify this remark and discuss the Riemann integral and its relation to the “Lebesgue integral,” introduced here, at the end of this Chapter. Definition: Let Ω be a set and be a σ-algebra of subsets of Ω. The we shall refer to (Ω, ) as a measurableF space. The sets in will be referred to as measurableF sets. F Definition: Suppose (Ω , ) and (Ω , ) are two measurable 1 F1 2 F2 spaces. Then a mapping f : Ω1 Ω2 is measurable, with respect 1 → to 1 and 2 if f − (B) 1 whenever B 2. We shall sometimes sayFf is measurableF function∈ F from (Ω , )∈ to F (Ω , ) to clarify the 1 F1 2 F2 choice of σ-algebras 1 and 2. Unless otherwise specified a function n F F f : Ω1 R is said to be measurable or Borel measurable if f is mea- surable→ from (Ω , ) to (Rn, ). 1 F1 B An elementary example of a measurable function is given by the characteristic function χA of a set A . Then χA is measurable as a mapping from (Ω, )to(R, ) if and only if A , that is if and only if A is a measurableF set. FurtherB examples of measurable∈ F functions will be apparent once some elementary properties have been established. Lemma 2.1. Suppose that (Ω , ), (Ω , ) and (Ω , ) are 1 F1 2 F2 3 F3 three measurable spaces and that f : Ω1 Ω2 and g : Ω2 Ω3 are two mappings. If f and g are both measurable→ then the composed→ function g f is measurable from (Ω , ) to (Ω , ). ◦ 1 F1 3 F3 Proof: We must show that, for an arbitrary set C 3, (g 1 ∈ F ◦ f)− (C) . It suffices to show that ∈ F1 1 1 1 (2.1) (g f)− (C)= f − (g− (C)) ◦ 2.MEASURABILITY 33 1 1 1 because g− (C) 2 because g is measurable and so f − (g− (C)) 1 because f is measurable.∈ F The verification of (2.1) is a straightforwar∈ Fd exercise in checking the equality of sets. 2 Proposition 2.2. Suppose that f : Ω1 Ω2 is a function and and are σ-algebras on Ω and Ω respectively.→ Suppose further F1 F2 1 2 that is a system of generators of 2, which is to say 2 = σ( ). Then S F F 1 S f is measurable from (Ω1, 1) to (Ω2, 2) if and only if f − (B) 1 for every B . F F ∈ F ∈S 1 Proof: It is obvious that, if f is measurable then f − (B) 1 for 1 ∈ F every B simply because 2. Conversely, suppose f − (B) 1 ∈S S ⊆ F 1 ∈ F for every B . Let = B : f − (B) so that . We ∈S T { ∈ F2 ∈ F1} T ⊇S shall show that is a σ-algebra which implies that = 2 and so, by the definition ofT , f is measurable. This will completeT theF proof. We check thereforeT that is a σ-algebra. Certainly , because 1 T ∅∈T1 f − ( ) = 1. We check next that if B then f − (B) 1 ∅ ∅1 ∈c F 1 c ∈ T ∈ F and so f − (B ) = f − (B) 1 because 1 is closed under taking 1 ∈c F 1 c F complements. (Recall f − (B ) = f − (B) by an Exercise.) We check 1 finally that if (Bn)n N is a sequence of sets in so that f − (Bn) for each n N then∈ T ∈ T ∈ 1 1 f − ( n NBn)= n Nf − (Bn) 1 ∪ ∈ ∪ ∈ ∈ F because 1 is a σ-algebra. This shows that is closed under countable unions andF is therefore a σ-algebra and theT proof is complete. 2 Remark: This set in the proof constitute “good” sets and the argument that there areT many good sets is an instance of what Ash calls the good sets principle in his text page 5. Remark: The above proof uses a special case of the following observation: If f is a mapping f : Ω1 Ω2 and if (Bα)α is a collection of subsets of Ω indexed by a set then→ ∈I 2 I 1 1 f − ( α Bα) = α f − (Bα) ∈I ∈I 1 ∪ ∪ 1 f − ( α Bα) = α f − (Bα). ∩ ∈I ∩ ∈I On the other hand if (Aα)α is a collection of subsets of Ω1 then ∈I f( α Aα)= α f(Aα) ∪ ∈I ∪ ∈I but it may happen that f( α Aα) = α f(Aα). ∩ ∈I ∩ ∈I Corollary 2.3. A continuous function f : Rm Rn is Borel measurable. → Of course the understood σ-algebras here are the Borel σ-algebra (Rm) and (Rn). B B 34 1.MEASURETHEORYANDINTEGRATION Proof: Since the (Rn) is generated by the open sets, it suffices 1 B m to show that f − (U) (R ) for an arbitrary open set U. The result will then follow from∈B the Proposition. However f is continuous means 1 1 m that f − (U) is open whenever U is and so f − (U) (R ). 2 ∈B The same reasoning applies in a more general setting. Corollary 2.4. If f is a continuous function from one topolog- ical space (X, Σ) to another (Y, ) then f is measurable from (X, (X)) to (Y, (Y )) where (X) is theT Borel σ-algebra which is generatedB by the openB sets Σ of XB and similarly (Y ) is generated by the open sets . B T Proposition 2.5. Suppose that (Ω, ) is a measurable space F and f1, f2, ... fm are m Borel measurable real valued functions, fj : (Ω, ) (R, ), for 1 j m. Suppose that g : (Rm, (Rm)) (R, F(R→)) is measurable.B ≤ Then≤ B → B φ(x)= g(f1(x),f2(x),...,fm(x)) defines a measurable function φ : (Ω, ) (R, (R)). F → B We shall set aside the proof of the Proposition until later and con- sider its consequences. Corollary 2.6. If f : (Ω, ) (R, ), j = 1, 2 are Borel j F → B measurable functions, then f1 + f2 is also Borel measurable. Proof. This is an application of the Proposition with g : R2 R → defined by g(x1,x2) = x1 + x2. Because g is continuous it is Borel measurable. Corollary 2.7. If f : (Ω, ) (R, ), j = 1, 2 are Borel j F → B measurable functions, then the product f1f2 is also Borel measurable. In particular, if k is a real constant then kf2 is Borel measurable. Proof. In this case g(x,y) = xy. The special case follows by defining f1(x)= k which makes f1 measurable because it is continuous. Corollary 2.8. If f : (Ω, ) (R, ), is a Borel measurable function then f is also Borel measurable.F → B | | Proof. In this case g(x)= x . | | Corollary 2.9. If fj : (Ω, ) (R, ), j = 1, 2 are Borel measurable functions, and if f (x) =F 0→for allBx Ω then the ratio 2 ∈ f1/f2 is also Borel measurable. 2.MEASURABILITY 35 Proof. In this case we define x/y if y =0 g(x,y)= 0 if y =0 so that g is defined on all of R2 but of course it is not continuous. We shall check however that g is measurable, as a mapping from 2 2 1 2 (R , (R )) to (R, ). It suffices to show that g− ( ,a) (R ) B B 1 −∞ ∈ B for any real a by Proposition 2.2. If a< 0 then g− ( ,a)= (x,y) R2 : x/y < a is easily seen to be open in R2 and−∞ hence in {(R2).∈ If } 1 2 B a 0 then g− ( ,a) = (x,y) R : x/y < a (x,y) : y = 0 which≥ is the union−∞ of an open{ and a∈ closed set and is} therefore ∪ { in (R2).} This proves that g is measurable and so the above Proposition appliesB which completes the proof. Exercise: Show that if f : (Ω, ) (R, ), j = 1, 2 are Borel j F → B measurable functions then max f1,f2 and min f1,f2 are also mea- surable. Remark: max a,b ={ ( a b} + a + b)/{2 for any} reals a and b. { } | − | It remains to establish the Proposition. Proof of the Proposition: It suffices to show that the map- ping, f say, defined by f(x)=(f1(x),f2(x),...fm(x)) is measurable as a mapping from (Ω, ) to (Rm, (Rm) because the composition of measurable functionsF is measurable.B To verify f is measurable, m it suffices to show that, for an arbitrary a = (a1,a2,...,am) R , 1 ∈ f − ( x1 < a1,x2 < a2,...,xm < am ) by Proposition 2.2. How- ever { } ∈ F 1 1 f − ( x1 f(x)= λjχAj (x) 1 j m ≤ ≤ The set of all simple functions is denotes (Ω, ) or when the context is clear. S F S Observe that a simple function is measurable because it is the sum of measurable functions. Also a simple function takes on only finitely many values, λj, 1 j m and possibly 0. Conversely a measurable function that takes≤ on finitely≤ many values is simple. To see this, sim- 1 ply define, for each λ in the image of f, A = f − ( λ ). The set j j { j} S 36 1.MEASURETHEORYANDINTEGRATION is closed under addition, scalar multiplication and multiplication and therefore forms a linear algebra over R of functions. (Recall that a lin- ear algebra is a vector space with a multiplication operation (f,g) fg that is associative and distributes over addition from the left and → right and, for any scalar α, α(fg)=(αf)g = f(αg). Reference: Naimark’s Normed Algebras 7.) It is further worth noting that the representation § f(x)= 1 j m λjχAj (x) in the definition of f is not unique. ≤ ≤ ∈S Proposition 2.11. Let fn, n N be a sequence of Borel mea- surable real valued functions defined on∈ a measurable space (Ω, ). Sup- pose that F lim fn(x)= f(x) exists in R for every x Ω. n N ∈ ∈ Then the function f : (Ω, ) R, so defined, is measurable. F → This result says, briefly, that the pointwise limit of measurable func- tions is measurable. Proof: We will show that 1 1 (2.2) f − (( ,a]) = p N n N j nfj− (( ,a +1/p]) −∞ ∩ ∈ ∪ ∈ ∩ ≥ −∞ for every a R The set of all sets of the form ( ,a] generate the Borel σ-algebra∈ because any open set is generated.−∞ Therefore, we will have shown fB is measurable by Proposition 2.2. 1 To verify (2.2), let x f − (( ,a]) so that f(x) a. Then, ∈ −∞ ≤ for any p N there is n so that fj(x) a + 1/p for all j n: 1 ∈ ≤ 1 ≥ x fj− (( ,a +1/p]) for all j n or x j nfj− (( ,a +1/p]). ∈ −∞ ≥ ∈ ∩ ≥ −∞ But p was arbitrary and so x belongs to the right side of (2.2) and this 1 shows that f − (( ,a]) is a subset or equal to the right side of (2.2). Conversely suppose−∞ that x belongs to the right hand side of (2.2). Then, for every p N, there exists n N so that fj(x) a +1/p for all j n. Taking∈ limits in this last expression∈ as j ≤ we see that f(x) ≥ a +1/p. Since p is arbitrary f(x) a and this→ shows ∞ the right ≤ 1 ≤ side of (2.2) is in f − (( ,a]) which verifies (2.2). This proves that f is measurable. −∞ 2. The Extended Reals R: We introduce the extended real line R = [ , ], sometimes referred to as the two point compactification of −∞ ∞ the real line, and this is just R with two points adjoined: R = R . This will be a convenience when±∞ discussing convergence.∪ {∞}∪{−∞} We introduce the following topology on R. The neighborhoods of (resp. ) are those sets which contain an interval of the form (a, ∞] for some−∞a R (resp. [ ,a)) and the neighborhoods of x R are the∞ usual: those∈ sets that contain∞ an interval of the form y : ∈y x <δ { | − | } 2.MEASURABILITY 37 for some δ > 0. Of course the topology that R inherits as a subset of R is its usual topology. As a consequence of these definitions we see that R is homeomorphic to the compact interval [ 1, 1] with the (usual) topology it inherits as a subset of R. Indeed − x if x R √x2+1 ∈ (2.3) Φ(x)= 1 if x = 1 if x = ∞ − −∞ is continuous from R to [ 1, 1] with inverse − x if 1