Hyperreal MAT2000 ––Project in

Arne Tobias Malkenes Ødegaard Supervisor: Nikolai Bjørnestøl Hansen Abstract This project deals with doing calculus not by using epsilons and deltas, but by using a system called the hyperreal . The hyperreal numbers is an extension of the normal real numbers with both infinitely small and infinitely added. We will first show how this system can be created, and then show some basic properties of the hyperreal numbers. Then we will show how one can treat the topics of convergence, continuity, limits and differentiation in this system and we will show that the two approaches give rise to the same definitions and results.

Contents

1 Construction of the hyperreal numbers 3 1.1 Intuitive construction ...... 3 1.2 Ultrafilters ...... 3 1.3 Formal construction ...... 4 1.4 Infinitely small and large numbers ...... 5 1.5 Enlarging sets ...... 5 1.6 Extending functions ...... 6

2 The 6 2.1 Stating the transfer principle ...... 6 2.2 Using the transfer principle ...... 7

3 Properties of the hyperreals 8 3.1 Terminology and notation ...... 8 3.2 Arithmetic of hyperreals ...... 9 3.3 Halos ...... 9 3.4 Shadows ...... 10

4 Convergence 11 4.1 Convergence in hyperreal calculus ...... 11 4.2 Monotone convergence ...... 12

5 Continuity 13 5.1 Continuity in hyperreal calculus ...... 13 5.2 Examples ...... 14 5.3 Theorems about continuity ...... 15 5.4 ...... 16

6 Limits and 17 6.1 Limits in hyperreal calculus ...... 17 6.2 Differentiation in hyperreal calculus ...... 18 6.3 Examples ...... 18 6.4 Increments ...... 19 6.5 Theorems about derivatives ...... 19

1 1 Construction of the hyperreal numbers 1.1 Intuitive construction We want to construct the hyperreal numbers as of real numbers hrni = hr1, r2,...i, and the idea is to let sequences where limn→∞ rn = 0 represent infinitely small numbers, or infinitesimals, and let sequences where limn→∞ rn = ∞ represent infinitely large numbers. However, if we simply let each be defined as a of real numbers, and let and be defined as elementwise addition and multiplication of sequences, we have the problem that this structure is not a field, since

h1, 0, 1, 0,...i h0, 1, 0, 1,...i = h0, 0, 0, 0,...i.

The way we solve this is by introducing an equivalence on the of real-valued sequences. We want to identify two sequences if the set of indices for which the sequences agree is a large of N, for a certain technical meaning of large. Let us first discuss some properties we should expect this concept of largeness to have.

• N itself must be large, since a sequence must be equivalent with itself. • If a set contains a large set, it should be large itself. • The ∅ should not be large.

• We want our relation to be transitive, so if the sequences r and s agree on a large set, and s and t agree on a large set, we want r and t to agree on a large set.

1.2 Ultrafilters Our model of a large set is a called an ultrafilter.

Definition 1.1 (Ultrafilters). We define an ultrafilter on N, F, to be a set of of N such that: • If X ∈ F and X ⊆ Y ⊆ N, then Y ∈ F. That is, F is closed under supersets. • If X ∈ F and Y ∈ F, then X ∩ Y ∈ F. F is closed under intersections.

• N ∈ F, but ∅ 6∈ F. • For any subset A of N, F contains exactly one of A and N \ A.

We say that an ultrafilter is free if it contains no finite subsets of N. Note that a free ultrafilter will contain all cofinite subsets of N (sets with finite ) due to the last property of an ultrafilter.

Theorem 1.2. There exists a free ultrafilter on N.

Proof. See [Kei76, p. 49]. 

2 1.3 Formal construction

Let F be a fixed free ultrafilter on N. We define a relation ≡ on the set of real-valued sequences RN by letting

hrni ≡ hsni ⇐⇒ {n ∈ N | rn = sn} ∈ F.

Proposition 1.3 (Equivalence). The relation ≡ is an equivalence relation on RN. Proof. We check all needed properties of an equivalence relation.

Reflexivity Since the set {n ∈ N | rn = rn} = N, and N ∈ F, ≡ is reflexive.

Symmetry The sets {n ∈ N | rn = sn} and {n ∈ N | sn = rn} are the same, so if one belongs to F, so does the other.

Transitivity Assume that hrni ≡ hsni and hsni ≡ htni. Then both {n ∈ N | rn = sn} ∈ F and {n ∈ N | sn = tn} ∈ F. Since {n ∈ N | rn = sn} ∩ {n ∈ N | sn = tn} ⊆ {n ∈ N | rn = tn}, and F is closed under intersections and supersets, {n ∈ N | rn = tn} ∈ F, and so hrni ≡ htni, as desired.  Since ≡ is an equivalence relation, we can define the set of hyperreal numbers ∗R as the set of real-valued sequences modulo the equivalence relation ≡. In symbols, ∗ R = {[r] | r ∈ RN} = RN/ ≡ . We define addition and multiplication of elements in ∗R by doing elementwise addition and multiplication in the related sequences, more formally as

[r] + [s] = [hrni] + [hsni] = [hrn + sni]

[r] · [s] = [hrni] · [hsni] = [hrn · sni] .

We define the ordering relation < by letting

[r] < [s] ⇐⇒ {n ∈ N | rn < sn} ∈ F.

At this , let us introduce some notation to make our arguments easier to read. For two sequences hrni and hsni, we denote the agreement set {n ∈ N | rn = sn} by r = s . We can apply the same notation to other relations, so for J K example we have r < s = {n ∈ N | rn < sn}. J K Proposition 1.4. The operations + and · are well-defined, and so is the relation <.

0 Proof. We first show that + is well-defined. If we have that hrni ≡ hrni and 0 0 0 hsni ≡ hsni, then r = r ∈ F and s = s ∈ F, which means that r = r0 ∩ s = s0 ∈ F.J What weK now needJ to showK is that r + s = r0 + s0 J∈ F. K J K 0 0 J 0 0 K If, for some k ∈ N, both rk = rk and sk = sk, then rk + sk = rk + sk, hence if k ∈ r = r0 ∩ s = s0 , then k ∈ r + s = r0 + s0 , which shows that r = r0 J ∩ s =Ks0 J⊆ r +Ks = r0 + s0 .J Since r = r0 ∩K s = s0 ∈ F, so is Jr + s =K r0 J+ s0 . SoK if Jr ≡ r0 and s ≡ sK0, r + s ≡J r0 + s0K, whichJ showsK that the operationJ is well-defined.K Showing that · is well-defined is similar.

3 We will now show that < is well-defined, which means that we need to show 0 0 0 0 that if hrni ≡ hrni and hsni ≡ hsni, then if r < s ∈ F, then r < s ∈ F. Firstly, assume that r = r0 ∈ F and that s =J s0 ∈K F. Then, weJ need toK prove that if r < s ∈ F thenJ rK0 < s0 ∈ F. SoJ let usK assume that r < s ∈ F, and then proveJ thatK r0 < s0 J∈ F. K J K By our assumptions,J K we have that r = r0 ∩ s = s0 ∩ r < s ∈ F. If 0 0 J K 0 J K0 J K k ∈ r = r ∩ s = s ∩ r < s , then rk = rk, sk = sk and rk < sk, and J 0 K J0 K J0 0 K 0 0 0 0 therefore rk < sk, so k ∈ r < s . So, r = r ∩ s = s ∩ r < s ⊆ r < s , and since F is closed underJ supersets,K J we concludeK J thatK rJ0 < s0 K∈ FJ , whichK J K shows that < is well-defined. 

1.4 Infinitely small and large numbers One of the main reasons for constructing the hyperreals is that we want to have access to infinitely large and infinitely small numbers, and now we can prove their existence.

Theorem 1.5. There exists a number ε ∈ ∗R such that 0 < ε < r for any positive real number r, and there exists a number ω ∈ ∗R such that ω > r for any r.

Proof. First, we need to talk about real numbers in ∗R. The way to do this is that given a real number r ∈ R, we can identify this with a hyperreal number ∗r ∈ ∗R as ∗r = hr, r, . . .i. We will generally omit the ∗-decoration, and simply refer to this number as r.  1   1  Now, let us turn to the actual proof. Let ε = h1, 2 ,...i = h n i . For any 1 positive real number r, the set {n ∈ N | n > r} must be finite, and therefore 1 {n ∈ N | n < r} is cofinite, and hence belongs to our free ultrafilter F. Therefore, 1 we can conclude that ε < r. Also, since {n ∈ N | 0 < n } = N ∈ F, it must be the case that 0 < ε. So the number ε is a hyperreal number which is greater than 0, but smaller than any positive real number. Let ω = [h1, 2,...i] = [hni]. For any real number r, the set {n ∈ N | r ≥ n} is finite, and hence {n ∈ N | r < n} is cofinite, and belongs to F, which means that ω > r. This proves that ω is a hyperreal number greater than any real number. 

1.5 Enlarging sets

For a given subset A of R we can define an “enlarged” subset ∗A of ∗R by saying that a hyperreal number r is an element in ∗A if and only if the set of n such that rn is an element in A is large. Formally this can be defined as

∗ [r] ∈ A ⇐⇒ {n ∈ N | rn ∈ A} ∈ F. Again, we need to check that this is well-defined. Using the ... notation, J K let r ∈ A = {n ∈ N | rn ∈ A}. We have that J K r = r0 ∩ r ∈ A ⊆ r0 ∈ A , J K J K J K so if r ≡ r0 and r ∈ A ∈ F, then r0 ∈ A ∈ F, which shows that enlargements are well-defined.J K J K

4 An example of this is if A = N and ω = h1, 2, 3,...i. Then ω ∈ N = N ∈ F, so ω ∈ ∗N. We will refer to the set ∗N as the hypernaturals. Similarly,J ifKA = (0, 1) and r = h0.9, 0.99, 0.999,...i. Then r ∈ N = N ∈ F, so r ∈ ∗(0, 1). J K 1.6 Extending functions

An important tool in non-standard analysis is to take a f : R → R and extend it to a function ∗f : ∗R → ∗R. This is done by applying the function to each element in the sequence representing the given hyperreal number. We define the extension as follows:

∗ f([hr1, r2,...i]) = [hf(r1), f(r2),...i] . Again, we need to prove that this is well-defined. First, let f ◦ r denote 0 0 0 hf(r1), f(r2),...i. In general, r = r ⊆ f ◦ r = f ◦ r , and so if r ≡ r , then ∗f(r) = f ◦ r ≡ f ◦ r0 = ∗f(r0).J HenceK theJ function is well-defined.K A function f : A → R defined on some subset A of R can also be extended to a function ∗f : ∗A → ∗R, but not in exactly the same way as above. Since r can be in ∗A without all elements of r being in A, there can be indices i for which f(ri) is not defined. In order to get around this, we let f(ri) = 0 whenever ri 6∈ A. More formally, let ( f(rn) if rn ∈ A sn = 0 otherwise

and define ∗ f([hrni]) = [hsni] . Since we have that ∗f(r) = f(r) whenever r ∈ A, ∗f extends f. Therefore we will often simply drop the ∗-decoration, and simply refer to the extended function as f as well. An important subject related to this construction is sequences. A sequence hs1, s2,...i is simply a function s: N → R, and so by this construction can be ∗ ∗ extended to a hypersequence s: N → R, which means that the sn is defined even when n ∈ ∗N \ N.

2 The transfer principle 2.1 Stating the transfer principle One of the most important tools of non-standard analysis is the transfer principle, a way to show that a certain of statement is true when talking about the real numbers if and only if a certain related statement is true when talking about the hyperreal numbers. 1 First, we introduce the set of sentences which the transfer principle applies to. This set is basically the set of all sentences (formulas with no free variables) in a language of first-order which consists of a constant for each real number, a function symbol for each real function, and a relation symbol for each

1This is a rather cursory introduction to the tranfer principle. For a more in-depth explanation, see [Gol98, pp. 35-47].

5 relation on the reals. However, instead of using the quantifiers (∀x) and (∃y), our sentences use quantifiers of the form (∀x ∈ A) and (∃y ∈ B) where A and B are subsets of R. Some examples of such sentences are (∀n ∈ N)(∃m ∈ N)(m > n), (∃x ∈ R)(∀y ∈ R)(x + y = y) which state respectively that there is no biggest and there is an for the reals. Let us call such a sentence an L-sentence. Now, we define the ∗-transform of an L-sentence. We take a sentence ϕ, and create a related sentence ∗ϕ. An L-sentence ϕ contains symbols P , f, and r for relations, functions, and constants on R. To create ∗ϕ, we replace P by ∗P for all relations P , replace f by ∗f for all functions f, and replace r by ∗r for all constants r. Some examples of this are:

• The ∗-transform of the sentence (∀n ∈ N)(∃m ∈ N)(m > n) is (∀n ∈ ∗N)(∃m ∈ ∗N)(m ∗> n).

• The ∗-transform of (∀x ∈ R)(sin(x) < 2) is (∀x ∈ ∗R)(∗sin(x) ∗< ∗2). We will generally follow the conventions that we omit the ∗ for constants, most functions, and simple equalities and inequalities. With these conventions, the above sentences become (∀n ∈ ∗N)(∃m ∈ ∗N)(m > n) and (∀x ∈ ∗R)(sin(x) < 2). Now we state the transfer principle, which we will take as true without proof.

Theorem 2.1 (Transfer principle). An L-sentence ϕ is true if and only if its ∗-transform ∗ϕ is true. Some remarks are in order. It is worth pointing out that one can go in both directions, that is one can go from R to ∗R, and from ∗R to R. If one decides to go in this last direction, it is important that the statement is the ∗-transform of an L-sentence, so for example it can contain no hyperreal constants. A way to get around this is by replacing the constant with a x, and adding the quantifier (∃x ∈ ∗A) for some A ⊆ R in front, which is a technique we will use. In many cases, we will not explicitly write down the full sentence, but rather state things like “since s < n for all natural n, by transfer it also also true for any hypernatural n”.

2.2 Using the transfer principle

Theorem 2.2. The structure h∗R, +, ·,