© COPYRIGHT
by
Amanda Purcell
2013
ALL RIGHTS RESERVED
ULTRAPRODUCTS AND THEIR APPLICATIONS
BY
Amanda Purcell
ABSTRACT
An ultraproduct is a mathematical construction used primarily in abstract algebra and model theory to create a new structure by reducing a product of a family of existing structures using a class of objects referred to as filters. This thesis provides a rigorous construction of ultraproducts and investigates some of their applications in the fields of mathematical logic, nonstandard analysis, and complex analysis. An introduction to basic set theory is included and used as a foundation for the ultraproduct construction. It is shown how to use this method on a family of models of first order logic to construct a new model of first order logic, with which one can produce a proof of the Compactness Theorem that is both elegant and robust. Next, an ultraproduct is used to offer a bridge between intuition and the formalization of nonstandard analysis by providing concrete infinite and infinitesimal elements. Finally, a proof of the Ax-
Grothendieck Theorem is provided in which the ultraproduct and other previous results play a critical role. Rather than examining one in depth application, this text features ultraproducts as tools to solve problems across various disciplines.
ii
ACKNOWLEDGMENTS
I would like to give very special thanks to my advisor, Professor Ali Enayat, whose expertise, understanding, and patience, were invaluable to my pursuit of a degree in mathematics. His vast knowledge and passion for teaching inspired me throughout my collegiate and graduate career and will continue to do so.
iii
TABLE OF CONTENTS
ABSTRACT ...... ii
ACKNOWLEDGMENTS ...... iii
Chapter
1. INTRODUCTION ...... 1
2. FILTERS, ULTRAFILTERS, AND REDUCED PRODUCTS ...... 3
3. ULTRAPRODUCTS AND COMPACTNESS ...... 16
4. ULTRAPRODUCTS AND NONSTANDARD UNIVERSES ...... 31
5. THE AX-GROTHENDIECK THEOREM ...... 47
BIBLIOGRAPHY ...... 59
iv CHAPTER 1
INTRODUCTION
The ultraproduct construction is a method used primarily in model theory and abstract algebra that creates a new structure by reducing a Cartesian product of existing structures. Its value stems from the ability for one to determine properties of the ultraproduct based solely on the properties of the structures within the reduced product and the equivalence relation by which it is reduced. The ultraproduct is attractive in its algebraic nature and the fact that it can be constructed using only basic set theoretic concepts. This thesis begins (Chapter 2) with an outline of the set theory required for the construction of ultraproducts beginning with filters (specific sets of sets). It includes important observations on the existence and uses of certain filters before providing the step-by-step construction of reduced products and describing their elements. This chapter provides the foundation for all applications of ultraproducts discussed ahead. To be able to define the notion of an ultraproduct, and to be able to understand its properties, it is necessary to provide a brief introduction to model theory, which comes in the beginning of Chapter 3. Chapter 3 then defines an ultrapoduct of models and the elements, functions, and relations on this new object. Next, this chapter instructs how to interpret sentences of first order logic in the ultraproduct model by proving the ever-important and applicable Łos’s´ Theorem which will recur throughout the text. Ultimately, Chapter 3 provides a proof of the Compactness Theorem, one of the most fundamental results in first order logic. Chapter 4 then changes gears to provide an examination of basic nonstandard analysis using the tools and elements of ultraproducts developed in earlier sections. The ultraproduct construction allows us to build ordered "nonarchimedean" fields, i.e., fields 1 2 that contain, as concrete objects, infinitely small quantities as well as infinitely large ones. Moreover, these nonarchimedean fields are shown to satisfy the same first order sentences as those that are true in the ordered field of real numbers. In chapter 4 we employ the aforementioned nonarchimedean fields to provide a new way of establishing a number of classical results in Calculus in an intuitively clear manner, with the explicit, unabashed use of infinitesimals. The final chapter (5) offers a proof of the Ax-Grothendieck Theorem, alternative to the proof from Walter Rudin [1] utilizing techniques of complex analysis. This final part highlights the rather metamathematical method of proof used a great deal in disciplines of advanced mathematics: the method in which one takes a problem from some specific area, maps the relevant pieces to a different universe in which the problem can be solved, and maps back to complete the proof. In the end, it should be clear to the reader that the ultraproduct model is not only interesting in and of itself, but is a very powerful tool that can provide answers, or methods to obtain them, in many other disciplines outside of model theory. 3
CHAPTER 2
FILTERS, ULTRAFILTERS, AND REDUCED PRODUCTS
Set Theory and Construction of Filters
We begin with a few basic set theoretic concepts that will be useful in our construction of filters, and ultimately in the construction of ultraproducts.
• The powerset of a set X, denoted P(X), is the set of all possible subsets of X. For example, for X = {A,B}, the powerset of X would be written P(X) = {/0,{A},{B},{A,B}}, where /0denotes the “empty set,” the unique set containing no elements. • A is a subset of a set X, or X contains A (written A ⊆ X), if every element of A is also an element of X. • The union of two sets, A and B, denoted A ∪ B, is the collection of elements that are in A, in B, or in both A and B. • The intersection of two sets A and B, denoted A ∩B, is the collection of elements that are in both A and B. • The complement of a set A in relation to a background set X where A ⊆ X, denoted X\A, is the set of all elements of X that are not in the subset A.
For purposes that arise in a later section, it is also important to understand the concept of cardinality. The cardinality of a set X, denoted |X|, is the number of elements in the set; numbers representing the cardinality of sets are called “cardinal numbers.” For example, the cardinality of the natural numbers (N) is ℵ0, the first infinite cardinal number. The ℵ ℵ cardinality of the real numbers (R) is 2 0 , where 2 0 > ℵ0 by a classical theorem of Cantor. 4
Armed with these basic set theoretic concepts, we are now equipped to discuss the first class of objects important to our construction of the ultraproduct: filters.
Definition 1. Let X be a nonempty set. A filter F over X is a nonempty family of sets F ⊆ P(X) such that:
(i) if A, B ∈ F , then A ∩ B ∈ F , and (ii) if A ∈ F and A ⊆ B ⊆ X, then B ∈ F .
In other words, F is a set of subsets of X that is closed under intersections and supersets. For example, F = P(X) is a filter over X; this is called the “improper filter.” The set containing only X itself, {X}, also forms a filter, called the “trivial filter.” All other filters are referred to as “proper nontrivial filters.”
Proposition 2. A filter F is proper if and only if /0 ∈/ F .
Proof. We prove the ⇒ direction by way of contradiction. Suppose that F is a proper filter over X and that /0 ∈ F . As all subsets A ⊆ X contain /0,
(/0 ⊆ A ⊆ X) ⇒ (A ∈ F ), for all A ⊆ X, and every possible subset of X is a member of F by upward closure. Therefore F = P(X), and F must be improper, contradicting our assumption. Hence, for any proper filter F over a set X, we have /0 ∈/ F .
The proof of the ⇐ direction is trivial.
Also note that for all filters F over X, the background set X will be a member of the filter F . Given that F is nonempty by definition, let A ∈ F for some A ⊆ X. By upward closure, we have
(A ⊆ X) ⇒ (X ∈ F ), 5 and X ∈ F for all filters F over X. Some examples of proper filters include:
• The principal filter: Fix some element i0 ∈ X, and consider the family of subsets
{A ⊆ X |i0 ∈ A}. This defines a filter over X, and is referred to as the principal
filter generated by i0. Any filter that is not principal is referred to as free. • The Fréchet filter: Let X be an infinite set, and let
F = {A ⊆ X |X\A is finite},
i.e., the set of all co-finite subsets of X. This filter is called the Fréchet filter, a very important filter that will be utilized later.
Before we can discuss more interesting properties about filters, we must first define the following property:
Definition 3. A set F of subsets of X is said to have the finite intersection property, or f.i.p., if the intersection of any finite number of those subsets is nonempty, i.e.,
A1 ∩ A2 ∩ ... ∩ An 6= /0 for Ai ∈ F , 1 ≤ i ≤ n, and n ∈ N.
We claim that any proper filter has f.i.p., and for any set of subsets that has f.i.p., there exists a proper filter containing that set. Before we prove our claim, consider our examples of proper filters above. The principal filter generated by i0 clearly has f.i.p. as i0 will be a member of every set in the filter, and thus a member of any finite intersection. The Fréchet filter also clearly has f.i.p as any finite intersection will be the complement of the union of a finite number of finite sets (which will also be finite) in relation to an infinite background set. 6
Claim 4. Let F be a filter over X. Then the following statements are equivalent: (i) F is a proper filter. (ii) /0 ∈/ F . (iii) F has f.i.p.
Proof. To prove that the above statements are equivalent, it is sufficient to show that (i) implies (ii), (ii) implies (iii), and that (iii) implies (i). (i)→(ii). By Proposition 2.
(ii)→(iii). Suppose, by way of contradiction, that /0 ∈/ F and that F does not have f.i.p. Then there exists some finite number of elements, A1,..,An of F such that
A1 ∩ ... ∩ An = /0. By property of filters, this intersection must also be a member of F , contradicting our assumption that /0 ∈/ F . (iii)→(i). Suppose, by way of contradiction, that F has f.i.p and that F is improper. As F is improper, given any element A ⊂ X (where A 6= X), we have both A ∈ F and X\A ∈ F , and therefore by property of filters, A ∩ (X\A) = /0 ∈ F , contradicting our assumption.
Lemma 5. Suppose G is a set of subsets of the background set X such that G has f.i.p. Then G can be extended to a proper filter that contains G .
Proof. Let G be a set of subsets that has f.i.p. and define:
F = {S ⊆ X |S ⊇ A1 ∩ ... ∩ An, whereAi ∈ G for1 ≤ i ≤ n} for some n ∈ N, the collection of all sets S that are the supersets of some finite intersection of sets in G . We claim this set will be a proper filter containing G . By definition of a proper filter, we must show that (i) for A, B ∈ F , we have A ∩ B ∈ F , (ii) for A ∈ F and A ⊆ B ⊆ X, then B ∈ F , and (iii) that F is proper. 7
(i) Let A,B ∈ F . Then by definition, A and B both contain finite intersections of elements from G . Thus, the intersection of A and B will also contain an intersection of finitely many elements from G , and A ∩ B ∈ F . (ii) Let A ∈ F and A ⊆ B ⊆ X. Since A contains a finite intersection of elements from G and B contains A, B must also contain this same finite intersection and therefore B ∈ F . (iii) Suppose F were improper and thus contained /0(by Proposition 2). Then G would also contain /0since S ⊇ /0for S = /0. This contradicts the fact that G has f.i.p., however, since any intersection with the empty set is empty.
Note that for all g ∈ G , we have g ∈ F since for S = g we have S ⊇ g.
Therefore F is a proper filter such that G ⊆ F .
Ultrafilters
Let F be a filter over N. We say that a function f has some property P in reference to F if and only if {n ∈ N| f (n)has propertyP} ∈ F . That is, the set of indices on which the function has the property is a member of the filter. Consider the function 1 n is even f (n) = , −1 n is odd and consider the property P of the values of a function being always greater than or equal to zero (or, conversely, always less than zero). We would like for there to be a clear choice whether function f has property P or not P in reference to F (and not both, and not neither). Unfortunately, in reference to the Fréchet filter, neither is true in this case as f oscillates. Therefore, we need to obtain a finer filter to make this choice. To do so, we introduce another stipulation to our definition of a filter to create the ultrafilter. This new 8 stipulation provides the tool to decide which set of the “evens” or “odds” would be in the
filter on N above to determine if either property P or not P held.
Definition 6. A filter F is called an ultrafilter if, in addition to (i) and (ii) of Definition 1,
(iii) For any subset A ⊆ X, either A ∈ F or (X\A) ∈ F , and not both.
Proposition 7. A filter F over a set X is an ultrafilter if and only if it is maximal, i.e., the only proper filter containing F is F itself.
Proof. (⇒) Let F be an ultrafilter; then for any set A ⊆ X, we have either A ∈ F or (X\A) ∈ F . Now suppose G is a proper filter over X that contains F , G ⊇ F . Let B be a subset of X such that B ∈ G and B ∈/ F . Then, since F is an ultrafilter, we have X\B ∈ F . As F is contained in G,
(X\B ∈ F ) ⇒ (X\B ∈ G ), and
(B ∈ G and (X\B) ∈ G ) ⇒ (B ∩ (X\B) = /0 ∈ G ), contradicting the fact that G is proper. Therefore, G cannot strictly contain F , and we must have F = G . Hence F is maximal.
(⇐) Now let F be a maximal filter over X. It suffices to show that for any subset A ⊆ X, A ∈/ F implies X\A ∈ F . Let A be a subset of X such that A ∈/ F . As F is a proper filter, F has f.i.p. by Claim 4. The set F ∪ {X\A} maintains f.i.p. since A ∈/ F . This is because an empty intersection could only be obtained with {X\A} and A or a subset of A. Clearly A ∈/ F , 9 and for any subset B ⊂ A, if B ∈ F , then by property of filters, A must also be a member of F . By Lemma 5, there exists a proper filter Ff that contains F ∪ {X\A}, which obviously contains F . Thus by the maximality of F , we must have:
F ⊆ F ∪ {X\A} ⊆ Ff⊆ F , which implies that F = F ∪ {X\A}, and {X\A} ∈ F .
A principal ultrafilter is an ultrafilter that is principal (as introduced after
Proposition 2). If F is an ultrafilter over X, and {i0} ∈ F for some i0 ∈ X, then F is the principal ultrafilter generated by {i0}. The following statements about principal ultrafilters are equivalent:
• A subset A of X is a member of the principal ultrafilter F ;
• A ∩ {i0} ∈ F ;
• i0 ∈ A.
Additionally, if F is a principal ultrafilter, then F must be generated by a single element. Otherwise, if F were generated by some subset B (where B is more than one element of X, i.e., F would be the set of all subsets of X containing set B) such that /0 6= B ⊂ A ⊆ X, then the filter generated by B would be a proper filter containing F , contradicting the maximality of F .
Proposition 8. Let F be an ultrafilter. If F is not principal (i.e., F is free), then F contains the Fréchet filter.
Proof. By the ultrafilter “decision” property (Definition 6), it suffices to show that
(S ⊆ X and S is finite) ⇒ (S ∈/ F ), 10 i.e., if no finite set S can be in ultrafilter F , then all co-finite sets X\S must be. We proceed by induction on the size of S. Let F be a non-principal ultrafilter over
X and define Sn := {i1,...,in} where i j ∈ X for 1 ≤ j ≤ n. We wish to show that Sn ∈/ F .
For the case of n = 1, we clearly have a contradiction as S1 ∈ F would imply that F is principal, as defined above.
Now suppose that for n = k, we have Sk ∈/ F and hence X\Sk ∈ F . We now show that Sk+1 ∈/ F . Observe,
Sk+1 = {i1,...ik,ik+1} = {i1,...,ik} ∪ {ik+1},
where Sk = {i1,...,ik} ∈/ F , and {ik+1} ∈/ F by the n = 1 case. Therefore, we have
X\{i1,...,ik} ∈ F and X\{ik+1} ∈ F . Since F is closed under intersections,
(X\{i1,...,ik}) ∩ (X\{ik+1}) = X\({i1,...,ik} ∪ {ik+1}) ∈ F
⇒ X\Sk+1 ∈ F .
Thus, Sk+1 ∈/ F , and by induction, we have Sn ∈/ F for any n ∈ N. Since F contains no finite set and F is an ultrafilter, it must contain the complement of every finite set. In other words, F contains the Fréchet filter.
The Fréchet filter is clearly not an ultrafilter (for example, by the existence of the infinite/co-infinite sets of the even and odd numbers of N as mentioned at the beginning of this section). However, we have seen that it is possible to have a principal filter generated by i0 that is an ultrafilter. Are there other examples of ultrafilters? The answer to this question is yes; in fact, we will show any proper filter can be extended to an ultrafilter. Before we comment on this and make a few other important 11 observations on ultrafilters, we first need to introduce the Axiom of Choice in the form of Zorn’s Lemma, and, in order to do so, we first define the notion of a poset.
Definition 9. A partially ordered set, or poset, is a pair (X,≤) where X is a nonempty set and ≤ is a binary relation on X that is: (i) reflexive, (i.e., x ≤ x for all x ∈ X); (ii) antisymmetric, (i.e., if x ≤ y and y ≤ x then x = y); (iii) transitive, (i.e., if x ≤ y and y ≤ z then x ≤ z).
Any subset C of X, where (X,≤) is a poset, that can be ordered such that x ≤ y or y ≤ x for all x,y ∈ C is called a chain. An element b ∈ X is an upper bound for C if for all x ∈ C we have x ≤ b. An element m is maximal if for any x ∈ X, m ≤ x implies x = m.
Lemma 10. (Zorn’s Lemma) Let (X,≤) be a partially ordered set. If each chain in X has an upper bound, then X has at least one maximal element.
Zorn’s Lemma is equivalent to the Axiom of Choice, which states that there is a “choice function,” f , on any (infinite) set A of nonempty sets such that f (x) ∈ x for each x ∈ A [2]. The Axiom of Choice and Zorn’s Lemma will be referenced when applied in later proofs. We use Zorn’s Lemma in the proof of the following theorem on the existence of ultrafilters, referred to as the “Ultrafilter Theorem.”
Theorem 11. (Ultrafilter Theorem) If F is a proper filter on X, then there is an ultrafilter E on X such that F ⊆ E .
Proof. Let F be a filter over X and define Ffbe the set of all proper filters that contain F . Ffis clearly nonempty as F ∈ Ff. We claim that (Ff,⊆) forms a poset. (i) Clearly, D ⊆ D for all D ∈ Ffby property of inclusion, and ⊆ is reflexive. 12
(ii) Similarly, for D1,D2 ∈ Ff, if D1 ⊆ D2 and D2 ⊆ D1, then D1 = D2, and ⊆ is antisymmetric.
(iii) Finally, for D1,D2,D3 ∈ Ff, if D1 ⊆ D2 and D2 ⊆ D3, then D1 ⊆ D3, and ⊆ is transitive.
Thus, (Ff,⊆) forms a poset. Now let C be a chain in Ff. In order to be able to apply Zorn’s Lemma, we must show that every chain has an upper bound. Consider that any chain C of Ffis a list of elements, or filters, and consider the union of such elements: SC . We claim that this is an upper bound for C . Since for each D ∈ C , /0 ∈/ D (since each D in the chain is a S S proper filter), we are assured that /0 ∈/ C . Given A ∈ C , we know that A ∈ D1 for some proper filter D1 ∈ C . Thus for any superset B of A, A ⊆ B, we have B ∈ D1 and S S therefore B ∈ C . Now given A,B ∈ C , we know that that A,B ∈ D2 for some proper
filter D2 ∈ C as the chain is partially ordered by inclusion. Therefore A ∩ B ∈ D2 and A ∩ B ∈ SC . Hence SC ∈ Ff. By construction, we have D ⊆ SC for each D ∈ C , and SC is an upper bound for the chain C . Therefore, since every chain has an upper bound, we know by Zorn’s Lemma that
Ffhas at least one maximal element, i.e., a maximal filter that contains F . Since this
filter is maximal, it is an ultrafilter by Proposition 7.
Construction of Reduced Products
Now that we are familiar with the notions of filters and ultrafilters, we may begin our construction of reduced products. We shall see that the ultraproduct is just a specific case of the reduced product. 13
Suppose X is a non-empty set and D is a proper filter over X, and for each i ∈ X,
Ai is a non-empty set. Let C = ∏i∈X Ai be the Cartesian product (in the usual sense) of S these sets. Elements of C are thus functions f : X → i∈X Ai such that f (i) ∈ Ai for each i ∈ X, or strings of elements where the ith element comes from Ai.
Given two functions f , g ∈ C, we say f and g are D-equivalent, written f =D g, if and only if {i ∈ X | f (i) = g(i)} ∈ D.
That is, the set of indices on which f and g agree is in D. Note that for any non-principal ultrafilter D, by Proposition 8, this would amount to the fact that f and g agree on all but a finite number of indices, or “almost everywhere.” We claim that C divides naturally into equivalence classes modulo the filter D.
Proposition 12. The relation =D is an equivalence relation over C.
Proof. To show that =D is an equivalence relation, we must show that reflexivity, symmetry, and transitivity hold.
(i) Reflexivity: Given f ∈ C, we have f =D f if and only if
{i ∈ X | f (i) = f (i)} ∈ D.
However, {i ∈ X | f (i) = f (i)} = X where X is necessarily in D, and therefore =D is reflexive.
(ii) Symmetry: Given f ,g ∈ C, we have f =D g if and only if
{i ∈ X | f (i) = g(i)} ∈ D. 14
However, {i ∈ X | f (i) = g(i)} = {i ∈ X |g(i) = f (i)}, and thus
{i ∈ X |g(i) = f (i)} ∈ D,
which implies g =D f , and =D is symmetric.
(iii) Transitivity: Given f ,g,h ∈ C, f =D g and g =D h if and only if
{i ∈ X | f (i) = g(i)} ∈ D and {i ∈ X |g(i) = h(i)} ∈ D.
Because D is a filter, the intersection of these sets must also be in D, and:
{i ∈ X | f (i) = g(i)} ∩ {i ∈ X |g(i) = h(i)} ∈ D
⇐⇒ {i ∈ X | ( f (i) = g(i)) ∧ (g(i) = h(i))} ∈ D.
Since {i ∈ X | ( f (i) = g(i)) ∧ (g(i) = h(i))} ⊆ {i ∈ X | f (i) = h(i)}, we have:
{i ∈ X | f (i) = g(i)} ∩ {i ∈ X |g(i) = h(i)} ∈ D ⇒ {i ∈ X | f (i) = h(i)} ∈ D, by upward closure of D.
Therefore =D is also transitive, and hence an equivalence relation over C.
As =D is an equivalence relation, it can be used to divide C into equivalence classes, which, for convenience, we will denote:
f D = {g ∈ C |{i ∈ X |g(i) = f (i)} ∈ D}. 15
The set of these equivalence classes is referred to as a reduced product, defined specifically as follows:
Definition 13. The reduced product of Ai modulo D is the set of all equivalence classes D { f | f ∈ ∏i∈X Ai}, denoted by ∏D Ai, where the Ai are nonempty sets. The set X is the index set for ∏D Ai. In the case that D is an ultrafilter over X, the reduced product ∏D Ai is an ultraproduct. Additionally, if all sets Ai = A are the same, ∏D Ai may be written as
∏D A and is referred to as an ultrapower.
Now that we have constructed this interesting object, we examine what is means to be a reduced product of models, but first, we provide a brief overview of elementary model theory to give context for this new structure. 16
CHAPTER 3
ULTRAPRODUCTS AND COMPACTNESS
A Brief Introduction to Model Theory
Before we prove the Compactness Theorem, we must first discuss the model construction that we will utilize in the proof, and, before we discuss the model construction, it is necessary to provide a brief introduction to model theory. Model theory is the branch of mathematical logic that concerns the relation between a formal language (syntax) and its interpretation in different structures (semantics) [3].
Syntax. The first order language L consists of finitary relation symbols: R, functions symbols: F, and constant symbols: c. To formalize L , we must also use the following logical symbols: connectives: ∧ (“and”), ∨ (“or”), ¬ (“not”), → (“if _then_”), ⇐⇒ (“_if and only if_”); quantifiers: ∃ (“there exists”), ∀ (“for all”); variables: x0,xi,...,xn,...; and parentheses: (, ).
In addition to these logical symbols, we also have the binary relation for equality,
≡. Note that none of these symbols are part of the language L . In order to meaningfully refer to objects in our domain of discourse, we need to use strings of logical symbols combined with the symbols of L . These strings are called terms. Terms are defined inductively as follows:
(i) Constants and variables are terms.
(ii) If F is an n-ary function symbol and τ1,...,τn are terms, then F(τ1,...,τn) is a term. 17
(iii) A string of symbols is a term if and only if it can be arrived at by a finite number of applications of (i) and (ii).
To be able to say meaningful things about terms (e.g., how they “relate”), we introduce formulas. The most simple formulas are referred to as atomic and defined as follows:
(i) τ1 ≡ τ2 is an atomic formula, where τ1,τ2 are terms of L .
(ii) If R is an n-ary relation symbol and τ1,...,τn are terms, then R(τ1,...,τn) is an atomic formula.
Formulas of L are defined inductively as follows:
(i) An atomic formula is a formula. (ii) If φ and ψ are formulas, then (φ ∧ ψ), (φ ∨ ψ), (φ → ψ), (φ ⇐⇒ ψ), and (¬φ) are formulas. (iii) If x is a variable and φ is a formula, then ∃x(φ) and ∀x(φ) are formulas. (iv) A sequence of symbols is a formula only if it can be shown to be a formula by a finite number of applications of (i), (ii), and (iii).
Formulas with no free variables (variables specified with quantifiers) are called sentences. Syntax is very important, but without instruction on how to interpret in different structures (semantics), it is quite literally meaningless.
Semantics. Sentences of L as developed above can be true in one structure and not in another. Consider the case of the existence of a greatest element. A sentence of L representing this idea is true in finite structures that can be ordered, but not, for example, in the typical structure of the real numbers. Consider also a first order statement of density, that between any two elements exists another. This sentence is true in Q and R, but not in N and Z. Sentences like these are true or false depending on the structure (on the context) in which they are being considered. The structures in which we consider the truth or 18 falsity of a sentence of L are called models, a class of mathematical structures (e.g. groups, fields, graphs, etc.) in which one can apply the tools of mathematical logic. A sentence is satisfiable if it is possible to find a model whose interpretation of that sentence is true in that model. The following section outlines how we can use the truth interpretation of sentences of first order logic in each model Ai in our ultraproduct to determine what is true in the structure of the ultraproduct itself.
Interpretations and Łos’s´ Theorem
The Compactness Theorem of first order logic states that if you have an infinite set Σ of sentences of L , with the property that each finite subset i ⊂ Σ is satisfied in some model Mi (which is dependent on the finite subset i specified), then there is a single model M in which all of Σ is satisfied. The typical mathematical logic proof of this theorem requires the notion of Soundness, as well as Gödel’s Completeness Theorem: two ideas which, themselves, require much careful consideration. It is a proof by contrapositive, the idea being to use the finiteness of proofs and the relationship between satisfiability and deducibility to show that if the infinite set of sentences is not satisfiable, then it has a finite subset of sentences which can lead to the proof of a contradiction (and therefore, the premise is impossible). The proof of the Compactness Theorem we present utilizing ultraproducts will ultimately be completed in two main steps. First we will prove Łos’s´ Theorem to show how true formulas in each model Ai are interpreted in the ultraproduct. We then use filter properties to show that finite satisfiability also transfers to the ultraproduct from each of the elements in the product. In the end, we not only prove the Compactness Theorem, but 19 we are left with a tangible model that actually fulfills this duty of satisfying the infinite set of sentences. In order to provide some consistency in nomenclature, symbols of the language as interpreted in each model are represented using the same symbol of L with a superscript denoting the model (like our equivalence classes, f D, used superscripts to denote the filter). Multiple symbols are indexed in the subscript.
Let X be a nonempty set, D a proper filter over X, and for each i ∈ X, let Ai be the model for language L over the set Ai. Constants c of the language L are interpreted by
Ai Ai n Ai n c ∈ Ai, n-ary relations R by R ⊆ Ai , and functions F by F : Ai → Ai.
Given hAi : i ∈ Xi, we construct a new model of L . The following is an extension of the previously provided definition of reduced products (Definition 13) and specifies how each object of the language L is interpreted component-wise.
Definition 14. The reduced product B = ∏D Ai is the model for L described as follows:
(i) Set: The background set for B is ∏D Ai, (the ultraproduct as constructed D above, where elements are equivalence classes, f , defined by =D). (ii) Relations: Let R be an n-ary relation symbol of L . The interpretation of R
in B = ∏D Ai is the relation such that
B D D Ai R ( f1 ,..., fn ) ⇐⇒ {i ∈ X |R ( f1(i),..., fn(i))} ∈ D,
D i.e., elements (equivalence classes) fk , 1 ≤ k ≤ n, are related in B if and only if the set of indices of elements of the equivalence classes that are related by
R in the ith model, Ai, is in D. 20
(iii) Functions: Let F be an n-ary function symbol of L . Then F is interpreted in B by the function
D E B D D Ai F ( f1 ,..., fn ) = F ( f1(i),..., fn(i)) : i ∈ X , D
i.e., FB is a function on n elements (equivalence classes) of B whose output is a sequence of elements where the ith member of the sequence is the value
of the function F interpreted in the ith model (as FAi ), modulo D.
(iv) Constants: Let c be a constant symbol of L . Then c is interpreted in B
B B Ai by c ∈ ∏D Ai, where c = c : i ∈ X D, i.e., the equivalence class of a
sequence of constants from the Ai’s where the ith term in the sequence is the
constant interpreted in the corresponding Ai.
The following proposition shows that ultraproducts over principal ultrafilters are somewhat less interesting in that they mirror pre-existing components of the product, motivating our examination of products over solely non-principal ultrafilters in the future (so we can form objects that yield new information and do not simply extrapolate information from one of their respective members).
Proposition 15. Suppose D is a principal ultrafilter over X generated by i0. Then ∼ ∏D Ai = Ai0 .
Proof. Given that D is a principal ultrafilter generated by i0, we know {i0} ∈ D. Elements of ∏D Ai can then be grouped into equivalence classes based on their agreement evaluated D only on the i0th element. Let F : ∏D Ai → Ai0 be given by F( f ) = f (i0). Note that F is D well-defined since every element in each equivalence class f agrees on the value of i0. D D Given f ,g ∈ ∏D Ai, we have