<<

RESEARCH STATEMENT

ALEX KRUCKMAN

Introduction. My research lies at the intersection of , combinatorics, and . I am interested in the relationships between properties of random finite structures and the of their infinitary limits. What do I mean by “properties” and “infinitary limits”? It turns out that there are a number of useful ways of associating limit objects to classes of finite structures, which I will describe below, and I am also interested in the relationships between these limits. When I say “properties”, I mean something more precise: those properties which are expressible by sentences of first-order logic. For those unfamiliar with logical terminology, formulas of first-order logic are built up from a vocabulary of basic symbols. My vocabulary will always consist of at most countably many relation symbols (e.g. the edge relation of a graph). These basic relations, applied to variables, can be combined by Boolean operations (¬, ∧, ∨) and quantification (∀x, ∃x) to form formulas. A sentence is a formula in which all the variables have been quantified. A structure A is a set, given with interpretations of the symbols in the vocabulary (e.g. a graph). The notation A |= ϕ, read “A is a model of ϕ”, means that the sentence ϕ is true of the structure A. A (complete) theory is a maximally consistent set of sentences. Let me begin with two examples, illustrating the motivating phenomena. It is well-known that the class G of finite graphs has a first-order zero-one law [GKLT69,Fag76]. That is, letting G(n) be the set of graphs with domain [n] = {1, . . . , n} and µn the uniform measure on G(n), we have that for every sentence ϕ of first-order logic, lim µn({G ∈ G(n) | G |= ϕ}) = 0 or 1. n→∞ The set of all sentences ϕ for which the value of the above limit is 1 is thus a complete theory, the almost- sure theory of G. This theory is countably categorical, meaning that it has a unique countable model up to isomorphism. This model is the Rado graph R, a countably infinite graph which reflects all of the properties which are almost-surely true of large finite graphs. R can be constructed as the Fra¨ıss´elimit of G: it is universal, in the sense that it contains an embedded copy of every finite graph, and homogeneous, in the sense that if two finite subgraphs A and B of R are isomorphic, then there is an automorphism of R moving A to B. Indeed, any class of finite structures which satisfies the hereditary property, joint embedding property, and amalgamation property has a universal homogeneous Fra¨ıss´elimit (see [Hod93, Ch. 7]). We call such a class K a Fra¨ıss´eclass and call the theory its Fra¨ıss´elimit the generic theory of K. Alternatively, R has a probabilistic construction: If we define a graph with underlying set N by assigning each possible edge independently with probability 1/2, then the resulting graph is isomorphic to R with probability 1. Note that this construction is highly symmetric: the probability of seeing a given induced subgraph on a set of vertices A ⊆ N does not depend on the choice of A. Now let’s consider G4, the class of triangle-free graphs. G4 is a Fra¨ıss´eclass, with Fra¨ıss´elimit H, the Henson graph [Hen71]. G4 also has a first-order zero-one law, but its almost-sure theory diverges from its generic theory. Indeed, almost all large finite triangle-free graphs are bipartitle [EKR76, KPR87] and hence do not contain any odd-length cycles, while H contains embedded copies of all triangle-free graphs. The first symmetric probabilistic construction of the Henson graph was given only recently by Petrov and Vershik [PV10]. A theory T is pseudofinite if every sentence ϕ ∈ T has a finite model. The fact that the theory TR of R occurs as the almost-sure theory of G shows that TR is pseudofinite. In fact, for each sentence ϕ ∈ TR, most large finite graphs satisfy ϕ. But this probabilistic argument fails for triangle-free graphs, and the pseudofiniteness of the generic theory of G4 is a notorious open problem (see [Che92, Che11]). My research follows two main threads. The first is the question of pseudofiniteness of theories of Fra¨ıss´e limits, and the coincidence or divergence of generic theories and almost-sure theories. The second is a model- theoretic of symmetric probabilistic constructions. There are analogies between these two settings, which can be viewed as instances of a general measure/category analogy, and in furture work (described below) I plan to connect the two threads. 1 2 ALEX KRUCKMAN Pseudofinite Fra¨ıss´elimits. There are, essentially, two ways to show that a theory T is pseudofinite. One can explicitly construct finite structures which satisfy the sentences in T , or one can employ a probabilistic argument by specifying a sequence of probability measures µn on classes K(n) of finite structures, such that each ϕ ∈ T is satisfied with positive probability for large enough n. The first method has the advantage of being more explicit, and the constructions may be of combinatorial interest. But the second method tells us something more: the sentences of T actually have many finite models as measured by the µn. I have identified a combinatorial sufficient condition, disjoint n-amalgamation, for a countably categorical theory to be pseudofinite, witnessed by a natural sequence of probability measures µn (see [Krua]). A type p(x) in the variables x over a set of parameters A relative to a theory T is the set of all first-order formulas with parameters from A satisfied by some of elements from a model of T . If T has quantifier elimination (as in the case that it is the generic theory of a Fra¨ıss´eclass), then a type is determined by its quantifier-free formulas, i.e. the isomorphism type of the induced substructure on the tuple. A type is nonredundant if it does not contain the formulas xi = xj when i 6= j or xi = a when a ∈ A. A theory has disjoint n-amalgamation if whenever we have of variables x1,..., xn and a system of types {pS | S ( [n]} over A such that pS is a non-redundant type in the variables {xi | i ∈ S}, and pS ⊆ pT when S ⊆ T , then there is some non-redundant type p[n] in the variables {xi | i ∈ [n]} extending the pS. Disjoint 2-amalgamation is a property enjoyed by many combinatorial examples of countably categorical theories. A countably categorical theory T has disjoint 2-amalgamation if and only if it has trivial algebraic closure: For all finite sets A ⊆ M |= T , acl(A) = A, in the sense that no point b ∈ M \ A has a finite orbit under the group of automorphisms fixing A. 1 ([Krua]). If a countably categorical theory T has disjoint n-amalgamation for all n, then T is pseudofinite.

The measures µn witnessing pseudofiniteness describe random structures on with domain [n] “built from below”: For each i ∈ [n], we choose a 1-type uniformly at random, then for every pair {i, j}, we choose a 2-type uniformly at random extending the given 1-types, etc. Disjoint n-amalgamation ensures that such an extension always exists and that we are able to make all random choices as independently as possible. Variations on the theme of n-amalgamation are important in a variety of model-theoretic contexts, in- cluding simple theories [CH99,Kol05,KKT08], definable groupoids [Hru12,GKK15], and abstract elementary classes [Bal09], and I believe that they are an important piece of the puzzle of pseudofinite Fra¨ıss´elimits. Indeed, almost all examples of countably categorical theories with trivial algebraic closure which are known to be pseudofinite either have disjoint n-amalgamation for all n, or are reducts of countably categorical theories with disjoint n-amalgamation for all n (a reduct of a structure or a theory is obtained by forgetting some of the symbols in the vocabulary). The only exceptions (known at the time of this writing) are built from equivalence relations. For example, I have considered two case studies of complicated generic theories of equivalence relations. ∗ Tfeq, introduced by Shelah [She93], is the generic theory of the class of parameterized equivalence relations: the vocabulary consists of a sort of objects, a sort of parameters, and a ternary relation Ex(y, z), where for each parameter a, Ea is an equivalence relation on the objects. TCPZ, introduced by Casanovas, Pel´aez, and Ziegler [CPZ11] is the generic theory of structures in the vocabulary with infinitely many relations En, where En is an equivalence relation on the n-tuples, and all redundant tuples are in the same class. Neither of the theories is a reduct of a theory with disjoint n-amalgamation for all n, but both can be expressed as limits of such reducts in an appropriate sense. This implies that they are both pseudofinite, and, moreover, the pseudofiniteness can be explained via disjoint n-amalgamation. Much of model theory since the advent of Shelah’s classification theory [She90] is concerned with the combinatorics of definable sets (subsets of models definable by formulas), and many dividing lines (stability, simplicity, NIP, etc.) in the complexity of theories have been established on this basis. Let me take a moment to discuss the relationship between the ideas above and these dividing lines. Theorem 2 ([Krua]). If a theory T is a reduct of a countably categorical theory with disjoint 2- and 3- amalgamation, then T is simple. Kim and Pillay [KP98] conjectured that all countably categorical pseudofinite theories are simple. The ∗ theories Tfeq TCPZ are counterexamples to this conjecture. These theories are not simple, but they are NSOP1 (they do Not have the 1-Strong Order Property). The property SOP1, introduced in [DS04], is the first in a heirarchy of dividing lines SOPn, lying between simplicity and the strict order property SOP. RESEARCH STATEMENT 3 Theorem 3 (Folklore). No countably categorical pseudofinite theory has the strict order property. However, very little is known about the pseudofiniteness of countably categorical theories in the region between SOP1 and SOP. As an update to the conjecture of Kim and Pillay, and a vast strengthening of Theorem 3, I suggest the following conjecture:

Conjecture 4. Every pseudofinite countably categorical theory is NSOP1.

Chernikov and Ramsey have given an independence relation criterion for NSOP1 [CR15], which, together with refinements of Hrushovski’s pseudofinite dimensions [Hru13] may be a promising lead toward Conjec- ture 4. Conjecture 4 embraces the equivalence relation examples, but another option is to throw them out, as in the following definition. The motivation is to try to isolate a class of theories for which pseudofiniteness can always be explained by Theorem 1. Definition 5. A primitive combinatorial theory (PCT) is a countably categorical theory with trivial algebraic closure (disjoint 2-amalgamation), such that for every finite set A and every complete type p over A, every A-definable equivalence relation on realizations of p is ∅-definable in the empty vocabulary. Conjecture 6. A PCT is pseudofinite if and only if it is a reduct of a countably categorical theory with n-amalgamation for all n. The triviality of definable equivalence relations only needs to be checked on 1-types [Krub]. This makes it easy to verify that many natural generic theories of Fra¨ıss´eclasses are PCTs.

Theorem 7 ([Krub]). The theory TH of the Henson graph is a PCT. Further, TH is not the reduct of any countably categorical theory with n-amalgamation for all n. The primitive combinatorial theories are of independent model-theoretic interest, since the definition is designed to capture a general notion of “purely unstable” theory. Stable theories are the most well- behaved from a model-theoretic point of view, but examples tend to be algebraic (think of vector spaces or algebraically closed fields) rather than combinatorial. Indeed, the theory of an infinite dimensional over a finite field is pseudofinite, but not for a good probabilistic reason: the finite models of sentences in the theory only occur in certain cardinalities, and must have a very rigid structure. But PCTs have no non-trivial stable behavior. Theorem 8 ([Krub]). In a PCT, any stable formula ϕ(x, y) is equivalent to a boolean of formulas isolating complete 1-types over ∅ and instances of equality. Any stable PCT is interdefinable with a theory of finitely many infinite partitioning unary predicates. In [Krub], I investigate the interaction of PCTs with model-theoretic dividing lines. For example, I characterize when PCTs are simple with trivial forking and when they are distal. Some quesitons remain in this direction. The following conjecture is the most important - replacing “SOP3” with “not simple” is equivalent to the stable forking conjecture for PCTs.

Conjecture 9. If a PCT fails to have disjoint 3-amalgamation, then it is SOP3. Symmetric probabilistic constructions. I will now describe a new kind of infinitary limit of finite structures. Given any finite structure A, we can assign a value P (ϕ; A) ∈ [0, 1] to every quantifier-free formula ϕ(x1, . . . , xn), namely the probability that n elements sampled uniformly and indepedently from A satisfy ϕ. We say that a sequence of finite structures hAni converges if the sequence hP (ϕ; An)i converges for all quantifier-free formulas ϕ. This is very different from convergence in the sense of zero-one laws. Instead of evaluating the truth of sentences with quantifiers globally on structures, we are evaluating quantifier-free formulas locally on elements of structures. The information captured by a convergent sequence amounts to a finitely additive probability measure on the Boolean of quantifier-free formulas in countably many variables {xi | i ∈ N}. This measure extends in a natural and unique way to a Borel probability measure on the space StrL of L-structures with domain N. Moreover, the resulting measure is invariant under the “logic action” of S∞, the on N. From now on I will simply write invariant measure to refer to this sort of probability measure. That there are other useful and equivalent ways to package this information: the exchangeable arrays of Aldous Hoover, and Kallenberg [Ald81, Hoo79, Kal05], the flag of Razborov [Raz07], the graphons 4 ALEX KRUCKMAN of Lovasz and Szegedy in the case of graphs [Lov12] and their generalizations by Austin [Aus08], Nesetril and Ossana de Mendez [NOdM12], and others. But the invariant measures framework allows us to think of a sequence of finite structures as converging to an infinite random structure with domain N, i.e. the sort of symmetric probabilistic construction alluded to in the introduction. We say that an invariant measure concentrates on a structure M if the isomorphism class of M has measure 1. The method due to Petrov and Vershik [PV10] of giving a symmetric probabilistic construction of the Henson graph was extended dramatically by Ackerman, Freer, and Patel to any structure with trivial algebraic closure. Theorem 10 ([AFP12]). There is an invariant measure concentrating on an L-structure M if and only if M has trivial algebraic closure. In fact, the measures constructed in the above theorem are all ergodic, meaning that any almost-surely invariant subset of StrL has measure 0 or 1. In particular, an ergodic measure gives the set of models of any

sentence, even of the infinitary logic Lω1,ω, measure 0 or 1, and it makes sense to talk about the complete theory of the invariant measure, consisting of those sentences which receive measure 1. While the familiar examples of invariant measures concentrate on countable structures (e.g. the Rado graph), there are also simple examples of ergodic invariant measures which fail to concentrate on any partic- ular countable structure [AFNP13]. We call such measures properly ergodic. In joint work with Ackerman, Freer, and Patel, we characterized the behavior of properly ergodic invariant measures.

Theorem 11 ([AFKP]). Let ϕ be a sentence of Lω1,ω. The following are equivalent:

(1) For all countable fragments F of Lω1,ω and all complete F -theories T containing ϕ which have trivial definable closure, there are only countably many F -types consistent with T . (2) Every ergodic measure µ which concentrates on ϕ concentrates on a single model of ϕ, i.e., is not properly ergodic.

The proof goes through a measured version of Morley’s analysis of sentences of Lω1,ω [Mor70], and it also yields the following theorem. Theorem 12 ([AFKP]). If µ is a properly ergodic measure, then Th (µ) has no models of any cardinality. Lω1,ω

However, if F is any countable fragment of Lω1,ω, then ThF (µ) has continuum-many countable models.

There is a general measure/category analogy between the theory of invariant measures on StrL on one hand and Fra¨ıss´etheory on the other. A class K of finite structures with the joint embedding property is the analogue of an ergodic invariant measure: every isomorphism-invariant Borel subset of the K-direct limits is either meager or comeager. A generalization of the usual amalgamation property, called weak amalgamation, is equivalent to the existence of a comeager isomorphism class, and hence corresponds to a measure which concentrates on a single structure up to isomorphism. I explore this analogy in notes on generalized Fra¨ıss´e theory, where I prove the following analogue of Theorem 12. Theorem 13 ([Kruc]). If K is class of finite structure which has the joint embedding property but does not have the weak amalgamation property, then the generic L -theory Th (K) has no models of any ω1,ω Lω1,ω cardinality. However, if F is any countable fragment of Lω1,ω, then the generic F -theory ThF (K) has continuum-many countable models. As the theory of symmetric probabilistic constructions develops, this analogy deserves to be studied further, and I plan to pursue this direction in future work. I will end with a connection to the pseudofiniteness questions in the previous section. Any invariant measure µ induces a sequence of measures µn on the spaces StrL,n of L-structures with domain [n] = {1, . . . , n}, simply by taking the random substructure on [n] induced by the random structure on N. Conversely, these measure µn cohere and induce a unique invariant measure on StrL. Coherence in this sense is a reasonable “naturality” condition on the sequence of measures µn. For example, the “build from below” measures in the proof of Theorem 1 cohere. Question 14. Characterize those invariant measures µ concentrating on the Fra¨ıss´elimit of a class K which induce a sequence µn of measures which witness the pseudofiniteness of the generic theory TK , in the sense that for all ϕ ∈ TK , limn→∞ µn({A ∈ StrL,n | A |= ϕ}) = 1. Which pseudofinite Fra¨ıss´elimits admit such an invariant measure? RESEARCH STATEMENT 5 References [AFKP] Nathanael Ackerman, Cameron Freer, Alex Kruckman, and Rehana Patel, Properly ergodic random structures, in preparation. [AFNP13] Nathanael Ackerman, Cameron Freer, Jaroslav Nesetril, and Rehana Patel, Invariant measures via inverse limits of finite structures, arXiv:1310.8147. [AFP12] Nathanael Ackerman, Cameron Freer, and Rehana Patel, Invariant measures concentrated on countable structures, arxiv:1206.4011. [Ald81] David J. Aldous, Representations for partially exchangeable arrays of random variables, J. Multivariate Anal. 11 (1981), no. 4, 581–598. MR 637937 (82m:60022) [Aus08] Tim Austin, On exchangeable random variables and the of large graphs and hypergraphs, Probab. Surv. 5 (2008), 80–145. MR 2426176 (2010g:60016) [Bal09] John T. Baldwin, Categoricity, University Lecture , vol. 50, American Mathematical Society, Providence, RI, 2009. MR 2532039 (2010m:03068) [CH99] Zo´eChatzidakis and Ehud Hrushovski, Model theory of difference fields, Trans. Amer. Math. Soc. 351 (1999), no. 8, 2997–3071. MR 1652269 (2000f:03109) [Che92] Gregory L. Cherlin, Combinatorial problems connected with finite homogeneity, Proceedings of the International Conference on Algebra, Part 3 (Novosibirsk, 1989), Contemp. Math., vol. 131, Amer. Math. Soc., Providence, RI, 1992, pp. 3–30. MR 1175870 (93m:03054) [Che11] Gregory Cherlin, Two problems on homogeneous structures, revisited, Model Theoretic Methods in Finite Combi- natorics 558 (2011), 319–415. [CPZ11] Enrique Casanovas, Rodrigo Pel´aez,and Martin Ziegler, On many-sorted ω-categorical theories, Fund. Math. 214 (2011), no. 3, 285–294. MR 2845727 (2012k:03091) [CR15] Artem Chernikov and Nicholas Ramsey, On model-theoretic tree properties, arXiv:1505.00454. ∗ [DS04] Mirna Dˇzamonjaand Saharon Shelah, On C -maximality, Ann. Pure Appl. Logic 125 (2004), no. 1-3, 119–158. MR 2033421 (2004j:03044) [EKR76] P. Erd˝os,D. J. Kleitman, and B. L. Rothschild, Asymptotic enumeration of Kn-free graphs, Colloquio Internazionale sulle Teorie Combinatorie (Rome, 1973), Tomo II, Accad. Naz. Lincei, Rome, 1976, pp. 19–27. Atti dei Convegni Lincei, No. 17. MR 0463020 (57 #2984) [Fag76] Ronald Fagin, on finite models, J. Symbolic Logic 41 (1976), no. 1, 50–58. MR 0476480 (57 #16042) [GKK15] John Goodrick, Byunghan Kim, and Alexei Kolesnikov, Type-amalgamation properties and polygroupoids in stable theories, J. Math. Log. 15 (2015), no. 1, 1550004, 45. MR 3367778 [GKLT69] Yu.V. Glebskii, D.I. Kogan, M.I. Liogon’kii, and V.A. Talanov, Range and degree of realizability of formulas in the restricted predicate , Cybernetics 5 (1969), no. 2, 142–154 (English). [Hen71] C. Ward Henson, A family of countable homogeneous graphs, Pacific J. Math. 38 (1971), 69–83. MR 0304242 (46 #3377) [Hod93] Wilfrid Hodges, Model theory, Encyclopedia of and its Applications, vol. 42, Cambridge University Press, Cambridge, 1993. MR 1221741 (94e:03002) [Hoo79] Douglas N Hoover, Relations on probability spaces and arrays of random variables, Preprint, Institute for Advanced Study, Princeton, NJ 2 (1979). [Hru12] Ehud Hrushovski, Groupoids, imaginaries and internal covers, Turkish J. Math. 36 (2012), no. 2, 173–198. MR 2912035 [Hru13] , On pseudo-finite dimensions, Notre Dame J. Form. Log. 54 (2013), no. 3-4, 463–495. MR 3091666 [Kal05] Olav Kallenberg, Probabilistic symmetries and invariance principles, Probability and its Applications (New York), Springer, New York, 2005. MR 2161313 (2006i:60002) [KKT08] Byunghan Kim, Alexei S. Kolesnikov, and Akito Tsuboi, Generalized amalgamation and n-simplicity, Ann. Pure Appl. Logic 155 (2008), no. 2, 97–114. MR 2455560 (2009j:03054) [Kol05] Alexei S. Kolesnikov, n-simple theories, Ann. Pure Appl. Logic 131 (2005), no. 1-3, 227–261. MR 2097228 (2005h:03067) [KP98] Byunghan Kim and Anand Pillay, From stability to simplicity, Bull. Symbolic Logic 4 (1998), no. 1, 17–36. MR 1609191 (99b:03050) [KPR87] Ph. G. Kolaitis, H. J. Pr¨omel,and B. L. Rothschild, Kl+1-free graphs: asymptotic structure and a 0-1 law, Trans. Amer. Math. Soc. 303 (1987), no. 2, 637–671. MR 902790 (88j:05016) [Krua] Alex Kruckman, Disjoint n-amalgamation and pseudofinite countably categorical theories, in preparation, https://math.berkeley.edu/ kruckman/n-amalgamation.pdf. [Krub] , Model theory of finite structures and their limits, PhD thesis, in preparation. [Kruc] , Notes on generalized fra¨ıss´etheory, unpublished notes, https://math.berkeley.edu/ kruckman/fraisse.pdf. [Lov12] L´aszl´oLov´asz, Large networks and graph limits, American Mathematical Society Colloquium Publications, vol. 60, American Mathematical Society, Providence, RI, 2012. MR 3012035 [Mor70] Michael Morley, The number of countable models, J. Symbolic Logic 35 (1970), 14–18. MR 0288015 (44 #5213) [NOdM12] Jaroslav Neˇsetˇriland Patrice Ossona de Mendez, A model theory approach to structural limits, Comment. Math. Univ. Carolin. 53 (2012), no. 4, 581–603. MR 3016428 [PV10] Fedor Petrov and Anatoly Vershik, Uncountable graphs and invariant measures on the set of universal countable graphs, Random Structures 37 (2010), no. 3, 389–406. MR 2724668 (2012a:05277) [Raz07] Alexander A. Razborov, Flag algebras, J. Symbolic Logic 72 (2007), no. 4, 1239–1282. MR 2371204 (2008j:03040) 6 ALEX KRUCKMAN [She90] S. Shelah, Classification theory and the number of nonisomorphic models, second ed., Studies in Logic and the Foundations of Mathematics, vol. 92, North-Holland Publishing Co., Amsterdam, 1990. MR 1083551 (91k:03085) [She93] , The universality spectrum: consistency for more classes, Combinatorics, Paul Erd˝osis eighty, Vol. 1, Bolyai Soc. Math. Stud., J´anosBolyai Math. Soc., Budapest, 1993, pp. 403–420. MR 1249724 (95a:03043)