arXiv:2105.03038v2 [math.CT] 10 May 2021 unl nonee nawd wt fctgrclapplicat compu categorical to of algebra, linear swath and representations wide group a through on encountered quently le oad eocln h w edn aaim fnatu of paradigms [ leading compositional two the reconciling towards alley plcbefe lers hl h eun acl genera calculi sequent the depe While not r does algebras. ones, property free free The applicable the expansions. only all not before pregroups, performed all that follows It bras. Thm. Results 1.2 calculi. sequent of concepts [ [ ste sequents expansion pregroup all free precede the steps supported contraction elimination all where form a a origins, disjoint of structures two these [ semantics combining space in vector tal computatio the in assigning together for used other been have spiders and Pregroups rusae nteohrhn,teagbacsrcueunder structure algebraic the hand, other the on are, groups nvriyo aai oouuH,USA HI, Honolulu Hawaii, of University Pavlovic Dusko uk Pavlovic: Dusko uzosisepstoso i eiinpoeue[ procedure decision his of expositions Buszkowski’s nicknamed lovingly algebras, Frobenius Special Background 1.1 Introduction 1 12 .Temto fcteiiainwsitoue nGengtzen’ in introduced was elimination cut of method The ]. Lambek’s 1 hsc,adcmue science. computer and physics, "Spider" rmwr hteegdwt h eut ugssnwwy to ways analysis. new data suggests and learning results machine the in structures with disj emerged as that algebras spider framework characterized relational be of can characterization general the in cha algebras be ari spider can naturally preordered they themselves where pregroups relations, preordered that proc of out category language turns natural It in together semantics. used been have spiders and abkperusaeFoeisspiders Frobenius are pregroups Lambek asta rgop n one pdr r di are spiders pointed and pregroups that says wthn Lemma Switching [email protected] sancnm of nickname a is 8 , 19 , 24 , 27 ]. , dusko.org pca rbnu algebras Frobenius special [ 37 rp ]sy htaysqeti a in sequent any that says 2] Prop. , Pregroups upre yNFadAFOSR. and NSF by Supported , npreorders in 48 r udmna tutr rmlnusis Pregroups linguistics. from structure fundamental a are spiders , 39 60 11 ff h 7.I a n ae the named and was It 27]. Ch. , 1 , ,ado h qiaec ihtecnetfe languages context-free the with equivalence the of and ], rn rsnain ftesm ait fsqetalge- sequent of variety same the of presentations erent 61 u otercaatrsi omlform normal characteristic their to due etefe eun lersjs ietetr calculi term the like just algebras sequent free the te .Tesrn iga oainhsbe instrumen- been has notation diagram string The ]. dc otenra omweealcnrcin are contractions all where form normal the to educe aiiyadcmlxt [ complexity and tability di t yn oenctgra rmas[ grammars categorial modern lying udmna tutr rmmathematics, from structure fundamental a , a agaepoesn,tedsrbtoa n the and distributional the processing, language ral s twspoe nutvl,a h omo cut of form the as inductively, proved was It ps. a igitc,oefrpeetn h ytx the syntax, the presenting for one linguistics, nal do h nutv wthn rctelimination, cut or switching inductive the on nd ituin fgop.Tecompositional The groups. of unions oint ff efo rmas h te a around, way other The grammars. from se os rmtplgclqatmfil theories, field quantum topological from ions, rn eeso agaemdln,t pnan open to modeling, language of levels erent eia [ seminal s suin fperus hsextends This pregroups. of unions as sig n o ytx h te for other the syntax, for one essing: atrzda one pdr nthe in spiders pointed as racterized nesadadapytebasis the apply and understand free 26 ,adrmie n ftecentral the of one remained and ], rgopcnb omlzdto normalized be can pregroup 23 , 33 SicigLemma" "Switching , 54 , 55 15 , ( , 57 38 34 , ) , r fre- are , 63 37 .Pre- ]. , 59 in ]. generate the free equational algebras, the algebraic reductions, often conveniently presented as categorical dia- gram chasing, apply to both families of general, not necessarily free algebras. Thm. 2 is also a statement about general sequent algebras. It says that the residuation monoids are characterized by the Frobenius conditions, which determine the shape of a spider. Note that the formal grammars were defined as a special family of sequent calculi (those where the generators are partitioned into the terminals and the nonterminals [16]), and that the categorial grammars are the sequent calculi corresponding to the residuation monoids [2, 35, 36, §1.2]. Thm. 3 is a statement about spiders in general, which are more general than those in Thm. 1 because these are not necessarily pointed, but less general than the Frobenius algebras in Thm. 2, because spider algebras are required to satisfy the special isometry condition, in addition to the Frobenius conditions. Thm. 3 says that a general spider can be decomposed into a consistent union of many pregroups; and that the union of any consistent family of pregroups yields a spider. The upshot is that the spidersdo not appear only in the semantical assignments of pregroupsto vector spaces but also as the shapes of pregroups themselves. Any spider of preordered relations is a union of pregroups. A pregroup is precisely a pointed spider.
1.3 Related work The characterization of spiders of relations over preorders extends the characterization [51] of spiders of rela- tions over sets: unions of pregroups disjoint unions of groups = spiders of preordered relations spiders of relations While [51] was mainly concerned with the group structure of the basis elements as a quantum-algorithmic re- source [52], pregroups are a structure used in natural language processing [24, 44, 46]. The structure of the basis elements and the basis sets correspond, respectively, to the syntactic and the semantical aspects. Modern applications require a seamless integration of both aspects [31]. The presented results have been directly in- spired by the DisCoCat program [24, 27, 60, 61], and indirectly by the remarkable headways in the practices of computational linguistics over theory [10]. The author is at an early stage of studying the connections be- tween the many fronts of progress in computational linguistics [43, 47, 48, 59]. Beyond the current horizon, the conceptual veins from spiders’ roots in categorical quantum mechanics [22, 52] to their branchings through the theories of language and communication [27] continue to feed not only the theoretical explorations of sequent algebra [7, 6, 25] but also practical applications and commercial technology transfer [44, 46].
1.4 Why sequent algebra? To relate Lambek’s pregroups and Frobenius’ algebras, both structures are presented in terms of preordered relations. This frameworkcould be viewed as expandingthe scope of the studied structures, or as an unnecessary distraction. In the following, I argue that it is neither. Equational theories are usually presented as pairs (O, E), where O is a set of operations, and E is a set of equations between the terms generated from the operations. Algebra is the practice of deriving other valid equations. If the operations are presented as arrows, the composite operations are the paths along the arrows, the equations are the faces of the directed graphs buit from the arrows, and the sequences of equations can be presented as diagram chases, going back to [36]. Sequent theories are also specified as pairs (O, E), but the operations from O are now given as sequents, and E is a set of implications between some derived sequents. Since the sequents are bulkier than the terms, the sequences of sequent implications are usually written as proof trees, which gives the sequent calculi their typical appearance. Since only relatively simple sequents arise in the present work, I will not build proof trees, but save space by writing the sequent derivations horizontally, with the sequents usually enclosed in parentheses. This should not conceal the fact that we are treading on the well ploughed ground of sequent algebra, going back to Axel Thue. The category PRel is just a convenient categorical framework for it. But the use of sequents goes well beyond algebra. The main techniques were developed by Gerhard Gentzen, Emil Post
2 and Andrey Markov1. Noam Chomsky refined and specialized the sequent theories into grammars [16, 17]. Formal grammars have been instrumental in specifying the syntax of programming languages [49], as well as for their operational semantics [58]. The use of sequents for specifying the grammars of the natural languages goes back to Sanskrit philologist Panini from VI BC [5]. Lambek’s pregroups belong to that tradition. The results show that the sequent view of the spider algebras and of the Frobenius conditions is a natural extension.
1.5 Overview of the paper In Sec. 2 we set the stage for sequent algebra in the category of preorders and preordered relations, shortened to prelations. Sec. 3 describes the prelational monoids, one of the simplest structures of sequent algebras, and characterizes when they are representable as usual preordered monoids. The structure of pregroups is also presented in this framework. Sec. 4 spells out the Frobenius conditions and the special (isometry) condition for prelations and defines the spider algebras in this framework. Theorems 1, 2 and 3 are stated and proved in sections 5, 6, and 7, respectively. Sec. 8 discusses the repercussions of the results, in particular with respect to the goals and the limitations of pregroup grammers [39, Ch. 28].
2 Preorders and prelations
A A preorder is a set A with transitive, reflexive relation ←− ⊆ A × A, which means that A A A A x ←− x and x ←− y ∧ y ←− z =⇒ x ←− z (1) hold for all x,y,z ∈ A. When the underlying set A is clear or irrelevant, we write (x ← y). The preorder equivalence is the symmetric part of a preorder, defined by
(x ↔ y) ⇐⇒ (x ← y) ∧ (y ← x) (2)
A The quotient of A modulo the equivalence relation ←→ is the largest poset (partially ordered set) with a monotone map from A. Since this map is an order isomorphism, every preorder is order-isomorphic to a poset. A prelation Φ : A ց B between preorders A and B is a relation ←−Φ ⊆ A × B which is upper-closed in A and lower-closed in B, i.e.
A B x ←− x′ ∧ x′ ←−Φ y′ ∧ y′ ←− y =⇒ x ←−Φ y (3) holds for all x, x′ ∈ A and y,y′ ∈ B. Given a prelation Ψ : B ց C, the composite (Φ ; Ψ) : A ց C is defined
(Φ ;Ψ) x ←−−−− z ⇐⇒ ∃y. x ←−Φ y ∧ y ←−Ψ z (4)
A The transitivity makes the preordering ←− ⊆ A × A into the identity prelation on A. The category of preorders and prelations is denoted by PRel .
′ Examples. Given a set A, any set of sequents {ai ← ai}i∈J ⊆ A × A determines a preorder on A as its transitive reflixive closure. Given preorders A and B, any set of sequents {ai ← bi}i∈K ⊆ A × B determines A B a prelation A ց B as its transitive closure under ←− on the right and ←− on the left.
1The same one who invented Markov chains and the n-gram model of language, all in the same paper [45].
3 2.1 Monoidal structure The main tensor products of PRel are familiar from their restrictions to PRel’s smaller cousin Rel, the category of sets and relations. For any pair of preorders A and B, • the disjoint union A + B with the preorder
A B A B x ←−−−+ y ⇐⇒ x ←− y ∨ x ←− y (5) is a biproduct in PRel, with the empty set ∅ as the zero; • the cartesian product A × B with the preorder written in the form
AB A B x, y ←−− x′,y′ ⇐⇒ x ←− x′ ∧ y ←− y′ (6)
is a symmetric monoidal product in PRel, with the singleton ½ = {∅} as the unit object.
The diagonals δ : A ց A × A and the projections ! : A ց ½ are defined by
δ AA x ←− y,z ⇐⇒ x, x ←−− y,z and x ←−∅! for all x (7) Just like in the category Rel of sets and relations, the diagonals and the projections do not form natural trans- formations: the former commutes only with the single-valued relations, the latter only with the total ones.
(PRel, ×, ½) is therefore not a cartesian structure, just symmetric monoidal. (PRel, +, ∅) is a bicartesian struc- ture.
2.2 Completions Any preorder X embeds into the complete lattices of its lower sets and upper sets, respectively, with the opposite inclusion orderings:
X ⇓X ⇓X = {L ⊆ X | x ∈ L ∧ x ←− x′ =⇒ x′ ∈ L} with L ←−− L′ ⇐⇒ L ⊇ L′ (8) X ⇑X ⇑X = {V ⊆ X | x ←− x′ ∧ x′ ∈ V =⇒ x ∈ V } with U ←−− U ′ ⇐⇒ U ⊆ U ′ (9) X The embeddings map each x ∈ X into the principal lower set ↓ x = {y| x ←− y }∈⇓X and the principal X upper set ↑ x = {y| y ←− x }∈⇑X. It is easy to see that ⇓X is generated by the unions of the principal lower sets as suprema, and that ⇑X is generated by the unions of the principal upper sets as infima. They are thus X’s supremum and infimum completions, respectively.
2.3 Dualities PRel supports two dualities:
O, ‡ : PRelop −→ PRel (10) where PRelop is the opposite category of PRel, i.e. PRelop(A, B) = PRel(B, A). It is convenient to write Ao instead of O(A) and A‡ instead of ‡(X). The object part of O sends A to the opposite preorder Ao, i.e. Ao A x ←−− x′ ⇐⇒ x′ ←− x . The object part of ‡ is the identity, i.e. A‡ = A. The arrow parts thus map a prelation Φ : A ց B to prelations in the form Φo : Bo ց Ao and Φ‡ : B ց A, defined
o ‡ B A y ←−−Φ x ⇐⇒ x ←−Φ y y ←−−Φ x ⇐⇒ ∀uv. u ←−Φ v ⇒ y ←− v ∧ u ←− x (11)
4
op ½ To understand ‡ : PRel −→ PRel, consider the special case when A = ½. A prelation Φ : ց B can then ‡ be viewed as a lower set Φ ∈ ⇓B, and it is easy to see that Φ : B ց ½, defined as in (11), is the upper set
‡ ‡
½ ½ Φ ∈⇑B of all upper bounds of Φ. When B = ½, then Ψ : A ց is an upper set Ψ ∈⇑A amd Ψ : ց A in (11) corresponds to the lower set Ψ‡ ∈ ⇓A of Ψ’s lower bounds. Iterating the dagger induces a closure operators ‡‡ : ⇓B −→⇓B, and a lower set Φ satisfies Φ=Φ‡‡ if and only if it contains all lower bounds of the set of its upper bounds. Dually for an upper set Ψ. If a general prelation Φ : A ց B is viewed as a lower set Φ ∈ ⇓(Ao × B), then Φ‡ is the set of Φ’s upper bounds. A set of upper bounds is, of course, an element of ⇑(Ao × B), but if it is viewed as the lower set Φ‡ ∈ ⇓(Bo × A) ⇑(Ao × B), then it easier to see that it is a prelation Φ‡ : B ց A. Checking that Φ ⊆ Φ‡‡ and Φ‡ = Φ‡‡‡ always hold is routine, and Φ=Φoo is obvious. The two dualities coincide just on the preorders that happen to be equivalence relations. The duality O gives rise to the internal adjoints in PRel on the level of objects, ‡ on the level of arrows. The former adjoints make PRel into a compact category, whereas the latter provide an internal characterization of monotone maps between preorders.
2.4 Compact structure On the level of the objects of PRel, the fact that every preorder B has a right adjoint2 Bo in PRel is realized by the bijections
PRel(A × B, C) PRel(A, Bo × C) (12)
η o natural in A and C. The adjunction unit ½ −→ B × B arises on the right, corresponding to id : B ց B on o ε oo left, whereas the counit B × B −→ ½ arises on the left corresponding to id : B ց B = B on the right. This makes PRel into a compact category.
2.5 Maps On the level of morphisms, the adjunction conditions are
‡ idA ⊆ Φ;Φ , which means