Basics About Algebraic Approach to Languages

Basics about algebraic approach to languages Marc Zeitoun LSV, ENS Cachan – LaBRI, U. Bordeaux – CNRS – INRIA GAMES-EPIT Spring School 2011 Carcans Maubuisson May 23–27, 2011 1/49 Motivation I Word languages are used in computer science for many purposes. I Natural models for I (text) programs, I inputs of programs, I behaviors of programs, I etc. I Several formalisms to define regular sets of words: logics, regexp, automata... How to compare their expressive power? The talk shows examples exploiting connections with algebra to answer this question. Logics Algebraic Automata Tools Other high level 2/49 I Is temporal logic with only unary operators weaker? I Are FO(<) and LTL on words equally expressive? I Is a given regular language L expressible in first-order logic? S ∗ ∗ ∗ I A polynomial is a language of the form finite B0 a1B1 ··· ak Bk . Given a regular language L, is it decidable if I L is a polynomial? I L is a finite boolean combination of polynomials? Motivation: some examples I Main question in this talk: expressiveness. Typical questions: I Compare the expressive power of logic L1 and logic L2. I Is temporal logic with past and future operators more expressive than pure future temporal logic? 3/49 Motivation: some examples I Main question in this talk: expressiveness. Typical questions: I Compare the expressive power of logic L1 and logic L2. I Is temporal logic with past and future operators more expressive than pure future temporal logic? I Is temporal logic with only unary operators weaker? I Are FO(<) and LTL on words equally expressive? I Is a given regular language L expressible in first-order logic? S ∗ ∗ ∗ I A polynomial is a language of the form finite B0 a1B1 ··· ak Bk . Given a regular language L, is it decidable if I L is a polynomial? I L is a finite boolean combination of polynomials? 3/49 Link between logic and algebra for regular languages ∗ I A language L ⊆ A is recognized by a monoid M if there is a morphism ' : A∗ ! M such that L = '−1('(L)) I In other words: it is enough to look at ' image to decide membership to L. Thm. Kleene-Büchi For a language L ⊆ A∗, it is equivalent I to be recognized by a finite automaton, I to be definable by a regular expression, I to be definable in Monadic Second Order logic, I to be recognized by a finite monoid. 4/49 Regular languages and monoids High-level Language L, Regular exp ab∗ description “Easy” Harder b a a Low-level Minimal automaton Amin(L) 0 1 2 description b a, b 5/49 Regular languages and monoids High-level Language L, Regular exp ab∗ description 0 1 2 Syntactic monoid M(L) {ε, a, b, a2} Algebra a 1 2 2 (finite, [K56]) ab a, ba a2, “Easy” algorithmsHarder b = = 2 1 2 b2 = b b a a Low-level Minimal automaton Amin(L) 0 1 2 description b a, b 5/49 High-level vs. algebraic properties: prominent results Logical Language L A (L) Synt(L) min definability Star-free Counter-free A FO(<); LTL [Sch65,McN-P71,K68] Piecewise testable Very weak + ::: J Bool(Σ1) [S75,Th87] U Q FO2(<), f non ambiguous 2-way part. ord. DA UTL; Σ2 \ Π2 [Sch76,SchThV01,ThW98,EVW97,PW97] Loc. threshold testable Forbidden patterns ACom ∗ LI FO(+1) [S85,ThW85,BP89] 6/49 Some tools I Syntactic monoid of a regular language. I Monoid structure, in terms of ideals. I Basic and more advanced pumping arguments. 7/49 Outline Motivation Regular languages and monoids Syntactic monoids Background on finite monoids Idempotents Green’s relations Logics First order logic LTL Expressiveness results FO with one variable Piecewise testable languages First-order logic, star-free languages and aperiodic monoids Summary 8/49 Syntactic monoids I Notation: Monoid (M; ·; 1), with · associative, 1 neutral. ∗ I A free monoid over A. I Morphism ' : M ! N. ∗ ∗ ∗ I If L ⊆ A , its syntactic congruence ∼L ⊆ A × A is defined by u ∼L v iff 8x; y 2 xuy 2 L () xvy 2 L: I It is indeed a congruence: equivalence relation compatible with concatenation. u ∼L v =) [x(uw)y 2 L , x(vw)y 2 L]= ) uw ∼L vw ∗ ∗ I Syntactic monoid M(L) = A =∼L and syntactic morphism ηL : A ! M(L). Thm. (Myhill) A language is regular if and only if its syntactic congruence has finite index. 9/49 Proof −1 I ηL recognizes L since ηL (ηL(L)) = L: −1 u 2 ηL (ηL(L)) ) ηL(u) 2 ηL(L) ) 9v 2 L; u ∼L v ) u 2 L: I If M(L) divides M: Nα one-to-one M ϕ β surj. A∗ M(L) ηL Then α ◦ ' recognizes L. ∗ I Conversely, if ' : A ! M recognizes L, then '(u) = '(v) implies u ∼L v. Therefore there is a surjective morphism β from '(A∗) into M(L). Syntactic monoids I Say that M1 divides M2 if M1 is a quotient of a submonoid of M2. Syntactic monoid and recognition Monoids recognizing L are exactly those which are divided by M(L). 10/49 I If M(L) divides M: Nα one-to-one M ϕ β surj. A∗ M(L) ηL Then α ◦ ' recognizes L. ∗ I Conversely, if ' : A ! M recognizes L, then '(u) = '(v) implies u ∼L v. Therefore there is a surjective morphism β from '(A∗) into M(L). Syntactic monoids I Say that M1 divides M2 if M1 is a quotient of a submonoid of M2. Syntactic monoid and recognition Monoids recognizing L are exactly those which are divided by M(L). Proof −1 I ηL recognizes L since ηL (ηL(L)) = L: −1 u 2 ηL (ηL(L)) ) ηL(u) 2 ηL(L) ) 9v 2 L; u ∼L v ) u 2 L: 10/49 ∗ I Conversely, if ' : A ! M recognizes L, then '(u) = '(v) implies u ∼L v. Therefore there is a surjective morphism β from '(A∗) into M(L). Syntactic monoids I Say that M1 divides M2 if M1 is a quotient of a submonoid of M2. Syntactic monoid and recognition Monoids recognizing L are exactly those which are divided by M(L). Proof −1 I ηL recognizes L since ηL (ηL(L)) = L: −1 u 2 ηL (ηL(L)) ) ηL(u) 2 ηL(L) ) 9v 2 L; u ∼L v ) u 2 L: I If M(L) divides M: Nα one-to-one M ϕ β surj. A∗ M(L) ηL Then α ◦ ' recognizes L. 10/49 Syntactic monoids I Say that M1 divides M2 if M1 is a quotient of a submonoid of M2. Syntactic monoid and recognition Monoids recognizing L are exactly those which are divided by M(L). Proof −1 I ηL recognizes L since ηL (ηL(L)) = L: −1 u 2 ηL (ηL(L)) ) ηL(u) 2 ηL(L) ) 9v 2 L; u ∼L v ) u 2 L: I If M(L) divides M: Nα one-to-one M ϕ β surj. A∗ M(L) ηL Then α ◦ ' recognizes L. ∗ I Conversely, if ' : A ! M recognizes L, then '(u) = '(v) implies u ∼L v. Therefore there is a surjective morphism β from '(A∗) into M(L). 10/49 I M(L1 \ L2) and M(L1 [ L2) divide M(L1) × M(L2). I Consequence: properties defined by identities are preserved through boolean combinations. I For instance, if M(L1) and M(L2) are commutative, then so are I M(L1 \ L2), I M(L1 [ L2) I M(L1 n L2). Syntactic monoids I M(L) is the monoid of transitions of the minimal automaton of L. Q Each word induces a mapping Q ! Q (hence jM(L)j 6 Q ). ) Algorithm to compute M(L) (see Example next slide). ∗ I In particular, we have M(A n L) = M(L) 11/49 Syntactic monoids I M(L) is the monoid of transitions of the minimal automaton of L. Q Each word induces a mapping Q ! Q (hence jM(L)j 6 Q ). ) Algorithm to compute M(L) (see Example next slide). ∗ I In particular, we have M(A n L) = M(L) I M(L1 \ L2) and M(L1 [ L2) divide M(L1) × M(L2). I Consequence: properties defined by identities are preserved through boolean combinations. I For instance, if M(L1) and M(L2) are commutative, then so are I M(L1 \ L2), I M(L1 [ L2) I M(L1 n L2). 11/49 Syntactic monoid: example ∗ ∗ I Example: let A = fa; b; cg and L = A abA . I The minimal automaton of L is the following: b, c a a, b, c a b 1 2 3 c Generators Elements 1 2 3 1 2 3 a 2 2 3 ∗ 1 1 2 3 b 1 3 3 ∗ a 2 2 3 c 1 1 3 ∗ b 1 3 3 ∗ c 1 1 3 Relations aa = a = ca; ac = cb = cc = c ∗ ab 3 3 3 bb = b = bc; ab = bab = 0 ba 2 3 3 12/49 Background on finite monoids 13/49 Idempotents in monoids I An idempotent is an element e 2 M such that e = ee. ! I Any m 2 M has a single power which is idempotent, denoted m . m! = (m!)2 k−1 2 3 m m m m mk ... mk+1 jMj! ! I Since the loop has at most jMj elements, m = m . 14/49 I The right action of M on itself yields a labeled graph R(M). m2 m1 m1m2 Strongly connected components of R(M) are called R-classes. I Formally, u and v are R-equivalent, written u R v if 1.

Basics About Algebraic Approach to Languages

Automata, Semigroups and Duality

A Quantum Query Complexity Trichotomy for Regular Languages

A Categorical Approach to Syntactic Monoids 11

Syntactic Semigroups

Typed Monoids – an Eilenberg-Like Theorem for Non Regular Languages

Syntactic Semigroup Problem for the Semigroup Reducts of Affine Near-Semirings Over Brandt Semigroups

Some Results on the Generalized Star-Height Problem

Theme and Variations on the Concatenation Product

Varieties of Tree Languages Definable by Syntactic Monoids

Algebraic Approach to Automata Theory

Lecture 3: Algebra and Languages 1. Overview 2. Syntactic Congruences

Star-Free Languages and Local Divisors