Alternating Finite Automata and Star-Free Languages

Total Page:16

File Type:pdf, Size:1020Kb

Alternating Finite Automata and Star-Free Languages Theoretical Computer Science 234 (2000) 167–176 www.elsevier.com/locate/tcs View metadata, citation and similar papers at core.ac.uk brought to you by CORE Alternating ÿnite automata and star-free languages provided by Elsevier - Publisher Connector Kai Salomaa ∗;1, Sheng Yu 2 Department of Computer Science, University of Western Ontario, London, Ont., Canada N6A 5B7 Received May 1997; revised November 1997 Communicated by G. Rozenberg Abstract For a given extended regular expression e we construct an equational representation of an alternating ÿnite automaton accepting the language denoted by e. For star-free extended regular expressions the construction yields a loop-free alternating ÿnite automaton. Also the inclusion in the opposite direction holds and, thus, we obtain a new characterization for the class of star-free languages. c 2000 Elsevier Science B.V. All rights reserved Keywords: Finite automata; Alternation; Star-free languages; Language equations 1. Introduction Alternating ÿnite automata (AFA), or Boolean automata, are a natural generaliza- tion of nondeterministic automata which provide a succinct representation for regular languages. This model was ÿrst considered in [4, 3], since then much work has been done on alternating (ÿnite) automata, see e.g., [5, 7–11, 16]. Systems of language equations can be used as a convenient and intuitively clear representation for alternating ÿnite automata [3, 6, 10, 11, 16]. In this paper we present a recursive construction that, for a given extended regular expression e, produces an equational representation Eqr(e) of an AFA that recognizes the language denoted by e. Extended regular expressions allow the use of complementation and intersection in addition to the operations employed by the standard regular expressions. A regular language is said to be star-free [12, 16] if it can be denoted by an extended regular expression without ∗-operators. The star-free languages form the lowest level ∗ Corresponding author. E-mail address: [email protected] (K. Salomaa) 1 Research supported by the Academy of Finland Grant 14018. 2 Research supported by the Natural Sciences and Engineering Research Council of Canada Grant OGP0041630. 0304-3975/00/$ - see front matter c 2000 Elsevier Science B.V. All rights reserved PII: S0304-3975(97)00047-4 168 K. Salomaa, S. Yu / Theoretical Computer Science 234 (2000) 167{176 of the extended star-height hierarchy. It is known that there exist regular languages that are not star-free but it remains an open problem whether there exist languages having extended star-height greater than one [2, 16]. An interesting characterization of star-freeness in terms of the noncounting property was given by Schutzenberger in 1965. A set S ⊆ ∗ is said to be noncounting if there exists an integer n>0 such that for all x; y; z ∈ ∗, xy nz ∈ S if and only if xy n+1z ∈ S. The characterization theorem of Schutzenberger [14, 15] states that a regular language is star-free if and only if it is noncounting. Meyer [13] showed that the family of group-free regular languages consists of exactly the star-free languages. In this paper, we obtain a new characterization for the family of star-free languages in terms of loop-free alternating ÿnite automata. Intuitively, an AFA A is said to be loop-free if there is a total order 6 on the state set of A such that any state p does not depend on any state q where q6p. We establish that if e is a star-free extended regular expression then the corresponding system of equations Eqr(e) represents a loop-free AFA. Also the converse inclusion holds and, thus, the loop-free AFA deÿne exactly the family of star-free languages. 2. Preliminaries Here we brie y recall the deÿnitions of extended regular expressions and alternating ÿnite automata. For a more detailed presentation and examples the reader is referred to [16]. The set of ÿnite words over an alphabet is ∗ and stands for the empty word. For L ⊆ ∗, L is the complement of L with respect to ∗. The cardinality of a ÿnite set A is |A|. We denote the set of non-negative integers by N0. The set of extended regular expressions E over an alphabet is the smallest subset of ( ∪{; ∅; +; ·; ∗; ¬; ∩; (; )})∗ that satisÿes the following conditions: (i) ∪{; ∅} ⊆ E. ∗ (ii) If e1;e2 ∈E then also (e1 + e2); (e1 ∩ e2); (e1 · e2); (¬e1) and (e1 ) belong to E. An expression e ∈ E is star-free if e does not contain any occurrence of the operator star-free ∗. The set of star-free extended regular expressions over is denoted E . The language denoted by an extended regular expression e ∈ E, L(e), is deÿned inductively as follows: (i) L(∅)=∅,L()={}, and L(a)={a}for a ∈ . (ii) Let e1;e2 ∈ . Then L((e1 + e2)) = L(e1) ∪ L(e2), L((e1 ∩ e2)) = L(e1) ∩ L(e2), ∗ ∗ L((e1 · e2)) = L(e1)L(e2), L((e1 )) = L(e1) , and L((¬e1)) = L(e1). We assume that the binary operators have precedence in the order +; ∩; · from lowest to highest and the unary operators are applied before the binary operators. A pair of parentheses may be omitted whenever the omission would not cause confusion. Also, often we do not write the symbol · denoting catenation in regular expressions. Note that the above deÿnition of extended regular expressions is not minimal be- cause the omission of, for instance, the empty set and the intersection operation would K. Salomaa, S. Yu / Theoretical Computer Science 234 (2000) 167{176 169 not change the family of languages deÿned by the (star-free) extended regular expressions. Alternating ÿnite automata, AFA, are an extension of nondeterministic ÿnite au- tomata. Informally, the operation of an AFA can be described as follows. When the automaton reads an input symbol a in a given state q, it will activate all states of the automaton to work on the remaining part of the input in parallel. Once the states have completed their tasks, q will compute its value by applying a Boolean function to the results obtained and pass on the resulting value to the state by which it was activated. A word w is accepted if the starting state computes the value of 1 and it is rejected otherwise. Below we brie y recall the deÿnition of AFA and their equational representation. For more details see [4, 7, 16]. We denote by the symbol B the two-element Boolean algebra B=({0; 1}; ∨; ∧; ¬; 0; 1). Let Q be a ÿnite set. Then BQ is the set of all mappings of Q into B. Note that u ∈ BQ can be considered as a vector of |Q| entries, indexed by elements of Q, with each entry Q being from B. For u ∈ B and q ∈ Q, we write uq to denote the image of q under u. An alternating ÿnite automaton (AFA) is a quintuple A =(Q; ; s; F; g) where Q is the ÿnite set of states, is the input alphabet, s ∈ Q is the starting state, F ⊆ Q is the set of ÿnal states, and g is a function from Q into the set of all functions from × BQ into B. Note that for each state q ∈ Q, g(q) is a function from × BQ into B, which we will denote by gq in the sequel. For each state q ∈ Q and a ∈ , we deÿne gq(a)to be the Boolean function BQ → B such that Q gq(a)(u)=gq(a; u);u∈B: Q Thus, for u ∈ B , the value of gq(a)(u), also gq(a; u), is either 1 or 0. Q Q We deÿne the function gQ : × B → B by putting together the |Q| functions Q Q gq : × B → B, q ∈ Q, as follows. For a ∈ and u; v ∈ B , gQ(a; u)=vif and only if gq(a; u)=vq for each q ∈ Q. For convenience, we will write g(a; u) instead of gQ(a; u) in the following. The characteristic vector of F, f ∈ BQ, is deÿned by the condition fq =1 ⇐⇒ q ∈ F: We extend g to a function of Q into the set of all functions ∗ ×BQ → B as follows: uq if w = ; gq(w; u)= 0 0 0 ∗ gq(a; g(w ;u)) if w = aw with a ∈ and w ∈ ; where w ∈ ∗ and u ∈ BQ. ∗ A word w ∈ is accepted by A if and only if gs(w; f) = 1, where s is the starting state and f is the characteristic vector of F. The language accepted by A is the set ∗ L(A)={w∈ |gs(w; f)=1}: 170 K. Salomaa, S. Yu / Theoretical Computer Science 234 (2000) 167{176 Example 2.1. We deÿne an AFA A =(Q; ; s; F; g) where Q = {q0;q1;q2},={a; b}, s = q0, F = {q2}, and g is given by State a b q0 q1 ∧ q2 0 q1 q2 q1 ∧ q2 q2 ¬(q1) ∧ q2 q1 ∨¬(q2) Let w = aba. Then w is accepted by the AFA A as follows: (aba; f)=g (ba; f) ∧ g (ba; f) gq0 q1 q2 =(gq1(a; f) ∧ gq2 (a; f)) ∧ (gq1 (a; f) ∨¬gq2(a; f)) (; f) ∧ (¬g (; f) ∧ g (; f))) ∧ (g (; f) =(gq2 q1 q2 q2 (; f) ∧ g (; f))) ∨¬(¬gq1 q2 ∧(¬f ∧f )) ∧ (f ∨¬(¬f ∧f )) =(fq2 q1 q2 q2 q1 q2 =(1∧(¬0∧1)) ∧ (1 ∨¬(¬0∧1))=1: Each AFA A =(Q; ; s; F; g) can be represented by a system of language equations of the following form: P Xq = a · gq(a; X )+q;q∈Q; (1) a2 where X is a vector of |Q| variables Xq indexed by the states q ∈ Q, and if q ∈ F; = q 0 otherwise; for each q ∈ Q.
Recommended publications
  • Language and Automata Theory and Applications
    LANGUAGE AND AUTOMATA THEORY AND APPLICATIONS Carlos Martín-Vide Characterization • It deals with the description of properties of sequences of symbols • Such an abstract characterization explains the interdisciplinary flavour of the field • The theory grew with the need of formalizing and describing the processes linked with the use of computers and communication devices, but its origins are within mathematical logic and linguistics A bit of history • Early roots in the work of logicians at the beginning of the XXth century: Emil Post, Alonzo Church, Alan Turing Developments motivated by the search for the foundations of the notion of proof in mathematics (Hilbert) • After the II World War: Claude Shannon, Stephen Kleene, John von Neumann Development of computers and telecommunications Interest in exploring the functions of the human brain • Late 50s XXth century: Noam Chomsky Formal methods to describe natural languages • Last decades Molecular biology considers the sequences of molecules formed by genomes as sequences of symbols on the alphabet of basic elements Interest in describing properties like repetitions of occurrences or similarity between sequences Chomsky hierarchy of languages • Finite-state or regular • Context-free • Context-sensitive • Recursively enumerable REG ⊂ CF ⊂ CS ⊂ RE Finite automata: origins • Warren McCulloch & Walter Pitts. A logical calculus of the ideas immanent in nervous activity. Bulletin of Mathematical Biophysics, 5:115-133, 1943 • Stephen C. Kleene. Representation of events in nerve nets and
    [Show full text]
  • Some Results on the Generalized Star-Height Problem Jean-Eric Pin, Howard Straubing, Denis Thérien
    Some results on the generalized star-height problem Jean-Eric Pin, Howard Straubing, Denis Thérien To cite this version: Jean-Eric Pin, Howard Straubing, Denis Thérien. Some results on the generalized star-height problem. Information and Computation, Elsevier, 1992, 101, pp.219-250. hal-00019978 HAL Id: hal-00019978 https://hal.archives-ouvertes.fr/hal-00019978 Submitted on 2 Mar 2006 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. May 12, 2002 Some results on the generalized star-height problem Jean-Eric Pin∗, H. Straubingyand D. Th´erienz [email protected], [email protected], [email protected] Abstract We prove some results related to the generalized star-height problem. In this problem, as opposed to the restricted star-height problem, com- plementation is considered as a basic operator. We first show that the class of languages of star-height ≤ n is closed under certain operations (left and right quotients, inverse alphabetic morphisms, injective star-free substitutions). It is known that languages recognized by a commutative group are of star-height 1. We extend this result to nilpotent groups of class 2 and to the groups that divide a semidirect product of a commu- tative group by (Z=2Z)n.
    [Show full text]
  • Some Results on the Generalized Star-Height Problem
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Elsevier - Publisher Connector INFORMATION AND COMPUTATION 101, 219-250 (1992) Some Results on the Generalized Star-Height Problem J. E. PIN,* H. STRAUBING,~ AND D. TH~RIEN~ LITP, Universiti de Paris VI. 4 place Jussieu, 75252 Paris Cedex, France We prove some results related to the generalized star-height problem. In this problem. as opposed to the restricted star-height problem, complementation is considered as a basic operator. We first show that the class of languages of star- height <n is closed under certain operations (left and right quotients, inverse alphabetic morphisms. injective star-free substitutions). It is known that languages recognized by a commutative group are of star-height 1. We extend this result to nilpotent groups of class 2 and to the groups that divide a semidirect product of a commutative group by (L/2Z)“. In the same direction. we show that one of the languages that were conjectured to be of star-height 2 during the past ten years is in fact of star-height 1. Next we show that if a rational language L is recognized by a monoid of the variety generated by wreath products of the form Mb (G r N), where M and N are aperiodic monoids, and G is a commutative group, then L is of star-height < 1. Finally we show that every rational language is the inverse image, under some morphism between free monoids. of a language of (resticted) star-height 1. c 1992 Academic Press. Inc. The determination of the star-height of a rational language is an old problem of formal language theory (see Brzozowski, 1980, for an historical survey).
    [Show full text]
  • Languages of Generalised Star Height 1
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Theoretical Computer Science 106 (1992) 327-335 327 Elsevier Note More languages of generalised star height 1 J.M. Robson Department qf‘ Computer Science, GPO Box 4, Canberra, ACT 2601, Australia Communicated by D. Perrin Received September 1991 Revised January 1992 Abstract Robson, J.M., More languages of generalized star height 1, Theoretical Computer Science 106 (1992) 327-335. Suppose that membership of a word w in a language L is specified in terms of the number (modulo some integer k) of occurrences of a fixed word W’ as a factor of w. Then the language obtained for any M by adding arbitrary numbers of copies of a”’ anywhere in the words of L has generalized star height at most 1. 1. Introduction The “star height” of a regular language is defined as the minimum nesting depth of stars in a regular expression denoting the language. This definition can be interpreted in the context of regular expressions using only concatenation, union and star operators (the “restricted” star height), or of expressions using also intersection and complement operators (“generalised” star height). In this paper we are interested only in generalised star height and we will consistently use “star height” or even “height” to mean generalised star height. Schutzenberger [4] gave a simple characterisation of languages of star height 0 (or “star-free”): a language has star height 0 if and only if its syntactic monoid is aperiodic. Correspondence to: J.M.
    [Show full text]
  • Star Height Via Games
    Star height via games Mikołaj Bojanczyk´ (University of Warsaw) Abstract—This paper proposes a new algorithm deciding the I would like to thank Nathanael¨ Fijalkow, Denis Kuperberg, star height problem. As shown by Kirsten, the star height prob- Christof Loding¨ and the anonymous referees for their helpful lem reduces to a problem concerning automata with counters, comments. This work has been supported by the Polish NCN called limitedness. The new contribution is a different algorithm for the limitedness problem, which reduces it to solving a Gale- grant 2014-13/B/ST6/03595. Stewart game with an !-regular winning condition. I. REDUCTION TO LIMITEDNESS OF COST AUTOMATA The star height of a regular expression is its nesting depth This section introduces cost automata, and describes a of the Kleene star. The star height problem is to compute reduction of the star height problem to a problem for cost the star height of a regular language of finite words, i.e. the automata, called limitedness. The exact reduction used in this smallest star height of a regular expression that defines the paper was shown in [Kir05] and is included to make the paper language. For instance, the expression (a∗b∗)∗ has star height self-contained. The idea to use counters and limitedness in two, but its language has star height one, because it is defined the study of star height dates back to the original proof of by the expression (a + b)∗ of star height one, and it cannot Hashiguchi. be defined by a star-free expression. Here we consider regular Cost automata and the limitedness problem: The lim- expressions without complementation, in which case star-free itedness problem concerns an automaton model with coun- regular expressions define only finite languages.
    [Show full text]
  • Generic Results for Concatenation Hierarchies
    GENERIC RESULTS FOR CONCATENATION HIERARCHIES THOMAS PLACE AND MARC ZEITOUN Abstract. In the theory of formal languages, the understanding of concatenation hierarchies of regular languages is one of the most fundamental and challenging topic. In this paper, we survey progress made in the comprehension of this problem since 1971, and we establish new generic statements regarding this problem. 1. Introduction This paper has a dual vocation. The first is to outline progress seen during the last 50 years about concatenation hierarchies of regular languages. The second is to provide generic statements and elementary proofs of some of the core results on this topic, which were obtained previously in restricted cases. In this introduction, we present the historical background, first highlighting the motivations and the key ideas that emerged since the mid 60s. In a second part, we describe the contributions of the paper, which are either new proofs of existing results or generalizations thereof. Historical background: a short survey of 50 years of research. Concatenation hierarchies were introduced in order to understand the interplay between two basic constructs used to build regular languages: Boolean operations and concatenation. The story started in 1956 with Kleene’s theorem [14], one of the key results in automata theory. It states that languages of finite words recognized by finite automata are exactly the ones that can be described by regular expressions, i.e., are built from the singleton languages and the empty set using a finite number of times operations among three basic ones: union, concatenation, and iteration (a.k.a. Kleene star). As Kleene’s theorem provides another syntax for regular languages, it makes it possible to classify them according to the hardness of describing a language by such an expression.
    [Show full text]
  • INSTITUT FÜR INFORMATIK Finite Automata, Digraph Connectivity
    T U M INSTITUT FÜR INFORMATIK Finite Automata, Digraph Connectivity, and Regular Expression Size Hermann Gruber Markus Holzer TUM-I0725 Dezember 07 TECHNISCHE UNIVERSITÄT MÜNCHEN TUM-INFO-12-I0725-0/1.-FI Alle Rechte vorbehalten Nachdruck auch auszugsweise verboten c 2007 Druck: Institut für Informatik der Technischen Universität München Finite Automata, Digraph Connectivity, and Regular Expression Size Hermann Gruber Institut für Informatik, Ludwig-Maximilians-Universität München, Oettingstraße 67, 80538 München, Germany [email protected] Markus Holzer Institut für Informatik, Technische Universität München, Boltzmannstraße 3, D-85748 Garching, Germany [email protected] Abstract We study some descriptional complexity aspects of regular expressions. In particular, we give lower bounds on the minimum required size for the conversion of deterministic finite automata into regular expressions and on the required size of regular expressions resulting from applying some basic language operations on them, namely intersection, shuffle, and complement. Some of the lower bounds obtained are asymptotically tight, and, notably, the examples we present are over an alphabet of size two. To this end, we develop a new lower bound technique that is based on the star height of regular languages. It is known that for a restricted class of regular languages, the star height can be determined from the digraph underlying the transition structure of the minimal finite automaton accepting that language. In this way, star height is tied to cycle rank, a structural complexity measure for digraphs proposed by Eggan and Büchi, which measures the degree of connectivity of directed graphs. This measure, although a robust graph theoretic notion, appears to have remained widely unnoticed outside formal language theory.
    [Show full text]
  • Algorithms for Determining Relative Star Height and Star Height*
    INFORMATION AND COMPUTATION 78, 124-169 (1988) Algorithms for Determining Relative Star Height and Star Height* KOSABURO HASHIGUCHI Department of Information and Computer Sciences, Toyohashi University of Technology, Tempaku, Toyohashi 440, Japan Let W= {R,, . .. R,} be a finite class of regular languages over a finite alphabet Z. Let A = {b,, .. 6,) be an alphabet, and 6 be the substitution from A* into Z* such that S(b,) = R, for all i (1 < id m). Let R be a regular language over Z which can be defined from %?by a finite number of applications of the operators union, concatenation, and star. Then there exist regular languages over A which can be transformed onto R by 6. The relative star height of R w.r.t. V is the minimum star height of regular languages over A which can be transformed onto R by 6. This paper proves the existence of an algorithm for determining relative star height. This result obviously implies the existence of an algorithm for determining the star height of any regular language. 0 1988 Academic PRSS. I~C. 1. 1NTRo~ucT10~ Eggan (1963) introduced the notion of star height, and proved that for any integer k 20, there exists a regular language of star height k. The problem was left open in his paper to determine the star height of any regular language. Dejean and Schutzenberger (1966) showed that for any integer k 2 0, there exists a regular language of star height k over the two letter alphabet. McNaughton (1967) presented an algorithm for determin- ing the loop complexity (i.e., the star height) of any regular language whose syntactic monoid is a group.
    [Show full text]
  • From Finite Automata to Regular Expressions and Back—A Summary on Descriptional Complexity
    From Finite Automata to Regular Expressions and Back—A Summary on Descriptional Complexity Hermann Gruber Markus Holzer knowledgepark AG, Leonrodstr. 68, Institut f¨ur Informatik, Universit¨at Giessen, 80636 M¨unchen, Germany Arndtstr. 2, 35392 Giessen, Germany [email protected] [email protected] The equivalence of finite automata and regular expressions dates back to the seminal paper of Kleene on events in nerve nets and finite automata from 1956. In the present paper we tour a fragment of the literature and summarize results on upper and lower bounds on the conversion of finite automata to regular expressions and vice versa. We also briefly recall the known bounds for the removal of spontaneous transitions (ε-transitions) on non-ε-free nondeterministic devices. Moreover, we report on recent results on the average case descriptional complexity bounds for the conversion of regular expressions to finite automata and brand new developments on the state elimination algorithm that converts finite automata to regular expressions. 1 Introduction There is a vast literature documenting the importance of the notion of finite automata and regular ex- pressions as an enormously valuable concept in theoretical computer science and applications. It is well known that these two formalisms are equivalent, and in almost all monographs on automata and formal languages one finds appropriate constructions for the conversion of finite automata to equivalent regular expressions and back. Regular expressions, introduced by Kleene [68], are well suited for human users and therefore are often used as interfaces to specify certain patterns or languages. For example, in the widely available programming environment UNIX, regular(-like) expressions can be found in legion of software tools like, e.g., awk, ed, emacs, egrep, lex, sed, vi, etc., to mention a few of them.
    [Show full text]
  • The Star-Height of a Finite Automaton and Some Related Questions
    International Journal of Open Information Technologies ISSN: 2307-8162 vol. 6, no. 7, 2018 The star-height of a finite automaton and some related questions B. F. Melnikov Abstract—The paper is related to the star-height for non- was some improved using an approach of the game theory, deterministic finite automata, not for the star-height problem see [8]. And unlike the previous comment, the last proof was for regular languages. We describe an alternative proof of called “elegant”. Although it does improve the understanding Kleene’s theorem, and then our version of the reduction of some problems related to the star-height to nondeterministic of the proof of D. Kirsten, the number of automata being finite automata. For a given regular language, the corresponding examined due to this “elegance” does not decrease. finite automaton is constructed; the method we are considering The second of these mentioned problems is the generalized has the property that the reverse construction gives the original star-height problem. For now, it is unknown the answer regular expression. to the question whether or not this problem is decidable. The publication of this not-so-complicated problem has two goals, the both are related to the future development of the topic The author also does not know significant papers on this under consideration. First, we assume the future development of subject published after [9] (1992). The connection between the topic with the aim of describing a similar approach for gen- the generalized star-height problem and generalized pseudo- eralized regular expressions. Secondly, another generalization automata (see [10] for the last formalism) is to be considered is proposed, namely, consideration of the structures we describe in one of the following publications.
    [Show full text]
  • Open Problems About Regular Languages, 35 Years Later Jean-Eric Pin
    Open problems about regular languages, 35 years later Jean-Eric Pin To cite this version: Jean-Eric Pin. Open problems about regular languages, 35 years later. Stavros Konstantinidis; Nelma Moreira; Rogério Reis; Jeffrey Shallit. The Role of Theory in Computer Science - Essays Dedicated to Janusz Brzozowski, World Scientific, 2017, 9789813148208. 10.1142/9789813148208_0007. hal- 01614375v2 HAL Id: hal-01614375 https://hal.archives-ouvertes.fr/hal-01614375v2 Submitted on 27 Jan 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Open problems about regular languages, 35 years later Dedicated to Janusz A. Brzozowski for his 80th birthday Jean-Eric´ Pin ∗ IRIF, CNRS and Universit´eParis-Diderot, Case 7014, 75205 Paris Cedex 13, France. [email protected] October 2016 Abstract In December 1979, Janusz A. Brzozowski presented a selection of six open problems about regular languages and mentioned two other problems in the conclusion of his article. These problems have been the source of some of the greatest breakthroughs in automata theory over the past 35 years. This survey article summarizes the state of the art on these questions and the hopes for the next 35 years.
    [Show full text]
  • On the Star-Height of Factor Counting Languages and Their Relationship To
    ON THE STAR-HEIGHT OF SUBWORD COUNTING LANGUAGES AND THEIR RELATIONSHIP TO REES ZERO-MATRIX SEMIGROUPS TOM BOURNE AND NIK RUSKUCˇ Abstract. Given a word w over a finite alphabet, we consider, in three special cases, the generalised star-height of the languages in which w occurs as a contiguous subword (factor) an exact number of times and of the languages in which w occurs as a contiguous subword modulo a fixed number, and prove that in each case it is at most one. We use these combinatorial results to show that any language recognised by a Rees (zero-)matrix semigroup over an abelian group is of generalised star-height at most one. 1. Introduction and preliminaries The generalised star-height problem, which asks whether or not there exists an algo- rithm to compute the generalised star-height of a regular language, is a long-standing problem in the field of formal language theory. In particular, it is not yet known whether there exist languages of generalised star-height greater than one; see [10, Section I.6.4] and [9]. The aim of the present paper is to present some new contributions concerning this problem. In Section 2, we take a combinatorial approach and find the generalised star-height of languages where a fixed word w appears as a contiguous subword of the words in our language precisely k times or k modulo n times. In Section 3, we apply these results to prove that languages recognised by Rees (zero-)matrix semigroups over abelian groups are of generalised star-height at most one.
    [Show full text]