Combinatorics on Words – a Tutorial *

Total Page:16

File Type:pdf, Size:1020Kb

Combinatorics on Words – a Tutorial * Combinatorics on Words – A Tutorial ⋆ J. Berstel1 and J. Karhum¨aki2⋆⋆ 1Institut Gaspard-Monge, Universit´ede Marne-la-Vall´ee, 77454 Marne-la-Vall´ee Cedex 2, France, email: [email protected] 2Department of Mathematics and Turku Centre for Computer Science, University of Turku, 20014 Turku, Finland, email: [email protected].fi Table of Contents 1 Introduction ...................................... .................. 2 1.1 History......................................... ............... 3 1.2 Notions and notations............................. .............. 4 2 Connections....................................... .................. 5 2.1 Tomatrices ...................................... .............. 6 2.2 Toalgebra ....................................... .............. 7 2.3 To algorithmics.................................. ............... 8 3 Periodicity....................................... ................... 9 3.1 Fine and Wilf’s theorem............................ ............. 9 3.2 Critical factorization theorem . ................ 11 3.3 Characterizations for ultimately periodic words . ................ 12 4 Dimension properties............................... .................. 15 4.1 Defect theorems .................................. .............. 16 4.2 Ehrenfeucht Compactness Property . ............. 19 5 Unavoidable regularities ........................... ................... 19 5.1 Power-free words ................................. .............. 19 5.2 Testsets andtest words ............................ ............. 23 5.3 Repetition threshold ............................. ............... 23 5.4 Unavoidable patterns............................. ............... 24 5.5 Shirshov’s theorem............................... ............... 25 5.6 Unavoidable sets of words.......................... .............. 27 6 Complexity ........................................ ................. 28 6.1 Subword complexity of finite words . ............ 28 6.2 Subword complexity of infinite words . ............. 29 6.3 Sturmianwords ................................... ............. 30 6.4 Episturmian words................................ .............. 33 6.5 Hierarchies of complexities . ................ 34 6.6 Subword complexity and transcendence . ............. 35 6.7 Descriptive and computational complexity . .............. 36 7 From words to finite sets of words ....................... .............. 38 7.1 Conway’s Problem ................................. ............. 38 ⋆ This is a slightly improved version of the paper that appeared in Bulletin EATCS, 79 (february 2003), p. 178-228. The work of the second author was done while visiting Institut Gaspard-Monge Universit´ede Marne-la-Vall´ee ⋆⋆ Supported by the Academy of Finland under the grant 44087 7.2 Characterization of commuting sets . .............. 41 7.3 Undecidability results for finite sets . ................ 42 8 OpenProblems...................................... ................ 43 1 Introduction During the last two decades research on combinatorial problems of words, i.e., on Combinatorics on Words, has grown enormously. Although there has been important contributions on words starting from the very beginning of last cen- tury, they were scattered and typically needed as tools to achieve some other goals in mathematics. A notable exception is combinatorial group theory, which studies combinatorial problems on words as representing group elements, see [LS77] and [MKS66]. Now, and particularly after the appearance of Lothaire’s book – Combinatorics on Words – in 1983 the topic has become a challenging research topic of its own. In the latest classification of Mathematical Reviews combinatorics on words constitutes its own section under the chapter discrete mathematics related to computer science. Although the applications of words are, by no means, only in computer science the classification catches the basic of the nature of combinatorics on words. Recent developments of the field culminated in Lothaire’s second book – Algebraic Combinatorics on Words – which appeared in 2002. Its more than 500 pages witness the vital stage of the topic. The new book repeats basically nothing from the first one, and actually most of the results were discovered during the last twenty years. A biannual conference – referred to as WORDS – devoted entirely to combinatorics on words has also been created. The fourth event will be in Turku in 2003. A word is a sequence of symbols, finite of infinite, taken from a finite alphabet. A natural environment of a finite word is a free monoid. Consequently, words can be seen as a discrete combinatorial objects or discrete algebraic objects in a noncommutative structure. These two facts – discreteness and noncommutativity – are the two fundamental features of words. At the same time they explain why many problems are so difficult. Words are central objects of automata theory, and in fact in any standard model of computing. Even when computing on numbers computers operate on words, i.e., representations of numbers as words. Consequently, on one hand, it is natural to study algorithmic properties of words. On the other hand, the undecidability of problems is most easily stated in terms of words – the Post Cor- respondence Problem being a splendid example. Both these elements of words – algorithmic aspects and undecidability – are visible, often implicitly, throughout our presentation. The goal of the tutorial is to discuss – without aiming to be exhaustive – several typical problems on words, as well as to try to point out several appli- cations. With a few exceptions the proofs are not presented here, however, in some cases we use examples to illustrate the basic ideas. Open problems form an important part of our presentation. 2 The contents of this tutorial is as follows. At the end of this introductory sec- tion we discuss briefly the history of combinatorics on words, and fix some basic terminology. Then in Section 2 we consider connections to other fields of math- ematics and computer science. Section 3 is devoted to the most fundamental notion of words, namely to periodicity. Dimension properties of words consti- tute Section 4, while Section 5 concentrates one of the most studied and most characteristic feature of words, namely unavoidable regularities. Words, indeed, are very suitable objects to formulate such fundamental properties. In Section 6 complexity issues of infinite words are studied from different points of view. An interesting phenomenon is that what is considered to be complicated in a classical sense, e.g., algebraically, need not be so from the point of view of words. Finally, in Section 7 we discuss about some extensions of the theory to finite sets of words, and in Section 8 collect a list of important open problems, many of those being apparently very difficult. As we said everywhere above algorithmic and decidability issues are present. We conclude by some bibliographic remarks. Combinatorics on words has now become a rich area, with many connections to algorithms, to number the- ory, to symbolic dynamics, and to applications in biology and text processing. Several books have appeared quite recently, or will appear in the next months, that emphasize these connections. What have to be mentioned are the book of Allouche and Shallit [AS03], where the emphasis is on relation to automata theory, and the book by Pytheas-Fogg [PF02] which is a nom de plume for the Marseille group. Quite recently the book by Crochemore and Rytter [CR02] ap- peared as a follow-up book to [CR94]. A detailed introduction into algorithms on words is given in the book [CHL01]. Algorithms on words are also described, from a more biological point of view, in Gusfield’s book [Gus97]. Finally, we should point to algebraic applications of combinatorics on words, as they appear in the book of de Luca and Varricchio [dLV99]. 1.1 History The history of combinatorics on words goes back to the beginning of the last century, when A. Thue started to work on repetition-free words. He proved, among other things, the existence of an infinite square-free word over a ternary alphabet. Interestingly, it seems that Thue had no outside motivation for his research on words. He published his results in two long papers [Th06] and [Th12], but unfortunately in a less known Norwegian journal, so that his results became known only much later, cf. [Be95]. Actually many of those were reproved several times. The notion of word is, of course, so natural that it can be found even in several older mathematical works. Even Gauss considered a problem which was nothing but a problem on combinatorics on words, cf. [Ga00] and [KMPS92], and in 1850s Prouhet [Pr51] introduced the most famous infinite word redefined by Thue. However, Thue was clearly the first to study systematically problems on words, and moreover as problems of their own. 3 After Thue during the first half of the previous century there were only a few isolated works on words, such as those of [Ar37], [Mo21], [Mo38] and [MH38]. In these works, as it has been very typical for the whole field, properties of words were not that much the research topic of itself rather than tools for solving problems in other areas. It took till the second half of the last century when the theory of words arose. This happened more or less simultaneously in France and Russia. In France it grew up from research of M.P. Sch¨utzenberger on theory of codes, see [Sch56]. The Russian school, in turn, developed from the seminal work of P.S. Novikov and S. Adian on
Recommended publications
  • Filling Functions Notes for an Advanced Course on the Geometry of the Word Problem for Finitely Generated Groups Centre De Recer
    Filling Functions Notes for an advanced course on The Geometry of the Word Problem for Finitely Generated Groups Centre de Recerca Mathematica` Barcelona T.R.Riley July 2005 Revised February 2006 Contents Notation vi 1Introduction 1 2Fillingfunctions 5 2.1 Van Kampen diagrams . 5 2.2 Filling functions via van Kampen diagrams . .... 6 2.3 Example: combable groups . 10 2.4 Filling functions interpreted algebraically . ......... 15 2.5 Filling functions interpreted computationally . ......... 16 2.6 Filling functions for Riemannian manifolds . ...... 21 2.7 Quasi-isometry invariance . .22 3Relationshipsbetweenfillingfunctions 25 3.1 The Double Exponential Theorem . 26 3.2 Filling length and duality of spanning trees in planar graphs . 31 3.3 Extrinsic diameter versus intrinsic diameter . ........ 35 3.4 Free filling length . 35 4Example:nilpotentgroups 39 4.1 The Dehn and filling length functions . .. 39 4.2 Open questions . 42 5Asymptoticcones 45 5.1 The definition . 45 5.2 Hyperbolic groups . 47 5.3 Groups with simply connected asymptotic cones . ...... 53 5.4 Higher dimensions . 57 Bibliography 68 v Notation f, g :[0, ∞) → [0, ∞)satisfy f ≼ g when there exists C > 0 such that f (n) ≤ Cg(Cn+ C) + Cn+ C for all n,satisfy f ≽ g ≼, ≽, ≃ when g ≼ f ,andsatisfy f ≃ g when f ≼ g and g ≼ f .These relations are extended to functions f : N → N by considering such f to be constant on the intervals [n, n + 1). ab, a−b,[a, b] b−1ab, b−1a−1b, a−1b−1ab Cay1(G, X) the Cayley graph of G with respect to a generating set X Cay2(P) the Cayley 2-complex of a
    [Show full text]
  • Properties and Generalizations of the Fibonacci Word Fractal Exploring Fractal Curves José L
    The Mathematica® Journal Properties and Generalizations of the Fibonacci Word Fractal Exploring Fractal Curves José L. Ramírez Gustavo N. Rubiano This article implements some combinatorial properties of the Fibonacci word and generalizations that can be generated from the iteration of a morphism between languages. Some graphic properties of the fractal curve are associated with these words; the curves can be generated from drawing rules similar to those used in L-systems. Simple changes to the programs generate other interesting curves. ‡ 1. Introduction The infinite Fibonacci word, f = 0 100 101 001 001 010 010 100 100 101 ... is certainly one of the most studied words in the field of combinatorics on words [1–4]. It is the archetype of a Sturmian word [5]. The word f can be associated with a fractal curve with combinatorial properties [6–7]. This article implements Mathematica programs to generate curves from f and a set of drawing rules. These rules are similar to those used in L-systems. The outline of this article is as follows. Section 2 recalls some definitions and ideas of combinatorics on words. Section 3 introduces the Fibonacci word, its fractal curve, and a family of words whose limit is the Fibonacci word fractal. Finally, Section 4 generalizes the Fibonacci word and its Fibonacci word fractal. The Mathematica Journal 16 © 2014 Wolfram Media, Inc. 2 José L. Ramírez and Gustavo N. Rubiano ‡ 2. Definitions and Notation The terminology and notation are mainly those of [5] and [8]. Let S be a finite alphabet, whose elements are called symbols. A word over S is a finite sequence of symbols from S.
    [Show full text]
  • Here the Base Β Is Real and the Alphabet Is Subset a of Z
    Numeration 2018 22-25 May 2018 Paris France Table of contents Invited talks (Edita Pelantov´a,Robert F. Tichy)3 Establishing and maintaining databases of self-affine tiles, Bandt Christoph [et al.] 1 On the beta-expansion of integers of Z[beta]; connections with self similar tilings, Bertrand-Mathis Anne3 Integer multipliers of real polynomials without nonnegative roots, Brunotte Horst 4 On the magic of some families of fractal dendrites, Cristea Ligia Loretta5 Multiplicative dependence of shifted algebraic numbers, Dubickas Arturas7 Representations of palindromes in the Fibonacci word, Frid Anna9 Random homogeneous beta-expansions and self-similar measures, Hare Kathryn [et al.] 13 Totally positive quadratic integers and numeration, Hejda Tomas [et al.] 14 Toolset for supporting the number system research, Hudoba Peter [et al.] 19 Characterization of rational matrices that admit finite digit representations, Jankauskas Jonas [et al.] 22 1 Unique expansions on fat Sierpinski gaskets, Kong Derong [et al.] 23 Multi-base Representations and their Minimal Hamming Weight, Krenn Daniel [et al.] 24 Infinite families of number systems, Kr´asensk´yJakub [et al.] 26 Ito alpha-continued fractions and matching, Langeveld Niels [et al.] 29 The sum-of-digits function of linearly recurrent number systems and almost primes, Madritsch Manfred 31 Constructing invariant densities for random systems, Maggioni Marta [et al.] 33 Some complexity results in the theory of normal numbers, Airey Dylan [et al.] 36 Mobius Orthogonality for automatic sequences and beyond,
    [Show full text]
  • Sturmian Words and the Permutation That Orders Fractional Parts
    Sturmian Words and the Permutation that Orders Fractional Parts Kevin O’Bryant University of California, San Diego [email protected] http://www.math.ucsd.edu/∼kobryant November 12, 2002 Abstract A Sturmian word is a map W : N → {0, 1} for which the set of {0, 1}-vectors T Fn(W ) := {(W (i),W (i + 1),...,W (i + n − 1)) : i ∈ N} has cardinality exactly n + 1 for each positive integer n. Our main result is that the volume of the simplex whose n + 1 vertices are the n + 1 points in Fn(W ) does not depend on W . Our proof of this motivates studying algebraic properties of the per- mutation πα,n (where α is any irrational and n is any positive integer) that orders the fractional parts {α}, {2α},..., {nα}, i.e., 0 < {πα,n(1)α} < {πα,n(2)α} < ··· < {πα,n(n)α} < 1. We give a formula for the sign of πα,n, and prove that for every irrational α there are infinitely many n such that the order of πα,n (as an element of the symmetric group Sn) is less than n. 1 Introduction A binary word is a map from the nonnegative integers into {0, 1}. The factors of W are the column vectors (W (i),W (i + 1),...,W (i + n − 1))T , where i ≥ 0 and n ≥ 1. In particular, the set of factors of length n of a binary word W is defined by n T o Fn(W ) := (W (i),W (i + 1),...,W (i + n − 1)) : i ≥ 0 .
    [Show full text]
  • Free Lie Algebras
    Last revised 3:11 p.m. April 10, 2020 Free Lie algebras Bill Casselman University of British Columbia [email protected] The purpose of this essay is to give an introduction to free Lie algebras and a few of their applications. My principal references are [Serre:1965], [Reutenauer:1993], and [de Graaf:2000]. My interest in free Lie algebras has been motivated by the well known conjecture that Kac•Moody algebras can be defined by generators and relations analogous to those introduced by Serre for finite•dimensional semi•simple Lie algebras. I have had this idea for a long time, but it was coming across the short note [de Graaf:1999] that acted as catalyst for this (alas! so far unfinished) project. Fix throughout this essay a commutative ring R. I recall that a Lie algebra over R is an R•module g together with a Poisson bracket [x, y] such that [x, x]=0 [x, [y,z]] + [y, [z, x]] + [z, [x, y]]=0 Since [x + y, x + y] = [x, x] + [x, y] + [y, x] + [y,y], the first condition implies that [x, y] = [y, x]. The − second condition is called the Jacobi identity. In a later version of this essay, I’ll discuss the Baker•Campbell•Hausdorff Theorem (in the form due to Dynkin). Contents 1. Magmas........................................... ................................ 1 2. ThefreeLiealgebra ................................ ................................ 3 3. Poincare•Birkhoff•Witt´ .................................... ......................... 5 4. FreeLiealgebrasandtensorproducts ................. .............................. 8 5. Hallsets—motivation
    [Show full text]
  • Fixed Points of Sturmian Morphisms and Their Derivated Words
    FIXED POINTS OF STURMIAN MORPHISMS AND THEIR DERIVATED WORDS KAREL KLOUDA, KATEŘINA MEDKOVÁ, EDITA PELANTOVÁ, AND ŠTĚPÁN STAROSTA Abstract. Any infinite uniformly recurrent word u can be written as concatenation of a finite number of return words to a chosen prefix w of u. Ordering of the return words to w in this concatenation is coded by derivated word du(w). In 1998, Durand proved that a fixed point u of a primitive morphism has only finitely many derivated words du(w) and each derivated word du(w) is fixed by a primitive morphism as well. In our article we focus on Sturmian words fixed by a primitive morphism. We provide an algorithm which to a given Sturmian morphism ψ lists the morphisms fixing the derivated words of the Sturmian word u = ψ(u). We provide a sharp upper bound on length of the list. Keywords: Derivated word, Return word, Sturmian morphism, Sturmian word 2000MSC: 68R15 1. Introduction Sturmian words are probably the most studied object in combinatorics on words. They are aperiodic words over a binary alphabet having the least factor complexity possible. Many properties, characteri- zations and generalizations are known, see for instance [5, 4, 2]. One of their characterizations is in terms of return words to their factors. Let u = u0u1u2 ··· be a binary infinite word with ui ∈ {0, 1}. Let w = uiui+1 ··· ui+n−1 be its factor. The integer i is called an occurrence of the factor w. A return word to a factor w is a word uiui+1 ··· uj−1 with i and j being two consecutive occurrences of w such that i < j.
    [Show full text]
  • Combinatorics, Automata and Number Theory CANT
    Combinatorics, Automata and Number Theory CANT Edited by Val´erie Berth´e LIRMM - Universit´e Montpelier II - CNRS UMR 5506 161 rue Ada, F-34392 Montpellier Cedex 5, France Michel Rigo Universit´e de Li`ege, Institut de Math´ematiques Grande Traverse 12 (B 37), B-4000 Li`ege, Belgium 8 Transcendence and Diophantine approximation Boris Adamczewski CNRS, Universit´e de Lyon, Universit´e Lyon 1, Institut Camille Jordan, 43 boulevard du 11 novembre 1918, F-69622 Villeurbanne cedex, France Yann Bugeaud IRMA - Universit´e de Strasbourg-Math´ematiques - CNRS UMR 7501 7 rue Ren´e Descartes, F-67084 Strasbourg cedex, France. The aim of this chapter is to present several number-theoretic problems that reveal a fruitful interplay between combinatorics on words and Diophantine approximation. Finite and infinite words occur naturally in Diophantine approximation when we consider the expansion of a real number in an inte- ger base b or its continued fraction expansion. Conversely, with an infinite word a on the finite alphabet 0, 1, . , b 1 we associate the real number { − } ξa whose base-b expansion is given by a. As well, with an infinite word a on the infinite alphabet 1, 2, 3, . , we associate the real number ζ whose { } a continued fraction expansion is given by a. It turns out that, if the word a enjoys certain combinatorial properties involving repetitive or symmetric patterns, then this gives interesting information on the arithmetical nature and on the Diophantine properties of the real numbers ξa and ζa. We illustrate our results by considering the real numbers associated with two classical infinite words, the Thue-Morse word and the Fibonacci word, see Example 1.2.21 and 1.2.22.
    [Show full text]
  • On the K-Fibonacci Words
    Acta Univ. Sapientiae, Informatica, 5, 2 (2013) 212–226 DOI: 10.2478/ausi-2014-0011 On the k-Fibonacci words Jos´eL.RAM´IREZ Gustavo N. RUBIANO Instituto de Matem´aticas y sus Departamento de Matem´aticas Aplicaciones Universidad Nacional de Colombia, Universidad Sergio Arboleda, Colombia Colombia email: email: [email protected] [email protected] Abstract. In this paper we define the k-Fibonacci words in analogy with the definition of the k-Fibonacci numbers. We study their properties and we associate to this family of words a family of curves with interesting patterns. 1 Introduction Fibonacci numbers and their generalizations have many interesting properties and applications to almost every fields of science and arts (e.g. see [13]). The Fibonacci numbers Fn are the terms of the sequence 0, 1, 1, 2, 3, 5, . wherein each term is the sum of the two previous terms, beginning with the values F0 = 0,andF1 = 1. Besides the usual Fibonacci numbers many kinds of generalizations of these numbers have been presented in the literature. In particular, a generalization is the k-Fibonacci numbers [11]. For any positive real number k,thek-Fibonacci sequence, say {Fk,n}n∈N is defined recurrently by Fk,0 = 0, Fk,1 = 1 and Fk,n+1 = kFk,n + Fk,n−1,n 1. (1) Computing Classification System 1998: I.3.7, G.2.0 Mathematics Subject Classification 2010: 11B39, 05A05 , 68R15 Key words and phrases: Fibonacci word, k-Fibonacci numbers, k-Fibonacci words, k- Fibonacci curves 212 On the k-Fibonacci words 213 In [11], k-Fibonacci numbers were found by studying the recursive applica- tion of two geometrical transformations used in the four-triangle longest-edge (4TLE) partition.
    [Show full text]
  • Grammar-Compressed Self-Index with Lyndon Words
    Grammar-compressed Self-index with Lyndon Words Kazuya Tsuruta1 Dominik K¨oppl1;2 Yuto Nakashima1 Shunsuke Inenaga1;3 Hideo Bannai1 Masayuki Takeda1 1Department of Informatics, Kyushu University, Fukuoka, Japan, fkazuya.tsuruta, yuto.nakashima, inenaga, bannai, [email protected] 2Japan Society for Promotion of Science, [email protected] 3Japan Science and Technology Agency Abstract We introduce a new class of straight-line programs (SLPs), named the Lyndon SLP, in- spired by the Lyndon trees (Barcelo, 1990). Based on this SLP, we propose a self-index data structure of O(g) words of space that can be built from a string T in O(n lg n) ex- pected time, retrieving the starting positions of all occurrences of a pattern P of length m in O(m + lg m lg n + occ lg g) time, where n is the length of T , g is the size of the Lyndon SLP for T , and occ is the number of occurrences of P in T . 1 Introduction A context-free grammar is said to represent a string T if it generates the language consisting of T and only T . Grammar-based compression [29] is, given a string T , to find a small size descrip- tion of T based on a context-free grammar that represents T . The grammar-based compression scheme is known to be most suitable for compressing highly-repetitive strings. Due to its ease of manipulation, grammar-based representation of strings is a frequently used model for compressed string processing, where the aim is to efficiently process compressed strings without explicit de- compression.
    [Show full text]
  • Combinatorial Group Theory
    Combinatorial Group Theory Charles F. Miller III 7 March, 2004 Abstract An early version of these notes was prepared for use by the participants in the Workshop on Algebra, Geometry and Topology held at the Australian National University, 22 January to 9 February, 1996. They have subsequently been updated and expanded many times for use by students in the subject 620-421 Combinatorial Group Theory at the University of Melbourne. Copyright 1996-2004 by C. F. Miller III. Contents 1 Preliminaries 3 1.1 About groups . 3 1.2 About fundamental groups and covering spaces . 5 2 Free groups and presentations 11 2.1 Free groups . 12 2.2 Presentations by generators and relations . 16 2.3 Dehn’s fundamental problems . 19 2.4 Homomorphisms . 20 2.5 Presentations and fundamental groups . 22 2.6 Tietze transformations . 24 2.7 Extraction principles . 27 3 Construction of new groups 30 3.1 Direct products . 30 3.2 Free products . 32 3.3 Free products with amalgamation . 36 3.4 HNN extensions . 43 3.5 HNN related to amalgams . 48 3.6 Semi-direct products and wreath products . 50 4 Properties, embeddings and examples 53 4.1 Countable groups embed in 2-generator groups . 53 4.2 Non-finite presentability of subgroups . 56 4.3 Hopfian and residually finite groups . 58 4.4 Local and poly properties . 61 4.5 Finitely presented coherent by cyclic groups . 63 1 5 Subgroup Theory 68 5.1 Subgroups of Free Groups . 68 5.1.1 The general case . 68 5.1.2 Finitely generated subgroups of free groups .
    [Show full text]
  • Standard Lyndon Bases of Lie Algebras and Enveloping Algebras
    transactions of the american mathematical society Volume 347, Number 5, May 1995 STANDARD LYNDON BASES OF LIE ALGEBRAS AND ENVELOPING ALGEBRAS PIERRE LALONDE AND ARUN RAM Abstract. It is well known that the standard bracketings of Lyndon words in an alphabet A form a basis for the free Lie algebra Lie(^) generated by A . Suppose that g = Lie(A)/J is a Lie algebra given by a generating set A and a Lie ideal J of relations. Using a Gröbner basis type approach we define a set of "standard" Lyndon words, a subset of the set Lyndon words, such that the standard bracketings of these words form a basis of the Lie algebra g . We show that a similar approach to the universal enveloping algebra g naturally leads to a Poincaré-Birkhoff-Witt type basis of the enveloping algebra of 0 . We prove that the standard words satisfy the property that any factor of a standard word is again standard. Given root tables, this property is nearly sufficient to determine the standard Lyndon words for the complex finite-dimensional simple Lie algebras. We give an inductive procedure for computing the standard Lyndon words and give a complete list of the standard Lyndon words for the complex finite-dimensional simple Lie algebras. These results were announced in [LR]. 1. Lyndon words and the free Lie algebra In this section we give a short summary of the facts about Lyndon words and the free Lie algebra which we shall use. All of the facts in this section are well known. A comprehensive treatment of free Lie algebras (and Lyndon words) appears in the book by C.
    [Show full text]
  • Lyndon Words, Free Algebras and Shuffles
    Can. J. Math., Vol. XLI, No. 4, 1989, pp. 577-591 LYNDON WORDS, FREE ALGEBRAS AND SHUFFLES GUY MELANÇON AND CHRISTOPHE REUTENAUER 1. Introduction. A Lyndon word is a primitive word which is minimum in its conjugation class, for the lexicographical ordering. These words have been introduced by Lyndon in order to find bases of the quotients of the lower central series of a free group or, equivalently, bases of the free Lie algebra [2], [7]. They have also many combinatorial properties, with applications to semigroups, pi-rings and pattern-matching, see [1], [10]. We study here the Poincaré-Birkhoff-Witt basis constructed on the Lyndon basis (PBWL basis). We give an algorithm to write each word in this basis: it reads the word from right to left, and the first encountered inversion is either bracketted, or straightened, and this process is iterated: the point is to show that each bracketting is a standard one: this we show by introducing a loop invariant (property (S)) of the algorithm. This algorithm has some analogy with the collecting process of P. Hall [5], but was never described for the Lyndon basis, as far we know. A striking consequence of this algorithm is that any word, when written in the PBWL basis, has coefficients in N (see Theorem 1). This will be proved twice in fact, and is similar to the same property for the Shirshov-Hall basis, as shown by M.P. Schutzenberger [11]. Our next result is a precise description of the dual basis of the PBWL basis. The former is denoted (Sw), where w is any word, and we show that if w = au is a Lyndon word beginning with the letter a, and that 1 , Sw = (ti!...*II!r 5* o...oS*- x k n if w = l\ ..
    [Show full text]