The Complexity of Unavoidable Word Patterns

Total Page:16

File Type:pdf, Size:1020Kb

The Complexity of Unavoidable Word Patterns The complexity of unavoidable word patterns by PAUL VAN DER MERWE SAUER submitted in accordance with the requirements of DOCTOR OF PHILOSOPHY in the subject OPERATIONS RESEARCH at the UNIVERSITY OF SOUTH AFRICA SUPERVISOR: PROF W L FOUCHÉ CO-SUPERVISOR: PROF P H POTGIETER December 30, 2019 2 Summary The avoidability, or unavoidability of patterns in words over finite alphabets has been studied extensively. The word α over a finite set A is said to be unavoidable for an infinite set B+ of nonempty words over a finite set B if, for all but finitely + + + many elements w of B , there exists a semigroup morphism φ ∶ A → B such that φ(α) is a factor of w. In this treatise, we start by presenting a historical background of results that are related to unavoidability. We present and discuss the most important theorems surrounding unavoidability in detail. We present various complexity-related properties of unavoidable words. For words that are unavoidable, we provide a constructive upper bound to the lengths of words that avoid them. In particular, for a pattern α of length n over an alphabet of size r, we give a concrete function N(n; r) such that no word of length N(n; r) over the alphabet of size r avoids α. A natural subsequent question is how many unavoidable words there are. We show that the fraction of words that are unavoidable drops exponentially fast in the length of the word. This allows us to calculate an upper bound on the number of unavoidable patterns for any given finite alphabet. Subsequently, we investigate computational aspects of unavoidable words. In particular, we exhibit concrete algorithms for determining whether a word is unavoidable. We also prove results on the computational complexity of the problem of determining whether a given word is unavoidable. Specifically, the NP -completeness of the aforementioned problem is established. Key Terms Unavoidability; Avoidability; Combinatorial Algebra; Computational Complexity; Ramsey Theory; Algorithms; NP -completeness; Unavoidable regularity; Word pat- terns; Zimin words 3 4 Contents 1 Preliminaries 7 1.1 First Words . 7 1.2 Monkeys, Typewriters and more . 14 1.3 The History of Avoidability . 16 1.4 New Contributions . 18 1.5 Acknowledgements . 19 2 Formal Introductions 21 2.1 Basic Concepts . 21 2.2 Unavoidability . 22 2.3 Fouché’s Constructive Bounds . 24 2.4 Roadmap . 25 3 Bean, Ehrenfeucht and McNulty in Detail 27 3.1 Groundwork . 27 3.2 Sapir’s Proof of the B.E.M. Theorem . 31 3.3 Some Observations on Sapir’s Construction . 34 4 Complexity of Size 39 4.1 General Bounds for Unavoidable Patterns . 39 4.2 The Size of the Bound . 41 4.3 Tighter Bounds . 42 5 How many Patterns are Unavoidable? 43 5.1 Density and Counting Unavoidable Patterns . 43 5.2 Zimin Words . 44 5.3 Counting . 45 5 Contents 6 Computational Complexity 47 6.1 Preliminaries . 47 6.2 Free Letters and Computation . 49 6.3 Unavoidability and Logic . 53 6.4 Unavoidability and Computational Complexity . 63 6.5 Practical Concerns . 65 7 Conclusion 67 A A Longer Word Avoiding xx 69 B Being Avoidant of xxx 97 C Code 125 C.1 Ternary Patterns with Free Letters . 125 C.2 Binary Patterns with Free Letters . 126 C.3 Size Bounds . 126 C.4 Estimated Counts of Unavoidable Patterns (in tabular form) . 127 D Three Letters 129 E Four Letter Words 163 Bibliography 191 6 Chapter 1 Preliminaries This chapter provides a non-technical overview of the milieu where we intend to work. Readers familiar with the underlying concepts and historical background on Unavoidability may choose to skip it altogether. 1.1 First Words Before we embark on a potentially tedious voyage of mathematical formalism, let us consider the following puzzle. Is it possible to write an arbitrarily long sequence of zeroes and ones that avoids the pattern xx? By the pattern xx we mean two identical blocks of zeroes and ones appear adjacent to each other somewhere in the sequence that we aim to write. By “avoids" we mean the pattern xx does not appear in our sequence. A brief moment of reflection reveals that this is not a very hard puzzle and the answer is “no": Let us try to write down such a sequence. We may as well start with 0, by symmetry. The next symbol (or letter, in our language) must be 1, otherwise our sequence would be 00 which reflects the pattern xx. So our sequence (or word, as we will eventually call it) is now 01. Our next choice must be 0 to avoid the 11 in 011 from once again reflecting xx, leaving us with the word 010. The next letter must be 1, for the same reason as before and we are left with 0101. But now we have failed, because x ↦ 01 means 0101 reflects xx. We say xx is unavoidable over the binary alphabet {0; 1}. When it is possible to write an arbitrarily long sequence avoiding a given pattern, we call such a pattern avoidable. Now let us consider the pattern xyx. Is this pattern avoidable over the binary alphabet? This is perhaps a little more difficult. Starting again with 0, we are at liberty to choose either 0 or 1 for our second letter, it may seem. Regardless of our 7 Chapter 1. Preliminaries choice for the second letter, the third letter must be 1, otherwise both 010 and 000 reflect xyx. Suppose the second letter is 1. Our sequence is now 011. If the fourth letter is 0, then our word 0110 reflects xyx, given by x ↦ 0 and y ↦ 11. If the fourth letter is 1, on the other hand, then our word 0111 also reflects xyx, given by choosing x ↦ 1 and y ↦ 1. So our second letter cannot be 1 and must therefore be 0, leaving us with 001 after three choices. If our fourth choice is 0 then the last three letters 010 of 0010 reflects xyx. If our fourth choice is 1, then we are left with 0011. If the fifth choice is 0, then x ↦ 0 and y ↦ 11 reflects xyx in 00110. Finally, if the fifth choice is 1, then x ↦ 1 and y ↦ 1 reflects xyx in 00111. We therefore see, as before with xx, that xyx is unavoidable over the binary alphabet. Our second problem, though more difficult than the first, is still not very interest- ing as it simply amounts to following fairly rudimentary forced logical consequences. So far this does not look like a gripping set of problems. However, it quickly becomes much harder to solve our avoidability puzzle for more complicated patterns and larger alphabets. In the xyx problem above a certain “degree of freedom" emerged in that we had an arbitrary choice for our second letter. This was not true for the xx prob- lem and caused the additional steps of reasoning related to xyx. We see that more freedoms emerge as patterns become more elaborate and alphabet sizes increase, rapidly leading to a combinatorially intractable situations. The reader may choose verify that the following word on the ternary alphabet avoids xx, given sufficient time. 123 121 312 321 323 132 123 121 312 321 231 213 231 321 231 213 123 213 231 321 312 321 231 213 231 321 312 321 323 132 123 121 323 132 131 232 123 121 312 321 323 132 123 121 312 321 231 213 231 321 231 213 123 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 213 123 213 231 321 231 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 212 312 131 232 132 313 213 123 212 312 132 313 213 123 213 231 321 231 213 231 321 312 321 231 213 231 321 231 213 123 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 213 123 213 231 321 231 213 231 321 312 321 231 213 231 321 231 213 123 213 231 321 312 321 231 213 231 321 312 321 323 132 123 121 323 132 131 232 123 121 312 321 323 132 123 121 312 321 231 213 231 321 312 321 323 132 123 121 323 132 131 232 123 121 323 132 123 121 312 321 323 132 131 232 123 121 312 321 323 132 123 121 312 321 231 213 231 321 231 213 123 213 231 321 312 321 231 213 231 321 312 321 323 132 123 121 323 132 131 232 123 121 312 321 323 132 123 121 312 321 231 213 231 321 231 213 123 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 213 123 213 231 321 231 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 212 312 131 232 132 313 213 123 212 312 132 313 213 123 213 231 321 231 213 231 321 312 321 231 213 231 321 231 213 123 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 212 312 131 232 132 313 213 123 212 312 132 313 213 123 213 231 321 231 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 212 312 131 232 132 313 213 123 212 312 131 232 132 313 212 312 131 232 123 121 323 132 131 232 132 313 212 312 132 313 213 123 212 312 132 313 212 312 131 232 132 313 213 123 212 312 132 313 213 123 213 231 321 231 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 213 123 213 231 321 231 213 231 321 312 321 231 213 231 321 231 213 123 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 212 312 131 232 132 313 213 123 212 312 132 313 213 123 213 231 321 231 213 231 321 312 321 231 213 123 213 231 321 231 213 123 212 312 132 313 212 312 131 232 132 313 213 123 212 312 131 232 132 313 212 312 131 232 123 121 323 132 131 232 132 313 212 312 132 313 213 123 212 312 131 232 132 313 212 312 131 232 123 121 323 132 123 121 312 321 323 132 131 232 123 121 323 132 131 232 132 313 212 312 132 313 213 123 212 312 132 313 212 312 131 232 132 313 213 123 212 312 131 232 132 313 8 Chapter 1.
Recommended publications
  • A Generalization of Square-Free Strings a GENERALIZATION of SQUARE-FREE STRINGS
    A Generalization of Square-free Strings A GENERALIZATION OF SQUARE-FREE STRINGS BY NEERJA MHASKAR, B.Tech., M.S. a thesis submitted to the department of Computing & Software and the school of graduate studies of mcmaster university in partial fulfillment of the requirements for the degree of Doctor of Philosophy c Copyright by Neerja Mhaskar, August, 2016 All Rights Reserved Doctor of Philosophy (2016) McMaster University (Dept. of Computing & Software) Hamilton, Ontario, Canada TITLE: A Generalization of Square-free Strings AUTHOR: Neerja Mhaskar B.Tech., (Mechanical Engineering) Jawaharlal Nehru Technological University Hyderabad, India M.S., (Engineering Science) Louisiana State University, Baton Rouge, USA SUPERVISOR: Professor Michael Soltys CO-SUPERVISOR: Professor Ryszard Janicki NUMBER OF PAGES: xiii, 116 ii To my family Abstract Our research is in the general area of String Algorithms and Combinatorics on Words. Specifically, we study a generalization of square-free strings, shuffle properties of strings, and formalizing the reasoning about finite strings. The existence of infinitely long square-free strings (strings with no adjacent re- peating word blocks) over a three (or more) letter finite set (referred to as Alphabet) is a well-established result. A natural generalization of this problem is that only sub- sets of the alphabet with predefined cardinality are available, while selecting symbols of the square-free string. This problem has been studied by several authors, and the lowest possible bound on the cardinality of the subset given is four. The problem remains open for subset size three and we investigate this question. We show that square-free strings exist in several specialized cases of the problem and propose ap- proaches to solve the problem, ranging from patterns in strings to Proof Complexity.
    [Show full text]
  • Some Recent Results on Squarefree Words
    Some recent results on squarefree words by Jean Berstel Universit~ Pierre et Marie Curie and L.I.T.P Paris "F~r die Entwicklung der logischen Wissenschaften wird es, ohne ROcksicht auf etwaige Anwendungen, yon Bedeutung sein, ausgedehnte Felder f~r Spekulation Ober schwierige Probleme zu finden." Axel Thue, 1912. I. Introduction. When Axel Thue wrote these lines in the introduction to his 1912 paper on squarefree words, he certainly did not feel as a theoretical computer scientist. During the past seventy years, there was an increasing interest in squarefree words and more generally in repetitions in words. However, A. Thue's sentence seems still to hold : in some sense, he said that there is no reason to study squarefree words, excepted that it's a difficult question, and that it is of primary importance to investigate new domains. Seventy years later, these questions are no longer new, and one may ask if squarefree words served already. First, we observe that infinite squarefree, overlap-free or cube-free words indeed served as examples or counter-examples in several, quite different domains. In symbolic dynamics, they were introduced by Morse in 1921 [36] . Another use is in group theory, where an infinite square-free word is one (of the numerous) steps in disproving the Burnside conjecture (see Adjan[2]). Closer to computer science is Morse and Hedlund's interpretatio~ in relation with chess [37]. We also mention applications to formal language theory : Brzozowsky, K. Culik II and Gabriellian [7] use squarefree words in connection with noncounting languages, J. Goldstine uses the Morse sequence to show that a property of some family of languages [22].
    [Show full text]
  • Combinatorics on Words: Christoffel Words and Repetitions in Words
    Combinatorics on Words: Christoffel Words and Repetitions in Words Jean Berstel Aaron Lauve Christophe Reutenauer Franco Saliola 2008 Ce livre est d´edi´e`ala m´emoire de Pierre Leroux (1942–2008). Contents I Christoffel Words 1 1 Christoffel Words 3 1.1 Geometric definition ....................... 3 1.2 Cayley graph definition ..................... 6 2 Christoffel Morphisms 9 2.1 Christoffel morphisms ...................... 9 2.2 Generators ............................ 15 3 Standard Factorization 19 3.1 The standard factorization .................... 19 3.2 The Christoffel tree ........................ 23 4 Palindromization 27 4.1 Christoffel words and palindromes ............... 27 4.2 Palindromic closures ....................... 28 4.3 Palindromic characterization .................. 36 5 Primitive Elements in the Free Group F2 41 5.1 Positive primitive elements of the free group .......... 41 5.2 Positive primitive characterization ............... 43 6 Characterizations 47 6.1 The Burrows–Wheeler transform ................ 47 6.2 Balanced1 Lyndon words ..................... 50 6.3 Balanced2 Lyndon words ..................... 50 6.4 Circular words .......................... 51 6.5 Periodic phenomena ....................... 54 i 7 Continued Fractions 57 7.1 Continued fractions ........................ 57 7.2 Continued fractions and Christoffel words ........... 58 7.3 The Stern–Brocot tree ...................... 62 8 The Theory of Markoff Numbers 67 8.1 Minima of quadratic forms .................... 67 8.2 Markoff numbers ......................... 69 8.3 Markoff’s condition ........................ 71 8.4 Proof of Markoff’s theorem ................... 75 II Repetitions in Words 81 1 The Thue–Morse Word 83 1.1 The Thue–Morse word ...................... 83 1.2 The Thue–Morse morphism ................... 84 1.3 The Tarry-Escott problem .................... 86 1.4 Magic squares ........................... 90 2 Combinatorics of the Thue–Morse Word 93 2.1 Automatic sequences ......................
    [Show full text]
  • CRM Monograph Series ; V
    Volume 27 C CRM R MONOGRAPH M SERIES Centre de Recherches Mathématiques Montréal Combinatorics on Words Christoffel Words and Repetitions in Words Jean Berstel Aaron Lauve Christophe Reutenauer Franco V. Saliola American Mathematical Society Combinatorics on Words Christoffel Words and Repetitions in Words https://doi.org/10.1090/crmm/027 Volume 27 C CRM R MONOGRAPH M SERIES Centre de Recherches Mathématiques Montréal Combinatorics on Words Christoffel Words and Repetitions in Words Jean Berstel Aaron Lauve Christophe Reutenauer Franco V. Saliola The Centre de Recherches Mathématiques (CRM) of the Université de Montréal was created in 1968 to promote research in pure and applied mathematics and related disciplines. Among its activities are special theme years, summer schools, workshops, postdoctoral programs, and publishing. The CRM is supported by the Université de Montréal, the Province of Québec (FQRNT), and the Natural Sciences and Engineering Research Council of Canada. It is affiliated with the Institut des Sciences Mathématiques (ISM) of Montréal, whose constituent members are Concordia University, McGill University, the Université de Montréal, the Université du Québec à Montréal, and the E´ cole Polytechnique. The CRM may be reached on the Web at www.crm.umontreal.ca. American Mathematical Society Providence, Rhode Island USA The production of this volume was supported in part by the Fonds Qu´eb´ecois de la Recherche sur la Nature et les Technologies (FQRNT) and the Natural Sciences and Engineering Research Council of Canada (NSERC). 2000 Mathematics Subject Classification.Primary68R15; Secondary 37B10, 11J70, 68W40. For additional information and updates on this book, visit www.ams.org/bookpages/crmm-27 Library of Congress Cataloging-in-Publication Data Combinatorics on words : Christoffel words and repetitions in words / Jean Berstel ..
    [Show full text]
  • Grasshopper Avoidance of Patterns
    Grasshopper avoidance of patterns Micha lD¸ebski Urszula Pastwa Krzysztof W¸esek Faculty of Mathematics and Information Science Warsaw University of Technology Warsaw, Poland [email protected], [email protected], [email protected] Submitted: Jun 10, 2016; Accepted: Oct 15, 2016; Published: Oct 28, 2016 Mathematics Subject Classifications: 68R15 Abstract Motivated by a geometrical Thue-type problem, we introduce a new variant of the classical pattern avoidance in words, where jumping over a letter in the pattern occurrence is allowed. We say that pattern p 2 E+ occurs with jumps in a word + ∗ ∗ w = a1a2 : : : ak 2 A , if there exist a non-erasing morphism f from E to A and a sequence (i1; i2; : : : ; il) satisfying ij+1 2 fij + 1; ij + 2g for j = 1; 2; : : : ; l − 1, such that f(p) = ai1 ai2 : : : ail : For example, a pattern xx occurs with jumps in a word abdcadbc (for x 7! abc). A pattern p is grasshopper k-avoidable if there exists an alphabet A of k elements, such that there exist arbitrarily long words over A in which p does not occur with jumps. The minimal such k is the grasshopper avoidability index of p. It appears that this notion is related to two other problems: pattern avoidance on graphs and pattern-free colorings of the Euclidean plane. In particular, we show that a sequence avoiding a pattern p with jumps can be a tool 2 to construct a line p-free coloring of R . In our work, we determine the grasshopper avoidability index of patterns αn for all n except n = 5.
    [Show full text]
  • Axel Thue's Papers on Repetitions in Words: a Translation
    Axel Thues pap ers on rep etitions in words a translation J Berstel LITP Institut Blaise Pascal Universite Pierre et Marie Curie Paris France July Contents Preliminaries Notation Co des and enco dings The ThueMorse sequence Symb olic dynamical systems Thues First Pap er Ab out innite sequences of symb ols x x Thues Second Pap er On the relative p osition of equal parts in certain sequences of symb ols Intro ductory Remarks Sequences over two symb ols Sequences over three symb ols First Case aca and bcb are missing Second Case aba and aca are missing Third Case aba and bab are missing Irreducible words over four letters Irreducible words over more than four letters Notes Squarefree morphisms Overlapfree words Avoidable patterns Intro duction In a series of four pap ers which app eared during the p erio d Axel Thue considered several combinatorial problems which arise in the study of sequences of symb ols Two of these pap ers deal with word problems for nitely presented semigroups these pap ers contain the denition of what is now called a Thue system He was able to solve the word problem in sp ecial cases It
    [Show full text]
  • Relations on Words 3
    RELATIONS ON WORDS MICHEL RIGO Abstract. In the first part of this survey, we present classical notions arising in combi- natorics on words: growth function of a language, complexity function of an infinite word, pattern avoidance, periodicity and uniform recurrence. Our presentation tries to set up a unified framework with respect to a given binary relation. In the second part, we mainly focus on abelian equivalence, k-abelian equivalence, combinatorial coefficients and associated relations, Parikh matrices and M-equivalence. In particular, some new refinements of abelian equivalence are introduced. 1. Introduction This paper follows and complements the talk I gave during the conference on Automatic Sequences, Number Theory, and Aperiodic Order held in Delft in October 2015. The aim is to survey various concepts arising in combinatorics on words and present them in a unified and general framework. In Section 3, relatively to a given binary relation over A∗, we define the growth function of a language, the complexity function of an infinite word and the notions of avoidable patterns, periodicity and uniform recurrence. These notions are usually first introduced for the restrictive case of equality of factors, e.g., the complexity function counts the number of factors of length n occurring in a given infinite word but we could count them up to rearrangement of the letters. In the second part of the paper, we review classical binary relations on words where these concepts may be applied. In Section 4, we consider abelian equivalence, then its extension to k-abelian equivalence is presented in Section 5. We pursue in Section 6 with binomial coefficients of words and various equivalence relations that can be associated with.
    [Show full text]
  • Patterns, Pattern Avoidance and Graphs on Words
    Patterns, Pattern Avoidance, and Graphs on Words Thesis submitted in partial fulfilment of the requirements for the award of the degree of Doctor of Philosophy in Computer Science and Engineering by Mrityunjay Singh Under the supervision of Dr. Benny George Kenkireth Department of Computer Science and Engineering Indian Institute of Technology Guwahati Guwahati - 781039 Assam India MAY, 2019 Copyright c Mrityunjay Singh 2019. All Rights Reserved. I dedicate this thesis to my late grand mother. Acknowledgements First and foremost, I thank God Almighty. I would like to take this opportunity to acknowl- edge the support and help of my guide Dr. Benny George Kenkireth. He has motivated, supported, and guided me throughout this work. This work has been a really nice learning experience for me. He has taught me how to communicate mathematics. I would also like to thank my doctoral committee members Prof G. Sajith, Prof. Kalpesh Kapoor, Dr. Deepan- jan Kesh and Prof. K.V. Krishna for their guidance through out the duration of my doctoral studies. I would like to thank to the professors P.A.S. Krishna, Shyamsree Upadhyay, Bhaba Kumar Sharma, Rupam Barman, Bikash Bhattacharya, Deepanjan Kesh, Sushant Karmakar, Pinaki Mitra, Purandar Bhaduri, V. Vijaya Saradhi, Sukumar Nandi, H. Ramesh and T. Venketesh for the various courses they taught me during my stay at IITG. I would also like to thank professors John Tsitsiklis, Vittal Rao, Fedrico Ardila, Arindama Singh, Somenath Biswas and Tim Roughgarden for the various online courses taught by them. I thank the whole IITG fraternity for all the different ways in which they have knowingly and unknowingly help me.
    [Show full text]
  • Some Recent Results on Squarefree Words Publications Du Département De Mathématiques De Lyon, 1985, Fascicule 2B « Compte Rendu Des Journées Infinitistes », , P
    PUBLICATIONS DU DÉPARTEMENT DE MATHÉMATIQUES DE LYON JEAN BERSTEL Some Recent Results on Squarefree Words Publications du Département de Mathématiques de Lyon, 1985, fascicule 2B « Compte rendu des journées infinitistes », , p. 21-36 <http://www.numdam.org/item?id=PDML_1985___2B_21_0> © Université de Lyon, 1985, tous droits réservés. L’accès aux archives de la série « Publications du Département de mathématiques de Lyon » im- plique l’accord avec les conditions générales d’utilisation (http://www.numdam.org/conditions). Toute utilisation commerciale ou impression systématique est constitutive d’une infraction pé- nale. Toute copie ou impression de ce fichier doit contenir la présente mention de copyright. Article numérisé dans le cadre du programme Numérisation de documents anciens mathématiques http://www.numdam.org/ SOME RECENT RESULTS ON SQUAREFREE WORDS by Jean BERSTEL <1 > "Fur die Entwicklung der logischen Wissenschaften wird es, oh ne Rucks icht ouf etwaige Anwendungen, von Bedeutung sein, ausgedehnte Felder fur Spekulation ùber schwierige Problème zu finden. 11 Axel Thue, 1912. 1. Introduction. When Axel Thue wrote these lines in the introduction to his 1912 paper on squarefree words, he certainly did not feel as a theoretical computer scientist. During the past seventy years, there was an increasing interest in squarefree words and more generally in repetitions in words. However, A. Thuefs sentence seems still to hold : in some sense, he said that there is no reason to study squarefree words, excepted that it1s a difficult question, and that it is of primary importance to investigate new domains. Seventy years later, these questions are no longer new, and one may ask if squarefree words served already.
    [Show full text]
  • Avoidable Formulas in Combinatorics on Words
    University of California Los Angeles Avoidable Formulas in Combinatorics on Words A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Mathematics by Ronald James Clark 2001 The dissertation of Ronald James Clark is approved. D. Stott Parker Robert Brown Bruce Rothschild Kirby Baker, Committee Chair University of California, Los Angeles 2001 ii For Azusa, partner-in-crime and partner-in-life iii Table of Contents 1 Preliminaries ............................... 1 1.1 Introduction . 1 1.2 Basic Definitions . 2 1.3 Formulas and Divisibility . 3 1.4 Avoidability . 6 1.5 A Known Algorithm to Determine Avoidability . 10 1.6 Finding the Index of Avoidability . 12 1.7 Locked Formulas . 18 2 Avoidance Bases ............................. 20 2.1 Introduction . 20 2.2 Avoidance Bases . 20 2.3 Finding Avoidance Bases . 23 2.4 Avoidance Bases on Small Alphabets . 27 2.5 Formulas on Three Letters Are 4-Avoidable . 28 3 Minimally Locked Formulas ...................... 44 3.1 Introduction . 44 3.2 Indices of MLF’s on Small Alphabets . 45 3.3 Growth Rates . 48 4 Index 5 Formulas ............................. 53 iv 4.1 Introduction . 53 4.2 The Formula ρ Is 4-Unavoidable . 54 4.3 5-Special Formulas . 54 4.4 5-Special Formulas Are 5-Avoidable . 56 4.5 Index 6? . 65 A Cubefree Words on Two Letters ................... 66 A.1 Introduction . 66 A.2 Fixed Points of Endomorphisms . 67 A.3 A Bound for the Density Function . 71 A.4 Full Density . 73 A.5 Growth Rates . 77 References ................................... 78 v List of Tables 1.1 Cassaigne’s Backtracking Algorithm .
    [Show full text]
  • Appendix a Useful Notions and Facts
    Appendix A Useful notions and facts In this appendix we introduce most of the statistics on permutations and words ap- pearing in this book, as well as several well-known sequences of numbers and certain lattice paths. Also, we state in Table A.7 several classes of restricted permutations together with references and initial values of the corresponding sequences. A.1 Statistics on permutations and words Definition A.1.1. In a permutation π = π1π2 : : : πn, πi is a left-to-right maximum (resp., left-to-right minimum) if πi ≥ πj (resp., πi ≤ πj) for 1 ≤ j ≤ i. Also, πi is a right-to-left maximum (resp., right-to-left minimum) if πi ≥ πj (resp., πi ≤ πj) for i ≤ j ≤ n. Example A.1.2. If π = 3152647 then the left-to-right maxima are 3, 5, 6 and 7, the left-to-right minima are 3 and 1, the right-to-left maximum is 7, and the right-to-left minima are 7, 4, 2 and 1. In Table A.1 we define all permutation statistics appearing in the book. The list of statistics is essentially taken from [259]. Note that many of the statistics defined make sense for words, though not all. For example, the statistic head :i, the position of the smallest letter, is not defined for words as we may have several instances of the smallest letter. The expression π1 appearing in Table A.1 means the permutation beginning with π and ending with a letter that is larger that any letter in π. Remark A.1.3.
    [Show full text]
  • Pattern Occurrence Statistics and Applications to the Ramsey Theory
    Pattern occurrence statistics and applications to the Ramsey theory of unavoidable patterns∗ Jim Tao1 October 29, 2018 Abstract As suggested by Currie, we apply the probabilistic method to problems regarding pattern avoidance. Using techniques from analytic combinatorics, we calculate asymptotic pattern oc- currence statistics and use them in conjunction with the probabilistic method to establish new results about the Ramsey theory of unavoidable patterns in the full word case (both nonabelian sense and abelian sense) and in the partial word case. Keywords: Combinatorics on words; Partial words; Unavoidable patterns; Abelian patterns; Probabilistic method; Analytic combinatorics; Ramsey theory. 1 Introduction In [10], Currie reviews results and formulates a large number of open problems concerning pattern avoidance as well as an abelian variation of it. Given a pattern p over an alphabet and a word w V over an alphabet , we say that w encounters p if there exists a nonerasing morphism h : ∗ ∗ A V →A such that h(p) is a factor of w; otherwise w avoids p. In other words, w encounters p = p p , 1 · · · n where p ,...,p , if w contains u u as a factor, where u , . , u are nonempty words in 1 n ∈ V 1 · · · n 1 n ∗ satisfying u = u whenever p = p . On the other hand, w encounters p = p p in the A i j i j 1 · · · n abelian sense if w contains u u as a factor, where u can be obtained from u by rearranging 1 · · · n i j letters whenever pi = pj; otherwise w avoids p in the abelian sense.
    [Show full text]