Multiset Controlled Grammars: a Normal Form and Closure Properties

Total Page:16

File Type:pdf, Size:1020Kb

Multiset Controlled Grammars: a Normal Form and Closure Properties Indonesian Journal of Electrical Engineering and Computer Science Vol. 8, No. 1, October 2017, pp. 36 ~ 42 DOI: 10.11591/ijeecs.v8.i1.pp36-42 36 Multiset Controlled Grammars: A Normal Form and Closure Properties Salbiah Ashaari1, Sherzod Turaev*2, M. Izzuddin M. Tamrin3, Abdurahim Okhunov4, Tamara Zhukabayeva5 1,2,3Kulliyyah of Information and Communication Technology, International Islamic University Malaysia 53100 Kuala Lumpur, Malaysia 4Kulliyyah of Engineering, International Islamic University Malaysia, 53100 Kuala Lumpur, Malaysia 5Faculty of Information Technology, L.N. Gumilyov Eurasian National University, 010008 Astana, Kazakhstan *Corresponding author, e-mail: [email protected], [email protected], [email protected], [email protected], [email protected] Abstract Multisets are very powerful and yet simple control mechanisms in regulated rewriting systems. In this paper, we review back the main results on the generative power of multiset controlled grammars introduced in recent research. It was proven that multiset controlled grammars are at least as powerful as additive valence grammars and at most as powerful as matrix grammars. In this paper, we mainly investigate the closure properties of multiset controlled grammars. We show that the family of languages generated by multiset controlled grammars is closed under operations union, concatenation, kleene-star, homomorphism and mirror image. Keywords: multiset, regulated grammar, multiset controlled grammar, generative capacity, closure property Copyright © 2017 Institute of Advanced Engineering and Science. All rights reserved. 1. Introduction A regulated grammar is depicted as a grammar with an additional (control) mechanism that able to restrict the use of the productions (a.k.a. rules) during derivation process in order to avoid certain derivations as well as to obtain a subset of the language generated in usual way. The primary motivation for introducing regulated grammars came from the fact that a plenty of languages of interest are seen to be non-context free such as the languages with reduplication, multiple agreement or crossed agreement properties. Therefore, the main aim of regulated grammars is to achieve a higher computational power and yet at the same time does not increase the complexity of the model [1, 2]. It is believed that the first regulated grammar, which is a matrix grammar, was introduced by Abraham in 1965 with the idea such in a derivation step, a sequence of productions are applied together [3]. Since then, a plenty of regulated grammars have been introduced and investigated in several papers such as [1-13], where each has a different control mechanism, and provides useful structures to handle a variety of issues in formal languages, programming languages, DNA computing, security, bioinformatics and many other areas. In this paper, we continue our research on multiset controlled grammars (see [4]); we investigate the closure properties of the family of languages generated by multiset controlled grammars. The study of closure properties is one of a crucial investigation in formal language theory since it provides a meaningful merit in both theory and practice of grammars. This paper is structured as follows. First, we give some basic notations and knowledge concerning to the theory of formal languages in which include grammars with regulated rewriting and set-theoretic operations on languages that will be used throughout the study. Then, we recall the definitions of multiset controlled grammars defined in [4] together with results on their generative power. Then, we demonstrate that for multiset controlled grammars one can construct equivalent normal forms, which will be used in study of the closure properties. In the last section, we put in a nutshell all materials studied previously with some possible future research topics on those grammars. Received June 6, 2017; Revised August 16, 2017; Accepted September 1, 2017 IJEECS ISSN: 2502-4752 37 2. Preliminaries In this section, we present some basic notations, terminologies and concepts concerning to the formal languages theories, multiset and regulated rewriting grammars that will be used in the following sections. For details, the reader is referred to [1, 2, 14-16]. Throughout the paper, we use the following basic notations. Symbols and represent the set membership and negation of set membership of an element to a set. Symbol signifies the set inclusion and marks the strict inclusion. Then, for a two sets and ∈ , ∉ if and . Further, notation | | is used to portray the cardinality of a set in which is⊆ the number of elements in the set⊂ as well as notation 2 is used to depict the 퐴power 퐵set퐴 of⊊ a퐵 set퐴 ⊆. Symbol퐵 퐴 ≠ denotes퐵 the empty set.퐴 The sets of integer,퐴 natural, real and rational퐴 numbers are denoted by , , and , respectively.퐴 퐴 ∅ An alphabet is a finite and nonempty set of symbols or letters, which is ℤdenotedℕ ℝ byℚ and a string (sometimes referred as word) over the alphabet is a finite sequence of symbols (concatenation of symbols) from . The string without symbols is called null or empty string andΣ denoted by . The set of all strings (including ) over the alphabet,Σ is denoted by and = { }. A string is a substringΣ of a string if and only if there exist , such∗ that += ∗ where휆 , , , . String is a휆 proper substring of ifΣ and Σ . The 1 2 lengthΣ Σ of− string휆 , denoted푤 by | |∗, is the number of symbols휈 in the string. Obviously,푢 푢 | | = 0 and 1 2 1 2 |휈 |푢=푤|푢| + | |, 푢, 푢 푤 . 휈A ∈languageΣ is푤 a subset of . A language휈 is푤 ≠ -휆free if 푤 ≠ 휈. For a set , a multiset푤 is a mapping∗ 푤: . The set of all multisets∗ over is denoted by 휆 . Then, the푤휈 set 푤 is 푣a called푤 휈 ∈ theΣ basic set of퐿 . For a multisetΣ and퐿 element휆 휆 ∉⊕퐿 , ( ) represents퐴 the number of occurrences휇 퐴 → ℕ of ⊕in . ⊕ 퐴 퐴 A퐴 Chomsky grammar is a quadruple퐴 = ( , , ,휇 )∈ 퐴where is an alphabet푎 ∈ 퐴 휇 푎of nonterminals, is an alphabet of terminals푎 and휇 = , is the start symbol and is a finite set of productions such that ( ) 퐺( 푁)푇×푆(푃 ) . We푁 simply use notation for a production푇 ( , ) . A direct derivation∗푁 ∩ 푇 relation∅∗ 푆 ∈ over푁 (∗ ) , denoted by 푃 , is defined as provided if and only푃 ⊆ if푁 there∪ 푇 푁is 푁a ∪rule푇 푁 ∪ 푇 such ∗that = and 퐴 =→ 푤 for some , 퐴 푤( ∈ 푃 ) . Since is a relation, then its th,푁 ∪ 푇 0, power is denoted⇒ 1 2 by , its transitive푢 ⇒ 푣 closure by ∗ , and its reflexive and퐴 →transitive푤 ∈ 푃 closure by푢 푥. 퐴A푥 string 1 2 1 2 푣 푥(푛푤푥 ) is a sentential푥 푥 ∈ form푁 ∪ 푇if + . If⇒ , then is called푛 a푛 sentence≥ or ∗a terminal …. ⇒ ∗ ⇒ ∗ ∗ ⇒ string and is said to be a successful derivation. We also use the notations or to 푤 ∈ 푁 ∪ 푇 푆 ⇒ 푤 푤 ∈ 푇 푤 푚 푟0푟1 푟푛 denote the derivation∗ that uses the sequence of rules = … , , 1 . The language 푆ge⇒nerated푤 by , denoted by ( ), is defined as ( ) = { | ⇒ ��}�.� �Two� 0 1 푛 푖 grammars and are called to be equivalent if and only if푚 they푟 generate푟 푟 푟 the∈ 푃 ∗same≤ 푖 ∗language,≤ 푛 i.e., ( ) = ( ). There are퐺 five main types퐿 퐺of grammars depending퐿 퐺 on their푤 ∈ 푇productions푆 ⇒ 푤 forms 1 2 : 퐺 퐺 1 2 a)퐿 퐺 a regular퐿 퐺 if and , 푢 →b)푣 a linear if ∗ ∗ and , c) a context-free푣 ∈ if∗푇 ∪ ∗푁( 푇 ∗ ) 푢and∈ 푁 , d) a context-푣sensitive∈ 푇 ∪ 푇 if 푁푇 ( ∗ 푢 )∈ 푁 ( ) and ( ) where | | | | and e) a recursively enumerable푣 ∈ 푁 ∪ 푇 or unrestricted∗푢 ∈+ 푁 if∗ ( ) and+ ( ) where contains at least one 푢nonterminal∈ 푁 ∪ 푇 symbol.푁 푁 ∪ 푇 푣 ∈ 푁 ∪+ 푇 푢 ≤ ∗푣 The families of languages generated by arbitrary,푢 ∈context푁 ∪ 푇 sensitive,푣 context∈ 푁 ∪ 푇free, regular,푢 linear and finite grammars are denoted by , , , , , and , respectively. For these language families, Chomsky hierarchy holds: 퐑퐑 퐂퐂 퐂퐂 퐑퐑퐑 퐋퐋퐋 � . �Before⊂ 퐑퐑퐑 moving⊂ 퐋퐋퐋 to ⊂the퐂퐂 operations⊂ 퐂퐂 ⊂ 퐑퐑 on languages, we recall some definition of regulated grammars mentioned in this study. A matrix grammar is a quadruple = ( , , , ) where , and are defined as for context-free grammar and is a set of matrices, that are finite sequences of context-free rules from × ( ) . Its language is defined퐺 푁by푇 푆( 푀) = { 푁| 푇 푆 ∗ 푀 ∗ and }. 휋 푁 푁 ∪ 푇 퐿 퐺 푤 ∈ 푇 푆 An additive∗ valence grammar is a 5-tuples = ( , , , , ) where , , , are defined as⇒ 푤for a 휋context∈ 푀 -free grammar and is a mapping from into set (set ). The language generated by the grammar consists of all string퐺 푁 푇 such푆 푃 푣that there푁 푇is 푆a 푃derivation 푣 ∗푃 ℤ ℚ 푤 ∈ 푇 푆 Multiset Controlled Grammars: Normal Form and Closure Properties (Salbiah Ashaari) 38 ISSN: 2502-4752 … where ( ) = 0 . The families of languages generated by matrix and additive 푟1푟2 푟푛 valence grammars 푛(with erasing rues) are denoted by , , ( , ), respectively. 푘=1 푘 Now,������ �we푤 recall some∑ 푣 set푟 -theoretic operations that will be used to investigate휆 휆 the closure properties of a grammar. Let and be two languages.퐌퐌퐌 Then,푎퐕퐌퐋 the퐌퐌퐌 union 푎(퐕퐌퐋), intersection ( ), difference ( ) and concatenation ( ) of and are defined as: 1 2 퐿 퐿 ∪ ∩ 1 2 − = { ⋅ }퐿 퐿 = { } 1 2 1 2 퐿 ∪ 퐿 = {푤 ∶ 푤 ∈ 퐿 표푟 푤 ∈ 퐿 } 1 2 1 2 퐿 ∩ 퐿 = { 푤 ∶ 푤 ∈ 퐿 푎푛푎 푤 ∈ 퐿 } 1 2 1 2 퐿 − 퐿 푤 ∶ 푤 ∈ 퐿 푎푛푎 푤 ∉ 퐿 1 2 1 2 1 1 2 2 The complement퐿 ⋅ 퐿 of푤 푤 ∶ 푤 with∈ 퐿 respect푎푛푎 푤 to∈ 퐿 is defined as = . For a language , the iterations of is defined as∗: ∗ ∗ 퐿 ⊆ Σ Σ 퐿� Σ − 퐿 퐿 =퐿{ }, 0 = , 퐿1 = 휆 , 퐿 2 퐿 퐿 = 퐿퐿 (iteration closure: Kleene star). ∗⋯= 푖 (positive iteration closure: Kleene plus). 푖≥0 퐿+ ⋃ 퐿 푖 푖≥1 Given퐿 ⋃ two퐿 alphabets , , a mapping : is called a morphism or synonymously a homomorphism if and only if: ∗ ∗ 1 2 1 2 (i) for every , there existsΣ Σ such that =ℎ Σ( )→ andΣ is distinct, (ii) for every , ∗ ( ) = ( ) (∗ ) .
Recommended publications
  • A Geometric Hierarchy Beyond Context-Free Languages*
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Theoretical Computer Science 104 (1992) 235-261 235 Elsevier A geometric hierarchy beyond context-free languages* David J. Weir School of Cognitive and Computing Science, University of Sussex, Falmer. Brighton BNI 9QH, UK Communicated by M.A. Harrison Received October 1990 Revised March 1991 Abstract Weir, D.J., A geometric hierarchy beyond context-free languages, Theoretical Computer Science 104 (1992) 235-261. This paper defines a geometric hierarchy of language classes the first member of which is context-free languages. This hierarchy generalizes the difference between context-free languages and the class of languages generated by four weakly equivalent grammar formalisms that are of interest to computa- tional linguists. A grammatical characterization of the hierarchy is given using a variant of control grammars. Each member of the progression is shown to share many of the attractive features of context-free grammars, in particular, we show that each member is an abstract family of languages. We give a progression of automata and show that it corresponds exactly to the language hierarchy defined with control grammars. 1. Introduction Recent work in computational linguistics suggests that more generative capacity than context-free grammars (CFG) is required to describe natural languages [14, 31. This has led to increased interest in grammar formalisms that are able to provide this additional power without sacrificing the attractive features of context-free languages (CFL) such as polynomial recognition, and various closure and decidability proper- ties. Several formalisms have been independently proposed as being suitable in this regard and it has been shown that the same class of languages is generated by four of these formalisms [19,22, 18,211.
    [Show full text]
  • Parsing Based on N-Path Tree-Controlled Grammars
    Theoretical and Applied Informatics ISSN 1896–5334 Vol.23 (2011), no. 3-4 pp. 213–228 DOI: 10.2478/v10179-011-0015-7 Parsing Based on n-Path Tree-Controlled Grammars MARTIN Cˇ ERMÁK,JIR͡ KOUTNÝ,ALEXANDER MEDUNA Formal Model Research Group Department of Information Systems Faculty of Information Technology Brno University of Technology Božetechovaˇ 2, 612 66 Brno, Czech Republic icermak, ikoutny, meduna@fit.vutbr.cz Received 15 November 2011, Revised 1 December 2011, Accepted 4 December 2011 Abstract: This paper discusses recently introduced kind of linguistically motivated restriction placed on tree-controlled grammars—context-free grammars with some root-to-leaf paths in their derivation trees restricted by a control language. We deal with restrictions placed on n ¸ 1 paths controlled by a deter- ministic context–free language, and we recall several basic properties of such a rewriting system. Then, we study the possibilities of corresponding parsing methods working in polynomial time and demonstrate that some non-context-free languages can be generated by this regulated rewriting model. Furthermore, we illustrate the syntax analysis of LL grammars with controlled paths. Finally, we briefly discuss how to base parsing methods on bottom-up syntax–analysis. Keywords: regulated rewriting, derivation tree, tree-controlled grammars, path-controlled grammars, parsing, n-path tree-controlled grammars 1. Introduction The investigation of context-free grammars with restricted derivation trees repre- sents an important trend in today’s formal language theory (see [3], [6], [10], [11], [12], [13] and [15]). In essence, these grammars generate their languages just like ordinary context-free grammars do, but their derivation trees have to satisfy some simple pre- scribed conditions.
    [Show full text]
  • Grammars Controlled by Petri Nets
    CORE Metadata, citation and similar papers at core.ac.uk Provided by The International Islamic University Malaysia Repository ChapterChapter 15 0 Grammars Controlled by Petri Nets J. Dassow, G. Mavlankulov, M. Othman, S. Turaev, M.H. Selamat and R. Stiebe Additional information is available at the end of the chapter http://dx.doi.org/10.5772/50637 1. Introduction Formal language theory, introduced by Noam Chomsky in the 1950s as a tool for a description of natural languages [8–10], has also been widely involved in modeling and investigating phenomena appearing in computer science, artificial intelligence and other related fields because the symbolic representation of a modeled system in the form of strings makes its processes by information processing tools very easy: coding theory, cryptography, computation theory, computational linguistics, natural computing, and many other fields directly use sets of strings for the description and analysis of modeled systems. In formal language theory a model for a phenomenon is usually constructed by representing it as a set of words, i.e., a language over a certain alphabet, and defining a generative mechanism, i.e., a grammar which identifies exactly the words of this set. With respect to the forms of their rules, grammars and their languages are divided into four classes of Chomsky hierarchy: recursively enumerable, context-sensitive, context-free and regular. Context-free grammars are the most investigated type of Chomsky hierarchy which, in addition, have good mathematical properties and are extensively used in many applications of formal languages. However, they cannot cover all aspects which occur in modeling of phenomena.
    [Show full text]
  • Automaten Und Formale Sprachen Allrode (Harz), 27
    HEORIE- TTAG 2011 21. Theorietag Automaten und Formale Sprachen Allrode (Harz), 27. – 29. September 2011 Tagungsband Jürgen Dassow, Bianca Truthe (Hrsg.) Titel: 21. Theorietag „Automaten und Formale Sprachen“, Allrode (Harz), 27. – 29.9.2011, Tagungsband Veröffentlicht durch die Otto-von-Guericke-Universität Magdeburg Fakultät für Informatik Universitätsplatz 2 39106 Magdeburg Herausgegeben von Jürgen Dassow und Bianca Truthe, 2011 Das Copyright liegt bei den Autoren der Beiträge. Vorwort Der Theorietag ist die Jahrestagung der Fachgruppe Automaten und Formale Sprachen der Ge- sellschaft für Informatik. Er wurde vor 20 Jahren in Magdeburg ins Leben gerufen. Seit dem Jahre 1996 wird der Theorietag von einem eintägigen Workshop mit eingeladenen Vorträgen begleitet. Im Laufe des Theorietags findet auch die jährliche Fachgruppensitzung statt. Die bisherigen Theorietage wurden in Magdeburg (1991), in Kiel (1992), auf Schloß Dag- stuhl (1993), in Herrsching bei München (1994 und 2003), auf Schloß Rauischholzhausen (1995), in Cunnersdorf in der Sächsischen Schweiz (1996), in Barnstorf bei Bremen (1997), in Riveris bei Trier (1998), in Schauenburg-Elmshagen bei Kassel (1999), in Wien (2000 und 2006), in Wendgräben bei Magdeburg (2001), in der Lutherstadt Wittenberg (2002 und 2009), in Caputh bei Potsdam (2004), in Lauterbad bei Freudenstadt (2005), in Leipzig (2007), in Wettenberg-Launsbach bei Gießen (2008) und in Baunatal bei Kassel (2010) ausgerichtet. Der diesjährige Theorietag wird nach 1991 und 2001 wieder von der Arbeitsgruppe „For- male Sprachen und Automaten“ der Otto-von-Guericke-Universität Magdeburg organisiert. Er findet mit dem vorangehenden Workshop vom 27. bis 29. September 2011 in Allrode im Harz statt. Auf dem Workshop tragen • Frank Drewes aus Umeå, • Manfred Droste aus Leipzig, • Florin Manea aus Bukarest, • Sergey Verlan aus Paris und • Georg Zetzsche aus Kaiserslautern vor.
    [Show full text]
  • TH`ESE Zhe CHEN the Control System in Formal
    N◦ d'ordre : 1036 THESE` pr´esent´ee`a l'Institut National des Sciences Appliquees´ de Toulouse pour l'obtention du titre de DOCTEUR de l'Universit´ede Toulouse d´elivr´epar l'INSA Sp´ecialit´e:Syst`emesInformatiques par Zhe CHEN Lab. Toulousain de Technologie et d'Ingenierie´ des Systemes` (LATTIS) Laboratoire d'Analyse et d'Architecture des Systemes` (LAAS-CNRS) Ecole´ Doctorale Syst`emes Titre de la th`ese: The Control System in Formal Language Theory and The Model Monitoring Approach for Reliability and Safety Soutenue le 9 Juillet 2010, devant le jury : Rapporteur : Maritta Heisel Professeur `al'Universit¨atDuisburg-Essen, Germany Rapporteur : Fabrice Bouquet Professeur `al'INRIA, Universit´ede Franche-Comt´e Examinateur: Gilles Motet Professeur `al'INSA de Toulouse - Directeur de th`ese Examinateur: Karama Kanoun Directeur de Recherche au LAAS-CNRS Examinateur: Jean-Paul Blanquart Astrium, European Aeronautic Defence and Space (EADS) Acknowledgement Although I have been mostly working independently in the lab during all those years, I would not have been able to do this thesis without the support, advice and encouragement of others, teachers, friends and colleagues. Accordingly, I would like to use this opportunity to express my gratitude to a number of people who over the years have contributed in various ways to the completion of this work. In the first place I would like to record my gratitude to my thesis supervisor Prof. Gilles Motet for guiding me in my research, and at the same time, for letting me the freedom to take initiatives in the development of my lines of research.
    [Show full text]
  • PETRI NET CONTROLLED GRAMMARS Sherzod Turaev ISBN:978-84-693-1536-1/DL:T-644-2010
    UNIVERSITAT ROVIRA I VIRGILI PETRI NET CONTROLLED GRAMMARS Sherzod Turaev ISBN:978-84-693-1536-1/DL:T-644-2010 Facultat de Lletres Departament de Filologies Romàniques Petri Net Controlled Grammars PhD Dissertation Presented by Sherzod Turaev Supervised by Jürgen Dassow Tarragona, 2010 UNIVERSITAT ROVIRA I VIRGILI PETRI NET CONTROLLED GRAMMARS Sherzod Turaev ISBN:978-84-693-1536-1/DL:T-644-2010 Supervisor: Prof. Dr. Jürgen Dassow Fakultät für Informatik Institut für Wissens- und Sprachverarbeitung Otto-von-Guericke-Universität Magdeburg D-39106 Magdeburg, Germany Tutor: Dr. Gemma Bel Enguix Grup de Recerca en Lingüística Matemàtica Facultat de Lletres Universitat Rovira i Virgili Av. Catalunya, 35 43002 Tarragona, Spain UNIVERSITAT ROVIRA I VIRGILI PETRI NET CONTROLLED GRAMMARS Sherzod Turaev ISBN:978-84-693-1536-1/DL:T-644-2010 Acknowledgements It is a pleasure to thank many people who helped me to made this thesis possible. First of all I would like to express my sincere gratitude to my super- visor, Professor Jürgen Dassow. He helped me with his enthusiasm, his inspiration, and his efforts. During the course of this work, he provided with many helpful suggestions, constant encouragement, important advice, good teaching, good company, and lots of good ideas. It has been an honor to work with him. I would like to thank the head of our research group, Professor Car- los Martín-Vide, for his great effort in establishing the wonderful circum- stances of PhD School on Formal Languages and Applications, and for his help in receiving financial support throughout my PhD. This work was made thanks to the financial support of research grant 2006FI-01030 from AGAUR of Catalonia Government.
    [Show full text]
  • Controlled Pure Grammar Systems
    Journal of Universal Computer Science, vol. 18, no. 14 (2012), 2024-2040 submitted: 14/1/12, accepted: 16/4/12, appeared: 28/7/12 © J.UCS Controlled Pure Grammar Systems Alexander Meduna (Brno University of Technology, Faculty of Information Technology IT4Innovations Centre of Excellence, Brno, Czech Republic meduna@fit.vutbr.cz) Petr Zemek (Brno University of Technology, Faculty of Information Technology IT4Innovations Centre of Excellence, Brno, Czech Republic izemek@fit.vutbr.cz) Abstract: This paper discusses grammar systems that have only terminals, work in the leftmost way, and generate their languages under the regulation by control lan- guages over rule labels. It establishes three results concerning their generative power. First, without any control languages, these systems are not even able to generate all context-free languages. Second, with regular control languages, these systems, hav- ing no more than two components, characterize the family of recursively enumerable languages. Finally, with control languages that are themselves generated by regular- controlled context-free grammars, these systems over unary alphabets generate nothing but regular languages. In its introductory section, the paper gives a motivation for in- troducing these systems, and in the concluding section, it formulates several open problems. Key Words: formal languages, pure grammar systems, controlled derivations Category: F.4.2, F.4.3, F.1.1 1 Introduction Indisputably, context-free grammars are central to formal language theory as a whole (see [Rozenberg and Salomaa 1997, Dassow and P˘aun 1989, Meduna 2000, Salomaa 1973]). It thus comes as no surprise that this theory has introduced a broad variety of their modified versions, ranging from simplified and restricted versions up to fundamentally generalized systems based upon these grammars.
    [Show full text]
  • Grammars with Regulated Rewriting
    Grammars with Regulated Rewriting J¨urgen Dassow Otto-von-Guericke-Universit¨atMagdeburg PSF 4120, D – 39016 Magdeburg e-mail: [email protected] Abstract: Context-free grammars are not able to cover all linguistic phenomena. Thus we define some types of grammars where context-free rules are used and restric- tion imposed on the derivations. We illustrate the concepts by example, compare the generative power, give some closure and decidability properties and basic facts on syntactic complexity. 0. Introduction The regular and context-free grammars/languages are the most investigated types of formal languages which, in addition, have a lot of nice properties (see [9, 11, 14] and the corresponding chapters of this volume). However, these types of grammars/languages are not able to cover all aspects which occur in modelling of phenomena by means of formal languages. Here we only mention an example from natural languages. Let us consider the following sequence of a German dialect spoken in Switzerland: S1=Jan s¨aitdas mer em Hans h¨alfed. (Jan says that we helped Hans.) S2=Jan s¨aitdas mer em Hans es Huus h¨alfed aastriche. (Jan said that we helped Hans to paint the house.) S3=Jan s¨aitdas mer d’chind em Hans es Huus l¨ondh¨alfed aastriche. (Jan said that we allowed the children to help Hans to paint the house.) Further, let h be the morphisms which maps Hans and h¨alfed to the letter a, Huus, aastriche, d’chind and l¨ond to b and all other words of the sentences to the empty word.
    [Show full text]
  • Error Tolerance for the Recognition of Faulty Strings in a Regulated Grammar Using Fuzzy Sets
    Sådhanå (2018) 43:134 © Indian Academy of Sciences https://doi.org/10.1007/s12046-018-0833-ySadhana(0123456789().,-volV)FT3](0123456789().,-volV) Error tolerance for the recognition of faulty strings in a regulated grammar using fuzzy sets AJAY KUMAR*, NIDHI KALRA and SUNITA GARHWAL Computer Science and Engineering Department, Thapar Institute of Engineering & Technology, Patiala 147004, India e-mail: [email protected] MS received 25 July 2017; revised 5 January 2018; accepted 8 January 2018; published online 10 July 2018 Abstract. To overcome the limitations of context-free and context-sensitive grammars, regulated grammars have been proposed. In this paper, an algorithm is proposed for the recognition of faulty strings in regulated grammar. Furthermore, depending on the errors and certainty, it is decided whether the string belongs to the 2 language or not based on string membership value. The time complexity of the proposed algorithm is O(|GR|·|w|), where |GR| represents the number of production rules and |w| is the length of the input string, w. The reader is provided with numerical examples by applying the algorithm to regularly controlled and matrix grammar. Finally, the proposed algorithm is applied in the Hindi language for the recognition of faulty strings in regulated grammar as a real-life application. Keywords. Fuzzy regulated grammar; fuzzy sets; Chomsky normal form; regularly controlled grammar; matrix grammar. 1. Introduction inference methods [10–12] are used in the design of fuzzy systems. Context-free grammars are not able to cover all aspects of Motivated by the prior work of recognition of imperfect linguistic phenomena. The membership of a string in a strings in context-free grammar [13] and context-sensitive context-sensitive grammar is decided in exponential time.
    [Show full text]
  • Brno University of Technology Regulated Grammars
    BRNO UNIVERSITY OF TECHNOLOGY VYSOKÉ UČENÍ TECHNICKÉ V BRNĚ FACULTY OF INFORMATION TECHNOLOGY DEPARTMENT OF INFORMATION SYSTEMS FAKULTA INFORMAČNÍCH TECHNOLOGIÍ ÚSTAV INFORMAČNÍCH SYSTÉMŮ REGULATED GRAMMARS: CONCEPTS, PROPERTIES AND APPLICATIONS REGULOVANÉ GRAMATIKY: KONCEPTY, VLASTNOSTI A VYUŽITÍ MASTER’S THESIS DIPLOMOVÁ PRÁCE AUTHOR Bc. PETR BEDNÁŘ AUTOR PRÁCE SUPERVISOR Prof. RNDr. ALEXANDER MEDUNA, CSc. VEDOUCÍ PRÁCE BRNO 2016 Abstract This thesis is investigating regulated grammars. Introduces new modifications of existing regulated grammars. Introduces parsing methods of newlz introduces modifications. Dis- cusses problems of determinism in definition of grammars. It studies the expressive strength of these new modifications applied on regular-controlled grammars. Abstrakt Tato práce se zabývá regulovanými gramatikami. Zavádí nové modifikace existujících reg- ulovaných gramatik. Pro tyto modifikace zavádí metody syntaktické analýzi. Diskutuje problémy determinismu v definici gramatik. Studuje sílu nově uvedených modifikací ap- likovaných na regulárně regulované gramatiky. Keywords regulated grammars, determinism, syntax analysis, regular-controlled grammars, LL gram- mars Klíčová slova regulované gramatiky, determinismus, syntaktická analýza, regulárně regulované gramatiky, LL gramatiky Reference BEDNÁŘ, Petr. Regulated Grammars: Concepts, Properties and Applications. Brno, 2016. Master’s thesis. Brno University of Technology, Faculty of Information Technology. Supervisor Meduna Alexander. Regulated Grammars: Concepts, Properties and Applications Declaration Hereby I declare that this master’s thesis was prepared as an original author’s work under the supervision of prof. RNDr. Alexandra Meduny, CSc. All the relevant information sources, which were used during preparation of this thesis, are properly cited and included in the list of references. ....................... Petr Bednář May 25, 2016 Acknowledgements I would like to express my gratitude to my supervisor prof. RNDr.
    [Show full text]
  • Universität Tübingen 15
    Wilhelm Schickard-Institut für Informatik 15. Theorietag der GI Fachgruppe 0.1.5 Automaten und Forn1ale Sprachen Henning Fernau (Ed.) WSI-2005-16 Universität Tübingen 15. Theorietag der GI Fachgruppe 0.1.5 A uton1aten und Forn1ale Sprachen Henning Fernau (Ed.) WSI-2005-16 Henning Fernau Wilhelm-Schickard-Institut für Informatik Universität Tübingen Sand 13 D-72076 Tübingen Germany E-Mail: f ernau©informatik. uni-tuebingen. de Telefon: (07071) 29-77565 Telefax: (07071) 29-5061 © Wilhelm-Schickard-Institut für Informatik, 2005 ISSN 0946-3852 15. Theorietag der GI Fachgruppe 0.1.5 Automaten und Formale Sprachen Henning Fernau (Ed.) WSI-2005-16 Dies ist nun bereits der 15. "Jahrestag" der GI Fachgruppe 0.1.5 "Automa­ ten und Formale Sprachen," der "reihum" an verschiedenen Orten Deutschlands (und auch Österreichs) stattfindet. Am 28. und am 29. September 2005 fand die­ ses Treffen statt in Lauterbad bei Freudenstadt, organisatorisch betreut von der Formalsprachlergruppe in Tübingen. Der Theorietag gibt die Gelegenheit, Ergebnisse formalsprachlicher Forschun­ gen aus Deutschland und dem angrenzenden Ausland kennenzulernen und natür­ lich eigene Resultate vorzustellen. Wichtiger vielleicht ist noch der persönliche Kontakt, der durch solche Treffen ermöglicht wird. Manch eine Arbeit wäre ver­ mutlich gar nicht zustande gekommen ohne vorherige Treffen, Gespräche und Anregungen. Daneben soll der Theorietag Gelegenheit geben, ein bestimmtes Themenge­ biet aus dem Bereich der Formalen Sprachen (im weitesten Sinne) vertiefend zu behandeln. Dazu dienen eingeladene Vorträge an einem dem "eigentlichen" Theorietag (der sich ja über anderthalb Tage erstreckt) "vorgelagerten" Tag. In diesem Jahr ist das Thema dieser Überblicksvorträge "Lernen von Automaten und Grammatiken." Als Sprecher konnten gewonnen werden: • Colin de la Higuera von der Universität von St.
    [Show full text]
  • Binary Context-Free Grammars
    S S symmetry Article Binary Context-Free Grammars Sherzod Turaev 1,* , Rawad Abdulghafor 2 , Ali Amer Alwan 2 , Ali Abd Almisreb 3 and Yonis Gulzar 4 1 Department of Computer Science & Software Engineering, College of Information Technology, United Arab Emirates University, Al Ain 15551, UAE 2 Department of Computer Science, Faculty of Information and Communication Technology, International Islamic University Malaysia, Gombak, Selangor 53100, Malaysia; [email protected] (R.A.); [email protected] (A.A.A.) 3 Faculty of Engineering and Natural Sciences, International University of Sarajevo, 71210 Sarajevo, Bosnia and Herzegovina; [email protected] 4 Department of Management Information Systems, King Faisal University, Al-Ahsa 31982, Saudi Arabia; [email protected] * Correspondence: [email protected]; Tel.: +971-58-567-6013 Received: 17 June 2020; Accepted: 21 July 2020; Published: 24 July 2020 Abstract: A binary grammar is a relational grammar with two nonterminal alphabets, two terminal alphabets, a set of pairs of productions and the pair of the initial nonterminals that generates the binary relation, i.e., the set of pairs of strings over the terminal alphabets. This paper investigates the binary context-free grammars as mutually controlled grammars: two context-free grammars generate strings imposing restrictions on selecting production rules to be applied in derivations. The paper shows that binary context-free grammars can generate matrix languages whereas binary regular and linear grammars have the same power as Chomskyan regular and linear grammars. Keywords: formal language; binary grammar; context-free grammar; matrix grammar; relational grammar; computation power; chomsky hierarchy 1. Introduction A “traditional” phrase-structure grammar (also known as a Chomskyan grammar) is a generative computational mechanism that produces strings (words) over some alphabet starting from the initial symbol and sequentially applying production rules that rewrite sequences of symbols [1–3].
    [Show full text]