CDM [1Ex]Context-Sensitive Grammars
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Formal Languages and Automata Theory Géza Horváth, Benedek Nagy Szerzői Jog © 2014 Géza Horváth, Benedek Nagy, University of DebrecenFormal Languages and Automata Theory Géza Horváth, Benedek Nagy Created by XMLmind XSL-FO Converter. Formal Languages and Automata Theory Géza Horváth, Benedek Nagy Szerzői jog © 2014 Géza Horváth, Benedek Nagy, University of Debrecen Created by XMLmind XSL-FO Converter. Tartalom Formal Languages and Automata Theory .......................................................................................... vi Introduction ...................................................................................................................................... vii 1. Elements of Formal Languages ...................................................................................................... 1 1. 1.1. Basic Terms .................................................................................................................... 1 2. 1.2. Formal Systems .............................................................................................................. 2 3. 1.3. Generative Grammars .................................................................................................... 5 4. 1.4. Chomsky Hierarchy ....................................................................................................... 6 2. Regular Languages and Finite Automata ...................................................................................... 12 1. 2.1. Regular Grammars ....................................................................................................... 12 2. 2.2. Regular Expressions ....................................................................................................
- 
												  INF2080 Context-Sensitive LangugaesINF2080 Context-Sensitive Langugaes Daniel Lupp Universitetet i Oslo 15th February 2017 Department of University of Informatics Oslo INF2080 Lecture :: 15th February 1 / 20 Context-Free Grammar Definition (Context-Free Grammar) A context-free grammar is a 4-tuple (V ; Σ; R; S) where 1 V is a finite set of variables 2 Σ is a finite set disjoint from V of terminals 3 R is a finite set of rules, each consisting of a variable and of a string of variables and terminals 4 and S is the start variable Rules are of the form A ! B1B2B3 ::: Bm, where A 2 V and each Bi 2 V [ Σ. INF2080 Lecture :: 15th February 2 / 20 consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S this is context-free... but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! INF2080 Lecture :: 15th February 3 / 20 this is context-free... but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S INF2080 Lecture :: 15th February 3 / 20 but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! consider the following toy programming language, where you can “declare” and “assign” a variable a value.
- 
												  Automata and GrammarsAutomata and Grammars Prof. Dr. Friedrich Otto Fachbereich Elektrotechnik/Informatik, Universität Kassel 34109 Kassel, Germany E-mail: [email protected] SS 2018 Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 1 / 294 Lectures and Seminary SS 2018 Lectures: Thursday 9:00 - 10:30, Room S 11 Start: Thursday, February 22, 2018, 9:00. Seminary: Thursday 10:40 - 12:10, Room S 11 Start: Thursday, February 22, 2018, 10:40. Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 2 / 294 Exercises: From Thursday to Thursday next week! To pass seminary: At least two successful presentations in class (10 %) and passing of midtem exam on April 5 (10:40 - 11:40) (20 %) Final Exam: (dates have been fixed!) Written exam (2 hours) on May 31 (9:00 - 11:30) in S 11 Oral exam (30 minutes per student) on June 14 (9:00 - 12:30) in S 10 Written exam (2. try) on June 21 (9:00 - 11:30) in S 11 Oral exam (2. try) on June 28 (9:00 - 12:30) in S 11 The written exam accounts for 50 % of the final grade, while the oral exam accounts for 20 % of the final grade. Moodle: Course ”Automata and Grammars” NTIN071. Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 3 / 294 Literature: P.J. Denning, J.E. Dennis, J.E. Qualitz; Machines, Languages, and Computation. Prentice-Hall, Englewood Cliffs, N.J., 1978. M. Harrison; Introduction to Formal Language Theory. Addison-Wesley, Reading, M.A., 1978. J.E. Hopcroft, R. Motwani, J.D. Ullman; Introduction to Automata Theory, Languages, and Computation.
- 
												  Languages Generated by Context-Free and Type AB → BA RulesMagyar Kutatók 8. Nemzetközi Szimpóziuma 8th International Symposium of Hungarian Researchers on Computational Intelligence and Informatics Languages Generated by Context-Free and Type AB → BA Rules Benedek Nagy Faculty of Informatics, University of Debrecen, H-4010 PO Box 12, Hungary, Research Group on Mathematical Linguistics, Rovira i Virgili University, Tarragona, Spain [email protected] Abstract: Derivations using branch-interchanging and language family obtained by context-free and interchange (AB → BA) rules are analysed. This language family is between the context-free and context-sensitive families helping to fill the gap between them. Closure properties are analysed. Only semi-linear languages can be generated in this way. Keywords: formal languages, Chomsky hierarchy, derivation trees, interchange (permutation) rule, semi-linear languages, mildly context-sensitivity 1 Introduction The Chomsky type grammars and the generated language families are one of the most basic and most important fields of theoretical computer science. The field is fairly old, the basic concepts and results are from the middle of the last century (see, for instance, [2, 6, 7]). The context-free grammars (and languages) are widely used due to their generating power and simple way of derivation. The derivation trees represent the context-free derivations. There is a big gap between the efficiency of context-free and context-sensitive grammars. There are very ‘simple’ non-context-free languages, for instance {an^2 |n ∈ N}, {anbncn|n ∈ N}, etc. It was known in the early 70’s that every context-sensitive language can be generated by rules only of the following types AB → AC, AB → BA, A → BC, A → B and A → a (where A,B,C are nonterminals and a is a terminal symbol).
- 
												  Context-Sensitive Grammars and Linear- Bounded AutomataI. J. Computer Network and Information Security, 2016, 1, 61-66 Published Online January 2016 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijcnis.2016.01.08 Context-Sensitive Grammars and Linear- Bounded Automata Prem Nath The Patent Office, CP-2, Sector-5, Salt Lake, Kolkata-700091, India Email: [email protected] Abstract—Linear-bounded automata (LBA) accept The head moves left or right on the tape utilizing finite context-sensitive languages (CSLs) and CSLs are number of states. A finite contiguous portion of the tape generated by context-sensitive grammars (CSGs). So, for whose length is a linear function of the length of the every CSG/CSL there is a LBA. A CSG is converted into initial input can be accessed by the read/write head. This normal form like Kuroda normal form (KNF) and then limitation makes LBA a more accurate model of corresponding LBA is designed. There is no algorithm or computers that actually exists than a Turing machine in theorem for designing a linear-bounded automaton (LBA) some respects. for a context-sensitive grammar without converting the LBA are designed to use limited storage (limited tape). grammar into some kind of normal form like Kuroda So, as a safety feature, we shall employ end markers normal form (KNF). I have proposed an algorithm for ($ on the left and on the right) on tape and the head this purpose which does not require any modification or never goes past these markers. This will ensure that the normalization of a CSG. storage bound is maintained and helps to keep LBA from leaving their tape.
- 
												  Appendix8 Shape Grammars in Chapter 9 There Is a Reference to Possible Applications of Shape Grammars. One Such Application ConcAppendix8 Shape Grammars In Chapter 9 there is a reference to possible applications of shape grammars. One such application concerns artificial biological materials constructed from a finite number of specially created (stable) Holliday type moleculesb. Shape grammars might be of practical value in a number of other applications as well. A shape grammar is defined as follows. SG = (VM, VT, %, I), where VT is a finite set of terminal symbols VM is a finite set of marker symbols (that act like non-terminals) I e Vj V^ VJ is the start symbol. The production rules of the shape grammar % are phrase structured, and the rules may be applied serially, or in parallel (the same as with Lindenmeyer systems). % is composed of rules of the form u => v, where u is a mix of Vf with V^ , and v is a mix of Vj with V^. Objects in VT and VM may have any location, orientation, or scaling (think in terms of location in a Cartesian grid or some other coordinate system, rotation, and size), in two or more dimensions. As an example, VM The marker disks may be at any one of the four branches, and marks an active area. In the example that follows, these marked active areas are where ligases may act. In the following rule for /£, the four cruciform symbols on the right of the production rule are of the same size, none are rotated, and all four have the locations indicated. J. Gips, Shape Grammars and Their Uses: Artificial Perception, Shape Generation and Computer Aesthetics, 1975, Birkhftuser.
- 
												  INF2080 Context-Sensitive LangugaesINF2080 Context-Sensitive Langugaes Daniel Lupp Universitetet i Oslo 24th February 2016 Department of University of Informatics Oslo INF2080 Lecture :: 24th February 1 / 20 Context-Free Grammar Definition (Context-Free Grammar) A context-free grammar is a 4-tuple (V ; Σ; R; S) where 1 V is a finite set of variables 2 Σ is a finite set disjoint from V of terminals 3 R is a finite set of rules, each consisting of a variable and of a string of variables and terminals 4 and S is the start variable Rules are of the form A ! B1B2B3 ::: Bm, where A 2 V and each Bi 2 V [ Σ. INF2080 Lecture :: 24th February 2 / 20 Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S this is context-free... but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! INF2080 Lecture :: 24th February 3 / 20 Context-Sensitive Languages Some believe natural languages reside in the class of context-sensitive languages, though this is a controversial topic amongst linguists. But many characteristics of natural languages (e.g., verb-noun agreement) are context-sensitive! INF2080 Lecture :: 24th February 4 / 20 Context-Sensitive Grammars So, instead of allowing for a single variable on the left-hand side of a rule, we allow for a context: αBγ ! αβγ (1) with α; β; γ 2 (V [ Σ)∗, but β 6= ".
- 
												  An Introduction to the Theory of Formal Languages and AutomataJANUA LINGUARUM STUDIA MEMORIAE NICOLAI VAN WIJK DEDICATA edenda curat C. H. VAN SCHOONEVELD Indiana University Series Minor, 192/1 FORMAL GRAMMARS IN LINGUISTICS AND PSYCHOLINGUISTICS VOLUME I An Introduction to the Theory of Formal Languages and Automata by W. J. M. LEVELT 1974 MOUTON THE HAGUE • PARIS © Copyright 1974 in The Netherlands Mouton & Co. N. V., Publishers, The Hague No part of this book may be translated or reproduced in any form, by print, photoprint, microfilm, or any other means, without written permission from the publishers Translation: ANDREW BARNAS Printed in Belgium by N.I.C.I., Ghent PREFACE In the latter half of the 1950's, Noam Chomsky began to develop mathematical models for the description of natural languages. Two disciplines originated in his work and have grown to maturity. The first of these is the theory of formal grammars, a branch of mathematics which has proven to be of great interest to informa tion and computer sciences. The second is generative, or more specifically, transformational linguistics. Although these disciplines are independent and develop each according to its own aims and criteria, they remain closely interwoven. Without access to the theory of formal languages, for example, the contemporary study of the foundations of linguistics would be unthinkable. The collaboration of Chomsky and the psycholinguist, George Miller, around 1960 led to a considerable impact of transforma tional linguistics on the psychology of language. During a period of near feverish experimental activity, psycholinguists studied the various ways in which the new linguistic notions might be used in the development of models for language user and language acquisi tion.
- 
												  Normal Forms with Ap- Plications Obecné Gramatiky: Normální Formy a Jejich AplikaceBRNO UNIVERSITY OF TECHNOLOGY VYSOKÉ UČENÍ TECHNICKÉ V BRNĚ FACULTY OF INFORMATION TECHNOLOGY FAKULTA INFORMAČNÍCH TECHNOLOGIÍ DEPARTMENT OF INFORMATION SYSTEMS ÚSTAV INFORMAČNÍCH SYSTÉMŮ GENERAL GRAMMARS: NORMAL FORMS WITH AP- PLICATIONS OBECNÉ GRAMATIKY: NORMÁLNÍ FORMY A JEJICH APLIKACE BACHELOR’S THESIS BAKALÁŘSKÁ PRÁCE AUTHOR DOMINIKA KLOBUČNÍKOVÁ AUTOR PRÁCE SUPERVISOR Prof. RNDr. ALEXANDER MEDUNA, CSc. VEDOUCÍ PRÁCE BRNO 2017 Abstract This thesis deals with the topic of unrestricted grammars, normal forms, and their applica- tions. It focuses on context-sensitive grammars as their special cases. Based on the analysis of the set, an algorithm was designed using the principles of the Cocke-Younger-Kasami algorithm to make a decision of whether an input string is a sentence of a context-sensitive grammar. The final application, which implements this algorithm, works with context- sensitive grammars in the Penttonen normal form. Abstrakt Táto práca sa zaoberá problematikou obecných gramatík, normálnych foriem a ich apliká- cií. Zameriava sa na kontextové gramatiky ako ich špeciálne prípady. Na základe analýzy tejto množiny bol navrhnutý algoritmus využívajúci princípy Cocke-Younger-Kasami algo- ritmu za účelom rozhodnutia, či zadaný reťazec je vetou jazyka definovaného kontextovou gramatikou. Výsledná aplikácia implementujúca toto riešenie je navrhnutá pre prácu s kon- textovými gramatikami v Penttonenovej normálnej forme. Keywords formal languages, grammars, syntax analysis, normal forms, Kuroda, Penttonen, CYK, CKY, Cocke-Younger-Kasami Klíčová slova formálne jazyky, gramatiky, syntaktická analýza, normálne formy, Kuroda, Penttonen, CYK, CKY, Cocke-Younger-Kasami Reference KLOBUČNÍKOVÁ, Dominika. General Grammars: Normal Forms with Applications. Brno, 2017. Bachelor’s thesis. Brno University of Technology, Faculty of Information Technology. Supervisor Prof. RNDr.
- 
												  Chapter 2 Syntax-Based Machine TranslationDesign of a Non-ambiguous Predictive Parser for BangIa Natural Language Sentence with Error Rec!,very Capability by Fazle Elahi Faisal Roll No: 040505037P A thesis submitted for the partial fulfillment of the requirement for the degree of Master of Science in Computer Science and Engineering (M. Sc. Engg.) MASTER OF SCIENCE IN COMPUTER SCIENCE AND ENGINEERING • -, . Department of Computer Science and Engineering (CSE) BANGLADESH UNIVERSITY OF ENGINEERING AND TECHNOLOGY (BUET) Dhaka - 1000, Bangladesh December, 2008 1" < G The thesis titled "Design of a Non-ambiguous Predictive Parser for BangIa Natural Language Sentence with Error Recovery Capability", submitted by Fazle Elahi Faisal, Roll No: 040505037P, Session: April 2005, has been accepted as satisfactory in partial fulfillment of the requirement for the degree of Master of Science in Computer Science and Engineering (M. Sc. Engg.) on December 31, 2008. BOARD OF EXAMINERS 1. Dr. Muhammad Masroor Ali Chairman Professor (Supervisor) Department of Computer Science and Engineering Bangladesh University of Engineering and Technology Dhaka - 1000, Bangladesh 9xw-~ 2. Dr'MClSaidur Rahman Member Professor and Head (Ex-officio) Department of Computer Science and Engineering Bangladesh University of Engineering and Technology. Dhaka - 1000, Bangladesh I~_' 3. Dr. M. Kaykobad Member Professor Department of Computer Science and Engineering Bangladesh University of Engineering and Technology Dhaka ~ 1000, Bangladesh 4. D . Md. Humayun Kabir Member Associate Professor Department of Computer Science and Engineering Bangladesh University of Engineering and Technology Dhaka - 1000, Bangladesh ~ 5. Dr. Mohammad Zahidur Rahman Member Associate Professor (External) D Jahangirnagar University Savar, Dhaka, Bangladesh CANDIDATE'S DECLARATION It is hereby declared that this thesis or any part of it has not been submitted elsewhere for the award if any degree or diploma.
- 
												  Chicago Journal of Theoretical Computer Science the MIT PressChicago Journal of Theoretical Computer Science The MIT Press Volume 1996, Article 4 13 November 1996 ISSN 1073–0486. MIT Press Journals, 55 Hayward St., Cambridge, MA 02142 USA; (617)253-2889; [email protected], [email protected]. Published one article at a time in LATEX source form on the Internet. Pag- ination varies from copy to copy. For more information and other articles see: http://www-mitpress.mit.edu/jrnls-catalog/chicago.html • http://www.cs.uchicago.edu/publications/cjtcs/ • gopher.mit.edu • gopher.cs.uchicago.edu • anonymous ftp at mitpress.mit.edu • anonymous ftp at cs.uchicago.edu • Buntrock and Niemann Weakly Growing CSGs (Info) The Chicago Journal of Theoretical Computer Science is abstracted or in- R R R dexed in Research Alert, SciSearch, Current Contents /Engineering Com- R puting & Technology, and CompuMath Citation Index. c 1996 The Massachusetts Institute of Technology. Subscribers are licensed to use journal articles in a variety of ways, limited only as required to insure fair attribution to authors and the journal, and to prohibit use in a competing commercial product. See the journal’s World Wide Web site for further details. Address inquiries to the Subsidiary Rights Manager, MIT Press Journals; (617)253-2864; [email protected]. The Chicago Journal of Theoretical Computer Science is a peer-reviewed scholarly journal in theoretical computer science. The journal is committed to providing a forum for significant results on theoretical aspects of all topics in computer science. Editor in chief: Janos
- 
												  Context-Sensitive Languages and Linear Bounded AutomataContext-sensitive Languages and Linear Bounded Automata Josh Bax Andre Nies, Supervisor November 15, 2010 Contents 1 Definitions of Context-Sensitive Languages 2 1.1 Introduction . 2 1.2 First Definition of Context Sensitive Languages . 2 1.3 The Chomsky Hierarchy . 4 1.4 Second Equivalent Definition of Context Sensitive Languages . 6 1.5 Third Equivalent Definition of Context Sensitive Languages . 6 2 Properties of Context-Sensitive Languages 10 2.1 Introduction . 10 2.2 Basic closure theorems . 10 2.3 Immerman-Szelepcs´enyi theorem . 12 2.4 Space Complexity . 15 2.5 Computability . 15 2.6 Conclusion . 17 3 References and Acknowledgements 19 1 Definitions of Context-Sensitive Languages 2 1 Definitions of Context-Sensitive Languages 1.1 Introduction The first complete description of the context-sensitive languages (CSLs) was developed by Noam Chomsky in the 1950's as a tool for linguistic analysis. A context-sensitive language is defined by a set of rewriting rules (called a grammar) involving symbols from a given finite set of symbols. These rules take a symbol from one distinct set, the variables, and replace it with one or more symbols from another set, the terminal symbols. The application of these rules depends on the symbols preceding and following the variable symbol, hence they are called context-sensitive. There are other forms of grammar: regular, context-free and unrestricted, classified according to the form of their productions. This classification was made by Chomsky and now forms what is known as the Chomsky hierarchy[1]. Within this hierarchy, the class of context-sensitive languages contains the class of context-free languages, and is itself contained in the recursively enumerable languages.