Automata and Grammars

Total Page:16

File Type:pdf, Size:1020Kb

Automata and Grammars Automata and Grammars Prof. Dr. Friedrich Otto Fachbereich Elektrotechnik/Informatik, Universität Kassel 34109 Kassel, Germany E-mail: [email protected] SS 2018 Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 1 / 294 Lectures and Seminary SS 2018 Lectures: Thursday 9:00 - 10:30, Room S 11 Start: Thursday, February 22, 2018, 9:00. Seminary: Thursday 10:40 - 12:10, Room S 11 Start: Thursday, February 22, 2018, 10:40. Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 2 / 294 Exercises: From Thursday to Thursday next week! To pass seminary: At least two successful presentations in class (10 %) and passing of midtem exam on April 5 (10:40 - 11:40) (20 %) Final Exam: (dates have been fixed!) Written exam (2 hours) on May 31 (9:00 - 11:30) in S 11 Oral exam (30 minutes per student) on June 14 (9:00 - 12:30) in S 10 Written exam (2. try) on June 21 (9:00 - 11:30) in S 11 Oral exam (2. try) on June 28 (9:00 - 12:30) in S 11 The written exam accounts for 50 % of the final grade, while the oral exam accounts for 20 % of the final grade. Moodle: Course ”Automata and Grammars” NTIN071. Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 3 / 294 Literature: P.J. Denning, J.E. Dennis, J.E. Qualitz; Machines, Languages, and Computation. Prentice-Hall, Englewood Cliffs, N.J., 1978. M. Harrison; Introduction to Formal Language Theory. Addison-Wesley, Reading, M.A., 1978. J.E. Hopcroft, R. Motwani, J.D. Ullman; Introduction to Automata Theory, Languages, and Computation. Addison-Wesley, Boston, 2nd. ed., 2001. H.R. Lewis, Ch.H. Papadimitriou; Elements of the Theory of Computation. Prentice Hall, Englewood Cliffs, N.J., 1981. G. Rozenberg, A. Salomaa (editors); Handbook of Formal Languages, Vol.1: Word, Language, Grammar. Springer, Heidelberg, 1997. Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 4 / 294 1. Introduction 1. Introduction Formal Languages (not natural languages): A formal language is a set of (finite) sequences of symbols (words, strings) from a fixed finite set of symbols (alphabet). How to describe a formal language? There are various options: - by a complete enumeration, but: L = f w 2 fa; b; cg∗ j jwj = 20 g : jLj = 320 ∼ 3:5 × 109 - by a mathematical expression, but: how to check that a given word belongs to the language described? - by a generative device: a grammar or a rewriting system. A grammar tells you how to generate (all) the words of a language! - by an analytical device: an automaton or an algorithm. An automaton recognizes exactly the words of a given language! Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 5 / 294 1. Introduction The Roots of the Theory of Formal Languages: - Combinatorics on Words (A. Thue 1906, 1912) - Semigroup and Group Theory (M. Dehn 1911) - Logic (A. Turing 1926, E. Post 1936, A. Church 1936) Church’s Thesis A function is effectively computable iff there is a Turing machine that computes this function. Let f :Σ∗ ; Γ∗ be a (partial) function. dom(f ) = f u 2 Σ∗ j f (u) is defined g : domain range(f ) = f v 2 Γ∗ j 9u 2 Σ∗ : f (u) = v g : range ker(f ) = f u 2 Σ∗ j f (u) = " g : kernel graph(f ) = f u#f (u) j u 2 dom(f ) g : graph f is computable iff there is an “algorithm” that accepts the language graph(f ). Classes of Automata: Restrictions of the Turing machine Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 6 / 294 1. Introduction - Linguistics (N. Chomsky 1956) : phrase structure grammars - Biology (A. Lindenmayer 1968) : L-systems (T. Head 1987) : DNA-computing, H-systems (G. Paun 1999) : Membrane computing, P-systems Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 7 / 294 1. Introduction Overview: Chapter 2: Regular Languages and Finite Automata Chapter 3: Context-free Languages and Pushdown Automata Chapter 4: Turing Machines and Recursively Enumerable and Context-Sensitive Languages Chapter 5: Summary Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 8 / 294 2. Regular Languages and Finite Automata Chapter 2: Regular Languages and Finite Automata Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 9 / 294 2. Regular Languages and Finite Automata 2.1. Words, Languages, and Morphisms 2.1. Words, Languages, and Morphisms An alphabet Σ is a finite set of symbols or letters. For all n 2 N :Σn = fu :[1; n] ! Σg: set of words of length n over Σ, that is, u = (u(1); u(2);:::; u(n)) = u1u2 ::: un. Σ0 = f"g : " = empty word Σ+ = S Σn : set of all non-empty words over Σ n≥1 Σ∗ = S Σn : set of all words over Σ n≥0 The concatenation · :Σ∗ × Σ∗ ! Σ∗ is defined through u · v = uv. For u 2 Σm and v 2 Σn, uv 2 Σm+n. ∗ The length function j:j :Σ ! N is defined through juj = n for all u 2 Σn; n ≥ 0. Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 10 / 294 2. Regular Languages and Finite Automata 2.1. Words, Languages, and Morphisms Lemma 2.1 (a) The operation of concatenation is associative, that is, (u · v) · w = u · (v · w): (b) For all u 2 Σ∗, u · " = " · u = u. (c) If u · v = u · w, then v = w (Left cancellability). (d) If u · w = v · w, then u = v (Right cancellability). (e) If u · v = x · y, then exactly one of the following cases holds: (1) juj = jxj, u = x, and v = y. (2) juj > jxj, and there exists z 2 Σ+ such that u = x · z and y = z · v. (3) juj < jxj, and there exists z 2 Σ+ such that x = u · z and v = z · y. Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 11 / 294 2. Regular Languages and Finite Automata 2.1. Words, Languages, and Morphisms Abbreviations: uv stands for u · v. u0 = "; u1 = u; un+1 = unu for all u 2 Σ∗; n ≥ 1. Further Basic Notions: The mirror function R :Σ∗ ! Σ∗ is defined through "R = "; (ua)R = auR for all u 2 Σ∗; a 2 Σ: Hence, (abbc)R = c(abb)R = cb(ab)R = cbba. If uv = w, then u is a prefix and v is a suffix of w. If u 6= " (v 6= "), then v is a proper suffix (proper prefix) of w. If uvz = w, then v is a factor of w. It is a proper factor, if uz 6= ". Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 12 / 294 2. Regular Languages and Finite Automata 2.1. Words, Languages, and Morphisms A language L over Σ is a subset L ⊆ Σ∗. The cardinality of L is denoted by jLj. ∗ Let L; L1; L2 ⊆ Σ . L1 · L2 = f uv j u 2 L1; v 2 L2 g is the product of L1 and L2. It is also called the concatenation of L1 and L2. L0 = f"g; L1 = L; Ln+1 = Ln · L for all (n ≥ 1): L+ = S Ln and L∗ = S Ln are the plus closure and the star closure n≥1 n≥0 (Kleene closure) of L. LR = f w R j w 2 L g is the mirror language of L. Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 13 / 294 2. Regular Languages and Finite Automata 2.1. Words, Languages, and Morphisms Example: ∗ (a) Σ1 := f0; 1g : "; 0; 10; 110 2 Σ1. i L1 = f 10 j i ≥ 0 g: R j R i L1 = f 0 1 j j ≥ 0 g und L1 · L1 = f 10 1 j i ≥ 0 g. n n L2 = f 0 1 j n ≥ 1 g. (b) Σ2 := fa; b; cg : R ∗ L3 = f wcw j w 2 fa; bg g: marked palindromes of even length ∗ L4 = f ww j w 2 fa; bg g: copy language L5 = f w j jwja ≡ 0 mod 2 and jwjb ≡ 1 mod 3 g. + (c) Σ3 := f0; 1;:::; 9; :; +; −; eg : 112:45; −23e + 17 2 Σ3 L6 = funsigned integer in PASCALg Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 14 / 294 2. Regular Languages and Finite Automata 2.1. Words, Languages, and Morphisms Let Σ and ∆ be two alphabets. A mapping h :Σ∗ ! ∆∗ is a morphism, if the equality h(uv) = h(u) · h(v) holds for all u; v 2 Σ∗. Lemma 2.2 If h :Σ∗ ! ∆∗ is a morphism, then h(") = ". For a set S, 2S denotes the power set of S. ∗ A mapping ' :Σ∗ ! 2∆ is a substitution, if the two following conditions are satisfied: – 8u; v 2 Σ∗ : '(uv) = '(u) · '(v). – '(") = f"g. For L ⊆ Σ∗, h(L) = f h(w) j w 2 L g and '(L) = S '(w). w2L Prof. Dr. F. Otto (Universität Kassel) Automata and Grammars 15 / 294 2. Regular Languages and Finite Automata 2.1. Words, Languages, and Morphisms Example: Let Σ = fa; b; cg and ∆ = f0; 1g. If h :Σ∗ ! ∆∗ is defined through a 7! 01; b 7! 1; c 7! ", then h(baca) = 10101. ∗ If ' :Σ∗ ! 2∆ is defined through a 7! f01; 001g; b 7! f1i j i ≥ 1g; c 7! f"g; then '(baca) = f 1i 0101; 1i 01001; 1i 00101; 1i 001001 j i ≥ 1 g. A morphism h is called "-free, if h(a) 6= " for all a 2 Σ. A substitution ' is called "-free, if " 62 '(a) for all a 2 Σ. A substitution ' is called finite, if '(a) is a finite set for all a 2 Σ. ∗ h−1 : ∆∗ ! 2Σ is defined through h−1(v) = f u 2 Σ∗ j h(u) = v g.
Recommended publications
  • Formal Languages and Automata Theory Géza Horváth, Benedek Nagy Szerzői Jog © 2014 Géza Horváth, Benedek Nagy, University of Debrecen
    Formal Languages and Automata Theory Géza Horváth, Benedek Nagy Created by XMLmind XSL-FO Converter. Formal Languages and Automata Theory Géza Horváth, Benedek Nagy Szerzői jog © 2014 Géza Horváth, Benedek Nagy, University of Debrecen Created by XMLmind XSL-FO Converter. Tartalom Formal Languages and Automata Theory .......................................................................................... vi Introduction ...................................................................................................................................... vii 1. Elements of Formal Languages ...................................................................................................... 1 1. 1.1. Basic Terms .................................................................................................................... 1 2. 1.2. Formal Systems .............................................................................................................. 2 3. 1.3. Generative Grammars .................................................................................................... 5 4. 1.4. Chomsky Hierarchy ....................................................................................................... 6 2. Regular Languages and Finite Automata ...................................................................................... 12 1. 2.1. Regular Grammars ....................................................................................................... 12 2. 2.2. Regular Expressions ....................................................................................................
    [Show full text]
  • INF2080 Context-Sensitive Langugaes
    INF2080 Context-Sensitive Langugaes Daniel Lupp Universitetet i Oslo 15th February 2017 Department of University of Informatics Oslo INF2080 Lecture :: 15th February 1 / 20 Context-Free Grammar Definition (Context-Free Grammar) A context-free grammar is a 4-tuple (V ; Σ; R; S) where 1 V is a finite set of variables 2 Σ is a finite set disjoint from V of terminals 3 R is a finite set of rules, each consisting of a variable and of a string of variables and terminals 4 and S is the start variable Rules are of the form A ! B1B2B3 ::: Bm, where A 2 V and each Bi 2 V [ Σ. INF2080 Lecture :: 15th February 2 / 20 consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S this is context-free... but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! INF2080 Lecture :: 15th February 3 / 20 this is context-free... but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S INF2080 Lecture :: 15th February 3 / 20 but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! consider the following toy programming language, where you can “declare” and “assign” a variable a value.
    [Show full text]
  • Languages Generated by Context-Free and Type AB → BA Rules
    Magyar Kutatók 8. Nemzetközi Szimpóziuma 8th International Symposium of Hungarian Researchers on Computational Intelligence and Informatics Languages Generated by Context-Free and Type AB → BA Rules Benedek Nagy Faculty of Informatics, University of Debrecen, H-4010 PO Box 12, Hungary, Research Group on Mathematical Linguistics, Rovira i Virgili University, Tarragona, Spain [email protected] Abstract: Derivations using branch-interchanging and language family obtained by context-free and interchange (AB → BA) rules are analysed. This language family is between the context-free and context-sensitive families helping to fill the gap between them. Closure properties are analysed. Only semi-linear languages can be generated in this way. Keywords: formal languages, Chomsky hierarchy, derivation trees, interchange (permutation) rule, semi-linear languages, mildly context-sensitivity 1 Introduction The Chomsky type grammars and the generated language families are one of the most basic and most important fields of theoretical computer science. The field is fairly old, the basic concepts and results are from the middle of the last century (see, for instance, [2, 6, 7]). The context-free grammars (and languages) are widely used due to their generating power and simple way of derivation. The derivation trees represent the context-free derivations. There is a big gap between the efficiency of context-free and context-sensitive grammars. There are very ‘simple’ non-context-free languages, for instance {an^2 |n ∈ N}, {anbncn|n ∈ N}, etc. It was known in the early 70’s that every context-sensitive language can be generated by rules only of the following types AB → AC, AB → BA, A → BC, A → B and A → a (where A,B,C are nonterminals and a is a terminal symbol).
    [Show full text]
  • CDM [1Ex]Context-Sensitive Grammars
    CDM Context-Sensitive Grammars Klaus Sutner Carnegie Mellon Universality 70-cont-sens 2017/12/15 23:17 1 Context-Sensitive Grammars Linear Bounded Automata LBA and Counting Where Are We? 3 Context-free languages based on grammars with productions A ! α are very important since they describe many aspects of programming languages and admit very efficient parsers. CFLs have a natural machine model (PDA) that is useful e.g. to evaluate arithmetic expressions. Properties of CFLs are mostly undecidable, Emptiness and Finiteness being notable exceptions. Postfix Calculators 4 Hewlett-Packard figured out 40 years ago the reverse Polish notation is by far the best way to perform lengthy arithmetic calculations. Very easy to implement with a stack. The old warhorse dc also uses RPN. 10 20 30 + * n 500 10 20 30 f 30 20 10 ^ n 1073741824000000000000000000000000000000 Languages versus Machines 5 Why the hedging about \aspects of programming languages"? Because some properties of programming language lie beyond the power of CFLs. Here is a classical example: variables must be declared before they can be used. begin int x; ... x = 17; ... end Obviously, we would want the compiler to address this kind of situation. Languages versus Machines 6 To deal with problems like this one we need to strengthen our grammars. The key is to remove the constraint of being \context-free." This leads to another important grammar based class of languages: context-sensitive languages. As it turns out, CSL are plenty strong enough to describe programming languages|but in the real world it does not matter, it is better to think of programming language as being context-free, plus a few extra constraints.
    [Show full text]
  • Context-Sensitive Grammars and Linear- Bounded Automata
    I. J. Computer Network and Information Security, 2016, 1, 61-66 Published Online January 2016 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijcnis.2016.01.08 Context-Sensitive Grammars and Linear- Bounded Automata Prem Nath The Patent Office, CP-2, Sector-5, Salt Lake, Kolkata-700091, India Email: [email protected] Abstract—Linear-bounded automata (LBA) accept The head moves left or right on the tape utilizing finite context-sensitive languages (CSLs) and CSLs are number of states. A finite contiguous portion of the tape generated by context-sensitive grammars (CSGs). So, for whose length is a linear function of the length of the every CSG/CSL there is a LBA. A CSG is converted into initial input can be accessed by the read/write head. This normal form like Kuroda normal form (KNF) and then limitation makes LBA a more accurate model of corresponding LBA is designed. There is no algorithm or computers that actually exists than a Turing machine in theorem for designing a linear-bounded automaton (LBA) some respects. for a context-sensitive grammar without converting the LBA are designed to use limited storage (limited tape). grammar into some kind of normal form like Kuroda So, as a safety feature, we shall employ end markers normal form (KNF). I have proposed an algorithm for ($ on the left and on the right) on tape and the head this purpose which does not require any modification or never goes past these markers. This will ensure that the normalization of a CSG. storage bound is maintained and helps to keep LBA from leaving their tape.
    [Show full text]
  • Appendix8 Shape Grammars in Chapter 9 There Is a Reference to Possible Applications of Shape Grammars. One Such Application Conc
    Appendix8 Shape Grammars In Chapter 9 there is a reference to possible applications of shape grammars. One such application concerns artificial biological materials constructed from a finite number of specially created (stable) Holliday type moleculesb. Shape grammars might be of practical value in a number of other applications as well. A shape grammar is defined as follows. SG = (VM, VT, %, I), where VT is a finite set of terminal symbols VM is a finite set of marker symbols (that act like non-terminals) I e Vj V^ VJ is the start symbol. The production rules of the shape grammar % are phrase structured, and the rules may be applied serially, or in parallel (the same as with Lindenmeyer systems). % is composed of rules of the form u => v, where u is a mix of Vf with V^ , and v is a mix of Vj with V^. Objects in VT and VM may have any location, orientation, or scaling (think in terms of location in a Cartesian grid or some other coordinate system, rotation, and size), in two or more dimensions. As an example, VM The marker disks may be at any one of the four branches, and marks an active area. In the example that follows, these marked active areas are where ligases may act. In the following rule for /£, the four cruciform symbols on the right of the production rule are of the same size, none are rotated, and all four have the locations indicated. J. Gips, Shape Grammars and Their Uses: Artificial Perception, Shape Generation and Computer Aesthetics, 1975, Birkhftuser.
    [Show full text]
  • INF2080 Context-Sensitive Langugaes
    INF2080 Context-Sensitive Langugaes Daniel Lupp Universitetet i Oslo 24th February 2016 Department of University of Informatics Oslo INF2080 Lecture :: 24th February 1 / 20 Context-Free Grammar Definition (Context-Free Grammar) A context-free grammar is a 4-tuple (V ; Σ; R; S) where 1 V is a finite set of variables 2 Σ is a finite set disjoint from V of terminals 3 R is a finite set of rules, each consisting of a variable and of a string of variables and terminals 4 and S is the start variable Rules are of the form A ! B1B2B3 ::: Bm, where A 2 V and each Bi 2 V [ Σ. INF2080 Lecture :: 24th February 2 / 20 Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S this is context-free... but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! INF2080 Lecture :: 24th February 3 / 20 Context-Sensitive Languages Some believe natural languages reside in the class of context-sensitive languages, though this is a controversial topic amongst linguists. But many characteristics of natural languages (e.g., verb-noun agreement) are context-sensitive! INF2080 Lecture :: 24th February 4 / 20 Context-Sensitive Grammars So, instead of allowing for a single variable on the left-hand side of a rule, we allow for a context: αBγ ! αβγ (1) with α; β; γ 2 (V [ Σ)∗, but β 6= ".
    [Show full text]
  • An Introduction to the Theory of Formal Languages and Automata
    JANUA LINGUARUM STUDIA MEMORIAE NICOLAI VAN WIJK DEDICATA edenda curat C. H. VAN SCHOONEVELD Indiana University Series Minor, 192/1 FORMAL GRAMMARS IN LINGUISTICS AND PSYCHOLINGUISTICS VOLUME I An Introduction to the Theory of Formal Languages and Automata by W. J. M. LEVELT 1974 MOUTON THE HAGUE • PARIS © Copyright 1974 in The Netherlands Mouton & Co. N. V., Publishers, The Hague No part of this book may be translated or reproduced in any form, by print, photoprint, microfilm, or any other means, without written permission from the publishers Translation: ANDREW BARNAS Printed in Belgium by N.I.C.I., Ghent PREFACE In the latter half of the 1950's, Noam Chomsky began to develop mathematical models for the description of natural languages. Two disciplines originated in his work and have grown to maturity. The first of these is the theory of formal grammars, a branch of mathematics which has proven to be of great interest to informa­ tion and computer sciences. The second is generative, or more specifically, transformational linguistics. Although these disciplines are independent and develop each according to its own aims and criteria, they remain closely interwoven. Without access to the theory of formal languages, for example, the contemporary study of the foundations of linguistics would be unthinkable. The collaboration of Chomsky and the psycholinguist, George Miller, around 1960 led to a considerable impact of transforma­ tional linguistics on the psychology of language. During a period of near feverish experimental activity, psycholinguists studied the various ways in which the new linguistic notions might be used in the development of models for language user and language acquisi­ tion.
    [Show full text]
  • Normal Forms with Ap- Plications Obecné Gramatiky: Normální Formy a Jejich Aplikace
    BRNO UNIVERSITY OF TECHNOLOGY VYSOKÉ UČENÍ TECHNICKÉ V BRNĚ FACULTY OF INFORMATION TECHNOLOGY FAKULTA INFORMAČNÍCH TECHNOLOGIÍ DEPARTMENT OF INFORMATION SYSTEMS ÚSTAV INFORMAČNÍCH SYSTÉMŮ GENERAL GRAMMARS: NORMAL FORMS WITH AP- PLICATIONS OBECNÉ GRAMATIKY: NORMÁLNÍ FORMY A JEJICH APLIKACE BACHELOR’S THESIS BAKALÁŘSKÁ PRÁCE AUTHOR DOMINIKA KLOBUČNÍKOVÁ AUTOR PRÁCE SUPERVISOR Prof. RNDr. ALEXANDER MEDUNA, CSc. VEDOUCÍ PRÁCE BRNO 2017 Abstract This thesis deals with the topic of unrestricted grammars, normal forms, and their applica- tions. It focuses on context-sensitive grammars as their special cases. Based on the analysis of the set, an algorithm was designed using the principles of the Cocke-Younger-Kasami algorithm to make a decision of whether an input string is a sentence of a context-sensitive grammar. The final application, which implements this algorithm, works with context- sensitive grammars in the Penttonen normal form. Abstrakt Táto práca sa zaoberá problematikou obecných gramatík, normálnych foriem a ich apliká- cií. Zameriava sa na kontextové gramatiky ako ich špeciálne prípady. Na základe analýzy tejto množiny bol navrhnutý algoritmus využívajúci princípy Cocke-Younger-Kasami algo- ritmu za účelom rozhodnutia, či zadaný reťazec je vetou jazyka definovaného kontextovou gramatikou. Výsledná aplikácia implementujúca toto riešenie je navrhnutá pre prácu s kon- textovými gramatikami v Penttonenovej normálnej forme. Keywords formal languages, grammars, syntax analysis, normal forms, Kuroda, Penttonen, CYK, CKY, Cocke-Younger-Kasami Klíčová slova formálne jazyky, gramatiky, syntaktická analýza, normálne formy, Kuroda, Penttonen, CYK, CKY, Cocke-Younger-Kasami Reference KLOBUČNÍKOVÁ, Dominika. General Grammars: Normal Forms with Applications. Brno, 2017. Bachelor’s thesis. Brno University of Technology, Faculty of Information Technology. Supervisor Prof. RNDr.
    [Show full text]
  • Chapter 2 Syntax-Based Machine Translation
    Design of a Non-ambiguous Predictive Parser for BangIa Natural Language Sentence with Error Rec!,very Capability by Fazle Elahi Faisal Roll No: 040505037P A thesis submitted for the partial fulfillment of the requirement for the degree of Master of Science in Computer Science and Engineering (M. Sc. Engg.) MASTER OF SCIENCE IN COMPUTER SCIENCE AND ENGINEERING • -, . Department of Computer Science and Engineering (CSE) BANGLADESH UNIVERSITY OF ENGINEERING AND TECHNOLOGY (BUET) Dhaka - 1000, Bangladesh December, 2008 1" < G The thesis titled "Design of a Non-ambiguous Predictive Parser for BangIa Natural Language Sentence with Error Recovery Capability", submitted by Fazle Elahi Faisal, Roll No: 040505037P, Session: April 2005, has been accepted as satisfactory in partial fulfillment of the requirement for the degree of Master of Science in Computer Science and Engineering (M. Sc. Engg.) on December 31, 2008. BOARD OF EXAMINERS 1. Dr. Muhammad Masroor Ali Chairman Professor (Supervisor) Department of Computer Science and Engineering Bangladesh University of Engineering and Technology Dhaka - 1000, Bangladesh 9xw-~ 2. Dr'MClSaidur Rahman Member Professor and Head (Ex-officio) Department of Computer Science and Engineering Bangladesh University of Engineering and Technology. Dhaka - 1000, Bangladesh I~_' 3. Dr. M. Kaykobad Member Professor Department of Computer Science and Engineering Bangladesh University of Engineering and Technology Dhaka ~ 1000, Bangladesh 4. D . Md. Humayun Kabir Member Associate Professor Department of Computer Science and Engineering Bangladesh University of Engineering and Technology Dhaka - 1000, Bangladesh ~ 5. Dr. Mohammad Zahidur Rahman Member Associate Professor (External) D Jahangirnagar University Savar, Dhaka, Bangladesh CANDIDATE'S DECLARATION It is hereby declared that this thesis or any part of it has not been submitted elsewhere for the award if any degree or diploma.
    [Show full text]
  • Chicago Journal of Theoretical Computer Science the MIT Press
    Chicago Journal of Theoretical Computer Science The MIT Press Volume 1996, Article 4 13 November 1996 ISSN 1073–0486. MIT Press Journals, 55 Hayward St., Cambridge, MA 02142 USA; (617)253-2889; [email protected], [email protected]. Published one article at a time in LATEX source form on the Internet. Pag- ination varies from copy to copy. For more information and other articles see: http://www-mitpress.mit.edu/jrnls-catalog/chicago.html • http://www.cs.uchicago.edu/publications/cjtcs/ • gopher.mit.edu • gopher.cs.uchicago.edu • anonymous ftp at mitpress.mit.edu • anonymous ftp at cs.uchicago.edu • Buntrock and Niemann Weakly Growing CSGs (Info) The Chicago Journal of Theoretical Computer Science is abstracted or in- R R R dexed in Research Alert, SciSearch, Current Contents /Engineering Com- R puting & Technology, and CompuMath Citation Index. c 1996 The Massachusetts Institute of Technology. Subscribers are licensed to use journal articles in a variety of ways, limited only as required to insure fair attribution to authors and the journal, and to prohibit use in a competing commercial product. See the journal’s World Wide Web site for further details. Address inquiries to the Subsidiary Rights Manager, MIT Press Journals; (617)253-2864; [email protected]. The Chicago Journal of Theoretical Computer Science is a peer-reviewed scholarly journal in theoretical computer science. The journal is committed to providing a forum for significant results on theoretical aspects of all topics in computer science. Editor in chief: Janos
    [Show full text]
  • Context-Sensitive Languages and Linear Bounded Automata
    Context-sensitive Languages and Linear Bounded Automata Josh Bax Andre Nies, Supervisor November 15, 2010 Contents 1 Definitions of Context-Sensitive Languages 2 1.1 Introduction . 2 1.2 First Definition of Context Sensitive Languages . 2 1.3 The Chomsky Hierarchy . 4 1.4 Second Equivalent Definition of Context Sensitive Languages . 6 1.5 Third Equivalent Definition of Context Sensitive Languages . 6 2 Properties of Context-Sensitive Languages 10 2.1 Introduction . 10 2.2 Basic closure theorems . 10 2.3 Immerman-Szelepcs´enyi theorem . 12 2.4 Space Complexity . 15 2.5 Computability . 15 2.6 Conclusion . 17 3 References and Acknowledgements 19 1 Definitions of Context-Sensitive Languages 2 1 Definitions of Context-Sensitive Languages 1.1 Introduction The first complete description of the context-sensitive languages (CSLs) was developed by Noam Chomsky in the 1950's as a tool for linguistic analysis. A context-sensitive language is defined by a set of rewriting rules (called a grammar) involving symbols from a given finite set of symbols. These rules take a symbol from one distinct set, the variables, and replace it with one or more symbols from another set, the terminal symbols. The application of these rules depends on the symbols preceding and following the variable symbol, hence they are called context-sensitive. There are other forms of grammar: regular, context-free and unrestricted, classified according to the form of their productions. This classification was made by Chomsky and now forms what is known as the Chomsky hierarchy[1]. Within this hierarchy, the class of context-sensitive languages contains the class of context-free languages, and is itself contained in the recursively enumerable languages.
    [Show full text]