INF2080 Context-Sensitive Langugaes

INF2080 Context-Sensitive Langugaes Daniel Lupp Universitetet i Oslo 15th February 2017 Department of University of Informatics Oslo INF2080 Lecture :: 15th February 1 / 20 Context-Free Grammar Definition (Context-Free Grammar) A context-free grammar is a 4-tuple (V ; Σ; R; S) where 1 V is a finite set of variables 2 Σ is a finite set disjoint from V of terminals 3 R is a finite set of rules, each consisting of a variable and of a string of variables and terminals 4 and S is the start variable Rules are of the form A ! B1B2B3 ::: Bm, where A 2 V and each Bi 2 V [ Σ. INF2080 Lecture :: 15th February 2 / 20 consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S this is context-free... but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! INF2080 Lecture :: 15th February 3 / 20 this is context-free... but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S INF2080 Lecture :: 15th February 3 / 20 but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S this is context-free... INF2080 Lecture :: 15th February 3 / 20 Why Context-Sensitive? Many building blocks of programming languages are context-free, but not all! consider the following toy programming language, where you can “declare” and “assign” a variable a value. S ! declare v; S j assign v : x; S this is context-free... but what if we only want to allow assignment after declaration and an infinite amount of variable names? ! context-sensitive! INF2080 Lecture :: 15th February 3 / 20 But many characteristics of natural languages (e.g., verb-noun agreement) are context-sensitive! Context-Sensitive Languages Some believe natural languages reside in the class of context-sensitive languages, though this is a controversial topic amongst linguists. INF2080 Lecture :: 15th February 4 / 20 Context-Sensitive Languages Some believe natural languages reside in the class of context-sensitive languages, though this is a controversial topic amongst linguists. But many characteristics of natural languages (e.g., verb-noun agreement) are context-sensitive! INF2080 Lecture :: 15th February 4 / 20 Definition (Context-sensitive grammar) A context-sensitive grammar is a 4-tuple (V ; Σ; R; S) consisting of a finite set V of variables; a finite set Σ of terminals, disjoint from V ; a set R of rules of the form (1); a start variable S 2 V . If S does not occur on any righthand side of a rule in S, we also allow for the rule S ! " in R. Context-Sensitive Grammars So, instead of allowing for a single variable on the left-hand side of a rule, we allow for a context: αBγ ! αβγ (1) with α; β; γ 2 (V [ Σ)∗, but β 6= ". INF2080 Lecture :: 15th February 5 / 20 Context-Sensitive Grammars So, instead of allowing for a single variable on the left-hand side of a rule, we allow for a context: αBγ ! αβγ (1) with α; β; γ 2 (V [ Σ)∗, but β 6= ". Definition (Context-sensitive grammar) A context-sensitive grammar is a 4-tuple (V ; Σ; R; S) consisting of a finite set V of variables; a finite set Σ of terminals, disjoint from V ; a set R of rules of the form (1); a start variable S 2 V . If S does not occur on any righthand side of a rule in S, we also allow for the rule S ! " in R. INF2080 Lecture :: 15th February 5 / 20 It is, however, context-sensitive! A context-sensitive grammar that produces this language: 0 0 S ! ABC B Z2 ! B C S ! ASB0C BB0 ! BB 0 0 CB ! Z1B A ! a 0 Z1B ! Z1Z2 B ! b 0 Z1Z2 ! B Z2 C ! c Example Recall that the language fanbncn j n ≥ 1g was not context-free. (pumping lemma for CFLs) INF2080 Lecture :: 15th February 6 / 20 A context-sensitive grammar that produces this language: 0 0 S ! ABC B Z2 ! B C S ! ASB0C BB0 ! BB 0 0 CB ! Z1B A ! a 0 Z1B ! Z1Z2 B ! b 0 Z1Z2 ! B Z2 C ! c Example Recall that the language fanbncn j n ≥ 1g was not context-free. (pumping lemma for CFLs) It is, however, context-sensitive! INF2080 Lecture :: 15th February 6 / 20 Example Recall that the language fanbncn j n ≥ 1g was not context-free. (pumping lemma for CFLs) It is, however, context-sensitive! A context-sensitive grammar that produces this language: 0 0 S ! ABC B Z2 ! B C S ! ASB0C BB0 ! BB 0 0 CB ! Z1B A ! a 0 Z1B ! Z1Z2 B ! b 0 Z1Z2 ! B Z2 C ! c INF2080 Lecture :: 15th February 6 / 20 many rules needed to encode, e.g., the swapping rule cB ! Bc. Definition (Noncontracting Grammars) A noncontracting grammar is a set of rules α ! β, where α; β 2 (V [ Σ)∗ and jαj ≤ jβj. In addition, it may contain S ! " if S does not occur on any righthand side of a rule. Examples: cC ! abABc ab ! de cB ! Bc Note: none of these rules are context-sensitive! Noncontracting Grammars So CSGs can be quite cumbersome... INF2080 Lecture :: 15th February 7 / 20 Definition (Noncontracting Grammars) A noncontracting grammar is a set of rules α ! β, where α; β 2 (V [ Σ)∗ and jαj ≤ jβj. In addition, it may contain S ! " if S does not occur on any righthand side of a rule. Examples: cC ! abABc ab ! de cB ! Bc Note: none of these rules are context-sensitive! Noncontracting Grammars So CSGs can be quite cumbersome...many rules needed to encode, e.g., the swapping rule cB ! Bc. INF2080 Lecture :: 15th February 7 / 20 Examples: cC ! abABc ab ! de cB ! Bc Note: none of these rules are context-sensitive! Noncontracting Grammars So CSGs can be quite cumbersome...many rules needed to encode, e.g., the swapping rule cB ! Bc. Definition (Noncontracting Grammars) A noncontracting grammar is a set of rules α ! β, where α; β 2 (V [ Σ)∗ and jαj ≤ jβj. In addition, it may contain S ! " if S does not occur on any righthand side of a rule. INF2080 Lecture :: 15th February 7 / 20 Note: none of these rules are context-sensitive! Noncontracting Grammars So CSGs can be quite cumbersome...many rules needed to encode, e.g., the swapping rule cB ! Bc. Definition (Noncontracting Grammars) A noncontracting grammar is a set of rules α ! β, where α; β 2 (V [ Σ)∗ and jαj ≤ jβj. In addition, it may contain S ! " if S does not occur on any righthand side of a rule. Examples: cC ! abABc ab ! de cB ! Bc INF2080 Lecture :: 15th February 7 / 20 Noncontracting Grammars So CSGs can be quite cumbersome...many rules needed to encode, e.g., the swapping rule cB ! Bc. Definition (Noncontracting Grammars) A noncontracting grammar is a set of rules α ! β, where α; β 2 (V [ Σ)∗ and jαj ≤ jβj. In addition, it may contain S ! " if S does not occur on any righthand side of a rule. Examples: cC ! abABc ab ! de cB ! Bc Note: none of these rules are context-sensitive! INF2080 Lecture :: 15th February 7 / 20 So it would seem that noncontracting grammars are quite a bit more expressive than context-sensitive grammars. After all, α ! β is far more general... Theorem Every noncontracting grammar can be transformed into a context-sensitive grammar that produces the same language. So, in the spirit of INF2080’s love of abbreviations: NCG = CSG! Noncontracting vs. Context-sensitive Grammars First note: context-sensitive rules αBγ ! αβγ are noncontracting, since we required β 6= ". INF2080 Lecture :: 15th February 8 / 20 Theorem Every noncontracting grammar can be transformed into a context-sensitive grammar that produces the same language. So, in the spirit of INF2080’s love of abbreviations: NCG = CSG! Noncontracting vs. Context-sensitive Grammars First note: context-sensitive rules αBγ ! αβγ are noncontracting, since we required β 6= ". So it would seem that noncontracting grammars are quite a bit more expressive than context-sensitive grammars. After all, α ! β is far more general... INF2080 Lecture :: 15th February 8 / 20 So, in the spirit of INF2080’s love of abbreviations: NCG = CSG! Noncontracting vs. Context-sensitive Grammars First note: context-sensitive rules αBγ ! αβγ are noncontracting, since we required β 6= ". So it would seem that noncontracting grammars are quite a bit more expressive than context-sensitive grammars. After all, α ! β is far more general... Theorem Every noncontracting grammar can be transformed into a context-sensitive grammar that produces the same language. INF2080 Lecture :: 15th February 8 / 20 Noncontracting vs. Context-sensitive Grammars First note: context-sensitive rules αBγ ! αβγ are noncontracting, since we required β 6= ". So it would seem that noncontracting grammars are quite a bit more expressive than context-sensitive grammars. After all, α ! β is far more general... Theorem Every noncontracting grammar can be transformed into a context-sensitive grammar that produces the same language.

INF2080 Context-Sensitive Langugaes

Formal Languages and Automata Theory Géza Horváth, Benedek Nagy Szerzői Jog © 2014 Géza Horváth, Benedek Nagy, University of Debrecen

Automata and Grammars

Languages Generated by Context-Free and Type AB → BA Rules

CDM [1Ex]Context-Sensitive Grammars

Context-Sensitive Grammars and Linear- Bounded Automata

Appendix8 Shape Grammars in Chapter 9 There Is a Reference to Possible Applications of Shape Grammars. One Such Application Conc

INF2080 Context-Sensitive Langugaes

An Introduction to the Theory of Formal Languages and Automata

Normal Forms with Ap- Plications Obecné Gramatiky: Normální Formy a Jejich Aplikace

Chapter 2 Syntax-Based Machine Translation

Chicago Journal of Theoretical Computer Science the MIT Press

Context-Sensitive Languages and Linear Bounded Automata