Synchronous Dependency Insertion Grammars a Grammar Formalism for Syntax Based Statistical MT

Total Page:16

File Type:pdf, Size:1020Kb

Synchronous Dependency Insertion Grammars a Grammar Formalism for Syntax Based Statistical MT Synchronous Dependency Insertion Grammars A Grammar Formalism for Syntax Based Statistical MT Yuan Ding and Martha Palmer Department of Computer and Information Science University of Pennsylvania Philadelphia, PA 19104, USA {yding, mpalmer}@linc.cis.upenn.edu Abstract In the early 1990s, (Brown et. al. 1993) intro- duced the idea of statistical machine translation, This paper introduces a grammar formalism where the word to word translation probabilities and specifically designed for syntax-based sta- sentence reordering probabilities are estimated from tistical machine translation. The synchro- a large set of parallel sentence pairs. By having the nous grammar formalism we propose in advantage of leveraging large parallel corpora, the this paper takes into consideration the per- statistical MT approach outperforms the traditional vasive structure divergence between lan- transfer based approaches in tasks for which ade- guages, which many other synchronous quate parallel corpora is available (Och, 2003). grammars are unable to model. A Depend- However, a major criticism of this approach is that it ency Insertion Grammars (DIG) is a gen- is void of any internal representation for syntax or erative grammar formalism that captures semantics. word order phenomena within the depend- In recent years, hybrid approaches, which aim at ency representation. Synchronous Depend- applying statistical learning to structured data, began ency Insertion Grammars (SDIG) is the to emerge. Syntax based statistical MT approaches synchronous version of DIG which aims at began with (Wu 1997), who introduced a polyno- capturing structural divergences across the mial-time solution for the alignment problem based languages. While both DIG and SDIG have on synchronous binary trees. (Alshawi et al., 2000) comparatively simpler mathematical forms, extended the tree-based approach by representing we prove that DIG nevertheless has a gen- each production in parallel dependency trees as a eration capacity weakly equivalent to that finite-state transducer. (Yamada and Knight, 2001, of CFG. By making a comparison to TAG 2002) model translation as a sequence of operations and Synchronous TAG, we show how such transforming a syntactic tree in one language into formalisms are linguistically motivated. We the string of the second language. then introduce a probabilistic extension of The syntax based statistical approaches have SDIG. We finally evaluated our current im- been faced with the major problem of pervasive plementation of a simplified version of structural divergence between languages, due to both SDIG for syntax based statistical machine systematic differences between languages (Dorr, translation. 1994) and the vagaries of loose translations in real 1 Introduction corpora. While we would like to use syntactic in- formation in both languages, the problem of non- Dependency grammars have a long history and isomorphism grows when trees in both languages are have played an important role in machine translation required to match. (MT). The early use of dependency structures in ma- To allow the syntax based machine translation chine translation tasks mainly fall into the category approaches to work as a generative process, certain of transfer based MT, where the dependency struc- isomorphism assumptions have to be made. Hence a ture of the source language is first analyzed, then reasonable question to ask is: to what extent should transferred to the target language by using a set of the grammar formalism, which we choose to repre- transduction rules or a transfer lexicon, and finally sent syntactic language transfer, assume isomor- the linear form of the target language sentence is phism between the structures of the two languages? generated. (Hajic et al., 2002) allows for limited non- While the above approach seems to be plausible, isomorphism in that n-to-m matching of nodes in the the transfer process demands intense human effort in two trees is permitted. However, even after extend- creating a working transduction rule set or a transfer ing this model by allowing cloning operations on lexicon, which largely limits the performance and subtrees, (Gildea, 2003) found that parallel trees application domain of the resultant machine transla- over-constrained the alignment problem, and tion system. achieved better results with a tree-to-string model using one input tree than with a tree-to-tree model 5. Simple: it should have a minimal number of using two. different structures and operations so that it will At the same time, grammar theoreticians have be learnable from the empirical data. proposed various generative synchronous grammar In the following sections of this paper, we intro- formalisms for MT, such as Synchronous Context duce a grammar formalism that satisfies the above Free Grammars (S-CFG) (Wu, 1997) or Synchro- properties: Synchronous Dependency Insertion nous Tree Adjoining Grammars (S-TAG) (Shieber Grammar (SDIG). Section 2 gives an informal look and Schabes, 1990). Mathematically, generative at the desired capabilities of a monolingual version synchronous grammars share many good properties Dependency Insertion Grammar (DIG) by address- similar to their monolingual counterparts such as ing the problems with previous dependency gram- CFG or TAG (Joshi and Schabes, 1992). If such a mars. Section 3 gives the formal definition of the synchronous grammar could be learnt from parallel DIG and shows that it is weakly equivalent to Con- corpora, the MT task would become a mathemati- text Free Grammar (CFG). Section 4 shows how cally clean generative process. DIG is linguistically motivated by making a com- However, the problem of inducing a synchronous parison between DIG and Tree Adjoining Grammar grammar from empirical data was never solved. For (TAG). Section 5 specifies the Synchronous DIG example, Synchronous TAGs, proposed by (Shieber and Section 6 gives the probabilistic extension of and Schabes, 1990), which were introduced primar- SDIG. ily for semantics but were later also proposed for translation. From a formal perspective, Syn-TAGs 2 Issues with Dependency Grammars characterize the correspondences between languages by a set of synchronous elementary tree pairs. While 2.1 Dependency Grammars and Statistical MT examples show that this formalism does capture cer- tain cross language structural divergences, there is According to (Fox, 2002), dependency represen- not, to our knowledge, any successful statistical tations have the best phrasal cohesion properties learning method to learn such a grammar from em- across languages. The percentage of head crossings pirical data. We believe that this is due to the limited per chance is 12.62% and that of modifier crossings ability of Synchronous TAG to model structure di- per chance is 9.22%. Observing this fact, it is rea- vergences. This observation will be discussed later sonable to propose a formalism that handles lan- in Section 5. guage transfer based on dependency structures. We studied the problem of learning synchronous What is more, if a formalism based on depend- syntactic sub-structures (parallel dependency treelets) ency structures is made possible, it will have the from unaligned parallel corpora in (Ding and Palmer, nice property of being simple, as expressed in the 2004). At the same time, we would like to formalize following table: a synchronous grammar for syntax based statistical CFG TAG DG MT. The necessity of a well-defined formalism and Node# 2n 2n n certain limitations of the current existing formalisms, Lexicalized? NO YES YES motivate us to design a new synchronous grammar Node types 2 2 1* formalism which will have the following properties: Operation types 1 2 1* 1. Linguistically motivated: it should be able to (*: will be shown later in this paper) capture most language phenomena, e.g. compli- Figure 1. cated word orders such as “wh” movement. The simplicity of a grammar is very important for 2. Without the unrealistic word-to-word isomor- statistical modeling, i.e. when it is being learned phism assumption: it should be able to capture from the corpora and when it is being used in ma- structural variations between the languages. chine translation decoding, we don’t need to condi- 3. Mathematically rigorous: it should have a well tion the probabilities on two different node types or defined formalism and a proven generation ca- operations. pacity, preferably context free or mildly context At the same time, dependency grammars are in- sensitive. herently lexicalized in that each node is one word. 4. Generative: it should be “generative” in a Statistical parsers (Collins 1999) showed perform- mathematical sense. This property is essential ance improvement by using bilexical probabilities, for the grammar to be used in statistical MT. i.e. probabilities of word pair occurrences. This is Each production rule should have its own prob- what dependency grammars model explicitly. ability, which will allow us to decompose the overall translation probability. 2.2 A Generative Grammar? definition of the monolingual Dependency Insertion Grammar. Why do we want the grammar for statistical MT to be generative? First of all, generative models have 3 The DIG Formalism long been studied in the machine learning commu- nity, which will provide us with mathematically rig- orous algorithms for training and decoding. Second, 3.1 Elementary Trees CFG, the most popular formalism in describing Formally, the Dependency Insertion Grammar is natural language phenomena, is generative. Certain defined as a six tuple (C, L, A, B,S, R) . C is a set ideas and algorithms can be borrowed from CFG if of syntactic categories and L is a set of lexical we make the formalism generative. items. A is a set of Type-A trees and B is a set of While there has been much previous work in Type-B trees (defined later). S is a set of the start- formalizing dependency grammars and in its appli- cation to the parsing task, until recently (Joshi and ing categories of the sentences. R is a set of word Rambow, 2003), little attention has been given to the order rules local to each node of the trees.
Recommended publications
  • Functional Unification Grammar: a Formalism for Machine Translation
    FUNCTIONAL UNIFICATION GRAMMAR: A FORMALISM FOR MACHINE TRANSLATION Martin Kay Xerox Palo Alto Research Center 3333 Coyote Hill Road Palo Alto California 94304 and CSLI, Stanford Abstract language--morphological, syntactic, semantic, or whatever--could be stated. A formalism powerful enough to accommodate the Functional Unification Grammar provides an opportunity various different kinds of linguistic phenomena with equal facility to encompass within one formalism and computational system might be unappealing to theoretical linguists because powerful the parts of machine translation systems that have usually been formal systems do not make powerful claims. But the engineering treated separately, natably analysis, transfer, and synthesis. advantages are clear to see. A single formalism would straightfor- Many of the advantages of this formalism come from the fact wardly reduce the number of interpreters to two, one for analysis that it is monotonic allowing data structures to grow differently and one for synthesis. Furthermore, the explanatory value of a as different nondeterministic alternatives in a computation are theory clearly rests on a great deal more than the restriciveness of pursued, but never to be modified in any way. A striking feature its formal base. In particular, the possiblity of encompassing what of this system is that it is fundamental reversible, allowing a to had hitherto been thought to require altogether different kinds of translate as b only if b could translate as a. treatment within a single framework could be theoretically inter- esting. I Overview Another clear improvement on the classical design would A. Machine Translation "result from merging 'the two interpreters associated with a for- malism.
    [Show full text]
  • Wittgenstein and Musical Formalism: a Case Revisited
    Ápeiron. Estudios de filosofía — Monográfico «Wittgenstein. Música y arquitectura» Wittgenstein and Musical Formalism: A Case Revisited Wittgenstein y el formalismo musical: Un caso reconsiderado Hanne Appelqvist University of Helsinki [email protected] Abstract: This article defends a formalist interpretation of Wittgenstein’s later thought on music by com- paring it with Eduard Hanslick’s musical formalism. In doing so, it returns to a disagreement I have had with Bela Szabados who, in his book Wittgenstein as a Philosophical Tone-Poet, claims that the attribution of formalism obscures the role that music played in the development of Wittgenstein’s thought. The paper scrutinizes the four arguments Szabados presents to defend his claim, pertaining to alleged differences be- tween Wittgenstein and Hanslick on their accounts of theory, beauty, rules, and the broader significance of music. I will argue that in each case the similarities between Wittgenstein’s and Hanslick’s respective views outshine possible differences. Ultimately, I will argue that instead of rendering music a marginal phenom- enon suited for mere entertainment, formalism –as presented by Hanslick and Wittgenstein, whom I read as influenced by Kant’s aesthetics– underscores music’s ability to show fundamental features of reality and our relation to it. Music does this precisely as a sensuous yet structured medium that is irreducible to any conceptually determined domain. Keywords: Wittgenstein, Hanslick, Kant, formalism, music. Resumen: Este artículo defiende una interpretación formalista del pensamiento posterior de Wittgenstein so- bre la música comparándolo con el formalismo musical de Eduard Hanslick. Con ese fin, reconsidera un des- acuerdo que he tenido con Bela Szabados.
    [Show full text]
  • Ten Choices for Lexical Semantics
    1 Ten Choices for Lexical Semantics Sergei Nirenburg and Victor Raskin1 Computing Research Laboratory New Mexico State University Las Cruces, NM 88003 {sergei,raskin}@crl.nmsu.edu Abstract. The modern computational lexical semantics reached a point in its de- velopment when it has become necessary to define the premises and goals of each of its several current trends. This paper proposes ten choices in terms of which these premises and goals can be discussed. It argues that the central questions in- clude the use of lexical rules for generating word senses; the role of syntax, prag- matics, and formal semantics in the specification of lexical meaning; the use of a world model, or ontology, as the organizing principle for lexical-semantic descrip- tions; the use of rules with limited scope; the relation between static and dynamic resources; the commitment to descriptive coverage; the trade-off between general- ization and idiosyncracy; and, finally, the adherence to the “supply-side” (method- oriented) or “demand-side” (task-oriented) ideology of research. The discussion is inspired by, but not limited to, the comparison between the generative lexicon ap- proach and the ontological semantic approach to lexical semantics. It is fair to say that lexical semantics, the study of word meaning and of its representation in the lexicon, experienced a powerful resurgence within the last decade.2 Traditionally, linguistic se- mantics has concerned itself with two areas, word meaning and sentence meaning. After the ascen- dancy of logic in the 1930s and especially after “the Chomskian revolution” of the late 1950s-early 1960s, most semanticists, including those working in the transformational and post-transforma- tional paradigm, focused almost exclusively on sentence meaning.
    [Show full text]
  • The Computational Lexical Semantics of Syntagmatic Expressions
    The Computational Lexical Semantics of Syntagmatic Relations Evelyne Viegas, Stephen Beale and Sergei Nirenburg New Mexico State University Computing Research Lab, Las Cruces, NM 88003, USA viegas, sb, sergei©crl, nmsu. edu Abstract inheritance hierarchy of Lexical Semantic Functions In this paper, we address the issue of syntagmatic (LSFs). expressions from a computational lexical semantic perspective. From a representational viewpoint, we 2 Approaches to Syntagmatic argue for a hybrid approach combining linguistic and Relations conceptual paradigms, in order to account for the Syntagmatic relations, also known as collocations, continuum we find in natural languages from free are used differently by lexicographers, linguists and combining words to frozen expressions. In particu- statisticians denoting almost similar but not identi- lar, we focus on the place of lexical and semantic cal classes of expressions. restricted co-occurrences. From a processing view- The traditional approach to collocations has been point, we show how to generate/analyze syntag- lexicographic. Here dictionaries provide infor- matic expressions by using an efficient constraint- mation about what is unpredictable or idiosyn- based processor, well fitted for a knowledge-driven cratic. Benson (1989) synthesizes Hausmann's stud- approach. ies on collocations, calling expressions such as com- 1 Introduction mit murder, compile a dictionary, inflict a wound, etc. "fixed combinations, recurrent combinations" You can take advantage o] the chambermaid 1 is not a or "collocations". In Hausmann's terms (1979) a collocation one would like to generate in the context collocation is composed of two elements, a base ("Ba- of a hotel to mean "use the services of." This is why sis") and a collocate ("Kollokator"); the base is se- collocations should constitute an important part in mantically autonomous whereas the collocate cannot the design of Machine Translation or Multilingual be semantically interpreted in isolation.
    [Show full text]
  • The Semantics of Grammar Formalisms Seen As Computer Languages
    The semantics of grammar formalisms seen as computer languages The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Fernando C. N. Pereira and Stuart M. Shieber. The semantics of grammar formalisms seen as computer languages. In Proceedings of the Tenth International Conference on Computational Linguistics, pages 123-129, Stanford University, Stanford, California, July 2-6 1984. Published Version http://dx.doi.org/10.3115/980491.980518 Citable link http://nrs.harvard.edu/urn-3:HUL.InstRepos:2309658 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Other Posted Material, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#LAA The Semantics of Grammar Formalisms Seen as Computer Languages Fernando C. N. Pereira and Stuart M. Shieber Artificial Intelligence Center SRI International and Center for the Study of Language and Information Stanford University Abstract a declarative semantics z extended in the natural way from the declarative semantics of context-free grammars. The design, implementation, and use of grammar for- The last point deserves amplification. Context-free ma]isms for natural language have constituted a major grammars possess an obvious declarative semantics in branch of coml)utational linguistics throughout its devel- which nonterminals represent sets of strings and rules rep- opment. By viewing grammar formalisms as just a spe- resent n-ary relations over strings. This is brought out by cial ease of computer languages, we can take advantage of the reinterpretation familiar from formal language theory the machinery of denotational semantics to provide a pre- of context-free grammars as polynomials over concatena- cise specification of their meaning.
    [Show full text]
  • Pragmatic Formalism, Separation of Powers, and the Need to Revisit the Nondelegation Doctrine
    Loyola University Chicago Law Journal Volume 51 Issue 2 Winter 2019 Article 4 2019 Pragmatic Formalism, Separation of Powers, and the Need to Revisit the Nondelegation Doctrine Martin H. Redish Follow this and additional works at: https://lawecommons.luc.edu/luclj Part of the Law Commons Recommended Citation Martin H. Redish, Pragmatic Formalism, Separation of Powers, and the Need to Revisit the Nondelegation Doctrine, 51 Loy. U. Chi. L. J. 363 (2020). Available at: https://lawecommons.luc.edu/luclj/vol51/iss2/4 This Article is brought to you for free and open access by LAW eCommons. It has been accepted for inclusion in Loyola University Chicago Law Journal by an authorized editor of LAW eCommons. For more information, please contact [email protected]. Pragmatic Formalism, Separation of Powers, and the Need to Revisit the Nondelegation Doctrine Martin H. Redish* In a time where the executive branch continues to grow in size and strength, reviving the nondelegation doctrine has become more important than ever. Judicial enforcement of this abandoned rule that Congress cannot delegate its legislative power serves three vital functions. It preserves the separation of powers, prevents tyranny, and promotes democratic accountability. But even if we acknowledge that enforcement of the doctrine is necessary to preserve the American form of government, a more difficult question is how the doctrine ought to be enforced. This Article rejects the formulations of the nondelegation doctrine proposed by both functionalists and formalists and proposes “pragmatic formalism” as a solution that carves a path between the two existing theories. After explaining the theory and applying it to the facts of the Supreme Court’s major nondelegation cases, the Article also proposes a solution to one of the most difficult challenges facing the nondelegation doctrine: delegation during times of national emergency.
    [Show full text]
  • Russian Formalism: a Metapoetics
    Russian Formalism Russian For1nalis1n A Metapoetics by PETER STEINER Cornell University Press I Ithaca I London Copyright © 1984 by Peter Steiner All rights reserved. Except for brief quotations in a review, this book, or parts thereof, must not be reproduced in any form without permission in writing from the publisher. For information, address Cornell University Press, Sage House, 512 East State Street, Ithaca, New York 14850, or visit our website at www.cornellpress.cornell.edu. First published 1984 by Cornell University Press. “Too Much Monkey Business,” by Chuck Berry, copyright © 1956, Arc Music Corp., 110 East 59th Street, New York, N.Y. 10022. International copyright secured. Used by permission. Library of Congress Cataloging-in-Publication Data Steiner, P. (Peter), 1946– Russian formalism. Based on the author’s thesis. Includes index. 1. Formalism (Literary analysis)—Soviet Union. I. Title. PN98.F6S73 1984 801'.95 84-7708 ISBN 978-0-8014-1710-8 (pbk. : alk. paper) The text of this book is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License: https://creativecommons.org/licenses/by-nc-nd/4.0/ Blond hair, good look'n', wants me to marry, get a home, settle down, write a book- ahhhhhhhhhhhh! Too much monkey business, too much monkey business. -CHUCK BERRY Contents Preface 9 1 Who Is Formalism, What Is She? 15 � The Three Metaphors 44 The Machine 44 The Organism 68 The System 99 3 A Synecdoche 1]8 Zaum' 140 Verse 172 Expression 199 4 The Developmental Significance of Russian Formalism 24 2 Index 2 71 7 Preface This book grew out of my earlier comparative study of Rus­ sian Formalism and Prague Structuralism.
    [Show full text]
  • The Algebra of Language Introduction to Syntax
    The$Algebra$of$Language$an$Introduction$to$ Syntax$ Adam$Szczegielniak Rutgers$University [email protected] Ver.$2 Copyright:$Adam$Szczegielniak TOC: Introduction/…………………………………..2 §1Sentence/structure………………………6 §2//Graphs………………………………………14 §3/Word/categories………………………….23 §4/XABar…………………………………………..47 §5/XABar,/Part/2/Functional/Heads……..78 §6CAcommand/and/Binding/……………..100 §7/Movement/Intro…………………………..110 §7.1/Head/Movement………………………..121 §8/DP/movement/……………………………...136 §9/WhAmovement/……………………………..162 1 /Introduction This/is/a/script/designed/to/assist/students/who/are/taking/an/introduction/to/syntax/ class/as/part/of/the/theoretical/linguistics/curriculum./It/is/not/really/designed/as/a/stand/ alone/text/and/is/Best/used/in/conduction/with/class/lectures./The/framework/adopted/in/this/ textBook/is/Generative/Grammar./There/is/no/historical/preview/of/the/Wield./You/will/not/Wind/ in/here/discussion/of/phrase/structure/rules,/which/were/part/of/early/research/in/the/Wield,/ But/you/will/also/not/Wind/more/recent/versions/of/Minimalism./This/is/an/introduction/text/ and/as/such/its/aim/is/to/assist/in/training/you/to/think/like/an/empirical/scientist/and/a/ linguist./For/some,/this/might/Be/a/surprise/that/linguistics/is/aiming/for/the/empiricalA/ scientiWic/method,/as/opposed/to/some/other/more/neBulous/approach/that/is,/often/ mistakenly,/associated/with/the/humanities./This/stems/from/the/very/nature/of/the/ Chomsky/revolution/of/the/1950’s/that/gave/rise/to/Generative/Grammar./Linguistics/is/the/ study/of/what/a/speaker/knows/when/she/he/is/said/to/speak/a/given/language./It/is/a/way/to/
    [Show full text]
  • Form and Formalism in Linguistics
    Form and formalism in linguistics Edited by James McElvenny language History and Philosophy of the Language science press Sciences 1 History and Philosophy of the Language Sciences Editor: James McElvenny In this series: 1. McElvenny, James (ed.). Form and formalism in linguistics. Form and formalism in linguistics Edited by James McElvenny language science press McElvenny, James (ed.). 2019. Form and formalism in linguistics (History and Philosophy of the Language Sciences 1). Berlin: Language Science Press. This title can be downloaded at: http://langsci-press.org/catalog/book/214 © 2019, the authors Published under the Creative Commons Attribution 4.0 Licence (CC BY 4.0): http://creativecommons.org/licenses/by/4.0/ ISBN: 978-3-96110-182-5 (Digital) 978-3-96110-183-2 (Hardcover) DOI:10.5281/zenodo.2654375 Source code available from www.github.com/langsci/214 Collaborative reading: paperhive.org/documents/remote?type=langsci&id=214 Cover and concept of design: Ulrike Harbort Typesetting: James McElvenny Proofreading: Agnes Kim, Andreas Hölzl, Brett Reynolds, Daniela Hanna-Kolbe, Els Elffers, Eran Asoulin, George Walkden, Ivica Jeđud, Jeroen van de Weijer, Judith Kaplan, Katja Politt, Lachlan Mackenzie, Laura Melissa Arnold, Nick Riemer, Tom Bossuyt, Winfried Lechner Fonts: Linux Libertine, Libertinus Math, Arimo, DejaVu Sans Mono Typesetting software:Ǝ X LATEX Language Science Press Unter den Linden 6 10099 Berlin, Germany langsci-press.org Storage and cataloguing done by FU Berlin Contents Preface James McElvenny iii 1 Visual formalisms in comparative-historical linguistics Judith Kaplan 1 2 Alternating sounds and the formal franchise in phonology James McElvenny 35 3 On Sapir’s notion of form/pattern and its aesthetic background Jean-Michel Fortis 59 4 Linguistics as a “special science”: A comparison of Sapir and Fodor Els Elffers 89 5 The impact of Russian formalism on linguistic structuralism Bart Karstens 115 6 The resistant embrace of formalism in the work of Émile Benveniste and Aurélien Sauvageot John E.
    [Show full text]
  • A Formalism and Environment for the Development of a Large Grammar of English
    A Formalism and Environment for the Development of a Large Grammar of English Ted Briscoe, Claire Grover Bran Boguraev, John Carroll Department of Linguistics, University of Lancaster, Computer Laboratory, University of Cambridge, Bailrigg, Lancaster LAI 4YT, England Com Exchange Street, Cambridge CB2 3QG, England ABSTRACT II THE METAGRAMMATICAL NOTATION Natural language grammars with large coverage are typically The metagrammatical notation we have designed is based on the result of many person-years of effort, working with clumsy GPSG, but has been modified and extended to be more flexible formalisms and sub-optimal software support for grammar and expressive and is interpreted somewhat differently. The first development. This paper describes our approach to the task of motivation for these changes is to define a notation which writing a substantial grammar, as part of a collaboration to provides a specialised programming language for specifying produce a general purpose morphological and syntactic analyser grammatical theories and grammars for particular languages, for English. The grammatical formalism we have developed for rather than defining a specific theory directly. This approach both the task is a metagrammatical notation which is a more ensures that developments in syntactic theory will be less likely expressive and computationally tractable variant of Generalized to render the grammar development environment obsolete and Phrase Structure Grammar. We have also implemented a software also provides the grammarian with an expressively flexible and system which provides a highly integrated and very powerful set powerful notation for grammar writing, rather than a formalism of tools for developing and managing a large grammar based on embodying a restrictive theory.
    [Show full text]
  • Knowledge Representation Formalism for Building Semantic Web Ontologies
    Knowledge Representation Formalism For Building Semantic Web Ontologies by Basak Taylan A survey submitted to the Graduate Faculty in Computer Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy, The City University of New York. 2017 Contents 1 INTRODUCTION 6 2 SEMANTIC WEB 11 3 KNOWLEDGE REPRESENTATION 24 3.1 KNOWLEDGE . 24 3.2 KNOWLEDGE REPRESENTATION FORMALISM . 27 3.2.1 Roles Of Knowledge Representation . 27 3.3 KNOWLEDGE REPRESENTATION METHODS . 33 3.3.1 NETWORKED REPRESENTATION . 33 3.3.1.1 SEMANTIC NETWORKS . 33 3.3.1.2 CONCEPTUAL GRAPHS . 39 3.3.2 STRUCTURAL REPRESENTATION . 42 3.3.2.1 FRAMES . 42 3.3.2.2 KL-ONE KNOWLEDGE REPRESENTATION SYSTEMS . 49 3.3.3 LOGIC-BASED REPRESENTATION . 53 3.3.3.1 PROPOSITIONAL LOGIC(PL) . 54 3.3.3.2 FIRST-ORDER LOGIC(FOL) . 58 2 CONTENTS 3 3.3.3.3 DESCRIPTION LOGICS . 64 3.3.3.4 FRAME LOGIC(F-LOGIC) . 76 4 APPLICATIONS 86 4.1 The Open Mind Common Sense Project(OMCS) . 86 4.2 ConceptNet 3: a Flexible, Multilingual Semantic Network for Common Sense Knowledge . 89 4.3 WordNet . 94 4.4 FrameNet . 99 4.5 VerbNet . 102 4.6 The Brandeis Semantic Ontology (BSO) . 107 5 CONCLUSION 109 Abstract The initial idea of Tim Berners Lee when designing The Wold Wide Web was creating a commonly accessible area in the network for information sharing by means of hyperlinks without concerning about platform dependency in late 1980s[28]. Since then, the Web has been growing dramatically and it has became a major information retrieval means.
    [Show full text]
  • Modernism, Formalism, and Structuralism the Beginning of The
    Modernism, Formalism, and Structuralism The beginning of the 20th century marked a significant shift in ideas pertaining to literature, particularly—though not exclusively—in Europe and the United States. Modernism Enduring through the first half of the 20th century, the literary movement known as modernism developed as a virtual denunciation of preceding aesthetic traditions— including Romanticism, a major artistic movement of the 18th and 19th centuries, as well as the 19th-century predominance of realist, naturalist, and Gothic influences displayed in many novels of the Victorian period. Modernism, in contrast, was preoccupied with a new, distinctly pessimistic worldview that emerged among artists who saw themselves responding to a chaotic and unstable contemporary world that was characterized by the social, political, and philosophical upheavals occurring in Europe and the U.S. at the turn of the century. In particular, modernism marked a turn toward innovative uses of traditional literary modes and techniques. New forms of narrative that defied convention were seen in the works of writers such as Gertrude Stein, James Joyce, Virginia Woolf, and Franz Kafka. Joyce’s A Portrait of the Artist as a Young Man (1916) employed an ambitious stream- of-consciousness narrative and experimental style that Joyce developed further in his masterpiece, Ulysses (1922), now considered an iconic modernist novel. British author Virginia Woolf also gave voice to the interior world of her protagonists through her use of stream-of-consciousness in her novels Mrs. Dalloway (1925) and To the Lighthouse (1927). American authors such as Ernest Hemingway and F. Scott Fitzgerald also became pioneers of modernism; Fitzgerald’s nuanced portrayal of 1920s excess in The Great Gatsby (1925) led to the book’s status as one of the great American novels.
    [Show full text]