A Lexical Theory of Phrasal Idioms
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The Generative Lexicon
The Generative Lexicon James Pustejovsky" Computer Science Department Brandeis University In this paper, I will discuss four major topics relating to current research in lexical seman- tics: methodology, descriptive coverage, adequacy of the representation, and the computational usefulness of representations. In addressing these issues, I will discuss what I think are some of the central problems facing the lexical semantics community, and suggest ways of best ap- proaching these issues. Then, I will provide a method for the decomposition of lexical categories and outline a theory of lexical semantics embodying a notion of cocompositionality and type coercion, as well as several levels of semantic description, where the semantic load is spread more evenly throughout the lexicon. I argue that lexical decomposition is possible if it is per- formed generatively. Rather than assuming a fixed set of primitives, I will assume a fixed number of generative devices that can be seen as constructing semantic expressions. I develop a theory of Qualia Structure, a representation language for lexical items, which renders much lexical ambiguity in the lexicon unnecessary, while still explaining the systematic polysemy that words carry. Finally, I discuss how individual lexical structures can be integrated into the larger lexical knowledge base through a theory of lexical inheritance. This provides us with the necessary principles of global organization for the lexicon, enabling us to fully integrate our natural language lexicon into a conceptual whole. 1. Introduction I believe we have reached an interesting turning point in research, where linguistic studies can be informed by computational tools for lexicology as well as an appre- ciation of the computational complexity of large lexical databases. -
Interpreting and Defining Connections in Dependency Structures Sylvain Kahane
Interpreting and defining connections in dependency structures Sylvain Kahane To cite this version: Sylvain Kahane. Interpreting and defining connections in dependency structures. 5th international conference on Dependency Linguistics (Depling), Aug 2019, Paris, France. hal-02291673 HAL Id: hal-02291673 https://hal.parisnanterre.fr//hal-02291673 Submitted on 19 Sep 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Interpreting and defining connections in dependency structures Sylvain Kahane Modyco, Université Paris Nanterre & CNRS [email protected] Abstract This paper highlights the advantages of not interpreting connections in a dependency tree as combina- tions between words but of interpreting them more broadly as sets of combinations between catenae. One of the most important outcomes is the possibility of associating a connection structure to any set of combinations assuming some well-formedness properties and of providing a new way to define de- pendency trees and other kinds of dependency structures, which are not trees but “bubble graphs”. The status of catenae of dependency trees as syntactic units is discussed. 1 Introduction The objective of this article is twofold: first, to show that dependencies in a dependency tree or a similar structure should not generally be interpreted only as combinations between words; second, to show that a broader interpreta- tion of connections has various advantages: It makes it easier to define the dependency structure and to better under- stand the link between different representations of the syntactic structure. -
Some Observations on the Hebrew Desiderative Construction – a Dependency-Based Account in Terms of Catenae1
Thomas Groß Some Observations on the Hebrew Desiderative Construction – A Dependency-Based Account in Terms of Catenae1 Abstract The Modern Hebrew (MH) desiderative construction must obey four conditions: 1. A subordinate clause headed by the clitic še= ‘that’ must be present. 2. The verb in the subordinate clause must be marked with future tense. 3. The grammatical properties genus, number, and person tend to be specified, i.e. if the future tense affix is underspecified, material tends to appear that aids specification, if contextual recovery is unavailable. 4. The units of form that make up the constructional meaning of the desiderative must qualify as a catena. A catena is a dependency-based unit of form, the parts of which are immediately continuous in the vertical dimension. The description of the individual parts of the desiderative must address trans-, pre-, and suffixes, and cliticization. Catena-based morphology is representational, monostratal, dependency-, construction-, and piece-based. 1. Purpose, means and claims The main purpose of this paper is to analyze the Hebrew desiderative construction. This construction is linguistically interesting and challenging for a number of reasons. 1. It is a periphrastic construction, with fairly transparent compositionality. 2. It is transclausal, i.e. some parts of the construction reside in the main clause, and others in the subordinated clause. The complementizer is also part of the construction. 3. The construction consists of more than one word, but it does not qualify as a constituent. Rather the construction cuts into words. 4. Two theoretically 1 I want to thank Outi Bat-El (Tel Aviv University) and three anonymous reviewers for their help and advice. -
Clitics in Dependency Morphology
Clitics in Dependency Morphology Thomas Groß Aichi University, Nagoya, Japan [email protected] tured in much the same fashion as sentences was Abstract proposed first in Williams (1981), and further discussed in the famous “head-debate” between Clitics are challenging for many theories of grammar Zwicky (1985a) and Hudson (1987). In contem- because they straddle syntax and morphology. In most porary morphological theories that attempt to theories, cliticization is considered a phrasal pheno- inform syntax (predominantly within the genera- menon: clitics are affix-like expressions that attach to tive framework) such as Di Sciullo (2005) and whole phrases. Constituency-based grammars in par- the theory of Distributed Morphology (Halle and ticular struggle with the exact constituent structure of such expressions. This paper proposes a solution Marantz 1993, Embick and Noyer 2001, 2007, based on catena-based dependency morphology. This Harley and Noyer 2003, Embick 2003), words theory is an extension of catena-based dependency are now seen as hierarchically structured items. syntax. Following Authors et.al. (in press), a word or Seen in the light of this development, it is time a combination of words in syntax that are continuous for dependency grammar (DG) to make up for its with respect to dominance form a catena. Likewise, a neglect of morphological matters. The assess- morph or a combination of morphs that is continuous ment by Harnisch (2003) that the development of with respect to dominance form a morph catena. Em- a dependency-based morphology requires imme- ploying the concept of morph catena together with a diate attention is accurate. -
A Study of Idiom Translation Strategies Between English and Chinese
ISSN 1799-2591 Theory and Practice in Language Studies, Vol. 3, No. 9, pp. 1691-1697, September 2013 © 2013 ACADEMY PUBLISHER Manufactured in Finland. doi:10.4304/tpls.3.9.1691-1697 A Study of Idiom Translation Strategies between English and Chinese Lanchun Wang School of Foreign Languages, Qiongzhou University, Sanya 572022, China Shuo Wang School of Foreign Languages, Qiongzhou University, Sanya 572022, China Abstract—This paper, focusing on idiom translation methods and principles between English and Chinese, with the statement of different idiom definitions and the analysis of idiom characteristics and culture differences, studies the strategies on idiom translation, what kind of method should be used and what kind of principle should be followed as to get better idiom translations. Index Terms— idiom, translation, strategy, principle I. DEFINITIONS OF IDIOMS AND THEIR FUNCTIONS Idiom is a language in the formation of the unique and fixed expressions in the using process. As a language form, idioms has its own characteristic and patterns and used in high frequency whether in written language or oral language because idioms can convey a host of language and cultural information when people chat to each other. In some senses, idioms are the reflection of the environment, life, historical culture of the native speakers and are closely associated with their inner most spirit and feelings. They are commonly used in all types of languages, informal and formal. That is why the extent to which a person familiarizes himself with idioms is a mark of his or her command of language. Both English and Chinese are abundant in idioms. -
Interpreting and Defining Connections in Dependency Structures
Interpreting and defining connections in dependency structures Sylvain Kahane Modyco, Université Paris Nanterre & CNRS [email protected] Abstract This paper highlights the advantages of not interpreting connections in a dependency tree as combina- tions between words but of interpreting them more broadly as sets of combinations between catenae. One of the most important outcomes is the possibility of associating a connection structure to any set of combinations assuming some well-formedness properties and of providing a new way to define de- pendency trees and other kinds of dependency structures, which are not trees but “bubble graphs”. The status of catenae of dependency trees as syntactic units is discussed. 1 Introduction The objective of this article is twofold: first, to show that dependencies in a dependency tree or a similar structure should not generally be interpreted only as combinations between words; second, to show that a broader interpreta- tion of connections has various advantages: It makes it easier to define the dependency structure and to better under- stand the link between different representations of the syntactic structure. We will focus on the possible syntactic combinations (what combines with what), without considering the fact that combinations are generally asymmetric, linking a governor to a dependent. We use the term connection to denote a dependency without the governor-depen- dency hierarchy. Section 2 presents the notions of combination and connection, which are based on the notion of catena (Osborne et al. 2012). The properties of the set of catenae are studied and used to define an equivalence relation on the set of combinations, which is used to define the connections. -
THE LEXICON: a SYSTEM of MATRICES of LEXICAL UNITS and THEIR PROPERTIES ~ Harry H
THE LEXICON: A SYSTEM OF MATRICES OF LEXICAL UNITS AND THEIR PROPERTIES ~ Harry H. Josselson - Uriel Weinreich /I/~ in discussing the fact that at one time many American scholars relied on either the discipline of psychology or sociology for the resolution of semantic problems~ comments: In Soviet lexicology, it seems, neither the tra- ditionalists~ who have been content to work with the categories of classical rhetoric and 19th- century historical semantics~ nor the critical lexicologists in search of better conceptual tools, have ever found reason to doubt that linguistics alone is centrally responsible for the investiga- tion of the vocabulary of languages. /2/ This paper deals with a certain conceptual tool, the matrix, which linguists can use for organizing a lexicon to insure that words will be described (coded) with consistency, that is~ to insure that questions which have been asked about certain words will be asked for all words in the same class, regardless of the fact that they may be more difficult to answer for some than for others. The paper will also dis- cuss certain new categories~ beyond those of classical rhetoric~ which have been introduced into lexicology. i. INTRODUCTION The research in automatic translation brought about by the introduction of computers into the technology has ~The research described herein has b&en supported by the In- formation Systems Branch of the Office of Naval Research. The present work is an amplification of a paper~ "The Lexicon: A Matri~ of Le~emes and Their Properties"~ contributed to the Conference on Mathematical Linguistics at Budapest-Balatonsza- badi, September 6-i0~ 1968. -
Experiments in Idiom Recognition
Experiments in Idiom Recognition Jing Peng and Anna Feldman Department of Computer Science Department of Linguistics Montclair State University Montclair, New Jersey, USA 07043 Abstract Some expressions can be ambiguous between idiomatic and literal interpretations depending on the context they occur in, e.g., sales hit the roof vs. hit the roof of the car. We present a novel method of classifying whether a given instance is literal or idiomatic, focusing on verb-noun constructions. We report state-of-the-art results on this task using an approach based on the hypothesis that the distributions of the contexts of the idiomatic phrases will be different from the contexts of the literal usages. We measure contexts by using projections of the words into vector space. For comparison, we implement Fazly et al. (2009)’s, Sporleder and Li (2009)’s, and Li and Sporleder (2010b)’s methods and apply them to our data. We provide experimental results validating the proposed techniques. 1 Introduction Researchers have been investigating idioms and their properties for many years. According to traditional approaches, an idiom is — in its simplest form— a string of two or more words for which meaning is not derived from the meanings of the individual words comprising that string (Swinney and Cutler, 1979). As such, the meaning of kick the bucket (‘die’) cannot be obtained by breaking down the idiom and an- alyzing the meanings of its constituent parts, to kick and the bucket. In addition to being influenced by the principle of compositionality, the traditional approaches are also influenced by theories of generative grammar (Flores, 1993; Langlotz, 2006) The properties that traditional approaches attribute to idiomatic expressions are also the properties that make them difficult for generative grammars to describe. -
CS474 Natural Language Processing Semantic Analysis Caveats Introduction to Lexical Semantics
CS474 Natural Language Processing Semantic analysis Last class Assigning meanings to linguistic utterances – History Compositional semantics: we can derive the – Tiny intro to semantic analysis meaning of the whole sentence from the meanings of the parts. Next lectures – Max ate a green apple. – Word sense disambiguation Relies on knowing: » Background from linguistics – the meaning of individual words Lexical semantics – how the meanings of individual words combine to form » On-line resources the meaning of groups of words » Computational approaches [next class] – how it all fits in with syntactic analysis Caveats Introduction to lexical semantics Problems with a compositional approach Lexical semantics is the study of – the systematic meaning-related connections among – a former congressman words and – a toy elephant – the internal meaning-related structure of each word – kicked the bucket Lexeme – an individual entry in the lexicon – a pairing of a particular orthographic and phonological form with some form of symbolic meaning representation Sense: the lexeme’s meaning component Lexicon: a finite list of lexemes Lexical semantic relations: Dictionary entries homonymy right adj. located nearer the right hand esp. Homonyms: words that have the same form and unrelated being on the right when facing the same direction meanings – Instead, a bank1 can hold the investments in a custodial account as the observer. in the client’s name. – But as agriculture burgeons on the east bank2, the river will shrink left adj. located nearer to this side of the body even more. than the right. Homophones: distinct lexemes with a shared red n. the color of blood or a ruby. pronunciation – E.g. -
Idioms-And-Expressions.Pdf
Idioms and Expressions by David Holmes A method for learning and remembering idioms and expressions I wrote this model as a teaching device during the time I was working in Bangkok, Thai- land, as a legal editor and language consultant, with one of the Big Four Legal and Tax companies, KPMG (during my afternoon job) after teaching at the university. When I had no legal documents to edit and no individual advising to do (which was quite frequently) I would sit at my desk, (like some old character out of a Charles Dickens’ novel) and prepare language materials to be used for helping professionals who had learned English as a second language—for even up to fifteen years in school—but who were still unable to follow a movie in English, understand the World News on TV, or converse in a colloquial style, because they’d never had a chance to hear and learn com- mon, everyday expressions such as, “It’s a done deal!” or “Drop whatever you’re doing.” Because misunderstandings of such idioms and expressions frequently caused miscom- munication between our management teams and foreign clients, I was asked to try to as- sist. I am happy to be able to share the materials that follow, such as they are, in the hope that they may be of some use and benefit to others. The simple teaching device I used was three-fold: 1. Make a note of an idiom/expression 2. Define and explain it in understandable words (including synonyms.) 3. Give at least three sample sentences to illustrate how the expression is used in context. -
Draft 5 for Printing
Jan Milí č of Kroměř íž and Emperor Charles IV: Preaching, Power, and the Church of Prague Eleanor Janega UCL Thesis Submitted for the degree of PhD in History 1 I, Eleanor Janega, confirm that the work presented in this thesis is my own. Where information has been derived from other sources, I confirm that this has been indicated in my thesis. 2 Abstract During the second half of the fourteenth century Jan Milí č of Krom ěř íž became an active and popular preacher in Prague. The sermons which he delivered focused primarily on themes of reform, and called for a renewal within the church. Despite a sustained popularity with the lay populace of Prague, Milí č faced opposition to his practice from many individual members of the city’s clergy. Eventually he was the subject of twelve articles of accusation sent to the papal court of Avignon. Because of the hostility which Milí č faced, historians have most often written of him as a precursor to the Hussites. As a result he has been identified as an anti-establishment rabble-rouser and it has been assumed that he conducted his career in opposition to the court of the Emperor Charles IV. This thesis, over four body chapters, examines the careers of both Milí č and Charles and argues that instead of being enemies, the two men shared an amicable relationship. The first chapter examines Milí č’s career and will prove that he was well-connected to Charles and several members of his court. It will also examine the most common reasons given to argue that Charles and Milí č were at odds, and disprove them. -
The Art of Lexicography - Niladri Sekhar Dash
LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash THE ART OF LEXICOGRAPHY Niladri Sekhar Dash Linguistic Research Unit, Indian Statistical Institute, Kolkata, India Keywords: Lexicology, linguistics, grammar, encyclopedia, normative, reference, history, etymology, learner’s dictionary, electronic dictionary, planning, data collection, lexical extraction, lexical item, lexical selection, typology, headword, spelling, pronunciation, etymology, morphology, meaning, illustration, example, citation Contents 1. Introduction 2. Definition 3. The History of Lexicography 4. Lexicography and Allied Fields 4.1. Lexicology and Lexicography 4.2. Linguistics and Lexicography 4.3. Grammar and Lexicography 4.4. Encyclopedia and lexicography 5. Typological Classification of Dictionary 5.1. General Dictionary 5.2. Normative Dictionary 5.3. Referential or Descriptive Dictionary 5.4. Historical Dictionary 5.5. Etymological Dictionary 5.6. Dictionary of Loanwords 5.7. Encyclopedic Dictionary 5.8. Learner's Dictionary 5.9. Monolingual Dictionary 5.10. Special Dictionaries 6. Electronic Dictionary 7. Tasks for Dictionary Making 7.1. Panning 7.2. Data Collection 7.3. Extraction of lexical items 7.4. SelectionUNESCO of Lexical Items – EOLSS 7.5. Mode of Lexical Selection 8. Dictionary Making: General Dictionary 8.1. HeadwordsSAMPLE CHAPTERS 8.2. Spelling 8.3. Pronunciation 8.4. Etymology 8.5. Morphology and Grammar 8.6. Meaning 8.7. Illustrative Examples and Citations 9. Conclusion Acknowledgements ©Encyclopedia of Life Support Systems (EOLSS) LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash Glossary Bibliography Biographical Sketch Summary The art of dictionary making is as old as the field of linguistics. People started to cultivate this field from the very early age of our civilization, probably seven to eight hundred years before the Christian era.