'It Works in Practice but Will It Work in Theory?' the Uneasy Relationship Between Lexicography and Matters Theoretical
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Words and Alternative Basic Units for Linguistic Analysis
Words and alternative basic units for linguistic analysis 1 Words and alternative basic units for linguistic analysis Jens Allwood SCCIIL Interdisciplinary Center, University of Gothenburg A. P. Hendrikse, Department of Linguistics, University of South Africa, Pretoria Elisabeth Ahlsén SCCIIL Interdisciplinary Center, University of Gothenburg Abstract The paper deals with words and possible alternative to words as basic units in linguistic theory, especially in interlinguistic comparison and corpus linguistics. A number of ways of defining the word are discussed and related to the analysis of linguistic corpora and to interlinguistic comparisons between corpora of spoken interaction. Problems associated with words as the basic units and alternatives to the traditional notion of word as a basis for corpus analysis and linguistic comparisons are presented and discussed. 1. What is a word? To some extent, there is an unclear view of what counts as a linguistic word, generally, and in different language types. This paper is an attempt to examine various construals of the concept “word”, in order to see how “words” might best be made use of as units of linguistic comparison. Using intuition, we might say that a word is a basic linguistic unit that is constituted by a combination of content (meaning) and expression, where the expression can be phonetic, orthographic or gestural (deaf sign language). On closer examination, however, it turns out that the notion “word” can be analyzed and specified in several different ways. Below we will consider the following three main ways of trying to analyze and define what a word is: (i) Analysis and definitions building on observation and supposed easy discovery (ii) Analysis and definitions building on manipulability (iii) Analysis and definitions building on abstraction 2. -
Greek and Latin Roots, Prefixes, and Suffixes
GREEK AND LATIN ROOTS, PREFIXES, AND SUFFIXES This is a resource pack that I put together for myself to teach roots, prefixes, and suffixes as part of a separate vocabulary class (short weekly sessions). It is a combination of helpful resources that I have found on the web as well as some tips of my own (such as the simple lesson plan). Lesson Plan Ideas ........................................................................................................... 3 Simple Lesson Plan for Word Study: ........................................................................... 3 Lesson Plan Idea 2 ...................................................................................................... 3 Background Information .................................................................................................. 5 Why Study Word Roots, Prefixes, and Suffixes? ......................................................... 6 Latin and Greek Word Elements .............................................................................. 6 Latin Roots, Prefixes, and Suffixes .......................................................................... 6 Root, Prefix, and Suffix Lists ........................................................................................... 8 List 1: MEGA root list ................................................................................................... 9 List 2: Roots, Prefixes, and Suffixes .......................................................................... 32 List 3: Prefix List ...................................................................................................... -
Different but Not All Opposite: Contributions to Lexical Relationships Teaching in Primary School
INTE - ITICAM - IDEC 2018, Paris-FRANCE VOLUME 1 All Different But Not All Opposite: Contributions To Lexical Relationships Teaching In Primary School Adriana BAPTISTA Polytechnic Institute of Porto – School of Media Arts and Design inED – Centre for Research and Innovation in Education Portugal [email protected] Celda CHOUPINA Polytechnic Institute of Porto – School of Education inED – Centre for Research and Innovation in Education Centre of Linguistics of the University of Porto Portugal [email protected] José António COSTA Polytechnic Institute of Porto – School of Education inED – Centre for Research and Innovation in Education Centre of Linguistics of the University of Porto Portugal [email protected] Joana QUERIDO Polytechnic Institute of Porto – School of Education Portugal [email protected] Inês OLIVEIRA Polytechnic Institute of Porto – School of Education Centre of Linguistics of the University of Porto Portugal [email protected] Abstract The lexicon allows the expression of particular cosmovisions, which is why there are a wide range of lexical relationships, involving different linguistic particularities (Coseriu, 1991; Teixeira , 2005). We find, however, in teaching context, that these variations are often replaced by dichotomous and decontextualized proposals of lexical organization, presented, for instance, in textbooks and other supporting materials (Baptista et al., 2017). Thus, our paper is structured in two parts. First, we will try to account for the diversity of lexical relations (Choupina, Costa & Baptista, 2013), considering phonological, morphological, syntactic, semantic, pragmatic- discursive, cognitive and historical criteria (Lehmann & Martin-Berthet, 2008). Secondly, we present an experimental study that aims at verifying if primary school pupils intuitively organize their mental lexicon in a dichotomous way. -
Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph
mathematics Article Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph Tajana Ban Kirigin 1,* , Sanda Bujaˇci´cBabi´c 1 and Benedikt Perak 2 1 Department of Mathematics, University of Rijeka, R. Matejˇci´c2, 51000 Rijeka, Croatia; [email protected] 2 Faculty of Humanities and Social Sciences, University of Rijeka, SveuˇcilišnaAvenija 4, 51000 Rijeka, Croatia; [email protected] * Correspondence: [email protected] Abstract: This paper describes a graph method for labeling word senses and identifying lexical sentiment potential by integrating the corpus-based syntactic-semantic dependency graph layer, lexical semantic and sentiment dictionaries. The method, implemented as ConGraCNet application on different languages and corpora, projects a semantic function onto a particular syntactical de- pendency layer and constructs a seed lexeme graph with collocates of high conceptual similarity. The seed lexeme graph is clustered into subgraphs that reveal the polysemous semantic nature of a lexeme in a corpus. The construction of the WordNet hypernym graph provides a set of synset labels that generalize the senses for each lexical cluster. By integrating sentiment dictionaries, we introduce graph propagation methods for sentiment analysis. Original dictionary sentiment values are integrated into ConGraCNet lexical graph to compute sentiment values of node lexemes and lexical clusters, and identify the sentiment potential of lexemes with respect to a corpus. The method can be used to resolve sparseness of sentiment dictionaries and enrich the sentiment evaluation of Citation: Ban Kirigin, T.; lexical structures in sentiment dictionaries by revealing the relative sentiment potential of polysemous Bujaˇci´cBabi´c,S.; Perak, B. Lexical Sense Labeling and Sentiment lexemes with respect to a specific corpus. -
Lexical Semantics
Lexical Semantics COMP-599 Oct 20, 2015 Outline Semantics Lexical semantics Lexical semantic relations WordNet Word Sense Disambiguation • Lesk algorithm • Yarowsky’s algorithm 2 Semantics The study of meaning in language What does meaning mean? • Relationship of linguistic expression to the real world • Relationship of linguistic expressions to each other Let’s start by focusing on the meaning of words— lexical semantics. Later on: • meaning of phrases and sentences • how to construct that from meanings of words 3 From Language to the World What does telephone mean? • Picks out all of the objects in the world that are telephones (its referents) Its extensional definition not telephones telephones 4 Relationship of Linguistic Expressions How would you define telephone? e.g, to a three-year- old, or to a friendly Martian. 5 Dictionary Definition http://dictionary.reference.com/browse/telephone Its intensional definition • The necessary and sufficient conditions to be a telephone This presupposes you know what “apparatus”, “sound”, “speech”, etc. mean. 6 Sense and Reference (Frege, 1892) Frege was one of the first to distinguish between the sense of a term, and its reference. Same referent, different senses: Venus the morning star the evening star 7 Lexical Semantic Relations How specifically do terms relate to each other? Here are some ways: Hypernymy/hyponymy Synonymy Antonymy Homonymy Polysemy Metonymy Synecdoche Holonymy/meronymy 8 Hypernymy/Hyponymy ISA relationship Hyponym Hypernym monkey mammal Montreal city red wine beverage 9 Synonymy and Antonymy Synonymy (Roughly) same meaning offspring descendent spawn happy joyful merry Antonymy (Roughly) opposite meaning synonym antonym happy sad descendant ancestor 10 Homonymy Same form, different (and unrelated) meaning Homophone – same sound • e.g., son vs. -
Antonymy and Conceptual Vectors
Antonymy and Conceptual Vectors Didier Schwab, Mathieu Lafourcade and Violaine Prince LIRMM Laboratoire d'informatique, de Robotique et de Micro´electronique de Montpellier MONTPELLIER - FRANCE. schwab,lafourca,prince @lirmm.fr http://www.lirmm.fr/f ~ schwab,g lafourca, prince f g Abstract a kernel of manually indexed terms is necessary For meaning representations in NLP, we focus for bootstrapping the analysis. The transver- 1 our attention on thematic aspects and concep- sal relationships , such as synonymy (LP01), tual vectors. The learning strategy of concep- antonymy and hyperonymy, that are more or tual vectors relies on a morphosyntaxic analy- less explicitly mentioned in definitions can be sis of human usage dictionary definitions linked used as a way to globally increase the coher- to vector propagation. This analysis currently ence of vectors. In this paper, we describe a doesn't take into account negation phenomena. vectorial function of antonymy. This can help This work aims at studying the antonymy as- to improve the learning system by dealing with pects of negation, in the larger goal of its inte- negation and antonym tags, as they are often gration into the thematic analysis. We present a present in definition texts. The antonymy func- model based on the idea of symmetry compat- tion can also help to find an opposite thema to ible with conceptual vectors. Then, we define be used in all generative text applications: op- antonymy functions which allows the construc- posite ideas research, paraphrase (by negation tion of an antonymous vector and the enumer- of the antonym), summary, etc. ation of its potentially antinomic lexical items. -
Phonotactic Complexity of Finnish Nouns FRED KARLSSON
7 Phonotactic Complexity of Finnish Nouns FRED KARLSSON 7.1 Introduction In the continuous list of publications on his homepage, Kimmo Koskenniemi gives an item from 1979 as the first one. But this is not strictly speaking his first publication. Here I shall elevate from international oblivion a report of Kimmo’s from 1978 from which the following introductory prophecy is taken: “The computer might be an extremely useful tool for linguistic re- search. It is fast and precise and capable of treating even large materials” (Koskenniemi 1978: 5). This published report is actually a printed version of Kimmo’s Master’s Thesis in general linguistics where he theoretically analyzed the possibili- ties of automatic lemmatization of Finnish texts, including a formalization of Finnish inflectional morphology. On the final pages of the report he esti- mates that the production rules he formulated may be formalized as analytic algorithms in several ways, that the machine lexicon might consist of some 200,000 (more or less truncated) stems, that there are some 4,000 inflectional elements, that all of these stems and elements can be accommodated on one magnetic tape or in direct-access memory, and that real-time computation could be ‘very reasonable’ (varsin kohtuullista) if the data were well orga- nized and a reasonably big computer were available (ibid.: 52-53). I obviously am the happy owner of a bibliographical rarity because Kimmo’s dedication of 1979 tells me that this is the next to the last copy. This was five years before two-level morphology was launched in 1983 when Kimmo substantiated his 1978 exploratory work by presenting a full- blown theory of computational morphology and entered the international computational-linguistic scene where he has been a main character ever since. -
A Reverse Dictionary of Cypriot Greek
Cypriot Greek Lexicography: A Reverse Dictionary of Cypriot Greek Charalambos Themistocleous, Marianna Katsoyannou, Spyros Armosti & Kyriaki Christodoulou Keywords: reverse dictionary, Cypriot Greek, orthographic variation, orthography standardisation, dialectal lexicography. Abstract This article explores the theoretical issues of producing a dialectal reverse dictionary of Cypriot Greek, the collection of data, the principles for selecting the lemmas among various candidates of word types, their orthographic representation, and the choices that were made for writing a variety without a standardized orthography. 1. Introduction Cypriot Greek (henceforth CG) is a variety of Greek spoken by almost a million people in the Republic of Cyprus. CG differs from Standard Modern Greek (henceforth SMG) with regard to its phonetics, phonology, morphology, syntax, and even pragmatics (Goutsos and Karyolemou 2004; Papapavlou and Pavlou 1998; Papapavlou 2005; Tsiplakou 2004; Katsoyannou et al. 2006; Tsiplakou 2007; Arvaniti 2002; Terkourafi 2003). The study of the vocabulary of CG was one of the research goals of ‘Syntychies’, a research project for the production of lexicographic resources undertaken by the Department of Byzantine and Modern Greek Studies of the University of Cyprus between 2006 and 2010. Another research goal of the project has been the study of the written representation of the dialect—since there exists no standardized CG orthography. The applied part of the Syntychies project includes the creation of a lexicographic database suitable for the production of dialectal dictionaries of CG, such as a reverse dictionary of CG (henceforth RDCG), which is currently under publication. The lexicographic database is hosted in a dedicated webpage, which allows online searching of the database. -
Reverse Dictionary
Imperial Journal of Interdisciplinary Research (IJIR) Vol-2, Issue-4, 2016 ISSN: 2454-1362, http://www.onlinejournal.in Reverse Dictionary Veena Gurram1, Sweta Rathod2, Anish Lushte3, Pranay Vaidya4, Prof. Vinod Alone5 & Prof. Mahendra Pawar6 1,2,3,4B.E Student 5,6Assistant Professor 1,2,3,4,5,6Department Of Computer Engineering, PVPPCOE Sion, Mumbai 400022 Abstract— In this paper, we describe the design and dictionary are the reverse dictionaries [3][4] which is implementation of a reverse dictionary. Unlike a tra- built with certain drawbacks. The existing dictionary ditional forward dictionary, which maps from words receives an input phrase and outputs many output to their definitions, a reverse dictionary takes a user words; therefore it can be tedious for the user to input phrase describing the desired concept, and re- search one from it. Whereas, building a ranking based turns a set of candidate words that satisfy the input dictionary, allows the user to choose from a set of phrase. This work has significant application not only words which are closely related to each other. for the general public, particularly those who work Example: In the existing reverse dictionary, the user closely with words, but also in the general field of inputs a phrase “unknown name” outputs over a 100 conceptual search. search results including “anonymous”, “nameless”, “incognito”, “jane doe”, “john doe”, “some”, “sky- Index Terms—Dictionaries, thesauruses, search pro- blue pink”, “dark horse” etc. [4], But the most cess, web-based services. accurate result for “unknown name” shall be “anonymous”, “nameless” but “incognito” also I. INTRODUCTION contributes some primary meaning to the user input phrase. -
What Is Lexicography?
What is Lexicography? Henning Bergenholtz, Department of Afrikaans and Dutch, University of Stellenbosch, Stellenbosch, Republic of South Africa and Centre for Lexicography, Aarhus University, Aarhus, Denmark ([email protected]) and Rufus H. Gouws, Department of Afrikaans and Dutch, University of Stellenbosch, Stellenbosch, Republic of South Africa ([email protected]) Abstract: Within the field of lexicography there are numerous differences when it comes to the interpretation of the term lexicography and differences in determining the nature, extent and scope of this term. Although it is widely accepted that lexicography consist of two components, i.e. theo- retical lexicography and the lexicographic practice, different definitions of lexicography give no unambiguous reflection of this distinction and of the individual components. This paper looks at some prevailing diverse uses and interpretations of the word lexicography. This is followed by pro- posals to ensure a transformative, unified and comprehensive interpretation of this concept. Keywords: INDEPENDENT DISCIPLINE, INFORMATION TOOL, LEXICOGRAPHER, LEXI- COGRAPHIC PRACTICE, LEXICOGRAPHIC THEORY, LEXICOGRAPHIC TOOL, LEXICOG- RAPHY, LEXICOLOGY, METALEXICOGRAPHY, SCIENTIFIC LEXICOGRAPHY, TERMI- NOGRAPHY Opsomming: Wat is leksikografie? In die leksikografie bestaan daar talle verskille met betrekking tot die interpretasie van die term leksikografie asook met betrekking tot 'n vasstelling van die aard, omvang en bestek van hierdie term. Alhoewel dit wyd aanvaar word dat leksikografie uit twee komponente bestaan, naamlik 'n teoretiese en 'n praktiese komponent, bied onderskeie defini- sies van leksikografie geen ondubbelsinnige beskrywing van hierdie verskil en van die onderskeie komponente nie. Hierdie artikel gee aandag aan enkele bestaande maar uiteenlopende gebruike en interpretasies van die woord leksikografie. Daarna volg voorstelle ter versekering van 'n transforma- tiewe verenigde en omvattende interpretasie van hierdie begrip. -
Using an On-Line Dictionary to Find Rhyming Words And
USING AN ON=LINE DICTIONARY TO FIND RHYMING WORDS AND PRONUNCIATIONS FOR UNKNOWN WORDS Roy J Byrd I.B.M. Thomas J. Watson Research Center Yorktown Heights, New York 10598 Martin S. Chodorow Department of Psychology, Hunter College of CUNY and I.B.M. Thomas J. Watson Research Center Yorktown Heights, New York 10598 ABSTRACT word not found in the dictionary. WordSmith also shows the user words that are "close" to a given word Humans know a great deal about relationships among along dimensions such as spelling (as in published dic- words. This paper discusses relationships among word tionaries), meaning (as in thesauruses), and sound (as in pronunciations. We describe a computer system which rhyming dictionaries). models human judgement of rhyme by assigning specific roles to the location of primary stress, the similarity of Figure I shows a sample of the WordSmith user inter- phonetic segments, and other factors. By using the face. The current word, urgency, labels the text box at model as an experimental tool, we expect to improve our the center of the screen. The box contains the output understanding of rhyme. A related computer model will of the PRONUNC application applied to the current attempt to generate pronunciations for unknown words word: it shows the pronunciation of urgency and the by analogy with those for known words. The analogical mapping between the word's spelling and pronunciation. processes involve techniques for segmenting and PRONUNC represents pronunciations in an alphabet matching word spellings, and for mapping spelling to derived from Webster's Seventh Collegiate Dictionary. In sound in known words. -
Lexeme and Flexeme in a Formal Theory of Grammar
Chapter 8 Lexeme and flexeme in a formal theory of grammar Olivier Bonami Laboratoire de linguistique formelle, Université Paris Diderot Berthold Crysmann Laboratoire de linguistique formelle, CNRS This paper deals with the role played by the notion of a lexeme in a constraint-based lexicalist theory of grammar such as Head-driven Phrase Structure Grammar. Adopting a Word and Paradigm view of inflection, we show how the distinction between lexemes, individuated by their lexical semantics, and flexemes, individuated by their inflectional paradigm, can fruitfully be integrated in such a framework. This allows us to present an integrated analysis of stem spaces, inflection classes, heteroclisis and overabundance. It is often observed by morphologists that contemporary work in theoretical mor- phology has little impact on formal theories of grammar, which on average are content with a view of morphology quite close to that of offered by the post-Bloomfieldian mor- phemic toolkit. A notable exception to this situation is the pervasive use in Head-driven Phrase Structure Grammar (henceforth HPSG) of the distinction between words and lexemes familiar from Word and Paradigm approaches to morphology (see among many others Robins 1959, Hockett 1967, Matthews 1972, Zwicky 1985, Anderson 1992, Aronoff 1994, Stump 2001, Blevins 2016). In this paper we reevaluate the role of the lexeme in HPSG in the light of 20 years of research, and in particular of recent attempts to inte- grate a truly realisational theory of inflection within the HPSG framework (Crysmann & Bonami 2016). We conclude that current theorizing conflates two distinct notions of an abstract lexical object: lexemes, which are characterised in terms of their syntax and semantics, and flexemes (Fradin & Kerleroux 2003), which are characterised in terms of their inflectional paradigm.