Words and Alternative Basic Units for Linguistic Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Words and Alternative Basic Units for Linguistic Analysis Words and alternative basic units for linguistic analysis 1 Words and alternative basic units for linguistic analysis Jens Allwood SCCIIL Interdisciplinary Center, University of Gothenburg A. P. Hendrikse, Department of Linguistics, University of South Africa, Pretoria Elisabeth Ahlsén SCCIIL Interdisciplinary Center, University of Gothenburg Abstract The paper deals with words and possible alternative to words as basic units in linguistic theory, especially in interlinguistic comparison and corpus linguistics. A number of ways of defining the word are discussed and related to the analysis of linguistic corpora and to interlinguistic comparisons between corpora of spoken interaction. Problems associated with words as the basic units and alternatives to the traditional notion of word as a basis for corpus analysis and linguistic comparisons are presented and discussed. 1. What is a word? To some extent, there is an unclear view of what counts as a linguistic word, generally, and in different language types. This paper is an attempt to examine various construals of the concept “word”, in order to see how “words” might best be made use of as units of linguistic comparison. Using intuition, we might say that a word is a basic linguistic unit that is constituted by a combination of content (meaning) and expression, where the expression can be phonetic, orthographic or gestural (deaf sign language). On closer examination, however, it turns out that the notion “word” can be analyzed and specified in several different ways. Below we will consider the following three main ways of trying to analyze and define what a word is: (i) Analysis and definitions building on observation and supposed easy discovery (ii) Analysis and definitions building on manipulability (iii) Analysis and definitions building on abstraction 2. Analysis and definitions building on observation and supposed easy discovery We will start by considering analyses and definitions intended to build on observation of linguistic communication. Here the idea is that words are the basic building blocks of linguistic communication, providing combinable units of meaning and external expression that should as such be fairly directly observable and discoverable when inspecting linguistic communication, whether in written, spoken or gestural form. This can especially be seen in the definition of orthographic words given below. (i) Orthographic words According to Trask (2004) “[a]n orthographic word is a written sequence which has a white space at each end but no white space in the middle”. This definition of ”orthographic word” is both too wide and too narrow, in relation to other notions of word that intuitively have precedence. For example, the expression ”rail road” has 2 two orthographic words but is intuitively one word. This means that the notion of orthographic word as defined captures too much, i.e. it is too wide => too many words. But making ”rail road” into two orthographic words is also too narrow => not capturing the word (semantic unit and phonological stress unit, lexeme) that is actually there. (ii) Phonological words Following Trask again, Trask (2004) defines a phonological word as “a piece of speech which behaves as a unit of pronunciation according to criteria which vary from language to language” Unfortunately, there are units other than words that perhaps meet such phonological requirements, for example phonemes, syllables or breath groups. The definition does not tell us how to differentiate these units from each other. The mention of language-specific features does not help, since, these might be different for different languages, for the different units. In addition, when transcribing words, i.e. making them into orthographic words, typical phonetic information that may be used in the identification of phonological words such as stress, tone patterns, pauses (length) are either typically not represented in transcriptions. One reason for this is that such information is not traditionally part of written language, another is that it may reflect that this information is not so easily consciously recognized/observed by phonetically untrained transcribers. If we consider the relation between orthographic words and phonological words, we first may note that given these two definitions of a word, a consequence is that there is no 1- 1 correspondence between orthographic words, phonological and semantic words. Consider the following examples: rail road (2 orthograpic words - 1 phonological word) or I’m, you’re, won’t and ain’t (1 orthograpic word - 1 phonological word but two semantically motivated words). New York (2 orthographic words) vs. Newfoundland (1 orthographic word). New York and Newfoundland, thus, fairly arbitrarily, have different orthographic status while both probably are single phonological words etc. (iii) Gestural words Using Trask’s definition of phonological words as a model, we can now define gestural words analogously as as “a piece of gestural communication which behaves as a unit of gesturing according to criteria which vary from language to language” The relation between orthographic, phonological and semantically motivated and gestural words is more complex, so that 1 – 1 correspondences between the three word forms are not always possible to establish here either. Concerning gestural languages (sign languages), one reason for this is that while written and spoken words can be seen as variants of the same unit in two different expressive modes, gestural words in sign language are units in a new language and not gestural variants of the same word, in the sense that the written and spoken variants of a word are variants. We can also note that only the definition of orthographic word is operational, i.e. lives up to the desiderata of being both directly observable and discoverable and thus, directly usable as an element in automated information retrieval. Since, as we have seen above, the criteria given for what is a unit of pronunciation or gesturing are not sufficient, these concepts thus remain in need of further specification and clarification. 3 3. Analysis and definitions building on manipulability Many linguists have thought that word criteria, based on inherent word features that are supposed to be directly observable are unreliable and need to be supplemented by other criteria. Some widely used such criteria are criteria that in a syntactic mode focus on the unit status of words. Two criteria are often suggested: (i) Moveability (ii) Resistance to intrusion and interruption Both of these criteria have often been used to define the notion ”word”. We will now consider them one by one. 3.1 Moveability According to this criterion, a word is the smallest element of a sentence that can be moved around without destroying the grammaticality of the sentence. Thus, the fact that the word often in the expression often he went to the house can be moved from first to last position as in he went to the house often, shows that often is a word. A problem with this criterion is that several kinds of units that have not traditionally been considered words can be moved around in a similar fashion. Consider the following examples: (i) Movement of morpheme The unfaithful wife was masked -> The faithful wife was unmasked Even if -un is not usually regarded as a word but as a morpheme, it can be moved around without destroying the grammaticality of the embedding sentence. The meaning is changed, but since the criterion wisely does not demand preservation of meaning, -un passes as a moveable unit. If preservation of meaning had been required, even the example given above, using the word often, might not qualify (the information structural aspect of meaning is changed). In fact, very few changes of word order do not have an effect on meaning and it would not be a trivial task to say which aspects of meaning do not change when word order is changed. If it be objected that there is no movement here, -un is just deleted and affixed to a new unit, a reply to this is that movement can always be analyzed as a combination of deletion and addition and since there is no requirement of preservation of meaning there is no way to rule this type of example out. (ii) Movement of phrase By and large you are right -> You are right by and large The phrase by and large is not usually regarded as a word but clearly behaves in a word like fashion, using this criterion. Thus, if we use the criterion of moveability, it seems that morphemes and fixed phrases are somewhat arbitrarily excluded from word status. 4 Since especially what one might call “lexicalized phrases” are important for our argument, we will give some more examples of expressions of this type that arguably have lexicalized status. The classification and examples are taken from Moon (1998) (cf. also Wray 2002). 1. Different types of “anomalous” collocations At all, by and large, of course, stay put, thank you, in retrospect, kith and kin, on behalf of someone/something, short shrift, to and fro at least, a foregone conclusion, in effect, beg the question, in time, curry favour, foot the bill, toe the line in action, into action, out of action, on show, on display, to a …degree, to a …extent 2. Fomulae Simple formulae alive and well, I’m sorry to say, not exactly, pick and choose, you know Sayings an eye for an eye, curiouser and curiouser, don’t let the bastards grind you down, that’s the way the cookie crumbles, home, James, and don’t spare the horses Proverbs you can’t have your cake and eat it, enough is enough, first come first served Similes good as gold, as old as the hills, like lambs to the slaughter, live like a king 3. Metaphors Transparent metaphors alarm bells ring, behind someone’s back, breathe life into something, on (some) one’s doorstep, pack one’s back Semi-transparent metaphors grasp the nettle, on an even keel, the pecking order, throw the towel in, under one’s belt Opaque metaphors bite the bullet, kick the bucket, over the moon, red herring, shoot the breeze 3.2 Resistance to ‘intrusion and interruption’ The second common criterion for word-hood is that words are the largest units which resist ‘intrusion and interruption’ by the insertion of new material between their constituent parts.
Recommended publications
  • ON SOME CATEGORIES for DESCRIBING the SEMOLEXEMIC STRUCTURE by Yoshihiko Ikegami
    ON SOME CATEGORIES FOR DESCRIBING THE SEMOLEXEMIC STRUCTURE by Yoshihiko Ikegami 1. A lexeme is the minimum unit that carries meaning. Thus a lexeme can be a "word" as well as an affix (i.e., something smaller than a word) or an idiom (i.e,, something larger than a word). 2. A sememe is a unit of meaning that can be realized as a single lexeme. It is defined as a structure constituted by those features having distinctive functions (i.e., serving to distinguish the sememe in question from other semernes that contrast with it).' A question that arises at this point is whether or not one lexeme always corresponds to just one serneme and no more. Three theoretical positions are foreseeable: (I) one which holds that one lexeme always corresponds to just one sememe and no more, (2) one which holds that one lexeme corresponds to an indefinitely large number of sememes, and (3) one which holds that one lexeme corresponds to a certain limited number of sememes. These three positions wiIl be referred to as (1) the "Grundbedeutung" theory, (2) the "use" theory, and (3) the "polysemy" theory, respectively. The Grundbedeutung theory, however attractive in itself, is to be rejected as unrealistic. Suppose a preliminary analysis has revealed that a lexeme seems to be used sometimes in an "abstract" sense and sometimes in a "concrete" sense. In order to posit a Grundbedeutung under such circumstances, it is to be assumed that there is a still higher level at which "abstract" and "concrete" are neutralized-this is certainly a theoretical possibility, but it seems highly unlikely and unrealistic from a psychological point of view.
    [Show full text]
  • Greek and Latin Roots, Prefixes, and Suffixes
    GREEK AND LATIN ROOTS, PREFIXES, AND SUFFIXES This is a resource pack that I put together for myself to teach roots, prefixes, and suffixes as part of a separate vocabulary class (short weekly sessions). It is a combination of helpful resources that I have found on the web as well as some tips of my own (such as the simple lesson plan). Lesson Plan Ideas ........................................................................................................... 3 Simple Lesson Plan for Word Study: ........................................................................... 3 Lesson Plan Idea 2 ...................................................................................................... 3 Background Information .................................................................................................. 5 Why Study Word Roots, Prefixes, and Suffixes? ......................................................... 6 Latin and Greek Word Elements .............................................................................. 6 Latin Roots, Prefixes, and Suffixes .......................................................................... 6 Root, Prefix, and Suffix Lists ........................................................................................... 8 List 1: MEGA root list ................................................................................................... 9 List 2: Roots, Prefixes, and Suffixes .......................................................................... 32 List 3: Prefix List ......................................................................................................
    [Show full text]
  • Semantic Differences in Translation Exploring the Field of Inchoativity
    Semantic differences in translation Exploring the field of inchoativity Lore Vandevoorde language Translation and Multilingual Natural science press Language Processing 13 Translation and Multilingual Natural Language Processing Editors: Oliver Czulo (Universität Leipzig), Silvia Hansen-Schirra (Johannes Gutenberg-Universität Mainz), Reinhard Rapp (Johannes Gutenberg-Universität Mainz) In this series: 1. Fantinuoli, Claudio & Federico Zanettin (eds.). New directions in corpus-based translation studies. 2. Hansen-Schirra, Silvia & Sambor Grucza (eds.). Eyetracking and Applied Linguistics. 3. Neumann, Stella, Oliver Čulo & Silvia Hansen-Schirra (eds.). Annotation, exploitation and evaluation of parallel corpora: TC3 I. 4. Czulo, Oliver & Silvia Hansen-Schirra (eds.). Crossroads between Contrastive Linguistics, Translation Studies and Machine Translation: TC3 II. 5. Rehm, Georg, Felix Sasaki, Daniel Stein & Andreas Witt (eds.). Language technologies for a multilingual Europe: TC3 III. 6. Menzel, Katrin, Ekaterina Lapshinova-Koltunski & Kerstin Anna Kunz (eds.). New perspectives on cohesion and coherence: Implications for translation. 7. Hansen-Schirra, Silvia, Oliver Czulo & Sascha Hofmann (eds). Empirical modelling of translation and interpreting. 8. Svoboda, Tomáš, Łucja Biel & Krzysztof Łoboda (eds.). Quality aspects in institutional translation. 9. Fox, Wendy. Can integrated titles improve the viewing experience? Investigating the impact of subtitling on the reception and enjoyment of film using eye tracking and questionnaire data. 10. Moran, Steven & Michael Cysouw. The Unicode cookbook for linguists: Managing writing systems using orthography profiles. 11. Fantinuoli, Claudio (ed.). Interpreting and technology. 12. Nitzke, Jean. Problem solving activities in post-editing and translation from scratch: A multi-method study. 13. Vandevoorde, Lore. Semantic differences in translation. ISSN: 2364-8899 Semantic differences in translation Exploring the field of inchoativity Lore Vandevoorde language science press Vandevoorde, Lore.
    [Show full text]
  • Download Article
    Advances in Social Science, Education and Humanities Research (ASSEHR), volume 312 International Conference "Topical Problems of Philology and Didactics: Interdisciplinary Approach in Humanities and Social Sciences" (TPHD 2018) Methods of Identifying Members of Synonymic Row Juliya A. Litvinova Elena A. Maklakova Chair of Foreign Languages Chair of Foreign Languages Federal State Budget Educational Institution of Higher Federal State Budget Educational Institution of Higher Education Voronezh State University of Forestry and Education Voronezh State University of Forestry and Technologies named after G.F. Morozov Technologies named after G.F. Morozov Voronezh, Russia Voronezh, Russia [email protected] Affiliation): dept. name of organization Abstract— This article is devoted to identifying the criteria of analysis, method of field modeling, method of semantic synonymity of lexical items. The existence of different definitions interpretation, method of generalization of dictionary of synonymy, the selection criteria of items in the synonymic row definitions, methods of quantitative, lexicographical, indicate the insufficient study and incoherence of this contextual, psycholinguistic analysis. Data for the study were phenomenon in linguistics. The study of the semantics of lexical 2 lexical items in the Russian language (gorodok, gorodishko) items allows explaining the most accurately and authentically the integration and differentiation of lexical items close in meaning. obtained from the Russian Explanatory Dictionaries (V. I. The description of the meaning structure (sememe) is possible Dahl, D. N. Ushakov, S. I. Ozhegov, A. P. Evgenieva, S. A. through the description of its seme composition. The methods of Kuznetsov, T. F. Efremova), Russian National Corpus seme semasiology (lexicographic, psycholinguistic, contextual) (ruscorpora.ru). allow revealing various components in the sememe structure.
    [Show full text]
  • Different but Not All Opposite: Contributions to Lexical Relationships Teaching in Primary School
    INTE - ITICAM - IDEC 2018, Paris-FRANCE VOLUME 1 All Different But Not All Opposite: Contributions To Lexical Relationships Teaching In Primary School Adriana BAPTISTA Polytechnic Institute of Porto – School of Media Arts and Design inED – Centre for Research and Innovation in Education Portugal [email protected] Celda CHOUPINA Polytechnic Institute of Porto – School of Education inED – Centre for Research and Innovation in Education Centre of Linguistics of the University of Porto Portugal [email protected] José António COSTA Polytechnic Institute of Porto – School of Education inED – Centre for Research and Innovation in Education Centre of Linguistics of the University of Porto Portugal [email protected] Joana QUERIDO Polytechnic Institute of Porto – School of Education Portugal [email protected] Inês OLIVEIRA Polytechnic Institute of Porto – School of Education Centre of Linguistics of the University of Porto Portugal [email protected] Abstract The lexicon allows the expression of particular cosmovisions, which is why there are a wide range of lexical relationships, involving different linguistic particularities (Coseriu, 1991; Teixeira , 2005). We find, however, in teaching context, that these variations are often replaced by dichotomous and decontextualized proposals of lexical organization, presented, for instance, in textbooks and other supporting materials (Baptista et al., 2017). Thus, our paper is structured in two parts. First, we will try to account for the diversity of lexical relations (Choupina, Costa & Baptista, 2013), considering phonological, morphological, syntactic, semantic, pragmatic- discursive, cognitive and historical criteria (Lehmann & Martin-Berthet, 2008). Secondly, we present an experimental study that aims at verifying if primary school pupils intuitively organize their mental lexicon in a dichotomous way.
    [Show full text]
  • Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph
    mathematics Article Lexical Sense Labeling and Sentiment Potential Analysis Using Corpus-Based Dependency Graph Tajana Ban Kirigin 1,* , Sanda Bujaˇci´cBabi´c 1 and Benedikt Perak 2 1 Department of Mathematics, University of Rijeka, R. Matejˇci´c2, 51000 Rijeka, Croatia; [email protected] 2 Faculty of Humanities and Social Sciences, University of Rijeka, SveuˇcilišnaAvenija 4, 51000 Rijeka, Croatia; [email protected] * Correspondence: [email protected] Abstract: This paper describes a graph method for labeling word senses and identifying lexical sentiment potential by integrating the corpus-based syntactic-semantic dependency graph layer, lexical semantic and sentiment dictionaries. The method, implemented as ConGraCNet application on different languages and corpora, projects a semantic function onto a particular syntactical de- pendency layer and constructs a seed lexeme graph with collocates of high conceptual similarity. The seed lexeme graph is clustered into subgraphs that reveal the polysemous semantic nature of a lexeme in a corpus. The construction of the WordNet hypernym graph provides a set of synset labels that generalize the senses for each lexical cluster. By integrating sentiment dictionaries, we introduce graph propagation methods for sentiment analysis. Original dictionary sentiment values are integrated into ConGraCNet lexical graph to compute sentiment values of node lexemes and lexical clusters, and identify the sentiment potential of lexemes with respect to a corpus. The method can be used to resolve sparseness of sentiment dictionaries and enrich the sentiment evaluation of Citation: Ban Kirigin, T.; lexical structures in sentiment dictionaries by revealing the relative sentiment potential of polysemous Bujaˇci´cBabi´c,S.; Perak, B. Lexical Sense Labeling and Sentiment lexemes with respect to a specific corpus.
    [Show full text]
  • Social & Behavioural Sciences SCTCMG 2019 International
    The European Proceedings of Social & Behavioural Sciences EpSBS ISSN: 2357-1330 https://doi.org/10.15405/epsbs.2019.12.04.33 SCTCMG 2019 International Scientific Conference «Social and Cultural Transformations in the Context of Modern Globalism» INTERLEXICOGRAPHY IN TEACHING RUSSIAN AS A SECOND LANGUAGE Elena Baryshnikova (a)*, Dmitrii Kazhuro (b) *Corresponding author (a) Peoples’ Friendship University of Russia, 6, Miklukho-Maklaya str., Moscow, Russia, [email protected], +7 (968) 812-64-61 (b) Peoples’ Friendship University of Russia, 6, Miklukho-Maklaya str., Moscow, Russia, [email protected], +7 (977) 946-74-63 Abstract The paper is concerned with the use of dictionaries of international words by foreign students studying the Russian language. It considers the phenomenon of defining international vocabulary in a dictionary in the context of a scientific field. The authors outline the growth of cross-cultural and cross-language contacts in the history of human civilization. They provide the definition of interlexicography as the method to carry out interlexicological study of international words in various linguistic systems. The paper analyses the issues related to the choice of an illustrative part in the interlexicographic source with regard to qualifying a lexeme as international. There is a background of interlexicographic development based on bi- and multilingual dictionaries of some national languages. The authors analyze the target and goal focus of existing dictionaries of international words, the principles of making up a glossary, its organization and representation in dictionary entries. Through descriptive and comparative methods, along with methodological parameters of linguodidactic representation of the material in line with the educational lexicography, the paper identifies some linguistic and methodological advantages and disadvantages of the existing internationalism dictionaries and their linguodidactic value for teaching Russian as a second language to international students.
    [Show full text]
  • PDF Polysemy and Metaphor in the Verbs of Perception
    Manasia: Polysemy and Metaphor 55 Polysemy and Metaphor in the Verbs of Perception Mihaela Georgiana Manasia ABSTRACT: This paper addresses the idea that has been recently that perception verbs have a polysemous structure motivated by ourput forwardexperience by andseveral understanding studies in the of fieldthe world. of cognitive Metaphor linguistics is not only characteristic of poetic language, but on the contrary, it can be found everywhere in everyday language and the polysemous and constructional alternatives makes them a motivating semantic character of perception verbs reflected into a wide range of syntactic KEY WORDS: polysemy, metaphor, perception verbs, prototypical meaning,field to approach metaphorical in this meaning. respect. olysemy represents, within semantics, the term used to Pcharacterize the situation in which a word has two or more polysemy has been subject to controversies and continues to remain similar meanings. Despite this very simple definition, the concept of In 1980, the study of polysemy and metaphor expands a debatable field in the linguistic research. book Metaphors We Live By. relationwithin cognitive of meanings. linguistics It is perceivedespecially withas categorization Lakoff and Johnson’s namely related meanings are organised They intodefine categories polysemy based as a systematic on family resemblance. 55 56 HARVARD SQUARE SYMPOSIUM | THE FUTURE OF KNOWLEDGE to put forward that perception verbs have a polysemous structure, motivatedRecent by studies our experience in the field and of cognitive understanding semantics of thehave world. tried Metaphor represents one of the cognitive instruments structuring the way in which we think, perceive and act. this varietyThe authors of meanings of Metaphors and a part We of Live everyday By criticized language the that classical affects theory of metaphor as a comparison, describing similarities that already exist.
    [Show full text]
  • CS474 Natural Language Processing Semantic Analysis Caveats Introduction to Lexical Semantics
    CS474 Natural Language Processing Semantic analysis Last class Assigning meanings to linguistic utterances – History Compositional semantics: we can derive the – Tiny intro to semantic analysis meaning of the whole sentence from the meanings of the parts. Next lectures – Max ate a green apple. – Word sense disambiguation Relies on knowing: » Background from linguistics – the meaning of individual words Lexical semantics – how the meanings of individual words combine to form » On-line resources the meaning of groups of words » Computational approaches [next class] – how it all fits in with syntactic analysis Caveats Introduction to lexical semantics Problems with a compositional approach Lexical semantics is the study of – the systematic meaning-related connections among – a former congressman words and – a toy elephant – the internal meaning-related structure of each word – kicked the bucket Lexeme – an individual entry in the lexicon – a pairing of a particular orthographic and phonological form with some form of symbolic meaning representation Sense: the lexeme’s meaning component Lexicon: a finite list of lexemes Lexical semantic relations: Dictionary entries homonymy right adj. located nearer the right hand esp. Homonyms: words that have the same form and unrelated being on the right when facing the same direction meanings – Instead, a bank1 can hold the investments in a custodial account as the observer. in the client’s name. – But as agriculture burgeons on the east bank2, the river will shrink left adj. located nearer to this side of the body even more. than the right. Homophones: distinct lexemes with a shared red n. the color of blood or a ruby. pronunciation – E.g.
    [Show full text]
  • Verbs of 'Preparing Something for Eating by Heating It in a Particular
    DEPARTAMENTO DE FILOLOGÍA INGLESA Y ALEMANA Verbs of ‘preparing something for eating by heating it in a particular way’: a lexicological analysis Grado en Estudios Ingleses Fabián García Díaz Tutora: Mª del Carmen Fumero Pérez San Cristóbal de La Laguna, Tenerife 8 de septiembre de 2015 INDEX 1. Abstract ................................................................................................................................. 3 2. Introduction .......................................................................................................................... 4 3. Theoretical perspective ........................................................................................................ 6 4. Analysis: verbs of to prepare something for eating by heating it in a particular way: cook, fry and roast. ................................................................................................................... 9 4.1. Corpus selection .............................................................................................................. 9 4.2. Verb selection ................................................................................................................ 11 5. Paradigmatic relations ....................................................................................................... 13 5.1. Semantic components and lexematic analysis ............................................................... 13 5.2. Lexical relations ...........................................................................................................
    [Show full text]
  • COMMON and DIFFERENT ASPECTS in a SET of COMMENTARIES in DICTIONARIES and SEMANTIC TAGS Akhmedova Dildora Bakhodirovna a Teacher
    International Scientific Forum on language, literature, translation, literary criticism: international scientific-practical conference on modern approaches and perspectives. Web: https://iejrd.com/ COMMON AND DIFFERENT ASPECTS IN A SET OF COMMENTARIES IN DICTIONARIES AND SEMANTIC TAGS Akhmedova Dildora Bakhodirovna A teacher, BSU Hamidova Iroda Olimovna A student, BSU Annotation: This article discusses dictionaries that can be a source of information for the language corpus, the structure of the dictionary commentary, and the possibilities of this information in tagging corpus units. Key words:corpus linguistics, lexicography, lexical units, semantic tagging, annotated dictionary, general vocabulary, limited vocabulary, illustrative example. I.Introduction Corpus linguistics is closely related to lexicography because the linguistic unit in the dictionary, whose interpretation serves as a linguo-lexicographic supply for tagging corpus units. Digitization of the text of existing dictionaries in the Uzbek language is the first step in linking the dictionary with the corpus. Uzbek lexicography has come a long way, has a rich treasure that serves as a material for the corps. According to Professor E. Begmatov, systematicity in the lexicon is not as obvious as in other levels of language. Lexical units are more numerous than phonemes, morphemes in terms of quantity, and have the property of periodic instability. Therefore, it is not possible to identify and study the lexicon on a scale. Today, world linguistics has begun to solve this problem with the help of language corpora and has already achieved results. Qualitative and quantitative inventory of vocabulary, the possibility of comprehensive research has expanded. The main source for semantic tagging of language units is the explanatory dictionary of the Uzbek language.
    [Show full text]
  • Lexical Semantics
    Lexical Semantics COMP-599 Oct 20, 2015 Outline Semantics Lexical semantics Lexical semantic relations WordNet Word Sense Disambiguation • Lesk algorithm • Yarowsky’s algorithm 2 Semantics The study of meaning in language What does meaning mean? • Relationship of linguistic expression to the real world • Relationship of linguistic expressions to each other Let’s start by focusing on the meaning of words— lexical semantics. Later on: • meaning of phrases and sentences • how to construct that from meanings of words 3 From Language to the World What does telephone mean? • Picks out all of the objects in the world that are telephones (its referents) Its extensional definition not telephones telephones 4 Relationship of Linguistic Expressions How would you define telephone? e.g, to a three-year- old, or to a friendly Martian. 5 Dictionary Definition http://dictionary.reference.com/browse/telephone Its intensional definition • The necessary and sufficient conditions to be a telephone This presupposes you know what “apparatus”, “sound”, “speech”, etc. mean. 6 Sense and Reference (Frege, 1892) Frege was one of the first to distinguish between the sense of a term, and its reference. Same referent, different senses: Venus the morning star the evening star 7 Lexical Semantic Relations How specifically do terms relate to each other? Here are some ways: Hypernymy/hyponymy Synonymy Antonymy Homonymy Polysemy Metonymy Synecdoche Holonymy/meronymy 8 Hypernymy/Hyponymy ISA relationship Hyponym Hypernym monkey mammal Montreal city red wine beverage 9 Synonymy and Antonymy Synonymy (Roughly) same meaning offspring descendent spawn happy joyful merry Antonymy (Roughly) opposite meaning synonym antonym happy sad descendant ancestor 10 Homonymy Same form, different (and unrelated) meaning Homophone – same sound • e.g., son vs.
    [Show full text]