A Graph Model for Words and Their Meanings

Total Page:16

File Type:pdf, Size:1020Kb

A Graph Model for Words and Their Meanings A Graph Model for Words and their Meanings Von der Philosophisch-Historischen Fakult¨at der Universit¨at Stuttgart zur Erlangung der W¨urde eines Doktors der Philosophie (Dr. phil.) genehmigte Abhandlung vorgelegt von Beate Dorow aus Albstadt Erstgutachter: PD Dr. phil. habil. Ulrich Heid Zweitgutachter: Prof. Hinrich Sch¨utze, Ph.D. Tag der m¨undlichen Pr¨ufung: 03.03.2006 Institut f¨ur Maschinelle Sprachverarbeitung der Universit¨at Stuttgart 2006 2 Contents 1 Introduction 13 1.1 Motivation.................................... 13 1.2 Semantic similarity and ambiguity . ..... 13 1.2.1 Ambiguity................................ 13 1.2.2 Semanticsimilarity ........................... 14 1.3 Idioms...................................... 15 1.4 Synonyms .................................... 16 1.5 Contributionsofthethesis . ... 17 1.6 Outlineofthethesis .............................. 18 2 Graphs 21 2.1 Terminologyandnotation . 21 2.2 Real-worldgraphs................................ 25 2.2.1 Socialnetworks ............................. 25 2.2.2 Technologicalnetworks . 26 2.2.3 Biologicalnetworks ........................... 27 2.2.4 Graphs arising in linguistics . .. 27 2.3 Curvature as a measure of local graph topology . ...... 29 2.3.1 Clustering in social networks . .. 29 2.3.2 Graph-theoreticdefinition . 31 2.3.3 Geometricinterpretation . 33 3 Data acquisition 39 3.1 Acquisitionassumption. .. 39 3.2 Textmaterial .................................. 40 3.3 Buildingthegraph ............................... 40 3.4 Preprocessingofthewordgraph. ... 43 3.4.1 Conflatingvariantsofthesameword . 43 3.4.2 Eliminatingweaklinks . .. .. 44 3.5 Weightinglinks ................................. 44 3.6 Spuriousedges ................................. 46 3.6.1 Errorsindataacquisition . 46 3.6.2 Violation of the symmetry assumption . .. 48 4 CONTENTS 3.6.3 Semantically heterogeneous lists . .... 48 3.6.4 Ambiguity due to conflation of variants . ... 48 3.6.5 Statistical properties of the word graph . ..... 48 4 Discovering word senses 51 4.1 Intuition..................................... 51 4.2 MarkovClustering ............................... 52 4.3 Word Sense Clustering Algorithm . ... 53 4.4 ExperimentalResults. .. .. 54 5 Using curvature for ambiguity detection and semantic class acquisition 59 5.1 A linguistic interpretation of curvature . ........ 59 5.2 Curvatureasameasureofambiguity . ... 61 5.3 Curvatureasaclusteringtool . ... 64 5.3.1 Determining the curvature threshold . ... 65 5.3.2 Allowing for overlapping clusters . ... 69 5.4 EvaluationagainstWordNet . .. 72 5.4.1 PrecisionandRecall .......................... 72 5.4.2 Mapping clusters to WordNet senses . 75 5.4.3 ResultsandConclusions . 76 5.5 Uncovering the meanings of novel words . .... 79 6 From clustering words to clustering word associations 87 6.1 Intuition..................................... 87 6.2 Linkgraphtransformation . .. 87 6.3 Somepropertiesofthelinkgraph . ... 89 6.4 Clusteringthelinkgraph. .. 92 6.5 Evaluation.................................... 92 6.6 Pairsofcontrastingclusters . .... 96 6.7 Conclusion.................................... 102 7 Idiomaticity in the word graph 105 7.1 Introduction................................... 105 7.2 Asymmetriclinks ................................ 106 7.2.1 Testing the reversibility (symmetry) of links . ....... 106 7.2.2 Semanticconstraints . 108 7.3 Idiomaticconstraints . 113 7.4 Linkanalysis .................................. 114 7.5 Statisticalanalysis ............................. 118 7.5.1 Noncompositionality of meaning . 118 7.5.2 Semanticanomaly ........................... 119 7.5.3 Limited syntactic variability . 122 7.5.4 Variabilityofcontext . 123 Table of Contents 5 7.6 Conclusions ................................... 126 8 Synonyms in the word graph 127 8.1 Hypothesis.................................... 127 8.2 Methodology .................................. 129 8.3 Evaluationexperiment . 131 8.3.1 Testset ................................. 131 8.3.2 Experimentalset-up . .. .. 133 8.4 RevisedHypothesis ............................... 137 8.5 Conclusion.................................... 138 9 Related Work 143 9.1 Extractionofsemanticrelations . ..... 143 9.2 Semantic similarity and ambiguity . ..... 146 9.2.1 Bootstrapping semantic categories . .... 146 9.2.2 Hard clustering of words into classes of similar words ........ 150 9.2.3 Soft clustering of words into classes of similar words ......... 152 9.2.4 Word sense induction by clustering word contexts . ...... 158 9.2.5 Identifying and assessing ambiguity . .... 165 9.3 Idiomaticity ................................... 166 9.4 Synonymy .................................... 167 10 Summary, Conclusion and Future Research 171 10.1Summary .................................... 171 10.2 ConclusionandFutureResearch . .... 173 11 Zusammenfassung 175 11.1 Semantische Ambiguit¨at und Ahnlichkeit¨ . .. 176 11.2Idiome...................................... 180 11.3Synonyme .................................... 180 11.4 Einordnung und zuk¨unftige Forschungsarbeit . .......... 181 Acknowledgements 187 6 Table of Contents List of Figures 1.1 Local graph around rock ............................ 15 1.2 Local graph around carrot and stick ...................... 16 1.3 Local graph around writer and author ..................... 17 2.1 Asimplegraph ................................. 22 2.2 Twosubgraphs ................................. 22 2.3 Adirectedgraph ................................ 23 2.4 Aweightedgraph................................ 23 2.5 Completegraphs ................................ 24 2.6 Ahierarchy ................................... 25 2.7 Aparsetree ................................... 28 2.8 WordNetsnippet ................................ 30 2.9 Asocialnetwork ................................ 32 2.10Curvature .................................... 33 2.11Surfaces ..................................... 34 2.12Boundarystrips................................. 34 2.13Aplanartriangle ................................ 35 2.14 Atriangleonasphere ............................. 36 2.15Atriangleonasaddle ............................. 37 2.16 ThePoincar´edisk.. .. .. 37 3.1 Buildingthewordgraph ............................ 42 3.2 Local graph around body ............................ 42 3.3 Conflatingthevariantsofaword . .. 43 3.4 Thetrianglefilter................................ 44 4.1 Local graph around mouse ........................... 51 4.2 Local graph around wing ............................ 52 5.1 Local graph around head (3D)......................... 60 5.2 Curvaturevs.Frequency . 62 5.3 Local graph around monaco .......................... 63 5.4 Evolutionofthegiantcomponent . ... 67 5.5 Evolutionofsmallcomponents. ... 68 8 LIST OF FIGURES 5.6 Curvatureclustering ............................. 70 5.7 The clusters of mouse .............................. 73 5.8 The clusters of oil ................................ 74 5.9 The clusters of recorder ............................. 74 5.10 Precision, recall and F-score of the curvature clustering........... 77 5.11PathsinWordNet................................ 82 5.12 Length of the paths connecting a testword with its label .......... 83 5.13Christmastree ................................. 85 6.1 Linkgraphtransformation . .. 88 6.2 Two triangles involving squash ......................... 89 6.3 Local graph around the word organ ...................... 89 6.4 Link graph associated with organ ....................... 90 6.5 A clique under the link graph transformation . ....... 91 6.6 Curvatureofnodesinthelinkgraph . ... 92 6.7 Precision and recall of link graph clustering . ........ 94 6.8 F-scoreoflinkgraphclustering . .... 95 6.9 Two clusters of humour ............................. 97 6.10 Subgraphs corresponding to three clusters of plant .............. 102 6.11 Contrastsbetweenclusters . .... 103 7.1 Directed graph of family relationships . ....... 111 7.2 Hierarchical relationships between aristocrats . ........... 112 7.3 Directedgraphoflife-events . .... 112 7.4 Chronologyoftheseasons . 113 7.5 Lexicalsubstitution. .. .. 115 7.6 Local graph around rhyme and reason ..................... 115 7.7 Local graph around rhyme and song ...................... 116 7.8 Local graph around man, woman and child .................. 117 7.9 Local graph around lock, stock and barrel ................... 117 7.10 Circulardispersion . 125 8.1 Local graph around pedestrian and walker .................. 128 8.2 First-andsecond-orderMarkovgraph. ..... 130 8.3 Local graph around gull and mug ....................... 132 8.4 TargetranksunderHypothesis1 . 135 8.5 TargetranksunderHypothesis2 . 139 9.1 Bootstrappingsemanticcategories . ...... 149 9.2 Agglomerativeclustering . 151 9.3 Acooccurrencematrix ............................. 159 9.4 WordSpace ................................... 159 9.5 VectorsinWordSpace ............................. 160 9.6 Shared Nearest Neighbors Clustering . ..... 164 List of Figures 9 9.7 The neighborhood of chocolate inadictionary . 169 11.1 Die Nachbarschaft von Gericht ......................... 177 11.2Kr¨ummung ................................... 178 11.3 Graphtransformation . 179 10 List of Figures List of Tables 3.1 Statisticalpropertiesofthewordgraph . ....... 49 4.1 Sample of test words and their discovered senses . ........ 55 4.2 German nominalizations and their discovered senses . .......... 57 5.1 Correlation between curvature and WordNet polysemy count........ 64 5.2 Properties of curvature clustering for different curvature thresholds
Recommended publications
  • The Three Phases of Arendt's Theory of Totalitarianism*
    The Three Phases of Arendt's Theory of Totalitarianism* X~1ANNAH Arendt's The Origins of Totalitarianism, first published in 1951, is a bewilderingly wide-ranging work, a book about much more than just totalitarianism and its immediate origins.' In fact, it is not really about those immediate origins at all. The book's peculiar organization creates a certain ambiguity regarding its intended subject-matter and scope.^ The first part, "Anti- semitism," tells the story of tbe rise of modern, secular anti-Semi- tism (as distinct from what the author calls "religious Jew-hatred") up to the turn of the twentieth century, and ends with the Drey- fus affair in Erance—a "dress rehearsal," in Arendt's words, for things still worse to come (10). The second part, "Imperialism," surveys an assortment of pathologies in the world politics of the late nineteenth and early twentieth centuries, leading to (but not direcdy involving) the Eirst World War. This part of the book examines the European powers' rapacious expansionist policies in Africa and Asia—in which overseas investment became the pre- text for raw, openly racist exploitation—and the concomitant emergence in Central and Eastern Europe of "tribalist" ethnic movements whose (failed) ambition was the replication of those *Earlier versions of this essay were presented at the University of Virginia and at the conference commemorating the fiftieth anniversary of The Origins of Toudilarianism hosted by the Hannah Arendt-Zentrum at Carl Ossietzky University in Oldenburg, Ger- many. I am grateful to Joshua Dienstag and Antonia Gnmenberg. respectively, for arrang- ing these (wo occasions, and also to the inembers of the audience at each—especially Lawrie Balfour, Wolfgang Heuer, George KJosko, Allan Megili, Alfons Sollner, and Zoltan Szankay—for their helpful commeiiLs and criticisms.
    [Show full text]
  • A Study of Idiom Translation Strategies Between English and Chinese
    ISSN 1799-2591 Theory and Practice in Language Studies, Vol. 3, No. 9, pp. 1691-1697, September 2013 © 2013 ACADEMY PUBLISHER Manufactured in Finland. doi:10.4304/tpls.3.9.1691-1697 A Study of Idiom Translation Strategies between English and Chinese Lanchun Wang School of Foreign Languages, Qiongzhou University, Sanya 572022, China Shuo Wang School of Foreign Languages, Qiongzhou University, Sanya 572022, China Abstract—This paper, focusing on idiom translation methods and principles between English and Chinese, with the statement of different idiom definitions and the analysis of idiom characteristics and culture differences, studies the strategies on idiom translation, what kind of method should be used and what kind of principle should be followed as to get better idiom translations. Index Terms— idiom, translation, strategy, principle I. DEFINITIONS OF IDIOMS AND THEIR FUNCTIONS Idiom is a language in the formation of the unique and fixed expressions in the using process. As a language form, idioms has its own characteristic and patterns and used in high frequency whether in written language or oral language because idioms can convey a host of language and cultural information when people chat to each other. In some senses, idioms are the reflection of the environment, life, historical culture of the native speakers and are closely associated with their inner most spirit and feelings. They are commonly used in all types of languages, informal and formal. That is why the extent to which a person familiarizes himself with idioms is a mark of his or her command of language. Both English and Chinese are abundant in idioms.
    [Show full text]
  • Experiments in Idiom Recognition
    Experiments in Idiom Recognition Jing Peng and Anna Feldman Department of Computer Science Department of Linguistics Montclair State University Montclair, New Jersey, USA 07043 Abstract Some expressions can be ambiguous between idiomatic and literal interpretations depending on the context they occur in, e.g., sales hit the roof vs. hit the roof of the car. We present a novel method of classifying whether a given instance is literal or idiomatic, focusing on verb-noun constructions. We report state-of-the-art results on this task using an approach based on the hypothesis that the distributions of the contexts of the idiomatic phrases will be different from the contexts of the literal usages. We measure contexts by using projections of the words into vector space. For comparison, we implement Fazly et al. (2009)’s, Sporleder and Li (2009)’s, and Li and Sporleder (2010b)’s methods and apply them to our data. We provide experimental results validating the proposed techniques. 1 Introduction Researchers have been investigating idioms and their properties for many years. According to traditional approaches, an idiom is — in its simplest form— a string of two or more words for which meaning is not derived from the meanings of the individual words comprising that string (Swinney and Cutler, 1979). As such, the meaning of kick the bucket (‘die’) cannot be obtained by breaking down the idiom and an- alyzing the meanings of its constituent parts, to kick and the bucket. In addition to being influenced by the principle of compositionality, the traditional approaches are also influenced by theories of generative grammar (Flores, 1993; Langlotz, 2006) The properties that traditional approaches attribute to idiomatic expressions are also the properties that make them difficult for generative grammars to describe.
    [Show full text]
  • Lexical Semantic Analysis in Natural Language Text
    Lexical Semantic Analysis in Natural Language Text Nathan Schneider Language Technologies Institute School of Computer Science Carnegie Mellon University ◇ July 28, 2014 Submitted in partial fulfillment of the requirements for the degree of doctor of philosophy in language and information technologies Abstract Computer programs that make inferences about natural language are easily fooled by the often haphazard relationship between words and their meanings. This thesis develops Lexical Semantic Analysis (LxSA), a general-purpose framework for describing word groupings and meanings in context. LxSA marries comprehensive linguistic annotation of corpora with engineering of statistical natural lan- guage processing tools. The framework does not require any lexical resource or syntactic parser, so it will be relatively simple to adapt to new languages and domains. The contributions of this thesis are: a formal representation of lexical segments and coarse semantic classes; a well-tested linguistic annotation scheme with detailed guidelines for identifying multi- word expressions and categorizing nouns, verbs, and prepositions; an English web corpus annotated with this scheme; and an open source NLP system that automates the analysis by statistical se- quence tagging. Finally, we motivate the applicability of lexical se- mantic information to sentence-level language technologies (such as semantic parsing and machine translation) and to corpus-based linguistic inquiry. i I, Nathan Schneider, do solemnly swear that this thesis is my own To my parents, who gave me language. work, and that everything therein is, to the best of my knowledge, true, accurate, and properly cited, so help me Vinken. § ∗ In memory of Charles “Chuck” Fillmore, one of the great linguists.
    [Show full text]
  • Idioms-And-Expressions.Pdf
    Idioms and Expressions by David Holmes A method for learning and remembering idioms and expressions I wrote this model as a teaching device during the time I was working in Bangkok, Thai- land, as a legal editor and language consultant, with one of the Big Four Legal and Tax companies, KPMG (during my afternoon job) after teaching at the university. When I had no legal documents to edit and no individual advising to do (which was quite frequently) I would sit at my desk, (like some old character out of a Charles Dickens’ novel) and prepare language materials to be used for helping professionals who had learned English as a second language—for even up to fifteen years in school—but who were still unable to follow a movie in English, understand the World News on TV, or converse in a colloquial style, because they’d never had a chance to hear and learn com- mon, everyday expressions such as, “It’s a done deal!” or “Drop whatever you’re doing.” Because misunderstandings of such idioms and expressions frequently caused miscom- munication between our management teams and foreign clients, I was asked to try to as- sist. I am happy to be able to share the materials that follow, such as they are, in the hope that they may be of some use and benefit to others. The simple teaching device I used was three-fold: 1. Make a note of an idiom/expression 2. Define and explain it in understandable words (including synonyms.) 3. Give at least three sample sentences to illustrate how the expression is used in context.
    [Show full text]
  • Draft 5 for Printing
    Jan Milí č of Kroměř íž and Emperor Charles IV: Preaching, Power, and the Church of Prague Eleanor Janega UCL Thesis Submitted for the degree of PhD in History 1 I, Eleanor Janega, confirm that the work presented in this thesis is my own. Where information has been derived from other sources, I confirm that this has been indicated in my thesis. 2 Abstract During the second half of the fourteenth century Jan Milí č of Krom ěř íž became an active and popular preacher in Prague. The sermons which he delivered focused primarily on themes of reform, and called for a renewal within the church. Despite a sustained popularity with the lay populace of Prague, Milí č faced opposition to his practice from many individual members of the city’s clergy. Eventually he was the subject of twelve articles of accusation sent to the papal court of Avignon. Because of the hostility which Milí č faced, historians have most often written of him as a precursor to the Hussites. As a result he has been identified as an anti-establishment rabble-rouser and it has been assumed that he conducted his career in opposition to the court of the Emperor Charles IV. This thesis, over four body chapters, examines the careers of both Milí č and Charles and argues that instead of being enemies, the two men shared an amicable relationship. The first chapter examines Milí č’s career and will prove that he was well-connected to Charles and several members of his court. It will also examine the most common reasons given to argue that Charles and Milí č were at odds, and disprove them.
    [Show full text]
  • The Denotative Meaning
    ================================================================== Language in India www.languageinindia.com ISSN 1930-2940 Vol. 16:11 November 2016 ================================================================== Evaluating the Distinct Use of Meaning Relationship At Semantic Level Md. Rizwan, Ph.D. (JNU), MADE, CTE ================================================================== Abstract Semantics is a branch of linguistics which studies about the relationships between words and how the meanings are constructed and how we understand each other. It means the meaning and interpretation of words, signs, and sentence structure. In a broader term, the semantics study the meaning in a language as well as anything created with language, and often make a distinction between meaning and concept. Meaning is a category of language and concept is the totality of real world knowledge about an item. It is possible to know the meaning of the word without knowing everything about the concept referred to by that meaning. Further, the meaning relationships are relations between concepts and meanings. The concept ‘school’ may, for example, be expressed by the terms or expressions ‘school’, ‘schoolhouse’ and ‘place for teaching’. The relation between ‘school’ and ‘schoolhouse’ is a (synonym) relation between two words, while the relation between ‘school’ and ‘place for teaching’ is a relation between a word and an expression or phrase. These relations between words are termed lexical relations. 'School' also means ‘a group of people who share common characteristics of outlook’, ‘a school of thought’. This is a homonym relation where two senses share the same word or expression as ‘school’. Synonyms and homonyms are not only relations between concepts, but are about concepts expressed with identical or with different signs. In this article some of the important dimensions of meaning relationships have been discussed and the examples have been drawn from English and also from Urdu, Hindi or Persian, in order to clear the distinctive use among the languages.
    [Show full text]
  • From Idiom Variants to Open-Slot Idioms: Close-Ended and Open-Ended Variational Paradigms Ramon Marti Solano
    From Idiom Variants to Open-Slot Idioms: Close-Ended and Open-Ended Variational Paradigms Ramon Marti Solano To cite this version: Ramon Marti Solano. From Idiom Variants to Open-Slot Idioms: Close-Ended and Open-Ended Variational Paradigms. Research on Phraseology across Continents, Vol. II, University of Bialystok Publishing House, pp.167-180, 2013. hal-00933011 HAL Id: hal-00933011 https://hal-unilim.archives-ouvertes.fr/hal-00933011 Submitted on 20 Nov 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. From idiom variants to open-slot idioms: close-ended and open-ended variational paradigms Ramón, MARTÍ SOLANO Université de Limoges France Abstract There is a cline between idiom variants and open-slot idioms in English: from blow a fuse/gasket in which the nominal constituent admits only a single substitution to wear your heart on your sleeve in which the noun group your heart is actually instantiated by a plethora of other nouns or noun groups making of this multi-word unit an open-slot idiom with an open-ended variational paradigm. Variation is regularly signalled in general dictionaries and in dictionaries of idioms by means of different typographical marks such as oblique strokes or commas or by including a note in the body of the article.
    [Show full text]
  • SLIDE – a Sentiment Lexicon of Common Idioms
    SLIDE – a Sentiment Lexicon of Common Idioms Charles Jochim, Francesca Bonin, Roy Bar-Haim, Noam Slonim IBM Research fcharlesj,[email protected], froybar,[email protected] Abstract Idiomatic expressions are problematic for most sentiment analysis approaches, which rely on words as the basic linguistic unit. Compositional solutions for phrase sentiment are not able to handle idioms correctly because their sentiment is not derived from the sentiment of the individual words. Previous work has explored the importance of idioms for sentiment analysis, but has not addressed the breadth of idiomatic expressions in English. In this paper we present an approach for collecting sentiment annotation of idiomatic multiword expressions using crowdsourcing. We collect 10 annotations for each idiom and the aggregated label is shown to have good agreement with expert annotations. We describe the resulting publicly available lexicon and how it captures sentiment strength and ambiguity. The Sentiment Lexicon of IDiomatic Expressions (SLIDE) is much larger than previous idiom lexicons. The lexicon includes 5,000 frequently occurring idioms, as estimated from a large English corpus. The idioms were selected from Wiktionary, and over 40% of them were labeled as sentiment-bearing. Keywords: Idiom, Lexicon, Sentiment Analysis 1. Introduction of magnitude larger than previous idiom sentiment lexi- cons and focuses specifically on the most frequently used Multiword expressions (MWE) are a key challenge in Natu- idioms.1 In creating this resource, we are somewhat ag- ral Language Processing (Sag et al., 2002). Among MWEs, nostic to the exact definition of idiom. We are more gener- idioms are often defined as non-compositional multiword ally interested in sentiment analysis that can handle MWEs.
    [Show full text]
  • Research Report on Phonetics and Phonology of Sinhala
    Research Report on Phonetics and Phonology of Sinhala Asanka Wasala and Kumudu Gamage Language Technology Research Laboratory, University of Colombo School of Computing, Sri Lanka. [email protected],[email protected] Abstract phonetics and phonology for improving the naturalness and intelligibility for our Sinhala TTS system. This report examines the major characteristics of Many researches have been carried out in the linguistic Sinhala language related to Phonetics and Phonology. domain regarding Sinhala phonetics and phonology. The main topics under study are Segmental and These researches cover both aspects of segmental and Supra-segmental sounds in Spoken Sinhala. The first suprasegmental features of spoken Sinhala. Some of part presents Sinhala Phonemic Inventory, which the themes elaborated in these researches are; Sinhala describes phonemes with their associated features and sound change-rules, Sinhala Prosody, Duration and phonotactics of Sinhala. Supra-segmental features like Stress, and Syllabification. Syllabification, Stress, Pitch and Intonation, followed In the domain of Sinhala Speech Processing, a by the procedure for letter to sound conversion are very little number of researches were reported and described in the latter part. most of them were carried out by undergraduates for The research on Syllabification leads to the their final year projects. Some restricted domain identification of set of rules for syllabifying a given Sinhala TTS systems developed by undergraduates are word. The syllabication algorithm achieved an already working in a number of domains. The study of accuracy of 99.95%. Due to the phonetic nature of the such projects laid the foundation for our project. Sinhala alphabet, the letter-to-phoneme mapping was However, an extensive research was done in all- easily done, however to produce the correct aspects of Sinhala Phonetics and Phonology; in order phonetized version of a word, some modifications identify the areas to be deeply concerned in developing were needed to be done.
    [Show full text]
  • Semantics 126.134: Test #1 22 February 2006
    Semantics 126.134: Test #1 22 February 2006 Answers Name: Student Number: 4 sections; 22 questions; 50 marks total Fill in the Blanks Fill in the following blanks with the most appropriate term (1 mark each blank, 15 total). phonetics 1. The branch of linguistics called is the study of sounds in language and how they are produced. 2. de Saussure said that language is a communication system with an arbitrary signifier / sign signified relationship between the and the . utterance 3. meaning can be defined as the meaning that results from using an expression in a given context (such as a particular speaker, time, or place). referent 4. A reference is associated with a real-world entity called a(n) . Compositionality 5. According to the Principle of meaning is determined by a combination of lexical meanings, grammatical meanings, and syntactic structure. 6. A word like hiss that imitates the sound it represents is an example of onomatopoeia . 7. While hyponymy describes a linguistic relationship, the corresponding real-world taxonomy relationship between categories and subtypes is called a . head 8. A compound word is generally a hyponym of its . euphemisms 9. Good or indirect terms for bad or tabooed things are called . orthographic 10. The same (or written) word with two different meanings are known as homographs. 11. The concept associated with a sentence (for declarative sentences, the statement it proposition asserts) is know as its . opaque 12. A semantically combination of words is not predictable from the meanings of the individual words. lexicon 13. A dictionary is also known as a (basically, a list of lexemes).
    [Show full text]
  • A CMB Framework to Semantic Features of Opposite-Localizer-Superposition-Idioms
    Frontiers in Educational Research ISSN 2522-6398 Vol. 2, Issue 9: 126-136, DOI: 10.25236/FER.2019.020918 A CMB Framework to Semantic Features of Opposite-Localizer-Superposition-Idioms LI Linlin1, Jing Jia2 1 The College of Literature and Journalism of Sichuan University, 610065, China 2 Sichuan International Studies University, 400000, China ABSTRACT. OLSI (opposite-localizer-superposition-idioms), consists of one pair of antonymous localizers and other two components in one idiom, possesses distinctive word formation characteristics and semantic structure types. This paper attempts to explain the OLSI of Chinese idioms'reduplicated words through CMB theory. KEYWORDS: CMB; Semantic Features; Opposite-Localizer-Superposition-Idioms The CMB framework is AabstracTed from the cognitive reference point, metonymy view and conceptual integration theory in cognitive linguistics in order to discuss the cognitive mechanism of OLSI constructions. Concerning CMB theory, related articles have been discussed in detail. Under the guidance of CMB, the author constructs the related closed corpus and the idioms analyzed in the thesis are selected from it. This thesis compiles an exhaustive list of OLSI and gets 320 items from the electronic format of popular version of Chinese Dictionary to discuss the semantic of OLSI constructions. Therefore, this section will focus on the related data statistics and induction analyses on OLSI constructions from semantic. 1. Data Description and Distribution According to the general structure of OLSI, which consists of one pair antonymous localizer and other components in one idiom, we collect OLSI one by one from the electronic format of popular version of Chinese Dictionary with the total number of 47200.
    [Show full text]