Lexical Semantics Chapter 19 Three Perspectives on Meaning Outline

Lexical Semantics Chapter 19 Three Perspectives on Meaning Outline

Three Perspectives on Meaning 1. Lexical Semantics Lexical Semantics – The meanings of individual words 2. Formal Semantics (or Compositional Semantics or Chapter 19 Sentential Semantics) – HthHow those mean ings com bitbine to ma ke mean ings for individual sentences or utterances Lecture #13 3. Discourse or Pragmatics – How those meanings combine with each other and with November 2012 other facts about various kinds of context to make meanings for a text or discourse. Largely based on Kathy Mckeown’s lecture – Dialog or Conversation is often lumped together with which was based on Dan Jurafsky, Jim Discourse. Martin, and Chris Manning 1 2 Outline: Computational Lexical Preliminaries Semantics • What’s a word? • Introduction to Lexical Semantics – Definitions we’ve used over the class: Types, tokens, stems, roots, uninflected forms, etc… – Word relations such as Homonymy, Polysemy, Synonymy – Online resources: WordNet • Lexeme: An entry in a lexicon consisting of a pairing • Computational Lexical Semantics of a form with a single meaning representation – WdSWord Sense Disam bitibiguation • Supervised • Lexicon: A collection of lexemes • Semi-supervised – Word Similarity • Lemma – citation form – uninflected form (used to • Thesaurus-based represent a lexeme). Need to do morphological • Distributional parsing to get from wordform to lemma (lemmatization) • Lemma is part-of-speech specific (e.g., table N and 3 V) 4 Relationships between word Homonymy meanings • Homonymy • Lexemes that share a form • Polysemy – Phonological, orthographic or both • Synonymy • But have unrelated, distinct meanings • Antonymy • Clear example: • Hypernomy – Bat (wooden stick-like thing) vs – Bat (flying scary mammal thing) • Hyponomy – Or bank (financial institution) versus bank (riverside) • Meronomy • Can be homophones, homographs, or both – Homophones: • Write and right • Piece and peace 5 6 1 Homonymy causes problems for Polysemy NLP applications • The bank is constructed from red brick • Text-to-Speech I withdrew the money from the bank – Same orthographic form but different phonological form • Bass vs bass • Bow vs bow • Are those the same sense? • Record vs record • What about river bank? • Information retrieval – Different meanings same orthographic form • What about: The food bank is having a donation drive • QUERY: bat care next week. • Machine Translation • Different senses but some more related than • Speech recognition others… • When two senses are related semantically we call it polysemy (rather than homonymy) 7 8 Polysemy Metaphor and Metonymy • A single lexeme with multiple related meanings (bank • Specific types of polysemy the building, bank the financial institution) • Metaphor: • Most non-rare words have multiple meanings – Germany will pull Slovenia out of its economic slump. – The number of meanings is related to its frequency – I spent 2 hours on that homework. – VbtVerbs ten d more to po lysemy – ItI put money itGltkinto Google stock. – Distinguishing polysemy from homonymy isn’t always easy • Metonymy (use of one aspect of a concept or entity (or necessary) to refer to other aspects of the entity or to the entity itself) – The White House announced yesterday… • White House refers to the administration whose office is in the White House – This chapter talks about part-of-speech tagging 9 – Bank (building) and bank (financial institution) 10 How do we know when a word Synonyms has more than one sense? • ATIS examples • Words that have the same meaning in some or all – Which flights serve breakfast? contexts – Does America West serve Philadelphia? – Filbert / hazelnut – Couch / sofa • The “zeugma” test: – Big / large – Automobile / car – Vomit / throw up – ?Does United serve breakfast and San Jose? – Water / H2O • Two lexemes are synonyms if they can be successfully substituted for each other in all situations – If so they have the same propositional meaning 11 12 2 Synonyms Some more terminology • Lemmas and word forms • But there are few (or no) examples of perfect –A lexeme is an abstract pairing of meaning and form synonym –A lemma or citation form is the grammatical form that is – Why should that be? used to represent a lexeme – Even if many aspects of meaning are identical • Carpet is the lemma for carpets – Still may not preserve the acceptability acceptability based on notions of • Corpus is the lemma for corpora politeness, slang, register, genre, etc… – Specific surface forms carpets, sung, corpora are called •Example wordforms – Water and H2O • The lemma bank has two senses: – Large coke versus *big coke – Instead, a bank can hold the investments in… – But as agriculture burgeons on the east bank, the river will shrink even more •A sense is a discrete representation of one aspect of 13 the meaning of a word 14 Synonymy is a relation between Antonyms senses rather than words • Senses that are opposites with respect to one feature • Consider the words big and large of their meaning • Are they synonyms? • Otherwise, they are very similar! –How big is that plane? – Dark / light – Would I be flying on a large or small plane? – Short / long • HbthHow about here: – Hot / cold – Miss Nelson, for instance, became a kind of big sister to – Up / down Benjamin. – In / out – ?Miss Nelson, for instance, became a kind of large sister to • More formally: antonyms can Benjamin. – Define a binary opposition or are at opposite ends of a scale •Why? (long/short, fast/slow) – Big has a sense that means being older, or grown up – Be reversives (describe a change of movement in opposite – Large lacks this sense directions): rise/fall, up/down 15 16 Hyponym Hyponymy more formally • One sense is a hyponym of another if the first sense is more specific, denoting a subclass of the other • Extensional: – Car is a hyponym of vehicle – The class denoted by the superordinate extensionally – Dog is a hyponym of animal includes the class denoted by the hyponym – Mango is a hyponym of fruit • ClConversely • Entailment – Vehicle is a hypernym/superordinate of car – A sense A is a hyponym of sense B if being an A entails being a B – Animal is a hypernym of dog – Fruit is a hypernym of mango • Hyponymy is usually transitive – (A hypo B and B hypo C entails A hypo C) Superordinate Vehicle Fruit Furniture mammal Hyponym Car Mango Chair Dog 17 18 3 II. Wordnet Wordnet • A hierarchically organized lexical database • Where it is: • On-line thesaurus + aspects of a dictionary Categgyory Unique Forms • http://wordnetweb.princeton.edu/perl/webwn Noun 117,097 Verb 11,488 Adjective 22,141 Adverb 4,601 19 20 Format of WordNet Entries WordNet Noun Relations • The noun bass has 8 senses in wordnet: • S: (n) bass (the lowest part of the musical range) • S: (n) bass, bass part (the lowest part in polyphonic music) • S: (n) bass, basso (an adult male singer with the lowest voice) • S: (n) sea bass, bass (the lean flesh of a saltwater fish of the family Serranidae) • S: (n) freshwater bass, bass (any of various North American freshwater fish with lean flesh (especially of the genus Micropterus)) • S: (n) bass, bass voice, basso (the lowest adult male singing voice) • S: (n) bass (the member with the lowest range of a family of musical instruments) • S: (n) bass (nontechnical name for any of numerous edible marine and freshwater spiny-finned fishes) • And 1 Adjective Sense: • S: (adj) bass, deep (having or denoting a low vocal or instrumental range) "a deep voice"; "a bass voice is lower than a baritone voice"; "a bass clarinet" 21 22 WordNet Verb Relations 23 24 4 WordNet Hierarchies How is “sense” defined in WordNet? • The set of near-synonyms for a WordNet sense is called a synset (synonym set); it’s their version of a sense or a concept. • Example: chump as a noun to mean – ‘a person w ho is gu llible an d easy to ta ke a dvan tage o f’ – chump#1, fool#2, gull#1, mark#9, patsy#1, fall guy#1, sucker#1, soft touch#1, mug#2 (a person who is gullible and easy to take advantage of) • Each of these senses share this same gloss • Thus, for WordNet, the meaning of this sense of chump is this list. 25 26 Word Sense Disambiguation (WSD) Two variants of WSD task •Given • Lexical Sample task – A word in context, – Small pre-selected set of target words – A fixed inventory of potential word senses – And inventory of senses for each word • Decide which sense of the word this is • All-words task – ElihEnglish-to-SihMTSpanish MT – EditittEvery word in an entire text • Inventory is the set of Spanish translations – A lexicon with senses for each word – Speech Synthesis – Sort-of like part-of-speech tagging • Inventory is homographs with different pronunciations like bass • Except each lemma has its own tagset and bow – Automatic indexing of medical articles – MeSH (Medical Subject Headings) thesaurus entries 27 28 Supervised Machine Learning Approaches Approaches • Supervised • Supervised machine learning approach: – A training corpus of ? – Used to train a classifier that can tag words in text – Just as in part-of-speech tagging, statistical MT. • Semi-supervised – Unsupervised • Dictionary-based techniques • Summary of what we need: • Selectional association – The tag set (“sense inventory”) – Lightly supervised – The training corpus • Bootstrapping – A set of features extracted from the training corpus • Preferred Selectional Association – A classifier 29 30 5 WordNet Bass Supervised WSD 1: WSD Tag • The noun “bass” has 8 senses in WordNet • S: (n) bass#1 (the lowest part of the musical range) • What’s a tag? • S: (n) bass#2, bass part#1 (the lowest part in polyphonic music)

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    9 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us