With an Application to Ontologies and Semantics

Total Page:16

File Type:pdf, Size:1020Kb

With an Application to Ontologies and Semantics On Link Predictions in Complex Networks with an Application to Ontologies and Semantics Inaugural-Dissertation zur Erlangung des Doktorgrades der Philosophie des Fachbereiches Sprache, Literatur, Kultur (05) der Justus-Liebig-Universit¨atGießen vorgelegt von Bastian Entrup aus Hardegsen 2016 Dekan: Prof. Dr. Magnus Huber 1.Berichterstatter: Prof. Dr. Henning Lobin 2.Berichterstatter: Prof. Dr. Thomas Gloning Tag der Disputation: 21.06.2017 2 For my wife. 3 Contents List of Figures 7 List of Tables 9 1 Introduction 11 2 Graph Theory and Complex, Natural Networks 19 2.1 Properties of Graphs and Networks . 19 2.1.1 Vertices, Edges, In- and Out-Degree . 19 2.1.2 Components, Cliques, Clusters, and Communities . 22 2.1.3 Multigraphs, Trees, and Bipartite Graphs . 23 2.1.4 Measure of Centrality and Importance of Vertices . 24 2.1.5 Small-World Networks and Scale-Free Distribution . 28 2.1.6 Terminology . 31 2.2 Language Networks and Computational Models of Language . 31 2.3 Graphs of Language . 34 2.3.1 Hierarchical Structures of the Mind . 34 2.3.2 Complex Network Growth and Models of Language Network Evo- lution . 36 2.4 Conclusion . 42 3 Lexical Semantics and Ontologies 45 3.1 Frame Semantics and Encyclopedic Knowledge . 47 3.2 Field Theories . 49 3.3 Distributional Semantics . 51 3.4 Semantic Relations . 56 3.4.1 Synonymy . 56 3.4.2 Hyponymy . 56 3.4.3 Meronymy . 57 3.4.4 Antonymy and Opposition . 57 3.5 Ontologies . 59 3.5.1 Conception . 59 3.5.2 Ontology Formalizations . 60 3.6 Conclusion . 65 4 4 Relation Extraction and Prediction 68 4.1 Extending Ontologies: Relation Extraction from Text . 68 4.2 Predicting Missing Links . 73 4.3 Possible Machine Learning Algorithms . 80 4.3.1 Machine Learning . 80 4.3.2 Machine Learning Algorithms . 83 4.4 Conclusion . 92 5 WordNet: Analyses and Predictions 95 5.1 Ambiguity: Polysemy and Homonymy . 96 5.1.1 Homonymy . 96 5.1.2 Polysemy . 97 5.2 Architecture of WordNet . 106 5.2.1 Nouns . 108 5.2.2 Adjectives . 111 5.2.3 Adverbs . 114 5.2.4 Verbs . 114 5.2.5 Overview and Shortcomings . 117 5.3 Network Analysis . 120 5.3.1 WordNet as a Graph . 120 5.3.2 Graphs of Single POS . 127 5.3.3 Network Analysis: Overview . 131 5.4 Link Prediction: Polysemy . 134 5.4.1 State of the Art: Finding Polysemy in WordNet . 134 5.4.2 Feature Selection: the Network Approach . 138 5.4.3 Data Preparation . 143 5.5 Evaluation and Results . 144 5.5.1 Homograph Nouns . 144 5.5.2 Homograph Adjectives . 152 5.5.3 Homograph Adverbs . 156 5.5.4 Homograph Verbs . 160 5.6 Results . 162 6 DBpedia: Analyses and Predictions 166 6.1 DBpedia: an Ontology Extracted from Wikipedia . 166 6.1.1 Knowledge Bases . 166 6.1.2 What Is DBpedia? . 166 5 6.1.3 Extraction of Data from Wikipedia . 167 6.1.4 DBpedia Ontology . 168 6.1.5 Similar Approaches . 169 6.1.6 Shortcomings of DBpedia . 171 6.2 Network Analysis . 173 6.3 Approach to Link Identification in DBpedia . 176 6.3.1 Related Work . 176 6.3.2 Identifying Missing Links and the Network Structure of DBpedia . 178 6.4 The Data Sets . 181 6.5 Identification of Missing Links: Evaluation and Results . 185 6.5.1 Setup: Apriori . 185 6.5.2 Quantitative Evaluation: Apriori . 186 6.5.3 Setup: Clustering . 188 6.5.4 Quantitative Evaluation: Clustering . 189 6.6 Unsupervised Classification of Missing Links . 190 6.6.1 Setup: Link Classification Using Apriori Algorithm . 190 6.6.2 Evaluation . 191 6.7 Conclusion . 198 7 Discussion, Conclusion, and Future Work 202 8 References 212 A Appendix 232 6 List of Figures 1 Simple example graphs. 21 2 Closed triangle assumption. 22 3 A small, simple tree. 24 4 A simple bipartite graph with two different kinds of vertices: fa,c,eg and fb,dg....................................... 25 5 Weighted graphs: (a) weighted by degree, (b) weighted by betweenness centrality, and (c) weighted by closeness centrality. 27 6 Plot of frequency and rank of words based on the Brown Corpus: (a) the data following a power law and (b) the same data plotted on logarithmic scale. 29 7 The same data as in Fig. 6(b) but scaled by a factor of 2. 30 8 An Erd¨osand R´enyi random graph with 100 vertices and a connectivity probability of 0:2................................. 37 9 Watts and Strogatz graphs with n = 20 and k = 4. 38 10 A Barab´asiand R´eka graph with 100 vertices. 39 11 Degree distributions of Erd¨osand R´enyi, Watts and Strogatz, and Barab´asi and R´eka models. 40 12 An exemplary RDF graph. 63 13 Sample of hierarchically structured text. 71 14 Sample of hierarchically structured text with marked terms. 71 15 HRG: hierarchical random graph model. 79 16 Word senses, synsets, and word forms in WordNet. 107 17 Exemplary organization of nouns in WordNet. 111 18 Exemplary organization of adjectives in WordNet. 114 19 Exemplary organization of adverbs in WordNet. 115 20 Exemplary organization of verbs in WordNet. 116 21 WordNet degree distribution (directed network). 122 22 WordNet degree distribution (ignoring direction) with fitted power law. 123 23 Schematic plot of synset flaw, jurisprudenceg and its first- and second- degree neighbors. 124 24 NCC plot of WordNet. 126 25 Degree distribution, in- and out-degree combined, of the four WordNet subsets with the appropriate power law fitting. Note the different scaling of the plots. 129 26 NCP plots of the WordNet POS subsets. 132 7 27 Classes of instances yes/blue and no/red plotted according to the geodesic path. 146 28 Instances containing word senses from an adverb and the other given POS and their membership of class yes (blue) or no (red). 155 29 Correlation of geodesic path and class (yes/blue and no/red) in the adjec- tive test and training set. 155 30 Classes in adverb set relative to geodesic path. 158 31 Correlation of degree (word form 1) and class in the adverb test and training set......................................... 158 32 Closeness of the two word sense vertices involved in the instances. 159 33 Changes in the WordNet graph: connected and unconnected word forms of bank...................................... 165 34 Exemplary graph: DBpedia classes and instances and their interrelations. 169 35 Connectivity between the four Beatles in DBpedia. 171 36 The Beatles and their neighbors in DBpedia. 172 37 DBpedia degree distribution. 174 38 Unconnected triad with assumed connection (dotted edge) between two vertices A and C................................. 180 39 Precision and recall in relation to the similarity. 187 40 Classification results with all selected classes: predicted relations in %. Green: correct classifications, red: incorrect classifications. 192 41 Screenshot of Wikipedia page: Barenaked Ladies............... 194 42 Classification results with only well-performing classes: predicted relations in %. Green: correct classifications, red: incorrect classifications. 196 43 Existing relations (grey, dotted) and newly added relations (black) between the four Beatles in DBpedia. 201 8 List of Tables 1 Lexical field w^ısheit. Left: ca. 1200; right: ca. 1300. ..
Recommended publications
  • Cohesion and Coherence in Indonesian Textbooks for the Tenth Grader of Senior High School
    Advances in Social Science, Education and Humanities Research, volume 301 Seventh International Conference on Languages and Arts (ICLA 2018) COHESION AND COHERENCE IN INDONESIAN TEXTBOOKS FOR THE TENTH GRADER OF SENIOR HIGH SCHOOL Syamsudduha1, Johar Amir2, and Wahida3 1Universitas Negeri Makasar, Makassar, Indonesia, [email protected] 2Universitas Negeri Makasar, Makassar, Indonesia, [email protected] 3 Universitas Negeri Makasar, Makassar, Indonesia, [email protected] Abstract This study aims to describe: (1) cohesion and (2) coherence in Indonesian Textbooks for XI Grade Senior High School. This type of research is qualitative analysis research. The data are the statements or disclosures that contain cohesion and coherence in the Indonesian textbooks. The sources of data are the texts in The Indonesian Textbooks for XI Grade Senior High Schoolpublished by the Ministry of Education and Culture of Republic of Indonesia. The data collection techniques are carried out with documenting, reading,and noting. Data analysis techniques are carried out through reduction, presentation, and inference/verification of data. The results of this study showed that (1) the use of two cohesion markers, grammatical cohesionand lexical cohesion. The markers of grammatical cohesion were reference, substitution, ellipsis, and conjunction. The markers of lexical cohesion were repetition, synonym,antonym, hyponym, and collocation. The later result was (2) the use of coherence markers which are addition, sequencing, resistance, more, cause-consequences, time, terms, ways, uses, and explanations. Those findings showed that the textsof Indonesian Textbooks for XI Grade Senior High School published by the Ministry of Education and Culture of Republic of Indonesia were texts that had cohesion and unity of meaning so that Indonesian Textbooks for XI Grade Senior High Schoolpublished by the Ministry of Education and Cultureof The Republic of Indonesia was declared good to be used as a teaching material.
    [Show full text]
  • Metalanguage and Encoding Scheme Design for Digital Lexicography
    MONDILEX: I УШ ИИ а Conceptual Modelling of Networking of '•ШАШЛ ЩЛ Centres for High-Quality Research in Slavic Lexicography and Their Digital Resources L. Stur Institute of Linguistics Slovak Academy of Sciences Metalanguage and Encoding Scheme Design for Digital Lexicography MONDILEX Third Open Workshop Bratislava, Slovakia, 15-16 April, 2009 Proceedings Bratislava 2009 MONDILEX: Conceptual Modelling of Networking of Centres for High- Quality Research in Slavic Lexicography and Their Digital Resources Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences Metalanguage and Encoding Scheme Design for Digital Lexicography Innovative Solutions for Lexical Entry Design in Slavic Lexicography MONDILEX Third Open Workshop Bratislava, Slovakia, 15–16 April, 2009 Proceedings Radovan Garabík (Ed.) The workshop is organized by the project GA 211938 MONDILEX Conceptual Modelling of Networking of Centres for High-Quality Research in Slavic Lexicography and Their Digital Resources supported by EU FP7 programme Capacities – Research Infrastructures Design studies for research infrastructures in all S&T fields Metalanguage and Encoding Scheme Design for Digital Lexicography Bratislava, Ľ. Štúr Institute of Linguistics, 2009. The volume contains contributions presented at the Third open workshop “Metalanguage and encoding scheme design for digital lexicography”, held in Bratislava, Slovakia, on 15–16 April 2009. The workshop is organized by the international project GA 211938 MONDILEX Conceptual Modelling of Networking of Centres for High- Quality
    [Show full text]
  • Exploring the Relationship Between Word-Association and Learners’ Lexical Development
    Exploring the Relationship between Word-Association and Learners’ Lexical Development Jason Peppard An assignment for Master of Arts in Applied Linguistics Module 2 - Lexis July 2007 Centre for English Language Studies Department of English UNIVERSITY OF BIRMINGHAM Birmingham B15 2TT United Kingdom LX/06/02 Follow task 123 outlined on page 152 of McCarthy (1990 Vocabulary OUP). You do not have to use students: anyone who has an L2 but has not been brought up as a bilingual will do. Use at least four subjects and test them on their L2 (or L3/ L4 etc.). Report your findings, giving the level(s) of subjects' L2 (L3, etc.) and including the prompt words and responses. Follow McCarthy's list of evaluation points, adding other points to this if you wish. Approx. 4530 words 1.0 Introduction Learning or acquiring your first language was a piece of cake, right? So why then, is it so difficult for some people to learn a second language? For starters, many learners might be wondering why I referred to a dessert in the opening sentence of a paper concerning vocabulary. The point here is summed up neatly by McCarthy (1990): No matter how well the student learns grammar, no matter how successfully the sounds of L2 are mastered, without words to express a wide range of meanings, communication in an L2 just cannot happen in any meaningful way (p. viii). This paper, based on Task 123 of McCarthy’s Vocabulary (ibid: 152), aims to explore the L2 mental lexicon. A simple word association test consisting of eight stimulus words was administered to both low-level and high-level Japanese EFL students as well as a group of native English speakers for comparative reasons.
    [Show full text]
  • Monosemy and the Dictionary Henri Béjoint
    Monosemy and the Dictionary Henri Béjoint I. The Notion of "Monosemy" in Linguistics The notion of "monosemy" is often mentioned by linguists, though not always under that name—Cruse (1986), for example, uses "univocality", Catford (1983:24) discusses the use of terms such as "oligosemy" "eurysemy" and "stenosemy" — but it is hardly ever defined or exemplified. Also, few linguists have tried to evaluate the quantitative importance of monosemy: how many words can be considered monosemous in English and in other languages? When evaluations are attempted, the results are surprisingly divergent, the discrepancies probably being due to the indeterminacy ofthe definition of "monosemy". The situation is all the more surprising as "polysemy" is discussed in every single book about semantics. Lexical polysemy has been considered as an unfortunate imperfection by many linguists in the past (dialectologists, after Gilliéron, and structuralists), but nowadays it is often presented as an indispensable feature of language: without polysemy, language could not cope with the diversity and the variability of the notions to be expressed. If every single "referent" had a different name, the lexical code would impose an extraordinary burden on the memory of the language user (see Hagège 1985:126).1 Whichever attitude is adopted, polysemy is important for the semanticist: indeed, for some, it is "the very object of semantics" (Rey-Debove 1971:256). If monosemy is inseparable from polysemy, it must be an equally fundamental concept. Its study is particularly important in terminology, since it is one of the most often quoted characteristics of the term as opposed to the word, but it is also important in lexicology and lexicography.
    [Show full text]
  • Online Dictionaries As Emergent Archives of Contemporary Usage and Collaborative Codification
    Online Dictionaries as Emergent Archives of Contemporary Usage and Collaborative Codification Colleen Cotter Queen Mary, University of London John Damaso Arizona State University February 2007 Abstract Within the history of modern English lexicography, individual dictionary editors have had ultimate control over the selection, meaning, and illustration of words; extensive collaboration with contributors has been limited. However, Internet technologies that easily permit exchanges between a user and a database have allowed a new type of dictionary online, one that is built by the collaboration of contributing end-users, allowing ordinary users of dictionaries who are not trained lexicographers to engage in dictionary-making. We discuss a popular online slang dictionary called UrbanDictionary.com (UD) to illustrate how lexicographic principles are joined with Web-only communication technologies to provide a context for collaborative engagement and meaning-making; and to note the many characteristics and functions shared with the traditional print dictionary. Significantly, UD captures what most traditional English dictionaries fall short of: recording ephemeral quotidian spoken language and representing popular views of meaning. By relying on the users of language to select and define words for a dictionary, UD, which defines more than 1 million words, has in effect influenced access to and formulation of the lexis. Keywords computer-mediated communication, lexicography, slang, youth language; English Queen Mary’s OPAL #9 Occasional Papers Advancing Linguistics 1 Introduction English lexicography stems from a tradition of relatively limited functional collaboration, beginning with Samuel Johnson’s dictionary in 1755, in which editors overseeing numerous contributors held the ultimate authority over the selection, meaning, and illustration of words.
    [Show full text]
  • Unit 2 Structures Handout.Pdf
    2. The definition of a language as a structure of structures 2.1. Phonetics and phonology Relevance for studying language in its natural or primary medium: oral sounds rather than written symbols. Phonic medium: the range of sounds produced by the speech organs insofar as the play a role in language Speech sounds: Individual sounds within that range Phonetics is the study of the phonic medium: The study of the production, transmission, and reception of human sound-making used in speech. e.g. classification of sounds as voiced vs voiceless: /b/ vs /p/ Phonology is the study of the phonic medium not in itself but in relation with language. e.g. application of voice to the explanation of differences within the system of language: housen vs housev usen vs usev 2.1.1. Phonetics It is usually divided into three branches which study the phonic medium from three points of view: Articulatory phonetics: speech sounds according to the way in which they are produced by the speech organs. Acoustic phonetics: speech sounds according to the physical properties of their sound-waves. Auditory phonetics: speech sounds according to their perception and identification. Articulatory phonetics has the longest tradition, and its progress in the 19th century contributed a standardize and internationally accepted system of phonetic transcription: the origins of the International Phonetic Alphabet used today and relying on sound symbols and diacritics. It studies production in relation with the vocal tract, i.e., organs such as: lungs trachea or windpipe, containing: larynx vocal folds glottis pharyngeal cavity nose mouth, containing fixed organs: teeth and teeth ridge hard palate pharyngeal wall mobile organs: lips tongue soft palate jaw According to their function and participation, sounds may take several features: Voice: voiced vs voiceless sounds, according to the participation of the vocal folds e.g.
    [Show full text]
  • Different but Not All Opposite: Contributions to Lexical Relationships Teaching in Primary School
    INTE - ITICAM - IDEC 2018, Paris-FRANCE VOLUME 1 All Different But Not All Opposite: Contributions To Lexical Relationships Teaching In Primary School Adriana BAPTISTA Polytechnic Institute of Porto – School of Media Arts and Design inED – Centre for Research and Innovation in Education Portugal [email protected] Celda CHOUPINA Polytechnic Institute of Porto – School of Education inED – Centre for Research and Innovation in Education Centre of Linguistics of the University of Porto Portugal [email protected] José António COSTA Polytechnic Institute of Porto – School of Education inED – Centre for Research and Innovation in Education Centre of Linguistics of the University of Porto Portugal [email protected] Joana QUERIDO Polytechnic Institute of Porto – School of Education Portugal [email protected] Inês OLIVEIRA Polytechnic Institute of Porto – School of Education Centre of Linguistics of the University of Porto Portugal [email protected] Abstract The lexicon allows the expression of particular cosmovisions, which is why there are a wide range of lexical relationships, involving different linguistic particularities (Coseriu, 1991; Teixeira , 2005). We find, however, in teaching context, that these variations are often replaced by dichotomous and decontextualized proposals of lexical organization, presented, for instance, in textbooks and other supporting materials (Baptista et al., 2017). Thus, our paper is structured in two parts. First, we will try to account for the diversity of lexical relations (Choupina, Costa & Baptista, 2013), considering phonological, morphological, syntactic, semantic, pragmatic- discursive, cognitive and historical criteria (Lehmann & Martin-Berthet, 2008). Secondly, we present an experimental study that aims at verifying if primary school pupils intuitively organize their mental lexicon in a dichotomous way.
    [Show full text]
  • Translating Lexical Semantic Relations: the First Step Towards Multilingual Wordnets*
    Translating Lexical Semantic Relations: The First Step Towards Multilingual Wordnets* Chu-Ren Huang, I-Ju E. Tseng, Dylan B.S. Tsai Institute of Linguistics, Preparatory Office, Academia Sinica 128 Sec.2 Academy Rd., Nangkang, Taipei, 115, Taiwan, R.O.C. [email protected], {elanna, dylan}@hp.iis.sinica.edu.tw Abstract Wordnets express ontology via a network of Establishing correspondences between words linked by lexical semantic relations. Since wordnets of different languages is essential these words are by definition the lexicon of each to both multilingual knowledge processing language, the wordnet design feature ensures and for bootstrapping wordnets of versatility in faithfully and comprehensively low-density languages. We claim that such representing the semantic content of each correspondences must be based on lexical semantic relations, rather than top ontology language. Hence, on one hand, these conceptual or word translations. In particular, we define atoms reflect linguistic idiosyncrasies; on the a translation equivalence relation as a other hand, the lexical semantic relations (LSR’s) bilingual lexical semantic relation. Such receive universal interpretation across different relations can then be part of a logical languages. For example, the definition of entailment predicting whether source relations such as synonymy or hypernymy is language semantic relations will hold in a universal. The universality of the LSR’s is the target language or not. Our claim is tested foundation that allows wordnet to serve as a with a study of 210 Chinese lexical lemmas potential common semantic network and their possible semantic relations links representation for all languages. The premise is bootstrapped from the Princeton WordNet.
    [Show full text]
  • CS460/626 : Natural Language Processing/Speech, NLP and the Web
    CS460/626 : Natural Language Processing/Speech, NLP and the Web Lecture 24, 25, 26 Wordnet Pushpak Bhattacharyya CSE Dept., IIT Bombay 17th and 19th (morning and night), 2013 NLP Trinity Problem Semantics NLP Trinity Parsing Part of Speech Tagging Morph Analysis Marathi French HMM Hindi English Language CRF MEMM Algorithm NLP Layer Discourse and Corefernce Increased Semantics Extraction Complexity Of Processing Parsing Chunking POS tagging Morphology Background Classification of Words Word Content Function Word Word Verb Noun Adjective Adverb Prepo Conjun Pronoun Interjection sition ction NLP: Thy Name is Disambiguation A word can have multiple meanings and A meaning can have multiple words Word with multiple meanings Where there is a will, Where there is a will, There are hundreds of relatives Where there is a will There is a way There are hundreds of relatives A meaning can have multiple words Proverb “A cheat never prospers” Proverb: “A cheat never prospers but can get rich faster” WSD should be distinguished from structural ambiguity Correct groupings a must … Iran quake kills 87, 400 injured When it rains cats and dogs run for cover Should be distinguished from structural ambiguity Correct groupings a must … Iran quake kills 87, 400 injured When it rains, cats and dogs runs for cover When it rains cats and dogs, run for cover Groups of words (Multiwords) and names can be ambiguous Broken guitar for sale, no strings attached (Pun) Washington voted Washington to power pujaa ne pujaa ke liye phul todaa (Pujaa plucked
    [Show full text]
  • Machine Reading = Text Understanding! David Israel! What Is It to Understand a Text? !
    Machine Reading = Text Understanding! David Israel! What is it to Understand a Text? ! • To understand a text — starting with a single sentence — is to:! • determine its truth (perhaps along with the evidence for its truth)! ! • determine its truth-conditions (roughly: what must the world be like for it to be true)! • So one doesn’t have to check how the world actually is — that is whether the sentence is true! • calculate its entailments! • take — or at least determine — appropriate action in light of it (and some given preferences/goals/desires)! • translate it accurately into another language! • and what does accuracy come to?! • ground it in a cognitively plausible conceptual space! • …….! • These are not necessarily competing alternatives! ! For Contrast: What is it to Engage Intelligently in a Dialogue? ! • Respond appropriately to what your dialogue partner has just said (and done)! • Typically, taking into account the overall purpose(s) of the interaction and the current state of that interaction (the dialogue state)! • Compare and contrast dialogue between “equals” and between a human and a computer, trying to assist the human ! • Siri is a very simple example of the latter! A Little Ancient History of Dialogue Systems! STUDENT (Bobrow 1964) Question-Answering: Word problems in simple algebra! “If the number of customers Tom gets is twice the square of 20% of the number of advertisements he runs, and the number of advertisements he runs is 45, what is the number of customers Tom gets?” ! Overview of the method ! . 1 Map referential expressions to variables. ! ! . 2 Use regular expression templates to identify and transform mathematical expressions.
    [Show full text]
  • Semantic Shifts in the Sphere of Evaluative Units
    Center for Open Access in Science ▪ Belgrade - SERBIA 3rd International e-Conference on Studies in Humanities and Social Sciences http://centerprode.com/conferences/3IeCSHSS.html ISBN (Online) 978-86-81294-02-4 ▪ 2019: 201-210 _________________________________________________________________________ Semantic Shifts in the Sphere of Evaluative Units Tatiana Sallier St. Petersburg State University, RUSSIAN FEDERATION Department of Philology, St. Petersburg Abstract The purpose of the research is to trace some semantic processes occurring in the sphere of evaluative units – lexemes with “good” or “bad” element of meaning. The article is aimed at proving that evaluative units display a semantic shift from more concrete to more abstract meaning. Evaluative lexemes are known to include a denotative (more concrete) seme and an evaluative (abstract) seme. In the process of usage, the denotative seme is suppressed and the lexeme acquires purely evaluative meaning. In the process of “name calling”, “bad” words lose their denotative element and become pure invectives. The denotative seme may not even be known to the speaker. The loss of denotative meaning may occur in the process of word borrowing. Latin “paganus” – a rural dweller – came to mean “pagan” in European languages and the word “поганый” in Russian means just “bad”. Keywords: evaluative, pejorative, semantic shifts, denotative. 1. Introduction The article is aimed at tracing semantic tendencies operating in the sphere of pejorative evaluative lexemes – that is lexemes including an evaluative element of meaning. An evaluative lexeme may be purely evaluative – that is, bear no other meaning except evaluation. Such adjectives as English “good” and “bad” or Russian “хороший”, “плохой” may serve as examples.
    [Show full text]
  • Hindi Wordnet
    Description Hindi WordNet The Hindi WordNet is a system for bringing together different lexical and semantic relations between the Hindi words. It organizes the lexical information in terms of word meanings and can be termed as a lexicon based on psycholinguistic principles. The design of the Hindi WordNet is inspired by the famous English WordNet. In the Hindi WordNet the words are grouped together according to their similarity of meanings. Two words that can be interchanged in a context are synonymous in that context. For each word there is a synonym set, or synset, in the Hindi WordNet, representing one lexical concept. This is done to remove ambiguity in cases where a single word has multiple meanings. Synsets are the basic building blocks of WordNet. The Hindi WordNet deals with the content words, or open class category of words. Thus, the Hindi WordNet contains the following category of words- Noun, Verb, Adjective and Adverb. Each entry in the Hindi WordNet consists of following elements 1. Synset: It is a set of synonymous words. For example, “वि饍यालय, पाठशाला, कू ल” (vidyaalay, paaThshaalaa, skuul) represents the concept of school as an educational institution. The words in the synset are arranged according to the frequency of usage. 2. Gloss: It describes the concept. It consists of two parts: Text definition: It explains the concept denoted by the synset. For example, “िह थान जहााँ प्राथमिक या िाध्यमिक तर की औपचाररक मशक्षा दी जाती है” (vah sthaan jahaan praathamik yaa maadhyamik star kii aupachaarik sikshaa dii jaatii hai) explains the concept of school as an educational institution.
    [Show full text]