Comparative Study on Currently Available Wordnets
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
An Efficient Database Design for Indowordnet Development Using
An Efficient Database Design for IndoWordNet Development Using Hybrid Approach Venkatesh Prabhu2 Shilpa Desai1 Hanumant Redkar1 N eha Prabhugaonkar1 Apurva N agvenkar1 Ramdas Karmali1 (1) GOA UNIVERSITY, Taleigao - Goa (2) THYWAY CREATIONS, Mapusa - Goa [email protected], [email protected], [email protected], [email protected], [email protected], [email protected] ABSTRACT WordNet is a crucial resource that aids in Natural Language Processing (NLP) tasks such as Machine Translation, Information Retrieval, Word Sense Disambiguation, Multi-lingual Dictionary creation, etc. The IndoWordNet is a multilingual WordNet which links WordNets of different Indian languages on a common identification number given to each concept. WordNet is designed to capture the vocabulary of a language and can be considered as a dictionary cum thesaurus and much more. WordNets for some Indian Languages are being developed using expansion approach. In this paper we have discussed the details and our experiences during the evolution of this database design while working on the Indradhanush WordNet Project. The Indradhanush WordNet Project is working on the development of WordNets for seven Indian languages. Our database design gives an efficient plan for storage of WordNet data for all languages. In addition it extends the design to hold specific concepts for a language. KEYWORDS: WordNet, IndoWordNet, synset, database design, expansion approach, semantic relation, lexical relation. Proceedings of the 3rd Workshop on South and Southeast Asian Natural Language Processing (SANLP), pages 229–236, COLING 2012, Mumbai, December 2012. 229 1 Introduction 1.1 WordNet and its storage methods WordNet (Miller, 1993) maintains the concepts in a language, relations between concepts and their ontological details. -
Hittite Etymologies and Notes*
Studia Linguistica Universitatis Iagellonicae Cracoviensis 129 (2012) DOI 10.4467/20834624SL.12.015.0604 ROBERT WOODHOUSE The University oo Queensland, Brisbane [email protected] HITTITE ETYMOLOGIES AND NOTES* Keywords: Hittite, etymology, Proto-Indo-European, historical phonology, semantics Abstract Discussed are the etymologies of twelve Hittite words and word groups (alpa- ‘cloud’, aku- ‘seashell’, ariye/a-zi ‘determine by or consult an oracle’, heu- / he(y)aw- ‘rain’, hāli- ‘pen, corral’, kalmara- ‘ray’ etc., māhla- ‘grapevine branch’, sūu, sūwaw- ‘full’, tarra-tta(ri) ‘be able’ and tarhu-zi ‘id.; conquer’, idālu- ‘evil’, tara-i / tari- ‘become weary, henkan ‘death, doom’) and some points of Hittite historical phonology, such as the fate of medial *-h2n- (sub §7) and final *-i (§13), all of which appear to receive somewhat inadequate treatment in Kloekhorst’s 2008 Hittite etymological dictionary. Several old etymologies are defended and some new ones suggested. The following notes were compiled while writing a response (in press b) to that part of the (2006) paper, recently kindly brought to my attention by its author, Professor Witold Mańczak, that purports to unseat the laryngeal theory on the basis of al- legedly incompatible Hittite material collected over three decades ago by Tischler (1980). The massive debate on the laryngeal theory that essentially followed Tisch- ler’s paper was no doubt in part a response to it and produced solutions to most if not all of the problems raised by Tischler, a position I attempt to summarize in my own paper noted above with reference to the superb Hittite etymological diction- ary recently published by Kloekhorst (2008, hereinafter referred to as K:) with its several innovations in the areas of Hittite and Anatolian historical phonology and morphology. -
The Effectiveness of Monolingual and Bilingual Dictionaries Features Toward the Students Narrative Writing of Uin Malang: Comparative Study
THE EFFECTIVENESS OF MONOLINGUAL AND BILINGUAL DICTIONARIES FEATURES TOWARD THE STUDENTS NARRATIVE WRITING OF UIN MALANG: COMPARATIVE STUDY THESIS WINDA FIFI PUSPITA SARI 10320118 ENGLISH LANGUAGE AND LETTERS DEPARTMENT FACULTY OF HUMANITIES MAULANA MALIK IBRAHIM STATE ISLAMIC UNIVERSITY MALANG 2014 THE EFFECTIVENESS OF MONOLINGUAL AND BILINGUAL DICTIONARIES FEATURES TOWARD THE STUDENTS NARRATIVE WRITING OF UIN MALANG: COMPARATIVE STUDY THESIS Presented to Maulana Malik Ibrahim State Islamic University of Malang In a Partial of the Requirements for the Degree of Sarjana Sastra (S.S.) By: WindaFifiPuspita Sari 10320118 Advisor: Drs. H.DjokoSusanto, M.Ed., Ph.D. NIP 19670529 2000031 001 ENGLISH LANGUAGE AND LETTERS DEPARTMENT FACULTY OF HUMANITIES MAULANA MALIK IBRAHIM STATE ISLAMIC UNIVERSITY MALANG 2014 i ii iii iv MOTTO ُ ……يَ ْزفَ ِع هَّللاُ اله ِذ َين َآمنُوا ِم ْن ُك ْم َواله ِذ َين أوتُوا ْال ِع ْل َم َد َر َج ٍات َو هَّللا ُ مِ َاا َت ْع َالُو َ َبيِ ٌز ….. Allah will exalt those of you who believe, and those who are given knowledge, in high degrees; and Allah is aware of what you do.(Surah Al Mujadilah 58:11) v DEDICATION I dedicate my thesis to my beloved family: I would like to say thank you very much to my beloved parents, brother and sister who always support me during my thesis progress. And thank you very much to my family who never stop bring me in their prayer, cares, and loves that make me always spirit. vi ACKNOWLEDGMENT First of all, I would like to thank Allah SWT who always gives me a blessing and strength to finish my thesis. -
Methods in Lexicography and Dictionary Research* Stefan J
Methods in Lexicography and Dictionary Research* Stefan J. Schierholz, Department Germanistik und Komparatistik, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany ([email protected]) Abstract: Methods are used in every stage of dictionary-making and in every scientific analysis which is carried out in the field of dictionary research. This article presents some general consid- erations on methods in philosophy of science, gives an overview of many methods used in linguis- tics, in lexicography, dictionary research as well as of the areas these methods are applied in. Keywords: SCIENTIFIC METHODS, LEXICOGRAPHICAL METHODS, THEORY, META- LEXICOGRAPHY, DICTIONARY RESEARCH, PRACTICAL LEXICOGRAPHY, LEXICO- GRAPHICAL PROCESS, SYSTEMATIC DICTIONARY RESEARCH, CRITICAL DICTIONARY RESEARCH, HISTORICAL DICTIONARY RESEARCH, RESEARCH ON DICTIONARY USE Opsomming: Metodes in leksikografie en woordeboeknavorsing. Metodes word gebruik in elke fase van woordeboekmaak en in elke wetenskaplike analise wat in die woor- deboeknavorsingsveld uitgevoer word. In hierdie artikel word algemene oorwegings vir metodes in wetenskapfilosofie voorgelê, 'n oorsig word gegee van baie metodes wat in die taalkunde, leksi- kografie en woordeboeknavorsing gebruik word asook van die areas waarin hierdie metodes toe- gepas word. Sleutelwoorde: WETENSKAPLIKE METODES, LEKSIKOGRAFIESE METODES, TEORIE, METALEKSIKOGRAFIE, WOORDEBOEKNAVORSING, PRAKTIESE LEKSIKOGRAFIE, LEKSI- KOGRAFIESE PROSES, SISTEMATIESE WOORDEBOEKNAVORSING, KRITIESE WOORDE- BOEKNAVORSING, HISTORIESE WOORDEBOEKNAVORSING, NAVORSING OP WOORDE- BOEKGEBRUIK 1. Introduction In dictionary production and in scientific work which is carried out in the field of dictionary research, methods are used to reach certain results. Currently there is no comprehensive and up-to-date documentation of these particular methods in English. The article of Mann and Schierholz published in Lexico- * This article is based on the article from Mann and Schierholz published in Lexicographica 30 (2014). -
Determining the Characteristic Vocabulary for a Specialized Dictionary Using Word2vec and a Directed Crawler Gregory Grefenstette, Lawrence Muchemi
Determining the Characteristic Vocabulary for a Specialized Dictionary using Word2vec and a Directed Crawler Gregory Grefenstette, Lawrence Muchemi To cite this version: Gregory Grefenstette, Lawrence Muchemi. Determining the Characteristic Vocabulary for a Special- ized Dictionary using Word2vec and a Directed Crawler. GLOBALEX 2016: Lexicographic Resources for Human Language Technology, May 2016, Portoroz, Slovenia. hal-01323591 HAL Id: hal-01323591 https://hal.inria.fr/hal-01323591 Submitted on 30 May 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Determining the Characteristic Vocabulary for a Specialized Dictionary using Word2vec and a Directed Crawler Gregory Grefenstette Lawrence Muchemi Inria Saclay/TAO, Rue Noetzlin - Bât 660 Inria Saclay/TAO, , Rue Noetzlin - Bât 660 91190 Gif sur Yvette, France 91190 Gif sur Yvette, France [email protected] [email protected] ABSTRACT crawler works. This is followed by a description of one Specialized dictionaries are used to understand concepts in distributional semantics tool, word2vec. Then we show how specific domains, especially where those concepts are not these two tools can be used together to extract the basis of a part of the general vocabulary, or having meanings that specialized vocabulary for a domain. -
An Amerind Etymological Dictionary
An Amerind Etymological Dictionary c 2007 by Merritt Ruhlen ! Printed in the United States of America Library of Congress Cataloging-in-Publication Data Greenberg, Joseph H. Ruhlen, Merritt An Amerind Etymological Dictionary Bibliography: p. Includes indexes. 1. Amerind Languages—Etymology—Classification. I. Title. P000.G0 2007 000!.012 00-00000 ISBN 0-0000-0000-0 (alk. paper) This book is dedicated to the Amerind people, the first Americans Preface The present volume is a revison, extension, and refinement of the ev- idence for the Amerind linguistic family that was initially offered in Greenberg (1987). This revision entails (1) the correction of a num- ber of forms, and the elimination of others, on the basis of criticism by specialists in various Amerind languages; (2) the consolidation of certain Amerind subgroup etymologies (given in Greenberg 1987) into Amerind etymologies; (3) the addition of many reconstructions from different levels of Amerind, based on a comprehensive database of all known reconstructions for Amerind subfamilies; and, finally, (4) the addition of a number of new Amerind etymologies presented here for the first time. I believe the present work represents an advance over the original, but it is at the same time simply one step forward on a project that will never be finished. M. R. September 2007 Contents Introduction 1 Dictionary 11 Maps 272 Classification of Amerind Languages 274 References 283 Semantic Index 296 Introduction This volume presents the lexical and grammatical evidence that defines the Amerind linguistic family. The evidence is presented in terms of 913 etymolo- gies, arranged alphabetically according to the English gloss. -
Exploring Resources in Word Sense Disambiguation for Marathi Language Amit Patil1, Chhaya Patil2, Dr
www.rspsciencehub.com Volume 02 Issue 10S October 2020 Special Issue of First International Conference on Advancements in Management, Engineering and Technology (ICAMET 2020) Exploring Resources in Word Sense Disambiguation for Marathi Language Amit Patil1, Chhaya Patil2, Dr. Rakesh Ramteke3, Dr. R. P. Bhavsar4, Dr. Hemant Darbari5 1,2 Assistant Professor, Department of Computer Application, RCPET’s IMRD, Maharashtra, India 3,4Professor, School of Computer Sciences, KBC North Maharashtra University, Maharashtra, India 5Director General, Centre for Development of Advanced Computing (C-DAC), Maharashtra, India Abstract Word Sense Disambiguation (WSD) is one of the most challenging problems in the research area of natural language processing. To find the correct sense of the word in a particular context is called Word Sense Disambiguation. As a human, we can get a correct sense of the word given in the sentence because of word knowledge of that particular natural language, but it is not an easy task for the machine to disambiguate the word. Developing any WSD system, it required sense repository and sense dictionary. It is very costly and time-consuming to build these resources. Many foreign languages have available these resources, that is why most of the foreign languages like English, German, Spanish etc lot of work is done in these Natural languages. When we look for Indian languages like Hindi, Marathi, Bengali etc. very less work is done. The reason behind this is resource-scarcity. In this paper, we majorly focus on Marathi Language Word Sense Disambiguation because of very less work is done in the Marathi Language as compared to Hindi and other Indian Languages. -
Multimedia Data Mining a Systematic Introduction to Concepts and Theory
Multimedia Data Mining A Systematic Introduction to Concepts and Theory © 2009 by Taylor & Francis Group, LLC C9667_FM.indd 1 10/8/08 10:06:11 AM Chapman & Hall/CRC Data Mining and Knowledge Discovery Series SERIES EDITOR Vipin Kumar University of Minnesota Department of Computer Science and Engineering Minneapolis, Minnesota, U.S.A AIMS AND SCOPE This series aims to capture new developments and applications in data mining and knowledge discovery, while summarizing the computational tools and techniques useful in data analysis. This series encourages the integration of mathematical, statistical, and computational methods and techniques through the publication of a broad range of textbooks, reference works, and hand- books. The inclusion of concrete examples and applications is highly encouraged. The scope of the series includes, but is not limited to, titles in the areas of data mining and knowledge discovery methods and applications, modeling, algorithms, theory and foundations, data and knowledge visualization, data mining systems and tools, and privacy and security issues. PUBLISHED TITLES UNDERSTANDING COMPLEX DATASETS: Data Mining with Matrix Decompositions David Skillicorn COMPUTATIONAL METHODS OF FEATURE SELECTION Huan Liu and Hiroshi Motoda CONSTRAINED CLUSTERING: Advances in Algorithms, Theory, and Applications Sugato Basu, Ian Davidson, and Kiri L. Wagsta KNOWLEDGE DISCOVERY FOR COUNTERTERRORISM AND LAW ENFORCEMENT David Skillicorn MULTIMEDIA DATA MINING: A Systematic Introduction to Concepts and Theory Zhongfei Zhang and Ruofei Zhang © 2009 by Taylor & Francis Group, LLC C9667_FM.indd 2 10/8/08 10:06:11 AM Chapman & Hall/CRC Data Mining and Knowledge Discovery Series Multimedia Data Mining A Systematic Introduction to Concepts and Theory Zhongfei Zhang Ruofei Zhang © 2009 by Taylor & Francis Group, LLC C9667_FM.indd 3 10/8/08 10:06:11 AM The cover images were provided by Yu He, who also participated in the design of the cover page. -
How You Can Help Your Child in French Immersion
Created by the Staff of St. Angela School French Language Arts HOW YOU CAN HELP YOUR CHILD IN FRENCH IMMERSION Here are some suggestions that will support your child in a second language program. Be positive. Just a little work and encouragement on your part can make a significant difference to your child’s attitude towards and achievement in French. • Help your child make connections in language (for example: banane - banana). • Point out French in your community, for example, signs, labels, brochures, neighbours, street names. • Provide some out-of school language and cultural experiences - . • Support your child's learning by providing the necessary tools: English/French dictionary, a dictionary of synonyms (like a thesaurus). For more information on how to support your child in French Immersion view: parentGuideFrench.pdf Yes - You Can Help Your Child in French Alberta Education Website. Homework Tool box from Ontario Created by the Staff of St. Angela School Bring French into Your Home • Use labels from can and food packages to make a collage or collect them in a scrapbook. They won’t realize they are learning vocabulary, spelling, and sorting. • Play I Spy in French. Prepare for the game by printing on cards the French names for objects in a particular room of the house. Don’t know the name of an object? Let your child know it’s OK not to know something. Make a point of finding out the word you didn't know from an older sibling, a teacher, a visual dictionary or an English/French dictionary. • Have your child label as many objects in your house as possible. -
Lexicographic Interchange Between a Specialized and a General Language Dictionary1
Lexicographic interchange between a specialized and a general language dictionary1 Marie-Claude Demers, Ilan Kernerman & Marie-Claude L’Homme Keywords: general language dictionary, terminological database, specialized meaning, term, wordlist. Abstract One of the important issues lexicographers need to address concerns the desired coverage of a dictionary’s wordlist. This paper addresses the issue from a practical angle. We propose a method for comparing the contents of two resources and evaluating to what extent each can contribute to increase and improve the coverage of the other. Concretely, the project consists of comparing the contents of the English version of DiCoInfo (a dictionary of computing and Internet terms) with the appropriate entries of the Random House Webster’s College Dictionary (RHWCD). The entries missing in one resource are considered for inclusion in the other, and vice versa. The approach proves beneficial for both resources. Approximately 100 entries were added to DiCoInfo and over 500 lexical items or meanings are being included in the RHWCD. 1. Introduction An important issue lexicographers need to address concerns the coverage of a dictionary’s wordlist. The question is relevant from the points of view of general as well as specialized lexicography, although it leads to different answers in each area. Specialized dictionaries should include all items that are related to the field they aim to cover. General language dictionaries include many specialized lexical items but attempt to cover fields and terms of more general interest (Alonso Campo 2008; Béjoint 1988; Boulanger 1996; Josselin-Leray 2005; Svensén 2009; Wiegand 1999). This paper addresses the issue of coverage from a practical angle. -
DICTIONARY News Kernerman
Number 20 y July 2012 Kernerman kdictionaries.com/kdn DICTIONARY News What’s in a name? This is the twentieth issue of this newsletter, and our company name replacement seemed to be Kernerman, which by that time is now in its twentieth year. The first issue of the newsletter was had become well known in the dictionary world, but which published by Kernerman Publishing and Password Publishers I personally avoided — because that would be too close to in July 1994. It was entitled Password News, and had the goal the name of Kernerman Publishing, and because I considered of serving as a “forum for discussion about the semi-bilingual that carrying my surname was somewhat vain. On the other English dictionary.” The title was changed in the next issue hand, I liked the short form K as an abbreviation of Kernerman to Kernerman Dictionary News. Issues No. 2 and 3 appeared (and for 1,000), which was already in our logo, its anonymity, in 1995, and since then the newsletter has been published and the nice counterbalance it produced against the long word regularly in July each year. Over the years the scope of topics Dictionaries. Thus, K Dictionaries emerged — and may a has expanded, and the look and size have changed as well. thousand dictionaries bloom! Kernerman Publishing was established in 1969, as an The irony of fate, however, is that on numerous occasions independent ELT publishing house. Since the 1980s it has been we are referred to as Kernerman Dictionaries, most notably by conducted by my father, Ari (Lionel) Kernerman, who initiated dictionary professionals… Accepting that there is no escape the semi-bilingual dictionary, and became the from your name, and assuming the weight it leading English dictionary publisher in Israel implies, we began to introduce Kernerman to and renowned globally. -
The Art of Lexicography - Niladri Sekhar Dash
LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash THE ART OF LEXICOGRAPHY Niladri Sekhar Dash Linguistic Research Unit, Indian Statistical Institute, Kolkata, India Keywords: Lexicology, linguistics, grammar, encyclopedia, normative, reference, history, etymology, learner’s dictionary, electronic dictionary, planning, data collection, lexical extraction, lexical item, lexical selection, typology, headword, spelling, pronunciation, etymology, morphology, meaning, illustration, example, citation Contents 1. Introduction 2. Definition 3. The History of Lexicography 4. Lexicography and Allied Fields 4.1. Lexicology and Lexicography 4.2. Linguistics and Lexicography 4.3. Grammar and Lexicography 4.4. Encyclopedia and lexicography 5. Typological Classification of Dictionary 5.1. General Dictionary 5.2. Normative Dictionary 5.3. Referential or Descriptive Dictionary 5.4. Historical Dictionary 5.5. Etymological Dictionary 5.6. Dictionary of Loanwords 5.7. Encyclopedic Dictionary 5.8. Learner's Dictionary 5.9. Monolingual Dictionary 5.10. Special Dictionaries 6. Electronic Dictionary 7. Tasks for Dictionary Making 7.1. Panning 7.2. Data Collection 7.3. Extraction of lexical items 7.4. SelectionUNESCO of Lexical Items – EOLSS 7.5. Mode of Lexical Selection 8. Dictionary Making: General Dictionary 8.1. HeadwordsSAMPLE CHAPTERS 8.2. Spelling 8.3. Pronunciation 8.4. Etymology 8.5. Morphology and Grammar 8.6. Meaning 8.7. Illustrative Examples and Citations 9. Conclusion Acknowledgements ©Encyclopedia of Life Support Systems (EOLSS) LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash Glossary Bibliography Biographical Sketch Summary The art of dictionary making is as old as the field of linguistics. People started to cultivate this field from the very early age of our civilization, probably seven to eight hundred years before the Christian era.