81 REDEFINING the DICTIONARY.Pdf
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Etytree: a Graphical and Interactive Etymology Dictionary Based on Wiktionary
Etytree: A Graphical and Interactive Etymology Dictionary Based on Wiktionary Ester Pantaleo Vito Walter Anelli Wikimedia Foundation grantee Politecnico di Bari Italy Italy [email protected] [email protected] Tommaso Di Noia Gilles Sérasset Politecnico di Bari Univ. Grenoble Alpes, CNRS Italy Grenoble INP, LIG, F-38000 Grenoble, France [email protected] [email protected] ABSTRACT a new method1 that parses Etymology, Derived terms, De- We present etytree (from etymology + family tree): a scendants sections, the namespace for Reconstructed Terms, new on-line multilingual tool to extract and visualize et- and the etymtree template in Wiktionary. ymological relationships between words from the English With etytree, a RDF (Resource Description Framework) Wiktionary. A first version of etytree is available at http: lexical database of etymological relationships collecting all //tools.wmflabs.org/etytree/. the extracted relationships and lexical data attached to lex- With etytree users can search a word and interactively emes has also been released. The database consists of triples explore etymologically related words (ancestors, descendants, or data entities composed of subject-predicate-object where cognates) in many languages using a graphical interface. a possible statement can be (for example) a triple with a lex- The data is synchronised with the English Wiktionary dump eme as subject, a lexeme as object, and\derivesFrom"or\et- at every new release, and can be queried via SPARQL from a ymologicallyEquivalentTo" as predicate. The RDF database Virtuoso endpoint. has been exposed via a SPARQL endpoint and can be queried Etytree is the first graphical etymology dictionary, which at http://etytree-virtuoso.wmflabs.org/sparql. -
Multilingual Ontology Acquisition from Multiple Mrds
Multilingual Ontology Acquisition from Multiple MRDs Eric Nichols♭, Francis Bond♮, Takaaki Tanaka♮, Sanae Fujita♮, Dan Flickinger ♯ ♭ Nara Inst. of Science and Technology ♮ NTT Communication Science Labs ♯ Stanford University Grad. School of Information Science Natural Language ResearchGroup CSLI Nara, Japan Keihanna, Japan Stanford, CA [email protected] {bond,takaaki,sanae}@cslab.kecl.ntt.co.jp [email protected] Abstract words of a language, let alone those words occur- ring in useful patterns (Amano and Kondo, 1999). In this paper, we outline the develop- Therefore it makes sense to also extract data from ment of a system that automatically con- machine readable dictionaries (MRDs). structs ontologies by extracting knowledge There is a great deal of work on the creation from dictionary definition sentences us- of ontologies from machine readable dictionaries ing Robust Minimal Recursion Semantics (a good summary is (Wilkes et al., 1996)), mainly (RMRS). Combining deep and shallow for English. Recently, there has also been inter- parsing resource through the common for- est in Japanese (Tokunaga et al., 2001; Nichols malism of RMRS allows us to extract on- et al., 2005). Most approaches use either a special- tological relations in greater quantity and ized parser or a set of regular expressions tuned quality than possible with any of the meth- to a particular dictionary, often with hundreds of ods independently. Using this method, rules. Agirre et al. (2000) extracted taxonomic we construct ontologies from two differ- relations from a Basque dictionary with high ac- ent Japanese lexicons and one English lex- curacy using Constraint Grammar together with icon. -
(STAAR®) Dictionary Policy
STAAR® State of Texas Assessments of Academic Readiness State of Texas Assessments of Academic Readiness (STAAR®) Dictionary Policy Dictionaries must be available to all students taking: STAAR grades 3–8 reading tests STAAR grades 4 and 7 writing tests, including revising and editing STAAR Spanish grades 3–5 reading tests STAAR Spanish grade 4 writing test, including revising and editing STAAR English I, English II, and English III tests The following types of dictionaries are allowable: standard monolingual dictionaries in English or the language most appropriate for the student dictionary/thesaurus combinations bilingual dictionaries* (word-to-word translations; no definitions or examples) E S L dictionaries* (definition of an English word using simplified English) sign language dictionaries picture dictionary Both paper and electronic dictionaries are permitted. However, electronic dictionaries that provide access to the Internet or have photographic capabilities are NOT allowed. For electronic dictionaries that are hand-held devices, test administrators must ensure that any features that allow note taking or uploading of files have been cleared of their contents both before and after the test administration. While students are working through the tests listed above, they must have access to a dictionary. Students should use the same type of dictionary they routinely use during classroom instruction and classroom testing to the extent allowable. The school may provide dictionaries, or students may bring them from home. Dictionaries may be provided in the language that is most appropriate for the student. However, the dictionary must be commercially produced. Teacher-made or student-made dictionaries are not allowed. The minimum schools need is one dictionary for every five students testing, but the state’s recommendation is one for every three students or, optimally, one for each student. -
Methods in Lexicography and Dictionary Research* Stefan J
Methods in Lexicography and Dictionary Research* Stefan J. Schierholz, Department Germanistik und Komparatistik, Friedrich-Alexander-Universität Erlangen-Nürnberg, Germany ([email protected]) Abstract: Methods are used in every stage of dictionary-making and in every scientific analysis which is carried out in the field of dictionary research. This article presents some general consid- erations on methods in philosophy of science, gives an overview of many methods used in linguis- tics, in lexicography, dictionary research as well as of the areas these methods are applied in. Keywords: SCIENTIFIC METHODS, LEXICOGRAPHICAL METHODS, THEORY, META- LEXICOGRAPHY, DICTIONARY RESEARCH, PRACTICAL LEXICOGRAPHY, LEXICO- GRAPHICAL PROCESS, SYSTEMATIC DICTIONARY RESEARCH, CRITICAL DICTIONARY RESEARCH, HISTORICAL DICTIONARY RESEARCH, RESEARCH ON DICTIONARY USE Opsomming: Metodes in leksikografie en woordeboeknavorsing. Metodes word gebruik in elke fase van woordeboekmaak en in elke wetenskaplike analise wat in die woor- deboeknavorsingsveld uitgevoer word. In hierdie artikel word algemene oorwegings vir metodes in wetenskapfilosofie voorgelê, 'n oorsig word gegee van baie metodes wat in die taalkunde, leksi- kografie en woordeboeknavorsing gebruik word asook van die areas waarin hierdie metodes toe- gepas word. Sleutelwoorde: WETENSKAPLIKE METODES, LEKSIKOGRAFIESE METODES, TEORIE, METALEKSIKOGRAFIE, WOORDEBOEKNAVORSING, PRAKTIESE LEKSIKOGRAFIE, LEKSI- KOGRAFIESE PROSES, SISTEMATIESE WOORDEBOEKNAVORSING, KRITIESE WOORDE- BOEKNAVORSING, HISTORIESE WOORDEBOEKNAVORSING, NAVORSING OP WOORDE- BOEKGEBRUIK 1. Introduction In dictionary production and in scientific work which is carried out in the field of dictionary research, methods are used to reach certain results. Currently there is no comprehensive and up-to-date documentation of these particular methods in English. The article of Mann and Schierholz published in Lexico- * This article is based on the article from Mann and Schierholz published in Lexicographica 30 (2014). -
User Contributions to Online Dictionaries Andrea Abel and Christian M
The dynamics outside the paper: User Contributions to Online Dictionaries Andrea Abel and Christian M. Meyer Electronic lexicography in the 21st century (eLex): Thinking outside the paper, Tallinn, Estonia, October 17–19, 2013. 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 1 Introduction Online dictionaries rely increasingly on their users and leverage methods for facilitating user contributions at basically any step of the lexicographic process. 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 2 Previous work . Mostly focused on specific type/dimension of user contribution . e.g., Carr (1997), Storrer (1998, 2010), Køhler Simonsen (2005), Fuertes-Olivera (2009), Melchior (2012), Lew (2011, 2013) . Ambiguous, partly overlapping terms: www.wordle.net/show/wrdl/7124863/User_contributions (02.10.2013) www.wordle.net/show/wrdl/7124863/User_contributions http:// 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 3 Research goals and contribution How to effectively plan the user contributions for new and established dictionaries? . Comprehensive and systematic classification is still missing . Mann (2010): analysis of 88 online dictionaries . User contributions roughly categorized as direct / indirect contributions and exchange with other dictionary users . Little detail given, since outside the scope of the analysis . We use this as a basis for our work We propose a novel classification of the different types of user contributions based on many practical examples 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. -
Online Dictionaries As Emergent Archives of Contemporary Usage and Collaborative Codification
Online Dictionaries as Emergent Archives of Contemporary Usage and Collaborative Codification Colleen Cotter Queen Mary, University of London John Damaso Arizona State University February 2007 Abstract Within the history of modern English lexicography, individual dictionary editors have had ultimate control over the selection, meaning, and illustration of words; extensive collaboration with contributors has been limited. However, Internet technologies that easily permit exchanges between a user and a database have allowed a new type of dictionary online, one that is built by the collaboration of contributing end-users, allowing ordinary users of dictionaries who are not trained lexicographers to engage in dictionary-making. We discuss a popular online slang dictionary called UrbanDictionary.com (UD) to illustrate how lexicographic principles are joined with Web-only communication technologies to provide a context for collaborative engagement and meaning-making; and to note the many characteristics and functions shared with the traditional print dictionary. Significantly, UD captures what most traditional English dictionaries fall short of: recording ephemeral quotidian spoken language and representing popular views of meaning. By relying on the users of language to select and define words for a dictionary, UD, which defines more than 1 million words, has in effect influenced access to and formulation of the lexis. Keywords computer-mediated communication, lexicography, slang, youth language; English Queen Mary’s OPAL #9 Occasional Papers Advancing Linguistics 1 Introduction English lexicography stems from a tradition of relatively limited functional collaboration, beginning with Samuel Johnson’s dictionary in 1755, in which editors overseeing numerous contributors held the ultimate authority over the selection, meaning, and illustration of words. -
Wiktionary Matcher
Wiktionary Matcher Jan Portisch1;2[0000−0001−5420−0663], Michael Hladik2[0000−0002−2204−3138], and Heiko Paulheim1[0000−0003−4386−8195] 1 Data and Web Science Group, University of Mannheim, Germany fjan, [email protected] 2 SAP SE Product Engineering Financial Services, Walldorf, Germany fjan.portisch, [email protected] Abstract. In this paper, we introduce Wiktionary Matcher, an ontology matching tool that exploits Wiktionary as external background knowl- edge source. Wiktionary is a large lexical knowledge resource that is collaboratively built online. Multiple current language versions of Wik- tionary are merged and used for monolingual ontology matching by ex- ploiting synonymy relations and for multilingual matching by exploiting the translations given in the resource. We show that Wiktionary can be used as external background knowledge source for the task of ontology matching with reasonable matching and runtime performance.3 Keywords: Ontology Matching · Ontology Alignment · External Re- sources · Background Knowledge · Wiktionary 1 Presentation of the System 1.1 State, Purpose, General Statement The Wiktionary Matcher is an element-level, label-based matcher which uses an online lexical resource, namely Wiktionary. The latter is "[a] collaborative project run by the Wikimedia Foundation to produce a free and complete dic- tionary in every language"4. The dictionary is organized similarly to Wikipedia: Everybody can contribute to the project and the content is reviewed in a com- munity process. Compared to WordNet [4], Wiktionary is significantly larger and also available in other languages than English. This matcher uses DBnary [15], an RDF version of Wiktionary that is publicly available5. The DBnary data set makes use of an extended LEMON model [11] to describe the data. -
A Study of Idiom Translation Strategies Between English and Chinese
ISSN 1799-2591 Theory and Practice in Language Studies, Vol. 3, No. 9, pp. 1691-1697, September 2013 © 2013 ACADEMY PUBLISHER Manufactured in Finland. doi:10.4304/tpls.3.9.1691-1697 A Study of Idiom Translation Strategies between English and Chinese Lanchun Wang School of Foreign Languages, Qiongzhou University, Sanya 572022, China Shuo Wang School of Foreign Languages, Qiongzhou University, Sanya 572022, China Abstract—This paper, focusing on idiom translation methods and principles between English and Chinese, with the statement of different idiom definitions and the analysis of idiom characteristics and culture differences, studies the strategies on idiom translation, what kind of method should be used and what kind of principle should be followed as to get better idiom translations. Index Terms— idiom, translation, strategy, principle I. DEFINITIONS OF IDIOMS AND THEIR FUNCTIONS Idiom is a language in the formation of the unique and fixed expressions in the using process. As a language form, idioms has its own characteristic and patterns and used in high frequency whether in written language or oral language because idioms can convey a host of language and cultural information when people chat to each other. In some senses, idioms are the reflection of the environment, life, historical culture of the native speakers and are closely associated with their inner most spirit and feelings. They are commonly used in all types of languages, informal and formal. That is why the extent to which a person familiarizes himself with idioms is a mark of his or her command of language. Both English and Chinese are abundant in idioms. -
Introduction to Wordnet: an On-Line Lexical Database
Introduction to WordNet: An On-line Lexical Database George A. Miller, Richard Beckwith, Christiane Fellbaum, Derek Gross, and Katherine Miller (Revised August 1993) WordNet is an on-line lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, and adjectives are organized into synonym sets, each representing one underlying lexical concept. Different relations link the synonym sets. Standard alphabetical procedures for organizing lexical information put together words that are spelled alike and scatter words with similar or related meanings haphazardly through the list. Unfortunately, there is no obvious alternative, no other simple way for lexicographers to keep track of what has been done or for readers to ®nd the word they are looking for. But a frequent objection to this solution is that ®nding things on an alphabetical list can be tedious and time-consuming. Many people who would like to refer to a dictionary decide not to bother with it because ®nding the information would interrupt their work and break their train of thought. In this age of computers, however, there is an answer to that complaint. One obvious reason to resort to on-line dictionariesÐlexical databases that can be read by computersÐis that computers can search such alphabetical lists much faster than people can. A dictionary entry can be available as soon as the target word is selected or typed into the keyboard. Moreover, since dictionaries are printed from tapes that are read by computers, it is a relatively simple matter to convert those tapes into the appropriate kind of lexical database. -
Resources for Freelancers
Resources for Freelancers Organizations, websites and a list citation guide; online subscription available) ACES (blog, news, resources, jobs, and the Quick Check Editorial Reference Cards ACES forums): www.copydesk.org (from Copyediting.com) Copyediting Newsletter (Links, blog, jobs, Purdue’s Online Writing Lab (guides to resources, training): www.copyediting.com Chicago, MLA and APA style) Editorial Freelancers Association (Training, Garner’s Modern American Usage resources, jobs): www.the-efa.org Merriam-Webster’s Dictionary of English Usage Journalists’ Toolbox (Compendium of links The Gregg Reference Manual to useful sites) www.journaliststoolbox.org Grammar Girl’s Quick & Dirty Tips for Better Copyeditor’s Knowledge Base (Links to Writing resources for freelance copyeditors): http://www.kokedit.com/ckb.php Elephants of Style and Lapsing into a Comma (Bill Walsh) Common Errors in English Usage (Big list of quick-hit usage tips): OnlineStylebooks.com www.wsu.edu/~brians/errors/errors.html Grammar Girl’s Quick and Dirty Tips Dictionaries (transcripts of all the podcasts, more): http://www.quickanddirtytips.com Merriam-Webster Collegiate 11th (Chicago’s preferred; online free, on CD-ROM, and as a Grant Barrett’s searches (word-related sites, free app; Unabridged by subscription) Google books): http://www.copyediting.com/two-copy- Webster’s New World College (AP’s desk-power-searches preferred; also as an app) CE-L (mailing list for copyeditors): Sign up Oxford dictionaries (New Oxford American at http://www.copyediting-l.info/ free -
The Demands of Users and the Publishing World: Printed Or Online, Free Or Paid For?
The demands of users and the publishing world: printed or online, free or paid for? Hilary Nesi Accepted manuscript PDF deposited in Coventry University’s Repository Original citation: ‘The demands of users and the publishing world: printed or online, free or paid for?’, in The Oxford Handbook of Lexicography, ed. by Philip Durkin, pub 2015 (ISBN 9780199691630) Publisher: Oxford University Press Reproduced by permission of Oxford University Press - https://global.oup.com/academic/product/the-oxford-handbook-of-lexicography- 9780199691630?cc=gb&lang=en& Copyright © and Moral Rights are retained by the author(s) and/ or other copyright owners. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This item cannot be reproduced or quoted extensively from without first obtaining permission in writing from the copyright holder(s). The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the copyright holders. 1 Nesi, H. (2015) The demands of users and the publishing world: printed or online, free or paid for? In: Durkin. P. (ed) The Oxford Handbook of Lexicography. Oxford: Oxford University Press pp. 579-589 Pre-publication version Abstract This chapter examines the extent of the digital migration of reference works from print to screen, and the effect this is having on dictionary publishers and dictionary users. It discusses the place of the human lexicographer, and possible new sources of e-dictionary revenue in the new ‘give-away’ internet environment. It also considers the automatic and collaborative generation of dictionary content, quality issues, and the needs and preferences of dictionary users around the world. -
Humping a Grouse Cobber Dymphna Lonergan
Humping a grouse cobber Dymphna Lonergan When I arrived in Adelaide in the 1970s, they were called cobbers. Those small, chocolate covered squares of hard caramel now go by the name mates. The word mate has nudged ahead of cobber in popularity, probably since the republican debate in the 1990s when mateship was widely discussed as a proposed term for inclusion in the Constitution of Australia. By the time of the republican debate, the word cobber had been in print for just over one hundred years. Appearing first in the Worker in Sydney in 1893, it points to a rural origin. The Australian National Dictionary suggests the British dialectal word cob, ‘to take a liking to’ as a probable origin of cobber, and the use of the word seems to support the need for friendship in the harsh environment of the Australian bush. If we no longer hear cobber used much, it may simply have outlived its usefulness because the social conditions that created it have changed (and because mate is shorter, and more suited to these downsized and core business times). On the other hand, the word cobber appears in a current online urbandictionary website with the claim that it is not an archaic word, but simply little understood as an equivalent to mate. Therefore, while cobber seems to have been superseded by mate as a more commercially viable term for chocolate-coated caramel squares, cobber still has currency for technology-focused young Australians The English Dialect Dictionary cautions that cob ‘to like’ is not widely known. It certainly seems curious that this apparently rare verb could evolve quickly into an Australian noun widely used in the 1890s.