A New Approach for Idiom Identification Using Meanings And

Total Page:16

File Type:pdf, Size:1020Kb

A New Approach for Idiom Identification Using Meanings And A New Approach for Idiom Identification Using Meanings and the Web∗ Rakesh Verma Vasanthi Vuppuluri Computer Science Dept. Computer Science Dept. University of Houston University of Houston Houston, TX, 77204, USA Houston, TX, 77204, USA [email protected] [email protected] Abstract Idioms play an important role in Natural Lan- There is a great deal of knowledge avail- guage Processing (NLP). They exist in almost all able on the Web, which represents a great languages and are hard to extract as there is no al- opportunity for automatic, intelligent text gorithm that can precisely outline the structure of processing and understanding, but the ma- an idiom. Idioms are important for natural lan- jor problems are finding the legitimate guage generation, parsing, and significantly influ- sources of information and the fact that ence machine translation and semantic tagging. search engines provide page statistics not Idioms could be also useful in document index- occurrences. This paper presents a new, ing, information retrieval, and in text summariza- domain independent, general-purpose id- tion or question-answering approaches that rely on iom identification approach. Our approach extracting key words or phrases from the docu- combines the knowledge of the Web with ment to be summarized, e.g., (Barrera and Verma, the knowledge extracted from dictionaries. 2011; Barrera and Verma, 2012; Barrera et al., This method can overcome the limitations 2011). Efficiently extracting idioms significantly of current techniques that rely on linguis- improves many areas of NLP. But most of the tic knowledge or statistics. It can recog- idiom extraction techniques are biased in a way nize idioms even when the complete sen- that they focus on a specific domain or make use tence is not present, and without the need of statistical techniques alone, which results in for domain knowledge. It is currently de- poor performance. The technique in this paper signed to work with text in English but can makes use of knowledge from the Web combined be extended to other languages. with knowledge from dictionaries in deciding if a phrase is a idiom rather than solely depending on 1 Introduction frequency measures or following rules of a spe- Automatically extracting phrases from the doc- cific domain. The Web has been attractive to NLP uments, be they structured, un-structured or researchers because it can solve the sparsity is- semistructured has always been an important yet sue and also its update latency is lower than for challenging task. The overall goal is to create a dictionaries, but its disadvantages are noise, lack easily machine-readable text to process the sen- of a good method for finding reliable sources and tences. In this paper we focus on identifying id- the coarseness of page statistics. Dictionaries are ioms from text. An idiom is a phrase made up of more reliable but they have higher update latency. a sequence of two or more words that has prop- Our work tries to minimize the disadvantages and erties that are not predictable from the properties maximize the advantages when combining these of the individual words or their normal mode of resources. combination. Recognition of idioms is a challeng- 1.1 Contribution ing problem with wide applications. Some exam- ples of idioms are ‘yellow journalism,’ ‘kick the This paper proposes a new idiom identification bucket,’ and ‘quick fix’. For example, the mean- technique, which is general, domain independent ing of ‘yellow journalism’ cannot be derived from and unsupervised in the sense that it requires no the meanings of ‘yellow’ and ‘journalism.’ labeled datasets of idioms. The major problem Research∗ supported in part by NSF grants CNS with existing approaches is that most of them 1319212, DUE 1241772 and DGE1433817 are supervised, requiring manually annotated data, and many of them impose syntactic restrictions, of verb-noun constructions, prepositional phrases, e.g., verb-particle, noun-verb, etc. Our tech- and subordinate clauses in (Laura et al., 2010). nique makes use of carefully extracted reliable knowledge from the Web and dictionaries. More- To our knowledge, there are only a few gen- over, our technique can be extended to languages eral approaches for idiom identification in the other than English, provided similar resources are phrase classification stream (Muzny and Zettle- available. Although our approach uses meanings, moyer, 2013); (Feldman and Peng, 2013) and with the advancement of the web, more and more most of the techniques are supervised. A super- phrase definitions are becoming available on the vised technique for automatically identifying id- web and thus the reliance on dictionaries can be iomatic dictionary entries with the help of online reduced or even eliminated. However, in many resources like Wiktionary is discussed in (Muzny cases, even though the definition of a phrase may and Zettlemoyer, 2013). There are three lexical be available, the phrase itself is not necessarily la- features and five graph-based features in this tech- beled as an idiom so we cannot just do a simple nique, which model whether phrase meanings are lookup of a phrase and mark it as an idiom. constructed compositionally. The dataset consists The rest of the paper is organized as follows. of phrases, definitions, and example sentences Section 2 presents previous work on idiom extrac- from the English-language Wiktionary dump from tion and classification. In Section 3 we present our November 13th, 2012. The lexical and graph- approach in detail. Section 4 presents the datasets based features when used together yield F-scores and in Section 5 we present the experiments and of 40.1% and 62.0% when tested on the same comparisons. We conclude in Section 6. dataset, once without annotating the idiom la- bels and once after providing the annotated labels. 2 Related Work This approach when combined with the Lesk word sense disambiguation algorithm and a Wiktionary There is considerable work on extracting multi- label default rule, yields an F-score of 83.8%. word expressions (MWEs), a superclass of idioms, e.g., (Zhang et al., 2006); (Villavicencio et al., An unsupervised idiom extraction technique us- 2007); (Li et al., 2008); (Spence et al., 2013); ing Principal Component Analysis (PCA) treat- (Ramisch, 2014); (Marie and Constant., 2014); ing idioms as semantic outliers and a supervised (Schneider et al., 2014); (Kordoni and Simova, technique based on Linear Discriminant Analy- 2014); (Yulia and Wintner, 2014). We do not cover sis (LDA) was described by (Feldman and Peng, this work here since our focus is on idioms. 2013). The idea of treating idioms as outliers Because of its importance, several researchers was tested on 99 sentences extracted from the have investigated idiom identification. As men- British National Corpus (BNC) social science tioned in (Muzny and Zettlemoyer, 2013), prior (non-fiction) section, containing 12 idioms, 22 work on this topic can be categorized into two dead metaphors and 2 living metaphors. The idea streams: phrase classification in which a phrase of idiom detection based on LDA was tested on is always idiomatic or literal, e.g., (Gedigian et 2,984 Verb-Noun Combination (VNC) tokens ex- al., 2006); (Shutova et al., 2010), or token clas- tracted from BNC described in (Fazly et al., 2009). sification in which each occurrence of a phrase is These 2,984 tokens are translated into 2,550 sen- classified as either idiomatic or literal, e.g., (Birke tences of which 2,013 are idiomatic sentences and et al., 2006); (Katz and Eugenie, 2006); (Li and 537 are literal sentences. A variety of results were Sporleder, 2009); (Fabienne et al., 2010); (Caro- presented for PCA for different false positive rates line et al., 2010); (Peng et al., 2014). Most work ranging from 1 to 10% (one Table with rates of 16- on the phrase classification stream imposes syn- 20%). For idioms only, the detection rates range tactic restrictions. Verb/Noun restriction is im- from 44% at 1% false positive rate to 89% at 10% posed in (Fazly et al., 2009) and (Diab and Pravin, false positive rate. 2009); subject/verb and verb/direct-object restric- tions are imposed in (Shutova et al., 2010) and Some of the work in the token classification verb-particle restriction is imposed in (Ramisch stream, e.g., (Peng et al., 2014), relies on a list of et al., 2008). Portions of the American Na- potentially idiomatic expressions. Such a list can tional Corpus were tagged for idioms composed be generated using our technique. 3 Idiom Extraction Model RDW 2 = fRD21; RD22; RD23; :::; RD2mg, and so on. We now present the details of our approach Now, each of the word in the original phrase is for extracting idioms, which is implemented in replaced with its definitions which results in a set Python and called IdiomExtractor. We focus on of new phrases P as follows: the meaning of the word idiom, i.e., “properties P = fRD RD :::RD ; RD RD :::RD of individual words in a phrase differ from the 11 12 j1 12 21 j1 ; RD1nRD2m:::RD g properties of the phrase in itself.” Hence, we jl To avoid any confusion regarding how the proce- look at what individual words in a phrase mean dure is implemented an example is provided be- and what the phrase means as a whole. If the low. meaning of phrase is different from what the individual words in the phrase try to convey then 3.3 Subtraction by definition of the word idiom, that phrase is a idiom. Each of the phrases present in P is subtracted from each of the recreated definition in RDp and the Steps involved in the process of idiom extraction result is stored in set S. are as follows: 3.4 Idiom Result 3.1 Definition Extraction There are two options the user can choose in de- This step is the most important step in determin- ciding if a phrase is a idiom.
Recommended publications
  • Dictionary Users Do Look up Frequent Words. a Logfile Analysis
    Erschienen in: Müller-Spitzer, Carolin (Hrsg.): Using Online Dictionaries. Berlin/ Boston: de Gruyter, 2014. (Lexicographica: Series Maior 145), S. 229-249. Alexander Koplenig, Peter Meyer, Carolin Müller-Spitzer Dictionary users do look up frequent words. A logfile analysis Abstract: In this paper, we use the 2012 log files of two German online dictionaries (Digital Dictionary of the German Language1 and the German Version of Wiktionary) and the 100,000 most frequent words in the Mannheim German Reference Corpus from 2009 to answer the question of whether dictionary users really do look up fre- quent words, first asked by de Schryver et al. (2006). By using an approach to the comparison of log files and corpus data which is completely different from that of the aforementioned authors, we provide empirical evidence that indicates - contra - ry to the results of de Schryver et al. and Verlinde/Binon (2010) - that the corpus frequency of a word can indeed be an important factor in determining what online dictionary users look up. Finally, we incorporate word dass Information readily available in Wiktionary into our analysis to improve our results considerably. Keywords: log file, frequency, corpus, headword list, monolingual dictionary, multi- lingual dictionary Alexander Koplenig: Institut für Deutsche Sprache, R 5, 6-13, 68161 Mannheim, +49-(0)621-1581- 435, [email protected] Peter Meyer: Institut für Deutsche Sprache, R 5, 6-13, 68161 Mannheim, +49-(0)621-1581-427, [email protected] Carolin Müller-Spitzer: Institut für Deutsche Sprache, R 5, 6-13, 68161 Mannheim, +49-(0)621-1581- 429, [email protected] Introduction We would like to Start this chapter by asking one of the most fundamental questions for any general lexicographical endeavour to describe the words of one (or more) language(s): which words should be included in a dictionary? At first glance, the answer seems rather simple (especially when the primary objective is to describe a language as completely as possible): it would be best to include every word in the dictionary.
    [Show full text]
  • Etytree: a Graphical and Interactive Etymology Dictionary Based on Wiktionary
    Etytree: A Graphical and Interactive Etymology Dictionary Based on Wiktionary Ester Pantaleo Vito Walter Anelli Wikimedia Foundation grantee Politecnico di Bari Italy Italy [email protected] [email protected] Tommaso Di Noia Gilles Sérasset Politecnico di Bari Univ. Grenoble Alpes, CNRS Italy Grenoble INP, LIG, F-38000 Grenoble, France [email protected] [email protected] ABSTRACT a new method1 that parses Etymology, Derived terms, De- We present etytree (from etymology + family tree): a scendants sections, the namespace for Reconstructed Terms, new on-line multilingual tool to extract and visualize et- and the etymtree template in Wiktionary. ymological relationships between words from the English With etytree, a RDF (Resource Description Framework) Wiktionary. A first version of etytree is available at http: lexical database of etymological relationships collecting all //tools.wmflabs.org/etytree/. the extracted relationships and lexical data attached to lex- With etytree users can search a word and interactively emes has also been released. The database consists of triples explore etymologically related words (ancestors, descendants, or data entities composed of subject-predicate-object where cognates) in many languages using a graphical interface. a possible statement can be (for example) a triple with a lex- The data is synchronised with the English Wiktionary dump eme as subject, a lexeme as object, and\derivesFrom"or\et- at every new release, and can be queried via SPARQL from a ymologicallyEquivalentTo" as predicate. The RDF database Virtuoso endpoint. has been exposed via a SPARQL endpoint and can be queried Etytree is the first graphical etymology dictionary, which at http://etytree-virtuoso.wmflabs.org/sparql.
    [Show full text]
  • User Contributions to Online Dictionaries Andrea Abel and Christian M
    The dynamics outside the paper: User Contributions to Online Dictionaries Andrea Abel and Christian M. Meyer Electronic lexicography in the 21st century (eLex): Thinking outside the paper, Tallinn, Estonia, October 17–19, 2013. 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 1 Introduction Online dictionaries rely increasingly on their users and leverage methods for facilitating user contributions at basically any step of the lexicographic process. 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 2 Previous work . Mostly focused on specific type/dimension of user contribution . e.g., Carr (1997), Storrer (1998, 2010), Køhler Simonsen (2005), Fuertes-Olivera (2009), Melchior (2012), Lew (2011, 2013) . Ambiguous, partly overlapping terms: www.wordle.net/show/wrdl/7124863/User_contributions (02.10.2013) www.wordle.net/show/wrdl/7124863/User_contributions http:// 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M. Meyer | 3 Research goals and contribution How to effectively plan the user contributions for new and established dictionaries? . Comprehensive and systematic classification is still missing . Mann (2010): analysis of 88 online dictionaries . User contributions roughly categorized as direct / indirect contributions and exchange with other dictionary users . Little detail given, since outside the scope of the analysis . We use this as a basis for our work We propose a novel classification of the different types of user contributions based on many practical examples 18.10.2013 | European Academy of Bozen/Bolzano and Technische Universität Darmstadt | Andrea Abel and Christian M.
    [Show full text]
  • Wiktionary Matcher
    Wiktionary Matcher Jan Portisch1;2[0000−0001−5420−0663], Michael Hladik2[0000−0002−2204−3138], and Heiko Paulheim1[0000−0003−4386−8195] 1 Data and Web Science Group, University of Mannheim, Germany fjan, [email protected] 2 SAP SE Product Engineering Financial Services, Walldorf, Germany fjan.portisch, [email protected] Abstract. In this paper, we introduce Wiktionary Matcher, an ontology matching tool that exploits Wiktionary as external background knowl- edge source. Wiktionary is a large lexical knowledge resource that is collaboratively built online. Multiple current language versions of Wik- tionary are merged and used for monolingual ontology matching by ex- ploiting synonymy relations and for multilingual matching by exploiting the translations given in the resource. We show that Wiktionary can be used as external background knowledge source for the task of ontology matching with reasonable matching and runtime performance.3 Keywords: Ontology Matching · Ontology Alignment · External Re- sources · Background Knowledge · Wiktionary 1 Presentation of the System 1.1 State, Purpose, General Statement The Wiktionary Matcher is an element-level, label-based matcher which uses an online lexical resource, namely Wiktionary. The latter is "[a] collaborative project run by the Wikimedia Foundation to produce a free and complete dic- tionary in every language"4. The dictionary is organized similarly to Wikipedia: Everybody can contribute to the project and the content is reviewed in a com- munity process. Compared to WordNet [4], Wiktionary is significantly larger and also available in other languages than English. This matcher uses DBnary [15], an RDF version of Wiktionary that is publicly available5. The DBnary data set makes use of an extended LEMON model [11] to describe the data.
    [Show full text]
  • A Study of Idiom Translation Strategies Between English and Chinese
    ISSN 1799-2591 Theory and Practice in Language Studies, Vol. 3, No. 9, pp. 1691-1697, September 2013 © 2013 ACADEMY PUBLISHER Manufactured in Finland. doi:10.4304/tpls.3.9.1691-1697 A Study of Idiom Translation Strategies between English and Chinese Lanchun Wang School of Foreign Languages, Qiongzhou University, Sanya 572022, China Shuo Wang School of Foreign Languages, Qiongzhou University, Sanya 572022, China Abstract—This paper, focusing on idiom translation methods and principles between English and Chinese, with the statement of different idiom definitions and the analysis of idiom characteristics and culture differences, studies the strategies on idiom translation, what kind of method should be used and what kind of principle should be followed as to get better idiom translations. Index Terms— idiom, translation, strategy, principle I. DEFINITIONS OF IDIOMS AND THEIR FUNCTIONS Idiom is a language in the formation of the unique and fixed expressions in the using process. As a language form, idioms has its own characteristic and patterns and used in high frequency whether in written language or oral language because idioms can convey a host of language and cultural information when people chat to each other. In some senses, idioms are the reflection of the environment, life, historical culture of the native speakers and are closely associated with their inner most spirit and feelings. They are commonly used in all types of languages, informal and formal. That is why the extent to which a person familiarizes himself with idioms is a mark of his or her command of language. Both English and Chinese are abundant in idioms.
    [Show full text]
  • Experiments in Idiom Recognition
    Experiments in Idiom Recognition Jing Peng and Anna Feldman Department of Computer Science Department of Linguistics Montclair State University Montclair, New Jersey, USA 07043 Abstract Some expressions can be ambiguous between idiomatic and literal interpretations depending on the context they occur in, e.g., sales hit the roof vs. hit the roof of the car. We present a novel method of classifying whether a given instance is literal or idiomatic, focusing on verb-noun constructions. We report state-of-the-art results on this task using an approach based on the hypothesis that the distributions of the contexts of the idiomatic phrases will be different from the contexts of the literal usages. We measure contexts by using projections of the words into vector space. For comparison, we implement Fazly et al. (2009)’s, Sporleder and Li (2009)’s, and Li and Sporleder (2010b)’s methods and apply them to our data. We provide experimental results validating the proposed techniques. 1 Introduction Researchers have been investigating idioms and their properties for many years. According to traditional approaches, an idiom is — in its simplest form— a string of two or more words for which meaning is not derived from the meanings of the individual words comprising that string (Swinney and Cutler, 1979). As such, the meaning of kick the bucket (‘die’) cannot be obtained by breaking down the idiom and an- alyzing the meanings of its constituent parts, to kick and the bucket. In addition to being influenced by the principle of compositionality, the traditional approaches are also influenced by theories of generative grammar (Flores, 1993; Langlotz, 2006) The properties that traditional approaches attribute to idiomatic expressions are also the properties that make them difficult for generative grammars to describe.
    [Show full text]
  • Extracting an Etymological Database from Wiktionary Benoît Sagot
    Extracting an Etymological Database from Wiktionary Benoît Sagot To cite this version: Benoît Sagot. Extracting an Etymological Database from Wiktionary. Electronic Lexicography in the 21st century (eLex 2017), Sep 2017, Leiden, Netherlands. pp.716-728. hal-01592061 HAL Id: hal-01592061 https://hal.inria.fr/hal-01592061 Submitted on 22 Sep 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Extracting an Etymological Database from Wiktionary Benoît Sagot Inria 2 rue Simone Iff, 75012 Paris, France E-mail: [email protected] Abstract Electronic lexical resources almost never contain etymological information. The availability of such information, if properly formalised, could open up the possibility of developing automatic tools targeted towards historical and comparative linguistics, as well as significantly improving the automatic processing of ancient languages. We describe here the process we implemented for extracting etymological data from the etymological notices found in Wiktionary. We have produced a multilingual database of nearly one million lexemes and a database of more than half a million etymological relations between lexemes. Keywords: Lexical resource development; etymology; Wiktionary 1. Introduction Electronic lexical resources used in the fields of natural language processing and com- putational linguistics are almost exclusively synchronic resources; they mostly include information about inflectional, derivational, syntactic, semantic or even pragmatic prop- erties of their entries.
    [Show full text]
  • Idioms-And-Expressions.Pdf
    Idioms and Expressions by David Holmes A method for learning and remembering idioms and expressions I wrote this model as a teaching device during the time I was working in Bangkok, Thai- land, as a legal editor and language consultant, with one of the Big Four Legal and Tax companies, KPMG (during my afternoon job) after teaching at the university. When I had no legal documents to edit and no individual advising to do (which was quite frequently) I would sit at my desk, (like some old character out of a Charles Dickens’ novel) and prepare language materials to be used for helping professionals who had learned English as a second language—for even up to fifteen years in school—but who were still unable to follow a movie in English, understand the World News on TV, or converse in a colloquial style, because they’d never had a chance to hear and learn com- mon, everyday expressions such as, “It’s a done deal!” or “Drop whatever you’re doing.” Because misunderstandings of such idioms and expressions frequently caused miscom- munication between our management teams and foreign clients, I was asked to try to as- sist. I am happy to be able to share the materials that follow, such as they are, in the hope that they may be of some use and benefit to others. The simple teaching device I used was three-fold: 1. Make a note of an idiom/expression 2. Define and explain it in understandable words (including synonyms.) 3. Give at least three sample sentences to illustrate how the expression is used in context.
    [Show full text]
  • Wiktionary Matcher Results for OAEI 2020
    Wiktionary Matcher Results for OAEI 2020 Jan Portisch1;2[0000−0001−5420−0663] and Heiko Paulheim1[0000−0003−4386−8195] 1 Data and Web Science Group, University of Mannheim, Germany fjan, [email protected] 2 SAP SE Product Engineering Financial Services, Walldorf, Germany [email protected] Abstract. This paper presents the results of the Wiktionary Matcher in the Ontology Alignment Evaluation Initiative (OAEI) 2020. Wiktionary Matcher is an ontology matching tool that exploits Wiktionary as exter- nal background knowledge source. Wiktionary is a large lexical knowl- edge resource that is collaboratively built online. Multiple current lan- guage versions of Wiktionary are merged and used for monolingual on- tology matching by exploiting synonymy relations and for multilingual matching by exploiting the translations given in the resource. This is the second OAEI participation of the matching system. Wiktionary Matcher has been improved and is the best performing system on the knowledge graph track this year.3 Keywords: Ontology Matching · Ontology Alignment · External Re- sources · Background Knowledge · Wiktionary 1 Presentation of the System 1.1 State, Purpose, General Statement The Wiktionary Matcher is an element-level, label-based matcher which uses an online lexical resource, namely Wiktionary. The latter is "[a] collaborative project run by the Wikimedia Foundation to produce a free and complete dic- tionary in every language"4. The dictionary is organized similarly to Wikipedia: Everybody can contribute to the project and the content is reviewed in a com- munity process. Compared to WordNet [2], Wiktionary is significantly larger and also available in other languages than English. This matcher uses DBnary [13], an RDF version of Wiktionary that is publicly available5.
    [Show full text]
  • Draft 5 for Printing
    Jan Milí č of Kroměř íž and Emperor Charles IV: Preaching, Power, and the Church of Prague Eleanor Janega UCL Thesis Submitted for the degree of PhD in History 1 I, Eleanor Janega, confirm that the work presented in this thesis is my own. Where information has been derived from other sources, I confirm that this has been indicated in my thesis. 2 Abstract During the second half of the fourteenth century Jan Milí č of Krom ěř íž became an active and popular preacher in Prague. The sermons which he delivered focused primarily on themes of reform, and called for a renewal within the church. Despite a sustained popularity with the lay populace of Prague, Milí č faced opposition to his practice from many individual members of the city’s clergy. Eventually he was the subject of twelve articles of accusation sent to the papal court of Avignon. Because of the hostility which Milí č faced, historians have most often written of him as a precursor to the Hussites. As a result he has been identified as an anti-establishment rabble-rouser and it has been assumed that he conducted his career in opposition to the court of the Emperor Charles IV. This thesis, over four body chapters, examines the careers of both Milí č and Charles and argues that instead of being enemies, the two men shared an amicable relationship. The first chapter examines Milí č’s career and will prove that he was well-connected to Charles and several members of his court. It will also examine the most common reasons given to argue that Charles and Milí č were at odds, and disprove them.
    [Show full text]
  • S Wiktionary Wikisource Wikibooks Wikiquote Wikimedia Commons
    SCHWESTERPROJEKTE Wiktionary S Das Wiktionary ist der lexikalische Partner der freien Enzyklopädie Wikipedia: ein Projekt zur Erstellung freier Wörterbücher und Thesau- ri. Während die Wikipedia inhaltliche Konzepte beschreibt, geht es in ihrem ältesten Schwester- projekt, dem 2002 gegründeten Wiktionary um Wörter, ihre Grammatik und Etymologie, Homo- nyme und Synonyme und Übersetzungen. Wikisource Wikisource ist eine Sammlung von Texten, die entweder urheberrechtsfrei sind oder unter ei- ner freien Lizenz stehen. Das Projekt wurde am 24. November 2003 gestartet. Der Wiktionary-EIntrag zum Wort Schnee: Das Wörterbuch präsen- Zunächst mehrsprachig auf einer gemeinsamen tiert Bedeutung, Deklination, Synonyme und Übersetzungen. Plattform angelegt, wurde es später in einzel- ne Sprachversionen aufgesplittet. Das deutsche Teilprojekt zählte im März 2006 über 2000 Texte Wikisource-Mitarbeiter arbeiten an einer digitalen, korrekturge- und über 100 registrierte Benutzer. lesenen und annotierten Ausgabe der Zimmerischen Chronik. Wikibooks Das im Juli 2003 aus der Taufe gehobene Projekt Wikibooks dient der gemeinschaftlichen Schaf- fung freier Lehrmaterialien – vom Schulbuch über den Sprachkurs bis zum praktischen Klet- terhandbuch oder der Go-Spielanleitung Wikiquote Wikiquote zielt darauf ab, auf Wiki-Basis ein freies Kompendium von Zitaten und Das Wikibooks-Handbuch Go enthält eine ausführliche Spielanleitung Sprichwörtern in jeder Sprache zu schaffen. Die des japanischen Strategiespiels. Artikel über Zitate bieten (soweit bekannt) eine Quellenangabe und werden gegebenenfalls in die deutsche Sprache übersetzt. Für zusätzliche Das Wikimedia-Projekt Wikiquote sammelt Sprichwörter und Informationen sorgen Links in die Wikipedia. Zitate, hier die Seite zum Schauspieler Woody Allen Wikimedia Commons Wikimedia Commons wurde im September 2004 zur zentralen Aufbewahrung von Multime- dia-Material – Bilder, Videos, Musik – für alle Wi- kimedia-Projekte gegründet.
    [Show full text]
  • Thank You - Wiktionary 4/24/11 9:56 PM Thank You
    thank you - Wiktionary 4/24/11 9:56 PM thank you Definition from Wiktionary, the free dictionary See also thankyou and thank-you Contents 1 English 1.1 Etymology 1.2 Pronunciation 1.3 Interjection 1.3.1 Synonyms 1.3.2 Translations 1.4 Noun 1.5 References 1.6 See also English Etymology Thank you is a shortened expression for I thank you; it is attested since c. 1400.[1] Pronunciation Audio (UK) (file) Interjection thank you 1. An expression of gratitude or politeness, in response to something done or given. http://en.wiktionary.org/wiki/thank_you Page 1 of 5 thank you - Wiktionary 4/24/11 9:56 PM Synonyms cheers (informal), thanks, thanks very much, thank you very much, thanks a lot, ta (UK, Australia), thanks a bunch (informal), thanks a million (informal), much obliged Translations an expression of gratitude [hide ▲] Select targeted languages ﺳﻮﭘﺎﺱ ,Afar: gadda ge Kurdish: spas Afrikaans: dankie, baie dankie Ladin: giulan Albanian: faleminderit, ju falem nderit Lao: (kööp cai) Gheg: falimineres Latin: benignē dīcis (la), tibi gratiās agō (la) Latvian: paldies (lv) Aleut: qaĝaasakuq (Atkan), qaĝaalakux̂ Lithuanian: ačiū (lt) (Eastern) Luo: erokamano Alutiiq: quyanaa Macedonian: благодарам (mk) (blagódaram) American Sign Language: OpenB@Chin- (formal), фала (mk) (fála) (informal) PalmBack OpenB@FromChin-PalmUp Malagasy: misaotra (mg) Amharic: (am) (amesegenallo) Malay: terima kasih (ms) (ar) (mersii) Malayalam: (ml) (nandhi) ﻣﻴﺮﺳﻲ ,(ar) (shúkran) ﺷﻜﺮًﺍ :Arabic (informal) Maltese: grazzi (mt) Armenian: Maori: kia ora (mi) շնորհակալություն
    [Show full text]