Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging

Total Page:16

File Type:pdf, Size:1020Kb

Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging Persian Ezafe Recognition Using Transformers and Its Role in Part-Of-Speech Tagging Ehsan Doostmohammadi|, Minoo Nassajian|, Adel Rahimi♠ |Sharif University of Technology, Tehran, Iran ♠Dathena Science Pte. Ltd., Singapore fe.doostm72,[email protected], [email protected] Abstract This affix is always pronounced but almost al- ways not written, which results in a high degree Ezafe is a grammatical particle in some Iranian of ambiguity in reading and understanding Persian languages that links two words together. Re- texts. It is hence considered as one of the most gardless of the important information it con- veys, it is almost always not indicated in Per- interesting issues in Persian linguistics, and it has sian script, resulting in mistakes in reading been discussed in details from phonological as- complex sentences and errors in natural lan- pects (Ghomeshi, 1997), morphological aspects guage processing tasks. In this paper, we (Samvelian, 2006, 2007) and (Karimi and Brame, experiment with different machine learning 2012), and syntactic aspects (Samiian, 1994; Lar- methods to achieve state-of-the-art results in son and Yamakido, 2008; Kahnemuyipour, 2006, the task of ezafe recognition. Transformer- 2014, 2016). based methods, BERT and XLMRoBERTa, achieve the best results, the latter achieving Nearly 22% of the Persian words have ezafe (Bi- 2.68% F1-score more than the previous state- jankhan et al., 2011), which shows the prevalence of-the-art. We, moreover, use ezafe informa- of this marker. Moreover, this construction also ap- tion to improve Persian part-of-speech tagging pears in other languages such as Hawramani (Holm- results and show that such information will not berg and Odden, 2005), Zazaki (Larson and Ya- be useful to transformer-based methods and ex- makido, 2006; Toosarvandani and van Urk, 2014), plain why that might be the case. Kurdish (Karimi, 2007) etc. Ezafe construction 1 Introduction is also similar to idafa construction in Arabic and construct state in Hebrew (Habash, 2010; Karimi Persian ezafe is an unstressed morpheme that ap- and Brame, 2012) and Zulu (Jones, 2018). pears on the end of the words, as -e after conso- Ezafe recognition is the task of automatically la- nants and as -ye1 after vowels. This syntactic phe- beling the words ending with ezafe, which is crucial nomenon links a head noun, head pronoun, head for some tasks such as speech synthesis (Sheikhan adjective, head preposition, or head adverb to their et al., 1997; Bahaadini et al., 2011), as ezafe is al- modifiers in a constituent called ‘ezafe construc- ways pronounced, but rarely written. Furthermore, tion’ (Nassajian et al., 2019). Whether a word in a as recognizing the positions of this marker in sen- sentence receives or does not receive ezafe might tences helps determine phrase boundaries, it highly affect that sentence’s semantic and syntactic struc- facilitates other natural language processing (NLP) tures, as demonstrated in Examples 1a and 1b in tasks, such as tokenization (Ghayoomi and Mom- Figure1. There are some constructions in English tazi, 2009), syntactic parsing (Sagot and Walther, that can be translated by ezafe construction in Per- 2010; Nourian et al., 2015), part-of-speech (POS) sian. For instance, English ‘of’ has the same role tagging (Hosseini Pozveh et al., 2016), and ma- as Persian ezafe to show the part-whole relation, chine translation (Amtrup et al., 2000). the relationship of possession, or ‘’s’ construction, In this paper, we experiment with different meth- and possessive pronouns followed by nouns show- ods to achieve state-of-the-art results in the task of ing genitive cases are mirrored by Persian ezafe ezafe recognition. We then use the best of these (Karimi and Brame, 2012). methods to improve the results for the task of POS 1The y is called an intrusive y and is an excrescence be- tagging. After establishing a baseline for this task, tween two vowels for the ease of pronunciation. we provide the ezafe information to the POS tag- 961 Findings of the Association for Computational Linguistics: EMNLP 2020, pages 961–971 November 16 - 20, 2020. c 2020 Association for Computational Linguistics prep nsubj pobj advmod dobj-lvc root (1) a. Darpey-e ettefaq¯ at-e¯ diruz ’u ’este’fa¯ dad¯ . following-ez.2 events-ez. yesterday he/she resign did . P N ADV PRO N V DELM “Following the yesterday events, he/she resigned.” prep advmod nsubj pobj dobj-lvc root b. Darpey-e ettefaq¯ at¯ diruz ’u ’este’fa¯ dad¯ . following-ez. events yesterday he/she resign did . P N ADV PRO N V DELM “Following the events, he/she resigned yesterday.” Figure 1: An example of the role of ezafe in the syntactic and semantic structures. ging model once in the input text and the other time Megerdoomian et al.(2000) use a rule-based as an auxiliary task in a multi-task setting, to see method to design a Persian morphological analyzer. the difference in the results. The contributions of They define an ezafe feature to indicate the pres- this paper are (1) improving the state-of-the-art re- ence or absence of ezafe for each word based on sults in both of ezafe recognition and POS tagging the following words in a sentence. Another work is tasks, (2) analyzing the results of ezafe recognition Muller¨ and Ghayoomi(2010) that considers ezafe task to pave the way for further enhancement in the as a part of implemented head-driven phrase struc- future work, (3) improving POS tagging results in ture grammar (HPSG) to formalize Persian syntax some of the methods by providing ezafe informa- and determine phrase boundaries. In addition, No- tion and explaining why transformer-based models joumian(2011) designs a Persian lexical diacritizer might not benefit from such information. The code to insert short vowels within words in sentences us- for our experiments is available on this project’s ing finite-state transducers (FST) to disambiguate GitHub repository 3. words phonologically and semantically. They use After reviewing the previous work of both tasks a rule-based method to insert ezafe based on the in Section2, we introduce our methodology in context and the POS tags of the previous words. Section3 and data in Section4. We then discuss As for the statistical approach, Koochari et al. ezafe recognition and POS tagging tasks and their (2006) employ classification and regression trees results in Sections5 and6, respectively. (CART) to predict the absence or presence of ezafe marker. They use features such as Persian mor- 2 Previous Work phosyntactic characteristics, the POS tags of the current word, two words before, and three words af- 2.1 Ezafe Recognition ter the current word to train the model. Their train In the field of NLP, a few studies have been car- set contains approximately 70,000 words, and the ried out on Persian ezafe recognition, including test corpus consists of 30,382 words. To evaluate rule-based methods, statistical methods, and hybrid the performance of the model, they use Kappa fac- methods. Most of the previous work on the task tor, and they report 98.25% accuracy in the case of ezafe rely on long lists of hand-crafted rules and fail to non- words and 88.85% in the case of words ezafe achieve high performance on the task. with . As another research, we can mention Asghari et al.(2014) that employs maximum en- 2Ezafe. tropy (ME) and conditional random fields (CRF) 3https://github.com/edoost/pert methods. They use the 10 million word Bijankhan 962 corpus (Bijankhan et al., 2011) and report an ac- lion word Bijankhan corpus, ME and CRF with curacy of 97.21% for the ME tagger and 97.83% different window sizes, the best results of which for the CRF model with a window of size 5. They are 95% for both models with a window size of 5. also utilize five Persian specific features in a hy- brid setting with the models to achieve the highest 3 Methodology accuracy of 98.04% with CRF. We see both ezafe recognition and POS tagging Isapour et al.(2008) propose a hybrid method as sequence labeling problems, i.e., mapping each to determine ezafe positions using probabilistic input word to the corresponding class space of the context-free grammar (PCFG) and then consider task. For the ezafe recognition task, the class space the relations between the heads and their modi- size is two, 0 for words without and 1 for words fiers. The obtained accuracy is 93.29%, reportedly. with ezafe. The class space size for POS tagging Another work is Noferesti and Shamsfard(2014) task is 14, consisting of the coarse-grained POS that uses both a rule-based method and a genetic tags in the 10 million word Bijankhan corpus. The algorithm. At first, they apply 53 syntactic, mor- results in Section 2.1 are unfortunately reported on phological, and lexical rules to texts to determine different, and in most cases irreproducible, test sets, words with ezafe. Then, the genetic algorithm is using accuracy as the performance measure (which employed to recognize words with ezafe, which is insufficient and unsuitable for the task), making have not been recognized at the previous step. To the comparison difficult. We hence re-implemented train and test the model, they use the 2.5 million the model that reports the highest accuracy on the word Bijankhan corpus (Bijankhan, 2004) and ob- largest test set and compare its results with ours. tain an accuracy of 95.26%. 3.1 Models 2.2 POS Tagging We experiment with three types of models: condi- Azimizadeh et al.(2008) use a trigram hidden tional random fields (CRF) (Lafferty et al., 2001), Markov model trained on the 2.5 million word Bi- recurrent neural networks (RNN) (Rumelhart et al., jankhan corpus.
Recommended publications
  • Origins of Grammatical Forms and Evidence from Latin1
    Origins of Grammatical Forms and Evidence from Latin1 Brigitte L. M. Bauer The University of Texas at Austin, U.S.A. [email protected] Analyses of grammaticalization in Indo-European languages tend to focus on Germanic and Romance forms, for which we generally are able to identify the original autonomous composing elements. By contrast, in early Indo- European languages we find many inflected formations or other grammatical elements that are opaque. The question then is: are these forms opaque (1) because of the length of their existence; or (2) because the composing elements in origin were not autonomous lexical forms, but rather of a different type? We may never be able to fully answer that question. Yet comparison of phenomena of grammaticalization in different stages of Latin will help clarify certain aspects of it. This article compares manifestations of grammaticalization in the transition from Latin to Romance with earlier formation processes in Latin and Indo-European. I will submit that the instances of grammaticalization that we find in the later stages of Latin and that resulted in a number of new grammatical forms in Romance, reflect a major linguistic innovation. While the new grammatical forms are created out of lexical or mildly grammatical autonomous elements, earlier processes seem to primarily involve particles with a certain semantic value, and freezing. This fundamental difference explains why the attempts of early Indo-Europeanists such as Franz Bopp at tracing the lexical origins of Indo-European inflected forms were unsuccessful and strongly criticized by the Neo-Grammarians. 1. Introduction Over the last thirty or so years the study of grammaticalization has been one of the hotbeds of historical linguistics.
    [Show full text]
  • Information to Users
    INFORMATION TO USERS The most advanced technology has been used to photograph and reproduce this manuscript from the microfilm master. UMI films the text directly from the original or copy submitted. Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer. The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct prim, colored or poor quality illustrations and photographs, print bleedthrough, substandard margins, and improper alignment can adversely affect reproduction. In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. Oversize materials (e.g., maps, drawings, charts) are reproduced by sectioning the original, beginning at the upper left-hand corner and continuing from left to right in equal sections with small overlaps. Each original is also photographed in one exposure and is included in reduced form at the back of the book. Photographs included in the original manuscript have been reproduced xerographically in this copy. Higher quality 6" x 9" black and white photographic prints are available for any photographs or illustrations appearing in this copy for an additional charge. Contact UMI directly to order. University Microfilms International A Bell & Howeil Information Company 300 North Zeeb Road Ann Arbor Ml 48106-1346 USA 313 761-4700 800 521-0600 Order Number 0105209 Topics on a categorlal theory of Chinese syntax Sheu, Ying-yu, Ph.D.
    [Show full text]
  • An Introduction to Syntactic Analysis and Theory
    An Introduction to Syntactic Analysis and Theory Hilda Koopman Dominique Sportiche Edward Stabler 1 Morphology: Starting with words 1 2 Syntactic analysis introduced 37 3 Clauses 87 4 Many other phrases: first glance 101 5 X-bar theory and a first glimpse of discontinuities 121 6 The model of syntax 141 7 Binding and the hierarchical nature of phrase structure 163 8 Apparent violations of Locality of Selection 187 9 Raising and Control 203 10 Summary and review 223 iii 1 Morphology: Starting with words Our informal characterization defined syntax as the set of rules or princi- ples that govern how words are put together to form phrases, well formed sequences of words. Almost all of the words in it have some common sense meaning independent of the study of language. We more or less understand what a rule or principle is. A rule or principle describes a regularity in what happens. (For example: “if the temperature drops suddenly, water vapor will condense”). This notion of rule that we will be interested in should be distinguished from the notion of a rule that is an instruction or a statement about what should happen, such as “If the light is green, do not cross the street.” As linguists, our primary interest is not in how anyone says you should talk. Rather, we are interested in how people really talk. In common usage, “word” refers to some kind of linguistic unit. We have a rough, common sense idea of what a word is, but it is surprisingly difficult to characterize this precisely.
    [Show full text]
  • A Semantic Study of the Preposition “På” in Norwegian
    Life of på A semantic study of the preposition “på” in Norwegian Vilde Reichelt MA thesis in Linguistics LING4190 Department of Linguistics and Scandinavian Studies University of Oslo November 2018 II Life of på A semantic study of the preposition “på” in Norwegian III © Vilde Reichelt 2017/18 Life of på Vilde Reichelt http://www.duo.uio.no/ Print: Reprosentralen, University of Oslo IV Abstract In this thesis I examine the lexical semantics of the Norwegian word på (eng. on) using a written translation corpus from Norwegian to English. My empirical data was collected from the written corpus OMC and an excerpt of the novel Sophie’s World in The English- Norwegian Parallel Corpus. The analysis comprised of two substudies. To test for my hypotheses, I studied noun phrases of the type ‘fur of the rabbit’ in translation from Norwegian, and the Norwegian prepositions av ‘of’, i ‘in’ and til ‘to’ in relation to på and the English translation of constructions that originally have på ‘on’ in Norwegian. The other substudy consists of both particle verbs of the type ‘believe in’ and verbs with preposition phrases ‘was on her way’. I explored nominalization of verbs and whether it give rise to a requirement for på, and what patterns there are for the semantic selection of prepositional elements. Occasionally the lexeme på functions as a complement to the verb, making it more specific, or else it appears in particle verbs as a grammatical function. Focusing on translation data from both Norwegian and English, the thesis investigates the different combinations of på and of, and the interchangeability within the Norwegian prepositions.
    [Show full text]
  • Studies in Historical Linguistics and Language Change Grammaticalization, Refunctionalization and Beyond
    Studies in Historical Linguistics and Language Change Grammaticalization, Refunctionalization and Beyond Edited by Dorien Nieuwenhuijsen and Mar Garachana Printed Edition of the Special Issue Published in Languages www.mdpi.com/journal/languages Studies in Historical Linguistics and Language Change Studies in Historical Linguistics and Language Change. Grammaticalization, Refunctionalization and Beyond Special Issue Editors Dorien Nieuwenhuijsen Mar Garachana MDPI • Basel • Beijing • Wuhan • Barcelona • Belgrade Special Issue Editors Dorien Nieuwenhuijsen Mar Garachana Utrecht University Barcelona University The Netherlands Spain Editorial Office MDPI St. Alban-Anlage 66 4052 Basel, Switzerland This is a reprint of articles from the Special Issue published online in the open access journal Languages (ISSN 2226-471X) from 2018 to 2019 (available at: https://www.mdpi.com/journal/languages/ special issues/Lingustics LanguageChange) For citation purposes, cite each article independently as indicated on the article page online and as indicated below: LastName, A.A.; LastName, B.B.; LastName, C.C. Article Title. Journal Name Year, Article Number, Page Range. ISBN 978-3-03921-576-8 (Pbk) ISBN 978-3-03921-577-5 (PDF) Cover image courtesy of Bob de Jonge. c 2019 by the authors. Articles in this book are Open Access and distributed under the Creative Commons Attribution (CC BY) license, which allows users to download, copy and build upon published articles, as long as the author and publisher are properly credited, which ensures maximum dissemination and a wider impact of our publications. The book as a whole is distributed by MDPI under the terms and conditions of the Creative Commons license CC BY-NC-ND. Contents About the Special Issue Editors ....................................
    [Show full text]
  • Processing Prescriptively Incorrect Comparative Particles: Evidence from Sentence-Matching and Eye-Tracking
    fpsyg-11-00186 February 13, 2020 Time: 18:19 # 1 ORIGINAL RESEARCH published: 14 February 2020 doi: 10.3389/fpsyg.2020.00186 Processing Prescriptively Incorrect Comparative Particles: Evidence From Sentence-Matching and Eye-Tracking Ferdy Hubers1†, Theresa Redl1,2*†, Hugo de Vos3, Lukas Reinarz4 and Helen de Hoop1 1 Centre for Language Studies, Radboud University Nijmegen, Nijmegen, Netherlands, 2 Max Planck Institute for Psycholinguistics, Nijmegen, Netherlands, 3 Institute of Public Administration, Leiden University, Leiden, Netherlands, 4 Department of Physics, University of Bonn, Bonn, Germany Speakers of a language sometimes use particular constructions which violate prescriptive grammar rules. Despite their prescriptive ungrammaticality, they can occur rather frequently. One such example is the comparative construction in Dutch and similarly in German, where the equative particle is used in comparative constructions instead of the prescriptively correct comparative particle (Dutch beter als Jan and German besser wie Jan “lit. better as John”). In a series of three experiments Edited by: using sentence-matching and eye-tracking methodology, we investigated whether M. Teresa Espinal, Autonomous University of Barcelona, this grammatical norm violation is processed as grammatical, as ungrammatical, or Spain whether it falls in between these two. We hypothesized that the latter would be the Reviewed by: case. We analyzed our data using linear mixed effects models in order to capture Gosse Bouma, University of Groningen, Netherlands possible individual differences. The results of the sentence-matching experiments, which Katharina Spalek, were conducted in both Dutch and German, showed that the grammatical norm Humboldt University of Berlin, violation patterns with ungrammatical sentences in both languages. Our hypothesis Germany was therefore not borne out.
    [Show full text]
  • Greek and Mandarin Utterance Particles David Stifler 2007-2008
    Greek and Mandarin Utterance Particles David Stifler 2007-2008 Greek and Mandarin Utterance Particles: A Comparative Analysis David Stifler1 1. Introduction 1.1 What are particles? The word “particle” is frequently used to describe various kinds of morphemes in numerous languages. It often appears as a catch-all term serving to label any class of morpheme that does not seem to fit into what are traditionally understood to be the major categories—noun, verb, adjective, and so on. (Dobson 1974:4)2 Many interjections (among other classes) in English could be considered particles, such as the sentence-final Canadian eh or American huh, the word now or well at the beginning of a declarative sentence (usually followed by a comma in written transcriptions), or the so-called “filler” like. One would be hard-pressed to come up with a satisfactory gloss for the first word in the sentence Now, I think you’ve got the wrong idea, or the final word in the sentence Wife on your case again, eh?, but no speaker (at least, no native 1* I wish to thank my numerous professors, including those of Chinese (Jiang Laoshi, Li Laoshi, Kong Laoshi, Shi Laoshi, Zhou Laoshi and all of my Beijing instructors) and Greek (William Turpin, Rosaria Munson, Deborah Beck, John Bauschatz, and Grace Ledbetter) and Linguistics (especially my advisor, Jason Kandybowicz). I also owe a lot to the moral support of my friends, especially Sally O’Brien and Lauren Rile Smith, as well as the productive discourse I have had with many of them, especially James “Kit” Digges La Touche, Katie Bates, and Dan “Venger” Jamison.
    [Show full text]
  • Aspects of Change in the Syntax of Māori - a Corpus-Based Study
    Aspects of change in the syntax of Māori - a corpus-based study BY KARENA GWEN KELLY A thesis submitted to the Victoria University of Wellington in fulfillment of the requirements for the degree of Doctor of Philosophy Victoria University of Wellington (2015) i ABSTRACT In the current climate of Māori language revitalisation, there is ample anecdotal evidence to suggest that not only the vocabulary, but also the syntax of modern Māori is markedly different from its traditional roots, and that it shows significant influence from English syntax. However, syntactic change in Māori has not hitherto been rigorously studied. This thesis aims to provide material evidence of change in Māori syntax, through a corpus-based study of grammatical change in te reo Māori over the period of contact with English. My methodology involved the compilation and comparison of two synchronic corpora representing the two ends of the contact period to provide a diachronic perspective on the language. Each corpus consists of approximately 102,000 running words of material written originally in Māori. The early corpus contains items published pre-1900. The modern material was written post-1990. The thesis is not only an exploration of the possibility of documenting syntactic change through the use of such corpora, but also tests whether it is possible to do this using corpora significantly smaller than the multi-million word corpora typical in corpus linguistics. The scope of this methodology is tested by examining three distinct types of grammatical features: a grammatical particle (the preposition mō), a pair of semantically related lexemes that appear to be undergoing a process of grammaticalisation (the verbs taea and āhei), and a widespread grammatical construction (certain types of relative clauses).
    [Show full text]
  • A Description of Preverb and Particle Usage In
    A DESCRIPTION OF PREVERB AND PARTICLE USAGE IN INNU-AIMÛN NARRATIVE by ©Jane Bannister Bachelor of Arts Memorial University of Newfoundland (2000) A thesis submitted to the School of Graduate Studies in partial fulfillment of the requirements for the degree of Master of Arts Department of Linguistics Memorial University of Newfoundland April 2004 St. John’s Newfoundland and Labrador, Canada ABSTRACT Sentences with multiple preverbs and/or particles are examined in this thesis. The data sentences were collected from the first 18 stories of the Labrador Innu Text Project. Chapter 1 is an introduction to Innu-aimun grammar, with sections on previous research into word ordering, especially preverb ordering. Chapter 2 describes the patterning, use and co-occurrence of the ten most common preverbs in the data sentences. Preverbs are subdivided into modal preverbs, temporal preverbs, aspectual preverbs and other preverbs. Chapter 3 discusses 28 common particles in the data. These particles are also divided into smaller groups, including complementizers, focus particles, negative particles, adverbs, temporal and aspectual particles, particles of speaker opinion and particles with changed forms. Both chapters 2 and 3 include discussion of regular patterns of ordering of preverbs or particles. Chapter 4 is an analysis of the use of the independent or conjunct orders following negative particles. Optimality Theory is used to explain Innu data, and sentences are analyzed based on Brittain (2001, 1997). A general thesis conclusion ends chapter 4. ii ACKNOWLEDGMENTS Thanks to the many people who have helped me create and finish this thesis. Thanks to my supervisor Phil Branigan, for sensible suggestions and a calming demeanor.
    [Show full text]
  • 51 on the Periphery of the Parts of Speech System Rakhim Z. Muryasov
    On the periphery of the parts of speech system Rakhim Z. Muryasov DOI: 10.18355/XL.2019.12.04.05 Abstract The article is devoted to the consideration of the periphery of the parts of speech system. The theory of parts of speech, constituting the core of the theory of grammatical structure and serving as the basis for describing the linguistic system as a whole, is an area of controversial debates in the context of various linguistic paradigms. The internal contradictions inherent in the language system do not allow developing integrated, logically consistent criteria for the classification of lexis, covering all categories of words. The functional-semantic and formal heterogeneity of lexical categories does not make it possible to include them into “the Procrustean bed,” a scheme recommended by various linguistic schools. The most contradictory classifications are the ones of the so-called synsemantic, functional words which are characterized by a high degree of categorical interpenetration. The determination of the categorical status of this kind of words is possible only on the basis of semantic and pragmatic features of discourse, as well as when taking into account presupposition factors. Key words: auto-and synsemantic parts of speech, prepositions, conjunctions, particles, interjections, onomatopoeic words Introduction 1.1. The Problem Statement Parts of speech, making the fundamental category of the language system, have been the object of contradictory discussions for centuries and remain one of the controversial issues at present (Simsek, Elciyar, Kizilhan, 2019). The classification of parts of speech is based on a number of criteria: semantic (the generalized lexical meaning), morphological (the combination of grammatical categories with the corresponding paradigms), and syntactic (functions of words in a sentence) (Valeeva, Aitov, Bulatbayeva, 2016; Sum, Kwon, 2018).
    [Show full text]
  • Towards a Correlation of Form, Use and Meaning of German Ge-Prefixed
    a journal of Pross, Tillmann and Antje Roßdeutscher. 2019. Towards a correlation of form, general linguistics Glossa use and meaning of German ge-prefixed predicative participles. Glossa: a journal of general linguistics 4(1): 93. 1–31. DOI: https://doi.org/10.5334/gjgl.753 RESEARCH Towards a correlation of form, use and meaning of German ge-prefixed predicative participles Tillmann Pross and Antje Roßdeutscher University of Stuttgart, DE Corresponding author: Tillmann Pross ([email protected]) We argue for a split semantics of German predicative participle constructions, depending on whether or not the formation of the participle involves prefixation with the dedicated morpheme ge-. Against the background of the analysis of participles of German be-prefixed verbs proposed in Pross (2019), and using the licensing of superlative constructions and ung-nominalizations as tests, we show that ge-prefixed participles denote a result relation between a property of an event and an individual. In contrast, be-prefixed participles, like adjectives, denote properties of individuals. We cast the distinction between event properties and individual properties in a compositional semantics of ge-and be-prefixed participles and show how the resulting semantic distinction allows to predict the distinction between target and resultant state participles drawn in Kratzer (2000) without using the questionable immer noch ‘still’ test. Keywords: German; Lexical Semantics; Adjectival Passive; Predicative Participle; Stativity; Target State; Resultant State 1 Introduction 1.1 Goals of the paper This paper is about German participles. More specifically, its focus is the analysis of predicative uses of German participles, the formation of which requires prefixation of the base verb with a dedicated morpheme ge- as in (1-b).
    [Show full text]
  • Scope of Linguistic Influence: Does a Classifier System Alter Object Concepts?
    Journal of Experimental Psychology: General Copyright 2007 by the American Psychological Association 2007, Vol. 136, No. 3, 485–501 0096-3445/07/$12.00 DOI: 10.1037/0096-3445.136.3.485 Scope of Linguistic Influence: Does a Classifier System Alter Object Concepts? Henrik Saalbach Mutsumi Imai Max Planck Institute for Human Development Keio University at Shonan-Fujisawa Whether and to what extent conceptual structure is universal is of great importance for understanding the nature of human concepts. Two major factors that might affect concepts are language and culture. The authors investigated whether these 2 factors affect concepts of everyday objects in any significant ways. Specifically, they tested (a) whether the system of grammatical categorization by classifiers influenced the conceptual structure of speakers of classifier languages, and (b) whether Westerners organized object concepts around taxonomic relations whereas Easterners organized them around thematic relations, as proposed by R. E. Nisbett (2003). The relative importance of 3 types of relations—taxonomic, thematic, and classifier—for Chinese and German speakers was tested using a range of tasks, including categorization, similarity judgment, property induction, and fast-speed word–picture matching. Some support for linguistic relativity as well as for the cultural-specific cognition proposal was found in some tasks, but these effects were miniscule compared with the importance of taxonomic and thematic relations for both language–culture groups. The authors conclude that the global structure of everyday object concepts is strikingly similar across different cultures and languages. Keywords: concepts, linguistic relativity, culture-specific cognition, classifiers, thematic relations Language labels and refers to categories. Object categories are classifier grammar system categorizes nouns into more than 100 referred to by nouns, but in many languages, nouns are further semantic categories (Craig, 1986; Grinevald, 2000).
    [Show full text]