Squibs Telicity Vs. Perfectivity: a Case Study of Odia1 Complex Predicates
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Semi-Supervised Learning of HMM to Build a POS Tagger for a Low Resourced Language
J. lnf. Commun. Converg. Eng. 18(4): 207-215, Dec. 2020 Regular paper A Semi-supervised Learning of HMM to Build a POS Tagger for a Low Resourced Language 1* 2 3 Sagarika Pattnaik , Ajit Kumar Nayak , and Srikanta Patnaik 1Department of Computer Science and Engineering, SOA, Deemed to be University, Bhubaneswar, 751030, India 2Department of Computer Science and Information Technology, SOA, Deemed to be University, Bhubaneswar, 751030, India 3Department of Computer Science and Engineering, SOA, Deemed to be University, Bhubaneswar, 751030, India Abstract Part of speech (POS) tagging is an indispensable part of major NLP models. Its progress can be perceived on number of languages around the globe especially with respect to European languages. But considering Indian Languages, it has not got a major breakthrough due lack of supporting tools and resources. Particularly for Odia language it has not marked its dominancy yet. With a motive to make the language Odia fit into different NLP operations, this paper makes an attempt to develop a POS tagger for the said language on a HMM (Hidden Markov Model) platform. The tagger judiciously considers bigram HMM with dynamic Viterbi algorithm to give an output annotated text with maximum accuracy. The model is experimented on a corpus belonging to tourism domain accounting to a size of approximately 0.2 million tokens. With the proportion of training and testing as 3:1, the proposed model exhibits satisfactory result irrespective of limited training size. Index Terms: HMM, NLP, Odia, POS tagger I. INTRODUCTION the morphological complexities of various languages has been a major issue. The process is also limited by the availability of Part-of-speech (POS) tagging or grammatical tagging, an required resources, such as a sufficient training corpus and lin- integral part of significant natural language processing (NLP) guistic information related to the language. -
Nominal and Verbal Parameters in the Diachrony of Differential Object Marking in Spanish Marco García García University of Cologne
Chapter 8 Nominal and verbal parameters in the diachrony of differential object marking in Spanish Marco García García University of Cologne This paper deals with the influence that nominal and verbal parameters have on DOMinthe diachrony of Spanish. Comparing selected corpus studies, I will focus first on the different nominal parameters that build up the animacy and referentiality scales, in particular on an- imacy and definiteness. In order to clarify how far DOM has diachronically evolved, special attention will be paid to inanimate objects, which can be viewed as the alleged endpointin the development of DOM in Spanish. Secondly, I will provide a systematic overview of the relevant verbal parameters, which include aspect, affectedness and agentivity. The study will show a complex interaction of nominal and verbal parameters, revealing some unex- pected correlations: Obligatory object marking is not only found with human and strongly affected objects involved in a telic event, but also with inanimate, non-affected andagen- tive objects embedded in a stative event. In other words, in Spanish DOM patterns with both extremely high and extremely low transitivity. These findings sharply contrast with traditional accounts concerning the development as well as the explanation of DOM. 1 Introduction Differential object marking (DOM) is a well-attested phenomenon within Romance lan- guages (for an overview see Bossong 1998: 218–230). While in some Romance languages, such as Catalan or Modern Portuguese, DOM is confined to a reduced number of con- texts, in others, such as Sardinian or Spanish, it is found in many more contexts. This paper will focus on Spanish, where DOM seems to have reached a greater stage of de- velopment than in any other Romance language. -
30. Tense Aspect Mood 615
30. Tense Aspect Mood 615 Richards, Ivor Armstrong 1936 The Philosophy of Rhetoric. Oxford: Oxford University Press. Rockwell, Patricia 2007 Vocal features of conversational sarcasm: A comparison of methods. Journal of Psycho- linguistic Research 36: 361−369. Rosenblum, Doron 5. March 2004 Smart he is not. http://www.haaretz.com/print-edition/opinion/smart-he-is-not- 1.115908. Searle, John 1979 Expression and Meaning. Cambridge: Cambridge University Press. Seddiq, Mirriam N. A. Why I don’t want to talk to you. http://notguiltynoway.com/2004/09/why-i-dont-want- to-talk-to-you.html. Singh, Onkar 17. December 2002 Parliament attack convicts fight in court. http://www.rediff.com/news/ 2002/dec/17parl2.htm [Accessed 24 July 2013]. Sperber, Dan and Deirdre Wilson 1986/1995 Relevance: Communication and Cognition. Oxford: Blackwell. Voegele, Jason N. A. http://www.jvoegele.com/literarysf/cyberpunk.html Voyer, Daniel and Cheryl Techentin 2010 Subjective acoustic features of sarcasm: Lower, slower, and more. Metaphor and Symbol 25: 1−16. Ward, Gregory 1983 A pragmatic analysis of epitomization. Papers in Linguistics 17: 145−161. Ward, Gregory and Betty J. Birner 2006 Information structure. In: B. Aarts and A. McMahon (eds.), Handbook of English Lin- guistics, 291−317. Oxford: Basil Blackwell. Rachel Giora, Tel Aviv, (Israel) 30. Tense Aspect Mood 1. Introduction 2. Metaphor: EVENTS ARE (PHYSICAL) OBJECTS 3. Polysemy, construal, profiling, and coercion 4. Interactions of tense, aspect, and mood 5. Conclusion 6. References 1. Introduction In the framework of cognitive linguistics we approach the grammatical categories of tense, aspect, and mood from the perspective of general cognitive strategies. -
Hundred Sentences and XFST Assignments
Hundred Sentences and XFST Assignments 2-19-14 Your assignment • Make an elicitation checklist of at least 100 sentences covering these things: – Transitive and intransitive verbs. – Semantic verb classes: stativity, dynamicity, and telicity. – Tense/mood/aspect – Special sentence types: • Existential • Copula • Possession • Questions – Speech acts: • Statement • Command/prohibition • Question – Negation Your assignment continued • Noun types – Pronoun – Common – Proper – Count/Mass – Concrete/Abstract • Noun phrases – Definiteness – Possession – Proximity – Diminutive/Augmentative – Quantity • Cardinal numbers • Quantifiers: all, each, both, some – Ordinal (first, second) – Partitive • The top of the tree • The bottom of the tree Your assignment continued • Adjectives – Comparative – Superlative – Intensive (very) – Unintensive (a little) • Adpositions and/or case marking • Typical modiers of nouns and verbs. – Location, time, manner, etc. Example: Belele (Bwele) (Inspiration from Warlpiri and Iñupiaq) Bele-le word-generic Intransitive sentences. SV word order. “Wordkind” Fwe-le bar -la -to Verb morphology: root-aspect-tense Bird-gener sing-hab-pres “Birds sing” Aspects: habitual -la Fwe-n bar -la -to Punctual Ø Bird-def sing-hab-pres Inceptive -go “The bird sings” Fwe-n bar-na Tenses: Bird-def sing-past Present -to “The bird sang” Past -na Fwe-n bar- go- na Bird-def sing-incep-past “The bird started to sing” Example continued • Compound noun: First element not marked for number and definiteness Nar -bi ndo –ki -n There are adjective stems, but they happy -one day –pl –def have to be turned into nouns in order “the happy days” to use them. -bi turns an adjective into a noun. • Two noun phrases (either order, or can be discontinuous): Nar -bi -ki -n ndo –ki -n happy -one –pl-def day –pl –def Nouns and their modifiers can be in either order of can be “the happy ones the days” separated if the modifier is marked for number and ndo –ki -n nar -bi -ki -n definiteness. -
Early Sensitivity to Telicity: the Role of the Count/Mass Distinction in Event Individuation
W O R K I N G P A P E R S I N L I N G U I S T I C S The notes and articles in this series are progress reports on work being carried on by students and faculty in the Department. Because these papers are not finished products, readers are asked not to cite from them without noting their preliminary nature. The authors welcome any comments and suggestions that readers might offer. Volume 41(4) 2010 (April) DEPARTMENT OF LINGUISTICS UNIVERSITY OF HAWAI‘I AT MĀNOA HONOLULU 96822 An Equal Opportunity/Affirmative Action Institution WORKING PAPERS IN LINGUISTICS: UNIVERSITY OF HAWAI‘I AT MĀNOA, VOL. 41(4) DEPARTMENT OF LINGUISTICS FACULTY 2010 Victoria B. Anderson Byron W. Bender (Emeritus) Benjamin Bergen Derek Bickerton (Emeritus) Robert A. Blust Robert L. Cheng (Adjunct) Kenneth W. Cook (Adjunct) Kamil Deen Patricia J. Donegan (Co-Graduate Chair) Katie K. Drager Emanuel J. Drechsel (Adjunct) Michael L. Forman (Emeritus) George W. Grace (Emeritus) John H. Haig (Adjunct) Roderick A. Jacobs (Emeritus) Paul Lassettre P. Gregory Lee Patricia A. Lee Howard P. McKaughan (Emeritus) William O’Grady (Chair) Yuko Otsuka Ann Marie Peters (Emeritus, Co-Graduate Chair) Kenneth L. Rehg Lawrence A. Reid (Emeritus) Amy J. Schafer Albert J. Schütz, (Emeritus, Editor) Ho Min Sohn (Adjunct) Nicholas Thieberger Laurence C. Thompson (Emeritus) ii EARLY SENSITIVITY TO TELICITY: THE ROLE OF THE COUNT/MASS DISTINCTION IN EVENT INDIVIDUATION YUKIE HARA1 This paper presents evidence that English-speaking children are sensitive to telicity based on the count/mass distinction of the object noun in verb phrases such as eat an apple (telic) vs. -
A Survey on Odia Computational Morphology
International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) Volume 3 Issue 3, March 2014 A Survey on Odia Computational Morphology Dhabal Prasad Sethi ABSTRACT In natural language processing morphological analyzer, The rest part is organized as section II describes the literature morphological generator and morphological parser is the survey, section III describes the different methods of essential common tool used in developing language morphological analyzer and generator, section IV describes software. Morphological analyzer is a computer program the applications and section V describes the conclusion. which provides the grammatical information of a word. II.LITERATURE SURVEY Morphological generator, by giving/providing the root word Itisree Jena, Sriram Chaudhury, Himani Chaudhry, Dipti M. and grammatical information, it generates all word forms of Sharma [3] presented a paper named “Developing Oriya that word. On the other hand morphological parsing is the Morphological Analyzer Using Lt-toolbox “.They have process of determining the morphemes from given word? In developed the morphological analyzer for Odia using this article I surveys the different work have done about paradigm approach. The paradigm approach is a method Odia morphology. which defines the all the word form of a given stem and its Keyword: Morphology, Morphological Analyzer, associated feature structure. Their system handles the Morphological Generator, Morphological Parser, Information inflectional morphology of noun, verb and adjectives. Retrieval, Stemmer, IE, Spell Checker, Machine Translation, R.C Balabantray, M.K.Jena S.Mohanty[4] presented a paper Question Answering System. named “Shallow Morphology based complex predicates extraction in Oriya” The aim of their paper is to extracting the I. -
Corpus Study of Tense, Aspect, and Modality in Diglossic Speech in Cairene Arabic
CORPUS STUDY OF TENSE, ASPECT, AND MODALITY IN DIGLOSSIC SPEECH IN CAIRENE ARABIC BY OLA AHMED MOSHREF DISSERTATION Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate College of the University of Illinois at Urbana-Champaign, 2012 Urbana, Illinois Doctoral Committee: Professor Elabbas Benmamoun, Chair Professor Eyamba Bokamba Professor Rakesh M. Bhatt Assistant Professor Marina Terkourafi ABSTRACT Morpho-syntactic features of Modern Standard Arabic mix intricately with those of Egyptian Colloquial Arabic in ordinary speech. I study the lexical, phonological and syntactic features of verb phrase morphemes and constituents in different tenses, aspects, moods. A corpus of over 3000 phrases was collected from religious, political/economic and sports interviews on four Egyptian satellite TV channels. The computational analysis of the data shows that systematic and content morphemes from both varieties of Arabic combine in principled ways. Syntactic considerations play a critical role with regard to the frequency and direction of code-switching between the negative marker, subject, or complement on one hand and the verb on the other. Morph-syntactic constraints regulate different types of discourse but more formal topics may exhibit more mixing between Colloquial aspect or future markers and Standard verbs. ii To the One Arab Dream that will come true inshaa’ Allah! عربية أنا.. أميت دمها خري الدماء.. كما يقول أيب الشاعر العراقي: بدر شاكر السياب Arab I am.. My nation’s blood is the finest.. As my father says Iraqi Poet: Badr Shaker Elsayyab iii ACKNOWLEDGMENTS I’m sincerely thankful to my advisor Prof. Elabbas Benmamoun, who during the six years of my study at UIUC was always kind, caring and supportive on the personal and academic levels. -
The Interaction Between Telicity and Agentivity: Experimental Evidence from Intransitive Verbs in German and Chinese
Originally published in: Lingua vol. 200 (2017), pp. 84-106. DOI: https://doi.org/10.1016/j.lingua.2017.08.006 POSTPRINT a,b,* a a a,b Tim Graf , Markus Philipp , Xiaonan Xu , Franziska Kretzschmar , a Beatrice Primus a Institute of German Language and Literature I, University of Cologne, Albertus-Magnus-Platz, 50923 Cologne, Germany b CRC 1252 Prominence in Language, University of Cologne, Germany The interaction between telicity and agentivity: Experimental evidence from intransitive verbs in German and Chinese Abstract: Telicity and agentivity are semantic factors that split intransitive verbs into (at least two) different classes. Clear-cut unergative verbs, which select the auxiliary HAVE, are assumed to be atelic and agent-selecting; unequivocally unaccusative verbs, which select the auxiliary BE, are analyzed as telic and patient-selecting. Thus, agentivity and telicity are assumed to be inversely correlated in split intransitivity. We will present semantic and experimental evidence from German and Mandarin Chinese that casts doubts on this widely held assumption. The focus of our experimental investigation lies on variation with respect to agentivity (specifically motion control, manipulated via animacy), telicity (tested via a locative vs. goal adverbial), and BE/HAVE-selection with semantically flexible intransitive verbs of motion. Our experimental methods are acceptability ratings for German and Chinese (Experiments 1 and 2) and event-related potential (ERP) measures for German (Experiment 3). Our findings contradict the above- mentioned assumption that agentivity and telicity are generally inversely correlated and suggest that for the verbs under study, agentivity and telicity harmonize with each other. Furthermore, the ERP measures reveal that the impact of the interaction under discussion is more pronounced on the verb lexeme than on the auxiliary. -
Classicism of Odia Language
Orissa Review Classicism of Odia Language Dr. G.K. Srichandan Man expresses his feelings and emotions through neighbouring languages are Hindi, Bengali, Telgu, sentences of any language-spoken or written. Tamil etc. In addition to this 62 denominations Thus, human beings use many words in sentences (communities) of tribals live within the State of of language which have come to us by oral use Odisha. Their spoken languages or dialects had for hundreds of thousand years. Words are have impact on Odia language for her evolution evolved and use for expression of man¶s inner and enrichment. feelings. We borrow such words from other languages also to supplement our own mother Except Surya dynasty rulers all other tongue. preceding dynasty rulers have come from outside and brought with them soldiers and officers The vocabulary of a language depends speaking their native language and after settling in on the richness or treasure of words accumulated Udra Desha had embraced the then prevailing and stored in that language by passage of time Odia language. That had impact on development from the time immemorial. Odia is the mother of the vocabulary of Odia language. language of the majority of the people of Odisha at present and also in the past. The present In 1568 Odisha empire lost her Odisha is the political entity with distinct boundary. sovereignty and independence on sudden and sad But outside Odisha there exists Odia speaking demise of the last independent and sovereign tracts which was the result or resultant of the Hindu king-emperor Gajapati Mukunda Deva in conquests made by the Ganga Vansi and Surya Gohiri Tikira battle field. -
The Telicity Parameter Revisited Hana Filip SRI in Ternational- NL P Group Discern & Stanfo Rd University
The Telicity Parameter Revisited Hana Filip SRI In ternational- NL P Group Discern & Stanfo rd University 1. Introduction The goals of this paper are threefold: First, to review some recent syntactic accounts of cross-linguistic differences in the expression of telicity in Slavic vs. Germanic languages. Second, I will argue that the parametric variation in the encoding of telicity cannot be based on a unidirectional specifier head agreement between the verbal functional head linked to the telicity of the VP and the DO-DP in its specifier position, with languages exhibiting two clearly distinct modes of assigning telicity to the functional head. In the simplest terms, in Germanic languages, it is assIgned by the DO-DP and in Slavic by the perfective/imperfective aspect of the lexical VP head. Rather, in a given telicity structure in both Slavic and Germanic languages, we actually observe mutual constraints and interactions between the head verb and one of its semantic arguments, namely, the incremental argument. Third, the variation in the encoding of telicity cannot be limited just to syntactic factors. Instead, it is semantic (and also pragmatic) factors that ultimately motivate (i) the phenomena that the syntactic parametric approach tries to capture, and also (ii) telicity phenomena that are a priori precluded by it, leftout or unnoticed. In this connection, I will defend the familiar (though often forgotten) insights of Krifka's (1986 and elsewhere) and Dowty's (1991) mereological theory oftelicity. 2. The syntactic telicity parameter One dominant strand of research locates all variation in the encoding of telicity in syntax, with much of the work being motivated by assumptions of Minimalism and/or Distributed Morphology. -
The Telicity Parameter Revisited Hana Filip SRI International – NLP Group Discern & Stanford University
The Telicity Parameter Revisited Hana Filip SRI International – NLP Group Discern & Stanford University 1. Introduction The goals of this paper are threefold: First, to review some recent syntactic accounts of cross-linguistic differences in the expression of telicity in Slavic vs. Germanic languages. Second, I will argue that the parametric variation in the encoding of telicity cannot be based on a unidirectional specifier head agreement between the verbal functional head linked to the telicity of the VP and the DO-DP in its specifier position, with languages exhibiting two clearly distinct modes of assigning telicity to the functional head. In the simplest terms, in Germanic languages, it is assigned by the DO-DP and in Slavic by the perfective/imperfective aspect of the lexical VP head. Rather, in a given telicity structure in both Slavic and Germanic languages, we actually observe mutual constraints and interactions between the head verb and one of its semantic arguments, namely, the incremental argument. Third, the variation in the encoding of telicity cannot be limited just to syntactic factors. Instead, it is semantic (and also pragmatic) factors that ultimately motivate (i) the phenomena that the syntactic parametric approach tries to capture, and also (ii) telicity phenomena that are a priori precluded by it, left out or unnoticed. In this connection, I will defend the familiar (though often forgotten) insights of Krifka’s (1986 and elsewhere) and Dowty’s (1991) mereological theory of telicity. 2. The syntactic telicity parameter One dominant strand of research locates all variation in the encoding of telicity in syntax, with much of the work being motivated by assumptions of Minimalism and/or Distributed Morphology. -
Telicity and Differential Object Marking in the History of Spanish
Diego Romero Heredero Telicity and Differential Object Marking in the History of Spanish Abstract: This paper addresses the influence of telicity on DOM (Differential Object Marking) from a diachronic perspective. Previous literature has frequently pointed to telicity as one of the verbal factors that trigger DOM in Spanish. However, this as- sumption has not been tested empirically. The present study provides a corpus analy- sis examining the question of whether the assumed impact of telicity on DOM in Modern Spanish, which has been noted in synchronic studies, also occurs in previous stages of Spanish. The corpus-based study covers the GHth, GIth, and JKth centuries, and is the first empirical study that addresses the relationship between telicity and DOM with human direct objects. The results point in the opposite direction to what has been proposed by previous literature, suggesting that telicity had no significant impact on DOM in any of the periods studied. These findings lead both to discard telicity as a factor that influences DOM independently, and to raise the question of whether it is possible that the effect of telicity observed in previous studies could be the conse- quence of the interaction of telicity with other verbal factors such as affectedness or agentivity. Keywords: aspectual class, corpus analysis, definiteness, Differential Object Mark- ing, language change, lexical aspect, telicity, verbal factors. ── Diego Romero Heredero: SFB %&'& ‘Prominence in Language’, Universität zu Köln, Luxemburger Str. &CC, D-'FCGC Köln, (+JC) &&% JLF-MCCFG, [email protected] 2 │ Diego Romero Heredero 1 Introduction Differential Object Marking (DOM) is a grammatical phenomenon conditioned by the interaction of various factors (see García García, JKGN for an overview).