Hungarian Copula Constructions in Dependency Syntax and Parsing
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Transfer Learning for Singlish Universal Dependencies Parsing and POS Tagging
From Genesis to Creole language: Transfer Learning for Singlish Universal Dependencies Parsing and POS Tagging HONGMIN WANG, University of California Santa Barbara, USA JIE YANG, Singapore University of Technology and Design, Singapore YUE ZHANG, West Lake University, Institute for Advanced Study, China Singlish can be interesting to the computational linguistics community both linguistically as a major low- resource creole based on English, and computationally for information extraction and sentiment analysis of regional social media. In our conference paper, Wang et al. [2017], we investigated part-of-speech (POS) tagging and dependency parsing for Singlish by constructing a treebank under the Universal Dependencies scheme, and successfully used neural stacking models to integrate English syntactic knowledge for boosting Singlish POS tagging and dependency parsing, achieving the state-of-the-art accuracies of 89.50% and 84.47% for Singlish POS tagging and dependency respectively. In this work, we substantially extend Wang et al. [2017] by enlarging the Singlish treebank to more than triple the size and with much more diversity in topics, as well as further exploring neural multi-task models for integrating English syntactic knowledge. Results show that the enlarged treebank has achieved significant relative error reduction of 45.8% and 15.5% on the base model, 27% and 10% on the neural multi-task model, and 21% and 15% on the neural stacking model for POS tagging and dependency parsing respectively. Moreover, the state-of-the-art Singlish POS tagging and dependency parsing accuracies have been improved to 91.16% and 85.57% respectively. We make our treebanks and models available for further research. -
A Syntactic Analysis of Copular Sentences*
A SYNTACTIC ANALYSIS OF COPULAR SENTENCES* Masato Niimura Nanzan University 1. Introduction Copular is a verb whose main function is to link subjects with predicate complements. In a limited sense, the copular refers to a verb that does not have any semantic content, but links subjects and predicate complements. In a broad sense, the copular contains a verb that has its own meaning and bears the syntactic function of “the copular.” Higgins (1979) analyses English copular sentences and classifies them into three types. (1) a. predicational sentence b. specificational sentence c. identificational sentence Examples are as follows: (2) a. John is a philosopher. b. The bank robber is John Smith. c. That man is Mary’s brother. (2a) is an example of predicational sentence. This type takes a reference as subject, and states its property in predicate. In (2a), the reference is John, and its property is a philosopher. (2b) is an example of specificational sentence, which expresses what meets a kind of condition. In (2b), what meets the condition of the bank robber is John Smith. (2c) is an example of identificational sentence, which identifies two references. This sentence identifies that man and Mary’s brother. * This paper is part of my MA thesis submitted to Nanzan University in March, 2007. This paper was also presented at the Connecticut-Nanzan-Siena Workshop on Linguistic Theory and Language Acquisition, held at Nanzan University on February 21, 2007. I would like to thank the participants for the discussions and comments on this paper. My thanks go to Keiko Murasugi, Mamoru Saito, Tomohiro Fujii, Masatake Arimoto, Jonah Lin, Yasuaki Abe, William McClure, Keiko Yano and Nobuko Mizushima for the invaluable comments and discussions on the research presented in this paper. -
An Analysis of the Content Words Used in a School Textbook, Team up English 3, Used for Grade 9 Students
[Pijarnsarid et. al., Vol.5 (Iss.3): March, 2017] ISSN- 2350-0530(O), ISSN- 2394-3629(P) ICV (Index Copernicus Value) 2015: 71.21 IF: 4.321 (CosmosImpactFactor), 2.532 (I2OR) InfoBase Index IBI Factor 3.86 Social AN ANALYSIS OF THE CONTENT WORDS USED IN A SCHOOL TEXTBOOK, TEAM UP ENGLISH 3, USED FOR GRADE 9 STUDENTS Sukontip Pijarnsarid*1, Prommintra Kongkaew2 *1, 2English Department, Graduate School, Ubon Ratchathani Rajabhat University, Thailand DOI: https://doi.org/10.29121/granthaalayah.v5.i3.2017.1761 Abstract The purpose of this study were to study the content words used in a school textbook, Team Up in English 3, used for Grade 9 students and to study the frequency of content words used in a school textbook, Team Up in English 3, used for Grade 9 students. The study found that nouns is used with the highest frequency (79), followed by verb (58), adjective (46), and adverb (24).With the nouns analyzed, it was found that the Modifiers + N used with the highest frequency (92.40%), the compound nouns were ranked in second (7.59 %). Considering the verbs used in the text, it was found that transitive verbs were most commonly used (77.58%), followed by intransitive verbs (12.06%), linking verbs (10.34%). As regards the adjectives used in the text, there were 46 adjectives in total, 30 adjectives were used as attributive (65.21 %) and 16 adjectives were used as predicative (34.78%). As for the adverbs, it was found that adverbs of times were used with the highest frequency (37.5 % ), followed by the adverbs of purpose and degree (33.33%) , the adverbs of frequency (12.5 %) , the adverbs of place ( 8.33% ) and the adverbs of manner ( 8.33 % ). -
Universidaddesonora
U N I V E R S I D A D D E S O N O R A División de Humanidades y Bellas Artes Maestría en Lingüística Nominal and Adjectival Predication in Yoreme/Mayo of Sonora and Sinaloa TESIS Que para optar por el grado de Maestra en Lingüística presenta Rosario Melina Rodríguez Villanueva 2012 Universidad de Sonora Repositorio Institucional UNISON Excepto si se seala otra cosa, la licencia del tem se describe como openAccess CONTENTS DEDICATION……………………………………………………………………….. 4 ACKNOWLEDGEMENTS………………………………………………………….. 5 ABBREVIATIONS………………………………………………………………….. 7 INTRODUCTION…………………………………………………………………… 12 CHAPTER 1: The Yoreme/Mayo and their language……………………………….. 16 1.1 Ethnographic and Sociolinguistic Context………………………………………. 16 1.1.1 Geographic Location of the Yoreme/Mayo…………………………………… 16 1.1.2 Social Organization of the Yoreme/Mayo……………………………………... 20 1.1.3 Economy and Working Trades………………………………………………… 21 1.1.4 Religion and Cosmogony……………………………………………………… 22 1.2 The Yoreme/Mayo language…………………………………………………….. 23 1.2.1 Geographical location and genetic affiliation………………………………….. 23 1.2.2 Phonology……………………………………………………………………… 27 1.2.2.1 Consonants………………………………………………………………….. 27 1.2.2.2 Vowels……………………………………………………………………….. 30 1.2.3 Typological Characteristics……………………………………………………. 33 1.2.3.1 Classification………………………………………………………………… 33 1.2.3.2 Marking: head or dependent?........................................................................... 35 1.2.3.3 Word Order…………………………………………………………………. 40 1.2.3.4 Case-Marking………………………………………………………………. 43 1.3 Previous Documentation and Description of Yoreme/Mayo……………………. 48 1 CHAPTER 2: Theoretical Preliminaries…………………………………………….. 52 2.0 Introduction……………………………………………………………………… 52 2.1 Predication: Verbal and Non-verbal 53 2.1.1 Definition………………………………………………………………………. 53 2.1.2 Verbal Predication……………………………………………………………. 56 2.1.3 Non-verbal Predication………………………………………………………… 61 2.2. The Syntax of Non-verbal Predication………………………………………… 71 2.2.1 Nominal Predication…………………………………………………………. -
ASYMMETRIES in the PROSODIC PHRASING of FUNCTION WORDS: ANOTHER LOOK at the SUFFIXING PREFERENCE Nikolaus P
ASYMMETRIES IN THE PROSODIC PHRASING OF FUNCTION WORDS: ANOTHER LOOK AT THE SUFFIXING PREFERENCE Nikolaus P. Himmelmann Universität zu Köln It is a well-known fact that across the world’s languages there is a fairly strong asymmetry in the affixation of grammatical material, in that suffixes considerably outnumber prefixes in typo - logical databases. This article argues that prosody, specifically prosodic phrasing, plays an impor - tant part in bringing about this asymmetry. Prosodic word and phrase boundaries may occur after a clitic function word preceding its lexical host with sufficient frequency so as to impede the fu - sion required for affixhood. Conversely, prosodic boundaries rarely, if ever, occur between a lexi - cal host and a clitic function word following it. Hence, prosody does not impede the fusion process between lexical hosts and postposed function words, which therefore become affixes more easily. Evidence for the asymmetry in prosodic phrasing is provided from two sources: disfluencies, and ditropic cliticization, that is, the fact that grammatical pro clitics may be phonological en clit- ics (i.e. phrased with a preceding host), but grammatical enclitics are never phonological proclit- ics. Earlier explanations for the suffixing preference have neglected prosody almost completely and thus also missed the related asymmetry in ditropic cliticization. More importantly, the evi - dence from prosodic phrasing suggests a new venue for explaining the suffixing preference. The asymmetry in prosodic phrasing, which, according to the hypothesis proposed here, is a major fac - tor underlying the suffixing preference, has a natural basis in the mechanics of turn-taking as well as in the mechanics of speech production.* Keywords : affixes, clitics, language processing, turn-taking, grammaticization, explanation in ty - pology, Tagalog 1. -
A Grammar of Gyeli
A Grammar of Gyeli Dissertation zur Erlangung des akademischen Grades doctor philosophiae (Dr. phil.) eingereicht an der Kultur-, Sozial- und Bildungswissenschaftlichen Fakultät der Humboldt-Universität zu Berlin von M.A. Nadine Grimm, geb. Borchardt geboren am 28.01.1982 in Rheda-Wiedenbrück Präsident der Humboldt-Universität zu Berlin Prof. Dr. Jan-Hendrik Olbertz Dekanin der Kultur-, Sozial- und Bildungswissenschaftlichen Fakultät Prof. Dr. Julia von Blumenthal Gutachter: 1. 2. Tag der mündlichen Prüfung: Table of Contents List of Tables xi List of Figures xii Abbreviations xiii Acknowledgments xv 1 Introduction 1 1.1 The Gyeli Language . 1 1.1.1 The Language’s Name . 2 1.1.2 Classification . 4 1.1.3 Language Contact . 9 1.1.4 Dialects . 14 1.1.5 Language Endangerment . 16 1.1.6 Special Features of Gyeli . 18 1.1.7 Previous Literature . 19 1.2 The Gyeli Speakers . 21 1.2.1 Environment . 21 1.2.2 Subsistence and Culture . 23 1.3 Methodology . 26 1.3.1 The Project . 27 1.3.2 The Construction of a Speech Community . 27 1.3.3 Data . 28 1.4 Structure of the Grammar . 30 2 Phonology 32 2.1 Consonants . 33 2.1.1 Phonemic Inventory . 34 i Nadine Grimm A Grammar of Gyeli 2.1.2 Realization Rules . 42 2.1.2.1 Labial Velars . 43 2.1.2.2 Allophones . 44 2.1.2.3 Pre-glottalization of Labial and Alveolar Stops and the Issue of Implosives . 47 2.1.2.4 Voicing and Devoicing of Stops . 51 2.1.3 Consonant Clusters . -
Copula-Less Nominal Sentences and Matrix-C Clauses: a Planar View Of
Copula-less Nominal Sentences and Matrix-C0 Clauses: A Planar View of Clause Structure * Tanmoy Bhattacharya Abstract This paper is an attempt to unify the apparently unrelated sentence types in Bangla, namely, copula-less nominal sentences and matrix-C clauses. In particular, the paper claims that a “Planar” view of clause structure affords us, besides the above unification, a better account of the Kaynean Algorithm (KA, as in Kayne (1998a, b; 1999)) in terms of an interface driven motivation to break symmetry. In order to understand this unification, the invocation of the KA, that proclaims most significantly the ‘non-constituency’ of the C and its complement, is relevant if we agree with the basic assumption of this paper that views both Cs and Classifiers as disjoint from the main body (plane) of the clause. Such a disjunction is exploited further to introduce a new perspective of the structure of clauses, namely, the ‘planar’ view. The unmistakable non-linearity of the KA is seen here as a multi-planar structure creation; the introduction of the C/ CLA thus implies a new plane introduction. KA very strongly implies a planar view of clause structure. The identification of a plane is considered to be as either required by the C-I or the SM interface and it is shown that this view matches up with the duality of semantics, as in Chomsky (2005). In particular, EM (External Merge) is required to introduce or identify a new plane, whereas IM (Internal Merge) is inter- planar. Thus it is shown that inter-planar movements are discourse related, whereas intra-planar movements are not discourse related. -
Identification of Zero Copulas in Hungarian Using
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 4802–4810 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC Much Ado About Nothing Identification of Zero Copulas in Hungarian Using an NMT Model Andrea Dömötör1;2, Zijian Gyoz˝ o˝ Yang1, Attila Novák1 1MTA-PPKE Hungarian Language Technology Research Group, Pázmány Péter Catholic University, Faculty of Information Technology and Bionics Práter u. 50/a, 1083 Budapest, Hungary 2Pázmány Péter Catholic University, Faculty of Humanities and Social Sciences Egyetem u. 1, 2087 Piliscsaba, Hungary {surname.firstname}@itk.ppke.hu Abstract The research presented in this paper concerns zero copulas in Hungarian, i.e. the phenomenon that nominal predicates lack an explicit verbal copula in the default present tense 3rd person indicative case. We created a tool based on the state-of-the-art transformer architecture implemented in Marian NMT framework that can identify and mark the location of zero copulas, i.e. the position where an overt copula would appear in the non-default cases. Our primary aim was to support quantitative corpus-based linguistic research by creating a tool that can be used to compile a corpus of significant size containing examples of nominal predicates including the location of the zero copulas. We created the training corpus for our system transforming sentences containing overt copulas into ones containing zero copula labels. However, we first needed to disambiguate occurrences of the massively ambiguous verb van ‘exist/be/have’. We performed this using a rule-base classifier relying on English translations in the English-Hungarian parallel subcorpus of the OpenSubtitles corpus. -
NWAV 46 Booklet-Oct29
1 PROGRAM BOOKLET October 29, 2017 CONTENTS • The venue and the town • The program • Welcome to NWAV 46 • The team and the reviewers • Sponsors and Book Exhibitors • Student Travel Awards https://english.wisc.edu/nwav46/ • Abstracts o Plenaries Workshops o nwav46 o Panels o Posters and oral presentations • Best student paper and poster @nwav46 • NWAV sexual harassment policy • Participant email addresses Look, folks, this is an electronic booklet. This Table of Contents gives you clues for what to search for and we trust that’s all you need. 2 We’ll have buttons with sets of pronouns … and some with a blank space to write in your own set. 3 The venue and the town We’re assuming you’ll navigate using electronic devices, but here’s some basic info. Here’s a good campus map: http://map.wisc.edu/. The conference will be in Union South, in red below, except for Saturday talks, which will be in the Brogden Psychology Building, just across Johnson Street to the northeast on the map. There are a few places to grab a bite or a drink near Union South and the big concentration of places is on and near State Street, a pedestrian zone that runs east from Memorial Library (top right). 4 The program 5 NWAV 46 2017 Madison, WI Thursday, November 2nd, 2017 12:00 Registration – 5th Quarter Room, Union South pm-6:00 pm Industry Landmark Northwoods Agriculture 1:00- Progress in regression: Discourse analysis for Sociolinguistics and Texts as data 3:00 Statistical and practical variationists forensic speech sources for improvements to Rbrul science: Knowledge- -
Hyman Lusoga Noun Phrase Tonology PLAR
UC Berkeley Phonetics and Phonology Lab Annual Report (2017) Lusoga Noun Phrase Tonology Larry M. Hyman Department of Linguistics University of California, Berkeley 1. Introduction Bantu tone systems have long been known for their syntagmatic properties, including the ability of a tone to assimilate or shift over long distances. Most systems have a surface binary contrast between H(igh) and L(ow) tone, some also a downstepped H which produces a contrast between H-H and H-ꜜH.1 Given their considerable complexity, much research has focused on the tonal alternations that are produced both lexically and post-lexically. Lusoga, the language under examination in this study, is no exception. Although the closest relative to Luganda, whose tone system has been widely studied (see references in Hyman & Katamba 2010), the only two discussions of Lusoga tonology that I am aware of are Yukawa (2000) and van der Wal (2004:20-30), who outline the surface tone patterns of words in isolation, including certain verb tenses, and illustrate some of the alternations. The latter also points out certain resemblances with Luganda: “Two similarities between Luganda and Lusoga are the clear restriction against LH syllables and the maximum of one H to L pitch drop per word” (van der Wal 2004:29). In this paper I extend the tonal description, with particular attention on the relation between underlying and surface tonal representations. As I have pointed out in a number of studies, two-height tone systems are subject to more than one interpretation: First, the contrast may be either equipolent, H vs. -
6 the Major Parts of Speech
6 The Major Parts of Speech KEY CONCEPTS Parts of Speech Major Parts of Speech Nouns Verbs Adjectives Adverbs Appendix: prototypes INTRODUCTION In every language we find groups of words that share grammatical charac- teristics. These groups are called “parts of speech,” and we examine them in this chapter and the next. Though many writers onlanguage refer to “the eight parts of speech” (e.g., Weaver 1996: 254), the actual number of parts of speech we need to recognize in a language is determined by how fine- grained our analysis of the language is—the more fine-grained, the greater the number of parts of speech that will be distinguished. In this book we distinguish nouns, verbs, adjectives, and adverbs (the major parts of speech), and pronouns, wh-words, articles, auxiliary verbs, prepositions, intensifiers, conjunctions, and particles (the minor parts of speech). Every literate person needs at least a minimal understanding of parts of speech in order to be able to use such commonplace items as diction- aries and thesauruses, which classify words according to their parts (and sub-parts) of speech. For example, the American Heritage Dictionary (4th edition, p. xxxi) distinguishes adjectives, adverbs, conjunctions, definite ar- ticles, indefinite articles, interjections, nouns, prepositions, pronouns, and verbs. It also distinguishes transitive, intransitive, and auxiliary verbs. Writ- ers and writing teachers need to know about parts of speech in order to be able to use and teach about style manuals and school grammars. Regardless of their discipline, teachers need this information to be able to help students expand the contexts in which they can effectively communicate. -
The Impact of Function Words on the Processing and Acquisition of Syntax
NORTHWESTERN UNIVERSITY The Impact of Function Words on the Processing and Acquisition of Syntax A DISSERTATION SUBMITTED TO THE GRADUATE SCHOOL IN PARTIAL FULFILLMENT OF THE REQUIREMENTS for the degree DOCTOR OF PHILOSOPHY Field of Linguistics By Jessica Peterson Hicks EVANSTON, ILLINOIS December 2006 2 © Copyright by Jessica Peterson Hicks 2006 All Rights Reserved 3 ABSTRACT The Impact of Function Words on the Processing and Acquisition of Syntax Jessica Peterson Hicks This dissertation investigates the role of function words in syntactic processing by studying lexical retrieval in adults and novel word categorization in infants. Christophe and colleagues (1997, in press) found that function words help listeners quickly recognize a word and infer its syntactic category. Here, we show that function words also help listeners make strong on-line predictions about syntactic categories, speeding lexical access. Moreover, we show that infants use this predictive nature of function words to segment and categorize novel words. Two experiments tested whether determiners and auxiliaries could cause category- specific slowdowns in an adult word-spotting task. Adults identified targets faster in grammatical contexts, suggesting that a functor helps the listener construct a syntactic parse that affects the speed of word identification; also, a large prosodic break facilitated target access more than a smaller break. A third experiment measured independent semantic ratings of the stimuli used in Experiments 1 and 2, confirming that the observed grammaticality effect mainly reflects syntactic, and not semantic, processing. Next, two preferential-listening experiments show that by 15 months, infants use function words to infer the category of novel words and to better recognize those words in continuous speech.