Nouns and Individuation
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Why Is Language Typology Possible?
Why is language typology possible? Martin Haspelmath 1 Languages are incomparable Each language has its own system. Each language has its own categories. Each language is a world of its own. 2 Or are all languages like Latin? nominative the book genitive of the book dative to the book accusative the book ablative from the book 3 Or are all languages like English? 4 How could languages be compared? If languages are so different: What could be possible tertia comparationis (= entities that are identical across comparanda and thus permit comparison)? 5 Three approaches • Indeed, language typology is impossible (non- aprioristic structuralism) • Typology is possible based on cross-linguistic categories (aprioristic generativism) • Typology is possible without cross-linguistic categories (non-aprioristic typology) 6 Non-aprioristic structuralism: Franz Boas (1858-1942) The categories chosen for description in the Handbook “depend entirely on the inner form of each language...” Boas, Franz. 1911. Introduction to The Handbook of American Indian Languages. 7 Non-aprioristic structuralism: Ferdinand de Saussure (1857-1913) “dans la langue il n’y a que des différences...” (In a language there are only differences) i.e. all categories are determined by the ways in which they differ from other categories, and each language has different ways of cutting up the sound space and the meaning space de Saussure, Ferdinand. 1915. Cours de linguistique générale. 8 Example: Datives across languages cf. Haspelmath, Martin. 2003. The geometry of grammatical meaning: semantic maps and cross-linguistic comparison 9 Example: Datives across languages 10 Example: Datives across languages 11 Non-aprioristic structuralism: Peter H. Matthews (University of Cambridge) Matthews 1997:199: "To ask whether a language 'has' some category is...to ask a fairly sophisticated question.. -
Modeling Language Variation and Universals: a Survey on Typological Linguistics for Natural Language Processing
Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing Edoardo Ponti, Helen O ’Horan, Yevgeni Berzak, Ivan Vulic, Roi Reichart, Thierry Poibeau, Ekaterina Shutova, Anna Korhonen To cite this version: Edoardo Ponti, Helen O ’Horan, Yevgeni Berzak, Ivan Vulic, Roi Reichart, et al.. Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing. 2018. hal-01856176 HAL Id: hal-01856176 https://hal.archives-ouvertes.fr/hal-01856176 Preprint submitted on 9 Aug 2018 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Modeling Language Variation and Universals: A Survey on Typological Linguistics for Natural Language Processing Edoardo Maria Ponti∗ Helen O’Horan∗∗ LTL, University of Cambridge LTL, University of Cambridge Yevgeni Berzaky Ivan Vuli´cz Department of Brain and Cognitive LTL, University of Cambridge Sciences, MIT Roi Reichart§ Thierry Poibeau# Faculty of Industrial Engineering and LATTICE Lab, CNRS and ENS/PSL and Management, Technion - IIT Univ. Sorbonne nouvelle/USPC Ekaterina Shutova** Anna Korhonenyy ILLC, University of Amsterdam LTL, University of Cambridge Understanding cross-lingual variation is essential for the development of effective multilingual natural language processing (NLP) applications. -
Urdu Treebank
Sci.Int.(Lahore),28(4),3581-3585, 2016 ISSN 1013-5316;CODEN: SINTE 8 3581 URDU TREEBANK Muddassira Arshad, Aasim Ali Punjab University College of Information Technology (PUCIT), University of the Punjab, Lahore [email protected], [email protected] (Presented at the 5th International. Multidisciplinary Conference, 29-31 Oct., at, ICBS, Lahore) ABSTRACT: Treebank is a parsed corpus of text annotated with the syntactic information in the form of tags that yield the phrasal information within the corpus. The first outcome of this study is to design a phrasal and functional tag set for Urdu by minimal changes in the tag set of Penn Treebank, that has become a de-facto standard. Our tag set comprises of 22 phrasal tags and 20 functional tags. Second outcome of this work is the development of initial Treebank for Urdu, which remains publically available to the research and development community. There are 500 sentences of Urdu translation of religious text with an average length of 12 words. Context free grammar containing 109 rules and 313 lexical entries, for Urdu has been extracted from this Treebank. An online parser is used to verify the coverage of grammar on 50 test sentences with 80% recall. However, multiple parse trees are generated against each test sentence. 1 INTRODUCTION grammars for parsing[5], where statistical models are used The term "Treebank" was introduced in 2003. It is defined as for broad-coverage parsing [6]. "linguistically annotated corpus that includes some In literature, various techniques have been applied for grammatical analysis beyond the part of speech level" [1]. -
Linguistic Profiles: a Quantitative Approach to Theoretical Questions
Laura A. Janda USA – Norway, Th e University of Tromsø – Th e Arctic University of Norway Linguistic profi les: A quantitative approach to theoretical questions Key words: cognitive linguistics, linguistic profi les, statistical analysis, theoretical linguistics Ключевые слова: когнитивная лингвистика, лингвистические профили, статисти- ческий анализ, теоретическая лингвистика Abstract A major challenge in linguistics today is to take advantage of the data and sophisticated analytical tools available to us in the service of questions that are both theoretically inter- esting and practically useful. I offer linguistic profi les as one way to join theoretical insight to empirical research. Linguistic profi les are a more narrowly targeted type of behavioral profi ling, focusing on a specifi c factor and how it is distributed across language forms. I present case studies using Russian data and illustrating three types of linguistic profi ling analyses: grammatical profi les, semantic profi les, and constructional profi les. In connection with each case study I identify theoretical issues and show how they can be operationalized by means of profi les. The fi ndings represent real gains in our understanding of Russian grammar that can also be utilized in both pedagogical and computational applications. 1. Th e quantitative turn in linguistics and the theoretical challenge I recently conducted a study of articles published since the inauguration of the journal Cognitive Linguistics in 1990. At the year 2008, Cognitive Linguistics took a quanti- tative turn; since that point over 50% of the scholarly articles that journal publishes have involved quantitative analysis of linguistic data [Janda 2013: 4–6]. Though my sample was limited to only one journal, this trend is endemic in linguistics, and it is motivated by a confl uence of historic factors. -
Morphological Processing in the Brain: the Good (Inflection), the Bad (Derivation) and the Ugly (Compounding)
Morphological processing in the brain: the good (inflection), the bad (derivation) and the ugly (compounding) Article Published Version Creative Commons: Attribution 4.0 (CC-BY) Open Access Leminen, A., Smolka, E., Duñabeitia, J. A. and Pliatsikas, C. (2019) Morphological processing in the brain: the good (inflection), the bad (derivation) and the ugly (compounding). Cortex, 116. pp. 4-44. ISSN 0010-9452 doi: https://doi.org/10.1016/j.cortex.2018.08.016 Available at http://centaur.reading.ac.uk/78769/ It is advisable to refer to the publisher’s version if you intend to cite from the work. See Guidance on citing . To link to this article DOI: http://dx.doi.org/10.1016/j.cortex.2018.08.016 Publisher: Elsevier All outputs in CentAUR are protected by Intellectual Property Rights law, including copyright law. Copyright and IPR is retained by the creators or other copyright holders. Terms and conditions for use of this material are defined in the End User Agreement . www.reading.ac.uk/centaur CentAUR Central Archive at the University of Reading Reading’s research outputs online cortex 116 (2019) 4e44 Available online at www.sciencedirect.com ScienceDirect Journal homepage: www.elsevier.com/locate/cortex Special issue: Review Morphological processing in the brain: The good (inflection), the bad (derivation) and the ugly (compounding) Alina Leminen a,b,*,1, Eva Smolka c,1, Jon A. Dunabeitia~ d,e and Christos Pliatsikas f a Cognitive Science, Department of Digital Humanities, Faculty of Arts, University of Helsinki, Finland b Cognitive Brain Research -
Syntactic Variation in English Quantified Noun Phrases with All, Whole, Both and Half
Syntactic variation in English quantified noun phrases with all, whole, both and half Acta Wexionensia Nr 38/2004 Humaniora Syntactic variation in English quantified noun phrases with all, whole, both and half Maria Estling Vannestål Växjö University Press Abstract Estling Vannestål, Maria, 2004. Syntactic variation in English quantified noun phrases with all, whole, both and half, Acta Wexionensia nr 38/2004. ISSN: 1404-4307, ISBN: 91-7636-406-2. Written in English. The overall aim of the present study is to investigate syntactic variation in certain Present-day English noun phrase types including the quantifiers all, whole, both and half (e.g. a half hour vs. half an hour). More specific research questions concerns the overall frequency distribution of the variants, how they are distrib- uted across regions and media and what linguistic factors influence the choice of variant. The study is based on corpus material comprising three newspapers from 1995 (The Independent, The New York Times and The Sydney Morning Herald) and two spoken corpora (the dialogue component of the BNC and the Longman Spoken American Corpus). The book presents a number of previously not discussed issues with respect to all, whole, both and half. The study of distribution shows that one form often predominated greatly over the other(s) and that there were several cases of re- gional variation. A number of linguistic factors further seem to be involved for each of the variables analysed, such as the syntactic function of the noun phrase and the presence of certain elements in the NP or its near co-text. -
Formal Semantics
HG2002 Semantics and Pragmatics Formal Semantics Francis Bond Division of Linguistics and Multilingual Studies http://www3.ntu.edu.sg/home/fcbond/ [email protected] Lecture 10 Location: HSS Auditorium Creative Commons Attribution License: you are free to share and adapt as long as you give appropriate credit and add no additional restrictions: https://creativecommons.org/licenses/by/4.0/. HG2002 (2018) Overview ã Revision: Components ã Quantifiers and Higher Order Logic ã Modality ã (Dynamic Approaches to Discourse) ã Next Lecture: Chapter 11 — Cognitive Semantics HG2002 (2018) 1 Revision: Componential Analysis HG2002 (2018) 2 Break word meaning into its components ã components allow a compact description ã interact with morphology/syntax ã form part of our cognitive architecture ã For example: woman [female] [adult] [human] spinster [female] [adult] [human] [unmarried] bachelor [male] [adult] [human] [unmarried] wife [female] [adult] [human] [married] ã We can make things more economical (fewer components): woman [+female] [+adult] [+human] spinster [+female] [+adult] [+human] [–married] bachelor [–female] [+adult] [+human] [–married] wife [+female] [+adult] [+human] [+married] HG2002 (2018) 3 Defining Relations using Components ã hyponymy: P is a hyponym of Q if all the components of Q are also in P. spinster ⊂ woman; wife ⊂ woman ã incompatibility: P is incompatible with Q if they share some components but differ in one or more contrasting components spinster ̸≈ wife ã Redundancy Rules [+human] ! [+animate] [+animate] ! [+concrete] [+married] -
Inflection), the Bad (Derivation) and the Ugly (Compounding)
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Central Archive at the University of Reading Morphological processing in the brain: the good (inflection), the bad (derivation) and the ugly (compounding) Article Published Version Creative Commons: Attribution 4.0 (CC-BY) Open Access Leminen, A., Smolka, E., Duñabeitia, J. A. and Pliatsikas, C. (2019) Morphological processing in the brain: the good (inflection), the bad (derivation) and the ugly (compounding). Cortex, 116. pp. 4-44. ISSN 0010-9452 doi: https://doi.org/10.1016/j.cortex.2018.08.016 Available at http://centaur.reading.ac.uk/78769/ It is advisable to refer to the publisher's version if you intend to cite from the work. See Guidance on citing . To link to this article DOI: http://dx.doi.org/10.1016/j.cortex.2018.08.016 Publisher: Elsevier All outputs in CentAUR are protected by Intellectual Property Rights law, including copyright law. Copyright and IPR is retained by the creators or other copyright holders. Terms and conditions for use of this material are defined in the End User Agreement . www.reading.ac.uk/centaur CentAUR Central Archive at the University of Reading Reading's research outputs online cortex 116 (2019) 4e44 Available online at www.sciencedirect.com ScienceDirect Journal homepage: www.elsevier.com/locate/cortex Special issue: Review Morphological processing in the brain: The good (inflection), the bad (derivation) and the ugly (compounding) Alina Leminen a,b,*,1, Eva Smolka c,1, Jon A. Dunabeitia~ d,e and Christos Pliatsikas -
Number and Adjectives : the Case of Activity and Quality Nominals Delphine Beauseroy, Marie-Laurence Knittel
Number and adjectives : the case of activity and quality nominals Delphine Beauseroy, Marie-Laurence Knittel To cite this version: Delphine Beauseroy, Marie-Laurence Knittel. Number and adjectives : the case of activity and quality nominals. 2012. hal-00418040v2 HAL Id: hal-00418040 https://hal.archives-ouvertes.fr/hal-00418040v2 Preprint submitted on 11 Jun 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 1 NUMBER AND ADJECTIVES: THE CASE OF FRENCH ACTIVITY AND QUALITY NOMINALS 1. INTRODUCTION This article is dedicated to the examination of the role of Number with regards to adjective distribution in French. We focus on two kinds of abstract nouns: activity nominals and quality nominals. Both display particular behaviours with regards to adjective distribution: activity nominals need to appear as count nouns to be modified by qualifying adjectives; concerning quality nominals, they are frequently introduced by the indefinite un(e) instead of the partitive article (du / de la) when modified. Our analysis of adjectives is based on the idea that they can have two uses, which correlate with syntactic and semantic restrictions and are distinguishable on semantic grounds. Adjectives are understood either as taxonomic, i.e. -
Cross-Linguistic Evidence for Semantic Countability1
Eun-Joo Kwak 55 Journal of Universal Language 15-2 September 2014, 55-76 Cross-Linguistic Evidence for Semantic Countability1 2 Eun-Joo Kwak Sejong University, Korea Abstract Countability and plurality (or singularity) are basically marked in syntax or morphology, and languages adopt different strategies in the mass-count distinction and number marking: plural marking, unmarked number marking, singularization, and different uses of classifiers. Diverse patterns of grammatical strategies are observed with cross-linguistic data in this study. Based on this, it is concluded that although countability is not solely determined by the semantic properties of nouns, it is much more affected by semantics than it appears. Moreover, semantic features of nouns are useful to account for apparent idiosyncratic behaviors of nouns and sentences. Keywords: countability, plurality, countability shift, individuation, animacy, classifier * This work is supported by the Sejong University Research Grant of 2013. Eun-Joo Kwak Department of English Language and Literature, Sejong University, Seoul, Korea Phone: +82-2-3408-3633; Email: [email protected] Received August 14, 2014; Revised September 3, 2014; Accepted September 10, 2014. 56 Cross-Linguistic Evidence for Semantic Countability 1. Introduction The state of affairs in the real world may be delivered in a different way depending on the grammatical properties of languages. Nominal countability makes part of grammatical differences cross-linguistically, marked in various ways: plural (or singular) morphemes for nouns or verbs, distinct uses of determiners, and the occurrences of classifiers. Apparently, countability and plurality are mainly marked in syntax and morphology, so they may be understood as having less connection to the semantic features of nouns. -
A Case Study in Language Change
Western Michigan University ScholarWorks at WMU Honors Theses Lee Honors College 4-17-2013 Glottopoeia: A Case Study in Language Change Ian Hollenbaugh Western Michigan University, [email protected] Follow this and additional works at: https://scholarworks.wmich.edu/honors_theses Part of the Other English Language and Literature Commons Recommended Citation Hollenbaugh, Ian, "Glottopoeia: A Case Study in Language Change" (2013). Honors Theses. 2243. https://scholarworks.wmich.edu/honors_theses/2243 This Honors Thesis-Open Access is brought to you for free and open access by the Lee Honors College at ScholarWorks at WMU. It has been accepted for inclusion in Honors Theses by an authorized administrator of ScholarWorks at WMU. For more information, please contact [email protected]. An Elementary Ghau Aethauic Grammar By Ian Hollenbaugh 1 i. Foreword This is an essential grammar for any serious student of Ghau Aethau. Mr. Hollenbaugh has done an excellent job in cataloguing and explaining the many grammatical features of one of the most complex language systems ever spoken. Now published for the first time with an introduction by my former colleague and premier Ghau Aethauic scholar, Philip Logos, who has worked closely with young Hollenbaugh as both mentor and editor, this is sure to be the definitive grammar for students and teachers alike in the field of New Classics for many years to come. John Townsend, Ph.D Professor Emeritus University of Nunavut 2 ii. Author’s Preface This grammar, though as yet incomplete, serves as my confession to what J.R.R. Tolkien once called “a secret vice.” History has proven Professor Tolkien right in thinking that this is not a bizarre or freak occurrence, undergone by only the very whimsical, but rather a common “hobby,” one which many partake in, and have partaken in since at least the time of Hildegard of Bingen in the twelfth century C.E. -
Parsing the SYNTAGRUS Treebank of Russian
Parsing the SYNTAGRUS Treebank of Russian Joakim Nivre Igor M. Boguslavsky Leonid L. Iomdin Vaxj¨ o¨ University and Universidad Politecnica´ Russian Academy Uppsala University de Madrid of Sciences [email protected] Departamento de Institute for Information Inteligencia Artificial Transmission Problems [email protected] [email protected] Abstract example, Nivre et al. (2007a) observe that the lan- guages included in the 2007 CoNLL shared task We present the first results on parsing the can be divided into three distinct groups with re- SYNTAGRUS treebank of Russian with a spect to top accuracy scores, with relatively low data-driven dependency parser, achieving accuracy for richly inflected languages like Arabic a labeled attachment score of over 82% and Basque, medium accuracy for agglutinating and an unlabeled attachment score of 89%. languages like Hungarian and Turkish, and high A feature analysis shows that high parsing accuracy for more configurational languages like accuracy is crucially dependent on the use English and Chinese. A complicating factor in this of both lexical and morphological features. kind of comparison is the fact that the syntactic an- We conjecture that the latter result can be notation in treebanks varies across languages, in generalized to richly inflected languages in such a way that it is very difficult to tease apart the general, provided that sufficient amounts impact on parsing accuracy of linguistic structure, of training data are available. on the one hand, and linguistic annotation, on the other. It is also worth noting that the majority of 1 Introduction the data sets used in the CoNLL shared tasks are Dependency-based syntactic parsing has become not derived from treebanks with genuine depen- increasingly popular in computational linguistics dency annotation, but have been obtained through in recent years.