Notes for Contributors Policy: SKY Journal of Linguistics Welcomes Unpublished Original Works from Authors of All Nationalities and Theoretical Persuasions
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Semi-Supervised Learning of HMM to Build a POS Tagger for a Low Resourced Language
J. lnf. Commun. Converg. Eng. 18(4): 207-215, Dec. 2020 Regular paper A Semi-supervised Learning of HMM to Build a POS Tagger for a Low Resourced Language 1* 2 3 Sagarika Pattnaik , Ajit Kumar Nayak , and Srikanta Patnaik 1Department of Computer Science and Engineering, SOA, Deemed to be University, Bhubaneswar, 751030, India 2Department of Computer Science and Information Technology, SOA, Deemed to be University, Bhubaneswar, 751030, India 3Department of Computer Science and Engineering, SOA, Deemed to be University, Bhubaneswar, 751030, India Abstract Part of speech (POS) tagging is an indispensable part of major NLP models. Its progress can be perceived on number of languages around the globe especially with respect to European languages. But considering Indian Languages, it has not got a major breakthrough due lack of supporting tools and resources. Particularly for Odia language it has not marked its dominancy yet. With a motive to make the language Odia fit into different NLP operations, this paper makes an attempt to develop a POS tagger for the said language on a HMM (Hidden Markov Model) platform. The tagger judiciously considers bigram HMM with dynamic Viterbi algorithm to give an output annotated text with maximum accuracy. The model is experimented on a corpus belonging to tourism domain accounting to a size of approximately 0.2 million tokens. With the proportion of training and testing as 3:1, the proposed model exhibits satisfactory result irrespective of limited training size. Index Terms: HMM, NLP, Odia, POS tagger I. INTRODUCTION the morphological complexities of various languages has been a major issue. The process is also limited by the availability of Part-of-speech (POS) tagging or grammatical tagging, an required resources, such as a sufficient training corpus and lin- integral part of significant natural language processing (NLP) guistic information related to the language. -
The Assignment of Grammatical Gender in German: Testing Optimal Gender Assignment Theory
The Assignment of Grammatical Gender in German: Testing Optimal Gender Assignment Theory Emma Charlotte Corteen Trinity Hall September 2018 This dissertation is submitted for the degree of Doctor of Philosophy The Assignment of Grammatical Gender in German: Testing Optimal Gender Assignment Theory Emma Charlotte Corteen Abstract The assignment of grammatical gender in German is a notoriously problematic phenomenon due to the apparent opacity of the gender assignment system (e.g. Comrie 1999: 461). Various models of German gender assignment have been proposed (e.g. Spitz 1965, Köpcke 1982, Corbett 1991, Wegener 1995), but none of these is able to account for all of the German data. This thesis investigates a relatively under-explored, recent approach to German gender assignment in the form of Optimal Gender Assignment Theory (OGAT), proposed by Rice (2006). Using the framework of Optimality Theory, OGAT claims that the form and meaning of a noun are of equal importance with respect to its gender. This is formally represented by the crucial equal ranking of all gender assignment constraints in a block of GENDER FEATURES, which is in turn ranked above a default markedness hierarchy *NEUTER » *FEMININE » *MASCULINE, which is based on category size. A key weakness of OGAT is that it does not specify what constitutes a valid GENDER FEATURES constraint. This means that, in theory, any constraint can be proposed ad hoc to ensure that an OGAT analysis yields the correct result. In order to prevent any constraints based on ‘postfactum rationalisations’ (Comrie 1999: 461) from being included in the investigation, the GENDER FEATURES constraints which have been proposed in the literature for German are assessed according to six criteria suggested by Enger (2009), which seek to determine whether there is independent evidence for a GENDER FEATURES constraint. -
Lara Mantovan Nominal Modification in Italian Sign Language Sign Languages and Deaf Communities
Lara Mantovan Nominal Modification in Italian Sign Language Sign Languages and Deaf Communities Editors Annika Herrmann, Markus Steinbach, Ulrike Zeshan Editorial board Carlo Geraci, Rachel McKee, Victoria Nyst, Sibaji Panda, Marianne Rossi Stumpf, Felix Sze, Sandra Wood Volume 8 Lara Mantovan Nominal Modification in Italian Sign Language ISHARA PRESS ISBN 978-1-5015-1343-5 e-ISBN (PDF) 978-1-5015-0485-3 e-ISBN (EPUB) 978-1-5015-0481-5 ISSN 2192-516X e-ISSN 2192-5178 Library of Congress Cataloging-in-Publication Data A CIP catalog record for this book has been applied for at the Library of Congress. Bibliographic information published by the Deutsche Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available on the Internet at http://dnb.dnb.de. © 2017 Walter de Gruyter Inc., Boston/Berlin and Ishara Press, Preston, UK Printing and binding: CPI books GmbH, Leck ♾ Printed on acid-free paper Printed in Germany www.degruyter.com E Pluribus Unum (uncertain origin, attributed to Virgilio, Moretum, v. 103) Acknowledgements This book is a revised version of my 2015 dissertation which was approved for the PhD degree in Linguistics at Ca’ Foscari University of Venice. When I first plunged into the world of academic research, almost five years ago, I would never have imagined it was possible to achieve such an important milestone. Being so close to finalizing this book, I would like to look back briefly and remember and thank all the people who showed me the way, supported me, and encouraged me to grow both academically and personally. -
5 Nouns and Determiners
5 Nouns and determiners 5.1 Types of noun phrase Noun classes: count, noncount, and proper nouns Concrete and abstract nouns Nouns with dual class membership Reclassification Partitive constructions Partition in respect of quality Partition in respect of quantity Measure partitive nouns Noncount nouns and their count equivalents Determinatives Central determiners The articles Other central determiners Central determiners and noun classes Predeterminers , All, both, half All and whole The multipliers double, twice, etc The fractions one-third, two-fifihs, etc Postdeterminers (a) Cardinal numerals (b) Ordinal numerals and 'general ordinals' (c) Closed-class quantifiers (d) Open-class quantifiers The use of articles with common nouns Specific and generic reference Specific reference : definite and indefinite Uses of the definite article (a) Immediate situation (b) Larger situation (general knowledge) (c) Anaphoric reference: direct (d) Anaphoric reference: indirect (e) Cataphoric reference (f) Sporadic reference (g) The 'logical' use of the (h) The use of the with reference to body parts Uses of the indefinite article Nonreferring uses of the indefinite article (Bb) Pluralia tantum ending in -S The indefinite article and the numeral one (BC) Unmarked plural nouns: people, police, etc Uses of the zero article (C) Regular plurals The zero article compared with unstressedsome The pronunciation of the regular plural The zero article with definite meaning The spelling of the regular plural Noun phrases in a copular relation (D) Irregular plurals -
Proceedings of the LFG13 Conference
Proceedings of LFG13 Miriam Butt and Tracy Holloway King (Editors) 2013 CSLI Publications http://csli-publications.stanford.edu/ Contents 1 Dedication 4 2 Editors’ Note 5 3 Yasir Alotaibi, Muhammad Alzaidi, Maris Camilleri, Shaimaa ElSadek and Louisa Sadler: Psychological Predicates and Verbal Complemen- tation in Arabic 6 4 I Wayan Arka: Nominal Aspect in Marori 27 5 Doug Arnold and Louisa Sadler: Displaced Dependent Constructions 48 6 Daniele Artoni and Marco Magnani: LFG Contributions in Second Language Acquisition Research: The Development of Case in Russian L2 69 7 Oleg Belyaev: Optimal Agreement at M-structure: Person in Dargwa 90 8 Ansu Berg, Rigardt Pretorius and Laurette Pretorius: The Represen- tation of Setswana Double Objects in LFG 111 9 Tina Bögel: A Prosodic Resolution of German Case Ambiguities 131 10 Kersti Börjars and John Payne: Dimensions of Variation in the Ex- pression of Functional Features: Modelling Definiteness in LFG 152 11 George Aaron Broadwell: An Emphatic Auxiliary Construction for Emotions in Copala Triqui 171 12 Özlem Çetinoglu,˘ Sina Zarrieß and Jonas Kuhn: Dependency-based Sentence Simplification for Increasing Deep LFG Parsing Coverage 191 13 Elizabeth Christie: Result XPs and the Argument-Adjunct Distinction212 14 Cheikh Bamba Dione: Valency Change and Complex Predicates in Wolof: An LFG Account 232 15 Lachlan Duncan: Non-verbal Predicates in K’ichee’ Mayan: An LFG Approach 253 16 Dag Haug: Partial Control and the Semantics of Anaphoric Control in LFG 274 17 Annette Hautli-Janisz: Moving Right Along: -
A Survey on Odia Computational Morphology
International Journal of Advanced Research in Computer Engineering & Technology (IJARCET) Volume 3 Issue 3, March 2014 A Survey on Odia Computational Morphology Dhabal Prasad Sethi ABSTRACT In natural language processing morphological analyzer, The rest part is organized as section II describes the literature morphological generator and morphological parser is the survey, section III describes the different methods of essential common tool used in developing language morphological analyzer and generator, section IV describes software. Morphological analyzer is a computer program the applications and section V describes the conclusion. which provides the grammatical information of a word. II.LITERATURE SURVEY Morphological generator, by giving/providing the root word Itisree Jena, Sriram Chaudhury, Himani Chaudhry, Dipti M. and grammatical information, it generates all word forms of Sharma [3] presented a paper named “Developing Oriya that word. On the other hand morphological parsing is the Morphological Analyzer Using Lt-toolbox “.They have process of determining the morphemes from given word? In developed the morphological analyzer for Odia using this article I surveys the different work have done about paradigm approach. The paradigm approach is a method Odia morphology. which defines the all the word form of a given stem and its Keyword: Morphology, Morphological Analyzer, associated feature structure. Their system handles the Morphological Generator, Morphological Parser, Information inflectional morphology of noun, verb and adjectives. Retrieval, Stemmer, IE, Spell Checker, Machine Translation, R.C Balabantray, M.K.Jena S.Mohanty[4] presented a paper Question Answering System. named “Shallow Morphology based complex predicates extraction in Oriya” The aim of their paper is to extracting the I. -
Chapter 7 Gender in Walman Matthew S
Chapter 7 Gender in Walman Matthew S. Dryer University at Buffalo In this paper, I describe gender and gender-like phenomena in Walman, a language of the Torricelli family spoken on the north coast of Papua New Guinea. I discuss three topics. One of these is the two clear instances of gender in Walman, mascu- line and feminine. I discuss the formal realization of gender in Walman and the factors governing the choice of masculine versus feminine gender. There are also two gender-like phenomena in Walman, namely pluralia tantum nouns and a diminutive category. Pluralia tantum nouns in Walman are different from pluralia tantum nouns in European languages in that what makes them gram- matically plural is not their form, but the fact that they control plural agreement. What makes pluralia tantum gender-like is that there are twice as many pluralia tantum nouns in our data as there are nouns that are lexically masculine. The second gender-like phenomenon in Walman is a diminutive category, whichis coded in the same way as feminine singular, masculine singular, and plural. What makes it unlike phenomena that are normally considered instances of gender in other languages is the fact that there are no lexically diminutive nouns and any noun can be associated with diminutive agreement. Keywords: gender, masculine, feminine, diminutive, pluralia tantum, Walman, Tor- ricelli. 1 Introduction The goal of this paper is to give a description of gender in Walman, a languagein the Torricelli family spoken in Papua New Guinea. I understand gender to denote a morphosyntactic category in a language based on a division among nouns in the language and on agreement phenomena related to this division. -
Pluralia Tantum Nouns and the Theory of Features: a Typology of Nouns with Non-Canonical Number Properties
Pluralia tantum nouns and the theory of features: a typology of nouns with non- canonical number properties Greville G. Corbett Morphology ISSN 1871-5621 Volume 29 Number 1 Morphology (2019) 29:51-108 DOI 10.1007/s11525-018-9336-0 1 23 Your article is published under the Creative Commons Attribution license which allows users to read, copy, distribute and make derivative works, as long as the author of the original work is cited. You may self- archive this article on your own website, an institutional repository or funder’s repository and make it publicly available immediately. 1 23 Morphology (2019) 29:51–108 https://doi.org/10.1007/s11525-018-9336-0 Pluralia tantum nouns and the theory of features: a typology of nouns with non-canonical number properties Greville G. Corbett1 Received: 27 April 2018 / Accepted: 9 November 2018 / Published online: 5 December 2018 © The Author(s) 2018 Abstract Many languages have an interesting class of nouns, the pluralia tantum, which have restricted number possibilities when, in some sense, they should not. Thus English binoculars has no singular, which is worth noting (that is, it is not predictable). True, there are other nouns denoting items consisting of two signifi- cant parts which behave similarly (spectacles, trousers...);indeed they are subject to ‘middle-size generalizations’ (Koenig 1999). But there are two reasons to note such nouns. First there are many English nouns equally denoting items consisting of two significant parts which are unremarkable in this respect: bicycle, bigraph, Bactrian camel, couple, duo. And second, there are languages with number systems roughly comparable to that of English in which the equivalents of binoculars and trousers are normal count nouns: Russian binokl’, French pantalon. -
Sociolinguistic Investigation in Language Death and Revitalization PJAEE, 17 (5) (2020)
The Destiny of Human Languages: Will they all Die? Sociolinguistic Investigation in Language Death and Revitalization PJAEE, 17 (5) (2020) The Destiny of Human Languages: Will they all Die? Sociolinguistic Investigation in Language Death and Revitalization Amir Ibrahim Khudair Al-Omraniyah Sc. School, Directorate General of Education in Diyala [email protected] Amir Ibrahim Khudair. The Destiny of Human Languages: Will they all Die? Sociolinguistic Investigation in Language Death and Revitalization-- Palarch’s Journal of Archaeology of Egypt/Egyptology 17(5), 1354-1370. ISSN 1567-214x Keywords: Language death, endangered language, language suicide language resurrection. Abstract: In this essay the researcher focuses on language death, its internal and external causes in addition to its consequences on the speakers, cultures and knowledge as well. Then, he displays examples of American and European indigenous languages accompanied with the economic and political conditions within which these languages began retreating, declining and then dying out. The sorrowful point in this essay culminates when it deals with language suicide. This linguistic tragedy occurs when the speakers wittingly decide to leave their language under an illusory impression that their own native language is but an inferior one and they have to adopt another superior language due to economic and political interests. Finally, the mechanisms of revitalization of the dying and endangered languages are also investigated to create hope for the next generations to preserve our identity and our universal language and culture diversity. Introduction: Since the dawn of the industrial epoch the tropical rainforests have begun the process of erosion and the rate of their atrophy was very slow and unnoticeable. -
Grammatical Gender and Linguistic Complexity
Grammatical gender and linguistic complexity Volume I: General issues and specific studies Edited by Francesca Di Garbo Bruno Olsson Bernhard Wälchli language Studies in Diversity Linguistics 26 science press Studies in Diversity Linguistics Editor: Martin Haspelmath In this series: 1. Handschuh, Corinna. A typology of marked-S languages. 2. Rießler, Michael. Adjective attribution. 3. Klamer, Marian (ed.). The Alor-Pantar languages: History and typology. 4. Berghäll, Liisa. A grammar of Mauwake (Papua New Guinea). 5. Wilbur, Joshua. A grammar of Pite Saami. 6. Dahl, Östen. Grammaticalization in the North: Noun phrase morphosyntax in Scandinavian vernaculars. 7. Schackow, Diana. A grammar of Yakkha. 8. Liljegren, Henrik. A grammar of Palula. 9. Shimelman, Aviva. A grammar of Yauyos Quechua. 10. Rudin, Catherine & Bryan James Gordon (eds.). Advances in the study of Siouan languages and linguistics. 11. Kluge, Angela. A grammar of Papuan Malay. 12. Kieviet, Paulus. A grammar of Rapa Nui. 13. Michaud, Alexis. Tone in Yongning Na: Lexical tones and morphotonology. 14. Enfield, N. J. (ed.). Dependencies in language: On the causal ontology of linguistic systems. 15. Gutman, Ariel. Attributive constructions in North-Eastern Neo-Aramaic. 16. Bisang, Walter & Andrej Malchukov (eds.). Unity and diversity in grammaticalization scenarios. 17. Stenzel, Kristine & Bruna Franchetto (eds.). On this and other worlds: Voices from Amazonia. 18. Paggio, Patrizia and Albert Gatt (eds.). The languages of Malta. 19. Seržant, Ilja A. & Alena Witzlack-Makarevich (eds.). Diachrony of differential argument marking. 20. Hölzl, Andreas. A typology of questions in Northeast Asia and beyond: An ecological perspective. 21. Riesberg, Sonja, Asako Shiohara & Atsuko Utsumi (eds.). Perspectives on information structure in Austronesian languages. -
Crimean Roma
V. TOROPOV Crimean Roma Language and folklore 2009 V. Toropov Crimean Roma Language and Folklore 2009 Toropov V.G. Crimean Roma: Language and Folklore. — Ivanovo: ―Unona‖ Publishing House, 2009. — 340 pages. (8 pages of illustrations) The book is a collection of all scholarly-known records of Crimean Roma language belonging to the period of the second half of the XIX — the beginning of the XXI century. The book contains lists of words and texts of various contents. Each record is published in the original language, accompanied by a translation/retelling and commentaries. The texts, together with reviews, will help readers to better understand different aspects of philology, culture and ethnography belonging to Crimean Roma. The edition was supported in the frame work of VORBA (Viable Opportunities for Romani Books Access) project, with the kind assistance of Next Page Foundation, funded by the Open Society Institute — Budapest. Izdanija adale knižkakiri esas podd΄eržani proekt΄esa ―VORBA‖ (Pal dila te oven o knižkes e Romane čhibakere) E lošame Nekst Pejğ΄ Faundejšenakere jardym΄esa, Oupen Sosajeti Institut΄eske – Budap΄ešti – lovenge Acknowledgements: L.N. Cherenkov (scientific editor) Prof. A.V. Stepanov (translator) Angela Tropea (editor) Prof. G.M. Vishnevskaya (editor) Prof. A.V. Vishnevski (editor) M.V. Diomina (editor) Prof. V.A. Godlevski (musical notes) ISBN 978-5-89729-118-2 © V.G. Toropov, 2009 (author‘s text) © A.V. Stepanov, 2009 (translation) © V. B. Volchenkov, 2009 (pictures) 2 ON THE REASONS FOR WRITING THIS BOOK The author of this book devoted almost thirty years to the studies of such fragile and volatile phenomenon of the human culture as the language and the folklore of a small ethnic group – Crimean Roma – that emerged in the Crimea out of the people who had come here from what are now Moldova and Romania. -
Classicism of Odia Language
Orissa Review Classicism of Odia Language Dr. G.K. Srichandan Man expresses his feelings and emotions through neighbouring languages are Hindi, Bengali, Telgu, sentences of any language-spoken or written. Tamil etc. In addition to this 62 denominations Thus, human beings use many words in sentences (communities) of tribals live within the State of of language which have come to us by oral use Odisha. Their spoken languages or dialects had for hundreds of thousand years. Words are have impact on Odia language for her evolution evolved and use for expression of man¶s inner and enrichment. feelings. We borrow such words from other languages also to supplement our own mother Except Surya dynasty rulers all other tongue. preceding dynasty rulers have come from outside and brought with them soldiers and officers The vocabulary of a language depends speaking their native language and after settling in on the richness or treasure of words accumulated Udra Desha had embraced the then prevailing and stored in that language by passage of time Odia language. That had impact on development from the time immemorial. Odia is the mother of the vocabulary of Odia language. language of the majority of the people of Odisha at present and also in the past. The present In 1568 Odisha empire lost her Odisha is the political entity with distinct boundary. sovereignty and independence on sudden and sad But outside Odisha there exists Odia speaking demise of the last independent and sovereign tracts which was the result or resultant of the Hindu king-emperor Gajapati Mukunda Deva in conquests made by the Ganga Vansi and Surya Gohiri Tikira battle field.