Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Linguapax Review 2010 Linguapax Review 2010
LINGUAPAX REVIEW 2010 MATERIALS / 6 / MATERIALS Col·lecció Materials, 6 Linguapax Review 2010 Linguapax Review 2010 Col·lecció Materials, 6 Primera edició: febrer de 2011 Editat per: Amb el suport de : Coordinació editorial: Josep Cru i Lachman Khubchandani Traduccions a l’anglès: Kari Friedenson i Victoria Pounce Revisió dels textos originals en anglès: Kari Friedenson Revisió dels textos originals en francès: Alain Hidoine Disseny i maquetació: Monflorit Eddicions i Assessoraments, sl. ISBN: 978-84-15057-12-3 Els continguts d’aquesta publicació estan subjectes a una llicència de Reconeixe- ment-No comercial-Compartir 2.5 de Creative Commons. Se’n permet còpia, dis- tribució i comunicació pública sense ús comercial, sempre que se’n citi l’autoria i la distribució de les possibles obres derivades es faci amb una llicència igual a la que regula l’obra original. La llicència completa es pot consultar a: «http://creativecom- mons.org/licenses/by-nc-sa/2.5/es/deed.ca» LINGUAPAX REVIEW 2010 Centre UNESCO de Catalunya Barcelona, 2011 4 CONTENTS PRESENTATION Miquel Àngel Essomba 6 FOREWORD Josep Cru 8 1. THE HISTORY OF LINGUAPAX 1.1 Materials for a history of Linguapax 11 Fèlix Martí 1.2 The beginnings of Linguapax 14 Miquel Siguan 1.3 Les débuts du projet Linguapax et sa mise en place 17 au siège de l’UNESCO Joseph Poth 1.4 FIPLV and Linguapax: A Quasi-autobiographical 23 Account Denis Cunningham 1.5 Defending linguistic and cultural diversity 36 1.5 La defensa de la diversitat lingüística i cultural Fèlix Martí 2. GLIMPSES INTO THE WORLD’S LANGUAGES TODAY 2.1 Living together in a multilingual world. -
Program-UPDATED-2019-06-18.Pdf
1 Victoria University - Wellington Māori and Indigenous Analysis Ltd Te Pouhere Kōrero Tū Tama Wahine o Taranaki 2 PROGRAMME INFORMATION DURING THE CONFERENCE Abbreviations Transport to the conference venue GAPA: Gallagher Academy of Performing Arts Complimentary buses will circulate between Hamilton LHC: Local Host Committee City and the University of Waikato throughout the day, Thursday to Saturday. City stops will be Victoria Street, near the Ibis and Novotel hotels, with a Concurrent Session Types morning pick-up at Distinction Hotel, Te Rapa. Film: Film screening and discussion. Hamilton Taxis is offering a special conference Individual Paper Session: Individual papers organized rate for delegates, just mention NAISA when booking. into sessions by NAISA Council with the chair of the Free call 0800 477 477. Local Host Committee. Panel: A chaired panel of scholars who present Karakia | Prayer developed papers on a defined topic. Some panels Every morning, 8:15 am, S Block G.01. have commentators who will discuss the papers. Roundtable: A group discussion of a defined topic that Kaumātua (Elder) Space opens up to the audience. Somewhere for elders to gather and take time out: A Block tearoom. Presentation Room Technology All presentation rooms have: Tā Moko and Healing Space • a PC connected to an audio system and Tā moko and healing services will be available daily in projector; A Block. For appointments, please contact the artists • a DVD player; and and practitioners directly via the conference website. • speakers with an external audio jack. All PCs have a USB port for you to access, and HDMI Registration and VGA input to connect devices such as laptops, Find the registration desk on Thursday in L Block, and including Macs. -
Arxiv:1908.07448V1
Evaluating Contextualized Embeddings on 54 Languages in POS Tagging, Lemmatization and Dependency Parsing Milan Straka and Jana Strakova´ and Jan Hajicˇ Charles University Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics {strakova,straka,hajic}@ufal.mff.cuni.cz Abstract Shared Task (Zeman et al., 2018). • We report our best results on UD 2.3. The We present an extensive evaluation of three addition of contextualized embeddings im- recently proposed methods for contextualized 25% embeddings on 89 corpora in 54 languages provements range from relative error re- of the Universal Dependencies 2.3 in three duction for English treebanks, through 20% tasks: POS tagging, lemmatization, and de- relative error reduction for high resource lan- pendency parsing. Employing the BERT, guages, to 10% relative error reduction for all Flair and ELMo as pretrained embedding in- UD 2.3 languages which have a training set. puts in a strong baseline of UDPipe 2.0, one of the best-performing systems of the 2 Related Work CoNLL 2018 Shared Task and an overall win- ner of the EPE 2018, we present a one-to- A new type of deep contextualized word repre- one comparison of the three contextualized sentation was introduced by Peters et al. (2018). word embedding methods, as well as a com- The proposed embeddings, called ELMo, were ob- parison with word2vec-like pretrained em- tained from internal states of deep bidirectional beddings and with end-to-end character-level word embeddings. We report state-of-the-art language model, pretrained on a large text corpus. results in all three tasks as compared to results Akbik et al. -
Indigenous People of Western New York
FACT SHEET / FEBRUARY 2018 Indigenous People of Western New York Kristin Szczepaniec Territorial Acknowledgement In keeping with regional protocol, I would like to start by acknowledging the traditional territory of the Haudenosaunee and by honoring the sovereignty of the Six Nations–the Mohawk, Cayuga, Onondaga, Oneida, Seneca and Tuscarora–and their land where we are situated and where the majority of this work took place. In this acknowledgement, we hope to demonstrate respect for the treaties that were made on these territories and remorse for the harms and mistakes of the far and recent past; and we pledge to work toward partnership with a spirit of reconciliation and collaboration. Introduction This fact sheet summarizes some of the available history of Indigenous people of North America date their history on the land as “since Indigenous people in what is time immemorial”; some archeologists say that a 12,000 year-old history on now known as Western New this continent is a close estimate.1 Today, the U.S. federal government York and provides information recognizes over 567 American Indian and Alaskan Native tribes and villages on the contemporary state of with 6.7 million people who identify as American Indian or Alaskan, alone Haudenosaunee communities. or combined.2 Intended to shed light on an often overlooked history, it The land that is now known as New York State has a rich history of First includes demographic, Nations people, many of whom continue to influence and play key roles in economic, and health data on shaping the region. This fact sheet offers information about Native people in Indigenous people in Western Western New York from the far and recent past through 2018. -
Extended and Enhanced Polish Dependency Bank in Universal Dependencies Format
Extended and Enhanced Polish Dependency Bank in Universal Dependencies Format Alina Wróblewska Institute of Computer Science Polish Academy of Sciences ul. Jana Kazimierza 5 01-248 Warsaw, Poland [email protected] Abstract even for languages with rich morphology and rel- atively free word order, such as Polish. The paper presents the largest Polish Depen- The supervised learning methods require gold- dency Bank in Universal Dependencies for- mat – PDBUD – with 22K trees and 352K standard training data, whose creation is a time- tokens. PDBUD builds on its previous ver- consuming and expensive process. Nevertheless, sion, i.e. the Polish UD treebank (PL-SZ), dependency treebanks have been created for many and contains all 8K PL-SZ trees. The PL- languages, in particular within the Universal De- SZ trees are checked and possibly corrected pendencies initiative (UD, Nivre et al., 2016). in the current edition of PDBUD. Further The UD leaders aim at developing a cross- 14K trees are automatically converted from linguistically consistent tree annotation schema a new version of Polish Dependency Bank. and at building a large multilingual collection of The PDBUD trees are expanded with the en- hanced edges encoding the shared dependents dependency treebanks annotated according to this and the shared governors of the coordinated schema. conjuncts and with the semantic roles of some Polish is also represented in the Universal dependents. The conducted evaluation exper- Dependencies collection. There are two Polish iments show that PDBUD is large enough treebanks in UD: the Polish UD treebank (PL- for training a high-quality graph-based depen- SZ) converted from Składnica zalezno˙ sciowa´ 1 and dency parser for Polish. -
Universal Dependencies According to BERT: Both More Specific and More General
Universal Dependencies According to BERT: Both More Specific and More General Tomasz Limisiewicz and David Marecekˇ and Rudolf Rosa Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics Charles University, Prague, Czech Republic {limisiewicz,rosa,marecek}@ufal.mff.cuni.cz Abstract former based systems particular heads tend to cap- ture specific dependency relation types (e.g. in one This work focuses on analyzing the form and extent of syntactic abstraction captured by head the attention at the predicate is usually focused BERT by extracting labeled dependency trees on the nominal subject). from self-attentions. We extend understanding of syntax in BERT by Previous work showed that individual BERT examining the ways in which it systematically di- heads tend to encode particular dependency verges from standard annotation (UD). We attempt relation types. We extend these findings by to bridge the gap between them in three ways: explicitly comparing BERT relations to Uni- • We modify the UD annotation of three lin- versal Dependencies (UD) annotations, show- ing that they often do not match one-to-one. guistic phenomena to better match the BERT We suggest a method for relation identification syntax (x3) and syntactic tree construction. Our approach • We introduce a head ensemble method, com- produces significantly more consistent depen- dency trees than previous work, showing that bining multiple heads which capture the same it better explains the syntactic abstractions in dependency relation label (x4) BERT. • We observe and analyze multipurpose heads, At the same time, it can be successfully ap- containing multiple syntactic functions (x7) plied with only a minimal amount of supervi- sion and generalizes well across languages. -
Options for a National Culture Symbol of Cameroon: Can the Bamenda Grassfields Traditional Dress Fit?
EAS Journal of Humanities and Cultural Studies Abbreviated Key Title: EAS J Humanit Cult Stud ISSN: 2663-0958 (Print) & ISSN: 2663-6743 (Online) Published By East African Scholars Publisher, Kenya Volume-2 | Issue-1| Jan-Feb-2020 | DOI: 10.36349/easjhcs.2020.v02i01.003 Research Article Options for a National Culture Symbol of Cameroon: Can the Bamenda Grassfields Traditional Dress Fit? Venantius Kum NGWOH Ph.D* Department of History Faculty of Arts University of Buea, Cameroon Abstract: The national symbols of Cameroon like flag, anthem, coat of arms and seal do not Article History in any way reveal her cultural background because of the political inclination of these signs. Received: 14.01.2020 In global sporting events and gatherings like World Cup and international conferences Accepted: 28.12.2020 respectively, participants who appear in traditional costume usually easily reveal their Published: 17.02.2020 nationalities. The Ghanaian Kente, Kenyan Kitenge, Nigerian Yoruba outfit, Moroccan Journal homepage: Djellaba or Indian Dhoti serve as national cultural insignia of their respective countries. The https://www.easpublisher.com/easjhcs reason why Cameroon is referred in tourist circles as a cultural mosaic is that she harbours numerous strands of culture including indigenous, Gaullist or Francophone and Anglo- Quick Response Code Saxon or Anglophone. Although aspects of indigenous culture, which have been grouped into four spheres, namely Fang-Beti, Grassfields, Sawa and Sudano-Sahelian, are dotted all over the country in multiple ways, Cameroon cannot still boast of a national culture emblem. The purpose of this article is to define the major components of a Cameroonian national culture and further identify which of them can be used as an acceptable domestic cultural device. -
INFORME Situación De Los Derechos Humanos En GUATEMALA En 2011 Y 2012
INFORME SITUACIÓN DE LOS DERECHOS HUMANOS EN GUATEMALA EN 2011 Y 2012 Conflictos agrarios, energéticos y extractivos Departamentos de Alta Verapaz, El Quiché y San Marcos Situación de los derechos humanos en Guatemala en 2011 y 2012 ELABORACIÓN CONTACTO Educación para la Acción Crítica Càtedra Unesco en Sostenibilitat Cátedra UNESCO de Sostenibilidad de la Universitat Politècnica de Catalunya Universitat Politècnica de Catalunya Colom, 1 08222 Terrassa (España) Grupo de Investigación en Derechos Humanos y Sostenibilidad Coordinación del proyecto: Xavier Álvarez EdPAC del Castillo Sant Pere més alt, 31, 2º3ª 08003 Barcelona (España) Coordinación de la investigación: Arturo Landeros [email protected] Redacción: Marco Aparicio, Platera Bart- http://investigaccionddhh.wordpress.com lett, Nuria Cortada, David Espinola, Ieva Giedraitytė, Mónica Gironés, Clara Martínez, Sara Mingorría, Laia Haurie y Arturo Lande- ros MISIÓN DE OBSERVACIÓN EN GUATEMALA EDICIÓN Y COMPAGINACIÓN 2011 Y 2012 EdPAC Alicia Sánchez, Andrea Eguren, Arturo Landeros, Carolina Novella, Clara Martínez, David Espinola, Delphine Ortega, Laia Haurie, Marco Aparicio, Mónica Gironés, Platera Bartlett, Raimon Crespi, Santiago Bonmati, Sara Mingorría y Xavier Álvarez. ILUSTRACIÓN PORTADA Paula Laverde http://paulalaverde.wordpress.com/ TRANSCRIPCIONES ENTREVISTAS Alfredo Rodríguez, Carles Sedó, Delphine Ortega, IMPRESIÓN Elisa Figueroa, Estrella Pineda, Emmanuel Bran, Ieva El Tinter SL Giedraitytė, Iñaki Samaniego, Jordi Farrés, Marina Impreso en papel 100% reciclado Barroso, Marina Domènech, Marta Roda, Mireia Gallego, Montse Artés, Montse Torrents, Neus Riga, Núria Sánchez, Patricia Messa, Samanta Fernández, Santiago Bonmatí, Santos Batzin, Sara Mingorría, Vi- DEPÓSITO LEGAL tor Taveira, Xavi Zulet, Xavier Montanyès y Yamilé Miró. LICENCIA FECHA DE PUBLICACIÓN Creative Commons 31 de enero de 2013 http://creativecommons.org/licenses/by-nd/3.0/ es/ 2 ÍNDICE PRÓLOGO 5 1. -
The Rulers of Palenque a Beginner’S Guide
The Rulers of Palenque A Beginner’s Guide By Joel Skidmore With illustrations by Merle Greene Robertson Citation: 2008 The Rulers of Palenque: A Beginner’s Guide. Third edition. Mesoweb: www. mesoweb.com/palenque/resources/rulers/PalenqueRulers-03.pdf. Publication history: The first edition of this work, in html format, was published in 2000. The second was published in 2007, when the revised edition of Martin and Grube’s Chronicle of the Maya Kings and Queens was still in press, and this third conforms to the final publica- tion (Martin and Grube 2008). To check for a more recent edition, see: www.mesoweb.com/palenque/resources/rulers/rulers.html. Copyright notice: All drawings by Merle Greene Robertson unless otherwise noted. Mesoweb Publications The Rulers of Palenque INTRODUCTION The unsung pioneer in the study of Palenque’s dynastic history is Heinrich Berlin, who in three seminal studies (Berlin 1959, 1965, 1968) provided the essential outline of the dynasty and explicitly identified the name glyphs and likely accession dates of the major Early and Late Classic rulers (Stuart 2005:148-149). More prominent and well deserved credit has gone to Linda Schele and Peter Mathews (1974), who summarized the rulers of Palenque’s Late Classic and gave them working names in Ch’ol Mayan (Stuart 2005:149). The present work is partly based on the transcript by Phil Wanyerka of a hieroglyphic workshop presented by Schele and Mathews at the 1993 Maya Meet- ings at Texas (Schele and Mathews 1993). Essential recourse has also been made to the insights and decipherments of David Stuart, who made his first Palenque Round Table presentation in 1978 at the age of twelve (Stuart 1979) and has recently advanced our understanding of Palenque and its rulers immeasurably (Stuart 2005). -
Assessing the Chitimacha-Totozoquean Hypothesis1
ASSESSING THE CHITIMACHA-TOTOZOQUEAN HYPOTHESIS1 DANIEL W. HIEBER UNIVERSITY OF CALIFORNIA, SANTA BARBARA 1. Introduction2 Scholars have attempted to genetically classify the Chitimacha language of Louisiana ever since the first vocabulary of the language was collected by Martin Duralde in 1802. Since then, there have been numerous attempts to relate Chitimacha to other isolates of the region (Swanton 1919; Swadesh 1946a; Gursky 1969), Muskogean as part of a broader Proto-Gulf hypothesis (Haas 1951; Haas 1952), and even languages as far afield as Yuki in California (Munro 1994). The most recent attempt at classification, however, looks in a new direction, and links Chitimacha with the recently-advanced Totozoquean language family of Mesoamerica (Brown, Wichmann & Beck 2014; Brown et al. 2011), providing 90 cognate sets and a number of 1 [Acknowledgements] 2 Abbreviations used in this paper are as follows: * reconstructed form ** hypothetical form intr. intransitive post. postposition tr. transitive AZR adjectivizer CAUS causative NZR nominalizer PLACT pluractional TRZR transitivizer VZR verbalizer morphological parallels as evidence. Now, recent internal reconstructions in Chitimacha made available in Hieber (2013), as well as a growing understanding of Chitimacha grammar (e.g. Hieber forthcoming), make it possible to assess the Chitimacha- Totozoquean hypothesis in light of more robust data. This paper shows that a more detailed understanding of Chitimacha grammar and lexicon casts doubt on the possibility of a genetic connection between Chitimacha and Mesoamerica. Systematic sound correspondences prove to be unattainable for the data provided in Brown, Wichmann & Beck (2014). However, groups of correspondences do appear in the data, suggestive of diffusion through contact rather than genetic inheritance. -
UCLA Electronic Theses and Dissertations
UCLA UCLA Electronic Theses and Dissertations Title A History of Guelaguetza in Zapotec Communities of the Central Valleys of Oaxaca, 16th Century to the Present Permalink https://escholarship.org/uc/item/7tv1p1rr Author Flores-Marcial, Xochitl Marina Publication Date 2015 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA Los Angeles A History of Guelaguetza in Zapotec Communities of the Central Valleys of Oaxaca, 16th Century to the Present A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in History by Xóchitl Marina Flores-Marcial 2015 © Copyright by Xóchitl Marina Flores-Marcial 2015 ABSTRACT OF THE DISSERTATION A History of Guelaguetza in Zapotec Communities of the Central Valleys of Oaxaca, 16th Century to the Present by Xóchitl Marina Flores-Marcial Doctor of Philosophy in History University of California, Los Angeles, 2015 Professor Kevin B. Terraciano, Chair My project traces the evolution of the Zapotec cultural practice of guelaguetza, an indigenous sharing system of collaboration and exchange in Mexico, from pre-Columbian and colonial times to the present. Ironically, the term "guelaguetza" was appropriated by the Mexican government in the twentieth century to promote an annual dance festival in the city of Oaxaca that has little to do with the actual meaning of the indigenous tradition. My analysis of Zapotec-language alphabetic sources from the Central Valley of Oaxaca, written from the sixteenth to the eighteenth centuries, reveals that Zapotecs actively participated in the sharing system during this long period of transformation. My project demonstrates that the Zapotec sharing economy functioned to build and reinforce social networks among households in Zapotec communities. -
A Critique of the Separation Base Method for Genealogical Subgrouping, with Data from Mixe-Zoquean*
Journal of Quantitative Linguistics 2006, Volume 13, Numbers 2 – 3, pp. 225 – 264 DOI: 10.1080/09296170600850759 A Critique of the Separation Base Method for Genealogical Subgrouping, with Data from Mixe-Zoquean* Michael Cysouw, Søren Wichmann and David Kamholz Max Planck Institute for Evolutionary Anthropology, Leipzig ABSTRACT Holm (2000) proposes the ‘‘separation base’’ method for determining subgroup relationships in a language family. The method is claimed to be superior to most approaches to lexicostatistics because the latter falls victim to the ‘‘proportionality trap’’, that is, the assumption that similarity is proportional to closeness of relationship. The principles underlying Holm’s method are innovative and not obviously incorrect. However, his only demonstration of the method is with Indo-European. This makes it difficult to interpret the results, because higher-order Indo-European subgrouping remains controversial. In order to have some basis for verification, we have tested the method on Mixe-Zoquean, a well-studied family of Mesoamerica whose subgrouping has been established by two scholars working independently and using the traditional comparative method. The results of our application of Holm’s method are significantly different from the currently accepted family tree of Mixe-Zoquean. We identify two basic sources of problems that arise when Holm’s approach is applied to our data. The first is reliance on an etymological dictionary of the proto-language in question, which creates problems of circularity that cannot be overcome. The second is that the method is sensitive to the amount of documentation available for the daughter languages, which has a distorting effect on the computed relationships.