Laurie Bauer T I C S
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The Unicode Cookbook for Linguists: Managing Writing Systems Using Orthography Profiles
Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2017 The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles Moran, Steven ; Cysouw, Michael DOI: https://doi.org/10.5281/zenodo.290662 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-135400 Monograph The following work is licensed under a Creative Commons: Attribution 4.0 International (CC BY 4.0) License. Originally published at: Moran, Steven; Cysouw, Michael (2017). The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles. CERN Data Centre: Zenodo. DOI: https://doi.org/10.5281/zenodo.290662 The Unicode Cookbook for Linguists Managing writing systems using orthography profiles Steven Moran & Michael Cysouw Change dedication in localmetadata.tex Preface This text is meant as a practical guide for linguists, and programmers, whowork with data in multilingual computational environments. We introduce the basic concepts needed to understand how writing systems and character encodings function, and how they work together. The intersection of the Unicode Standard and the International Phonetic Al- phabet is often not met without frustration by users. Nevertheless, thetwo standards have provided language researchers with a consistent computational architecture needed to process, publish and analyze data from many different languages. We bring to light common, but not always transparent, pitfalls that researchers face when working with Unicode and IPA. Our research uses quantitative methods to compare languages and uncover and clarify their phylogenetic relations. However, the majority of lexical data available from the world’s languages is in author- or document-specific orthogra- phies. -
MUFI Character Recommendation V. 3.0: Code Chart Order
MUFI character recommendation Characters in the official Unicode Standard and in the Private Use Area for Medieval texts written in the Latin alphabet ⁋ ※ ð ƿ ᵹ ᴆ ※ ¶ ※ Part 2: Code chart order ※ Version 3.0 (5 July 2009) ※ Compliant with the Unicode Standard version 5.1 ____________________________________________________________________________________________________________________ ※ Medieval Unicode Font Initiative (MUFI) ※ www.mufi.info ISBN 978-82-8088-403-9 MUFI character recommendation ※ Part 2: code chart order version 3.0 p. 2 / 245 Editor Odd Einar Haugen, University of Bergen, Norway. Background Version 1.0 of the MUFI recommendation was published electronically and in hard copy on 8 December 2003. It was the result of an almost two-year-long electronic discussion within the Medieval Unicode Font Initiative (http://www.mufi.info), which was established in July 2001 at the International Medi- eval Congress in Leeds. Version 1.0 contained a total of 828 characters, of which 473 characters were selected from various charts in the official part of the Unicode Standard and 355 were located in the Private Use Area. Version 1.0 of the recommendation is compliant with the Unicode Standard version 4.0. Version 2.0 is a major update, published electronically on 22 December 2006. It contains a few corrections of misprints in version 1.0 and 516 additional char- acters (of which 123 are from charts in the official part of the Unicode Standard and 393 are additions to the Private Use Area). There are also 18 characters which have been decommissioned from the Private Use Area due to the fact that they have been included in later versions of the Unicode Standard (and, in one case, because a character has been withdrawn). -
A Sketch Grammar of Pondi
A SKETCH GRAMMAR OF PONDI A SKETCH GRAMMAR OF PONDI RUSSELL BARLOW ASIA-PACIFIC LINGUISTICS Published by ANU Press The Australian National University Acton ACT 2601, Australia Email: [email protected] Available to download for free at press.anu.edu.au ISBN (print): 9781760463830 ISBN (online): 9781760463847 WorldCat (print): 1175268620 WorldCat (online): 1175268668 DOI: 10.22459/SGP.2020 This title is published under a Creative Commons Attribution-NonCommercial- NoDerivatives 4.0 International (CC BY-NC-ND 4.0). The full licence terms are available at creativecommons.org/licenses/by-nc-nd/4.0/legalcode Cover design and layout by ANU Press. Cover photograph by Russell Barlow. This edition © 2020 ANU Press CONTENTS Acknowledgements . ix List of abbreviations . xi List of maps, figures, and tables . xiii 1 . Introduction . 1 1 .1 Organisation . 1 1.2 Previous research on the language . 2 1.3 Methodology . 2 1.4 Orthography and presentation . 3 1.5 Pondi: The language and its speakers . 3 1.5.1 The name of the language . 4 1.5.2 The environment . 4 1 .5 .3 Langam village . 5 1.5.4 The people . 6 1.5.5 Relationships with neighbouring villages and borrowing . 8 1 .5 .6 Variation . 11 1.6 Language vitality . .11 1 .6 .1 UNESCO’s nine factors . 12 1 .6 .2 EGIDS . 12 1 .6 .3 LEI . 13 1.7 Classification . 14 1.8 Typological overview . 18 2 . Phonetics and phonology . 21 2 .1 Consonants . 21 2 .1 .1 Voiceless stops . 22 2 .1 .2 Prenasalised voiced stops . 23 2.1.3 Prenasalised voiced affricate . -
Using the Unicode Standard for Linguistic Data: Preliminary Guidelines∗
Using the Unicode Standard for Linguistic Data: Preliminary Guidelines∗ Deborah Anderson UC Berkeley 1 Introduction A core concern for E-MELD is the need for a common standard for the digitalization of linguistic data, for the different archiving practices and representation of languages will inhibit accessing data, searching, and scientific investigation.1 While the context is expressed in terms of documenting endangered languages, E- MELD will serve generally as a “showroom of best practices” for linguistic data from all languages. The problem posed by the lack of a single, common standard for text data in various scripts was already evident in the business world in the 1980s, when competing industry and governmental body character encoding standards made the exchange of text data chaotic. This situation led to the creation of a single, universal character encoding standard, Unicode.2 That standard was synchronized with the ISO/IEC 10646, the parallel International Standard maintained by the International Organization for Standardization (ISO). Unicode is now the default character encoding standard for XML. Fortunately, Unicode provides a single standard that can be used by linguists in their work, either when working with transcription, languages with existent writing systems, or those languages that have no established orthography. Indeed, its use is advocated in “Seven Dimensions of Portability for Language Documentation and Description” (Bird and Simons 2002). But specifying “Unicode” as the character encoding does not provide enough guidance for many linguists, who may be puzzled by the structure of the Unicode Standard and how to use it, are unsure which Unicode characters ought to be used in a particular context and whom to ask about this, are uncertain how to report a missing character, etc. -
IPA Braille: an Updated Tactile Representation of the International Phonetic Alphabet
IPA Braille: An Updated Tactile Representation of the International Phonetic Alphabet Print Edition Overview, Tables, and Sample Texts Edited by Robert Englebretson, Ph.D. Produced by CNIB For the International Council on English Braille 2008 List of ICEB Member Countries As Of April, 2008 Australia Canada New Zealand Nigeria South Africa United Kingdom United States ii Table of Contents ACKNOWLEDGEMENTS BY ROBERT ENGLEBRETSON.............................................. V FOREWORD by Fredric K. Schroeder .................................................................. VII ACCLAIM FOR IPA BRAILLE BY MARTHA PAMPERIN.............................................IX 1. INTRODUCTION ............................................................................................. 1 2. THE SYMBOLS OF THE IPA ............................................................................. 5 2.1. CONSONANTS (PULMONIC) .................................................................. 6 2.2. CONSONANTS (NON-PULMONIC) ....................................................... 11 2.3. OTHER SYMBOLS ................................................................................ 13 2.4. VOWELS ............................................................................................. 15 2.5. DIACRITICS ....................................................................................... 17 2.6. SUPRASEGMENTALS ........................................................................... 23 2.7. TONES & WORD ACCENTS.................................................................. -
Language Endangerment in the Sepik Area of Papua New Guinea Alexandra Y. Aikhenvald Research Centre for Linguistic Typology La T
1 Language endangerment in the Sepik area of Papua New Guinea Alexandra Y. Aikhenvald Research Centre for Linguistic Typology La Trobe University, Melbourne 1. Language diversity and language endangerment in Papua New Guinea The island of New Guinea is probably the most linguistically diverse and complex linguistic area in the world, with over 1,000 languages spoken over an area of 900,000 km2 (that is, one language every 900 km2: Foley 1986: 8). Seventy-five percent of these languages belong to Non- Austronesian families often referred to as 'Papuan' (see Foley 1986: 1-3; Dixon 1991a: 245)1. The state of Papua New Guinea (independent since 1975) features about 830 languages (Nekitel 1998; Ford ms; Landweer 2000), with the number of Papuan languages exceeding 600 (see Foley 1986: 1-3; Dixon 1991a: 245). Its official languages are English, Tok Pisin and Hiri Motu (also called Police Motu). Tok Pisin is currently the most important language spoken in most provinces. English is less dominant, but is widely gaining ground, especially in West Sepik (Sandaun) province and a number of other Coastal provinces (see Sankoff 1980: 126-70). Hiri Motu is even more restricted (for instance, it is not known at all in either of the two Sepik provinces). Bi- and trilingualism in Tok Pisin and English is quickly expanding. According to materials in Sankoff (1980: 129-30), in 1971 the percentage of Papua New Guineans age ten and over who are unable to speak any of the official languages was 17.6% in East Sepik and 35.9% in Sandaun. -
The Unicode Cookbook for Linguists
The Unicode Cookbook for Linguists Managing writing systems using orthography profiles Steven Moran Michael Cysouw language Translation and Multilingual Natural science press Language Processing 10 Translation and Multilingual Natural Language Processing Editors: Oliver Czulo (Universität Leipzig), Silvia Hansen-Schirra (Johannes Gutenberg-Universität Mainz), Reinhard Rapp (Johannes Gutenberg-Universität Mainz) In this series: 1. Fantinuoli, Claudio & Federico Zanettin (eds.). New directions in corpus-based translation studies. 2. Hansen-Schirra, Silvia & Sambor Grucza (eds.). Eyetracking and Applied Linguistics. 3. Neumann, Stella, Oliver Čulo & Silvia Hansen-Schirra (eds.). Annotation, exploitation and evaluation of parallel corpora: TC3 I. 4. Czulo, Oliver & Silvia Hansen-Schirra (eds.). Crossroads between Contrastive Linguistics, Translation Studies and Machine Translation: TC3 II. 5. Rehm, Georg, Felix Sasaki, Daniel Stein & Andreas Witt (eds.). Language technologies for a multilingual Europe: TC3 III. 6. Menzel, Katrin, Ekaterina Lapshinova-Koltunski & Kerstin Anna Kunz (eds.). New perspectives on cohesion and coherence: Implications for translation. 7. Hansen-Schirra, Silvia, Oliver Czulo & Sascha Hofmann (eds). Empirical modelling of translation and interpreting. 8. Svoboda, Tomáš, Łucja Biel & Krzysztof Łoboda (eds.). Quality aspects in institutional translation. 9. Fox, Wendy. Can integrated titles improve the viewing experience? Investigating the impact of subtitling on the reception and enjoyment of film using eye tracking and questionnaire data. 10. Moran, Steven & Michael Cysouw. The Unicode cookbook for linguists: Managing writing systems using orthography profiles ISSN: 2364-8899 The Unicode Cookbook for Linguists Managing writing systems using orthography profiles Steven Moran Michael Cysouw language science press Steven Moran & Michael Cysouw. 2018. The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles (Translation and Multilingual Natural Language Processing 10). -
3.5 Plurals Ending in -Ate
A SKETCH GRAMMAR OF PONDI A SKETCH GRAMMAR OF PONDI RUSSELL BARLOW ASIA-PACIFIC LINGUISTICS Published by ANU Press The Australian National University Acton ACT 2601, Australia Email: [email protected] Available to download for free at press.anu.edu.au ISBN (print): 9781760463830 ISBN (online): 9781760463847 WorldCat (print): 1175268620 WorldCat (online): 1175268668 DOI: 10.22459/SGP.2020 This title is published under a Creative Commons Attribution-NonCommercial- NoDerivatives 4.0 International (CC BY-NC-ND 4.0). The full licence terms are available at creativecommons.org/licenses/by-nc-nd/4.0/legalcode Cover design and layout by ANU Press. Cover photograph by Russell Barlow. This edition © 2020 ANU Press CONTENTS Acknowledgements . ix List of abbreviations . xi List of maps, figures, and tables . xiii 1 . Introduction . 1 1 .1 Organisation . 1 1.2 Previous research on the language . 2 1.3 Methodology . 2 1.4 Orthography and presentation . 3 1.5 Pondi: The language and its speakers . 3 1.5.1 The name of the language . 4 1.5.2 The environment . 4 1 .5 .3 Langam village . 5 1.5.4 The people . 6 1.5.5 Relationships with neighbouring villages and borrowing . 8 1 .5 .6 Variation . 11 1.6 Language vitality . .11 1 .6 .1 UNESCO’s nine factors . 12 1 .6 .2 EGIDS . 12 1 .6 .3 LEI . 13 1.7 Classification . 14 1.8 Typological overview . 18 2 . Phonetics and phonology . 21 2 .1 Consonants . 21 2 .1 .1 Voiceless stops . 22 2 .1 .2 Prenasalised voiced stops . 23 2.1.3 Prenasalised voiced affricate . -
A Sociolinguistic Survey of the Lower Ramu Languages of Papua New Guinea (Awar [Aya], Bosmun [Bqs], Kayan [Kct], and Marangis [Wax])
DigitalResources Electronic Survey Report 2015-009 A Sociolinguistic Survey of the Lower Ramu Languages of Papua New Guinea (Awar [aya], Bosmun [bqs], Kayan [kct], and Marangis [wax]) Brian Paris A Sociolinguistic Survey of the Lower Ramu Languages of Papua New Guinea (Awar [aya], Bosmun [bqs], Kayan [kct], and Marangis [wax]) Brian Paris SIL International® 2015 SIL Electronic Survey Report 2015-009, September 2015 © 2015 SIL International® All rights reserved Abstract This sociolinguistic survey of the Lower Ramu language groups was undertaken to assess the feasibility of the four target language groups—Awar [aya], Bosmun [bqs], Kayan [kct], and Marangis [wax]— joining the Papua New Guinea branch of Pioneer Bible Translators’ Lower Ramu Initial Goals Project (LRIG), a proposed multi-language development project. The goals of the survey were to 1) assess the vitality of the vernacular for each target group, 2) assess the willingness of the target groups to work with the other potential language groups in the LRIG project, and 3) assess the interest of the target groups in vernacular language development. The survey found that vitality among the Awar, with an Expanded Graded Intergenerational Disruption Scale (EGIDS) score of 7 “Shifting,” was too low to be involved in LRIG, high enough among the Kayan and Marangis (both EGIDS 6a “Vigorous”), and (EGIDS 6b “Threatened”) among the Bosmun. All of the groups expressed willingness to work together in the LRIG project, and no current or past behavior contradicts this. All of the groups expressed a desire for vernacular development, though the motivations expressed by the Awar do not match the goals of the LRIG project. -
A Sketch Grammar of Pondi
A SKETCH GRAMMAR OF PONDI A SKETCH GRAMMAR OF PONDI RUSSELL BARLOW ASIA-PACIFIC LINGUISTICS Published by ANU Press The Australian National University Acton ACT 2601, Australia Email: [email protected] Available to download for free at press.anu.edu.au ISBN (print): 9781760463830 ISBN (online): 9781760463847 WorldCat (print): 1175268620 WorldCat (online): 1175268668 DOI: 10.22459/SGP.2020 This title is published under a Creative Commons Attribution-NonCommercial- NoDerivatives 4.0 International (CC BY-NC-ND 4.0). The full licence terms are available at creativecommons.org/licenses/by-nc-nd/4.0/legalcode Cover design and layout by ANU Press. Cover photograph by Russell Barlow. This edition © 2020 ANU Press CONTENTS Acknowledgements . ix List of abbreviations . xi List of maps, figures, and tables . xiii 1 . Introduction . 1 1 .1 Organisation . 1 1.2 Previous research on the language . 2 1.3 Methodology . 2 1.4 Orthography and presentation . 3 1.5 Pondi: The language and its speakers . 3 1.5.1 The name of the language . 4 1.5.2 The environment . 4 1 .5 .3 Langam village . 5 1.5.4 The people . 6 1.5.5 Relationships with neighbouring villages and borrowing . 8 1 .5 .6 Variation . 11 1.6 Language vitality . .11 1 .6 .1 UNESCO’s nine factors . 12 1 .6 .2 EGIDS . 12 1 .6 .3 LEI . 13 1.7 Classification . 14 1.8 Typological overview . 18 2 . Phonetics and phonology . 21 2 .1 Consonants . 21 2 .1 .1 Voiceless stops . 22 2 .1 .2 Prenasalised voiced stops . 23 2.1.3 Prenasalised voiced affricate . -
Ipa-Braille-Final.Pdf
IPA Braille: An Updated Tactile Representation of the International Phonetic Alphabet Print Edition Overview, Tables, and Sample Texts Edited by Robert Englebretson, Ph.D. Produced by CNIB For the International Council on English Braille 2008 List of ICEB Member Countries As Of April, 2008 Australia Canada New Zealand Nigeria South Africa United Kingdom United States ii Table of Contents ACKNOWLEDGEMENTS BY ROBERT ENGLEBRETSON.............................................. V FOREWORD by Fredric K. Schroeder .................................................................. VII ACCLAIM FOR IPA BRAILLE BY MARTHA PAMPERIN.............................................IX 1. INTRODUCTION ............................................................................................. 1 2. THE SYMBOLS OF THE IPA ............................................................................. 5 2.1. CONSONANTS (PULMONIC) .................................................................. 6 2.2. CONSONANTS (NON-PULMONIC) ....................................................... 11 2.3. OTHER SYMBOLS ................................................................................ 13 2.4. VOWELS ............................................................................................. 15 2.5. DIACRITICS ....................................................................................... 17 2.6. SUPRASEGMENTALS ........................................................................... 23 2.7. TONES & WORD ACCENTS.................................................................. -
Laurie Bauer T I C S
The T h Linguistics Student's e L i n Handbook g u The i s Laurie Bauer t i c s ‘This book is a good idea. It’s not a textbook but a linguistics manual, full of information about concepts and practices and written in clear, straightforward language. It contains, for S t example, a mini language-file (for the many languages referred to in textbooks without u further explanation); brief notes on central concepts in linguistics; the conventions of writing d e Linguistics essays and assignments; technical terms that are have two or more uses and a list of online resources for linguists. There is something here for everyone engaged in Linguistics, from first- n t ' year undergraduates through final-year undergraduates and postgraduates to teaching staff. s I foresee it being a constant companion.’ H Professor Jim Miller, University of Auckland a n The book that tells you all the things you felt you were expected to know about linguistics, but d b Student's were afraid to ask about. o • What do you know about Burushaski and Miwok? o k • What’s the difference between paradigmatic and syntagmatic? • What is E-language? • What is a language? Handbook • What does the symbol ‘*’ mean in linguistics? • What do you call the letter ? L • Do parenthetical and non-restrictive mean the same thing? a • How do you write a bibiliographic entry for a work you have not seen? u r i Every student who has asked these questions needs this book. A compendium of useful things e for linguistics students to know, from the IPA chart to the Saussurean dichotomies, this book will be the constant companion of anyone undertaking studies of linguistics.