Phonological Competence in the Design of a New Orthography
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The Lexicographical Contribution to Welsh
Coping with an expanding vocabulary: the lexicographical contribution to Welsh Abstract The Welsh language, as a lesser-used language with English as an immediate neighbour, has inevitably borrowed much of its vocabulary from that language (or its precursors) as well as inheriting a considerable vocabulary from Latin via Brythonic. Welsh lexicography dates back to at least the 15th-century bardic glossaries and the first Welsh dictionary was printed in the mid-16th century. Most Welsh dictionaries are bilingual with either Latin or English, and Welsh lexicographers have been surprisingly influential in the development of the lexicon with many of their neologisms being adopted into the modern lexicon. Modern prescriptive lexicography and terminology have tended to promote what some linguists deride as a ‘purist’ approach but which has in fact proved to be a practical solution which has succeeded in expanding the vocabulary in a way acceptable to many speakers. Inevitably, since all are fully bilingual, Welsh speakers in Wales have continued to borrow vocabulary extensively from English. 1. Introduction Yr ydym wedi bod yn rhy selog o lawer, yn y dyddiau a fu, yn dyfeisio geiriau newydd am bob dim …, gan dybied mai iaith bur yw iaith a’i geiriau’n perthyn dim i eiriau’r un iaith arall. Ffolineb yw hynny: y mae gan bob iaith allu i fabwysiadu geiriau estronol yn eiriau iddi ei hun. Ac y mae yr iaith Gymraeg wedi mabwysiadu cannoedd o eiriau Lladin a geiriau Saesneg erioed, ac y maent yn eitha-geiriau Cymraeg erbyn hyn. Y prawf o air Cymraeg yw, a arferir ef gan y bobl ai peidio? os y gwneir, y mae yn air Cymraeg; os na wneir, nid yw. -
Some Principles of the Use of Macro-Areas Language Dynamics &A
Online Appendix for Harald Hammarstr¨om& Mark Donohue (2014) Some Principles of the Use of Macro-Areas Language Dynamics & Change Harald Hammarstr¨om& Mark Donohue The following document lists the languages of the world and their as- signment to the macro-areas described in the main body of the paper as well as the WALS macro-area for languages featured in the WALS 2005 edi- tion. 7160 languages are included, which represent all languages for which we had coordinates available1. Every language is given with its ISO-639-3 code (if it has one) for proper identification. The mapping between WALS languages and ISO-codes was done by using the mapping downloadable from the 2011 online WALS edition2 (because a number of errors in the mapping were corrected for the 2011 edition). 38 WALS languages are not given an ISO-code in the 2011 mapping, 36 of these have been assigned their appropri- ate iso-code based on the sources the WALS lists for the respective language. This was not possible for Tasmanian (WALS-code: tsm) because the WALS mixes data from very different Tasmanian languages and for Kualan (WALS- code: kua) because no source is given. 17 WALS-languages were assigned ISO-codes which have subsequently been retired { these have been assigned their appropriate updated ISO-code. In many cases, a WALS-language is mapped to several ISO-codes. As this has no bearing for the assignment to macro-areas, multiple mappings have been retained. 1There are another couple of hundred languages which are attested but for which our database currently lacks coordinates. -
Proposal for a Korean Script Root Zone LGR 1 General Information
(internal doc. #: klgp220_101f_proposal_korean_lgr-25jan18-en_v103.doc) Proposal for a Korean Script Root Zone LGR LGR Version 1.0 Date: 2018-01-25 Document version: 1.03 Authors: Korean Script Generation Panel 1 General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Korean Script LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document below: • proposal-korean-lgr-25jan18-en.xml Labels for testing can be found in the accompanying text document below: • korean-test-labels-25jan18-en.txt In Section 3, we will see the background on Korean script (Hangul + Hanja) and principal language using it, i.e., Korean language. The overall development process and methodology will be reviewed in Section 4. The repertoire and variant groups in K-LGR will be discussed in Sections 5 and 6, respectively. In Section 7, Whole Label Evaluation Rules (WLE) will be described and then contributors for K-LGR are shown in Section 8. Several appendices are included with separate files. proposal-korean-lgr-25jan18-en 1 / 73 1/17 2 Script for which the LGR is proposed ISO 15924 Code: Kore ISO 15924 Key Number: 287 (= 286 + 500) ISO 15924 English Name: Korean (alias for Hangul + Han) Native name of the script: 한글 + 한자 Maximal Starting Repertoire (MSR) version: MSR-2 [241] Note. -
Running Head: ORTHOGRAPHIC DEPTH and ORTHOGRAPHIC PROCESSING 1
Running head: ORTHOGRAPHIC DEPTH AND ORTHOGRAPHIC PROCESSING 1 This is a post-peer-review, pre-copyedit version of an article accepted in "Reading and Writing". The final authenticated version will be available at link.springer.com. The Effect of Orthographic Depth on Letter String Processing: The Case of Visual Attention Span and Rapid Automatized Naming Alexia Antzaka (ORCID iD: 0000-0002-3975-1122) Basque Center on Cognition, Brain and Language, 20009, San Sebastián, Spain Departamento de Lengua Vasca y Comunicación, UPV/EHU, 48940, Leioa, Spain Clara Martin (ORCID iD: 0000-0003-2701-5045) Basque Center on Cognition, Brain and Language, 20009, San Sebastián, Spain Ikerbasque, Basque Foundation for Science, 48013, Bilbao, Spain Sendy Caffarra (ORCID iD: 0000-0003-3667-5061) Basque Center on Cognition, Brain and Language, 20009, San Sebastián, Spain Sophie Schlöffel Running head: ORTHOGRAPHIC DEPTH AND ORTHOGRAPHIC PROCESSING 1 Basque Center on Cognition, Brain and Language, 20009, San Sebastián, Spain Departamento de Lengua Vasca y Comunicación, UPV/EHU, 48940, Leioa, Spain Manuel Carreiras (ORCID iD: 0000-0001-6726-7613) Basque Center on Cognition, Brain and Language, 20009, San Sebastián, Spain Departamento de Lengua Vasca y Comunicación, UPV/EHU, 48940, Leioa, Spain Ikerbasque, Basque Foundation for Science, 48013, Bilbao, Spain Marie Lallier (ORCID iD: 0000-0003-4340-1296) Basque Center on Cognition, Brain and Language, 20009, San Sebastián, Spain Author note The authors acknowledge financial support from the Basque Government (PRE_2015_2_0049 to A.A, PI_2015_1_25 to C.M, PRE_2015_2_0247 to S.S), the European Research Council (ERC-2011-ADG-295362 to M.C.), the Spanish Ministry of Economy and Competitiveness (PSI20153653383P to M.L., PSI20153673533R to M. -
Old and Middle Welsh David Willis ([email protected]) Department of Linguistics, University of Cambridge
Old and Middle Welsh David Willis ([email protected]) Department of Linguistics, University of Cambridge 1 INTRODUCTION The Welsh language emerged from the increasing dialect differentiation of the ancestral Brythonic language (also known as British or Brittonic) in the wake of the withdrawal of the Roman administration from Britain and the subsequent migration of Germanic speakers to Britain from the fifth century. Conventionally, Welsh is treated as a separate language from the mid sixth century. By this time, Brythonic speakers, who once occupied the whole of Britain apart from the north of Scotland, had been driven out of most of what is now England. Some Brythonic-speakers had migrated to Brittany from the late fifth century. Others had been pushed westwards and northwards into Wales, western and southwestern England, Cumbria and other parts of northern England and southern Scotland. With the defeat of the Romano-British forces at Dyrham in 577, the Britons in Wales were cut off by land from those in the west and southwest of England. Linguistically more important, final unstressed syllables were lost (apocope) in all varieties of Brythonic at about this time, a change intimately connected to the loss of morphological case. These changes are traditionally seen as having had such a drastic effect on the structure of the language as to mark a watershed in the development of Brythonic. From this period on, linguists refer to the Brythonic varieties spoken in Wales as Welsh; those in the west and southwest of England as Cornish; and those in Brittany as Breton. A fourth Brythonic language, Cumbric, emerged in the north of England, but died out, without leaving written records, in perhaps the eleventh century. -
Harmonizing the Orthography of Gĩkũyũ and Kĩkamba.Pdf
NAME, DATE HERE GĨKŨYŨ AND KĨKAMBA 39 CHAPTER THREE HARMONIZING THE ORTHOGRAPHY OF GĨKŨYŨ AND KĨKAMBA Angelina Nduku Kioko, Martin C. Njoroge and Peter Mburu Kuria INTRODUCTION The term orthography is derived from the Greek word ‘orthos’ which means ‘correct’, and ‘graphein’, which stands for ‘to write’ (Sampson, 1985). The orthography of a language describes or defines the set of symbols (graphemes and diacritics) used to represent the phonemic inventory of that language in the writing and the rules on how to write these symbols. According to Massamba (1986), a language takes a limited number of sounds from the central pool of speech sounds to form its phonetic inventory. In this chapter, orthography is used to refer to the system of symbols used in the writing system of Gĩkũyũ and Kĩkamba. There are three types of orthographies (Read, 1983: 143-152). The first is the ‘phonemic orthography’. In a ‘phonemic’ orthography there is a one-to-one correspondence between phonemes and graphemes. This type of orthography has a dedicated sequence of symbol or symbols for each phoneme. Examples of languages that have phonemic ortho graphies are Korean and Kiswahili. The second type is the ‘morpho-phonemic orthography’ which considers both the phonemic features and the underlying structure of words. In this case, words may be written in the same way despite differences in pronunciation. For example, the pronunciation of the plural marker in English {s} is conditioned by the phonetic environment in which it occurs, yet it is written with the same grapheme <s>. The plural forms ‘cats’ and ‘dogs’ are pronounced as [kӕts] and [dɒɡz] respectively although the two final sounds are written 40 THE HARMONIZATION AND STANDARDIZATION OF KENYAN LANGUAGES with the grapheme <s>. -
1 English Spelling and Pronunciation
ISSN: 2456-8104 http://www.jrspelt.com Issue 5, Vol. 2, 2018 English Spelling and Pronunciation - A Brief Study Prof. V. Chandra Sekhar Rao ([email protected] ) Professor in English, SITECH, Hyderabad Abstract The present paper aims at the correlation between spelling and pronunciation of English words. English spelling is almost divorced from its pronunciation and there is no perfect guide how to 1 learn the pronunciation of the words. The letters of alphabet used are always inadequate to represent the sounds. English alphabet contains only 26 letters but the sounds 44. IPA symbols are needed to understand the intelligibility of the pronunciation and the spelling-designed. Learners of English language have to understand that words from other languages may be adopted without being adapted to the spelling system. Most of the letters of English alphabet produce multiple pronunciations. English Pronouncing Dictionary is needed for better understanding of the spelling and pronunciation. Keywords: Spelling and Pronunciation, Orthography, Intelligibility, Phonetic Symbols Introduction "If we know the sounds of a word (in English) we can't know how to spell it; if we know the ` spelling, we can't know how to pronounce it." (Otto Jespersen, philologist, Essentials of English Grammar, 1905, page 11). "English spelling is almost divorced from its pronunciation and forms hardly any guide as to how words should be pronounced." (Mont Follick, The Case for Spelling Reform, 1964, page 87). English, as a global language of communication, is spoken, written and used widely for many different purposes - international diplomatic relations, business, science and technology. It is also called the library language and medium of instructions in higher education - science and technology, computer and software engineering, medicine and law, pharmacy and nursing, commerce and management, fashion technology and so on. -
Y in Medieval Welsh Orthography Sims-Williams, Patrick
Aberystwyth University ‘Dark’ and ‘Clear’ Y in Medieval Welsh Orthography Sims-Williams, Patrick Published in: Transactions of the Philological Society DOI: 10.1111/1467-968X.12205 Publication date: 2021 Citation for published version (APA): Sims-Williams, P. (2021). ‘Dark’ and ‘Clear’ Y in Medieval Welsh Orthography: Caligula versus Teilo. Transactions of the Philological Society, 1191, 9-47. https://doi.org/10.1111/1467-968X.12205 Document License CC BY-NC-ND General rights Copyright and moral rights for the publications made accessible in the Aberystwyth Research Portal (the Institutional Repository) are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the Aberystwyth Research Portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the Aberystwyth Research Portal Take down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. tel: +44 1970 62 2400 email: [email protected] Download date: 28. Sep. 2021 Transactions of the Philological Society Volume 1191 (2021) 9–47 doi: 10.1111/1467-968X.12205 ‘DARK’ AND ‘CLEAR’ Y IN MEDIEVAL WELSH ORTHOGRAPHY: CALIGULA VERSUS TEILO By PATRICK SIMS-WILLIAMS Aberystwyth University (Submitted: 26 May, 2020; Accepted: 20 January, 2021) ABSTRACT A famous exception to the ‘phonetic spelling system’ of Welsh is the use of <y> for both /ǝ/ and the retracted high vowel /ɨ(:)/. -
Why Learning to Read Is Easier in Welsh Than in English: Orthographic Transparency Effects Evinced with Frequency-Matched Tests
Applied Psycholinguistics 22 (2001), 571–599 Printed in the United States of America Why learning to read is easier in Welsh than in English: Orthographic transparency effects evinced with frequency-matched tests NICK C. ELLIS and A. MARI HOOPER University of Wales, Bangor ADDRESS FOR CORRESPONDENCE Nick C. Ellis, School of Psychology, University of Wales, Bangor, Gwynedd LL57 2DG, United Kingdom. E-mail: [email protected] ABSTRACT This study compared the rate of literacy acquisition in orthographically transparent Welsh and ortho- graphically opaque English using reading tests that were equated for frequency of written exposure. Year 2 English-educated monolingual children were compared with Welsh-educated bilingual chil- dren, matched for reading instruction, background, locale, and math ability. Welsh children were able to read aloud accurately significantly more of their language (61% of tokens, 1821 types) than were English children (52% tokens, 716 types), allowing them to read aloud beyond their compre- hension levels (168 vs. 116%, respectively). Various observations suggested that Welsh readers were more reliant on an alphabetic decoding strategy: word length determined 70% of reading latency in Welsh but only 22% in English, and Welsh reading errors tended to be nonword mispro- nunciations, whereas English children made more real word substitutions and null attempts. These findings demonstrate that the orthographic transparency of a language can have a profound effect on the rate of acquisition and style of reading adopted by its speakers. Systems that are noisy and inconsistent are harder to sort out than systems that are reliable and categorical. There is now a large body of research demonstrating that greater ambiguity in the mappings between the forms and functions of a particular language causes less successful learning because of a larger degree of competition among the cues in the learning set (Bates & MacWhinney, 1987; MacWhinney, 1987). -
Welsh on The
Technical Report No: 2003/16 Issues in Creating HTML Pages with Welsh or bilingual Content John Dyke 26 November 2003 Department of Computing Faculty of Mathematics and Computing The Open University Walton Hall, Milton Keynes MK7 6AA United Kingdom http://computing.open.ac.uk l Issues in Creating HTML Pages with Welsh or bilingual Content John Dyke Summary Using utf-8 encoding web pages in HTML, XHTML or XML can contain all the letters used in Welsh including all of the diacritical marks used. These pages are rendered and display correctly on Microsoft's Internet Explorer from version 4 onwards and Netscape's browser from version 4. Opera supports all of the characters correctly from version 6 albeit with font substitution occurring for some of the less frequently used diacritical marks on w and y: the acen grom, the most frequently used diacritical mark, is rendered correctly. Opera's version 5.12 provides a reasonably good coverage. It displays the vowels a,e,i,o and u correctly with all diacritical marks but renders w, W, y and Y without an acen grom but produces blanks for the other less frequently used diacritical marks used over these characters. For wider support on older browsers, named character entities should be used for the characters a, e, i, o and u with diacritical marks. The remaining vowels w and y should coded either without diacritical marks or by some other representation e.g. the character followed by the diacritical mark (w^ in place of ŵ) The language used on a page should be denoted by using the lang attribute in the HTML mark up. -
Handouts for Advanced Phonology: a Course Packet Steve Parker GIAL
Handouts for Advanced Phonology: A Course Packet Steve Parker GIAL and SIL International Dallas, 2016 Copyright © 2016 by Steve Parker and other contributors Page 1 of 304 Preface This set of materials is designed to be used as handouts accompanying an advanced course in phonology, particularly at the graduate level. It is specifically intended to be used in conjunction with two textbooks: Phonology in generative grammar (Kenstowicz 1994), and Optimality theory (Kager 1999). However, this course packet could potentially also be adapted for use with other phonology textbooks. The materials included here have been developed by myself and others over many years, in conjunction with courses in phonology taught at SIL programs in North Dakota, Oregon, Dallas, and Norman, OK. Most recently I have used them at GIAL. Many of the special phonetic characters appearing in these materials use IPA fonts available as freeware from the SIL International website. Unless indicated to the contrary on specific individual handouts, all materials used in this packet are the copyright of Steve Parker. These documents are intended primarily for educational use. You may make copies of these works for research or instructional purposes (under fair use guidelines) free of charge and without further permission. However, republication or commercial use of these materials is expressly prohibited without my prior written consent. Steve Parker Graduate Institute of Applied Linguistics Dallas, 2016 Page 2 of 304 1 Table of contents: list of handouts included in this packet Day 1: Distinctive features — their definitions and uses -Pike’s premises for phonological analysis ......................................................................... 7 -Phonemics analysis flow chart .......................................................................................... -
The Impact of Orthography on the Acquisition of L2 Phonology:1
Coutsougera, The Impact of Orthography on the Acquisition of L2 Phonology:1 The impact of orthography on the acquisition of L2 phonology: inferring the wrong phonology from print Photini Coutsougera, University of Cyprus 1 The orthographic systems of English and Greek The aim of this study is to investigate how the deep orthography of English influences the acquisition of L2 English phonetics/phonology by L1 Greek learners, given that Greek has a shallow orthography. Greek and English deploy two fundamentally different orthographies. The Greek orthography, despite violating one-letter-to-one-phoneme correspondence, is shallow or transparent. This is because although the Greek orthographic system has a surplus of letters/digraphs for vowel sounds (e.g. sound /i/ is represented in six different ways in the orthography); each letter/digraph has one reading. There are very few other discrepancies between letters and sounds, which are nevertheless handled by specific, straightforward rules. As a result, there is only one possible way of reading a written form. The opposite, however, does not hold, i.e. a speaker of Greek cannot predict the spelling of a word when provided with the pronunciation. In contrast, as often cited in the literature, English has a deep or non transparent orthography since it allows for the same letter to represent more than one sound or for the same sound to be represented by more than one letter. Other discrepancies between letters and sounds - also well reported or even overstated in the literature - are of rather lesser importance (e.g. silent letters existing mainly for historical reasons etc).