The Bangla-Asamiya Script and Its Representation in Unicode

Total Page:16

File Type:pdf, Size:1020Kb

The Bangla-Asamiya Script and Its Representation in Unicode THE BANGLA-ASAMIY A SCRIPT AND ITS REPRESENTATION IN UNICODE Probal Dasgupta University of Hyderabad Gautam Sengupta Jadavpur University. Ko/katal o. Introductory Remarks The script describ~d here is one that the Eastem Indo-Aryan languages BangIa (or Bengali) and Asamiya (or Assamese), the establlshed and traditional loci of this script, share with Santali )-&'#& is also written ' other scripts), Manipuri an" several other languages! As the linguistics and graphology of those languages come to be better understood, the behav'our of this shared script ' those settings -ill playa crucial role, as all newly opened up "omains do, ' redrawi g the overall picture of this script. Until that development takes place, we are confined to the study of the major languages that have traditionally used this and only this script. Furthermore, the background of the aut&ors compels an emphasis on the way the script functions ' the BangIa setting, treating the Asamiya material as a brief supplement to a overall BangIa-focused story. The present authors regret but cannot avoid this bias, -&'#& should be easy ,or Asamiya scholars to correct. For some #urrent Asamiya-base" remarks on this share" script, see Goswami a " Tamuli )1223*! The present paper seeks to cover the synchronic realities at the levels of linguistic a " graphological description, including the computational dimension that the study o, scripts ' our era is compelled to take on board for languages that have made the jump! Some rudimentary remarks about historical origins also appear, but will need to be supplemented by specialists of the history of script an" language. While eschewing pho ological description for its own sake, we are compelled to provide a reasonably full picture of the segmental phonology of standar" BangIa as the basic material that the graphology hooks on to. For convenience o, reading, we use a basic phonemic transcription as our standard metho" of mentioning BangIa or Asamiya words, except where the specifics of a written fonn are under scrutiny. For the same reason, our "'scussion of the script is built around a romanization whose conventions are consistent with the phonemic transcription we employ. 1 We should ideally distinguish between the properties of the script proper, -&'#& can underlie several different. orthographies for one or more languages, and those of an orthograp&$ that applies it to a particular language. But common parlan#e does not make this distinction. For immediate practical purposes, given the relatively marginal differences between written BangIa and written Asamiya at the structural level, -e will therefore not make that distinction, a " will speak of script somewhat loosely. The distinctions we do need to make for our practical work are expressed ' the transcription systems used! In our practice here+ the graphological transcriptions are enclosed ' 7 8 to distinguish them from transcriptions at the graphic level enclosed ' single angular brackets < 9! We employ a ' u ,or short vowel graphemes, a I ii for their long counterparts, r for the functionally .ocalic Iril originating historically from syllabic Irl, ai aU for the diphthong graphemes, tilde f()r vowel nasalizatio (as ' a), it for the velar nasal, s for the palato- alveolar sibilant, nand s for the dentals, t : for retroflex plosives, r for the retroflex flap, ( for the retroflex sibilant, ,' for the palatal nasal, H for the anusvara and b for the visarga to the extent that these may eed separate representation, y for the palatal grapheme with semivowel value (marked ' the actual script by placing a dot under the independent allograp& of the palatal grapheme), $ for the palatal grapheme with non-semivowel value (this covers both the undotted independent allograp& of the palatal grapheme and the postconsonantal depe dent palatal marking known as ya-phala), and backslash for the silencer that cancels the default vowel «a» on a consonant. The phonem'# II and phonetic ; < transcription systems use re = for low vowels, # j for palato-alveolar affricates, and other symbols (including Ia! for the low back unrounded vowel) as ' IPA and other standard tra scriptions. Itis possible that this stock needs to be eked out by drawing on IPA itself an" other resources ', one expands the scope of such an investigation. When Salomon )1223> ?1*+ ' a direct reference to BangIa, claims that 'the written vowel e actually serves to represent three separate sounds', he lists 1r./, ' addition to our lei and lrel, -&'#& corresponds to nothing known ' standard BangIa. The reference is perhaps to dialects of BangIa ' -&'#& standard lrel does not appear, for we are not aware of any variant of Bangia ' -&'#& lower-mid fro t and low front vowels appear to contrast with mid front, and Salomo does not clarify -&at he has ' mind. We "oubt that graphological studies are prepared to take on all the dialects at this stage of the enterprise. The present expositio is a working draft. Section I provides a general overview of the script and some considerations involv'ng phonology and orthography. Section 1 takes up more concrete issues on the computational front requiring decisions. 1. Considerations of Script, Phonology, and Orthography 1.1. Diachrony and Description We distinguish the content of descriptive linguistics, -&'#& needs to take on board whatever realities govern a particular state of a language, from the principles of linguistic synchrony, -&'#& interact with those of diachrony and other systems to yield these realities. T&# serious study of linguistic synchrony deploys scientific idealizations modelling the most transparent operations ' + say+ phonology or syntax. In contrast, the descriptive analysis of the phenomena of a language has to hug these phenomena closely and to suspend judgment about just ho- various factors interact to give rise to the appearances, ', they are later scientifically judged to be as deceptive as all appearances. The description of morphology, for instance+ has to take into account both synchron'# and diachronic principles. In the other direction, a study of "'achrony must divide its attention between the tracking of phenomena through time, a historical matter, and the effects o, "'achronic phenomena as they show up within a particular state of a given language+ where the archaic and the innovative sectors of the lexicon are layere" "ifferently. In the study of language ' recent decades, there has been some neglect both of diachron'# issues ( " of the interface between spoken and written language. 4e face a certain problem of equipment and of preparedness of the "iscursive surroundings, therefore, when we approach the issue of the diachrony of the Bangla-Asamiya writing system. For our purposes, let the following points suffice, inviting supplementation by specialists. To begin with a familiar first approximation )-&'#& -e radically revise ' section 1*+ the Bangla- Asamiya script is alpha-syllabic, or an ak~ara script ' the sense of Salomon )1223>BC*! Its remote ancestor was the Brahm) script. Its classic shape was finalized over several decades when the sociology of print encountered the handwriting of Ishwarchandra Vidyasagar (1820-1891, the initiator of BangIa print litera#$ and the author of primers that have stoo" the test of time) a " his associates ' nineteenth century Bengal. There was one major revision of these shapes beginning ' the nineteen fifties, with the rise of linotype printing and the need to reduce the number of "'stinct allographs. The current rise of computer technology has been occasioning much activity behind the scenes as far as the replication of older printing standards is concerned, but little by way of script reform ' the sense of the earlier impact of print technology on this script. At the level of orthography, Bangla-Asamiya writing is considerably more conservative than the phonologies of these languages. Many innovations ' their phonology are not matched ' the writing of words taken directly ,rom Sanskrit and preserving the orthography of Sanskrit, standardly called tatsama words. As a result, there is a deep cleavage between speech and writing ' these languages. C!1! Generalities and Dowels The basic graphological molecule of an ak~ara script such as the Bangla-Asamiya script is an ak~ara. It consists of a series of graphic atoms. The series must begin -'th zero or more consonants and end with either a marked. vowel or the unmarke" default vowel «a» (BangIa and Asamiya pronounce this as I::>/) or the silencer element transcribed by us as the backs lash E and called 7hasanta» Ih::>Sonto/ ' BangIa (the Nagari term is «halanta», Hind' Ih~l~nt/). Thus, ' «ud\bigna» ludbigno/ 'anxious', there are four ak~aras. The vocalic ak~ara <ill» consists of zero consonant plus graphological <ill». The zero consonant triggers the graphic l~vel choice of the independent vowel allograph of <ill»!The next ak~ara 7"E8 #onsists of graphic F"9 plus the silencer, without -&'#& graphic F"9 -ould be understood to carry a default «a» (modem BangIa strongly tends to omit all silencers at the graphic level, leaving it an open question whether the$ are even graphologically absent)! A consonant «b» 'nitiates the following ak~ara ' the word under consideration and triggers, for graphological 7'8+ the graph level #&oice of the dependent allograph F'9+ -&'#& appears to the left of the graph <b> and is called the «hrasba-i-kara» /hr::>§oikar/, the 'short 7'8 supplement'. All vowels distinguish the dependent (or supplementary) from the independent allograph! The last ak~ara of <illd\bigna» begins with the conjunct consonant «gn» and leaves the default vowel «a» understood. Some conjunct consonants like «gn» combine the two graphs, ' this case vertically (while 7mph» combinesm with ph horizontally). One consonant, «t», ' a conjunct-initial position+ chooses a prefixal allograph called «khanc;l8 ta» /khonet.otto/ 'broken t', whose use preempts that of the unattested «t\» combination throughout the language. In the other "irection, several consonants ' conjunct-final position choose a dependent allograph known as a <<phala8Iph::>la/.
Recommended publications
  • A Ruse Secluded Character Set for the Source
    Mukt Shabd Journal Issn No : 2347-3150 A Ruse Secluded character set for the Source Mr. J Purna Prakash1, Assistant Professor Mr. M. Rama Raju 2, Assistant Professor Christu Jyothi Institute of Technology & Science Abstract We are rich in data, but information is poor, typically world wide web and data streams. The effective and efficient analysis of data in which is different forms becomes a challenging task. Searching for knowledge to match the exact keyword is big task in Internet such as search engine. Now a days using Unicode Transform Format (UTF) is extended to UTF-16 and UTF-32. With helps to create more special characters how we want. China has GB 18030-character set. Less number of website are using ASCII format in china, recently. While searching some keyword we are unable get the exact webpage in search engine in top place. Issues in certain we face this problem in results announcement, notifications, latest news, latest products released. Mainly on government websites are not shown in the front page. To avoid this trap from common people, we require special character set to match the exact unique keyword. Most of the keywords are encoded with the ASCII format. While searching keyword called cbse net results thousands of websites will have the common keyword as cbse net results. Matching the keyword, it is already encoded in all website as ASCII format. Most of the government websites will not offer search engine optimization. Match a unique keyword in government, banking, Institutes, Online exam purpose. Proposals is to create a character set from A to Z and a to z, for the purpose of data cleaning.
    [Show full text]
  • Kiraz 2019 a Functional Approach to Garshunography
    Intellectual History of the Islamicate World 7 (2019) 264–277 brill.com/ihiw A Functional Approach to Garshunography A Case Study of Syro-X and X-Syriac Writing Systems George A. Kiraz Institute for Advanced Study, Princeton and Beth Mardutho: The Syriac Institute, Piscataway [email protected] Abstract It is argued here that functionalism lies at the heart of garshunographic writing systems (where one language is written in a script that is sociolinguistically associated with another language). Giving historical accounts of such systems that began as early as the eighth century, it will be demonstrated that garshunographic systems grew organ- ically because of necessity and that they offered a certain degree of simplicity rather than complexity.While the paper discusses mostly Syriac-based systems, its arguments can probably be expanded to other garshunographic systems. Keywords Garshuni – garshunography – allography – writing systems It has long been suggested that cultural identity may have been the cause for the emergence of Garshuni systems. (In the strictest sense of the term, ‘Garshuni’ refers to Arabic texts written in the Syriac script but the term’s semantics were drastically extended to other systems, sometimes ones that have little to do with Syriac—for which see below.) This paper argues for an alterna- tive origin, one that is rooted in functional theory. At its most fundamental level, Garshuni—as a system—is nothing but a tool and as such it ought to be understood with respect to the function it performs. To achieve this, one must take into consideration the social contexts—plural, as there are many—under which each Garshuni system appeared.
    [Show full text]
  • The Unicode Cookbook for Linguists: Managing Writing Systems Using Orthography Profiles
    Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2017 The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles Moran, Steven ; Cysouw, Michael DOI: https://doi.org/10.5281/zenodo.290662 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-135400 Monograph The following work is licensed under a Creative Commons: Attribution 4.0 International (CC BY 4.0) License. Originally published at: Moran, Steven; Cysouw, Michael (2017). The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles. CERN Data Centre: Zenodo. DOI: https://doi.org/10.5281/zenodo.290662 The Unicode Cookbook for Linguists Managing writing systems using orthography profiles Steven Moran & Michael Cysouw Change dedication in localmetadata.tex Preface This text is meant as a practical guide for linguists, and programmers, whowork with data in multilingual computational environments. We introduce the basic concepts needed to understand how writing systems and character encodings function, and how they work together. The intersection of the Unicode Standard and the International Phonetic Al- phabet is often not met without frustration by users. Nevertheless, thetwo standards have provided language researchers with a consistent computational architecture needed to process, publish and analyze data from many different languages. We bring to light common, but not always transparent, pitfalls that researchers face when working with Unicode and IPA. Our research uses quantitative methods to compare languages and uncover and clarify their phylogenetic relations. However, the majority of lexical data available from the world’s languages is in author- or document-specific orthogra- phies.
    [Show full text]
  • A Ruse Secluded Character Set for the Source
    JOURNAL OF ARCHITECTURE & TECHNOLOGY Issn No : 1006-7930 A Ruse Secluded character set for the Source Mr. J Purna Prakash1, Assistant Professor Mr. M. Rama Raju 2, Assistant Professor Christu Jyothi Institute of Technology & Science Abstract We are rich in data, but information is poor, typically world wide web and data streams. The effective and efficient analysis of data in which is different forms becomes a challenging task. Searching for knowledge to match the exact keyword is big task in Internet such as search engine. Now a days using Unicode Transform Format (UTF) is extended to UTF-16 and UTF-32. With helps to create more special characters how we want. China has GB 18030-character set. Less number of website are using ASCII format in china, recently. While searching some keyword we are unable get the exact webpage in search engine in top place. Issues in certain we face this problem in results announcement, notifications, latest news, latest products released. Mainly on government websites are not shown in the front page. To avoid this trap from common people, we require special character set to match the exact unique keyword. Most of the keywords are encoded with the ASCII format. While searching keyword called cbse net results thousands of websites will have the common keyword as cbse net results. Matching the keyword, it is already encoded in all website as ASCII format. Most of the government websites will not offer search engine optimization. Match a unique keyword in government, banking, Institutes, Online exam purpose. Proposals is to create a character set from A to Z and a to z, for the purpose of data cleaning.
    [Show full text]
  • Rongorongo Script: Carving Techniques and Scribal Corrections
    Journal de la Société des Océanistes 129 | juillet-décembre 2009 Varia Rongorongo Script: Carving Techniques and Scribal Corrections Paul Horley Electronic version URL: http://journals.openedition.org/jso/5813 DOI: 10.4000/jso.5813 ISSN: 1760-7256 Publisher Société des océanistes Printed version Date of publication: 15 December 2009 Number of pages: 249-261 ISBN: 978-2-85430-026-0 ISSN: 0300-953x Electronic reference Paul Horley, « Rongorongo Script: Carving Techniques and Scribal Corrections », Journal de la Société des Océanistes [Online], 129 | juillet-décembre 2009, Online since 30 December 2012, connection on 30 April 2019. URL : http://journals.openedition.org/jso/5813 ; DOI : 10.4000/jso.5813 © Tous droits réservés Rongorongo Script: Carving Techniques and Scribal Corrections by Paul HORLEY* ABSTRACT RÉSUMÉ Studies of three original rongorongo tablets (Tahua, L’étude menée sur trois tablettes rongorongo origina- Aruku Kurenga and Mamari) revealed clear traces of les (Tahua, Aruku Kurenga et Mamari) montre claire- two-stage carving (pre-incising with an obsidian flake ment que les signes ont été tracés au cours de deux and contour enhancement with a shark tooth). Most étapes successives : pré-incision avec un éclat d’obsi- probably, the texts were written in short fragments with dienne puis gravure des contours avec une dent de requin. shark-tooth engraving applied before passing to the next Il est probable que de courts fragments de textes étaient fragment. Additional multiple engraving sessions might écrits et gravés à l’aide d’une dent de requin avant de been performed for finished inscription, aiming to tracer le fragment de texte suivant.
    [Show full text]
  • Of Writing Systems in Terms of Typological and Other Criteria: Cross-Linguistic Observations from the German and Japanese Writing Systems
    The evolution of writing systems: Empirical and cross-linguistic approaches workshop (AG5) @ DGfS2020, Universität Hamburg, Germany; 4-6 March, 2020 ‘Evolution’ of writing systems in terms of typological and other criteria: Cross-linguistic observations from the German and Japanese writing systems Terry Joyce Dimitrios Meletis Tama University, Japan University of Graz, Austria [email protected] [email protected] Overview Opening remarks Selective sample of writing system (WS) typologies Alternative criteria for evaluating WSs Observations from German (GWS) + Japanese (JWS) Closing remarks Opening remarks 1: Chaos over basic terminology! Erring towards understatement, Gnanadesikan (2017: 15) notes, [t]here is, in general, significant variation in the basic terminology used in the study of writing systems. Indeed, as Meletis (2018: 73) observes regarding the differences between the concepts of WS and orthography, [t]hese terms are often shockingly misused as synonyms, or writing system is not used at all and orthography is employed instead. Similarly, Joyce and Masuda (in press) seek to differentiate between the elusive trinity of terms at heart of WS research; namely, script, WS, and orthography, with particular reference to the JWS. Opening remarks 2: Our working definitions WS1 [Schrifttyp]: Abstract relations (i.e., morphographic, syllabographic, + phonemic), as focus of typologies. WS2 [Schriftsystem]: Common usage for signs + conventions of given language, such as GWS + JWS. Script [Schrift]: Set of material signs for specific language. Orthography [Orthographie]: Mediation between script + WS, but often with prescriptive connotations of correct writing. Graphematic representation: Emerging from grapholinguistic approach, a neutral (ego preferable) alternative to orthography. GWS: Use of extended alphabetic set, as used to represent written German language.
    [Show full text]
  • Research Report on Phonetics and Phonology of Sinhala
    Research Report on Phonetics and Phonology of Sinhala Asanka Wasala and Kumudu Gamage Language Technology Research Laboratory, University of Colombo School of Computing, Sri Lanka. [email protected],[email protected] Abstract phonetics and phonology for improving the naturalness and intelligibility for our Sinhala TTS system. This report examines the major characteristics of Many researches have been carried out in the linguistic Sinhala language related to Phonetics and Phonology. domain regarding Sinhala phonetics and phonology. The main topics under study are Segmental and These researches cover both aspects of segmental and Supra-segmental sounds in Spoken Sinhala. The first suprasegmental features of spoken Sinhala. Some of part presents Sinhala Phonemic Inventory, which the themes elaborated in these researches are; Sinhala describes phonemes with their associated features and sound change-rules, Sinhala Prosody, Duration and phonotactics of Sinhala. Supra-segmental features like Stress, and Syllabification. Syllabification, Stress, Pitch and Intonation, followed In the domain of Sinhala Speech Processing, a by the procedure for letter to sound conversion are very little number of researches were reported and described in the latter part. most of them were carried out by undergraduates for The research on Syllabification leads to the their final year projects. Some restricted domain identification of set of rules for syllabifying a given Sinhala TTS systems developed by undergraduates are word. The syllabication algorithm achieved an already working in a number of domains. The study of accuracy of 99.95%. Due to the phonetic nature of the such projects laid the foundation for our project. Sinhala alphabet, the letter-to-phoneme mapping was However, an extensive research was done in all- easily done, however to produce the correct aspects of Sinhala Phonetics and Phonology; in order phonetized version of a word, some modifications identify the areas to be deeply concerned in developing were needed to be done.
    [Show full text]
  • Poorman's Hangul Jamo Input Method
    Poorman’s Hangul Jamo Input Method pmhanguljamo.sty Kangsoo Kim 20 Sep 2021 version 0.3.6 Contents 1 Introduction 1 2 Usage 2 2.1 Loading the package .......................... 2 2.2 Commands and Environment Provided ................. 2 2.3 Setting up in your Preamble ....................... 3 3 Transliteration Rule of This Package 4 3.1 Tone Marks and Syllable Serapator ................... 4 3.2 Consonants ............................... 4 3.3 Vowels .................................. 5 3.4 Compatibility Jamos .......................... 6 4 Proper Fonts 6 5 Examples 7 5.1 Modern Hangul ............................. 7 5.2 pre-1933 Hangul ............................ 8 6 The RRK Input Method: An Alternative Way 8 6.1 Transliteration Rule of RRK ...................... 9 6.2 Example of RRK method ........................ 10 7 Further Information 11 8 Acknowledgement 11 1 Introduction 1 This LATEX package provides Hangul transliteration input method, which allows to typeset Korean Letters (Hangul) with the help of proper fonts. The name comes from “Poorman’s Hangul Jamo Input Method.” It is mainly for the people who have a system without Korean keyboard IM, but want to typeset Hangul in their document. Not only 1Hangul is the Korean alphabet to write the Korean language. In both South and North Korea, the standard writing system uses Hangul. 1 modern Hangul, but so-colled “Old Hangul” characters that uses the lost letters such as ‘Arae-A’(ㆍ), ‘Yet Ieung’(ㆁ) or ‘Pan-Sios’(ㅿ) etc. can also be typeset. X LE ATEX or LuaLATEX is required. The legacy pdfTEX is not supported. The Korean Language supporting packages such as xetexko and luatexko (in the ko.TEX bundle) or polyglossia package with Korean support are recommended, but without them typeset- ting Hangul is of no problem with this package pmhanguljamo.
    [Show full text]
  • The Writing Revolution
    9781405154062_1_pre.qxd 8/8/08 4:42 PM Page iii The Writing Revolution Cuneiform to the Internet Amalia E. Gnanadesikan A John Wiley & Sons, Ltd., Publication 9781405154062_1_pre.qxd 8/8/08 4:42 PM Page iv This edition first published 2009 © 2009 Amalia E. Gnanadesikan Blackwell Publishing was acquired by John Wiley & Sons in February 2007. Blackwell’s publishing program has been merged with Wiley’s global Scientific, Technical, and Medical business to form Wiley-Blackwell. Registered Office John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, United Kingdom Editorial Offices 350 Main Street, Malden, MA 02148-5020, USA 9600 Garsington Road, Oxford, OX4 2DQ, UK The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK For details of our global editorial offices, for customer services, and for information about how to apply for permission to reuse the copyright material in this book please see our website at www.wiley.com/wiley-blackwell. The right of Amalia E. Gnanadesikan to be identified as the author of this work has been asserted in accordance with the Copyright, Designs and Patents Act 1988. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by the UK Copyright, Designs and Patents Act 1988, without the prior permission of the publisher. Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic books. Designations used by companies to distinguish their products are often claimed as trademarks.
    [Show full text]
  • Simplified Lambda in an English Novel
    The Quest for Information Retrieval THE QUEST FOR INFORMATION RETRIEVAL: AN INSCRIBED RELIC FROM ANCIENT RAPA NUI (EASTER ISLAND) – Part I. Tomi S. MELKA (Las Palmas de G.C., Spain) [email protected] Robert M. SCHOCH (Boston, Massachusetts, USA) [email protected] This article focuses on an antique wooden fragment from Easter Island engraved with a single rongorongo (RR) glyph. Based on our analysis, we conclude that the most likely hypothesis regarding this fragment is that it dates to pre-missionary times (prior to 1864) and was part of some type of weapon or fighting instrument, either of a functional or ritual nature. It may represent a portion of a handle for an obsidian-tipped spear or knife. The “bird”-like RR glyph on the fragment belongs in all probability to a category that has previously been connected in the scholarly literature on RR with “slumber”, and hence “waning”, “demise”, and “death”. Its presence on a hand-weapon or ceremonial warlike artifact suggests a meaningful real-world, “non-literary”, context for this glyph. Keywords: “bird”-like motif, glyph of the /600/-class, “Lutz–Terasaki fragment”, mako‘i (Thespesia populnea), mata‘a, patina, pre-missionary era, Rapa Nui (Easter Island), rongorongo script “El desorden y el esparcimiento en que se encuentran los más importantes documentos originales de la cultura pascuense, permite sospechar que en algunos museos y colecciones particulares se encuentren algunas otras piezas auténticas” [The chaos and dispersal in which the most important original documents (= rongorongo artifacts; our note) of Rapanui culture are found, allows us to suspect that a few other authentic pieces may be encountered in some museums and particular collections].
    [Show full text]
  • Can a Logographic Script Be Simplified? Lessons from the 20Th Century Chinese Writing Reform Informed by Recent Psycholinguistic Research*
    Can a Logographic Script be Simplified? Lessons from the 20th Century Chinese Writing Reform Informed by Recent Psycholinguistic Research* Zev Handel University of Washington, Seattle In the 1950s and 1960s, the government of the People’s Republic of China undertook, in two stages, a carefully planned “simplification” of the logographic Chinese script. Drawing on a variety of historical precedents, over 2,000 individual graphs were modified in an attempt to make the script easier to learn and use. This was the first significant change in the official form of the Chinese script in nearly two millennia, and resulted in the script variety that is widely used today in mainland China, commonly termed “simplified Chinese characters.” Drawing on recent psycholinguistic experiments that attempt to characterize the cognitive functions involved in Chinese script processing, this study revisits long-standing questions about the efficacy of character simplification and provides some additional theoretical insights into the nature of logographic writing. The central conclusion of this study is that meaningful simplification of a logographic script is possible, but that today’s simplified character script cannot be characterized as an effective reform by any reasonable metric—it is only “simpler” in the crudest of senses. After evaluating the results of recent studies on the cognitive processing of Chinese characters, I introduce the concept of semantic orthographic depth and argue that a true simplification of a logographic script should be based on regularization of semantic and phonetic components, rather than on reduction of the number of graphs or the reduction of the number of strokes per graph.
    [Show full text]
  • Transmission of the First Alphabets
    Scripts in Contact: Transmission of the First Alphabets Sveva Elti di Rodeano Abstract. The alphabet is a type of notation which breaks language into small units, attempting to represent its phonological and/or phonetic repertoire. Given that writing is a way of thinking, based on a cognitive activity that re- quires mental procedures, the aim of this paper is to propose the architecture of the spelling process, described in cognitive and psycholinguistic studies, as a model for the transmission of the alphabet(s). For this purpose, the reflection is focused on the so-called first alphabets in writing systems’ history, i.e., Greek, Anatolian, Italic, and Iberian alphabets, in- troducing the new linguistic and archeological approaches which allow an earlier date and a less decisive role for the Greeks in the alphabets’ introduction. Ancient literatures’ evidence about the teaching of writing and reading are found to match with the current results of cognitive studies about the dynamics of oral reproduction, word recognition, and written reproduction. The pecu- liarities of Asia minor alphabets, which are still unsolved, will be addressed and framed within the spelling process model. 1. Introduction Currently, the alphabet is more widespread than any other system of written language. In company with Chinese characters, the alphabet provides the forms by which all living languages are written: whether Arabic, Bengali, Cyrillic, Devanagari, Greek, Hebrew, Javanese, Latin, or Tibetan or any This paper has been written as contribution to the GC21 Conference, held online due to COVID-19 pandemic. I must thanks all the participants for their direct and undirect contributions for my research, with special regards to Daniel Harbour, Hany Rashwan and Amalia Gnanadesikan.
    [Show full text]