Typology of Writing and the Origin of Alphabetic Script

Total Page:16

File Type:pdf, Size:1020Kb

Typology of Writing and the Origin of Alphabetic Script saqarTvelos mecnierebaTa erovnuli akademiis moambe, 175, #2, 2007 BULLETIN OF THE GEORGIAN NATIONAL ACADEMY OF SCIENCES, 175, #2, 2007 Semiotics & Linguistics Typology of Writing and the Origin of Alphabetic Script Thomas V. Gamkrelidze* * Academy Member, G. Tsereteli Institute of Oriental Studies, Tbilisi; Georgian National Academy of Sciences ABSTRACT. The conceptualization of writing as a sign system affords a better insight into its ontological nature. This conceptualization enables one to develop a general typology of writing and to determine its place in the development of human culture. Comparative analysis of various types of scripts is presented in order to develop criteria for their assessment necessary for the clarification of the question of the origin of writing and determina- tion of the principal stages of its phylogenetic development. © 2007 Bull. Georg. Natl. Acad. Sci. Key words: typology of writing, alphabetic script, semiotic system. 1. Writing as a semiotic system. Writing systems sion and content. The expression of a graphic symbol, or may be conceptualized in what follows as a set of inter- its signifier (signans) is that physical substance by related signs of a special nature, forming a single integral means of which a visual representation of a sign is real- structure. The conceptualization of writing as a semiotic ized. This representation may be a drawing, a geometric system places it on a par with other analogous human sign, or a figure. The content of a graphic sign, or its systems. This defines the theory of writing, proposed in signified (signatum) is all that is expressed by such a modern linguistic science to be named grammatology, as written symbol, all to which it is correlated. This may be a division of the general theory of sign systems, viz. a definite concept, idea, number, word, syllable, or an semiotics or semiology. individual sound. The writing system, when seen as a Thus we may apply to the writing system a number sign system, is characterized by two planes: those of of operational concepts developed in other semiotic dis- expression and content, to which, because of their dual ciplines, primarily in linguistics. This is facilitated not nature, the graphic signs of a particular writing system only by the close historical relationship that exists be- are correlated. tween language and writing - the latter in a sense being Such a dual nature of the writing system gives superimposed on language - but also by the very na- grounds for a typological classification of writing accord- ture of writing, which displays many of the common ing to the character of its “plane of expression”. This structural features of a language system. The classification in turn permits a comparative analysis of conceptualization of writing as a sign system affords a various types of scripts in order to develop criteria for better insight into its ontological nature. This their assessment, necessary for the clarification of the conceptualization enables one to develop a general ty- question of the origin of writing and determination of the pology of writing and to determine its place in the de- principal stages of its phylogenetic development. The velopment of human culture [1]. phylogenetic development (or phylogeny) of writing re- 2. The “Plane of content” and the “Plane of expres- fers to the evolution of writing in general, with account sion” of writing. As a semiotic system, writing consists of the consecutive stages of its development, beginning of visual symbols of sign structure. This means that ev- with semiography up to the alphabetic stage of phonog- ery written sign-symbol constitutes a two-sided entity, raphy. The ontogenetic development (or ontogeny) of i.e., a systemic unit characterized by two sides: expres- writing is the origin and development of the consecutive © 2007 Bull. Georg. Natl. Acad. Sci. Typology of Writing and the Origin of Alphabetic Scripts 153 stages of an individual writing system, beginning with universal conceptual categories of the language - essen- its creation and continuing up to the time of its study. tially available to all language communities at a definite Two principal typological classes may be identified level of cultural development - but to the concrete pho- with regard to the “plane of content”: (a) semasiography netics of a particular language. In such systems it is not or ideography, and (b) phonography. concepts that play the role of the signified of particular Semasiography is characteristic of the class of writ- signs, but concrete words characterized by concrete ing systems in which the graphic signs designate not sounding, or language units of a lower order - individual the phonetic side of a particular language (individual syllables and/or sounds. In the case of phonographic words, syllables, or sounds of the given language), but systems, knowledge of the writing system presupposes instead denote concrete concepts or even whole situa- the preliminary knowledge of the relationship - often con- tions. They correlate directly with the “plane of content” ventional - between the sign and the concrete phonetic of language. In other words, in such writing systems the word of the given language, or the sound segment of a plane of content, expressed by the words and phrases of lower order (syllable, sound). a particular language is directly reflected in the signs, Writing proper is believed - apparently without suf- the latter performing the role of units which - along with ficient grounds - to be represented by phonographic the words and word combinations of a concrete language systems, while ideography constitutes in its phyloge- – designate universal conceptual categories of various netic development a sort of precursor of writing. levels of abstraction. Such signs in ideographic Thus, in phonographic systems writing is already (semasiographic) systems, being correlated to definite correlated to speech, and the phonetic form of a lan- concepts but devoid of the phonetic envelope of the guage serves as the plane of content of such systems - words of concrete languages, are understood and read definite sound segments (phonetic word, syllables, or correctly by representatives of diverse languages who individual sounds) turn into the signified of the graphic possess knowledge of these signs, i.e., knowledge of the signs of a writing system. correlation of these signs with concepts. Such knowl- A writing system related to phonography and pos- edge of the content of the signs and symbols of an ideo- sessing characters expressing individual lexemes of a graphic system is based either on the identification of concrete language, is defined as logographic. A sepa- their signifiers with objects of the real world, whose rate sign of such a system is a logogram. A phonographic iconic reflection they are, or on a conventionally adopted system with characters expressing separate syllables is relation of concrete signs of a writing system to corre- defined as syllabography. The characters of such a sys- sponding semantic conceptual signifiers. Elements of tem are syllabograms. A phonographic system with signs ideography are present in many ancient and modern writ- expressing individual sounds (sound units) is described ing systems (cf. for example, the numerical designations as an alphabet. in most writing systems, symbols of the type of &, e.g., In the typology of writing, the alphabetic system cf. in English, the system of mathematical signs, and so occupies the highest rank. It is the most economical in on). Uninterrupted ideographic writing systems are char- terms of the number of signs needed for a complete re- acteristic of the ancient stages of the phylogenetic de- cording of phonetic speech and for the transmission of velopment of writing. information over a distance. Thereby the invention of In this connection the question arises regarding the plane of expression of a writing system. The characters alphabetic writing signified an outstanding achievement of an ideographic system may graphically resemble the in the cultural development of mankind. In the form of objects of the real world to which they are correlated alphabetic writing mankind acquired a simple and effec- through their connection with corresponding concepts. tive means of graphic recording of speech and of its This iconic resemblance of the signifiers of the signs of transmission over space and time. Elements of ideography the system to the objects they reflect characterizes the may be present in any alphabetic writing. Hence, one writing system as pictographic, i.e., an iconic system of can speak only conventionally about the greater perfec- writing. tion of the alphabetic system of writing in comparison When there is no external resemblance between the with ideography. The “perfection” of alphabetic writing symbols of the ideographic system and the objects of should be taken in the sense that it is chronologically a the real world to which they are correlated by virtue of further stage in the phylogenetic development of writ- their link with corresponding conceptual categories (cf. ing, consecutively passing the stages of ideography, for example, numerical designations) the writing system logography, and syllabography, up to the emergence of may be described as conventional. the alphabetic system proper, although in the develop- Phonography refers to a class of writing systems in ment of alphabetic writing cases may be observable of a which the signifiers of the signs are correlated not to the
Recommended publications
  • Armenian Secret and Invented Languages and Argots
    Armenian Secret and Invented Languages and Argots The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Russell, James R. Forthcoming. Armenian secret and invented languages and argots. Proceedings of the Institute of Linguistics of the Russian Academy of Sciences. Citable link http://nrs.harvard.edu/urn-3:HUL.InstRepos:9938150 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#OAP 1 ARMENIAN SECRET AND INVENTED LANGUAGES AND ARGOTS. By James R. Russell, Harvard University. Светлой памяти Карена Никитича Юзбашяна посвящается это исследование. CONTENTS: Preface 1. Secret languages and argots 2. Philosophical and hypothetical languages 3. The St. Petersburg Manuscript 4. The Argot of the Felt-Beaters 5. Appendices: 1. Description of St. Petersburg MS A 29 2. Glossary of the Ṙuštuni language 3. Glossary of the argot of the Felt-Beaters of Moks 4. Texts in the “Third Script” of MS A 29 List of Plates Bibliography PREFACE Much of the research for this article was undertaken in Armenia and Russia in June and July 2011 and was funded by a generous O’Neill grant through the Davis Center for Russian and Eurasian Studies at Harvard. For their eager assistance and boundless hospitality I am grateful to numerous friends and colleagues who made my visit pleasant and successful. For their generous assistance in Erevan and St.
    [Show full text]
  • Petit Manuel Unix®
    Août 2010 Petit Manuel Unix® Jacques MADELAINE Département d’informatique Université de CAEN 14032 CAEN CEDEX La première édition de ce manuel décrivait SMX un Unix développé à l’INRIA pour la machine française SM90, la deuxième édition une adaptation pour SPIX, un Unix pour SM90 basé sur System V et développé par Bull. Il a été ensuite modifié et corrigé pour SunOS l’Unix de Sun Microsystems, puis pour Solaris. La cinquième version a été adaptée pour tenir compte des particularités du système GNU-Linux. Un chapitre supplémentaire dédié aux accès réseau a été ensuite ajouté. Rappelons que presque toutes les commandes décrites vont fonctionner comme indiqué sur tout système Unix commercial (Solaris, HP-UX, AIX, ...) ou libre (Linux, OpenBSD, FreeBSD, NetBSD, ...). Mes remerciements à Sara Aubry pour sa relecture attentive etàFrançois Girault pour avoir fourni la mise en tableau des commandes d’emacs. Mes remerciements à Davy Gigan pour m’avoir poussé à publier la version html en octobre 2003. 1 INTRODUCTION() INTRODUCTION() 2Petit manuel Unix 2002 INTRODUCTION NOM intro − introduction to the mini manual − introduction au petit manuel DESCRIPTION Ce manuel donne les principales commandes de Unix. Unix est une famille de systèmes d’exploitation ; les commandes décrites existent, sauf précision contraire, sous Linux et Solaris, les deux systèmes disponibles au département. Seules les principales options sont données, reportez-vous au manuel en ligne pour une liste exhaustive.Chaque commande est décrite par trois sections : NOM qui donne le nom de la commande, son nom en anglais (le nom Unix étant un mnémonique anglais ne correspondant pas toujours bien aveclefrançais) et en français.
    [Show full text]
  • Optimal Interleaving: Serial Phonology-Morphology Interaction in a Constraint-Based Model
    OPTIMAL INTERLEAVING: SERIAL PHONOLOGY-MORPHOLOGY INTERACTION IN A CONSTRAINT-BASED MODEL A Dissertation Presented by MATTHEW ADAM WOLF Submitted to the Graduate School of the University of Massachusetts Amherst in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY September 2008 Department of Linguistics © Copyright by Matthew Adam Wolf 2008 All Rights Reserved OPTIMAL INTERLEAVING: SERIAL PHONOLOGY-MORPHOLOGY INTERACTION IN A CONSTRAINT-BASED MODEL A Dissertation Presented by MATTHEW A. WOLF Approved as to style and content by: ____________________________________ John J. McCarthy, Chair ____________________________________ Joseph V. Pater, Member ____________________________________ Elisabeth O. Selkirk, Member ____________________________________ Mark H. Feinstein, Member ___________________________________ Elisabeth O. Selkirk, Head Department of Linguistics ACKNOWLEDGEMENTS I’ve learned from my own habits over the years that the acknowledgements are likely to be the most-read part of any dissertation. It is therefore with a degree of trepidation that I set down these words of thanks, knowing that any omissions or infelicities I might commit will be a source of amusement for who-knows-how-many future generations of first-year graduate students. So while I’ll make an effort to avoid cliché, falling into it will sometimes be inevitable—for example, when I say (as I must, for it is true) that this work could never have been completed without the help of my advisor, John McCarthy. John’s willingness to patiently hear out half-baked ideas, his encyclopedic knowledge of the phonology literature, his almost unbelievably thorough critical eye, and his dogged insistence on making the vague explicit have made this dissertation far better, and far better presented, than I could have hoped to achieve on my own.
    [Show full text]
  • Guide to the Use of Character Set Standards in Europe
    CEN TECHNICAL REPORT Draft 3 for CEN Trnnnn:1999 1999-07-23 Descriptors: Data processing, information interchange, text processing, text communication, graphic characters, character sets, representation of characters, coded character sets, architecture Information Technology - Guide to the use of character set standards in Europe This CEN Technical Report has been drawn up by CEN/TC 304 This CEN Technical Report was established by TC 304 in one official version (English). A version in any other language made by translation under the responsibility of a CEN member into its own lan- guage and notified to the Central Secretariat has the same status as the official version. CEN members are the national bodies of Austria, Belgium, the Czech Republic, Denmark, Finland, France, Germany, Greece, Iceland, Ireland, Italy, Luxembourg, Netherlands, Norway, Portugal, Spain, Sweden, Switzerland, and the United Kingdom. CEN European Committee for Standardization Comité Européen de Normalisation Europäisches Komitee für Normung Central Secretariat: rue de Stassart 36, B-1050 Brussels © CEN 1999 Copyright reserved to all CEN members Ref.No. TR xxxx:1999 E CEN TR nnnn : Draft 2 Guide to the use of character set standards in Europe ii Guide to the use of character set standards in Europe CEN TR nnnn : Draft 2 FOREWORD This report was produced by a CEN/TC 304 Project Team, set up in June, 1998, as one of several to carry out the funded work program of TC 304 (documented in CEN/TC 304 N 666 R2). A first draft was discussed at the TC meeting in Brussels in November, 1998. A revised draft was circulated for comments within the TC and thereafter discussed at the TC plenary meeting in April, 1999.
    [Show full text]
  • Choosing Inscriptions Making Font for 'Armazuli' Aramaic Objectives Mark up of the Texts and Linked Data New Photo Document
    @EAGLE 2016, Rome Epigraphic Corpus of Georgia Inscriptions found in Georgia are diverse in their typology, content and language. From among this range perhaps the most compelling examples are those inscriptions The Institute of Linguistic Research, in Aramaic and Old Greek (numbering more than 1000), and dating from V AC to XIX (T. Kaukhchishvili 2009.) None of these have been published online according to the EpiDoc guidelines. Thanks to a year’s funding from The Institute of Linguistic Research of the Ilia State University (ISU), the “Epigraphic Corpus of Georgia Project”, Ilia State University led by Prof. Nino Doborjginidze, began on March 1st 2015. It aims to make the first, key 30 inscriptions available to both a scholarly audience and to the general public. It presents an opportunity to question the strict practice of the “print-only” publishing of epigraphic materials among Georgian epigraphists. Objectives New photo documentation and web page - epigraphy.iliauni.edu.ge A desired outcome of the digital publishing of the inscriptions of Georgia is, on the one hand, to preserve those inscriptions and likewise to preserve editions of these inscriptions that were made by Georgian experts from 1930s onwards, although only a few of these were published in international scientific journals because of the So- viet restrictions. Thus the aims of the project can be summarized thus: • to protect these inscriptions as an element of our cultural heritage • to document the printed critical editions of the inscriptions, which likewise consti- tute a part of that cultural heritage • to demonstrate and illustrate common, historical-cultural contexts(e.g.
    [Show full text]
  • Proposal for the Georgian Script Root Zone LGR
    Proposal for the Georgian Script Root Zone LGR LGR Version 2 Date: 2016-11-24 Document version: 1.1 Authors: GEORGIAN SCRIPT GENERATION PANEL 1 General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Georgian LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document: • Proposed-LGR-Georgian-20160915.xml Labels for testing can be found in the accompanying text document: • Labels-GeorgianScript-20160915.txt 2 Script for which the LGR is proposed ISO 15924 Code: Geor ISO 15924 Key N°: 240 ISO 15924 English Name: Georgian Mkhedruli Latin transliteration of native script name: Mkhedruli Native name of the script: მხედრული Maximal Starting Repertoire (MSR) version: MSR-2 Proposal for a Georgian Script Root Zone LGR Georgian Script GP 3 Background on Script and Principal Languages Using It The Georgian scripts are the three writing systems used to write the Georgian language: Asomtavruli, Nuskhuri and Mkhedruli. Mkhedruli (Georgian: მხედრული) is the current Georgian script and is therefore the standard script for modern Georgian and its related Kartvelian languages, whereas Asomtavruli and Nuskhuri are used only in ceremonial religious texts and iconography. In the following, the term Georgian script is used synonymously with Mkhedruli. Like the two other scripts, Mkhedruli is purely unicameral. Mkhedruli first appears in the 10th century - the oldest Mkhedruli inscription found is dated back to 982 AD.
    [Show full text]
  • How Children Learn to Write Words
    How Children Learn to Write Words How Children Learn to Write Words Rebecca TReiman and bReTT KessleR 1 3 Oxford University Press is a department of the University of Oxford. It furthers the University’s objective of excellence in research, scholarship, and education by publishing worldwide. Oxford New York Auckland Cape Town Dar es Salaam Hong Kong Karachi Kuala Lumpur Madrid Melbourne Mexico City Nairobi New Delhi Shanghai Taipei Toronto With offices in Argentina Austria Brazil Chile Czech Republic France Greece Guatemala Hungary Italy Japan Poland Portugal Singapore South Korea Switzerland Thailand Turkey Ukraine Vietnam Oxford is a registered trademark of Oxford University Press in the UK and certain other countries. Published in the United States of America by Oxford University Press 198 Madison Avenue, New York, NY 10016 © Oxford University Press 2014 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without the prior permission in writing of Oxford University Press, or as expressly permitted by law, by license, or under terms agreed with the appropriate reproduction rights organization. Inquiries concerning reproduction outside the scope of the above should be sent to the Rights Department, Oxford University Press, at the address above. You must not circulate this work in any other form and you must impose this same condition on any acquirer. A copy of this book’s Catalog-in-Publication Data is on file with the Library of Congress ISBN 978–0–19–990797–7
    [Show full text]
  • Armenian Range: 0530–058F
    Armenian Range: 0530–058F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation.
    [Show full text]
  • Sylfaen : Foundations of Multiscript Typography
    Sylfaen : Foundations of Multiscript Typography John Hudson, 2000 © INTRODUCTION In the autumn of 1997, I was hired by the Microsoft Typography Group to consult For a general introduction to OpenType, see David Lemon’s on the production of support materials for OpenType font developers working with article in the first issue of Type. For more detailed information, multiple scripts and languages. The project was given the working title Web Resource including the current version of the OpenType specification, for International Typography (WRIT). A year later, I had the pleasure of showing this see the Microsoft Typography work in progress at the 1998 ATypI Conference in Lyon, where it was well received website at: www.microsoft.com/typography by my colleagues in the type industry. Shortly after my return to Canada, Microsoft decided to put further development of this work on hold, so this paper, inevitably, lacks some of the enthusiasm I had for the project in 1998. It is difficult to remain excited about something which has been effectively cancelled, but at the same time I am happy to respond to the editors’ invitation to write something about the project for the Type journal, because I think the small team at Microsoft who worked on this project achieved something important. Perhaps I still hope that some day the project may be revived, even if in a different or more limited form. Certainly, I still believe that the ever increasing emphasis on internationalisation in software development, business and, of course, the Internet, requires a better level of support from font developers, and this in turn requires something like WRIT.
    [Show full text]
  • N5020 2019-01-11
    ISO/IEC JTC 1/SC 2 N____ ISO/IEC JTC 1/SC 2/WG 2 N5020 2019-01-11 ISO/IEC JTC 1/SC 2/WG 2 Universal Coded Character Set (UCS) - ISO/IEC 10646 Secretariat: ANSI DOC TYPE: Meeting minutes TITLE: Unconfirmed minutes of WG 2 meeting 67 SOAS University of London, London, UK; 2018-06-16/20 SOURCE: V.S. Umamaheswaran ([email protected]), Recording Secretary Michel Suignard ([email protected]), Convener PROJECT: JTC 1.02.18 - ISO/IEC 10646 STATUS: SC 2/WG 2 participants are requested to review the attached unconfirmed minutes, act on appropriate noted action items, and to send any comments or corrections to the convener as soon as possible but no later than the due date below. ACTION ID: ACT DUE DATE: 2019-05-01 DISTRIBUTION: SC 2/WG 2 members and Liaison organizations MEDIUM: Acrobat PDF file NO. OF PAGES: 46 (including cover sheet) ISO International Organization for Standardization Organisation Internationale de Normalisation ISO/IEC JTC 1/SC 2/WG 2 Universal Coded Character Set (UCS) ISO/IEC JTC 1/SC 2 N____ ISO/IEC JTC 1/SC 2/WG 2 N5020 2019-01-11 Title: Unconfirmed minutes of WG 2 meeting 67 SOAS University of London, London, UK; 2018-06-16/20 Source: V.S. Umamaheswaran ([email protected]), Recording Secretary Michel Suignard ([email protected]), Convener Action: WG 2 members and Liaison organizations Distribution: ISO/IEC JTC 1/SC 2/WG 2 members and liaison organizations 1 Opening Input document: 4950 Agenda for the 67th Meeting of ISO/IEC JTC 1/SC2/WG2, SOAS, London, U.K.
    [Show full text]
  • Armenian Phonetic Characters in Unicode∗
    Armenian Phonetic Characters in Unicode∗ Luc V. Baronian, PhD Visiting Scholar, Stanford Associate Professor, UQAC January 19, 2017 1 Introduction Armenian has a relatively small number of speakers, yet it has a sesquimillennial written tradition and several contemporary academic publications. The numeric disadvantage, combined with the existence of two standard versions of the modern language (Eastern and Western), a unique alphabet and the evolution of Armenia's universities in the sphere of Russia, rather than that of Western Europe and North America, have all contributed to a poorer knowledge of its language and its academic achievements in the West than it deserves. For example, there exists an Armenian linguistic tradition, which, like the American, French and other linguistic traditions, has used a phonetic alphabet of its own for about a century, but based on the Armenian alphabet. The works of Hrach'ya Acharyan1 were crucial in disseminating the use of this phonetic alphabet still in use today among Armenian scholars. 2 The characters proposed for inclusion in Unicode While most of the Armenian phonetic characters consist of Armenian letters with or without the use of diacritics such as the umlaut or a combining ring, I have identified two characters that cannot be represented in the current state of Unicode. The first character consists of ARMENIAN SMALL LETTER AYB (U+0561, ) rotated 180 degrees. The letter could sometimes be confused for a LATIN SMALL LETTER M (U+006D); however, in those Armenian fonts that use square angles for Ayb, the difference is noticeable. Even a rotated curvy Ayb may look different from an m, especially in the rendition of the initial m-stroke (final Ayb-stroke).
    [Show full text]
  • MSR-4: Annotated Repertoire Tables, Non-CJK
    Maximal Starting Repertoire - MSR-4 Annotated Repertoire Tables, Non-CJK Integration Panel Date: 2019-01-25 How to read this file: This file shows all non-CJK characters that are included in the MSR-4 with a yellow background. The set of these code points matches the repertoire specified in the XML format of the MSR. Where present, annotations on individual code points indicate some or all of the languages a code point is used for. This file lists only those Unicode blocks containing non-CJK code points included in the MSR. Code points listed in this document, which are PVALID in IDNA2008 but excluded from the MSR for various reasons are shown with pinkish annotations indicating the primary rationale for excluding the code points, together with other information about usage background, where present. Code points shown with a white background are not PVALID in IDNA2008. Repertoire corresponding to the CJK Unified Ideographs: Main (4E00-9FFF), Extension-A (3400-4DBF), Extension B (20000- 2A6DF), and Hangul Syllables (AC00-D7A3) are included in separate files. For links to these files see "Maximal Starting Repertoire - MSR-4: Overview and Rationale". How the repertoire was chosen: This file only provides a brief categorization of code points that are PVALID in IDNA2008 but excluded from the MSR. For a complete discussion of the principles and guidelines followed by the Integration Panel in creating the MSR, as well as links to the other files, please see “Maximal Starting Repertoire - MSR-4: Overview and Rationale”. Brief description of exclusion
    [Show full text]