The Mongolian Script

Total Page:16

File Type:pdf, Size:1020Kb

The Mongolian Script &'()'* !"#$% The Mongolian script: What’s going on?! $%&"' !"# 梁 海 · Liang Hai · लांग हाइ · [email protected] 11 September 2018, IUC #42 * Get the latest revision from ↗ lianghai.github.io/mongolian * Note The views expressed by the speaker in this talk are his own and are NOT meant to reflect those of the Unicode Consortium or the Unicode Technical Committee. Agenda I. A crash course on the script II. The Unicode Mongolian, an encoding model from hell III. What exactly are not working? IV. Tough lessons learned V. Ongoing efforts, and how to participate · Part I · A crash course on the script What is the Mongolian script? Writing systems and features. I. Crash course: Origin Aramaic Sogdian Old Uyghur Mongolian, initially “Uyghur Mongolian” …………………………… early 13th century ..………………………………………………………………………… I. Crash course: Writing systems & languages Mongolian/Hudum (Mongolian) ……….. early 13th ……….. late 16th . early 17th . mid-17th … mid-18th …… mid-20th .. Mongolian/Hudum Ali Gali (Sanskrit–Tibetan) Manchu (Manchu) Manchu Ali Gali (Sanskrit–Tibetan) Sibe (Sibe Manchu) Todo (Oirat–Kalmyk Mongolian & Sanskrit–Tibetan) I. Crash course: Writing systems & languages [cont.] moŋol ()*+), moŋol mnju !-.-/ s [→] Hudum, Manchu, Sibe, and Todo, in some typical styles: moŋgol hudum | … manju | … sibe | … todo ] . cont [ 0.1 BC:DE 9:;< 0.2 0F2 0;2 032 0G2 0=2 ende ada ata | 45.6 HIDJ >5;? ordo urtu urdu ordo urtu Writing systems & languages systems & Writing 4536 HIKJ >5=? 75.8 7IF8 @5;A Hudum, Manchu–Sibe, and Todo, normalized to the same style: normalized Todo, Hudum, Manchu–Sibe, and ] ↓ [ I. Crash course: ] . cont [ 0.1 BC:DE 9:;< 0.2 0F2 0;2 , normalized to the same style: , normalized 032 0G2 0=2 ende ada ata | Todo 45.6 HIDJ >5;? , and ordo urtu urdu ordo urtu Writing systems & languages systems & Writing 4536 HIKJ >5=? 75.8 7IF8 @5;A Manchu–Sibe Hudum, ] ↓ [ I. Crash course: I. Crash course: Writing systems & languages [cont.] Writing systems: Served languages: • Hudum and Hudum Ali Gali • Mongolian, incl. Oirat–Kalmyk • Manchu–Sibe and Manchu Ali Gali • Manchu–Sibe • Todo • Sanskrit–Tibetan Also, note some marginal caes: Manchu–Sibe for Daur, Hudum for Evenki, and Vagindra for Buryat Mongolian. I. Crash course: General features Inherited fom Aramaic ~ Sogdian: + , • Cursive - , 567$894 ./0 • Largely dual-joining. , 0 , ﻳﻮﻧﻴﻜﻮد cf. Arabic 1/1 • • Bowed consonants ﻛـﻜـﻚ ك ـﻮ و ـﺪ د 2 , 3 , 4 ] . cont [ @A,BA,BC @C !:,;:,;< !< =0,70,71 =1 =>,7>,7? =? General features General om Aramaic ~ Sogdian: Aramaic om f Cursive Cursive consonants Bowed • • Inherited Inherited I. Crash course: ] . *D HF *D I *D EF *D ? cont [ GHF =1 GEF =? @A,BA,BC @C !:,;:,;< !< =0,70,71 =1 =>,7>,7? =? General features General om Aramaic ~ Sogdian: Aramaic om f Cursive Cursive consonants Bowed • • Inherited Inherited I. Crash course: MNF MO JKF JL ] . *D HF *D I *D EF *D ? cont [ GHF =1 GEF =? @A,BA,BC @C !:,;:,;< !< =0,70,71 =1 =>,7>,7? =? as a as a and General features General tail f is not a is not a tooth le + tail tail f tooth bare le . ] A A ] om Aramaic ~ Sogdian: Aramaic om f fun fact fun positional allographs positional allographs tail right usages), while usages), of allograph contextual whole is a [ Ali Gali in (unless grapheme • Cursive Cursive consonants Bowed • • Inherited Inherited I. Crash course: I. Crash course: General features [cont.] Inherited fom Sogdian ~ Old Uyghur: • Vertical writing ↓·→ • Originated from ←·↓ being rotated 90° counterclockwise. • ↓·→-in-narrow-column or →·↓ as fallback in horizontal writing. • True alphabet • (+ dual-joining =) Consonant leters seldom appear on their own, but are usually writen in internally joined syllables. I. Crash course: General features [cont.] Invented during Old Uyghur ~ Mongolian: => R> , 7> , S> • Syllable onset placeholder , , (aleph) 7T , Q R? =? ﻧﯩـﻨﯩـﻨﻰ ﻧﻰ ﺋـ cf. Uyghur • ﺋﯩـﺌﯩـ ﺋﻰ $= (…Syllable coda forms (n, g, d • R$ , 7$ /U , • Vowel harmony class–specific S$ , , 7W , ﯨـ ى V/ consonants RW • Phonetic letters/syllables =W I. Crash course: General features [cont.] Invented during Old Uyghur ~ Mongolian: => R , > 7> • Syllable onset placeholder , (aleph) S , > , 7T , Q R ﺋـ cf. Uyghur • =? ? ﻧﯩـﻨﯩـﻨﻰ ﻧﻰ • [fun fact] The crown and the tooth are ﺋﯩـﺌﯩـ ﺋ ﻰ $= positional allographs to each other. R , $ 7$ /U , Syllable coda forms S • , (n, g, d…) $ , 7W , ﯨـ ى V/ R • Vowel harmony class–specific =W consonants W • Phonetic letters/syllables I. Crash course: General features [cont.] Invented during Old Uyghur ~ Mongolian: = R> > , 7> • Syllable onset placeholder , (aleph) S> , , 7T , Q R? ﺋـ cf. Uyghur • = ﻧﯩـﻨﯩـﻨﻰ ﻧ ﻰ ? • [fun fact] The crown and the tooth are ﺋﯩـﺌﯩـ ﺋﻰ = positional allographs to each other. R$ $ , 7$ /U , Syllable coda forms S$ • (n, g, d…) , , 7W , ﯨـ ى / V RW • Vowel harmony class–specific = consonants W • Phonetic letters/syllables I. Crash course: General features [cont.] Invented during Old Uyghur ~ Mongolian: = R> > , 7> • Syllable onset placeholder , (aleph) S> , , 7T , Q R? ﺋـ cf. Uyghur • = ﻧﯩـﻨﯩـﻨﻰ ﻧ ﻰ ? • [fun fact] The crown and the tooth are ﺋﯩـﺌﯩـ ﺋﻰ = positional allographs to each other. R$ $ , 7$ /U , Syllable coda forms S$ • (n, g, d…) , , 7W , ﯨـ ى / V RW • Vowel harmony class–specific = consonants W • Phonetic letters/syllables ] . cont X>Y, F3>Z [ [>\, F)>] =>S, F7>^ ) aleph ( …) d General features General , g , n ( –specific Phonetic letters/syllables Phonetic Vowel harmony class harmony Vowel consonants Syllable onset placeholder placeholder onset Syllable coda forms Syllable • • • • Invented during Old Uyghur ~ Mongolian: ~ Uyghur Old during Invented I. Crash course: ] . cont X>Y, F3>Z [ [>\, F)>] =>S, F7>^ ) aleph ( …) d General features General , g , n ( –specific Phonetic letters/syllables Phonetic Vowel harmony class harmony Vowel consonants Syllable onset placeholder placeholder onset Syllable coda forms Syllable • • • • Invented during Old Uyghur ~ Mongolian: ~ Uyghur Old during Invented I. Crash course: ] . cont X>Y, F3>Z [ [>\, F)>] F_` =>S, F7>^ Fa` ) aleph ( …) d General features General , g , n ( –specific Phonetic letters/syllables Phonetic Vowel harmony class harmony Vowel consonants Syllable onset placeholder placeholder onset Syllable coda forms Syllable • • • • Invented during Old Uyghur ~ Mongolian: ~ Uyghur Old during Invented I. Crash course: I. Crash course: General features [cont.] Invented during Old Uyghur ~ Mongolian: =? =T • Syllable onset placeholder (aleph) • Syllable coda forms (n, g, d…) !< !b • Vowel harmony class–specific f? cd consonants ed [? • Complementary distribution of two gutural series: gimel and kaph. • Phonetic letters/syllables I. Crash course: General features [cont.] Invented during Old Uyghur ~ Mongolian: =? =T • Syllable onset placeholder (aleph) • Syllable coda forms (n, g, d…) !< !b • Vowel harmony class–specific f c d consonants ? e [ • Complementary distribution of d ? two gutural series: gimel and kaph. • Phonetic letters/syllables I. Crash course: General features [cont.] Invented during Old Uyghur ~ Mongolian: Graphs … Letters … Phones • Syllable onset placeholder (aleph) Spanih G L … P • Syllable coda forms (n, g, d…) Englih G L … … … P • Vowel harmony class–specific consonants Arabic G … L … … … P • Phonetic letters/syllables Tibetan G … … L … … P • Reanalyzed leters on the basis of Mongolian G … … … L* … … … P phonemes instead of graphemes. g h, =i =6,7j,7k V n l m, =n =6,7j,7k ü V_ ö n V_ o p, =q =0,70,71 u n o V n r s, =t =0,70,71 i e V_ a u v, =W =$,7$,7W V → w x, =T =y,7y,7T ↓ z {, =? =y,7y,7T g h, =i =6,7j,7k V n l m, =n =6,7j,7k ü V_ ö n V_ o p, =q =0,70,71 u n o V n r s, =t =0,70,71 i e V_ a u v, =W =$,7$,7W V → w x, =T =y,7y,7T ↓ z {, =? =y,7y,7T I. Crash course: Writing system analyses • Hudum and Todo are analyzable as either phonetic sylables or phonetic leters. • Hudum is largely unpredictable (one-to-multi, involving grammatical/ lexical information) and highly confuable (multi-to-one). • Todo is highly predictable and minimally confusable. • Manchu–Sibe is analyzable as phonetic sylables. • Also highly predictable and minimally confusable. • Can be very weird if must be analyzed as phonetic leters. 0.1 BC:DE 9:;< ende 0.2 0F2 0;2 ada 032 0G2 0=2 ata | 45.6 HIDJ >5;? urdu 4536 HIKJ >5=? Todo: Hudum, Manchu–Sibe, and ] urtu ↓ [ 75.8 7IF8 @5;A ordo de 0.1 BC:DE 9:;< : en Todo 0.2 0F2 0;2 da a , and ta 032 0G2 0=2 a | du Manchu–Sibe 45.6 HIDJ >5;? , ur tu Hudum 4536 HIKJ >5=? ] ur ↓ [ 75.8 7IF8 @5;A do or I. Crash course: Hudum-specific features • Disjointed tail (ćaćulga, detached a/e) • First-vowel forms (o, u, ö, ü) • Controversial diphthongs • Complex scopes • One scope per word-stem (note compound words) • Word-stem boundaries affect syllable boundaries • Suffixes (including enclitics, which is disconnected) extend scopes • Purely lexical variants I. Crash course: Hudum-specific features • Disjointed tail (ćaćulga, detached a/e) • First-vowel forms (o, u, ö, ü) • Controversial diphthongs • Complex scopes • One scope per word-stem (note compound words) • Word-stem boundaries affect syllable boundaries • Suffixes (including enclitics, which is disconnected) extend scopes • Purely lexical variants F F a` F 7? ^ _n _na _n[a] F F a F 7 ^ ? ` _n _na _n[a] I. Crash course: Hudum-specific features [cont.] • Disjointed tail (ćaćulga, detached a/e) • First-vowel forms (o, u, ö, ü) • Controversial diphthongs • Complex scopes • One scope per word-stem (note compound words) • Word-stem boundaries affect syllable boundaries • Suffixes (including enclitics, which is disconnected) extend scopes • Purely lexical variants /0, /1 R6,S6, Ri =6,76,7j,7k =i /0, /1 ü u R0,S0, Rq i =0,70,70,71 =q /U, /V R$,S$, RW =$,7$,7$,7W =W I. Crash course: Hudum-specific features [cont.] • Disjointed tail (ćaćulga, detached a/e) • First-vowel forms (o, u, ö, ü) • Controversial diphthongs • Complex scopes • One scope per word-stem (note compound words) • Word-stem boundaries affect syllable boundaries • Suffixes (including enclitics, which is disconnected) extend scopes • Purely lexical variants | | = >}$ >$$ >$ ~>^ •? ^ naima sain/sayin/sayn sayihan I.
Recommended publications
  • Section 14.4, Phags-Pa
    The Unicode® Standard Version 13.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2020 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 13.0. Includes index. ISBN 978-1-936213-26-9 (http://www.unicode.org/versions/Unicode13.0.0/) 1.
    [Show full text]
  • Technical Reference Manual for the Standardization of Geographical Names United Nations Group of Experts on Geographical Names
    ST/ESA/STAT/SER.M/87 Department of Economic and Social Affairs Statistics Division Technical reference manual for the standardization of geographical names United Nations Group of Experts on Geographical Names United Nations New York, 2007 The Department of Economic and Social Affairs of the United Nations Secretariat is a vital interface between global policies in the economic, social and environmental spheres and national action. The Department works in three main interlinked areas: (i) it compiles, generates and analyses a wide range of economic, social and environmental data and information on which Member States of the United Nations draw to review common problems and to take stock of policy options; (ii) it facilitates the negotiations of Member States in many intergovernmental bodies on joint courses of action to address ongoing or emerging global challenges; and (iii) it advises interested Governments on the ways and means of translating policy frameworks developed in United Nations conferences and summits into programmes at the country level and, through technical assistance, helps build national capacities. NOTE The designations employed and the presentation of material in the present publication do not imply the expression of any opinion whatsoever on the part of the Secretariat of the United Nations concerning the legal status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The term “country” as used in the text of this publication also refers, as appropriate, to territories or areas. Symbols of United Nations documents are composed of capital letters combined with figures. ST/ESA/STAT/SER.M/87 UNITED NATIONS PUBLICATION Sales No.
    [Show full text]
  • Writing Systems: Their Properties and Implications for Reading
    Writing Systems: Their Properties and Implications for Reading Brett Kessler and Rebecca Treiman doi:10.1093/oxfordhb/9780199324576.013.1 Draft of a chapter to appear in: The Oxford Handbook of Reading, ed. by Alexander Pollatsek and Rebecca Treiman. ISBN 9780199324576. Abstract An understanding of the nature of writing is an important foundation for studies of how people read and how they learn to read. This chapter discusses the characteristics of modern writing systems with a view toward providing that foundation. We consider both the appearance of writing systems and how they function. All writing represents the words of a language according to a set of rules. However, important properties of a language often go unrepresented in writing. Change and variation in the spoken language result in complex links to speech. Redundancies in language and writing mean that readers can often get by without taking in all of the visual information. These redundancies also mean that readers must often supplement the visual information that they do take in with knowledge about the language and about the world. Keywords: writing systems, script, alphabet, syllabary, logography, semasiography, glottography, underrepresentation, conservatism, graphotactics The goal of this chapter is to examine the characteristics of writing systems that are in use today and to consider the implications of these characteristics for how people read. As we will see, a broad understanding of writing systems and how they work can place some important constraints on our conceptualization of the nature of the reading process. It can also constrain our theories about how children learn to read and about how they should be taught to do so.
    [Show full text]
  • A New Database for Online Handwritten Mongolian Word Recognition
    2016 23rd International Conference on Pattern Recognition (ICPR) Cancún Center, Cancún, México, December 4-8, 2016 A New Database for Online Handwritten Mongolian Word Recognition Long-Long Ma, Ji Liu, Jian Wu National Engineering Research Center of Fundamental Software Institute of Software, Chinese Academy of Sciences Beijing, P. R. China {longlong, wujian}@iscas.ac.cn Abstract—A new online handwritten Mongolian word studied lately field. Researchers from only several institutes are database, MRG-OHMW, is introduced in this paper. This devoted to related work. Gao et al. [1] presented a statistical database contains 946 Mongolian words produced by 300 persons and structural recognition method for printed Mongolian from Mongolian ethnic minority. These Mongolian words are character recognition. Batsaikhan et al. [2] used multilayer composed of one to fourteen Mongolian characters, and selected perceptron classifier to recognize noisy Mongolian characters from large-scale Mongolian text corpus according to the with single font. Two-stage classification method based on frequencies of usage. The current version of this database is glyph segmentation was used to enhance Mongolian character collected using Anoto pen on paper. The database is further recognition performance [3]. A segmentation-based approach annotated using Mongolian word-level string alignment strategy. segments classical Mongolian words into several GUs (Glyph We partition the samples into training and test sets, and evaluate Unites) and then recognizes the GUs [4]. These GUs are the database using the CNN-based recognizer as a baseline. Experimental results reveal a big challenge to higher recognition combined to form word recognition results. Multi-font printed performance. To our knowledge, MRG-OHMW is the first Mongolian documents are recognized by integrating character publicly available database for online handwritten Mongolian segmentation and character recognition with support mixed research.
    [Show full text]
  • Unicode Alphabets for L ATEX
    Unicode Alphabets for LATEX Specimen Mikkel Eide Eriksen March 11, 2020 2 Contents MUFI 5 SIL 21 TITUS 29 UNZ 117 3 4 CONTENTS MUFI Using the font PalemonasMUFI(0) from http://mufi.info/. Code MUFI Point Glyph Entity Name Unicode Name E262 � OEligogon LATIN CAPITAL LIGATURE OE WITH OGONEK E268 � Pdblac LATIN CAPITAL LETTER P WITH DOUBLE ACUTE E34E � Vvertline LATIN CAPITAL LETTER V WITH VERTICAL LINE ABOVE E662 � oeligogon LATIN SMALL LIGATURE OE WITH OGONEK E668 � pdblac LATIN SMALL LETTER P WITH DOUBLE ACUTE E74F � vvertline LATIN SMALL LETTER V WITH VERTICAL LINE ABOVE E8A1 � idblstrok LATIN SMALL LETTER I WITH TWO STROKES E8A2 � jdblstrok LATIN SMALL LETTER J WITH TWO STROKES E8A3 � autem LATIN ABBREVIATION SIGN AUTEM E8BB � vslashura LATIN SMALL LETTER V WITH SHORT SLASH ABOVE RIGHT E8BC � vslashuradbl LATIN SMALL LETTER V WITH TWO SHORT SLASHES ABOVE RIGHT E8C1 � thornrarmlig LATIN SMALL LETTER THORN LIGATED WITH ARM OF LATIN SMALL LETTER R E8C2 � Hrarmlig LATIN CAPITAL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C3 � hrarmlig LATIN SMALL LETTER H LIGATED WITH ARM OF LATIN SMALL LETTER R E8C5 � krarmlig LATIN SMALL LETTER K LIGATED WITH ARM OF LATIN SMALL LETTER R E8C6 UU UUlig LATIN CAPITAL LIGATURE UU E8C7 uu uulig LATIN SMALL LIGATURE UU E8C8 UE UElig LATIN CAPITAL LIGATURE UE E8C9 ue uelig LATIN SMALL LIGATURE UE E8CE � xslashlradbl LATIN SMALL LETTER X WITH TWO SHORT SLASHES BELOW RIGHT E8D1 æ̊ aeligring LATIN SMALL LETTER AE WITH RING ABOVE E8D3 ǽ̨ aeligogonacute LATIN SMALL LETTER AE WITH OGONEK AND ACUTE 5 6 CONTENTS
    [Show full text]
  • Orthographic Transparency and the Ottoman Abjad Maithili Jais
    Orthographic Transparency and the Ottoman Abjad Maithili Jais University of Florida Spring 2018 I. Introduction In 2014, the debate over whether Ottoman Turkish was to be taught in schools or not was once again brought to the forefront of Turkish society and the Turkish conscience, as Erdogan began to push for Ottoman Turkish to be taught in all high schools across the country (Yeginsu, 2014). This became an obsession of a news topic for media in the West as well as in Turkey. Turkey’s tumultuous history with politics inevitably led this proposal of teaching Ottoman Turkish in all high schools to become a hotbed of controversy and debate. For all those who are perfectly contented to let bygones be bygones, there are many who assert that the Ottoman Turkish alphabet is still relevant and important. In fact, though this may be a personal anecdote, there are still certainly people who believe that the Ottoman script is, or was, superior to the Latin alphabet with which modern Turkish is written. This thesis does not aim to undertake a task so grand as sussing out which of the two was more appropriate for Turkish. No, such a task would be a behemoth for this paper. Instead, it aims to answer the question, “How?” Rather, “How was the Arabic script moulded to fit Turkish and to what consequence?” Often the claim that one script it superior to another suggests inherent judgement of value, but of the few claims seen circulating Facebook on the efficacy of the Ottoman script, it seems some believe that it represented Turkish more accurately and efficiently.
    [Show full text]
  • 1 Introduction 6
    Report No 170, August 1999 UNU/IIST, P.O. Box 3058, Macau 2 Report No 170, August 1999 UNU/IIST, P.O. Box 3058, Macau 3 Report No 170, August 1999 UNU/IIST, P.O. Box 3058, Macau 4 Report No 170, August 1999 UNU/IIST, P.O. Box 3058, Macau 5 Contents 1 Introduction 6 2 The Basic Character set 8 2.1 Other basic Mongolian characters 15 3 The Variant Forms 17 3.1 Overriding the Defaults 21 3.2 The Mongolian Reference Table 23 4 The Ligatures 25 5 Implementing Software for Mongolian 28 A The Mongolian Reference Table B Mongolian Ligatures Report No 170, August 1999 UNU/IIST, P.O. Box 3058, Macau 6 1 Introduction Although most countries in the world have had national standard encoding schemes for the characters of their own language or languages for some time, these could differ wildly even between countries sharing the same written language. As a result, an electronic document written using a piece of software based on a particular encoding scheme could only be read by someone possessing either software based on the same encoding scheme or software for translating between the two different encoding schemes. As the volume of international communications increased, especially the international exchange of electronic data, not least via the internet, it became clear that this situation was completely impractical and that some internationally accepted universal encoding scheme, which could form the basis for multi-lingual software, was needed. A joint technical committee (ISO/IEC JTC1) was therefore set up by the International Organization for Standardisation (ISO) and the International Electrotechnical Committee (IEC) to work on this, and, initially independently though later in collaboration with ISO/IEC, the Unicode Consortium embarked on a similar project.
    [Show full text]
  • Korean Hankul and the Þp'ags-Pa Script
    Writine in the Altaic World studia orientalia 87, Helsinki 1999, pp. 79-t00 Korean Hankul and the þP'ags-pa script Roger Finch From the annals of the reign period of King Sejong of Korea, the fourth king of the Yi Dynasty, who ruled from 1418 to 1450, we learn that in the l2th month of the 25th year of his reign (1443), the king invented the Korean alphabet. This alphabet, now known as Hankul, was called Hunmin Ceng'im,'Right Sounds for Instructing the People'l at the time it was introduced, and later simply En-mun'Vernacular Script'. According to popular tradition, the shapes of the letters, most of which are rectilinear or angular, were inspired by the shapes in the fretwork on Korean windows. This writing system is a true alphabet, and not a syllabary, though signs for discrete sounds are organized into single blocks of signs, each representing a syllable in the way that Chinese characters are complexes made up of various strokes organized into single blocks, each of which represent.s a syllable; a syllabary is a writing system in which each combination of consonants with a syllabic core is repre- sented by a unique sign as, for example, the Japanese Kana (Hiragana and Katakana). Almost a century earlier, in 1260, in the year of his election to the position of Great Khan of the Mongols and Emperor of China,2 Khubilai Khan appointed National Preceptor a Tibetan monk named hP'ags-pa he had met seven years earlier, when hP'ags-pa was only nineteen, and commissioned him to create a new Mongolian script.
    [Show full text]
  • (RSEP) Request October 16, 2017 Registry Operator INFIBEAM INCORPORATION LIMITED 9Th Floor
    Registry Services Evaluation Policy (RSEP) Request October 16, 2017 Registry Operator INFIBEAM INCORPORATION LIMITED 9th Floor, A-Wing Gopal Palace, NehruNagar Ahmedabad, Gujarat 380015 Request Details Case Number: 00874461 This service request should be used to submit a Registry Services Evaluation Policy (RSEP) request. An RSEP is required to add, modify or remove Registry Services for a TLD. More information about the process is available at https://www.icann.org/resources/pages/rsep-2014- 02-19-en Complete the information requested below. All answers marked with a red asterisk are required. Click the Save button to save your work and click the Submit button to submit to ICANN. PROPOSED SERVICE 1. Name of Proposed Service Removal of IDN Languages for .OOO 2. Technical description of Proposed Service. If additional information needs to be considered, attach one PDF file Infibeam Incorporation Limited (“infibeam”) the Registry Operator for the .OOO TLD, intends to change its Registry Service Provider for the .OOO TLD to CentralNic Limited. Accordingly, Infibeam seeks to remove the following IDN languages from Exhibit A of the .OOO New gTLD Registry Agreement: - Armenian script - Avestan script - Azerbaijani language - Balinese script - Bamum script - Batak script - Belarusian language - Bengali script - Bopomofo script - Brahmi script - Buginese script - Buhid script - Bulgarian language - Canadian Aboriginal script - Carian script - Cham script - Cherokee script - Coptic script - Croatian language - Cuneiform script - Devanagari script
    [Show full text]
  • Classifying Arabic Fonts Based on Design Characteristics: PANOSE-A
    Classifying Arabic Fonts Based on Design Characteristics: PANOSE-A Jehan Janbi A Thesis In the Department of Computer Science & Software Engineering Presented in Partial Fulfillment of the Requirements For the Degree of Doctor of Philosophy (Computer Science) at Concordia University Montreal, Quebec, Canada July 2016 © Jehan Janbi, 2016 Abstract In desktop publishing, fonts are essential components in each document design. With the development of font design software and tools, there are thousands of digital fonts. Increasing the number of available fonts makes selecting an appropriate font, which best serves the objective of a design, not an intuitive issue. Designers can search for a font like any other file types by using general information such as name and file format. But for document design purposes, the design features or visual characteristics of fonts are more meaningful for designers than font file information. Therefore, representing fonts’ design features by searchable and comparable data would facilitate searching and selecting a desirable font. One solution is to represent a font’s design features by a code composed of several digits. This solution has been implemented as a computerized system called PANOSE-1 for Latin script fonts. PANOSE-1 is a system for classifying and matching typefaces based on design features. It is composed of 10 digits, where each digit represents a specific design feature. It is used within several font management tools as an option for ordering and searching fonts based on their design features. It is also used in font replacement processes when an application or an operating system detects a missing font in an immigrant document or website.
    [Show full text]
  • Mongolian-Chinese Dictionary: Online Version"
    Instructions for the "Mongolian-Chinese Dictionary: Online Version" 2020.10.20 Table of Contents 1. Outline 1.1. The prologue 1.2. Main features 1.3. System requirements 1.4. Usage requirements 2. Components of entries in the dictionary 3. Usage method 3.1. Start-up 3.2. Components of the window 3.2.1. Search area 3.2.2. Buttons 3.2.3. Search result display area 3.2.4. The image of the original dictionary 3.2.5. Listen to the pronunciation 3.2.6. Cyrillic word 3.2.7. The links to the Cyrillic Mongolian dictionaries 3.3. Search method 3.3.1. Search object 3.3.1.1. “Mongolian Words” (Searching by traditional Mongolian) 3.3.1.2. “Roman Transcript” (Search by Romanized transcription) 3.3.1.3. “Search All” (Searching all letter strings) ■The benefits of using “Search All” mode. 3.3.2. Types of search methods 3.3.3. Search options 3.3.3.1. Fuzzy Search ■The advantage of “Fuzzy Search” 3.3.3.2. Case-insensitive 3.3.3.3. Exclude sub headwords (search only main headwords) Supplemental Explanations 1. About "Mongolian-Chinese dictionary" 2. Some examples for explaining the relations between traditional Mongolian letters, Roman transcript and the keyboard keys 3. Things to be careful of when searching by traditional Mongolian letters 1 1.Outline 1.1. The prologue This "Mongolian-Chinese Dictionary: Online Version" is an electronic version of the "Mongolian-Chinese Dictionary: Revised and Enlarged Edition" (published by Inner Mongolia University Press, 1999) which has been compiled by the Institute of Mongolian Language Study, the School of Mongolian Studies of Inner Mongolia University.
    [Show full text]
  • Scripts and Politics in the Ussr
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Firenze University Press: E-Journals © 2017 Author(s). Open Access. This article is Studi Slavistici xiv (2017): 9-19 distributed under the terms of the cc by-nc-nd 4.0 doi: 10.13128/Studi_Slavis-21936 Submitted on 2017, March 11th issn 1824-761x (print) Accepted on 2017, June 26th Blocco tematico issn 1824-7601 (online) Vladimir Mikhajlovich Alpatov Scripts and Politics in the ussr 1. Four Aspects of the Alphabet Problem The Soviet Union existed for 69 years, from 1922 to 1991, as a multinational and multi- lingual entity. It is difficult to calculate the number of the languages in the ussr (due in part to the problematic differentiation of languages and dialects), but it was upwards of 150 (for further details on the linguistic map of the former Soviet Union see Matthews 1951, Creis- sels 1977, Isaev 1977, Comrie 1981, as well as the monumental scholarly work in five volumes edited by Vinogradov 1966-1968). Almost half of these did not have written alphabets even during the Soviet period, but more than 80 have or had alphabets. Moreover, the majority of these languages changed their alphabets two or three times (for some examples see Glück 1994: 747-751). No country in the world has changed its language policy – including the choice of alphabet – so frequently (Jachnow 1994). There were three main periods of alpha- bet change: the 1920’s; the second half of the 1930’s; and the last years of the ussr.
    [Show full text]