A Study of Traditional Mongolian Script Encodings and Rendering: Use of Unicode in Opentype Fonts
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  Section 14.4, Phags-PaThe Unicode® Standard Version 13.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2020 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 13.0. Includes index. ISBN 978-1-936213-26-9 (http://www.unicode.org/versions/Unicode13.0.0/) 1.
- 
												  Technical Reference Manual for the Standardization of Geographical Names United Nations Group of Experts on Geographical NamesST/ESA/STAT/SER.M/87 Department of Economic and Social Affairs Statistics Division Technical reference manual for the standardization of geographical names United Nations Group of Experts on Geographical Names United Nations New York, 2007 The Department of Economic and Social Affairs of the United Nations Secretariat is a vital interface between global policies in the economic, social and environmental spheres and national action. The Department works in three main interlinked areas: (i) it compiles, generates and analyses a wide range of economic, social and environmental data and information on which Member States of the United Nations draw to review common problems and to take stock of policy options; (ii) it facilitates the negotiations of Member States in many intergovernmental bodies on joint courses of action to address ongoing or emerging global challenges; and (iii) it advises interested Governments on the ways and means of translating policy frameworks developed in United Nations conferences and summits into programmes at the country level and, through technical assistance, helps build national capacities. NOTE The designations employed and the presentation of material in the present publication do not imply the expression of any opinion whatsoever on the part of the Secretariat of the United Nations concerning the legal status of any country, territory, city or area or of its authorities, or concerning the delimitation of its frontiers or boundaries. The term “country” as used in the text of this publication also refers, as appropriate, to territories or areas. Symbols of United Nations documents are composed of capital letters combined with figures. ST/ESA/STAT/SER.M/87 UNITED NATIONS PUBLICATION Sales No.
- 
												  The Vocabulary of Inanimate Nature As a Part of Turkic-Mongolian Language CommonnessISSN 2039-2117 (online) Mediterranean Journal of Social Sciences Vol 6 No 6 S2 ISSN 2039-9340 (print) MCSER Publishing, Rome-Italy November 2015 The Vocabulary of Inanimate Nature as a Part of Turkic-Mongolian Language Commonness Valentin Ivanovich Rassadin Doctor of Philological Sciences, Professor, Department of the Kalmyk language and Mongolian studies Director of the Mongolian and Altaistic research Scientific centre, Kalmyk State University 358000, Republic of Kalmykia, Elista, Pushkin street, 11 Doi:10.5901/mjss.2015.v6n6s2p126 Abstract The article deals with the problem of commonness of Turkic and Mongolian languages in the area of vocabulary; a layer of vocabulary, reflecting the inanimate nature, is subject to thorough analysis. This thematic group studies the rubrics, devoted to landscape vocabulary, different soil types, water bodies, atmospheric phenomena, celestial sphere. The material, mainly from Khalkha-Mongolian and Old Written Mongolian languages is subject to the analysis; the data from Buryat and Kalmyk languages were also included, as they were presented in these languages. The Buryat material was mainly closer to the Khalkha-Mongolian one. For comparison, the material, mainly from the Old Turkic language, showing the presence of similar words, was included; it testified about the so-called Turkic-Mongolian lexical commonness. The analysis of inner forms of these revealed common lexemes in the majority of cases allowed determining their Turkic origin, proved by wide occurrence of these lexemes in Turkic languages and Turkologists' acknowledgement of their Turkic origin. The presence of great quantity of common vocabulary, which origin is determined as Turkic, testifies about repeated ancient contacts of Mongolian and Turkic languages, taking place in historical retrospective, resulting in hybridization of Mongolian vocabulary.
- 
												  Old Cyrillic in Unicode*Old Cyrillic in Unicode* Ivan A Derzhanski Institute for Mathematics and Computer Science, Bulgarian Academy of Sciences [email protected] The current version of the Unicode Standard acknowledges the existence of a pre- modern version of the Cyrillic script, but its support thereof is limited to assigning code points to several obsolete letters. Meanwhile mediæval Cyrillic manuscripts and some early printed books feature a plethora of letter shapes, ligatures, diacritic and punctuation marks that want proper representation. (In addition, contemporary editions of mediæval texts employ a variety of annotation signs.) As generally with scripts that predate printing, an obvious problem is the abundance of functional, chronological, regional and decorative variant shapes, the precise details of whose distribution are often unknown. The present contents of the block will need to be interpreted with Old Cyrillic in mind, and decisions to be made as to which remaining characters should be implemented via Unicode’s mechanism of variation selection, as ligatures in the typeface, or as code points in the Private space or the standard Cyrillic block. I discuss the initial stage of this work. The Unicode Standard (Unicode 4.0.1) makes a controversial statement: The historical form of the Cyrillic alphabet is treated as a font style variation of modern Cyrillic because the historical forms are relatively close to the modern appearance, and because some of them are still in modern use in languages other than Russian (for example, U+0406 “I” CYRILLIC CAPITAL LETTER I is used in modern Ukrainian and Byelorussian). Some of the letters in this range were used in modern typefaces in Russian and Bulgarian.
- 
												  The Fontspec Package Font Selection for XƎLATEX and LualatexThe fontspec package Font selection for XƎLATEX and LuaLATEX Will Robertson and Khaled Hosny [email protected] 2013/05/12 v2.3b Contents 7.5 Different features for dif- ferent font sizes . 14 1 History 3 8 Font independent options 15 2 Introduction 3 8.1 Colour . 15 2.1 About this manual . 3 8.2 Scale . 16 2.2 Acknowledgements . 3 8.3 Interword space . 17 8.4 Post-punctuation space . 17 3 Package loading and options 4 8.5 The hyphenation character 18 3.1 Maths fonts adjustments . 4 8.6 Optical font sizes . 18 3.2 Configuration . 5 3.3 Warnings .......... 5 II OpenType 19 I General font selection 5 9 Introduction 19 9.1 How to select font features 19 4 Font selection 5 4.1 By font name . 5 10 Complete listing of OpenType 4.2 By file name . 6 font features 20 10.1 Ligatures . 20 5 Default font families 7 10.2 Letters . 20 6 New commands to select font 10.3 Numbers . 21 families 7 10.4 Contextuals . 22 6.1 More control over font 10.5 Vertical Position . 22 shape selection . 8 10.6 Fractions . 24 6.2 Math(s) fonts . 10 10.7 Stylistic Set variations . 25 6.3 Miscellaneous font select- 10.8 Character Variants . 25 ing details . 11 10.9 Alternates . 25 10.10 Style . 27 7 Selecting font features 11 10.11 Diacritics . 29 7.1 Default settings . 11 10.12 Kerning . 29 7.2 Changing the currently se- 10.13 Font transformations . 30 lected features .
- 
												  Writing Systems: Their Properties and Implications for ReadingWriting Systems: Their Properties and Implications for Reading Brett Kessler and Rebecca Treiman doi:10.1093/oxfordhb/9780199324576.013.1 Draft of a chapter to appear in: The Oxford Handbook of Reading, ed. by Alexander Pollatsek and Rebecca Treiman. ISBN 9780199324576. Abstract An understanding of the nature of writing is an important foundation for studies of how people read and how they learn to read. This chapter discusses the characteristics of modern writing systems with a view toward providing that foundation. We consider both the appearance of writing systems and how they function. All writing represents the words of a language according to a set of rules. However, important properties of a language often go unrepresented in writing. Change and variation in the spoken language result in complex links to speech. Redundancies in language and writing mean that readers can often get by without taking in all of the visual information. These redundancies also mean that readers must often supplement the visual information that they do take in with knowledge about the language and about the world. Keywords: writing systems, script, alphabet, syllabary, logography, semasiography, glottography, underrepresentation, conservatism, graphotactics The goal of this chapter is to examine the characteristics of writing systems that are in use today and to consider the implications of these characteristics for how people read. As we will see, a broad understanding of writing systems and how they work can place some important constraints on our conceptualization of the nature of the reading process. It can also constrain our theories about how children learn to read and about how they should be taught to do so.
- 
												  A New Database for Online Handwritten Mongolian Word Recognition2016 23rd International Conference on Pattern Recognition (ICPR) Cancún Center, Cancún, México, December 4-8, 2016 A New Database for Online Handwritten Mongolian Word Recognition Long-Long Ma, Ji Liu, Jian Wu National Engineering Research Center of Fundamental Software Institute of Software, Chinese Academy of Sciences Beijing, P. R. China {longlong, wujian}@iscas.ac.cn Abstract—A new online handwritten Mongolian word studied lately field. Researchers from only several institutes are database, MRG-OHMW, is introduced in this paper. This devoted to related work. Gao et al. [1] presented a statistical database contains 946 Mongolian words produced by 300 persons and structural recognition method for printed Mongolian from Mongolian ethnic minority. These Mongolian words are character recognition. Batsaikhan et al. [2] used multilayer composed of one to fourteen Mongolian characters, and selected perceptron classifier to recognize noisy Mongolian characters from large-scale Mongolian text corpus according to the with single font. Two-stage classification method based on frequencies of usage. The current version of this database is glyph segmentation was used to enhance Mongolian character collected using Anoto pen on paper. The database is further recognition performance [3]. A segmentation-based approach annotated using Mongolian word-level string alignment strategy. segments classical Mongolian words into several GUs (Glyph We partition the samples into training and test sets, and evaluate Unites) and then recognizes the GUs [4]. These GUs are the database using the CNN-based recognizer as a baseline. Experimental results reveal a big challenge to higher recognition combined to form word recognition results. Multi-font printed performance. To our knowledge, MRG-OHMW is the first Mongolian documents are recognized by integrating character publicly available database for online handwritten Mongolian segmentation and character recognition with support mixed research.
- 
												  On the Acoustic Properties of Mongolian LanguageAN INTRODUCTION TO MONGOLIAN PHONETIC-LINGUISTIC MODEL: ON THE ACOUSTIC PROPERTIES OF MONGOLIAN LANGUAGE ¡ Idomuso Dawa , Shigeki Okawa and Katsuhiko Shirai ¢ Dept. of Information and Computer Science, Waseda University, Japan ¢£¢ Dept. of Network Science, Chiba Institute of Technology, Japan ABSTRACT Pandita in 1648. (c) Cyrillic scripts, created from Russian orthographic characters in 1940’s, have been widely used in This paper proposes a multi-lingual phonetic-linguistic mod- Mongolia, Kalmyk A. R. in Russia. el for Mongolian language based on the study of the In the script (a), allophones sometimes correspond to classification and the postulates of such models. This the same characters as well as the contrary case. In many model is commonly applicable for various fields as linguis- cases, a word form is quite different from the pronunciations tic/phonetic/dialectal/ language educational studies as well in these days. On the contrary, (b) Todo scripts were created as speech recognition. Its application to major dialects of based on a rule that one character corresponds to only one Mongolian has marked 91 % on word level recognition. pronunciation. Additionally, the letters have special marks In this paper, therefore, we first introduce the structure of for the long vowels etc. Todo scripts avoid the vagueness in language and classification of Mongolian dialectal speech. writing and speaking in initial Mongolian, as “Todo” means Next, we analyze and classify fundamental characteristics the clearness by Mongolian. in order to define phoneme labels. Lastly, we apply the common models to a simple word recognition experiment. ¤ (a) word ¤ (b) word written spoken written spoken 1 INTRODUCTION 1 2 -- n n Mongolian language, one of the most spoken Asian lan- n n n o u u u guages, is mainly used by Mongolian people resided in g g o- o G G the Central Asia and spoken by about 7.5 million people.
- 
												  International Language Environments GuideInternational Language Environments Guide Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. Part No: 806–6642–10 May, 2002 Copyright 2002 Sun Microsystems, Inc. 4150 Network Circle, Santa Clara, CA 95054 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, Java, XView, ToolTalk, Solstice AdminTools, SunVideo and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. SunOS, Solaris, X11, SPARC, UNIX, PostScript, OpenWindows, AnswerBook, SunExpress, SPARCprinter, JumpStart, Xlib The OPEN LOOK and Sun™ Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry.
- 
												  Development Production Line the Short StoryDevelopment Production Line The Short Story Jene Jasper Copyright © 2007-2018 freedumbytes.dev.net (Free Dumb Bytes) Published 3 July 2018 4.0-beta Edition While every precaution has been taken in the preparation of this installation manual, the publisher and author assume no responsibility for errors or omissions, or for damages resulting from the use of the information contained herein. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. To get an idea of the Development Production Line take a look at the following Application Integration overview and Maven vs SonarQube Quality Assurance reports comparison. 1. Operating System ......................................................................................................... 1 1.1. Windows ........................................................................................................... 1 1.1.1. Resources ................................................................................................ 1 1.1.2. Desktop .................................................................................................. 1 1.1.3. Explorer .................................................................................................. 1 1.1.4. Windows 7 Start Menu ................................................................................ 2 1.1.5. Task Manager replacement ........................................................................... 3 1.1.6. Resource Monitor .....................................................................................
- 
												  ISO/IEC JTC1/SC2/WG2 N 4823 Date: 2017-05-24ISO/IEC JTC1/SC2/WG2 N 4823 Date: 2017-05-24 ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC) Doc. Type: Disposition of comments Title: Disposition of comments on PDAM1.2 to ISO/IEC 10646 5th edition Source: Michel Suignard (project editor) Project: JTC1 02.10646.00.01.00.05 Status: For review by WG2 Date: 2017-05-24 Distribution: WG2 Reference: SC2 N4518 Medium: Paper, PDF file Comments were received from the following members: China, Ireland, Japan, Mongolia, UK, and USA. The following document is the disposition of those comments. The disposition is organized per country. Note – With some minor exceptions, the full content of the ballot comments has been included in this document to facilitate the reading. The dispositions are inserted in between these comments and are marked in Underlined Bold Serif text, with explanatory text in italicized serif. As a result of this disposition, a new PDAM1.3 ballot will be initiated. It is expected to be the last PDAM ballot for Amendment 1 before a DAM ballot is initiated. Page 1 Following these dispositions, the following changes were done to the Amendment repertoire: Xiangqi game symbols 30 characters removed (U+1F270..U+1F28D) from the Enclose Ideographic Supplement block (U+1F200..U+1F2FF) and replaced by 14 characters (U+1FA60..U+1FA6D) in a new block: Chess Symbols (U+1FA00..U+1FA6F) with names and code points as follows: 1FA60 RED XIANGQI GENERAL 1FA61 RED XIANGQI MANDARIN 1FA62 RED XIANGQI ELEPHANT 1FA63 RED XIANGQI HORSE 1FA64 RED XIANGQI CHARIOT 1FA65 RED XIANGQI CANNON 1FA66 RED XIANGQI SOLDIER 1FA67 BLACK XIANGQI GENERAL 1FA68 BLACK XIANGQI MANDARIN 1FA69 BLACK XIANGQI ELEPHANT 1FA6A BLACK XIANGQI HORSE 1FA6B BLACK XIANGQI CHARIOT 1FA6C BLACK XIANGQI CANNON 1FA6D BLACK XIANGQI SOLDIER Small Historic Kana The characters proposed at 1B127..1B12F are removed from this amendment.
- 
												  PTOCA Reference (Presentation Text Object Content ArchitectureAdvanced Function Presentation Consortium Data Stream and Object Architectures Presentation Text Object Content Architecture Reference AFPC-0009-03 Note: Before using this information, read the information in “Notices” on page 171. AFPC-0009-03 Fourth Edition (March 2016) This edition applies to the Presentation Text Object Content Architecture (PTOCA). It is the first edition produced by the AFP Consortium™(AFPC™) and replaces and makes obsolete the previous edition, SC31-6803-02, published by the IBM® Corporation. This edition remains current until a new edition is published. Specific changes are indicated by a vertical bar to the left of the change. For a detailed list of the changes, see “Summary of Changes” on page ix. Internet Visit our home page: www.afpcinc.org Copyright © AFP Consortium 1997, 2016 ii Preface This book describes the functions and services associated with the Presentation Text Object Content Architecture (PTOCA) architecture. This book is a reference, not a tutorial. It complements individual product publications, but does not describe product implementations of the architecture. Who Should Read This Book This book is for systems programmers and other developers who need such information to develop or adapt a product or program to interoperate with other presentation products. Copyright © AFP Consortium 1997, 2016 iii AFP Consortium AFP Consortium (AFPC) The Advanced Function Presentation™(AFP™) architectures began as the strategic, general purpose document and information presentation architecture for the IBM Corporation. The first specifications and products go back to 1984. Although all of the components of the architecture have grown over the years, the major concepts of object-driven structures, print integrity, resource management, and support for high print speeds were built in from the start.