The Bangla-Asamiya Script and Its Representation in Unicode
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Ruse Secluded Character Set for the Source
Mukt Shabd Journal Issn No : 2347-3150 A Ruse Secluded character set for the Source Mr. J Purna Prakash1, Assistant Professor Mr. M. Rama Raju 2, Assistant Professor Christu Jyothi Institute of Technology & Science Abstract We are rich in data, but information is poor, typically world wide web and data streams. The effective and efficient analysis of data in which is different forms becomes a challenging task. Searching for knowledge to match the exact keyword is big task in Internet such as search engine. Now a days using Unicode Transform Format (UTF) is extended to UTF-16 and UTF-32. With helps to create more special characters how we want. China has GB 18030-character set. Less number of website are using ASCII format in china, recently. While searching some keyword we are unable get the exact webpage in search engine in top place. Issues in certain we face this problem in results announcement, notifications, latest news, latest products released. Mainly on government websites are not shown in the front page. To avoid this trap from common people, we require special character set to match the exact unique keyword. Most of the keywords are encoded with the ASCII format. While searching keyword called cbse net results thousands of websites will have the common keyword as cbse net results. Matching the keyword, it is already encoded in all website as ASCII format. Most of the government websites will not offer search engine optimization. Match a unique keyword in government, banking, Institutes, Online exam purpose. Proposals is to create a character set from A to Z and a to z, for the purpose of data cleaning. -
Kiraz 2019 a Functional Approach to Garshunography
Intellectual History of the Islamicate World 7 (2019) 264–277 brill.com/ihiw A Functional Approach to Garshunography A Case Study of Syro-X and X-Syriac Writing Systems George A. Kiraz Institute for Advanced Study, Princeton and Beth Mardutho: The Syriac Institute, Piscataway [email protected] Abstract It is argued here that functionalism lies at the heart of garshunographic writing systems (where one language is written in a script that is sociolinguistically associated with another language). Giving historical accounts of such systems that began as early as the eighth century, it will be demonstrated that garshunographic systems grew organ- ically because of necessity and that they offered a certain degree of simplicity rather than complexity.While the paper discusses mostly Syriac-based systems, its arguments can probably be expanded to other garshunographic systems. Keywords Garshuni – garshunography – allography – writing systems It has long been suggested that cultural identity may have been the cause for the emergence of Garshuni systems. (In the strictest sense of the term, ‘Garshuni’ refers to Arabic texts written in the Syriac script but the term’s semantics were drastically extended to other systems, sometimes ones that have little to do with Syriac—for which see below.) This paper argues for an alterna- tive origin, one that is rooted in functional theory. At its most fundamental level, Garshuni—as a system—is nothing but a tool and as such it ought to be understood with respect to the function it performs. To achieve this, one must take into consideration the social contexts—plural, as there are many—under which each Garshuni system appeared. -
The Unicode Cookbook for Linguists: Managing Writing Systems Using Orthography Profiles
Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2017 The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles Moran, Steven ; Cysouw, Michael DOI: https://doi.org/10.5281/zenodo.290662 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-135400 Monograph The following work is licensed under a Creative Commons: Attribution 4.0 International (CC BY 4.0) License. Originally published at: Moran, Steven; Cysouw, Michael (2017). The Unicode Cookbook for Linguists: Managing writing systems using orthography profiles. CERN Data Centre: Zenodo. DOI: https://doi.org/10.5281/zenodo.290662 The Unicode Cookbook for Linguists Managing writing systems using orthography profiles Steven Moran & Michael Cysouw Change dedication in localmetadata.tex Preface This text is meant as a practical guide for linguists, and programmers, whowork with data in multilingual computational environments. We introduce the basic concepts needed to understand how writing systems and character encodings function, and how they work together. The intersection of the Unicode Standard and the International Phonetic Al- phabet is often not met without frustration by users. Nevertheless, thetwo standards have provided language researchers with a consistent computational architecture needed to process, publish and analyze data from many different languages. We bring to light common, but not always transparent, pitfalls that researchers face when working with Unicode and IPA. Our research uses quantitative methods to compare languages and uncover and clarify their phylogenetic relations. However, the majority of lexical data available from the world’s languages is in author- or document-specific orthogra- phies. -
A Ruse Secluded Character Set for the Source
JOURNAL OF ARCHITECTURE & TECHNOLOGY Issn No : 1006-7930 A Ruse Secluded character set for the Source Mr. J Purna Prakash1, Assistant Professor Mr. M. Rama Raju 2, Assistant Professor Christu Jyothi Institute of Technology & Science Abstract We are rich in data, but information is poor, typically world wide web and data streams. The effective and efficient analysis of data in which is different forms becomes a challenging task. Searching for knowledge to match the exact keyword is big task in Internet such as search engine. Now a days using Unicode Transform Format (UTF) is extended to UTF-16 and UTF-32. With helps to create more special characters how we want. China has GB 18030-character set. Less number of website are using ASCII format in china, recently. While searching some keyword we are unable get the exact webpage in search engine in top place. Issues in certain we face this problem in results announcement, notifications, latest news, latest products released. Mainly on government websites are not shown in the front page. To avoid this trap from common people, we require special character set to match the exact unique keyword. Most of the keywords are encoded with the ASCII format. While searching keyword called cbse net results thousands of websites will have the common keyword as cbse net results. Matching the keyword, it is already encoded in all website as ASCII format. Most of the government websites will not offer search engine optimization. Match a unique keyword in government, banking, Institutes, Online exam purpose. Proposals is to create a character set from A to Z and a to z, for the purpose of data cleaning. -
Rongorongo Script: Carving Techniques and Scribal Corrections
Journal de la Société des Océanistes 129 | juillet-décembre 2009 Varia Rongorongo Script: Carving Techniques and Scribal Corrections Paul Horley Electronic version URL: http://journals.openedition.org/jso/5813 DOI: 10.4000/jso.5813 ISSN: 1760-7256 Publisher Société des océanistes Printed version Date of publication: 15 December 2009 Number of pages: 249-261 ISBN: 978-2-85430-026-0 ISSN: 0300-953x Electronic reference Paul Horley, « Rongorongo Script: Carving Techniques and Scribal Corrections », Journal de la Société des Océanistes [Online], 129 | juillet-décembre 2009, Online since 30 December 2012, connection on 30 April 2019. URL : http://journals.openedition.org/jso/5813 ; DOI : 10.4000/jso.5813 © Tous droits réservés Rongorongo Script: Carving Techniques and Scribal Corrections by Paul HORLEY* ABSTRACT RÉSUMÉ Studies of three original rongorongo tablets (Tahua, L’étude menée sur trois tablettes rongorongo origina- Aruku Kurenga and Mamari) revealed clear traces of les (Tahua, Aruku Kurenga et Mamari) montre claire- two-stage carving (pre-incising with an obsidian flake ment que les signes ont été tracés au cours de deux and contour enhancement with a shark tooth). Most étapes successives : pré-incision avec un éclat d’obsi- probably, the texts were written in short fragments with dienne puis gravure des contours avec une dent de requin. shark-tooth engraving applied before passing to the next Il est probable que de courts fragments de textes étaient fragment. Additional multiple engraving sessions might écrits et gravés à l’aide d’une dent de requin avant de been performed for finished inscription, aiming to tracer le fragment de texte suivant. -
Of Writing Systems in Terms of Typological and Other Criteria: Cross-Linguistic Observations from the German and Japanese Writing Systems
The evolution of writing systems: Empirical and cross-linguistic approaches workshop (AG5) @ DGfS2020, Universität Hamburg, Germany; 4-6 March, 2020 ‘Evolution’ of writing systems in terms of typological and other criteria: Cross-linguistic observations from the German and Japanese writing systems Terry Joyce Dimitrios Meletis Tama University, Japan University of Graz, Austria [email protected] [email protected] Overview Opening remarks Selective sample of writing system (WS) typologies Alternative criteria for evaluating WSs Observations from German (GWS) + Japanese (JWS) Closing remarks Opening remarks 1: Chaos over basic terminology! Erring towards understatement, Gnanadesikan (2017: 15) notes, [t]here is, in general, significant variation in the basic terminology used in the study of writing systems. Indeed, as Meletis (2018: 73) observes regarding the differences between the concepts of WS and orthography, [t]hese terms are often shockingly misused as synonyms, or writing system is not used at all and orthography is employed instead. Similarly, Joyce and Masuda (in press) seek to differentiate between the elusive trinity of terms at heart of WS research; namely, script, WS, and orthography, with particular reference to the JWS. Opening remarks 2: Our working definitions WS1 [Schrifttyp]: Abstract relations (i.e., morphographic, syllabographic, + phonemic), as focus of typologies. WS2 [Schriftsystem]: Common usage for signs + conventions of given language, such as GWS + JWS. Script [Schrift]: Set of material signs for specific language. Orthography [Orthographie]: Mediation between script + WS, but often with prescriptive connotations of correct writing. Graphematic representation: Emerging from grapholinguistic approach, a neutral (ego preferable) alternative to orthography. GWS: Use of extended alphabetic set, as used to represent written German language. -
Research Report on Phonetics and Phonology of Sinhala
Research Report on Phonetics and Phonology of Sinhala Asanka Wasala and Kumudu Gamage Language Technology Research Laboratory, University of Colombo School of Computing, Sri Lanka. [email protected],[email protected] Abstract phonetics and phonology for improving the naturalness and intelligibility for our Sinhala TTS system. This report examines the major characteristics of Many researches have been carried out in the linguistic Sinhala language related to Phonetics and Phonology. domain regarding Sinhala phonetics and phonology. The main topics under study are Segmental and These researches cover both aspects of segmental and Supra-segmental sounds in Spoken Sinhala. The first suprasegmental features of spoken Sinhala. Some of part presents Sinhala Phonemic Inventory, which the themes elaborated in these researches are; Sinhala describes phonemes with their associated features and sound change-rules, Sinhala Prosody, Duration and phonotactics of Sinhala. Supra-segmental features like Stress, and Syllabification. Syllabification, Stress, Pitch and Intonation, followed In the domain of Sinhala Speech Processing, a by the procedure for letter to sound conversion are very little number of researches were reported and described in the latter part. most of them were carried out by undergraduates for The research on Syllabification leads to the their final year projects. Some restricted domain identification of set of rules for syllabifying a given Sinhala TTS systems developed by undergraduates are word. The syllabication algorithm achieved an already working in a number of domains. The study of accuracy of 99.95%. Due to the phonetic nature of the such projects laid the foundation for our project. Sinhala alphabet, the letter-to-phoneme mapping was However, an extensive research was done in all- easily done, however to produce the correct aspects of Sinhala Phonetics and Phonology; in order phonetized version of a word, some modifications identify the areas to be deeply concerned in developing were needed to be done. -
Poorman's Hangul Jamo Input Method
Poorman’s Hangul Jamo Input Method pmhanguljamo.sty Kangsoo Kim 20 Sep 2021 version 0.3.6 Contents 1 Introduction 1 2 Usage 2 2.1 Loading the package .......................... 2 2.2 Commands and Environment Provided ................. 2 2.3 Setting up in your Preamble ....................... 3 3 Transliteration Rule of This Package 4 3.1 Tone Marks and Syllable Serapator ................... 4 3.2 Consonants ............................... 4 3.3 Vowels .................................. 5 3.4 Compatibility Jamos .......................... 6 4 Proper Fonts 6 5 Examples 7 5.1 Modern Hangul ............................. 7 5.2 pre-1933 Hangul ............................ 8 6 The RRK Input Method: An Alternative Way 8 6.1 Transliteration Rule of RRK ...................... 9 6.2 Example of RRK method ........................ 10 7 Further Information 11 8 Acknowledgement 11 1 Introduction 1 This LATEX package provides Hangul transliteration input method, which allows to typeset Korean Letters (Hangul) with the help of proper fonts. The name comes from “Poorman’s Hangul Jamo Input Method.” It is mainly for the people who have a system without Korean keyboard IM, but want to typeset Hangul in their document. Not only 1Hangul is the Korean alphabet to write the Korean language. In both South and North Korea, the standard writing system uses Hangul. 1 modern Hangul, but so-colled “Old Hangul” characters that uses the lost letters such as ‘Arae-A’(ㆍ), ‘Yet Ieung’(ㆁ) or ‘Pan-Sios’(ㅿ) etc. can also be typeset. X LE ATEX or LuaLATEX is required. The legacy pdfTEX is not supported. The Korean Language supporting packages such as xetexko and luatexko (in the ko.TEX bundle) or polyglossia package with Korean support are recommended, but without them typeset- ting Hangul is of no problem with this package pmhanguljamo. -
The Writing Revolution
9781405154062_1_pre.qxd 8/8/08 4:42 PM Page iii The Writing Revolution Cuneiform to the Internet Amalia E. Gnanadesikan A John Wiley & Sons, Ltd., Publication 9781405154062_1_pre.qxd 8/8/08 4:42 PM Page iv This edition first published 2009 © 2009 Amalia E. Gnanadesikan Blackwell Publishing was acquired by John Wiley & Sons in February 2007. Blackwell’s publishing program has been merged with Wiley’s global Scientific, Technical, and Medical business to form Wiley-Blackwell. Registered Office John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, United Kingdom Editorial Offices 350 Main Street, Malden, MA 02148-5020, USA 9600 Garsington Road, Oxford, OX4 2DQ, UK The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, UK For details of our global editorial offices, for customer services, and for information about how to apply for permission to reuse the copyright material in this book please see our website at www.wiley.com/wiley-blackwell. The right of Amalia E. Gnanadesikan to be identified as the author of this work has been asserted in accordance with the Copyright, Designs and Patents Act 1988. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by the UK Copyright, Designs and Patents Act 1988, without the prior permission of the publisher. Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic books. Designations used by companies to distinguish their products are often claimed as trademarks. -
Simplified Lambda in an English Novel
The Quest for Information Retrieval THE QUEST FOR INFORMATION RETRIEVAL: AN INSCRIBED RELIC FROM ANCIENT RAPA NUI (EASTER ISLAND) – Part I. Tomi S. MELKA (Las Palmas de G.C., Spain) [email protected] Robert M. SCHOCH (Boston, Massachusetts, USA) [email protected] This article focuses on an antique wooden fragment from Easter Island engraved with a single rongorongo (RR) glyph. Based on our analysis, we conclude that the most likely hypothesis regarding this fragment is that it dates to pre-missionary times (prior to 1864) and was part of some type of weapon or fighting instrument, either of a functional or ritual nature. It may represent a portion of a handle for an obsidian-tipped spear or knife. The “bird”-like RR glyph on the fragment belongs in all probability to a category that has previously been connected in the scholarly literature on RR with “slumber”, and hence “waning”, “demise”, and “death”. Its presence on a hand-weapon or ceremonial warlike artifact suggests a meaningful real-world, “non-literary”, context for this glyph. Keywords: “bird”-like motif, glyph of the /600/-class, “Lutz–Terasaki fragment”, mako‘i (Thespesia populnea), mata‘a, patina, pre-missionary era, Rapa Nui (Easter Island), rongorongo script “El desorden y el esparcimiento en que se encuentran los más importantes documentos originales de la cultura pascuense, permite sospechar que en algunos museos y colecciones particulares se encuentren algunas otras piezas auténticas” [The chaos and dispersal in which the most important original documents (= rongorongo artifacts; our note) of Rapanui culture are found, allows us to suspect that a few other authentic pieces may be encountered in some museums and particular collections]. -
Can a Logographic Script Be Simplified? Lessons from the 20Th Century Chinese Writing Reform Informed by Recent Psycholinguistic Research*
Can a Logographic Script be Simplified? Lessons from the 20th Century Chinese Writing Reform Informed by Recent Psycholinguistic Research* Zev Handel University of Washington, Seattle In the 1950s and 1960s, the government of the People’s Republic of China undertook, in two stages, a carefully planned “simplification” of the logographic Chinese script. Drawing on a variety of historical precedents, over 2,000 individual graphs were modified in an attempt to make the script easier to learn and use. This was the first significant change in the official form of the Chinese script in nearly two millennia, and resulted in the script variety that is widely used today in mainland China, commonly termed “simplified Chinese characters.” Drawing on recent psycholinguistic experiments that attempt to characterize the cognitive functions involved in Chinese script processing, this study revisits long-standing questions about the efficacy of character simplification and provides some additional theoretical insights into the nature of logographic writing. The central conclusion of this study is that meaningful simplification of a logographic script is possible, but that today’s simplified character script cannot be characterized as an effective reform by any reasonable metric—it is only “simpler” in the crudest of senses. After evaluating the results of recent studies on the cognitive processing of Chinese characters, I introduce the concept of semantic orthographic depth and argue that a true simplification of a logographic script should be based on regularization of semantic and phonetic components, rather than on reduction of the number of graphs or the reduction of the number of strokes per graph. -
Transmission of the First Alphabets
Scripts in Contact: Transmission of the First Alphabets Sveva Elti di Rodeano Abstract. The alphabet is a type of notation which breaks language into small units, attempting to represent its phonological and/or phonetic repertoire. Given that writing is a way of thinking, based on a cognitive activity that re- quires mental procedures, the aim of this paper is to propose the architecture of the spelling process, described in cognitive and psycholinguistic studies, as a model for the transmission of the alphabet(s). For this purpose, the reflection is focused on the so-called first alphabets in writing systems’ history, i.e., Greek, Anatolian, Italic, and Iberian alphabets, in- troducing the new linguistic and archeological approaches which allow an earlier date and a less decisive role for the Greeks in the alphabets’ introduction. Ancient literatures’ evidence about the teaching of writing and reading are found to match with the current results of cognitive studies about the dynamics of oral reproduction, word recognition, and written reproduction. The pecu- liarities of Asia minor alphabets, which are still unsolved, will be addressed and framed within the spelling process model. 1. Introduction Currently, the alphabet is more widespread than any other system of written language. In company with Chinese characters, the alphabet provides the forms by which all living languages are written: whether Arabic, Bengali, Cyrillic, Devanagari, Greek, Hebrew, Javanese, Latin, or Tibetan or any This paper has been written as contribution to the GC21 Conference, held online due to COVID-19 pandemic. I must thanks all the participants for their direct and undirect contributions for my research, with special regards to Daniel Harbour, Hany Rashwan and Amalia Gnanadesikan.