• Preliminary Proposal to Encode Nandinagari (Pandey) [N4389] • Preliminary Code Chart for the Pau Cin Hau Syllabary (Pande

Total Page:16

File Type:pdf, Size:1020Kb

• Preliminary Proposal to Encode Nandinagari (Pandey) [N4389] • Preliminary Code Chart for the Pau Cin Hau Syllabary (Pande ISO/IEC JTC1/SC2 WG2 N4440 DATE: 31 May 2013 DOC TYPE: Working Group Document TITLE: SEI Liaison Report SOURCE: Deborah Anderson, Script Encoding Initiative, UC Berkeley STATUS: Liaison contribution ACTION: For consideration by WG2 DISTRIBUTION: ISO/IEC JTC1/SC2/WG2 This document serves as a summary of the UC Berkeley Script Encoding Initiative’s recent activities. Proposals or documents currently submitted to the WG2 that have involved SEI assistance include: Preliminary proposal to encode Nandinagari (Pandey) [N4389] Preliminary Code Chart for the Pau Cin Hau Syllabary (Pandey) [N4412] Proposal to encode the Mongolian Square script (Pandey)[N4413] Revised proposal to encode the Soyombo script (Pandey) [N4414] Sharada o Proposal to encode Bakhshali minus sign (Pandey) [N4416] Devanagari o Proposal to encode JAIN OM for Devanagari (Pandey) [N4408] The following script are in the preliminary stage or are still undergoing research and are not yet ready for approval: Afáka (Everson) [N4292] Bagam (Everson) [N4293] Balti ‘A’ and Balti ʹBʹ (Pandey) [N4016, N3842] Bhaiksuki (Pandey) Book Pahlavi Dhives Akuru script (Pandey) [N3848] Coorgi‐Cox (Pandey) [N4287] Garay (Everson) [N4261] Gondi (Pandey) [N4291] Jenticha (Pandey) [N4028] Kawi (Pandey) [N4266] Khambu Rai (Pandey) [N4018] Khema Tamu Phri [Gurung] (Pandey) [N4019] Kirat Rai (Pandey) [N4037] Kpelle (Everson and Riley) [N3762] Landa (Pandey) [N3768] 1 Loma (Everson) Magar Akkha (Pandey) [N4036] Mongolian Square Script (Pandey) Mwangwego (Everson) [N4323] Nandinagari (Pandey) [N4389] Newar (Pandey) [N4184] Pau Cin Hau Logographs (Pandey) Pyu (Pandey) [N3874] Rañjana (Pandey) Rohingya (Pandey) [N4283] Siyaq (4 blocks) (Pandey) Tani Lipi (Pandey) Tikamuli (Pandey) [N3963] Tolong Siki (Pandey) [N3811] Unifon (Everson) [N4262] Woleai (Everson) [N4146] Zou (Pandey) [N4044] Other proposal topics are being investigated. Deborah Anderson currently is encouraging participation from Egyptologists on a project to encode Ptolemaic signs. Other ongoing work includes assistance on the Nushu script proposal and research into other scripts that might be eligible. 2 .
Recommended publications
  • ISO/IEC JTC1/SC2/WG2 N 4823 Date: 2017-05-24
    ISO/IEC JTC1/SC2/WG2 N 4823 Date: 2017-05-24 ISO/IEC JTC1/SC2/WG2 Coded Character Set Secretariat: Japan (JISC) Doc. Type: Disposition of comments Title: Disposition of comments on PDAM1.2 to ISO/IEC 10646 5th edition Source: Michel Suignard (project editor) Project: JTC1 02.10646.00.01.00.05 Status: For review by WG2 Date: 2017-05-24 Distribution: WG2 Reference: SC2 N4518 Medium: Paper, PDF file Comments were received from the following members: China, Ireland, Japan, Mongolia, UK, and USA. The following document is the disposition of those comments. The disposition is organized per country. Note – With some minor exceptions, the full content of the ballot comments has been included in this document to facilitate the reading. The dispositions are inserted in between these comments and are marked in Underlined Bold Serif text, with explanatory text in italicized serif. As a result of this disposition, a new PDAM1.3 ballot will be initiated. It is expected to be the last PDAM ballot for Amendment 1 before a DAM ballot is initiated. Page 1 Following these dispositions, the following changes were done to the Amendment repertoire: Xiangqi game symbols 30 characters removed (U+1F270..U+1F28D) from the Enclose Ideographic Supplement block (U+1F200..U+1F2FF) and replaced by 14 characters (U+1FA60..U+1FA6D) in a new block: Chess Symbols (U+1FA00..U+1FA6F) with names and code points as follows: 1FA60 RED XIANGQI GENERAL 1FA61 RED XIANGQI MANDARIN 1FA62 RED XIANGQI ELEPHANT 1FA63 RED XIANGQI HORSE 1FA64 RED XIANGQI CHARIOT 1FA65 RED XIANGQI CANNON 1FA66 RED XIANGQI SOLDIER 1FA67 BLACK XIANGQI GENERAL 1FA68 BLACK XIANGQI MANDARIN 1FA69 BLACK XIANGQI ELEPHANT 1FA6A BLACK XIANGQI HORSE 1FA6B BLACK XIANGQI CHARIOT 1FA6C BLACK XIANGQI CANNON 1FA6D BLACK XIANGQI SOLDIER Small Historic Kana The characters proposed at 1B127..1B12F are removed from this amendment.
    [Show full text]
  • UTC L2/16‐037 FROM: Deborah Anderson, Ken Whistler
    TO: UTC L2/16‐037 FROM: Deborah Anderson, Ken Whistler, Rick McGowan, Roozbeh Pournader, Andrew Glass, and Laurentiu Iancu SUBJECT: Recommendations to UTC #146 January 2016 on Script Proposals DATE: 22 January 2016 The recommendations below are based on documents available to the members of this group at the time they met, January 19, 2016. EUROPE 1. Latin Document: L2/15‐327 Proposal to add Medievalist punctuation characters – Everson Discussion: We reviewed this document, which requested 21 characters. Many of the proposed characters require more detailed analysis, specifically providing examples that show contrasts in manuscripts, in old transcriptions, and how the marks are represented in text today. Specific comments raised in the discussion: • §1 Introduction. In the list of the proposed characters on pages 1 and 2, include dotted guide‐ lines, which show the placement of the characters in relation to the baseline, mid‐line, and top line, and solid lines separating individual table cells. • §2.2.3. Punctus versus. The text suggests that two glyphs for the same character are being proposed: PUNCTUS VERSUS MARK and LOW PUNCTUS VERSUS MARK. • §2.4 Distinctiones. “Note too that ჻ is the Georgian paragraph separator; no ‘generic’ punctuation mark for that has been encoded.” Is this a request to unify the Latin ჻ with U+10FB Georgian Paragraph Separator? If so, it can be added to ScriptExtensions.txt. • §4 Linebreaking. The assignment of SY as the LB property for DOTTED SOLIDUS should be reviewed by the UTC, since the SY class currently has only one member and it would be prudent to be cautious about adding another member to SY.
    [Show full text]
  • Optimal Clustering Technique for Handwritten Nandinagari Character Recognition
    International Journal of Computer Applications Technology and Research Volume 6–Issue 5, 213-223, 2017, ISSN:-2319–8656 Optimal Clustering Technique for Handwritten Nandinagari Character Recognition Prathima Guruprasad Prof. Dr. Jharna Majumdar Research Scholar, UOM, Sc. G DRDO (Retd.), Dean, Dept. of CSE, NMIT, R&D, Prof. and Head, Dept. of CSE and Center for Gollahalli, Yelahanka, Robotics Research, NMIT, Bangalore, India Bangalore, India Abstract: In this paper, an optimal clustering technique for handwritten Nandinagari character recognition is proposed. We compare two different corner detector mechanisms and compare and contrast various clustering approaches for handwritten Nandinagari characters. In this model, the key interest points on the images which are invariant to Scale, rotation, translation, illumination and occlusion are identified by choosing robust Scale Invariant Feature Transform method(SIFT) and Speeded Up Robust Feature (SURF) transform techniques. We then generate a dissimilarity matrix, which is in turn fed as an input for a set of clustering techniques like K Means, PAM (Partition Around Medoids) and Hierarchical Agglomerative clustering. Various cluster validity measures are used to assess the quality of clustering techniques with an intent to find a technique suitable for these rare characters. On a varied data set of over 1040 Handwritten Nandinagari characters, a careful analysis indicate this combinatorial approach used in a collaborative manner will aid in achieving good recognition accuracy. We find that Hierarchical clustering technique is most suitable for SIFT and SURF features as compared to K Means and PAM techniques. Keywords: Invariant Features, Scale Invariant Feature Transform, Speeded Up Robust Feature technique, Nandinagari Handwritten Character Recognition, Dissimilarity Matrix, Cluster measures, K Means, PAM, Hierarchical Agglomerative Clustering 1.
    [Show full text]
  • Elements of South-Indian Palaeography, from the Fourth To
    This is a reproduction of a library book that was digitized by Google as part of an ongoing effort to preserve the information in books and make it universally accessible. https://books.google.com ELEMENTS SOUTH-INDIAN PALfi3&BAPBY FROM THE FOURTH TO THE SEVENTEENTH CENTURY A. D. BEIN1 AN INTRODUCTION TO ?TIK STUDY OF SOUTH-INDIAN INSCRIPTIONS AND MSS. BY A. C. BURNELL HON'. PH. O. OF TUE UNIVERSITY M. K. A, ri'VORE PIS I. A SOClfcTE MANGALORE \ BASEL MISSION BOOK & TRACT DEPOSITORY ft !<3 1874 19 Vi? TRUBNER & Co. 57 & 69 LUDOATE HILL' . ' \jj *£=ggs3|fg r DISTRIBUTION of S INDIAN alphabets up to 1550 a d. ELEMENTS OF SOUTH-INDIAN PALEOGRAPHY FROM THE FOURTH TO THE SEVENTEENTH CENTURY A. D. BEING AN INTRODUCTION TO THE STUDY OF SOUTH-INDIAN INSCRIPTIONS AND MSS. BY A. p. j^URNELL HON. PH. D. OF THE UNIVERSITY OF STRASSBUB.G; M. R. A. S.; MEMBKE DE LA S0CIETE ASIATIQUE, ETC. ETC. MANGALORE PRINTED BY STOLZ & HIRNER, BASEL MISSION PRESS 1874 LONDON TRtlBNER & Co. 57 & 59 LUDGATE HILL 3« w i d m « t als ^'ctdjcn kr §anltekcit fiir Mc i|jm bdic<jcnc JJoctorMvk ttcsc fetlings^kit auf rincm fejjcr mtfrckntcn Jfclk bet 1®4 INTRODUCTION. I trust that this elementary Sketch of South-Indian Palaeography may supply a want long felt by those who are desirous of investigating the real history of the peninsula of India. Trom the beginning of this century (when Buchanan executed the only archaeological survey that has ever been done in even a part of the South of India) up to the present time, a number of well meaning persons have gone about with much simplicity and faith collecting a mass of rubbish which they term traditions and accept as history.
    [Show full text]
  • Ori & Mss Library
    ORI & MSS LIBRARY SCHEME AND SYLLABUS M. PHIL MANUSCRIPTOLOGY ORI & MSS LIBRARY, KARIAVATTOM Syllabus - M.Phil Manuscriptology in Malayalam, Tamil & Sanskrit M. Phil Manuscriptology Scheme and Syllabus Semester – I Sl.No. Course code Course Name Credit Marks 1 MSS 711 Paper I - Research Methodology 4 100 2. MSS 712 Paper II- Textual Criticism 4 100 3. MSS 713 Paper III - Writing and Writing materials 4 100 Semester – II 1 MSS 721 Paper IV-Dissertation + Viva voce 20 300 Total marks 600 Total credits 32 (12+20) Semester I Paper I MSS 711 Research Methodology Credit – 4 Marks - 100 Unit – I – Introduction Meaning and Definition of Research Need of Research (26 hrs) Unit – II – Types of Research (50 hrs) Unit - III – Research Process Formulation of Research Problem Hypothis Research Design Data Collection Analysis of Data Centralisation (50 hrs) Unit – IV – Structure of Research Report Preliminary Section The Text End Matter (50 hrs) Suggested Readings : 1. Research in Education - Best W John 2. Research Methodology - Kothari.C.R. 3. Gaveshana Pravacika - Dr.M.V.Vishnu Nambothiri 4. Sakithya Gaveshanathinte Reethi Sastram - Dr.D.Benjamin 5. Methodology of Research - Kulbir Singh Sidhu 6. Research Methods in Social Sciences - Sharma.R.D 7. Thesis and Assignment Writing - Anderson J Durston 8. The Elements of Research in Education - Whitmeu.F.C. 9. Arivum Anubuthiyum - P.V.Velayudhan Pillai 10. Methodology for Research - Joseph A Antony 11. Sakithya Ghaveshanam - Chattnathu Achuthanunni Paper II - MSS 712 - Textual Criticism Credit
    [Show full text]
  • Töwkhön, the Retreat of Öndör Gegeen Zanabazar As a Pilgrimage Site Zsuzsa Majer Budapest
    Töwkhön, The ReTReaT of öndöR GeGeen ZanabaZaR as a PilGRimaGe siTe Zsuzsa Majer Budapest he present article describes one of the revived up to the site is not always passable even by jeep, T Mongolian monasteries, having special especially in winter or after rain. Visitors can reach the significance because it was once the retreat and site on horseback or on foot even when it is not possible workshop of Öndör Gegeen Zanabazar, the main to drive up to the monastery. In 2004 Töwkhön was figure and first monastic head of Mongolian included on the list of the World’s Cultural Heritage Buddhism. Situated in an enchanted place, it is one Sites thanks to its cultural importance and the natural of the most frequented pilgrimage sites in Mongolia beauties of the Orkhon River Valley area. today. During the purges in 1937–38, there were mass Information on the monastery is to be found mainly executions of lamas, the 1000 Mongolian monasteries in books on Mongolian architecture and historical which then existed were closed and most of them sites, although there are also some scattered data totally destroyed. Religion was revived only after on the history of its foundation in publications on 1990, with the very few remaining temple buildings Öndör Gegeen’s life. In his atlas which shows 941 restored and new temples erected at the former sites monasteries and temples that existed in the past in of the ruined monasteries or at the new province and Mongolia, Rinchen marked the site on his map of the subprovince centers. Öwörkhangai monasteries as Töwkhön khiid (No.
    [Show full text]
  • Final Proposal to Encode Nandinagari in Unicode
    L2/17-162 2017-05-05 Final proposal to encode Nandinagari in Unicode Anshuman Pandey [email protected] May 5, 2017 1 Introduction This is a proposal to encode the Nandinagari script in Unicode. It supersedes the following documents: • L2/13-002 “Preliminary Proposal to Encode Nandinagari in ISO/IEC 10646” • L2/16-002 “Proposal to encode the Nandinagari script in Unicode” • L2/16-310 “Proposal to encode the Nandinagari script in Unicode” • L2/17-119 “Towards an encoding model for Nandinagari conjuncts” It incorporates comments regarding previous proposals made in: • L2/16-037 “Recommendations to UTC #146 January 2016 on Script Proposals” • L2/16-057 “Comments on L2/16-002 Proposal to encode Nandinagari” • L2/16-216 “Recommendations to UTC #148 August 2016 on Script Proposals” • L2/17-153 “Recommendations to UTC #151 May 2017 on Script Proposals” • L2/17-117 “Proposal to encode a nasal character in Vedic Extensions” Major changes since L2/16-310 include: • Expanded description of the headstroke and its behavior (see section 3.2). • Clarification of encoding model for consonant conjuncts (see section 5.4). • Removal of digits and proposed unification with Kannada digits (see section 4.9). • Re-analysis of ‘touching’ conjuncts as variant forms that may be controlled using fonts. • Removal of ardhavisarga and other characters that require additional research (see section 4.10). • Identification of a pr̥ ṣṭhamātrā, which is not included in the proposed repertoire (see section 4.10). • Proposed reallocation of a Vedic nasal letter to the ‘Vedic Extensions’ block (see L2/17-117). • Revision of Indic position category for vowel signs.
    [Show full text]
  • Preliminary Proposal to Encode Nandinagari in ISO/IEC 10646
    ISO/IEC JTC1/SC2/WG2 N4389 L2/13-002 2013-01-14 Title: Preliminary Proposal to Encode Nandinagari in ISO/IEC 10646 Source: Script Encoding Initiative (SEI) Author: Anshuman Pandey ([email protected]) Status: Liaison Contribution Action: For consideration by WG2 and UTC Date: 2013-01-14 1 Introduction This is a preliminary proposal to encode Nandinagari in the Universal Character Set (ISO/IEC 10646). It provides a draft character repertoire, names list, and some specimens. Research on the script is ongoing and a formal proposal is forthcoming. Nandinagari is a Brahmi-based script that was used in southern India between the 8th and 19th centuries for producing manuscripts and inscriptions in Sanskrit in south Maharashtra, Karnataka and Andhra Pradesh. It derives from the central group of Nagari scripts and is related to Devanagari. There are several similarities between Nandinagari and Devanagari in terms of character repertoire, glyphic representation, and structure (see the comparison in table 1). However, Nandinagari differs from Devanagari in the shapes of character glyphs, the lack of a connecting headline, and, particularly, in the rendering of con- sonant conjuncts (see figures 14–18; note the shapes of kṣa and jña, and the form of ya as C2). There are also several styles of Nandinagari, which are to be treated as variant forms of the script. As such, Nandinagari cannot be considered a stylistic variant of Devanagari and the various styles of Nandinagari cannot be prop- erly classified as variants of Devanagari. The independent status of Nandinagari is perhaps best articulated by Saraju Rath, who writes: From statements in various early and recent secondary literature [...] one could infer that Nandināgarī, Nāgarī and Devanāgarī are very close and show only minor distinctions.
    [Show full text]
  • Unicode Character Properties
    Unicode character properties Document #: P1628R0 Date: 2019-06-17 Project: Programming Language C++ Audience: SG-16, LEWG Reply-to: Corentin Jabot <[email protected]> 1 Abstract We propose an API to query the properties of Unicode characters as specified by the Unicode Standard and several Unicode Technical Reports. 2 Motivation This API can be used as a foundation for various Unicode algorithms and Unicode facilities such as Unicode-aware regular expressions. Being able to query the properties of Unicode characters is important for any application hoping to correctly handle any textual content, including compilers and parsers, text editors, graphical applications, databases, messaging applications, etc. static_assert(uni::cp_script('C') == uni::script::latin); static_assert(uni::cp_block(U'[ ') == uni::block::misc_pictographs); static_assert(!uni::cp_is<uni::property::xid_start>('1')); static_assert(uni::cp_is<uni::property::xid_continue>('1')); static_assert(uni::cp_age(U'[ ') == uni::version::v10_0); static_assert(uni::cp_is<uni::property::alphabetic>(U'ß')); static_assert(uni::cp_category(U'∩') == uni::category::sm); static_assert(uni::cp_is<uni::category::lowercase_letter>('a')); static_assert(uni::cp_is<uni::category::letter>('a')); 3 Design Consideration 3.1 constexpr An important design decision of this proposal is that it is fully constexpr. Notably, the presented design allows an implementation to only link the Unicode tables that are actually used by a program. This can reduce considerably the size requirements of an Unicode-aware executable as most applications often depend on a small subset of the Unicode properties. While the complete 1 Unicode database has a substantial memory footprint, developers should not pay for the table they don’t use. It also ensures that developers can enforce a specific version of the Unicode Database at compile time and get a consistent and predictable run-time behavior.
    [Show full text]
  • General Historical and Analytical / Writing Systems: Recent Script
    9 Writing systems Edited by Elena Bashir 9,1. Introduction By Elena Bashir The relations between spoken language and the visual symbols (graphemes) used to represent it are complex. Orthographies can be thought of as situated on a con- tinuum from “deep” — systems in which there is not a one-to-one correspondence between the sounds of the language and its graphemes — to “shallow” — systems in which the relationship between sounds and graphemes is regular and trans- parent (see Roberts & Joyce 2012 for a recent discussion). In orthographies for Indo-Aryan and Iranian languages based on the Arabic script and writing system, the retention of historical spellings for words of Arabic or Persian origin increases the orthographic depth of these systems. Decisions on how to write a language always carry historical, cultural, and political meaning. Debates about orthography usually focus on such issues rather than on linguistic analysis; this can be seen in Pakistan, for example, in discussions regarding orthography for Kalasha, Wakhi, or Balti, and in Afghanistan regarding Wakhi or Pashai. Questions of orthography are intertwined with language ideology, language planning activities, and goals like literacy or standardization. Woolard 1998, Brandt 2014, and Sebba 2007 are valuable treatments of such issues. In Section 9.2, Stefan Baums discusses the historical development and general characteristics of the (non Perso-Arabic) writing systems used for South Asian languages, and his Section 9.3 deals with recent research on alphasyllabic writing systems, script-related literacy and language-learning studies, representation of South Asian languages in Unicode, and recent debates about the Indus Valley inscriptions.
    [Show full text]
  • India & Mongolia in the Middle Ages – More Than Just a Connection
    Ancient History of Asian Countries India & Mongolia in the Middle Ages – More Than Just a Connection By Mohan Gopal Author Mohan Gopal The Taj Mahal area and traced his lineage to a line of Turkic-Mongol warlords who alternately plagued, plundered, ruled and governed in greater or If there is one monument which conjures up an image of India, it lesser components a vast region which roughly spans the areas of is the Taj Mahal. It was constructed c. 1631 at the behest of Emperor present-day Turkey, southern Russia, the northern Middle East, Shah Jahan as the ultimate memorial and place of eternal rest for his central Asia, northern Iran, Afghanistan, Pakistan and northern India. beloved wife, Mumtaj. The Taj, as it is commonly referred to, is for The most commonly known name in this lineage was Tamerlane, romanticists across the globe, the ultimate architectural poem of love born in 1336 in central Asia, also known as Timur the Lame, Timur etched in white marble with floral motifs inlaid with precious stones; the Great or Timur the Horrible, depending on which perspective one a monument of perfect geometrical balance and symmetry. In this took – that of his huge territorial conquests or the countless and mausoleum lie the tombs of Emperor Shah Jahan and his beloved endless massacres and lootings which formed the basis of them. queen. Babur chose the former perspective and with pride considered It may come as a surprise to many that this widely admired, loved himself a Timurid, a descendant of the mighty Timur. and romanticized world artefact which has come to be known as a Babur’s mother was Qutlugh Nigar Khanum.
    [Show full text]
  • Preliminary Agenda
    ISO/IEC JTC 1/SC 2/WG 2 N4505-A DATE: 2014-02-23 ISO/IEC JTC 1/SC 2/WG 2 Universal Multiple-Octet Coded Character Set (UCS) - ISO/IEC 10646 Secretariat: ANSI DOC TYPE: Preliminary Agenda TITLE: Preliminary Agenda Meeting # 62 SOURCE: Mike Ksar, Convener PROJECT: JTC 1.02.18 – ISO/IEC 10646 STATUS: ACTION ID: ACT – Review preliminary agenda in document N4505-A and provide feedback to Convener DUE DATE: 2014-02-18 DISTRIBUTION: SC2/WG2 members and Liaison organizations MEDIUM: Electronic NO. OF PAGES: 4 Below is a copy of the preliminary agenda for the upcoming meeting 62 of JTC1/SC2/WG2. Please review and provide feedback. Mike Ksar ISO/IEC/JTC 1/SC 2/WG 2 Convener Phone: +1 408 255-1217 22680 Alcalde Rd. e-mail: [email protected] Cupertino, CA 95014 – U. S. A. ISO/IEC JTC 1/SC 2/WG 2 N4505-A DATE: 2014-02-23 Preliminary Agenda – Meeting # 62 Topic (Document No.) Proposed Outcome 1. Opening and roll call (N4401) Update Distribution List 2. Approval of the agenda (N4505-A) Approved agenda 3. Approval of minutes of meeting 61 (N4403) Approved Minutes 4. Review action items from previous meeting (N4403-AI) Updated Action Item List 5. JTC1 and ITTF matters 6. SC2 matters: 6.1. SC2 Program of Work FYI 6.2. FDAM2 Results of 3rd edition – 100% approved (N4532) FYI 6.3. Results of PDAM1 subdivision proposal (N4531) FYI 6.4. Summary of Voting DIS – 4th edition (N4524 & N4524-A) Consider and progress 6.5. Draft additional Repertoire DIS – 4th edition (N4459) Consider and Progress 6.6.
    [Show full text]