On the History of Thai Scripts Hans Penth
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
SC22/WG20 N896 L2/01-476 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale De Normalisation
SC22/WG20 N896 L2/01-476 Universal multiple-octet coded character set International organization for standardization Organisation internationale de normalisation Title: Ordering rules for Khmer Source: Kent Karlsson Date: 2001-12-19 Status: Expert contribution Document type: Working group document Action: For consideration by the UTC and JTC 1/SC 22/WG 20 1 Introduction The Khmer script in Unicode/10646 uses conjoining characters, just like the Indic scripts and Hangul. An alternative that is suitable (and much more elegant) for Khmer, but not for Indic scripts, would have been to have combining- below (and sometimes a bit to the side) consonants and combining-below independent vowels, similar to the combining-above Latin letters recently encoded, as well as how Tibetan is handled. However that is not the chosen solution, which instead uses a combining character (COENG) that makes characters conjoin (glue) like it is done for Indic (Brahmic) scripts and like Hangul Jamo, though the latter does not have a separate gluer character. In the Khmer script, using the COENG based approach, the words are formed from orthographic syllables, where an orthographic syllable has the following structure [add ligation control?]: Khmer-syllable ::= (K H)* K M* where K is a Khmer consonant (most with an inherent vowel that is pronounced only if there is no consonant, independent vowel, or dependent vowel following it in the orthographic syllable) or a Khmer independent vowel, H is the invisible Khmer conjoint former COENG, M is a combining character (including COENG, though that would be a misspelling), in particular a combining Khmer vowel (noted A below) or modifier sign. -
418338 1 En Bookbackmatter 205..225
Glossary Abhayamudra A style of keeping hands while sitting Abhidhama The Abhidhamma Pitaka is a detailed scholastic reworking of material appearing in the Suttas, according to schematic classifications. It does not contain systematic philosophical treatises, but summaries or enumerated lists. The other two collections are the Sutta Pitaka and the Vinaya Pitaka Abhog It is the fourth part of a composition. The last movement gradually goes back to the sthayi after completion of the paraphrasing and improvisation of the composition, which can cover even three octaves in the recital of a master performer Acharya A teacher or a tutor who is the symbol of wisdom Addhayoga One of seven kinds of lodgings where monks are allowed to live. Addhayoga is a building with a roof sloping on either one side or both. It is shaped like wings of the Garuda Agganna-sutta AggannaSutta is the 27th Sutta of the Digha Nikaya collection. The sutta describes a discourse imparted by the Buddha to two Brahmins, Bharadvaja, and Vasettha, who left their family and caste to become monks Ahankar Haughtiness, self-importance A-hlu-khan mandap Burmese term, a temporary pavilion to receive donation Akshamala A japa mala or mala (meaning garland) which is a string of prayer beads commonly used by Hindus, Buddhists, and some Sikhs for the spiritual practice known in Sanskrit as japa. It is usually made from 108 beads, though other numbers may also be used Amulets An ornament or small piece of jewellery thought to give protection against evil, danger, or disease. Clay tablets have also been used as amulets. -
Computers and the Thai Language
[3B2-6] man2009010046.3d 12/2/09 13:47 Page 46 Computers and the Thai Language Hugh Thaweesak Koanantakool National Science and Technology Development Agency Theppitak Karoonboonyanan Thai Linux Working Group Chai Wutiwiwatchai National Electronics and Computer Technology Center This article explains the history of Thai language development for computers, examining such factors as the language, script, and writing system, among others. The article also analyzes characteristics of Thai characters and I/O methods, and addresses key issues involved in Thai text processing. Finally, the article reports on language processing research and provides detailed information on Thai language resources. Thai is the official language of Thailand. Certain vowels, all tone marks, and diacritics The Thai script system has been used are written above and below the main for Thai, Pali, and Sanskrit languages in Bud- character. dhist texts all over the country. Standard Pronunciation of Thai words does not Thai is used in all schools in Thailand, and change with their usage, as each word has a most dialects of Thai use the same script. fixed tone. Changing the tone of a syllable Thai is the language of 65 million people, may lead to a totally different meaning. and has a number of regional dialects, such Thai verbs do not change their forms as as Northeastern Thai (or Isan; 15 million with tense, gender, and singular or plural people), Northern Thai (or Kam Meuang or form,asisthecaseinEuropeanlanguages.In- Lanna; 6 million people), Southern Thai stead, there are other additional words to help (5 million people), Khorat Thai (400,000 with the meaning for tense, gender, and sin- people), and many more variations (http:// gular or plural. -
Khmer Phonetics & Phonology: Theoretical Implications for ESL Instruction
Running Head: KHMER PHONETICS AND PHONOLOGY 1 Khmer Phonetics & Phonology: Theoretical Implications for ESL Instruction Alex Donley A Senior Thesis submitted in partial fulfillment of the requirements for graduation in the Honors Program Liberty University Spring 2020 KHMER PHONETICS AND PHONOLOGY 2 Acceptance of Senior Honors Thesis This Senior Honors Thesis is accepted in partial fulfillment of the requirements for graduation from the Honors Program of Liberty University. ______________________________ Jaeshil Kim, Ph.D. Thesis Chair ______________________________ Stephanie Blankenship, Ed.D. Committee Member ______________________________ David Schweitzer, Ph.D. Assistant Honors Director ______________________________ Date KHMER PHONETICS AND PHONOLOGY 3 Abstract This thesis develops an approach to English teaching for Khmer-speaking students that centers on Khmer phonetics and phonology. Cambodia has a strong demand for English instruction, but consistently underperforms next to other nations in terms of proficiency. A significant reason for Cambodia’s skill gap is the lack of research into linguistic hurdles Khmer speakers face when learning English. This paper aims to bridge Khmer and English with an understanding of the speech systems that both languages use before turning to the unique challenges Khmer speakers must overcome based on the tenets of L1 Transfer Theory. It closes by outlining strategies for English teachers to build the comprehensibility and confidence of their Khmer-speaking students. Keywords: Khmer, English, phonetics, phonology, transfer, ESL KHMER PHONETICS AND PHONOLOGY 4 Khmer Phonetics and Phonology: Theoretical Implications for ESL Instruction Introduction This thesis develops an approach to English teaching for Khmer-speaking students that is grounded in a thorough understanding of Khmer phonetics and phonology. -
Khmer Romanization Table
KHMER Consonants ʻAksar Mul script ʻAksar Mul script Full Full Full Full Form Subscript form Subscript Romanization Form Subscript form Subscript Romanization ក ◌ក ក ◌្ក k ទ ◌ទ ទ ◌្ទ d ខ ◌ខ ខ ◌្ខ kh ធ ◌ធ ធ ◌្ធ dh គ ◌គ គ ◌្គ g ន ◌ន ន ◌្ន n ឃ ◌ឃ ឃ ◌ឃ gh ប ◌ប ប ◌ប p ង ◌ង ង ◌្ង ng ផ ◌ផ ផ ◌្ផ ph ច ◌ច ច ◌្ច c ព ◌ព ព ◌្ព b ឆ ◌ឆ ឆ ◌្ឆ ch ភ ◌ភ ភ ◌្ភ bh ជ ◌ជ ជ ◌្ជ j ម ◌ម ម ◌្ម m ឈ ◌ឈ ឈ ◌ឈ jh យ ◌យ យ ◌យ y ញ ◌ញ or ញ ញ ◌្ញ or ញ ñ រ ្រ◌ រ ្រ◌ r ដ ◌្ត ដ ◌្ត ṭ ល ◌្ល ល ◌្ល l ឋ ◌្ឋ ឋ ◌្ឋ ṭh វ ◌្វ វ ◌្វ v ឌ ◌្ឌ ឌ ◌្ឌ ḍ ឝ ◌គ ឝ ◌្គ ś * ឍ ◌ ឍ ◌ឍ ḍh ឞ ◌ ឞ ◌្ឞ s*̣ ណ ◌្ណ ណ ◌្ណ ṇ ស ◌ ស ◌ស s ត ◌្ត ត ◌្ត t ហ ◌្ហ ហ ◌្ហ h ḷ (l with ថ ◌ថ ថ ◌្ថ th ឡ - ឡ - dot below) ‛ ʹ (ayn + អ ◌្អ អ ◌្អ soft sign) * Not used since the mid-17th century and is mainly used for Pali and Sanskrit transliteration. Vowels Independent Romanization Independent Romanization ឥ i ឭ ḷ ឦ ī ឮ ḹ ឧ u ឯ ae ឩ ū ឰ ai ឪ ýu ឱ o ឫ ṛ ឳ au ឬ ṝ Dependent Romanization Dependent Romanization ◌◌ ʹaʹ ែ◌ ʹae ◌ា ʹā ៃ◌ ʹai ◌ិ ʹi េ◌ ʹo ◌ី ʹī េ◌ ʹau ◌ឹ ʹẏ ◌ុ◌ំ ʹuṃ ȳ ṃ ◌ឺ ʹ ◌ំ ʹa ◌ុ ʹu ◌ា◌ំ ʹāṃ ◌ូ ʹū ◌ះ ʹaḥ ◌ួ ʹua ◌ិ◌ះ ʹiḥ ʹẏ េ◌ើ ʹoe ◌ឹ◌ះ ḥ ẏ ʹu េ◌ឿ ʹ a ◌ុ◌ះ ḥ េ◌ៀ ʹia េ◌◌ះ ʹeḥ េ◌ ʹe េ◌◌ះ ʹoaḥ Diacritical marks Vernacular Alternative Romanization ◌៉ ◌ុ ″ (hard sign) ◌៊ ◌ុ ′ (soft sign (prime)) ◌៌ r ◌៍ ̊ (circle above) (see Note 5) ◌៎ ’ (alif) ◌៏ ʻ (ayn) ◌៱ ˙ (dot above) (see Note 5) ◌័◌ ă (breve) ◌ៈ à (combining grave accent) ◌◌់ á (combining acute accent) ◌ា◌់ â (modified letter circumflex) Notes 1. -
Representing Myanmar in Unicode Details and Examples Version 3
Representing Myanmar in Unicode Details and Examples Version 3 Martin Hosken1 Table of Contents Introduction................................................................................................................................................2 Unicode 5.1 Model.....................................................................................................................................4 Advanced Issues.......................................................................................................................................11 Languages.................................................................................................................................................14 Burmese....................................................................................................................................................15 Old Burmese.............................................................................................................................................18 Sanskrit/Pali..............................................................................................................................................20 Mon...........................................................................................................................................................22 Sgaw Karen...............................................................................................................................................24 Western Pwo Karen..................................................................................................................................26 -
(RSEP) Request October 16, 2017 Registry Operator INFIBEAM INCORPORATION LIMITED 9Th Floor
Registry Services Evaluation Policy (RSEP) Request October 16, 2017 Registry Operator INFIBEAM INCORPORATION LIMITED 9th Floor, A-Wing Gopal Palace, NehruNagar Ahmedabad, Gujarat 380015 Request Details Case Number: 00874461 This service request should be used to submit a Registry Services Evaluation Policy (RSEP) request. An RSEP is required to add, modify or remove Registry Services for a TLD. More information about the process is available at https://www.icann.org/resources/pages/rsep-2014- 02-19-en Complete the information requested below. All answers marked with a red asterisk are required. Click the Save button to save your work and click the Submit button to submit to ICANN. PROPOSED SERVICE 1. Name of Proposed Service Removal of IDN Languages for .OOO 2. Technical description of Proposed Service. If additional information needs to be considered, attach one PDF file Infibeam Incorporation Limited (“infibeam”) the Registry Operator for the .OOO TLD, intends to change its Registry Service Provider for the .OOO TLD to CentralNic Limited. Accordingly, Infibeam seeks to remove the following IDN languages from Exhibit A of the .OOO New gTLD Registry Agreement: - Armenian script - Avestan script - Azerbaijani language - Balinese script - Bamum script - Batak script - Belarusian language - Bengali script - Bopomofo script - Brahmi script - Buginese script - Buhid script - Bulgarian language - Canadian Aboriginal script - Carian script - Cham script - Cherokee script - Coptic script - Croatian language - Cuneiform script - Devanagari script -
Benchmarking of Document Image Analysis Tasks for Palm Leaf Manuscripts from Southeast Asia
Journal of Imaging Article Benchmarking of Document Image Analysis Tasks for Palm Leaf Manuscripts from Southeast Asia Made Windu Antara Kesiman 1,2,* ID , Dona Valy 3,4, Jean-Christophe Burie 1, Erick Paulus 5, Mira Suryani 5, Setiawan Hadi 5, Michel Verleysen 2, Sophea Chhun 4 and Jean-Marc Ogier 1 1 Laboratoire Informatique Image Interaction (L3i), Université de La Rochelle, 17042 La Rochelle, France; [email protected] (J.-C.B.); [email protected] (J.-M.O.) 2 Laboratory of Cultural Informatics (LCI), Universitas Pendidikan Ganesha, Singaraja, Bali 81116, Indonesia; [email protected] 3 Institute of Information and Communication Technologies, Electronic, and Applied Mathematics (ICTEAM), Université Catholique de Louvain, 1348 Louvain-la-Neuve, Belgium; [email protected] 4 Department of Information and Communication Engineering, Institute of Technology of Cambodia, Phnom Penh, Cambodia; [email protected] 5 Department of Computer Science, Universitas Padjadjaran, Bandung 45363, Indonesia; [email protected] (E.P.); [email protected] (M.S.); [email protected] (S.H.) * Correspondence: [email protected] Received: 15 December 2017; Accepted: 18 February 2018; Published: 22 February 2018 Abstract: This paper presents a comprehensive test of the principal tasks in document image analysis (DIA), starting with binarization, text line segmentation, and isolated character/glyph recognition, and continuing on to word recognition and transliteration for a new and challenging collection of palm leaf manuscripts from Southeast Asia. This research presents and is performed on a complete dataset collection of Southeast Asian palm leaf manuscripts. It contains three different scripts: Khmer script from Cambodia, and Balinese script and Sundanese script from Indonesia. -
3. Pronunciation & Phonetic Letters Guide
หน้า ๒๙ : Nâa 29 PRONUNCIATION GUIDE It is important to read this section thoroughly if your aim is to learn to speak Thai using phonetic letters or transliteration. Skip this only if you can read Thai script. What are Phonetic Letters? There are many different practices for learning to speak a language without learning the scripts of the language. Romanisation is the conversion of writing from a different writing system to the Roman (Latin) script, or a system for doing so. Methods of romanisation include transliteration, for representing written text, and transcription, for representing the spoken word, and combinations of both. Phonetic transcription (also known as Phonetic Script or Phonetic Notation) is the visual representation of speech sounds (or phones). The most common type of phonetic transcription use a phonetic alphabet such as the International Phonetic Alphabet (IPA). Transliteration is the practice of converting a text from one script into another by writing or printing (a letter or word) using the closest corresponding letters of a different alphabet or language. Royal Thai Phonetic Alphabet (RPA) is a system used as a standard for transcribing Thai words, names of a person, place, thing, into English. Romanisation In Thai is called ทับศัพท์ : Túb~Sùb or ทับเสียง : Túb~Sĕarng and colloquially called ภาษาคาราโอเกะ : Paa-săa Kaa-raa-o-ge’ = Karaoke language. (ทับ : Túb = place on top of, ศัพท์ : Sùb = vocabulary, เสียง : Sĕang = sound, คาราโอเกะ : Kaa- raa-o-ge’ = karaoke) At Thai Style, we have developed our own system using the English alphabet as phonetic letters. This is because we would like to compare the sounds in English and Thai by using the English alphabet, like if you are a native English speaker learning French which uses the same alphabet but some letters are pronounced with completely different sounds. -
The Tai-Kadai Languages Resources for Thai Language Research
This article was downloaded by: 10.3.98.104 On: 26 Sep 2021 Access details: subscription number Publisher: Routledge Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: 5 Howick Place, London SW1P 1WG, UK The Tai-Kadai Languages Anthony V. N. Diller, Jerold A. Edmondson, Yongxian Luo Resources for Thai Language Research Publication details https://www.routledgehandbooks.com/doi/10.4324/9780203641873.ch3 Anthony Diller Published online on: 11 Jun 2008 How to cite :- Anthony Diller. 11 Jun 2008, Resources for Thai Language Research from: The Tai- Kadai Languages Routledge Accessed on: 26 Sep 2021 https://www.routledgehandbooks.com/doi/10.4324/9780203641873.ch3 PLEASE SCROLL DOWN FOR DOCUMENT Full terms and conditions of use: https://www.routledgehandbooks.com/legal-notices/terms This Document PDF may be used for research, teaching and private study purposes. Any substantial or systematic reproductions, re-distribution, re-selling, loan or sub-licensing, systematic supply or distribution in any form to anyone is expressly forbidden. The publisher does not give any warranty express or implied or make any representation that the contents will be complete or accurate or up to date. The publisher shall not be liable for an loss, actions, claims, proceedings, demand or costs or damages whatsoever or howsoever caused arising directly or indirectly in connection with or arising out of the use of this material. PART 2 TAI LANGUAGES: OVERVIEWS AND RESOURCES Downloaded By: 10.3.98.104 At: 19:02 26 Sep 2021; For: 9780203641873, chapter3, 10.4324/9780203641873.ch3 29 This page intentionally left blank Downloaded By: 10.3.98.104 At: 19:02 26 Sep 2021; For: 9780203641873, chapter3, 10.4324/9780203641873.ch3 30 CHAPTER THREE 3. -
The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley
The Unicode Standard Version 3.0 The Unicode Consortium ADDISON–WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts · Harlow, England · Menlo Park, California Berkeley, California · Don Mills, Ontario · Sydney Bonn · Amsterdam · Tokyo · Mexico City Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. If these files have been purchased on computer-readable media, the sole remedy for any claim will be exchange of defective media within ninety days of receipt. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. ISBN 0-201-61633-5 Copyright © 1991-2000 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc. -
First Revised Proposal Transliteration of Akson1-Tham-Isan and Akson
1 First Revised Proposal Transliteration of Akson1-Tham-Isan and Akson-Thai-Noi 1 Scope The transliteration system presented here describes the orthographic system of the Akson- Tham-Isan and Akson-Thai-Noi scripts using Romanized characters consistent with the provisions of the International Standards Organization. The Akson-Tham-Isan and Akson-Thai-Noi characters of this standard were developed as modem representations of ancient scripts found in inscriptions and palm leaf religious texts and adapted by the Royal Institute of Thailand2. The selection of Romanized characters follows, to the extent possible, the phonemic/phonetic representations used in the transliteration of Standard Thai as described in the International Standard of the ISO 11940, thus enabling consistency of system and economy of codes. In this system transliteration principles are applied stringently to enable complete unambiguous reversibility in the conversion of characters. Although accurate pronunciation may not always result in the application of this system, because the original Akson-Tham-Isan and Akson-Thai-Noi characters can be regenerated automatically from the Romanized representation, those with knowledge of the languages will be able to correctly pronounce the Romanized graphemes. 2 Normative reference The following standard contains provisions which, through reference in this text, constitute provisions of this International Standard. At the time of publication, the edition indicated was valid. All standards are subject to revision, and parties to agreements based on this International Standard are encouraged to investigate the possibility of applying the most recent edition of the standard indicated below. Members of IEC and ISO maintain registers of currently valid International Standards.