The Tonal Comparative Method: Tai Tone in Historical Perspective
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Cross-Language Framework for Word Recognition and Spotting of Indic Scripts
Cross-language Framework for Word Recognition and Spotting of Indic Scripts aAyan Kumar Bhunia, bPartha Pratim Roy*, aAkash Mohta, cUmapada Pal aDept. of ECE, Institute of Engineering & Management, Kolkata, India bDept. of CSE, Indian Institute of Technology Roorkee India cCVPR Unit, Indian Statistical Institute, Kolkata, India bemail: [email protected], TEL: +91-1332-284816 Abstract Handwritten word recognition and spotting of low-resource scripts are difficult as sufficient training data is not available and it is often expensive for collecting data of such scripts. This paper presents a novel cross language platform for handwritten word recognition and spotting for such low-resource scripts where training is performed with a sufficiently large dataset of an available script (considered as source script) and testing is done on other scripts (considered as target script). Training with one source script and testing with another script to have a reasonable result is not easy in handwriting domain due to the complex nature of handwriting variability among scripts. Also it is difficult in mapping between source and target characters when they appear in cursive word images. The proposed Indic cross language framework exploits a large resource of dataset for training and uses it for recognizing and spotting text of other target scripts where sufficient amount of training data is not available. Since, Indic scripts are mostly written in 3 zones, namely, upper, middle and lower, we employ zone-wise character (or component) mapping for efficient learning purpose. The performance of our cross- language framework depends on the extent of similarity between the source and target scripts. -
LCSH Section K
K., Rupert (Fictitious character) Motion of K stars in line of sight Ka-đai language USE Rupert (Fictitious character : Laporte) Radial velocity of K stars USE Kadai languages K-4 PRR 1361 (Steam locomotive) — Orbits Ka’do Herdé language USE 1361 K4 (Steam locomotive) UF Galactic orbits of K stars USE Herdé language K-9 (Fictitious character) (Not Subd Geog) K stars—Galactic orbits Ka’do Pévé language UF K-Nine (Fictitious character) BT Orbits USE Pévé language K9 (Fictitious character) — Radial velocity Ka Dwo (Asian people) K 37 (Military aircraft) USE K stars—Motion in line of sight USE Kadu (Asian people) USE Junkers K 37 (Military aircraft) — Spectra Ka-Ga-Nga script (May Subd Geog) K 98 k (Rifle) K Street (Sacramento, Calif.) UF Script, Ka-Ga-Nga USE Mauser K98k rifle This heading is not valid for use as a geographic BT Inscriptions, Malayan K.A.L. Flight 007 Incident, 1983 subdivision. Ka-houk (Wash.) USE Korean Air Lines Incident, 1983 BT Streets—California USE Ozette Lake (Wash.) K.A. Lind Honorary Award K-T boundary Ka Iwi National Scenic Shoreline (Hawaii) USE Moderna museets vänners skulpturpris USE Cretaceous-Paleogene boundary UF Ka Iwi Scenic Shoreline Park (Hawaii) K.A. Linds hederspris K-T Extinction Ka Iwi Shoreline (Hawaii) USE Moderna museets vänners skulpturpris USE Cretaceous-Paleogene Extinction BT National parks and reserves—Hawaii K-ABC (Intelligence test) K-T Mass Extinction Ka Iwi Scenic Shoreline Park (Hawaii) USE Kaufman Assessment Battery for Children USE Cretaceous-Paleogene Extinction USE Ka Iwi National Scenic Shoreline (Hawaii) K-B Bridge (Palau) K-TEA (Achievement test) Ka Iwi Shoreline (Hawaii) USE Koro-Babeldaod Bridge (Palau) USE Kaufman Test of Educational Achievement USE Ka Iwi National Scenic Shoreline (Hawaii) K-BIT (Intelligence test) K-theory Ka-ju-ken-bo USE Kaufman Brief Intelligence Test [QA612.33] USE Kajukenbo K. -
SC22/WG20 N896 L2/01-476 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale De Normalisation
SC22/WG20 N896 L2/01-476 Universal multiple-octet coded character set International organization for standardization Organisation internationale de normalisation Title: Ordering rules for Khmer Source: Kent Karlsson Date: 2001-12-19 Status: Expert contribution Document type: Working group document Action: For consideration by the UTC and JTC 1/SC 22/WG 20 1 Introduction The Khmer script in Unicode/10646 uses conjoining characters, just like the Indic scripts and Hangul. An alternative that is suitable (and much more elegant) for Khmer, but not for Indic scripts, would have been to have combining- below (and sometimes a bit to the side) consonants and combining-below independent vowels, similar to the combining-above Latin letters recently encoded, as well as how Tibetan is handled. However that is not the chosen solution, which instead uses a combining character (COENG) that makes characters conjoin (glue) like it is done for Indic (Brahmic) scripts and like Hangul Jamo, though the latter does not have a separate gluer character. In the Khmer script, using the COENG based approach, the words are formed from orthographic syllables, where an orthographic syllable has the following structure [add ligation control?]: Khmer-syllable ::= (K H)* K M* where K is a Khmer consonant (most with an inherent vowel that is pronounced only if there is no consonant, independent vowel, or dependent vowel following it in the orthographic syllable) or a Khmer independent vowel, H is the invisible Khmer conjoint former COENG, M is a combining character (including COENG, though that would be a misspelling), in particular a combining Khmer vowel (noted A below) or modifier sign. -
Schwa Deletion: Investigating Improved Approach for Text-To-IPA System for Shiri Guru Granth Sahib
ISSN (Online) 2278-1021 ISSN (Print) 2319-5940 International Journal of Advanced Research in Computer and Communication Engineering Vol. 4, Issue 4, April 2015 Schwa Deletion: Investigating Improved Approach for Text-to-IPA System for Shiri Guru Granth Sahib Sandeep Kaur1, Dr. Amitoj Singh2 Pursuing M.E, CSE, Chitkara University, India 1 Associate Director, CSE, Chitkara University , India 2 Abstract: Punjabi (Omniglot) is an interesting language for more than one reasons. This is the only living Indo- Europen language which is a fully tonal language. Punjabi language is an abugida writing system, with each consonant having an inherent vowel, SCHWA sound. This sound is modifiable using vowel symbols attached to consonant bearing the vowel. Shri Guru Granth Sahib is a voluminous text of 1430 pages with 511,874 words, 1,720,345 characters, and 28,534 lines and contains hymns of 36 composers written in twenty-two languages in Gurmukhi script (Lal). In addition to text being in form of hymns and coming from so many composers belonging to different languages, what makes the language of Shri Guru Granth Sahib even more different from contemporary Punjabi. The task of developing an accurate Letter-to-Sound system is made difficult due to two further reasons: 1. Punjabi being the only tonal language 2. Historical and Cultural circumstance/period of writings in terms of historical and religious nature of text and use of words from multiple languages and non-native phonemes. The handling of schwa deletion is of great concern for development of accurate/ near perfect system, the presented work intend to report the state-of-the-art in terms of schwa deletion for Indian languages, in general and for Gurmukhi Punjabi, in particular. -
Volume 4-2:2011
JSEALS Journal of the Southeast Asian Linguistics Society Managing Editor: Paul Sidwell (Pacific Linguistics, Canberra) Editorial Advisory Board: Mark Alves (USA) George Bedell (Thailand) Marc Brunelle (Canada) Gerard Diffloth (Cambodia) Marlys Macken (USA) Brian Migliazza (USA) Keralapura Nagaraja (India) Peter Norquest (USA) Amara Prasithrathsint (Thailand) Martha Ratliff (USA) Sophana Srichampa (Thailand) Justin Watkins (UK) JSEALS is the peer-reviewed journal of the Southeast Asian Linguistics Society, and is devoted to publishing research on the languages of mainland and insular Southeast Asia. It is an electronic journal, distributed freely by Pacific Linguistics (www.pacling.com) and the JSEALS website (jseals.org). JSEALS was formally established by decision of the SEALS 17 meeting, held at the University of Maryland in September 2007. It supersedes the Conference Proceedings, previously published by Arizona State University and later by Pacific Linguistics. JSEALS welcomes articles that are topical, focused on linguistic (as opposed to cultural or anthropological) issues, and which further the lively debate that characterizes the annual SEALS conferences. Although we expect in practice that most JSEALS articles will have been presented and discussed at the SEALS conference, submission is open to all regardless of their participation in SEALS meetings. Papers are expected to be written in English. Each paper is reviewed by at least two scholars, usually a member of the Advisory Board and one or more independent readers. Reviewers are volunteers, and we are grateful for their assistance in ensuring the quality of this publication. As an additional service we also admit data papers, reports and notes, subject to an internal review process. -
Thai Song Dam, Wang Nam Village, Wang Yang Subdistrict, Khlong Khlung District, Kamphaeng Phet Province
th The 10 International Conference on Art and Culture Network 11-13 march 2020, Kamphaeng Phet Rajabhat University, Thailand Thai Song Dam, Wang Nam Village, Wang Yang Subdistrict, Khlong Khlung District, Kamphaeng Phet Province Weerawan Jangmo1, Suphachokchai Nanthasri2, Phathooramphai Praphatsorn3, Pimnara Banjong4 and Wanassanan Nutchanart5 Office of Arts and Culture Rajabhat University Kamphaeng Phet 1E-mail : [email protected], Telephone Number : 080-9829596 Introduction The Tai Dam or Thai Song Damis a group of Tai people who originally settle in Sip Song Chu Tai or the Black and Red River Basin in Northern Vietnam, which is the original residence of the Tai Dam and Tai Khao which means White Tai.At present, Sip Song Chu Taiis Vietnam's DienBien Phu province, which has a border with Laos, Phongsali. Nowadays, the name people in Dien Bien Phu province call themselves is "Tai Dam" during the period when France came to rule Vietnam and Laos.They called tribes in the Black River basin Tai Dam. It’s not becausethe Tai Damlived in the Black River Basin, but they preferred to their unique black clothing dyed with the Hom (Baphicacanthus cusia) or Indigo (Indigofera tinctoria). The so-called "Lao Song", in fact, their race is not Laos. The reason for the name is due to the migration from Laos down to Siam. The term "Song people" or "Tai Song Dam" is more acceptable. For instance, a group of people in Phetchaburi is called "Song" or "Thai Song Dam" Picture 1 The province of Vietnam or Laos is the residence to Tai Dam people with Thaeng as a capital. -
A Comparative Study of Shan and Standard Thai Morphology
A COMPARATIVE STUDY OF SHAN AND STANDARD THAI MORPHOLOGY Kittisara A Thesis Submitted in Partial Fulfilment of the Requirements for the Degree of Master of Arts (Linguistics) Graduate School Mahachulalongkornrajavidayalaya University C.E. 2018 A Comparative Study of Shan and Standard Thai Morphology Kittisara A Thesis Submitted in Partial Fulfilment of the Requirements for the Degree of Master of Arts (Linguistics) Graduate School Mahachulalongkornrajavidayalaya University C.E. 2018 (Copyright by Mahachulalongkornrajavidyalaya University) i Thesis Title : A Comparative Study of Shan and Standard Thai Morphology Researcher : Kittisara Degree : Master of Arts in Linguistics Thesis Supervisory Committee : Assoc. Prof. Nilratana Klinchan B.A. (English), M.A. (Political Science) : Asst. Prof. Dr. Phramaha Suriya Varamedhi B.A. (Philosophy), M.A. (Linguistics), Ph.D. (Linguistics) Date of Graduation : March 19, 2019 Abstract The purpose of this research is to explore the comparative study of Shan and standard Thai Morphology. The objectives of the study are classified into three parts as the following; (1) To study morpheme of Shan and standard Thai, (2) To study the word-formation of Shan and standard Thai and (3) To compare the morpheme and word-classes of Shan and standard Thai. This research is the qualitative research. The population referred to this research, researcher selects Shan people who were born at Tachileik in Shan state consisting of 6 persons. Area of research is Shan people at Tachileik in Shan state union of Myanmar. Research method, the tool used in the research, the researcher makes interview and document research. The main important parts in this study based on content analysis as documentary research by selecting primary sources from the books, academic books, Shan dictionary, Thai dictionary, library, online research and the research studied from informants' native speakers for 6 persons. -
University of California Santa Cruz NO SOMOS ANIMALES
University of California Santa Cruz NO SOMOS ANIMALES: INDIGENOUS SURVIVAL AND PERSEVERANCE IN 19TH CENTURY SANTA CRUZ, CALIFORNIA A dissertation submitted in partial satisfaction of the requirements for the degree of DOCTOR OF PHILOSOPHY in HISTORY with emphases in AMERICAN STUDIES and LATIN AMERICAN & LATINO STUDIES by Martin Adam Rizzo September 2016 The Dissertation of Martin Adam Rizzo is approved: ________________________________ Professor Lisbeth Haas, Chair _________________________________ Professor Amy Lonetree _________________________________ Professor Matthew D. O’Hara ________________________________ Tyrus Miller Vice Provost and Dean of Graduate Studies Copyright ©by Martin Adam Rizzo 2016 Table of Contents List of Figures iv Abstract vii Acknowledgments ix Introduction 1 Chapter 1: “First were taken the children, and then the parents followed” 24 Chapter 2: “The diverse nations within the mission” 98 Chapter 3: “We are not animals” 165 Chapter 4: Captain Coleto and the Rise of the Yokuts 215 Chapter 5: ”Not finding anything else to appropriate...” 261 Chapter 6: “They won’t try to kill you if they think you’re already dead” 310 Conclusion 370 Appendix A: Indigenous Names 388 Bibliography 398 iii List of Figures 1.1: Indigenous tribal territories 33 1.2: Contemporary satellite view 36 1.3: Total number baptized by tribe 46 1.4: Approximation of Santa Cruz mountain tribal territories 48 1.5: Livestock reported near Mission Santa Cruz 75 1.6: Agricultural yields at Mission Santa Cruz by year 76 1.7: Baptisms by month, through -
Finite-State Script Normalization and Processing Utilities: the Nisaba Brahmic Library
Finite-state script normalization and processing utilities: The Nisaba Brahmic library Cibu Johny† Lawrence Wolf-Sonkin‡ Alexander Gutkin† Brian Roark‡ Google Research †United Kingdom and ‡United States {cibu,wolfsonkin,agutkin,roark}@google.com Abstract In addition to such normalization issues, some scripts also have well-formedness constraints, i.e., This paper presents an open-source library for efficient low-level processing of ten ma- not all strings of Unicode characters from a single jor South Asian Brahmic scripts. The library script correspond to a valid (i.e., legible) grapheme provides a flexible and extensible framework sequence in the script. Such constraints do not ap- for supporting crucial operations on Brahmic ply in the basic Latin alphabet, where any permuta- scripts, such as NFC, visual normalization, tion of letters can be rendered as a valid string (e.g., reversible transliteration, and validity checks, for use as an acronym). The Brahmic family of implemented in Python within a finite-state scripts, however, including the Devanagari script transducer formalism. We survey some com- mon Brahmic script issues that may adversely used to write Hindi, Marathi and many other South affect the performance of downstream NLP Asian languages, do have such constraints. These tasks, and provide the rationale for finite-state scripts are alphasyllabaries, meaning that they are design and system implementation details. structured around orthographic syllables (aksara)̣ as the basic unit.1 One or more Unicode characters 1 Introduction combine when rendering one of thousands of leg- The Unicode Standard separates the representation ible aksara,̣ but many combinations do not corre- of text from its specific graphical rendering: text spond to any aksara.̣ Given a token in these scripts, is encoded as a sequence of characters, which, at one may want to (a) normalize it to a canonical presentation time are then collectively rendered form; and (b) check whether it is a well-formed into the appropriate sequence of glyphs for display. -
Department of Social Development and Welfare Ministry of Social
OCT SEP NOV AUG DEC JUL JAN JUN FEB MAY MAR APR Department of Social Development and Welfare Ministry of Social Development and Human Security ISBN 978-616-331-053-8 Annual Report 2015 y t M i r i u n c is e t S ry n o a f m So Hu ci d al D an evelopment Department of Social Development and Welfare Annual Report 2015 Department of Social Development and Welfare Ministry of Social Development and Human Security Annual Report 2015 2015 Preface The Annual Report for the fiscal year 2015 was prepared with the aim to disseminate information and keep the general public informed about the achievements the Department of Social Development and Welfare, Ministry of Social Development and Human Security had made. The department has an important mission which is to render services relating to social welfare, social work and the promotion and support given to local communities/authorities to encourage them to be involved in the social welfare service providing.The aim was to ensure that the target groups could develop the capacity to lead their life and become self-reliant. In addition to capacity building of the target groups, services or activities by the department were also geared towards reducing social inequality within society. The implementation of activities or rendering of services proceeded under the policy which was stemmed from the key concept of participation by all concerned parties in brainstorming, implementing and sharing of responsibility. Social development was carried out in accordance with the 4 strategic issues: upgrading the system of providing quality social development and welfare services, enhancing the capacity of the target population to be well-prepared for emerging changes, promoting an integrated approach and enhancing the capacity of quality networks, and developing the organization management towards becoming a learning organization. -
THAI-YUNNAN PROJECT BULLETIN NUMBER 7 MARCH 2005 Editorial
THAI-YUNNAN PROJECT BULLETIN NUMBER 7 MARCH 2005 Editorial In the seventh issue of this online Bulletin we carry a fascinating continuation of Wasan Panyagaew’s travels (see Bulletin no. 5, 2003) among the diasporic Lue of the upper-Mekong borderlands. Nicholas Farrelly takes us even further afi eld with a stimulating and critical account of the way in which Thai scholars have approached the Tai groups of northeastern India. This builds on his previous critique of Thai writing on the Shan, which was submitted as an Honours Thesis at the Australian National University in 2003. And Runako Samata, in an extract from her Master’s Thesis at Chiang Mai University, gives a detailed account of some aspects of cabbage production by the Karen of Thailand, pointing to close relations with neighbouring groups and complexifying the often trite identifi cations of commercial production with non-Karen peoples. For recent discussions of this important issue see, for example, Yos Santasombat’s “Karen Cultural Capital and the political ecology of symbolic power” in Asian Ethnicity 5:1, 2004 and Andrew Walker’s comment Phra Upakhut, Wat Upakhut, Chiang Mai. Andrew Walker in the subsequent issue; Pinkaew Laungaramsri’s It is a belief that Phra Uppakut walks the streets of Chiangmai important critique of Thai forest policy, Redefi ning on the full moon of the ninth month as a monk seeking alms. The fi rst one to make an offering is blessed with good fortune. Nature (Earthworm Books 2001); Claudio Delang’s There is a story in Chiangmai that the future Luang Anusarn edited collection, Living at the edge of Thai society had walked overland to Chiangmai from China and was earn- (RoutledgeCurzon, 2003) and Yoko Hayami’s ing his living as a pedlar on the streets of Chiangmai. -
Sino-Tibetan Languages 393
Sino-Tibetan Languages 393 Gair J W (1998). Studies in South Asian linguistics: Sinhala Government Press. [Reprinted Sri Lanka Sahitya and other South Asian languages. Oxford: Oxford Uni- Mandalaya, Colombo: 1962.] versity Press. Karunatillake W S (1992). An introduction to spoken Sin- Gair J W & Karunatillake W S (1974). Literary Sinhala. hala. Colombo: Gunasena. Ithaca, NY: Cornell University South Asia Program. Karunatillake W S (2001). Historical phonology of Sinha- Gair J W & Karunatillake W S (1976). Literary Sinhala lese: from old Indo-Aryan to the 14th century AD. inflected forms: a synopsis with a transliteration guide to Colombo: S. Godage and Brothers. Sinhala script. Ithaca, NY: Cornell University South Asia Macdougall B G (1979). Sinhala: basic course. Program. Washington D.C.: Foreign Service Institute, Department Gair J W & Paolillo J C (1997). Sinhala (Languages of the of State. world/materials 34). Mu¨ nchen: Lincom. Matzel K & Jayawardena-Moser P (2001). Singhalesisch: Gair J W, Karunatillake W S & Paolillo J C (1987). Read- Eine Einfu¨ hrung. Wiesbaden: Harrassowitz. ings in colloquial Sinhala. Ithaca, NY: Cornell University Reynolds C H B (ed.) (1970). An anthology of Sinhalese South Asia Program. literature up to 1815. London: George Allen and Unwin Geiger W (1938). A grammar of the Sinhalese language. (English translations). Colombo: Royal Asiatic Society. Reynolds C H B (ed.) (1987). An anthology of Sinhalese Godakumbura C E (1955). Sinhalese literature. Colombo: literature of the twentieth century. Woodchurch, Kent: Colombo Apothecaries Ltd. Paul Norbury/Unesco (English translations). Gunasekara A M (1891). A grammar of the Sinhalese Reynolds C H B (1995). Sinhalese: an introductory course language.