UNIVERSITY of CALIFORNIA Los Angeles Word Prosody And
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The Longest Words Using the Fewest Letters
The Longest Words Using The Fewest Letters David M. Rheingold / Barnstable, Massachusetts Austin Byl, Evan Crouch, Elizabeth Krodel, Celia Schiffman, Nico Toutenhoofd / Boulder, Colorado Claude Shannon may be remembered as the father of information theory for the formula engraved on his tombstone, but it’s not the only measure of entropy he helped devise. Metric entropy gauges the amount of information proportionate to the number of unique characters in a message. Within the English language, the maximum metric entropy individual words can attain is 0.5283 binary digits (or bits). This is the value of any three-letter word composed of three different letters (e.g., WHO). Multiplying 0.5283 times the number of distinct letters equals 1.58, and 2 raised to the power of 1.58 yields the same number of letters. These values hold true for any trigram with three different letters in a 26-letter alphabet. Measurements differ for written text involving other characters including numbers, punctuation, even spaces. Distinguishing between uppercase and lowercase would double the number of letters to 52. Adjusting for word frequency would further alter values as trigrams like WHO occur far more often in written English than, say, KEY. (How much more often? We found 3,598,284 instances of WHO in Project Gutenberg versus 58,863 of KEY, or 61 times as many.) Our lexicon derives from three primary sources: Merriam-Webster, Webster’s, and the Moby II wordlist, with a combined 382,843 words. Of these we find some 2,000 trigrams made up of three different letters, including WHO and KEY, tied for highest metric entropy. -
De Sousa Sinitic MSEA
THE FAR SOUTHERN SINITIC LANGUAGES AS PART OF MAINLAND SOUTHEAST ASIA (DRAFT: for MPI MSEA workshop. 21st November 2012 version.) Hilário de Sousa ERC project SINOTYPE — École des hautes études en sciences sociales [email protected]; [email protected] Within the Mainland Southeast Asian (MSEA) linguistic area (e.g. Matisoff 2003; Bisang 2006; Enfield 2005, 2011), some languages are said to be in the core of the language area, while others are said to be periphery. In the core are Mon-Khmer languages like Vietnamese and Khmer, and Kra-Dai languages like Lao and Thai. The core languages generally have: – Lexical tonal and/or phonational contrasts (except that most Khmer dialects lost their phonational contrasts; languages which are primarily tonal often have five or more tonemes); – Analytic morphological profile with many sesquisyllabic or monosyllabic words; – Strong left-headedness, including prepositions and SVO word order. The Sino-Tibetan languages, like Burmese and Mandarin, are said to be periphery to the MSEA linguistic area. The periphery languages have fewer traits that are typical to MSEA. For instance, Burmese is SOV and right-headed in general, but it has some left-headed traits like post-nominal adjectives (‘stative verbs’) and numerals. Mandarin is SVO and has prepositions, but it is otherwise strongly right-headed. These two languages also have fewer lexical tones. This paper aims at discussing some of the phonological and word order typological traits amongst the Sinitic languages, and comparing them with the MSEA typological canon. While none of the Sinitic languages could be considered to be in the core of the MSEA language area, the Far Southern Sinitic languages, namely Yuè, Pínghuà, the Sinitic dialects of Hǎinán and Léizhōu, and perhaps also Hakka in Guǎngdōng (largely corresponding to Chappell (2012, in press)’s ‘Southern Zone’) are less ‘fringe’ than the other Sinitic languages from the point of view of the MSEA linguistic area. -
Set Reading Benchmarks in South Africa
SETTING READING BENCHMARKS IN SOUTH AFRICA October 2020 PHOTO: SCHOOL LIBRARY, CREDIT: PIXABAY Contract Number: 72067418D00001, Order Number: 72067419F00007 THIS PUBLICATION WAS PRODUCED AT THE REQUEST OF THE UNITED STATES AGENCY FOR INTERNATIONAL DEVELOPMENT. IT WAS PREPARED INDEPENDENTLY BY KHULISA MANAGEMENT SERVICES, (PTY) LTD. THE AUTHOR’S VIEWS EXPRESSED IN THIS PUBLICATION DO NOT NECESSARILY REFLECT THE VIEWS OF THE UNITED STATES AGENCY FOR INTERNATIONAL DEVELOPMENT OR THE UNITED STATES GOVERNMENT. ISBN: 978-1-4315-3411-1 All rights reserved. You may copy material from this publication for use in non-profit education programmes if you acknowledge the source. Contract Number: 72067418D00001, Order Number: 72067419F00007 SETTING READING BENCHMARKS IN SOUTH AFRICA October 2020 AUTHORS Matthew Jukes (Research Consultant) Elizabeth Pretorius (Reading Consultant) Maxine Schaefer (Reading Consultant) Katharine Tjasink (Project Manager, Senior) Margaret Roper (Deputy Director, Khulisa) Jennifer Bisgard (Director, Khulisa) Nokuthula Mabhena (Data Visualisation, Khulisa) CONTACT DETAILS Margaret Roper 26 7th Avenue Parktown North Johannesburg, 2196 Telephone: 011-447-6464 Email: [email protected] Web Address: www.khulisa.com CONTRACTUAL INFORMATION Khulisa Management Services Pty Ltd, (Khulisa) produced this Report for the United States Agency for International Development (USAID) under its Practical Education Research for Optimal Reading and Management: Analyze, Collaborate, Evaluate (PERFORMANCE) Indefinite Delivery Indefinite Quantity -
Cross-Language Information Retrieval
Cross-Language Information Retrieval MC_Labosky_FM.indd i Achorn International 03/11/2010 10:17AM ii SynthesisOne liner Lectures Chapter in TitleHuman Language Technologies Editor Graeme Hirst, University of Toronto Synthesis Lectures on Human Language Technologies publishes monographs on topics relat- ing to natural language processing, computational linguistics, information retrieval, and spoken language understanding. Emphasis is placed on important new techniques, on new applica- tions, and on topics that combine two or more HLT subfields. Cross-Language Information Retrieval Jian-Yun Nie 2010 Data-Intensive Text Processing with MapReduce Jimmy Lin, Chris Dyer 2010 Semantic Role Labeling Martha Palmer, Daniel Gildea, Nianwen Xue 2010 Spoken Dialogue Systems Kristiina Jokinen, Michael McTear 2010 Introduction to Chinese Natural Language Processing Kam-Fai Wong, Wenji Li, Ruifeng Xu, Zheng-sheng Zhang 2009 Introduction to Linguistic Annotation and Text Analytics Graham Wilcock 2009 MC_Labosky_FM.indd ii Achorn International 03/11/2010 10:17AM MC_Labosky_FM.indd iii Achorn International 03/11/2010 10:17AM SYNTHESIS LESCTURES IN HUMAN LANGUAGE TECHNOLOGIES iii Dependency Parsing Sandra Kübler, Ryan McDonald, Joakim Nivre 2009 Statistical Language Models for Information Retrieval ChengXiang Zhai 2008 MC_Labosky_FM.indd ii Achorn International 03/11/2010 10:17AM MC_Labosky_FM.indd iii Achorn International 03/11/2010 10:17AM Copyright © 2010 by Morgan & Claypool All rights reserved. No part of this publication may be reproduced, stored in -
World Directory of Minorities
World Directory of Minorities Asia and Oceania MRG Directory –> Myanmar/Burma –> Karen Print Page Close Window Karen Profile The term ‘Karen’ actually refers to a number of ethnic groups with Tibetan-Central Asian origins who speak 12 related but mutually unintelligible languages (‘Karenic languages’) that are part of the Tibeto- Burman group of the Sino-Tibetan family. Around 85 per cent of Karen belong either to the S’ghaw language branch, and are mostly Christian and animist living in the hills, or the Pwo section and are mostly Buddhists. The vast majority of Karen are Buddhists (probably over two-thirds), although large numbers converted to Christianity during British rule and are thought to constitute about 30 per cent among the Karen. The group encompasses a great variety of ethnic groups, such as the Karenni, Padaung (also known by some as ‘long-necks’ because of the brass coils worn by women that appear to result in the elongation of their necks), Bghai, Brek, etc. There are no reliable population figures available regarding their total numbers in Burma: a US State Department estimate for 2007 suggests there may be over 3.2 million living in the eastern border region of the country, especially in Karen State, Tenasserim Division, eastern Pegu Division, Mon State and the Irrawaddy Division. Historical context While there is some uncertainty as to the origins of the various Karen peoples, it is thought that their migration may have occurred after the arrival of the Mon – who are thought to have established themselves in the region before the start of the first millennium BCE – but before that of the Burmans sometime before the ninth century. -
Sumi Tone: a Phonological and Phonetic Description of a Tibeto-Burman Language of Nagaland
Sumi tone: a phonological and phonetic description of a Tibeto-Burman language of Nagaland Amos Benjamin Teo Submitted in total fulfilment of the requirements of the degree of Masters by Research (by Thesis Only) December 2009 School of Languages and Linguistics The University of Melbourne Abstract Previous research on Sumi, a Tibeto-Burman language spoken in the extreme northeast of India, has found it to have three lexical tones. However, the few phonological studies of Sumi have focused mainly on its segmental phonology and have failed to provide any substantial account of the tone system. This thesis addresses the issue by providing the first comprehensive description of tone in this language. In addition to confirming three contrastive tones, this study also presents the first acoustic phonetic analysis of Sumi, looking at the phonetic realisation of these tones and the effects of segmental perturbations on tone realisation. The first autosegmental representation of Sumi tone is offered, allowing us to account for tonal phenomena such as the assignment of surface tones to prefixes that appear to be lexically unspecified for tone. Finally, this investigation presents the first account of morphologically conditioned tone variation in Sumi, finding regular paradigmatic shifts in the tone on verb roots that undergo nominalisation. The thesis also offers a cross-linguistic comparison of the tone system of Sumi with that of other closely related Kuki-Chin-Naga languages and some preliminary observations of the historical origin and development of tone in these languages are made. This is accompanied by a typological comparison of these languages with other Tibeto-Burman languages, which shows that although these languages are spoken in what has been termed the ‘Indosphere’, their tone systems are similar to those of languages spoken further to the east in the ‘Sinosphere’. -
Sino-Tibetan Numeral Systems: Prefixes, Protoforms and Problems
Sino-Tibetan numeral systems: prefixes, protoforms and problems Matisoff, J.A. Sino-Tibetan Numeral Systems: Prefixes, Protoforms and Problems. B-114, xii + 147 pages. Pacific Linguistics, The Australian National University, 1997. DOI:10.15144/PL-B114.cover ©1997 Pacific Linguistics and/or the author(s). Online edition licensed 2015 CC BY-SA 4.0, with permission of PL. A sealang.net/CRCL initiative. PACIFIC LINGUISTICS FOUNDING EDITOR: Stephen A. Wunn EDITORIAL BOARD: Malcolm D. Ross and Darrell T. Tryon (Managing Editors), Thomas E. Dutton, Nikolaus P. Himmelmann, Andrew K. Pawley Pacific Linguistics is a publisher specialising in linguistic descriptions, dictionaries, atlases and other material on languages of the Pacific, the Philippines, Indonesia and southeast Asia. The authors and editors of Pacific Linguistics publications are drawn from a wide range of institutions around the world. Pacific Linguistics is associated with the Research School of Pacific and Asian Studies at the Australian National University. Pacific Linguistics was established in 1963 through an initial grant from the Hunter Douglas Fund. It is a non-profit-making body financed largely from the sales of its books to libraries and individuals throughout the world, with some assistance from the School. The Editorial Board of Pacific Linguistics is made up of the academic staff of the School's Department of Linguistics. The Board also appoints a body of editorial advisors drawn from the international community of linguists. Publications in Series A, B and C and textbooks in Series D are refereed by scholars with re levant expertise who are normally not members of the editorial board. -
A Supplementary Topical Index
179 A SUPPLEMENTARY TOPICAL INDEX The following index has been designed to· help the reader locate ~. Sa ntiago specific types of wordplay published in 26 issues of Word Ways ;aint Louis from February 1978 through May 1984; it updates a similar index 12. Lagos, for 40 issues of Word Ways from February 1968 through November' Bucharest 1977 appearing in the February 1978 issue. Both indices use the 19. Lenin same format: a logological core consisting of (1) letter-patterns. :>an Antonio in words, (2) operations upon letters in words, and (3) relation-. 27. Wash ships between letters and sounds, and a periphery (the intersection r, Cremona of logology with other branches of wordplay) consisting of (1) lit Copenhagen erary wordplay and games, (2) academic language studies, and ~. spelling (3) word games and puzzles, Wordplay involving special sets of ::rlin, West words (presidents, statenames, -cide words, etc.) is separately indexed. Each article is cited by year and page: thus, 79-123 directs the reader to page 123 of the 1979 volume, For 'articles on a com up 6. tart mon topic pu blished the same yea r, the year is omitted but the l1. lap up page retained, as 80-23, 141, 212. Citations in parenthesis denote ). ham up corrections or follow-up material. The letters q, f, P or b follow-· wrap up ing a citation identifies it as a special format: a quiz, a fictional 26. butter or humorous article, a poem, or a book or journal review. divvy up 36. stuck I. DEFINITIONS, SOURCES OF WORDS ed up 41. -
Phonetics of Sgaw Karen in Thailand
PHONETICS OF SGAW KAREN IN THAILAND: AN ACOUSTIC DESCRIPTION PONGPRAPUNT RATTANAPORN MASTER OF ARTS IN ENGLISH THE GRADUATE SCHOOL CHIANG MAI UNIVERSITY SEPTEMBER 2012 PHONETICS OF SGAW KAREN IN THAILAND: AN ACOUSTIC DESCRIPTION PONGPRAPUNT RATTANAPORN AN INDEPENDENT STUDY SUBMITTED TO THE GRADUATE SCHOOL IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF ARTS IN ENGLISH THE GRADUATE SCHOOL CHIANG MAI UNIVERSITY SEPTEMBER 2012 PHONETICS OF SGAW KAREN IN THAILAND: AN ACOUSTIC DESCRIPTION PONGPRAPUNT RATTANAPORN THIS INDEPENDENT STUDY HAS BEEN APPROVED TO BE A PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF ARTS IN ENGLISH 26 September 2012 © Copyright by Chiang Mai University iii ACKNOWLEDGEMENT I would like to express my gratitude to everybody who contributed to this research. Firstly, my deepest thanks go to both of my supervisors, Dr. Preeya Nokaew (Chiang Mai University) and Dr. Paul Sidwell (Australian National University), for their excellent support from when I first started until I finished this research. Secondly, my thanks go to the examining committee, Dr. Peter Freeouf and Dr. Sudarat Hatfield, for their useful comments. Thirdly, I would like to express my thanks to Assoc. Prof. Suphanee Jancamai and Asst. Prof. Panit Boonyavatana and all the teachers in the graduate program. Also a special thanks to Dr. Phinnarat Akharawatthanakun (Payap University) for her valuable advice. This research would not have been successful without the assistance of the Dujeto’s, Mr. Thaworn Kamponkun and all the informants and my friends Ey, Sprite, Mike, Chris, Darwin, Oh, Jay, Rose and Yrrah. Last but not least, I appreciate everybody in my family for their love, encouragement and financial support. -
Refugees from Burma Acknowledgments
Culture Profile No. 21 June 2007 Their Backgrounds and Refugee Experiences Writers: Sandy Barron, John Okell, Saw Myat Yin, Kenneth VanBik, Arthur Swain, Emma Larkin, Anna J. Allott, and Kirsten Ewers RefugeesEditors: Donald A. Ranard and Sandy Barron From Burma Published by the Center for Applied Linguistics Cultural Orientation Resource Center Center for Applied Linguistics 4646 40th Street, NW Washington, DC 20016-1859 Tel. (202) 362-0700 Fax (202) 363-7204 http://www.culturalorientation.net http://www.cal.org The contents of this profile were developed with funding from the Bureau of Population, Refugees, and Migration, United States Department of State, but do not necessarily rep- resent the policy of that agency and the reader should not assume endorsement by the federal government. This profile was published by the Center for Applied Linguistics (CAL), but the opinions expressed herein do not necessarily reflect positions or policies of CAL. Production supervision: Sanja Bebic Editing: Donald A. Ranard Copyediting: Jeannie Rennie Cover: Burmese Pagoda. Oil painting. Private collection, Bangkok. Design, illustration, production: SAGARTdesign, 2007 ©2007 by the Center for Applied Linguistics The U.S. Department of State reserves a royalty-free, nonexclusive, and irrevocable right to reproduce, publish, or otherwise use, and to authorize others to use, the work for Government purposes. All other rights reserved. No part of this book may be reproduced, in any form or by any means, without permission in writing from the publisher. All inquiries should be addressed to the Cultural Orientation Resource Center, Center for Applied Linguistics, 4646 40th Street, N.W., Washington, D.C. 20016. -
Languages of Myanmar
Ethnologue report for Myanmar Page 1 of 20 Languages of Myanmar [See also SIL publications on the languages of Myanmar.] Union of Myanmar, Pyeidaungzu Myanma Naingngandaw. Formerly Burma. 42,720,196. Speakers of Tibeto-Burman languages: 28,877,000 or 78% of the population, Daic languages 2,778,900 or 9.6%, Austro- Asiatic languages 1,934,900 or 6.7%, Hmong-Mien languages 6,000 (1991 J. Matisoff). National or official language: Burmese. Literacy rate: 66% to 78%; 78.5% over 15 years old (1991). Also includes Eastern Tamang, Geman Deng, Iu Mien, Malay (21,000), Sylheti, Chinese (1,015,000), people from Bangladesh and India (500,000). Information mainly from F. Lebar, G. Hickey, J. Musgrave 1964; A. Hale 1982; B. Comrie 1987; R. B. Jones 1988; J. Matisoff et al. 1996; D. Bradley 1997; R. Burling ms. (1998). Blind population: 214,440. Deaf population: 2,684,514. Deaf institutions: 1. The number of languages listed for Myanmar is 109. Of those, 108 are living languages and 1 is extinct. Living languages Achang [acn] 1,700 in Myanmar (1983). West of the Irrawaddy River in Katha District, near Banmauk, scattered among the Lashi. Along the China border. Alternate names: Anchan, Chung, Atsang, Acang, Ngac'ang, Ngachang, Ngochang, Mönghsa, Tai Sa'. Dialects: Maingtha. Classification: Sino-Tibetan, Tibeto- Burman, Lolo-Burmese, Burmish, Northern More information. Akha [ahk] 200,000 in Myanmar (1991 UBS). Population total all countries: 449,261. Eastern part of Kengtung Shan State. Also spoken in China, Laos, Thailand, Viet Nam. Alternate names: Kaw, Ekaw, Ko, Aka, Ikaw, Ak'a, Ahka, Khako, Kha Ko, Khao Kha Ko, Ikor, Aini, Yani. -
A Basic Vocabulary of Htoklibang Pwo Karen with Hpa-An, Kyonbyaw, and Proto-Pwo Karen Forms
Asian and African Languages and Linguistics No.4, 2009 A Basic Vocabulary of Htoklibang Pwo Karen with Hpa-an, Kyonbyaw, and Proto-Pwo Karen Forms Kato, Atsuhiko (Osaka University) Htoklibang Pwo Karen is one of the dialects of Pwo Karen. In this dialect, the two Proto- Pwo tones that come from Tone1 of Proto-Karen have merged. Such a dialect has not been reported so far. This paper presents the phonemic system and a basic vocabulary of this dialect. For each vocabulary item, its corresponding forms in the Hpa-an dialect, Kyonbyaw dialect, and Proto-Pwo Karen are also shown. Keywords: Pwo Karen, Karenic languages, Tibeto-Burman 1. Introduction 2. Htoklibang Pwo Karen 3. Notes to the basic vocabulary 4. Basic vocabulary of Htoklibang Pwo Karen 5. Summary 1. Introduction I read the following from Man Lin Myat Kyaw (1970:22-23) and developed an inter- est in the language spoken by the people called the Htoklibang: There are people who speak mixing Pwo Karen and Sgaw Karen in the villages of Thahton prefecture such as Winpa, Kyaikkaw, Danu, Kawyin and in the villages of Bilin township such as Wabyugon, Kyibin, Inh- palaung, Alugyi, Alule, Nyaungthaya, Shin-in, Shatthwa, Winbyan, Phowathein, Thaybyugyaun. They are called Htoklibang by the other eastern Karens. [Man Lin Myat Kyaw 1970:22-23; in Burmese] (underlined by Kato) In December 2007, I visited the village of Alule (/Pa˘l´ul´e/ in Burmese), located in the Bilin Township, Mon State, Myanmar (Burma), and researched the language used there. The exact location of the Alule village is 17◦15’N, 97◦09’E (see Map 1).