Design of a Cloud Service for Learning Chinese Pronunciation
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Background I. Names
Background I. Names 1. China It used to be thought that the name ‘China’ derived from the name of China’s early Qin dynasty (Chin or Ch’in in older transcriptions), whose rulers conquered all rivals and initiated the dynasty in 221 BC. But, as Wilkinson notes (Chinese History: A Manual: 753, and fn 7), the original pronunciation of the name ‘Qin’ was rather different, and would make it an unlikely source for the name China. Instead, China is thought to derive from a Persian root, and was, apparently, first used for porcelain, and only later applied to the country from which the finest examples of that material came. Another name, Cathay, now rather poetic in English, but surviving as the regular name for the country in languages such as Russian (Kitai), is said to derive from the name of the Khitan Tarters, who formed the Liao dynasty in north China in the 10th century. The Khitan dynasty was the first to make a capital on the site of Beijing. The Chinese now call their country Zhōngguó, often translated as ‘Middle Kingdom’. Originally, this name meant the central – or royal – state of the many that occupied the region prior to the unification of Qin. Other names were used before Zhōngguó became current. One of the earliest was Huá (or Huáxià, combining Huá with the name of the earliest dynasty, the Xià). Xià, in fact, combined with the Zhōng of Zhōngguó, appears in the modern official names of the country, as we see below. 2. Chinese places a) The People’s Republic of China (PRC) [Zhōnghuá Rénmín Gònghéguó] This is the political entity proclaimed by Máo Zédōng when he gave his speech (‘China has risen again’) at the Gate of Heavenly Peace [Tiān’ān Mén] in Beijing on October 1, 1949. -
Building a Collation Element Table for a Large Chinese Character Set in YES
Building a Collation Element Table for a Large Chinese Character Set in YES Xiaoheng Zhang1 and Xiaotong Li2 1 Department of Chinese and Bilingual Studies, Hong Kong Polytechnic University [email protected] 2 College of International Exchange, Shenzhen University [email protected] Abstract. YES is a simplified stroke-based method for sorting Chinese characters. It is free from stroke counting and grouping, and thus much faster and more accurate than the traditional method. This paper presents a collation element table built in YES for a large joint Chinese character set covering (a) all 20,902 characters of Unicode CJK Unified Ideographs, (b) all 11,408 characters in the Complete List of Chinese Characters Used by the Media in 2013, (c) all 13,000 plus characters in the latest versions of Xinhua Dictionary(v11) and Contemporary Chinese Dictionary(v6). Of the 20,902 Chinese characters in Unicode, 97.23% have one-to-one relationship with their stroke order codes in YES, comparing with 90.69% of the traditional method. Enhanced with the secondary and tertiary sorting levels of stroke layout and Unicode value, there is a guarantee of one-to-one relationship between the characters and collation elements. The collation element table has been successfully applied to sorting CC-CEDICT, a Chinese-English dictionary of over 112,000 word entries. Keywords: Chinese characters, collation, Unicode, YES 1 Introduction Collation, or determining the sorting order of strings of characters, is a key function in computer systems and a basic need of the human society. Whenever a list of texts—such as the records of a database and the entries of a dictionary—is presented to users, they are likely to want it in a sorted order so that they can easily and reliably find their target words. -
Localizing Into Chinese: the Two Most Common Questions White Paper Answered
Localizing into Chinese: the two most common questions White Paper answered Different writing systems, a variety of languages and dialects, political and cultural sensitivities and, of course, the ever-evolving nature of language itself. ALPHA CRC LTD It’s no wonder that localizing in Chinese can seem complicated to the uninitiated. St Andrew’s House For a start, there is no single “Chinese” language to localize into. St Andrew’s Road Cambridge CB4 1DL United Kingdom Most Westerners referring to the Chinese language probably mean Mandarin; but @alpha_crc you should definitely not assume this as the de facto language for all audiences both within and outside mainland China. alphacrc.com To clear up any confusion, we talked to our regional language experts to find out the most definitive and useful answers to two of the most commonly asked questions when localizing into Chinese. 1. What’s the difference between Simplified Chinese and Traditional Chinese? 2. Does localizing into “Chinese” mean localizing into Mandarin, Cantonese or both? Actually, these are really pertinent questions because they get to the heart of some of the linguistic, political and cultural complexities that need to be taken into account when localizing for this region. Because of the important nature of these issues, we’ve gone a little more in depth than some of the articles on related themes elsewhere on the internet. We think you’ll find the answers a useful starting point for any considerations about localizing for the Chinese-language market. And, taking in linguistic nuances and cultural history, we hope you’ll find them an interesting read too. -
Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only. -
The Creative Artist: a Journal of Theatre and Media Studies, Vol 13, 2, 2017
The Creative Artist: A Journal of Theatre and Media Studies, Vol 13, 2, 2017 MANDARIN CHINESE PINYIN: PRONUNCIATION, ORTHOGRAPHY AND TONE Sunny Ifeanyi Odinye, PhD Department of Igbo, African and Asian Studies Nnamdi Azikiwe University, Awka Email: [email protected] ABSTRACT Pinyin is essential and most fundamental knowledge for all serious learners of Mandarin Chinese. If students are learning Mandarin Chinese, no matter if they are beginners or advanced learners, they should be aware of paramount importance of Pinyin. Good knowledge in Pinyin can greatly help students express themselves more clearly; it can also help them in achieving better listening and reading comprehension. Before learning how to read, write, or pronounce Chinese characters, students must learn pinyin first. This study aims at highlighting the importance of Pinyin and its usage in Mandarin for any students learning Mandarin Chinese. The research work adopts descriptive research method. The work is divided into introduction, body and conclusion. 1. INTRODUCTION Pinyin (spelled sounds) has vastly increased literacy throughout China and beyond; eased the classroom agonies of foreigners studying Mandarin Chinese; afforded the blind a way to read the language in Braille; facilitated the rapid entry of Chinese on computer keyboards and cell phones. "About one billion Chinese citizens have mastered pinyin, which plays an important role in both Chinese language education and international communication," said Wang Dengfeng, vice-chairman of the National Language Committee and director of language department of the Education Ministry. "Hanyu Pinyin not only belongs to China but also belongs to the world. It is now everywhere in our daily lives," Wang said, "Pinyin will continue playing an important role in the modernization of China." (Xinhua News, 2008). -
Sinitic Language and Script in East Asia: Past and Present
SINO-PLATONIC PAPERS Number 264 December, 2016 Sinitic Language and Script in East Asia: Past and Present edited by Victor H. Mair Victor H. Mair, Editor Sino-Platonic Papers Department of East Asian Languages and Civilizations University of Pennsylvania Philadelphia, PA 19104-6305 USA [email protected] www.sino-platonic.org SINO-PLATONIC PAPERS FOUNDED 1986 Editor-in-Chief VICTOR H. MAIR Associate Editors PAULA ROBERTS MARK SWOFFORD ISSN 2157-9679 (print) 2157-9687 (online) SINO-PLATONIC PAPERS is an occasional series dedicated to making available to specialists and the interested public the results of research that, because of its unconventional or controversial nature, might otherwise go unpublished. The editor-in-chief actively encourages younger, not yet well established, scholars and independent authors to submit manuscripts for consideration. Contributions in any of the major scholarly languages of the world, including romanized modern standard Mandarin (MSM) and Japanese, are acceptable. In special circumstances, papers written in one of the Sinitic topolects (fangyan) may be considered for publication. Although the chief focus of Sino-Platonic Papers is on the intercultural relations of China with other peoples, challenging and creative studies on a wide variety of philological subjects will be entertained. This series is not the place for safe, sober, and stodgy presentations. Sino- Platonic Papers prefers lively work that, while taking reasonable risks to advance the field, capitalizes on brilliant new insights into the development of civilization. Submissions are regularly sent out to be refereed, and extensive editorial suggestions for revision may be offered. Sino-Platonic Papers emphasizes substance over form. -
The Twentieth-Century Secularization of the Sinograph in Vietnam, and Its Demotion from the Cosmological to the Aesthetic
Journal of World Literature 1 (2016) 275–293 brill.com/jwl The Twentieth-Century Secularization of the Sinograph in Vietnam, and its Demotion from the Cosmological to the Aesthetic John Duong Phan* Rutgers University [email protected] Abstract This article examines David Damrosch’s notion of “scriptworlds”—spheres of cultural and intellectual transfusion, defined by a shared script—as it pertains to early mod- ern Vietnam’s abandonment of sinographic writing in favor of a latinized alphabet. The Vietnamese case demonstrates a surprisingly rapid readjustment of deeply held attitudes concerning the nature of writing, in the wake of the alphabet’s meteoric suc- cesses. The fluidity of “language ethics” in early modern Vietnam (a society that had long since developed vernacular writing out of an earlier experience of diglossic liter- acy) suggests that the durability of a “scriptworld” depends on the nature and history of literacy in the societies under question. Keywords Vietnamese literature – Vietnamese philology – Chinese philology – script reform – language reform – early modern East Asia/Southeast Asia Introduction In 1919, the French-installed emperor Khải Định permanently dismantled Viet- nam’s civil service examinations, and with them, an entire system of political * I would like to thank Wynn Wilcox, Nguyễn Tuấn Cường, Tuna Artun, and Jack Chia for their valuable help and input. All errors are my own. © koninklijke brill nv, leiden, 2016 | doi: 10.1163/24056480-00102010 Downloaded from Brill.com09/25/2021 07:54:41AM via free access 276 phan selection based on proficiency in Literary Sinitic.1 That same year, using a con- troversial latinized alphabet called Quốc Ngữ (lit. -
A Comparative Analysis of the Simplification of Chinese Characters in Japan and China
CONTRASTING APPROACHES TO CHINESE CHARACTER REFORM: A COMPARATIVE ANALYSIS OF THE SIMPLIFICATION OF CHINESE CHARACTERS IN JAPAN AND CHINA A THESIS SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI‘I AT MĀNOA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF ARTS IN ASIAN STUDIES AUGUST 2012 By Kei Imafuku Thesis Committee: Alexander Vovin, Chairperson Robert Huey Dina Rudolph Yoshimi ACKNOWLEDGEMENTS I would like to express deep gratitude to Alexander Vovin, Robert Huey, and Dina R. Yoshimi for their Japanese and Chinese expertise and kind encouragement throughout the writing of this thesis. Their guidance, as well as the support of the Center for Japanese Studies, School of Pacific and Asian Studies, and the East-West Center, has been invaluable. i ABSTRACT Due to the complexity and number of Chinese characters used in Chinese and Japanese, some characters were the target of simplification reforms. However, Japanese and Chinese simplifications frequently differed, resulting in the existence of multiple forms of the same character being used in different places. This study investigates the differences between the Japanese and Chinese simplifications and the effects of the simplification techniques implemented by each side. The more conservative Japanese simplifications were achieved by instating simpler historical character variants while the more radical Chinese simplifications were achieved primarily through the use of whole cursive script forms and phonetic simplification techniques. These techniques, however, have been criticized for their detrimental effects on character recognition, semantic and phonetic clarity, and consistency – issues less present with the Japanese approach. By comparing the Japanese and Chinese simplification techniques, this study seeks to determine the characteristics of more effective, less controversial Chinese character simplifications. -
The Japanese Writing Systems, Script Reforms and the Eradication of the Kanji Writing System: Native Speakers’ Views Lovisa Österman
The Japanese writing systems, script reforms and the eradication of the Kanji writing system: native speakers’ views Lovisa Österman Lund University, Centre for Languages and Literature Bachelor’s Thesis Japanese B.A. Course (JAPK11 Spring term 2018) Supervisor: Shinichiro Ishihara Abstract This study aims to deduce what Japanese native speakers think of the Japanese writing systems, and in particular what native speakers’ opinions are concerning Kanji, the logographic writing system which consists of Chinese characters. The Japanese written language has something that most languages do not; namely a total of three writing systems. First, there is the Kana writing system, which consists of the two syllabaries: Hiragana and Katakana. The two syllabaries essentially figure the same way, but are used for different purposes. Secondly, there is the Rōmaji writing system, which is Japanese written using latin letters. And finally, there is the Kanji writing system. Learning this is often at first an exhausting task, because not only must one learn the two phonematic writing systems (Hiragana and Katakana), but to be able to properly read and write in Japanese, one should also learn how to read and write a great amount of logographic signs; namely the Kanji. For example, to be able to read and understand books or newspaper without using any aiding tools such as dictionaries, one would need to have learned the 2136 Jōyō Kanji (regular-use Chinese characters). With the twentieth century’s progress in technology, comparing with twenty years ago, in this day and age one could probably theoretically get by alright without knowing how to write Kanji by hand, seeing as we are writing less and less by hand and more by technological devices. -
In This Issue
译风 The ATA Chinese Language Division Newsletter|Winter 2019|美国翻译协会中文翻译分会通讯 IN THIS ISSUE From the Editors 01 Farewell from Our Outgoing Layout Editor 03 This issue of Yifeng witnesses yet more Since our last issue on the new A Little Story About Our changes as we bid farewell to our layout Chinese-to-English ATA certification New Layout Editor editor, Tianlu Redmon (贾天璐) and exam, CLD member Rony Gao (高嵘) welcome new layout editor Pearl Zheng not only obtained the certification but 04 (郑绍娴) on page 4. Many thanks to figured out how to do so "in ten both of them for their hard work hours"! See how he achieved this feat Letter from the CLD ensuring that Yifeng is not just interesting on page 16 and learn some tips for Administrator to read but also pleasant to look at. preparing for the certification exam. 05 ATA annual conferences share much in Finally, in the Resources Roundup on common, but each offers a unique page 19, Yifeng editor Trevor Cook My First ATA experience. Read about first-time reviews the indispensable Pleco app, Conference - More than attendee Jessie Liu’s experience in New and, in Bird's Corner, CLD Just an Escape from the Orleans (pg. 7) and how Sean Song 蔡晓萍 Administrator Pency Tsai ( ) Kids (宋俊杰) has improved his conference shares a fresh take on adopting experience through repeat attendance translation and interpretation 07 (pg. 9). If you missed her at this year’s technology on page 22. conference, you can also meet CLD Finding My Niche for a member Gigi Yau (尤建之) in our new Better Conference feature 15 Minutes with June |译者访谈 Experience 录, by Junqiao Chen (陈俊巧), on page 12. -
Basis Technology Unicode対応ライブラリ スペックシート 文字コード その他の名称 Adobe-Standard-Encoding A
Basis Technology Unicode対応ライブラリ スペックシート 文字コード その他の名称 Adobe-Standard-Encoding Adobe-Symbol-Encoding csHPPSMath Adobe-Zapf-Dingbats-Encoding csZapfDingbats Arabic ISO-8859-6, csISOLatinArabic, iso-ir-127, ECMA-114, ASMO-708 ASCII US-ASCII, ANSI_X3.4-1968, iso-ir-6, ANSI_X3.4-1986, ISO646-US, us, IBM367, csASCI big-endian ISO-10646-UCS-2, BigEndian, 68k, PowerPC, Mac, Macintosh Big5 csBig5, cn-big5, x-x-big5 Big5Plus Big5+, csBig5Plus BMP ISO-10646-UCS-2, BMPstring CCSID-1027 csCCSID1027, IBM1027 CCSID-1047 csCCSID1047, IBM1047 CCSID-290 csCCSID290, CCSID290, IBM290 CCSID-300 csCCSID300, CCSID300, IBM300 CCSID-930 csCCSID930, CCSID930, IBM930 CCSID-935 csCCSID935, CCSID935, IBM935 CCSID-937 csCCSID937, CCSID937, IBM937 CCSID-939 csCCSID939, CCSID939, IBM939 CCSID-942 csCCSID942, CCSID942, IBM942 ChineseAutoDetect csChineseAutoDetect: Candidate encodings: GB2312, Big5, GB18030, UTF32:UTF8, UCS2, UTF32 EUC-H, csCNS11643EUC, EUC-TW, TW-EUC, H-EUC, CNS-11643-1992, EUC-H-1992, csCNS11643-1992-EUC, EUC-TW-1992, CNS-11643 TW-EUC-1992, H-EUC-1992 CNS-11643-1986 EUC-H-1986, csCNS11643_1986_EUC, EUC-TW-1986, TW-EUC-1986, H-EUC-1986 CP10000 csCP10000, windows-10000 CP10001 csCP10001, windows-10001 CP10002 csCP10002, windows-10002 CP10003 csCP10003, windows-10003 CP10004 csCP10004, windows-10004 CP10005 csCP10005, windows-10005 CP10006 csCP10006, windows-10006 CP10007 csCP10007, windows-10007 CP10008 csCP10008, windows-10008 CP10010 csCP10010, windows-10010 CP10017 csCP10017, windows-10017 CP10029 csCP10029, windows-10029 CP10079 csCP10079, windows-10079 -
The Chinese Script T � * 'L
Norman, Jerry, Chinese, Cambridge: Cambridge University Press, 1988. 1 3.1 Th e beginnings of Chinese writing 59 3 FISH HORSE ELEPHANT cow (yu) (m ii) (xiimg) (niu) " The Chinese script t � * 'l Figure 3.1. Pictographs in early Chinese writing 3.1 The beginnings of Chinese writing1 The Chinese script appears as a fully developed writing system in the late Shang .dynasty (fourteenth to eleventh centuries BC). From this period we have copious examples of the script inscribed or written on bones and tortoise shells, for the most part in the form of short divinatory texts. From the same period there also Figure 3.2. The graph fo r quiin'dog' exist a number of inscriptions on bronze vessels of various sorts. The former type of graphic record is referred to as the oracle bone script while the latter is com of this sort of graph are shown in Figure 3.1. The more truly representational a monly known· as the bronze script. The script of this period is already a fully graph is, the more difficult and time-consuming it is to depict. There is a natural developed writing system, capable of recording the contemporary Chinese lan tendency for such graphs to become progressively simplified and stylized as a guage in a complete and unambiguous manner. The maturity of this early script writing system matures and becomes more widely used. As a result, pictographs has suggested to many scholars that it must have passed through a fairly long gradually tend to lose their obvious pictorial quality. The graph for qui'in 'dog' period of development before reaching this stage, but the few examples of writing shown in Figure 3.2 can serve as a good illustration of this sort of development.