Instructions for Use. with These Macros You May Enter Any Double-Byte Asian Characters and Make Simple Conversion of Texts for East Asian Languages

Total Page:16

File Type:pdf, Size:1020Kb

Instructions for Use. with These Macros You May Enter Any Double-Byte Asian Characters and Make Simple Conversion of Texts for East Asian Languages CJKV VBA IME for MS Word v 3.2 user’s guide information .. CJKV VBA IME for MS Word version 3.2 by WERTA© 2000–2014 Instructions for use. With these macros you may enter any double-byte Asian characters and make simple conversion of texts for East Asian languages. These macros are includes: Chinese, Japanese, Korean, and Vietnamese units in the dot- format files (MS Word templates). Technical requirements For properly work of these macros you need the MS Word software installed on your PC with MS Office version not 2000 earlier. These macros have no digital signature, so after installation, you need change to low security of your MS Word for work with VBA-macros, for example: (Tools => Macro => Security => Low). After run of MS Word or after open any of the dot-files, you will see a new toolbars in MS Word application. The binary files and conversion tables program path is fixed and determined as C:\ CJKV_VBA. Created by: WERTA©, 2000-2014 1 CJKV VBA IME for MS Word v 3.2 user’s guide information .. 1. Chinese Input Methods for MS Word. 1.1 List of IME Format Description: A set of characters (Ex. / Trad.) Input method. Input methods (character set) for Simplified Chinese (GB2312): Phonetic (Mandarin) GuoBiao (JianTi) PinYin - standard Pinyin with using digitals for tones and input letter «v» instead of «ü». Character set is GB2312. Simplified Chinese characters are available. Keyboard layout is Latin letters. Text input mode is single ideographic characters and you can also input some Chinese words (tone digits is not use for it). For the Pinyin IMEs (GB and Big5) few variants of Latinized Chinese transcription systems were added as a button control for a switching. Five transcription systems variants were included for Standard Chinese phonemes text input. By the cyclic switching of control button at the right side of text edit field you can choice appropriate transcription system what you need. After each clicking the color of button will be change accordingly with follows systems list: Standard Pinyin (gray) Wade-Giles Romanization (green) MPS II (red) Yale Romanization (yellow) Tongyong PY (purple) Gwoyeu Romatzyh (blue), after phoneme input, you need to press space bar. The without tones words input is available only in Standard Pinyin mode (gray button color). GuoBiao (JianTi) ShuangPin - input phoneme is available always with two keyboards clicks (called ShuangPin). The first key press is corresponds to PinYin initial. The second key press is corresponds to PinYin medial+final. Character set is GB2312. Simplified Chinese characters are Created by: WERTA©, 2000-2014 2 CJKV VBA IME for MS Word v 3.2 user’s guide information .. available. Text input mode is single ideographic characters and you can also input some Chinese words (tone digits is not use for it). GuoBiao (JianTi) ZhuYin - input based on Zhuyin Zimu alphabet. Character set is GB2312. Simplified Chinese characters are available. Keyboard layout is standard Bopomofo. Text input mode is single ideographic characters only. GuoBiao (FanTi) PinYin - standard Pinyin, with using digitals for tones and input letter «v» instead of «ü». Character set is GB2312. Traditional Chinese characters are available. Keyboard layout is Latin letters. Text input mode is single ideographic characters and you can also input some Chinese words (tone digits is not use for it). GuoBiao (FanTi) ShuangPin - input phoneme is available always with two keyboards clicks (called ShuangPin). The first key press is corresponds to PinYin initial. The second key press is corresponds to PinYin medial+final. Character set is GB2312. Traditional Chinese characters are available. Text input mode is single ideographic characters and you can also input some Chinese words (tone digits is not use for it). GuoBiao (FanTi) ZhuYin - input based on Zhuyin Zimu alphabet. Character set is GB2312. Traditional Chinese characters are available. Keyboard layout is standard Bopomofo. Text input mode is single ideographic characters only. Structural methods GuoBiao (JianTi) CangJie - input with CangJie system. The maximum length of the input string is 5 characters. Character set - GB2312. Simplified Chinese characters are available. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by matching of the entered part of the coding CangJie stroke. The input mode is single input of ideographic character only. GuoBiao (JianTi) CangJie (exact) - input with CangJie system. The maximum length of the input string is 5 characters. Character set is GB2312. Simplified Chinese characters are available. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by both matching of the entered part of the encoding CangJie stroke and the total length of encoding CangJie stroke. The input mode is single input of ideographic character only. Created by: WERTA©, 2000-2014 3 CJKV VBA IME for MS Word v 3.2 user’s guide information .. GuoBiao (JianTi) WuBi - input with Wubi system. The maximum length of the input string is 4 characters. Character set is GB2312. Simplified Chinese characters are available. Keyboard layout is standard WuBi. Input modes is type of single ideographic character only. Phonetic (Dialects) GuoBiao (JT/FT) Shanghai IME – this is an input with standard Latin transcription system for Shanghainese with the numerical tones designation (0-5). Character set is GB2312. In the ideographs choice list you can see both simplified and traditional hanzi. For this input method the keyboard layout is standard English. The input mode is type of single ideographic character only. GuoBiao (JT/FT) Wu IME – this is an input with generalized Latin transcription system for Wu dialects without any numerical tones designation. Character set is GB2312. In the ideographs choice list you can see both simplified and traditional hanzi. For this input method the keyboard layout is standard English. The input mode is type of single ideographic character only. For activation the list of choice you need just press space bar after input of Wu phoneme. Input methods (character set) for Traditional Chinese: Phonetic (Mandarin) Big5 PinYin - standard Pinyin, with using digitals for tones and input letter «v» instead of «ü». Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is Latin letters. Text input mode is typing of single ideographic characters and you can also input some Chinese words (tone digits is not use for it). Big5 ShuangPin - input phoneme is available always with two keyboards clicks (called ShuangPin). The first keypress is corresponds to PinYin initial. The second keypress is corresponds to PinYin medial+final. Character set is BIG5. Traditional Chinese characters are available. Text input mode is typing of single ideographic characters and you can also input some Chinese words (tone digits is not use for it). Big5 ZhuYin - input based on Zhuyin Zimu alphabet. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is standard Bopomofo. Text input mode is type of single ideographic character only. Created by: WERTA©, 2000-2014 4 CJKV VBA IME for MS Word v 3.2 user’s guide information .. Structural methods Big5 CangJie - input with CangJie system. The maximum length of the input string is 5 characters. Character set - BIG5. Traditional Chinese characters are available. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by matching of the entered part of the coding CangJie stroke. The input mode is type of single ideographic character only. Big5 CangJie (exact) - input with CangJie system. The maximum length of the input string is 5 characters. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is standard CangJie. The selection of characters for the input list is performs by both matching of the entered part of the encoding CangJie stroke and the total length of encoding CangJie stroke. The input mode is single input of ideographic character only. Big5 Array30 - input with Array30 system. The maximum length of the input string is 4 characters. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is Array30. The selection of characters for the input list is performs by matching of the entered part of the coding Array30 stroke. The input mode is type of single ideographic character only. Big5 Boshiamy - input with Boshiamy system. The maximum length of the input string is 4 characters. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is Boshiamy. The selection of characters for the input list is performs by matching of the entered part of the coding Boshiamy stroke. The input mode is type of single ideographic character only. Big5 DaYi (4) - input with DaYi system. The maximum length of the input string is 4 characters. Character set - BIG5. Traditional Chinese characters are available. Keyboard layout is DaYi. The selection of characters for the input list is performs by matching of the entered part of the coding DaYi stroke. The input mode is type of single ideographic character only. Phonetic methods (Chinese dialects) Big5 Cantonese PY - input with Cantonese transcription without tone digits. Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is standard Latin. The input mode is type of single ideographic character only. Created by: WERTA©, 2000-2014 5 CJKV VBA IME for MS Word v 3.2 user’s guide information .. Big5 Hakka PY - input with transcription for Hakka dialects, with digital marking of tones (1-6). Character set is BIG5. Traditional Chinese characters are available. Keyboard layout is standard Latin. The input mode is type of single ideographic character only. Input methods for UNICODE CJK extensions PinYin IME (CJK Unified + ExtA) - input with Standard Pinyin with digital Chinese tones designation (1-5) and with letter «v» instead of «ü». A set of characters for enter is Unicode: CJK Unified + CJK ExtA.
Recommended publications
  • Pan-Sinitic Object Marking: Morphology and Syntax*
    Breaking Down the Barriers, 785-816 2013-1-050-037-000234-1 Pan-Sinitic Object Marking: * Morphology and Syntax Hilary Chappell (曹茜蕾) EHESS In Chinese languages, when a direct object occurs in a non-canonical position preceding the main verb, this SOV structure can be morphologically marked by a preposition whose source comes largely from verbs or deverbal prepositions. For example, markers such as kā 共 in Southern Min are ultimately derived from the verb ‘to accompany’, pau11 幫 in many Huizhou and Wu dialects is derived from the verb ‘to help’ and bǎ 把 from the verb ‘to hold’ in standard Mandarin and the Jin dialects. In general, these markers are used to highlight an explicit change of state affecting a referential object, located in this preverbal position. This analysis sets out to address the issue of diversity in such object-marking constructions in order to examine the question of whether areal patterns exist within Sinitic languages on the basis of the main lexical fields of the object markers, if not the construction types. The possibility of establishing four major linguistic zones in China is thus explored with respect to grammaticalization pathways. Key words: typology, grammaticalization, object marking, disposal constructions, linguistic zones 1. Background to the issue In the case of transitive verbs, it is uncontroversial to state that a common word order in Sinitic languages is for direct objects to follow the main verb without any overt morphological marking: * This is a “cross-straits” paper as earlier versions were presented in turn at both the Institute of Linguistics, Academia Sinica, during the joint 14th Annual Conference of the International Association of Chinese Linguistics and 10th International Symposium on Chinese Languages and Linguistics, held in Taipei in May 25-29, 2006 and also at an invited seminar at the Institute of Linguistics, Chinese Academy of Social Sciences in Beijing on 23rd October 2006.
    [Show full text]
  • In Search of an Effective Romanized Transcription System for Learning Chinese Pronunciation Background
    In Search of an Effective Romanized Transcription System for Learning Chinese Pronunciation Background. Learning to pronounce Chinese characters has always been a challenge for native English speakers for two reasons: 1) the Chinese language contains certain sounds that are not present in the English language and 2) due to the logographic nature of Chinese, romanized transcription systems have been developed to aid in the learning of Chinese pronunciation, however, most romanized transcription systems do not accurately represent Chinese sounds when read by native English speakers. In our present global community foreign language must be taught in methods that most efficiently reflect the needs of the target language learner. To adequately address these needs, Chinese language pedagogy in America and possibly other English speaking countries must shift to accommodate the target’s learner characteristics which includes developing a curriculum of learning Chinese pronunciation not by associating Chinese sounds with misrepresented Latin letters, but by allowing the transcription system flow naturally from the learner’s language. Research Procedure. Noticing this hindrance from first hand experience and from other native English speakers in several elementary Chinese language classes, I chose to examine the five most popular romanized transcription systems for Chinese and test them on native English speakers with no prior knowledge of Chinese to see which one(s) produced the most standard pronunciation. The Scripts chosen were: Hanyu Pinyin,
    [Show full text]
  • Last Name First Name/Middle Name Course Award Course 2 Award 2 Graduation
    Last Name First Name/Middle Name Course Award Course 2 Award 2 Graduation A/L Krishnan Thiinash Bachelor of Information Technology March 2015 A/L Selvaraju Theeban Raju Bachelor of Commerce January 2015 A/P Balan Durgarani Bachelor of Commerce with Distinction March 2015 A/P Rajaram Koushalya Priya Bachelor of Commerce March 2015 Hiba Mohsin Mohammed Master of Health Leadership and Aal-Yaseen Hussein Management July 2015 Aamer Muhammad Master of Quality Management September 2015 Abbas Hanaa Safy Seyam Master of Business Administration with Distinction March 2015 Abbasi Muhammad Hamza Master of International Business March 2015 Abdallah AlMustafa Hussein Saad Elsayed Bachelor of Commerce March 2015 Abdallah Asma Samir Lutfi Master of Strategic Marketing September 2015 Abdallah Moh'd Jawdat Abdel Rahman Master of International Business July 2015 AbdelAaty Mosa Amany Abdelkader Saad Master of Media and Communications with Distinction March 2015 Abdel-Karim Mervat Graduate Diploma in TESOL July 2015 Abdelmalik Mark Maher Abdelmesseh Bachelor of Commerce March 2015 Master of Strategic Human Resource Abdelrahman Abdo Mohammed Talat Abdelziz Management September 2015 Graduate Certificate in Health and Abdel-Sayed Mario Physical Education July 2015 Sherif Ahmed Fathy AbdRabou Abdelmohsen Master of Strategic Marketing September 2015 Abdul Hakeem Siti Fatimah Binte Bachelor of Science January 2015 Abdul Haq Shaddad Yousef Ibrahim Master of Strategic Marketing March 2015 Abdul Rahman Al Jabier Bachelor of Engineering Honours Class II, Division 1
    [Show full text]
  • From Eurocentrism to Sinocentrism: the Case of Disposal Constructions in Sinitic Languages
    From Eurocentrism to Sinocentrism: The case of disposal constructions in Sinitic languages Hilary Chappell 1. Introduction 1.1. The issue Although China has a long tradition in the compilation of rhyme dictionar- ies and lexica, it did not develop its own tradition for writing grammars until relatively late.1 In fact, the majority of early grammars on Chinese dialects, which begin to appear in the 17th century, were written by Europe- ans in collaboration with native speakers. For example, the Arte de la len- gua Chiõ Chiu [Grammar of the Chiõ Chiu language] (1620) appears to be one of the earliest grammars of any Sinitic language, representing a koine of urban Southern Min dialects, as spoken at that time (Chappell 2000).2 It was composed by Melchior de Mançano in Manila to assist the Domini- cans’ work of proselytizing to the community of Chinese Sangley traders from southern Fujian. Another major grammar, similarly written by a Do- minican scholar, Francisco Varo, is the Arte de le lengua mandarina [Grammar of the Mandarin language], completed in 1682 while he was living in Funing, and later posthumously published in 1703 in Canton.3 Spanish missionaries, particularly the Dominicans, played a signifi- cant role in Chinese linguistic history as the first to record the grammar and lexicon of vernaculars, create romanization systems and promote the use of the demotic or specially created dialect characters. This is discussed in more detail in van der Loon (1966, 1967). The model they used was the (at that time) famous Latin grammar of Elio Antonio de Nebrija (1444–1522), Introductiones Latinae (1481), and possibly the earliest grammar of a Ro- mance language, Grammatica de la Lengua Castellana (1492) by the same scholar, although according to Peyraube (2001), the reprinted version was not available prior to the 18th century.
    [Show full text]
  • Ocelot Mandarin Oral Recognition Test Roderick Gammon Kapi'olani
    Ocelot Mandarin Oral Recognition Test Roderick Gammon Kapi’olani Community College February 16, 2004ß Abstract The Ocelot Mandarin Oral Recognition Test is a diagnostic test that measures a student’s ability to match spoken Chinese syllables to a romanization. Analogous to using a Romanized dictionary index, test items are multimedia multiple-choice with an audio prompt and a selection of textual romanizations. The test is organized into subunits focused on different features in a formal linguistic construct of a Mandarin syllable (Yao et al., 1997). The test is computer adaptive and allows for criterion referenced (Brown and Hudson, 2002) item banking (Brown, 1997). After a literature review, this paper describes the structure and piloting of the test. Introduction The Ocelot Mandarin Oral Recognition Test (MORT) measures recognition of Mandarin Chinese oral syllables by interpreting subject performance relevant to a linguistic feature system. Implemented as a Flash (2004 MX) applet, the test is computer adaptive and algorithmically generates items at runtime. The test was created as a member of a set of computer assisted language-learning (CALL) materials modeled around a particular course of study using the Integrated Chinese (Yao et al., 1997) textbooks. This introduction reviews literature regarding Mandarin romanization, language testing, and CALL design. Following that review the introduction recasts the MORT and its pilot study in terms for research questions raised by the literature review. The MORT instrument and its piloting are then described. The paper concludes with a consideration of further avenues of study. Mandarin Mandarin Chinese, with increasing regularity since the mid-20th century, been transcribed using the Pinyin Romanization method (Ramsey, 1987).
    [Show full text]
  • An Investigation of Chinese Learners' Acquisition and Understanding of Bushou and Their Attitude on Formal In-Class Bushou Instruction
    University of Massachusetts Amherst ScholarWorks@UMass Amherst Masters Theses Dissertations and Theses November 2014 An Investigation of Chinese Learners' Acquisition and Understanding of Bushou and Their Attitude on Formal In-Class Bushou Instruction Yan P. Liu University of Massachusetts Amherst Follow this and additional works at: https://scholarworks.umass.edu/masters_theses_2 Recommended Citation Liu, Yan P., "An Investigation of Chinese Learners' Acquisition and Understanding of Bushou and Their Attitude on Formal In-Class Bushou Instruction" (2014). Masters Theses. 98. https://doi.org/10.7275/6054895 https://scholarworks.umass.edu/masters_theses_2/98 This Campus-Only Access for Five (5) Years is brought to you for free and open access by the Dissertations and Theses at ScholarWorks@UMass Amherst. It has been accepted for inclusion in Masters Theses by an authorized administrator of ScholarWorks@UMass Amherst. For more information, please contact [email protected]. AN INVESTIGATION OF CHINESE LEARNERS’ ACQUISITION AND UNDERSTANDING OF BUSHOU AND THEIR ATTITUDE ON FORMAL IN- CLASS BUSHOU INSTRUCTION A CASE STUDY A Thesis Presented By YANPING LIU Submitted to the Graduate School of the University of Massachusetts Amherst in partial fulfillment of the requirements for the degree of MASTER OF ARTS September 2014 Department of Languages, Literatures, and Cultures Asian Languages and Literatures AN INVESTIGATION OF CHINESE LEARNERS’ ACQUISITION AND UNDERSTANDING OF BUSHOU AND THEIR ATTITUDE ON FORMAL IN- CLASS BUSHOU INSTRUCTION
    [Show full text]
  • Pinyin and Chinese Children's Phonological Awareness
    Pinyin and Chinese Children’s Phonological Awareness by Xintian Du A thesis submitted in conformity with the requirements For the degree of Master of Arts Department of Curriculum, Teaching and Learning Ontario Institute of Studies in Education University of Toronto @Copyright by Xintian Du 2010 ABSTRACT Pinyin and Chinese Children’s Phonological Awareness Master of Arts 2010 Xintian Du Department of Curriculum, Teaching and Learning University of Toronto This paper critically reviewed the literature on the relationships between Pinyin and Chinese bilingual and monolingual children’s phonological awareness (PA) and identified areas of research worth of further investigation. As the Chinese Phonetic Alphabet providing pronunciation of the universal Chinese characters, Pinyin facilitates children’s early reading development. What research has found in English is that PA is a reliable indicator of later reading success and meta-linguistic training improves PA. In Chinese, a non-alphabetic language, there is also evidence that PA predicts reading in Chinese, which confirms the universality of PA’s role. However, research shows the uniqueness of each language: tonal awareness is stronger indicator in Chinese while phonemic awareness is stronger indicator in English. Moreover, Pinyin, the meta-linguistic training, has been found to improve PA in Chinese and reading in Chinese and possibly facilitate the cross-language transfer of PA from Chinese to English and vice versa. ii ACKNOWLEDGEMENTS I am heartily thankful to my supervisors Becky Chen and Normand Labrie, whose guidance and support from the initial to the final level enabled me to develop a thorough understanding of the subject and eventually complete the thesis paper.
    [Show full text]
  • Translating Chinese Romanized Name Into Chinese Idiographic Characters Via Corpus and Web Validation
    Translating Chinese Romanized Name into Chinese Idiographic Characters via Corpus and Web Validation Yiping Li — Gregory Grefenstette Laboratoire d'Ingénierie de la Connaissance Multimédia Multilingue (LIC2M) Commissariat à l'Energie Atomique Bat. 38-1; 18, rue du Panorama; BP 6; 92265 Fontenay aux Roses Cedex; France [email protected] [email protected] ABSTRACT. Cross-language information retrieval performance depends on the quality of the translation resources used to pass from a user’s source language query to target language documents. Translation lists of proper names are rare but vital resources for cross-language retrieval between languages using different character sets. Named entities translation dictionaries can be extracted from bilingual corpus with some degree of success, but the problem of the coverage of these scarce bilingual corpora remains. In this article, we present a technique for finding Chinese transliterations for any Chinese name written in English script. Our system performs transliteration of Pinyin (the standard Romanization for Chinese) to Chinese characters via corpus and web validation. Though Chinese family names form a small set, the number and variety of multisyllabic first names is great, and treatment is complicated by the fact that one Pinyin transliteration can correspond to hundred of different Chinese characters. Our method finds the best translations of a Chinese name written in Pinyin by filtering out unlikely translations using a bigram model derived from a very large monolingual Chinese corpus, and then vetting remaining candidate transliterations using Web statistics. We experimentally validate our method using an independent gold standard. RESUME. La performance en recherche d'information translingue dépend de la qualité des ressources de traduction utilisées pour passer de la langue source (requête d'utilisateur) vers la langue cible des documents.
    [Show full text]
  • The Grammaticalization of Hakka, Mandarin and Southern Min
    The Grammaticalization of Hakka, Mandarin and Southern Min The Interaction of Negatives with Modality, Aspect, and Interrogatives by Hui-Ling Yang A Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy Approved April 2012 by the Graduate Supervisory Committee: Elly van Gelderen, Chair Karen Adams Carrie Gillon Chaofen Sun ARIZONA STATE UNIVERSITY May 2012 ABSTRACT The primary topic of this dissertation is the grammaticalization of negation in three Sinitic language varieties: Hakka, Mandarin, and Southern Min. I discuss negative morphemes that are used under different modality or aspect contexts, including ability, volition, necessity, and perfectivity. Not only does this study examine Southern Min affirmative and negative pairs, but it also highlights the grammaticalization of negation and parametric differences in negation among the languages under investigation. This dissertation also covers the reanalysis of negatives into interrogatives. I approach the investigation of Southern Min negation from both synchronic and diachronic perspectives. I analyze corpus data in addition to data collected from fieldwork for the contemporary linguistic data. For my diachronic research of Chinese negation, I use historical texts and etymological dictionaries. Diachronically, many of the negative morphemes originate from full-fledged verbs and undergo an analogous grammaticalization process that consists of multiple stages of reanalysis from V to T (aspect; modality), and then T to C (interrogative; discourse). I explain this reanalysis, which involves head-to-head movement, using generative frameworks that combine a modified cartographic approach and the Minimalist Economy Principles. Synchronic data show that Southern Min affirmative modals are characterized by a certain morphological doubling.
    [Show full text]
  • Romanization and Language Planning in Taiwan
    1 Romanization and Language Planning in Taiwan The University of Texas at Arlington P.O. Box 2485 Arlington, TX 76004, USA Email: [email protected] Abstract Chiung, Wi-vun Taiffalo. 2001. Romanization and Language Planning in Taiwan. The Linguistic Association of Korea Journal 9(1). Although Taiwan is currently a Hancha (Han characters)-dominated society, romanization was in fact the first writing system used in Taiwan. The first romanized orthography is the Sinkang manuscripts introduced by the Dutch missionaries in the first half of the seventeenth century. Thereafter, Han characters were imposed to Taiwan by the Sinitic Koxinga regime that followed in the second half of the seventeenth century. As the number of Han immigrants from China dramatically increased, Han characters gradually became the dominant writing system. At present, romanization for Mandarin Chinese is an auxiliary script simply used for transliteration purpose. As for Taiwanese romanization, it is mainly used by particular groups, such as church followers and the Taiwanese writing circle. This paper provides readers an overall introduction to the history and current development of romanization in Taiwan from the perspectives of literacy and sociolinguistics. The University of Texas at Arlington. Chiung, Wi-vun Taiffalo. 2001. Romanization and language planning in Taiwan. The Linguistic Association of Korea Journal 9(1), 15-43. 2 Romanization and Language Planning in Taiwan1 1. Introduction Although Taiwan is currently a Hancha (Han characters)-dominated society, romanization once was the unique and first writing system used in Taiwan. This system of romanization was introduced by the Dutch missionaries in the first half of the seventeenth century.
    [Show full text]
  • THE POLITICS of PINYIN: Reflections on the Differences of Mandarin Chinese in Taiwan and China by a Novice
    d THE POLITICS OF PINYIN: Reflections on the differences of Mandarin Chinese in Taiwan and China by a novice Peter D. Coyl In efforts to boost literacy, the Chinese language and its romanized transcription have been modified over the past 100 years. The changes in Chinese on the mainland and Chinese in Taiwan have been motivated by literacy and by politics. Transcription of Mandarin Chinese Until 1979 the standard for transcribing Chinese characters into readable English was a system known as the Wade-Giles system. Wade-Giles assigns some sounds in Chinese with non­ equivalent letters in English. For example, the “T” represents the “D” sound as in “day.” So thousands of people eat “tofu” in­ stead of “dofu” and practice “Taoism” instead of “Daoism.” As this can lead to confusion among those not familiar with the intricacies of the varying sounds, the government of the Peo­ ple’s Republic of China adopted hanyu pinyin (literally trans­ lated as Chinese language spell sound) in 1979, replacing Wade- Giles and zhuyin fuhao (a ruby character system used in Taiwan 63 to teach pronunciation to children and beginning learners) as the way to transcribe and teach Chinese. The hanyu pinyin sys­ tem is more true to proper pronunciation compared to Wade- Giles, but is still imperfect. Taiwan continued to use Wade-Giles as the romanization system. However, as noted earlier, Wade-Giles assigns some let­ ters to sounds that are not intuitive, so Chinese language learn­ ers with no training do not pronounce some of the words cor­ rectly. Tongyong pinyin was developed to correct this problem.
    [Show full text]
  • Modeling Taiwanese Southern-Min Tone Sandhi Using Rule-Based Methods
    Computational Linguistics and Chinese Language Processing Vol. 12, No. 4, December 2007, pp. 349-370 The Association for Computational Linguistics and Chinese Language Processing Modeling Taiwanese Southern-Min Tone Sandhi Using Rule-Based Methods Un-Gian Iunn*, Kiat-Gak Lau+, Hong-Giau Tan-Tenn, Sheng-An Lee*, and Cheng-Yan Kao* Abstract A sizable corpus of Taiwanese text in Latin script has been accumulated over the past two hundred or so years. However, due to the special status of Taiwan, few people can read these materials at present. It is regrettable that the utilization of these plentiful materials is very low. This paper addresses problems raised in the Taiwanese Southern-Min tone sandhi system by describing a set of computational rules to approximate this system, as well as the results obtained from its implementation. Using the romanized Taiwanese Southern-Min text as source, we take the sentence as the unit, translate every word into Chinese via an online Taiwanese-Chinese dictionary (OTCD), and obtain the part-of-speech (POS) information from the Chinese Electronic Dictionary (CED) made by the Chinese Knowledge and Information Processing (CKIP) group of Academia Sinica. By using the POS data and tone sandhi rules based on linguistics, we then tag each syllable with its post-sandhi tone marker. Finally, we implement a Taiwanese Southern-Min tone sandhi processing system which takes a romanized sentence as an input and then outputs the tone markers. Our system achieves 97.39% and 88.98% accuracy rates with training and test data, respectively. Finally, we analyze the factors influencing error for the purpose of future improvement.
    [Show full text]