马马虎虎) Xiè Xiè Nǐ Bú Kè Qì Duì Bu Qǐ Méi Guàn Xì 谢谢 你 , 不客气 , 对不起 , 没关系

Total Page:16

File Type:pdf, Size:1020Kb

马马虎虎) Xiè Xiè Nǐ Bú Kè Qì Duì Bu Qǐ Méi Guàn Xì 谢谢 你 , 不客气 , 对不起 , 没关系 Assignment/ Unit - I: Hello – Greeting and Pinyin 8/20 Quiz/ Test: 1. Practice pinyin & greeting 1. 拼音pinyin – Final 2. Writing voca. 10X each – Due 8/27 3. Quiz – Greeting/ Voca – 8/30 4. WS-Number – 9/4 2. 问候 wèn hòu - greeting Nǐ hǎo ma Textbook Ref.: Page: Q: 你 好 吗? – how are you? Vocabulary: wǒ hěn hǎo 1. míngzì 你nǐ - you 11. 名字 - Name A-1: 我 很 好! – I’m fine. 2. 好hǎo - good 12. 二 èr - 2 3. 吗ma – question 13. 三 sān -3 bù tài hǎo 4. 我Wǒ – I, me, my 14. 四 sì -4 5. 们men – plural 1 15. 五 wǔ -5 A-2: 不 太 好 – not too good 6. 是shì- 16. 六 liù-6 7. 他Tā – he,him,his 17. 七 qī-7 mǎ mǎ hǔ hǔ 8. 叫 jiào – to call, yell 18. 八 bā -8 9. 岁Suì –years old 19. 九 jiǔ -9 A-3: 马 马 虎 虎- OK. 10.什么 shénme - what20. 十 shí-10 Culture Notes: #1 Video – Hello China – Learning one Chinese Culture word a Day 1. Hello- 你好 2. Chinese Lunar Calendar 3. China 4. Beijing 5. Yellow river Video – Hello China – Learning one Chinese culture word a Day https://www.youtube.com/watch?v=k-EuZn5JUvM&list=PLCFDD3F76245D00A9 1. China – 中国 35. Mid-Autumn Festival- 中秋节 68 Ethnic Group-民族 84. Wax Printing- 蜡染 2. Hello- 你好 36. Water- Splashing Festival-拨水节 69. Name- 姓名 85. Brocade- 云锦 3. Confucius-孔子 37. Animal Signs of the Chinese Zodiac 70. Family-家 86. Cheongsam- 旗袍 4. SunTzu-孙子 38. Feng Shui- 风水 71 Courtyard- 四合院 87. Tang Suit-唐装 5. Lao Tzu- 老子 39. Lion Dance – 舞狮 72 Alleyway-胡同 88. Fan- 扇子 6. Sun-太阳 40. Door God – 门神 73 Tulou Building-土楼 89. Abacus- 算盘 7. Moon-月亮 41. Wedding- 婚礼 74 Tea-茶 90. Panda-熊猫 8. Chinese Lunar Calendar-中国农历年 42. Matchmaker- 红娘 75 Chopsticks- 筷子 91. Magpie-喜鹊 9. Compass –指南针 43. Chrysanthemum- 菊花 76 Paper cutting-剪纸 92. Chinese Characters 10. Gun Powder-火药 44. Etiquette- 礼 77 Silk-丝 93. Contradiction- 矛盾 11. Paper-纸 45. Number One Scholar- 状元 78 Porcelain Ware-瓷器 94. Careless 12. Printing-印刷 46. Beijing- 北京 79 Jade- 玉 95. Jealous- 吃醋 13. Beijing Opera-北京京剧 47. Great Wall –长城 80. Money – 钱 96. Bosom Buddy – 知音 14. Traditional Chinese Painting-国画 48. Imperial Palace – 故宫 (紫禁城) 81. Lantern-灯笼 97. Things – 东西 15. Fresco – 壁画 49. Temple of Heaven- 天坛 82. Kite- 风筝 98. Baby- 宝贝 16. Dance-舞蹈 50. Xi’an- 西安 (长安) 83. Shadow Puppet- 皮影 99. Numerals – 数字 17. Music-音乐 51. Terracotta Warriors – 兵马俑 100. Ru Yi- 如意 18. Chime Bells-编钟 52. Yellow river – 黄河 19. Gu Qin-古琴 53. West Lake- 西湖 20. Calligraphy-书法 54 Garden – 庭院 21. Writing Brush-毛笔 55 Three Gorges – 三峡 22. Journey to the West-西游记 56 Dujianyan Irrigation Dam – 都江堰 23. Bamboo-竹 57 Taishan Mountain –太山 24. Dragon –龙 58 Dunhuang –敦煌 () 25. Phoenix-凤 59 Shaolin Monastery – 少林寺 26. Kung Fu- 功夫 60 Dumpling –饺子 27. Tai Chi Chuan- 太极拳 61 Roast Duck – 烤鸭 28. Sword-剑 62 Tang hu lu- 糖葫芦 29. Traditional Chinese Medicine-中药 63 Hotpot-火锅 30. Acupuncture and Moxibustion-针灸 64 Tofu- 豆腐 31. Spring Festival-春节 65 Alcoholic Drinks – 酒 32. Qingming Festival – 清明 66. Noodles- 面 33. Dragon Boat Festival-端午 67. Fish- 鱼 34. 34. QiXi Festival – 七夕 Initials bp m fd t nl (bay) (pay) (may) (fair) (day) (take) (nay) (lay) g k h j q x y = i (go) (king) (happy) (jeep) (cheap) (sheep) w= u Chinese for Kids | Song to z c s zh ch sh r Learn Pinyin in 4 minutes! (lads) (cats) (son) (germ) (church) (shirt) (leisure) https://www.youtube.co m/watch?v=ocgsflnEgqY Finals aoe i u ü bpmf Song (bpmf歌) | Basic Songs | Chinese | By Little Fox https://www.youtube.com/watch?v=EReU1BKtAXo Note: tone mark will place on final (vowel), it there are more Initial + Finals than one place the mark according to order of “a o e i u ü” Ex: ue -----tone mark will be on “e” Tone Mark Initial Nǐ hǎo Final 你 好! (Hi, hello) Nǐ hǎo ma wǒ hěn hǎo 你好吗? (How are you?) 我 很 好 bù tài hǎo 不 太 好 马 虎 mǎ mǎ hǔ hǔ 马 马 虎 虎 Nǐ hǎo Nǐ hǎo 你好! 你好! Nǐ hǎo ma bù tài hǎo 你好吗? 不 太 好 (Chinese Basic Strokes) Jiǎgǔwén Jīnwén Dàzhuàn Xiǎozhuàn oracle bronze big seal seal 甲骨文、大篆、金文、籀文、小篆,隶书、草书、楷书(以及衍生的行书) Lìshū Cǎoshū Xíngshū Kǎishū clerical cursive semi-cursive or regular script running script 1. Oracle bone script: (from left) 馬/马 mǎ "horse", 虎 hǔ "tiger", 豕 shĭ "swine", 犬 quǎn "dog", 鼠 shǔ "rat and mouse", 象 xiàng "elephant", 豸 zhì "beasts of prey", 龜/龟 guī "turtle", 爿 qiáng "low table" (now 床 chuáng), 為/为 wèi "to lead" (now "do or for"), and 疾 jí "illness" 2. the Seal Script (篆书 zhuànshū). 3. the “Clerical Script” (隶书 lìshū) -It evolved organically out of the Spring and - evolved in the Warring States period to the Qin Autumn period Zhou script, and was adopted in a dynasty, was dominant in the Han dynasty. Due standardized form under the first Emperor of to its high legibility to modern readers, it is still China, Qin Shi Huang. The seal script, as the name used for artistic flavor in a variety of functional suggests, is now used only in artistic seals. applications such as headlines, signboards, and advertisements. 5. The Cursive script (草书cǎoshū), 4. the “Semi-cursive Script” (行书 xíngshū) -often mistranslated as Grass script, is a style of Chinese calligraphy. Cursive script is faster to write -used mostly for handwriting than other styles, but difficult to read for those -Also referred to in English both as running unfamiliar with it. It functions primarily as a kind of script and by its Mandarin Chinese name, shorthand script or calligraphic style. xíngshū, it is derived from clerical script, -People who can read standard or printed forms of Chinese may not be able to comprehend this script at all. 6. Regular script Regular script (楷书/ 楷書kǎishū), also called 正楷 (zhèngkǎi), 真書 (zhēnshū), 楷体 (kǎitǐ) and 正書 (zhèngshū), is the newest of the Chinese script styles , most common in modern writings and publications (after the Ming and sans-serif styles, used exclusively in print). Simplified Traditional Chinese characters Chinese characters are currently used in Hong Kong, Macau, and The government of the People's Republic of Republic of China (Taiwan). While traditional China in mainland China has promoted them characters can still be read and understood by for use in printing since the 1950s and 1960s in many mainland Chinese and Singaporeans, an attempt to increase literacy. They are these groups generally retain their use of officially used in the People's Republic of China Simplified characters. Overseas Chinese and Singapore. communities generally use traditional characters. Simplification in China The use of traditional Chinese characters versus simplified Chinese characters varies greatly, and can depend on both the local customs and the medium. Before the official reform, character simplifications were not officially sanctioned and generally adopted vulgar variants and idiosyncratic substitutions. Orthodox variants were mandatory in printed works, while the (unofficial) simplified characters would be used in everyday writing or quick notes. Since the 1950s, and especially with the publication of the 1964 list, the People's Republic of China has officially adopted simplified Chinese characters for use in mainland China, while Hong Kong, Macau, and the Republic of China (Taiwan) were not affected by the reform. There is no absolute rule for using either system, and often it is determined by what the target audience understands, as well as the upbringing of the writer. Although most often associated with the People's Republic of China, character simplification predates the 1949 communist victory. Caoshu, cursive written text, almost always includes character simplification, and simplified forms have always existed in print, albeit not for the most formal works. In the 1930s and 1940s, discussions on character simplification took place within the Kuomintang government, and a large number of Chinese intellectuals and writers have long maintained that character simplification would help boost literacy in China. Indeed, this desire by the Kuomintang to simplify the Chinese writing system (inherited and implemented by the Communist Party of China) also nursed aspirations of some for the adoption of a phonetic script based on the Latin script, and spawned such inventions as the Gwoyeu Romatzyh. The People's Republic of China issued its first round of official character simplifications in two documents, the first in 1956 and the second in 1964. A second round of character simplifications (known as erjian, or "second round simplified characters") was promulgated in 1977. It was poorly received, and in 1986 the authorities rescinded the second round completely, while making six revisions to the 1964 list, including the restoration of three traditional characters that had been simplified: 叠 dié, 覆 fù, 像 xiàng. The majority of simplified characters are drawn from conventional abbreviated forms, or ancient standard forms. For example, the orthodox character 來 lái ("come") was written with the structure 来 in the clerical script (隶书 / 隸書, lìshū) of the Han Dynasty. This clerical form uses one fewer stroke, and was thus adopted as a simplified form. The character 雲 yún ("cloud") was written with the structure 云 in the oracle bone script of the Shang Dynasty, and had remained in use later as a phonetic loan in the meaning of "to say" while the 雨 radical was added to differentiate meanings.
Recommended publications
  • Digital Editions of Premodern Chinese Texts: Methods and Problems – Exemplified Using the Daozang Jiyao1道藏輯要
    Chung-Hwa Buddhist Journal (2012, 25:167-194) Taipei: Chung-Hwa Institute of Buddhist Studies 中華佛學學報第二十五期 頁 167-194 (民國一百零一年),臺北:中華佛學研究所 ISSN:1017-7132 Digital Editions of Premodern Chinese Texts: Methods and Problems – Exemplified Using the Daozang Jiyao1道藏輯要 Christian Wittern Kyoto University Abstract Digital editions do have a great potential for new avenues of research, but they also pose vexing research questions that have to be resolved adequately in order to make the resulting edition useful in the long run. One of the many differences between printed editions of texts and digital editions is the open-endedness of the latter, which means that it can be done incrementally and updated without incurring substantial expenses. The medium of digital editions requires the creator to make many assumptions about the texts explicit and record them in a way that can be processed automatically. This is a new concept, which seems foreign to the agenda of a scholar whose ultimate aim is to engage with the text. This article demonstrates that what seems like a detour is actually advancing the understanding of the text and the need objectify a text in this gives access to new dimensions of a text. It then goes on to provide details of a conceptual model for describing a premodern text digitally that has been developed working on a digital edition of the early Qing Daoist collection Daozang jiyao. Keywords: Text Encoding, Digital Editions, Character Encoding, XML, Doaist Studies 1 I would like to thank the anonymous reviewers for this journal for their very helpful suggestions for clarifications and general improvement of the article.
    [Show full text]
  • The Effect of Pinyin in Chinese Vocabulary Acquisition with English-Chinese Bilingual Learners
    St. Cloud State University theRepository at St. Cloud State Culminating Projects in TESL Department of English 12-2019 The Effect of Pinyin in Chinese Vocabulary Acquisition with English-Chinese Bilingual Learners Yahui Shi Follow this and additional works at: https://repository.stcloudstate.edu/tesl_etds Recommended Citation Shi, Yahui, "The Effect of Pinyin in Chinese Vocabulary Acquisition with English-Chinese Bilingual Learners" (2019). Culminating Projects in TESL. 17. https://repository.stcloudstate.edu/tesl_etds/17 This Thesis is brought to you for free and open access by the Department of English at theRepository at St. Cloud State. It has been accepted for inclusion in Culminating Projects in TESL by an authorized administrator of theRepository at St. Cloud State. For more information, please contact [email protected]. The Effect of Pinyin in Chinese Vocabulary Acquisition with English-Chinese Bilingual learners by Yahui Shi A Thesis Submitted to the Graduate Faculty of St. Cloud State University in Partial Fulfillment of the Requirements for the Degree Master of Arts in English: Teaching English as a Second Language December, 2019 Thesis Committee: Choonkyong Kim, Chairperson John Madden Zengjun Peng 2 Abstract This study investigates Chinese vocabulary acquisition of Chinese language learners in English-Chinese bilingual contexts; the 20 participants in this study were English native speakers, who were enrolled in a Chinese immersion program in central Minnesota. The study used a matching test, and the test contains seven sets of test items. In each set, there were six Chinese vocabulary words and the English translations of three of them. The six words are listed in one column on the left, and the three translations were in another column on the right.
    [Show full text]
  • Learning Chinese Characters Through Applied
    Learning(g Chinese Characters (汉字)g) through Applied Pattern Recognition Dr. Ta’id HOLMES (博士·福尔摩斯·大山) TIP SAP Research SE&T Non-Objectives Scientific Talk Relevancy &C& Completeness of fC Content Writing & Spea kin g © 2011 SAP AG. All rights reserved. 2 Objectives Activating your Intrinsic “Research Gene“ Raising your Curiosity Boost your Creativity, Motivation, & Happiness Gain a Basic Understanding of Written Chinese Develop a Profound Appreciation for the Different © 2011 SAP AG. All rights reserved. 3 Agenda (政纲) 1 Facts (事实) 2 History (历史) 3 Examples (实例) 4 Outlook (图景) © 2011 SAP AG. All rights reserved. 4 Facts (事实): Some Metrics 100,000+ characters 2000 simplified characters 3000 99% of characters used in newspapers 4000 you may consider yourself educated 6000 to read literature or Classical Chinese 1,300,000+ speakers © 2011 SAP AG. All rights reserved. 5 Total Number of Chinese Characters (汉字) Year Name of dictionary Number of characters 100 Shuowen Jiezi 9,353 543? Yupian 12,158 601 Qieyun 16,917 997 Longkan Shoujian 26,430 1011 Guangyun 26,194 1039 Jiyun 53, 525 1615 Zihui 33,179 1675 Zhengzitong 33, 440 1716 Kangxi Zidian 47,035 1916 Zhonghua Da Zidian 48,000 1989 Hanyu Da Zidian 54,678 1994 Zhonghua Zihai 85,568 2004 Yitizi Zidian 106,230 © 2011 SAP AG. All rights reserved. 6 1 Facts (事实) 2 History (历史) 3 Examples (实例) 4 Outlook (图景) © 2011 SAP AG. All rights reserved. 7 History (历史) Jiǎhú (賈湖) symbols: 6600-6200 BC © 2011 SAP AG. All rights reserved. 8 Shang Dynasty (商朝): 1600-1046 BC Oracle bone script (甲骨文) © 2011 SAP AG.
    [Show full text]
  • The Linguistic Analysis of Chinese Emoticon
    University of Massachusetts Amherst ScholarWorks@UMass Amherst Masters Theses Dissertations and Theses July 2015 THE LINGUISTIC ANALYSIS OF CHINESE EMOTICON Xiangxi Liu University of Massachusetts Amherst Follow this and additional works at: https://scholarworks.umass.edu/masters_theses_2 Part of the Chinese Studies Commons Recommended Citation Liu, Xiangxi, "THE LINGUISTIC ANALYSIS OF CHINESE EMOTICON" (2015). Masters Theses. 236. https://doi.org/10.7275/6945608 https://scholarworks.umass.edu/masters_theses_2/236 This Open Access Thesis is brought to you for free and open access by the Dissertations and Theses at ScholarWorks@UMass Amherst. It has been accepted for inclusion in Masters Theses by an authorized administrator of ScholarWorks@UMass Amherst. For more information, please contact [email protected]. THE LINGUISTIC ANALYSIS OF CHINESE EMOTICON A Thesis Presented by XIANGXI LIU Submitted to the Graduate School of the University of Massachusetts Amherst in partial fulfillment of the requirements for the degree of MASTER OF ARTS May 2015 Asian Languages and Literatures i © Copyright by Xiangxi Liu 2015 All Rights Reserved ii THE LINGUISTIC ANALYSIS OF CHINESE EMOTICON A Thesis Presented by XIANGXI LIU Approved as to style and content by: ______________________________________ Zhongwei Shen, Chair ______________________________________ Enhua Zhang, Member ______________________________________ Zhijun Wang, Member ________________________________________ Stephen Miller Director Asian Languages and Literatures ________________________________________ William Moebius, Chair Department of Languages, Literatures and Cultures iii ACKNOWLEDGMENTS First, I gratefully acknowledge my advisor, Zhongwei Shen, for granting me the opportunity to pursue my master’s degree under his guidance. Since I’m certainly not his typical student type: hard working, always meeting deadlines and attention to details.
    [Show full text]
  • Proposal for a Chinese Script Root Zone LGR
    Proposal for a Chinese Script Root Zone LGR LGR Version 1.7.2 Date: 2020-02-28 Document version:17 Authors: CGP (Chinese Generation Panel) Proposal for a Chinese Root Zone LGR CGP 1 General Information/ Overview/ Abstract The purpose of this document aims to give an overarching view of the label generation rules for the Chinese Script (Hani) including rationale behind the design decisions taken. This includes a discussion of the relevant features of the script, the communities and languages using it, as well as the process and methodology used and information of the contributors. The formal specification of the LGR can be found in the accompanying XML document: ● proposal-chinese-lgr-28feb20-en.xml Labels for testing can be found in the accompanying text document: ● chinese-test-labels-28feb20-en.txt All the appendices to the document can be found in the accompanying EXCEL and XML documents: ● proposal-chinese-lgr-28feb20-en, referred to as [Proposal] below ● Appendix A CGP Repertoire [201902].xlsx, ● Appendix B JGP Repertoire [201703].xlsx, ● Appendix C KGP Hanja Repertoire [201703].xlsx, ● Appendix D CGP Variant Mappings [201909].xlsx, ● Appendix E CGP Internal Review [202002].xlsx ● Appendix F IP External Review [201909].xlsx ● Appendix G.1 CDNC IDN Table 2018 in RFC3743 format.txt ● Appendix G.2 CDNC IDN Table 2018 in XML format.xml ● Appendix H KGP Hanja Variant Mappings [201703].xlsx, ● Appendix I CGP Variant Mappings Matching Existing Practice [201909].xlsx ● Appendix J CGP Variant Mappings Differ from Existing Practice [201909].xlsx
    [Show full text]
  • Hong Kong Supplementary Character Set
    LegCo Panel on Information Technology and Broadcasting Hong Kong Supplementary Character Set Introduction This paper explains the objective of developing the Hong Kong Supplementary Character Set (HKSCS), the criteria for the inclusion of characters in the HKSCS and other related matters. Objective of developing the HKSCS 2. The ISO 10646 and the BIG-5 coding scheme commonly used in Hong Kong do not encompass some of the Chinese characters which the Hong Kong Special Administrative Region (HKSAR) Government and the public need to use in electronic communication in Chinese. Owing to the absence of a common Chinese language interface, conflicts of internal codes arise in electronic communication. (Please refer to Annex 1 Annex 1 for details.) The objective of developing the HKSCS is to collate Chinese characters which computer users need for practical purposes and which are not at present included in any of the commonly used coding schemes. In doing so, we seek to provide a common Chinese language interface so as to facilitate accurate electronic communication in Chinese. 3. In 1995, the Hong Kong Government developed a set of coded Chinese characters called the Government Common Character Set (GCCS) in the User-Defined Area of the Big-5 coding scheme to facilitate electronic communication among government departments. This character set, which included a number of Chinese characters unique to Hong Kong, was supplementary to the Big-5 basic character set. Initially, GCCS was intended for use within Government only. Subsequently, in order to enable the public to view government documents on the Internet, GCCS was made freely available on the Government’s web site.
    [Show full text]
  • The Totality of Chinese Characters – a Digital Perspective
    Journal of Chinese Language and Computing 17 (2): 107-125 107 The Totality of Chinese Characters – A Digital Perspective Shouhui Zhao CRPP, NIE of Nanyang Technological University, Singapore 647616 [email protected] Dongbo Zhang Department of Modern Language, Carnegie Mellon University, PA 15231, USA [email protected] Abstract That the Chinese IT industry has been recurrently plagued by the deficiencies of the Chinese writing system has been well known. In this paper, we provided an examination of the complex factors involved in the process of streamlining the total number (TN) of Chinese characters (hanzi), and critically reviewed the existing and emerging theoretical frameworks advocated by language policy (LP) practitioners over the years in addressing the issue of fixing the TN. We first identified three referencing points when people from different fields talk about the TN of hanzi. Based on this review of the concept of TN, we discussed how the ever-expanding, indefinable and unpredictable TN poses problems to Chinese information processing. From a sociolinguistic perspective, we then pointed out the main external causes that lead to the dilemma of TN restriction. Various proposals, articulated since the early 20th century, were then reviewed and their failures were briefly discussed. Lastly, the most recent visionary model to deal with hanzi’s TN was introduced and how it informs the planning of TN of Chinese characters was also discussed. It is hoped, this study will shed some light on the in-depth understanding towards one difficult aspect of Chinese script modernization and computerization. Keywords Chinese characters, total number, variant forms (VF), rarely used characters (RC), encoding set 1.
    [Show full text]
  • Dos Ideogramas (China) Às Palavras (Brasil) : Elaboração Do Primeiro Dicionário Básico Bilíngüe Português-Chinês
    UNIVERSIDADE DE BRASÍLIA INSTITUTO DE LETRAS DEPARTAMENTO DE LINGÜÍSTICA, PORTUGUÊS E LÍNGUAS CLÁSSICAS PROGRAMA DE PÓS-GRADUAÇÃO EM LINGÜÍSTICA DISSERTAÇÃO DE MESTRADO DOS IDEOGRAMAS (CHINA) ÀS PALAVRAS (BRASIL) : ELABORAÇÃO DO PRIMEIRO DICIONÁRIO BÁSICO BILÍNGÜE PORTUGUÊS-CHINÊS Luo Yan Julho de 2007 Brasília – DF LUO YAN DOS IDEOGRAMAS (CHINA) ÀS PALAVRAS (BRASIL) : ELABORAÇÃO DO PRIMEIRO DICIONÁRIO BÁSICO BILÍNGÜE PORTUGUÊS-CHINÊS Dissertação apresentada ao Programa de Pós-Graduação Stricto Sensu do Departamento de Lingüística, Português e Línguas Clássicas da Universidade de Brasília – UnB, como requisito parcial à obtenção do grau de Mestre em Lingüística. Orientadora: Profa. Doutora Enilde Faulstich Brasília – DF Julho de 2007 BANCA EXAMINADORA Profª. Dra. Enilde Faulstich ( LIP ) - UnB Presidente ________________________________________ Prof ª. Dra Patrícia Vieira Nunes Gomes (Michellangelo) Membro efetivo ________________________________________ Prof. Dr. Li Weigang ( CIC ) - UnB Membro efetivo ________________________________________ Prof. Dr. René Strehler (LET) - UnB Membro suplente ________________________________________ AGRADECIMENTOS Gostaria de registrar o meu reconhecimento àqueles que contribuíram para a realização deste trabalho: Aos meus pais, pelo amor que me dedicam, pela compreensão e pelo suporte emocional e espiritual. À Professora Dra. Enilde Faulstich, pela acolhida de meu projeto de mestrado e por sua precisa orientação. À minha amiga Thaís Lobos, por sua sempre disponibilidade e grande generosidade em transmitir seus conhecimentos em relação à língua portuguesa. À amiga Liu Jia que é formada na Língua e Literatura Chinesa, pela ajuda na coleta dos dados da língua chinesa e sanar muitas dúvidas sobre a língua. Ao Professor Zhang Jianbo do Departamento da Língua Espanhola e Portuguesa da Universidade dos Estudos Estrangeiros de Beijing pela avaliação e sugestões na elaboração do dicionário que aqui apresento.
    [Show full text]
  • Word Segmentation in Chinese Language Processing
    Statistics and Its Interface Volume 10 (2017) 165–173 Word segmentation in Chinese language processing Xinxin Shu, Junhui Wang, Xiaotong Shen, and Annie Qu∗ However, after rapid growth in the last decade, there were over 649 million internet users in 2013 writing text docu- This paper proposes a new statistical learning method for ments in Chinese, consisting of 23.2% of all internet users, word segmentation in Chinese language processing. Word compared to 28.6% in English. The online sales and enter- segmentation is the crucial first step towards natural lan- tainment businesses also promote the popularity of Chinese guage processing. Segmentation, despite progress, remains in the digital world. For example, Amazon.cn data (Zhang under-studied; particularly for the Chinese language, the et al., 2009) consists of 5 × 105 Chinese reviews on various second most popular language among all internet users. products. One major difficulty is that the Chinese language is highly However, Chinese language processing is still an area context-dependent and ambiguous in terms of word repre- which has been severely under-studied. This is likely due to sentations. To overcome this difficulty, we cast the problem specific challenges caused by the characteristics of Chinese of segmentation into a framework of sequence classification, language. Word segmentation is considered a crucial step to- where an instance (observation) is a sequence of characters, wards Chinese language processing tasks, due to the unique and a class label is a sequence determining how each char- characteristics of Chinese language structure. Chinese words acter is segmented. Given the class label, each character generally are composed of multiple characters without any sequence can be segmented into linguistically meaningful delimiter appearing between words.
    [Show full text]
  • SCML: a Structural Representation for Chinese Characters
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Dartmouth Digital Commons (Dartmouth College) Dartmouth College Dartmouth Digital Commons Dartmouth College Undergraduate Theses Theses and Dissertations 5-29-2007 SCML: A Structural Representation for Chinese Characters Daniel G. Peebles Dartmouth College Follow this and additional works at: https://digitalcommons.dartmouth.edu/senior_theses Part of the Computer Sciences Commons Recommended Citation Peebles, Daniel G., "SCML: A Structural Representation for Chinese Characters" (2007). Dartmouth College Undergraduate Theses. 51. https://digitalcommons.dartmouth.edu/senior_theses/51 This Thesis (Undergraduate) is brought to you for free and open access by the Theses and Dissertations at Dartmouth Digital Commons. It has been accepted for inclusion in Dartmouth College Undergraduate Theses by an authorized administrator of Dartmouth Digital Commons. For more information, please contact [email protected]. SCML: A Structural Representation for Chinese Characters Senior Honors Thesis Daniel G. Peebles Advisor: Devin Balkcom May 29, 2007 Dartmouth College Technical Report TR2007-592 Abstract Chinese characters are used daily by well over a billion people. They constitute the main writing system of China and Taiwan, form a major part of written Japanese, and are also used in South Korea. Anything more than a cursory glance at these characters will reveal a high degree of structure to them, but computing systems do not currently have a means to operate on this structure. Existing character databases and dictionaries treat them as numerical code points, and associate with them additional ‘hand-computed’ data, such as stroke count, stroke order, and other information to aid in specific searches.
    [Show full text]
  • Die Chinesische Schrift Fasziniert Durch Ihre Fremdheit, Ihr Alter Und Ihre Schönheit
    Die chinesische Schrift fasziniert durch ihre Fremdheit, ihr Alter und ihre Schönheit. Thomas O. Höllmann erklärt, wie sie ent- standen ist, wie die Zeichen aufgebaut sind und wie man diese liest. Er erläutert, warum sich Chinesen unter Umständen besser schriftlich als mündlich verständigen können und welche Rolle chinesische Zeichen in Japan und Korea spielen. Nicht zuletzt geht es um die unterschiedlichen Schreibtechniken, von der Kal- ligraphie mit Tusche und Pinsel über den frühen Buchdruck bis hin zur modernen Textverarbeitung. Thomas O. Höllmann ist Professor für Sinologie an der Lud- wig-Maximilians-Universität München. Bei C.H.Beck erschie- nen von ihm u. a.«Das alte China» (2008), «Schlafender Lotos, trunkenes Huhn. Kulturgeschichte der chinesischen Küche» (2010), in C.H.Beck Wissen «Die Seidenstraße» (32011) sowie in der Reihe C.H.Beck Textura der Band «Windgeflüster. Chi- nesische Gedichte über die Vergänglichkeit». Thomas O. Höllmann DIE CHINESISCHE SCHRIFT Geschichte, Zeichen, Kalligraphie Verlag C.H.Beck Mit 39 Abbildungen und einer Karte Umschlagabbildung: Das Schriftzeichen shu bedeutet «Schrift, Brief, Buch». Es ist einem Text entnommen, den der berühmte Kalligraph Yan Zhengqing im Jahre 780 auf einer Stele anbringen ließ, um die Geschichte seiner Familie festzuhalten. Sie befindet sich heute im «Stelenwald» in Xi’an. (Nach Yu Weichao, Huaxia zhi lu, Beijing 1997: Bd. 3, S. 166) Originalausgabe © Verlag C.H.Beck oHG, München 2015 Satz: Fotosatz Amann, Memmingen Druck und Bindung: Druckerei C.H.Beck, Nördlingen Umschlaggestaltung: Uwe Göbel, München ISBN 978 3 406 68290 2 www.beck.de Inhalt Vorwort 7 1. Inspiration und Drill 9 Viele Sprachen, eine Schrift . 9 Privilegien und Defizite .......................
    [Show full text]
  • ISO/IEC International Standard 10646-1
    Proposed Draft Amendment (PDAM) 8 ISO/IEC 10646:2003/Amd.8:2009 (E) Information technology — Universal Multiple-Octet Coded Character Set (UCS) — AMENDMENT 8: Additional symbols, Bamum supplement, CJK Unified Ideographs Extension D, and other characters Page 2, Clause 3, Normative references Replace the list describing the fields of the linked con- tent (CJKU_SR.txt) and the following paragraph with Update the reference to the Unicode Bidirectional Algo- the following text: rithm and the Unicode Normalization Forms as follows: st • 1 field: BMP or SIP code point (0hhhh), Unicode Standard Annex, UAX#9, The Unicode Bidi- (2hhhh) rectional Algorithm: • 2nd field: Radical Stroke index http://www.unicode.org/reports/tr9/tr9-21.html. (d{1,3}’.d{1,2}), (Radical is one to three di- gits, optionally followed by an apostrophe for alter- Unicode Standard Annex, UAX#15, Unicode Normali- nate radical, followed by a full stop, and ending by zation Forms: one or two digits for the stroke count). http://www.unicode.org/reports/tr15/tr15-31.html. rd • 3 field: Hanzi G sources(G0-hhhh), (G1- hhhh), (G3-hhhh), (G5-hhhh), (G7- Editor’s Note: The versions for the Unicode Standard hhhh), (GS-hhhh), (G8-hhhh), (G9- Annexes mentioned above will be updated as appropri- hhhh), (GE-hhhh), (G_4K), (G_BK), ate in future phases of this amendment process. (G_BKddddd), (G_CH), (G_CY), (G_CYYddddd), (G_CHddddd), (G_FZ), Page 21, Sub-clause 23.1 Source references (G_FZddddd), (G_GHddddd), for CJK Unified Ideographs (G_GFHZBddd), (G_GJZddddd), (G_HC), (G_HCddddd), (G_HZ),
    [Show full text]