Script As a Factor in Translation
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Towards Chinese Calligraphy Zhuzhong Qian
Macalester International Volume 18 Chinese Worlds: Multiple Temporalities Article 12 and Transformations Spring 2007 Towards Chinese Calligraphy Zhuzhong Qian Desheng Fang Follow this and additional works at: http://digitalcommons.macalester.edu/macintl Recommended Citation Qian, Zhuzhong and Fang, Desheng (2007) "Towards Chinese Calligraphy," Macalester International: Vol. 18, Article 12. Available at: http://digitalcommons.macalester.edu/macintl/vol18/iss1/12 This Article is brought to you for free and open access by the Institute for Global Citizenship at DigitalCommons@Macalester College. It has been accepted for inclusion in Macalester International by an authorized administrator of DigitalCommons@Macalester College. For more information, please contact [email protected]. Towards Chinese Calligraphy Qian Zhuzhong and Fang Desheng I. History of Chinese Calligraphy: A Brief Overview Chinese calligraphy, like script itself, began with hieroglyphs and, over time, has developed various styles and schools, constituting an important part of the national cultural heritage. Chinese scripts are generally divided into five categories: Seal script, Clerical (or Official) script, Regular script, Running script, and Cursive script. What follows is a brief introduction of the evolution of Chinese calligraphy. A. From Prehistory to Xia Dynasty (ca. 16 century B.C.) The art of calligraphy began with the creation of Chinese characters. Without modern technology in ancient times, “Sound couldn’t travel to another place and couldn’t remain, so writings came into being to act as the track of meaning and sound.”1 However, instead of characters, the first calligraphy works were picture-like symbols. These symbols first appeared on ceramic vessels and only showed ambiguous con- cepts without clear meanings. -
Uhm Phd 9506222 R.Pdf
INFORMATION TO USERS This manuscript has been reproduced from the microfilm master. UM! films the text directly from the original or copy submitted. Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer. The quality of this reproduction is dependent UJWD the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleedthrough, substandard margins, and improper alignment can adverselyaffect reproduction. In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. Oversize materials (e.g., maps, drawings, charts) are reproduced by sectioning the original, beginning at the upper left-band comer and continuing from left to right in equal sections with small overlaps. Each original is also photographed in one exposure and is included in reduced form at the back of the book. Photographs included in the original manuscript have been reproduced xerographically in this copy. Higher quality 6" x 9" black and white photographic prints are available for any photographs or illustrations appearing in this copy for an additional charge. Contact UMI directly to order. U·M·I University Microfilms tnternauonat A Bell & Howell tntorrnatron Company 300 North Zeeb Road. Ann Arbor. M148106-1346 USA 313/761-4700 800:521·0600 Order Number 9506222 The linguistic and psycholinguistic nature of kanji: Do kanji represent and trigger only meanings? Matsunaga, Sachiko, Ph.D. University of Hawaii, 1994 Copyright @1994 by Matsunaga, Sachiko. -
Glyce: Glyph-Vectors for Chinese Character Representations
Glyce: Glyph-vectors for Chinese Character Representations Yuxian Meng*, Wei Wu*, Fei Wang*, Xiaoya Li*, Ping Nie, Fan Yin Muyu Li, Qinghong Han, Xiaofei Sun and Jiwei Li Shannon.AI fyuxian meng, wei wu, fei wang, xiaoya li, ping nie, fan yin, muyu li, qinghong han, xiaofei sun, jiwei [email protected] Abstract It is intuitive that NLP tasks for logographic languages like Chinese should benefit from the use of the glyph information in those languages. However, due to the lack of rich pictographic evidence in glyphs and the weak generalization ability of standard computer vision models on character data, an effective way to utilize the glyph information remains to be found. In this paper, we address this gap by presenting Glyce, the glyph-vectors for Chinese character representations. We make three major innovations: (1) We use historical Chinese scripts (e.g., bronzeware script, seal script, traditional Chinese, etc) to enrich the pictographic evidence in characters; (2) We design CNN structures (called tianzege-CNN) tailored to Chinese character image processing; and (3) We use image-classification as an auxiliary task in a multi-task learning setup to increase the model’s ability to generalize. We show that glyph-based models are able to consistently outperform word/char ID-based models in a wide range of Chinese NLP tasks. We are able to set new state- of-the-art results for a variety of Chinese NLP tasks, including tagging (NER, CWS, POS), sentence pair classification, single sentence classification tasks, dependency parsing, and semantic role labeling. For example, the proposed model achieves an F1 score of 80.6 on the OntoNotes dataset of NER, +1.5 over BERT; it achieves an almost perfect accuracy of 99.8% on the Fudan corpus for text classification. -
Universal Multiple-Octet Coded Character
Universal Multiple-Octet Coded Character Set (UCS) ISO/IEC JTC 1/SC 2/WG 2 N4701 Date: 2015-10-23 Source: WG 2 meeting 64, Kunibiki Messe, Matsue, Shimane, Japan; 2015-10-19/23 Title: Recommendations from WG 2 meeting 64 Action: For approval by SC 2 and for information to WG 2 Status: Adopted at meeting 64 of WG 2 Distribution: WG2 meeting 64 (ISO/IEC JTC 1/SC 2 and WG 2) Experts accredited by the national bodies or liaison organizations for Canada, China, IRG, Japan, SEI - UC Berkeley (Liaison), TCA (Liaison), the United Kingdom, the Unicode Consortium (Liaison), and the USA were present when the following recommendations were adopted (see attached attendance list). Character count 120585 in 4th edition 101 additions in DAM1 to the 4th edition 11630 in PDAM2.1 (end of meeting 63) Character count 132316 (total allocated till end of meeting 63) Note: Some of the referenced proposals in this document are contributions from the Unicode Consortium and do not have a WG2 document number. Observation M64.01 (Amendment 2 changes since meeting 63): WG2 notes that in addition to the total of 11630 characters listed in recommendation M63.11 (document N4604), progression of Amendment 2 through PDAM 2.2 and to DAM2 stages has resulted in several changes as summarized in document N4665. The net is that total number of characters that went for DAM2 ballot was 7332. count: 7332 additions in Amendment 2 Observation M64.02 (Additions in 5th edition ballot): WG2 also notes that the balloted text for CD of the 5th edition contained the Amendment 1 and DAM2 content, plus several changes. -
Sinitic Language and Script in East Asia: Past and Present
SINO-PLATONIC PAPERS Number 264 December, 2016 Sinitic Language and Script in East Asia: Past and Present edited by Victor H. Mair Victor H. Mair, Editor Sino-Platonic Papers Department of East Asian Languages and Civilizations University of Pennsylvania Philadelphia, PA 19104-6305 USA [email protected] www.sino-platonic.org SINO-PLATONIC PAPERS FOUNDED 1986 Editor-in-Chief VICTOR H. MAIR Associate Editors PAULA ROBERTS MARK SWOFFORD ISSN 2157-9679 (print) 2157-9687 (online) SINO-PLATONIC PAPERS is an occasional series dedicated to making available to specialists and the interested public the results of research that, because of its unconventional or controversial nature, might otherwise go unpublished. The editor-in-chief actively encourages younger, not yet well established, scholars and independent authors to submit manuscripts for consideration. Contributions in any of the major scholarly languages of the world, including romanized modern standard Mandarin (MSM) and Japanese, are acceptable. In special circumstances, papers written in one of the Sinitic topolects (fangyan) may be considered for publication. Although the chief focus of Sino-Platonic Papers is on the intercultural relations of China with other peoples, challenging and creative studies on a wide variety of philological subjects will be entertained. This series is not the place for safe, sober, and stodgy presentations. Sino- Platonic Papers prefers lively work that, while taking reasonable risks to advance the field, capitalizes on brilliant new insights into the development of civilization. Submissions are regularly sent out to be refereed, and extensive editorial suggestions for revision may be offered. Sino-Platonic Papers emphasizes substance over form. -
Lisu and Lahu 1 Abstract Introduction 75
Bulletin suisse de linguislique appliyuee, 691I, 1999,75-93 Publié dans Bulletin VALS-ASLA (Association suisse de linguistique appliquée) 69/1, 75-93, 1999 qui doit être utilisée pour toute référence à ce travail Standardisation of Transnational Minority Languages in Asia: Lisu and Lahu 1 David BRADLEY & Maya BRADLEY Abstract Lisu and Lahu are each spoken by large groups spread over southwestern China, northern Laos and Thailand, and northeastern Burma, with some Lahu in northern Vietnam and some Lisu in northeastern India. The status and corpus planning and educational policy for these minorities differ in each nation where they are spoken, which has led to the selection of more than one "standard" variety and the development of various competing orthographies. Diversity was created by religious and political divisions: Protestant missionaries prepared the earliest and most widely used orthographies, but Catholic and post-1950 Chinese Communist orthographies are also in use for Lahu; while Lisu has two very different Protestant orthographies and a post-1950 Chinese orthography in use, along with others that are no longer used. There are also competing centrifugal tendencies among some subgroups speaking divergent dialects, who wish to create their own orthographies and achieve some recognition for their own speech variety. For example, the Lahu Shi (Yellow Lahu) have created their own writing system; this is used among some refugees from Laos living in California and may soon be introduced in Thailand and elsewhere. Nevertheless, each of the two groups feels an internal unity, which is linguistically reflected in efforts to reunify the orthographies and maintain international contacts among leaders and language scholars. -
The Following Is the List of East Asian Linguistics-Related Publications (As
The following is the list of East Asian linguistics-related publications (as of February 2016) by our seven DEALL “East Asian Linguistics” faculty members’ (Chan, Nakayama, Noda, C. Quinn, Unger (Emeritus), Xie, & Yuasa). It shows not only our faculty members’ expertise, but also which issues have been investigated by them. Please note that the list does not include any linguistic reviews, abstracts, and encyclopedia entries, non-EA linguistics-related articles, and Pedagogy publications. Also it does not include translated books as separate entries. Linguistics in general Nakayama, M. (editor-in-chief) (2007-12) Journal of Japanese Linguistics 22-28. Minami, M. & M. Nakayama. (editors) (2013) Journal of Japanese Linguistics 29. Xie, Z. & T. Tsui. (eds.) (2015) Buckeye East Asian Linguistics (vol. 1). Columbus, OH: The Knowledge Bank. (http://kb.osu.edu/dspace/handle/1811/73648) Yuasa, E, T. Bagchi, & K. Beals. (eds.) (2011) Pragmatics and Autolexical Grammar: In Honors of Jerry Sadock. Amsterdam: John Benjamins. Inagaki, S., S. Arita, M. Hirakawa, Y. Hirakawa, H. Morikawa, M. Nakayama, J. Tsubaki, & H. Sirai (eds.) (2009) Studies in Language Science Vol. 8. Tokyo: Kurosio Publishers. Chan, M. K. M., & H. Kang (eds.) (2008) Proceedings of the 20th North American Conference on Chinese Linguistics (NACCL-20). 2 Volumes. Columbus, OH: East Asian Studies Center, The Ohio State University. Nakayama, M., M. Minami, H. Morikawa, K. Nakamura, & H. Sirai (eds.) (2006) Studies in Language Science Vol. 5. Tokyo: Kurosio Publishers. Minami, M., H. Kobayashi, M. Nakayama, & H. Sirai (eds.) (2005) Studies in Language Science Vol. 4. Tokyo: Kurosio Publishers. Minami, M., H. Kobayashi, M. Nakayama, & H. -
A Comparative Analysis of the Simplification of Chinese Characters in Japan and China
CONTRASTING APPROACHES TO CHINESE CHARACTER REFORM: A COMPARATIVE ANALYSIS OF THE SIMPLIFICATION OF CHINESE CHARACTERS IN JAPAN AND CHINA A THESIS SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI‘I AT MĀNOA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF ARTS IN ASIAN STUDIES AUGUST 2012 By Kei Imafuku Thesis Committee: Alexander Vovin, Chairperson Robert Huey Dina Rudolph Yoshimi ACKNOWLEDGEMENTS I would like to express deep gratitude to Alexander Vovin, Robert Huey, and Dina R. Yoshimi for their Japanese and Chinese expertise and kind encouragement throughout the writing of this thesis. Their guidance, as well as the support of the Center for Japanese Studies, School of Pacific and Asian Studies, and the East-West Center, has been invaluable. i ABSTRACT Due to the complexity and number of Chinese characters used in Chinese and Japanese, some characters were the target of simplification reforms. However, Japanese and Chinese simplifications frequently differed, resulting in the existence of multiple forms of the same character being used in different places. This study investigates the differences between the Japanese and Chinese simplifications and the effects of the simplification techniques implemented by each side. The more conservative Japanese simplifications were achieved by instating simpler historical character variants while the more radical Chinese simplifications were achieved primarily through the use of whole cursive script forms and phonetic simplification techniques. These techniques, however, have been criticized for their detrimental effects on character recognition, semantic and phonetic clarity, and consistency – issues less present with the Japanese approach. By comparing the Japanese and Chinese simplification techniques, this study seeks to determine the characteristics of more effective, less controversial Chinese character simplifications. -
Negotiation Philosophy in Chinese Characters
6 Ancient Wisdom for the Modern Negotiator: What Chinese Characters Have to Offer Negotiation Pedagogy Andrew Wei-Min Lee* Editors’ Note: In a project that from its inception has been devoted to second generation updates, it is instructive nonetheless to realize how much we have to learn from the past. We believe Lee’s chapter on Chinese characters and their implications for negotiation is groundbreaking. With luck, it will prove to be a harbinger of a whole variety of new ways of looking at our field that will emerge from our next round of discussion. Introduction To the non-Chinese speaker, Chinese characters can look like a cha- otic mess of dots, lines and circles. It is said that Chinese is the most difficult language in the world to learn, and since there is no alpha- bet, the struggling student has no choice but to learn every single Chinese character by sheer force of memory – and there are tens of thousands! I suggest a different perspective. While Chinese is perhaps not the easiest language to learn, there is a very definite logic and sys- tem to the formation of Chinese characters. Some of these characters date back almost eight thousand years – and embedded in their make-up is an extraordinary amount of cultural history and wisdom. * Andrew Wei-Min Lee is founder and president of the Leading Negotiation Institute, whose mission is to promote negotiation pedagogy in China. He also teaches negotiation at Peking University Law School. His email address is an- [email protected]. This article draws primarily upon the work of Feng Ying Yu, who has spent over three hundred hours poring over ancient Chinese texts to analyze and decipher the make-up of modern Chinese characters. -
The Chinese Script T � * 'L
Norman, Jerry, Chinese, Cambridge: Cambridge University Press, 1988. 1 3.1 Th e beginnings of Chinese writing 59 3 FISH HORSE ELEPHANT cow (yu) (m ii) (xiimg) (niu) " The Chinese script t � * 'l Figure 3.1. Pictographs in early Chinese writing 3.1 The beginnings of Chinese writing1 The Chinese script appears as a fully developed writing system in the late Shang .dynasty (fourteenth to eleventh centuries BC). From this period we have copious examples of the script inscribed or written on bones and tortoise shells, for the most part in the form of short divinatory texts. From the same period there also Figure 3.2. The graph fo r quiin'dog' exist a number of inscriptions on bronze vessels of various sorts. The former type of graphic record is referred to as the oracle bone script while the latter is com of this sort of graph are shown in Figure 3.1. The more truly representational a monly known· as the bronze script. The script of this period is already a fully graph is, the more difficult and time-consuming it is to depict. There is a natural developed writing system, capable of recording the contemporary Chinese lan tendency for such graphs to become progressively simplified and stylized as a guage in a complete and unambiguous manner. The maturity of this early script writing system matures and becomes more widely used. As a result, pictographs has suggested to many scholars that it must have passed through a fairly long gradually tend to lose their obvious pictorial quality. The graph for qui'in 'dog' period of development before reaching this stage, but the few examples of writing shown in Figure 3.2 can serve as a good illustration of this sort of development. -
Writing Arabizi: Orthographic Variation in Romanized
WRITING ARABIZI: ORTHOGRAPHIC VARIATION IN ROMANIZED LEBANESE ARABIC ON TWITTER ! ! ! ! Natalie!Sullivan! ! ! ! TC!660H!! Plan!II!Honors!Program! The!University!of!Texas!at!Austin! ! ! ! ! May!4,!2017! ! ! ! ! ! ! ! _______________________________________________________! Barbara!Bullock,!Ph.D.! Department!of!French!&!Italian! Supervising!Professor! ! ! ! ! _______________________________________________________! John!Huehnergard,!Ph.D.! Department!of!Middle!Eastern!Studies! Second!Reader!! ii ABSTRACT Author: Natalie Sullivan Title: Writing Arabizi: Orthographic Variation in Romanized Lebanese Arabic on Twitter Supervising Professors: Dr. Barbara Bullock, Dr. John Huehnergard How does technology influence the script in which a language is written? Over the past few decades, a new form of writing has emerged across the Arab world. Known as Arabizi, it is a type of Romanized Arabic that uses Latin characters instead of Arabic script. It is mainly used by youth in technology-related contexts such as social media and texting, and has made many older Arabic speakers fear that more standard forms of Arabic may be in danger because of its use. Prior work on Arabizi suggests that although it is used frequently on social media, its orthography is not yet standardized (Palfreyman and Khalil, 2003; Abdel-Ghaffar et al., 2011). Therefore, this thesis aimed to examine orthographic variation in Romanized Lebanese Arabic, which has rarely been studied as a Romanized dialect. It was interested in how often Arabizi is used on Twitter in Lebanon and the extent of its orthographic variation. Using Twitter data collected from Beirut, tweets were analyzed to discover the most common orthographic variants in Arabizi for each Arabic letter, as well as the overall rate of Arabizi use. Results show that Arabizi was not used as frequently as hypothesized on Twitter, probably because of its low prestige and increased globalization. -
Section 18.1, Han
The Unicode® Standard Version 13.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2020 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 13.0. Includes index. ISBN 978-1-936213-26-9 (http://www.unicode.org/versions/Unicode13.0.0/) 1.