Chinese Information Processing

Total Page:16

File Type:pdf, Size:1020Kb

Chinese Information Processing UNLV Retrospective Theses & Dissertations 1-1-1995 Chinese information processing Yucheng Liu University of Nevada, Las Vegas Follow this and additional works at: https://digitalscholarship.unlv.edu/rtds Repository Citation Liu, Yucheng, "Chinese information processing" (1995). UNLV Retrospective Theses & Dissertations. 544. http://dx.doi.org/10.25669/azdz-qsik This Thesis is protected by copyright and/or related rights. It has been brought to you by Digital Scholarship@UNLV with permission from the rights-holder(s). You are free to use this Thesis in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s) directly, unless additional rights are indicated by a Creative Commons license in the record and/ or on the work itself. This Thesis has been accepted for inclusion in UNLV Retrospective Theses & Dissertations by an authorized administrator of Digital Scholarship@UNLV. For more information, please contact [email protected]. INFORMATION TO USERS This manuscript has been reproduced from the microfilm master. UMI films die text directly from the original or copy submitted. Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer. The quality o f this reproduction is dependent upon the quality o f the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleed through, substandard m argins, and improper alignment can adversely affect reproduction. In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. Oversize materials (e.g., maps, drawings, charts) are reproduced by sectioning the original, beginning at the upper left-hand comer and continuing from left to right in equal sections with small overlaps. Each original is also photographed in one exposure and is included in reduced form at the back of the book. Photographs included in the original manuscript have been reproduced xerographically in this copy. Higher quality 6" x 9" black and white photographic prints are available for any photographs or illustrations appearing in this copy for an additional charge. Contact UMI directly to order. A Bell & Howell Information Company 300 North Zeeb Road. Ann Arbor. Ml 48106-1346 USA 313/761-4700 800/521-0600 Chinese Information Processing by Yucheng Liu A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science Department of Computer Science University of Nevada, Las Vegas December 1995 UMI Number: 1377645 UMI Microform 1377645 Copyright 1996, by UMI Company. All rights reserved. This microform edition is protected against unauthorized copying under Title 17, United States Code. UMI 300 North Zeeb Road Ann Arbor, MI 48103 The thesis of Yucheng Liu for the degree of M aster of Science in Computer Science is approved. airpersoii^Tanichi Kanai, Ph.D. Examining Committee Member, Thomas A. Nartker, Ph.D. ExaminingIxamining Committ<Committee Member, Kia Makki, Ph.D. Graduate Faculty Representative, Xin Li, Ph.D. Interim D£an of the Graduate College, Cheryl L. Bowles, Ed.D. University of Nevada, Las Vegas December 1995 © 1995 Yucheng Liu All Rights Reserved ABSTRACT A survey of the field of Chinese information processing is provided. It covers the following areas: the Chinese writing system, several popular Chinese encoding schemes and code conversions, Chinese keyboard entry methods, Chinese fonts, Chi­ nese operating systems, basic Chinese computing techniques and applications. Contents 1 Overview of Chinese Information Processing 1 1.1 Introduction.................................................................................................... 1 1.2 Background of Chinese Language ............................................................. 3 1.3 Basic Concepts and Terminology ............................................................. 3 1.3.1 Software Related Concepts ............................................................... 4 1.3.2 Chinese Computing Based Concepts ........................................... 4 1.3.3 Encoding Standards ........................................................................ 5 2 The Chinese Writing System 7 2.1 Roman C h a ra c te rs ...................................................................................... 7 2.2 Symbols and punctuation .......................................................................... 7 2.3 Hanzi C haracter............................................................................................. 8 2.3.1 The Structure of H an zi ..................................................................... 8 2.3.2 Pronunciation ..................................................................................... 12 2.4 Chinese Typefaces, Type Styles and Type S izes ................................... 17 2.4.1 T y p efaces ........................................................................................... 17 2.4.2 Type S ty le s .................... 19 2.4.3 Type Sizes ........................................................................................... 19 2.5 Chinese Text S t y l e ...................................................................................... 20 3 Chinese Character Set Standards and Encoding Methods 21 3.1 Chinese Character Set Standards ............................................................. 21 3.2 Chinese Encoding M ethods ......................................................................... 23 3.2.1 GB 2312-80 Encoding ..................................................................... 23 3.2.2 HZ E n c o d in g ..................................................................................... 26 3.2.3 Big-5 E n co d in g .................................................................................. 27 3.2.4 CNS 11643 E ncoding ........................................................................ 28 3.2.5 CCCII .................... 30 3.2.6 International Encoding M ethods ..................................................... 32 4 Chinese Input 36 4.1 Big Keyboard input M eth o d ....................................................................... 36 4.2 Small Keyboard Input Method ................................................................... 38 4.2.1 Input by Pronunciation .................................................................... 39 4.2.2 Input by S tru c tu re ........................................................................... 40 4.2.3 Input by Encoding V alue ................................................................. 43 4.2.4 Input by Other C r ite r ia ................................................................. 44 4.3 Optical Character Recognition ................................................................... 44 4.3.1 Online/Offline Handwritten and Handprinted C C R ................. 45 4.3.2 Printed Chinese Character Recognition ........................................ 46 iv 4.4 Audio Input M e th o d ..................................................................................... 47 4.5 Chinese Character Dictionaries .................................................................. 47 5 Chinese Output 49 5.1 Chinese Fonts .................................................................................................. 49 5.1.1 Bitmap F o n t ........................................................................................ 50 5.1.2 Vector F onts ........................................................................................ 51 5.1.3 Outline F o n ts ..................................................................................... 51 5.1.4 E valuation ........................................................................................... 52 5.2 Font Generation Methods ........................................................................... 52 5.2.1 Vector Pattern ................................................................................. 52 5.2.2 Dot Matrix P a tte rn ........................................................................... 54 5.2.3 E valuation ........................................................................................... 55 5.3 Font Storage ..................................................................................................... 55 5.4 Printer O u t p u t .............................................................................................. 56 5.5 Screen O u tp u t ................................................................................................. 57 5.6 Chinese T e rm in al........................................................................................... 57 6 Chinese Information Processing Techniques 58 6.1 Chinese Operating Systems ........................................................................ 58 6.1.1 ' Chinese Operating Systems on M S-DOS .................................... 59 6.1.2 Chinese Operating Systems on MS-Windows ............................. 60 6.1.3 Chinese Operating Systems on U nix ............................................. 60 6.1.4 Chinese Operating Systems on Macintosh ................................. 62 6.2 Code C o n v ersio n ........................................................................................... 62 6.2.1 GB <-> HZ C onversion
Recommended publications
  • Nutrition and the Cancer Survivor
    NUTRITION AND THE CANCER SURVIVOR CANCER SURVIVOR SERIES AICR Research Grants 2015 (partial list) CONTENTS Women’s interventional nutrition study (WINS) long- term survival analysis 1 Introduction . 2 Rowan Chlebowski, MD, PhD, Harbor-UCLA Medical Diet and Cancer . 3 Center Weight and Cancer . 4 . Gene-environment interactions among circulating vitamin D levels, vitamin D pathway gene Physical Activity and Cancer . 4 . polymorphisms, BMI and esophageal adenocarcinoma prognosis 2 Adopting a Healthy Lifestyle . 5 David Christiani, MD, PhD, Harvard University Tips for Healthy Eating . .5 . Targeted disruption of cancer cell metabolism and Handle Food Safely . 8 growth through modification of diet quality Barbara Gower, PhD, The University of Alabama at Watch Your Waist . .9 . Birmingham Be Physically Active . 12 A mail- and video-based weight loss trial in breast cancer survivors 3 Evaluating Nutrition Information . 13 Melinda L . Irwin, PhD, Yale University 4 Common Questions . 16 Effects of fish oil on lipid metabolites in breast cancer Greg Kucera, PhD, Wake Forest University Health Should I take supplements? . .16 . Sciences Will a vegetarian diet protect me? . 17 Impact of physical activity on tumor gene expression What about eating only organic foods? . 17 . in women with newly diagnosed breast cancer Jennifer Ligibel, MD, Dana Farber Cancer Institute Are macrobiotic diets advisable? . 18. Impact of resistance training and protein 5 Need More Help? . 19 supplementation on lean muscle mass among childhood cancer survivors About AICR . .22 . Kirsten Ness, PhD, St . Jude’s Children’s Research Hospital About The Continuous Update Project . 22 Pilot study of a metabolic nutritional therapy for the AICR Recommendations for Cancer management of primary brain tumors Prevention .
    [Show full text]
  • Hieroglyphs for the Information Age: Images As a Replacement for Characters for Languages Not Written in the Latin-1 Alphabet Akira Hasegawa
    Rochester Institute of Technology RIT Scholar Works Theses Thesis/Dissertation Collections 5-1-1999 Hieroglyphs for the information age: Images as a replacement for characters for languages not written in the Latin-1 alphabet Akira Hasegawa Follow this and additional works at: http://scholarworks.rit.edu/theses Recommended Citation Hasegawa, Akira, "Hieroglyphs for the information age: Images as a replacement for characters for languages not written in the Latin-1 alphabet" (1999). Thesis. Rochester Institute of Technology. Accessed from This Thesis is brought to you for free and open access by the Thesis/Dissertation Collections at RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact [email protected]. Hieroglyphs for the Information Age: Images as a Replacement for Characters for Languages not Written in the Latin- 1 Alphabet by Akira Hasegawa A thesis project submitted in partial fulfillment of the requirements for the degree of Master of Science in the School of Printing Management and Sciences in the College of Imaging Arts and Sciences of the Rochester Institute ofTechnology May, 1999 Thesis Advisor: Professor Frank Romano School of Printing Management and Sciences Rochester Institute ofTechnology Rochester, New York Certificate ofApproval Master's Thesis This is to certify that the Master's Thesis of Akira Hasegawa With a major in Graphic Arts Publishing has been approved by the Thesis Committee as satisfactory for the thesis requirement for the Master ofScience degree at the convocation of May 1999 Thesis Committee: Frank Romano Thesis Advisor Marie Freckleton Gr:lduate Program Coordinator C.
    [Show full text]
  • Edinburgh Research Explorer
    Edinburgh Research Explorer Omega Becomes a Sign Processor Citation for published version: Haralambous, Y & Bella, G 2005, 'Omega Becomes a Sign Processor', TUGboat, vol. 27, no. 0, pp. 99-110. <https://www.tug.org/TUGboat/tb27-0/haralambous.pdf> Link: Link to publication record in Edinburgh Research Explorer Document Version: Publisher's PDF, also known as Version of record Published In: TUGboat General rights Copyright for the publications made accessible via the Edinburgh Research Explorer is retained by the author(s) and / or other copyright owners and it is a condition of accessing these publications that users recognise and abide by the legal requirements associated with these rights. Take down policy The University of Edinburgh has made every reasonable effort to ensure that Edinburgh Research Explorer content complies with UK legislation. If you believe that the public display of this file breaches copyright please contact [email protected] providing details, and we will remove access to the work immediately and investigate your claim. Download date: 26. Sep. 2021 Proceedings EuroTEX2005 – Pont-à-Mousson, France MOT02 Omega Becomes a Sign Processor Yannis Haralambous ENST Bretagne [email protected] http://omega.enstb.org/yannis G´abor Bella ENST Bretagne [email protected] Characters and Glyphs not one but four equivalence classes of shapes: ara- bic initial letter jeem, arabic medial letter The distinction between “characters” and “glyphs” jeem, and so on. But are these “characters”? is a rather new issue in computing, although the Answering to this question requires a pragmatic problem is as old as humanity: our species turns out approach.
    [Show full text]
  • Consonant Characters and Inherent Vowels
    Global Design: Characters, Language, and More Richard Ishida W3C Internationalization Activity Lead Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 1 Getting more information W3C Internationalization Activity http://www.w3.org/International/ Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 2 Outline Character encoding: What's that all about? Characters: What do I need to do? Characters: Using escapes Language: Two types of declaration Language: The new language tag values Text size Navigating to localized pages Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 3 Character encoding Character encoding: What's that all about? Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 4 Character encoding The Enigma Photo by David Blaikie Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 5 Character encoding Berber 4,000 BC Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 6 Character encoding Tifinagh http://www.dailymotion.com/video/x1rh6m_tifinagh_creation Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 7 Character encoding Character set Character set ⴰ ⴱ ⴲ ⴳ ⴴ ⴵ ⴶ ⴷ ⴸ ⴹ ⴺ ⴻ ⴼ ⴽ ⴾ ⴿ ⵀ ⵁ ⵂ ⵃ ⵄ ⵅ ⵆ ⵇ ⵈ ⵉ ⵊ ⵋ ⵌ ⵍ ⵎ ⵏ ⵐ ⵑ ⵒ ⵓ ⵔ ⵕ ⵖ ⵗ ⵘ ⵙ ⵚ ⵛ ⵜ ⵝ ⵞ ⵟ ⵠ ⵢ ⵣ ⵤ ⵥ ⵯ Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 8 Character encoding Coded character set 0 1 2 3 0 1 Coded character set 2 3 4 5 6 7 8 9 33 (hexadecimal) A B 52 (decimal) C D E F Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 9 Character encoding Code pages ASCII Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 10 Character encoding Code pages ISO 8859-1 (Latin 1) Western Europe ç (E7) Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 11 Character encoding Code pages ISO 8859-7 Greek η (E7) Copyright © 2005 W3C (MIT, ERCIM, Keio) slide 12 Character encoding Double-byte characters Standard Country No.
    [Show full text]
  • SUPPORTING the CHINESE, JAPANESE, and KOREAN LANGUAGES in the OPENVMS OPERATING SYSTEM by Michael M. T. Yau ABSTRACT the Asian L
    SUPPORTING THE CHINESE, JAPANESE, AND KOREAN LANGUAGES IN THE OPENVMS OPERATING SYSTEM By Michael M. T. Yau ABSTRACT The Asian language versions of the OpenVMS operating system allow Asian-speaking users to interact with the OpenVMS system in their native languages and provide a platform for developing Asian applications. Since the OpenVMS variants must be able to handle multibyte character sets, the requirements for the internal representation, input, and output differ considerably from those for the standard English version. A review of the Japanese, Chinese, and Korean writing systems and character set standards provides the context for a discussion of the features of the Asian OpenVMS variants. The localization approach adopted in developing these Asian variants was shaped by business and engineering constraints; issues related to this approach are presented. INTRODUCTION The OpenVMS operating system was designed in an era when English was the only language supported in computer systems. The Digital Command Language (DCL) commands and utilities, system help and message texts, run-time libraries and system services, and names of system objects such as file names and user names all assume English text encoded in the 7-bit American Standard Code for Information Interchange (ASCII) character set. As Digital's business began to expand into markets where common end users are non-English speaking, the requirement for the OpenVMS system to support languages other than English became inevitable. In contrast to the migration to support single-byte, 8-bit European characters, OpenVMS localization efforts to support the Asian languages, namely Japanese, Chinese, and Korean, must deal with a more complex issue, i.e., the handling of multibyte character sets.
    [Show full text]
  • Title the Practice of Basic Informatics 2019 Author(S) Kita, Hajime
    Title The Practice of Basic Informatics 2019 Kita, Hajime; Kitamura, Yumi; Hioki, Hirohisa; Sakai, Author(s) Hiroyuki; Lin, Donghui Citation (2020): 1-196 Issue Date 2020-03-08 URL http://hdl.handle.net/2433/246166 This book is licensed under CC-BY-NC-ND. For detail, access Right the following: https://creativecommons.org/licenses/by-nc- nd/4.0/deed.en Type Learning Material Textversion publisher Kyoto University The Practice of Basic Informatics 2019 Hajime Kita, Institute for Liberal Arts and Sciences, Yumi Kitamura, Kyoto University Library, Hirohisa Hioki, Graduate School of Human and Environmental Studies, Hiroyuki Sakai, Center for the Promotion of Excellence in Higher Education, Donghui Lin, Graduate School of Informatics Kyoto University Version 2020/03/08 0. Foreword Table of Contents 0. Foreword Kyoto University provides courses on ‘The Practice of Basic Informatics’ as part of its Liberal Arts and Sciences Program. The course is taught at many schools and departments, and course contents vary to meet the requirements of these schools and departments. This textbook is made open to the students of all schools that teach these courses. As stated in Chapter 1, this book is written with the aim of building ICT skills for study at university, that is, ICT skills for academic activities. Some topics may not be taught in class. However, the book is written for self-study by students. We include many exercises in this textbook so that instructors can select some of them for their classes, to accompany their teaching plans. The courses are given at the computer laboratories of the university, and the contents of this textbook assume that Windows 10 and Microsoft Office 2016 are available in these laboratories.
    [Show full text]
  • Writing As Aesthetic in Modern and Contemporary Japanese-Language Literature
    At the Intersection of Script and Literature: Writing as Aesthetic in Modern and Contemporary Japanese-language Literature Christopher J Lowy A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Washington 2021 Reading Committee: Edward Mack, Chair Davinder Bhowmik Zev Handel Jeffrey Todd Knight Program Authorized to Offer Degree: Asian Languages and Literature ©Copyright 2021 Christopher J Lowy University of Washington Abstract At the Intersection of Script and Literature: Writing as Aesthetic in Modern and Contemporary Japanese-language Literature Christopher J Lowy Chair of the Supervisory Committee: Edward Mack Department of Asian Languages and Literature This dissertation examines the dynamic relationship between written language and literary fiction in modern and contemporary Japanese-language literature. I analyze how script and narration come together to function as a site of expression, and how they connect to questions of visuality, textuality, and materiality. Informed by work from the field of textual humanities, my project brings together new philological approaches to visual aspects of text in literature written in the Japanese script. Because research in English on the visual textuality of Japanese-language literature is scant, my work serves as a fundamental first-step in creating a new area of critical interest by establishing key terms and a general theoretical framework from which to approach the topic. Chapter One establishes the scope of my project and the vocabulary necessary for an analysis of script relative to narrative content; Chapter Two looks at one author’s relationship with written language; and Chapters Three and Four apply the concepts explored in Chapter One to a variety of modern and contemporary literary texts where script plays a central role.
    [Show full text]
  • Legacy Character Sets & Encodings
    Legacy & Not-So-Legacy Character Sets & Encodings Ken Lunde CJKV Type Development Adobe Systems Incorporated bc ftp://ftp.oreilly.com/pub/examples/nutshell/cjkv/unicode/iuc15-tb1-slides.pdf Tutorial Overview dc • What is a character set? What is an encoding? • How are character sets and encodings different? • Legacy character sets. • Non-legacy character sets. • Legacy encodings. • How does Unicode fit it? • Code conversion issues. • Disclaimer: The focus of this tutorial is primarily on Asian (CJKV) issues, which tend to be complex from a character set and encoding standpoint. 15th International Unicode Conference Copyright © 1999 Adobe Systems Incorporated Terminology & Abbreviations dc • GB (China) — Stands for “Guo Biao” (国标 guóbiâo ). — Short for “Guojia Biaozhun” (国家标准 guójiâ biâozhün). — Means “National Standard.” • GB/T (China) — “T” stands for “Tui” (推 tuî ). — Short for “Tuijian” (推荐 tuîjiàn ). — “T” means “Recommended.” • CNS (Taiwan) — 中國國家標準 ( zhôngguó guójiâ biâozhün) in Chinese. — Abbreviation for “Chinese National Standard.” 15th International Unicode Conference Copyright © 1999 Adobe Systems Incorporated Terminology & Abbreviations (Cont’d) dc • GCCS (Hong Kong) — Abbreviation for “Government Chinese Character Set.” • JIS (Japan) — 日本工業規格 ( nihon kôgyô kikaku) in Japanese. — Abbreviation for “Japanese Industrial Standard.” — 〄 • KS (Korea) — 한국 공업 규격 (韓國工業規格 hangug gongeob gyugyeog) in Korean. — Abbreviation for “Korean Standard.” — ㉿ — Designation change from “C” to “X” on August 20, 1997. 15th International Unicode Conference Copyright © 1999 Adobe Systems Incorporated Terminology & Abbreviations (Cont’d) dc • TCVN (Vietnam) — Tiu Chun Vit Nam in Vietnamese. — Means “Vietnamese Standard.” • CJKV — Chinese, Japanese, Korean, and Vietnamese. 15th International Unicode Conference Copyright © 1999 Adobe Systems Incorporated What Is A Character Set? dc • A collection of characters that are intended to be used together to create meaningful text.
    [Show full text]
  • Basis Technology Unicode対応ライブラリ スペックシート 文字コード その他の名称 Adobe-Standard-Encoding A
    Basis Technology Unicode対応ライブラリ スペックシート 文字コード その他の名称 Adobe-Standard-Encoding Adobe-Symbol-Encoding csHPPSMath Adobe-Zapf-Dingbats-Encoding csZapfDingbats Arabic ISO-8859-6, csISOLatinArabic, iso-ir-127, ECMA-114, ASMO-708 ASCII US-ASCII, ANSI_X3.4-1968, iso-ir-6, ANSI_X3.4-1986, ISO646-US, us, IBM367, csASCI big-endian ISO-10646-UCS-2, BigEndian, 68k, PowerPC, Mac, Macintosh Big5 csBig5, cn-big5, x-x-big5 Big5Plus Big5+, csBig5Plus BMP ISO-10646-UCS-2, BMPstring CCSID-1027 csCCSID1027, IBM1027 CCSID-1047 csCCSID1047, IBM1047 CCSID-290 csCCSID290, CCSID290, IBM290 CCSID-300 csCCSID300, CCSID300, IBM300 CCSID-930 csCCSID930, CCSID930, IBM930 CCSID-935 csCCSID935, CCSID935, IBM935 CCSID-937 csCCSID937, CCSID937, IBM937 CCSID-939 csCCSID939, CCSID939, IBM939 CCSID-942 csCCSID942, CCSID942, IBM942 ChineseAutoDetect csChineseAutoDetect: Candidate encodings: GB2312, Big5, GB18030, UTF32:UTF8, UCS2, UTF32 EUC-H, csCNS11643EUC, EUC-TW, TW-EUC, H-EUC, CNS-11643-1992, EUC-H-1992, csCNS11643-1992-EUC, EUC-TW-1992, CNS-11643 TW-EUC-1992, H-EUC-1992 CNS-11643-1986 EUC-H-1986, csCNS11643_1986_EUC, EUC-TW-1986, TW-EUC-1986, H-EUC-1986 CP10000 csCP10000, windows-10000 CP10001 csCP10001, windows-10001 CP10002 csCP10002, windows-10002 CP10003 csCP10003, windows-10003 CP10004 csCP10004, windows-10004 CP10005 csCP10005, windows-10005 CP10006 csCP10006, windows-10006 CP10007 csCP10007, windows-10007 CP10008 csCP10008, windows-10008 CP10010 csCP10010, windows-10010 CP10017 csCP10017, windows-10017 CP10029 csCP10029, windows-10029 CP10079 csCP10079, windows-10079
    [Show full text]
  • Japanese Language and Culture
    NIHONGO History of Japanese Language Many linguistic experts have found that there is no specific evidence linking Japanese to a single family of language. The most prominent theory says that it stems from the Altaic family(Korean, Mongolian, Tungusic, Turkish) The transition from old Japanese to Modern Japanese took place from about the 12th century to the 16th century. Sentence Structure Japanese: Tanaka-san ga piza o tabemasu. (Subject) (Object) (Verb) 田中さんが ピザを 食べます。 English: Mr. Tanaka eats a pizza. (Subject) (Verb) (Object) Where is the subject? I go to Tokyo. Japanese translation: (私が)東京に行きます。 [Watashi ga] Toukyou ni ikimasu. (Lit. Going to Tokyo.) “I” or “We” are often omitted. Hiragana, Katakana & Kanji Three types of characters are used in Japanese: Hiragana, Katakana & Kanji(Chinese characters). Mr. Tanaka goes to Canada: 田中さんはカナダに行きます [kanji][hiragana][kataka na][hiragana][kanji] [hiragana]b Two Speech Styles Distal-Style: Semi-Polite style, can be used to anyone other than family members/close friends. Direct-Style: Casual & blunt, can be used among family members and friends. In-Group/Out-Group Semi-Polite Style for Out-Group/Strangers I/We Direct-Style for Me/Us Polite Expressions Distal-Style: 1. Regular Speech 2. Ikimasu(he/I go) Honorific Speech 3. Irasshaimasu(he goes) Humble Speech Mairimasu(I/We go) Siblings: Age Matters Older Brother & Older Sister Ani & Ane 兄 と 姉 Younger Brother & Younger Sister Otooto & Imooto 弟 と 妹 My Family/Your Family My father: chichi父 Your father: otoosan My mother: haha母 お父さん My older brother: ani Your mother: okaasan お母さん Your older brother: oniisanお兄 兄 さん My older sister: ane姉 Your older sister: oneesan My younger brother: お姉さ otooto弟 ん Your younger brother: My younger sister: otootosan弟さん imooto妹 Your younger sister: imootosan 妹さん Boy Speech & Girl Speech blunt polite I/Me = watashi, boku, ore, I/Me = watashi, washi watakushi I am going = Boku iku.僕行 I am going = Watashi iku く。 wa.
    [Show full text]
  • AIX Globalization
    AIX Version 7.1 AIX globalization IBM Note Before using this information and the product it supports, read the information in “Notices” on page 233 . This edition applies to AIX Version 7.1 and to all subsequent releases and modifications until otherwise indicated in new editions. © Copyright International Business Machines Corporation 2010, 2018. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents About this document............................................................................................vii Highlighting.................................................................................................................................................vii Case-sensitivity in AIX................................................................................................................................vii ISO 9000.....................................................................................................................................................vii AIX globalization...................................................................................................1 What's new...................................................................................................................................................1 Separation of messages from programs..................................................................................................... 1 Conversion between code sets.............................................................................................................
    [Show full text]
  • UC Berkeley Dissertations, Department of Linguistics
    UC Berkeley Dissertations, Department of Linguistics Title Relationship between Perceptual Accuracy and Information Measures: A cross-linguistic Study Permalink https://escholarship.org/uc/item/7tx5t8bt Author Kang, Shinae Publication Date 2015 eScholarship.org Powered by the California Digital Library University of California Relationship between perceptual accuracy and information measures: A cross-linguistic study by Shinae Kang A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Professor Keith A. Johnson, Chair Professor Sharon Inkelas Professor Susan S. Lin Professor Robert T. Knight Fall 2015 Relationship between perceptual accuracy and information measures: A cross-linguistic study Copyright 2015 by Shinae Kang 1 Abstract Relationship between perceptual accuracy and information measures: A cross-linguistic study by Shinae Kang Doctor of Philosophy in Linguistics University of California, Berkeley Professor Keith A. Johnson, Chair The current dissertation studies how the information conveyed by different speech el- ements of English, Japanese and Korean correlates with perceptual accuracy. Two well- established information measures are used: weighted negative contextual predictability (in- formativity) of a speech element; and the contribution of a speech element to syllable differ- entiation, or functional load. This dissertation finds that the correlation between information and perceptual accuracy differs depending on both the type of information measure and the language of the listener. To compute the information measures, Chapter 2 introduces a new corpus consisting of all the possible syllables for each of the three languages. The chapter shows that the two information measures are inversely correlated.
    [Show full text]