Multiple Indexing in an Electronic Kanji Dictionary James BREEN Monash University Clayton 3800, Australia [email protected]

Total Page:16

File Type:pdf, Size:1020Kb

Multiple Indexing in an Electronic Kanji Dictionary James BREEN Monash University Clayton 3800, Australia Jwb@Csse.Monash.Edu.Au Multiple Indexing in an Electronic Kanji Dictionary James BREEN Monash University Clayton 3800, Australia [email protected] Abstract and contain such information as the classification of the character according to Kanji dictionaries, which need to present a shape, usage, components, etc., the large number of complex characters in an pronunciation or reading of the character, order that makes them accessible by users, variants of the character, the meaning or traditionally use several indexing techniques semantic application of the character, and that are particularly suited to the printed often a selection of words demonstrating the medium. Electronic dictionary technology use of the character in the language's provides the opportunity of both introducing orthography. These dictionaries are usually new indexing techniques that are not ordered on some visual characteristic of the feasible with printed dictionaries, and also characters. allowing a wide range of index methods with each dictionary. It also allows A typical learner of Japanese needs to have dictionaries to be interfaced at the character both forms of dictionary, and the process of level with documents and applications, thus "looking up" an unknown word often removing much of the requirement for involves initially using the character complex index methods. This paper surveys dictionary to determine the pronunciation of the traditional indexing methods, introduces one or more of the characters, then using some of the new indexing techniques that that pronunciation as an index to a word have become available with electronic kanji dictionary, in a process that can be time- dictionaries, and reports on an analysis of consuming and error-prone. the index usage patterns in a major WWW- The advent of electronic dictionaries has had based electronic kanji dictionary. This is a considerable impact on Japanese believed to be the first such analysis dictionary usage: conducted and reported. a. it has facilitated the integration or 1 Introduction association of character and word Unlike languages written in alphabetic, dictionaries such that a user can index syllabic or similar scripts, languages such as between them in a relatively straightforward Japanese and Chinese, which are written manner. This integration was pioneered by using a large number of characters: hanzi in the author in the early 1990s (Breen, 1995), Chinese, kanji in Japanese, require two and is now a common feature of almost all distinct sets of dictionaries. These are: hand-held electronic Japanese dictionaries and PC-based dictionary packages; a. the traditional "word" dictionaries, as used in most recorded languages. Such b. it has allowed the direct transfer of dictionaries are usually ordered in some words between text documents and recognized phonetic sequence, and typically dictionary software, thus removing the include the pronunciation or reading of the often-laborious character identification; word as well as the usual dictionary c. for kanji dictionaries, it has greatly components: part-of-speech, explanation. increased the number of character indexing etc. methods that can effectively be used, and b. character dictionaries, which has also provided the opportunity for new typically have an entry for each character, indexing methods that are not available to radical (札, 朷, 李, 松, etc.), with the traditional paper dictionaries. grouped kanji ordered by the number of This paper will concentrate on the issues strokes in the remainder of the kanji. Radical associated with Japanese kanji dictionaries. systems have been used in Chinese character Many of these also apply to Chinese. dictionaries for nearly 2,000 years, and the dominant 214-radical system was first used 2 Indexing a Kanji Dictionary in the 康煕字典 (kangxi zidian) published in The general problem confronting the 1716. publication of kanji dictionaries is the large Virtually all major kanji dictionaries number of kanji in use and the absence of an published in Japan use the radical indexing intrinsic and recognized lexical order for method as the primary index, as do a kanji. In the post-war educational reforms in number of dictionaries published elsewhere. Japan, the number of kanji taught in schools Some dictionaries use modified or reduced was restricted to a basic 1,850, which has sets of radicals. The technique is not simple now been increased to 1,945. This set of to use, and some skill and practice is kanji, along with a small set designated for required in correctly identifying the radical use in personal names, accounts for all but a and counting the residual strokes. The small proportion of kanji usage in modern difficulty has been compounded by recent Japanese. Many dictionaries and similar simplifications of the glyphs of the kanji, reference books compiled for students are which in some cases have modified or based on this set (Sakade, 1961; Henshall, eliminated the radical. 1988; Halpern, 1999; etc.). The main computer character-set standard used in There are a number of other techniques used Japan, JIS X 0208 (JIS, 1997), which for indexing kanji in a dictionary: extends to less-common kanji including a. reading. The reading or those used in places-names, has 6,355 kanji. pronunciation of a kanji is a common and This set is the basis for several kanji useful method of identification, and virtually dictionaries (Nelson, 1997; Spahn & all kanji dictionaries have a separate Hadamitzky, 1996), while larger sets of reading/kanji index. The reading cannot be kanji are covered in many dictionaries, e.g. used effectively as the primary index, as in the Kodansha Daijiten (Ueda, 1963) has Japanese each kanji usually has two sets of 14,900 kanji and the 13-volume Morohashi readings, and some kanji have as many as Daikanwajiten (Morohashi, 1989) has over fifteen distinct readings. 45,000 kanji. b. shape/stroke. A number of In this paper, the term "primary index" has techniques have been used to decompose the been used for the method of ordering the shape of a kanji according to coded patterns. kanji entries, and "secondary index" has One, which was popular in China, is the been used for cross-reference lists of kanji Four-Corner code, which allocates a number based on alternative ordering systems. (0-9) to the pattern of strokes at each corner The major traditional indexing technique for of the character, leading to a four-digit kanji and hanzi dictionaries has been the index. Another method, which is quite radical system (bushu in Japanese), based on popular, is the SKIP (System of Kanji 214 elements plus about 150 variants. These Indexing by Patterns) used by Jack Halpern elements are graphic components of the in his kanji dictionaries (Halpern, 1990, character that occur frequently enough to be 1999). In this, a kanji is typically divided used for indexing purposes. For example, into two portions, and a code constructed the kanji 村 (mura: village) is identified by from the division type and the stroke-counts the 木 radical, and in a dictionary would be in the portions. Thus 村 has a SKIP code of grouped with other kanji identified by that 1-4-3, indicating a vertical division into four the emergence of dictionaries with these as and three-stroke portions. the primary index. The Sanseido Unicode Kanji Information Dictionary (Tanaka, c. school grade. In Japan the kanji to 2000) uses the Unicode code-point as the be taught in each grade of elementary school primary index, and the first edition of the are prescribed, and some references either JIS Kanji Dictionary (Shibano, 1997) used organize kanji in those groupings or provide the JIS X 0208 code-point. It is interesting a secondary index of grades. to note that the second edition (Shibano, d. stroke count. The number of pen or 2002) changed to the traditional radical brush strokes making up a kanji, ranging system, with the codepoints being relegated from one to over forty, can be an effective to a secondary index. indexing technique, particularly for the A summary of the indices available in a simpler kanji. Some dictionaries employ a selection of dictionaries and references is in secondary index using the total number of Table 1. The "P" indicates the primary index strokes in a kanji. and an "S" indicates a secondary index. (The e. frequency. The ranking of kanji original Nelson uses a slightly modified according to frequency-of-use can be a version of the traditional radical index, and useful secondary index, especially for the the Spahn & Hadamitzky Kanji Dictionary commonly used kanji. uses a simplified 79-radical system. f. code-point. The standardization of character set code-points for kanji has led to Index Type Dictionary Radical Shape Code-point Grade Reading Frequency Stroke Order Morohashi (1989) P S Ueda (1963) P S Nelson (1974) P* S S S Nelson (1997) P S S S&H (1996) P* S S&H (1997) S S P S Halpern (1990) S P S S S Halpern (1999) S P S S Sakade (1961) P S Henshall (1988) P S S Shibano (1997) P S Shibano (2002) P S S Tanaka (2000) S P Table 1: Index Types in Printed Kanji Dictionaries. 3 Electronic Kanji Dictionaries indexing methods available, and in particular have navigational advantages over As mentioned above, electronic kanji traditional paper dictionaries: dictionaries have an increased number of a. the concept of a "secondary" index c. the above-mentioned capability to no longer applies, as every index is capable index directly to a kanji entry from a kanji of linking directly to the kanji entries; selected from a text or application; b. dictionary users can choose flexibly d. suitable GUIs can enhance the kanji between index methods according to lookup process by providing visual cues and preference, and can select a method a degree of interactivity. appropriate to the characteristics of an Figures 1 and 2 show the GUIs for the bushu individual kanji; and SKIP methods in the kanji dictionary module of the JWPce word-processor (Rosenthal, 2002).
Recommended publications
  • Man'yogana.Pdf (574.0Kb)
    Bulletin of the School of Oriental and African Studies http://journals.cambridge.org/BSO Additional services for Bulletin of the School of Oriental and African Studies: Email alerts: Click here Subscriptions: Click here Commercial reprints: Click here Terms of use : Click here The origin of man'yogana John R. BENTLEY Bulletin of the School of Oriental and African Studies / Volume 64 / Issue 01 / February 2001, pp 59 ­ 73 DOI: 10.1017/S0041977X01000040, Published online: 18 April 2001 Link to this article: http://journals.cambridge.org/abstract_S0041977X01000040 How to cite this article: John R. BENTLEY (2001). The origin of man'yogana. Bulletin of the School of Oriental and African Studies, 64, pp 59­73 doi:10.1017/S0041977X01000040 Request Permissions : Click here Downloaded from http://journals.cambridge.org/BSO, IP address: 131.156.159.213 on 05 Mar 2013 The origin of man'yo:gana1 . Northern Illinois University 1. Introduction2 The origin of man'yo:gana, the phonetic writing system used by the Japanese who originally had no script, is shrouded in mystery and myth. There is even a tradition that prior to the importation of Chinese script, the Japanese had a native script of their own, known as jindai moji ( , age of the gods script). Christopher Seeley (1991: 3) suggests that by the late thirteenth century, Shoku nihongi, a compilation of various earlier commentaries on Nihon shoki (Japan's first official historical record, 720 ..), circulated the idea that Yamato3 had written script from the age of the gods, a mythical period when the deity Susanoo was believed by the Japanese court to have composed Japan's first poem, and the Sun goddess declared her son would rule the land below.
    [Show full text]
  • Introduction to Kanbun: W4019x (Fall 2004) Mondays and Wednesdays 11-12:15, Kress Room, Starr Library (Entrance on 200 Level)
    1 Introduction to Kanbun: W4019x (Fall 2004) Mondays and Wednesdays 11-12:15, Kress Room, Starr Library (entrance on 200 Level) David Lurie (212-854-5034, [email protected]) Office Hours: Tuesdays 2-4, 500A Kent Hall This class is intended to build proficiency in reading the variety of classical Japanese written styles subsumed under the broad term kanbun []. More specifically, it aims to foster familiarity with kundoku [], a collection of techniques for reading and writing classical Japanese in texts largely or entirely composed of Chinese characters. Because it is impossible to achieve fluent reading ability using these techniques in a mere semester, this class is intended as an introduction. Students will gain familiarity with a toolbox of reading strategies as they are exposed to a variety of premodern written styles and genres, laying groundwork for more thorough competency in specific areas relevant to their research. This is not a class in Classical Chinese: those who desire facility with Chinese classical texts are urged to study Classical Chinese itself. (On the other hand, some prior familiarity with Classical Chinese will make much of this class easier). The pre-requisite for this class is Introduction to Classical Japanese (W4007); because our focus is the use of Classical Japanese as a tool to understand character-based texts, it is assumed that students will already have control of basic Classical Japanese grammar. Students with concerns about their competence should discuss them with me immediately. Goals of the course: 1) Acquire basic familiarity with kundoku techniques of reading, with a focus on the classes of special characters (unread characters, twice-read characters, negations, etc.) that form the bulk of traditional Japanese kanbun pedagogy.
    [Show full text]
  • Handy Katakana Workbook.Pdf
    First Edition HANDY KATAKANA WORKBOOK An Introduction to Japanese Writing: KANA THIS IS A SUPPLEMENT FOR BEGINNING LEVEL JAPANESE LANGUAGE INSTRUCTION. \ FrF!' '---~---- , - Y. M. Shimazu, Ed.D. -----~---- TABLE OF CONTENTS Page Introduction vi ACKNOWLEDGEMENlS vii STUDYSHEET#l 1 A,I,U,E, 0, KA,I<I, KU,KE, KO, GA,GI,GU,GE,GO, N WORKSHEET #1 2 PRACTICE: A, I,U, E, 0, KA,KI, KU,KE, KO, GA,GI,GU, GE,GO, N WORKSHEET #2 3 MORE PRACTICE: A, I, U, E,0, KA,KI,KU, KE, KO, GA,GI,GU,GE,GO, N WORKSHEET #~3 4 ADDmONAL PRACTICE: A,I,U, E,0, KA,KI, KU,KE, KO, GA,GI,GU,GE,GO, N STUDYSHEET #2 5 SA,SHI,SU,SE, SO, ZA,JI,ZU,ZE,ZO, TA, CHI, TSU, TE,TO, DA, DE,DO WORI<SHEEI' #4 6 PRACTICE: SA,SHI,SU,SE, SO, ZA,II, ZU,ZE,ZO, TA, CHI, 'lSU,TE,TO, OA, DE,DO WORI<SHEEI' #5 7 MORE PRACTICE: SA,SHI,SU,SE,SO, ZA,II, ZU,ZE, W, TA, CHI, TSU, TE,TO, DA, DE,DO WORKSHEET #6 8 ADDmONAL PRACI'ICE: SA,SHI,SU,SE, SO, ZA,JI, ZU,ZE,ZO, TA, CHI,TSU,TE,TO, DA, DE,DO STUDYSHEET #3 9 NA,NI, NU,NE,NO, HA, HI,FU,HE, HO, BA, BI,BU,BE,BO, PA, PI,PU,PE,PO WORKSHEET #7 10 PRACTICE: NA,NI, NU, NE,NO, HA, HI,FU,HE,HO, BA,BI, BU,BE, BO, PA, PI,PU,PE,PO WORKSHEET #8 11 MORE PRACTICE: NA,NI, NU,NE,NO, HA,HI, FU,HE, HO, BA,BI,BU,BE, BO, PA,PI,PU,PE,PO WORKSHEET #9 12 ADDmONAL PRACTICE: NA,NI, NU, NE,NO, HA, HI, FU,HE, HO, BA,BI,3U, BE, BO, PA, PI,PU,PE,PO STUDYSHEET #4 13 MA, MI,MU, ME, MO, YA, W, YO WORKSHEET#10 14 PRACTICE: MA,MI, MU,ME, MO, YA, W, YO WORKSHEET #11 15 MORE PRACTICE: MA, MI,MU,ME,MO, YA, W, YO WORKSHEET #12 16 ADDmONAL PRACTICE: MA,MI,MU, ME, MO, YA, W, YO STUDYSHEET #5 17
    [Show full text]
  • Writing As Aesthetic in Modern and Contemporary Japanese-Language Literature
    At the Intersection of Script and Literature: Writing as Aesthetic in Modern and Contemporary Japanese-language Literature Christopher J Lowy A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Washington 2021 Reading Committee: Edward Mack, Chair Davinder Bhowmik Zev Handel Jeffrey Todd Knight Program Authorized to Offer Degree: Asian Languages and Literature ©Copyright 2021 Christopher J Lowy University of Washington Abstract At the Intersection of Script and Literature: Writing as Aesthetic in Modern and Contemporary Japanese-language Literature Christopher J Lowy Chair of the Supervisory Committee: Edward Mack Department of Asian Languages and Literature This dissertation examines the dynamic relationship between written language and literary fiction in modern and contemporary Japanese-language literature. I analyze how script and narration come together to function as a site of expression, and how they connect to questions of visuality, textuality, and materiality. Informed by work from the field of textual humanities, my project brings together new philological approaches to visual aspects of text in literature written in the Japanese script. Because research in English on the visual textuality of Japanese-language literature is scant, my work serves as a fundamental first-step in creating a new area of critical interest by establishing key terms and a general theoretical framework from which to approach the topic. Chapter One establishes the scope of my project and the vocabulary necessary for an analysis of script relative to narrative content; Chapter Two looks at one author’s relationship with written language; and Chapters Three and Four apply the concepts explored in Chapter One to a variety of modern and contemporary literary texts where script plays a central role.
    [Show full text]
  • Developmental Reading Disorders in Japan— Prevalence, Profiles, and Possible Mechanisms
    Top Lang Disorders Vol. 34, No. 2, pp. 121–132 Copyright c 2014 Wolters Kluwer Health | Lippincott Williams & Wilkins Developmental Reading Disorders in Japan— Prevalence, Profiles, and Possible Mechanisms Yumiko Tanaka Welty, Lise Menn, and Noriko Oishi Japan has been considered dyslexia-free because of the nature of the orthography, which con- sists of the visually simple kana syllabary and some thousands of visually complex, logographic kanji characters. It is true that few children struggle with learning kana, which provide consistent mappings between symbols and their pronunciation. Indeed, most children can read most of the kana by age 6. However, many Japanese children struggle with reading the kanji, which represent most of the content words in a text; in addition to their visual complexity and impoverished or nonexistent phonological information, kanji are difficult because they typically have several pronunciations and multiple meanings, depending on the context. Because kanji must be learned semantically rather than phonologically, many people believe that Japanese dyslexia is due to visuospatial rather than phonological processing impairments. We sketch the complex psycholin- guistic demands of retrieving the correct pronunciations for kanji, especially in kanji compound words. Some individuals have extreme difficulty in learning the correspondences between these symbols and their sounds; whether these difficulties are visual, phonological, or both is an urgent topic for further research. After introducing Japanese orthography, we present 2 case studies. The first is a profile of a boy we observed from ages 7 to 20 years with difficulties in learning both kana and kanji. The second is a case study of using an interactive reading intervention for a fifth- grade boy with dyslexia.
    [Show full text]
  • AIX Globalization
    AIX Version 7.1 AIX globalization IBM Note Before using this information and the product it supports, read the information in “Notices” on page 233 . This edition applies to AIX Version 7.1 and to all subsequent releases and modifications until otherwise indicated in new editions. © Copyright International Business Machines Corporation 2010, 2018. US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. Contents About this document............................................................................................vii Highlighting.................................................................................................................................................vii Case-sensitivity in AIX................................................................................................................................vii ISO 9000.....................................................................................................................................................vii AIX globalization...................................................................................................1 What's new...................................................................................................................................................1 Separation of messages from programs..................................................................................................... 1 Conversion between code sets.............................................................................................................
    [Show full text]
  • International Language Environments Guide
    International Language Environments Guide Sun Microsystems, Inc. 4150 Network Circle Santa Clara, CA 95054 U.S.A. Part No: 806–6642–10 May, 2002 Copyright 2002 Sun Microsystems, Inc. 4150 Network Circle, Santa Clara, CA 95054 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, docs.sun.com, AnswerBook, AnswerBook2, Java, XView, ToolTalk, Solstice AdminTools, SunVideo and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. SunOS, Solaris, X11, SPARC, UNIX, PostScript, OpenWindows, AnswerBook, SunExpress, SPARCprinter, JumpStart, Xlib The OPEN LOOK and Sun™ Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry.
    [Show full text]
  • Degemination in Japanese Loanwords from Italian
    DEGEMINATION IN JAPANESE LOANWORDS FROM ITALIAN Maho Morimoto University of California, Santa Cruz In Japanese native phonology, geminate consonants are contrastive (as in [kata] ‘shoulder’ vs. [katta] ‘win-PAST’), but geminates in loanwords can have differing sources and motivations (see Kubozono, Itô, Mester 2009, Kawagoe 2015, and references cited therein): we see gemination of singletons in loanwords from English, in which consonant length is not distinctive ([kæt]Eng ‘cat’ → [kjatto]Jp), whereas we see geminate-preservation in loanwords from Italian ([espresso]It ‘espresso’ → [esupuresso]Jp), in which the length of most consonants is contrastive. In loanwords from Italian, however, not all geminates are preserved. This research addresses the cases of degemination, and captures the pattern as stress-based neutralization of consonant length within the framework of Optimality Theory (Prince & Smolensky 1993). 1. THE PUZZLE In loanwords from Italian, as pointed out by Tanaka (2007), the preservation rate for geminates that belong to the penultimate or antepenultimate syllable is higher compared to the other positions within a word. This positional effect on degemination is present in loanwords that include more than one geminate within a word, illustrated below (capital letters indicate the first half of long consonants and the acute accent mark signals stress in Italian and pitch accent in Japanese): (1) Italian source Japanese loan a. zuK.kóT.to → zu.kóT.to zuccotto (a type of cake) b. oreK.kjéT.te → o.re.ki.éT.te orecchiette (a type of pasta) c. kaF.feL.láT.te → ka.fe.ráT.te caffè latte ‘caffè latte’ 2. THE PROPOSAL Drawing attention to the fact that the penultimate and the antepenultimate syllables are the most common locations for Italian noun stress and Japanese pitch accent for loanwords to be assigned to, the current analysis views this positional effect on degemination as stress-based positional neutralization.
    [Show full text]
  • Introduction to the Special Issue on Japanese Geminate Obstruents
    J East Asian Linguist (2013) 22:303-306 DOI 10.1007/s10831-013-9109-z Introduction to the special issue on Japanese geminate obstruents Haruo Kubozono Received: 8 January 2013 / Accepted: 22 January 2013 / Published online: 6 July 2013 © Springer Science+Business Media Dordrecht 2013 Geminate obstruents (GOs) and so-called unaccented words are the two properties most characteristic of Japanese phonology and the two features that are most difficult to learn for foreign learners of Japanese, regardless of their native language. This special issue deals with the first of these features, discussing what makes GOs so difficult to master, what is so special about them, and what makes the research thereon so interesting. GOs are one of the two types of geminate consonant in Japanese1 which roughly corresponds to what is called ‘sokuon’ (促音). ‘Sokon’ is defined as a ‘one-mora- long silence’ (Sanseido Daijirin Dictionary), often symbolized as /Q/ in Japanese linguistics, and is transcribed with a small letter corresponding to /tu/ (っ or ッ)in Japanese orthography. Its presence or absence is distinctive in Japanese phonology as exemplified by many pairs of words, including the following (dots /. / indicate syllable boundaries). (1) sa.ki ‘point’ vs. sak.ki ‘a short time ago’ ka.ko ‘past’ vs. kak.ko ‘paranthesis’ ba.gu ‘bug (in computer)’ vs. bag.gu ‘bag’ ka.ta ‘type’ vs. kat.ta ‘bought (past tense of ‘buy’)’ to.sa ‘Tosa (place name)’ vs. tos.sa ‘in an instant’ More importantly, ‘sokuon’ is an important characteristic of Japanese speech rhythm known as mora-timing. It is one of the four elements that can form a mora 1 The other type of geminate consonant is geminate nasals, which phonologically consist of a coda nasal and the nasal onset of the following syllable, e.g., /am.
    [Show full text]
  • Names for Encodings – Mechanisms for Labeling Text with Encoding
    Web Internationalization Standards and Practice Tex Texin, XenCraft Copyright © 2002-2016 Tex Texin Internationalization and Unicode Conference IUC40 Abstract This is an introduction to internationalization on the World Wide Web. The audience will learn about the standards that provide for global interoperability and come away with an understanding of how to work with multilingual data on the Web. Character representation and the Unicode-based Reference Processing Model are described in detail. HTML, including HTML5, XHTML, XML (eXtensible Markup Language; for general markup), and CSS (Cascading Style Sheets; for styling information) are given particular emphasis. Web Internationalization Slide 2 Objectives • Describe the standards that define the architecture & principles for I18N on the web • Scope limited to markup languages • Provide practical advice for working with international data on the web, including the design and implementation of multilingual web sites and localization considerations • Be introductory level – Condense 3 hours to 75-90 minutes. This presentation and example code are available at: www.xencraft.com/training/webstandards.html Web Internationalization – Standards and Practice Slide 3 Legend For This Presentation Icons used to indicate current product support: Google Internet Firefox Chrome Explorer Supported: Partially supported: Not supported: Caution Highlights a note for users or developers to be careful. Web Internationalization Slide 4 How does the multilingual Web work? • How does the server know – my language?
    [Show full text]
  • Of Writing Systems in Terms of Typological and Other Criteria: Cross-Linguistic Observations from the German and Japanese Writing Systems
    The evolution of writing systems: Empirical and cross-linguistic approaches workshop (AG5) @ DGfS2020, Universität Hamburg, Germany; 4-6 March, 2020 ‘Evolution’ of writing systems in terms of typological and other criteria: Cross-linguistic observations from the German and Japanese writing systems Terry Joyce Dimitrios Meletis Tama University, Japan University of Graz, Austria [email protected] [email protected] Overview Opening remarks Selective sample of writing system (WS) typologies Alternative criteria for evaluating WSs Observations from German (GWS) + Japanese (JWS) Closing remarks Opening remarks 1: Chaos over basic terminology! Erring towards understatement, Gnanadesikan (2017: 15) notes, [t]here is, in general, significant variation in the basic terminology used in the study of writing systems. Indeed, as Meletis (2018: 73) observes regarding the differences between the concepts of WS and orthography, [t]hese terms are often shockingly misused as synonyms, or writing system is not used at all and orthography is employed instead. Similarly, Joyce and Masuda (in press) seek to differentiate between the elusive trinity of terms at heart of WS research; namely, script, WS, and orthography, with particular reference to the JWS. Opening remarks 2: Our working definitions WS1 [Schrifttyp]: Abstract relations (i.e., morphographic, syllabographic, + phonemic), as focus of typologies. WS2 [Schriftsystem]: Common usage for signs + conventions of given language, such as GWS + JWS. Script [Schrift]: Set of material signs for specific language. Orthography [Orthographie]: Mediation between script + WS, but often with prescriptive connotations of correct writing. Graphematic representation: Emerging from grapholinguistic approach, a neutral (ego preferable) alternative to orthography. GWS: Use of extended alphabetic set, as used to represent written German language.
    [Show full text]
  • Sveučilište Josipa Jurja Strossmayera U Osijeku Filozofski Fakultet U Osijeku Odsjek Za Engleski Jezik I Književnost Uroš Ba
    CORE Metadata, citation and similar papers at core.ac.uk Provided by Croatian Digital Thesis Repository Sveučilište Josipa Jurja Strossmayera u Osijeku Filozofski fakultet u Osijeku Odsjek za engleski jezik i književnost Uroš Barjaktarević Japanese-English Language Contact / Japansko-engleski jezični kontakt Diplomski rad Kolegij: Engleski jezik u kontaktu Mentor: doc. dr. sc. Dubravka Vidaković Erdeljić Osijek, 2015. 1 Summary JAPANESE-ENGLISH LANGUAGE CONTACT The paper examines the language contact between Japanese and English. The first section of the paper defines language contact and the most common contact-induced language phenomena with an emphasis on linguistic borrowing as the dominant contact-induced phenomenon. The classification of linguistic borrowing thereby follows Haugen's distinction between morphemic importation and substitution. The second section of the paper presents the features of the Japanese language in terms of origin, phonology, syntax, morphology, and writing. The third section looks at the history of language contact of the Japanese with the Europeans, starting with the Portuguese and Spaniards, followed by the Dutch, and finally the English. The same section examines three different borrowing routes from English, and contact-induced language phenomena other than linguistic borrowing – bilingualism , code alternation, code-switching, negotiation, and language shift – present in Japanese-English language contact to varying degrees. This section also includes a survey of the motivation and reasons for borrowing from English, as well as the attitudes of native Japanese speakers to these borrowings. The fourth and the central section of the paper looks at the phenomenon of linguistic borrowing, its scope and the various adaptations that occur upon morphemic importation on the phonological, morphological, orthographic, semantic and syntactic levels.
    [Show full text]