Robust Segmentation of Japanese Text Into a Lattice for Parsing

Total Page:16

File Type:pdf, Size:1020Kb

Robust Segmentation of Japanese Text Into a Lattice for Parsing Robust Segmentation of Japanese Text into a Lattice for Parsing Gary Kacmarcik, Chris Brockett, Hisami Suzuki Microsoft Research One Microsoft Way Redmond WA, 98052 USA {garykac,chrisbkt,hisamis}@microsoft.com of records for a given input string. This is Abstract typically done using a connective-cost model We describe a segmentation component that ([Hisamitsu90]), which is either maintained utilizes minimal syntactic knowledge to produce a laboriously by hand, or trained on large corpora. lattice of word candidates for a broad coverage Japanese NL parser. The segmenter is a finite Both of these approaches suffer from problems. state morphological analyzer and text normalizer Handcrafted heuristics may become a maintenance designed to handle the orthographic variations quagmire, and as [Kurohashi98] suggests in his characteristic of written Japanese, including discussion of the JUMAN segmenter, statistical alternate spellings, script variation, vowel models may become increasingly fragile as the extensions and word-internal parenthetical system grows and eventually reach a point where material. This architecture differs from con- side effects rule out further improvements. The ventional Japanese wordbreakers in that it does sparse data problem commonly encountered in not attempt to simultaneously attack the problems of identifying segmentation candidates and statistical methods is exacerbated in Japanese by choosing the most probable analysis. To minimize widespread orthographic variation (see §3). duplication of effort between components and to Our system addresses these pitfalls by assigning give the segmenter greater freedom to address completely separate roles to the segmenter and the orthography issues, the task of choosing the best parser to allow each to delve deeper into the analysis is handled by the parser, which has access to a much richer set of linguistic information. By complexities inherent in its tasks. maximizing recall in the segmenter and allowing a Other NL systems ([Kitani93], [Kurohashi98]) precision of 34.7%, our parser currently achieves a have separated the segmentation and parsing breaking accuracy of ~97% over a wide variety of components. However, these dual-level systems corpora. are prone to duplication of effort since many Introduction segmentation ambiguities cannot be resolved without invoking higher-level syntactic or The task of segmenting Japanese text into word semantic knowledge. Our system avoids this units (or other units such as bunsetsu ( phrases)) § duplication by relaxing the requirement that the has been discussed at great length in Japanese NL segmenter identify the best path (or even n-best literature ([Kurohashi98], [Fuchi98], [Nagata94], paths) through the lattice of possible records. The et al.). Japanese does not typically have spaces segmenter is responsible only for ensuring that a between words, which means that a parser must correct set of records is present in its output. It is first have the input string broken into usable units the function of the parsing component to select the before it can analyze a sentence. Moreover, a best analysis from this lattice. With this model, variety of issues complicate this operation, most our system achieves roughly 97% recall/precision notably that potential word candidate records may (see [Suzuki00] for more details). overlap (causing ambiguities for the parser) or there may be gaps where no suitable record is 1 System Overview found (causing a broken span). Figure shows a simple block diagram of our These difficulties are commonly addressed using Natural Language Understanding system for either heuristics or statistical methods to create a Japanese, the goal of which is to robustly produce model for identifying the best (or n-best) sequence syntactic and logical forms that allow automatic Orthography Word Segmentation ϼ syntax (or any downstream) component from Lexicon arriving at a correct analysis (e.g., a missing Ѐ record). A recoverable error is one that does not Derivational Assembly ϼ Syntax interfere with the operation of following Ѐ Lexicon Syntactic Analysis ϼ components. An example of the latter is the Ѐ inclusion of an extra record. This extra record Logical Form does not (theoretically) prevent the parser from doing its job (although in practice it may since it Figure 1: Block diagram of Japanese NL system consumes resources). extraction of semantic relationships (see Using standard definitions of recall (R) and [Richardson98]) and support other linguistic precision (P): projects like information retrieval, NL interfaces Seg Seg and dialog systems, auto-summarization and R = correct P = correct machine translation. Tagtotal Segtotal The segmenter is the first level of processing. This where Segcorrect and Segtotal are the number of “correct” is a finite-state morphological analyzer responsible and total number of segments returned by the segmenter, and Tagtotal is the total number of “correct” segments for generating all possible word candidates into a from a tagged corpus, word lattice. It has a custom lexicon (auto- matically derived from the main lexicon to ensure we can see that recall measures non-recoverable consistency) that is designed to facilitate the errors and precision measures recoverable errors. identification of orthographic variants. Since our goal is to create a robust NL system, it behooves us to maximize recall (i.e., make very Records representing words and morphemes are few non-recoverable errors) in open text while handed off by the segmenter to the derivational keeping precision high enough that the extra assembly component, which uses syntax-like rules records (recoverable errors) do not interfere with to generate additional derived forms that are then the parsing component. used by the parser to create syntax trees and logical forms. Many of the techniques here are similar to Achieving near-00% recall might initially seem to what we use in our Chinese NL system (see be a relatively straightforward task given a [Wu98] for more details). sufficiently large lexicon – simply return every possible record that is found in the input string. In The parser (described extensively in [Jensen93]) practice, the mixture of scripts and flexible generates syntactic representations and logical orthography rules of Japanese (in addition to the forms. This is a bottom-up chart parser with inevitable non-lexicalized words) make the task of binary rules within the Augmented Phrase identifying potential lexical boundaries an Structure Grammar formalism. The grammar rules interesting problem in its own right. are language-specific while the core engine is shared among 7 languages (Chinese, Japanese, 3 Japanese Orthographic Variation Korean, English, French, German, Spanish). The Japanese parser is described in [Suzuki00]. Over the centuries, Japanese has evolved a complex writing system that gives the writer a 2 Recall vs. Precision great deal of flexibility when composing text. Four scripts are in common use (kanji, hiragana, In this architecture, data is fed forward from one katakana and roman), and can co-occur within component to the next; hence, it is crucial that the lexical entries (as shown in Table ). base components (like the segmenter) generate a minimal number of omission errors. Some mixed-script entries could be handled as syntactic compounds, for example, ID ࠞ࡯࠼ [ai Since segmentation errors may affect subsequent dii kaado=“ID card”] could be derived from components, it is convenient to divide these errors IDNOUN + ࠞ࡯࠼ NOUN. However, many such items into two types: recoverable and non-recoverable. are preferably treated as lexical entries because A non-recoverable error is one that prevents the ݱ ᤨᤨೞೞ [jijikokkoku = “every ޘೞޘᤨ [”ᣂߒ޿ [atarashii = “new Kanji-Hiragana ߶੃㘃 [honyuurui = “mammal”] repeat moment”] = ߒ޿ ݱ ↲᢫↲᢫ߒ޿ [kaigaishiiޘޘᱤࡉ࡜ࠪ [haburashi = “toothbrush”] characters ↲᢫ Kanji-Katakana ࡅ⚛ [hiso = “arsenic”] “diligent”] distribution of Kanji-Alpha CGS න૏ [CGS tan’i = “CGS system”] [bideo = “video”] voicing marks ࡅࠐ࠹ࠐࠝ ݱ ࡆ࠺ࠝ 2 ᦬ [juunigatsu = “December”] Kanji-Symbol 㧲㧹᡼ㅍ ݱ FM ᡼ㅍ [FM housou = “FM ǫ✢ [ganma sen = “gamma rays”] halfwidth & broadcast”] ߅࠻ࠗ࡟ [otoire = “toilet”] = Mixed kana fullwidth 㩊㩨㨼㩞㩂㩨㩡㩛 ݱ ࠳ࠗࡗࠣ࡜ࡓ [daiyaguramu ࠳ࡉࠆ [daburu = “to double”] “diagram”] [”ID ࠞ࡯࠼ [aidii kaado = “ID card”] भ ݱ ࡄ࡯࠮ࡦ࠻ [paasento = “percent Kana-Alpha ࡔ࠶࠮ࡦࠫࡖ࡯RNA [messenjaa RNA = [kabushiki gaisha = ␠composite ॽ ݱ ᩣᑼળ “messenger RNA”] “incorporated”] symbols ࠬ࠻ࡠࡦ࠴࠙ࡓ 90 [sutoronchiumu 90 = ৷ ݱ 㧞㧤ᣣ [nijuuhachi nichi = “28th day of the “Strontium 90”] Kana-Symbol month”] ߶߃ࠆ 40q[hoeru yonjuu do = “roaring forties”] Table 2: Character type normalizations ᶖߒࠧࡓ [keshigomu = “eraser”] [arufa kentauri sei = Other mixed ǩࠤࡦ࠲࠙࡝ᤊ “Alpha Centauri”] 4 Segmenter Design ࠻ᦠ߈ [togaki = “stage directions”] Table 1: Mixed-script lexical entries Given the broad long-term goals for the overall system, we address the issues of recall/precision and orthographic variation by narrowly defining they have non-compositional syntactic or semantic the responsibilities of the segmenter as: attributes. () Maximize recall In addition, many Japanese verbs and adjectives (and words derived from them) have a variety of (2) Normalize word variants accepted spellings associated with okurigana, optional characters representing inflectional 4.1 Maximize Recall endings. For example, the present tense of ಾࠅ⪭ Maximal recall is imperative. Any recall mistake ߣߔ (kiriotosu = “to prune”) can be written as any made in the segmenter prevents the
Recommended publications
  • JLPT Explanatory Notes on the Use of the Flashcards.Pdf
    Explanatory Notes on the Use of the Flashcards General Introduction There are four sets of flashcards—Grammar, Kana, Kanji and Vocabulary flashcards. In the current digital age, it might seem superfluous to have printable, paper flashcards as there are now plenty of resources online to learn and review Japanese with. However, they can be very useful for a variety of reasons. First of all, physical flashcards have one primary purpose: to help you learn something. That might sound strange, but haven’t you ever switched on your phone to study some vocabulary on your favorite app only to get distracted by some social media notification. The next thing you know, you’ve spent 30 minutes checking up on your friend’s social life and haven’t learned anything new. That’s why using a single-purpose tool like paper flashcards can be a huge help. It helps you to stay focused and sometimes that is the most difficult part of studying a new language. The second reason is that paper flashcards don’t require any electricity. You can use them anytime, anywhere in any environment without worrying about a system crash or your battery dying. Use these flashcards to review what you’ve learned from the Study Guide so that you are fully equipped with the necessary language skills, before going into the test. There is a big difference between being familiar with something and knowing it well. The JLPT expects you to know the material and recall it quickly. That means frequent revision and practice. 1. Grammar Flashcards – Quick Start Guide The grammar cards contain the key phrases from all the grammar units of the Study Guide.
    [Show full text]
  • Writing As Aesthetic in Modern and Contemporary Japanese-Language Literature
    At the Intersection of Script and Literature: Writing as Aesthetic in Modern and Contemporary Japanese-language Literature Christopher J Lowy A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Washington 2021 Reading Committee: Edward Mack, Chair Davinder Bhowmik Zev Handel Jeffrey Todd Knight Program Authorized to Offer Degree: Asian Languages and Literature ©Copyright 2021 Christopher J Lowy University of Washington Abstract At the Intersection of Script and Literature: Writing as Aesthetic in Modern and Contemporary Japanese-language Literature Christopher J Lowy Chair of the Supervisory Committee: Edward Mack Department of Asian Languages and Literature This dissertation examines the dynamic relationship between written language and literary fiction in modern and contemporary Japanese-language literature. I analyze how script and narration come together to function as a site of expression, and how they connect to questions of visuality, textuality, and materiality. Informed by work from the field of textual humanities, my project brings together new philological approaches to visual aspects of text in literature written in the Japanese script. Because research in English on the visual textuality of Japanese-language literature is scant, my work serves as a fundamental first-step in creating a new area of critical interest by establishing key terms and a general theoretical framework from which to approach the topic. Chapter One establishes the scope of my project and the vocabulary necessary for an analysis of script relative to narrative content; Chapter Two looks at one author’s relationship with written language; and Chapters Three and Four apply the concepts explored in Chapter One to a variety of modern and contemporary literary texts where script plays a central role.
    [Show full text]
  • Print This Article
    Reviews | 159 Germany, Rewohlt Verlag established a paperback series called “New Women” and, from 1977 to 1995, introduced many women writers’ texts, not only from German-language communities but also other nations. In the 1980s, the largest publisher in Germany, Suhrkamp, asked Hijiya- Kirschnereit to become the editor of a 32-volume series on Japanese literature. This meant rapidly increasing interest in Japanese literature in Germany in the 1980s. In 1990, at the world-famous Frankfurt Book Fair, Japan was selected as the country of theme. Along with Hijiya-Kirschnereit’s translation projects came changes in the book market, general readership, and studies of Japanese women’s literature in the German-language community. As discussed above, Hijiya-Kirschnereit’s focus is on publications and studies on women’s literature in German- and Japanese-language communities. Her 2018 volume reviewed here complements Joan E. Ericson’s careful analysis in her Be a Woman: Hayashi Fumiko and Modern Japanese Women’s Literature (University of Hawai‘i Press, 1997), which pointed out that the literary category joryū bungaku associated sentimentality, lyricism, and impressionism with works of literature written by women, erasing individual differences in women and in each work of literature by the same author. Read in conjunction with Ericson’s analysis, Hijiya-Kirschnereit’s volume helps us understand the personal and literary history of individual authors, the development of the publishing industry, general and academic readership, and studies of modern Japanese women’s literature in the Japanese-, German-, and English-language communities. Japanese Kanji Power: A Workbook for Mastering Japanese Characters By John Millen.
    [Show full text]
  • Title Japanese Kanji Learning Method for Arabic Speakers Sub
    Title Japanese Kanji learning method for Arabic speakers Sub Title Author Alsharif, Afnan(Katō, Akira) 加藤, 朗 Publisher 慶應義塾大学大学院メディアデザイン研究科 Publication year 2017 Jtitle Abstract Notes Genre Thesis or Dissertation URL http://koara.lib.keio.ac.jp/xoonips/modules/xoonips/detail.php?koara_id=KO40001001-00002017 -0564 Powered by TCPDF (www.tcpdf.org) Master's thesis Academic Year 2017 Japanese Kanji Learning Method for Arabic Speakers Keio University Graduate School of Media Design Alsharif Afnan A Master’s Thesis submitted to Keio University Graduate School of Media Design in partial fulfillment of the requirements for the degree of MASTER of Media Design Alsharif Afnan Thesis Committee: Professor Akira Kato (Supervisor) Professor Hideki Sunahara (Co-Supervisor) Senior Assistant Professor Marcos Sadao Maekawa (Reviewer) Abstract of Master’s Thesis of Academic Year 2017 Japanese Kanji Learning Method for Arabic Speakers Category: Design Summary Arabic speakers who wish to learn Japanese deal with a very limited number of Kanji resources. This is because for several centuries the cultural exchange between Japan and the Arab world has been inactive which reflected on the amount of published work between the two nations. Learners mostly rely on English resources but this is not an efficient way to learn because first, not every Arabic speaker knows English and second, during translation meaning lost can happen. We address this challenging problem by proposing a learning method specific to native Arabic speakers. It employs two ideas: The first is Arabic Kanji Equivalent which is a copied version of Kanji in Arabic words. The second is Phrase Based Learning where Japanese text along with Arabic translation is given for active reading practice.
    [Show full text]
  • Rubric for Script (Adapted from VAC
    Rubric for Script (adapted from VAC http://www.carla.umn.edu/assessment/VAC/evaluation/e_1.html) Mechanics (3 points possible) Score 1st Draft 3 Few errors in punctuation and mastery of writing in genkoo yooshi (special format for Japanese composition). Hiragana and Katakana are written carefully with few errors. 2 Frequent errors in punctuation and lack of mastery of writing in genkoo yooshi. Hiragana and Katakana are written with frequent errors and demonstrate poor writing skills. 1 Dominated by errors in punctuation, in writing of Hiragana and Katakana, and no mastery of writing in genkoo yooshi. Content (4 points possible) Score Final 4 Topic clearly and fully developed. Details, descriptions, examples, and anecdotes explain and clarify information. All task requirements more than adequately fulfilled. 3 Topic adequately developed with relevant information and some details. Task requirements adequately fulfilled. 2 Topic partially developed; some relevant information is missing; few or no details. Task requirements partially fulfilled. 1 Topic poorly developed and/or task requirements not fulfilled. Or not enough material to evaluate. Grammar (5 points possible) Score Final 5 Few grammatical mistakes and the meaning is always clear. 4 Occasional grammatical mistakes, but do not interfere with the meaning of the message. 3 Frequent grammatical mistakes and sometimes interfere with the meaning of the message. 2 Very frequent grammatical mistakes and often interfere with the meaning of the message. 1 Incomprehensible or not enough material provided to evaluate. Vocabulary (4 points possible) Score Final 4 Wide range of vocabulary learned in class, used appropriately in all instances and with few spelling mistakes.
    [Show full text]
  • A Content Analysis of Kanji Textbooks Targeted for Second Language Learners of Japanese
    STOCKHOLMS UNIVERSITY Department of Asian, Middle Eastern and Turkish Studies A Content Analysis of Kanji Textbooks Targeted for Second Language Learners of Japanese Master of Art Thesis 2015 Joonas Riekkinen Supervisor: Gunnar Linder Acknowledgements There are several people that in many ways have contributed to make this thesis possible, but I will only acknowledge some of them. First I need to thank my supervisor Gunnar “Jinmei” Linder who in a short notice decided to undertake this project. He has in many ways inspired me over the years. I would also like to thank the staff, teachers and lecturers that I had over the years at Stockholm University. I also wish to thank the kanji teachers that I had at Tohoku University. Many thanks to Dennis “BAMF” Moberg for inspiring me, for introducing me various methods of kanji learning and for making countless of kanji word lists. I am looking forward for our future collaboration. Thanks to Jasmine “Master of the Universe” Öjbro for countless of corrections and proofreading. Thanks to Admir “Deus” Hodzic for proofreading, despite the fact of not knowing any Japanese. I also wish to thank Carlos “Akki” Giotis, for making it easy for me to combine monotonous studies with work. I want to thank my mother Marja Riekkinen for supporting me for many years. Last, I would like to thank Narumi Chiba for the help I received in Japan, as well as in the early stages of my master studies. 1 Introduction 1 1.1 Terminology 2 1.2 Purpose and Research Question 3 1.3 Theoretical Framework 4 1.3.1 Kanji and its
    [Show full text]
  • General Explanations Nihon Kokugo Daijiten Editorial Policy 1
    General Explanations Nihon kokugo daijiten Editorial Policy 1. This dictionary attempts to offer a historical account of the meanings and usages of the Japanese language through reference to various written materials. 2. Entry items include the vocabulary of modern Japanese as well as items from historical texts. Proper nouns, including names of places and people, and technical and specialized terms are also included. 3. Definitions of a given word are generally arranged in historical order; usage citations are accompanied by the name of the text in which they appear. 4. Sources for citations are taken from a broad spectrum of texts including literary, historical, religious, and other works of various periods. 5. Citation sources range from works of antiquity to works of the Meiji, Taishō, and Shōwa periods. Chinese texts are also used for Sino-Japanese words. 6. Citations are taken from the most reliable versions of historical texts; in cases where variant texts are cited, notice is given. 7. Identification of citations is a specific as possible. In order to facilitate comprehension, some citations include the author's name and the field with which the text is associated. 8. Separate subheadings for Dialectal Variants, Etymology, Pronunciation, and Premodern Dictionary Citations are included with commentary where appropriate. 9. Entry headings and definitions are based on modern standards, and are intended to make location and comprehension as easy as possible. Components of the Descriptions The descriptions of words in this dictionary are composed of the following elements: entry heading, historical kana orthography, kanji, part of speech, definitions, examples and sources, supplementary notes, dialectal variants, etymology, pronunciation, and premodern dictionary citations.
    [Show full text]
  • The ALA-LC Japanese Romanization Table
    Japanese The ALA-LC Japanese Romanization Table Revision Proposal March 30, 2018 Created by CTP/CJM Joint Working Group on the ALA-LC Japanese Romanization Table (JRTWG) JRTWG is part of: Association of Asian Studies (AAS) Council on East Asian Libraries (CEAL) Committee on Technical Processing (CTP) and Committee on Japanese Materials (CJM) JRTWG Co-Chairs: Yukari Sugiyama (Yale University) Chiharu Watsky (Princeton University) Members of JRTWG: Rob Britt (University of Washington) Ryuta Komaki (Washington University) Fabiano Rocha (University of Toronto) Chiaki Sakai (The Japan Foundation Japanese-Language Institute, Urawa) Keiko Suzuki (The New School) [Served as Co-Chair from April to September 2016] Koji Takeuchi (Library of Congress) Table of Contents Romanization Table…………………………………………………………………………………….Pages 1-28 1. Introduction……………………………………………………………………………………Page 1 2. Basic Principles for Romanization……………………………………………………Page 1 3. Capitalization………………………………………………………………………………….Page 5 4. Japanese Punctuation and Typographical Marks…………………………….Page 8 5. Diacritic Marks and Other Symbols Used in Romanization………………Page 9 6. Word Division………………………………………………………………………………….Page 11 7. Numerals…………………………………………………………………………………………Page 25 Romanized/Kana Equivalent Charts…………………………………………………..…..…….Pages 29-30 Helpful References…………………………………………………………………………………....…Page 31 Last Updated: 3/30/2018 10:48 AM 1 Introduction: Scope of the Romanization Table Romanization is one type of transliteration. Transliteration is the process of converting text written
    [Show full text]
  • 前書き | Introduction
    前書き | INTRODUCTION Objectives This chapter provides a general overview of the Japanese language, and an introduction to its sound and writing systems. I. Japanese Language (An overview) II. Japanese Writing System III. Speech Sounds IV. Sound Units V. Writing System (A Detailed Introduction) Hiragana, Katakana, Kanji VI. The Sentence Structure of Japanese Introduction 1 I. Japanese language Japanese is a language spoken by people in Japan and some descendants of the Japanese who have immigrated to communities in countries around the world, for example Brazil, Hawaii, Northern California, and North Sydney, Australia. II. Writing System (An Overview) Japanese is written using a combination of hiragana, katakana, and Chinese characters (kanji). All Japanese writing scripts were derived from Chinese characters over 1000 years ago. Hiragana and katakana are characters that have been simplified tremendously and stripped of meaning. They are syllabaries representing the sounds of the Japanese language. Kanji are ideograms, used to write words. The Japanese syllabaries hiragana and katakana consist of 46 characters each, consisting of vowel sounds (-a, i, -u, -e, -o), consonant plus vowel sounds, and a syllabic -n. Diacritical marks (the “ten-ten”, or “small circle”) may be added to make “voiced” sounds. Finally, “glides,” or sounds with -ya, -yu, and -yo, in words such as “Tokyo” or “Kyoto” are indicated in writing with “small -ya, -yu, or -yo” following hiragana in the --i line. See Section V in this introduction for a detailed introduction to hiragana, katakana. There will be also be a more detailed section on kanji later in this introduction, in addition to the kanji sections found in each chapter.
    [Show full text]
  • Homography-Induced Ambiguity in Japanese Kanji and the Lexical Disambiguating Function of Okurigana
    言語学論叢 第 26 号(2007) 17 Homography-induced ambiguity in Japanese kanji and the lexical disambiguating function of okurigana Keisuke HONDA 1 Introduction In the current Japanese writing system, graphemes of the kanji script denote native and Sino-Japanese lexical morphemes. 1 A large number of these graphemes classify as homographic, denoting two or more different morphemes at the same time. This is illustrated in (1), where the grapheme 食 corresponds to a set of five distinct morphemes.2 (1) 食1 {tabe-, kuw-, kuraw-, SYOKU, ZIKI}1 Homographic kanji are potentially ambiguous because they contain multiple possibilities of decoding without making any visible differentiation (Sampson 1985, 1 In the literature, there is no consensus on how to define the term grapheme. This paper adopts the conventional conception that grapheme is the basic unit of writing used to represent a certain unit or units of language (Sproat 2000, Coulmas 2003, Rogers 2005, etc.). As regards kanji, this term will be used to refer to the so-called ‘character’ – not the graphic constituent thereof. Also disputed is the identity of the linguistic entities that are denoted by kanji graphemes. Although this issue deserves a full discussion, it is out of the scope of the present paper. For the sake of convenience, this paper tentatively uses (lexical) morphemes to refer to these entities, although this term is not adequate in some respects. 2 Throughout this paper, examples will be given in the following format. The first and second lines present graphemes and associated morphemes, respectively. The morphemes are transcribed morphophonemically, with Sino-Japanese items shown in uppercase characters.
    [Show full text]
  • Book Review Of, Japanese Kanji Power: a Workbook for Mastering Japanese Characters
    Portland State University PDXScholar World Languages and Literatures Faculty Publications and Presentations World Languages and Literatures 3-30-2020 Book Review of, Japanese Kanji Power: A Workbook for Mastering Japanese Characters Karen Curtin Portland State University, [email protected] Follow this and additional works at: https://pdxscholar.library.pdx.edu/wll_fac Part of the Japanese Studies Commons Let us know how access to this document benefits ou.y Citation Details Curtin, K. (2020). Japanese Kanji Power: A Workbook for Mastering Japanese Characters. Japanese Language and Literature, 54(1), 159–164. https://doi.org/10.5195/jll.2020.147 This Book Review is brought to you for free and open access. It has been accepted for inclusion in World Languages and Literatures Faculty Publications and Presentations by an authorized administrator of PDXScholar. Please contact us if we can make this document more accessible: [email protected]. Reviews | 159 Germany, Rewohlt Verlag established a paperback series called “New Women” and, from 1977 to 1995, introduced many women writers’ texts, not only from German-language communities but also other nations. In the 1980s, the largest publisher in Germany, Suhrkamp, asked Hijiya- Kirschnereit to become the editor of a 32-volume series on Japanese literature. This meant rapidly increasing interest in Japanese literature in Germany in the 1980s. In 1990, at the world-famous Frankfurt Book Fair, Japan was selected as the country of theme. Along with Hijiya-Kirschnereit’s translation projects came changes in the book market, general readership, and studies of Japanese women’s literature in the German-language community. As discussed above, Hijiya-Kirschnereit’s focus is on publications and studies on women’s literature in German- and Japanese-language communities.
    [Show full text]
  • Introduction
    INTRODUCTION Overview of text Description of the Japanese Language I. Japanese Language II. Writing System (overview) III. Speech Sound IV. Sound Unit V. Writing System VI. Sentence Structure 1 I. Japanese language Japanese is a language spoken by people in Japan and some descendants of the Japanese who have immigrated to communities in countries around the world, for example Brazil, Hawaii, Northern California, and North Sydney. II. Writing System (Overview) Japanese is written using a combination of hiragana, katakana, and Chinese characters (kanji). All Japanese writing scripts were derived from Chinese characters over 1000 years ago. Hiragana and katakana are characters that have been simplified tremendously and stripped of meaning. They are syllabaries, which represent the sounds of the Japanese language. Kanji are ideograms, used to write words. There will be a more detailed section on kanji later in this introduction, in addition to the kanji sections found in each paragraph. In general, each of the scripts is used as follows: Category Appearance Usage hiragana “curvy” All sounds in Japanese; typically used for particles, inflectional endings (“conjugations”) of verbs, adjectives and the like, and some words that are typically not written in kanji katakana “angular” While katakana can be used to write all sounds in Japanese, it is typically used to write borrowed words, and onomatopoeia kanji Chinese Characters Kanji are generally used to write words, such as nouns, and the roots of verbs and adjectives The Japanese syllabaries hiragana and katakana consist of 46 characters each, consisting of vowel sounds (-a, i, -u, -e, -o), consonant plus vowel sounds, and a syllabic –n.
    [Show full text]