Reading Hentaigana and Kuzushiji: Summer Workshop

Total Page:16

File Type:pdf, Size:1020Kb

Reading Hentaigana and Kuzushiji: Summer Workshop H-Japan Reading Hentaigana and Kuzushiji: Summer Workshop Discussion published by Linda Chance on Tuesday, March 13, 2018 The Penn Faculty Working Group for Reading Asian Manuscripts will hold a three-day workshop on reading Edo-period hentaigana and kuzushiji June 28-30, 2018 at the University of Pennsylvania, and invites applications from faculty and graduate students from all fields of Japanese studies to participate. The workshop will be led by Dr. Laura Moretti, Emmanuel College, University of Cambridge, and will meet each day from 9 a.m. to 5 p.m. A key feature of this intensive workshop is hands-on experience with a wide range of actual texts, graded by the instructor so that participants can achieve a working knowledge in reading a variety of Edo-period cursive forms. The materials will all be different from those used in previous workshops at Penn and in the Graduate Summer School that Dr. Moretti runs in August in Cambridge www.wakancambridge.com( ). We will include materials from the Kislak Center for Special Collections, Rare Books and Manuscripts. The workshop is suitable for participants with no knowledge ofhentaigana/kuzushiji and for those with elementary or intermediary knowledge. The workshop will be limited to 20 participants and will require a nominal fee of $100 from each participant; travel to Philadelphia, accommodation and related costs are also the responsibility of the participants. Applications should include a two-page curriculum vitae and a brief paragraph describing how the workshop will be of assistance to your research as well as a self-evaluation of reading skill in hentaigana and kuzushiji. The workshop requires knowledge of Classical Japanese. Graduate students should also include the name of their advisor as a reference. All applications are due May 15, 2018, by 5 p.m., and participants will be informed by May 30 of acceptance; we will also keep a waiting list and notify additional participants on a rolling basis. All participants will be required to pay the fee upon acceptance. For more information, please contact Prof. Julie Davis ([email protected]) or Prof. Linda Chance ([email protected]). Citation: Linda Chance. Reading Hentaigana and Kuzushiji: Summer Workshop. H-Japan. 03-13-2018. https://networks.h-net.org/node/20904/discussions/1527393/reading-hentaigana-and-kuzushiji-summer-workshop Licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States License. 1.
Recommended publications
  • The Study of Old Documents of Hokkaido and Kuril Ainu
    NINJAL International Symposium 2018 Approaches to Endangered Languages in Japan and Northeast Asia, August 6-8 The study of old documents of Hokkaido and Kuril Ainu: Promise and Challenges Tomomi Sato (Hokkaido U) & Anna Bugaeva (TUS/NINJAL) [email protected] [email protected]) Introduction: Ainu • AINU (isolate, North Japan, moribund) • Is the only non-Japonic lang. of Japan. • Major dialect groups : Hokkaido (moribund), Sakhalin (extinct since 1993), Kuril (extinct since the end of XIX). • Was also spoken in Tōhoku till mid XVIII. • Hokkaido Ainu dialects: Southwestern (well documented) Northeastern (less documented) • Is not used in daily conversation since the 1950s. • Ethnical Ainu: 100,000. 2 Fig. 2 Major language families in Northeast Asia (excluding Sinitic) Amuric Mongolic Tungusic Ainuic Koreanic Japonic • Ainu shares only few features with Northeast Asian languages. • Ainu is typologically “more like a morphologically reduced version of a North American language.” (Johanna Nichols p.c.). • This is due to the strongly head-marking character of Ainu (Bugaeva, to appear). Why is it important to study Ainu? • Ainu culture is widely regarded as a direct descendant of the Jōmon culture which was spread in the Japanese archipelago in the Prehistoric time from about 14,000 BC. • Ainu is the only surviving Jōmon language; there had been other Jōmon lgs too: about 300 lgs (Janhunen 2002), cf. 10 lgs (Whitman, p.c.) . • Ainu is likely to be much more typical of what languages were like in Northeast Asia several millennia ago than the picture we would get from Chinese, Japanese or Korean. • Focusing on Ainu can help us understand a period of northeast Asian history when political, cultural and linguistic units were very different to what they have been since the rise of the great historically-attested states of East Asia.
    [Show full text]
  • Arxiv:1812.01718V1 [Cs.CV] 3 Dec 2018
    Deep Learning for Classical Japanese Literature Tarin Clanuwat∗ Mikel Bober-Irizar Center for Open Data in the Humanities Royal Grammar School, Guildford Asanobu Kitamoto Alex Lamb Center for Open Data in the Humanities MILA, Université de Montréal Kazuaki Yamamoto David Ha National Institute of Japanese Literature Google Brain Abstract Much of machine learning research focuses on producing models which perform well on benchmark tasks, in turn improving our understanding of the challenges associated with those tasks. From the perspective of ML researchers, the content of the task itself is largely irrelevant, and thus there have increasingly been calls for benchmark tasks to more heavily focus on problems which are of social or cultural relevance. In this work, we introduce Kuzushiji-MNIST, a dataset which focuses on Kuzushiji (cursive Japanese), as well as two larger, more challenging datasets, Kuzushiji-49 and Kuzushiji-Kanji. Through these datasets, we wish to engage the machine learning community into the world of classical Japanese literature. 1 Introduction Recorded historical documents give us a peek into the past. We are able to glimpse the world before our time; and see its culture, norms, and values to reflect on our own. Japan has very unique historical pathway. Historically, Japan and its culture was relatively isolated from the West, until the Meiji restoration in 1868 where Japanese leaders reformed its education system to modernize its culture. This caused drastic changes in the Japanese language, writing and printing systems. Due to the modernization of Japanese language in this era, cursive Kuzushiji (くずしc) script is no longer taught in the official school curriculum.
    [Show full text]
  • Foreignisation Part II: Writing the Outside out Sean Mahoney
    Article Foreignisation Part II: Writing the Outside Out Sean Mahoney ・Thefore1gnercomeslnwhenthe consciousnessof mydiffe「once arises and he disappears when we all acknowledge Ou「Selves as foreigners,unamenableto bondsand communities. _ a s,pi Abstract: Authorities In Japan well_known for meticulous reCO「ding Of detail and strict adherence to registration procedures haVe.oVe「 the yea「S;been dubbed xenophobic by observers. Such comments resound pa「tiCula「ly Intensely of course in discussions of Japan's plethora Of Policies fo「 dealing with jnternatjonal issues. The requi「ed P「OCeSS of WO「king through the labyrjnth of rules and regulations fo「 Conducting business, engaging In new employment,or even finding living accommodations has daunted and indeed prevented many from both onto「ing and Staying in this country Despite constant governmental proposals and p「Og「ammeS designed to enhance “internationalisation,” the pace Of 「eat Change iS somehow kept in the range of extremely slow to non-existent,With the results of various initiatives almost invariably proving SuPe「fiCia1 in society at large. I offer that at least one habit of foreignisation',that Which involves - 57 -- 1 行政社会論集 第12巻 第1号 the written demarcation of “native'' and ''foreign'' words in modern Japanese, is at least partly responsible for reinforcing xenophobic patterns of communication and conduct. While scholars in early Japan surely regarded the Chinese works they translated as “foreign,''concrete guidelines through which usage of the hiragana and katakana syllabaries prescribe
    [Show full text]
  • Section 18.1, Han
    The Unicode® Standard Version 13.0 – Core Specification To learn about the latest version of the Unicode Standard, see http://www.unicode.org/versions/latest/. Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trade- mark claim, the designations have been printed with initial capital letters or in all capitals. Unicode and the Unicode Logo are registered trademarks of Unicode, Inc., in the United States and other countries. The authors and publisher have taken care in the preparation of this specification, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. © 2020 Unicode, Inc. All rights reserved. This publication is protected by copyright, and permission must be obtained from the publisher prior to any prohibited reproduction. For information regarding permissions, inquire at http://www.unicode.org/reporting.html. For information about the Unicode terms of use, please see http://www.unicode.org/copyright.html. The Unicode Standard / the Unicode Consortium; edited by the Unicode Consortium. — Version 13.0. Includes index. ISBN 978-1-936213-26-9 (http://www.unicode.org/versions/Unicode13.0.0/) 1.
    [Show full text]
  • Hentaigana Proposal
    ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1 FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646 Please fill all the sections A, B and C below. Please read Principles and Procedures Document (P & P) from http://www.dkuug.dk/JTC1/SC2/WG2/docs/principles.html for guidelines and details before filling this form. Please ensure you are using the latest Form from http://www.dkuug.dk/JTC1/SC2/WG2/docs/summaryform.html. See also http://www.dkuug.dk/JTC1/SC2/WG2/docs/roadmaps.html for latest Roadmaps. A. Administrative 1. Title: A proposal for encoding the hentaigana characters 2. Requester's name: Jeroen Ruigrok van der Werven 3. Requester type (Member body/Liaison/Individual contribution): Individual contribution 4. Submission date: 2009-02-15 5. Requester's reference (if applicable): 6. Choose one of the following: This is a complete proposal: X (or) More information will be provided later: B. Technical – General 1. Choose one of the following: a. This proposal is for a new script (set of characters): Proposed name of script: b. The proposal is for addition of character(s) to an existing block: Name of the existing block: Historic Kana (U+1B000 – U+1B0FF) 2. Number of characters in proposal: TBD Comment [JRW1]: Mojikyo has over 3. Proposed category (select one from below - see section 2.2 of P&P document): 200, apparently. A-Contemporary B.1-Specialized (small collection) X B.2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4.
    [Show full text]
  • Emmanuel College Cambridge Graduate Summer School on Edo
    Emmanuel College Cambridge Graduate Summer School on Edo-period Written Japanese Reading hentaigana and kuzushiji Manual Laura Moretti 04-16 August 2014 First part: some basic knowledge Introduction: The age of wahon literacy 和本リテラシー p. 4 Japanese manuscripts and woodblock printed books: useful terminology p. 11 Kohitsu 古筆 Hanpon 板本 and shahon 写本 Komonjo 古文書 Reading and transcribing Edo-period texts p. 17 1. How can I locate the original? 2. What edition should I create? 3. What text should I choose as my teihon 底本? 4. What tools will help me in reading and transcribing the text? Hentaigana 変体仮名: a helpful chart and some tips Kanji, kuzushiji and much more 5. Produce your hanrei 凡例 Second part: examples of honkoku p.32 First part: some basic knowledge 3 Introduction The age of wahon literacy 和本リテラシー We are living in a very exciting age when it comes to pre-Meiji texts, in particular Edo-period texts. This is due partly to the fact that accessibility to these texts has been steadily enhanced thanks to various projects of digitization developed in recent years and partly to the fact that the community of Edo-period scholars within and outside Japan has been growing. But these are not the only reasons. More than anything else, from 2008 scholars in Japan have began drawing attention to the need to provide younger generations with the necessary tools to access not only their ‘present’ but also their own ‘past’, so as to build a better ‘future’. In particular Nakano Mitsutoshi 中野三敏 has become the advocate of what he has named wahon literacy 和本リテラシー.
    [Show full text]
  • The Wa.Ii Shorau-Sho of Keichu and Its Position in Historical Usance
    The Wa.ii shorau-sho of Keichu and Its Position in Historical Usance Studies by Christopher Seeley Thesis presented for the degree of Doctor of Philosophy University of London ProQuest Number: 10731311 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a com plete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. uest ProQuest 10731311 Published by ProQuest LLC(2017). Copyright of the Dissertation is held by the Author. All rights reserved. This work is protected against unauthorized copying under Title 17, United States C ode Microform Edition © ProQuest LLC. ProQuest LLC. 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106- 1346 2 ABSTRACT This thesis is concerned with examination and interpretation from the orthographical viewpoint of the system of historical kana iisage (rekishiteld, kana-gukai) proposed hy the 17th century scholar-priest Keichu, and its relationship to previous and subsequent kana usage and kana usage theory. In the introductory chapter, the meanings and scope of the term kana-sukai are considered, as also the question of how kana-zukai first arose. Chapter Two consists of a description of kana usage Before Keichu, in order to put the historical kana usage of Keichu into perspective. In Chapter Three a brief introduction to ICeichu and his works is given, together with a consideration of the significance of his kana usage studies within his work as a whole*r£ Chapter Four sets out assumptions concerning the sound-system of the language of KeichU as a preliminary to examination of his Icana usage writings.
    [Show full text]
  • Automated Transcription for Pre-Modern Japanese Kuzushiji Documents by Random Lines Erasure and Curriculum Learning
    Automated Transcription for Pre-Modern Japanese Kuzushiji Documents by Random Lines Erasure and Curriculum Learning Anh Duc Le1 1 Center for Open Data in The Humanities, Tokyo, Japan [email protected] Abstract. Recognizing the full-page of Japanese historical documents is a chal- lenging problem due to the complex layout/background and difficulty of writing styles, such as cursive and connected characters. Most of the previous methods divided the recognition process into character segmentation and recognition. However, those methods provide only character bounding boxes and classes without text transcription. In this paper, we enlarge our previous human- inspired recognition system from multiple lines to the full-page of Kuzushiji documents. The human-inspired recognition system simulates human eye movement during the reading process. For the lack of training data, we propose a random text line erasure approach that randomly erases text lines and distorts documents. For the convergence problem of the recognition system for full- page documents, we employ curriculum learning that trains the recognition sys- tem step by step from the easy level (several text lines of documents) to the dif- ficult level (full-page documents). We tested the step training approach and random text line erasure approach on the dataset of the Kuzushiji recognition competition on Kaggle. The results of the experiments demonstrate the effec- tiveness of our proposed approaches. These results are competitive with other participants of the Kuzushiji recognition competition. Keywords: Kuzushiji recognition, full-page document recognition, random text line erasure approach. 1 Introduction Japan had been using Kuzushiji or cursive writing style shortly after Chinese charac- ters got into the country in the 8th century.
    [Show full text]
  • Hentaigana Proposal
    JTC1/SC2/WG2 N3698 ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1 FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646 Please fill all the sections A, B and C below. Please read Principles and Procedures Document (P & P) from http://www.dkuug.dk/JTC1/SC2/WG2/docs/principles.html for guidelines and details before filling this form. Please ensure you are using the latest Form from http://www.dkuug.dk/JTC1/SC2/WG2/docs/summaryform.html. See also http://www.dkuug.dk/JTC1/SC2/WG2/docs/roadmaps.html for latest Roadmaps. A. Administrative 1. Title: A proposal for encoding the hentaigana characters 2. Requester's name: Jeroen Ruigrok van der Werven 3. Requester type (Member body/Liaison/Individual contribution): Individual contribution 4. Submission date: 2009-02-15 5. Requester's reference (if applicable): 6. Choose one of the following: This is a complete proposal: X (or) More information will be provided later: B. Technical – General 1. Choose one of the following: a. This proposal is for a new script (set of characters): Proposed name of script: b. The proposal is for addition of character(s) to an existing block: Name of the existing block: Historic Kana (U+1B000 – U+1B0FF) 2. Number of characters in proposal: TBD Comment [JRW1]: Mojikyo has over 3. Proposed category (select one from below - see section 2.2 of P&P document): 200, apparently. A-Contemporary B.1-Specialized (small collection) X B.2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4.
    [Show full text]
  • Recognition of Japanese Handwritten Characters with Machine Learning Techniques
    Recognition of Japanese handwritten characters with M achine learning techniques Bachelor’s degree in Multimedia Engineering Bachelor’s Thesis Auth or: José Vicente Tomás Pérez Supervisor: José Manuel Iñesta Quereda July 2020 Recognition of Japanese handwritten characters with Machine learning techniques A comprehensive research on the difficulties for the recognition of Japanese handwritten characters and the application of Machine learning techniques used to approach this problem Author José Vicente Tomás Pérez Supervisor José Manuel Iñesta Quereda Department of Software and Computing Systems (DLSI) Bachelor’s degree in Multimedia Engineering Escuela Politécnica Superior ALICANTE, July 2020 Abstract The recognition of Japanese handwritten characters has always been a challenge for researchers. A large number of classes, their graphic complexity, and the existence of three different writing systems make this problem particularly difficult compared to Western writing. For decades, attempts have been made to address the problem using traditional OCR (Optical Character Recognition) techniques, with mixed results. With the recent popularization of machine learning techniques through neural net- works, this research has been revitalized, bringing new approaches to the problem. These new results achieve performance levels comparable to human recognition. Fur- thermore, these new techniques have allowed collaboration with very different disci- plines, such as the Humanities or East Asian studies, achieving advances in them that would not have been possible without this interdisciplinary work. In this thesis, these techniques are explored until reaching a sufficient level of un- derstanding that allows us to carry out our own experiments, training neural network models with public datasets of Japanese characters. However, the scarcity of public datasets makes the task of researchers remarkably difficult.
    [Show full text]
  • A Human-Inspired Recognition System for Premodern Japanese Historical
    Date of publication xxxx 00, 0000, date of current version xxxx 00, 0000. Digital Object Identifier 10.1109/ACCESS.2017.Doi Number A human-inspired recognition system for pre- modern Japanese historical documents Anh Duc Le,Tarin Clanuwat, and Asanobu Kitamoto ABSTRACT Recognition of historical documents is a challenging problem due to the noised, damaged characters and background. However, in Japanese historical documents, not only contains the mentioned problems, pre-modern Japanese characters were written in cursive and are connected. Therefore, character segmentation based methods do not work well. This leads to the idea of creating a new recognition system. In this paper, we propose a human-inspired document reading system to recognize multiple lines of pre- modern Japanese historical documents. During the reading, people employ eyes movement to determine the start of a text line. Then, they move the eyes from the current character/word to the next character/word. They can also determine the end of a line or skip a figure to move to the next line. The eyes movement integrates with visual processing to operate the reading process in the brain. We employ attention-based encoder- decoder to implement this recognition system. First, the recognition system detects where to start a text line. Second, the system scans and recognize character by character until the text line is completed. Then, the system continues to detect the start of the next text line. This process is repeated until reading the whole document. As results, the system is successful to recognize multiple lines, connected and cursive characters without performing character/line segmentation.
    [Show full text]
  • Hentaigana Proposal
    ISO/IEC JTC 1/SC 2/WG 2 PROPOSAL SUMMARY FORM TO ACCOMPANY SUBMISSIONS 1 FOR ADDITIONS TO THE REPERTOIRE OF ISO/IEC 10646 Please fill all the sections A, B and C below. Please read Principles and Procedures Document (P & P) from http://www.dkuug.dk/JTC1/SC2/WG2/docs/principles.html for guidelines and details before filling this form. Please ensure you are using the latest Form from http://www.dkuug.dk/JTC1/SC2/WG2/docs/summaryform.html. See also http://www.dkuug.dk/JTC1/SC2/WG2/docs/roadmaps.html for latest Roadmaps. A. Administrative 1. Title: A proposal for encoding the hentaigana characters 2. Requester's name: Jeroen Ruigrok van der Werven 3. Requester type (Member body/Liaison/Individual contribution): Individual contribution 4. Submission date: 2009-02-15 5. Requester's reference (if applicable): 6. Choose one of the following: This is a complete proposal: X (or) More information will be provided later: B. Technical – General 1. Choose one of the following: a. This proposal is for a new script (set of characters): Proposed name of script: b. The proposal is for addition of character(s) to an existing block: Name of the existing block: Historic Kana (U+1B000 – U+1B0FF) 2. Number of characters in proposal: TBD Comment [JRW1]: Mojikyo has over 3. Proposed category (select one from below - see section 2.2 of P&P document): 200, apparently. A-Contemporary B.1-Specialized (small collection) X B.2-Specialized (large collection) C-Major extinct D-Attested extinct E-Minor extinct F-Archaic Hieroglyphic or Ideographic G-Obscure or questionable usage symbols 4.
    [Show full text]