Sentence Complexity Estimation for Chinese-Speaking Learners of Japanese

Total Page:16

File Type:pdf, Size:1020Kb

Sentence Complexity Estimation for Chinese-Speaking Learners of Japanese Sentence Complexity Estimation for Chinese-speaking Learners of Japanese Jun Liu Yuji Matsumoto Graduate School of Information Science Graduate School of Information Science Nara Institute of Science and Technology Nara Institute of Science and Technology 8916-5 Takayama, Ikoma, Nara 630-0192, Japan 8916-5 Takayama, Ikoma, Nara 630-0192, Japan [email protected] [email protected] Song, 2011; Horiba, 2012; Gilakjani and Sabouri, Abstract 2016). Reading comprehension is also influenced by the complexity of the reading material. Texts It is fairly challenging for a foreign containing highly demanding vocabularies and language learner to read and understand highly complex sentence structures are likely to Japanese texts containing words of high disturb the learners’ reading comprehension. difficulty level or low frequency and Learners of Japanese language from kanji complicated linguistic structures. Because a background countries benefit substantially from large number of Chinese characters (kanji kanji characters commonly used in both Japanese in Japanese) are commonly used both in and Chinese when they read Japanese sentences or Chinese and Japanese, the more confusing documents. However, it is more challenging for problem for Japanese language learners them to read and learn various Japanese functional from kanji background countries is the expressions with varied meanings and usages. acquisition of various complex Japanese The selection of appropriate reading material functional expressions. In this study, we matching the learners’ individual capabilities is propose a method utilizing Japanese kanji highly likely to enable language learners to read in characters, particularly Japanese–Chinese a more focused and selective manner. To support homographs with identical or similar learners in gathering useful information from texts meanings, as a critical feature of sentence- more effectively, certain online public Japanese complexity estimation for Chinese- reading-assistance systems such as Reading Tutor1, speaking learners of Japanese language. Asunaro 2 , Rikai 3 , and WWWJDIC 4 are highly Experimental results have partially effective. These systems are adequately demonstrated the effectiveness of our constructed for providing an internet learning method in enhancing the accuracy of environment where learners can make complete sentence-complexity estimation. use of information from the internet for their Japanese language study, and a few of them are 1 Introduction specifically designed to enable language learners to Enhancing reading capability is one of the understand Japanese texts by offering words with important purposes in second language teaching their corresponding difficulty level information or and learning. There are various factors that impact translation (Toyoda 2016). However, these systems learners’ reading comprehension. A few of these 1 factors involve the learners’ vocabulary knowledge, http://language.tiu.ac.jp/ 2 https://hinoki-project.org/asunaro/ grammar knowledge, reading strategies, interest, 3 http://www.rikai.com/perl/Home.pl attitude, and motivation (Koda, 2007; Han and 4 http://nihongo.monash.edu/cgi-bin/wwwjdic?9T 296 31st Pacific Asia Conference on Language, Information and Computation (PACLIC 31), pages 296–302 Cebu City, Philippines, November 16-18, 2017 Copyright c 2017 Jun Liu and Yuji Matsumoto do not take the learners’ native language that focuses on grammar and utilizes grammar background into account. Moreover, these systems templates. provide learners with limited information on the In recent years, a few Japanese text difficulty grammatical difficulty of all the various types of evaluation systems have been developed to support Japanese functional expressions, which learners Japanese language learners (Hasebe and Lee, 2015; actually intend to learn as a part of the procedure Lee and Hasebe, 2016). For example, JReadability5 for learning Japanese. can analyze input text and estimate its readability In Section 2 of this paper, we introduce some to categorize it as belonging to one of six difficulty previous works. In Section 3, we describe our levels, on the basis of five characteristics: average method for ranking example sentences of Japanese length of sentence; percentage of kango (words of functional expressions by utilizing Japanese– Chinese origin), percentage of wago (words of Chinese homographs with identical or similar Japanese origin), percentage of verbs, and meanings, as a critical feature. Section 4 describes percentage of particles. the several experiments conducted to examine the However, JReadability too does not sufficiently effectiveness of our method. Finally, in Section 5, consider the various types of Japanese functional we conclude and describe future work. expressions with varying difficulty levels. The prediction value calculated by this system is more 2 Previous Research reliable for long texts (approximately 1000 characters) and not for single sentences. Text difficulty or text readability evaluation is one of the challenges in natural language 3 General Method processing (NLP) owing to the linguistic complexity generated from both vocabulary and Japanese and Chinese share a large quantity of grammar. Researchers have been actively homographs that use identical kanji characters exploring methods to evaluate text difficulty (both in simplified Chinese and traditional (Gonzalez-Dios et al., 2014; Hancke, Vajjala, and Chinese). Table 1 presents a few examples of Meurers, 2012; Vajjala and Meurers, 2012; Xia, Japanese–Chinese homographs. These words play Kochmar and Briscoe, 2016). a significant role while reading Japanese or For English texts, there are numerous popular Chinese texts. According to a report by Wang formulas such as Flesch Reading Ease (Flesch (2001), approximately 80–95% Japanese–Chinese 1948) and Flesch-Kincaid Grade Level, all of homographs are used to express identical or similar which are used for several applications such as meanings in both the languages. Foreign language compilation of reading materials for language learners from kanji background countries can learners. Collins–Thompson and Callan (2004) straightforwardly understand the meaning of these proposed a language modeling method to estimate words according to kanji characters. This is the readability of English and French texts. occasionally more convenient than grammar for For Japanese texts, Tateishi, Ono, and Yamada foreign language learners from kanji background (1988a; 1988b) introduced a formula based on six countries to learn Japanese. surface characteristics: average number of For Japanese language learners, a vital challenge characters per sentence, average number of Roman is to master a large number of complex functional letters and symbols, average number of hiragana expressions. Hence, providing appropriate example characters, average number of kanji characters, sentences for learners based on their individual average number of katakana characters, and ratio Japanese language capabilities are highly likely to of touten (comma) to kuten (period). Formula- aid the enhancement of the efficiency of learning based approaches have also been used or teaching various Japanese functional expressions. Japanese to young native speakers (Shibasaki and In order to achieve this goal, we utilize Sawai, 2007; Sato, Matsuyoshi, and Kondoh, 2008; Japanese–Chinese homographs as a new feature, Shibasaki and Tamaoka, 2010). To evaluate text which is more or less dissimilar from previous difficulty level for foreign language learners of research, to estimate sentence difficulty and select Japanese, Wang and Andersen (2016) introduced an approach for evaluating Japanese text difficulty 5 http://jreadability.net 297 the most appropriate example sentences as learning content for Japanese functional expressions. Japanese vocabulary Difficulty level 山岳(mountains) Japanese Chinese Meaning 養う(to cultivate) N1 社会(society) 社会(society) Identical 忙しない(busy) 技術(technology) 技术(technology) Identical 前提(Presupposition) 東西(east and west) 东西(east and Similar 迫る(to press) N2 west; thing) 勇ましい(brave) 培養(culture) 培养(culture; Similar 愛情(love) train) 含める(to include) N3 手紙 手纸 (letters) (toilet paper) Dissimilar 巨大(huge) 勉強(study) 勉强(reluctantly) Dissimilar 複雑(complex) 捨てる(to throw away) N4 Table 1: Examples of Japanese–Chinese 挨拶(greeting) homographs. 学校(school) 3.1 Difficulty Level Evaluation Standard 明るい(bright) N5 To estimate the difficulties of example sentences, 始まる(begin) we follow the standard of the Japanese Language Japanese grammar Difficulty level Proficiency Test (JLPT). The JLPT consists of five べからざる(must not) levels: N1, N2, N3, N4, and N5. The least difficult がてら(while doing something) SP1 6 level is N5, and the most difficult level is N1 . を顧みず(regardless of) Since 2010, the JLPT official lists of vocabulary からといって(just because) and grammar have not been published in books, we に加えて(in addition to) SP2 referenced a few books (Xu and Reika, 2013a; Xu and Reika, 2013b) and online learning websites7,8, に違いない(without a doubt) all of which provide lists of the JLPT vocabulary にとって(to) and grammar with difficulty levels ranging from に比べて(compare) SP3 N1–N5. Here, we consider levels N3/SP3 and わけがない(it is impossible that) lower as “easy” level, levels N2/SP2 and above as かもしれない(maybe) difficult level. A few examples of vocabulary and ことができる(can) SP4 grammar in JLPT are presented in Table 2. みたいだ(similar to) 3.2 List of Japanese–Chinese Homographs てから(after) 前に Japanese language learners from kanji
Recommended publications
  • The Logical Linguistics
    International Journal of Latest Research in Humanities and Social Science (IJLRHSS) Volume 01 - Issue 11, www.ijlrhss.com || PP. 10-48 The Logical Linguistics: Logic of Phonemes and Morae Contained in Speech Sound Stream and the Logic of Dichotomy and Dualism equipped in Neurons and Mobile Neurons Inside the Cerebrospinal Fluid in Ventricular System Interacting for the Unconscious Grammatical Processing and Constitute Meaning of Concepts. (A Molecular Level Bio-linguistic Approach) Kumon “Kimiaki” TOKUMARU Independent Researcher Abstract: This is an interdisciplinary integration on the origins of modern human language as well as the in- brain linguistic processing mechanism. In accordance with the OSI Reference Model, the author divides the entire linguistic procedure into logical layer activities inside the speaker‟s and listener‟s brains, and the physical layer phenomena, i.e. the speech sound stream or equivalent coded signals such as texts, broadcasting, telecommunications and bit data. The entire study started in 2007, when the author visitedthe Klasies River Mouth (KRM) Caves in South Africafor the first time, the oldest modern human site in the world.The archaeological excavation in 1967- 68 (Singer Wymer 1982) confirmed that (anatomically) modern humanshad lived inside the caves duringthe period120 – 60 Ka (thousand years ago), which overlaps with the timing of language acquisition, 75 Ka (Shima 2004). Following this first visitIstarted an interdisciplinary investigation into linguistic evolution and the in- brain processing mechanism of language through reading various books and articles. The idea that the human language is digital was obtained unexpectedly. The Naked-Mole Rats (Heterocephalus glaber) are eusocial and altruistic. They live underground tunnel in tropical savanna of East Africa.
    [Show full text]
  • Exaptation, Grammaticalization, and Reanalysis
    Heiko Narrog Tohoku University Exaptation, Grammaticalization, and Reanalysis Abstract The goal of this paper is to argue that exaptation, as introduced into the study of language change by Lass (1990, 1997), in specific functional domains, is a limited alternative to grammaticalization. Exaptation, similarly to grammaticalization, leads to the formation of grammatical elements. Like grammaticalization, exaptation is based on the mechanism of reanalysis. It decisively differs from grammaticalization, however, as it implies change in the opposite direction, namely from material absorbed in the lexicon back to grammatical material. Two sets of data are presented as evidence for the replicability of this process. One involves the occurrence of exaptation across languages in a specific semantic domain, namely, the evolution of morphological causatives out of lexical verb patterns. The other data pertain to recurrent processes of exaptation in one language, namely in Japanese, where exaptation figures in the development of various morphological categories. In all cases of exaptation, reanalysis is crucially involved. This serves to show that reanalysis may be more fundamental to grammatical change than both grammaticalization and exaptation. Furthermore, it allows for change both with the usual directionality of grammaticalization and against it. 1. Introduction Even detractors of grammaticalization theory do not seriously challenge the fact that in the majority of cases the morphosyntactic and semantic development of grammatical material follows the paths outlined in the standard literature on grammaticalization. The two following California Linguistic Notes Volume XXXII No. 1 Winter, 2007 2 issues, however, potentially pose a critical challenge to the validity of the theory. First, there is the question of the theoretical status of grammaticalization as a coherent and unique concept.
    [Show full text]
  • <Strong><Em>English Grammar for Students of Japanese
    UCLA Issues in Applied Linguistics Title English Grammar for Students of Japanese (The Study Guide for Those Learning Japanese) by Mutsuko Endo Hudson. Ann Arbor: The Olivia and Hill Press, 1994 vii + 204 pp. Permalink https://escholarship.org/uc/item/29p5w1zb Journal Issues in Applied Linguistics, 5(2) ISSN 1050-4273 Author Mishina, Satomi Publication Date 1994-12-30 DOI 10.5070/L452005194 Peer reviewed eScholarship.org Powered by the California Digital Library University of California English Grammar for Students of Japanese (The Study Guide for Those Learning Japanese) by Mutsuko Endo Hudson. Ann Arbor: The Olivia and Hill Press, 1994 vii + 204 pp. Reviewed by Satomi Mishina University of California, Los Angeles English Grammar for Students ofJapanese provides a concise explanation of the key concepts and terminology of English and Japanese grammar. The title of this book may be somewhat misleading, since Hudson does not necessarily emphasize the grammar of English, but rather affords equal emphasis to the grammars of both English and Japanese. The description of English grammar and the contrastive presentation of the two grammar systems are intended to help students learning Japanese to understand its basic grammatical notions in hght of the grammar of their own native language, which, the author assumes, faciUtates the understanding of a foreign language grammar. The grammar points are addressed in separate chapters in the following basic order: parts of speech (e.g., nouns, verbs), inflections, various sentence level phenomena (e.g., subject, topic), sentence type (e.g., affirmative vs. negative, declarative vs. interrogative), tense (e.g., present tense, past tense), voice (e.g., active, passive), and types of clauses (e.g., conditional clauses, relative clauses).
    [Show full text]
  • Porting Grammars Between Typologically Similar Languages: Japanese to Korean Roger KIM Mary DALRYMPLE Palo Alto Research Center Dept
    Porting Grammars between Typologically Similar Languages: Japanese to Korean Roger KIM Mary DALRYMPLE Palo Alto Research Center Dept. of Computer Science 3333 Coyote Hill Rd. King's College London Palo Alto, CA 94304 USA Strand, London WC2R 2LS UK [email protected] [email protected] Ronald M. KAPLAN Tracy Holloway KING Palo Alto Research Center Palo Alto Research Center 3333 Coyote Hill Rd. 3333 Coyote Hill Rd. Palo Alto, CA 94304 USA Palo Alto, CA 94304 USA [email protected] [email protected] Abstract We report on a preliminary investigation of the dif®culty of converting a grammar of one lan- guage into a grammar of a typologically similar language. In this investigation, we started with the ParGram grammar of Japanese and used that as the basis for a grammar of Korean. The re- sults are encouraging for the use of grammar porting to bootstrap new grammar development. 1 Introduction The Parallel Grammar project (ParGram) is an international collaboration aimed at producing broad-cov- erage computational grammars for a variety of languages (Butt et al., 1999; Butt et al., 2002). The gram- mars (currently of English, French, German, Japanese, Norwegian, and Urdu) are written in the frame- work of Lexical Functional Grammar (LFG) (Kaplan and Bresnan, 1982; Dalrymple, 2001), and they are constructed using a common engineering and high-speed processing platform for LFG grammars, the XLE (Maxwell and Kaplan, 1993). These grammars, as do all LFG grammars, assign two levels of syntac- tic representation to the sentences of a language: a super®cial phrase structure tree (called a constituent structure or c-structure) and an underlying matrix of features and values (the functional structure or f- structure).
    [Show full text]
  • Linguist Expands the Horizons of Language Analysis with New Book 13 March 2017, by Peter Dizikes
    Linguist expands the horizons of language analysis with new book 13 March 2017, by Peter Dizikes Many linguistics scholars regard the world's Because English is the native language of so many languages as being fundamentally similar. Yes, the great linguists, he observes, there is a tendency to characters, words, and rules vary. But underneath regard it as a template for other languages. But it all, enough similar structures exist to form what drawing more heavily on additional languages, MIT scholars call universal grammar, a capacity for Miyagawa thinks, could lead to new insights about language that all humans share. the specific contents of our universal language capacity; he cites the work of MIT linguist Norvin To see how linguists find similariites that can elude Richards as an example of this kind of work. the rest of us, consider a language operation called "allocutive agreement." This is a variation of "Given the prominence of Indo-European standard subject-verb agreement. Normally, a verb languages, especially English, in linguistic theory, ending simply agrees with the subject of a one sometimes gets the impression that if sentence, so that in English we say, "You go," but something happens in English it's due to universal also, "She goes." grammar, but if something happens in Japanese, it's because it's Japanese," Miyagawa says. Allocutive agreement throws a twist into this procedure: Even a third-person verb ending, such Not mere formalities as "she goes," changes depending on the social status of the person being spoken to. This To see why allocutive agreement seems like such a happens in Basque, for one.
    [Show full text]
  • A Mathematical Theory of Language
    International Journal of Contemporary Education Vol. 1, No. 1; April 2018 ISSN 2575-3177 E-ISSN 2575-3185 Published by Redfame Publishing URL: http://ijce.redfame.com A Mathematical Theory of Language Hans Gua Correspondence: Hans Gua, Yeda Village, Xiyu Town, Xihe County, Longnan, Gansu 742100, China. Received: November 8, 2017 Accepted: December 24, 2017 Online Published: December 27, 2017 doi:10.11114/ijce.v1i1.2893 URL: https://doi.org/10.11114/ijce.v1i1.2893 Abstract Modern linguistics cannot define and identify the best or standard pronunciations, writing and grammar. The choice and decision of sole human unified standard official or common language cannot be solved by modern linguistics generally. The current linguistics is no longer met the human developments because it can’t answer what the best language is. The sole unified standard official global English cannot appear because of English linguistic level and shortcoming mainly. English linguistics comes to predominate in the contemporary era. Negating and improving the current linguistics must be negated and improved English linguistics first. The finite numerals are expressed infinite quantities in the mathematics. The finite sounds are represented the infinite meanings in the language. The theory and method are almost same in the mathematics and linguistics generally. The linguistics is a branch or concrete application of information theory (IT). IT is based on the probability theory and statistics generally. A meaning is often certain code, string of codes or mathematical value in the language. Defining or explaining certain meaning of language is measured and calculated a mathematical value actually. A subsystem or subtopic such as language teaching is often based on the general linguistics.
    [Show full text]
  • 5.2 Clause and Noun Host Type (CNH-Type) Constructions
    UC Berkeley Dissertations, Department of Linguistics Title Grammar and Semantics of Adnominal Clauses in Japanese Permalink https://escholarship.org/uc/item/81h604rt Author Matsumoto, Yoshiko Publication Date 1989 eScholarship.org Powered by the California Digital Library University of California Grammar and Semantics of Adnominal Clauses in Japanese By Yoshiko Matsumoto MA. (University of Tsukuba, Japan) 1979 M A (University of California) 1982 CJPhil. (University of California) 1987 DISSERTATION Submitted in partial satisfaction of the requirements for the degree of DOCTOR OF PHILOSOPHY in LINGUISTICS in the GRADUATE DIVISION of the UNIVERSITY OF CALIFORNIA at BERKELEY Approved: DOCTORAL DEGREE CONFERRED ********* illfii **6at**bi»etM*********** Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. Grammar and Semantics of Adnominal Clauses in Japanese Copyright ©1988 Yoshiko Matsumoto Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. Grammar and Semantics of Adnominal Clauses in Japanese by Yoshiko Matsumoto A bstract One crucial but rarely emphasized characteristic of Japanese noun modification by adnominal clause (e.g. relative clauses, noun complement clauses) is that the semantic function of the head noun with respect to the predicate in the modifying clause is not explicitly indicated. Despite the difficulties that this creates for any purely syntactic or structural analysis, the role of semantics and pragmatics has received little attention. This study focuses on the construal of Japanese adnominal clauses, and has as purpose (1) to demonstrate that a purely syntactic analysis modelled on analyses of English relative clauses cannot account for Japanese Noun-Modifying Construc­ tions (NMCs), (2) to show that semantics and pragmatics play a crucial role in the construal of clausal NMCs in Japanese, and (3) to suggest a framework that can account for a wide range of naturally-occurring NMCs.
    [Show full text]
  • A. Overview History of Grammar Study of the Japanese Language Grammar Study in Pre-Meiji Era When a Language Comes in C
    A. Overview History of Grammar Study of the Japanese Language Grammar Study in Pre-Meiji Era When a language comes in contact with another language, it is known that the language goes through a period of reflective self-awareness. Contact with Chinese was a major influence on the grammar study of the Japanese language. One example of what this contact brought was the definition of 詞 (shi) as 実質語 jisshitsu-go (substance word) versus te, ni, wo, ha as functional vocabulary. Te-ni-ha taigai shoo [Annotated Summary of Te-Ni-Ha], which is estimated to have been written at the end of the Kamakura period or as late as the beginning of the Muromachi period, has the following description, “詞 (shi) is like a temple or shrine, and te, ni, ha organize and decorate this temple or shrine beautifully.” This is an example of emerging awareness of word classification. It expresses that 詞 (shi) and te, ni, wo, and ha have separate functions. This awareness influenced and furthered the field for generations to come. The two giants of grammar study in the Edo period are Motoori Norinaga and Fujitani Nariakira. Moto’ori wrote Te-ni-wo-ha himo kagami [Hand-Held Mirror for Te-Ni-Wo-Ha] (1771) and Kotoba no tama no o [Precious String of Words] (1785). In the former he classifies kakari (relation opener) into three 行 (gyō, column in this context), ha, mo, tada, zo, no, ya, nani, and koso, and creates a single chart that lists the musubi (tying, conclusion) in 43 rows. This relatively simple chart succinctly captures the regularity of the rules of kakari-musubi.
    [Show full text]
  • Japanese in a Nutshell
    Chapter 1 Japanese in a Nutshell In This Chapter ▶ Dipping your toes into Japanese writing and grammar ▶ Working with simple expressions and basic numbers ▶ Exploring Japanese at home, at work, at play, and abroad elcome to Japanese! Now that you’ve decided to learn this intriguing Wlanguage, I’m sure you’re eager to find out as much as you can as quickly as you can. After all, you probably have co-workers, friends, neighbors, and others to impress with your newfound language skills. Well, here’s your chance to dive into Japanese. This chapter offers you a mini-preview of what you can find throughout Japanese For Dummies, 2nd Edition. Discovering Basic Japanese Sounds and Script Japanese sounds are pretty easy to pronounce. For one thing, Japanese has only five basic vowels: a (ah), i (ee), u (oo), e (eh), and o (oh). In addition, you don’t see a long string of consonants before or after a vowel, unlike in Russian or Polish. You just have to pay attention to a handful of strange con- sonants, such the Japanese r, which makes a sound somewhere between the English lCOPYRIGHTED and r. You can pronounce basic MATERIAL sounds in Japanese along with the audio recordings in Chapter 2. Japanese writing, on the other hand, can be confusing because Japanese scripts don’t look at all like Western alphabets. The Japanese scripts consist of two sets of kana (kah-nah) — phonetic symbols for Japanese syllables — and about 2,000 kanji (kahn-jee) characters, which are Chinese characters adapted for Japanese.
    [Show full text]
  • The Effects of Travel on the Transfer of Linguistic Data Between Asia and Europe
    Borders and Crossings No. 2 - Year 10 04/2020 - LC.6 [sic] - a journal of literature, culture and literary translation Violeta Moretti, Juraj Dobrila University of Pula, Croatia ([email protected]) Broadening the World of Knowledge: The Effects of Travel on the Transfer of Linguistic Data between Asia and Europe Abstract This paper provides an examination of the effects of contact between Europe and Asia in the early modern period, especially with regard to the exchange of linguistic data and ideas. The contact induced by transcontinental travel − with various motives, including missionary work and increasing colonial expansion − added fuel to the intellectual study of language. Both Europe and Asia gained access to new languages and approaches to language, which enabled an exchange of linguistic data and methodologies, which can now be seen as one of the origins of the development of modern linguistics. In order to illustrate the main developments within modern linguistics, earlier systems are briefly addressed along with some of the roots of their transformation or displacement. Latin treatises, including De antiquitate et affinitate linguae Zendicae, Samscrdamicae et Germanicae dissertatio and De Latini sermonis origine et cum orientalibus linguis connexione dissertatio, serve as examples of the earliest development toward comparative linguistic studies in the West. These texts, written by the Carmelite Ivan Filip Vesdin (alias Paulinus a Sancto Bartholomeo; 1748−1806) are some of the earliest Western texts offering a comparative description of the Sanskrit language. Although basically referring to the history of linguistics, this article provides an insight into the transfer of ideas and concepts between cultures and peoples of these macro-regions of human civilization.
    [Show full text]
  • Japanese to Korean Roger KIM Mary DALRYMPLE Palo Alto Research Center Dept
    Porting Grammars between Typologically Similar Languages: Japanese to Korean Roger KIM Mary DALRYMPLE Palo Alto Research Center Dept. of Computer Science 3333 Coyote Hill Rd. King's College London Palo Alto, CA 94304 USA Strand, London WC2R 2LS UK [email protected] [email protected] Ronald M. KAPLAN Tracy Holloway KING Palo Alto Research Center Palo Alto Research Center 3333 Coyote Hill Rd. 3333 Coyote Hill Rd. Palo Alto, CA 94304 USA Palo Alto, CA 94304 USA [email protected] [email protected] Abstract We report on a preliminary investigation of the difficulty of converting a grammar of one lan- guage into a grammar of a typologically similar language. In this investigation, we started with the ParGram grammar of Japanese and used that as the basis for a grammar of Korean. The re- sults are encouraging for the use of grammar porting to bootstrap new grammar development. 1 Introduction The Parallel Grammar project (ParGram) is an international collaboration aimed at producing broad-cov- erage computational grammars for a variety of languages (Butt et al., 1999; Butt et al., 2002). The gram- mars (currently of English, French, German, Japanese, Norwegian, and Urdu) are written in the frame- work of Lexical Functional Grammar (LFG) (Kaplan and Bresnan, 1982; Dalrymple, 2001), and they are constructed using a common engineering and high-speed processing platform for LFG grammars, the XLE (Maxwell and Kaplan, 1993). These grammars, as do all LFG grammars, assign two levels of syntac- tic representation to the sentences of a language: a superficial phrase structure tree (called a constituent structure or c-structure) and an underlying matrix of features and values (the functional structure or f- structure).
    [Show full text]
  • Variations of Double Nominative in Korean and Japanese
    Variations of double nominative in Korean and Japanese Dieter Wunderlich Preface A personal note. Once Sebastian Löbner and I tried to climb Mount Fuji, the king of mountains. Because of heavy rainfall, we were forced to turn back, and ended in a sauna with a sake bar. The double ascent had became a kind of double passivity; and both of us were disappointed by this failure. The king of cases is the nominative; a double nominative has two kings of diUerent descent, sometimes emerging in the passive voice (Leideform in German). This doubling experience made me ultimately decide to write this article. An observation. The double nominative is a very popular subject for Japanese and Korean linguists. There are, presumably, hundreds of papers discussing how it interacts with various Velds of Japanese and Korean syntax, mostly parallel in these languages. Not wrongly, Japanese and Korean linguists consider the double nominative to be a unique feature of their languages. A prejudice. A double nominative is not spectacular in itself. Some linguists believe that nominative is assigned in a speciVc context, say SpecT. In that case, one has to ask: and what assigns the second nominative? Alternatively one might believe that nominative is the default case (often unmarked), and so a double nominative may be more frequent than was previously believed. In many languages, if (for some reason) accusative is blocked for an object, nominative becomes the automatic case instead by default. A brief abstract. In this paper, various types of alternations bringing about double nominatives are discussed. Nominatives in particular invite focus or topic interpretations, dependent on further circumstances.
    [Show full text]