Behavior Research Methods, Instruments, & Computers 2000, 32 (3), 482-500

A Japanese logographic frequency list for cognitive science research

NOBUKO CHIKAMATSU DePaul University, Chicago, Illinois

SHarCHI YOKOYAMA National Language Research Institute ofJapan, ,

HIRONARI NOZAKI Aichi University ofEducation, Kariya, Japan

ERIC LONG National Language Research Institute ofJapan, Tokyo, Japan and SACHIO FUKUDA National University, Yokohama, Japan

This paper describes Japanese logographic character () frequency list, which is based on an analysis of the largest recently available corpus of Japanese and characters. This corpus com­ prised a full year of morning and evening editions of a major , containing more than 23 mil­ lion kanji characters and more than 4,000different kanji characters. This paper lists the 3,000most fre­ quent kanji characters, as well as an analysis of kanji usage and correlations between the present list and previous Japanese frequency lists. The authors believe that the present list will help researchers more accurately and efficiently control the selection of kanji characters in cognitive science research and interpret related psycholinguistic data.

In many empirical psycholinguistic studies, fre­ In the past, compiling linguistic corpora was an ex­ quency is used as an independent variable select ­ tremely labor-intensive process, plagued by reliability terials having desired frequency characteristics or as a concerns caused by human error. However, as computer control variable to match two or more sets ofmaterials in technology continues to develop, researchers are obtain­ order to minimize performance differences attributable ing more reliable linguistic corpora and compiling word to word frequency effects in word recognition, memory, or character frequency lists on the basis ofthese corpora or retrieval performance. It is sometimes important in for linguistic or cognitive science research. For American psycholinguistic research to focus on the frequency effects English, some widely used word frequency lists are the oflinguistic units smaller than words, such as clus­ Brown corpus (Kucera & Francis, 1967), the American ters (.g., bigrams or trigrams), syllabic-type units (sylla­ Heritage WordFrequency Book(Carroll, Davies, & Rich­ ble vs. nonsyllable), units, as well as position man, 1971), and the Thorndike-Lorge count (Thorndike (.e., word-initial, word-middle, or word-final positions; & Lorge, 1944; see the summary in Solso, Juel, & Rubin, Appleman & Mayzner, 1981; Grainger & Jacobs, 1993; 1982). Many of these corpora and lists are available in Srinivas, Roediger, & Rajaram, 1992). For example, logo­ computer database form and/or through the Internet. Con­ graphic character frequency is a crucial factor to consider sequently, researchers may use these corpora and lists to in word experiments using languages with logographic control word frequency in empirical language research scripts, such as Chinese, Japanese, or Korean, where each more easily, efficiently, and accurately than in the past. logographic character may function as a word (Matsunaga, Corpora for non-English languages, including Japanese, 1996). In short, it is important to carefully control the fre­ remain limited in number or are still under development quency ofprinted characters and/or words when empiri­ (Edwards, 1993; Leech & Fligelstone, 1992). cal psycho linguistic studies are conducted. Word and Character Frequency Lists in Japanese The authors thank the reviewers ofthe paper for their valuable sug­ Over the last two decades, researchers in the area of gestions. Correspondence concerning this article may be sent to experimental psychology, especially word recognition and . Chikamatsu, Department ofModern Languages, DePaul University, 802 West Belden Avenue, Chicago, IL 60614 (e-mail: nchikama@ memory, have increasingly focused on the Japanese lan­ condor.depaul.edu). guage, owing to the uniqueness ofits (Kess

Copyright 2000 Psychonomic Society, Inc. 482 JAPANESE CHARACTER FREQUENCY LIST 483

& Miyamoto, 1994; Paradis, Hagiwara, & Hildebrandt, of83 and 86 forms encoded in JIS and 1985; Yokoyama, 1997). In particular, kanji (characters EUe. Hiragana and katakana share the same syllabic­ in a logographic that is one ofthree scripts used in sound representation and can be transcribed one by the writing Japanese) has been widely used in experimental other (e.g., a syllable I I is transcribed as ~ in hiragana materials in order to examine new aspects of cognitive and '!t in katakana). The third Japanese , processing, relative to alphabetic language systems, in kanji, is a logographic script adopted from the Chinese word recognition and memory and hemispheric involve­ language, in which each symbol represents meaning and ment in the acquisition and usage oflanguage. However, functions as a morpheme. A single kanji character may although many studies have been conducted that use represent an independent word (e.g., * Ihon I, book) or Japanese words, the development ofJapanese word fre­ part ofa word (e.g., * in B* Inihon/,Japan). The mean­ quency lists or kanji character frequency lists has not kept ing ofeach constituent (i.e., a single character) in a kanji up with the demand for such lists. As a result, for exam­ word is sometimes less clear or transparent than that ofan ple, the kanji character or word frequency of selected independent word. Owing to the manner in which kanji kanji items has often not been controlled or mentioned in characters were transferred from the Chinese to the Japa­ Japanese word recognition studies when frequency has not nese language over the centuries, a single kanji charac­ been used as a dependent variable (e.g., Eko & Nakamizo, ter may have obtained more than one and may 1989; Flores d'Arcais & Saito, 1993; Flores d'Arcais, be pronounced in several different ways. For instance, Saito, Kawakami, & Masuda, 1994; Kikuchi, 1996; Mor­ the character , which means head, is read as Ito/, ton, Sasanuma, Patterson, & Sakuma, 1992; Nagae, 1994; Idol, Izul, Iju/, /saki/, latama/, Ikashira/, Ikobe/, Naito & Komatsu, 1989; , 1992; Sekiguchi & Abe, I kaburiI, and /tsumuri/ (Coulmas, 1989). Furthermore, 1992; Wang, 1988; Yokosawa & Shimomura, 1993). In a great number ofhomophones (i.e., kanji characters that many other studies, the frequency of kanji characters or share a common pronunciation but represent different words is controlled on the basis of(1) the researcher's sub­ meanings) occur in Japanese kanji usage. For instance, * jective, intuitive judgment (e.g., Flores d' Arcais, Saito, (tree), ~ (feeling), • (chance), 1tl(toglow), WI (period), & Kawakami, 1995; Hatta, Koike, & Langman, 1994; and many others are all pronounced Ikil. Thus, in con­ Shimomura & Yokosawa, 1991), (2) the examinee'sjudg­ trast to both hiragana and katakana, kanji characters do ment, such as subjects' rating on selected items (e.g., ­ not have a systematic sound representation or a one-to­ mada, Mitarai, & Yoshida, 1991), (3) the categorization one relationship between sound and symbol. The number of kanji characters standardized by the Japanese Min­ ofkanji characters is quite large and practically uncount­ istry of Education (e.g., Kyoiku kanji or Gakushu kanji 1; able (i.e., kanji dictionaries may contain between 12,000 see, e.g., Hayashi, 1988; Hirose, 1992; Nakagawa, 1994; and 50,000 entries; Kindaichi, 1991; Morohashi, 1989). Sakuma, Itoh, & Sasanuma, 1989), (4) lists compiled by Among hiragana, katakana, and kanji, usually only one examiners themselves (e.g., Wydell, Butterworth, & Pat­ is conventionally chosen and used to write a given Japa­ terson, 1995; Wydell, Patterson, & Humphreys, 1993), or nese word. Hiragana is used primarily for words that have (5) the National Language Research Institute's (NLRI's) a grammatical function, such as particles or case-makers, 1962 or 1976 word/character frequency lists (Cabeza, and for some native Japanese content words. Katakana is 1995; Morikawa, 1985; Naito & Komatsu, 1988; Sasa­ used for loan words (i.e., words mainly borrowed from numa, Sakuma, & Kitano, 1992; Tsuzuki, 1993). western languages, such as English, French, and Por­ One of the main impediments to the development of tuguese). Kanji is used for content words, such as , Japanese word frequency lists is that the electronic rep­ , and .? Thus, a single Japanese sentence resentation of Japanese characters is more complicated is usually written with all three scripts combined. than that ofalphabetical languages. At present, there are However, choice ofscript is not always consistent and three primary electronic coding systems for Japanese may vary, depending on a writer's intention or a publish­ characters (i.e., kana and kanji): Japanese Industrial er's guidelines for style. For instance, the Japanese word Standards (JIS), Shift-JIS (SJIS), and meaning egg, pronounced Itamago I, could be written as (EUC). Generally, EUC is used in Unix workstations on 'k"iJ;.Z:: in hiragana in one context, but as j~ in kanji in the Internet, whereas JIS is used for Japanese electronic another context. In their study ofthe subjectivefrequency mail. However, SJIS has been adopted for use with per­ ofscript type, Ukita, Sugishima, Minagawa, Inoue, and sonal computers. Consequently, if different coding sys­ Kashu (1996) studied 750 Japanese words that can be tems are used across tasks or methods, one must transfer written in more than one Japanese script. The study was one character code to another, using a converter such as conducted by asking 836 Japanese college students to the network kanji code conversion filter (NKF). judge whether a given word (written in a given script) is Another factor impeding the development ofJapanese seen often, occasionally, or rarely. The results showed frequency lists is the itself, that more than halfof the tested words were identified as which comprises three types of -hira­ words seen in more than one script. This inconsistency in gana, katakana, and kanji. Hiragana and katakana are orthographic representation makes word counting in Japa­ syllabic scripts in which each symbol represents a sound nese extremely difficult and complicated. In addition, unit (a syllable). These scripts each contain 46 basic forms, word segmentation in Japanese is more complex than that with additional and historical forms giving a total in English, since word boundaries are not separated with 484 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA spaces in written texts. A single kanji character could be through the Internet. In consideration of the word seg­ a morpheme ofa part ofa word or a word by itselfand may mentation and other problems mentioned above, the au­ be pronounced differently, depending on the context. With­ thors decided to start by developing frequency lists ofkanji out clear word boundaries, compound words are easily characters before attempting kanji word frequency lists. formed. Thus, complications in word counting and seg­ Furthermore, the current usage ofkanji in printed mass mentation present nontrivial challenges for those compil­ media is analyzed on the basis ofa comparison with 1966 ing word frequency lists in Japanese. frequencies from the NLRI 1976 list. Owing to these problems, few have attempted to make word and kanji character frequency lists in Japanese. The Source ofData, Method ofAnalysis, and Results NLRI in Japan published a word frequency list in 1962, The present corpus is available on a CD-ROM entitled based on a corpus derived from 90 different journals and CD-HIASK'93 (Nichigai Associates & Asahi, 1994), magazines with five different genres, all published in which contains the text of articles appearing in a major 1956. A total of 140 million tokens, consisting of40,000 newspaper, covering 1 year ofmorning and evening edi­ different words (i.e., types), were analyzed in order to de­ tions published in 1993. The data analysis was conducted velop a frequency list ofwords possessing a frequency of on Unix workstations, with the program written by the at least nine. In 1976, the NLRI also published a kanji authors in Perl and awk. First, headlines were excluded­ (character) frequency list based on a corpus published in from the approximately 110,000 articles> in the data. Sec­ 1966derived from three major , Asahi, ­ ond, all the printable characters were counted and ranked uri, and Mainichi. This corpus provided a total of by frequency from highest to lowest in each category (e.g., 991,375 kanji tokens and a frequency list of 3,213 dif­ kanji, hiragana, or katakana). Frequency ratio (%) and cu­ ferent kanji characters. This was the first attempt to an­ mulative frequency ratio (%) were also calculated. alyze a Japanese corpus with computers, and the results The corpus provided a total of 56,563,595 printable were used to standardize and regulate the use of kanji characters,v making it, at the time, the largest corpus characters for mass media and education in Japan. For used for the compilation ofJapanese word/character fre­ the past three decades, researchers have used these lists as quency lists. Ideally, in addition to newspapers, corpora an informative resource for many language-related ­ would be collected from other printed texts, such as mag­ search projects. In 1997, the NLRI published its 1962 list azines and novels of various genres. However, most of in floppy disk format (NLRI, 1997). these printed materials are not yet available in computer­ However, several problems are associated with the use readable form. As a result, it is not currently feasible to ofthe NLRI lists for empirical studies. First, the lists are gather sufficient amounts of data from these materials. based on dated media samples. Almost three or four Furthermore, copyright issues complicate the process of decades have passed since the corpora of the 1962 and copying and studying printed mass media in Japan and 1976 lists were collected. Consequently, the reliability of may pose an obstacle to making lists based on such ma­ these lists is open to question, since the use ofwords or terials freely available to the public. kanji in mass media and education may have changed. The 56,563,595 characters in the 1993 collection ofthe Second, the 1976 list does not identify low-frequency Asahi newspaper form the basis of the present character kanji. The 1962 word list does not contain words possess­ frequency list and break down into the general categories ing a frequency less than nine, and the 1976 kanji list does shown in Table I. A total of23,408,236 kanji tokens were not provide characters possessing a frequency less than found, making up 4,476 different types. Kanji covered nine. Low-frequency words or characters with the fre­ 41% of all the printable characters in the newspapers. quency ofone are required for many empirical language The central result ofthis survey is a list ofthe kanji char­ studies. Third, the lists are not easily accessed, since acters found, sorted according to their frequencies. The both were available only in hard copy form until recently. first 3,000 characters are listed in Appendix A with their Although the NLRI 1962 list is now available in floppy ranking, raw frequency, frequency ratio (%), and cumu­ disk format' it is not as accessible, especially to re­ lative frequency ratio (%). In addition, frequency lists of searchers outside Japan, as other language lists that are hiragana and katakana characters are also included in Ap­ available over the Internet. It is crucial that these lists be pendices Band C,7 respectively, with the same headings. accessible to researchers in computer database format to Cumulative percentage ofkanji character use. The help make the control and selection ofword and kanji fre­ cumulative frequency ratio of kanji characters, ranked quency simpler, more effective, and more accurate. from high to low frequency, is shown in Table 2. Although In 1994, the situation for Japanese corpus linguistics it has been conventionally said among learners and changed for the better, when CD-ROM databases ofnews­ teachers of Japanese that knowledge of 2,000~3,000 paper articles became available at a relatively low cost. kanji characters is required for one to read Japanese news­ These computerized corpora ofnewspapers made it pos­ papers, this conventional wisdom was not supported by sible to develop an updated kanji frequency list accessible the present data. According to the results of the present on computers and over the Internet. analysis, the top 500 most frequent kanji characters ac­ The purpose ofthe present project is to develop a new counted for approximately 80% oftotal kanj i use. Further­ kanji character frequency list to be made accessible more, the top 1,600 most frequent characters covered 99% JAPANESE CHARACTER FREQUENCY LIST 485

Table 1 developed from a single newspaper. Although the news­ Printable Character Type Totals paper used in the present project is a major Japanese news­ Characters Types Tokens Frequency Ratio (%J paper that currently publishes 10 million copies every Kanji 4,476 23,408,236 41.38 day for 120 million people in Japan (one paper is usually Hiragana 83 20,711,361 36.62 read by more than one person), there may be some bias Katakana 86 3,608,288 6.38 in terms ofthe selection oftopics and style. Goto (1995) and symbols 99 7,406,035 13.09 10 1,169,902 2.07 warned ofthis problem for researchers using a corpus de­ Latin 52 259,773 0.46 rived from newspapers, arguing that newspapers usually focus heavily on politics, economics, and foreign policy. As a result, vocabulary related to those areas may appear oftotal use and the rest-that is, the next 3,000 charac­ in newspapers more often than in other types ofpublished ters-made up only 1% ofthe total kanji use. materials. In addition, each newspaper has its own regu­ The cumulative frequency ratio obtained from the lations or standards for choosing words and expressions. 1966 corpus (i.e., the NLRI 1976 list) is also indicated in Furthermore, the number of newspaper writers, compared Table 2. The results from the two corpora, which are with the number ofreaders, is limited, and consequently, 30 years apart, indicate almost identical ratios for each the effects ofidiosyncratic writing styles and word selec­ frequency level. Furthermore, according to the current tion may be exaggerated, causing a bias in any analysis frequency list, ofthe 500 most frequent characters ranked based exclusively on such data. Since the present char­ in 1966, 445 characters are also ranked among the 500 acter list was made from a corpus ofnewspapers, future most frequent characters in the 1993 corpus, but all of research should be aimed at employing a variety of genres those 55 characters lower than the first 500 fall within the and types ofpublications to develop linguistic corpora. 1,000 most frequent characters in the 1993 corpus. Thus, To examine this issue ofthe bias in the present news­ the high-frequency kanji characters that account for over paper corpus, a correlation analysis ofkanji frequency was 80% of the total kanji use have not changed much in the conducted among the present list developed with 1993 last 30 years. newspapers, the NLRI 1962 list with 1956 magazines, Correlation coefficient of chronological character and the NLRI 1976 list with 1966 newspapers. The re­ use. To examine change in kanji usage over the past 30 sults indicated a significant and high correlation be­ years, a correlation analysis was conducted between the tween the 1956 magazine list and the 1966 newspaper present 1993 list and the NLRI 1976 list based on the 1966 list (r = .92,p < .01), and between the 1956 magazine list newspaper corpus. The raw frequencies of the 3,000 and the 1993 newspaperlist, (r= .89,p < .01; see Table3). highest frequency characters in the present list'' and the The higher correlation between the 1956 and 1966 lists equivalent characters in the NLRI 1976 list were con­ was probably due to the closer timing ofdata collection, verted to their base-1 0 logarithmic equivalents? and sub­ compared with that between the 1956 and 1993 lists. mitted to a Pearson correlation coefficient analysis. The However, these high correlation results may indicate that result indicates a high correlation (r = .95) and was sig­ the data collected from a single newspaper in this present nificant [F(l,3029) = 28,037.67,p < .01]. In Figure 1, analysis may be a reliable resource for predicting kanji although low-frequency characters scatter more than high­ character frequency in printed mass media in general. frequency characters, the distribution is close to linear.t? To examine the reliability ofthe present list, an addi­ Thus, on the basis ofthe analysis, it appears that the over­ tional correlation analysis was conducted with the kanji all pattern of kanji usage has not significantly changed characters grouped according to frequency level. The over the past 30 years. 3,000 ranked characters were divided into three groups: (l) high-frequency characters ranked from 1 through Discussion 1,000, (2) mid-frequency characters ranked from 1,000 In the present project, the authors introduced the first computer-based kanji character frequency lists derived Table 2 Cumulative Frequency Ratio ofKanji Character Use from the largest corpus of Japanese newspaper texts. It is important to control the frequency ofkanji characters Cumulative Frequency Ratio (%J in psycholinguistic studies, although researchers have Frequency Rank 1993 1966 often neglected this variable, owing to the unavailability 10 10.00 10.61 50 27.41 27.64 ofreliable frequency lists. Using the present list, the au­ 100 40.71 40.15 thors have compared kanji usage observed in data from 200 57.02 56.05 1966 and 1993. Although changes in language usage are 500 80.68 79.42 often discussed in Japanese linguistic studies in terms of 1,000 94.56 93.88 1,500 98.63 98.40 syntax, phonology, lexicography, and pragmatics, the 2,000 99.72 99.63 usage of kanji characters did not show any significant 2,500 99.92 99.90 variation over the past 30 years. 3,000 99.97 99.98 However, there are several issues left to discuss in the Note-Frequency rank ~ size of the group of highest frequency char­ present study. First, the corpus ofthe present word list was acters. 486 CHlKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

6

s ....., ,.-.. .--

to c 4 Q) :::; zr + ....Q) + + -.- + + ... ..J. S 3 .+. .,. + (Q + * J +.. * .... +! '-"' 0 * dpl+ *. + ++ 0'1 0 Co 2 ~ III ++:~:+:::+ · (Y) + + (J) (J) !'t>*1tt.,... ~ .-- .~ *i +++ ~+ ~. ..:

O:}----+----+----+----+------I 2 3 4 s 1966 [log10 (raw frequency+1)]

Figure 1. Kanji usage correlation distributions of top 3,000 characters between 1966 and 1993. though 2,000, and (3) low-frequency characters ranked The size ofthe 1993 kanji corpus was much greater than from 2,001 through 3,000. Although the correlations vary, the other two, containing 23,408,236 tokens and 4,476 depending on the levels and lists, all were significant at types. The 1966 list was compiled on the basis of991,375 the p < .01 level (see Table 4). kanji tokens and 3,213 types, and the 1956 list was based The results indicate high correlations at the high­ on 280,094 kanji tokens and 3,328 types. Since the pres­ frequency levels between the present list and the other ent correlation analysis was conducted with the 3,000 two lists. Notably, a high correlation ofthe 1993 news­ highest ranked characters in the 1993 list, some charac­ paper list with the 1966 newspaper list at the high­ ters did not even exits in the 1966 or 1956 list. For in­ frequency level (r = .92) indicates again that the present stance, among the 38 characters ranked at 2,996 in the list can be a reliable resource, especially at the high­ 1993 list, 17 characters were not included in the 1956 frequency levels, although the data were collected from a list, and 23 characters were not included in the 1966 list. single newspaper. Thus, the size difference in the corpus seems to be the In contrast, at the low-frequency levels, the correla­ most probable reason for the low correlation between the tions ofthe present list are low with the 1966 list (r = .43) lists. If that is the case, the largest corpus could be the and with the 1956 list (r = .28). This may imply a drastic most reliable information for low-frequency characters­ change ofkanji usage over the decades or some unrelia­ that is, the present list can be plausibly used as the most bility ofthe present list at the low-frequency level. The first interpretation is easily dismissed, considering the Table 3 overall high correlations with other lists, as indicated in Correlation ofKanji Frequency Among 1993, 1966, Table 3, and the similar low correlation between the 1966 and 1956 Lists With All 3,000 Ranked Characters and 1956 lists at low-frequency level (r = .40), which 1993 N 1966 N 1956 M were developed relatively close in time. Does this mean List r r r that the present list is not reliable at the low-frequency 1993 N 1.00 list? That does not have to be the conclusion. The most 1966 N .95 1.00 likely interpretation is that the low correlations are due 1956M .89 .92 1.00 to the differences in size ofthe corpora ofthe three lists. Note-N, newspapers; M, magazines. JAPANESE CHARACTER FREQUENCY LIST 487

Table 4 sources for language researchers. In 1. A. Edwards & M. D. Lampert Correlation of Kanji Frequency Among 1993, (Eds.), Talking data: Transcription and coding in discourse research 1966, and 1956 Lists With High-, Mid-, (pp. 263-310). Hillsdale, NJ: Erlbaum. and Low-Frequency Ranked Characters EKO,R., & NAKAMIZO, S. (1989). Coded representations ofkanji, kana 1966N 1956M and figures. Japanese Journal ofPsychology, 60, 265-268. FLORES D'ARCAIS, G. B., & SAITO, H. (1993). Lexical decomposition of List Frequency r r complex kanji characters in Japanese readers. Psychological Re­ 1993N high .92 .80 search, 55, 52-63. mid .68 .55 FLORES D'ARCAIS, G. B., SAITO, H., & KAWAKAMI, M. (1995). Phono­ low .43 .28 logical and semantic activation in reading kanji characters. Journal 1966N high 1.00 .86 ofExperimental Psychology: Learning, Memory, & Cognition, 21, mid 1.00 .60 34-42. low 1.00 .40 FLORES D'ARCAIS, G. B., SAITO, H., KAWAKAMI, M., & MASUDA, H.

Note-N, newspapers; M, magazines. High = characters ranked from (1994). Figural and phonological effects in radical migration with 1st through 1,000th. Mid = characters ranked from 1,001st through kanji characters. Advances in the Study ofChinese Language Pro­ cessing, 1, 241-254. 2,000th. Low = characters ranked from 2,001st through 3,000th. GOTO, H. (1995). Gengo kenkyu tame no data toshite no corpus no gainen tsuite [Corpus for linguistic research]. Tohoku University Linguistics Journal, 4, 71-87. reliable resource. Thus, the correlation analysis among GRAINGER, J., & JACOBS, A. M. (1993). Masked partial-word priming in the three lists can support the high reliability ofthe pres­ visual word recognition: Effects ofpositional letter frequency. Jour­ nal ofExperimental Psychology: Human Perception & Performance, ent list. 19,951-964. Another issue relating to the present project is the re­ HATTA, T, KOIKE, M., & LANGMAN, P. (1994). Laterality ofmental im­ liability of a computerized Japanese corpus (i.e., CD­ agery generation and operation: Testing with brain-damaged patients HIASK'93), transformed from the original hard copy and normal adults. Journal ofClinical & Experimental Neuropsy­ chology, 16,577-588. form (i.e., the Asahi newspaper). There are few instances HAYASHI, R. (1988). The role of semantic attributes of the distractor in which kanji characters appearing in the newspaper did word in the script type effect on Stroop color-word interference task. not appear encoded as kanji in the current CD-ROM cor­ Japanese Journal ofPsychology, 59, 1-8. pus. The original locations of those characters in the HIROSE, H. (1992). An investigation of the recognition process for jukugo by use of priming paradigms. Japanese Journal ofPsychol­ newspaper were marked by symbol characters, such as ogy, 63, 303-309. the "geta mark" (=) in the electronic format (Yokoyama, KESS, J., & MIYAMOTO, T. (1994). Japanese psycholinguistics: A clas­ Sasahara, Nozaki, & Long, 1998). This is due to several sified and annotated research bibliography. Amsterdam: Benjamins. revisions made in JIS, 11 which is an electronic character KIKUCHI, T (1996). Detection of kanji words in a rapid serial visual encoding standard and assigns a code to approximately presentation task. Journal ofExperimental Psychology: Human Per­ ception & Performance, 22, 332-341. 6,300 kanji characters. Although only a few hundred of KINDAICHI, K. (1991). Shimeikai kokugojiten [New Japanese word dic­ these cases were observed out of23 million kanji tokens tionary]. Tokyo: Sanseido. in the current CD-ROM corpus and these were among KUCERA, H., & FRANCIS, W.N. (1967). Computational analysis ofpresent­ the lowest frequent characters, this inconsistency between day American English. Providence : Brown University Press. LEECH, G., & FLlGELSTONE, S. (1992). Computers and corpus analysis. printed and electronic data formats should be kept in mind In C. S. Butler (Ed.), Computers and written texts (pp. 115-140). Ox­ when low-frequency characters are used or focused on. ford: Blackwell. In the present project, a new kanji character frequency MATSUNAGA, S. (1996). The linguistic nature ofkanji reexamined: Do list was developed with the largest newspapers corpus in kanji represent only meaning? Journal ofthe Association ofTeach­ Japanese corpus linguistic history. Although there are ing ofJapanese, 30, 1-22. MORIKAWA, Y. (1985). Stroop phenomena in the : still some issues left, the present list provides useful in­ II. Effects of character-usage frequency and number of strokes. In formation on kanji usage in the current printed media. H. S. R. Kao & R. Hoosain (Eds.), Linguistics, psychology & Chinese Furthermore, the present list has become available and language (pp. 73-80). Hong Kong: University of Hong Kong Centre accessible in computer database over the Internet, 12 which ofAsian Studies. MOROHASHI, T (1989). Dai kanwajiten [Japanese kanji character dic­ helps researchers control and interpret logographic psy­ tionary]. Tokyo: Taishuu-. cholinguistic studies more accurately and efficiently. MORTON, J., SASANUMA, S., PATTERSON, K., & SAKUMA, N. (1992). The organization ofthe lexicon in Japanese: Single and compound kanji. REFERENCES British Journal ofPsychology, 83, 517-531. NAGAE, S. (1994). Semantic processing functions of kanji and kana ApPLEMAN, I. B., & MAYZNER, M. S. (1981). The letter-frequency ef­ words in the right hemisphere. Japanese Journal ofPsychology, 65, fect and the generality offamiliarity effects on perception. Perception 144-149. & Psychophysics, 30, 436-446. NAITO, M., & KOMATSU, S. (1988). Attributes ofmemory that mediate CABEZA, R. (1995). Investigating the mixture and subdivision of per­ priming effects in perceptual identification. Japanese Journal ofPsy­ ceptual and conceptual processing in Japanese memory tests. Mem­ chology, 58, 352-358. ory & Cognition, 23, 155-165. NAITO, M., & KOMATSU, S. (1989). Effects ofconceptually driven pro­ CARROLL, J. B., DAVIES, P., & RICHMAN, B. (1971). The American Her­ cessing on perceptual identification. Japanese Psychological Re­ itage wordfrequency book. New York: American Heritage. search, 31, 45-56. COULMAS, F. (1989). The writing systems ofthe world. Oxford: Black­ NAKAGAWA, A. (1994). Visual and semantic processing in reading kanji. well. Journal ofExperimental Psychology: Human Perception & Perfor­ EDWARDS, J. A. (1993). Survey of electronic corpora and related re- mance, 20, 864-875. 488 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

NATIONAL LANGUAGE RESEARCH INSTITUTE OFJAPAN (1962). Gendai YOKOSAWA, K., & SHlMOMURA, M. (1993). On the role ofstimulus sim­ zasshi 90shi no yogo yoji [The total vocabulary and their written ilarity and segmentation in misprint detection. In D. Brogan, A. Gale, forms in ninety magazines oftoday]. Tokyo: Shuuei-sha. & K. Carr (Eds.), Visual search 2 (pp. 371-378). London: Taylor & NATIONAL LANGUAGE RESEARCH INSTITUTE OFJAPAN (1976). A study Francis. ofthe use ofChinese characters in modern newspapers (The National YOKOYAMA, S. (1997). Hyoki to kioku [Orthography and free recall] Language Research Institute Report 56). Tokyo: Shuuei Shuppan. (Japanese Psychological Monographs 26). Tokyo: Japanese Psycho­ NATIONAL LANGUAGE RESEARCH INSTITUTE OFJAPAN (1997). Gendai logical Association. zasshi 90shi no yogo yoji: FD format [The total vocabulary and their YOKOYAMA, S., SASAHARA, H., NOZAKI, H., & LONG, E. (1998). Shin­ written forms in ninety magazines oftoday]. Tokyo: Sanseido. bun denshi media no kanji [A study ofkanji in electronic newspaper NICHIGAI ASSOCIATES, & AsAHI [NEWSPAPER] (1994). CD-HIASK'93. media]. Tokyo: Sanseido. Tokyo: Kinokuniya. NOZAKI, H., CHIKAMATSU, N., & YOKOYAMA, S. (1997). Compiling NOTES katakanafrequency listsfrom Japanese newspaper corpus. Unpub­ lished manuscript. Aichi University ofEducation, . I. The set ofKyoiku kanji contains 881 characters to be taught in el­ OSAKA, M. (1992). Effect of memory set-size upon event related po­ ementary schools. These characters were selected by the Ministry of tentials for concrete and abstract kanji stimuli. Perceptual & Motor Education in 1948. This set was revised, and 1,006 characters were ­ Skills, 75, 401-402. lected and defined as Gakushu kanji for the same purpose in 1989. PARADIS, M., HAGIWARA, H., & HILDEBRANDT, N. (1985). Neurolinguis­ 2. Verbs, adjectives, and adverbs are often written in a combination tic aspects ofthe Japanese writing system. New York: Academic Press. of kanji and kana characters. The stem ofthose words, which does not SAKUMA, N., ITOH, M., & SASANUMA, S. (1989). Recognition units of change in conjugation, is written in kanji, and the inflectional endings kanji words: Priming effects on kanji recognition. Japanese Journal in hiragana. For instance, 1t""oQ Ita-berul(to eat), *~L' loo-kiil ofPsychology, 60, 1-8. (big), and !!!< Ihaya-kul (early) are all written with the first character SASANUMA, S., SAKUMA, N., & KITANO, K. (1992). Reading kanji with­ in kanji and the rest in hiragana. out semantics: Evidence from a longitudinal study ofdementia. Cog­ 3. Owing to Japanese copyright ownership issues, it may not be fea­ nitive Neuropsychology, 9, 465-486. sible to provide the 1976 NLRI kanji list in computer database form in SEKIGUCHl, H., & ABE,I. (1992). Functional hemisphere differences in the near future. recognition ofwords expressed in kanji. Japanese Journal ofEduca­ 4. Headlines were excluded from the present analysis because incon­ tional Psychology, 40, 3 I5-322. sistencies were observed in headlines in hard copy versions ofthe news­ SHIMOMURA, M., & YOKOSAWA, K. (1991). Processing ofkanji and kana papers, as compared with headlines downloaded from the CD-ROM. characters within Japanese words. Perception & Psychophysics, 50, 5. In addition to the articles downloaded from CD-ROM, 114 articles 19-27. were input manually into the database, since they were not included in SOLSO, R. L., JUEL,C., & RUBIN, D. C. (1982). The frequency and ver­ the CD-ROM. satility of initial and terminal letters in English words. Journal ofVer­ 6. These printable characters include kanji, hiragana, katakana, Latin bal Learning & Verbal Behavior, 21, 220-235. , Arabic numerals, punctuation, and symbols (commas, peri­ SRINIVAS, K., ROEDIGER, H. L., III, & RAJARAM, S. (1992). The role of ods, quotation markers, etc.). syllabic and orthographic properties of letter cues in solving word 7. Appendix C, katakana frequency list, is cited from Nozaki, Chika­ fragments. Memory & Cognition, 20, 219-230. matsu, and Yokoyama (1997). THORNDIKE, E. L., & LORGE, I. (1944). The teacher 50 word book of 8. The first 3,000 ranks included 3,031 characters in the 1993 list. 30,000 words. New York: Columbia University, Teachers College 9. Logarithmic transformation was used because the data were posi­ Press. tively skewed-that is, there were more low-frequency characters and TSUZUKl, T.(1993). Effects ofcontext-dependent and context-independent fewer high- frequency characters. A value of I was added before the log­ associative strength between prime and target words on the processing of arithmic transformation,-that is, log I0 (raw frequency+ 1) instead of lexical ambiguity. Japanese Journal ofPsychology, 64, 191-198. 10gi0 (raw frequency). This is because many oflowest frequency char­ UKITA, J., SUGISHIMA, I., MINAGAWA, N., INOUE, M., & KASHU, K. acters in the 1993 list were not found at all in the 1966 list (i.e., zero oc­ (1996). Nihongo no hyoki keitai ni kansuru shinrigaku kenkyuu [Re­ currence), which would make the logarithmic transformation impossi­ search on written forms ofJapanese words] (Japanese Psychological ble. Monographs 25). Tokyo: Japanese Psychological Association. 10. No value of logarithmic transformation was plotted between 0 WANG, J. (1988). Do phonological and semantic processing ofkanji fin­ and I in the y-axis because all top 3,000 ranked characters occurred ish at the same time? Japanese Journal ofPsychology, 59, 252-255. more than 13 times in the 1993 list and the value oflogIO(raw frequency WYDELL, T. N., BUTTERWORTH, B., & PATTERSON, K. (1995). The in­ + I) was more than I. consistency of consistency effects in reading: The case of Japanese II. lIS has been revised four times over the last two decades, in 1978, kanji. Journal ofExperimental Psychology: Learning, Memory, & 1983, 1990, and 1997. In each revision, a few new codes were added or Cognition, 21,1155-1168. replaced with old codes for kanji characters. Consequently, although it WYDELL, T. N., PATTERSON, K. E., & HUMPHREYS, G. W. (1993). is very rare, some characters may appear different, depending on the Phonologically mediated access to meaning for kanji: Is a rows still version used for decoding on computers. For instance, four kanji char­ a rose in Japanese kanji? Journal ofExperimental Psychology: acters that are part of the 1983 JIS standard were replaced by symbol Learning, Memory, & Cognition, 19,491-514. characters on the CD-ROM. For further discussion, see Yokoyama et al. YAMADA,J., MITARAI, Y.,& YOSHIDA, T. (1991). Kanji words are easier (1998). to identify than katakana words. Psychological Research, 53, 136­ 12. The presently developed kanji character frequency list is available 141. at http://nozaki-Iab.ics.aichi-edu.ac.jp/on the World-Wide Web. JAPANESE CHARACTER FREQUENCY LIST 489

APPENDIX A Kanji Character Frequency List

R C r R C F RC F RCF R C F R C F 1 B 336465 51 1't 79543 101 1& 50477 151 M 36776 201 [Ii] 29488 251 W 25212 2 285089 52 13 79040 102 ffl 50447 152 3t 36750 202 *li 29372 252 m 24992 3 + 254534 53 iI1li 76965 103 N1 50357 153 f1:J 36728 203 Ili'iJ 29329 253 ~ 24944 4 223075 54 pq 76927 104 !JiI. 49844 154 1JQ 35822 204 ~ 29150 254 m 24852 5 A 218967 55 Ei 76402 105 $ 49730 155 rfij 35672 205 29075 255 ~ 24761 6 218693 56 ~ 75885 106 ~ 49272 156 ~ 35583 206 *uuI=l 28933 256 ~ 24643 7 *4p 216931 57 -t; 75735 107 *£ 49270 157 Jf 35472 207 ffl 28881 257 ~ 24601 8 ~ 214989 58 ;1< 25754 290 ~ 21612 ..i.. 41 1\ 85900 9/ ~ 54048 /4/ DR 38404 191 En 31087 241 ¥ 25640 29/ ~ 21533 42 ~ 85767 92 ~ 54047 /42 ,~, 38272 192 ~ 30621 242 }JIJ 25521 292 11 21530 43 ~ 84934 93 ~ 53796 /43 J;J. 37777 193 T 30617 243 ~ 25465 293 'ff.,. 21472 44 tIT 84073 94 l!!1 52743 {44 5i 37577 /93 ~ 30617 244 If: 25377 294 ~ 21456 45 ~ 83847 95 1f. 52129 /45 3jZ 37547 195 Ii!JI 30581 245 ~ 25330 295 1* 21338 46 A 82282 96 {t 52076 146 t~ 37433 196 it 30563 246 25292 296 :m 21211 47 ~ 81652 97 ltIIJ 51942 147 m 37309 /97 !f!IJ 30441 247 •@: 25281 297 ~ 20973 48 fI3 80909 98 jiJf 51314 148 8b 37052 198 'Ji:. 30439 248 ~ 25279 298 -g 20969 1ljl 49 ='T 80845 99 Ij\ 51107 149 tiS 36971 /99 jl§ 30405 249 *lE 25273 299 20783 50 79738 100 /F 50768 150 ~ 36947 200 m: 30219 250 II 25253 300 A 20690 27.4104 40.7119 CF 49.9366 CF 57.0228 CF 62.8109 CF 67.7131 CF * --- CF 490 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued) R C F R C F R C F R C F R C F RC F 301 L!: 20667 357 l& 17216 407 IT 14694 451 I: 12884 501 ~ 11241 551 1< 9865 302 Ii: 20579 352 & 17108 402 it 14668 452 :Wi 12865 502 fjj 11203 552 ff!!. 9828 303 20557 353 m 17106 403 iiI 14659 453 :Ibi. 12820 503 ~ 11197 553 i'I.tJ 9827 304 $'* 20507 354 ,@ 17040 404 {t: 14601 454 ~ 12773 504 ~ 11174 554 ij( 9805 305 jIfIJ 20281 355 bi!i 16982 405 1'j- 14566 455 #Ii 12765 505 ~ 11130 555 n 9796 306 ~ 20234 356 f;t 16925 406 om 14564 456 I1f 12713 506 ~ 11 I15 556 :w: 9792 ~ llli 307 :P 20100 357 *It 16851 407 Ii~ 14471 457 12706 507 t~ 11066 557 ~~.. 9780 308 ± 20027 358 iE9 16798 408 ~ 14383 458 ~ 12685 508 W. 11047 558 ~2 9768 309 iJii 19987 359 16746 409 14369 459 12631 509 W 11046 559 jj. 9762 310 1f 19917 360 k1<* 16712 410 •IlR 14335 460 *~ 12610 510 WI 11037 560 {~ 9722 311 ~ 19884 361 lm 16709 411 ~ 14264 461 ffej 12585 511 5!l! 11020 561 fpJ 9711 372 :0 19633 362 ~ 16660 412 ~ 14262 462 $ 12552 512 11012 562 t-t 9708 373 ~ 19561 363 ~ 16657 413 :fit 14202 463 {!!; 12544 513 1IIl* 10955 563 .ijj. 9699 314 ~ 19517 364 ~ 16610 414 ~ 14129 464 l!i 12482 514 ltt 10871 564 j! 9684 375 ~ 19363 365 ~ 16602 415 a 14120 465 '¥ 12461 515 tl 10822 565 9642 316 ~\ 19359 366 M 16563 416 11}] 14113 466 )il 12443 516 1$ 10796 566 **jl}J 9634 317 ilii 19320 367 W 16520 417 1m 14053 466 £ 12443 517 1i 10795 567 ~ 9624 378 lit 19254 368 ~j£ 16463 418 ~ 13949 468 1Ii! 12393 518 Vi- 10790 568 ~ 9595 319 jiIff 19246 369 ~ 16324 419 13933 469 fj 12334 519 :k 10752 569 *-: 9546 320 ~ 19104 370 ~ 16308 420 ~* 13911 470 ~ 12259 520 ~ 10695 570 ~~ 9532 321 :l* 18976 371 iR 16280 421 m 13818 471 Jil 12175 527 1i 10644 571 ~ 9444 322 ~ 18815 372 1N 16259 422 ~ 13817 472 $ 12148 522 S 10564 572 ~ 9436 323 m 18796 373 ~ 16201 423 JfIi 13803 473 ~ 12137 523 ~ 10563 573 !A 9377 324 t'l. 18746 374 .fF 16165 424 ~ 13788 474 § 12125 524 IDi 10557 574 ~ 9337 Tr. ±: 325 iiJf 18714 375 1Jl, 16136 425 ~ 13711 475 13 11987 525 ill 10535 575 <=t 9317 326 ~ 18702 376 ~ 16093 426 !i~ 13708 476 )J! 11983 525 im 10535 576 /.iii 9314 327 JfX 18605 377 ~ 16063 427 P 13646 477 'iT 11968 527 n>¥ 10451 577 ~ 9311 328 W 18596 378 ~ 16052 428 1:t 13624 478 it 11946 528 I4l 10447 578 :i1[ 9310 329 Wl 18557 379 :If 16023 429 1'iIJ 13525 479 ~ 11923 529 ~ 10345 579 ~ 9281 330 'fjl: 18521 380 ~ 16017 430 iJt 13490 480 ~ 11863 530 ftiji 10289 580 l3€ 9254 331 1l~ 18513 381 M 16005 431 tt 13480 487 ~ 11808 531 5E 10288 581 ~ 9160 332 It 18385 382 15978 432 ~ 13479 482 ::t: 11804 532 !In: 10249 582 ~ 9104 333 iR 18379 383 1;* 15916 433 J.f! 13458 483 X- 11756 533 ~ 10228 583 9101 334 ffil 18302 384 fftl 15729 434 ~ 13379 483 $ 11756 534 10224 584 •i# 9083 335 ~ 18194 385 {Iffl 15708 435 ~ 13312 485 ~ 11739 535 Wi 10220 585 ~ 9081 336 I/tf 18109 386 11]" 15703 436 t5t 13292 486 M 11721 536 ~ 10197 586 i'& 9009 337 ff 18075 387 Wf 15625 437 :I: 13285 487 11681 537 ~ 10107 587 00 9000 338 m 18056 388 .fJt 15566 437 p~ 13285 488 *m 11648 538 fJiX 10094 588 JpJ: 8998 339 EB 17958 389 15514 439 W, 13241 489 ~ 11624 539 {~ 10093 589 fJ< 8989 340 Ilt 17877 390 •5ll 15401 440 m 13219 490 It 11516 540 ft 10057 590 8975 ".,. 341 ~ 17855 391 F 15245 44/ ~ 13194 491 Jll 11465 541 fti 10029 591 •llJ 8951 342 U; 17850 392 ~ 15241 442 itt 13150 492 ~ 11440 542 ~ 10023 592 )6J 8940 343 'Ff 17845 393 ~ 15190 443 ,. 13125 493 ~ 11433 543 !it 9992 593 ~ 8902 344 Ji 17785 394 1'8. 15101 444 ~ 13069 494 W 11398 544 [lW 9991 594 ti 8892 345 *- 17655 395 9'J 15078 445 ~ 13056 495 :ffi 11388 545 m 9986 595 8867 346 ± 17596 396 3i 14952 446 ~ 13043 496 1l 11306 546 ~ 9967 596 •~ 8856 346 ft 17596 397 ~ 14941 447 1Jf 13009 497 J(f. 11296 547 ... 9944 597 m 8847 348 !? 17527 398 ~ 14846 448 "f 13004 498 .it£: 11293 548 71f; 9908 598 M 8815 349 12 17505 399 IW 14835 449 ~ 13000 499 frJ 11289 549 :IX 9882 599 ~ 8737 350 %" 17362 400 ~ 14780 450 ~ 12958 500 £. 11273 550 ~ 9873 600 -1m 8733 CF 71.7398 CF 75.1727 CF 78.1088 CF 80.6841 CF 82.9314 CF 84.9251 JAPANESE CHARACTER FREQUENCY LIST 491

APPENDIX A (Continued)

R C F R C F R C F R C F R C FR C F 601 t: 8727 65/ ffl 7701 701 ~ 6760 751 ~ 6064 801 f.&, 5313 85/ ii 4810 602 ~ 8705 652 fj 7693 702 fj 6748 752 ~¥ 6063 801 ~ 5313 852 t: 4798 603 7E 8703 653 ~ 7692 703 W 6740 753 W 6062 803 ~ 5300 853 it 4796 604 rI 8700 654 J!! 7691 704 *& 6736 754 ~ 6021 803 ~ 5300 854 m 4787 605 ~ 8656 655 i1t 7654 705 it 6735 755 ~ 6009 805 :l1Ji 5293 855 j'jIJ 4780 605 $) 8656 656 ;n:;-=- 7624 706 m 6728 756 II 6003 806 ~ 5292 856 @; 4760 607 ri!i 8652 657 ~ 7616 707 ~ 6725 757 tb 5980 807 i1IU 5280 857 ~! 4759 608 Jffi 8614 658 ~ 7594 708 ~ 6689 758 m 5949 808 5273 858 llt 4754 609 tw 8547 659 t.:. 7545 709 :f.t 6683 759 f'€f 5939 809 •fjJ 5270 859 ~ 4747 610 14k 8543 660 ~ 7507 710 ~ 6664 760 'W 5920 809 ilJ 5270 860 1# 4724 c±r ~ 611 s: 8533 661 $ 7503 711 :0: 6632 761 IlB 5845 811 A 5254 861 }f 4720 612 tv 8517 662 ~ 7492 712 6627 762 ~ 5831 812 !lEt 5251 862 ~ 4716 6/3 ~ 8505 663 ~ 7454 713 •~ 6625 763 J.'R 5817 813 f'? 5231 863 ~ 4715 6/4 f@ 8477 664 gt 7440 714 ~ 6623 764 i!t 5811 814 ~ 5217 864 m 4710 615 ft\ 8473 665 ][ 7423 715 T 6590 765 1ft 5768 815 ij#' 5208 865 ~ 4701 ~ 8472 666 :j:f 7422 716 6556 766 lilri 5766 $A i:1iil. 616 tit 816 5201 866 ,.., 4675 617 7$ 8468 667 ~ 7402 717 ;pJ 6552 767 :jffI 5747 817 $! 5196 867 fl. 4674 618 ~ 8448 668 ~ 7382 718 L: 6545 768 lfu 5742 8/8 ~ 5172 868 *:g 4672 619 ~ 8444 669 ~ 7376 719 J!g 6536 769 J$ 5738 8/9 1m 5169 869 tlJ 4663 619 J6J 8444 670 : 7357 720 Wt 6529 770 ~ 5727 820 m 5163 870 *f1l 4637 621 if 8443 671 ~ 7351 721 1* 6527 771 JIll 5724 82/ rjfj 5158 87/ jp 4627 622 {df1 8430 672 ~ 7339 722 W 6524 772 Wi! 5703 822 M 5143 872 W 4623 623 1l' 8422 673 ~ 7285 723 M 6507 773 1lfl 5686 823 5141 873 % 4622 624 Mr 8327 674 *C 7224 723 # 6507 774 }tIJ 5650 824 •~U 5120 874 ~ 4612 625 il 8320 675 JriS 7201 725 jff1 6505 775 ~ 5647 825 m 5118 875 ??r 4595 626 m 8298 676 f~ 7188 726 ~ 6501 776 ~ 5635 825 ,!t 5118 876 *~ 4591 627 ~ 8248 677 !i& 7176 727 ~ 6488 777 ~ 5618 827 ~ 5109 877 sfj 4585 628 ~ 8194 678 ~ 7175 728 tL 6480 778 1m 5601 828 jJ 5106 878 JIll 4553 629 @: 8165 679 if 7168 729 th 6472 779 ~ 5590 829 m 5094 879 Ij\ 4532 630 tit 8156 680 llt 7135 730 ;6;.m 6447 780 ~ 5587 830 ~ 5082 880 1m 4521 631 8129 68/ tli 7106 731 £ 6380 780 fflJ 5587 831 [{ili 5080 881 fll; 4513 632 ~* 8125 682 fB 7083 732 rF 6372 782 i1J! 5584 832 f~ 5064 882 m 4512 633 ~ 8106 683 j)!; 7069 733 E§ 6355 783 ~ 5572 833 fl 5056 883 ~ 4499 .",. 634 ,." 8102 684 ;'fi 7061 734 f$ 6332 784 m~ 5568 834 lilt 5037 884 ,~ 4492 635 ~j;\ 8101 685 ~ 7036 735 ~ 6327 785 iM 5547 835 JSL 5024 885 ~ 4466 636 Jl 8098 685 ~ 7036 736 ~ 6324 786 ill 5499 836 ~ 5021 886 *PJ 4452 637 18 8090 687 ,~ 7023 737 6310 787 ji! 5496 837 iE! 5018 887 ~ 4449 638 8069 688 J! 6959 738 m* 6276 788 ~ 5477 838 ~ 4984 888 ~ 4446 639 it 8047 689 x: 6921 739 :E 6251 789 tr 5458 839 it 4981 889 )l 4439 640 *L 8007 690 6910 740 It 6237 790 ~ 5439 840 ~~ 4967 890 Jill 4433 64/ itt 8002 690 'J~ 6910 74/ ::K 6233 79/ JlJ 5433 84/ ~ 4961 891 ~ 4432 642 M 7982 692 6902 742 rtlI 6225 792 ~ 5427 842 ~ 4930 892 ~ 4431 643 ~ 7968 693 *fi 6891 743 ~ 6224 793 IIIIi 5388 843 Ilt1 4918 893 !l 4407 644 ,~ 7943 694 &. 6874 744 !iii 6217 794 X 5346 844 ~ 4904 894 1: 4404 645 'If. 7915 695 ~ 6836 745 ~ 6210 795 W 5344 845 it 4887 895 ~ 4400 646 § 7908 695 1t 6836 746 is 6186 796 m 5333 846 ,~ 4874 896 ~ 4397 647 ~ 7901 697 JliJj 6796 747 m. 6159 797 ~ 5326 847 ~ 4862 897 ili1i 4387 648 m 7881 698 ~ 6791 748 ~p 6099 797 fit 5326 848 Z 4848 898 f'rl 4379 649 =Itf 7851 699 Wi 6780 749 ~ 6094 799 ~ 5324 849 tl 4837 899 ~ 4378 649 #t 7851 700 ~ 6779 750 '$ 6067 800 m 5314 850 Jf!. 4823 900 ~~ 4370 CF 86.6954 CF 88.2397 CF 89.6189 CF 90.8281 CF 91.9183 CF 92.8971 492 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued) R C F R C I' R C F RC F R CF R C F 90/ m 4368 951 :l't 3878 1000 3387 1051 ~ 2893 1/01 ~ 2556 1/51 ~ 2262 902 ttJ 4362 952 1l 3875 1002 #.<* 3376 1052 ID 2886 1/02 f! 2554 1/52 ~ 2260 ~ ~ 4> 903 4346 953 3866 1003 i!I! 3372 1053 JlI 2884 1/03 2552 1/53 '" 2259 904 e: 4345 954 •~ 3858 /004 m: 3362 1054 1Ji 2883 1/04 m 2548 1/54 tx 2244 ~± ~ 905 i2't 4304 955 aD 3854 1n05 m 3358 1055 :j!f 2882 1/05 IJ& 2537 1/55 Ml 2240 .- 906 ~ 4298 956 i1i 3853 1006 li'P 3355 1056 ~ 2876 //06 ~ 2528 1/55 "l! 2240 907 it 4293 957 IlJ 3840 1007 1$ 3344 1057 :J$ 2874 //06 m 2528 1/55 11( 2240 908 1'11 4284 958 7 3835 1007 Ii 3344 1058 ~ 2857 //08 r.t 2519 1/58 w!l 2239 909 ~ 4278 958 ~ 3835 1009 ~ 3329 1059 q: 2849 //09 C. 2515 1/59 )If 2237 909 ~ 4278 960 M; 3831 1010 ~ 3308 1060 ~ 2848 1110 1>] 2513 1/60 =§ 2236 9// i'l] 4271 961 iliIil 3827 1011 trf 3290 1061 ~ 2846 1111 ~ 2509 1/61 *L 2235 9/2 frj1 4262 962 r;h 3820 1012 Ilfi1 3287 1062 nl'l 2814 1/12 ~ 2501 1162 j;j,t 2232 913 fro 4261 963 ~ 3818 1013 m 3260 1063 ~ 2810 1113 ~ 2496 1163 ~ 2230 9/4 11- 4245 964 ~ 3817 1014 ~ 3254 1064 M 2807 11/4 1m 2495 1164 ~ 2229 915 tJj 4239 965 ~ 3802 1015 ~ 3253 1065 ml 2806 11/5 ~ 2480 1165 ~ 2201 916 Wi 4230 966 !jJ 3793 1016 m 3245 /066 ~ 2796 1/15 $ 2480 1/66 ~ 2189 917 :;; 4198 967 ~ 3773 1017 ~ 3238 1067 ~'J 2790 1/15 'flU 2480 1/67 1:!1 2183 918 fir 4190 968 ~ 3749 1018 lJf 3220 /068 ~ 2789 1/18 #f 2472 1168 m 2179 9/9 *,J 4188 969 ;(p 3731 10/9 ~ 3217 1069 lli 2788 11/9 ~ 2468 1/69 J£ 2162 920 #f 4174 970 ~ 3728 1020 AA 3194 1070 ~ 2787 1/20 ~ 2460 1/70 Jitif 2151 92/ ~ 4113 971 ji;j 3714 1021 fll; 3186 /071 ~ 2782 1/21 m 2457 1/7/ E2: 2149 92/ W. 4113 972 ~ 3707 1022 ~ 3185 1072 ~ 2770 1/22 ~ 2452 1/72 m 2148 92/ ~ 4113 973 !7 3704 1023 fa 3183 1073 IlX 2767 1/23 2448 1/73 2143 924 itt 4109 974 ~ 3691 1024 {IE 3149 /074 jill 2764 1/24 *~ 2441 1/74 i** 2141 925 W: 4095 975 fa; 3649 1025 J1 3131 1074 ,~ 2764 1125 {~ 2439 1175 {= 2139 926 ffl' 4077 976 M 3645 /026 ~ 3128 1074 !Iff 2764 1/26 z 2437 1/76 ~ 2135 ~ 927 'f 4069 977 ~ 3637 1027 i'i 3114 1077 2752 1127 ~ 2428 1I77 ~c..... 2133 928 n 4059 978 ~ 3634 1n28 16 3113 1078 *£ 2751 1/28 ~ 2413 1178 ~ 2129 929 Jij 4053 979 ~ 3598 /029 3106 1079 ~ 2744 1/29 ~ 2405 1179 ~ 2117 930 ~ 4036 980 !X 3591 1030 *'WlJ 3079 1080 fr{ 2739 1/30 tpp 2397 1/80 ~ 2108 93/ t«I: 4034 98/ $: 3586 103/ 1ll: 3078 108/ ltf 2737 1/31 ft 2390 118/ :Wl 2089 932 11' 4033 982 i!t 3572 /032 ~ 3077 1082 tfi. 2712 1132 I/l} 2388 1182 1l. 2081 933 m. 4031 983 ~ 3564 1033 fiib 3066 1083 ~ 2687 1133 2381 1183 fg; 2079 934 fJT 4024 984 5lIt 3556 1034 ~ 3055 /084 1ft 2679 //34 Il* 2375 1184 7E 2075 935 lltl 4019 985 ~ 3553 1035 iii 3042 1085 JFjq 2670 1135 ~ 2372 //85 flJ. 2062 936 Wi 4010 986 ~ 3548 1036 3033 1086 13: 2667 1136 ~ 2349 //86 ~ 2061 937 N.iIi 4009 987 t'2 3538 1037 •{JiJ 3032 1087 !Ii 2659 1137 Ji[ 2341 1/87 ~ 2057 938 ~ 3995 988 1W 3531 1038 III 3015 1088 ~ 2655 1138 ME 2335 1188 l:f 2056 939 m 3969 989 tit 3515 1039 ~ 3006 1089 Jijj 2650 1I38 ~ 2335 1189 ~ 2052 940 fl1fl 3963 990 fl 3511 1040 fOC 3004 1090 iii 2645 1140 ~ 2334 1190 ~X; 2043 941 aR 3961 99/ ~(!j 3509 1041 ~ 2993 109/ ~ 2639 /14/ ~J 2325 1/9/ ~ 2029 942 n 3957 992 ~ 3496 1042 2962 1092 ~ 2632 1142 ilii< 2320 119/ ~ 2029 943 nt 3953 993 {tj 3467 1043 •~ 2959 1093 ~ 2631 1142 ~ 2320 1193 llPJ 2028 944 5J, 3923 994 m 3465 1044 ~ 2948 /094 4:i 2622 1144 ~ 2312 1194 ~ 2025 945 M 3919 995 1W 3452 /045 tilQ 2925 1095 5£ 2621 1145 l'i!i 2303 1194 Ml 2025 946 ~~ 3913 996 {l.. 3451 1046 ~ 2923 1096 ~ 2599 ll46 ~ 2294 1196 ~ 2022 947 tR 3900 997 JJ! 3430 1046 itt 2923 1097 7/lf 2595 ll47 m 2281 1197 ~ 2019 948 W 3893 998 ~ 3412 1048 f,Ij 2918 1097 ~ 2595 1148 ~ 2274 ll98 llli 2011 949 W 3887 999 ~ 3393 1049 .Ii 2912 1099 lIT 2580 1148 ~ 2274 ll99 :it 2008 950 ~ 3880 1000 3387 1050 fI3 2902 1100 2559 1150 :till 2272 /200 ~ 2002 CF 93.7751 CF * 94.5564 CF 95.2278 CF * 95.8138 CF 96.3309 CF 96.7869 JAPANESE CHARACTER FREQUENCY LIST 493

APPENDIX A (Continued) R C F R C F R C F R C F R C F R C F 1201 liF 2000 1251 *- 1774 1301 m 1616 1351 ~ 1425 1401 M 1222 1451 ~ 1075 1202 if 1996 1252 ~ 1771 1301 ~ 1616 1352 r¥ 1424 1402 ~ 1221 1452 *,J 1070 1203 l!'P 1991 1253 1767 1303 1615 1353 R1l. 1408 1403 Jli 1212 1453 ifIJ 1067 1204 ~ 1982 1254 •fJt 1758 1304 iJil 1614 1354 ~ 1406 1404 #3 1210 1454 JJL 1063 1205 r:i 1959 1255 rAe 1756 1305 ~ 1612 1354 ~ 1406 1405 ~ 1204 1455 m 1060 1205 ~ 1959 1256 j:j[ 1754 1306 .Ii 1611 1356 ~ 1404 1405 :ill! 1204 1456 ~ 1056 1207 ~ 1957 1257 ~ 1748 1307 lit 1608 1357 it 1395 1407 tt 1201 1456 r\Jl 1056 ". 1208 m 1954 1258 ""1"1 1747 1308 ~ 1607 1357 m 1395 1408 ~ 1200 1458 1J1! 1054 1208 ilP 1954 1259 ~ 1744 1309 :f.!l. 1601 1359 R 1391 1409 1* 1199 1458 Iii 1054 1210 iWl 1952 1260 }il 1743 1310 {~~ 1600 1360 ~ 1389 1410 {'R 1195 1460 ~ 1052 1211 ij! 1950 1261 ~ 1741 1310 tt 1600 1361 *'J 1387 1410 ~ 1195 1461 rM 1046 1212 ~ 1946 1262 ~ 1737 1312 tt 1596 1362 ~ 1380 1410 Ili1 1195 1462 ~ 1045 1213 11 1943 1263 §'\ 1736 1313 m 1584 1363 rll 1377 1413 !IEl 1189 1463 '/[ 1041 1214 ~ 1941 1264 ~ 1734 1314 )jJ 1578 1363 '>l'~ 1377 1413 ~ 1189 1463 u 1041 1215 ;fiil 1940 1264 ~ 1734 1315 f'J, 1577 1365 ~ 1373 1415 ~ 1187 1463 )(IJ 1041 1216 ~ 1936 1266 ~ 1732 1316 .!} 1569 1366 til 1370 1416 iii 1185 1466 11 1040 1217 if! 1933 1267 1731 1317 ~ 1568 1367 M 1360 1417 1.! 1183 1467 r~ 1035 1218 ~ 1926 1268 •~ 1730 1318 r~ 1566 1367 foil 1360 1418 11 1181 1468 jrli 1031 1219 1£ 1923 1269 ~ 1729 1319 ~ 1562 1369 '$ 1354 1419 JlI!i 1179 1469 ~ 1030 1220 ~ 1922 1270 1726 1320 ~ 1560 1369 ~ 1354 1420 iff 1174 1470 ~ 1028 # 1221 $ 1919 1271 •/ffJJ 1706 1321 'Jt 1559 1371 ijj;JJ 1351 1421 j~'" 1172 1471 IE 1023 1221 fR 1919 1271 Jtill 1706 1322 ;tJI[ 1547 1372 rm- 1346 1422 rrr. 1166 1472 ~ 1019 1223 ~ 1915 1273 ~ 1704 1323 fj- 1541 1373 ~ 1345 1423 -5 1160 1473 fE 1017 1224 ~ 1905 1274 it 1688 1324 Y: 1538 1374 fA 1338 1424 JfJ 1159 1474 1't 1016 1225 nt 1904 /275 qJ)II; 1687 1325 1J~ 1531 1375 *5J 1332 1425 ~ 1151 1474 L![ 1016 1226 ~ 1897 1276 it 1682 1326 IE 1528 1376 ~ 1329 1425 ~J 1151 1476 ~ 1014 1227 M 1883 1277 tJ§ 1680 1327 1527 1377 ~ 1321 1427 Iii 1142 1476 Hi 1014 1228 M 1881 1277 1680 1328 •~ 1525 1378 ~ 1320 1428 tt 1140 1476 1.iE 1014 1229 ~ 1880 1279 .!ilIJ* 1677 1329 9'.t 1524 1379 ~ 1309 1429 IIJ{ 1139 1479 III 1013 1230 m 1872 1280 ~ 1676 1330 ~ 1521 1380 ~ 1306 1429 1ft 1139 1480 iW; 1008 1231 Mt 1871 1281 ill 1675 1331 fII.! 1517 1381 '11ri 1300 1431 ~ 1136 1481 IlJj. 1000 1232 ~ 1870 1282 r~ 1668 1332 1516 1382 ~ 1298 1432 ~Jl 1129 1482 ~ 997 1233 ""a 1867 1283 Mi 1661 1333 •'fff 1511 1383 m 1297 1433 fllIf 1127 1483 ~ 995 1234 iB 1864 1284 ~ 1658 1334 1\ 1507 1384 ~ 1292 1434 tr ll20 1484 FJi 987 1235 fk 1860 1285 tfi 1656 1335 ~ 1490 1385 III 1289 1434 it 1120 1485 3t 978 1235 5t. 1860 1286 ;Wi 1654 1336 ~ 1489 1386 ;jfjJ 1286 1436 ~ 1119 1486 ~~ 973 XL 1237 ~ 1859 1287 "!!. 1653 1337 g 1487 1387 Il& 1285 1436 I'ifJ 1119 1486 )ffi 973 1238 5If 1857 /288 m 1649 1338 ~ 1484 /388 ~ 1284 1438 ~ 1111 1488 '~ 972 1239 ~ 1849 1289 jW 1641 1339 1:£ 1483 1389 mi 1267 1438 11 1111 1489 ~'.i 970 1240 ~ 1842 1290 * 1640 1340 1482 /390 fJi 1264 1440 # 1110 1490 U~ 968 1241 m; 1841 1290 @ 1640 1341 '*~ 1481 139/ m 1257 1441 1Yj~ 1107 1491 3rji 964 /242 ~ 1829 1290 T 1640 1342 Jj~ 1478 /392 ~ 1256 /442 ~ 1102 1492 fJ& 963 1243 ~ 1825 1293 il 1639 /343 ~ 1476 1392 'tt 1256 1443 3ii 1094 1492 II!!l 963 1244 ltt 1819 /294 ~ 1637 1344 ~ 1475 /394 'Ilr 1255 1444 ifr 1086 1494 It! 959 1245 ~ 1804 1295 1'Jj] 1631 1345 ~ 1471 1395 ~ 1254 1445 .m 1085 1495 ~ 958 1246 It 1802 1296 ~ 1626 1346 )iJi 1456 1396 1* 1253 1446 11 1083 1496 ~ 957 1247 In[ 1799 1296 ~ 1626 1347 Wi 1449 1397 ~ 1246 1446 Ilfi!J 1083 /497 ;it 954 1248 1m 1788 1298 t# 1625 1348 ~ 1447 1398 ~ 1231 1448 MJj 1078 1498 ~ 953 1249 ~ 1785 1299 frl 1623 1349 if! 1437 1399 $X 1228 1449 1* 1077 1498 m 953 1250 M 1779 1300 kW. 1622 1350 im 1435 1400 1l 1223 1450 ~ 1076 1500 ~ 948 CF 97.1912 CF 97.5529 CF 97.8808 CF 98.1645 CF 98.4101 CF 98.6258 494 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued) R C r R C F R C F R CF R C F R CF 1501 Ilt 943 1551 rfj 818 1601 H 729 1650 ~ 649 1701 ~ 559 1750 m 484 1502 ~ 942 1552 lli 815 1602 ;!I 728 1652 m 648 1702 m 558 1750 f\1t 484 1503 f~ 940 1553 :R 814 1603 11 727 1653 ~ 645 1703 555 1753 n~ 483 1504 939 1554 ~ 813 1604 i1t 725 1654 i'.fl 639 1704 '*tM 554 1753 t~ 483 1505 ft** 937 1555 ~ 811 1605 ~ 722 1655 fJ< 636 1704 'I: 554 1755 f(.,'J 480 1506 ~ 936 1556 !Iff 809 1605 1€1 722 1656 :$ 634 1706 I!/ll 553 1756 ~ 475 1507 tff 935 1556 ~ 809 1607 721 1656 w: 634 1707 *~ 552 1757 It 474 1508 ff 932 1558 ~ 807 1608 :$• 719 1656 lb 634 1708 ~ 545 1757 ~~ 474 1508 P~ 932 1559 *1l 796 1609 I~ 716 1659 Z 632 1709 ~t1i 541 1759 :£ 472 1510 llJii 929 1560 .& 793 1610 !9! 715 1659 fliIi 632 1710 $ 540 1760 ~ 469 li 1511 fib 919 1561 '& 792 1611 £iii 714 1659 t=l 632 1711 ~ 539 1761 8 466 1512 ~ 917 1562 ~ 787 1612 ~ 709 1662 jjiiJ 628 1711 tJji 539 1762 ~ 463 1513 jig 915 1563 ~ 786 1613 ~ 708 1662 w.: 628 1711 $ 539 1762 l'k 463 1514 *4 913 1563 ~ 786 1614 if 706 1664 if 625 1714 .. 535 1764 7ifl 462 1515 ~ 912 1565 T 784 1615 m 704 1665 jl 624 1714 ~ 535 1765 00- 461 1516 tr 903 1566 ~ 782 1616 702 1666 1t 622 1716 ~ 534 1766 ~~ 458 /517 902 1567 JJ 781 1617 •i.l 700 1667 ~ 618 1717 ~,f 533 1767 ;i 456 1518 ~* 895 /567 i't 781 1618 If.\" 699 1667 ~ 618 1717 ~ 533 1768 7'J 454 1519 {eb 887 1567 ~ 781 1618 Id: 699 1669 m 615 1719 ~ 528 1768 ~ 454 1520 M 883 1567 l?j 781 1618 it 699 1669 ~ 615 1719 If 528 1770 u± 453 1521 ~ 881 1571 m 779 1621 JftIl 698 1671 .m 614 1719 1j: 528 1771 ~ 451 1522 !'! 876 1571 lft 779 1622 :n 695 1671 Wt' 614 1722 ~ 527 1771 ~ 451 1523 ~ 875 1573 *El 778 1623 ~ 691 1673 ~ 612 1723 f1 523 1773 l!I 449 1524 ill! 874 1574 ItF 777 1623 ~ 691 /673 ~ 612 1724 mr 520 1774 ~ 444 1525 873 1575 m 773 1625 m 689 /675 ~ 6Il 1724 -g9( 520 1774 lifl 444 1525 •~ 873 1576 mi 770 1626 '~ 687 1676 IJJL 608 1726 1/1:1 519 1774 ~ 444 1527 :mt 868 1577 ~ 767 1627 ~ 685 1677 m 606 1727 518 1777 ~ 443 1528 ~ 865 1578 nl:! 766 1628 FK. 682 1678 ~ 605 1727 f.ft* 518 1778 ~ 441 1529 i:ll: 864 1579 *-!c 764 1629 ~ 680 1679 {/II 589 1727 $ 518 1779 ~ 440 1530 nl'i 862 1579 764 1630 ~ 679 1680 #e! 587 1730 .- 514 1780 ~ 439 *""= 1530 ~ 862 1581 F' 762 163/ 1lll 676 1681 ~ 586 1730 ;hi: 514 1781 m 436 1532 ~ 861 1582 *'li 761 1632 ~ 675 1682 t}j 584 1730 ~ 514 1782 P¥ 434 1533 ~ 860 1582 ~ 761 1633 fs 674 1683 ~ 583 1733 IlE 513 1783 ~ 432 1534 1E 858 1582 r11l 761 1634 -'it 673 1684 ~ 582 1734 512 1783 ~ 432 1535 ~ffi 855 1585 lffil 759 1635 ~f. 671 1685 $)( 581 1735 *m 509 1783 'if 432 1535 ~ 855 1586 ~ 758 1635 pq 671 1686 -m 580 1736 -t 508 1783 an. 432 1537 ;W 854 1587 754 /637 :liT 668 1686 ~ 580 1737 ~ 503 1787 IE 431 1538 ~ 852 1587 ~* 754 1638 J1j 667 1686 $G 580 1738 mt 502 1787 431 1539 rr 850 1589 J* 753 1639 ~ 664 1689 *.§. 578 1739 'R' 496 1787 * 431 1540 1ti 847 1589 W1: 753 1640 fJlJ 662 1690 ~ 574 1740 ~ 494 1790 ~* 430 1540 lW1 847 1591 fdf 751 1641 8lt 660 1690 Ii§" 574 1741 IE 491 1791 f.j 429 1542 jffl 846 1592 #: 750 1642 7f 659 1692 573 1742 Jjifi 490 1791 ~ 429 1543 71< 843 1593 lIPJ 748 1643 r,E 656 1693 '*t& 572 1742 #i 490 1793 iii 425 1544 iii 842 1594 ?!t 747 1643 656 1694 it 567 1744 III 489 1793 IJlIi 425 1545 III 838 1594 111 747 1645 •!t. 655 1694 :It 567 1744 -m 489 1795 Ii!J 424 1546 1$ 832 1596 741 1646 It 652 1694 :B 567 1744 00 489 1796 ~Q 423 1547 fi!i 825 1597 •tJjIj 739 1646 Ii'{ 652 1697 ~ 566 1747 n 487 1797 ~ 419 1548 '!l 821 1598 ~II 735 1646 iII 652 1698 IiA 565 1747 ~ 487 1798 ~ 418 1549 ~ 820 1599 lf1,\J 734 1649 it 650 1699 S 563 1747 ijl 487 1799 71. 416 1550 ifJL 819 1600 fI 732 1650 tJi. 649 1700 "1 560 1750 ,. 484 1800 Wi 415 CF 98.8138 CF 98.9790 CF 99.1262 CF 99.2550 CF 99.3665 CF 99.4618 JAPANESE CHARACTER FREQUENCY LIST 495

APPENDIX A (Continued) R C F R C F R C F R C F R CF R C F 1800 f,jj 415 1851 ~ 363 1901 ~ 302 1951 !'KJ 254 2000 ~ 206 2049 f~ 169 /802 411 1852 ~ 360 1902 ?,f 300 1951 j£ 254 2002 ~ 204 2052 ~ 168 /803 *1Jm 410 1853 359 1903 :!jr 299 1951 ~ 254 2002 ~ 204 2053 illi 167 1804 ,fl¥j 408 1854 Ii 358 1904 ~Ill 295 1951 1t 254 2004 ~ 202 2054 .~ 166 1804 frIiX 408 1855 ,j:g 354 /904 ~ 295 1955 z, 252 2004 JM 202 2055 m 165 1806 mE 405 1856 ~ 351 1904 #l! 295 1956 ifir 251 2006 ifl 201 2056 ~ 164 1806 ~ 405 1857 ~ 350 1907 M 293 1957 iii 250 2(J(J6 ~ 201 2(J57 m 163 1808 iJi{ 404 1857 $ 350 1907 1" 293 1957 ~ 250 2008 ID& 199 2057 ~ft 163 1808 ~ 404 1857 *7) 350 1907 ~ 293 /959 Iii 249 2008 199 2059 ~ 162 1808 404 1860 ifJ; 348 1907 ifIIJ 293 1960 t& 248 2010 •~ 198 2059 f~ 162 1808 *~~ 404 1861 tt 346 19J1 *It 292 1960 l!4 248 2011 ~ 197 2059 f!£ 162 1812 ~ 402 1861 ffl! 346 1912 5* 288 1962 ~I'J 247 2011 iii 197 2062 15k 161 /812 i3 402 1861 f$ 346 1912 ~ 288 1963 ~ 246 2013 #l 195 2063 fJJ 160 /8/4 f7e 400 1864 ~ 344 /914 ~ 287 1964 Mi 245 2014 II~ 194 2064 !f 159 1815 ~ 399 1864 ~ 344 1914 :Mi 287 1965 f1I 243 2015 ~ 193 2064 $ 159 1816 i1J. 398 /866 '$: 340 1916 f;lk 286 1966 M< 241 2016 M 191 2066 157 1817 ¥* 397 1867 ~ 334 /9/7 ~ 285 1967 B 239 2016 ~ 191 2066 •jij!j 157 1818 ~ 396 1867 ffi 334 /918 ~ 284 1968 8 237 2016 ff,j 191 2068 m 156 .1b 1818 ~ 396 1869 R 333 /9/9 283 1969 R 236 2016 if[, 191 2068 1"1 156 18/8 396 1869 Ef$ 333 19/9 • 283 1969 ~Ij 236 2016 1i 191 2070 frl!i 155 1821 •~~ 395 1871 ~ 332 1921 •~~ 282 /971 J§ 233 2016 ~ 191 2070 iI 155 1821 ~ 395 1872 X 329 1922 ~ 278 1971 $Jt 233 2022 fji 190 2072 ~ 154 /823 ~ 389 1873 it 327 1922 m 278 1971 ~ 233 2023 PX 189 2073 ~ 153 /824 m 388 1874 f~ 326 1924 Ilij 277 1974 # 230 2023 JI\ 189 2073 :&l 153 1825 ~ 385 1874 326 1925 nfOJ 274 1975 ~ 228 2025 1'5 188 2073 if! 153 1826 fiJi 384 1876 •ill 323 1925 Ijl 274 1976 ~ 225 2U26 J![ 187 2073 :~ 153 1827 ~ 383 1876 f'f 323 1925 R 274 1977 ~ 223 2026 JIt 187 2077 *lE 150 1828 i'Jll 382 1878 ~ 322 /928 272 1978 ~ 221 2028 t* 185 2078 itJ 149 1829 Jlft 380 1878 '!!i 322 1929 •It! 271 1978 tv 221 2028 JO.: 185 2U79 ~ 148 1830 ~ 379 1880 ~ 321 1929 JL 271 1978 1'& 221 2028 M 185 2079 11 148 183/ }L 377 1880 'II 321 /93/ j(: 269 /981 ~ 219 2028 m 185 2081 146 /831 f.m 377 1882 !f:t 320 /93/ RJJ 269 1982 »f. 217 2032 it: 183 2081 il* 146 183/ ~ 377 1883 319 193/ '1 269 1982 ~ 217 2032 ~ 183 208/ ~ 146 1834 ~ 376 1883 *~ 319 1934 lOft 268 1984 216 2032 f!f 183 2081 m 146 1835 tt 374 1885 i~ 316 1934 i'l 268 1984 f8• 216 2032 183 2085 ~ 142 1836 ;i 373 1886 1f. 315 1934 ~ 268 1984 4- 216 2036 * 182 2086 :tit 141 1836 B 373 1886 tu 315 1937 267 1987 III 214 2036 *~ 182 2087 ~ 140 /838 ~ 372 1888 t~ 314 1937 •~ 267 1987 214 2038 181 2087 f's. 140 1!?39 fl 371 1889 ~ 312 1939 m 266 1989 1#• 213 2039 ** 180 2089 JlJR 139 /839 II 371 /890 7iIJ 311 1940 ~ 265 1990 ~ 212 2040 •iilli 179 2089 5Mi 139 /839 ~ 371 /890 ~ 311 1940 ~ 265 1990 1!It 212 2041 i!iJJ 176 2091 )i 138 1842 ill 370 /892 00 310 1942 ~ 264 1990 JlI 212 2041 ~ 176 2091 JiiI 138 1842 1l! 370 1892 Jlt 310 1943 263 1993 210 2043 175 2091 f!l 138 1844 ~ 369 1894 IPl 309 /943 *'u~ 263 1994 •tHI 209 2044 •it 173 2094 at 137 1845 J! 368 1895 ill« 308 /945 lfl 262 1995 ~ 208 2045 .ftf 172 2095 fIfi 136 /846 6 367 1895 fR 308 1945 262 1995 *- 208 2045 it 172 2095 tt 136 /847 *Ji: 366 1897 307 1947 •Itt 258 1995 00 208 2047 ~ 171 2095 tri'i 136 /848 ~ 365 1898 11.1* 305 1948 ~ 256 1998 JIjj 207 2048 '* 170 2095 ~ 136 1848 ~ 365 1899 ~ 303 1949 W. 255 1998 ~ 207 2049 if!, 169 2099 ~ 135 ;DL ~ ~ /850 a 364 1899 is 303 /949 ~ 255 2000 m 206 2049 169 2099 135 CF 99.5445 CF 99.6146 CF 99.6736 CF 99.7226 CF 99.7627 CF 99.7953 496 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued) R C F R C F R C F R CF R C F R C F 2099 ~ 135 2150 m 112 2200 i#. 95 2249 ~ 79 2298 m 67 2351 ~ 59 2099 1L 135 2152 ffi. 111 2200 flUj 95 2249 ~ 79 2298 '1~ 67 2351 JlJit 59 2103 fIX 134 2152 R: 111 2203 ff 94 2253 ~ 78 2298 w: 67 2351 :tt 59 2103 ~ 134 2152 jIJl 111 2203 ~ 94 2253 m 78 2298 ~ 67 2351 itJ 59 2105 }J:t 133 2152 fc 111 2205 ~ 93 2253 7f 78 2305 li; 66 2351 m 59 2105 133 2156 ~ 110 2205 {IE 93 2256 • 77 2305 ijI 66 2351 BrlJ 59 2105 *f1f* 133 2156 JIti' 110 2207 iii 92 2256 te 77 2305 {g 66 2351 i1t 59 2105 fill 133 2156 110 2207 III 92 2256 :M! 77 2305 ~ 66 2351 rlJj 59 2105 ~ 133 2159 *j1( 109 2209 ~ 91 2256 ~ 77 2305 1{ 66 2351 M 59 2110 132 2160 ~ 108 2209 ~ 91 2260 ;ti~ 76 2310 JiI\ 65 2351 ~Dl 59 2110 •m 132 2160 ft 108 2209 ~ 91 2260 ~ 76 2310 ~ 65 2351 ~ 59 2112 ;9t 131 2162 ffii 107 2212 ~ 90 2260 if 76 2310 ~ 65 2351 M 59 2112 :It 131 2162 ~ 107 2213 1f 89 2260 nI 76 2310 ~ 65 2351 ¥- 59 21/-1 !Ill 130 2164 !i!! 106 2213 ~ 89 2260 ill!! 76 2310 1if 65 2364 'If 58 21/-1 H 130 2165 J#! 105 2213 ~ 89 2265 ~ 75 2310 ~ 65 2364 J!l 58 2116 1m 129 2165 ~ 105 2213 ~t 89 2265 • 75 2310 .t1J 65 2364 ti£! 58 2116 j! 129 2167 11$ 104 2213 ~ 89 2265 ~ 75 2317 Ii 64 2364 1i 58 2118 PI'l 128 2167 it 104 2213 89 2265 jt! 75 2317 {bo 64 2364 M 58 2118 ~ 128 2167 ~ 104 2219 *jlJf 88 2265 :Ii 75 2319 1!f 63 2369 ~ 57 2120 'I&- 127 2167 iIi: 104 2219 JE 88 2270 * 74 2319 ?(i 63 2369 itE 57 2121 !llf 126 2167 M 104 2219 88 2271 1!- 73 2319 1:>1 63 2369 ~13 57 2122 ~ 124 2167 if 104 2219 •it 88 2271 fm 73 2319 II! 63 2369 tit 57 2123 iffi 122 2167 m 104 2223 ~ 87 2271 • 73 2319 63 1373 1fI\ 56 2123 I!l!I 122 2174 $ 103 2223 I!!l 87 2274 ~ 72 2319 •~ 63 2373 it 56 2125 !lII1:i 120 2174 n 103 2223 ~ 87 2274 if 72 2319 URP 63 2373 ~8 56 2125 iI 120 2174 1f 103 2226 *t 86 2274 fIA 72 2326 !,it. 62 2373 ~ 56 2127 fit 119 2174 if 103 2226 $1 86 2277 Jiir 71 2326 m 62 2373 U{t 56 2127 ~ 119 2174 !tt 103 2226 1m 86 2277 m 71 2326 tJ 62 2373 t$ 56 2129 ~ 118 2179 ~ 102 2226 iii 86 2277 ~ 71 2326 ~ 62 2379 m 55 2130 ~ 117 2179 ~ 102 2226 ~ 86 2277 ~ 71 2326 fJ> 62 2379 55 ± 2130 fJ 117 2179 n 102 2231 m 85 2281 ~ 70 2326 11 62 2379 •;lit 55 2130 % 117 2182 m 101 2231 85 2281 jJ" 70 2326 4iJf 62 2379 55 2130 M§ 117 2182 it 101 2233 •m 84 2281 Ii 70 2326 I:}'i 62 2379 •ruk 55 2134 it 116 2182 lit 101 2234 ~ 83 2281 ~ 70 2326 ~ 62 2379 a& 55 2134 ~ 116 2185 ~ 100 2235 ~ 82 2281 fiij 70 2326 1M. 62 2385 ~ 54 2134 116 2186 !IE 99 2235 1m 82 2281 §l!j 70 2336 III 61 2385 #Ii 54 2134 •'&£ 116 2186 ~ 99 2235 m 82 2287 III 69 2336 i'lJ 61 2385 flL 54 2134 'E"'" 116 2186 ~ 99 2235 ~ 82 2287 rt 69 2336 W: 61 2385 ~ 54 2139 ~ 115 2186 1& 99 2235 1ft 82 2289 ~ 68 2336 Wlf 61 2385 f# 54 2139 11.ti 115 2186 ~ 99 2235 P)l 82 2289 ~ 68 1336 15 61 2385 11 54 2139 V 1I5 2191 ~ 98 2235 ~ 82 2289 :m 68 2336 ({] 61 2385 ~ 54 2139 ~ 115 2191 ttli 98 2235 iii 82 2289 ~ 68 2336 ~ 61 2385 ~ 54 2143 ft 114 2191 ~ 98 2243 ~ 81 2289 11f 68 2336 m 61 2385 ~ 54 2143 JfiJ 114 2194 ~ 97 2243 81 2289 m 68 2336 ~ 61 2385 ~~ 54 2143 ff. 114 2195 m: 96 2243 •Jfla 81 2289 U 68 2336 ftlt 61 2385 ;jl 54 2143 IIMJi 114 2195 ~ 96 2243 (£ 81 2289 68 2336 'It 61 2396 MIl 53 2147 8 113 2195 Iff 96 2247 IW 80 2289 •.. 68 2347 ~ 60 2396 .. 53 2147 ~~ 113 2195 ~ 96 2247 80 2298 ~ 67 2347 ~ 60 2396 Pm 53 2147 ~ 113 2195 ~ 96 2249 •~ 79 2298 m 67 2347 {Jt 60 2396 @ 53 2150 ft 112 2200 It 95 2249 il¥ 79 2298 ;fib 67 2347 ~ 60 2400 52 CF 99.8220 CF 99.8434 CF 99.8626 CF 99.8776 CF 99.8926 CF • 99.9039 JAPANESE CHARACTER FREQUENCY LIST 497

APPENDIX A (Continued) R C F R e F R CF R CF R C F R C F 2400 B 52 2449 ~ 46 2496 J.Il 40 2543 !Ill 35 2599 ~ 31 2646 ~ 28 2402 ~ 51 2449 !l5e 46 2496 it 40 2543 ~ 35 2599 m 31 2646 ~ 28 2402 J& 51 2449 1ri! 46 2503 ~ 39 2543 :I!l 35 2599 It 31 2646 FoX 28 2402 If?{ 51 2449 ~ 46 2503 !\!\! 39 2543 ~ 35 2599 .m 31 2646 let 28 2402 ~ 51 2449 ~ 46 2503 Ii 39 2543 jlj 35 2599 !!1. 31 2646 28 2402 ~ 51 2449 if:! 46 2503 ~ 39 2543 ~ 35 2599 tit 31 2646 •14' 28 2402 ~ 51 2449 ~ 46 2503 jili)i 39 2543 ~ 35 2599 ifi 31 2646 ~J 28 2402 ffi. 51 2449 J]p 46 2503 ~ 39 2543 :f!lt 35 2599 31 2646 1.t 28 2402 ~ 51 2459 ~ 45 2503 ~ 39 2559 ~ 34 2599 •it 31 2646 ~ 28 2402 fJi 51 2459 i'Jf 45 2503 MIJ 39 2559 nit 34 2599 ~ 31 2646 ~ 28 2411 g 50 2459 ~ 45 2503 39 2559 WJ 34 2599 IlfB 31 2646 ~ 28 2411 fi 50 2459 at 45 25/2 il!I*" 38 2559 ~ 34 2599 Il 31 2646 {lL 28 2411 ~ 50 2459 A 45 25/2 ~ 38 2559 ~ 34 2599 mJ 31 2646 QOO 28 24fl @ 50 2464 'Pf 44 2512 JlIl 38 2559 :It 34 2599 !I'J 31 2664 ~ 27 24fl 50 2464 fIJ 44 2512 ~ 38 2559 ~ 34 2599 Pfi 31 2664 Il!J: 27 2411 *J]lf 50 2464 -;Jf 44 25/2 m 38 2559 lil 34 2616 it: 30 2664 ;1'2 27 2411 {# 50 2464 Wi 44 2512 ~ 38 2559 ~ 34 2616 ~ 30 2664 ~ 27 2411 ~ 50 2468 43 2512 38 2568 ~ 33 2616 30 2664 ~ 27 24JJ ~ 50 2468 *~ 43 2512 •~ 38 2568 /Rl 33 2616 •1m 30 2664 fi 27 24fl ~ 50 2468 iif 43 2512 38 2568 ~ 33 2616 im 30 2664 00 27 24JJ ~ 50 2468 ~ 43 2512 •*BIt 38 2568 iIJJi 33 2616 M 30 2664 ~ 27 2422 ~ 49 2468 ~ 43 25/2 ~ 38 2568 ~J 33 2616 JtE 30 2664 1m 27 2422 ~ 49 2468 ~ 43 25/2 -m 38 2568 tal 33 2616 i;Ili 30 2664 '$ 27 2422 1# 49 2468 ~ 43 2512 IIi* 38 2568 ~ 33 2616 3": 30 2664 ~ 27 2422 g 49 2475 ., 42 25/2 ~ 38 2568 III 33 26/6 !Itt 30 2664 i? 27 2422 5E 49 2475 ~ 42 2512 (4f 38 2568 33 26/6 mt 30 2664 ~ 27 2422 m 49 2475 M 42 2527 37 2568 m* 33 2616 :g£ 30 2664 ~ 27 2422 1M; 49 2475 ia 42 2527 •PI 37 2568 33 2616 ~ 30 2664 ~ 27 2422 ~ 49 2475 ,. 42 2527 M 37 2568 • 33 2629 ~ 29 2664 ~ 27 2422 Fr 49 2475 42 2527 B: 37 2568 •~ 33 2629 29 2664 !Rt 27 2422 i1lJ 49 2475 IIf• 42 2527 it( 37 2581 $ 32 2629 •iAl 29 2681 M 26 2432 W 48 2475 ~ 42 2527 ~ 37 2581 ~ 32 2629 ~ 29 2681 ~ 26 2432 'I1J 48 2475 m 42 2533 i1ill 36 2581 ~ 32 2629 $ 29 2681 26 2432 II 48 2475 .. 42 2533 ,fit! 36 2581 M 32 2629 II 29 2681 •1C. 26 2432 #!!! 48 2475 7.X 42 2533 lit 36 2581 ~ 32 2629 !M 29 2681 ~ 26 2432 m 48 2486 ~ 41 2533 51! 36 258/ ffl'I 32 2629 ~ 29 2681 m 26 2432 P1! 48 2486 :it 41 2533 i'ili 36 258/ ~ 32 2629 ~ 29 2681 ~ 26 2432 l'IJ 48 2486 IBJ 41 2533 4: 36 2581 tt 32 2629 ~ 29 268/ ~ 26 2432 11: 48 2486 iltf 41 2533 IT 36 258/ lit 32 2629 iI 29 268/ :~ 26 2440 ~ 47 2486 ~ 41 2533 {iJi 36 258/ P'f 32 2629 i~ 29 2681 W 26 2440 ~ 47 2486 41 2533 ~ 36 2581 ~ 32 2629 29 2681 tl 26 2440 !ill 47 2486 "~ 41 2533 !i 36 2581 ~ 32 2629 •W 29 2681 ~ 26 2440 1$ 47 2486 Pl! 41 2543 ~ 35 2581 i6!t 32 2629 Jft; 29 268/ fa 26 2440 ~ 47 2486 trJi 41 2543 Ii 35 2581 WI. 32 2629 $ 29 2681 M 26 2440 mi 47 2486 it 41 2543 ~ 35 258/ ~ 32 2629 ~ 29 2681 m 26 2440 #i 47 2496 tlj 40 2543 it 35 258/ ~ 32 2646 ~ 28 268/ 26 2440 ~ 47 2496 ~ 40 2543 m 35 2581 !G 32 2646 # 28 2681 •~ 26 2440 11# 47 2496 ~ 40 2543 35 258/ :Ill: 32 2646 ~ 28 2681 .I;€ 26 2449 :¥Ill 46 2496 TiIJj 40 2543 '*w.Jt 35 2599 ~ 31 2646 1# 28 2681 ~ 26 2449 ~f 46 2496 Em 40 2543 35 2599 31 2646 f! 28 268/ M 26 CF 99.9139 CF 99.9239 CF 99.9331 CF • 99.9381 CF 99.9431 CF 99.9481 498 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIX A (Continued) R C F R C F RC F R C F R C F R CF 268/ IR 26 2725 ~Ji 24 2785 ~ 21 2836 :¥f 19 2886 if! 17 293/ m IS 2702 Il!k 25 2752 J5 23 2785 ~ 21 2836 It 19 2886 ~ 17 2931 Ii~ 15 2702 It 25 2752 ~ 23 2785 21 2836 p~ 19 2886 ?&. 17 293/ jg IS 2702 ~ 25 2752 ~ 23 2785 • 21 2836 *HI 19 2886 ~ 17 2931 IS 2702 ~ 25 2752 fiji 23 2785 ~'* 21 2836 1l 19 2886 ~ 17 2931 •ft IS 2702 &Ii 25 2752 Ilf 23 2806 II 20 2836 1Il~ 19 2886 m 17 2931 JEt 15 2702 ~r 25 2752 ,~ 23 2806 Jjf 20 2836 Sf 19 2907 ft 16 2931 ~i'f 15 2702 t:I 25 2752 III 23 2806 Kl 20 2836 1# 19 2907 IlR 16 293/ frJI 15 2702 flI 25 2752 ~ 23 2806 ~ 20 2836 rtJ 19 2907 3i.} 16 293/ ~ 15 2702 ~ 25 2752 m 23 2806 ~ 20 2836 • 19 2907 fa: 16 2931 ~ 15 2702 i1lt 25 2752 Q 23 2806 20 2861 ~ 18 2907 m 16 2931 10 15 2702 ~ 25 2752 23 2806 ~• 20 2861 ¥J). 18 2907 *L 16 2931 mJ 15 2702 Il% 25 2752 •:!1i 23 2806 lEI 20 2861 a: 18 2907 16 2931 !I* IS 2702 ~ 25 2752 ~ 23 2806 V1f 20 2861 :Ii!i 18 2907 m* 16 2931 !IW 15 2702 ~ 25 2752 ~ 23 2806 II#i 20 286/ ** 18 2907 ~ 16 2931 '!Ii 15 2702 *n 25 2752 .IE 23 2806 ~ 20 2861 ~ 18 2907 :RlJ 16 2966 j$ 14 2702 ,w 25 2752 m 23 2806 ~ 20 286/ ~ 18 2907 WJl 16 2966 11 14 2702 ~ 25 2752 ~ 23 2806 ~ 20 286/ JJJZ 18 2907 7§. 16 2966 1m 14 2702 ~ 25 2752 &if 23 2806 fi 20 2861 tI 18 2907 ~ 16 2966 JI3't 14 2702 Jl1{ 25 2752 ~ 23 2806 ii!i 20 2861 FX 18 2907 '& 16 2966 R 14 2702 ~ 25 2752 #. 23 2806 ffi 20 2861 it 18 2907 M: 16 2966 ~ 14 2702 :fa 25 2772 iii 22 2806 20 286/ If{; 18 2907 tilt 16 2966 1ft 14 2702 ~ 25 2772 1@: 22 2806 •T1J:. 20 2861 !l 18 2907 t-J 16 2966 ~ 14 2702 25 2772 1* 22 2806 ffl 20 2861 m: 18 2907 ~ 16 2966 J:l 14 2725 •!l'I 24 2772 ~ 22 2806 ~p 20 2861 fJ 18 2907 !mi 16 2966 ~ 14 2725 ~ 24 2772 ~ 22 2806 1t 20 286/ %5 18 2907 ,. 16 2966 ~ 14 2725 ~ 24 2772 & 22 2806 ~ 20 2861 fit! 18 2907 il& 16 2966 ~ 14 of± 2725 ~ 24 2772 IT,," 22 2806 Wi 20 2861 ~~ 18 2907 i!!I. 16 2966 ~ 14 2725 £j~ 24 2772 jjfl 22 2806 i$ 20 2861 ~ 18 2907 ~ 16 2966 lth 14 2725 JJJD 24 2772 ~~ 22 2806 ~ 20 2861 ~ 18 2907 fit 16 2966 T 14 2725 Wi 24 2772 ~ 22 2806 20 2861 ~ 18 2931 ~ 15 2966 m 14 2725 ft{ 24 2772 7Mt 22 2806 *~ 20 2861 :Hi 18 2931 ~l 15 2966 vJ; 14 ;!!!;; 2725 :II\! 24 2772 ~ 22 2806 7C 20 2861 Jm 18 2931 00 15 2966 ~ 14 2725 ~ 24 2772 ~ 22 2806 {frt 20 2861 ~ 18 2931 ~ 15 2966 :;~ 14 2725 Pm 24 2785 ,fl 21 2806 1£ 20 2861 ~ 18 2931 ii 15 2966 flIL 14 2725 iii 24 2785 wm 21 2836 it 19 2886 • 17 2931 ~ 15 2966 t1i 14 2725 #i! 24 2785 ~ 21 2836 ~ 19 2886 If* 17 2931 to 15 2966 m 14 2725 m 24 2785 ~ 21 2836 :n 19 2886 m 17 2931 ~ 15 2966 til 14 2725 ~ 24 2785 tJ 21 2836 :tS 19 2886 ~ 17 2931 if IS 2966 :II 14 2725 iOO 24 2785 IIQl: 21 2836 ~ 19 2886 • 17 2931 Ii IS 2966 14 2725 ~ 24 2785 ~ 21 2836 ~ 19 2886 ft 17 293/ EI IS 2966 ~"" 14 2725 24 2785 ~ 21 2836 T* 19 2886 II 17 2931 ~ IS 2966 ~ 14 2725 •e 24 2785 ~ 21 2836 ~ 19 2886 :jp 17 2931 fIij IS 2966 .fti 14 2725 til 24 2785 1$ 21 2836 IJlI¥ 19 2886 ~ 17 2931 tJi IS 2966 W 14 2725 Il'F 24 2785 fllJ 21 2836 ~ 19 2886 Il\I 17 293/ 15 2966 ~ 14 2725 J$ 24 2785 ;l 21 2836 ~ 19 2886 ff1 17 2931 m* 15 2996 J$i 13 2725 .z: 24 2785 l4l 21 2836 ~ 19 2886 rx 17 2931 fA 15 2996 ilH 13 2725 ilk 24 2785 5'1 21 2836 f

APPENDIX A (Continued) APPENDIXB Hiragana Letter Frequency List R CF 2996 ;m 13 R C F R C F 2996 ~ 13 1 (J) 1918313 49 '1 48752 2996 !Ill 13 2 vi: 1108840 50 '" 47013 2996 JlI(f 13 3 t: 1067566 51 rJ 32312 2996 ill 13 4 v\ 1060284 52 tJ 31212 I;): ~. 2996 ~ 13 5 937811 53 '- 26965 2996 *!Ji: 13 6 :a: 936356 54 :tJ. 23490 2996 fi 13 7 .!:: 927938 55 ~ 23280 2996 :JjiiJ 13 8 ~ 916652 56 <" 21549 2996 m 13 9 tJ' 860742 57 ~ 19865 2996 ~ 13 10 L- 848132 58 V- 19148 2996 ~ 13 11 -r: 764834 59 J: 14425 2996 :r-J 13 12 -c 758316 60 .-:5 13125 13 tt. 720156 61 1£ 12402 2996 '" 13 2996 I!DII 13 14 tJ' 537294 62 « 12108 2996 1Jtf 13 15 -o 467350 63 h 11606 2996 fj 13 16 n 450805 64 ~ 11522 2996 11{; 13 17 G 423294 65 e 10047 2996 IiIiI 13 18 b 396142 66 I1l 8486 2996 f7} 13 19 ? 352965 67 -If 6893 2996 ~ 13 20 T 340654 68 ~ 5124 2996 11 13 21 ~ 333999 69 1-;1: 4349 ~ 2996 ii\'f 13 22 ~ 312227 70 ~ 2755 2996 n 13 23 ti. 280911 71 rJ 1608 2996 m 13 24 't 278599 72 It 1315 ~ ~ 2996 !tj 13 25 258960 73 986 2996 ~ 13 26 ~ 233505 74 '" 477 2996 M 13 27 ~ 223806 75 i:> 125 2996 ~ 13 28 < 221960 76 ;t 106 2996 ~ 13 29 .:b 204256 77 "t? 82 2996 ~ 13 30 It 199362 78 2- 75 2996 it'S 13 31 E 196555 79 d;) 48 2996 ~ 13 32 Iv 190068 80 ;to 21 2996 ~ 13 33 ;t 163664 81 v- 21 CF 99.9815 34 J:: 154206 82 n 3 35 -o 153999 83 :; 1 Note-R = Rank offrequency, C = Character ofkanji, F = Frequency ~ occurrence in23,408,236 character corpus. 36 146156 37 -t- 131611 38 b 123077 39 ij 99183 40 7-f.. 89264 41 tt 83444 42 0 73467 43 t!:" 72228 44 j;J 65870 45 t;. 56857 46 A. 56005 47 i' 53256 48 Ij' 49126

Note-R = Rank of frequency, C = Character of hiragana, F = Fre- quency occurrence in20,711,361 character corpus. 500 CHIKAMATSU, YOKOYAMA, NOZAKI, LONG, AND FUKUDA

APPENDIXC Katakana Letter Frequency List R C F R C F 1 :/ 290948 49 22462 2 Iv 189442 50 *jJ 22061 3 A 178214 51 /'\ 21839 4 !- 162802 52 '7 21784 5 7 127845 53 'J 20784 6 .{ 120807 54 -7 20633 7 '7 117203 55 'E 20070 8 !J 106744 56 J 19572 9 17 98209 57 7:' 19240 10 ~'/ 86894 58 I! 18692 J1 7J 82982 59 18204 12 ~ 80626 60 *zr; 17817 13 lj 75319 61 :3 17731 14 0 75301 62 14881 15 F 74257 63 "'"::1' 13931 16 '/ 61171 64 .y 12526 17 7 61115 65 ~ 10732 18 i- 60608 66 ::l 10318 19 j. 60230 67 ~ 10144 20 ::J 58724 68 7" 10121 21 -:(' 56123 69 -tt' 7689 22 7' 54159 70 t 7289 23 T 53404 71 '" 7129 24 s; 50758 72 ::L 6653 25 T 48437 73 If 6481 26 /-\ 44970 74 :t 6245 27 ~' 44462 75 ~ 2897 28 If 40433 76 '/ 2640 29 :\'- 39608 77 ij 1145 30 r7 39323 78 ? 1050 31 -t)- 39202 79 T 149 32 ::::- 38711 80 'Y 127 33 -r 38047 81 =:1 122 34 J:. 36458 82 I 73 35 7' 35920 83 4- 40 36 /-\ 35416 84 ;h 14 37 -e 34883 85 'J 9 38 ;;f" 34718 86 -7" 2 39 -1 33747 40 T 32665 41 ::t. 32616 42 ~ 29262 43 -\' 28144 44 ~ 26651 45 lj' 26396 46 'Y 24541 47 ~ 23742 48 .r( 22755

Note-R = Rank of frequency, C ~ Character of katakana, F = Fre- quency occurrence in3,608,288 character corpus.

(Manuscript received February 2, 1999; revision accepted forpublication February 27,2000.)