Korean-To-Chinese Machine Translation Using Chinese Character As Pivot Clue

Total Page:16

File Type:pdf, Size:1020Kb

Korean-To-Chinese Machine Translation Using Chinese Character As Pivot Clue Korean-to-Chinese Machine Translation using Chinese Character as Pivot Clue Jeonghyeok Park1,2,3 and Hai Zhao1,2,3, ∗ 1Department of Computer Science and Engineering, Shanghai Jiao Tong University 2Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, China 3 MoE Key Lab of Artificial Intelligence AI Institute, Shanghai Jiao Tong University [email protected], [email protected] Abstract et al., 2018; Xiao et al., 2019). Meanwhile, there are few attempts to improve the performance of the Korean-Chinese is a low resource language NMT model using linguistic characteristics for sev- pair, but Korean and Chinese have a lot eral language pairs (Sennrich and Haddow, 2016). in common in terms of vocabulary. Sino- On the other hand, Most of the recently proposed Korean words, which can be converted into corresponding Chinese characters, account for statistical machine translation (SMT) systems have more then fifty of the entire Korean vocabu- attempted to improve translation performance by lary. Motivated by this, we propose a simple using linguistic features including part-of-speech linguistically motivated solution to improve (POS) tags (Ueffing and Ney, 2013), syntax (Zhang the performance of Korean-to-Chinese neural et al., 2007), semantics (Rafael and Marta, 2011), machine translation model by using their com- reordering information (Zang et al., 2015; Zhang et mon vocabulary. We adopt Chinese charac- al., 2016) and so on. ters as a translation pivot by converting Sino- Korean words in Korean sentence to Chinese In this work, we focus on machine translation be- characters and then train machine translation tween Korean and Chinese, which have few parallel model with the converted Korean sentences corpora but share a well-known culture heritage, the as source sentences. The experimental results Sino-Korean words. Chinese loanwords used in Ko- on Korean-to-Chinese translation demonstrate rean are called Sino-Korean words, and can also be that the models with the proposed method written in Chinese characters which are still used by improve translation quality up to 1.5 BLEU points in comparison to the baseline models. modern Chinese people. Such a shared vocabulary makes the two languages closer despite their huge linguistic difference and provides the possibility for 1 Introduction better machine translation. arXiv:1911.11008v1 [cs.CL] 25 Nov 2019 Neural machine translation (NMT) using sequence- Because of its long history of contact with China, to-sequence structure has achieved remarkable per- Koreans have used Chinese characters as their writ- formance for most language pairs (Bahdanau et al., ing system, and even after adopting Hangul(ôÇ/åJ in 2014; Cho et al., 2014; Sutskever et al., 2014; Lu- Korean) as the standard language, Chinese charac- ong and Manning, 2015). Many studies on NMT ters have a considerable influence in Korean vocabu- have tried to improve the translation performance lary. Currently, the writing system adopted by mod- by changing the structure of the network model or ern Korean is Hangul, but Chinese characters con- adding new strategies (Wu and Zhao, 2018; Zhang tinue to be used in Korean and Chinese characters ∗ used in Korean are called ”Hanja”. Korean vocab- Corresponding author. This paper was partially supported ulary can be categorized into native Korean words, by National Key Research and Development Program of China (No. 2017YFB0304100) and Key Projects of National Natural Sino-Korean words, and loanwords from other lan- Science Foundation of China (U1836222 and 61733011). guages. The Sino-Korean vocabulary refers to Ko- Systems Sentences Korean " §î îÉr 아Aü< °úᇀsᅵ ìÍøí÷&%3다. HH-Convert }令Ér 아Aü< °úᇀsᅵ 颁布÷&%3다. Chinese }令颁布如下。 English The command was promulgated as follows. Korean ᆼª²DGÉr Fg #3ôÇ% %iò \"f_ᅴ /BN1lx sᅵe`¦ XSþÙ¡다. HH-Convert $国Ér 广范ôÇ 领域\"f_ᅴ q同 )Ê`¦ n¤þÙ¡다. Chinese $国(广泛的领域n¤了q同)Ê。 English The two countries have confirmed common interests in a wide range of areas. Table 1: The HH-Convert is Korean sentence converted by Hangul-Hanja conversion of the Hanjaro. The underline denotes Sino-Korean word and its corresponding Chinese characters in Korean sentence and HH-Convert sentence, respectively. rean words of Chinese origin and can be converted ties between language pairs to improve MT perfor- into corresponding Chinese characters, and consid- mance. Li et al. (2009) improved the translation erably account for about 57% of Korean vocabu- quality for Chinese-to-Korean SMT by using Chi- lary. Table 1 shows some sentence pairs of Korean nese syntactic reordering for an adequate generation and Chinese with the converted Sino-Korean words. of Korean verbal phrases. In Table 1, some Chinese words are commonly ob- Since Chinese and Korean belong to entirely dif- served between the converted Korean sentence and ferent language families in terms of typology and the Chinese sentence. genealogy, many studies also tried to analyze sen- In this paper, we present a novel yet straightfor- tence structure and word alignment of the two lan- ward method for better Korean-to-Chinese MT by guages and then proposed the specific methods for exploiting the connection of Sino-Korean vocabu- their concern (Huang and Choi, 2000; Kim et al., lary. We convert all Sino-Korean words in Korean 2002; Li et al., 2008). Lu et al. (2015) proposed sentences into Chinese characters and take the con- a method of translating Korean words into Chinese verted Korean sentences as the updated source data using the Chinese character knowledge. for later MT model training. Our method is applied to two types of NMT models, recurrent neural net- There are several attempts to exploit the connec- work (RNN) and the Transformer, and shows signif- tion between the source language and the target lan- icant translation performance improvement. guage in machine translation. Kuang et al. (2018) proposed methods to somewhat shorten the distance 2 Related Work between the source and target words in NMT model, and thus strengthen their association, through a tech- There have been studies of linguistic annotation, nique bridging source and target word embeddings. such as dependency label (Wu et al., 2018; Li et al., For other low-resource language pairs, using pivot 2018a; Li et al., 2018b), semantic role labels (Guan language to overcome the limitation of the insuf- et al., 2019; Li et al., 2019) and so on. Sennrich and ficient parallel corpus has been a choice (Habash Haddow (2016) proved that various linguistic fea- and Hu, 2009; Zahabi et al., 2013; Ahmadnia et tures can be valuable for NMT. In this work, we fo- al., 2017). Chu et al. (2013) bulid a Chinese char- cus on the linguistic connection between Korean and acter mapping table for Japanese, Traditional Chi- Chinese to improve Korean-to-Chinese NMT. nese, and Simplified Chinese and verified the ef- There are several studies on Korean-Chinese fectiveness of shared Chinese characters for Chi- machine translation. For example, Kim et nese–Japanese MT. Zhao et al. (2013) used the Chi- al. (2002) proposed verb-pattern-based Korean-to- nese character, a common form of both languages, as Chinese MT system that uses pattern-based knowl- a translation bridge in the Vietnamese-Chinese SMT edge and consistently manages linguistic peculiari- model, and improved the translation quality by con- 北 B^ “北美a'>ᅨ¸ “南北a'>ᅨ%!3 Chinese, many homophones were created in their @/¨8” vocabulary in the process of translating the Chinese 3.1îr1lx 100ÅÒ¸ ´úᆽ아 ᆼ©# î #QL:\ "é¶Òo( words into their language. Around 35% of the Sino- 原r) IFGlᅵÂÒÃÌ Korean words registered in the Standard Korean Language Dictionary belong to homophones. Thus Table 2: News headlines with Chinese characters. The converting Sino-Korean words into (usually differ- underline denotes Chinese characters. ent) Chinese characters will have a similar impact as semantic disambiguation. For example, the Korean verting Vietnamese syllables into Chinese characters word uisa (_ᅴ사 in Korean) has many homophones with a pre-specified dictionary. Partially motivated and can have several meanings. To clarify the mean- by this work, we turn to Korean in terms of NMT ing of the word uisa in Korean context, these words models by fully exploiting the shared Sino-Korean are occasionally written in Chinese characters as fol- vocabulary between Korean and Chinese. lows: ;师 (doctor), 意思 (mind), Ië (martyr), ® 事 (proceedings). 3 Sino-Korean Words and Chinese In addition, There is a difference between Chinese Characters characters (Hanja) used in Korea and Chinese char- acters used in China. Chinese can be divided into Korea belongs to the Chinese cultural sphere, which two categories: Traditional Chinese and Simplified means that China has historically influenced regions Chinese. Chinese characters used in China and Ko- and countries of East Asia. Before the creation of rea are Simplified Chinese and Traditional Chinese, Hangul (Korean alphabet), all documents were writ- respectively. ten in Chinese characters, and Chinese characters were used continuously even after the creation of 4 The Proposed Approach Hangul. The proposed approach for Korean-to-Chinese MT Today, the standard writing system in Korea is has two phases: Hangul-Hanja conversion and NMT Hangul, and the use of Chinese characters in Korean model training. We first convert the Sino-Korean sentences is rare, but Chinese characters have left words of the Korean input sentences into Chinese a significant influence on Korean vocabulary. About characters, and convert the Traditional Chinese char- 290,000 (57%) out of the 510,000 words in the Stan- acters of the converted Korean input sentences into dard Korean Language Dictionary published by the Simplified Chinese characters to share the common National Institute of Korean Language belongs to units between source and target vocabulary. Then Sino-Korean words, which were originally written we train NMT models with the converted Korean in Chinese characters.
Recommended publications
  • Stochastic Model of Stroke Order Variation
    2009 10th International Conference on Document Analysis and Recognition Stochastic Model of Stroke Order Variation Yoshinori Katayama, Seiichi Uchida, and Hiroaki Sakoe Faculty of Information Science and Electrical Engineering, Kyushu University, 819-0395, Japan fyosinori,[email protected] Abstract “ ¡ ” under an unnatural stroke correspondence which max- imizes their similarity. Note that the correct stroke order of A stochastic model of stroke order variation is proposed “ ” is (“—” ! “j' ! “=” ! “n” ! “–”) and that of “ ¡ ” and applied to the stroke-order free on-line Kanji character is (“—” ! “–” ! “j' ! “=” ! “n” ). Thus if we allow any recognition. The proposed model is a hidden Markov model stroke order variation, those two characters become almost (HMM) with a special topology to represent all stroke order identical. variations. A sequence of state transitions from the initial One possible remedy to suppress the misrecognitions is state to the final state of the model represents one stroke to penalize unnatural i.e., rare stroke order on optimizing order and provides a probability of the stroke order. The the stroke correspondence. In fact, there are popular stroke distribution of the stroke order probability can be trained orders (including the standard stroke order) and there are automatically by using an EM algorithm from a training rare stroke orders. If we penalize the situation that “ ¡ ” is set of on-line character patterns. Experimental results on matched to an input pattern with its very rare stroke order large-scale test patterns showed that the proposed model of (“—” ! “j' ! “=” ! “n” ! “–”), we can avoid the could represent actual stroke order variations appropriately misrecognition of “ ” as “ ¡ .” and improve recognition accuracy by penalizing incorrect For this purpose, a stochastic model of stroke order vari- stroke orders.
    [Show full text]
  • Neural Substrates of Hanja (Logogram) and Hangul (Phonogram) Character Readings by Functional Magnetic Resonance Imaging
    ORIGINAL ARTICLE Neuroscience http://dx.doi.org/10.3346/jkms.2014.29.10.1416 • J Korean Med Sci 2014; 29: 1416-1424 Neural Substrates of Hanja (Logogram) and Hangul (Phonogram) Character Readings by Functional Magnetic Resonance Imaging Zang-Hee Cho,1 Nambeom Kim,1 The two basic scripts of the Korean writing system, Hanja (the logography of the traditional Sungbong Bae,2 Je-Geun Chi,1 Korean character) and Hangul (the more newer Korean alphabet), have been used together Chan-Woong Park,1 Seiji Ogawa,1,3 since the 14th century. While Hanja character has its own morphemic base, Hangul being and Young-Bo Kim1 purely phonemic without morphemic base. These two, therefore, have substantially different outcomes as a language as well as different neural responses. Based on these 1Neuroscience Research Institute, Gachon University, Incheon, Korea; 2Department of linguistic differences between Hanja and Hangul, we have launched two studies; first was Psychology, Yeungnam University, Kyongsan, Korea; to find differences in cortical activation when it is stimulated by Hanja and Hangul reading 3Kansei Fukushi Research Institute, Tohoku Fukushi to support the much discussed dual-route hypothesis of logographic and phonological University, Sendai, Japan routes in the brain by fMRI (Experiment 1). The second objective was to evaluate how Received: 14 February 2014 Hanja and Hangul affect comprehension, therefore, recognition memory, specifically the Accepted: 5 July 2014 effects of semantic transparency and morphemic clarity on memory consolidation and then related cortical activations, using functional magnetic resonance imaging (fMRI) Address for Correspondence: (Experiment 2). The first fMRI experiment indicated relatively large areas of the brain are Young-Bo Kim, MD Department of Neuroscience and Neurosurgery, Gachon activated by Hanja reading compared to Hangul reading.
    [Show full text]
  • Nushu and the Writing of Religious
    NOSHU AND RELIGIOUS CULTURE IN CHINA "MY MOTHER WATCHED OVER AN EMPTY HOUSE AND',WAS SEPARATED FROM THE HEA VENL Y FEMALE": NUSHU AND THE WRITING OF RELIGIOUS CULTURE IN CHINA By STEPHANIE BALK WILL, B.A. (H.Hons.) A Thesis Submitted to the School of Graduate Studies in Partial FulfiHment ofthe Requirements for the Degree Master of Arts McMaster University © Copyright by Stephanie Balkwill, August 2006 MASTER OF ARTS (2006) McMaster University (Religious Studies) Hamilton, Ontario TITLE: "My Mother Watched Over an Empty House and was Separated From the Heavenly Female": Nushu and the Writing IOf Religious Culture in China. AUTHOR: Stephanie Balkwill, B.A. (H.Hons.) (University of Regina) SUPERVISOR: Dr. James Benn NUMBER OF PAGES: v, 120 ii ]rll[.' ' ABSTRACT Niishu, or "Women's Script" is a system of writing indigenous to a small group of village women in iJiangyong County, Hunan Province, China. Used exclusively by and for these women, the script was developed in order to write down their oral traditions that may have included songs, prayers, stories and biographies. However, since being discovered by Chinese and Western researchers, nushu has been rapidly brought out of this Chinese village locale. At present, the script has become an object of fascination for diverse audiences all over the world. It has been both the topic of popular media presentations and publications as well as the topic of major academic research projects published in Engli$h, German, Chinese and Japanese. Resultantly, niishu has played host to a number of mo~ern explanations and interpretations - all of which attempt to explain I the "how" and the "why" of an exclusively female script developed by supposedly illiterate women.
    [Show full text]
  • Sinitic Language and Script in East Asia: Past and Present
    SINO-PLATONIC PAPERS Number 264 December, 2016 Sinitic Language and Script in East Asia: Past and Present edited by Victor H. Mair Victor H. Mair, Editor Sino-Platonic Papers Department of East Asian Languages and Civilizations University of Pennsylvania Philadelphia, PA 19104-6305 USA [email protected] www.sino-platonic.org SINO-PLATONIC PAPERS FOUNDED 1986 Editor-in-Chief VICTOR H. MAIR Associate Editors PAULA ROBERTS MARK SWOFFORD ISSN 2157-9679 (print) 2157-9687 (online) SINO-PLATONIC PAPERS is an occasional series dedicated to making available to specialists and the interested public the results of research that, because of its unconventional or controversial nature, might otherwise go unpublished. The editor-in-chief actively encourages younger, not yet well established, scholars and independent authors to submit manuscripts for consideration. Contributions in any of the major scholarly languages of the world, including romanized modern standard Mandarin (MSM) and Japanese, are acceptable. In special circumstances, papers written in one of the Sinitic topolects (fangyan) may be considered for publication. Although the chief focus of Sino-Platonic Papers is on the intercultural relations of China with other peoples, challenging and creative studies on a wide variety of philological subjects will be entertained. This series is not the place for safe, sober, and stodgy presentations. Sino- Platonic Papers prefers lively work that, while taking reasonable risks to advance the field, capitalizes on brilliant new insights into the development of civilization. Submissions are regularly sent out to be refereed, and extensive editorial suggestions for revision may be offered. Sino-Platonic Papers emphasizes substance over form.
    [Show full text]
  • The Twentieth-Century Secularization of the Sinograph in Vietnam, and Its Demotion from the Cosmological to the Aesthetic
    Journal of World Literature 1 (2016) 275–293 brill.com/jwl The Twentieth-Century Secularization of the Sinograph in Vietnam, and its Demotion from the Cosmological to the Aesthetic John Duong Phan* Rutgers University [email protected] Abstract This article examines David Damrosch’s notion of “scriptworlds”—spheres of cultural and intellectual transfusion, defined by a shared script—as it pertains to early mod- ern Vietnam’s abandonment of sinographic writing in favor of a latinized alphabet. The Vietnamese case demonstrates a surprisingly rapid readjustment of deeply held attitudes concerning the nature of writing, in the wake of the alphabet’s meteoric suc- cesses. The fluidity of “language ethics” in early modern Vietnam (a society that had long since developed vernacular writing out of an earlier experience of diglossic liter- acy) suggests that the durability of a “scriptworld” depends on the nature and history of literacy in the societies under question. Keywords Vietnamese literature – Vietnamese philology – Chinese philology – script reform – language reform – early modern East Asia/Southeast Asia Introduction In 1919, the French-installed emperor Khải Định permanently dismantled Viet- nam’s civil service examinations, and with them, an entire system of political * I would like to thank Wynn Wilcox, Nguyễn Tuấn Cường, Tuna Artun, and Jack Chia for their valuable help and input. All errors are my own. © koninklijke brill nv, leiden, 2016 | doi: 10.1163/24056480-00102010 Downloaded from Brill.com09/25/2021 07:54:41AM via free access 276 phan selection based on proficiency in Literary Sinitic.1 That same year, using a con- troversial latinized alphabet called Quốc Ngữ (lit.
    [Show full text]
  • Chinese Script Generation Panel Document
    Chinese Script Generation Panel Document Proposal for the Generation Panel for the Chinese Script Label Generation Ruleset for the Root Zone 1. General Information Chinese script is the logograms used in the writing of Chinese and some other Asian languages. They are called Hanzi in Chinese, Kanji in Japanese and Hanja in Korean. Since the Hanzi unification in the Qin dynasty (221-207 B.C.), the most important change in the Chinese Hanzi occurred in the middle of the 20th century when more than two thousand Simplified characters were introduced as official forms in Mainland China. As a result, the Chinese language has two writing systems: Simplified Chinese (SC) and Traditional Chinese (TC). Both systems are expressed using different subsets under the Unicode definition of the same Han script. The two writing systems use SC and TC respectively while sharing a large common “unchanged” Hanzi subset that occupies around 60% in contemporary use. The common “unchanged” Hanzi subset enables a simplified Chinese user to understand texts written in traditional Chinese with little difficulty and vice versa. The Hanzi in SC and TC have the same meaning and the same pronunciation and are typical variants. The Japanese kanji were adopted for recording the Japanese language from the 5th century AD. Chinese words borrowed into Japanese could be written with Chinese characters, while Japanese words could be written using the character for a Chinese word of similar meaning. Finally, in Japanese, all three scripts (kanji, and the hiragana and katakana syllabaries) are used as main scripts. The Chinese script spread to Korea together with Buddhism from the 2nd century BC to the 5th century AD.
    [Show full text]
  • A Comparative Analysis of the Simplification of Chinese Characters in Japan and China
    CONTRASTING APPROACHES TO CHINESE CHARACTER REFORM: A COMPARATIVE ANALYSIS OF THE SIMPLIFICATION OF CHINESE CHARACTERS IN JAPAN AND CHINA A THESIS SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI‘I AT MĀNOA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF ARTS IN ASIAN STUDIES AUGUST 2012 By Kei Imafuku Thesis Committee: Alexander Vovin, Chairperson Robert Huey Dina Rudolph Yoshimi ACKNOWLEDGEMENTS I would like to express deep gratitude to Alexander Vovin, Robert Huey, and Dina R. Yoshimi for their Japanese and Chinese expertise and kind encouragement throughout the writing of this thesis. Their guidance, as well as the support of the Center for Japanese Studies, School of Pacific and Asian Studies, and the East-West Center, has been invaluable. i ABSTRACT Due to the complexity and number of Chinese characters used in Chinese and Japanese, some characters were the target of simplification reforms. However, Japanese and Chinese simplifications frequently differed, resulting in the existence of multiple forms of the same character being used in different places. This study investigates the differences between the Japanese and Chinese simplifications and the effects of the simplification techniques implemented by each side. The more conservative Japanese simplifications were achieved by instating simpler historical character variants while the more radical Chinese simplifications were achieved primarily through the use of whole cursive script forms and phonetic simplification techniques. These techniques, however, have been criticized for their detrimental effects on character recognition, semantic and phonetic clarity, and consistency – issues less present with the Japanese approach. By comparing the Japanese and Chinese simplification techniques, this study seeks to determine the characteristics of more effective, less controversial Chinese character simplifications.
    [Show full text]
  • Proposal for a Korean Script Root Zone LGR 1 General Information
    (internal doc. #: klgp220_101f_proposal_korean_lgr-25jan18-en_v103.doc) Proposal for a Korean Script Root Zone LGR LGR Version 1.0 Date: 2018-01-25 Document version: 1.03 Authors: Korean Script Generation Panel 1 General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Korean Script LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document below: • proposal-korean-lgr-25jan18-en.xml Labels for testing can be found in the accompanying text document below: • korean-test-labels-25jan18-en.txt In Section 3, we will see the background on Korean script (Hangul + Hanja) and principal language using it, i.e., Korean language. The overall development process and methodology will be reviewed in Section 4. The repertoire and variant groups in K-LGR will be discussed in Sections 5 and 6, respectively. In Section 7, Whole Label Evaluation Rules (WLE) will be described and then contributors for K-LGR are shown in Section 8. Several appendices are included with separate files. proposal-korean-lgr-25jan18-en 1 / 73 1/17 2 Script for which the LGR is proposed ISO 15924 Code: Kore ISO 15924 Key Number: 287 (= 286 + 500) ISO 15924 English Name: Korean (alias for Hangul + Han) Native name of the script: 한글 + 한자 Maximal Starting Repertoire (MSR) version: MSR-2 [241] Note.
    [Show full text]
  • The Japanese Writing Systems, Script Reforms and the Eradication of the Kanji Writing System: Native Speakers’ Views Lovisa Österman
    The Japanese writing systems, script reforms and the eradication of the Kanji writing system: native speakers’ views Lovisa Österman Lund University, Centre for Languages and Literature Bachelor’s Thesis Japanese B.A. Course (JAPK11 Spring term 2018) Supervisor: Shinichiro Ishihara Abstract This study aims to deduce what Japanese native speakers think of the Japanese writing systems, and in particular what native speakers’ opinions are concerning Kanji, the logographic writing system which consists of Chinese characters. The Japanese written language has something that most languages do not; namely a total of ​ ​ three writing systems. First, there is the Kana writing system, which consists of the two syllabaries: Hiragana and Katakana. The two syllabaries essentially figure the same way, but are used for different purposes. Secondly, there is the Rōmaji writing system, which is Japanese written using latin letters. And finally, there is the Kanji writing system. Learning this is often at first an exhausting task, because not only must one learn the two phonematic writing systems (Hiragana and Katakana), but to be able to properly read and write in Japanese, one should also learn how to read and write a great amount of logographic signs; namely the Kanji. For example, to be able to read and understand books or newspaper without using any aiding tools such as dictionaries, one would need to have learned the 2136 Jōyō Kanji (regular-use Chinese characters). With the twentieth century’s progress in technology, comparing with twenty years ago, in this day and age one could probably theoretically get by alright without knowing how to write Kanji by hand, seeing as we are writing less and less by hand and more by technological devices.
    [Show full text]
  • The Chinese Script T � * 'L
    Norman, Jerry, Chinese, Cambridge: Cambridge University Press, 1988. 1 3.1 Th e beginnings of Chinese writing 59 3 FISH HORSE ELEPHANT cow (yu) (m ii) (xiimg) (niu) " The Chinese script t � * 'l Figure 3.1. Pictographs in early Chinese writing 3.1 The beginnings of Chinese writing1 The Chinese script appears as a fully developed writing system in the late Shang .dynasty (fourteenth to eleventh centuries BC). From this period we have copious examples of the script inscribed or written on bones and tortoise shells, for the most part in the form of short divinatory texts. From the same period there also Figure 3.2. The graph fo r quiin'dog' exist a number of inscriptions on bronze vessels of various sorts. The former type of graphic record is referred to as the oracle bone script while the latter is com­ of this sort of graph are shown in Figure 3.1. The more truly representational a monly known· as the bronze script. The script of this period is already a fully graph is, the more difficult and time-consuming it is to depict. There is a natural developed writing system, capable of recording the contemporary Chinese lan­ tendency for such graphs to become progressively simplified and stylized as a guage in a complete and unambiguous manner. The maturity of this early script writing system matures and becomes more widely used. As a result, pictographs has suggested to many scholars that it must have passed through a fairly long gradually tend to lose their obvious pictorial quality. The graph for qui'in 'dog' period of development before reaching this stage, but the few examples of writing shown in Figure 3.2 can serve as a good illustration of this sort of development.
    [Show full text]
  • Korean Northwest Islands (Working File)” of the NSC East Asian and Pacific Affairs Staff: Files, 1969-1977 at the Gerald R
    The original documents are located in Box 36, folder “Korean Northwest Islands (Working File)” of the NSC East Asian and Pacific Affairs Staff: Files, 1969-1977 at the Gerald R. Ford Presidential Library. Copyright Notice The copyright law of the United States (Title 17, United States Code) governs the making of photocopies or other reproductions of copyrighted material. Gerald R. Ford donated to the United States of America his copyrights in all of his unpublished writings in National Archives collections. Works prepared by U.S. Government employees as part of their official duties are in the public domain. The copyrights to materials written by other individuals or organizations are presumed to remain with them. If you think any of the information displayed in the PDF is subject to a valid copyright claim, please contact the Gerald R. Ford Presidential Library. Digitized from Box 36 of NSC East Asian and Pacific Affairs Staff: Files, 1969-1977 at the Gerald R. Ford Presidential Library 0 .onc:h'On KuUm-ni • I CIIANGAOAE·KKUT Koa6na• ..!!lliitn-nl S~K-TO .Chlha-rl f'i DEMARCATION liNE NOR TI-l KOREA CH'0-00 0 s.riw0n •1ch'&l stnch'~ P"yOnas-n• Solu:h'o • Slnwim-nl• .vanau Hwacb'On,.. YancYana. Haeju -, Q \ PAENGNYi!JIG- 00 0 TAECH'~HG- 'i'6nan 0 Ch'unch'&n Chumun)ln• 00 ROK ARMY '\ SECTOR SOCH'ONG-00 TUNGSAN-GOT ---;-........ Y~NP'YONG-YOLTO ROKMARINE ~ SECTOR ~ SO UTI-I ~ SEOUL Yancp'y6n•QmnH]" YONGYU-00 Ytt«;tt'6N .~6Ju' .o.an TOKCHOK-KUNDO •Chech'bn KOREA DEMILITARIZED ZONE AREA .Ans6na ~i----.-I;,;O;..,..-.....,..l2,;.0_..,..._;30;a:1 1 ---""4,0 Mdes o t'o io Jb I 4b ~llomettrs oslloan .r..
    [Show full text]
  • Most-Common-Surnames-Bmd-Registers-16.Pdf
    Most Common Surnames Surnames occurring most often in Scotland's registers of Births, Marriages and Deaths Counting only the surname of the child for births, the surnames of BOTH PARTIES (for example both BRIDE and GROOM) for marriages, and the surname of the deceased for deaths Note: the surnames from these registers may not be representative of the surnames of the population of Scotland as a whole, as (a) they include the surnames of non-residents who were born / married / died here; (b) they exclude the surnames of residents who were born / married / died elsewhere; and (c) some age-groups have very low birth, marriage and death rates; others account for most births, marriages and deaths.ths Registration Year = 2016 Position Surname Number 1 SMITH 2056 2 BROWN 1435 3 WILSON 1354 4 CAMPBELL 1147 5 STEWART 1139 6 THOMSON 1127 7 ROBERTSON 1088 8 ANDERSON 1001 9 MACDONALD 808 10 TAYLOR 782 11 SCOTT 771 12 REID 755 13 MURRAY 754 14 CLARK 734 15 WATSON 642 16 ROSS 629 17 YOUNG 608 18 MITCHELL 601 19 WALKER 589 20= MORRISON 587 20= PATERSON 587 22 GRAHAM 569 23 HAMILTON 541 24 FRASER 529 25 MARTIN 528 26 GRAY 523 27 HENDERSON 522 28 KERR 521 29 MCDONALD 520 30 FERGUSON 513 31 MILLER 511 32 CAMERON 510 33= DAVIDSON 506 33= JOHNSTON 506 35 BELL 483 36 KELLY 478 37 DUNCAN 473 38 HUNTER 450 39 SIMPSON 438 40 MACLEOD 435 41 MACKENZIE 434 42 ALLAN 432 43 GRANT 429 44 WALLACE 401 45 BLACK 399 © Crown Copyright 2017 46 RUSSELL 394 47 JONES 392 48 MACKAY 372 49= MARSHALL 370 49= SUTHERLAND 370 51 WRIGHT 357 52 GIBSON 356 53 BURNS 353 54= KENNEDY 347
    [Show full text]