Measuring Synonyms in Erya by Probabilistic Semantics

Total Page:16

File Type:pdf, Size:1020Kb

Measuring Synonyms in Erya by Probabilistic Semantics Measuring Synonyms in Erya by Probabilistic Semantics Kam Tang LAU 1, Yan SONG 2, Pang Fei KWOK3 1, College of Professional and Continuing Education, The Hong Kong Polytechnic University, Hong Kong China Email: [email protected] 2, Microsoft Search Technology Center Asia, Beijing China Email: [email protected] 3, Department of Chinese and History, City University of Kong Kong, Hong Kong China [email protected] Abstract Erya is the earliest Chinese dictionary which was edited and circulated before Qin Dynasty (BC 221). It is used for annotating the Confucian Classics. Therefore, the words collected in Erya are from pre-Qin Confucian Classics. It is recognized as one of the most important work of Chinese Philology. Erya later was listed as one of the Shisan Jing (Thirteen Confucian Classics). This shows that Erya is as important as the other Confucian Classics compiled in the pre-Qin Dynasty. Erya is like a synonym thesaurus. It puts all synonyms in parallel and explained by the last word and Ye. (“Ye” is using as verb-to-be “is” in Classical Chinese and placed after the explained word.) There are two main explanation patterns presented in Erya, one is “A, B, C, D Ye.” Another one is “E, F Ye, F, G Ye.” There are many Chinese Philology scholars discussed why the editors didn’t just record “E, F, G Ye”. They explained these phenomena by providing many evidences and examples from the Confucian Classics. They believed although E and F are the synonyms of G, F has shorter distance to G than E. This research is going to prove that if the research outcome of the Chinese Philology scholars is valid by using computation methods. This will be a more scientific and precise method to explain the distance between the words in Erya. We try to use the most prevailing deep neural network based word-to-vector modeling (Mikolov, et al., 2013) to build vectors for every word in Erya, and investigate the distance (probabilistic semantics) among the words in question. Continuous bag-of-word (CBOW) framework is used to train word vectors so that we are able to manipulate them for further computation like vector summation and concatenation for investigating word contexts. In doing so, the entire Erya is preprocessed by word segmentation before our investigation, following the process we did for Huainanzi (Lau et al., 2013). Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. Efficient Estimation of Word Representations in Vector Space. In Proceedings of Workshop at ICLR, 2013. Kam Tang Lau, Yan Song, and Fei Xia, 2013. The Construction of a Segmented and Part-of- speech Tagged Archaic Chinese Corpus: A Case Study on Huainanzi (in Chinese), in Proceedings of the 12th China National Conference on Computational Linguistics (CNCCL 2013), Suzhou, China, Oct, 2013. .
Recommended publications
  • Orthography of Early Chinese Writing: Evidence from Newly Excavated Manuscripts
    IMRE GALAMBOS ORTHOGRAPHY OF EARLY CHINESE WRITING: EVIDENCE FROM NEWLY EXCAVATED MANUSCRIPTS BUDAPEST MONOGRAPHS IN EAST ASIAN STUDIES SERIES EDITOR: IMRE HAMAR IMRE GALAMBOS ORTHOGRAPHY OF EARLY CHINESE WRITING: EVIDENCE FROM NEWLY EXCAVATED MANUSCRIPTS DEPARTMENT OF EAST ASIAN STUDIES, EÖTVÖS LORÁND UNIVERSITY BUDAPEST 2006 The present volume was published with the support of the Chiang Ching-kuo Foundation. © Imre Galambos, 2006 ISBN 963 463 811 2 ISSN 1787-7482 Responsible for the edition: Imre Hamar Megjelent a Balassi Kiadó gondozásában (???) A nyomdai munkálatokat (???)a Dabas-Jegyzet Kft. végezte Felelős vezető Marosi Györgyné ügyvezető igazgató CONTENTS Acknowledgements ................................................................................................. vii Introduction ............................................................................................................ 1 CHAPTER ONE FORMER UNDERSTANDINGS ..................................................................................... 11 1.1 Traditional views ........................................................................................... 12 1.1.1 Ganlu Zishu ........................................................................................ 13 1.1.2 Hanjian .............................................................................................. 15 1.2 Modern views ................................................................................................ 20 1.2.1 Noel Barnard .....................................................................................
    [Show full text]
  • Generic Subject Guide
    DiMenna-Nyselius Library Chinese Language & Literature • Campus Resources • Directories and Organizations • Dictionaries and Encyclopedias • Almanacs, Handbooks, and • Library Catalog Statistics • Article Indexes and Databases • Primary Documents and Images • Electronic Texts and Journals • Internet Resources • Bibliographies and Reviews • Course Homepage and Course • Biographical and Geographical Reserves Resources • Tips on Using Your Information Campus Resources Fairfield University Language Resource Center (LARC) http://www.faculty.fairfield.edu/jgoldfield/LARC030506.htm Dictionaries & Encyclopedias Chinese-English Dictionary of Modern Usage http://www.arts.cuhk.edu.hk/Lexis/Lindict/ Chinese Characters Dictionary Web http://zhongwen.com/zi.htm Online Chinese Dictionaries http://www.chinalanguage.com/Language/Dictionaries/ ABC Chinese-English comprehensive dictionary: alphabetically based computerized Ref. PL1455.A33 2003 A Dictionary of Chinese mythology Ref. BL1801.W35 1961 Dictionary of 1000 Chinese idioms Ref. PL1273.D53 2000 Mathews’ Chinese-English dictionary Ref. PL1455.M42 1969 A New English-Chinese dictionary Ref. PL1455.N49 1984 Oxford Chinese dictionary: English-Chinese, Chinese-English Ref. PL1455.O94 2003 with an accompanying CD-ROM at the Media Desk, Main Level The Pinyin Chinese-English dictionary Ref. PL1455.P55 1979 The Cambridge encyclopedia of China Ref. DS705.C35 1991 Encyclopedia of China: the essential reference to China, its history and culture Ref. DS705.P47 1999 Modern China: an encyclopedia of history, culture,
    [Show full text]
  • Singapore Mandarin Chinese : Its Variations and Studies
    This document is downloaded from DR‑NTU (https://dr.ntu.edu.sg) Nanyang Technological University, Singapore. Singapore Mandarin Chinese : its variations and studies Lin, Jingxia; Khoo, Yong Kang 2018 Lin, J., & Khoo, Y. K. (2018). Singapore Mandarin Chinese : its variations and studies. Chinese Language and Discourse, 9(2), 109‑135. doi:10.1075/cld.18007.lin https://hdl.handle.net/10356/136920 https://doi.org/10.1075/cld.18007.lin © 2018 John Benjamins Publishing Company. All rights reserved. This paper was published in Chinese Language and Discourse and is made available with permission of John Benjamins Publishing Company. Downloaded on 26 Sep 2021 00:28:12 SGT To appear in Chinese Language and Discourse (2018) Singapore Mandarin Chinese: Its Variations and Studies* Jingxia Lin and Yong Kang Khoo Nanyang Technological University Abstract Given the historical and linguistic contexts of Singapore, it is both theoretically and practically significant to study Singapore Mandarin (SM), an important member of Global Chinese. This paper aims to present a relatively comprehensive linguistic picture of SM by overviewing current studies, particularly on the variations that distinguish SM from other Mandarin varieties, and to serve as a reference for future studies on SM. This paper notes that (a) current studies have often provided general descriptions of the variations, but less on individual variations that may lead to more theoretical discussions; (b) the studies on SM are primarily based on the comparison with Mainland China Mandarin; (c) language contact has been taken as the major contributor of the variation in SM, whereas other factors are often neglected; and (d) corpora with SM data are comparatively less developed and the evaluation of data has remained largely in descriptive statistics.
    [Show full text]
  • Images of Women in Chinese Literature. Volume 1. REPORT NO ISBN-1-880938-008 PUB DATE 94 NOTE 240P
    DOCUMENT RESUME ED 385 489 SO 025 360 AUTHOR Yu-ning, Li, Ed. TITLE Images of Women in Chinese Literature. Volume 1. REPORT NO ISBN-1-880938-008 PUB DATE 94 NOTE 240p. AVAILABLE FROM Johnson & Associates, 257 East South St., Franklin, IN 46131-2422 (paperback: $25; clothbound: ISBN-1-880938-008, $39; shipping: $3 first copy, $0.50 each additional copy). PUB TYPE Books (010) Reports Descriptive (141) EDRS PRICE MF01/PC10 Plus Postage. DESCRIPTORS *Chinese Culture; *Cultural Images; Females; Folk Culture; Foreign Countries; Legends; Mythology; Role Perception; Sexism in Language; Sex Role; *Sex Stereotypes; Sexual Identity; *Womens Studies; World History; *World Literature IDENTIFIERS *Asian Culture; China; '`Chinese Literature ABSTRACT This book examines the ways in which Chinese literature offers a vast array of prospects, new interpretations, new fields of study, and new themes for the study of women. As a result of the global movement toward greater recognition of gender equality and human dignity, the study of women as portrayed in Chinese literature has a long and rich history. A single volume cannot cover the enormous field but offers volume is a starting point for further research. Several renowned Chinese writers and researchers contributed to the book. The volume includes the following: (1) Introduction (Li Yu- Wing);(2) Concepts of Redemption and Fall through Woman as Reflected in Chinese Literature (Tsung Su);(3) The Poems of Li Qingzhao (1084-1141) (Kai-yu Hsu); (4) Images of Women in Yuan Drama (Fan Pen Chen);(5) The Vanguards--The Truncated Stage (The Women of Lu Yin, Bing Xin, and Ding Ling) (Liu Nienling); (6) New Woman vs.
    [Show full text]
  • 1 Chinese Writing: Ancient Autochthonous Perspective Fran蔞ise Bott廨o CRLAO-CNRS-EHESS Paris in Order to Compare What Could
    1 Chinese writing: ancient autochthonous perspective Françoise Bottéro CRLAO-CNRS-EHESS Paris In order to compare what could also be compared between ancient writing systems and enrich our problematics, I would like to provide another possible perspective. Instead of signs structure or origins of writing, I would like to discuss some more theoretic or analytic points such as, for example, the kinds of reflections made by the ancient people on their own writing system. Indeed, It could be worth comparing how and when ancient scribes/scholars started asking questions about their writing system : How they conceived the origins of their writing system, what kind of reflections and theories they elaborated on their script, how they classified and analysed the written signs, what kind of lexical lists they provided, when did they start using a special terminology, etc. This could help us understand constraints imposed by the so called logographic writing system as opposed to phonetic writing systems, and the ways ancient scribes/lexicographs used to handle them. 1. The origins of writing in ancient China The oldest mention to the origins of writing in China goes back to the 3rd century BC. The great philosopher Xu2nzi3 荀子 (335-238) and his disciple Ha2n Fe1i 韓非 (280-233) present Ca1ng Jie2 倉頡 as someone specialised in writing and eventually as the inventor of writing. The Heroic invention of writing progressively developed, and around the 1rst century AD, Ca1ng Jie2 had become the scribe of the Mythical “Yellow Emperor” Hua2ngdi4 who drew his inspiration to invent writing from the marks left by the birds and the beasts on the soil.
    [Show full text]
  • Chinese Literature in the Second Half of a Modern Century: a Critical Survey
    CHINESE LITERATURE IN THE SECOND HALF OF A MODERN CENTURY A CRITICAL SURVEY Edited by PANG-YUAN CHI and DAVID DER-WEI WANG INDIANA UNIVERSITY PRESS • BLOOMINGTON AND INDIANAPOLIS William Tay’s “Colonialism, the Cold War Era, and Marginal Space: The Existential Condition of Five Decades of Hong Kong Literature,” Li Tuo’s “Resistance to Modernity: Reflections on Mainland Chinese Literary Criticism in the 1980s,” and Michelle Yeh’s “Death of the Poet: Poetry and Society in Contemporary China and Taiwan” first ap- peared in the special issue “Contemporary Chinese Literature: Crossing the Bound- aries” (edited by Yvonne Chang) of Literature East and West (1995). Jeffrey Kinkley’s “A Bibliographic Survey of Publications on Chinese Literature in Translation from 1949 to 1999” first appeared in Choice (April 1994; copyright by the American Library Associ- ation). All of the essays have been revised for this volume. This book is a publication of Indiana University Press 601 North Morton Street Bloomington, IN 47404-3797 USA http://www.indiana.edu/~iupress Telephone orders 800-842-6796 Fax orders 812-855-7931 Orders by e-mail [email protected] © 2000 by David D. W. Wang All rights reserved No part of this book may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and recording, or by any information storage and retrieval system, without permission in writing from the publisher. The Association of American University Presses’ Resolution on Permissions constitutes the only exception to this prohibition. The paper used in this publication meets the minimum requirements of American National Standard for Information Sciences— Permanence of Paper for Printed Library Materials, ANSI Z39.48-1984.
    [Show full text]
  • The Art of Lexicography - Niladri Sekhar Dash
    LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash THE ART OF LEXICOGRAPHY Niladri Sekhar Dash Linguistic Research Unit, Indian Statistical Institute, Kolkata, India Keywords: Lexicology, linguistics, grammar, encyclopedia, normative, reference, history, etymology, learner’s dictionary, electronic dictionary, planning, data collection, lexical extraction, lexical item, lexical selection, typology, headword, spelling, pronunciation, etymology, morphology, meaning, illustration, example, citation Contents 1. Introduction 2. Definition 3. The History of Lexicography 4. Lexicography and Allied Fields 4.1. Lexicology and Lexicography 4.2. Linguistics and Lexicography 4.3. Grammar and Lexicography 4.4. Encyclopedia and lexicography 5. Typological Classification of Dictionary 5.1. General Dictionary 5.2. Normative Dictionary 5.3. Referential or Descriptive Dictionary 5.4. Historical Dictionary 5.5. Etymological Dictionary 5.6. Dictionary of Loanwords 5.7. Encyclopedic Dictionary 5.8. Learner's Dictionary 5.9. Monolingual Dictionary 5.10. Special Dictionaries 6. Electronic Dictionary 7. Tasks for Dictionary Making 7.1. Panning 7.2. Data Collection 7.3. Extraction of lexical items 7.4. SelectionUNESCO of Lexical Items – EOLSS 7.5. Mode of Lexical Selection 8. Dictionary Making: General Dictionary 8.1. HeadwordsSAMPLE CHAPTERS 8.2. Spelling 8.3. Pronunciation 8.4. Etymology 8.5. Morphology and Grammar 8.6. Meaning 8.7. Illustrative Examples and Citations 9. Conclusion Acknowledgements ©Encyclopedia of Life Support Systems (EOLSS) LINGUISTICS - The Art of Lexicography - Niladri Sekhar Dash Glossary Bibliography Biographical Sketch Summary The art of dictionary making is as old as the field of linguistics. People started to cultivate this field from the very early age of our civilization, probably seven to eight hundred years before the Christian era.
    [Show full text]
  • Concise English-Chinese Chinese-English Dictionary Free
    FREECONCISE ENGLISH-CHINESE CHINESE-ENGLISH DICTIONARY EBOOK Manser H. Martin | 696 pages | 01 Jan 2011 | Commercial Press,The,China | 9787100059459 | English, Chinese | China Excerpt from A Concise Chinese-English Dictionary for Lovers | Penguin Random House Canada Notable Chinese dictionariespast and present, include:. From Wikipedia, the free encyclopedia. Wikipedia list article. Dictionaries of Chinese. List of Concise English-Chinese Chinese-English Dictionary dictionaries. Categories : Chinese dictionaries Lists of reference books. Concise English- Chinese Chinese-English Dictionary categories: Articles with short description Short description is different from Wikidata. Namespaces Article Talk. Views Read Edit View history. Help Learn to edit Community portal Recent changes Upload file. Download as PDF Printable version. Add links. First Chinese dictionary collated in single-sort alphabetical order of pinyin, John DeFrancis. A Chinese-English Dictionary. Herbert Allen Giles ' bestselling dictionary, 2nd ed. A Dictionary of the Chinese Language. A Syllabic Dictionary of the Chinese Language. Small Seal Script orthographic primer, Li Si 's language reform. Chinese Concise English-Chinese Chinese-English Dictionary English Dictionary. Popular modern general-purpose encyclopedic dictionary, 6 editions. Concise Dictionary of Spoken Chinese. Tetsuji Morohashi 's Chinese- Japanese character dictionary, 50, entries. Oldest extant Chinese dictionary, semantic field collationone of the Thirteen Classics. Yang Xiongfirst dictionary of Chinese regional varieties. Le Grand Ricci or Grand dictionnaire Ricci de la langue chinoise,". First orthography dictionary of the regular script. Grammata Serica Recensa. Great Dictionary of Modern Chinese Dialects. Compendium of dictionaries for 42 local varieties of Chinese. Zhang Yi 's supplement to the Erya. Rime dictionary expansion of Qieyunsource for reconstruction of Middle Chinese.
    [Show full text]
  • Introduce to the Non-Symmetry of Word Derivation Between 'Wenhua' and 'Culture'
    International Journal of Linguistics and Communication June 2015, Vol. 3, No. 1, pp. 145-150 ISSN: 2372-479X (Print) 2372-4803 (Online) Copyright © The Author(s). All Rights Reserved. Published by American Research Institute for Policy Development DOI: 10.15640/ijlc.v3n1a15 URL: http://dx.doi.org/10.15640/ijlc.v3n1a15 Introduce to the Non-symmetry of Word Derivation between ‘Wenhua’ and ‘Culture’ Fei Deng1 & Jianli Tang2 Abstract The western civilizations have brought many new thoughts to Chinese language in the 19th century, and further influenced and changed the words’ meanings of Chinese words. To studying the Chinese-English translated terms and knowing their different derivation can bridge the gap between different cultures. The scholar translated the English word ‘Culture’ into Chinese word ‘Wenhua’. ‘Wenhua’ is derived from two words which are ‘Wen’ and ‘Hua’ in ancient Chinese language. ‘Wen’ means ‘to feel and experience the nature by heart, and then painting and describing things, at last, one can achieve correlating knowledge through this process in ancient Chinese language’. ‘Hua’ means ‘to change’ which stresses persons can improve them based on ‘Wen’. The start-point of meanings emphasizes the harmonious relationship between human beings and outer nature, and stresses the inner harmony of human beings by themselves. Namely, ‘Wenhua’ focuses on the all sorts of social and ethical relationship which comes from the inner human beings society. The start- point of meanings derivation of English word ‘Culture’ emphasizes the interrelation between human being society and the outer nature, and focuses the contrary relationship which embody as the remolding one and the reformed one.
    [Show full text]
  • The Original Meaning of the Chinese Character for “Beauty”
    Filozofski vestnik Letnik/Volume XXII • Številka/Number 2 • 2001 • 141-159 THE ORIGINAL MEANING OF THE CHINESE CHARACTER FOR “BEAUTY” J ianping G ao l “Beauty” is translated into Chinese as Џ, (mei) and “Aesthetics” as -^ c ^ (meixue) (literally meaning the studies of the beauty). The compound ( meixue) is new in Chinese and its origin is due to translation in modern time. But indigenous in China is the word mei (beauty), which appearred as early as more than 3000 years ago. The very first question in aesthetics was probably “what is beauty?” The concept of beauty in the mind of ancient Chinese is not necessarily identical with that in the mind of modern people, but an investigation of it may be of some interest to today’s aesthetic inquiry, and, as we shall see, it already attracts attention of some scholars in the fields of both linguistics and aesthetics. “ ЏЈ’ (beauty) is traditionally considered to be composed of two characters: (sheep) and (large). A large sheep will supply plenty of delicious meat. This explanation comes from ShuowenJiezi (100 A.D.), a pioneering book on the research of Chinese characters: Џ. (beauty) means delicious. It is composed of (sheep) and (large). Among six domestic animals (cow, horse, sheep, pig, hen, and dog), sheep are the major sacrificial offerings. Beauty is identical with goodness.1 This opinion was accepted by almost all philologists in ancient China, such as Xu Xuan (917 - 992), Xu Kai (920-974), Duan Yucai (1735 - 1815), Wang Yun (1784 - 1854), and Zhu Junsheng (1788 - 1858), who provided 1 Shuowen Jiezi (literally means “a discription of simply characters and explanation of complex characters”) is a dictionary-like book which was intended to explain Chinese characters on the basis of their forms.
    [Show full text]
  • A History of Chinese Letters and Epistolary Culture
    A History of Chinese Letters and Epistolary Culture Edited by Antje Richter LEIDEN | BOSTON For use by the Author only | © 2015 Koninklijke Brill NV Contents Acknowledgements ix List of Illustrations xi Abbreviations xiii About the Contributors xiv Introduction: The Study of Chinese Letters and Epistolary Culture 1 Antje Richter PART 1 Material Aspects of Chinese Letter Writing Culture 1 Reconstructing the Postal Relay System of the Han Period 17 Y. Edmund Lien 2 Letters as Calligraphy Exemplars: The Long and Eventful Life of Yan Zhenqing’s (709–785) Imperial Commissioner Liu Letter 53 Amy McNair 3 Chinese Decorated Letter Papers 97 Suzanne E. Wright 4 Material and Symbolic Economies: Letters and Gifts in Early Medieval China 135 Xiaofei Tian PART 2 Contemplating the Genre 5 Letters in the Wen xuan 189 David R. Knechtges 6 Between Letter and Testament: Letters of Familial Admonition in Han and Six Dynasties China 239 Antje Richter For use by the Author only | © 2015 Koninklijke Brill NV vi Contents 7 The Space of Separation: The Early Medieval Tradition of Four-Syllable “Presentation and Response” Poetry 276 Zeb Raft 8 Letters and Memorials in the Early Third Century: The Case of Cao Zhi 307 Robert Joe Cutter 9 Liu Xie’s Institutional Mind: Letters, Administrative Documents, and Political Imagination in Fifth- and Sixth-Century China 331 Pablo Ariel Blitstein 10 Bureaucratic Influences on Letters in Middle Period China: Observations from Manuscript Letters and Literati Discourse 363 Lik Hang Tsui PART 3 Diversity of Content and Style section 1 Informal Letters 11 Private Letter Manuscripts from Early Imperial China 403 Enno Giele 12 Su Shi’s Informal Letters in Literature and Life 475 Ronald Egan 13 The Letter as Artifact of Sentiment and Legal Evidence 508 Janet Theiss 14 Infijinite Variations of Writing and Desire: Love Letters in China and Europe 546 Bonnie S.
    [Show full text]
  • Chinese-English Dictionary. by John Defrancis. Honolulu: University of Hawai’I Press, 1996
    Language (Journal of the Linguistic Society of America), 74.3 (1998), p. 660-661. ABC (Alphabetically Based Computerized) Chinese-English dictionary. By John DeFrancis. Honolulu: University of Hawai’i Press, 1996. Pp. xix, 897 + radical charts. $28.00. Comprehensive, practical and reliable Chinese-English dictionaries are few and far between, so John DeFrancis’s ABC Chinese-English dictionary is definitely a welcome addition to current offerings. Unfortunately, the alphabetic arrangement of this work is, in the reviewer’s opinion, also its biggest drawback. While compounds sharing the same character are usually grouped under the same entry, D arranges all entries, including compounds, strictly alphabetically, disregarding morpheme boundaries. And while this may reflect an ideal of treating Chinese primarily as a spoken rather than written language, it also goes against native habit and intuition. To find all the compounds beginning with shí, ‘to realize’, for example, you must plow through numerous unrelated compounds with first elements also pronounced shí, in all the four tones, i.e. all the 70-some homophones. This makes using the work burdensome and time-consuming if you are doing anything but looking up a very specific compound you already know how to pronounce. D draws on other dictionaries, apparently most heavily on the standard-setter for the whole field, Wu Jingrong’s A Chinese-English dictionary, of which a revised edition has recently come out (Wei Dongya (ed.) 1995. Beijing: Foreign Language and Research Press). Yet, in the interest of producing a conveniently-sized volume, D’s lexicon omits quite a bit of information and detail included in Wu’s work.
    [Show full text]