Korean-Chinese Person Name Translation for Cross Language Information Retrieval∗
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 2373–2380 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC A Topic-Aligned Multilingual Corpus of Wikipedia Articles for Studying Information Asymmetry in Low Resource Languages Dwaipayan Roy, Sumit Bhatia, Prateek Jain GESIS - Cologne, IBM Research - Delhi, IIIT - Delhi [email protected], [email protected], [email protected] Abstract Wikipedia is the largest web-based open encyclopedia covering more than three hundred languages. However, different language editions of Wikipedia differ significantly in terms of their information coverage. We present a systematic comparison of information coverage in English Wikipedia (most exhaustive) and Wikipedias in eight other widely spoken languages (Arabic, German, Hindi, Korean, Portuguese, Russian, Spanish and Turkish). We analyze the content present in the respective Wikipedias in terms of the coverage of topics as well as the depth of coverage of topics included in these Wikipedias. Our analysis quantifies and provides useful insights about the information gap that exists between different language editions of Wikipedia and offers a roadmap for the Information Retrieval (IR) community to bridge this gap. Keywords: Wikipedia, Knowledge base, Information gap 1. Introduction other with respect to the coverage of topics as well as Wikipedia is the largest web-based encyclopedia covering the amount of information about overlapping topics. -
PARK JIN HYOK, Also Known As ("Aka") "Jin Hyok Park," Aka "Pak Jin Hek," Case Fl·J 18 - 1 4 79
AO 91 (Rev. 11/11) Criminal Complaint UNITED STATES DISTRICT COURT for the RLED Central District of California CLERK U.S. DIS RICT United States ofAmerica JUN - 8 ?018 [ --- .. ~- ·~".... ~-~,..,. v. CENT\:y'\ l i\:,: ffl1G1 OF__ CAUFORN! BY .·-. ....-~- - ____D=E--..... PARK JIN HYOK, also known as ("aka") "Jin Hyok Park," aka "Pak Jin Hek," Case fl·J 18 - 1 4 79 Defendant. CRIMINAL COMPLAINT I, the complainant in this case, state that the following is true to the best ofmy knowledge and belief. Beginning no later than September 2, 2014 and continuing through at least August 3, 2017, in the county ofLos Angeles in the Central District of California, the defendant violated: Code Section Offense Description 18 U.S.C. § 371 Conspiracy 18 u.s.c. § 1349 Conspiracy to Commit Wire Fraud This criminal complaint is based on these facts: Please see attached affidavit. IBJ Continued on the attached sheet. Isl Complainant's signature Nathan P. Shields, Special Agent, FBI Printed name and title Sworn to before ~e and signed in my presence. Date: ROZELLA A OLIVER Judge's signature City and state: Los Angeles, California Hon. Rozella A. Oliver, U.S. Magistrate Judge Printed name and title -:"'~~ ,4G'L--- A-SA AUSAs: Stephanie S. Christensen, x3756; Anthony J. Lewis, x1786; & Anil J. Antony, x6579 REC: Detention Contents I. INTRODUCTION .....................................................................................1 II. PURPOSE OF AFFIDAVIT ......................................................................1 III. SUMMARY................................................................................................3 -
Neural Substrates of Hanja (Logogram) and Hangul (Phonogram) Character Readings by Functional Magnetic Resonance Imaging
ORIGINAL ARTICLE Neuroscience http://dx.doi.org/10.3346/jkms.2014.29.10.1416 • J Korean Med Sci 2014; 29: 1416-1424 Neural Substrates of Hanja (Logogram) and Hangul (Phonogram) Character Readings by Functional Magnetic Resonance Imaging Zang-Hee Cho,1 Nambeom Kim,1 The two basic scripts of the Korean writing system, Hanja (the logography of the traditional Sungbong Bae,2 Je-Geun Chi,1 Korean character) and Hangul (the more newer Korean alphabet), have been used together Chan-Woong Park,1 Seiji Ogawa,1,3 since the 14th century. While Hanja character has its own morphemic base, Hangul being and Young-Bo Kim1 purely phonemic without morphemic base. These two, therefore, have substantially different outcomes as a language as well as different neural responses. Based on these 1Neuroscience Research Institute, Gachon University, Incheon, Korea; 2Department of linguistic differences between Hanja and Hangul, we have launched two studies; first was Psychology, Yeungnam University, Kyongsan, Korea; to find differences in cortical activation when it is stimulated by Hanja and Hangul reading 3Kansei Fukushi Research Institute, Tohoku Fukushi to support the much discussed dual-route hypothesis of logographic and phonological University, Sendai, Japan routes in the brain by fMRI (Experiment 1). The second objective was to evaluate how Received: 14 February 2014 Hanja and Hangul affect comprehension, therefore, recognition memory, specifically the Accepted: 5 July 2014 effects of semantic transparency and morphemic clarity on memory consolidation and then related cortical activations, using functional magnetic resonance imaging (fMRI) Address for Correspondence: (Experiment 2). The first fMRI experiment indicated relatively large areas of the brain are Young-Bo Kim, MD Department of Neuroscience and Neurosurgery, Gachon activated by Hanja reading compared to Hangul reading. -
Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only. -
Sinitic Language and Script in East Asia: Past and Present
SINO-PLATONIC PAPERS Number 264 December, 2016 Sinitic Language and Script in East Asia: Past and Present edited by Victor H. Mair Victor H. Mair, Editor Sino-Platonic Papers Department of East Asian Languages and Civilizations University of Pennsylvania Philadelphia, PA 19104-6305 USA [email protected] www.sino-platonic.org SINO-PLATONIC PAPERS FOUNDED 1986 Editor-in-Chief VICTOR H. MAIR Associate Editors PAULA ROBERTS MARK SWOFFORD ISSN 2157-9679 (print) 2157-9687 (online) SINO-PLATONIC PAPERS is an occasional series dedicated to making available to specialists and the interested public the results of research that, because of its unconventional or controversial nature, might otherwise go unpublished. The editor-in-chief actively encourages younger, not yet well established, scholars and independent authors to submit manuscripts for consideration. Contributions in any of the major scholarly languages of the world, including romanized modern standard Mandarin (MSM) and Japanese, are acceptable. In special circumstances, papers written in one of the Sinitic topolects (fangyan) may be considered for publication. Although the chief focus of Sino-Platonic Papers is on the intercultural relations of China with other peoples, challenging and creative studies on a wide variety of philological subjects will be entertained. This series is not the place for safe, sober, and stodgy presentations. Sino- Platonic Papers prefers lively work that, while taking reasonable risks to advance the field, capitalizes on brilliant new insights into the development of civilization. Submissions are regularly sent out to be refereed, and extensive editorial suggestions for revision may be offered. Sino-Platonic Papers emphasizes substance over form. -
Towards a Korean Dbpedia and an Approach for Complementing the Korean Wikipedia Based on Dbpedia
Towards a Korean DBpedia and an Approach for Complementing the Korean Wikipedia based on DBpedia Eun-kyung Kim1, Matthias Weidl2, Key-Sun Choi1, S¨orenAuer2 1 Semantic Web Research Center, CS Department, KAIST, Korea, 305-701 2 Universit¨at Leipzig, Department of Computer Science, Johannisgasse 26, D-04103 Leipzig, Germany [email protected], [email protected] [email protected], [email protected] Abstract. In the first part of this paper we report about experiences when applying the DBpedia extraction framework to the Korean Wikipedia. We improved the extraction of non-Latin characters and extended the framework with pluggable internationalization components in order to fa- cilitate the extraction of localized information. With these improvements we almost doubled the amount of extracted triples. We also will present the results of the extraction for Korean. In the second part, we present a conceptual study aimed at understanding the impact of international resource synchronization in DBpedia. In the absence of any informa- tion synchronization, each country would construct its own datasets and manage it from its users. Moreover the cooperation across the various countries is adversely affected. Keywords: Synchronization, Wikipedia, DBpedia, Multi-lingual 1 Introduction Wikipedia is the largest encyclopedia of mankind and is written collaboratively by people all around the world. Everybody can access this knowledge as well as add and edit articles. Right now Wikipedia is available in 260 languages and the quality of the articles reached a high level [1]. However, Wikipedia only offers full-text search for this textual information. For that reason, different projects have been started to convert this information into structured knowledge, which can be used by Semantic Web technologies to ask sophisticated queries against Wikipedia. -
Women's Life During the Chosŏn Dynasty
International Journal of Korean History(Vol.6, Dec.2004) 113 Women’s Life during the Chosŏn Dynasty Han Hee-sook* 1 Introduction The Chosŏn society was one in which the yangban (aristocracy) wielded tremendous power. The role of women in this society was influenced greatly by the yangban class’ attempts to establish a patriarchal family order and a Confucian-based society. For example, women were forced, in accordance with neo-Confucian ideology, to remain chaste before marriage and barred from remarrying once their husbands had passed away. As far as the marriage system was concerned, the Chosŏn era saw a move away from the old tradition of the man moving into his in-laws house following the wedding (男歸女家婚 namgwiyŏgahon), with the woman now expected to move in with her husband’s family following the marriage (親迎制度 ch΄inyŏng jedo). Moreover, wives were rigidly divided into two categories: legitimate wife (ch΄ŏ) and concubines (ch΄ŏp). This period also saw a change in the legal standing of women with regards to inheritance, as the system was altered from the practice of equal, from a gender standpoint, rights to inheritance, to one in which the eldest son became the sole inheritor. These neo-Confucianist inspired changes contributed to the strengthening of the patriarchal system during the Chosŏn era. As a result of these changes, Chosŏn women’s rights and activities became increasingly restricted. * Professor, Dept. of Korean History, Sookmyung Women’s University 114 Women’s Life during the Chosŏn Dynasty During the Chosŏn dynasty women fell into one of the following classifications: female members of the royal family such as the queen and the king’s concubines, members of the yangban class the wives of the landed gentry, commoners, the majority of which were engaged in agriculture, women in special professions such as palace women, entertainers, shamans and physicians, and women from the lowborn class (ch’ŏnin), which usually referred to the yangban’s female slaves. -
The Twentieth-Century Secularization of the Sinograph in Vietnam, and Its Demotion from the Cosmological to the Aesthetic
Journal of World Literature 1 (2016) 275–293 brill.com/jwl The Twentieth-Century Secularization of the Sinograph in Vietnam, and its Demotion from the Cosmological to the Aesthetic John Duong Phan* Rutgers University [email protected] Abstract This article examines David Damrosch’s notion of “scriptworlds”—spheres of cultural and intellectual transfusion, defined by a shared script—as it pertains to early mod- ern Vietnam’s abandonment of sinographic writing in favor of a latinized alphabet. The Vietnamese case demonstrates a surprisingly rapid readjustment of deeply held attitudes concerning the nature of writing, in the wake of the alphabet’s meteoric suc- cesses. The fluidity of “language ethics” in early modern Vietnam (a society that had long since developed vernacular writing out of an earlier experience of diglossic liter- acy) suggests that the durability of a “scriptworld” depends on the nature and history of literacy in the societies under question. Keywords Vietnamese literature – Vietnamese philology – Chinese philology – script reform – language reform – early modern East Asia/Southeast Asia Introduction In 1919, the French-installed emperor Khải Định permanently dismantled Viet- nam’s civil service examinations, and with them, an entire system of political * I would like to thank Wynn Wilcox, Nguyễn Tuấn Cường, Tuna Artun, and Jack Chia for their valuable help and input. All errors are my own. © koninklijke brill nv, leiden, 2016 | doi: 10.1163/24056480-00102010 Downloaded from Brill.com09/25/2021 07:54:41AM via free access 276 phan selection based on proficiency in Literary Sinitic.1 That same year, using a con- troversial latinized alphabet called Quốc Ngữ (lit. -
International Post Adoption Services | Korea Service Descriptions
1605 Eustis Street 80 0 -952-9302 Saint Paul, MN 55108 651-646-7771 chlss.org/post-adoption International Post Adoption Services | Korea Service Descriptions Important: CH/LSS and our Korean partnership agencies receive heightened service requests over the summer months, and you may experience a longer wait time for your case to be assigned. Thank you for your understanding. A $35 Registration Fee is due at time of service request Additional forms specific to your Korean agency will be required. Your post adoption worker will provide them to you. Birth Family Search - Korean and U.S. File Review are included *See age restrictions below If you are unsure about proceeding with a search, consider starting with a file review. An adopted adult, age 19+ or parents of adopted minors (13+) can initiate a search for birth parents. Prior to starting your search, your post adoption worker will speak with you in detail about your motivations to search, the range of possible outcomes, and access to support systems during the search and outreach journey. Correspondence (through email) will be exchanged at no extra cost up to 1 year from the time of first contact with birth mother or father (translation available through volunteer translators). Correspondence fees (page 2) apply after 1st year of letter exchange. ..............................................................................................................................................................................................................................................................………..$350 Use “Domestic” Search Service Request when all parties live in US and Korean agency is not involved..….…..….Domestic Service fees apply US and/or Korean file review and other special requests *See age restrictions below An adopted adult, age 19+ or parents of adopted minors (age 13+) can request that their US and/or Korean agency adoption file be checked for updates. -
Girl Power: Feminine Motifs in Japanese Popular Culture David Endresak [email protected]
Eastern Michigan University DigitalCommons@EMU Senior Honors Theses Honors College 2006 Girl Power: Feminine Motifs in Japanese Popular Culture David Endresak [email protected] Follow this and additional works at: http://commons.emich.edu/honors Recommended Citation Endresak, David, "Girl Power: Feminine Motifs in Japanese Popular Culture" (2006). Senior Honors Theses. 322. http://commons.emich.edu/honors/322 This Open Access Senior Honors Thesis is brought to you for free and open access by the Honors College at DigitalCommons@EMU. It has been accepted for inclusion in Senior Honors Theses by an authorized administrator of DigitalCommons@EMU. For more information, please contact lib- [email protected]. Girl Power: Feminine Motifs in Japanese Popular Culture Degree Type Open Access Senior Honors Thesis Department Women's and Gender Studies First Advisor Dr. Gary Evans Second Advisor Dr. Kate Mehuron Third Advisor Dr. Linda Schott This open access senior honors thesis is available at DigitalCommons@EMU: http://commons.emich.edu/honors/322 GIRL POWER: FEMININE MOTIFS IN JAPANESE POPULAR CULTURE By David Endresak A Senior Thesis Submitted to the Eastern Michigan University Honors Program in Partial Fulfillment of the Requirements for Graduation with Honors in Women's and Gender Studies Approved at Ypsilanti, Michigan, on this date _______________________ Dr. Gary Evans___________________________ Supervising Instructor (Print Name and have signed) Dr. Kate Mehuron_________________________ Honors Advisor (Print Name and have signed) Dr. Linda Schott__________________________ Dennis Beagan__________________________ Department Head (Print Name and have signed) Department Head (Print Name and have signed) Dr. Heather L. S. Holmes___________________ Honors Director (Print Name and have signed) 1 Table of Contents Chapter 1: Printed Media.................................................................................................. -
Identity Under Japanese Occupation
1 “BECOMING JAPANESE:” IDENTITY UNDER JAPANESE OCCUPATION GRADES: 9-12 AUTHOR: Katherine Murphy TOPIC/THEME: Japanese Occupation, World War II, Korean Culture, Identity TIME REQUIRED: Two 60-minute periods BACKGROUND: The lesson is based on the impact of the Japanese occupation of Korea during World War II on Korean culture and identity. In particular, the lesson focuses on the Japanese campaign in 1940 to encourage Koreans to abandon their Korean names and adopt Japanese names. This campaign was known as “sōshi-kaimei." The purpose of this campaign, along with campaigns requiring Koreans to recite an oath to the Japanese Emperor and bow at Shinto shrines, were to make the Korean people “Japanese” and hopefully, loyal subjects of the Japanese Empire by abandoning their Korean identity and loyalties. These cultural policies and campaigns were key to the Japanese war effort during World War II. The lesson draws from the students’ lives as well as two books: Lost Names: Scenes from a Korean Boyhood by Richard E. Kim and Under the Black Umbrella: Voices from Colonial Korea 1910-1945 by Hildi Kang. CURRICULUM CONNECTION: The lesson is intended to use the major themes from the summer reading book Lost Names: Scenes from a Korean Boyhood to introduce students to one of the five essential questions of the World History II course: How is identity constructed? How does identity impact human experience? In first investigating the origin of their own names and the meaning of Korean names, students can begin to explore the question “How is identity constructed?’ In examining how and why the Japanese sought to change the Korean people’s names, religion, etc during World War II, students will understand how global events such as World War II can impact an individual. -
A Comparative Analysis of the Simplification of Chinese Characters in Japan and China
CONTRASTING APPROACHES TO CHINESE CHARACTER REFORM: A COMPARATIVE ANALYSIS OF THE SIMPLIFICATION OF CHINESE CHARACTERS IN JAPAN AND CHINA A THESIS SUBMITTED TO THE GRADUATE DIVISION OF THE UNIVERSITY OF HAWAI‘I AT MĀNOA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF ARTS IN ASIAN STUDIES AUGUST 2012 By Kei Imafuku Thesis Committee: Alexander Vovin, Chairperson Robert Huey Dina Rudolph Yoshimi ACKNOWLEDGEMENTS I would like to express deep gratitude to Alexander Vovin, Robert Huey, and Dina R. Yoshimi for their Japanese and Chinese expertise and kind encouragement throughout the writing of this thesis. Their guidance, as well as the support of the Center for Japanese Studies, School of Pacific and Asian Studies, and the East-West Center, has been invaluable. i ABSTRACT Due to the complexity and number of Chinese characters used in Chinese and Japanese, some characters were the target of simplification reforms. However, Japanese and Chinese simplifications frequently differed, resulting in the existence of multiple forms of the same character being used in different places. This study investigates the differences between the Japanese and Chinese simplifications and the effects of the simplification techniques implemented by each side. The more conservative Japanese simplifications were achieved by instating simpler historical character variants while the more radical Chinese simplifications were achieved primarily through the use of whole cursive script forms and phonetic simplification techniques. These techniques, however, have been criticized for their detrimental effects on character recognition, semantic and phonetic clarity, and consistency – issues less present with the Japanese approach. By comparing the Japanese and Chinese simplification techniques, this study seeks to determine the characteristics of more effective, less controversial Chinese character simplifications.