A Hybrid Machine Translation Framework for an Improved Translation Workflow

Total Page:16

File Type:pdf, Size:1020Kb

A Hybrid Machine Translation Framework for an Improved Translation Workflow A Hybrid Machine Translation Framework for an Improved Translation Workflow Dissertation zur Erlangung des akademischen Grades eines Doktors der Philosophie der Philosophischen Fakultäten der Universität des Saarlandes vorgelegt von Santanu Pal aus Sonamukhi, West Bengal, Indien Saarbrücken, 2018 Der Dekan: Prof. Dr. Roland Marti Erstberichterstatter: Prof. Dr. Josef van Genabith Zweitberichterstatter: Prof. Dr. Dietrich Klakow Tag der letzten Prüfungsleistung: 27.11.2017 “A scientist in his laboratory is not a mere technician: he is also a child confronting natural phenomena that impress him as though they were fairy tales ....” Marie Curie Abstract A Hybrid Machine Translation Framework for an Improved Translation Workflow by Santanu Pal Doctor of Philosophy Computerlinguistik, Sprachwissenschaft und Sprachtechnologie Universität des Saarlandes Over the past few decades, due to a continuing surge in the amount of content being translated and ever increasing pressure to deliver high quality and high throughput trans- lation, translation industries are focusing their interest on adopting advanced technologies such as machine translation (MT), and automatic post-editing (APE) in their translation workflows. Despite the progress of the technology, the roles of humans and machines essentially remain intact as MT/APE are moving from the peripheries of the translation field closer towards collaborative human-machine based MT/APE in modern transla- tion workflows. Professional translators increasingly become post-editors correcting raw MT/APE output instead of translating from scratch which in turn increases productivity in terms of translation speed. The last decade has seen substantial growth in research and development activities on improving MT; usually concentrating on selected aspects of workflows starting from training data pre-processing techniques to core MT processes to post-editing methods. To date, however, complete MT workflows are less investigated than the core MT processes. In the research presented in this thesis, we investigate av- enues towards achieving improved MT workflows. We study how different MT paradigms can be utilized and integrated to best effect. We also investigate how different upstream and downstream component technologies can be hybridized to achieve overall improved MT. Finally we include an investigation into human-machine collaborative MT by taking humans in the loop. In many of (but not all) the experiments presented in this thesis we focus on data scenarios provided by low resource language settings. German Summary (Zusammenfassung) Aufgrund des stetig ansteigenden Übersetzungsvolumens in den letzten Jahrzehnten und gleichzeitig wachsendem Druck hohe Qualität innerhalb von kürzester Zeit liefern zu müssen sind Übersetzungsdienstleister darauf angewiesen, moderne Technologien wie Maschinelle Übersetzung (MT) und automatisches Post-Editing (APE) in den Überset- zungsworkflow einzubinden. Trotz erheblicher Fortschritte dieser Technologien haben sich die Rollen von Mensch und Maschine kaum verändert. MT/APE ist jedoch nunmehr nicht mehr nur eine Randerscheinung, sondern wird im modernen Übersetzungsworkflow zunehmend in Zusammenarbeit von Mensch und Maschine eingesetzt. Fachübersetzer werden immer mehr zu Post-Editoren und korrigieren den MT/APE-Output, statt wie bisher Übersetzungen komplett neu anzufertigen. So kann die Produktivität bezüglich der Übersetzungsgeschwindigkeit gesteigert werden. Im letzten Jahrzehnt hat sich in den Bereichen Forschung und Entwicklung zur Verbesserung von MT sehr viel getan: Ein- bindung des vollständigen Übersetzungsworkflows von der Vorbereitung der Trainings- daten über den eigentlichen MT-Prozess bis hin zu Post-Editing-Methoden. Der voll- ständige Übersetzungsworkflow wird jedoch aus Datenperspektive weit weniger berück- sichtigt als der eigentliche MT-Prozess. In dieser Dissertation werden Wege hin zum idealen oder zumindest verbesserten MT-Workflow untersucht. In den Experimenten wird dabei besondere Aufmertsamfit auf die speziellen Belange von sprachen mit gerin- gen ressourcen gelegt. Es wird untersucht wie unterschiedliche MT-Paradigmen verwendet und optimal integriert werden können. Des Weiteren wird dargestellt wie unterschiedliche vor- und nachgelagerte Technologiekomponenten angepasst werden können, um insgesamt einen besseren MT-Output zu generieren. Abschließend wird gezeigt wie der Mensch in den MT-Workflow intergriert werden kann. Das Ziel dieser Arbeit ist es verschiedene Technologiekomponenten in den MT-Workflow zu integrieren um so einen verbesserten Gesamtworkflow zu schaffen. Hierfür werden hauptsächlich Hybridisierungsansätze ver- wendet. In dieser Arbeit werden außerdem Möglichkeiten untersucht, Menschen effek- tiv als Post-Editoren einzubinden. Die hierbei gewonnenen Übersetzungsprozessdaten German Summary werden automatisch gesammelt und stehen für künftige Forschung zur Verfügung (z.B. zur Unterstützung von inkrementellen Updates einzelner Workflow- und Post-Editing- Komponenten). Das Hauptziel dieser Dissertation ist es, die echten Bedürfnisse und Probleme der Anwender von Übersetzungstechnologie - einschließlich von professionellen übersetzern - zu erfassen. Es soll ein kollaborativer Rahmen für hybride maschinelle Übersetzung geschaffen werden, der den Übersetzungsworkflow verbessert und den Post- Editing-Effort für den Übersetzer reduziert. Auf dieser Grundlage soll die Funktionalität von Übersetzungssystemen in Hinblick auf die Anforderungen der Anwender optimiert werden, statt die Anwender dazu zu zwingen, ihre Arbeitsweise an die Technologie anzu- passen. Des Weiteren untersucht diese Arbeit ob und wie bestehende Technologien wie neuronale MT (NMT), statistische MT (SMT), beispielbasierte MT (Example Based MT, EBMT) und Translation- Memory-Systeme (TM) die Anforderungen der Anwender un- terstützen können. Hierfür wird der in der arbeit verfolgte Ansatz auf zwei Arten beschrieben. Normalerweise werden verschiedene Technologiekomponenten im Übersetzungsworkflow kombiniert; im ersten Teil dieser Arbeit liegt der Schwerpunkt auf den (i) Komponenten, im zweiten Teil auf den (ii) Workflows. Im ersten Teil dieser Arbeit liegt der Schwerpunkt insbesondere auf Design und Implementierung von leistungsstarken und benutzerfreundlichen Tech- nologiekomponenten (beschrieben in Kapitel 3, Kapitel 4 und Kapitel 5). In diesem Teil werden diverse Hybridisierungsansätze angewandt. Im zweiten Teil der Arbeit liegt der Schwerpunkt auf der Identifizierung optimierter Workflows durch die Kombination ver- schiedener Technologiekomponenten. Hier wird eine Plug&Play-Methodologie angewen- det, bei der der Optimierungsgrad in Bezug auf besseren Übersetzungsoutput gemessen wird. Des Weiteren wird in diesem Teil der Arbeit auch der Mensch in den Über- setzungsprozess zur Bewertung der Übersetzungsqualität sowie zur langfristigen schrit- tweisen Verbesserung der Technologiekomponenten durch Feedback eingebunden. Zusät- zlich werden gleichzeitig wertvolle Ressourcen für Übersetzungsprozessforschung geschaf- fen (vgl. Kapitel 6). In Kapitel 6 liegt der Fokus auf Technologien der die Sammlung von Ressourcen für den vorgeschlagenen Rahmen zur schrittweisen Verbesserung von MT/APE-Komponenten für der Workflow sowie für die Übersetzungsprozessforschung viii German Summary unterstützen. Nachdem Ressourcen in beträchtlichem Umfang durch den kollabora- tiven Rahmen für hybride human-maschinelle Übersetzung geschaffen wurden, soll diese Forschungsarbeit künftig durch die Einbeziehung weiterer Komponenten fortgeführt wer- den. APE stellt jedoch auch eine beträchtliche Verbesserung unseres übersetzungsrahmen dar. MT ist per Definition ein computergestützter Prozess, der Text einer menschlichen Sprache in eine andere umwandelt. Dies geschieht entweder voll- oder halbautomatisch (dies ist der Fall bei von Menschen unterstützen MT, bei der der Mensch in den Überset- zungsprozess eingebunden wird). In den letzten Jahren wurden verschiedene Ansätze zur MT untersucht, z.B. regelbasierte, beispielbasierte, wissensbasierte, statistische und neu- ronale Ansätze. Hinsichtlich weitreichender Entwicklungen und Anwendungen war SMT und insbesondere phrasenbasierte SMT (phrase-based SMT, PB-SMT) aus all diesen An- sätzen bis vor kurzem weitgehende dominierend1. Die Qualität von PB-SMT hängt sehr stark von vorgelagerten Prozessen wie Wordalignment und Bewertung von Phrasenpaaren ab. Beides kann durch die Verwendung großer satzalignierter Parallelkorpora erreicht wer- den. Jedoch kann die Verfügbarkeit von Daten eine Herausforderung sein. PB-SMT für Sprachpaare mit knappen Datenressourcen liefert schlechtere Übersetzungsqualität, da nicht genügend Trainingsdaten aus Parallelkorpora verfübar sind. Hieraus resultiert die erste Forschungsfrage (RQ). RQ1: Wie kann MT für Sprachen mit geringen Ressourcen verbessert werden? In Kapitel 3 wird eine mögliche Lösung für das Problem der Datenknappheit dargestellt. In diesem Kapitel wird eine Methodologie zur Extraktion paralleler Textfragmente aus vergleichbaren Korpora beschrieben, die als zusätzliche Trainingsdaten in der SMT für das Sprachenpaar Englisch-Bengali genutzt werden können. Für dieses Sprachenpaar stehen nur geringe Ressourcen zur Verfügung. Zur Extraktion von Paralleltexten aus vergleichbaren Korpora wird im Rahmen dieser Arbeit Textual Entailment Techniken (TE) angewendet. Der wichtigste Teil dieser Forschung, der in diesem Kapitel vorgestellt wird, wurde auch in (Pal et al., 2014b, 2015b) veröffentlicht. 1WMT 2016 war die erste
Recommended publications
  • Improving Pre-Trained Multilingual Model with Vocabulary Expansion
    Improving Pre-Trained Multilingual Models with Vocabulary Expansion Hai Wang1* Dian Yu2 Kai Sun3* Jianshu Chen2 Dong Yu2 1Toyota Technological Institute at Chicago, Chicago, IL, USA 2Tencent AI Lab, Bellevue, WA, USA 3Cornell, Ithaca, NY, USA [email protected], [email protected], fyudian,jianshuchen,[email protected] i.e., out-of-vocabulary (OOV) words (Søgaard and Abstract Johannsen, 2012; Madhyastha et al., 2016). A higher OOV rate (i.e., the percentage of the unseen Recently, pre-trained language models have words in the held-out data) may lead to a more achieved remarkable success in a broad range severe performance drop (Kaljahi et al., 2015). of natural language processing tasks. How- ever, in multilingual setting, it is extremely OOV problems have been addressed in previous resource-consuming to pre-train a deep lan- works under monolingual settings, through replac- guage model over large-scale corpora for each ing OOV words with their semantically similar in- language. Instead of exhaustively pre-training vocabulary words (Madhyastha et al., 2016; Ko- monolingual language models independently, lachina et al., 2017) or using character/word infor- an alternative solution is to pre-train a pow- mation (Kim et al., 2016, 2018; Chen et al., 2018) erful multilingual deep language model over or subword information like byte pair encoding large-scale corpora in hundreds of languages. (BPE) (Sennrich et al., 2016; Stratos, 2017). However, the vocabulary size for each lan- guage in such a model is relatively small, es- Recently, fine-tuning a pre-trained deep lan- pecially for low-resource languages. This lim- guage model, such as Generative Pre-Training itation inevitably hinders the performance of (GPT) (Radford et al., 2018) and Bidirec- these multilingual models on tasks such as se- tional Encoder Representations from Transform- quence labeling, wherein in-depth token-level ers (BERT) (Devlin et al., 2018), has achieved re- or sentence-level understanding is essential.
    [Show full text]
  • Ministry of Education and Youth of the Republic Of
    MINISTRY OF EDUCATION OF THE REPUBLIC OF MOLDOVA MOLDOVA STATE UNIVERSITY FACULTY OF FOREIGN LANGUAGES AND LITERATURES DEPARTMENT OF TRANSLATION, INTERPRETATION AND APPLIED LINGUISTICS MASTER’S DEGREE PAPER A CONTRASTIVE ANALYSIS OF LEGAL TERMINOLOGY IN ENGLISH, ROMANIAN AND POLISH. LEGAL TERMINOLOGY INNOVATIONS IN THE REPUBLIC OF MOLDOVA Submitted by: Buşila Alina, group 262 Research Adviser: Şaganean Gabriela, Doctor of Philology Chişinău 2011 TABLE OF CONTENTS Annotation ……………………………………………………………………………...…3 Adnotare ………………………………………………………………………………..…4 List of abbreviations …………………………………………………………………..….5 INTRODUCTION ……………………………………………………………………………..….6 CHAPTER I. LINGUISTIC APPROACH ON LEGAL TERMINOLOGY …………….…..12 1.1 Introduction in Terminology. Definitions and Controversy upon Terminology 1.2 Term versus Word versus Concept …………………………………………….19 1.3 Semantics of Legal Terminology ………………………………………………20 1.4 Recent Trends in Terminology ………………………………………………...24 1.5 Multilingual Term Creation within the EU Conceptual System ………………26 1.6 Characteristics of Terms ……………………………………………………….28 1.6.1 Different Classifications of Terms …………………………………...33 1.6.2 The Process of Terminologization, Determinologization and Reterminologization ……………………………………………………….46 1.7 Legal Terminology …………………………………………………………….48 1.7.1 English, Romanian and Polish Legal Terminology ………………….51 1.7.2 Incongruity of Legal Terms ………………………………………….53 1.7.3 Translation of Legal Terms ………………………………………….55 1.7.4 Legal Terminology Innovations in the Republic of Moldova ……….59 CHAPTER
    [Show full text]
  • Queering Polyamory: Configurations, Public Policy, and Lived Experiences
    Queering Polyamory: Configurations, Public Policy, and Lived Experiences By Timothy Joseph Gardner A Thesis Submitted in Partial Fulfillment of the Requirements for Master of Science in Ethnic and Multi-cultural Studies College of Behavioral and Social Sciences Minnesota State University, Mankato Mankato, Minnesota May 2005 ii iii This work is licensed under the Creative Commons Attribution-NonCommercial- NoDerivs License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/2.0/ or send a letter to Creative Commons, 559 Nathan Abbott Way, Stanford, California 94305, USA. iv ABSTRACT Gardner J. Timothy 2005 Queering Polyamory: Configurations, Public Policy, and Lived Experiences. M.S. thesis, Ethnic and Multi-cultural Studies, Minnesota State University, Mankato, 100 leaves. “Queering Polyamory: Configurations, Public Policy, and Lived Experiences” explores polyamory, a relationship “lovestyle” that involves more then one loving partner, while taking a close look at the social construction of modern day queer polyamory including marriage and sex law. The author states that queer polyamory is socially constructed due to its inclusion of self-identifying gay men, lesbian, bisexual, transgender, transsexual, gender-variant, omnisexual, pansexual, and queer individuals. This thesis includes a study of participants involved in queer polyamorous relationships. The study concludes that a population that engages in queer polyamorous relationships is diverse in regards to demographics; this is to say state of residence, age, gender, ethnicity, religious/spiritual affiliation, sexual identity and/or orientation, and relationship identity and/or orientation and ways individuals come to be part of queer polyamorous relationships. The study looks at how “out” the participants are and how public policy is affecting the lives of those who engage in queer polyamorous relationships.
    [Show full text]
  • Achieving Open Vocabulary Neural Machine Translation with Hybrid Word-Character Models
    Achieving Open Vocabulary Neural Machine Translation with Hybrid Word-Character Models Minh-Thang Luong and Christopher D. Manning Computer Science Department, Stanford University, Stanford, CA 94305 {lmthang,manning}@stanford.edu Abstract Nearly all previous work on neural ma- chine translation (NMT) has used quite restricted vocabularies, perhaps with a subsequent method to patch in unknown words. This paper presents a novel word- character solution to achieving open vo- cabulary NMT. We build hybrid systems that translate mostly at the word level and consult the character components for rare words. Our character-level recur- rent neural networks compute source word representations and recover unknown tar- get words when needed. The twofold advantage of such a hybrid approach is that it is much faster and easier to train than character-based ones; at the same time, it never produces unknown words Figure 1: Hybrid NMT – example of a word- as in the case of word-based models. On character model for translating “a cute cat” into the WMT’15 English to Czech translation “un joli chat”. Hybrid NMT translates at the word task, this hybrid approach offers an ad- level. For rare tokens, the character-level compo- dition boost of +2.1 11.4 BLEU points − nents build source representations and recover tar- over models that already handle unknown get <unk>. “_” marks sequence boundaries. words. Our best system achieves a new state-of-the-art result with 20.7 BLEU score. We demonstrate that our character state-of-the-art translation results for several lan- models can successfully learn to not only guage pairs such as English-French (Luong et al., generate well-formed words for Czech, 2015b), English-German (Jean et al., 2015a; Lu- a highly-inflected language with a very ong et al., 2015a; Luong and Manning, 2015), and complex vocabulary, but also build correct English-Czech (Jean et al., 2015b).
    [Show full text]
  • Masteroppgave Audrey Stark
    Polyamory – A Labor of Love Boundary Work and Legitimization of Non-Normative Intimate Relationships Audrey Stark Master’s Thesis in Sociology Department of Sociology and Human Geography UNIVERSITY OF OSLO June 2015 I II Polyamory – A Labor of Love Boundary Work and Legitimization of Non-Normative Intimate Relationships Audrey Stark III © Audrey Stark 2015 Polyamory – A Labor of Love Boundary Work and Legitimization of Non-Normative Intimate Relationships Audrey Stark http://www.duo.uio.no Print: The University Print Centre, University of Oslo IV Abstract Polyamory is a relatively new concept that is receiving an increasing amount of attention in research, popular culture and in society at large. Although polyamory has been, and continues to be, conceptualized in a number of different ways, there is agreement that it involves intimate relationships that are not limited to two people. As such, polyamory stands in contrast to the mononormative notion, in contemporary Western culture, that such relationships should be restricted to the realm of monogamous coupledom. Those who identify as polyamorous or engage in polyamorous practice contest monogamy in their daily lives through the way they understand love and the way they form affective relationships. This thesis examines the culture of polyamory in a Scandinavian context, where there, thus far, has been limited research on the subject. My exploration is based on the following research questions: 1) What are the most important narratives that emerge in the accounts of those who are affiliated with the culture of polyamory? 2) What elements characterize those narratives and what meanings do these individuals attach to these elements? 3) And finally, how do they use these narratives to create symbolic boundaries in an effort to present polyamory as a legitimate way of doing relationships? To answer these questions, I use a combination of naturally occurring and researcher-provoked data, drawing on online conversation threads, interviews and participant observation.
    [Show full text]
  • Variation and Exceptionality in Modern Hebrew Spirantization (Dissertation) Michal Temkin Martinez Boise State University SOURCES of NON-CONFORMITY in PHONOLOGY
    Boise State University ScholarWorks Mary Ellen Ryder Linguistics Lab Department of English 5-1-2010 Sources of Non-conformity in Phonology: Variation and Exceptionality in Modern Hebrew Spirantization (Dissertation) Michal Temkin Martinez Boise State University SOURCES OF NON-CONFORMITY IN PHONOLOGY: VARIATION AND EXCEPTIONALITY IN MODERN HEBREW SPIRANTIZATION by Michal Temkin Martínez A Dissertation Presented to the FACULTY OF THE USC GRADUATE SCHOOL UNIVERSITY OF SOUTHERN CALIFORNIA In Partial Fulfillment of the Requirements for the Degree of DOCTOR OF PHILOSOPHY (LINGUISTICS) May 2010 Copyright 2010 Michal Temkin Martínez Dedication To my grandparents for instilling in me a love for language ii Acknowledgements Although I realize that writing a dissertation and earning a Ph.D. are not as monumental as, say, winning a Nobel Prize, this is my biggest accomplishment in life thus far, and there are many people without whom it would have been impossible. Therefore, I feel it is only right to take the time now to thank them for their support. My advisor and dissertation chair, Rachel Walker, has been an amazing mentor and role model. Rachel embodies the well-balanced academic, her teaching and advising styles are enviable, and as I now realize more than ever before, her seemingly effortless manner stems from constant dedication and preparation. I am extremely grateful for her support in all aspects of my education, including the many service projects I found myself involved in. I could not have asked for a more understanding and compassionate advisor. My committee members – Louis Goldstein, Elsi Kaiser, Anna Lubowicz, and Mario Saltarelli – have all contributed to my work in many ways.
    [Show full text]
  • Glossary of Words COVID-19 DICTIONARY for DUMMIES Terminology to Understand the Pandemic by Luis B
    © 2021 PostScripts Publications Glossary of Words COVID-19 DICTIONARY FOR DUMMIES Terminology to Understand the Pandemic by Luis B. Vega Glossary of Words COVID-19 DICTIONARY For Dummies Terminology to Understand the Pandemic INCLUDING BIBLICAL TERMS AND CONCEPTS HACKING INTO THE GENOME OF ADAM _______________________________________ LUIS BERNARDINO VEGA 2 Dedicated to Those Choosing to receive the COVID-19 Shots. 3 Table of Contents Prologue………………………………….................... 5 Glossary of Words……………………….………..….. 8 Home Treatments……………..……...………………. 53 Scientific Diagrams…………………...………………. 58 ABC’s of Salvation……………………………………. 82 References…………………………………………….. 88 4 Prologue ‘By controlling the language, Big Brother controls the way that the People think. With a Limited Vocabulary, the People are limited in how much they can think, as well as, what they think about.’ - Orwell's 1984 The purpose of this booklet is to provide the Reader with a comprehensive array of the major Scientific, Medical, Political and some Theological terms concerning the COVID-19 pandemic. Why? The objective is to have a better sense of what the words, terms, and concepts mean. It is to be better educated in understanding such words when coming across either Scientists, Doctors, Virologists, Politicians and even Theologians who use them. Why? To make a more informed decision as to whether to take the new mRNA shots for example. Realize that the current narrative of what is going on with the COVID-19 pandemic and the new mRNA shots being administered needs to have a balanced approach in terms of a healthy, open and democratic debate. This is not occurring. On one hand, there is the ‘Official State Narrative’ and of the Facts and Figures posted on such websites of the UN’s World Health Organization WHO and the U.S.
    [Show full text]
  • For Download Click Here
    ISSN-L 0537-1988 54 THE INDIAN JOURNAL OF ENGLISH STUDIES An Annual Refereed Journal VOL. LIV 2017 Editor-in-Chief Dr. Binod Mishra Associate Professor of English Department of Humanities and Social Sciences IIT Roorkee, Uttarakhand. The responsibility for facts stated, opinions expressed or conclusions reached and plagiarism, if any in this journal, is entirely that of the author. The editor/publisher bears no responsibility for them whatsoever. THE OFFICIAL PUBLICATION OF ASSOCIATION FOR ENGLISH STUDIES OF INDIA 54 2017 INDIAN JOURNAL OF ENGLISH STUDIES Editor-in-Chief: Dr. Binod Mishra, Department of HSS, IIT Roorkee, Uttarakhand The Indian Journal of English Studies (IJES), published since 1940, accepts scholarly papers presented by the members at the annual conferences of the Association for English Studies of India (AESI). Orders for the copy of the journal for home, college, departmental/university library may be sent to the Editor-in-Chief, Dr. Binod Mishra, by sending an e-mail on mishra. [email protected]. Teachers and research scholars are requested to place orders on behalf of their institutions for one or more copies. Orders can be sent to the Dr. Binod Mishra, Editor-in-Chief, IJES, C-201 Shivalik Apartments, IIT Roorkee, District- Haridwar, Uttarakhand-247667, India. ASSOCIATION FOR ENGLISH STUDIES OF INDIA Price: 350 (for individuals) 600 (for institutions) £ 10 (for overseas) CONTENTS Editorial vii Binod Mishra English Literature of the Twentieth Century : Trends, Challenges and Achievements A. A. Mutalik Desai 1 Creativity and Criticism Rajnath 16 Polar Intensity of the Creative Process Charu Sheel Singh 21 Dystopia, Waste and the Decadent Sublime in Contemporary Culture Pramod K Nayar 32 The Poetic Talent of T.Vasudeva Reddy : A Critique Pashupati Jha 43 Bhisham Sahni’s Hanush : Questioning Power Structures Mukesh Ranjan Verma 51 From Common to Classic : A Proposition for Popular Literature R.P.
    [Show full text]
  • Machine Translation Summit XVI
    Machine Translation Summit XVI http://www.mtsummit2017.org Proceedings of MT Summit XVI Vol.1 Research Track Sadao Kurohashi, Pascale Fung, Editors MT Summit XVI September 18 – 22, 2017 -- Nagoya, Aichi, Japan Proceedings of MT Summit XVI, Vol. 1: MT Research Track Sadao Kurohashi (Kyoto University) & Pascale Fung (Hong Kong University of Science and Technology), Eds. Co-hosted by International Asia-Pacific Association for Association for Graduate School of Machine Translation Machine Translation Informatics, Nagoya University http://www.eamt.org/iamt.php http://www.aamt.info http://www.is.nagoya-u.ac.jp/index_en.html ©2017 The Authors. These articles are licensed under a Creative Commons 3.0 license, no derivative works, attribution, CC-BY-ND. Research Program Co-chairs Sadao Kurohashi Kyoto University Pascale Fung Hong Kong University of Science and Technology Research Program Committee Yuki Arase Osaka University Laurent Besacier LIG Houda Bouamor Carnegie Mellon University Hailong Cao Harbin Institute of Technology Michael Carl Copenhagen Business School Daniel Cer Google Boxing Chen NRC Colin Cherry NRC Chenhui Chu Osaka University Marta R. Costa-jussa Universitat Politècnica de Catalunya Steve DeNeefe SDL Language Weaver Markus Dreyer Amazon Kevin Duh Johns Hopkins University Andreas Eisele DGT, European Commission Christian Federmann Microsoft Research Minwei Feng IBM Watson Group Mikel L. Forcada Universitat d'Alacant George Foster National Research Council Isao Goto NHK Spence Green Lilt Inc Eva Hasler SDL Yifan He Bosch Research
    [Show full text]
  • Trust, Jealousy and Communication in Long-Term Couples Practicing Consensual Non-Monogamy
    Smith ScholarWorks Theses, Dissertations, and Projects 2016 Trust, jealousy and communication in long-term couples practicing consensual non-monogamy Christina Wang Smith College Follow this and additional works at: https://scholarworks.smith.edu/theses Part of the Social Work Commons Recommended Citation Wang, Christina, "Trust, jealousy and communication in long-term couples practicing consensual non- monogamy" (2016). Masters Thesis, Smith College, Northampton, MA. https://scholarworks.smith.edu/theses/1727 This Masters Thesis has been accepted for inclusion in Theses, Dissertations, and Projects by an authorized administrator of Smith ScholarWorks. For more information, please contact [email protected]. Christina Wang Trust, Jealousy, and Communication in Long-Term Couples Practicing Consensual Non-Monogamy ABSTRACT This cross-sectional exploratory quantitative study examined trust and communication patterns among adults practicing consensual non-monogamy (CNM). Previous relationship research in this area has primarily examined couples in demographically homogenous relationships (heterosexual, white, and married). A review of the literature suggested that adults engaging in consensual non-monogamy are more diverse. Our recruitment strategies and study design was successful in recruiting a more diverse group of participants. Twenty-eight adults in long-term CNM relationships completed a 56-item online questionnaire. Responses were examined to explore the relationship between communication patterns and relationship tenure. The relationship tenure of couples and the gender preference of participants were allowed to vary in an effort to explore the possibility that relationship tenure is associated with positive self- reports on trust and communication dimensions. The findings were more thematically relevant than statistically significant; however emotional jealousy was found to have decreased as length of relationship increased.
    [Show full text]
  • Alternatywne Metody Rozwiązywania Sporów W Przekładzi E Chińsko
    Volume 34/2018 Comparative Legilinguistics International Journal for Legal Communication Faculty of Modern Languages and Literatures Adam Mickiewicz University Poznań, Poland FACULTY OF MODERN LANGUAGES AND LITERATURES EDITORIAL BOARD Editor-in-chief: Aleksandra Matulewska Co-editor: Joanna Nowak-Michalska Editorial Secretaries and Assistants: Karolina Gortych-Michalak, Paulina Kozanecka, Joanna Nowak-Michalska, Paula Trzaskawka, Emilia Wojtasik-Dziekan, Daria Zozula External Members of the Editorial Board Łucja Biel (Poland), Christos Bintoudis (Italy), Susan Blackwell (UK), Onorina Botezat, (Romania), Olga Burukina (Russia), Sheng-Jie Chen (Taiwan), Laura Ervo (Sweden), Ewa Kościałkowska-Okońska (Poland), Larisa Krjukova (Russia), Artur Kubacki (Poland), Lavinia Nadrag (Romania), Natalja Netšunajeva (Estonia), Kiriakos Papoulidis (Greece/Poland), Fernando Prieto Ramos (Switzerland), Judith Rosenhouse (Israel), Juliette Scott (UK), Halina Sierocka (Poland), Ning Ye (China), Cheng Le (China), Anne Wagner (France) Section and technical editors: Karolina Gortych-Michalak, Paulina Kozanecka, Paulina Nowak-Korcz, Joanna Nowak-Michalska, Paula Trzaskawka, Emilia Wojtasik-Dziekan, Daria Zozula Linguistic editors: Jamila Oueslati for Arabic and French, Ya Nan Zhang for Chinese, Colin Phillips for English, Karsten Dahlmanns for German, Swietłana Gaś for Russian, Eva María Rufo Sánchez-Román for Spanish Editorial Office Wydział Neofilologii Zakład Legilingwistyki i Języków Specjalistycznych al. Niepodległości 4, pok. 218B 61-874 Poznań, Poland [email protected] Czasopismo znajduje się na liście ministerialnej czasopism punktowanych z 2016 roku z liczbą 7 punktów. Od roku 2018 znajduje się na liście ERIH PLUS The journal has been indexed on ERIH PLUS since 2018 The electronic version serves referential purposes. Wersja elektroniczna jest wersją referencyjną czasopisma Copyright by Faculty of Modern Languages and Literatures Printed in Poland ISSN 2080-5926 e-ISSN 2391-4491 (http://pressto.amu.edu.pl/index.php/cl/issue/archive) Nakład 60 Egz.
    [Show full text]
  • Hybrids with Final Neoclassical Combining Forms in Baltic
    studies about language / kalbų studijos no. 37 / 2020 92 SAL 37/2020 Hybrids with Final Hybrids with Final Neoclassical Combining Forms in Neoclassical Combining Baltic Languages Forms in Baltic Languages Received 04/2020 Accepted 10/2020 Baltų kalbų hibridai su baigiamaisiais neoklasikiniais dėmenimis LINGUISTICS / KALBOTYRA Jurgita Mikelionienė Kaunas University of Technology, Lithuania http://dx.doi.org/10.5755/j01.sal.1.37.24325 The article presents some ideas about the hybrids that are made by combining the neoclassical elements Abstract or are mixed of Lithuanian or Latvian and foreign origin combining forms. The examples of hybrids found in English are also provided. This corpus-based investigation reveals that the formation of hybrids with final neoclassical combining forms in modern Baltic languages is a sufficiently active, productive and unobtrusive process of creativity and wit. The identified structural types of hybrids are the following: 1) the hybrids with the international elements of different classical or neoclassical origin; 2) the hybrids with the initial element of non-classical origin. They are almost identical in the analysed languages, suggesting that they are often bor- rowed and translated rather than created, but if they are created, then this is done by obvious analogy. Some Lithuanian and Latvian hybrids have long been used and included in various dictionaries, and their frequency in the linguistic corpus is quite high. However, most examples of hybrid origin do not exist and probably do not appear in dictionaries due to their rare use, uniqueness, distinctive form or irregularity. The knowledge of hybrid formation and usage can be relevant to the learners of Lithuanian, Latvian, and English, as well as translators, terminologists, and lexicographers.
    [Show full text]