Workshop Proceeding
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The Impact of Crowdsourcing Post-Editing with the Collaborative Translation Framework
The Impact of Crowdsourcing Post-editing with the Collaborative Translation Framework Takako Aikawa1, Kentaro Yamamoto2, and Hitoshi Isahara2 1 Microsoft Research, Machine Translation Team [email protected] 2 Toyohashi University of Technology [email protected], [email protected] Abstract. This paper presents a preliminary report on the impact of crowdsourcing post-editing through the so-called “Collaborative Translation Framework” (CTF) developed by the Machine Translation team at Microsoft Research. We first provide a high-level overview of CTF and explain the basic functionalities available from CTF. Next, we provide the motivation and design of our crowdsourcing post-editing project using CTF. Last, we present the re- sults from the project and our observations. Crowdsourcing translation is an in- creasingly popular-trend in the MT community, and we hope that our paper can shed new light on the research into crowdsourcing translation. Keywords: Crowdsourcing post-editing, Collaborative Translation Framework. 1 Introduction The output of machine translation (MT) can be used either as-is (i.e., raw-MT) or for post-editing (i.e., MT for post-editing). Although the advancement of MT technology is making raw-MT use more pervasive, reservations about raw-MT still persist; espe- cially among users who need to worry about the accuracy of the translated contents (e.g., government organizations, education institutes, NPO/NGO, enterprises, etc.). Professional human translation from scratch, however, is just too expensive. To re- duce the cost of translation while achieving high translation quality, many places use MT for post-editing; that is, use MT output as an initial draft of translation and let human translators post-edit it. -
How to Use Google Translate
HOW TO USE GOOGLE TRANSLATE For some ASVAB CEP participants (or their parents), English is a second language. Google Translate is an easy way to instantly translate any webpage using these steps. Google Chrome Internet Explorer 1. Open Google Chrome. Google Translate is available on Internet Explorer version 6 and 2. Go to asvabprogram.com. later. To activate it: 3. Right click anywhere on the webpage. 1. Open Internet Explorer. 4. Select Translate from the menu. 2. Go to Google Toolbar’s website (toolbar.google.com), 5. Select Options. and click the “Download Google Toolbar” button. 6. On the Translate Language dropdown, 3. Click on “Accept and Install” and the toolbar will be select the desired language. automatically installed on your Internet Explorer. 4. Click Run or Open in the window that appears. 5. Enable the toolbar. 6. Go to asvabprogram.com. 7. Select More >> 8. Select Translate. 9. Then, the translate button will appear at the top of your webpage. 10. Right click to select the language option. 7. You will see the Google Translate icon in the browser bar, which you can use to manage your translation settings. iphone Android Microsoft Translator is a universal app for 1. On your Android phone or iPhone and iPad, and can be downloaded tablet, open the Chrome app. from the App Store for free. Once you’ve 2. Go to a webpage. got it downloaded, you can set up the action extension for translation web pages. 3. To change the language, tap 4. Tap Translate… To activate the Microsoft Translator extension in Safari: 5. -
Context in Neural Machine Translation: a Review of Models and Evaluations
Context in Neural Machine Translation: A Review of Models and Evaluations Andrei Popescu-Belis University of Applied Sciences of Western Switzerland (HES–SO) School of Management and Engineering Vaud (HEIG–VD) Route de Cheseaux 1, CP 521 1401 Yverdon-les-Bains, Switzerland [email protected] January 29, 2019 Abstract features when translating entire texts, especially when this is vital for correct translation. Taking textual con- This review paper discusses how context has been text into account1 means modeling long-range depen- used in neural machine translation (NMT) in the dencies between words, phrases, or sentences, which past two years (2017–2018). Starting with a brief are typically studied by linguistics under the topics of retrospect on the rapid evolution of NMT models, discourse and pragmatics. When it comes to transla- the paper then reviews studies that evaluate NMT tion, the capacity to model context may improve certain output from various perspectives, with emphasis on those analyzing limitations of the translation of translation decisions, e.g. by favoring a better lexical contextual phenomena. In a subsequent version, choice thanks to document-level topical information, the paper will then present the main methods that or by constraining pronoun choice thanks to knowledge were proposed to leverage context for improving about antecedents. translation quality, and distinguishes methods that This review paper puts into perspective the signif- aim to improve the translation of specific phenom- icant amount of studies devoted in 2017 and 2018 to ena from those that consider a wider unstructured improve the use of context in NMT and measure these context. -
Final Study Report on CEF Automated Translation Value Proposition in the Context of the European LT Market/Ecosystem
Final study report on CEF Automated Translation value proposition in the context of the European LT market/ecosystem FINAL REPORT A study prepared for the European Commission DG Communications Networks, Content & Technology by: Digital Single Market CEF AT value proposition in the context of the European LT market/ecosystem Final Study Report This study was carried out for the European Commission by Luc MEERTENS 2 Khalid CHOUKRI Stefania AGUZZI Andrejs VASILJEVS Internal identification Contract number: 2017/S 108-216374 SMART number: 2016/0103 DISCLAIMER By the European Commission, Directorate-General of Communications Networks, Content & Technology. The information and views set out in this publication are those of the author(s) and do not necessarily reflect the official opinion of the Commission. The Commission does not guarantee the accuracy of the data included in this study. Neither the Commission nor any person acting on the Commission’s behalf may be held responsible for the use which may be made of the information contained therein. ISBN 978-92-76-00783-8 doi: 10.2759/142151 © European Union, 2019. All rights reserved. Certain parts are licensed under conditions to the EU. Reproduction is authorised provided the source is acknowledged. 2 CEF AT value proposition in the context of the European LT market/ecosystem Final Study Report CONTENTS Table of figures ................................................................................................................................................ 7 List of tables .................................................................................................................................................. -
Metia Cloud OS Ss
U.S. Army Europe saves more than $150,000 by automating database translation Customer: U.S. Army Europe Website: www.eur.army.mil “By using the Microsoft Translator API to automate SQL Customer Size: 29,000 soldiers Server data translation into English, we are able to Country or Region: Germany Industry: Military/public sector present senior leaders with universally usable data that Customer Profile supports better informed decisions.” U.S. Army Europe trains and leads Army Mark Hutcheson forces in 51 countries to support U.S. IT Specialist, U.S. Army Europe European Command and Headquarters, Department of the Army. Before migrating to Microsoft Dynamics CRM, U.S. Army Europe Benefits needed to translate portions of a SQL Server database used for ◼ Enhanced force protection ◼ Saved $150,500 in manual translation screening and hiring local nationals. Using the Microsoft costs ◼ Improved usability of data Translator API, Microsoft Visual C#, and the common language runtime (CLR) environment, engineers automated the translation Software and Services ◼ Microsoft Server Product Portfolio of select SQL Server data into English. As a result, the Army saved − Microsoft SQL Server 2012 about $150,500 (about 1,750 hours) in manual translation costs, ◼ Microsoft Dynamics CRM ◼ Microsoft Visual Studio avoided a seven-month delay, and maintained access to all of its − Microsoft Visual C# historical employment screening data. ◼ Technologies − Microsoft Translator API information was typically submitted in a − Transact SQL Business Needs U.S. Army Europe trains, equips, deploys, language other than English. and provides command and control of troops to enhance transatlantic security. To All of the application data was stored in a support that mission, it employs many local SQL Server database to be used for nationals for civilian jobs such as land- screening and hiring employees and scaping, food services, and maintenance. -
Empowering People with Disabilities Through AI
Empowering people with disabilities through AI Microsoft WBCSD Future of Work case study February 2020 Table of Contents Summary ............................................................................................................................................................... 2 Company background ............................................................................................................................................ 2 Future of Work challenge ...................................................................................................................................... 3 Business case ......................................................................................................................................................... 3 Microsoft’s solution ............................................................................................................................................... 3 Seeing AI............................................................................................................................................................... 4 Helpicto ................................................................................................................................................................ 4 Microsoft Translator ............................................................................................................................................ 5 Results .................................................................................................................................................................. -
TRANSLATORS WITHOUT BORDERS a Community Translating to Save Lives
The Voice of Interpreters and Translators THE ATA Nov/Dec 2015 Volume XLIV Number 9 CHRONICLE TRANSLATORS WITHOUT BORDERS A Community Translating To Save Lives PEMT Yourself! Don't Leave Money You're Owed on the Table! Beyond Post-Editing: Advances in Interactive Translation Environments Switching from a Laptop to a Tablet: An Interpreter’s Experience A Publication of the American Translators Association CAREERS at the NATIONAL SECURITY AGENCY inspiredTHINKING When in the office, NSA language analysts develop new perspectives NSA has a critical need for individuals with the on the dialect and nuance of foreign language, on the context and following language capabilities: cultural overtones of language translation. • Arabic • Chinese We draw our inspiration from our work, our colleagues and our lives. • Farsi During downtime we create music and paintings. We run marathons • Korean and climb mountains, read academic journals and top 10 fiction. • Russian • Spanish Each of us expands our horizons in our own unique way and makes • And other less commonly taught languages connections between things never connected before. APPLY TODAY At the National Security Agency, we are inspired to create, inspired to invent, inspired to protect. U.S. citizenship is required for all applicants. NSA is an Equal Opportunity Employer and abides by applicable employment laws and regulations. All applicants for employment are considered without regard to age, color, disability, genetic information, national origin, race, religion, sex, sexual orientation, marital status, or status as a parent. Search NSA to Download WHERE INTELLIGENCE GOES TO WORK® 14CNS-10_8.5x11(live_8x10.5).indd 1 9/16/15 10:44 AM Nov/Dec 2015 Volume XLIV CONTENTS Number 9 FEATURES 19 Beyond Post-Editing: Advances in Interactive 9 Translation Environments Translators without Borders: Post-editing was never meant A Community Translating to be the future of machine to Save Lives translation. -
From the Myth of Babel to Google Translate: Confronting Malicious Use of Artificial Intelligence— Copyright and Algorithmic Biases in Online Translation Systems
Fordham Law School FLASH: The Fordham Law Archive of Scholarship and History Faculty Scholarship 2019 From the Myth of Babel to Google Translate: Confronting Malicious Use of Artificial Intelligence— Copyright and Algorithmic Biases in Online Translation Systems Shlomit Yanisky-Ravid Fordham University School of Law, [email protected] Cynthia Martens Deborah A. Nilson & Associates, PLLC Follow this and additional works at: https://ir.lawnet.fordham.edu/faculty_scholarship Recommended Citation Shlomit Yanisky-Ravid and Cynthia Martens, From the Myth of Babel to Google Translate: Confronting Malicious Use of Artificial Intelligence— Copyright and Algorithmic Biases in Online Translation Systems, 43 Seattle U. L. Rev. 99 (2019) Available at: https://ir.lawnet.fordham.edu/faculty_scholarship/1089 This Article is brought to you for free and open access by FLASH: The Fordham Law Archive of Scholarship and History. It has been accepted for inclusion in Faculty Scholarship by an authorized administrator of FLASH: The Fordham Law Archive of Scholarship and History. For more information, please contact [email protected]. From the Myth of Babel to Google Translate: Confronting Malicious Use of Artificial Intelligence— Copyright and Algorithmic Biases in Online Translation Systems Professor Shlomit Yanisky-Ravid and Cynthia Martens* Many of us rely on Google Translate and other Artificial Intelligence and Machine Learning (AI) online translation daily for personal or commercial use. These AI systems have become ubiquitous and are poised to revolutionize human communication across the globe. Promising increased fluency across cultures by breaking down linguistic barriers and promoting cross-cultural relationships in a way that many civilizations have historically sought and struggled to achieve, AI translation affords users the means to turn any text—from phrases to books—into cognizable expression. -
A Comparison of Free Online Machine Language Translators Mahesh Vanjani1,* and Milam Aiken2 1Jesse H
Journal of Management Science and Business Intelligence, 2020, 5–1 July. 2020, pages 26-31 doi: 10.5281/zenodo.3961085 http://www.ibii-us.org/Journals/JMSBI/ ISBN 2472-9264 (Online), 2472-9256 (Print) A Comparison of Free Online Machine Language Translators Mahesh Vanjani1,* and Milam Aiken2 1Jesse H. Jones School of Business, Texas Southern University, 3100 Cleburne Street, Houston, TX 77004 2School of Business Administration, University of Mississippi, University, MS 38677 *Email: [email protected] Received on 5/15/2020; revised on 7/25/2020; published on 7/26/2020 Abstract Automatic Language Translators also referred to as machine translation software automate the process of language translation without the intervention of humans While several automated language translators are available online at no cost there are large variations in their capabilities. This article reviews prior tests of some of these systems, and, provides a new and current comprehensive evaluation of the following eight: Google Translate, Bing Translator, Systran, PROMT, Babylon, WorldLingo, Yandex, and Reverso. This re- search could be helpful for users attempting to explore and decide which automated language translator best suits their needs. Keywords: Automated Language Translator, Multilingual, Communication, Machine Translation published results of prior tests and experiments we present a new and cur- 1 Introduction rent comprehensive evaluation of the following eight electronic automatic language translators: Google Translate, Bing Translator, Systran, Automatic Language Translators also referred to as machine translation PROMT, Babylon, WorldLingo, Yandex, and Reverso. This research software automate the process of language translation without the inter- could be helpful for users attempting to explore and decide which auto- vention of humans. -
Machine Translation in the Field of Law: a Study of the Translation of Italian Legal Texts Into German
Comparative Legilinguistics vol. 37/2019 DOI: http://dx.doi.org/10.14746/cl.2019.37.4 MACHINE TRANSLATION IN THE FIELD OF LAW: A STUDY OF THE TRANSLATION OF ITALIAN LEGAL TEXTS INTO GERMAN EVA WIESMANN, Prof., PhD Department of Interpreting and Translation, University of Bologna Corso della Repubblica 136, 47121 Forlì, Italy [email protected] ORCID: https://orcid.org/0000-0001-9414-8038 Abstract: With the advent of the neural paradigm, machine translation has made another leap in quality. As a result, its use by trainee translators has increased considerably, which cannot be disregarded in translation pedagogy. However, since legal texts have features that pose major challenges to machine translation, the question arises as to what extent machine translation is now capable of translating legal texts or at least certain types of legal text into another legal language well enough so that the post- editing effort is limited, and, consequently, whether a targeted use in translation pedagogy can be considered. In order to answer this question, DeepL Translator, a machine translation system, and MateCat, a CAT system that integrates machine translation, were tested. The test, undertaken at different times and without specific translation memories, provided Eva Wiesmann: Machine Translation in… for the translation of several legal texts of different types utilising both systems, and was followed by systematisation of errors and evaluation of translation results. The evaluation was carried out according to the following criteria: 1) comprehensibility and meaningfulness of the target text; and 2) correspondence between source and target text in consideration of the specific translation situation. -
Automated Translation in Typo3
AUTOMATED TRANSLATION IN TYPO3 Whitepaper on Translating contents automatically using DeepL and Google Translate in TYPO3 backend by Georgy GE What is TYPO3? TYPO3 CMS is an Open Source Enterprise Content Management System with a large global community. TYPO3 is written in PHP scripting language and was initially authored by Dane Kasper Skårhøj in 1997. The standout features of TYPO3 is its diversity and modularity. It can keep running on multiple web servers like Apache, Nginx or IIS and over numerous operating systems such as Linux, Microsoft Windows, FreeBSD, Mac OS X and OS/2. TYPO3 provides the basis for modern content management that can be adapted by small business websites to large multi-lingual global enterprises portals. TYPO3 always keeps a check on the updated requirements of businesses and public institutions. TYPO3 is opted by small and medium enterprises and municipalities because of its license- cost-free open source approach. In addition to the basic set of interfaces, functions and modules, TYPO3's functionality spectrum can be implemented using extensions. TYPO3 allows users or website operators to upgrade the corresponding websites using a solid extensions framework, and helps to publish and deliver any form of content to multiple devices. During the course of development of a TYPO3 website, a developer can rely on a strong community and about 6,000 extensions that offer unlimited possibilities. Ph : +41 43 558 4360 E-mail: [email protected] Visit: www.pitsolutions.com 2 Why TYPO3? 1. No license cost TYPO3 is an open source software under the GNU General Public License with no license fee. -
Statistical Machine Translation from English to Tuvan*
Statistical Machine Translation from English to Tuvan* Rachel Killackey, Swarthmore College rkillac [email protected] Linguistics Senior Thesis 2013 Abstract This thesis aims to describe and analyze findings of the Tuvan Machine Translation Project, which attempts to create a functional statistical machine translation (SMT) model between English and Tuvan, a minority language spoken in southern Siberia. Though most Tuvan speakers are also fluent in Russian, easily accessible SMT technology would allow for simpler English translation without the use of Russian as an intermediary language. The English to Tuvan half of the system that I examine makes consistent morphological errors, particularly involving the absence of the accusative suffix with the basic form -ni. Along with a typological analysis of these errors, I show that the introduction of novel data that corrects for the missing accusative suffix can improve the performance of an SMT system. This result leads me to conclude that SMT can be a useful avenue for efficient translation. However, I also argue that SMT may benefit from the incorporation of some linguistic knowledge such as morphological rules in the early steps of creating a system. 1. Introduction This thesis explores the field of machine translation (MT), the use of computers in rendering one natural language into another, with a specific focus on MT between English and Tuvan, a Turkic language spoken in south central Siberia. While MT is a growing force in the translation of major languages with millions of speakers such as French, Spanish, and Russian, minority and non-dominant languages with relatively few numbers of speakers have been largely ignored.