A Comparison of Free Online Machine Language Translators Mahesh Vanjani1,* and Milam Aiken2 1Jesse H
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Context in Neural Machine Translation: a Review of Models and Evaluations
Context in Neural Machine Translation: A Review of Models and Evaluations Andrei Popescu-Belis University of Applied Sciences of Western Switzerland (HES–SO) School of Management and Engineering Vaud (HEIG–VD) Route de Cheseaux 1, CP 521 1401 Yverdon-les-Bains, Switzerland [email protected] January 29, 2019 Abstract features when translating entire texts, especially when this is vital for correct translation. Taking textual con- This review paper discusses how context has been text into account1 means modeling long-range depen- used in neural machine translation (NMT) in the dencies between words, phrases, or sentences, which past two years (2017–2018). Starting with a brief are typically studied by linguistics under the topics of retrospect on the rapid evolution of NMT models, discourse and pragmatics. When it comes to transla- the paper then reviews studies that evaluate NMT tion, the capacity to model context may improve certain output from various perspectives, with emphasis on those analyzing limitations of the translation of translation decisions, e.g. by favoring a better lexical contextual phenomena. In a subsequent version, choice thanks to document-level topical information, the paper will then present the main methods that or by constraining pronoun choice thanks to knowledge were proposed to leverage context for improving about antecedents. translation quality, and distinguishes methods that This review paper puts into perspective the signif- aim to improve the translation of specific phenom- icant amount of studies devoted in 2017 and 2018 to ena from those that consider a wider unstructured improve the use of context in NMT and measure these context. -
Final Study Report on CEF Automated Translation Value Proposition in the Context of the European LT Market/Ecosystem
Final study report on CEF Automated Translation value proposition in the context of the European LT market/ecosystem FINAL REPORT A study prepared for the European Commission DG Communications Networks, Content & Technology by: Digital Single Market CEF AT value proposition in the context of the European LT market/ecosystem Final Study Report This study was carried out for the European Commission by Luc MEERTENS 2 Khalid CHOUKRI Stefania AGUZZI Andrejs VASILJEVS Internal identification Contract number: 2017/S 108-216374 SMART number: 2016/0103 DISCLAIMER By the European Commission, Directorate-General of Communications Networks, Content & Technology. The information and views set out in this publication are those of the author(s) and do not necessarily reflect the official opinion of the Commission. The Commission does not guarantee the accuracy of the data included in this study. Neither the Commission nor any person acting on the Commission’s behalf may be held responsible for the use which may be made of the information contained therein. ISBN 978-92-76-00783-8 doi: 10.2759/142151 © European Union, 2019. All rights reserved. Certain parts are licensed under conditions to the EU. Reproduction is authorised provided the source is acknowledged. 2 CEF AT value proposition in the context of the European LT market/ecosystem Final Study Report CONTENTS Table of figures ................................................................................................................................................ 7 List of tables .................................................................................................................................................. -
Translation and the Internet: Evaluating the Quality of Free Online Machine Translators
Quaderns. Rev. trad. 17, 2010 197-209 Translation and the Internet: Evaluating the Quality of Free Online Machine Translators Stephen Hampshire Universitat Autònoma de Barcelona Facultat de Traducció i d’Interpretació 08193 Bellaterra (Barcelona). Spain [email protected] Carmen Porta Salvia Universitat de Barcelona Facultat de Belles Arts [email protected] Abstract The late 1990s saw the advent of free online machine translators such as Babelfish, Google Translate and Transtext. Professional opinion regarding the quality of the translations provided by them, oscillates wildly from the «laughably bad» (Ali, 2007) to «a tremendous success» (Yang and Lange, 1998). While the literature on commercial machine translators is vast, there are only a handful of studies, mostly in blog format, that evaluate and rank free online machine translators. This paper offers a review of the most significant contributions in that field with an emphasis on two key issues: (i) the need for a ranking system; (ii) the results of a ranking system devised by the authors of this paper. Our small-scale evaluation of the performance of ten free machine trans- lators (FMTs) in «league table» format shows what a user can expect from an individual FMT in terms of translation quality. Our rankings are a first tentative step towards allowing the user to make an informed choice as to the most appropriate FMT for his/her source text and thus pro- duce higher FMT target text quality. Key words: free online machine translators, evaluation, internet, ranking system. Resum Durant la darrera dècada del segle xx s’introdueixen els traductors online gratuïts (TOG), com poden ser Babelfish, Google Translate o Transtext. -
Machine Translation in the Field of Law: a Study of the Translation of Italian Legal Texts Into German
Comparative Legilinguistics vol. 37/2019 DOI: http://dx.doi.org/10.14746/cl.2019.37.4 MACHINE TRANSLATION IN THE FIELD OF LAW: A STUDY OF THE TRANSLATION OF ITALIAN LEGAL TEXTS INTO GERMAN EVA WIESMANN, Prof., PhD Department of Interpreting and Translation, University of Bologna Corso della Repubblica 136, 47121 Forlì, Italy [email protected] ORCID: https://orcid.org/0000-0001-9414-8038 Abstract: With the advent of the neural paradigm, machine translation has made another leap in quality. As a result, its use by trainee translators has increased considerably, which cannot be disregarded in translation pedagogy. However, since legal texts have features that pose major challenges to machine translation, the question arises as to what extent machine translation is now capable of translating legal texts or at least certain types of legal text into another legal language well enough so that the post- editing effort is limited, and, consequently, whether a targeted use in translation pedagogy can be considered. In order to answer this question, DeepL Translator, a machine translation system, and MateCat, a CAT system that integrates machine translation, were tested. The test, undertaken at different times and without specific translation memories, provided Eva Wiesmann: Machine Translation in… for the translation of several legal texts of different types utilising both systems, and was followed by systematisation of errors and evaluation of translation results. The evaluation was carried out according to the following criteria: 1) comprehensibility and meaningfulness of the target text; and 2) correspondence between source and target text in consideration of the specific translation situation. -
Automated Translation in Typo3
AUTOMATED TRANSLATION IN TYPO3 Whitepaper on Translating contents automatically using DeepL and Google Translate in TYPO3 backend by Georgy GE What is TYPO3? TYPO3 CMS is an Open Source Enterprise Content Management System with a large global community. TYPO3 is written in PHP scripting language and was initially authored by Dane Kasper Skårhøj in 1997. The standout features of TYPO3 is its diversity and modularity. It can keep running on multiple web servers like Apache, Nginx or IIS and over numerous operating systems such as Linux, Microsoft Windows, FreeBSD, Mac OS X and OS/2. TYPO3 provides the basis for modern content management that can be adapted by small business websites to large multi-lingual global enterprises portals. TYPO3 always keeps a check on the updated requirements of businesses and public institutions. TYPO3 is opted by small and medium enterprises and municipalities because of its license- cost-free open source approach. In addition to the basic set of interfaces, functions and modules, TYPO3's functionality spectrum can be implemented using extensions. TYPO3 allows users or website operators to upgrade the corresponding websites using a solid extensions framework, and helps to publish and deliver any form of content to multiple devices. During the course of development of a TYPO3 website, a developer can rely on a strong community and about 6,000 extensions that offer unlimited possibilities. Ph : +41 43 558 4360 E-mail: [email protected] Visit: www.pitsolutions.com 2 Why TYPO3? 1. No license cost TYPO3 is an open source software under the GNU General Public License with no license fee. -
An Evaluation of the Accuracy of Online Translation Systems
Communications of the IIMA Volume 9 Issue 4 Article 6 2009 An Evaluation of the Accuracy of Online Translation Systems Milam Aiken University of Mississippi Kaushik Ghosh University of Mississippi John Wee University of Mississippi Mahesh Vanjani University of Mississippi Follow this and additional works at: https://scholarworks.lib.csusb.edu/ciima Recommended Citation Aiken, Milam; Ghosh, Kaushik; Wee, John; and Vanjani, Mahesh (2009) "An Evaluation of the Accuracy of Online Translation Systems," Communications of the IIMA: Vol. 9 : Iss. 4 , Article 6. Available at: https://scholarworks.lib.csusb.edu/ciima/vol9/iss4/6 This Article is brought to you for free and open access by CSUSB ScholarWorks. It has been accepted for inclusion in Communications of the IIMA by an authorized editor of CSUSB ScholarWorks. For more information, please contact [email protected]. Examination of the Accuracy of Online Translation Systems Aiken, Ghosh, Wee & Vanjani An Evaluation of the Accuracy of Online Translation Systems Milam Aiken University of Mississippi USA [email protected] Kaushik Ghosh University of Mississippi USA [email protected] John Wee University of Mississippi USA [email protected] Mahesh Vanjani Texas Southern University USA [email protected] Abstract Until fairly recently, translation among a large variety of natural languages has been difficult and costly. Now, several free, Web-based machine translation (MT) services can provide support, but relatively little research has been conducted on their accuracies. A study of four of these services using German-to-English and Spanish-to-English translations showed that Google Translate appeared to be superior. Further study using this system alone showed that while translations were not always perfect, their understandability was quite high. -
Report About a Survey-Based Research on Machine Translation Post-Editing
REPORT ABOUT A SURVEY‐BASED RESEARCH ON MACHINE TRANSLATION POST‐EDITING Common ground and gaps between LSCs, linguists and trainers Clara Ginovart Cid Universitat Pompeu Fabra Barcelona 22/12/2020 This report presents the findings of a survey‐based research based on three online questionnaires. It is part of the Industrial Doctorates research n° “2017 DI 010”. One questionnaire is addressed to Language Service Companies (LSCs) that sell or use as a process machine translation post‐editing (MTPE, also abbreviated PEMT). It received 66 submissions. Another questionnaire is addressed to individual linguists (inhouse or freelance) who accept MTPE assignments, and it received 141 submissions. Finally, the third questionnaire is addressed to European master or postgraduate PE educators, and it received 54 submissions. The survey‐based research is completed in between the end of 2018 and the beginning of 2019. In this report, the content of each questionnaire is presented in the first section (1. Content of the Questionnaires). In the second section (2. Results of the Submissions), the data of their findings is displayed, where the percentages have been rounded off. The three related publications where the findings are discussed are cited below: Ginovart Cid C, Colominas C, Oliver A. Language industry views on the profile of the post‐ editor. Translation Spaces. 2020 Jan 14. DOI:10.1075/ts.19010.cid Ginovart Cid C. The Professional Profile of a Post‐editor according to LSCs and Linguists: a Survey‐Based Research. HERMES ‐ Journal of Language and Communication in Business, 60, 171‐190. 2020 Jul 08. DOI:10.7146/hjlcb.v60i0.121318 Ginovart Cid C, Oliver A. -
Assessing the Comprehensibility and Perception of Machine Translations
ASSESSING THE COMPREHENSIBILITY AND PERCEPTION OF MACHINE TRANSLATIONS A PILOT STUDY Aantal woorden: 16 804 Iris Ghyselen Studentennummer: 01400320 Promotor: Prof. dr. Lieve Macken Masterproef voorgelegd voor het behalen van de graad master in het vertalen in de richting Toegepaste Taalkunde Academiejaar: 2017 – 2018 ASSESSING THE COMPREHENSIBILITY AND PERCEPTION OF MACHINE TRANSLATIONS A PILOT STUDY Aantal woorden: 16 804 Iris Ghyselen Studentennummer: 01400320 Promotor: Prof. dr. Lieve Macken Masterproef voorgelegd voor het behalen van de graad master in het vertalen in de richting Toegepaste Taalkunde Academiejaar: 2017 – 2018 i VERKLARING I.V.M. AUTEURSRECHT De auteur en de promotor(en) geven de toelating deze studie als geheel voor consultatie beschikbaar te stellen voor persoonlijk gebruik. Elk ander gebruik valt onder de beperkingen van het auteursrecht, in het bijzonder met betrekking tot de verplichting de bron uitdrukkelijk te vermelden bij het aanhalen van gegevens uit deze studie. ii ACKNOWLEDGMENTS There are several people who deserve my profound gratitude for their help and support while I was writing this dissertation. In the first place I would like to give thanks to all the respondents who filled in my questionnaire. That small act of kindness meant a great deal to me. The questionnaire required some time and attention to fill in and was distributed in a period when people were being overwhelmed on social media with all kinds of questionnaires for other theses. Therefore, I am very grateful to each and every one of them who helped me. Secondly, I would like to express my deep sense of gratitude to my supervisor, Prof. -
Methods Diversity in Communicating Cochrane Evidence to Diverse Russian-Speaking Audiences the Problem Key Results
For Cochrane evidence in Russian we found steady rise in Interest and Confidence of website and social media followers within diverse audiences by profession, country, data acquisition, device use. Google translate performed better than Microsoft or DeepL translation engines. The Problem Machine Translation Quality • We translate Cochrane evidence in various formats (Memsource with machine translation Estimation engines for PLSs), and disseminate them via social networks. We maintain Cochrane Russia (Google Translator vs DeepL Translator website; vs Microsoft Translator) • We evaluated the coverage and impact of communicating evidence in Russian and its Research materials: 90 Cochrane outreach in Russia and beyond; PLSs, published in the last 6 months. On-line platform for evaluation: • We evaluated, which machine translation engine (MTE) works better for Russian translations Memsource translation management of Cochrane PLS. software. Participants: 10 human translators Methods Analyses: Default, Post-editing 1, Post-editing 2 1. Google analytics for the Cochrane Russia website and social networks; 2. Manual summing up of numerical data on followers, page views, likes, reposts in social Results of Default analysis networks Google translator (0,09) DeepL translator (0,03) 3. Comparison of numbers of website users, views and website sessions in 2016 , 2017 to 2018. 4. Comparison of 3 MTE by quality and appropriateness for Cochrane PLSs using the machine Microsoft translator (0,01) translation quality estimation tool (MTQE) and human post-editing -
Full Text (PDF)
Journal of Liberal Arts and Humanities (JLAH) Issue: Vol. 2; No. 3; March 2021 pp. 26-46 ISSN 2690-070X (Print) 2690-0718 (Online) Website: www.jlahnet.com E-mail: [email protected] Doi: 10.48150/jlah.v2no3.2021.a3 MACHINE TRANSLATION AND POSTEDITING: AN OPINION POLL OF THE PROFESSIONAL TRANSLATION MARKET Enrique J. Vercher García Departamento de Filología Alemana y Filología Eslava Universidad Complutense de Madrid 91 394 7758 | 600 33 91 45 E-mail: [email protected] Abstract Over the years, significant advances have been made in the field of machine translation. This type of translation and the different programs and applications which exist seem to be well known and accepted in the academic world specialised in translation. However, the level of knowledge and acceptance of machine translation seems to be very different in the business world for clients who request translation services. In our paper we present the result of a study carried out on the translation market to ascertain the level of knowledge, acceptance and application by TSPs and end clients of different sectors. The opinion poll has been carried out through surveys adapted to the three target groups: translation agencies, professional translators and clients/users of professional translation. In the case of TSPs (agencies and translators), information has been gathered on their awareness and provision of machine translation and postediting (machine translation + proofreading), the percentage of assignments carried out through this service, as well as an evaluation of pure machine translation (without postediting) from their perspective and from the perspective of clients. In the case of clients/users of professional translation, information has been gathered on their knowledge and evaluation of machine translation (software translators). -
Trados Studio Apps/Plugins for Machine Translation Latest Update: 7 April, 2018
Trados Studio apps/plugins for machine translation Latest update: 7 April, 2018. For a long time now, I’ve been intrigued by the very large number of apps/ plugins in the Studio appstore which give access – free or paid – to various types of machine translation services and facilities. Since I have lately found that the use of MT may give surprisingly good results at least for En > Sv (as well as completely useless ones), I was curious to know more about all these various options. Below is a brief overview of what I found trying to explore them to the best of my ability. Some general observations: Several of the plugins offer to build MT engines based on your own TMs (with some of them, that’s the only option) and, in some cases, also termbase data. This requires fairly big TMs – typically at least 80,000 TUs – for reasonable usefulness. This type of service of course also means full confidentiality (you are its only user). However, normally such an engine suffers the same problem as other MT engines: it is not updated while you translate; you have to rebuild the it with the new TM data – and that will take some time. Newer engines, however, gives the possibility of adding fragments as you go (SDL’s Adaptive MT does this automatically, but only for some language combinations), but for full usage of all new TUs, the engine must be rebuilt. Some plugins require you to sign up for an account, free or paid, but details about that is not always provided with the plugin/app. -
Stefanie Geldbach: Lexicon Exchange in MT
Stefanie Geldbach Lexicon Exchange in MT The Long Way to Standardization Abstract in diff erent systems. MT systems typically fol- Th is paper discusses the question to what extent low a lemma-oriented approach for the repre- lexicon exchange in MT has been standardized sentation of homonymy which means that diff e- during the last years. Th e introductory section rent semantic readings of one word are collapsed is followed by a brief description of OLIF2, a into one entry. Th e entry for Maus in the Ger- format specifi cally designed for the exchange of man monolexicon of LexShop 2.2 (see Section terminological and lexicographical data (Section 3.4) illustrates this approach. Th is entry contains 2). Section 3 contains an overview of the import/ (among others) following feature-value pairs: export functionalities of fi ve MT systems (Promt Expert 7.0, Systran 5.0 Professional Premium, CAN “Maus” Translate pro 8.0, LexShop 2.2, OpenLogos). CAT NST Th is evaluation shows that despite the standar- ALO “Maus” dization eff orts of the last years the exchange of TYN (ANI C-POT) lexicographical data between MT systems is still not a straightforward task. Th e feature TYN (type of noun) which indica- tes the semantic type of the given noun has two 1 Introduction values, ANI (animal) and C-POT (concrete-po- Th e creation and maintenance of MT lexicons tent) representing two diff erent concepts, i.e. the is time-consuming and cost-intensive. Th erefore, small rodent and the peripheral device. the development of standardized exchange for- Term bases usually are concept-oriented mats has received considerable attention over the which means that diff erent semantic readings of last years.