Using the Internet for Specialised Translation

Total Page:16

File Type:pdf, Size:1020Kb

Using the Internet for Specialised Translation USING THE INTERNET FOR SPECIALISED TRANSLATION 1 Translation Technology “much translation work is carried out in a computer-assisted translation (CAT) environment, which may vary from a standard desktop equipped with word processing software and a browser to a full-blown translator workstation consisting of a multiplicity of tools specifically created for translators of technical texts and localizers." “Tra ns la tion agencies organize their workflow around project management systems that distribute translation tasks, memories and terminologies to and around individual translators.” (F. Zanettin 2014, “Corpora in Translation”) Translation technologies • electronicdictionaries and terminologicaldatabases, the arrival of the Internet with its numerous possibilities for research, documentation and communication, and the emergence of computer-assisted translationtools. Alcina A. (2008) «Translation technologies - Scope, tools and resources». Targ et 20:1, 79–102 Degrees of Translation automation Degrees of Translation automation • The term traditional human translation is understood to refer to translation without any kind of automation • Fully automatichigh quality translation (FAHQT) means translation that is performed wholly by the computer, without any kind of human involvement, and is of “high quality” • Human-aided machine translation (HAMT) refers to systems in which the translation is essentially carried out by the program itself, but aid required from humans • Machine-aided human translation (MAHT) comprises any process or degree of automation in the translation process, provided that the mechanical intervention provides some kind of linguistic support. Tools vs.Resources • The word tool refers to computer programs that enable translators to carry out a series of functions or tasks with a set of data that they have prepared and, at the same time, allows a particular kind of results to be obtained. • Internet search engines • Word processor • Trados, Wordfast, Déjà Vu, Across, OmegaT, … • Antconc, Wordsmith… • By resources we refer to all sets of previously gathered linguistic data which are organized in a particular manner and made available in some electronic format so that they can be used or looked up or used by translators used in the course of some phase of processing. Te r m i n o l o g i c a l databases (e.g. IATE), glossaries, … • (online) dictionaries • British National Corpus, … why and how can we mine the web? Extended units of meaning • the study of words “by presenting them in thecompany they usually keep - that is to say, an element of their meaning is indicated when their habitual word accompaniments are shown” • “Extended units of meaning” at work in language (Sinclair, 1996) • collocation Words must be studied in • colligation context rather than in • semantic preference isolation • semantic prosody Extended units of meaning • Differences in Italian between (from Taylor, 1998: 61): ◦ “pressione alta” = “high (blood) pressure”[medical] ◦ “alta pressione” = “(banks of) high pressure” [meteorological] • collocation Words must be studied in • colligation context rather than in • semantic preference isolation • semantic prosody Collocation • “Tendency of certain words to co-occur regularly in a given language” (Mona Baker, 1992: 47) • As observed in actual texts (vs. intuition) • Key features of collocations o language-specific (collocations vary from language tolanguage) • Collocations are not stable or fixed o they may change diachronically (over time) in generallanguage o they may change in LSP vs. generallanguage o they may change across LSPdomains 11 Semantic prosody •“A consistent aura of meaning with which a form is imbued by its collcates” (Louw1993) • “Feeling” or “aura” that is evoked by using certain words (reinforced by collocates, due to co-selectional implications and restrictions) •Usually this feeling is “positive or negative” • “Provide” tends to occur with words denoting things which are desirable, necessary or good, such as “information”, “service(s)”, “support”, “help”, “money”, “protection”, “food”,“care” • cf. Italian “fornire” and “elargire” • “Cause” tends to occur with words denotingnegative repercussions/consequences, such as “pain”, “damage”, “harm” • cf. Italian “causare” •Not20necessarily accessible to intuition. Semantic prosody •“A consistent aura of meaning with which a form is imbued by its collcates” (Louw1993) • “Feeling” or “aura” that is evoked by using certain words (reinforced by collocates, due to co-selectional implications and restrictions) •Usually this feeling is “positive or negative” • “Provide” tends to occur with words denoting things which are desirable, necessary or good, such as “information”, “service(s)”, “support”, “help”, “money”, “protection”, “food”,“care” • cf. Italian “fornire” and “elargire” • “Cause” tends to occur with words denotingnegative repercussions/consequences, such as “pain”, “damage”, “harm” • cf. Italian “causare” Semantic preference •Relation between a lemma and a set of semantically related words (Stubbs, 2001: 65) • Lemma: base form (lexeme) or dictionary entry of a word • “Commit” is used with a group of semantically similar words, e.g. “murder”, “crime”, “suicide” (cf. Italian “commettere”) •“Revoke” is used with e.g. “licence”, “permit”,“authorization” •Semantic prosody à positive/negativeevaluation •Semantic preference à relation to words belonging to a particular, definable semantic field 14 Semantic preference •Relation between a lemma and a set of semantically related words (Stubbs, 2001: 65) • Lemma: dictionary entry of a word • “Commit” is used with a group of semantically similar words, e.g. “murder”, “crime”, “suicide” (cf. Italian “commettere”) •“Revoke” is used with e.g. “licence”, “permit”,“authorization” •Semantic prosody à positive/negativeevaluation •Semantic preference à relation to words belonging to a particular, definable semantic field 15 Colligation •Relation between a pair of grammatical categories or a pairing of lexis and grammar (Stubbs, 2001: 65) • hear, notice, see, watchenters into colligation with the sequence of object + either the bare infinitive or the -ing form; e.g.Corr • We heard the visitors leave/leaving. • We noticed him walk away/walkingaway. • We heard Pavarotti sing/singing. • We saw it fall/falling.esponding collocations and colligations in Italian for “break the law”? 16 Colligation •Relation between a pair of grammatical categories or a pairing of • hear, notice, see, watch enters into colligation with the sequence of object + either the bare infinitive or the -ing form; e.g. • We heard the visitors leave/leaving. • We noticed him walk away/walkingaway. • We heard Pavarotti sing/singing. • We saw it fall/falling.esponding collocations and colligations in Italian for •Relation “break betweenthe law”? a pair of grammatical categories or a pairing of lexis and grammar (Stubbs, 2001: 65) 17 Conclusion on using the Web for specialised translation – Main advantages • massive amount of texts and multi-source information can be searched • content is constantly “refreshed” (i.e. updated and extended) • a lot of sources, text types and domains/topics are representedHow to friend and unfriend someone on Facebook - Computer Hope 1.https://www.computerhope.com › ... › Facebook Help • many24 gen languages2018 - Before (Englishyou can is dominant,connect with anothergood presenceperson on of Italian)Facebook and view their full profile, you must first become friends. Below are the steps on how to find new friends on Facebook, add • replicablefriends, and search how to techniquesunfriend any of across your current (yourfriends. How to find working/target)friends on Facebooklanguages; How to friend someone on ... • it is available at any time, at virtuallyno cost! Conclusion on using the Web for specialised translation – Main advantages • massive amount of texts and multi-source information can be searched • content is constantly “refreshed” (i.e. updated and extended) • a lot of sources, text types and domains/topics are represented • many languages (English is dominant, good presence of Italian) • replicable search techniques across (your working/target) languages • it is available at any time, at virtuallyno cost! Main disadvantages andproblems o need to differentiate good/reliable sources from questionableinformation §for facts (limited control over user-generated content like Wikipedia) §for linguistic usage (badly translated, non-native texts, poorauthors) §it may be difficult to identify differences between expert/non-expert use o data/results still need to be interpreted Main disadvantages andproblems o Google focuses on content/information, rather than linguistic forms • the ranking and sorting of results are performed according to criteria like • “popularity” of the websites, or geographic relevance • the same search can yield different numbers of hits, depending on unpredictable and uncontrollable factors as the time of the day, or the location from which the query is made -- word counts are not reliable +it is difficult to compare frequencies to verify translation hypothes • data on which searches are performed isunstable/changes Main disadvantages and problems Particularly relevant to linguists/translators: § no possible/meaningful sorting of hits/results(esp. L/R-hand collocates) - e.g. alphabetical sorting of collocates, from least to most frequent,etc. - think of e.g. the “a * range/array of”, “on the verge of” exercises § punctuation and upper case (capitals) are ignored, e.g. “aids”
Recommended publications
  • Translators' Tool
    The Translator’s Tool Box A Computer Primer for Translators by Jost Zetzsche Version 9, December 2010 Copyright © 2010 International Writers’ Group, LLC. All rights reserved. This document, or any part thereof, may not be reproduced or transmitted electronically or by any other means without the prior written permission of International Writers’ Group, LLC. ABBYY FineReader and PDF Transformer are copyrighted by ABBYY Software House. Acrobat, Acrobat Reader, Dreamweaver, FrameMaker, HomeSite, InDesign, Illustrator, PageMaker, Photoshop, and RoboHelp are registered trademarks of Adobe Systems Inc. Acrocheck is copyrighted by acrolinx GmbH. Acronis True Image is a trademark of Acronis, Inc. Across is a trademark of Nero AG. AllChars is copyrighted by Jeroen Laarhoven. ApSIC Xbench and Comparator are copyrighted by ApSIC S.L. Araxis Merge is copyrighted by Araxis Ltd. ASAP Utilities is copyrighted by eGate Internet Solutions. Authoring Memory Tool is copyrighted by Sajan. Belarc Advisor is a trademark of Belarc, Inc. Catalyst and Publisher are trademarks of Alchemy Software Development Ltd. ClipMate is a trademark of Thornsoft Development. ColourProof, ColourTagger, and QA Solution are copyrighted by Yamagata Europe. Complete Word Count is copyrighted by Shauna Kelly. CopyFlow is a trademark of North Atlantic Publishing Systems, Inc. CrossCheck is copyrighted by Global Databases, Ltd. Déjà Vu is a trademark of ATRIL Language Engineering, S.L. Docucom PDF Driver is copyrighted by Zeon Corporation. dtSearch is a trademark of dtSearch Corp. EasyCleaner is a trademark of ToniArts. ExamDiff Pro is a trademark of Prestosoft. EmEditor is copyrighted by Emura Software inc. Error Spy is copyrighted by D.O.G. GmbH. FileHippo is copyrighted by FileHippo.com.
    [Show full text]
  • Introducing a Cat Tool to Translate: Wordfast
    Indonesian Journal of English Language Studies Vol. 2, Number 1, February 2016 Introducing a Cat Tool to Translate: Wordfast Fika Apriliana, Ardiyarso Kurniawan, Sandy Ferianda, and Fidelis Chosa Kastuhandani ABSTRACT This article aims at introducing CAT tools to those prospective translators who are familiar with with the tools for the first time. Some of the CAT tools must be paid for while some others are free. This article is to inform the readers about the list of free and paid CAT tools, advantages and disadvantages of those tools. One does not need special training for using a free CAT tool while using the paid CAT tools, one needs some special preparation. This article is going to focus more on Wordfast Pro as the second most widely used CAT tools after SDLTrados. Wordfast Pro is a paid software the functioning of which is based on the creation of a Translation Memory which facilitates and speeds up the translator's work. This article is going to briefly explain the advantages of Wordfast Pro and the steps of using it. The translation example is presented to reveal the different translation results of Wordfast Pro as a paid CAT tool and OmegaT as a free CAT tool. Therefore, the article will facilitate those who intend to know more about Wordfast Pro and start using it. Keywords: CAT tools, Wordfast Pro, OmegaT INTRODUCTION When using CAT tools, it is Computerized translation has attracted immediately clear which parts of the text the attention of a large number of people must be translated; the unchanging who work directly or indirectly on portions are transferred accurately and translational issues such as professional directly; the time savings due to repeating translators, teachers, linguists, researchers expressions is huge; and expressions are and future translators.
    [Show full text]
  • Omegat 2.1 Manual De Usuario
    Acerca de OmegaT ― OmegaT 2.1 Manual de Usuario Acerca de OmegaT OmegaT es una herramienta multiplataforma para la Traducción Asistida por Ordenador, es software libre y cuenta con los siguientes atractivos: Memoria de traducción OmegaT recuerda sus traducciones en una memoria de traducción. Al mismo tiempo, puede utilizar referencias a memorias de traducción anteriores. Las memorias de traducción pueden ser muy útiles para una traducción con una variedad de repeticiones o segmentos de texto razonablemente similares. OmegaT utiliza memorias de traducción para recordar sus traducciones anteriores y sugerirle la traducción más probable para el texto en que está trabajando. Las memorias de traducción pueden ser muy útiles cuando necesita actualizar un documento, que ya ha sido traducido. Sin modificación se seguirá traduciendo y las oraciones actualizadas se muestran con una versión anterior de la misma sentencia que la traducción más probable. Las modificaciones al documento original, por lo tanto, serán tratadas con mayor facilidad. Si está utilizando memorias de traducción creadas previamente, por ejemplo, que le ha asignado la agencia de traducción o su cliente, OmegaT será capaz de utilizarlas como memorias de referencia. OmegaT utiliza el formato de archivo TMX estándar para almacenar y acceder a las memorias de traducción, lo cual garantiza que usted puede intercambiar su material con otras aplicaciones de traducción CAT, que soporten este formato de archivo. Gestión de terminología La gestión de la terminología es importante para la consistencia de la traducción. OmegaT utiliza glosarios que contienen la traducción de palabras sueltas o pequeñas frases, una especie de diccionario bilingüe simplificado para un dominio específico.
    [Show full text]
  • Omegat CAT- Aloittelijoille Kirjoittaneet Susan Welsh & Marc Prior Suomentanut Taija Salo 1
    OmegaT CAT- aloittelijoille kirjoittaneet Susan Welsh & Marc Prior Suomentanut Taija Salo 1 Tekijänoikeus Tekijänoikeus © 2009 Susan Welsh ja Marc Prior Tätä dokumenttia saa kopioida, levittää ja/tai muokata Free Software Foundationin julkaiseman GNU Free Documentation Licensen version 1.2 tai minkä hyvänsä myöhemmän version mukaisesti; vakiolukuja (”Invariant Sections”), etukansitekstejä (”Front Cover Texts”) ja takakansitekstejä ("Back- Cover Texts") ei saa käyttää. Kopio lisenssistä sisältyy osaan nimeltä "GNU Free Documentation License". Kannen kuva on peräisin osoitteesta www.freeclipartnow.com ja on asetettu vapaasti yleiseen käyttöön. Viimeisin päivitys tehty tammikuussa 2009. Viittaa OmegaT:n versioon 2.0.0. Kuvakaappaukset OmegaT:n versioista 1.6.0 ja 2.0.0. 2 Sisällysluettelo Tekijänoikeus......................................................................................................1 Esittely................................................................................................................3 Kenelle opas on suunnattu.......................................................................3 Mikä on CAT-työkalu, ja mitä hyötyä niistä on?......................................3 1. OpenOffice.orgin lataaminen..........................................................................3 2. OmegaT:n lataaminen.....................................................................................4 3. OmegaT:n asentaminen..................................................................................4 4. OmegaT:n käyttöliittymä................................................................................4
    [Show full text]
  • Translate's Localization Guide
    Translate’s Localization Guide Release 0.9.0 Translate Jun 26, 2020 Contents 1 Localisation Guide 1 2 Glossary 191 3 Language Information 195 i ii CHAPTER 1 Localisation Guide The general aim of this document is not to replace other well written works but to draw them together. So for instance the section on projects contains information that should help you get started and point you to the documents that are often hard to find. The section of translation should provide a general enough overview of common mistakes and pitfalls. We have found the localisation community very fragmented and hope that through this document we can bring people together and unify information that is out there but in many many different places. The one section that we feel is unique is the guide to developers – they make assumptions about localisation without fully understanding the implications, we complain but honestly there is not one place that can help give a developer and overview of what is needed from them, we hope that the developer section goes a long way to solving that issue. 1.1 Purpose The purpose of this document is to provide one reference for localisers. You will find lots of information on localising and packaging on the web but not a single resource that can guide you. Most of the information is also domain specific ie it addresses KDE, Mozilla, etc. We hope that this is more general. This document also goes beyond the technical aspects of localisation which seems to be the domain of other lo- calisation documents.
    [Show full text]
  • Reciprocal Enrichment Between Basque Wikipedia and Machine Translation
    Reciprocal Enrichment Between Basque Wikipedia and Machine Translation Inaki˜ Alegria, Unai Cabezon, Unai Fernandez de Betono,˜ Gorka Labaka, Aingeru Mayor, Kepa Sarasola and Arkaitz Zubiaga Abstract In this chapter, we define a collaboration framework that enables Wikipe- dia editors to generate new articles while they help development of Machine Trans- lation (MT) systems by providing post-edition logs. This collaboration framework was tested with editors of Basque Wikipedia. Their post-editing of Computer Sci- ence articles has been used to improve the output of a Spanish to Basque MT system called Matxin. For the collaboration between editors and researchers, we selected a set of 100 articles from the Spanish Wikipedia. These articles would then be used as the source texts to be translated into Basque using the MT engine. A group of volunteers from Basque Wikipedia reviewed and corrected the raw MT translations. This collaboration ultimately produced two main benefits: (i) the change logs that would potentially help improve the MT engine by using an automated statistical post-editing system , and (ii) the growth of Basque Wikipedia. The results show that this process can improve the accuracy of an Rule Based MT (RBMT) system in nearly 10% benefiting from the post-edition of 50,000 words in the Computer Inaki˜ Alegria Ixa Group, University of the Basque Country UPV/EHU, e-mail: [email protected] Unai Cabezon Ixa Group, University of the Basque Country, e-mail: [email protected] Unai Fernandez de Betono˜ Basque Wikipedia and University of the Basque Country, e-mail: [email protected] Gorka Labaka Ixa Group, University of the Basque Country, e-mail: [email protected] Aingeru Mayor Ixa Group, University of the Basque Country, e-mail: [email protected] Kepa Sarasola Ixa Group, University of the Basque CountryU, e-mail: [email protected] Arkaitz Zubiaga Basque Wikipedia and Queens College, CUNY, CS Department, Blender Lab, New York, e-mail: [email protected] 1 2 Alegria et al.
    [Show full text]
  • Omegat 4.2 - User's Guide Omegat Team Omegat 4.2 - User's Guide Omegat Team
    OmegaT 4.2 - User's Guide OmegaT team OmegaT 4.2 - User's Guide OmegaT team Publication date Abstract This document is the official user's guide to OmegaT, the free Computer Aided Translation tool. It also contains installation instructions. Table of Contents 1. Installing and running OmegaT ....................................................................................... 1 1. Windows Users ......................................................................................................... 1 2. Linux (Intel) Users .................................................................................................... 2 3. macOS Users ............................................................................................................ 3 4. Other Systems .......................................................................................................... 4 5. Drag and drop .......................................................................................................... 5 6. Using Java Web Start ............................................................................................... 5 7. Starting OmegaT from the command line ............................................................... 5 8. Building OmegaT From Source ................................................................................ 9 2. Menus ............................................................................................................................. 11 1. Project ...................................................................................................................
    [Show full text]
  • Omegat for CAT Beginners by Susan Welsh & Marc Prior 2
    OmegaT for CAT Beginners by Susan Welsh & Marc Prior 2 Copyright Copyright © 2014 Susan Welsh and Marc Prior Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the section entitled “GNU Free Documentation License.” Cover illustration is from www.freeclipartnow.com, in the public domain. Last updated: March 2014 Refers to OmegaT version: 3.0.8_2 Screenshots from OmegaT versions: 1.6.0, 3.0.8_2 Please note that owing to the pace at which OmegaT is being developed, the appearance of the screenshots and possibly some other information may have changed slightly. 3 Table of Contents Copyright............................................................................................................2 Introduction........................................................................................................5 Intended readership................................................................................5 What is a CAT tool and why are they useful?..........................................5 1. Downloading OmegaT.....................................................................................5 2. Installing OmegaT...........................................................................................5 3. The OmegaT user interface............................................................................5
    [Show full text]
  • Omegat Für CAT- Anfänger Von Susan Welsh & Marc Prior
    OmegaT für CAT- Anfänger von Susan Welsh & Marc Prior Übersetzung aus dem Englischen: Esther Schwemmlein 1 Copyright Copyright © 2009 Susan Welsh und Marc Prior Unter Einhaltung der GNU Free Documentation License, Version 1.2 oder jeder anderen späteren Version, die durch die Free Software Foundation veröffentlicht wird, können Sie dieses Dokument kopieren, verteilen und/oder verändern. Das Dokument hat keine unveränderlichen Abschnitte und keinen vorderen oder hinteren Umschlagtext. Eine Kopie der Lizenz ist im Abschnitt "GNU Free Documentation License" beigefügt. Die Lizenz steht dort in ihrer englischsprachigen Originalfassung. Verschiedene Übersetzungen der "GNU-Lizenz für freie Dokumentation" ins Deutsche können Sie im Internet finden, unter anderem auf der Website http://www.gnu.de. Die Illustration auf dem Umschlag stammt von www.freeclipartnow.com und ist Gemeingut. Zuletzt aktualisiert: Januar 2010 Bezieht sich auf die OmegaT-Version: 2.0.0 Screenshots der OmegaT-Version: 2.0.5_2 Titel der Originalfassung: OmegaT for CAT Beginners Übersetzung aus dem Englischen: Esther Schwemmlein, März 2011 2 Inhaltsverzeichnis Copyright............................................................................................................1 Einführung..........................................................................................................3 Zielgruppe................................................................................................3 Was ist ein CAT-Tool und warum ist es praktisch?.................................3
    [Show full text]
  • Omegat : JDLL, Lyon
    OmegaT Dublin Computational Linguistic Research Seminars Didier Briel June 2012 Contents • OmegaT workflow • Main features • Plugins • Exchange with other CAT tools • Supported formats • The OmegaT project • Availability • Support Dublin Computational Linguistic Research Seminars June 2012 OmegaT OmegaT workflow Main characteristics Translation of a file Demonstration OmegaT workflow Main characteristics • Completely stand-alone – None of its features depends on the installation of other software (e.g., Microsoft Office) • Available on all platforms compatible with Java 1.5 and later • No intermediate format – No preparation • Import or conversion – No “clean-up” – Instantaneous dynamic modification of projects (adding/changing/removing documents) • No database – All data are processed in memory – Very fast – Data size is limited • Automatic propagation of translations Dublin Computational Linguistic Research Seminars June 2012 OmegaT workflow Translation of a file • Creating a project • If needed, conversion of the source file • Installing glossaries and translation memories • Translation • Generating the target documents • If needed, conversion of the target file Dublin Computational Linguistic Research Seminars June 2012 OmegaT Main features RTL and bidi issues Concepts Main features • Fuzzy matching • Automatic propagation of translations • Glossaries • Search terms in the project, in reference memories and in reference documents • Projects can contain an unlimited number of folders and files, in all supported formats • Right to left
    [Show full text]
  • A Comparative Study of Omegat and Memsource: Advantages and Disadvantages of Their Primary Features
    A comparative study of OmegaT and Memsource: advantages and disadvantages of their primary features Student: Laura Moreno Sorolla Supervisor: Dr. Bartolomé Mesa Lao Master’s degree in Specialised Translation Universitat Oberta de Catalunya February 2018 1 Contents Abstract ......................................................................................................................................... 3 1. Introduction ............................................................................................................................ 4 1.1 Motivation ...................................................................................................................... 4 1.2 Aim ................................................................................................................................ 4 2. Literature review .................................................................................................................... 4 3. Difference between CAT tools and cloud-based translation systems ................................... 5 4. Materials and methods .......................................................................................................... 7 4.1 Tools .............................................................................................................................. 7 4.2 Methods ......................................................................................................................... 7 5. Results ..................................................................................................................................
    [Show full text]
  • FREE CAT TOOLS AS an ALTERNATIVE to COMMERCIAL SOFTWARE: Omegat
    FACULTAD DE TRADUCCIÓN E INTERPRETACIÓN Grado en Traducción e Interpretación TRABAJO FIN DE GRADO FREE CAT TOOLS AS AN ALTERNATIVE TO COMMERCIAL SOFTWARE: OmegaT Presentado por Veronica Nicoleta Anica Tutelado por Ana María Alconchel Soria, 2014 Free CAT tools as an alternative to commercial software: OmegaT Content ACKNOWLEDGEMENT ............................................................................................................... 4 I. INTRODUCTION ....................................................................................................................... 6 1. Connection with competencies ............................................................................................ 7 1.1. General competencies ................................................................................................... 7 1.2. Specific competencies ................................................................................................... 8 1. PURPOSE ............................................................................................................................. 10 2. METHODOLOGY ................................................................................................................... 11 II. THEORETICAL APPROACH ................................................................................................... 13 1. Translation and technology ................................................................................................ 13 1.1. Technological advances and the process of globalization
    [Show full text]