Translate Toolkit Documentation Release 2.0.0

Total Page:16

File Type:pdf, Size:1020Kb

Translate Toolkit Documentation Release 2.0.0 Translate Toolkit Documentation Release 2.0.0 Translate.org.za January 27, 2017 Contents 1 User’s Guide 3 1.1 Features..................................................3 1.2 Installation................................................4 1.3 Converters................................................6 1.4 Tools................................................... 56 1.5 Scripts.................................................. 91 1.6 Use Cases................................................. 103 1.7 Translation Related File Formats..................................... 120 2 Developer’s Guide 149 2.1 Translate Styleguide........................................... 149 2.2 Documentation.............................................. 155 2.3 Building................................................. 159 2.4 Testing.................................................. 159 2.5 Command Line Functional Testing................................... 161 2.6 Contributing............................................... 163 2.7 Translate Toolkit Developers Guide................................... 165 2.8 Making a Translate Toolkit Release................................... 169 2.9 Deprecation of Features......................................... 174 3 Additional Notes 177 3.1 Release Notes.............................................. 177 3.2 Changelog................................................ 240 3.3 History of the Translate Toolkit..................................... 248 3.4 License.................................................. 249 4 API Reference 251 4.1 API.................................................... 251 Python Module Index 535 i ii Translate Toolkit Documentation, Release 2.0.0 Welcome to Translate Toolkit’s documentation. This documentation covers both user’s and programmer’s perspective. Contents 1 Translate Toolkit Documentation, Release 2.0.0 2 Contents CHAPTER 1 User’s Guide This part has the user’s documentation for the tools included in the Translate Toolkit. 1.1 Features • Work with ONE localisation format. You’ll no longer be editing DTD files in one tool, .properties in another, OpenOffice GSI in a third. Simply do all your localisation in a PO or XLIFF editor • Converters for a number of formats – OpenOffice.org SDF/GSI – Mozilla: .properties, DTD, XHTML, .inc, .ini, etc – Others: Comma Separated Value, TMX, XLIFF, TBX, PHP, WordFast TXT, Qt .ts, txt, .ini, Windows .rc, ical, subtitles, Mac OS X strings • File access to localization files through the format API in all the above formats, as well as .qph, .qm, .mo • Output valid target file types. We make sure that your output files (e.g. .properties) contain all comments from the original file and preserves the layout of the original as far as possible. If your PO entry is marked as fuzzy we use the English text, not your half complete translation. The converters for OpenOffice.org and Mozilla formats will also perform simple checks and corrections to make sure you have none of those hard to find localisation bugs. • Our checker has over 42 checks to find errors such as: missing or translated variables, missing accelerator keys, bad escaping, start capitalisation, missing sentences, bad XML and much more. • Language awareness, taking language conventions for capitalisation, quotes and other punctuation into account • Find conflicting translations easily, cases where you have translated a source word differently or used a target word for 2 very different English concepts • Extract messages using simple text or a regular expression allowing you to quickly find and extract words that you need to fix due to glossary changes. • Merge snippets of PO files into your existing translations. • Create word, string and file counts of your files. Making it much easier to budget time as string counts do not give you a good indication of expected work. • Create a set of PO files with debugging entries to allow you to easily locate the source of translations. Very use- ful in OpenOffice.org which provides scant clues as to where the running application has sourced the message. The Translate Toolkit is also a powerful API for writing translation and localisation tools, already used by our own and several other projects. See the base class section for more information. 3 Translate Toolkit Documentation, Release 2.0.0 1.2 Installation This is a guide to installing the Translate Toolkit on your system. If the Translate Toolkit is already packaged for your system, this is probably the easiest way to install it. For several Linux distributions, the package might be available through your package manager. On Windows, we recommend using a virtual environment. If your system already has the toolkit prepackaged, then please let us know what steps are required to install it. 1.2.1 Building For build instructions, see the Building page. 1.2.2 Download Download a stable released version. Or if you have a python environment, run pip install translate-toolkit. For those who need problems fixed, or who want to work on the bleeding edge, get the latest source from Git. If you install through your distribution’s package manager, you should automatically have all the dependencies you need. If you are installing a version from Version Control, or from a source release, you should check the README file for information on the dependencies that are needed. Some of the dependencies are optional. The README file documents this. 1.2.3 Installing packaged versions Get the package for your system: RPM If you want to install easily on an RPM based system .tar.gz for source based installing on Linux .deb for Debian GNU/Linux (etch version) The RPM package can be installed by using the following command: $ rpm -Uvh translate-toolkit-1.0.1.rpm To install a tar.bz2: $ tar xvjf translate-toolkit-1.1.0.tar.bz2 $ cd translate-toolkit-1.1.0 $ su $ ./setup.py install On Debian (if you are on etch), just type the following command: $ aptitude install translate-toolkit If you are using an old Debian stable system, you might want to install the .tar.bz2 version. Be sure to install python and python development first with: $ apt-get install python python-dev Alternatively newer packages might be in testing. 4 Chapter 1. User’s Guide Translate Toolkit Documentation, Release 2.0.0 1.2.4 Installing on Windows On Windows we recommend that you install Translate Toolkit using a virtual environment. This makes installation clean and isolated. Use the latest Python 2.7 (at least Python 2.7.9 or newer as it bundles the pip installer). Install virtualenvwrapper-win to simplify handling of virtualenvs. 1. Install latest Python 2.7 2. Open cmd.exe or similar 3. pip install virtualenvwrapper-win 4. mkvirtualenv ttk where “ttk” is the name for the new virtualenv 5. pip install translate-toolkit to install latest stable or pip install –pre translate-toolkit to try a pre-release 6. pip install “lxml>=3.5” to be able to use XLIFF or other XML formats 7. po2prop –version to double check you have the right version Next times you need to use Translate Toolkit just remember to: 1. Open cmd.exe or similar 2. workon ttk to enable the virtualenv again 3. Run the Translate Toolkit commands you want 1.2.5 Installing from Git If you want to try the bleeding edge, or just want to have the latest fixes from a stabilising branch then you need to use Git to get your sources: $ git clone https://github.com/translate/translate.git This will retrieve the master branch of the Toolkit. Further Git instructions are also available. Once you have the sources you have two options, a full install: $ su $ ./setup.py install or, running the tools from the source directory: $ ./setuppath # Only needed the first time $ . setpath # Do this once for a session 1.2.6 Verify installed version To verify which version of the toolkit you have installed run: $ prop2po --version prop2po 2.0.0 1.2. Installation 5 Translate Toolkit Documentation, Release 2.0.0 1.2.7 Cleaning up existing installation To remove old versions of the toolkit which you might have installed without a virtual environment or without your package manager. The following advice only applies to manual installation from a tarball. 1. Find location of your python packages: $ python -c "from distutils.sysconfig import get_python_lib; print(get_python_lib())" 2. Delete toolkit package from your Python site-packages directory e.g.: $ rm -R /usr/local/lib/python2.7/dist-packages/translate 1.3 Converters 1.3.1 General Usage The tools follow a general usage convention which is helpful to understand. Input & Output The last two arguments of your command are the input and output files/directories: moz2po <input> <output> You can of course still use the -i and -o options which allows you to reorder commands moz2po -o <output> -i <input> Error Reporting All tools accept the option --errorlevel. If you find a bug, add this option and send the traceback to the develop- ers. moz2po<other-options>--errorlevel=traceback Templates If you are working with any file format and you wish to preserve comments and layout then use your source file as a template. po2dtd -t <source-file> <input> <output> This will use the files in <source-file> as a template, merge the PO files in <input>, and create new DTD files in <output> If you ran this without the templates you would get valid DTD files but they would not preserve the layout or all the comments from the source DTD file The same concept of templates is also used when you merge files. 6 Chapter 1. User’s Guide Translate Toolkit Documentation, Release 2.0.0 pomerge -t <old> <fixes> <new> This would take the <old> files merge in the <fixes> and output new PO files, preserving formatting, into <new>. You can use the same directory for <old> and <new> if you want the merges to overwrite files in <old>. source2target The converters all follow this convention: • source = the format from which you are converting e.g. in oo2po we are converting from OpenOffice.org SDF/GSI • target = the format into which you are converting e.g. in oo2po we are converting to Gettext PO Getting Help The --help option will always list the available commands for the tool. moz2po--help 1.3.2 moz2po moz2po converts Mozilla files to PO files.
Recommended publications
  • Translators' Tool
    The Translator’s Tool Box A Computer Primer for Translators by Jost Zetzsche Version 9, December 2010 Copyright © 2010 International Writers’ Group, LLC. All rights reserved. This document, or any part thereof, may not be reproduced or transmitted electronically or by any other means without the prior written permission of International Writers’ Group, LLC. ABBYY FineReader and PDF Transformer are copyrighted by ABBYY Software House. Acrobat, Acrobat Reader, Dreamweaver, FrameMaker, HomeSite, InDesign, Illustrator, PageMaker, Photoshop, and RoboHelp are registered trademarks of Adobe Systems Inc. Acrocheck is copyrighted by acrolinx GmbH. Acronis True Image is a trademark of Acronis, Inc. Across is a trademark of Nero AG. AllChars is copyrighted by Jeroen Laarhoven. ApSIC Xbench and Comparator are copyrighted by ApSIC S.L. Araxis Merge is copyrighted by Araxis Ltd. ASAP Utilities is copyrighted by eGate Internet Solutions. Authoring Memory Tool is copyrighted by Sajan. Belarc Advisor is a trademark of Belarc, Inc. Catalyst and Publisher are trademarks of Alchemy Software Development Ltd. ClipMate is a trademark of Thornsoft Development. ColourProof, ColourTagger, and QA Solution are copyrighted by Yamagata Europe. Complete Word Count is copyrighted by Shauna Kelly. CopyFlow is a trademark of North Atlantic Publishing Systems, Inc. CrossCheck is copyrighted by Global Databases, Ltd. Déjà Vu is a trademark of ATRIL Language Engineering, S.L. Docucom PDF Driver is copyrighted by Zeon Corporation. dtSearch is a trademark of dtSearch Corp. EasyCleaner is a trademark of ToniArts. ExamDiff Pro is a trademark of Prestosoft. EmEditor is copyrighted by Emura Software inc. Error Spy is copyrighted by D.O.G. GmbH. FileHippo is copyrighted by FileHippo.com.
    [Show full text]
  • Structuring Your Content & Projects for Continuous Localization
    GUIDE Structuring Your Content & Projects for Continuous Localization Table of Contents Introduction: Lessons from 10+ Years of Taking Companies Global • Learning Challenges to Overcome Them • A Guide for Fast, Tech-Focused Companies • What Not to Do: Common Software Localization Mistakes Part 1: Internationalization • The Importance of Internationalization • Finding & Fixing Internationalization Bugs • The Software Architecture Checklist Part 2: Structuring Your Files • Organizing Your Projects & Resources • Building a Global Content Repository • Structuring Your Files • Common Localization File Formats Part 3: Workflows & Integrations • Finding the Best Approach for Your Team • How to Integrate with Your TMS Wrap-Up • The Future of Localization: Continuous & Cloud-Based • About Transifex Native Transifex | Structuring Your Content & Projects for Continuous Localization 2 Introduction Lessons from 10+ Years of Taking Companies Global A Guide for Fast, Tech-Focused Companies We put together this guide for fast-moving companies like you — to empower your team to understand and leverage the power of continuous localization to overcome all those challenges that come with the outdated form of localization (a lot of manual files and email chains). In this guide, we break down the ins and outs of effectively structuring your content and projects for continuous localization, so you can finally localize at the speed of your content and product creation. Learning Challenges to Overcome Them In our journey building this platform, we’ve learned firsthand
    [Show full text]
  • Introducing a Cat Tool to Translate: Wordfast
    Indonesian Journal of English Language Studies Vol. 2, Number 1, February 2016 Introducing a Cat Tool to Translate: Wordfast Fika Apriliana, Ardiyarso Kurniawan, Sandy Ferianda, and Fidelis Chosa Kastuhandani ABSTRACT This article aims at introducing CAT tools to those prospective translators who are familiar with with the tools for the first time. Some of the CAT tools must be paid for while some others are free. This article is to inform the readers about the list of free and paid CAT tools, advantages and disadvantages of those tools. One does not need special training for using a free CAT tool while using the paid CAT tools, one needs some special preparation. This article is going to focus more on Wordfast Pro as the second most widely used CAT tools after SDLTrados. Wordfast Pro is a paid software the functioning of which is based on the creation of a Translation Memory which facilitates and speeds up the translator's work. This article is going to briefly explain the advantages of Wordfast Pro and the steps of using it. The translation example is presented to reveal the different translation results of Wordfast Pro as a paid CAT tool and OmegaT as a free CAT tool. Therefore, the article will facilitate those who intend to know more about Wordfast Pro and start using it. Keywords: CAT tools, Wordfast Pro, OmegaT INTRODUCTION When using CAT tools, it is Computerized translation has attracted immediately clear which parts of the text the attention of a large number of people must be translated; the unchanging who work directly or indirectly on portions are transferred accurately and translational issues such as professional directly; the time savings due to repeating translators, teachers, linguists, researchers expressions is huge; and expressions are and future translators.
    [Show full text]
  • Omegat 2.1 Manual De Usuario
    Acerca de OmegaT ― OmegaT 2.1 Manual de Usuario Acerca de OmegaT OmegaT es una herramienta multiplataforma para la Traducción Asistida por Ordenador, es software libre y cuenta con los siguientes atractivos: Memoria de traducción OmegaT recuerda sus traducciones en una memoria de traducción. Al mismo tiempo, puede utilizar referencias a memorias de traducción anteriores. Las memorias de traducción pueden ser muy útiles para una traducción con una variedad de repeticiones o segmentos de texto razonablemente similares. OmegaT utiliza memorias de traducción para recordar sus traducciones anteriores y sugerirle la traducción más probable para el texto en que está trabajando. Las memorias de traducción pueden ser muy útiles cuando necesita actualizar un documento, que ya ha sido traducido. Sin modificación se seguirá traduciendo y las oraciones actualizadas se muestran con una versión anterior de la misma sentencia que la traducción más probable. Las modificaciones al documento original, por lo tanto, serán tratadas con mayor facilidad. Si está utilizando memorias de traducción creadas previamente, por ejemplo, que le ha asignado la agencia de traducción o su cliente, OmegaT será capaz de utilizarlas como memorias de referencia. OmegaT utiliza el formato de archivo TMX estándar para almacenar y acceder a las memorias de traducción, lo cual garantiza que usted puede intercambiar su material con otras aplicaciones de traducción CAT, que soporten este formato de archivo. Gestión de terminología La gestión de la terminología es importante para la consistencia de la traducción. OmegaT utiliza glosarios que contienen la traducción de palabras sueltas o pequeñas frases, una especie de diccionario bilingüe simplificado para un dominio específico.
    [Show full text]
  • Teaching Video Game Translation: First Steps, Systems and Hands-On Experience Ensinando Tradução De Videogame: Primeiros Passos, Sistemas E Experiência Prática
    http://periodicos.letras.ufmg.br/index.php/textolivre Belo Horizonte, v. 11, n. 1, p. 103-120, jan.-abr. 2018 – ISSN 1983-3652 DOI: 10.17851/1983-3652.11.1.103-120 TEACHING VIDEO GAME TRANSLATION: FIRST STEPS, SYSTEMS AND HANDS-ON EXPERIENCE ENSINANDO TRADUÇÃO DE VIDEOGAME: PRIMEIROS PASSOS, SISTEMAS E EXPERIÊNCIA PRÁTICA Marileide Dias Esqueda Universidade Federal de Uberlândia [email protected] Érika Nogueira de Andrade Stupiello Universidade Estadual Paulista “Júlio de Mesquista Filho” [email protected] ABSTRACT: Despite the significant growth of the game localization industry in the past years, translation undergraduate curricula in Brazil still lacks formal training in game localization, often leaving novice translators no alternative but to search for the required skills informally in game translation communities. Designing a video game localization course in translation undergraduate programs in public universities is a complex task in today’s reality, particularly due to limited access to free and authentic materials. This paper describes a game localization teaching experience at the undergraduate level with special focus on how to handle the linguistic assets of the online race game SuperTuxKart, while trying to shed some light on potential translation requirements of entertainment software and its incorporation into translation programs. KEYWORDS: video game localization; video game translation; translator training; translation undergraduate program; SuperTuxKart. RESUMO: A despeito do significativo crescimento da indústria de localização de games nos últimos anos, os currículos dos cursos de graduação em tradução ainda carecem de formação específica na localização de games, geralmente não oferecendo ao tradutor em formação alternativas outras senão a de adquirir informalmente, ou em comunidades on- line de gamers, os conhecimentos sobre a tradução desse tipo de material.
    [Show full text]
  • Omegat CAT- Aloittelijoille Kirjoittaneet Susan Welsh & Marc Prior Suomentanut Taija Salo 1
    OmegaT CAT- aloittelijoille kirjoittaneet Susan Welsh & Marc Prior Suomentanut Taija Salo 1 Tekijänoikeus Tekijänoikeus © 2009 Susan Welsh ja Marc Prior Tätä dokumenttia saa kopioida, levittää ja/tai muokata Free Software Foundationin julkaiseman GNU Free Documentation Licensen version 1.2 tai minkä hyvänsä myöhemmän version mukaisesti; vakiolukuja (”Invariant Sections”), etukansitekstejä (”Front Cover Texts”) ja takakansitekstejä ("Back- Cover Texts") ei saa käyttää. Kopio lisenssistä sisältyy osaan nimeltä "GNU Free Documentation License". Kannen kuva on peräisin osoitteesta www.freeclipartnow.com ja on asetettu vapaasti yleiseen käyttöön. Viimeisin päivitys tehty tammikuussa 2009. Viittaa OmegaT:n versioon 2.0.0. Kuvakaappaukset OmegaT:n versioista 1.6.0 ja 2.0.0. 2 Sisällysluettelo Tekijänoikeus......................................................................................................1 Esittely................................................................................................................3 Kenelle opas on suunnattu.......................................................................3 Mikä on CAT-työkalu, ja mitä hyötyä niistä on?......................................3 1. OpenOffice.orgin lataaminen..........................................................................3 2. OmegaT:n lataaminen.....................................................................................4 3. OmegaT:n asentaminen..................................................................................4 4. OmegaT:n käyttöliittymä................................................................................4
    [Show full text]
  • Translate's Localization Guide
    Translate’s Localization Guide Release 0.9.0 Translate Jun 26, 2020 Contents 1 Localisation Guide 1 2 Glossary 191 3 Language Information 195 i ii CHAPTER 1 Localisation Guide The general aim of this document is not to replace other well written works but to draw them together. So for instance the section on projects contains information that should help you get started and point you to the documents that are often hard to find. The section of translation should provide a general enough overview of common mistakes and pitfalls. We have found the localisation community very fragmented and hope that through this document we can bring people together and unify information that is out there but in many many different places. The one section that we feel is unique is the guide to developers – they make assumptions about localisation without fully understanding the implications, we complain but honestly there is not one place that can help give a developer and overview of what is needed from them, we hope that the developer section goes a long way to solving that issue. 1.1 Purpose The purpose of this document is to provide one reference for localisers. You will find lots of information on localising and packaging on the web but not a single resource that can guide you. Most of the information is also domain specific ie it addresses KDE, Mozilla, etc. We hope that this is more general. This document also goes beyond the technical aspects of localisation which seems to be the domain of other lo- calisation documents.
    [Show full text]
  • Open-Source Tools - Freie Translationstechnologie Für Übersetzer
    Peter Sandrini Universität Innsbruck Open-Source Tools - Freie Translationstechnologie für Übersetzer Mit freier Software kostenlos zum Erfolg 27.9.2014 Überblick I. Was ist freie Software? II. Freie Translationstechnologie III.Freie Translationstechnologie im Set USBTrans und tuxtrans IV.Arbeitsabläufe eines Übersetzers ✔ Übersetzen eines Word-Dokumentes mit TM-Support ✔ Übersetzen einer Website mit TM-Support ✔ Maschinenübersetzung nutzen ✔ Erstellen eines TM anhand vorhandener Übersetzungen ✔ Qualitätsprüfung ✔ Verwalten eigener Terminologiesammlungen ✔ Terminologieextraktion ✔ Referenzkorpus erstellen ✔ Dateiformate konvertieren ... 27.9.2014 Mit Technologie kostenlos zum Erfolg 2 frei ? Open-Source? Freiheit zum – benutzen Lizenzen: – anpassen ● GNU GPL ● – weitergeben Apache License 2.0 ● BSD 2/3 – verbessern ● (L)PGL ● MIT license ● Mozilla Public License 2.0 ● Eclipse Public License ● Creative Commons 27.9.2014 Mit Technologie kostenlos zum Erfolg 3 27.9.2014 Mit Technologie kostenlos zum Erfolg 4 Warum eigentlich? ● Kostenschonenden Karrierebeginn ermöglichen ● erleichterte Kooperation ● Vermeiden von Urheberrechtsverletzungen durch Softwarepiraterie ● volle Kontrolle über eigenen PC ● Einfaches Handling ohne Lizenz oder Aktivierungscode ● Beteiligung an Entwicklercommunities ● macht aus (abhängigen) Konsumenten (eigenverantwortliche) Akteure 27.9.2014 Mit Technologie kostenlos zum Erfolg 5 TEnTs ● Translation Environment Tools (TenT) umfassende Applikation für Übersetzer mit allen nötigen Hilfsmitteln ● Oberbegriff für „Translation-Memory-
    [Show full text]
  • Translate Toolkit Documentation Release 2.0.0
    Translate Toolkit Documentation Release 2.0.0 Translate.org.za Sep 01, 2017 Contents 1 User’s Guide 3 1.1 Features..................................................3 1.2 Installation................................................4 1.3 Converters................................................6 1.4 Tools................................................... 57 1.5 Scripts.................................................. 96 1.6 Use Cases................................................. 107 1.7 Translation Related File Formats..................................... 124 2 Developer’s Guide 155 2.1 Translate Styleguide........................................... 155 2.2 Documentation.............................................. 162 2.3 Building................................................. 165 2.4 Testing.................................................. 166 2.5 Command Line Functional Testing................................... 168 2.6 Contributing............................................... 170 2.7 Translate Toolkit Developers Guide................................... 172 2.8 Making a Translate Toolkit Release................................... 176 2.9 Deprecation of Features......................................... 181 3 Additional Notes 183 3.1 Release Notes.............................................. 183 3.2 Changelog................................................ 246 3.3 History of the Translate Toolkit..................................... 254 3.4 License.................................................. 256 4 API Reference 257 4.1
    [Show full text]
  • FOSS L10n Guide
    Some of the content of this book is based on material from the website of the Translate project: http://translate.sourceforge.net/ ( منال حسن ) Project conception and coordination Manal Hassan Dwayne Bailey Author Friedel Wolff Additional content Samuel Murray – terminology transliteration – ( خالد حسني ) Khaled Hosny Proofreading and feedback Hermien Bos Kenneth Nielsen Illustration and covers Heather Bailey French translation Mohomodou Houssouba La localisation au service d’un changement durable Claude Paroz ( خالد حسني ) Arabic translation Khaled Hosny ( أحمد غربية ) Ahmad Gharbeia إحداث التغيير بتوطين المعلوماتية Spanish translation Lucía Morado Vázquez La localización al servicio de un cambio verdadero Silvia Rodríguez Vázquez Funding International Development Research Centre (IDRC) © 2011 Translate.org.za This book is licensed under the Creative Commons licence called ªAttribution Non- Commercial Share Alikeº: http://creativecommons.org/licenses/by-nc-sa/3.0 Introduction Computers are amazing tools. They can help us in so many ways, yet we are often frustrated when they don't work the way we want them to. We are frequently in interaction with technology where we change the way they work, and they change the way we work. A lot of this change is good, but we are not always aware how it changes us. A lot of technologies we work with often, such as software, web sites and cell phones, contain text which is in some language or another. Today, it is often in one of the major languages of the world, such as English or French. Those of us who understand the language of the technology are able to use it, while other people might be entirely locked out, or unable to fully enjoy the benefits that the technology brings.
    [Show full text]
  • Pipenightdreams Osgcal-Doc Mumudvb Mpg123-Alsa Tbb
    pipenightdreams osgcal-doc mumudvb mpg123-alsa tbb-examples libgammu4-dbg gcc-4.1-doc snort-rules-default davical cutmp3 libevolution5.0-cil aspell-am python-gobject-doc openoffice.org-l10n-mn libc6-xen xserver-xorg trophy-data t38modem pioneers-console libnb-platform10-java libgtkglext1-ruby libboost-wave1.39-dev drgenius bfbtester libchromexvmcpro1 isdnutils-xtools ubuntuone-client openoffice.org2-math openoffice.org-l10n-lt lsb-cxx-ia32 kdeartwork-emoticons-kde4 wmpuzzle trafshow python-plplot lx-gdb link-monitor-applet libscm-dev liblog-agent-logger-perl libccrtp-doc libclass-throwable-perl kde-i18n-csb jack-jconv hamradio-menus coinor-libvol-doc msx-emulator bitbake nabi language-pack-gnome-zh libpaperg popularity-contest xracer-tools xfont-nexus opendrim-lmp-baseserver libvorbisfile-ruby liblinebreak-doc libgfcui-2.0-0c2a-dbg libblacs-mpi-dev dict-freedict-spa-eng blender-ogrexml aspell-da x11-apps openoffice.org-l10n-lv openoffice.org-l10n-nl pnmtopng libodbcinstq1 libhsqldb-java-doc libmono-addins-gui0.2-cil sg3-utils linux-backports-modules-alsa-2.6.31-19-generic yorick-yeti-gsl python-pymssql plasma-widget-cpuload mcpp gpsim-lcd cl-csv libhtml-clean-perl asterisk-dbg apt-dater-dbg libgnome-mag1-dev language-pack-gnome-yo python-crypto svn-autoreleasedeb sugar-terminal-activity mii-diag maria-doc libplexus-component-api-java-doc libhugs-hgl-bundled libchipcard-libgwenhywfar47-plugins libghc6-random-dev freefem3d ezmlm cakephp-scripts aspell-ar ara-byte not+sparc openoffice.org-l10n-nn linux-backports-modules-karmic-generic-pae
    [Show full text]
  • Reciprocal Enrichment Between Basque Wikipedia and Machine Translation
    Reciprocal Enrichment Between Basque Wikipedia and Machine Translation Inaki˜ Alegria, Unai Cabezon, Unai Fernandez de Betono,˜ Gorka Labaka, Aingeru Mayor, Kepa Sarasola and Arkaitz Zubiaga Abstract In this chapter, we define a collaboration framework that enables Wikipe- dia editors to generate new articles while they help development of Machine Trans- lation (MT) systems by providing post-edition logs. This collaboration framework was tested with editors of Basque Wikipedia. Their post-editing of Computer Sci- ence articles has been used to improve the output of a Spanish to Basque MT system called Matxin. For the collaboration between editors and researchers, we selected a set of 100 articles from the Spanish Wikipedia. These articles would then be used as the source texts to be translated into Basque using the MT engine. A group of volunteers from Basque Wikipedia reviewed and corrected the raw MT translations. This collaboration ultimately produced two main benefits: (i) the change logs that would potentially help improve the MT engine by using an automated statistical post-editing system , and (ii) the growth of Basque Wikipedia. The results show that this process can improve the accuracy of an Rule Based MT (RBMT) system in nearly 10% benefiting from the post-edition of 50,000 words in the Computer Inaki˜ Alegria Ixa Group, University of the Basque Country UPV/EHU, e-mail: [email protected] Unai Cabezon Ixa Group, University of the Basque Country, e-mail: [email protected] Unai Fernandez de Betono˜ Basque Wikipedia and University of the Basque Country, e-mail: [email protected] Gorka Labaka Ixa Group, University of the Basque Country, e-mail: [email protected] Aingeru Mayor Ixa Group, University of the Basque Country, e-mail: [email protected] Kepa Sarasola Ixa Group, University of the Basque CountryU, e-mail: [email protected] Arkaitz Zubiaga Basque Wikipedia and Queens College, CUNY, CS Department, Blender Lab, New York, e-mail: [email protected] 1 2 Alegria et al.
    [Show full text]