2009 MATI Annual Conference
2009 MATI Annual Conference Dierk Seeburg IcoText
1 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Computer-Assisted Translation Tools
Dierk Seeburg IcoText
(Presentation available at icotext.com in a few days)
(CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
3 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Agenda
• Intro • PM, TMS, L18n • History • Terminology • ITEs • MT • Questions • Questions • Break
4 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Purpose
•Intro •Overview (incomplete) •Decision Guide
5 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Dierk Seeburg IcoText
6 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Marburg
7 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Dierk Seeburg IcoText
8 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Dierk Seeburg IcoText
9 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Phoenix
10 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Dierk Seeburg IcoText
11 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Dierk Seeburg IcoText
12 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Are you a freelance translator?
13 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Are you a freelance interpreter?
14 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Do you work for an LSP?
15 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Do you work in-house?
16 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Glossary
•G11n = Globalization •I18n = Internationalization •L10n = Localization •T9n = Translation •ITE = Integrated Translation Environment •TMS = Translation Management System •MTS = Machine Translation System •WCMS = Web Content Management System •GMS = Globalization Management System
17 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translators and Technology • Translators & • 300,000 interpreters worldwide • Translators regularly • 30-35,000 using Internet
• Translators using • 10-15,000 translation memory technology Source: Jeff Allen
18 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference History
"Disdain on the side of the professional translators for the hilarious and stupid MT mistakes gave birth to a new variant of MT called translation memory (TM). TM started off as a lower-level feature of commercial MT systems (...). But the success of TM came with dedicated products such as IBM TM/2 and Trados. The marketing message was tuned in to what the professional translation industry wanted to hear: "Forget about MT; it doesn't work well. Instead, use our TM product because it leaves you in full control of the process." Jaap van der Meer in MultiLingual 71 (2005)
19 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference History
"The message worked well, within a period of 10 to 15 years, TM products have found their way to the workstations of more than 50,000 translators in the world. But the message had also caused a "cognitive disorder" in the translation industry, namely that TM is good and MT is evil, foregoing the fact that TM is just a new variant of MT (...). The damage is done, however, and it will take years to convince the community of business translators that post-editing fuzzy matches from TM databases is, in fact, not different from post-editing fuzzy matches from any other MT system." Jaap van der Meer in MultiLingual 71 (2005)
20 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Development
•Market demands •Higher effciencies •Evolutionary approach: – Letter – FAX – PC – Internet – Translation memory technology •Spiral goes up: what's next?
21 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Do you have more than 1 year experience?
22 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Do you have more than 5 years experience?
23 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Do you have more than 10 years experience?
24 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Do you have more than 15 years experience?
25 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Do you have more than 20 years experience?
26 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translation Process
•Transfer •Conversion •Import •Pretranslation •Translation •Editing •QA •Terminology Management •Back-Conversion/-Transfer
27 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference CAT Tools
•Computer Assisted Translation tools: – All tools that aid in the translation process
28 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference CAT Tools – Tool Cat's • Integrated Translation Environments (ITEs) • Translation Management System (TMS): GlobalSight, LTC Organiser, Translation Offce 3000, Vertimus, ]project-open[ • Term extraction tools: Xerox XTS, LogiTerm, TermExtract, SDLPhraseFinder, Okapi • Aligners: AlignFactory, NoBabel AutoAligner • Term Management tools: Bitext2tmx, Olifant • Concordancers: AntConc, WebCONC, Lucon • Word count tools: PractiCount, AnyCount • Localization tools: Catalyst, Sisulizer, Lingobit • Machine translation programs: Apertium, Moses, OpenLogos • Lookup tools: europeana.eu, perseus.tufts.edu, dict.leo.org, dict.cc
29 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Have you used CAT Tools in the past?
30 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Are you currently using a CAT Tool?
31 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Are you planning on using a CAT Tool in the future?
32 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITE Categories
•Tools with MS Word as their main translation interface •Tools with independent translation interface •Tools for translating online
33 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translation Memories
The 3 different technologies behind translation memory: •Reference-based matching on segment level (Star Transit) •Database-based matching on segment level (most tools) •Database-based matching on sub-segment level (Multitrans, Logiterm, Find)
34 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Obstacles: File Types
XLIFF, HTML/XHTML, XML, SGML, SVG, PDF, Plain Text, OpenOffce, StarOffce, AbiWord, Corel WordPerfect, CorelDraw, PO/POT (GNU Gettext), CSV, TSV, ASP, PHP, Adobe FrameMaker (MIF), Adobe InDesign (INX), Adobe PageMaker, Adobe Illustrator, Adobe Photoshop, Interleaf /Quicksilver, XGate for QuarkXPress, AutoCAD, DocBook, DITA, Java Properties, JavaScript, RTF, Trados Tagged RTF, Trados TTX, MS Offce 2003 XML, MS Offce 2007, MS Visio, MS Publisher, ResX (Windows .NET Resources), RC (Windows C/C++ Resources), TMX, TBX, XLIFF, SRX, xml:tm, W3C ITS
35 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Obstacles: Standards
• XLIFF • Orgs: • TMX – ISO • TBX – LISA – • SRX OASIS – ATA • xml:tm – FIT • ITS – ITI
36 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Obstacles: Other
Interoperability User Interface Integration
37 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Strategies
•SWOT •Six Questions •See-Think-Draw – Situation – Goals – Route
38 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference SWOT
39 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference SWOT
•Strengths: attributes of the organization that are helpful to achieving the objective. •Weaknesses: attributes of the organization that are harmful to achieving the objective. •Opportunities: external conditions that are helpful to achieving the objective. •Threats: external conditions which could do damage to the business's performance. Source: Wikipedia
40 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Six Questions
•Who •What •Where •When •Why •How
41 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Tools with MS Word as their main translation interface • Anaphraseus (http://anaphraseus.sourceforge.net/) • Anymem (http://www.anymem.com/) • MetaTexis (http://www.metatexis.com) • MultiTrans (http://www.multicorpora.com) • Similis (http://www.lingua-et-machina.com) • Snowball (http://www.trmem.com) • TinyTM (http://tinytm.sourceforge.net/) • Trados (http://www.trados.com) • Wordfast (http://www.wordfast.net) • WordFisher (http://www.wordfsher.com)
42 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Anaphraseus
43 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Anymem
44 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference MetaTexis
45 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference MultiTrans
46 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Snowball
47 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference TinyTM
48 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Trados
49 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Wordfast
50 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Wordfsher
51 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Tools with independent translation interface • Across (http://www.across.net) • AppleTrans (http://www.apple.com/) • Cafetran (http://www.cafetran.republika.pl/) • Catalyst (http://www.alchemysoftware.ie/) • Déjà Vu (http://www.atril.com/) • Esperantilo (http://www.esperantilo.org/tm/index.html) • ForeignDesk (http://www.foreigndesk.net/) • Fusion (http://www.jivefusiontech.com/) • Heartsome (http://www.heartsome.net/) • Kbabel (http://kbabel.kde.org/) • LogiTerm (http://www.terminotix.com/)
52 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Across
53 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference AppleTrans
54 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Cafetran
55 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Catalyst
56 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Déjà Vu
57 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference EsperantiloTM
58 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ForeignDesk
59 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Fusion
60 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Heartsome
61 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference KBabel
62 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference LogiTerm
63 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Tools with independent translation interface • MemoQ (http://www.kilgray.com/) • OmegaT (http://www.omegat.org/) • Open Language Editor (http://open-language-tools.dev.java.net/) • SDL Trados (http://www.sdlx.com/) • Swordfsh (http://www.maxprograms.com/) • Sisulizer (http://www.sisulizer.com/) • Transit (http://www.star-transit.com/) • Transolution (http://transolution.python-hosting.com/) • Virtaal (http://translate.sourceforge.net/wiki/virtaal/index) • WordForge (http://www.khmeros.info/drupal/? q=en/download/Translation_Editor)
64 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference MemoQ
65 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference OmegaT
66 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Open Language Tools
67 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference SDL Trados
68 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Swordfsh
69 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Transolution
70 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Virtaal
71 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Tools for translating online
•Lingotek (http://www.lingotek.com/) •Logoport (http://www.lionbridge.com/) •Pootle (http://translate.sourceforge.net/wiki/po otle/)
72 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Break
73 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITE Features
•Leveraging translated content •Building and maintaining terminology databases •Work in complex fle formats •Methods for analysis •Quality assurance •Project management capabilities
74 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – The Similarities
•Freely confgurable terminology component •Unicode and TMX support (with caveats) •Alignment features, e.g. – AlignFactory – NoBabel AutoAligner •Similar range of supported fle formats (s. Wikipedia) •Concordance searches •Analysis features
75 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators
•Batch vs. individual fle processing •Terminology handling •Support for competing formats / TMs / termbases •Support for co-workers •Quality assurance •Remote connectivity •Code handling •Support and Training
76 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators
•Batch vs. individual fle processing
•Déjà Vu vs. Trados
77 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators
•Terminology handling
•Transit vs. Déjà Vu vs. Wordfast
78 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators
•Quality assurance
•Trados vs. MemoQ
79 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators
•Remote connectivity
•Wordfast vs. Lingotek
80 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators
•Support and Training
•Purchase = frst step only
•Compare to car maintenance
81 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators
•Support and Training –Tool Vendor Contract –Consultant –College classes –ATA seminars –Jost's Translators Toolkit –Websites: translatorstraining.com, proz.com, translatorscafe.com
82 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Terminology – Prejudices
•TM tool = emphasis is on the translation memory ? •More immediate gain: perfect and fuzzy matches •TMs = easily built up by aligning existing translated fle pairs/texts and/or automatically by translating new texts •Construction of terminology databases = tedious
83 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Terminology – The Reality
•The terminology database = investment – defning words and phrases • Grammatically • Contextually • by contrast •"Living dictionaries" for each segment translated •Term entries contain meta data relevance •Consistency throughout workgroups
84 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translation and Technology: Machine Translation •Statistically-based machine translation (SBMT) •Rules-based machine translation (RBMT) •Example-based machine translation (EBMT) •Human-assisted machine translation (HAMT) •Hybrid machine translation (HMT)
85 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translation and Technology: Machine Translation • Statistically-based machine translation works (sort of) • Rules-based approach does not work on its own • Translation can only be evaluated by its effectiveness, not by some elusive goal of "perfection" • Amount of relevant data can not be translated by humans • Microsoft • Symantec • Systran • AppTek • Promt • Google 86 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
87 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
88 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Are you planning on using an ITE in the future?
89 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Are you planning on using a TMS in the future?
90 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Are you planning on using a GMS in the future?
91 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Are you planning on using an MTS in the future?
92 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Trends •Consolidation •Machine Translation (SMT, RBMT, EBMT, HAMT) •Standards (XLIFF, TMX, TBX, SRX) •Translation Management (CMS, GMS) •Groups (Yahoo, Google, LinkedIn) •Blogs (LTD, ThoughtsOnT9n, Masked T9r) •Translators Sites (ProZ, Translators Cafe, Aquarius) •Social Sites (Ning, Facebook, Twitter) •Instant Messaging (SMS, Skype, GTalk)
93 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Trends •Desktop to Server •Server to Web •Web to Mobile •Pretranslation •Sub-sentence Fuzzy Matching •UX = User Experience •Integration •Collaboration •Open Source
94 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Have you attended the ATA Translation Tools Forum in the past?
95 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey
•Are you planning on attending the ATA Translation Tools Forum in the future?
96 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Take-Home
•Don't buy a golden hammer •Subscribe to LTD News
97 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Take-Home
98 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Acknowledgements
Jost Zetzsche, Rosalie Wells Di Wu & MATI ATA Prof'l. Dev. Committee ATA LT Division ATA staff, volunteers You
99 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Thank you! See you in New York in October! Download slides at
(CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference
Q & A
(CC) Creative Commons. Some Rights Reserved September 12, 2009