2009 MATI Annual Conference

2009 MATI Annual Conference Dierk Seeburg IcoText

1 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Computer-Assisted Translation Tools

Dierk Seeburg IcoText

(Presentation available at icotext.com in a few days)

(CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

3 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Agenda

• Intro • PM, TMS, L18n • History • Terminology • ITEs • MT • Questions • Questions • Break

4 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Purpose

•Intro •Overview (incomplete) •Decision Guide

5 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Dierk Seeburg IcoText

6 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Marburg

7 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Dierk Seeburg IcoText

8 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Dierk Seeburg IcoText

9 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Phoenix

10 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Dierk Seeburg IcoText

11 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Dierk Seeburg IcoText

12 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Are you a freelance translator?

13 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Are you a freelance interpreter?

14 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Do you work for an LSP?

15 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Do you work in-house?

16 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Glossary

•G11n = Globalization •I18n = Internationalization •L10n = Localization •T9n = Translation •ITE = Integrated Translation Environment •TMS = Translation Management System •MTS = Machine Translation System •WCMS = Web Content Management System •GMS = Globalization Management System

17 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translators and Technology • Translators & • 300,000 interpreters worldwide • Translators regularly • 30-35,000 using Internet

• Translators using • 10-15,000 translation memory technology Source: Jeff Allen

18 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference History

"Disdain on the side of the professional translators for the hilarious and stupid MT mistakes gave birth to a new variant of MT called translation memory (TM). TM started off as a lower-level feature of commercial MT systems (...). But the success of TM came with dedicated products such as IBM TM/2 and Trados. The marketing message was tuned in to what the professional translation industry wanted to hear: "Forget about MT; it doesn't work well. Instead, use our TM product because it leaves you in full control of the process." Jaap van der Meer in MultiLingual 71 (2005)

19 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference History

"The message worked well, within a period of 10 to 15 years, TM products have found their way to the workstations of more than 50,000 translators in the world. But the message had also caused a "cognitive disorder" in the translation industry, namely that TM is good and MT is evil, foregoing the fact that TM is just a new variant of MT (...). The damage is done, however, and it will take years to convince the community of business translators that post-editing fuzzy matches from TM databases is, in fact, not different from post-editing fuzzy matches from any other MT system." Jaap van der Meer in MultiLingual 71 (2005)

20 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Development

•Market demands •Higher effciencies •Evolutionary approach: – Letter – FAX – PC – Internet – Translation memory technology •Spiral goes up: what's next?

21 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Do you have more than 1 year experience?

22 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Do you have more than 5 years experience?

23 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Do you have more than 10 years experience?

24 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Do you have more than 15 years experience?

25 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Do you have more than 20 years experience?

26 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translation Process

•Transfer •Conversion •Import •Pretranslation •Translation •Editing •QA •Terminology Management •Back-Conversion/-Transfer

27 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference CAT Tools

•Computer Assisted Translation tools: – All tools that aid in the translation process

28 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference CAT Tools – Tool Cat's • Integrated Translation Environments (ITEs) • Translation Management System (TMS): GlobalSight, LTC Organiser, Translation Offce 3000, Vertimus, ]project-open[ • Term extraction tools: Xerox XTS, LogiTerm, TermExtract, SDLPhraseFinder, Okapi • Aligners: AlignFactory, NoBabel AutoAligner • Term Management tools: Bitext2tmx, Olifant • Concordancers: AntConc, WebCONC, Lucon • Word count tools: PractiCount, AnyCount • Localization tools: Catalyst, Sisulizer, Lingobit • Machine translation programs: Apertium, Moses, OpenLogos • Lookup tools: europeana.eu, perseus.tufts.edu, dict.leo.org, dict.cc

29 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Have you used CAT Tools in the past?

30 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Are you currently using a CAT Tool?

31 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Are you planning on using a CAT Tool in the future?

32 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITE Categories

•Tools with MS Word as their main translation interface •Tools with independent translation interface •Tools for translating online

33 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translation Memories

The 3 different technologies behind translation memory: •Reference-based matching on segment level (Star Transit) •Database-based matching on segment level (most tools) •Database-based matching on sub-segment level (Multitrans, Logiterm, Find)

34 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Obstacles: File Types

XLIFF, HTML/XHTML, XML, SGML, SVG, PDF, Plain Text, OpenOffce, StarOffce, AbiWord, Corel WordPerfect, CorelDraw, PO/POT (GNU ), CSV, TSV, ASP, PHP, Adobe FrameMaker (MIF), Adobe InDesign (INX), Adobe PageMaker, Adobe Illustrator, Adobe Photoshop, Interleaf /Quicksilver, XGate for QuarkXPress, AutoCAD, DocBook, DITA, Java Properties, JavaScript, RTF, Trados Tagged RTF, Trados TTX, MS Offce 2003 XML, MS Offce 2007, MS Visio, MS Publisher, ResX (Windows .NET Resources), RC (Windows C/C++ Resources), TMX, TBX, XLIFF, SRX, xml:tm, W3C ITS

35 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Obstacles: Standards

• XLIFF • Orgs: • TMX – ISO • TBX – LISA – • SRX OASIS – ATA • xml:tm – FIT • ITS – ITI

36 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Obstacles: Other

 Interoperability  User Interface  Integration

37 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Strategies

•SWOT •Six Questions •See-Think-Draw – Situation – Goals – Route

38 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference SWOT

39 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference SWOT

•Strengths: attributes of the organization that are helpful to achieving the objective. •Weaknesses: attributes of the organization that are harmful to achieving the objective. •Opportunities: external conditions that are helpful to achieving the objective. •Threats: external conditions which could do damage to the business's performance. Source: Wikipedia

40 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Six Questions

•Who •What •Where •When •Why •How

41 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Tools with MS Word as their main translation interface • Anaphraseus (http://anaphraseus.sourceforge.net/) • Anymem (http://www.anymem.com/) • MetaTexis (http://www.metatexis.com) • MultiTrans (http://www.multicorpora.com) • Similis (http://www.lingua-et-machina.com) • Snowball (http://www.trmem.com) • TinyTM (http://tinytm.sourceforge.net/) • Trados (http://www.trados.com) • Wordfast (http://www.wordfast.net) • WordFisher (http://www.wordfsher.com)

42 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Anaphraseus

43 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Anymem

44 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference MetaTexis

45 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference MultiTrans

46 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Snowball

47 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference TinyTM

48 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Trados

49 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Wordfast

50 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Wordfsher

51 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Tools with independent translation interface • Across (http://www.across.net) • AppleTrans (http://www.apple.com/) • Cafetran (http://www.cafetran.republika.pl/) • Catalyst (http://www.alchemysoftware.ie/) • Déjà Vu (http://www.atril.com/) • Esperantilo (http://www.esperantilo.org/tm/index.html) • ForeignDesk (http://www.foreigndesk.net/) • Fusion (http://www.jivefusiontech.com/) • Heartsome (http://www.heartsome.net/) • Kbabel (http://kbabel.kde.org/) • LogiTerm (http://www.terminotix.com/)

52 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Across

53 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference AppleTrans

54 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Cafetran

55 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Catalyst

56 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Déjà Vu

57 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference EsperantiloTM

58 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ForeignDesk

59 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Fusion

60 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Heartsome

61 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference KBabel

62 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference LogiTerm

63 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Tools with independent translation interface • MemoQ (http://www.kilgray.com/) • OmegaT (http://www.omegat.org/) • Open Language Editor (http://open-language-tools.dev.java.net/) • SDL Trados (http://www.sdlx.com/) • Swordfsh (http://www.maxprograms.com/) • Sisulizer (http://www.sisulizer.com/) • Transit (http://www.star-transit.com/) • Transolution (http://transolution.python-hosting.com/) • Virtaal (http://translate.sourceforge.net/wiki/virtaal/index) • WordForge (http://www.khmeros.info/drupal/? q=en/download/Translation_Editor)

64 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference MemoQ

65 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference OmegaT

66 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Open Language Tools

67 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference SDL Trados

68 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Swordfsh

69 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Transolution

70 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Virtaal

71 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Tools for translating online

•Lingotek (http://www.lingotek.com/) •Logoport (http://www.lionbridge.com/) •Pootle (http://translate.sourceforge.net/wiki/po otle/)

72 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Break

73 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITE Features

•Leveraging translated content •Building and maintaining terminology databases •Work in complex fle formats •Methods for analysis •Quality assurance •Project management capabilities

74 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – The Similarities

•Freely confgurable terminology component •Unicode and TMX support (with caveats) •Alignment features, e.g. – AlignFactory – NoBabel AutoAligner •Similar range of supported fle formats (s. Wikipedia) •Concordance searches •Analysis features

75 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators

•Batch vs. individual fle processing •Terminology handling •Support for competing formats / TMs / termbases •Support for co-workers •Quality assurance •Remote connectivity •Code handling •Support and Training

76 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators

•Batch vs. individual fle processing

•Déjà Vu vs. Trados

77 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators

•Terminology handling

•Transit vs. Déjà Vu vs. Wordfast

78 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators

•Quality assurance

•Trados vs. MemoQ

79 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators

•Remote connectivity

•Wordfast vs. Lingotek

80 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators

•Support and Training

•Purchase = frst step only

•Compare to car maintenance

81 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference ITEs – Differentiators

•Support and Training –Tool Vendor Contract –Consultant –College classes –ATA seminars –Jost's Translators Toolkit –Websites: translatorstraining.com, proz.com, translatorscafe.com

82 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Terminology – Prejudices

•TM tool = emphasis is on the translation memory ? •More immediate gain: perfect and fuzzy matches •TMs = easily built up by aligning existing translated fle pairs/texts and/or automatically by translating new texts •Construction of terminology databases = tedious

83 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Terminology – The Reality

•The terminology database = investment – defning words and phrases • Grammatically • Contextually • by contrast •"Living dictionaries" for each segment translated •Term entries contain meta data relevance •Consistency throughout workgroups

84 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translation and Technology: Machine Translation •Statistically-based machine translation (SBMT) •Rules-based machine translation (RBMT) •Example-based machine translation (EBMT) •Human-assisted machine translation (HAMT) •Hybrid machine translation (HMT)

85 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Translation and Technology: Machine Translation • Statistically-based machine translation works (sort of) • Rules-based approach does not work on its own • Translation can only be evaluated by its effectiveness, not by some elusive goal of "perfection" • Amount of relevant data can not be translated by humans • Microsoft • Symantec • Systran • AppTek • Promt • Google 86 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

87 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

88 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Are you planning on using an ITE in the future?

89 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Are you planning on using a TMS in the future?

90 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Are you planning on using a GMS in the future?

91 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Are you planning on using an MTS in the future?

92 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Trends •Consolidation •Machine Translation (SMT, RBMT, EBMT, HAMT) •Standards (XLIFF, TMX, TBX, SRX) •Translation Management (CMS, GMS) •Groups (Yahoo, Google, LinkedIn) •Blogs (LTD, ThoughtsOnT9n, Masked T9r) •Translators Sites (ProZ, Translators Cafe, Aquarius) •Social Sites (Ning, Facebook, Twitter) •Instant Messaging (SMS, Skype, GTalk)

93 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Trends •Desktop to Server •Server to Web •Web to Mobile •Pretranslation •Sub-sentence Fuzzy Matching •UX = User Experience •Integration •Collaboration •Open Source

94 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Have you attended the ATA Translation Tools Forum in the past?

95 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Survey

•Are you planning on attending the ATA Translation Tools Forum in the future?

96 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Take-Home

•Don't buy a golden hammer •Subscribe to LTD News

97 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Take-Home

98 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference Acknowledgements

Jost Zetzsche, Rosalie Wells Di Wu & MATI ATA Prof'l. Dev. Committee ATA LT Division ATA staff, volunteers You

99 (CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Thank you! See you in New York in October! Download slides at .

(CC) Creative Commons. Some Rights Reserved September 12, 2009 2009 MATI Annual Conference

Q & A

(CC) Creative Commons. Some Rights Reserved September 12, 2009