Final Study Report on CEF Automated Translation Value Proposition in the Context of the European LT Market/Ecosystem

Total Page:16

File Type:pdf, Size:1020Kb

Final Study Report on CEF Automated Translation Value Proposition in the Context of the European LT Market/Ecosystem Final study report on CEF Automated Translation value proposition in the context of the European LT market/ecosystem FINAL REPORT A study prepared for the European Commission DG Communications Networks, Content & Technology by: Digital Single Market CEF AT value proposition in the context of the European LT market/ecosystem Final Study Report This study was carried out for the European Commission by Luc MEERTENS 2 Khalid CHOUKRI Stefania AGUZZI Andrejs VASILJEVS Internal identification Contract number: 2017/S 108-216374 SMART number: 2016/0103 DISCLAIMER By the European Commission, Directorate-General of Communications Networks, Content & Technology. The information and views set out in this publication are those of the author(s) and do not necessarily reflect the official opinion of the Commission. The Commission does not guarantee the accuracy of the data included in this study. Neither the Commission nor any person acting on the Commission’s behalf may be held responsible for the use which may be made of the information contained therein. ISBN 978-92-76-00783-8 doi: 10.2759/142151 © European Union, 2019. All rights reserved. Certain parts are licensed under conditions to the EU. Reproduction is authorised provided the source is acknowledged. 2 CEF AT value proposition in the context of the European LT market/ecosystem Final Study Report CONTENTS Table of figures ................................................................................................................................................ 7 List of tables ................................................................................................................................................... 13 Definitions ..................................................................................................................................................... 15 Abbreviations ................................................................................................................................................. 16 Abstract ......................................................................................................................................................... 19 Résumé .......................................................................................................................................................... 19 Executive summary ........................................................................................................................................ 20 Sommaire exécutif ......................................................................................................................................... 27 1. Introduction ........................................................................................................................................... 34 2. Task 1: LT market analysis ...................................................................................................................... 37 2.1. Summary .............................................................................................................................................. 37 2.2. Introduction ......................................................................................................................................... 40 2.3. Overview of the LT Market .................................................................................................................. 43 2.3.1. Main players and innovators ...................................................................................................... 43 2.3.2. Current LT offering ...................................................................................................................... 43 2.3.3. Key market trends ....................................................................................................................... 47 2.3.4. Overview by vertical markets ..................................................................................................... 49 2.3.5. Overview by company size.......................................................................................................... 49 2.3.6. Overview by country ................................................................................................................... 50 2.3.7. Key findings ................................................................................................................................. 50 2.4. The LT market in Europe ...................................................................................................................... 51 2.4.1. Market size and forecast............................................................................................................. 51 2.4.2. Supply-side analysis .................................................................................................................... 54 2.4.3. Demand-side analysis ................................................................................................................. 62 2.4.4. Key findings ................................................................................................................................. 68 2.5. Conclusions .......................................................................................................................................... 70 3. Task 2: Competitiveness analysis ........................................................................................................... 71 3.1. Preface ................................................................................................................................................. 71 3.2. Methodology ....................................................................................................................................... 72 3.2.1. Market dimensions ..................................................................................................................... 72 3.2.2. SWOT analysis ............................................................................................................................. 72 3.3. Comparative position of the European LT market ............................................................................... 74 3.3.1. Comparative position of the European MT market .................................................................... 74 3 CEF AT value proposition in the context of the European LT market/ecosystem Final Study Report 3.3.2. Comparative position of the European speech technology market ........................................... 74 3.3.3. Comparative position of the European search market ............................................................... 75 3.4. Research .............................................................................................................................................. 76 3.4.1. Research in MT ........................................................................................................................... 76 3.4.2. Research in speech technologies ................................................................................................ 88 3.4.3. Research in search technologies ................................................................................................. 95 3.5. Innovation .......................................................................................................................................... 106 3.5.1. Innovation in machine translation ............................................................................................ 106 3.5.2. Innovation in speech technologies ........................................................................................... 109 3.5.3. Innovation in search technologies and services ....................................................................... 113 3.6. Investments ....................................................................................................................................... 121 3.6.1. Investments in machine translation ......................................................................................... 121 3.6.2. Investments in speech technologies ......................................................................................... 127 3.6.3. Investments in search technology and services ....................................................................... 130 3.7. Market dominance ............................................................................................................................ 133 3.7.1. Market dominance in machine translation ............................................................................... 134 3.7.2. Market dominance in speech technologies .............................................................................. 137 3.7.3. Market dominance in search technology and services ............................................................. 140 3.8. Industry .............................................................................................................................................. 146 3.8.1. Machine translation industry .................................................................................................... 146 3.8.2. Speech technology industry ...................................................................................................... 149 3.8.3. Search technology and service industry ................................................................................... 153 3.9. Infrastructure ....................................................................................................................................
Recommended publications
  • Evaluating Exploratory Search Engines : Designing a Set of User-Centered Methods Based on a Modeling of the Exploratory Search Process Emilie Palagi
    Evaluating exploratory search engines : designing a set of user-centered methods based on a modeling of the exploratory search process Emilie Palagi To cite this version: Emilie Palagi. Evaluating exploratory search engines : designing a set of user-centered methods based on a modeling of the exploratory search process. Human-Computer Interaction [cs.HC]. Université Côte d’Azur, 2018. English. NNT : 2018AZUR4116. tel-01976017v3 HAL Id: tel-01976017 https://tel.archives-ouvertes.fr/tel-01976017v3 Submitted on 20 Feb 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. THÈSE DE DOCTORAT Evaluation des moteurs de recherche exploratoire : Elaboration d'un corps de méthodes centrées utilisateurs, basées sur une modélisation du processus de recherche exploratoire Emilie PALAGI Wimmics (Inria, I3S) et Data Science (EURECOM) Présentée en vue de l’obtention Devant le jury, composé de : du grade de docteur en Informatique Christian Bastien, Professeur, Université de Loraine d’Université Côte d’Azur Pierre De Loor, Professeur, ENIB Dirigée par : Fabien Gandon
    [Show full text]
  • Awareness and Utilization of Search Engines for Information Retrieval by Students of National Open University of Nigeria in Enugu Study Centre Library
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Library Philosophy and Practice (e-journal) Libraries at University of Nebraska-Lincoln 2020 Awareness and Utilization of Search Engines for Information Retrieval by Students of National Open University of Nigeria in Enugu Study Centre Library SUNDAY JUDE ONUH National Open University of Nigeria, [email protected] OGOEGBUNAM LOVETH EKWUEME National Open University of Nigeria, [email protected] Follow this and additional works at: https://digitalcommons.unl.edu/libphilprac Part of the Library and Information Science Commons ONUH, SUNDAY JUDE and EKWUEME, OGOEGBUNAM LOVETH, "Awareness and Utilization of Search Engines for Information Retrieval by Students of National Open University of Nigeria in Enugu Study Centre Library" (2020). Library Philosophy and Practice (e-journal). 4650. https://digitalcommons.unl.edu/libphilprac/4650 Awareness and Utilization of Search Engines for Information Retrieval by Students of National Open University of Nigeria in Enugu Study Centre Library By Jude Sunday Onuh Enugu Study Centre Library National Open University of Nigeria [email protected] & Loveth Ogoegbunam Ekwueme Department of Library and Information Science National Open University of Nigeria [email protected] Abstract This study dwelt on awareness and utilization of search engines for information retrieval by students of National Open University of Nigeria (NOUN) Enugu Study centre. Descriptive survey research was adopted for the study. Two research questions were drawn from the two purposes that guided the study. The population consists of 5855 undergraduate students of NOUN Enugu Study Centre. A sample size of 293 students was used as 5% of the entire population.
    [Show full text]
  • List of Brands
    Global Consumer 2019 List of Brands Table of Contents 1. Digital music 2 2. Video-on-Demand 4 3. Video game stores 7 4. Digital video games shops 11 5. Video game streaming services 13 6. Book stores 15 7. eBook shops 19 8. Daily newspapers 22 9. Online newspapers 26 10. Magazines & weekly newspapers 30 11. Online magazines 34 12. Smartphones 38 13. Mobile carriers 39 14. Internet providers 42 15. Cable & satellite TV provider 46 16. Refrigerators 49 17. Washing machines 51 18. TVs 53 19. Speakers 55 20. Headphones 57 21. Laptops 59 22. Tablets 61 23. Desktop PC 63 24. Smart home 65 25. Smart speaker 67 26. Wearables 68 27. Fitness and health apps 70 28. Messenger services 73 29. Social networks 75 30. eCommerce 77 31. Search Engines 81 32. Online hotels & accommodation 82 33. Online flight portals 85 34. Airlines 88 35. Online package holiday portals 91 36. Online car rental provider 94 37. Online car sharing 96 38. Online ride sharing 98 39. Grocery stores 100 40. Banks 104 41. Online payment 108 42. Mobile payment 111 43. Liability insurance 114 44. Online dating services 117 45. Online event ticket provider 119 46. Food & restaurant delivery 122 47. Grocery delivery 125 48. Car Makes 129 Statista GmbH Johannes-Brahms-Platz 1 20355 Hamburg Tel. +49 40 2848 41 0 Fax +49 40 2848 41 999 [email protected] www.statista.com Steuernummer: 48/760/00518 Amtsgericht Köln: HRB 87129 Geschäftsführung: Dr. Friedrich Schwandt, Tim Kröger Commerzbank AG IBAN: DE60 2004 0000 0631 5915 00 BIC: COBADEFFXXX Umsatzsteuer-ID: DE 258551386 1.
    [Show full text]
  • Gender and Information Literacy: Evaluation of Gender Differences in a Student Survey of Information Sources
    Gender and Information Literacy: Evaluation of Gender Differences in a Student Survey of Information Sources Arthur Taylor and Heather A. Dalal Information literacy studies have shown that college students use a va- riety of information sources to perform research and commonly choose Internet sources over traditional library sources. Studies have also shown that students do not appear to understand the information quality issues concerning Internet information sources and may lack the information literacy skills to make good choices concerning the use of these sources. No studies currently provide clear guidance on how gender might influ- ence the information literacy skills of students. Such guidance could help improve information literacy instruction. This study used a survey of college-aged students to evaluate a subset of student information literacy skills in relation to Internet information sources. Analysis of the data collected provided strong indications of gender differences in information literacy skills. Female respondents ap- peared to be more discerning than males in evaluating Internet sources. Males appeared to be more confident in the credibility and accuracy of the results returned by search engines. Evaluation of other survey responses strengthened our finding of gender differentiation in informa- tion literacy skills. ollege students today have come of age surrounded by a sea of information delivered from an array of sources available at their fingertips at any time of the day or night. Studies have shown that the most common source of information for college-aged students is the Internet. Information gathered in this environment is most likely found using a commercial search engine that returns sources of dubious quality using an unknown algorithm.
    [Show full text]
  • Context in Neural Machine Translation: a Review of Models and Evaluations
    Context in Neural Machine Translation: A Review of Models and Evaluations Andrei Popescu-Belis University of Applied Sciences of Western Switzerland (HES–SO) School of Management and Engineering Vaud (HEIG–VD) Route de Cheseaux 1, CP 521 1401 Yverdon-les-Bains, Switzerland [email protected] January 29, 2019 Abstract features when translating entire texts, especially when this is vital for correct translation. Taking textual con- This review paper discusses how context has been text into account1 means modeling long-range depen- used in neural machine translation (NMT) in the dencies between words, phrases, or sentences, which past two years (2017–2018). Starting with a brief are typically studied by linguistics under the topics of retrospect on the rapid evolution of NMT models, discourse and pragmatics. When it comes to transla- the paper then reviews studies that evaluate NMT tion, the capacity to model context may improve certain output from various perspectives, with emphasis on those analyzing limitations of the translation of translation decisions, e.g. by favoring a better lexical contextual phenomena. In a subsequent version, choice thanks to document-level topical information, the paper will then present the main methods that or by constraining pronoun choice thanks to knowledge were proposed to leverage context for improving about antecedents. translation quality, and distinguishes methods that This review paper puts into perspective the signif- aim to improve the translation of specific phenom- icant amount of studies devoted in 2017 and 2018 to ena from those that consider a wider unstructured improve the use of context in NMT and measure these context.
    [Show full text]
  • All About Airfares, Airlines & Airports
    Chapter One “Into the Wild Blue Yonder . .” ALL ABOUT AIRFARES, AIRLINES & AIRPORTS COPYRIGHTED MATERIAL 004_418499-ch01.indd4_418499-ch01.indd 1 22/25/09/25/09 110:24:170:24:17 PPMM 2 “Into the Wild Blue Yonder . .” For years, we have all exulted over the low cost of airfares. While every other expense of travel went up, up, and up, airfares remained stable or even declined. No longer. Not simply the high cost of fuel but the poor fi nancial condi- tion of the airlines has caused carriers to cut fl ights, eliminate destinations, end the discounts (or made them harder to get), and hike the average cost of transportation to really serious levels, making it a big and growing part of our total vacation expenses. So it has become utterly necessary to learn how to fi nd the occasional bargains. And that’s what I’ve attempted to do in this chapter. In alphabeti- cal order, from “A” to “F,” I’ve grouped all the sources of those lower fares and all the methods for fi nding and obtaining them. And then, under “P,” I’ve discussed the question of passenger rights, which relates to the quality of the fl ight experience. Aggregators NEXT TIME YOU’RE DETERMINED TO FIND THE LOWEST POSSIBLE AIRFARE, USE AN AGGREGATOR Aggregators are like Google—they search everything in sight. But unlike Google, which “orders” (that is, tampers with) its results, aggregators list the fares they fi nd in an untouched and strictly logical order. Th ey also claim to be totally comprehensive.
    [Show full text]
  • Linkedin Corporation 10-K
    10APR201419231192 April 2016 To Our Stockholders: In 2015, we delivered a strong year of innovation focused on further connecting our members and customers to opportunity. For members, we made significant progress by focusing on two core value propositions: staying connected and informed, and advancing members’ careers. • In December, we launched our re-imagined flagship mobile application, the culmination of a year-long focus to create a dramatically simplified core LinkedIn experience. Since launch, we have seen meaningful increases in feed engagement, messages sent, and content interaction. • With respect to careers, we spent much of the year working on fundamental building blocks including doubling the number of jobs on LinkedIn to more than six million, improving jobs relevance, and re-launching the jobs experience on the desktop. This work resulted in a significant increase in overall engagement with jobs throughout 2015 compared to 2014. For customers, we focused on innovating the core value drivers in each product line: • Within Hiring, we announced the re-launch of Recruiter, the first full refresh of our flagship product since its original launch. The new Recruiter will rollout to customers throughout 2016 alongside the new Referrals product. The goal is that these products will further strengthen and extend our competitive position within the talent acquisition space in the coming years. • Within Marketing Solutions, Sponsored Updates increasingly evolved into the core of our advertising business, contributing approximately half of total ad revenue in 2015. Digital marketing remains a fast-evolving and competitive landscape, evidenced by the significant decline in our display ad revenue in 2015. To that end, we believe our primary focus on Sponsored Updates will continue to make LinkedIn the most effective platform for marketers to engage professionals.
    [Show full text]
  • The Complete Guide to Social Media from the Social Media Guys
    The Complete Guide to Social Media From The Social Media Guys PDF generated using the open source mwlib toolkit. See http://code.pediapress.com/ for more information. PDF generated at: Mon, 08 Nov 2010 19:01:07 UTC Contents Articles Social media 1 Social web 6 Social media measurement 8 Social media marketing 9 Social media optimization 11 Social network service 12 Digg 24 Facebook 33 LinkedIn 48 MySpace 52 Newsvine 70 Reddit 74 StumbleUpon 80 Twitter 84 YouTube 98 XING 112 References Article Sources and Contributors 115 Image Sources, Licenses and Contributors 123 Article Licenses License 125 Social media 1 Social media Social media are media for social interaction, using highly accessible and scalable publishing techniques. Social media uses web-based technologies to turn communication into interactive dialogues. Andreas Kaplan and Michael Haenlein define social media as "a group of Internet-based applications that build on the ideological and technological foundations of Web 2.0, which allows the creation and exchange of user-generated content."[1] Businesses also refer to social media as consumer-generated media (CGM). Social media utilization is believed to be a driving force in defining the current time period as the Attention Age. A common thread running through all definitions of social media is a blending of technology and social interaction for the co-creation of value. Distinction from industrial media People gain information, education, news, etc., by electronic media and print media. Social media are distinct from industrial or traditional media, such as newspapers, television, and film. They are relatively inexpensive and accessible to enable anyone (even private individuals) to publish or access information, compared to industrial media, which generally require significant resources to publish information.
    [Show full text]
  • Translate's Localization Guide
    Translate’s Localization Guide Release 0.9.0 Translate Jun 26, 2020 Contents 1 Localisation Guide 1 2 Glossary 191 3 Language Information 195 i ii CHAPTER 1 Localisation Guide The general aim of this document is not to replace other well written works but to draw them together. So for instance the section on projects contains information that should help you get started and point you to the documents that are often hard to find. The section of translation should provide a general enough overview of common mistakes and pitfalls. We have found the localisation community very fragmented and hope that through this document we can bring people together and unify information that is out there but in many many different places. The one section that we feel is unique is the guide to developers – they make assumptions about localisation without fully understanding the implications, we complain but honestly there is not one place that can help give a developer and overview of what is needed from them, we hope that the developer section goes a long way to solving that issue. 1.1 Purpose The purpose of this document is to provide one reference for localisers. You will find lots of information on localising and packaging on the web but not a single resource that can guide you. Most of the information is also domain specific ie it addresses KDE, Mozilla, etc. We hope that this is more general. This document also goes beyond the technical aspects of localisation which seems to be the domain of other lo- calisation documents.
    [Show full text]
  • New Scientist
    APPY TALK It’s never too late to learn another language WHO’S IN THE MIRROR? How it feels to have someone else’s face COSMIC BRUISE Glimpse of another universe? WEEKLY October 31 - November 6, 2015 COMPUTER SAYS NO Your career is already in the hands of AI How civilization took root all over the world No3045 US$5.95 CAN$5.95 44 0 70989 30690 5 Science and technology news www.newscientist.com US jobs in science TRUST NO ONE Laws of physics keep your secrets safe from hackers Mind. Full. Give a gift that satisfies this holiday Subscribe and save up to 73% Visit newscientist.com/8308 or call 1-888-822-3242 and quote 8308 CONTENTS Volume 228 No 3045 This issue online newscientist.com/issue/3045 News Leader 5 Tech is closing language gaps, 8 with results we can’t predict Cosmic bruise News 6 UPFRONT Evidence we got 23andMe returns with new genetic test. bashed by another TB now world’s leading cause of death. universe? Flying through a Saturn moon plume 8 THIS WEEK Rosetta finds oxygen on comet 67P. Anaesthesia blunts emotional memories. Self-sacrificing cells use DNA as weapon. Bumblebees deliver pesticides. Asthma RUDI SEBASTIAN/PLAINPICTURERUDI drug rejuvenates brains. Dark matter may blow up stars 10 SPECIAL REPORT On the cover The plan to make meat without animals 16 IN BRIEF 40 Appy talk Neighbouring planet could be an illusion. 31 Learn another language King penguins face longer swims for food 28 Who’s in the mirror? Seeds of How it feels to have Technology revolution someone else’s face 8 Cosmic bruise 20 Your career is in the hands of recruitment AI.
    [Show full text]
  • Desarrollo De Una Nueva Versión De Gtranslator: Reactivando Una
    Desarrollo de una nueva versi´onde gtranslator: reactivando una comunidad de Software Libre Pablo Jos´eSangiao Roca Master on Free Software Caixanova [email protected] 2 de diciembre de 2008 c 2008 Pablo Jos´eSangiao Roca. Este documento se distribuye bajo la licencia Creative Commons 2.5 con reco- nocimiento, compartir igual. M´asdetalles sobre esta licencia en: http://creativecommons.org/licenses/by-nc-sa/2.5/es/ Resumen Un apartado muy importante para la difusi´ondel Software Libre es, sin duda, la localizaci´onde los programas. Adem´asde permitir que los usuarios puedan hacer uso de las aplicaciones adaptadas a su lengua y costumbres, facilita tambi´enque personas con perfiles no t´ecnicos, pue- dan colaborar en las comunidades de Software Libre mediante la realiza- ci´onde traducciones. Para facilitar esta labor existen algunas aplicaciones como son Kbabel dentro del proyecto KDE y gtranslator en el proyec- to GNOME. Esta ´ultimase encontraba en un estado de desarrollo muy pobre, abandonada incluso por los traductores de GNOME y con una comunidad a su alrededor pr´acticamente inexistente. En este art´ıculose mostrar´ael trabajo que se realiz´opara desarrollar una nueva versi´onde esta herramienta, que la coloque en un punto de referencia para la co- munidad de traductores. Adem´asse indicar´anlas decisiones y propuestas realizadas para intentar dinamizar la comunidad, as´ıcomo un an´alisisdel estado de la misma a trav´esde informaci´onobtenida a partir del sistema de control de versiones y listas de correo del programa. 1. Introducci´on Los proyectos de Software Libre que tienen ´exito,suelen tener una carac- ter´ısticacom´un:una gran comunidad detr´as.Esto permite que, a pesar de que la mayor´ıason comenzados y mantenidos por personas en su tiempo libre, crez- can r´apidamente gracias a las peque~nascontribuciones de mucha gente.
    [Show full text]
  • Technology 6 November 2017
    INDUSTRY NOTE China | Technology 6 November 2017 Technology EQUITY RESEARCH China Summit Takeaway: AI, Semi, Smartphone Value Chain Key Takeaway We hosted several experts and 24 A/H corporates at our China Summit and Digital Disruption tour last week. Key takeaways: 1) AI shifting from central cloud to the edge (end devices), driving GPU/FPGA demand in near term, 2) AI leaders in China rolling out own ASICs in the next 2~3 years, 3) China's new role in semi, forging ahead in design and catching up in foundry. In smartphone value chain, maintain AAC as top pick for lens opportunity and Xiaomi strength. CHINA Artificial Intelligence - From cloud to edge computing: The rise of AI on the edge (terminal devices, like surveillance cameras, smartphones) from central cloud, can solve one key weakness of AI: the brains are located thousands of miles away from the applications. The benefits of AI on the edge include: 1) better analysis based on non-compressed raw data, which contains more information, 2) lower requirement on bandwidth, as transmitted data has been pre-processed, 3) faster response. This will keep driving the demands for GPUs and FPGAs in near term. Meanwhile, China's leading AI companies including Hikvision (002415 CH) and Unisound (private, leader in voice recognition) also noted they may develop own ASICs in the next 2~3 years, for better efficiency and low power. While machines getting smarter, we notice increasing concerns on data privacy. Governments are not only implementing Big Data laws and policies, also starts investing AI leaders, like Face ++ (computer vision) in China.
    [Show full text]