Dilinet Part B
Total Page:16
File Type:pdf, Size:1020Kb
FP7 – Call 8 Large-scale integrating project (IP) Proposal number 317832 DILINET Large-scale integrating project (IP) project FP7-ICT-2011-8 DILINET Diverse Indicators for Language Influence on the inter NET PART B Objective ICT-2011.4.4 - Intelligent Information Management Target outcome b) Intelligent integrated systems that directly support decision making… Keywords: Horizontal Big Data; Decision Platform; Data Extraction; Language Indicators; Multilinguism; Web Search; Text Mining; Audio mining; Web measurement Date of preparation: 17/01/2012 Version number: V40 Beneficiary Beneficiary name Short name Country no. 1 The European Research Consortium for Informatics and ERCIM FR (coordinator) Mathematics 2 World Network for Linguistic Diversity MAAYA CH 3 Istituto di Scienza e Tecnologie dell’Informazione CNR IT 4 Dialogic Innovation & Interaction DIALOGIC NL 5 Centre National de la Recherche Scientifique CNRS FR 6 Exalead, a branch of Dassault Systèmes EXALEAD FR 7 Universitat Pompeu Fabra UPF ES 8 Fraunhofer-Gesellschaft zur Förderung der angewandten FRAUNHOFER DE Forschung e.V. 9 Dutch National Centre of Mathematics and Computer Science CWI NL 10 Fundación Redes y Desarrollo FUNREDES DO 11 Vocapia Research VOCAPIA FR 12 United Nations Educational, Scientific and Cultural Organization UNESCO FR 13 Nielsen Media Research GmbH NIELSEN DE Proposal Part B Page 1 of 136 FP7 – Call 8 Large-scale integrating project (IP) Proposal number 317832 DILINET Proposal abstract It is natural to see the Web as the best source of easily accessible, up-to-date horizontal Big Data for making decisions, reading the pulse of social situations. But what are we reading? The Web has mutated from a small initial collection of individually crafted pages written in English. No one now knows how to characterize the Web, its contours, its contents. Uniting specialists in Web indexing, mathematical sampling, graph analysis, text and speech processing, as well as international organisations implicated in world language policies, DILINET will create the tools and methods concerning language use that will allow informed decision making in multiple sensitive areas. DILINET will create new methods of intelligent Web sampling, unbiased crawling, page cleaning, and language detection (both text and audio). International organisation partners will define language indicators from these sampling methods. Research partners will develop modelling techniques to characterize language use and web content, both text and video, providing quantification for the metrics. DILINET will also develop pro-active surveying of actual web use. Surveying partners will develop voluntary-use plug-ins to capture language usage in browsing situations and to vehicle short questionnaires to different populations. Through sampling and voluntary monitoring of web usage, DILINET will provide the first reliable measure of Web characteristics. Providing new measures of information on the Web, useful both for science and government policy making, and new search strategies, DILINET applications will also demonstrate practical uses of this new information: measuring diversity in EU government websites, producing country specific Web indexes for underrepresented citizens, teaching materials for language policy makers, and targeted user campaigns for business usage. DILINET will provide accurate sampling methods and trending tools directly exploitable for support decision making and situation awareness. Proposal Part B Page 2 of 136 FP7 – Call 8 Large-scale integrating project (IP) Proposal number 317832 DILINET Table of contents B 1. Concept and objectives, progress beyond state-of-the-art, S/T methodology and work plan........ 5 B 1.1 Concept and project objective(s) ............................................................................................. 5 B 1.1.1 The DILINET vision ............................................................................................................ 5 B 1.1.2 Project Objectives............................................................................................................. 6 B 1.1.3 Project outcome ............................................................................................................... 8 B 1.1.4 Relevance to the topics addressed by the call ................................................................. 8 B 1.1.5 Timeliness of the proposed work ..................................................................................... 9 B 1.2 Progress beyond the state-of-the-art..................................................................................... 10 B 1.3 S/T methodology and associated work plan .......................................................................... 18 B 1.3.1 Overall strategy and general description ....................................................................... 18 B 1.3.2 Timing of work packages and their components ........................................................... 18 Gantt-chart ........................................................................................................................................ 33 B 1.3.3 Work package list and detailed description ................................................................... 34 Table 1.3a: Work package list ....................................................................................................... 34 Table 1.3b: Deliverables List.......................................................................................................... 35 Table 1.3c: List of milestones........................................................................................................ 37 Table 1.3d: Work package description.......................................................................................... 39 Summary of effort.............................................................................................................................. 75 B 1.3.4 Risk analysis .................................................................................................................... 77 B 2. Implementation.............................................................................................................................. 79 B 2.1 Management structure and procedures ................................................................................ 79 B 2.1.1 Management structure................................................................................................... 79 B 2.1.2 Procedures and tools...................................................................................................... 84 B 2.1.3 Conflict resolution .......................................................................................................... 85 B 2.2 Beneficiaries ........................................................................................................................... 85 B 2.3 Consortium as a whole ......................................................................................................... 101 B 2.3.1 Consortium overview and role of the participants....................................................... 101 B 2.3.2 Complementarity of participants ................................................................................. 102 B 2.3.3 Sub-contracting ............................................................................................................ 102 B 2.3.4 Other countries............................................................................................................. 104 B 2.3.5 Additional partners....................................................................................................... 104 B 2.4 Resources to be committed.................................................................................................. 105 B 2.4.1 Overview of resources to be committed...................................................................... 105 B 2.4.2 Other direct costs ......................................................................................................... 106 B 2.4.3 Sub-contracts................................................................................................................ 108 B 3. Impact........................................................................................................................................... 109 B 3.1 Strategic impact.................................................................................................................... 109 B 3.1.1 Other impact factors..................................................................................................... 111 B 3.1.2 European added value.................................................................................................. 114 B 3.2 Plan for the use and dissemination of foreground............................................................... 114 B 3.2.1 Dissemination............................................................................................................... 114 B 3.2.2 Exploitation Plans ......................................................................................................... 116 B 3.2.3 Management of intellectual property.......................................................................... 126 B 4. Ethical Issues................................................................................................................................. 128 B 5. Annexes .......................................................................................................................................