Project Periodic Report: Final
Total Page:16
File Type:pdf, Size:1020Kb
VALCRI FP7-IP-608142 PROJECT PERIODIC REPORT: FINAL COVERING PERIODS 1-3, MAY 2014 – JUNE 2018 (M01-M50) VERSION 1.0 Date submitted: 17 May 2018 Dissemination Level: PU / PP / RE / CO WORK PACKAGE: WP1 MANAGEMENT WORK PACKAGE LEADER: B.L. WILLIAM WONG © 2018 VALCRI All rights reserved The research leading to these results has received funding from the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement no FP7-IP-608142. 1 VALCRI FP7-IP-608142 2 VALCRI FP7-IP-608142 Table of Contents SECTION 1 PUBLISHABLE SUMMARY: 5 A. SUMMARY DESCRIPTION OF PROJECT CONTEXT AND OBJECTIVES 5 B. DESCRIPTION OF WORK PERFORMED AND MAIN RESULTS 6 C. FINAL RESULTS AND IMPACTS 8 D. PROJECT PUBLIC WEBSITE ADDRESS: 9 SECTION 2 CORE OF THE REPORT: 10 2.1 INTRODUCTION 10 2.1.1 Project Objectives 10 2.1.2 Problems faced by criminal intelligence analysts 10 2.1.3 The VALCRI Concept 12 2.1.4 Summary of Achievements 18 2.2 WORK PROGRESS AND ACHIEVEMENTS DURING THE PERIOD M17 – M32 21 2.2.1 Work Package 1 - MANAGEMENT 21 2.2.2 Work Package 2 - REQUIREMENTS AND DESIGN 23 2.2.3 Work Package 3 - HUMAN ISSUES 27 2.2.4 Work Package 4 - ANALYST USER INTERFACE 40 2.2.5 Work Package 5 - DATA SPACE 44 2.2.6 Work Package 6 - ANALYSIS SPACE 51 2.2.7 Work Package 7 - HYPOTHESIS SPACE – USER INTERFACE 55 2.2.8 Work Package 8 - DATA EXTRACTION 58 2.2.9 Work Package 9 - EVENT DETECTION 63 2.2.10 Work Package 10 - ONTOLOGY LIBRARY 66 2.2.11 Work Package 11 - RESEARCH DATA 73 2.2.12 Work Package 12 - SCALABLE & SECURE DISTRIBUTED PROCESSING & INTEGRATION 78 2.2.13 Work Package 13 - TRAINING AND EVALUATION 83 2.2.14 Work Package 14 - DISSEMINATION AND EXPLOITATION 91 2.3. ALIGNMENT VIA SUBGROUPS (SG), ACTIVITIES IN THE REPORTING PERIOD 98 2.3.1 SG1: DEA (data extraction and analysis) 98 2.3.2 SG2: DMO (data management and ontologies) 98 2.3.3 SG3: DAR (design and architecture) 99 3.3.4 SG4: RCC (requirements consolidation and concretization) 99 2.3.5 SG5: SEPL (security, ethics, privacy, legal) 99 2.3.6 SG6: UUC (UX/UI and cognitive aspects) 100 2.3.7 SG7: Training and Evaluation 101 2.4 PROJECT MANAGEMENT 103 2.4.1 Scientific Coordination 103 2.4.2 Administrative and financial stability 103 2.4.3 Exploitation: Phase 4 Plans and Project Extension 104 3 VALCRI FP7-IP-608142 ANNEX A ADVANCES AND GROUND BREAKERS 1 ADVANCES AND GROUNDBREAKING SCIENCE AND TECHNOLOGIES 2 AGB1. The Analyst’s User Interface: How Analysts Think 3 AGB2. The Analyst’s User Interface: The Reasoning Workspace and Thinking Landscape 6 AGB3. Insight into challenges of using sophisticated software in criminal investigations and meeting disclosure obligations with regard to the production of “relevant material” under CPIA 1996 8 AGB4. Shewmaps - Gridded Geographical Summaries Of Multiple Spc Charts 10 AGB5. A Descriptive, Practical, Hybrid Argumentation Model to Assist With the Formulation of Defensible Assessments in Uncertain Sense-Making Environments 11 AGB6. Interactive human-centered extraction of temporal-spatial and behavioral associations for crime analysis 13 AGB7. Aggregated Visualization of Elements outside of a Visualization Viewport (Off-Screen) 15 AGB8. Interactive Machine Learning for Crime Data Analysis 16 AGB9. Interactive Dimensionality-Reduction to foster Data Exploration and Sense Making 17 AGB10. Modelling the visual analytic process 18 AGB11. Provenance models, methods and system architecture to support legal, ethical, and privacy requirements when dealing with police data 20 AGB12. Alignment Cubes – a tool for comparing and evaluating ontology alignments 22 AGB13. Requirements for user involvement for ontology alignment systems 23 AGB14. RepOSE plugin for Protégé – a tool for completing ontologies 25 AGB15. Advances in Ontology Design Pattern (ODP) usage methods (the eXtreme Design methodology) and support tooling. 26 AGB16. RSP-SPIN Service and RDF Stream Processing architecture 28 AGB17. REST-SPIN and Query Template Library 30 AGB18. Advances in the understanding of design and interpretation of data dense interactive graphics in crime analysis. 32 AGB19. Furthering understanding and implementation of legal developments in the field of privacy and data protection and assisting in development of innovative legally compliant police technologies. 34 AGB20. Methods for Discovering and Mitigating Cognitive Biases in a Visual Analytics Environment 38 AGB21. Sense-making in intelligence analysis and development of a framework of recommendations for the design of such systems 41 AGB22. Analysis of the new data protection framework for the Law Enforcement Agencies (LEA) sector in Europe to determine technical and organisational solution approaches for achieving legal compliance. 43 AGB23. The Master Analyst: A Reference Curriculum for Law Enforcement Professionals 46 AGB24. Visual Analytics for Federal Police of Antwerp 48 AGB25. Fuzzy integration tests over changing data 51 AGB26. Generators of faked yet realistic data. 53 AGB27. Enhancing VALCRI's operational efficacy by evaluating a dominant LPA data source - ISLP - for integration.56 AGB28. Development and testing environment 58 ANNEX B PUBLISHED PAPERS 1 ANNEX C VIDEOS 1 ANNEX D IEB Report and IEB Risk Assessment 3 Final Report of the Independent Ethics Board on the VALCRI Project 4 VALCRI Risk Assessment for IEB-related issues 20 ANNEX E EXPLOITATION PRINCIPLES AND COMMERCIAL IP LIST 1 Annex E, Appendix 1 - VALCRI – Commercial IP Exploitation Principles 1 Annex E, Appendix 2 - Commercial IP origination and individual or joint ownerships 1 4 VALCRI FP7-IP-608142 SECTION 1 PUBLISHABLE SUMMARY: Project Objectives, Work Progress, Achievements, and Project Management Period 3, M17 – M50, Sep 2014 – Jun 2018 A. SUMMARY DESCRIPTION OF PROJECT CONTEXT AND OBJECTIVES (4000 CHAR // 3998 CHARS) INTRODUCTION The purpose of VALCRI was to develop the next generation criminal intelligence analysis system for European LEAs. Working closely with three European police forces, the project researched and developed at TRL-5, an integrated system of over 75 software components of advanced data processing, analytic and sense-making tools. It includes multiple applications spanning strategic intelligence analysis to tactical intelligence and individual case management. The VALCRI system was routinely evaluated with project end-users. In the final nine months, it has been evaluated with 214 LEA officers in 50 agencies in 16 countries and 2 international LEAs (Europol and NATO Intelligence Fusion Centre). It is undergoing trials with actual data at the London Metropolitan Police, and the Pasco County Sheriff’s Department in Florida. Negotiations are also underway to purchase or licence various VALCRI technologies and non-software outcomes. VALCRI used a cognitive engineering approach to create a human-technology team that combined advanced concepts of human reasoning and analytic discourse with machine learning and database technologies. The result has been a semi-automated human-mediated semantic knowledge extraction capability that can facilitate and improve investigative sense making and problem solving in crime analysis and criminal investigation in a high ambiguity and constantly evolving environment. KEY DISTINGUISHING FEATURES 1. SUPPORT HOW ANALYSTS THINK, RATHER THAN WHAT ANALYSTS DO If VALCRI were designed to mainly support what analysts do, then the system would primarily automate current tasks and workflows. Instead, by designing for how analysts think, the VALCRI system is better able to respond to the variety of sense making, reasoning and inference making and problem solving strategies presented by human analysts. 2. FACILITATE EXPERT INTUITION TO SCIENTIFIC METHOD In many investigations, analysts are often only presented with fragments of data from which to create an understanding of the situation and to anticipate what might happen. Expert intuition is very useful in generating “hunches”, or early, plausible and tentative hypotheses. However, hunches can be error prone and subject to cognitive biases. VALCRI has designed quick ways for analysts to use the scientific method to test their hunches so that they may easily discard it if proven wrong. 5 VALCRI FP7-IP-608142 3. HUMANS DECIDE, MACHINES DO THE HEAVY LIFTING VALCRI has been designed so that humans and machines do what each is good at: Humans make decisions under ambiguity; machines are fast at tedious and repetitive task. So, when an analyst instructs VALCRI to “find me more reports like this …”, the machine learning-based automation will trawl through large volumes of structured and un-structured data (e.g. free text) to retrieve, triage, collate, thematically analyse the data, and then combines and presents the reports in context of the crime problem being investigated, e.g. Comparative Case Analysis. 4. ETHICS, LEGAL AND PRIVACY BY DESIGN In many LEA data analytics systems, once a person’s data is enmeshed in the system-data networks, that person will continue to be linked to those criminal profiles. Such profiles will be used by the system to predict membership characteristics and to set up alerts for “persons of interest”. This can lead to further stops and searches of the person, even though he may be innocent. This interferes with his private life. VALCRI advocates the need for ‘computational transparency’ as a mitigating approach: make visible the inner workings of ‘black box’ automated algorithms. A lower TRL prototype has been implemented in VALCRI to investigate how fine grain data access controls may be combined with computational transparency so that analysts and investigators are aware of the provenance of algorithm’s computed results and protect the rights of individuals. 5. UP-SKILLING OF ANALYTIC ABILITIES VALCRI has also identified and addressed varying deficiencies in the abilities of the intelligence analysis community. Some of this have been formalised in a new Master degree level analytics training course at Aston University in Birmingham, in partnership with the West Midlands Police; and some have been formalised into commercial intelligence analysis training packages focusing on analytic reasoning.