Capability Based Planning and Information Visualization Development of Visualizations
. L . L . . . S. CAE Professional Services (Canada) Inc. 1135 Innovation Dr. Ottawa, Ontario K2G 3G7, Canada Contract Project Manager: Richard Percival PWGSC Contract Number: W7714-11-5174 CSA: Mark Rempel, Chad Young, Strategic Planning Operations Research Team
"!" "" 0 ! " 0 0 ! "
" " 0 ! )!" <) .
DRDC CORA CR 2012-065 March 2012
Defence R&D Canada Centre for Operational Research and Analysis
Strategic Planning Operational Research
Capability Based Planning and Information Visualization Development of Visualizations
! " #$ %" &&'( % ) *+ * ,-, '.
/01 ! " #$ %" &&'( % ) *+ * ,-, '. " 2" 3 1 " " 4&'-5.6'5- 7 " 8 1 7..&5&&(&.5 1 9 :3 )!" " 4&';5.'(.;
"!" "" 0 ! " 0 0 ! " " " 0 ! )!" <) .
Defence R&D Canada – CORA " )) * -6&-64( " -6&-
Principal Author
Original signed by Catherine Campbell C. Campbell CAE Professional Services (Canada), Inc.
Approved by
Original signed by Robert Burton Robert Burton Section Head, Joint Systems Analysis
Approved for release by
Original signed by Paul Comeau Paul Comeau Chief Scientist
© Her Majesty the Queen in Right of Canada, as represented by the Minister of National Defence, 2012 © Sa Majesté la Reine (en droit du Canada), telle que représentée par le ministre de la Défense nationale, 2012
Abstract ……..
The Capability Based Planning (CBP) process in Canada is based on an extensive data model that integrates scenario information, effects information, capabilities, force elements, a strategic cost model, and a Program Activity Architecture (PAA) model. Each element is a dense and highly inter-related set of data. Understanding the breadth and depth of considerations that must be made in CBP is difficult for senior decision-makers and their staffs. This contract investigates the potential for information visualization to support these people in understanding and making effective use of CBP model data to support decision-making. To this end, this report describes the preliminary use of two information visualization toolkits (Prefuse and Flare, selected during phase I of this work) to develop visualizations. This work had two purposes: familiarization with the toolkits, and evaluation of the relative strengths and weaknesses of the two toolkits. The output from this preliminary work is presented here, as are the results of an evaluation session involving the contract Scientific Authority (SA). Based on this preliminary work, Prefuse is recommended for further development work, and a number of features of a prototype information visualization application are introduced that may be helpful for senior decision makers.
Résumé ….....
Au Canada, le processus de planification fondée sur les capacités (PFC) s’appuie sur un vaste modèle de données qui intègre de l’information sur le scénario, de l’information sur les effets, des capacités, des éléments de la force, un modèle de coût stratégique, et un modèle d’Architecture des activités de programme (AAP). Chaque élément contient un ensemble de données denses et hautement interreliées. La compréhension de l’étendue et de la portée des considérations qui peuvent être utilisées dans la PFC est difficile pour les principaux décideurs et leur effectif. Le présent travail permet d’examiner la possibilité d’utiliser la visualisation de l’information afin d’aider ces gens à comprendre et à utiliser efficacement les données du modèle de PFC pour appuyer la prise de décisions. À cette fin, le rapport décrit l’utilisation préliminaire de deux trousses d’outils de visualisation de l’information (Prefuse et Flare, qui ont été sélectionnés pendant la phase I de ce travail) afin de développer des outils de visualisation. Ce travail avait deux objectifs : la familiarisation avec les trousses d’outils et l’évaluation des forces et faiblesses relatives des deux trousses. Le résultat de ce travail préliminaire est présenté dans ce rapport, ainsi que les résultats de la séance d’évaluation faisant intervenir l’autorité scientifique des marchés (ASM). À partir des travaux préliminaires, Prefuse a été recommandé pour le travail de développement à venir. De plus, certaines caractéristiques d’un prototype d’application de visualisation de l’information ont été mises en œuvre et pourraient être utiles aux principaux décideurs.
DRDC CORA CR 2012-065 i
This page intentionally left blank.
ii DRDC CORA CR 2012-065
Executive summary
Capability Based Planning and Information Visualization: Development of Visualizations Campbell, C.; Labrie, M-A.; Lamoureux, T.; Guo, R.; Smith, G.; and Pronovost, S.; DRDC CORA CR 2012-065; Defence R&D Canada – CORA; March 2012. Introduction or background: The Capability Based Planning (CBP) process in Canada is based on an extensive data model that integrates scenario information, effects information, capabilities, force elements, a strategic cost model, and a Program Activity Architecture (PAA) model. Each element is a dense and highly inter-related set of data. Understanding the breadth and depth of considerations that must be made in CBP is difficult for senior decision-makers and their staffs. This contract investigates the potential for information visualization to support these people in understanding and making effective use of CBP model data to support decision-making. To this end, this report describes the preliminary use of two information visualization toolkits (Prefuse and Flare, selected during phase I of this work) to develop simple visualizations. This work had two purposes: familiarization with the toolkits, and evaluation of the relative strengths and weaknesses of the two toolkits.
Results: The output from this preliminary work is presented here, as are the results of an evaluation session involving the contract Scientific Authority (SA). Flare was found to be more attractive for users, but it was harder to use because of the need to learn ActionScript. A number of different visualizations were developed in each toolkit, as follows:
Dependency graph (Prefuse and Flare); Hyperbolic tree (Flare); Tree view (Prefuse); Tree map (Prefuse); and, Term/word cloud (Prefuse).
Also, a number of demonstrations of different interactions were developed for each toolkit.
Given the degree of familiarity with Prefuse, in part because it is Java based, it was decided to focus on Prefuse for the development of an information visualization prototype.
Significance: The preliminary work with the two information visualization toolkits allowed the team to select a toolkit on which to focus effort, and led to the development of plans for a prototype information visualization application, to be described in a report for phase III of this contract.
DRDC CORA CR 2012-065 iii
Future plans: Based on this preliminary work, Prefuse is recommended for further development work, and a number of features of a prototype information visualization application are introduced that may be helpful for CBP decision makers.
iv DRDC CORA CR 2012-065
Sommaire .....
Capability Based Planning and Information Visualization: Development of Visualizations Campbell, C.; Labrie, M-A.; Lamoureux, T.; Guo, R.; Smith, G.; and Pronovost, S.; DRDC CORA CR 2012-065; R & D pour la défense Canada – CORA; Mars 2012.
Introduction ou contexte: Au Canada, le processus de planification fondée sur les capacités (PFC) s’appuie sur un vaste modèle de données qui intègre de l’information sur le scénario, de l’information sur les effets, des capacités, des éléments de la force, un modèle de coût stratégique, et un modèle d’Architecture des activités de programme (AAP). Chaque élément contient un ensemble de données denses et hautement interreliées. La compréhension de l’étendue et de la portée des considérations qui peuvent être utilisées dans la PFC est difficile pour les principaux décideurs et leur effectif. Le présent travail permet d’examiner la possibilité d’utiliser la visualisation de l’information afin d’aider ces gens à comprendre et à utiliser efficacement les données du modèle de PFC pour appuyer la prise de décisions. À cette fin, le rapport décrit l’utilisation préliminaire de deux trousses d’outils de visualisation de l’information (Prefuse et Flare, qui ont été sélectionnés pendant la phase I de ce travail) afin de développer des outils de visualisation. Ce travail avait deux objectifs : la familiarisation avec les trousses d’outils et l’évaluation des forces et faiblesses relatives des deux trousses. Résultats: Le résultat du travail préliminaire est présenté ci-dessous ainsi que la séance d’évaluation faisant intervenir l’autorité scientifique de marchés (ASM). Aux yeux des utilisateurs, Flare était plus attrayant, mais il était plus difficile à utiliser étant donné que les utilisateurs devaient apprendre le programme ActionScript. Diverses visualisations ont été développées pour chaque trousse d’outils, les voici :
Diagramme de dépendances (Prefuse et Flare); Arbre hyperbolique (Flare); Vue d’arborescence (Prefuse); Carte d’arborescence (Prefuse); Nuage de termes/mots (Prefuse).
De plus, un certain nombre de démonstrations de diverses interactions ont été menées pour chaque trousse d’outils.
En raison du degré de familiarisation avec Prefuse, en partie parce qu’il est basé sur le langage de Java, on a décidé de mettre l’accent sur Prefuse en vue du développement du prototype de visualisation de l’information.
Importance: Le travail préliminaire effectué à l’aide de deux trousses d’outils de visualisation de l’information a permis à l’équipe de sélectionner une trousse d’outils afin d’y consacrer ses efforts. Cela a mené à l’élaboration de plans pour le prototype d’application de visualisation de l’information qui seront décrits dans le rapport de la phase III de ce travail.
DRDC CORA CR 2012-065 v
Perspectives: À partir des travaux préliminaires, Prefuse a été recommandé pour le travail de développement à venir. De plus, certaines caractéristiques d’un prototype d’application de visualisation de l’information ont été mises en œuvre et pourraient être utiles pour les décideurs de la PFC.
vi DRDC CORA CR 2012-065
Table of contents
Abstract ……...... i Résumé …...... i Executive summary ...... iii Sommaire ...... v Table of contents ...... vii List of figures ...... ix List of tables ...... x 1 Introduction...... 1 1.1 Purpose and Scope...... 1 1.2 Objectives...... 1 1.3 Scope ...... 2 1.4 This Document ...... 2 1.5 Method...... 3 2 Initial implementation...... 4 2.1 Prefuse...... 4 2.1.1 Dependency graph...... 5 2.1.2 Tree view...... 6 2.1.3 Tree map ...... 8 2.1.4 Term/Word cloud...... 9 2.1.5 Interaction features...... 10 2.1.6 Visualization types that were not implemented ...... 14 2.1.7 Approach and lessons learned...... 16 2.1.7.1 General ...... 16 2.1.7.2 Adding label data in the graphml file ...... 16 2.1.7.3 Trees in TreeML...... 16 2.1.7.4 Multiple edges ...... 16 2.1.7.5 Interconnecting different visualizations ...... 16 2.2 Flare...... 17 2.2.1 Dependency graph...... 17 2.2.2 Hyperbolic Tree ...... 19 2.2.3 Interaction features...... 20 2.2.4 Visualization types that were not implemented ...... 21 2.2.5 Approach and lessons learned...... 21 2.3 Graphical User Interface...... 21 2.3.1 GUI Concept Evolution...... 21 2.3.2 GUI Prototype...... 27 3 Assessment of initial implementations ...... 29
DRDC CORA CR 2012-065 vii
4 Creative development ...... 31 5 Feasibility assessment...... 33 6 Enhanced implementation plan...... 36 Annex A .. Minutes of the Implementation Assessment and Creative Development Meeting...... 39 List of symbols/abbreviations/acronyms/initialisms ...... 43
viii DRDC CORA CR 2012-065
List of figures
Figure 1: A Dependency graph view between force elements and bottom level capabilities...... 5 Figure 2: Dependencies for a selected force element with edge values displayed ...... 6 Figure 3: Tree view (orientation 1)...... 7 Figure 4: Tree map representation of the capability framework and PAA...... 8 Figure 5: Tree Map visualization of Capability Framework and PAA with “paa or capability_framework::3” entered as a search criteria...... 9 Figure 6: Force elements in a word cloud view...... 10 Figure 7: Tree view (orientation 1)...... 11 Figure 8: Tree view (orientation 2 - detail)...... 11 Figure 9: Tree view (orientation 3 - detail)...... 12 Figure 10: Tree view (orientation 4)...... 13 Figure 11: Result of double-clicking on a node...... 14 Figure 12: Flow map example available with the Stanford library...... 15 Figure 13: Flare Dependency Graph ...... 18 Figure 14: Flare Dependency Graph showing selected node and related edges...... 19 Figure 15: Hyperbolic Tree in Flare...... 20 Figure 16: Inspiration for overview development: yEd – Graph Editor (http://www.yworks.com/en/products_yed_about.html)...... 22 Figure 17 Example of yEd-Graph editor interface including an overview pane, Successors pane, Structure view, Main window, Palette pane for editing, and Properties view... 23 Figure 18: Initial concepts for information visualization interface layout and functionality ...... 24 Figure 19: Initial concepts for information visualization interface design and specific capabilities...... 25 Figure 22: First prototype: initial screen - dependency graph...... 28 Figure 23: First prototype: node selection and zoom in on main window and tree view...... 28
DRDC CORA CR 2012-065 ix
List of tables
Table 1: Summary of outcomes of research conducted during the first phase of this project...... 1 Table 2 Proposed visualization development activities...... 31 Table 3 Feasibility Analysis for Additional Development of Visualizations in Prefuse...... 33
x DRDC CORA CR 2012-065
1 Introduction
1.1 Purpose and Scope
The purpose of this contract is to investigate the field of information visualization and to develop visualizations that will assist strategic decision makers in the Department of National Defence (DND) and the Canadian Forces (CF) to employ the Capability-Based Planning (CBP) approach to guide the Department’s strategic activities. To successfully carry out this contract, datasets contributing to the CBP process were studied, the means to create visualizations were investigated, and the various strengths and weakness of different visualization types were mapped to the needs of the datasets and users. This led to selection of a finite number of visualization types to be implemented in the selected toolkit as presented in Table 1 (justification for these selections is presented in the phase 1 report). The development of these visualizations is the focus of this report.
Table 1: Summary of outcomes of research conducted during the first phase of this project Selected visualization types Selected interaction Selected toolkits techniques Dependency graph Zoom +/- Prefuse Tree view Fisheye Flare Flow map Pan Hyperbolic tree (radial space- Selection filling tree graphs) Self-organizing map Search/query Tree map Filtering Term/Word cloud Gauges Starfield Basic visualizations (bar graph, pie chart, scatter plot, etc.)
1.2 Objectives The contract objectives most relevant to this second phase of the work are:
1. Develop and demonstrate interactive information visualization and related functionality;
2. Assess the ultimate benefit of interactive visualizations that have been built to explore and exploit the relationships between existing data, models and frameworks within the domain of strategic planning and enterprise risk management as it currently exists with DND/CF.
DRDC CORA CR 2012-065 1
1.3 Scope
A subset of the dataset used within the CBP process is represented by five ‘dummy’ datasets that are richly interconnected. These datasets include variables (i.e. ‘nodes’) and relationships between the variables (i.e. ‘edges’). The datasets represent: force planning scenario and desired effects; the capability framework; force elements; the Program Activity Architecture (PAA); and, the Strategic Cost Model (SCM). These datasets are discussed in detail in the phase 1 report and represent the scope of data to be considered for selection and development of visualizations using the selected toolkits.
Although upwards of 80 visualization development toolkits were identified, as discussed in the phase 1 report, the focus of this contract is on two in particular: Prefuse (www.prefuse.org) and Flare (flare.prefuse.org). The Scientific Authority (SA) for this contract provided a shortlist of 10 visualization types for implementation. These, or modified versions of these, were implemented in the visualization development toolkits. Implementation of additional visualization types and variations of these are discussed as part of the creative development activity in this phase of the project (outlined below).
1.4 This Document This document reports on the second phase of work in this project and is divided into six sections:
1. Introduction (this section): describes the background to the contract, the approach taken to the overall contract, and a review of the process and outcomes from the first phase of this contract;
2. Documentation of the implemented visualizations;
3. Results of the Assessment of initial visualizations;
4. Results of Creative Development;
5. Feasibility assessment of recommended changes; and,
6. Enhanced Implementation Plan: describes the plan for further development of selected visualizations.
Computer code developed as part of this phase of work has been provided to the SA electronically.
There are two other reports in this contract:
1. The phase 1 report describes the CBP process and datasets used in this work; the process for identification and outcomes from the evaluation of various classes of visualization types and their relative strengths and weaknesses; frequently referenced open-source visualization toolkits and their relative strengths and weaknesses; the selection criteria developed to support evaluation of visualization types and toolkits in the context of the CBP dataset; and an Implementation plan outlining the steps required to build the selected visualizations using the selected toolkit.
2. The phase 3 report describes the enhanced implementation, discusses outstanding issues and provides a list, with brief descriptions, of potential ways of improving the visualizations to facilitate the CBP process.
2 DRDC CORA CR 2012-065
1.5 Method
This phase of the project is predominantly concerned with initial development of visualizations, assessment of initial implementations, and development of plans for future implementations. The process was conducted as follows:
1. Initial implementation of visualizations using subsets of the CBP dataset provided by the Scientific Authority;
2. Assessment of the initial visualizations in collaboration with the SA;
3. Creative development workshop targeting possible elaborations on the developed visualizations, selection of additional visualizations and interaction techniques to be implemented in the second development effort;
4. Feasibility assessment for desired elaborations in the context of allocated resources and project timeline; and
5. Development of an implementation plan for the development of enhanced visualizations.
DRDC CORA CR 2012-065 3
2 Initial implementation
The software development team focussed on implementation of the selected visualizations in Prefuse and Flare. The toolkits were investigated in parallel such that a comparison between them could be made with respect to:
1. level of effort to produce visualizations, and
2. visual output produced.
The following sub-sections present the visualization types and interaction techniques that were achieved in Prefuse and Flare.
2.1 Prefuse
Initial implementations using Prefuse focussed on the list of preferred visualizations provided by the SA in the SOW. From that list the following visualizations were implemented, with varying degrees of success, using a subset of the CBP dataset:
Dependency graph Tree view Tree map Term/Word cloud
As part of the familiarization phase of the project labelling nodes and edges, colour coding and edge thickness were explored in isolation. Many of these features were integrated into the visualizations presented above. Basic interaction techniques were explored, including drag and drop moving nodes, mouse over labelling, search fields, and selection of edges and nodes.
The visualization types that were not implemented include:
Flow map Hyperbolic tree Self-organizing map Gauges Starfield Basic visualizations (as demonstrated on flex.org)
4 DRDC CORA CR 2012-065
The remainder of this section presents the implemented visualizations, the rationale for not implementing the remaining visualizations and a discussion of lessons learned from the successes, challenges and anticipated capabilities of the Prefuse program with respect to the implementation of these visualizations for the CBP dataset.
2.1.1 Dependency graph
The dependency graph implementation includes representation of the dependencies between the force elements and the capability framework. Note that only the lowest level elements in the capability framework are shown in this work, since they are the only ones with a direct relationship with the force elements. A circle layout is used to place the nodes. The force elements are displayed in orange and the capability nodes are displayed in light-blue, illustrating some of the labelling capabilities of the Prefuse toolkit.
When the visualization is first opened by the user, all of the edges are represented as shown in Figure 1. Due to the selected subset of data, only data for cost_horizon_0 is represented and displayed directly on the edge.
If a user double-clicks on a node, the color of that node will change to red and only the edges related to that node will be visible. The result of this action is presented in Figure 2.
Figure 1: A Dependency graph view between force elements and bottom level capabilities.
DRDC CORA CR 2012-065 5
Figure 2: Dependencies for a selected force element with edge values displayed
2.1.2 Tree view
The tree view implementation was developed using a subset of the original dataset: the capability framework hierarchy and associated information for each node. It was transformed to TreeML format using available open source example code1. Using this data subset, tree values are defined for each node, corresponding to the score of the capability element for the three scenarios. As the file was generated manually, all the values are 0.0.
1 See http://prefuse.org/doc/api/prefuse/data/io/TreeMLReader.html and http://www.nomencurator.org/InfoVis2003/download/treeml.dtd 6 DRDC CORA CR 2012-065
A global view of the Tree view visualization examples is presented in Figure 3. Additional examples are used to illustrate possible interactions and manipulations by the user in section 2.1.5.
Figure 3: Tree view (orientation 1).
DRDC CORA CR 2012-065 7
2.1.3 Tree map
This implementation displays the capability framework and PAA hierarchy in a tree map view (Figure 4). It does not allow the user to see node data such as the score of a capability vs. a scenario, but this is a possibility. The names of the lowest level of the hierarchy (leaves) are displayed along the bottom of the window when the mouse cursor is over a node. Thus, this implementation gives an overview of the size of a sub-tree i.e. the more leaves a sub-tree has, the larger the region occupied by this sub-tree. The tree map can also provide graphical indications of other properties of nodes, for instance the importance or some other specified value, if the user chose to arrange the tree map according to particular data fields.
Figure 4: Tree map representation of the capability framework and PAA
The tree map also shows the ability of Prefuse to support searching. The example in Figure 5 presents a varied search for nodes beginning with ‘PAA’ or ‘Capability_Framework::3’. Although not necessarily possible in a tree map, such a search could also be performed for edges, which are also named.
8 DRDC CORA CR 2012-065
Figure 5: Tree Map visualization of Capability Framework and PAA with “paa or capability_framework::3” entered as a search criteria.
2.1.4 Term/Word cloud
In the term/word cloud implementation, force elements are represented in a word cloud view. Notice that the node parameter cost_horizon_0 is used to determine the size of each element (see Figure 6). Placement of the labels (words) is arbitrary and the user can move them around by selecting and dragging elements across the screen. If appropriate data was provided for the nodes, the distance between words could be meaningful, denoting, for example, similarity between words or the importance of a relationship between one word and another. This would lead to clustering, providing a quick and effective means for the user to understand the structure of the dataset.
DRDC CORA CR 2012-065 9
Figure 6: Force elements in a word cloud view.
2.1.5 Interaction features
Many controls are available to modify the view of visualizations displayed in Prefuse. The following interactions were achieved using the Tree view:
1. Ctrl-1, Ctrl-2, Ctrl-3 and Ctrl-4 allow changing the orientation of the tree (see Figure 7, Figure 8, Figure 9 and Figure 10).
2. Left-mouse button: drag the view.
3. Right-mouse button: zoom.
4. Mouse-over node: display node’s name in the bottom of the window
5. Double-click on a node: display node information (values for each scenario) in a contextual view (see Figure 11).
6. Search box at lower right.
It was discovered that the actual mechanism to see the scores of each capability against each scenario does not allow for a quick overview of the data. The values can only be viewed for one node at a time. Since these data are related to nodes, it would be possible to adapt the node’s shape, size or color depending on the scenario the user wishes to analyze. For example, radio buttons could be added to allow the user to select the scenario of interest. 10 DRDC CORA CR 2012-065