Uncertainty Treemaps
Total Page:16
File Type:pdf, Size:1020Kb
Uncertainty Treemaps Citation for published version (APA): Sondag, M., Meulemans, W., Schulz, C., Verbeek, K., Weiskopf, D., & Speckmann, B. (2020). Uncertainty Treemaps. In F. Beck, J. Seo, & C. Wang (Eds.), 2020 IEEE Pacific Visualization Symposium, PacificVis 2020 - Proceedings (pp. 111-120). [9086235] IEEE Computer Society. https://doi.org/10.1109/PacificVis48177.2020.7614 DOI: 10.1109/PacificVis48177.2020.7614 Document status and date: Published: 01/06/2020 Document Version: Accepted manuscript including changes made at the peer-review stage Please check the document version of this publication: • A submitted manuscript is the version of the article upon submission and before peer-review. There can be important differences between the submitted version and the official published version of record. People interested in the research are advised to contact the author for the final version of the publication, or visit the DOI to the publisher's website. • The final author version and the galley proof are versions of the publication after peer review. • The final published version features the final layout of the paper including the volume, issue and page numbers. Link to publication General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal. If the publication is distributed under the terms of Article 25fa of the Dutch Copyright Act, indicated by the “Taverne” license above, please follow below link for the End User Agreement: www.tue.nl/taverne Take down policy If you believe that this document breaches copyright please contact us at: [email protected] providing details and we will investigate your claim. Download date: 25. Sep. 2021 Uncertainty Treemaps Max Sondag* Wouter Meulemans* Christoph Schulz† TU Eindhoven TU Eindhoven University of Stuttgart Kevin Verbeek* Daniel Weiskopf† Bettina Speckmann* TU Eindhoven University of Stuttgart TU Eindhoven Value encoded by area Low High Uncertainty encoded by hatched area Low High Hatching type matches hierarchy Leaf Root Superimposed hierarchy levels Figure 1: Uncertainty Treemap of coffee imports from 1994 to 2014, decomposed into continents, subregions, and countries [39]. Our visualization encodes uncertainty using nested hatched lines in the areas of the corresponding values. Of the North American countries ( ), the United States (USA) import much more coffee than Canada (CAN). While Canada has much less uncertainty in its value (area of ), both countries share a similar relative fluctuation (height of relative to containing rectangles). Similarly, going up two levels ( ) reveals that Europe ( , , , ) has lower relative fluctuation than the Americas ( , ). ABSTRACT are a variety of treemaps using different shapes, such as Voronoi Rectangular treemaps visualize hierarchical numerical data by recur- Treemaps [3], Orthoconvex and L-shaped Treemaps [13], and Jig- sively partitioning an input rectangle into smaller rectangles whose saw Treemaps [41]. Here, we focus on rectangular treemapping R areas match the data. Numerical data often has uncertainty asso- algorithms that partition an input rectangle into a set of smaller ciated with it. To visualize uncertainty in a rectangular treemap, rectangles, such that the area of each such rectangle corresponds to we identify two conflicting key requirements: (i) to assess the data the data value associated with the corresponding leaf in the input value of a node in the hierarchy, the area of its rectangle should hierarchy. Furthermore, the rectangles associated with the children directly match its data value, and (ii) to facilitate comparison be- of an interior node of the hierarchy form again a rectangle, that is tween data and uncertainty, uncertainty should be encoded using the associated with their parent and has the area corresponding to the same visual variable as the data, that is, area. We present Uncer- node value. It has been shown that rectangular treemaps satisfy tainty Treemaps, which meet both requirements simultaneously by important design consideration for the visualization of hierarchi- introducing the concept of hierarchical uncertainty masks. First, we cal data: they present the input data in a distortion-free manner, define a new cost function that measures the quality of Uncertainty they clearly visualize the input hierarchy, and they are maximally Treemaps. Then, we show how to adapt existing treemapping algo- space-efficient [25, 29]. rithms to support uncertainty masks. Finally, we demonstrate the Numerical data often has uncertainty (measurement, complete- usefulness and quality of our technique through an expert review ness, inference) associated with each data value. When plotting and a computational experiment on real-world datasets. singular data values, it is straightforward to indicate the error, for example, using error bars. Such visualizations anchor the uncer- Index Terms: Human-centered computing—Visualization—Visu- tainty at the data value. While this strategy works well for low- alization techniques—Treemaps dimensional data, it does not lend itself directly to more complex or higher-dimensional data. In the context of treemaps, an additional 1 INTRODUCTION challenge is the fact that uncertainty does not aggregate in the same way as data values do: relative uncertainty tends to become smaller, Treemaps are a well-established method to visualize hierarchical the higher a node is in the hierarchy. Despite these challenges, any numerical data. A treemap recursively partitions a two-dimensional visualization that aims to be trustworthy and faithful to the data it shape into regions whose areas correspond to the input data. There represents has to visualize uncertainty together with the data. Never- theless, techniques that support plotting certain and uncertain data * e-mail: fm.f.m.sondag,w.meulemans,k.a.b.verbeek,[email protected] simultaneously for more complex data types are rather scarce [20]. † f g e-mail: christoph.schulz,daniel.weiskopf @visus.uni-stuttgart.de In this paper we show how to visualize both hierarchical data and their associated uncertainty in rectangular treemaps. To do so, we identify two conflicting key requirements: Visual Aggregation. To assess the value of an interior node, the Figure 2: Uncertainty Treemaps and Bubble Treemaps [15]. Left: the S&P dataset [15]. The Uncertainty Treemap shows that this dataset has only very few data points with high uncertainty, whereas this is harder to establish for the Bubble Treemap. Right: the Coffee dataset [35]. The Bubble Treemap produces thick and folded boundaries that prohibit value comparison; in contrast, the Uncertainty Treemap indicates the aggregated uncertainty for each level in the hierarchy. area of its rectangle (or generally, its visual size) should directly and summarize the result of an expert review. Furthermore, we match its value. introduce a new quality metric that measures the quality of a mask. While our hierarchical uncertainty masks can be applied to any Uncertainty Encoding by Area. To facilitate comparison between treemap layout computed with any treemapping algorithm, cer- data and uncertainty, uncertainty should be encoded using the tain layouts result in better mask quality. To compute Uncertainty same visual variable as the data, that is, area. Treemaps with high mask quality, we could focus on this optimiza- tion criterion, while ignoring the aspect ratio of the rectangles—the Either requirement is straightforward to satisfy in isolation. For most common quality criterion for rectangular treemaps. However, example, visual aggregation can be maintained if uncertainty is the (summed) mean values still need to remain legible and compara- encoded using a sequential colormap or any other visual variable ble. Thus, in Section 4, we show how to adapt existing treemapping that does not hamper the perception of area. Similarly, uncertainty algorithms to take the uncertainty masks into account. Here, we dis- can be encoded using additional area adjacent to each rectangle. tinguish two types of algorithms: mask-friendly algorithms that use Doing so, however, will make visual aggregation next to impossible, only the fact that a mask will be placed and mask-aware algorithms as an inner node will have additional area inside its rectangle. In that use the uncertainty values to compute the layout. contrast to these decoupled approaches, our technique meets both In Section 5, we experimentally compare variants of our mask- requirements simultaneously. As an integral part of our uncertainty quality metric to establish how well these measures are able to Treemaps, we introduce hierarchical uncertainty masks as a tool to capture different aspects of quality. Furthermore, we investigate the visualize data and uncertainty in the same visual space. effectiveness of mask-friendly and mask-aware algorithms on several Definitions and Notation. Let T be a rooted tree. Each node v 2 T real-world datasets. Finally, in Section 6, we discuss various design has a set of children C(v). A node without children is a leaf; non-leaf alternatives for the shape, placement, and rendering of the masks, as nodes are interior nodes. We denote the value of each node v in well as the effects of uncertainty and unbalanced hierarchies. T by m(v) and we have m(v) = ∑u2C(v) m(u) for any interior node 2 RELATED WORK v. We assume that this value m corresponds to the mean and the sum of means, respectively. We denote the uncertainty of a node There are many different methods to visualize hierarchical data, v 2 T by s(v). Generally, the uncertainty of an interior node can Schulz et al. [33] provide an extensive survey of the entire range of be derived from its children.