Generalized Pythagoras Trees for Visualizing Hierarchies

Generalized Pythagoras Trees for Visualizing Hierarchies Fabian Beck1, Michael Burch1, Tanja Munz1, Lorenzo Di Silvestro2 and Daniel Weiskopf1 1VISUS, University of Stuttgart, Stuttgart, Germany 2Dipartimento di Matematica e Informatica, Universita` di Catania, Catania, Italy Keywords: Hierarchy Visualization. Abstract: Pythagoras Trees are fractals that can be used to depict binary hierarchies. But this binary encoding is an obstacle for visualizing hierarchical data such as file systems or phylogenetic trees, which branch into n sub- hierarchies. Although any hierarchy can be modeled as a binary one by subsequently dividing n-ary branches into a sequence of n − 1 binary branches, we follow a different strategy. In our approach extending Pythago- ras Trees to arbitrarily branching trees, we only need a single visual element for an n-ary branch instead of spreading the binary branches along a strand. Each vertex in the hierarchy is visualized as a rectangle sized according to a metric. We analyze several visual parameters such as length, width, order, and color of the nodes against the use of different metrics. The usefulness of our technique is illustrated by two case studies visualizing directory structures and a large phylogenetic tree. We compare our approach with existing tree diagrams and discuss questions of geometry, perception, readability, and aesthetics. 1 INTRODUCTION duces space-efficient diagrams but complicates inter- preting the hierarchical structure. Hierarchical data (i.e., trees) occurs in many applica- In this paper, we introduce Generalized Pythago- tion domains, for instance, as results of a hierarchical ras Trees as an alternative to the above hierarchy visu- clustering algorithm, as files organized in directory alization techniques. It is based on Pythagoras Trees structures, or as species classified in a phylogenetic (Bosman, 1957), a fractal technique showing a binary tree. Providing an overview of possibly large and hierarchy as branching squares (Figure 1, a); the frac- deeply nested tree structures is one of the challenges tal approach is named after Pythagoras because every in information visualization. An appropriate visual- branch creates a right triangle and the Pythagorean ization technique should produce compact, readable, theorem is applicable to the areas of the squares. We and comprehensive diagrams, which ideally also look extend this approach to n-arily branching structures aesthetically appealing and natural to the human eye. and use it for depicting information hierarchies (Fig- A prominent visualization method are node-link ure 1, b). Instead of triangles, each recursive render- diagrams, which are often simply denoted as tree di- ing step produces a convex polygonal shape where the agrams; layout and aesthetic criteria have been dis- corners are placed on a semi circle. The size of the cussed (Reingold and Tilford, 1981; Wetherell and created rectangles can be modified for encoding nu- Shannon, 1979). Although node-link diagrams are meric information such as the number of leaf nodes intuitive and easy to draw, visual scalability and la- of the respective subhierarchy (Figure 1, c). beling often is an issue. An alternative, in particular We implemented the approach as an interactive easing the labeling problem, are indented trees (Burch tool and demonstrate its usefulness by applying it to et al., 2010) depicting the hierarchical structure by in- large and deeply structured abstract hierarchy data dentation. Further, layered icicle plots (Kruskal and from two application domains: a file system orga- Landwehr, 1983) stack boxes on top of each other for nized in directories and the NCBI taxonomy, a phy- encoding a hierarchy, but waste space by assigning logentic tree that structures the living organisms on large areas to inner nodes on higher levels of the hier- earth in a tree consisting of more than 300,000 ver- archy. The Treemap approach (Shneiderman, 1992), tices. Furthermore, a comparison to existing hier- which is applying the concept of nested boxes, pro- archy visualization approaches provides first insights Beck F., Burch M., Munz T., Di Silvestro L. and Weiskopf D.. 17 Generalized Pythagoras Trees for Visualizing Hierarchies. DOI: 10.5220/0004654500170028 In Proceedings of the 5th International Conference on Information Visualization Theory and Applications (IVAPP-2014), pages 17-28 ISBN: 978-989-758-005-5 Copyright c 2014 SCITEPRESS (Science and Technology Publications, Lda.) IVAPP2014-InternationalConferenceonInformationVisualizationTheoryandApplications (a) (b) (c) Figure 1: Extending Pythagoras Trees for encoding information hierarchies: (a) traditional fractal approach; (b) Generalized Pythagoras Tree applied to an n-ary information hierarchy; (c) additionally visualizing the number of leaves by the size of the inner nodes. into the unique characteristic of Generalized Pythago- strategies whereas McGuffin and Robert (2009) ana- ras Trees: a higher visual variety leads to more dis- lyzed space-efficiency. tinguishable visualizations, the fractal origin of the Indented representations of hierarchies are well- method supports identifying self-similar structures, known from explorable lists of files in file browsers. and the specific layout seems to be particularly suit- Recently, Burch et al. (2010) investigated a variant able for visualizing deep hierarchies. Finally, the as a technique for representing large hierarchies as created images are visually appealing as they show an overview representation. Such a diagram scales analogies to natural tree and branching structures. to very large and deep hierarchies and still shows the hierarchical organization but not as clear as in node-link diagrams. Layered icicle plots (Kruskal and Landwehr, 1983), in contrast, use the concept of 2 RELATED WORK stacking: the root vertex is placed on top and, analo- gous to node-link diagrams, consumes much horizon- The visualization of hierarchical data is a central in- tal space that is as large as all child nodes together. formation visualization problem that has been stud- Treemaps (Shneiderman, 1992), a space-filling ied for many years. Typical respresentations in- approach, are a prominent representative of nesting clude node-link, stacking, nesting, indentation, or techniques for encoding hierarchies. While proper- fractal concepts as surveyed by Jurgensmann¨ and ties of leaf nodes can be easily observed, a limitation Schulz (2010); Schulz (2011). Many variants of the becomes apparent when one tries to explore the hi- general concepts exist, for instance, radial (Battista erarchical structure because it is difficult to retrieve et al., 1999; Eades, 1992) and bubble layouts (Grivet the exact hierarchical information from deeply nested et al., 2004; Lin and Yen, 2007) of node-link dia- boxes: representatives of inner vertices are (nearly) grams, circular approaches for stacking techniques completely covered by descendants. Treemaps have (Andrews and Heidegger, 1998; Stasko and Zhang, been extended to other layout techniques such as 2000; Yang et al., 2003), or nested visualizations Voronoi diagrams (Balzer et al., 2005; Nocaj and based on Voronoi diagrams (Balzer et al., 2005; Nocaj Brandes, 2012) producing aesthetic diagrams that, and Brandes, 2012). however, suffer from high runtime complexity. Although many tree visualizations were proposed Also, 3D approaches have been investigated, for in the past, none provides a generally applicable solu- instance, in Cone Trees (Carriere` and Kazman, 1995), tion and solves all related issues. For example, node- each hierarchy vertex is visually encoded as a cone link diagrams clearly show the hierarchical structure with the apex placed on the circle circumference of by using explicit links in a crossing-free layout. How- the parent. Occlusion problems occur that are solved ever, by showing the node-link diagram in the tradi- by interactive features such as rotation. Botanical tional fashion with the root vertex on top and leaves Trees (Kleiberg et al., 2001), a further 3D approach, at the bottom, much screen space stays unused at the imitate the aesthetics of natural trees but are restricted top while leaves densely agglomerate at the bottom. to binary hierarchies, that is, n-ary hierarchies are Transforming the layout into a radial one distributes modeled as binary trees by the strand model of Holton the nodes more evenly, but makes comparisons of (1994); it becomes harder to detect the parent of a subtrees more difficult. Node-link layouts of hierar- node. chies have been studied in greater detail, for instance, The term fractal was coined by Mandelbrot (1982) Burch et al. (2011) investigated visual task solution and the class of those approaches has also been used 18 GeneralizedPythagorasTreesforVisualizingHierarchies for hierarchy visualization due to their self-similarity property (Koike, 1995; Koike and Yoshihara, 1993). With OneZoom (Rosindell and Harmon, 2012), the authors propose a fractal-based technique for visualizing phylogenetic trees; however, n-ary branches need to be visually translated into binary splits. De- vroye and Kruszewski (1995) visualize random binary hierarchies with a fractal approach as botanical trees; no additional metric value for the vertices is taken into account; instead, they investigate (a) (b) the Horton-Strahler number for computing the branch thicknesses. Figure 2: Illustration of the traditional Pythagoras Tree ap- The goal of our work is to extend a fractal approach: (a) a single binary branch; (b)

Generalized Pythagoras Trees for Visualizing Hierarchies

Using Fractal Dimension for Target Detection in Clutter

Fractal Curves and Complexity

Fractal Dimension of Self-Affine Sets: Some Examples

Fractal Geometry and Applications in Forest Science

An Investigation of Fractals and Fractal Dimension

Fractals Lindenmayer Systems

Math Morphing Proximate and Evolutionary Mechanisms

Fractal Modeling and Fractal Dimension Description of Urban Morphology

New Methods for Estimating Fractal Dimensions of Coastlines

Fractals and Multifractals

Fractals Dalton Allan and Brian Dalke College of Science, Engineering & Technology Nominated by Amy Hlavacek, Associate Professor of Mathematics

Dimensions of Self-Similar Fractals