Matrix Reordering Methods for Table and Network Visualization

EuroVis 2016 Volume 35 (2016), Number 3 R. Maciejewski, T. Ropinski, and A. Vilanova STAR – State of The Art Report (Guest Editors) Matrix Reordering Methods for Table and Network Visualization Michael Behrisch1, Benjamin Bach2, Nathalie Henry Riche3, Tobias Schreck4, Jean-Daniel Fekete5 1 2 3 4 5 6 7 8 9 10 1 1 Universität Konstanz, Germany 2 2Microsoft Research-Inria Joint Centre, France 3 3 4 Microsoft Research, USA 5 4 6 University of Technology Graz, Austria 7 5 8 Inria, France 9 10 (a) (b) 1 2 3 4 5 6 7 8 9 10 1 4 8 7 3 5 2 10 6 9 7 1 3 4 5 8 9 2 6 10 2 1 4 8 3 5 7 6 10 9 1 1 7 2 2 4 1 1 3 8 3 4 4 7 4 8 5 3 5 3 6 5 8 5 7 2 9 7 8 10 2 6 9 6 6 10 10 9 10 9 (a) (b) (c) (d) (e) 1 4 8 7 3 5 2 10 6 9 7 1 3 4 5 8 9 2 6 10 2 1 4 8 3 5 7 6 10 9 1 Figure 1: Visual matrix7 of numerical data (a)2 ordered randomly (b) and with three algorithms (c-e) revealing different patterns. 4 1 1 8 3 4 7 Abstract 4 8 3 5 3 5 This survey provides8 a description of algorithms5 to reorder visual matrices of tabular data and adjacency matrix of networks. 2 9 7 10 The goal of this survey2 is to provide a comprehensive6 list of reordering algorithms published in different fields such as statistics, 6 6 10 \begin{bmatrix} bioinformatics, or graph theory. While several of these algorithms are describedc_{1,1} in& c_{1, publications2} & \dots & \dots & \dots & c_{1 and,n} \\ others are available in 9 10 9 c_{2,1} & c_{2,2} & \dots & \dots & \dots & c_{2,n} \\ software libraries and programs, there is little awareness of what is done across\ alldots & \ fields.dots & \ddots &Our \dots & \dots survey & \dots \\ aims at describing these (c) (d) (e) \dots & \dots & \dots & c_{i,j}& \dots & \dots \\ \dots & \dots & \dots & \dots & \ddots & \dots \\ reordering algorithms in a unified manner to enable a wide audience to understandc_{m,1} their& c_{m,2} & \ differencesdots & \dots & \dots & c_{m and,n}\\ subtleties. We organize this corpus in a consistent manner, independently of the application or research field. We\end{bmatrix also} provide practical guidance on how to select appropriate algorithms depending on the structure and size of the matrix to reorder, and point to implementations when available. Categories and Subject Descriptors (according to ACM CCS): Visualization [Human-centered computing]: Visualization Techniques— \begin{bmatrix} c_{1,1} & c_{1,2} & \dots & \dots & \dots & c_{1,n} \\ c_{2,1} & c_{2,2} & \dots & \dots & \dots & c_{2,n} \\ \dots & \dots & \ddots & \dots & \dots & \dots \\ \dots & \dots & \dots & c_{i,j}& \dots & \dots \\ \dots & \dots & \dots & \dots & \ddots & \dots \\ c_{m,1} & c_{m,2} & \dots & \dots & \dots & c_{m,n}\\ \end{bmatrix} 1. Introduction node-link diagrams do. While low-level tasks do not necessarily require to reorder rows or columns [Ber73, GFC04], higher-level Visual Matrix Matrix Plot A or is a visual representation of tabular tasks, such as identifying groups or highly-connected vertices, do data also used to depict graphs and networks by encoding visually require a reordering of rows, respectively columns, to reveal higher an adjacency matrix. This representation has been used for almost a order patterns. Figure 1 illustrates how a good ordering can reveal century in many domains: biology, neurology, social science, sup- blocks along the diagonal of the matrix, indicating densely con- ply management, transportation and artificial intelligence. While nected groups in the network. The choice of the ordering highly most network exploration tools today leverage node-link diagrams, influences which visual patterns are visible, thus proving a critical several studies demonstrated that matrix representations are more factor for exploration and analysis. effective for several low-level tasks [GFC04] (e.g., finding if two vertices are connected or estimating the network density) and graph manually "the comparison [ABHR∗13], especially when networks become larger Historically, matrix reordering was done , with eye of the artist" and denser. [Ber73] and prove to be an extremely tedious task [Lii10]. The exploration, as well as evaluation of the re- Since matrices represent connections between vertices by a cell sults, was implicitly done in the process [PDF14]. Today, nu- at the intersection of a row (source vertex) and a column (target merous automatic reordering algorithms are described in the lit- vertex), they do not suffer from occlusion and link crossings as erature [Lii10, MML07a]. Such automated matrix reordering ap- c 2016 The Author(s) Computer Graphics Forum c 2016 The Eurographics Association and John Wiley & Sons Ltd. Published by John Wiley & Sons Ltd. M. Behrisch et al. / Matrix Reordering Methods for Table and Network Visualization proaches provide much faster results, the ability to order large ma- For our empirical comparison, we chose graphs from three dif- trices, and sometimes can even seek to reveal specific patterns in the ferent categories: (i) a set of graphs commonly used to benchmark data [WTC08]. In most cases, respective papers report on bench- matrix reordering algorithms [Pet03], (ii) a set of systematically marks of calculation complexity and run-time. Some of them in- generated graphs in order to test for specific graph characteris- cluding pictures of resulting matrices for individual algorithms or tics, such as density or number of clusters, and (iii) a set of real subsets. However the problem of assessing the quality of an order- world graphs (social networks, brain connectivity networks). For ing remains the same; similar to layout algorithms for node-link any of our 150 graphs, we generated an ordered matrix plot which representations of graphs, knowing which patterns are artifacts of we used as input for our performance measures. This let us with the algorithms and which patterns represent structure in the data is a total of 4348 matrix plots as basis for our comparison. Graphs crucial [MBK97]. To the best of our knowledge, there is no coher- and matrices shown in this survey represent a purposeful selec- ent survey of reordering algorithms for matrices available today. tion. The complete browsable collection can be found online at Moreover, there exist no visual comparison or methodology that http://matrixreordering.dbvis.de. allows for evaluating the patterns and artifacts each reordering algorithm produces. 1.3. Outline In this survey, we describe the main algorithms published for the In the reminder of this article, we start with a background on matrix purpose of finding a proper order for the rows and columns com- reordering methods and challenges. We then provide an extensive posing a visual matrix. For each of the six groups of algorithms survey on existing algorithms. We review and divide 35 exemplary we could establish, we illustrate the underlying ordering principles matrix reordering implementations into six major categories, as dis- and discuss pictures of matrices for a set of different graphs. The cussed in Section 3, such as Robinsonian-, Spectral-, Dimension problem of finding a good ordering for the vertices of an adjacency Reduction-, Heuristic-, Graph-Theoretic-, and Bi-Clustering. Then, matrix is known with multiple names: seriation, reordering, linear we report on our comparative analysis and discuss the results. We layout, layout, linear ordering, linear arrangement, numbering, la- conclude with a list of guidelines, which algorithm to choose for beling, and ordering. In this document, we simply call it reorder- specific data sets and characteristics, as well as suggestions for fu- ing, but these terms can be used interchangeably when the context ture studies. is not ambiguous since they are mostly an artifact of their history. However, the multiplicity of terms testifies of the wide variety of 2. Background domains that have studied the problem, and this survey tries to sum- This section introduces definitions and concepts that we rely upon marize and organize this wide diversity. to describe matrix reordering algorithms. The third subsection also While we focus entirely to network data (symmetric matrices), presents 4 visual patterns and 2 anti-patterns one can identify in we found that several ordering methods can similarly be applied visual matrices. We use these patterns to illustrate and compare the to table data. However, table data can produce visual patterns with algorithms. different interpretations and meaning; thus their description and visual analysis remains future work. 2.1. Definitions A graph G is a couple (V;E) where V is a set of vertices, and E is 1.1. Related Surveys a set of edges where: There has been several prior surveys describing matrix reorder- V = fv0;··· ;vng; ing approaches. Liiv’s overview [Lii10] summarizes the most com- E = fe ;··· ;e g;e 2 V 2 (1) mon reordering methods, highlighting the historical developments 0 m in various affected domains. Similarly, Wilkinson and Friendly A directed graph is a graph where the two vertices associated present an abstract of the history of cluster heatmaps [WF09]. Also, with an edge are considered ordered. An undirected graph is a Wilkinson describes in “Grammar of Graphics” [Wil05] the pro- graph where the vertices associated with an edge are not ordered. cess of creating a range of visual base patterns (cf. Section 2.3) and compares four reordering approaches with regards to their re- We use the term network to describe the graph topology as well trieval performance. Influential for our evaluation approach is the as attributes associated with vertices (e.g., labels), and attributes work of Mueller et al., which focuses on interpreting and describing associated with edges (e.g., weights).

Matrix Reordering Methods for Table and Network Visualization

Arxiv:1711.06300V1

"Distance Measures for Graph Theory"

Chapter 4 Introduction to Spectral Graph Theory

MINIMUM DEGREE ENERGY of GRAPHS Dedicated to the Platinum

MATH7502 Topic 6 - Graphs and Networks

Two One-Parameter Special Geometries

Seidel Signless Laplacian Energy of Graphs 1. Introduction

A Short Tutorial on Graph Laplacians, Laplacian Embedding, and Spectral Clustering

The Distance Matrix and Its Variants for Graphs and Digraphs by Carolyn

Eigenvector Localization

35. Finding Clusters in Graphs

Laplacian Matrix and Applications