Simple, Fast, and Scalable Reachability Oracle
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Efficiently Mining Frequent Closed Partial Orders
Efficiently Mining Frequent Closed Partial Orders (Extended Abstract) Jian Pei1 Jian Liu2 Haixun Wang3 Ke Wang1 Philip S. Yu3 Jianyong Wang4 1 Simon Fraser Univ., Canada, fjpei, [email protected] 2 State Univ. of New York at Buffalo, USA, [email protected] 3 IBM T.J. Watson Research Center, USA, fhaixun, [email protected] 4 Tsinghua Univ., China, [email protected] 1 Introduction Account codes and explanation Account code Account type CHK Checking account Mining ordering information from sequence data is an MMK Money market important data mining task. Sequential pattern mining [1] RRSP Retirement Savings Plan can be regarded as mining frequent segments of total orders MORT Mortgage from sequence data. However, sequential patterns are often RESP Registered Education Savings Plan insufficient to concisely capture the general ordering infor- BROK Brokerage mation. Customer Records Example 1 (Motivation) Suppose MapleBank in Canada Cid Sequence of account opening wants to investigate whether there is some orders which cus- 1 CHK ! MMK ! RRSP ! MORT ! RESP ! BROK tomers often follow to open their accounts. A database DB 2 CHK ! RRSP ! MMK ! MORT ! RESP ! BROK in Table 1 about four customers’ sequences of opening ac- 3 MMK ! CHK ! BROK ! RESP ! RRSP counts in MapleBank is analyzed. 4 CHK ! MMK ! RRSP ! MORT ! BROK ! RESP Given a support threshold min sup, a sequential pattern is a sequence s which appears as subsequences of at least Table 1. A database DB of sequences of ac- min sup sequences. For example, let min sup = 3. The count opening. following four sequences are sequential patterns since they are subsequences of three sequences, 1, 2 and 4, in DB. -
Approximating Transitive Reductions for Directed Networks
Approximating Transitive Reductions for Directed Networks Piotr Berman1, Bhaskar DasGupta2, and Marek Karpinski3 1 Pennsylvania State University, University Park, PA 16802, USA [email protected] Research partially done while visiting Dept. of Computer Science, University of Bonn and supported by DFG grant Bo 56/174-1 2 University of Illinois at Chicago, Chicago, IL 60607-7053, USA [email protected] Supported by NSF grants DBI-0543365, IIS-0612044 and IIS-0346973 3 University of Bonn, 53117 Bonn, Germany [email protected] Supported in part by DFG grants, Procope grant 31022, and Hausdorff Center research grant EXC59-1 Abstract. We consider minimum equivalent digraph problem, its max- imum optimization variant and some non-trivial extensions of these two types of problems motivated by biological and social network appli- 3 cations. We provide 2 -approximation algorithms for all the minimiza- tion problems and 2-approximation algorithms for all the maximization problems using appropriate primal-dual polytopes. We also show lower bounds on the integrality gap of the polytope to provide some intuition on the final limit of such approaches. Furthermore, we provide APX- hardness result for all those problems even if the length of all simple cycles is bounded by 5. 1 Introduction Finding an equivalent digraph is a classical computational problem (cf. [13]). The statement of the basic problem is simple. For a digraph G = (V, E), we E use the notation u → v to indicate that E contains a path from u to v and E the transitive closure of E is the relation u → v over all pairs of vertices of V . -
Copyright © 1980, by the Author(S). All Rights Reserved
Copyright © 1980, by the author(s). All rights reserved. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, to republish, to post on servers or to redistribute to lists, requires prior specific permission. ON A CLASS OF ACYCLIC DIRECTED GRAPHS by J. L. Szwarcfiter Memorandum No. UCB/ERL M80/6 February 1980 ELECTRONICS RESEARCH LABORATORY College of Engineering University of California, Berkeley 94720 ON A CLASS OF ACYCLIC DIRECTED GRAPHS* Jayme L. Szwarcfiter** Universidade Federal do Rio de Janeiro COPPE, I. Mat. e NCE Caixa Postal 2324, CEP 20000 Rio de Janeiro, RJ Brasil • February 1980 Key Words: algorithm, depth first search, directed graphs, graphs, isomorphism, minimal chain decomposition, partially ordered sets, reducible graphs, series parallel graphs, transitive closure, transitive reduction, trees. CR Categories: 5.32 *This work has been supported by the Conselho Nacional de Desenvolvimento Cientifico e Tecnologico (CNPq), Brasil, processo 574/78. The preparation ot the manuscript has been supported by the National Science Foundation, grant MCS78-20054. **Present Address: University of California, Computer Science Division-EECS, Berkeley, CA 94720, USA. ABSTRACT A special class of acyclic digraphs has been considered. It contains those acyclic digraphs whose transitive reduction is a directed rooted tree. Alternative characterizations have also been given, including one by forbidden subgraph containment of its transitive closure. For digraphs belonging to the mentioned class, linear time algorithms have been described for the following problems: recognition, transitive reduction and closure, isomorphism, minimal chain decomposition, dimension of the induced poset. -
95-106 Parallel Algorithms for Transitive Reduction for Weighted Graphs
Math. Maced. Vol. 8 (2010) 95-106 PARALLEL ALGORITHMS FOR TRANSITIVE REDUCTION FOR WEIGHTED GRAPHS DRAGAN BOSNAˇ CKI,ˇ WILLEM LIGTENBERG, MAXIMILIAN ODENBRETT∗, ANTON WIJS∗, AND PETER HILBERS Dedicated to Academician Gor´gi´ Cuponaˇ Abstract. We present a generalization of transitive reduction for weighted graphs and give scalable polynomial algorithms for computing it based on the Floyd-Warshall algorithm for finding shortest paths in graphs. We also show how the algorithms can be optimized for memory efficiency and effectively parallelized to improve the run time. As a consequence, the algorithms can be tuned for modern general purpose graphics processors. Our prototype imple- mentations exhibit significant speedups of more than one order of magnitude compared to their sequential counterparts. Transitive reduction for weighted graphs was instigated by problems in reconstruction of genetic networks. The first experiments in that domain show also encouraging results both regarding run time and the quality of the reconstruction. 1. Introduction 0 The concept of transitive reduction for graphs was introduced in [1] and a similar concept was given previously in [8]. Transitive reduction is in a sense the opposite of transitive closure of a graph. In transitive closure a direct edge is added between two nodes i and j, if an indirect path, i.e., not including edge (i; j), exists between i and j. In contrast, the main intuition behind transitive reduction is that edges between nodes are removed if there are also indirect paths between i and j. In this paper we present an extension of the notion of transitive reduction to weighted graphs. -
Masters Thesis: an Approach to the Automatic Synthesis of Controllers with Mixed Qualitative/Quantitative Specifications
An approach to the automatic synthesis of controllers with mixed qualitative/quantitative specifications. Athanasios Tasoglou Master of Science Thesis Delft Center for Systems and Control An approach to the automatic synthesis of controllers with mixed qualitative/quantitative specifications. Master of Science Thesis For the degree of Master of Science in Embedded Systems at Delft University of Technology Athanasios Tasoglou October 11, 2013 Faculty of Electrical Engineering, Mathematics and Computer Science (EWI) · Delft University of Technology *Cover by Orestis Gartaganis Copyright c Delft Center for Systems and Control (DCSC) All rights reserved. Abstract The world of systems and control guides more of our lives than most of us realize. Most of the products we rely on today are actually systems comprised of mechanical, electrical or electronic components. Engineering these complex systems is a challenge, as their ever growing complexity has made the analysis and the design of such systems an ambitious task. This urged the need to explore new methods to mitigate the complexity and to create sim- plified models. The answer to these new challenges? Abstractions. An abstraction of the the continuous dynamics is a symbolic model, where each “symbol” corresponds to an “aggregate” of states in the continuous model. Symbolic models enable the correct-by-design synthesis of controllers and the synthesis of controllers for classes of specifications that traditionally have not been considered in the context of continuous control systems. These include qualitative specifications formalized using temporal logics, such as Linear Temporal Logic (LTL). Be- sides addressing qualitative specifications, we are also interested in synthesizing controllers with quantitative specifications, in order to solve optimal control problems. -
Efficient Graph Reachability Query Answering Using Tree Decomposition
Efficient Graph Reachability Query Answering using Tree Decomposition Fang Wei Computer Science Department, University of Freiburg, Germany Abstract. Efficient reachability query answering in large directed graphs has been intensively investigated because of its fundamental importance in many application fields such as XML data processing, ontology rea- soning and bioinformatics. In this paper, we present a novel indexing method based on the concept of tree decomposition. We show analytically that this intuitive approach is both time and space efficient. We demonstrate empirically the efficiency and the effectiveness of our method. 1 Introduction Querying and manipulating large scale graph-like data has attracted much atten- tion in the database community, due to the wide application areas of graph data, such as GIS, XML databases, bioinformatics, social network, and ontologies. The problem of reachability test in a directed graph is among the fundamental operations on the graph data. Given a digraph G = (V; E) and u; v 2 V , a reachability query, denoted as u ! v, ask: is there a path from u to v? One of the fundamental queries on biological networks is for instance, to find all genes whose expressions are directly or indirectly influenced by a given molecule [15]. Given the graph representation of the genes and regulation events, the question can also be reduced to the reachability query in a directed graph. Recently, tree decomposition methodologies have been successfully applied to solving shortest path query answering over undirected graphs [17]. Briefly stated, the vertices in a graph G are decomposed into a tree in which each node contains a set of vertices in G. -
LNCS 7034, Pp
Confluent Hasse Diagrams DavidEppsteinandJosephA.Simons Department of Computer Science, University of California, Irvine, USA Abstract. We show that a transitively reduced digraph has a confluent upward drawing if and only if its reachability relation has order dimen- sion at most two. In this case, we construct a confluent upward drawing with O(n2)features,inanO(n) × O(n)gridinO(n2)time.Forthe digraphs representing series-parallel partial orders we show how to con- struct a drawing with O(n)featuresinanO(n)×O(n)gridinO(n)time from a series-parallel decomposition of the partial order. Our drawings are optimal in the number of confluent junctions they use. 1 Introduction One of the most important aspects of a graph drawing is that it should be readable: it should convey the structure of the graph in a clear and concise way. Ease of understanding is difficult to quantify, so various proxies for it have been proposed, including the number of crossings and the total amount of ink required by the drawing [1,18]. Thus given two different ways to present information, we should choose the more succinct and crossing-free presentation. Confluent drawing [7,8,9,15,16] is a style of graph drawing in which multiple edges are combined into shared tracks, and two vertices are considered to be adjacent if a smooth path connects them in these tracks (Figure 1). This style was introduced to re- duce crossings, and in many cases it will also Fig. 1. Conventional and confluent improve the ink requirement by represent- drawings of K5,5 ing dense subgraphs concisely. -
Depth-First Search & Directed Graphs
Depth-first Search and Directed Graphs Story So Far • Breadth-first search • Using breadth-first search for connectivity • Using bread-first search for testing bipartiteness BFS (G, s): Put s in the queue Q While Q is not empty Extract v from Q If v is unmarked Mark v For each edge (v, w): Put w into the queue Q The BFS Tree • Can remember parent nodes (the node at level i that lead us to a given node at level i + 1) BFS-Tree(G, s): Put (∅, s) in the queue Q While Q is not empty Extract (p, v) from Q If v is unmarked Mark v parent(v) = p For each edge (v, w): Put (v, w) into the queue Q Spanning Trees • Definition. A spanning tree of an undirected graph G is a connected acyclic subgraph of G that contains every node of G . • The tree produced by the BFS algorithm (with (( u, parent(u)) as edges) is a spanning tree of the component containing s . • The BFS spanning tree gives the shortest path from s to every other vertex in its component (we will revisit shortest path in a couple of lectures) • BFS trees in general are short and bushy Spanning Trees • Definition. A spanning tree of an undirected graph G is a connected acyclic subgraph of G that contains every node of G . • The tree produced by the BFS algorithm (with (( u, parent(u)) as edges) is a spanning tree of the component containing s . • The BFS spanning tree gives the shortest path from s to every other vertex in its component (we will revisit shortest path in a couple of lectures) • BFS trees in general are short and bushy Generalizing BFS: Whatever-First If we change how we store -
An Improved Algorithm for Transitive Closure on Acyclic Digraphs
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Theoretical Computer Science 58 (1988) 325-346 325 North-Holland AN IMPROVED ALGORITHM FOR TRANSITIVE CLOSURE ON ACYCLIC DIGRAPHS Klaus SIMON Fachbereich IO, Angewandte Mathematik und Informatik, CJniversitCt des Saarlandes, 6600 Saarbriicken, Fed. Rep. Germany Abstract. In [6] Goralcikova and Koubek describe an algorithm for finding the transitive closure of an acyclic digraph G with worst-case runtime O(n. e,,,), where n is the number of nodes and ered is the number of edges in the transitive reduction of G. We present an improvement on their algorithm which runs in worst-case time O(k. crud) and space O(n. k), where k is the width of a chain decomposition. For the expected values in the G,,.,, model of a random acyclic digraph with O<p<l we have F(k)=O(y), E(e,,,)=O(n,logn), O(n’) forlog’n/n~p<l. E(k, ercd) = 0( n2 log log n) otherwise, where “log” means the natural logarithm. 1. Introduction A directed graph G = ( V, E) consists of a vertex set V = {1,2,3,. , n} and an edge set E c VX V. Each element (u, w) of E is an edge and joins 2, to w. If G, = (V,, E,) and G, = (V,, E2) are directed graphs, G, is a subgraph of Gz if V, G V, and E, c EZ. The subgraph of G, induced by the subset V, of V, is the graph G, = (V,, E,), where E, is the set of all elements of E, which join pairs of elements of V, . -
Some NP-Complete Problems
Appendix A Some NP-Complete Problems To ask the hard question is simple. But what does it mean? What are we going to do? W.H. Auden In this appendix we present a brief list of NP-complete problems; we restrict ourselves to problems which either were mentioned before or are closely re- lated to subjects treated in the book. A much more extensive list can be found in Garey and Johnson [GarJo79]. Chinese postman (cf. Sect. 14.5) Let G =(V,A,E) be a mixed graph, where A is the set of directed edges and E the set of undirected edges of G. Moreover, let w be a nonnegative length function on A ∪ E,andc be a positive number. Does there exist a cycle of length at most c in G which contains each edge at least once and which uses the edges in A according to their given orientation? This problem was shown to be NP-complete by Papadimitriou [Pap76], even when G is a planar graph with maximal degree 3 and w(e) = 1 for all edges e. However, it is polynomial for graphs and digraphs; that is, if either A = ∅ or E = ∅. See Theorem 14.5.4 and Exercise 14.5.6. Chromatic index (cf. Sect. 9.3) Let G be a graph. Is it possible to color the edges of G with k colors, that is, does χ(G) ≤ k hold? Holyer [Hol81] proved that this problem is NP-complete for each k ≥ 3; this holds even for the special case where k =3 and G is 3-regular. -
An Efficient Reachability Indexing Scheme for Large Directed Graphs
1 Path-Tree: An Efficient Reachability Indexing Scheme for Large Directed Graphs RUOMING JIN, Kent State University NING RUAN, Kent State University YANG XIANG, The Ohio State University HAIXUN WANG, Microsoft Research Asia Reachability query is one of the fundamental queries in graph database. The main idea behind answering reachability queries is to assign vertices with certain labels such that the reachability between any two vertices can be determined by the labeling information. Though several approaches have been proposed for building these reachability labels, it remains open issues on how to handle increasingly large number of vertices in real world graphs, and how to find the best tradeoff among the labeling size, the query answering time, and the construction time. In this paper, we introduce a novel graph structure, referred to as path- tree, to help labeling very large graphs. The path-tree cover is a spanning subgraph of G in a tree shape. We show path-tree can be generalized to chain-tree which theoretically can has smaller labeling cost. On top of path-tree and chain-tree index, we also introduce a new compression scheme which groups vertices with similar labels together to further reduce the labeling size. In addition, we also propose an efficient incremental update algorithm for dynamic index maintenance. Finally, we demonstrate both analytically and empirically the effectiveness and efficiency of our new approaches. Categories and Subject Descriptors: H.2.8 [Database management]: Database Applications—graph index- ing and querying General Terms: Performance Additional Key Words and Phrases: Graph indexing, reachability queries, transitive closure, path-tree cover, maximal directed spanning tree ACM Reference Format: Jin, R., Ruan, N., Xiang, Y., and Wang, H. -
The Surprizing Complexity of Generalized Reachability Games Nathanaël Fijalkow, Florian Horn
The surprizing complexity of generalized reachability games Nathanaël Fijalkow, Florian Horn To cite this version: Nathanaël Fijalkow, Florian Horn. The surprizing complexity of generalized reachability games. 2010. hal-00525762v2 HAL Id: hal-00525762 https://hal.archives-ouvertes.fr/hal-00525762v2 Preprint submitted on 2 Feb 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. The surprising complexity of generalized reachability games Nathanaël Fijalkow1,2 and Florian Horn1 1 LIAFA CNRS & Université Denis Diderot - Paris 7, France {nath,florian.horn}@liafa.jussieu.fr 2 ÉNS Cachan École Normale Supérieure de Cachan, France Abstract. Games on graphs provide a natural and powerful model for reactive systems. In this paper, we consider generalized reachability objectives, defined as conjunctions of reachability objectives. We first prove that deciding the winner in such games is PSPACE-complete, although it is fixed-parameter tractable with the number of reachability objectives as parameter. Moreover, we consider the memory requirements for both players and give matching upper and lower bounds on the size of winning strategies. In order to allow more efficient algorithms, we consider subclasses of generalized reachability games. We show that bounding the size of the reachability sets gives two natural subclasses where deciding the winner can be done efficiently.