Theory of Graph Traversal Edit Distance, Extensions, and Applications
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Error-Tolerant Graph Matching: a Formal Framework and Algorithms
Error-Tolerant Graph Matching: A Formal Framework and Algorithms H. Bunke Department of Computer Science,University of Bern, Neubr/ickstr. 10, CH-3012 Bern, Switzerland [email protected] Abstract. This paper first reviews some theoretical results in error- tolerant graph matching that were obtained recently. The results in- clude a new metric for error-tolerant graph matching based on maximum common subgraph, a relation between maximum common subgraph and graph edit distance, and the existence of classes of cost functions for error-tolerant graph matching. Then some new optimal algorithms for error-tolerant graph matching are discussed. Under specific conditions, the new algorithms may be significantly more efficient than traditional methods. Keywords: structural pattern recognition, graphs, graph matching, error-tolerant matching, edit distance, maximum common subgraph, cost function 1 Introduction In structural pattern recognition, graphs are widely used for object representa- tion. Typically, parts of a complex object, or pattern, are represented by nodes, and relations between the various parts by edges. Labels and/or attributes for nodes and edges are used to further distinguish between different classes of nodes and edges, or to incorporate numerical information in the symbolic representa- tion. Using graphs to represent both known models from a database and un- known input patterns, the recognition task turns into a graph matching problem. That is, the database is searched for models that are similar to the unknown input graph. Standard algorithms for graph matching include graph isomorphism, sub- graph isomorphism, and maximum common subgraph search [1, 2]. However, in real world applications we can't always expect a perfect match between the input and one of the graphs in the database. -
Typical Snapshots Selection for Shortest Path Query in Dynamic Road Networks
Typical Snapshots Selection for Shortest Path Query in Dynamic Road Networks Mengxuan Zhang, Lei Li, Wen Hua, Xiaofang Zhou School of Information Technology and Electrical Engineering The University of Queensland, Brisbane, Australia [email protected], fl.li3, [email protected],[email protected] Abstract. Finding the shortest paths in road network is an important query in our life nowadays, and various index structures are constructed to speed up the query answering. However, these indexes can hardly work in real-life scenario because the traffic condition changes dynamically, which makes the pathfinding slower than in the static environment. In order to speed up path query answering in the dynamic road network, we propose a framework to support these indexes. Firstly, we view the dynamic graph as a series of static snapshots. After that, we propose two kinds of methods to select the typical snapshots. The first kind is time-based and it only considers the temporal information. The second category is the graph representation-based, which considers more insights: edge-based that captures the road continuity, and vertex-based that reflects the region traffic fluctuation. Finally, we propose the snapshot matching to find the most similar typical snapshot for the current traffic condition and use its index to answer the query directly. Extensive experiments on real-life road network and traffic conditions validate the effectiveness of our approach. Keywords: Shortest Path, Snapshot Selection, Dynamic Road Network 1 Introduction Shortest path query is a fundamental operation in road network routing and navigation. A road network can be denoted as a directed graph G(V; E) where V is the set of road intersections, and E ⊆ V × V is the set of road segments. -
On the Unification of the Graph Edit Distance and Graph Matching Problems Romain Raveaux
On the unification of the graph edit distance and graph matching problems Romain Raveaux To cite this version: Romain Raveaux. On the unification of the graph edit distance and graph matching problems. Pattern Recognition Letters, Elsevier, 2021, 145, 10.1016/j.patrec.2021.02.014. hal-03163084v2 HAL Id: hal-03163084 https://hal.archives-ouvertes.fr/hal-03163084v2 Submitted on 19 Mar 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. On the unification of the graph edit distance and graph matching problems Romain Raveaux1 1Universit´ede Tours, Laboratoire d'Informatique Fondamentale et Appliqu´eede Tours (LIFAT - EA 6300), 64 Avenue Jean Portalis, 37000 Tours, France 7th March 2020 Abstract Error-tolerant graph matching gathers an important family of problems. These problems aim at finding correspondences between two graphs while integrating an error model. In the Graph Edit Distance (GED) problem, the insertion/deletion of edges/nodes from one graph to another is explicitly expressed by the error model. At the opposite, the problem commonly referred to as \graph matching" does not explicitly express such operations. For decades, these two problems have split the research community in two separated parts. -
Eindhoven University of Technology BACHELOR the Chinese Postman Problem in Undirected and Directed Graphs Verberk, Lucy P.A
Eindhoven University of Technology BACHELOR The Chinese postman problem in undirected and directed graphs Verberk, Lucy P.A. Award date: 2019 Link to publication Disclaimer This document contains a student thesis (bachelor's or master's), as authored by a student at Eindhoven University of Technology. Student theses are made available in the TU/e repository upon obtaining the required degree. The grade received is not published on the document as presented in the repository. The required complexity or quality of research of student theses may vary by program, and the required minimum study period may vary in duration. General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain Eindhoven University of Technology Applied Mathematics Combinitorial Optimization The Chinese Postman Problem in undirected and directed graphs Bachelor Final project Author: Supervisor: Lucy Verberk Dr. Judith Keijsper July 11, 2019 Abstract In this report the Chinese Postman Problem (CPP) for undirected, directed and mixed graphs will be considered. Solution methods for the undirected and directed CPP will be discussed. For the mixed CPP, some suggestions for further research are mentioned. An application of routing gritters and snow-shovel trucks in the city of Eindhoven in the Netherlands will be con- sidered. -
A Binary Linear Programming Formulation of the Graph Edit Distance
1200 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 28, NO. 8, AUGUST 2006 A Binary Linear Programming Formulation of the Graph Edit Distance Derek Justice and Alfred Hero, Fellow, IEEE Abstract—A binary linear programming formulation of the graph edit distance for unweighted, undirected graphs with vertex attributes is derived and applied to a graph recognition problem. A general formulation for editing graphs is used to derive a graph edit distance that is proven to be a metric, provided the cost function for individual edit operations is a metric. Then, a binary linear program is developed for computing this graph edit distance, and polynomial time methods for determining upper and lower bounds on the solution of the binary program are derived by applying solution methods for standard linear programming and the assignment problem. A recognition problem of comparing a sample input graph to a database of known prototype graphs in the context of a chemical information system is presented as an application of the new method. The costs associated with various edit operations are chosen by using a minimum normalized variance criterion applied to pairwise distances between nearest neighbors in the database of prototypes. The new metric is shown to perform quite well in comparison to existing metrics when applied to a database of chemical graphs. Index Terms—Graph algorithms, similarity measures, structural pattern recognition, graphs and networks, linear programming, continuation (homotopy) methods. æ 1INTRODUCTION TTRIBUTED graphs provide convenient structures for chemicals or medicines [6]. Various techniques have been Arepresenting objects when relational properties are of designed for processing the graphs for structural features and interest. -
Bayesian Graph Edit Distance
This is a repository copy of Bayesian graph edit distance. White Rose Research Online URL for this paper: https://eprints.whiterose.ac.uk/1988/ Article: Myers, R., Hancock, E.R. orcid.org/0000-0003-4496-2028 and Wilson, R.C. (2000) Bayesian graph edit distance. IEEE Transactions on Pattern Analysis and Machine Intelligence. pp. 628-635. ISSN 0162-8828 https://doi.org/10.1109/34.862201 Reuse Items deposited in White Rose Research Online are protected by copyright, with all rights reserved unless indicated otherwise. They may be downloaded and/or printed for private study, or other acts as permitted by national copyright laws. The publisher or other rights holders may allow further reproduction and re-use of the full text version. This is indicated by the licence information on the White Rose Research Online record for the item. Takedown If you consider content in White Rose Research Online to be in breach of UK law, please notify us by emailing [email protected] including the URL of the record and the reason for the withdrawal request. [email protected] https://eprints.whiterose.ac.uk/ 628 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 22, NO. 6, JUNE 2000 Bayesian Graph Edit Distance strings [10]. This idea has been extended to form a basis for comparing trees and graphs on a global level [11], [12], [1], [7]. Richard Myers, Richard C. Wilson, and More recently, the idea of actively editing graphs during the matching process to eliminate relational clutter has proved very Edwin R. Hancock successful [13], [8], [14]. -
Delivery Optimization in Logistics Using Advanced Analytics and Interactive Visualization*
Proceedings_____________________________________________________________________________________________________ of the Central European Conference on Information and Intelligent Systems 35 Delivery Optimization in Logistics Using Advanced Analytics and Interactive Visualization* Leo Mršić Algebra University College Zagreb, Croatia [email protected] Abstract. Paper is based on research and PoC with called a Eulerian Cycle. Finding such a cycle is more goal to unlock business value in delivery optimization, “feasible” and can be used efficiently to create business using advanced analytics and interactive visualization. value. This problem was first solved Mei-Ko Kuan, a There are several common approaches including Chinese mathematician, in 1962 [Kuan, 1962]. He was travelling salesman problem (TSP) that has various first one who consider this problem from the practical applications even in its purest formulation, such as perspective of a postman picking up mail at the post planning or logistics. Knowledge base powered by office, delivering it along a set of streets, and returning large data set is described through development and to the post office. Since he must follow route at least final, interactive, form. Using common principles, once, the problem is referred to as "Chinese" postman paper describe business value that can be extracted problem to investigate how to cover every street along using advanced analytics and interactive visualization, path and return to the post office under the least cost. how to structure steps as best practice to achieve In practice, there are a lot of applications of this success and what additional benefits can be expected technique such as road maintenance, waste collection, by supporting this approach. As part of research, bus scheduling etc. -
A Survey of Graph Edit Distance
Pattern Anal Applic (2010) 13:113–129 DOI 10.1007/s10044-008-0141-y THEORETICAL ADVANCES A survey of graph edit distance Xinbo Gao Æ Bing Xiao Æ Dacheng Tao Æ Xuelong Li Received: 15 November 2007 / Accepted: 16 October 2008 / Published online: 13 January 2009 Ó Springer-Verlag London Limited 2009 Abstract Inexact graph matching has been one of the 1 Originality and contribution significant research foci in the area of pattern analysis. As an important way to measure the similarity between pair- Graph edit distance is an important way to measure the wise graphs error-tolerantly, graph edit distance (GED) is similarity between pairwise graphs error-tolerantly in the base of inexact graph matching. The research advance inexact graph matching and has been widely applied to of GED is surveyed in order to provide a review of the pattern analysis and recognition. However, there is scarcely existing literatures and offer some insights into the studies any survey of GED algorithms up to now, and therefore of GED. Since graphs may be attributed or non-attributed this paper is novel to a certain extent. The contribution of and the definition of costs for edit operations is various, the this paper focuses on the following aspects: existing GED algorithms are categorized according to these On the basis of authors’ effort for studying almost all two factors and described in detail. After these algorithms GED algorithms, authors firstly expatiate on the idea of are analyzed and their limitations are identified, several GED with illustrations in order to make the explanation promising directions for further research are proposed. -
A Metric Learning Approach to Graph Edit Costs for Regression Linlin Jia, Benoit Gaüzère, Florian Yger, Paul Honeine
A Metric Learning Approach to Graph Edit Costs for Regression Linlin Jia, Benoit Gaüzère, Florian Yger, Paul Honeine To cite this version: Linlin Jia, Benoit Gaüzère, Florian Yger, Paul Honeine. A Metric Learning Approach to Graph Edit Costs for Regression. Proceedings of IAPR Joint International Workshops on Statistical techniques in Pattern Recognition (SPR 2020) and Structural and Syntactic Pattern Recognition (SSPR 2020), Jan 2021, Venise, Italy. hal-03128664 HAL Id: hal-03128664 https://hal-normandie-univ.archives-ouvertes.fr/hal-03128664 Submitted on 2 Feb 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. A Metric Learning Approach to Graph Edit Costs for Regression Linlin Jia1;4, Benoit Ga¨uz`ere1;4, Florian Yger2?, and Paul Honeine3;4 1 LITIS Lab, INSA Rouen Normandie, France 2 LAMSADE, Universit´eParis Dauphine-PSL, France 3 LITIS Lab, Universit´ede Rouen Normandie, France 4 Normandie Universit´e,France Abstract. Graph edit distance (GED) is a widely used dissimilarity measure between graphs. It is a natural metric for comparing graphs and respects the nature of the underlying space, and provides interpretability for operations on graphs. As a key ingredient of the GED, the choice of edit cost functions has a dramatic effect on the GED and therefore the classification or regression performances. -
Graph Edit Networks
Published as a conference paper at ICLR 2021 GRAPH EDIT NETWORKS Benjamin Paassen Daniele Grattarola The University of Sydney Università della Svizzera italiana [email protected] [email protected] Daniele Zambon Cesare Alippi Università della Svizzera italiana Università della Svizzera italiana [email protected] Politecnico di Milano [email protected] Barbara Hammer Bielefeld University [email protected] ABSTRACT While graph neural networks have made impressive progress in classification and regression, few approaches to date perform time series prediction on graphs, and those that do are mostly limited to edge changes. We suggest that graph edits are a more natural interface for graph-to-graph learning. In particular, graph edits are general enough to describe any graph-to-graph change, not only edge changes; they are sparse, making them easier to understand for humans and more efficient computationally; and they are local, avoiding the need for pooling layers in graph neural networks. In this paper, we propose a novel output layer - the graph edit network - which takes node embeddings as input and generates a sequence of graph edits that transform the input graph to the output graph. We prove that a mapping between the node sets of two graphs is sufficient to construct training data for a graph edit network and that an optimal mapping yields edit scripts that are almost as short as the graph edit distance between the graphs. We further provide a proof-of-concept empirical evaluation on several graph dynamical systems, which are difficult to learn for baselines from the literature. -
Everystreet Algorithm
#everystreet algorithm Matej Kerekrety [email protected] www.everystreetchallenge.com August 26, 2020 Abstract A simple application of Chinese postman problem on #everystreet run- ning challenge, where a runner attempts to visit every street of given area at least once. In order to run the challenge optimally and avoid unnec- essary detours and running same street twice (or more often) we used an algorithm based on maximal matching which gives us an induced eulerian multigraph where optimal Euler route exists. Keywords: graph theory, Chinese Postman Problem, maximal matching, every street challenge, running 1 Motivation Over the COVID-19 lockdown many runners and cyclist found themselves cap- tured within their cities. Some of them ended up running or spinning endless hours on virtual races such as Zwift, but some as e.g. Mark Beaumont [1] de- cided to join #everystreet challenge. Every street is a challenge originated by Rickey Gates [2, 3] who run every single street in city of San Francisco in fall 2018 which took him 46 days and run 1,303 miles. Inspired by Mark Beaumont who did this challenge over the lockdown in Edinburgh, place where I spend the lockdown, and literally everybody else who managed to accomplish this challenge.1 We tried to find a algorithm finding an optimal route for given street network area starting and ending in the same point. 2 Every street challenge Rules of every street challenge are run or cycle2 every single street of given (metropolitan) area which is usually a city or a neighborhood. Till this point 1I would have never had the patience and motivation to run that much in the city. -
Online Assessment of Graph Theory
ONLINE ASSESSMENT OF GRAPH THEORY A thesis submitted for the degree of Doctor of Philosophy By Justin Dale Hatt College of Engineering, Design and Physical Sciences Brunel University August 2016 Abstract The objective of this thesis is to establish whether or not online, objective questions in elementary graph theory can be written in a way that exploits the medium of computer-aided assessment. This required the identification and resolution of question design and programming issues. The resulting questions were trialled to give an extensive set of answer files which were analysed to identify whether computer delivery affected the questions in any adverse ways and, if so, to identify practical ways round these issues. A library of questions spanning commonly-taught topics in elementary graph theory has been designed, programmed and added to the graph theory topic within an online assessment and learning tool used at Brunel University called Mathletics. Distracters coded into the questions are based on errors students are likely to make, partially evidenced by final examination scripts. Questions were provided to students in Discrete Mathematics modules with an extensive collection of results compiled for analysis. Questions designed for use in practice environments were trialled on students from 2007 – 2008 and then from 2008 to 2014 inclusive under separate testing conditions. Particular focus is made on the relationship of facility and discrimination between comparable questions during this period. Data is grouped between topic and also year group for the 2008 – 2014 tests, namely 2008 to 2011 and 2011 to 2014, so that it may then be determined what factors, if any, had an effect on the overall results for these questions.