Computing Optimal Assignments in Linear Time for Approximate Graph Matching

Total Page:16

File Type:pdf, Size:1020Kb

Computing Optimal Assignments in Linear Time for Approximate Graph Matching Computing Optimal Assignments in Linear Time for Approximate Graph Matching Nils M. Kriege∗, Pierre-Louis Giscardy, Franka Bause∗, Richard C. Wilsonz ∗Department of Computer Science, TU Dortmund University, Dortmund, Germany fnils.kriege, [email protected] yLMPA Joseph Liouville, Universite´ Littoral Coteˆ d’Opale, Calais, France [email protected] zDepartment of Computer Science, University of York, York, United Kingdom [email protected] Abstract—Finding an optimal assignment between two sets of features. However, in both cases, the method is not ideal, objects is a fundamental problem arising in many applications, since each feature corresponds to a specific element in the data including the matching of ‘bag-of-words’ representations in object, and so can correspond to no more than one element natural language processing and computer vision. Solving the assignment problem typically requires cubic time and its pairwise in a second data object. The co-occurrence counting method computation is expensive on large datasets. In this paper, we allows each feature to match to multiple features in the other develop an algorithm which can find an optimal assignment in dataset. linear time when the cost function between objects is represented A different approach is to explicitly find the best correspon- by a tree distance. We employ the method to approximate the edit dence between the features of two bags. This can be achieved distance between two graphs by matching their vertices in linear 3 time. To this end, we propose two tree distances, the first of which by solving the (linear) assignment problem in O(n ) time us- reflects discrete and structural differences between vertices, and ing Hungarian-type algorithms [4]. When we assume weights the second of which can be used to compare continuous labels. to be integers within the range of [0;N], scaling algorithm We verify the effectiveness and efficiency of our methods using become applicable such as [5], which requires O(n2:5 log N) synthetic and real-world datasets. time. Several authors studied a geometric version of the Index Terms—assignment problem, graph matching, graph edit d distance, tree distance problem, where the objects are points in R and the total distance is to be minimised. No subquadratic exact algorithm for this task is known, but efficient approximation algorithms I. INTRODUCTION exist [6]. This is also the case for various other problem Vast amounts of data are now available for machine learn- variants, see [7] and references therein. These algorithms are ing, including text documents, images, graphs and many typically involved and focus on theoretical guarantees. For more. Learning from such data typically involves computing practical applications often standard algorithms with cubic a similarity or distance function between the data objects. running time, simple greedy strategies or methods specifically Since many of these datasets are large, the efficiency of the designed for one task are used. comparison methods is critical. This is particularly challenging We briefly summarise the use of assignment methods for since taking the structure adequately into account often is a comparing structured data in machine learning with a focus hard problem. For example, no polynomial-time algorithms on graphs. While most kernels for such data are based on are known even for the basic task to decide whether two co-occurrence counting, there has been growing interest in graphs have the same structure [1]. Therefore, the pragmatic deriving kernels from optimal assignments in recent years. arXiv:1901.10356v2 [cs.LG] 10 Sep 2019 approach of describing such data with a ‘bag-of-words’ or The pyramid match kernel was proposed to approximate ‘bag-of-features’ is commonly used. In this representation, a correspondences between bags of features in Rd by employing series of objects are identified in the data and each object is a space-partitioning tree structure and counting how often described by a label or feature. The labels are placed in a bag points fall into the same bin [8]. For graphs, the optimal where the order in which they appear does not matter. assignment kernel was proposed, which establishes a cor- In the most basic form, such bags can be represented by respondence between the vertices of two graphs using the histograms or feature vectors, two of which are compared Hungarian algorithm [9]. However, it was soon realised that by counting the number of co-occurrences of a label in the this approach does not lead to valid kernels in the general bags. This is a common approach not only for images and case [2]. Therefore, Johansson and Dubhashi [10] derived text, but also for graph comparison, where a number of graph kernels from optimal assignments by first sampling a fixed kernels have been proposed which use different substructures set of so-called landmarks and representing graphs by their as elements of the bag [2, 3]. In the more general case, two optimal assignment similarities to landmarks. Kriege et al. bags of features are compared by summing over all pairs [11] demonstrated that a specific choice of a weight function of features weighted by a similarity function between the (derived from a hierarchy) does in fact generate a valid kernel © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. for the optimal assignment method and allows computation in representing cost functions: (i) based on Weisfeiler-Lehman linear time. The approach is not designed to actually construct refinement to quantify discrete and structural differences, the assignment. (ii) based on hierarchical clustering for continuous vertex The term graph matching refers to a diverse set of tech- attributes. We show experimentally that our linear time as- niques which typically establish correspondences between the signment algorithm scales to large graphs and datasets. Our vertices (and edges) of two graphs [12, 13]. This task is closely approach outperforms both exact and approximate methods related to the classical NP-hard maximum common subgraph for computing the graph edit distance in terms of running problem, which asks for the largest graph that is contained as time and provides state-of-the-art classification accuracy. For subgraph in two given graphs [1]. Exact algorithms for this some datasets with discrete labels our method even beats these problem haven been studied extensively, e.g., for applications approaches in terms of accuracy. in cheminformatics [14], where small molecular graphs with about 20 vertices are compared. The term network alignment II. FUNDAMENTALS is commonly used in bioinformatics, where large networks We summarise basic concepts and results on graphs, tree with thousands of vertices are compared such as protein- distances, the assignment problem and the graph edit distance. protein interaction networks. Methods applicable to such graphs typically cannot guarantee that an optimal solution is A. Graph theory found and often solve the assignment problem as a subroutine, An undirected graph G = (V; E) consists of a finite set e.g., [15, 16]. In the following we will focus on the graph V (G) = V of vertices and a finite set E(G) = E of edges, edit distance, which is one of the most widely accepted where each edge connects two distinct vertices. We denote approaches to graph matching with applications ranging from an edge connecting a vertex u and a vertex v by uv or vu, cheminformatics to computer vision [17]. It is defined as the where both refers to the same edge. Two vertices u and v are minimum cost of edit operations required to transform one said to be adjacent if uv 2 E and referred to as endpoints graph into another graph. The concept has been proposed of the edge uv. The vertices adjacent to a vertex v in G are for pattern recognition tasks more than 30 years ago [18]. denoted by NG(v) = fu 2 V (G) j uv 2 E(G)g and referred However, its computation is NP-hard, since it generalises the to as neighbours of v.A path of length n is a sequence of maximum common subgraph problem [19]. The graph edit vertices (v0; : : : ; vn) such that vivi+1 2 E for 0 ≤ i < n.A distance is closely related to the notoriously hard quadratic weighted graph is a graph G endowed with a weight function assignment problem [20]. Recently several elaborated exact al- w : E(G) ! R. The length of a path in a weighted graph gorithms for computing the graph edit distance have been pro- refers to the sum of weights of the edges contained in the path. posed [21, 22, 23]. Binary linear programming formulations A graph G0 = (V 0;E0) is a subgraph of a graph G = (V; E), in combination with highly-optimised general purpose solvers written G0 ⊆ G, if V 0 ⊆ V and E0 ⊆ E. Let V 0 ⊆ V , are among the most efficient approaches, but are still limited to then G0 = (V 0;E0) with E0 = fuv 2 E j u; v 2 V 0g is small graphs [22]. Even a restricted special case of the graph said to be the subgraph induced by V 0 in G and is denoted edit distance is APX-hard [24], i.e., there is a constant c > 1, by G[V 0]. An isomorphism between two graphs G and H is such that no polynomial-time algorithm can approximate it a bijection : V (G) ! V (H) such that uv 2 E(G) , within the factor c, unless P=NP.
Recommended publications
  • Linear Sum Assignment with Edition Sébastien Bougleux, Luc Brun
    Linear Sum Assignment with Edition Sébastien Bougleux, Luc Brun To cite this version: Sébastien Bougleux, Luc Brun. Linear Sum Assignment with Edition. [Research Report] Normandie Université; GREYC CNRS UMR 6072. 2016. hal-01288288v3 HAL Id: hal-01288288 https://hal.archives-ouvertes.fr/hal-01288288v3 Submitted on 21 Mar 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution - NoDerivatives| 4.0 International License Linear Sum Assignment with Edition S´ebastienBougleux and Luc Brun Normandie Universit´e GREYC UMR 6072 CNRS - Universit´ede Caen Normandie - ENSICAEN Caen, France March 21, 2016 Abstract We consider the problem of transforming a set of elements into another by a sequence of elementary edit operations, namely substitutions, removals and insertions of elements. Each possible edit operation is penalized by a non-negative cost and the cost of a transformation is measured by summing the costs of its operations. A solution to this problem consists in defining a transformation having a minimal cost, among all possible transformations. To compute such a solution, the classical approach consists in representing removal and insertion operations by augmenting the two sets so that they get the same size.
    [Show full text]
  • Asymptotic Moments of the Bottleneck Assignment Problem
    Asymptotic Moments of the Bottleneck Assignment Problem Michael Z. Spivey Department of Mathematics and Computer Science, University of Puget Sound, Tacoma, WA 98416-1043 email: [email protected] http://math.pugetsound.edu/~mspivey/ One of the most important variants of the standard linear assignment problem is the bottleneck assignment problem. In this paper we give a method by which one can find all of the asymptotic moments of a random bottleneck assignment problem in which costs (independent and identically distributed) are chosen from a wide variety of continuous distributions. Our method is obtained by determining the asymptotic moments of the time to first complete matching in a random bipartite graph process and then transforming those, via a Maclaurin series expansion for the inverse cumulative distribution function, into the desired moments for the bottleneck assignment problem. Our results improve on the previous best-known expression for the expected value of a random bottleneck assignment problem, yield the first results on moments other than the expected value, and produce the first results on the moments for the time to first complete matching in a random bipartite graph process. Key words: bottleneck assignment problem; random assignment problem; asymptotic analysis; probabilistic anal- ysis; complete matching; degree 1; random bipartite graph MSC2000 Subject Classification: Primary: 90C47, 90C27, 41A60; Secondary: 60C05, 41A58 OR/MS subject classification: Primary: networks/graphs { theory, programming { integer { theory; Secondary: mathematics { combinatorics, mathematics { matrices 1. Introduction. The behavior of random assignment problems has been the subject of much study ∗ in the last few years. One of the most well-known results, due to Aldous [2], is that if Zn is the opti- mal value of an n × n random linear assignment problem with independent and identically distributed ∗ 2 (iid) exponential(1) costs then limn!1 E[Zn] = (2) = =6.
    [Show full text]
  • The Optimal Assignment Problem: an Investigation Into Current Solutions, New Approaches and the Doubly Stochastic Polytope
    The Optimal Assignment Problem: An Investigation Into Current Solutions, New Approaches and the Doubly Stochastic Polytope Frans-Willem Vermaak A Dissertation submitted to the Faculty of Engineering, University of the Witwatersand, Johannesburg, in fulfilment of the requirements for the degree of Master of Science in Engineering Johannesburg 2010 Declaration I declare that this research dissertation is my own, unaided work. It is being submitted for the Degree of Master of Science in Engineering in the University of the Witwatersrand, Johannesburg. It has not been submitted before for any degree or examination in any other University. signed this day of 2010 ii Abstract This dissertation presents two important results: a novel algorithm that approximately solves the optimal assignment problem as well as a novel method of projecting matrices into the doubly stochastic polytope while preserving the optimal assignment. The optimal assignment problem is a classical combinatorial optimisation problem that has fuelled extensive research in the last century. The problem is concerned with a matching or assignment of elements in one set to those in another set in an optimal manner. It finds typical application in logistical optimisation such as the matching of operators and machines but there are numerous other applications. In this document a process of iterative weighted normalization applied to the benefit matrix associated with the Assignment problem is considered. This process is derived from the application of the Computational Ecology Model to the assignment problem and referred to as the OACE (Optimal Assignment by Computational Ecology) algorithm. This simple process of iterative weighted normalisation converges towards a matrix that is easily converted to a permutation matrix corresponding to the optimal assignment or an assignment close to optimality.
    [Show full text]
  • He Stable Marriage Problem, and LATEX Dark Magic
    UCLA UCLA Electronic Theses and Dissertations Title Distributed Almost Stable Matchings Permalink https://escholarship.org/uc/item/6pr8d66m Author Rosenbaum, William Bailey Publication Date 2016 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California UNIVERSITY OF CALIFORNIA Los Angeles Distributed Almost Stable Matchings A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Mathematics by William Bailey Rosenbaum óþÕä © Copyright by William Bailey Rosenbaum óþÕä ABSTRACT OF THE DISSERTATION Distributed Almost Stable Matchings by William Bailey Rosenbaum Doctor of Philosophy in Mathematics University of California, Los Angeles, óþÕä Professor Rafail Ostrovsky, Chair e Stable Marriage Problem (SMP) is concerned with the follow scenario: suppose we have two disjoint sets of agents—for example prospective students and colleges, medical residents and hospitals, or potential romantic partners—who wish to be matched into pairs. Each agent has preferences in the form of a ranking of her potential matches. How should we match agents based on their preferences? We say that a matching is stable if no unmatched pair of agents mutually prefer each other to their assigned partners. In their seminal work on the SMP, Gale and Shapley [ÕÕ] prove that a stable matching exists for any preferences. ey further describe an ecient algorithm for nding a stable matching. In this dissertation, we consider the computational complexity of the SMP in the distributed setting, and the complexity of nding “almost sta- ble” matchings. Highlights include (Õ) communication lower bounds for nding stable and almost stable matchings, (ó) a distributed algorithm which nds an almost stable matching in polylog time, and (ì) hardness of approximation results for three dimensional analogues of the SMP.
    [Show full text]
  • The Assignment Problem and Totally Unimodular Matrices
    Methods and Models for Combinatorial Optimization The assignment problem and totally unimodular matrices L.DeGiovanni M.DiSumma G.Zambelli 1 The assignment problem Let G = (V,E) be a bipartite graph, where V is the vertex set and E is the edge set. Recall that “bipartite” means that V can be partitioned into two disjoint subsets V1, V2 such that, for every edge uv ∈ E, one of u and v is in V1 and the other is in V2. In the assignment problem we have |V1| = |V2| and there is a cost cuv for every edge uv ∈ E. We want to select a subset of edges such that every node is the end of exactly one selected edge, and the sum of the costs of the selected edges is minimized. This problem is called “assignment problem” because selecting edges with the above property can be interpreted as assigning each node in V1 to exactly one adjacent node in V2 and vice versa. To model this problem, we define a binary variable xuv for every u ∈ V1 and v ∈ V2 such that uv ∈ E, where 1 if u is assigned to v (i.e., edge uv is selected), x = uv 0 otherwise. The total cost of the assignment is given by cuvxuv. uvX∈E The condition that for every node u in V1 exactly one adjacent node v ∈ V2 is assigned to v1 can be modeled with the constraint xuv =1, u ∈ V1, v∈VX2:uv∈E while the condition that for every node v in V2 exactly one adjacent node u ∈ V1 is assigned to v2 can be modeled with the constraint xuv =1, v ∈ V2.
    [Show full text]
  • Linear Sum Assignment with Edition
    Linear Sum Assignment with Edition S´ebastienBougleux and Luc Brun Normandie Universit´e GREYC UMR 6072 CNRS - Universit´ede Caen Normandie - ENSICAEN Caen, France October 10, 2018 Abstract We consider the problem of transforming a set of elements into another by a sequence of elementary edit operations, namely substitutions, removals and insertions of elements. Each possible edit operation is penalized by a non-negative cost and the cost of a transformation is measured by summing the costs of its operations. A solution to this problem consists in defining a transformation having a minimal cost, among all possible transformations. To compute such a solution, the classical approach consists in representing removal and insertion operations by augmenting the two sets so that they get the same size. This allows to express the problem as a linear sum assignment problem (LSAP), which thus finds an optimal bijection (or permutation, perfect matching) between the two augmented sets. While the LSAP is known to be efficiently solvable in polynomial time complexity, for instance with the Hungarian algorithm, useless time and memory are spent to treat the elements which have been added to the initial sets. In this report, we show that the problem can be formalized as an extension of the LSAP which considers only one additional element in each set to represent removal and insertion operations. A solution to the problem is no longer represented as a bijection between the two augmented sets. We show that the considered problem is a binary linear program (BLP) very close to the LSAP. While it can be solved by any BLP solver, we propose an adaptation of the Hungarian algorithm which improves the time and memory complexities previously obtained by the approach based on the LSAP.
    [Show full text]
  • Assignment Problems
    Assignment Problems y Eranda C ela Abstract Assignment problems arise in dierent situations where we have to nd an optimal way to assign n ob jects to m other ob jects in an injective fashion Dep ending on the ob jective we want to optimize we obtain dierent problems ranging from linear assignment problems to quadratic and higher dimensional assignment problems The assignment problems are a well studied topic in combinatorial opti mization These problems nd numerous application in pro duction planning telecommunication VLSI design economics etc We intro duce the basic problems classied into three groups linear as signment problems three and higher dimensional assignment problems and quadratic assignment problems and problems related to it For each group of problems we mention some applications show some basic prop erties and de scrib e briey some of the most successful algorithms used to solve these prob lems Intro duction Assignment problems deal with the question how to assign n ob jects to m other ob jects in an injective fashion in the b est p ossible way An assignment problem is completely sp ecied by its two comp onents the assignments which represent the underlying combinatorial structure and the ob jective function to b e optimized which mo dels the b est p ossible way In the classical assignment problem one has m n and most of the problems with m n can b e transformed or are strongly related to analogous problems with m n Therefore we will consider m n through the rest of this chapter unless otherwise sp ecied This research
    [Show full text]
  • Revised Note on Learning Algorithms for Quadratic Assignment With
    REVISED NOTE ON LEARNING QUADRATIC ASSIGNMENT WITH GRAPH NEURAL NETWORKS Alex Nowak†,§, Soledad Villar‡,∗,§, Afonso S Bandeira‡,∗ and Joan Bruna‡,∗ † Sierra Team, INRIA and Ecole Normale Superieure, Paris ‡ Courant Institute, New York University, New York ∗ Center for Data Science, New York University, New York ABSTRACT Instead of investigating a designed algorithm for the problem in question, we consider a data-driven approach to learn algorithms Inverse problems correspond to a certain type of optimization from solved instances of the problem. In other words, given a collec- problems formulated over appropriate input distributions. Recently, tion (xi,yi)i≤L of problem instances drawn from a certain distribu- there has been a growing interest in understanding the computational tion, we ask whether one can learn an algorithm that achieves good hardness of these optimization problems, not only in the worst case, accuracy at solving new instances of the same problem – also being but in an average-complexity sense under this same input distribu- drawn from the same distribution, and to what extent the resulting tion. algorithm can reach those statistical/computational thresholds. In this revised note, we are interested in studying another aspect The general approach is to cast an ensemble of algorithms as of hardness, related to the ability to learn how to solve a problem by neural networks yˆ = Φ(x; θ) with specific architectures that en- simply observing a collection of previously solved instances. These code prior knowledge on the algorithmic class, parameterized by ‘planted solutions’ are used to supervise the training of an appropri- θ ∈ RS . The network is trained to minimize the empirical loss ate predictive model that parametrizes a broad class of algorithms, −1 L(θ)= L i ℓ(yi, Φ(xi; θ)), for a given measure of error ℓ, us- with the hope that the resulting model will provide good accuracy- ing stochastic gradient descent.
    [Show full text]
  • Integer Linear Programming Modeling
    DM545 Linear and Integer Programming Lecture 8 Integer Linear Programming Modeling Marco Chiarandini Department of Mathematics & Computer Science University of Southern Denmark Integer Programming Outline Modeling 1. Integer Programming 2. Modeling Assignment Problem Knapsack Problem Set Covering Graph Problems 2 Integer Programming LP: Rational Solutions Modeling I A precise analysis of running time for an algorithm includes the number of bit operations together with number of arithmetic operations I Strongly polynomial algorithms: the running time of the algorithm is independent on the number of bit operations. Eg: same running time for input numbers with 10 bits as for inputs with a million bits. I No strongly polynomial-time algorithm for LP is known. I Running time depends on the sizes of numbers. We have to restrict attention to rational instances when analyzing the running time of algorithms and assume they are coded in binary. Theorem Optimal feasible solutions to LP problems are always rational as long as all coefficient and constants are rational. Proof: derives from the fact that in the simplex we only perform multiplications, divisions and sums of rational numbers 3 Integer Programming Outline Modeling 1. Integer Programming 2. Modeling Assignment Problem Knapsack Problem Set Covering Graph Problems 4 Integer Programming Discrete Optimization Modeling I Often need to deal with integral inseparable quantities. I Sometimes rounding can go. I Other times rounding not feasible: eg, presence of a bus on a line is 0.3... 5 Integer
    [Show full text]
  • Global Sensitivity Analysis for the Linear Assignment Problem
    Global Sensitivity Analysis for the Linear Assignment Problem Elad Michael, Tony A. Wood, Chris Manzie, and Iman Shames Abstract— In this paper, the following question is addressed: finding maximum flow within a network [6], lower bound- given a linear assignment problem, how much can the all of the ing solutions to the quadratic assignment problem [7], and individual assignment weights be perturbed without changing maximum a-posteriori data association for object tracking the optimal assignment? The extension of results involving perturbations in just one edge or one row/column are presented. [8]. Algorithms for the derivation of these bounds are provided. We In this paper we focus on understanding which perturba- also show how these bounds may be used to prevent assignment tions the optimal assignment is robust to. However, in some churning in a multi-vehicle guidance scenario. applications, it is preferable to find a potentially sub-optimal assignment which is less sensitive to perturbation. For linear I. INTRODUCTION assignment, [10] provides an algorithm with guaranteed Task assignment is a fundamental part of multi-agent robustness to uncertainty in a subset of the parameters. control. Here we represent task assignment optimization Similarly, but for the bottleneck assignment problem, [11] over a weighted bipartite graph, with agents and tasks as provides robustness to weights within predefined intervals, vertices and assignments as edges. The objective function as well as complexity improvements on similar bottleneck may be minimum completion time of all tasks, minimum assignment algorithms. However, these papers necessarily total fuel usage, balanced completion times, or any of a sacrifice optimality for robustness.
    [Show full text]
  • Lecture Notes on Bipartite Matching 1 Maximum Cardinality Matching
    Massachusetts Institute of Technology Handout 3 18.433: Combinatorial Optimization February 8th, 2007 Michel X. Goemans Lecture notes on bipartite matching We start by introducing some basic graph terminology. A graph G = (V; E) consists of a set V of vertices and a set E of pairs of vertices called edges. For an edge e = (u; v), we say that the endpoints of e are u and v; we also say that e is incident to u and v. A graph G = (V; E) is bipartite if the vertex set V can be partitioned into two sets A and B (the bipartition) such that no edge in E has both endpoints in the same set of the bipartition. A matching M E is a collection of edges such that every vertex of V is incident to at most one edge of M⊆. If a vertex v has no edge of M incident to it then v is said to be exposed (or unmatched). A matching is perfect if no vertex is exposed; in other words, a matching is perfect if its cardinality is equal to A = B . j j j j 1 6 2 7 exposed 3 8 matching 4 9 5 10 Figure 1: Example. The edges (1; 6), (2; 7) and (3; 8) form a matching. Vertices 4, 5, 9 and 10 are exposed. We are interested in the following two problems: Maximum cardinality matching problem: Find a matching M of maximum size. Minimum weight perfect matching problem: Given a cost cij for all (i; j) E, find a perfect matching of minimum cost where the cost of a matching M is given by2c(M) = (i;j)2M cij.
    [Show full text]
  • The Assignment Problem and Primal-Dual Algorithms 1
    Summer 2011 Optimization I Lecture 11 The Assignment Problem and Primal-Dual Algorithms 1 Assignment Problem Suppose we want to solve the following problem: We are given • a set of people I, and a set of jobs J, with jIj = jJj = n • a cost cij ≥ 0 for assigning job j to person i. We would like to assign jobs to people, so that each job is assigned to one person and each person is assigned one job, and we minimize the cost of the assignment. This is called the assignment problem. Example input: Jobs 90 75 75 80 People 35 85 55 65 125 95 90 105 45 110 95 115 The assignment problem is related to another problem, the maximum cardinality bipartite matching problem. In the maximum cardinality bipartite matching problem, you are given a bipartite graph G = (V; E), and you want to find a matching, i.e., a subset of the edges F such that each node is incident on at most one edge in F , that maximizes jF j. Suppose you have an algorithm for finding a maximum cardinality bipartite matching. If all of the costs in the assignment problem were either 0 or 1, then we could solve the assignment problem using the algorithm for the maximum cardinality bipartite matching problem: We construct a bipartite graph which has a node for every person i and every job j, and an edge from i to j if cij = 0. A matching in this graph of size k corresponds to a solution to the assignment problem of cost at most n − k and vice versa.
    [Show full text]