Exact Bound for a TSP Cost in Two Dimensions

Exact value for the average optimal cost of bipartite traveling-salesman and 2-factor problems in two dimensions Riccardo Capelli∗ Dipartimento di Fisica, University of Milan and INFN, via Celoria 16, 20133 Milan, Italy and Computational Biomedicine Section, Institute of Advanced Simulation IAS-5 and Institute of Neuroscience and Medicine INM-9, Forschungszentrum Jülich, Wilhelm-Johnen-Straße, 52425 Jülich, Germany Sergio Caracciolo,y Andrea Di Gioacchino,z and Enrico M. Malatestax Dipartimento di Fisica, University of Milan and INFN, via Celoria 16, 20133 Milan, Italy (Dated: October 1, 2018) We show that the average optimal cost for the traveling-salesman problem in two dimensions, which is the archetypal problem in combinatorial optimization, in the bipartite case, is simply related to the average optimal cost of the assignment problem with the same Euclidean, increasing, convex weights. In this way we extend a result already known in one dimension where exact solutions are avalaible. The recently determined average optimal cost for the assignment when the cost function is the square of the distance between the points provides therefore an exact prediction 1 E = log N N π for large number of points 2N. As a byproduct of our analysis also the loop covering problem has the same optimal average cost. We also explain why this result cannot be extended at higher dimensions. We numerically check the exact predictions. INTRODUCTION in the other subset, in the monopartite version all the points can be reached from any other point. Interestingly enough in The Traveling Salesman Problem (TSP) [1,2] can be for- d = 1, when p > 1, that is when the cost function is convex and mulated in few words: what is the shortest tour which goes increasing, the search for the optimal tour can be exactly solved through N given points? But this is well known to be a com- both in the bipartite [16], as well as in the monopartite [17], putationally intractable problem. The number of possible so- version of the problem. lutions increases exponentially with the number of points and There have been, recently, what we consider three relevant there is not a known algorithm able to find the solution in a progresses in the field: time that increases less than exponentially with N. i) for other optimization problems similar to the TSP, the When the emphasis is shifted from the research of the so- monopartite and bipartite versions have different op- lution (in the worst case) to the typical properties of the so- timal cost properties. For example for the matching, lution in a class of possible instances, the statistical prop- 1-factor and 2-factor (or loop-covering) problems, the erties of the optimal solutions can be described by a zero- optimal cost is expected to be a self-averaging quantity temperature statistical model. This approach has been tremen- whose average scales according to dously fruitul [3–8]. The random model in which the distances ¹p;dº − p between the cities are independent and equally distributed ran- 1 d EN ∼ N (2) dom variables has been deeply studied [9–12]. Much less is known for the Euclidean version of the random problem, (see [13] for a proof in the case p = 1). On the other where the position of the points are chosen at random in a hand, in the bipartite version [18–20] it is expected that d finite domain of R , so that the distances of the points are now 1− p >8N 2 for d = 1 > p p correlated [13–15]. Of course, for large d, the effects of these ¹p;dº >< 1− E ∼ N 2 ¹log Nº 2 for d = 2 (3) correlations are smaller and smaller and the methods used to N > 1− p >N d for d > 2 deal with the problem in absence of correlations becomes more : and more effective. that is a larger average cost with respect to the monopar- In the Euclidean version of the problem, we associate to the tite case when d ≤ 2. Moreover, in the bipartite case step in the tour from the i-th point with coordinate xi to the the optimal cost is expected to be not self-averaging; j-th point with coordinate xj a cost ii) in the bipartite case it is always true [21] that the total arXiv:1807.03559v2 [cond-mat.dis-nn] 27 Sep 2018 c ¹x ; x º c ¹x − x º kx − x kp ; ∗ p i j = p i j := i j (1) optimal cost of the TSP EH is larger than the total optimal cost of the 2-factor problem E∗ , which is larger M2 with p 2 R and kxi − xj k the Euclidean distance between the than twice the total optimal cost of the corresponding two points. In the bipartite version of the problem the set of matching problem (assignment) E∗ 2N points is partitioned in two subsets each with N points M1 and steps are allowed only from points in one subset to points E∗ ≥ E∗ ≥ 2E∗ : (4) H M2 M1 2 In [16] it has been shown that in d = 1, in the asymptotic limit of an infinitely large number points, this bound is saturated, that is the total optimal cost of the TSP, 1− p rescaled with N 2 , is exactly twice the total rescaled optimal cost of the assignment problem, and, therefore, all the three quantities coincide. iii) in the bipartite case, in d = 2 and p = 2, thanks to a deep connection with the continuum version of the problem, that is the well known transport problem, it has been possible to compute, exactly, the total optimal cost of the assignment problem in the asymptotic limit of an infinitely large number points [19, 22–24]: ¹2;2º 1 EN = log N : (5) 2π Figure 1: On the same instance with N = 16 blue (squared) and red (disk) points (top left panel), we draw an We considered, therefore, the possibility that also in d = 2, arbitrarily-chosen example of each class of spanning and p > 1, exactly, thanks to the logarithmic violation present subgraph we are considering: a 1-factor (top right), an in the bipartite case, the asymptotic total cost of the TSP can Hamiltonian cycle (bottom left) and a 2-factor (bottom right). be exactly twice the one of the assignment, that for p = 2 is also exactly known. Indeed, this is the case! The paper is organized as follows. In Section we present with M respectively M1; M2; H. In the Euclidean version of the model. In Section we present an argument to justify our bipartite optimization problems, we consider the immer- our strategy. In Section we provide evidence by numerical d sion of KN; N in an open subset Ω ⊂ R . V1, respectively V2, simulations how our result is established for large number of will be identified by the set of N points with coordinates ri’s, points. We also examined the case p = 1, which is the most that we shall call the red points, respectively by the set of N largely considered in the literature. points with coordinates bj ’s, that we shall call the blue points. Let ¹i; jº be the edge connecting the i−th red vertex with the j−th blue vertex. We give a weight wij = cp¹ri − bj º, with THE MODEL p ≥ 1 as in Eq. (1). Of course, in the monopartite Euclidean version there is Consider a generic graph G = ¹V; Eº where V is its set of only one set of points. 2 vertices and E ⊂ V its set of edges. Let we > 0 be a weight In the random Euclidean version of the problem, each pos- associated to the edge e 2 E. We shall consider the complete sible instance is obtained by choosing at random, with a given (monopartite) graph KN , whose vertex set has cardinality N law, the position of the points in Ω. For example we shall 2 d and E = V and the complete bipartite graph KN; N , whose consider Ω = »0; 1¼ and the flat distribution. We denote by ∗ ∗ vertex set is V = V1 [ V2 with V1 and V2 disjoint sets of E the average, over all instances, of the optimal total cost E . cardinality N, and E = V1 × V2. Let µ = ¹V; E 0º be a spanning subgraph of G, that is E 0 ⊂ E. We can define a total cost associated to µ according to SCALING ARGUMENT Õ E»µ¼ = we : (6) In this section we will provide a scaling argument to support e2µ our claim, that is, also in two dimensions, for any given choice of the positions of the points, in the asymptotic limit of large We shall consider three different classes of spanning sub- N, the cost of the bipartite TSP converges to twice the cost of graphs M. The set M1 of 1-factor (matching), where each the assignment. vertex belongs to one and only one edge, the set M2 of 2-factor Given an instance, let us consider the optimal assignment (2-matching or loop covering), where each vertex belongs to µ∗ on them. Let us now consider N points which are taken two edges, and the set H of Hamiltonian cycles, that is 2- between the red an blue point of each edge in µ∗ and call T ∗ factor formed by only one cycle (see Fig.1). The assignment, the optimal monopartite TSP solution on these points. For respectively 2-factor, TSP, problems amounts to the search of simplicity, as these N points we take the blue points. ∗ M M H the subgraph µ in 1, respectively 2, , which is optimal, We shall use T ∗ to provide an ordering among the red and in the sense of minimal total cost blue points.

Exact Bound for a TSP Cost in Two Dimensions

Extended Branch Decomposition Graphs: Structural Comparison of Scalar Data

Exploiting C-Closure in Kernelization Algorithms for Graph Problems

3.1 Matchings and Factors: Matchings and Covers

Approximation Algorithms

Multi-Budgeted Matchings and Matroid Intersection Via Dependent Rounding

The Complexity of Multivariate Matching Polynomials

1 Bipartite Matching and Vertex Covers

Exponential Family Bipartite Matching

Map-Matching Using Shortest Paths

Lecture 3: Approximation Algorithms 1 1 Set Cover, Vertex Cover, And

Graph Theory Matchings

Matching Algorithms