Polynomials for Directed Graphs

Total Page:16

File Type:pdf, Size:1020Kb

Polynomials for Directed Graphs Polynomials for Directed Graphs Gary Gordon and Lorenzo Traldi Departmen t of Mathematics Lafayette College Easton, PA 18042 Abstract Several polynomials are defined on directed graphs and rooted directed graphs which are all analogous to the Tutte polynomial of an undirected graph. Various interrelations between these polynomials are explored. Many of them have the number of maximal di rected trees (arbores cences) as an evaluation, and are related to the one- variable greedoid polynomial ~G(Y). Applications to reliability are also examined. 1. Introduction. The central theme of this study is the attempt to generalize the Tutte polynomial of an undirected graph to a polynomial invariant of a directed graph, especially one having a distinguished vertex (a rooted digraph ). It does not seem that there is any one right" such generalization; rather, different definitions of the Tutte polynomial suggest different directed polynomials. One definition of the Tutte polynomial of an undirected graph G depends on a rank function on subsets of E ( G), the edge-set of G. For A ~ E (G) the (circuit ) rank of A, r (A), is the cardinality of the largest acyclic subset of A_ The corank-nullity polynomial f( Gj t, z) (also called the Whitney polynomiaf) is defin ed by (T1) f(Gjt,z ) == L tr(E(G» - r(A) z IAI-r(A). A~E (G) In Section 2, we !lese three differen t rank functions to define polynomials on rooted digraphs via (T1). (By a rank function" we simply mean any function mapping sets of edges of directed graphs to non-negative integers. ) One of these three is the rank fun ction of the directed branchi ng greedoid (1] associated to a rooled digraph D, while the second is the basis rank fu nction of the same greedoid. Thus , these first two rank functions yield two-variable polynomial invariants for arbitrary greedoids. The third rank fu nction considered is not associated with any greedoid, though it is still associated with the notion of reachability of vertices of D fr om its root. All of these polynomials generalize the one-variable greedoid polynomial of [1] when the greedoid is the di rected branching greedoid of a rooted digraph. CONGRESSUS NUMERANTIUM 94(1993), pp.187-201 The definition of the corank-nullity polynomial of an undirected graph can be rephrased without using a rank function. For this formulation, define a subgraph A of G to be vertex-spanning if it contains the entire vertex-set V(C) of C, and let 5 denote the collection of all such subgraphs; we will often use the same letter A to refer to both a vertex-spanning subgraph and its edge-set. Then if c(A ) denotes the number of connected components of A, (T2) f( c; t, z) == E tc(A)-c(G) zIAI+c(A)-IV(G)I. A C:;; E (G) Since r(A) W(C)I - etA), (T2) is a direct translation of (Tl). We mention it here because it is natural to investigate the notions of "number of components" associated to various rank functions. For instance, the notion associated to the rank function of the directed branching greedoid is that the number of components of a rooted digraph is one more than the number of vertices not reachable from the root, wh il e the notion associated to the third rank function considered in Section 2 is that the number of components of a rooted digraph is the minimum number of edges that can be inserted into the digraph so that all its vertices will be reachable from its root. Also, it often h ppens that a certain notion of "number of components" can be more natural than the associated rank function. For instance, in Section 3 we menti n two polynomials t hat are associated wi th strong connectedness rather than reachability, and for these polynomials there are no "natural" descriptions of the associa ted rank func tions. The recursive definition of t he Tutte polynomial is very useful, since it allows for natural inductive proofs. If e is an edge of C then the deletion C - e is simply the graph C wi th e erased (deleted). The cont raction G / e is the graph obtained from C - e by identifyin g the endpoints of e. The Tutte polynomial can be defined by (T3) (a) ftC) = 1 ifC has no edges, (b ) I ( G) = f ( C - e) + f( G / e) if e is ne ither an isthmus nor a loop, (c) ft C ) = xf(C /e) if e is an isthmu,s, and (d) f tC) = yf(C - e) ife is a loop. Note that (T3) (b), (c), and (d) can be replaced by the following single condition. (e) For any e, ftC ) = (y - l )IV(G /e)I+1-I V (G )lf(C/e) + (x - ly(G)-r(G-e) f (C - e). (T 3) is equivalent to (Tl) and (T2) under the substitution x = t +1, II = z + 1. It seems di ffi cult to defi ne a "directed isthmus" that can be u.sed in (T 3) to recursively define a polynomial that is independent of the order in which edges are deleted and contracted. In Section 4 we give two exa.mples of recursively defined polynomials whi ch depend on a. given ordering of t he edges or vertices. One can also define the ordinary Tutte polynomial in terms of activi ties (see [2] and [9]) or generalized activities (see [7]) . Although several of the polynomials we defi ne for rooted digraphs can be described in term.s of analogous activities, these 188 descriptions are complicated and do not seem to contribute much to understanding the polynomials. In Section 5 we extend the polynomials to weighted digraphs; several can then be evaluated to yield the determinant of the Kirch hoff matri x. We also discuss the application of these pol ynomials to some reliability problems . T he polynomials yield no t only information about the likelihood that a directed network with imperfect edges will remain fully operational, but also illformation about the expected number of edges that will fai l, the expected number of repairs that will have to be made to restore a ne twork to operational status, and the expected cos t of making those repaIrs. Before considering this Introduction cnmplete we should take the opportunity to thank Lafayette College for its support : during part of the time this research was being done Gary Gordon was supported by a research fellowship, and Lorenzo Traldi was on a sabbatical leave. 2. Polynomials for rooted digraphs. In this section, we defi ne three polynomials for rooted digraphs via (Tl). We fi rst define the branching greedoid independence rank rl (A ) and basis rank r~ ( A ) , and also the rooted forest rank r3( A ), of a subset A of the edge-set of a rooted digraph (D, *). (Here .. represen ts t he root vertex. ) A subgrapb of D is a '" rooted arborescence if for every vertex v of T, there is a uni que directed path in T from * to v. T hus T is a di rected tree, rooted at • . (See Figure la.) A spanning. rooted arborescfnCf is a • rooted arborescence which is incident with every vertex of D. In general, a digraph need not have any spanning . rooted arborescence; even if it doesn't , it is still true tbat all maximal * rooted arborescences have the same number of edges. A • rooted forest of arborescences is a vertex-disjoi nt union of arborescences rooted at ., VI, V2t ••• for some vertices VI, V2, .... (In Figure 1b, the heavy lines give a * rooted forest of arborescenc s. ) We will often fail to distinguish be ween a • rooted forest of arborescences and its edge-set. We now defi ne rank fun ctions r l, r2, and r3 by (2.1) rl(A) == max{IT II T ~ A is a * rooted arbo rescence}, (2.2) r2(A) == 'Dax {IT nAil T ~ E(D) is a * roo ted arborescence}, and (2.3) r3 (A) == max{IFI I F ~ A is a. root ed forest of arborescences }. Proposition 2.1. For every A ~ E(D), rt{A) ~ r2 (A) ~ r3 (A); also, rdD) rd D). Proof. Since any • rooted arboresc nce can be extended to a maximal . rooted arborescence, rl (A ) ~ r2( A). Furthermore, since any subset of a maximal * rooted arborescence is a • rooted forest of arborescences, r2 (A ) :5 r3 (A). 189 .... The above inequali ties are not generally reversible. For instance, in Figure 2a if A = {a, c, e} then rl( A) = 1, r2( A ) = 2, and r3( A) = 3. It is a routine veri fi cation to demonstrate that rl is a greedoid rank function. In general , it is not a matroid rank fun ction. For examp le, let A = { b, c d} and x = a in the digraph of Figure 2a. Then rdA) = 0, but rl (A U {x}) = 4. It is also easy to verify that each of r2 and rJ satis fi es the defini tion of a matroid rank function except for semimodulari ty. For example, in Figure 2b for i = 2 or 3 we have r;( {c}) = r;( {c,d}) = r.( {a, c}) = 1, but r;({a,c,d}) = 2. Imitating defi nitions of matroid theory (see [10J for an account), for 1 Si S 3 we define A ~ E( D ) to he i-spanning if ri tA) = r.(E (D)), i-independent if r;(A ) = IAI. and an i-basis if it is both i-spanning and i-independent. The following proposition is an immediate consequence of Definitions (2.1), (2.2), and (2 .3 ).
Recommended publications
  • Applications of the Quantum Algorithm for St-Connectivity
    Applications of the quantum algorithm for st-connectivity KAI DELORENZO1 , SHELBY KIMMEL1 , AND R. TEAL WITTER1 1Middlebury College, Computer Science Department, Middlebury, VT Abstract We present quantum algorithms for various problems related to graph connectivity. We give simple and query-optimal algorithms for cycle detection and odd-length cycle detection (bipartiteness) using a reduction to st-connectivity. Furthermore, we show that our algorithm for cycle detection has improved performance under the promise of large circuit rank or a small number of edges. We also provide algo- rithms for detecting even-length cycles and for estimating the circuit rank of a graph. All of our algorithms have logarithmic space complexity. 1 Introduction Quantum query algorithms are remarkably described by span programs [15, 16], a linear algebraic object originally created to study classical logspace complexity [12]. However, finding optimal span program algorithms can be challenging; while they can be obtained using a semidefinite program, the size of the program grows exponentially with the size of the input to the algorithm. Moreover, span programs are designed to characterize the query complexity of an algorithm, while in practice we also care about the time and space complexity. One of the nicest span programs is for the problem of undirected st-connectivity, in which one must decide whether two vertices s and t are connected in a given graph. It is “nice” for several reasons: It is easy to describe and understand why it is correct. • It corresponds to a quantum algorithm that uses logarithmic (in the number of vertices and edges of • the graph) space [4, 11].
    [Show full text]
  • The Normalized Laplacian Spectrum of Subdivisions of a Graph
    The normalized Laplacian spectrum of subdivisions of a graph Pinchen Xiea,b, Zhongzhi Zhanga,c, Francesc Comellasd aShanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai 200433, China bDepartment of Physics, Fudan University, Shanghai 200433, China cSchool of Computer Science, Fudan University, Shanghai 200433, China dDepartment of Applied Mathematics IV, Universitat Polit`ecnica de Catalunya, 08034 Barcelona Catalonia, Spain Abstract Determining and analyzing the spectra of graphs is an important and excit- ing research topic in mathematics science and theoretical computer science. The eigenvalues of the normalized Laplacian of a graph provide information on its structural properties and also on some relevant dynamical aspects, in particular those related to random walks. In this paper, we give the spec- tra of the normalized Laplacian of iterated subdivisions of simple connected graphs. As an example of application of these results we find the exact val- ues of their multiplicative degree-Kirchhoff index, Kemeny's constant and number of spanning trees. Keywords: Normalized Laplacian spectrum, Subdivision graph, Degree-Kirchhoff index, Kemeny's constant, Spanning trees 1. Introduction Spectral analysis of graphs has been the subject of considerable research effort in mathematics and computer science [1, 2, 3], due to its wide appli- cations in this area and in general [4, 5]. In the last few decades a large body of scientific literature has established that important structural and dynamical properties
    [Show full text]
  • Matroidal Structure of Rough Sets from the Viewpoint of Graph Theory
    Hindawi Publishing Corporation Journal of Applied Mathematics Volume 2012, Article ID 973920, 27 pages doi:10.1155/2012/973920 Research Article Matroidal Structure of Rough Sets from the Viewpoint of Graph Theory Jianguo Tang,1, 2 Kun She,1 and William Zhu3 1 School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China 2 School of Computer Science and Engineering, XinJiang University of Finance and Economics, Urumqi 830012, China 3 Lab of Granular Computing, Zhangzhou Normal University, Zhangzhou 363000, China Correspondence should be addressed to William Zhu, [email protected] Received 4 February 2012; Revised 30 April 2012; Accepted 18 May 2012 Academic Editor: Mehmet Sezer Copyright q 2012 Jianguo Tang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Constructing structures with other mathematical theories is an important research field of rough sets. As one mathematical theory on sets, matroids possess a sophisticated structure. This paper builds a bridge between rough sets and matroids and establishes the matroidal structure of rough sets. In order to understand intuitively the relationships between these two theories, we study this problem from the viewpoint of graph theory. Therefore, any partition of the universe can be represented by a family of complete graphs or cycles. Then two different kinds of matroids are constructed and some matroidal characteristics of them are discussed, respectively. The lower and the upper approximations are formulated with these matroidal characteristics.
    [Show full text]
  • Permutation of Elements in Double Semigroups
    PERMUTATION OF ELEMENTS IN DOUBLE SEMIGROUPS MURRAY BREMNER AND SARA MADARIAGA Abstract. Double semigroups have two associative operations ◦, • related by the interchange relation: (a • b) ◦ (c • d) ≡ (a ◦ c) • (b ◦ d). Kock [13] (2007) discovered a commutativity property in degree 16 for double semigroups: as- sociativity and the interchange relation combine to produce permutations of elements. We show that such properties can be expressed in terms of cycles in directed graphs with edges labelled by permutations. We use computer alge- bra to show that 9 is the lowest degree for which commutativity occurs, and we give self-contained proofs of the commutativity properties in degree 9. 1. Introduction Definition 1.1. A double semigroup is a set S with two associative binary operations •, ◦ satisfying the interchange relation for all a,b,c,d ∈ S: (⊞) (a • b) ◦ (c • d) ≡ (a ◦ c) • (b ◦ d). The symbol ≡ indicates that the equation holds for all values of the variables. We interpret ◦ and • as horizontal and vertical compositions, so that (⊞) ex- presses the equivalence of two decompositions of a square array: a b a b a b (a ◦ b) • (c ◦ d) ≡ ≡ ≡ ≡ (a • c) ◦ (b • d). c d c d c d This interpretation of the operations extends to any double semigroup monomial, producing what we call the geometric realization of the monomial. The interchange relation originated in homotopy theory and higher categories; see Mac Lane [15, (2.3)] and [16, §XII.3]. It is also called the Godement relation by arXiv:1405.2889v2 [math.RA] 26 Mar 2015 some authors; see Simpson [21, §2.1].
    [Show full text]
  • G-Parking Functions and Tree Inversions
    G-PARKING FUNCTIONS AND TREE INVERSIONS DAVID PERKINSON, QIAOYU YANG, AND KUAI YU Abstract. A depth-first search version of Dhar’s burning algorithm is used to give a bijection between the parking functions of a graph and labeled spanning trees, relating the degree of the parking function with the number of inversions of the spanning tree. Specializing to the complete graph solves a problem posed by R. Stanley. 1. Introduction Let G = (V, E) be a connected simple graph with vertex set V = {0,...,n} and edge set E. Fix a root vertex r ∈ V and let SPT(G) denote the set of spanning trees of G rooted at r. We think of each element of SPT(G) as a directed graph in which all paths lead away from the root. If i, j ∈ V and i lies on the unique path from r to j in the rooted spanning tree T , then i is an ancestor of j and j is a descendant of i in T . If, in addition, there are no vertices between i and j on the path from the root, then i is the parent of its child j, and (i, j) is a directed edge of T . Definition 1. An inversion of T ∈ SPT(G) is a pair of vertices (i, j), such that i is an ancestor of j in T and i > j. It is a κ-inversion if, in addition, i is not the root and i’s parent is adjacent to j in G. The number of κ-inversions of T is the tree’s κ-number, denoted κ(G, T ).
    [Show full text]
  • The Normalized Laplacian Spectrum of Subdivisions of a Graph
    The normalized Laplacian spectrum of subdivisions of a graph Pinchen Xiea,c, Zhongzhi Zhangb,c, Francesc Comellasd aDepartment of Physics, Fudan University, Shanghai 200433, China bSchool of Computer Science, Fudan University, Shanghai 200433, China cShanghai Key Laboratory of Intelligent Information Processing, Fudan University, Shanghai 200433, China dDepartment of Applied Mathematics IV, Universitat Polit`ecnica de Catalunya, 08034 Barcelona Catalonia, Spain Abstract Determining and analyzing the spectra of graphs is an important and exciting research topic in theoretical computer science. The eigenvalues of the normalized Laplacian of a graph provide information on its structural properties and also on some relevant dynamical aspects, in particular those related to random walks. In this paper, we give the spectra of the nor- malized Laplacian of iterated subdivisions of simple connected graphs. As an example of application of these results we find the exact values of their multiplicative degree-Kirchhoff index, Kemeny’s constant and number of spanning trees. Keywords: Normalized Laplacian spectrum, Subdivision graph, Degree-Kirchhoff index, Kemeny’s constant, Spanning trees 1. Introduction arXiv:1510.02394v1 [math.CO] 7 Oct 2015 Spectral analysis of graphs has been the subject of considerable research effort in theoretical computer science [1, 2, 3], due to its wide applications in this area and in general [4, 5]. In the last few decades a large body of scientific literature has established that important structural and dynamical properties
    [Show full text]
  • Classes of Graphs Embeddable in Order-Dependent Surfaces
    Classes of graphs embeddable in order-dependent surfaces Colin McDiarmid Sophia Saller Department of Statistics Department of Mathematics University of Oxford University of Oxford [email protected] and DFKI [email protected] 12 June 2021 Abstract Given a function g = g(n) we let Eg be the class of all graphs G such that if G has order n (that is, has n vertices) then it is embeddable in some surface of Euler genus at most g(n), and let eEg be the corresponding class of unlabelled graphs. We give estimates of the sizes of these classes. For example we 3 g show that if g(n) = o(n= log n) then the class E has growth constant γP, the (labelled) planar graph growth constant; and when g(n) = O(n) we estimate the number of n-vertex graphs in Eg and eEg up g to a factor exponential in n. From these estimates we see that, if E has growth constant γP then we must have g(n) = o(n= log n), and the generating functions for Eg and eEg have strictly positive radius of convergence if and only if g(n) = O(n= log n). Such results also hold when we consider orientable and non-orientable surfaces separately. We also investigate related classes of graphs where we insist that, as well as the graph itself, each subgraph is appropriately embeddable (according to its number of vertices); and classes of graphs where we insist that each minor is appropriately embeddable. In a companion paper [43], these results are used to investigate random n-vertex graphs sampled uniformly from Eg or from similar classes.
    [Show full text]
  • Cheminformatics for Genome-Scale Metabolic Reconstructions
    CHEMINFORMATICS FOR GENOME-SCALE METABOLIC RECONSTRUCTIONS John W. May European Molecular Biology Laboratory European Bioinformatics Institute University of Cambridge Homerton College A thesis submitted for the degree of Doctor of Philosophy June 2014 Declaration This thesis is the result of my own work and includes nothing which is the outcome of work done in collaboration except where specifically indicated in the text. This dissertation is not substantially the same as any I have submitted for a degree, diploma or other qualification at any other university, and no part has already been, or is currently being submitted for any degree, diploma or other qualification. This dissertation does not exceed the specified length limit of 60,000 words as defined by the Biology Degree Committee. This dissertation has been typeset using LATEX in 11 pt Palatino, one and half spaced, according to the specifications defined by the Board of Graduate Studies and the Biology Degree Committee. June 2014 John W. May to Róisín Acknowledgements This work was carried out in the Cheminformatics and Metabolism Group at the European Bioinformatics Institute (EMBL-EBI). The project was fund- ed by Unilever, the Biotechnology and Biological Sciences Research Coun- cil [BB/I532153/1], and the European Molecular Biology Laboratory. I would like to thank my supervisor, Christoph Steinbeck for his guidance and providing intellectual freedom. I am also thankful to each member of my thesis advisory committee: Gordon James, Julio Saez-Rodriguez, Kiran Patil, and Gos Micklem who gave their time, advice, and guidance. I am thankful to all members of the Cheminformatics and Metabolism Group.
    [Show full text]
  • COLORING GRAPHS USING TOPOLOGY 11 Realized by Tutte Who Called an Example of a Disc G for Which the Boundary Has Chromatic Number 4 a Chromatic Obstacle
    COLORING GRAPHS USING TOPOLOGY OLIVER KNILL Abstract. Higher dimensional graphs can be used to color 2- dimensional geometric graphs G. If G the boundary of a three dimensional graph H for example, we can refine the interior until it is colorable with 4 colors. The later goal is achieved if all inte- rior edge degrees are even. Using a refinement process which cuts the interior along surfaces S we can adapt the degrees along the boundary S. More efficient is a self-cobordism of G with itself with a host graph discretizing the product of G with an interval. It fol- lows from the fact that Euler curvature is zero everywhere for three dimensional geometric graphs, that the odd degree edge set O is a cycle and so a boundary if H is simply connected. A reduction to minimal coloring would imply the four color theorem. The method is expected to give a reason \why 4 colors suffice” and suggests that every two dimensional geometric graph of arbitrary degree and ori- entation can be colored by 5 colors: since the projective plane can not be a boundary of a 3-dimensional graph and because for higher genus surfaces, the interior H is not simply connected, we need in general to embed a surface into a 4-dimensional simply connected graph in order to color it. This explains the appearance of the chromatic number 5 for higher degree or non-orientable situations, a number we believe to be the upper limit. For every surface type, we construct examples with chromatic number 3; 4 or 5, where the construction of surfaces with chromatic number 5 is based on a method of Fisk.
    [Show full text]
  • Graph Theory Trees
    GGRRAAPPHH TTHHEEOORRYY -- TTRREEEESS http://www.tutorialspoint.com/graph_theory/graph_theory_trees.htm Copyright © tutorialspoint.com Trees are graphs that do not contain even a single cycle. They represent hierarchical structure in a graphical form. Trees belong to the simplest class of graphs. Despite their simplicity, they have a rich structure. Trees provide a range of useful applications as simple as a family tree to as complex as trees in data structures of computer science. Tree A connected acyclic graph is called a tree. In other words, a connected graph with no cycles is called a tree. The edges of a tree are known as branches. Elements of trees are called their nodes. The nodes without child nodes are called leaf nodes. A tree with ‘n’ vertices has ‘n-1’ edges. If it has one more edge extra than ‘n-1’, then the extra edge should obviously has to pair up with two vertices which leads to form a cycle. Then, it becomes a cyclic graph which is a violation for the tree graph. Example 1 The graph shown here is a tree because it has no cycles and it is connected. It has four vertices and three edges, i.e., for ‘n’ vertices ‘n-1’ edges as mentioned in the definition. Note − Every tree has at least two vertices of degree one. Example 2 In the above example, the vertices ‘a’ and ‘d’ has degree one. And the other two vertices ‘b’ and ‘c’ has degree two. This is possible because for not forming a cycle, there should be at least two single edges anywhere in the graph.
    [Show full text]
  • Hamiltonian Cycle and Related Problems: Vertex-Breaking, Grid
    Hamiltonian Cycle and Related Problems: Vertex-Breaking, Grid Graphs, and Rubik’s Cubes by Mikhail Rudoy Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree of Masters of Engineering in Electrical Engineering and Computer Science at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY May 2017 ○c Mikhail Rudoy, MMXVII. All rights reserved. The author hereby grants to MIT permission to reproduce and to distribute publicly paper and electronic copies of this thesis document in whole or in part in any medium now known or hereafter created. Author............................................................................ Department of Electrical Engineering and Computer Science May 26, 2017 Certified by. Erik Demaine Professor Thesis Supervisor Accepted by....................................................................... Christopher J. Terman Chairman, Masters of Engineering Thesis Committee THIS PAGE INTENTIONALLY LEFT BLANK 2 Hamiltonian Cycle and Related Problems: Vertex-Breaking, Grid Graphs, and Rubik’s Cubes by Mikhail Rudoy Submitted to the Department of Electrical Engineering and Computer Science on May 26, 2017, in partial fulfillment of the requirements for the degree of Masters of Engineering in Electrical Engineering and Computer Science Abstract In this thesis, we analyze the computational complexity of several problems related to the Hamiltonian Cycle problem. We begin by introducing a new problem, which we call Tree-Residue Vertex-Breaking (TRVB). Given a multigraph 퐺 some of whose vertices are marked “breakable,” TRVB asks whether it is possible to convert 퐺 into a tree via a sequence of applications of the vertex- breaking operation: disconnecting the edges at a degree-푘 breakable vertex by replacing that vertex with 푘 degree-1 vertices.
    [Show full text]
  • Learning Max-Margin Tree Predictors
    Learning Max-Margin Tree Predictors Ofer Meshiy∗ Elad Ebany∗ Gal Elidanz Amir Globersony y School of Computer Science and Engineering z Department of Statistics The Hebrew University of Jerusalem, Israel Abstract years using structured output prediction has resulted in state-of-the-art results in many real-worlds problems from computer vision, natural language processing, Structured prediction is a powerful frame- computational biology, and other fields [Bakir et al., work for coping with joint prediction of 2007]. Such predictors can be learned from data using interacting outputs. A central difficulty in formulations such as Max-Margin Markov Networks using this framework is that often the correct (M3N) [Taskar et al., 2003, Tsochantaridis et al., 2006], label dependence structure is unknown. At or conditional random fields (CRF) [Lafferty et al., the same time, we would like to avoid an 2001]. overly complex structure that will lead to intractable prediction. In this work we ad- While the prediction and the learning tasks are dress the challenge of learning tree structured generally computationally intractable [Shimony, 1994, predictive models that achieve high accuracy Sontag et al., 2010], for some models they can be while at the same time facilitate efficient carried out efficiently. For example, when the model (linear time) inference. We start by proving consists of pairwise dependencies between output vari- that this task is in general NP-hard, and then ables, and these form a tree structure, prediction can suggest an approximate alternative. Our be computed efficiently using dynamic programming CRANK approach relies on a novel Circuit- at a linear cost in the number of output variables RANK regularizer that penalizes non-tree [Pearl, 1988].
    [Show full text]