Graphical Models, Message-Passing Algorithms, and Variational Methods: Part I
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Inference Algorithm for Finite-Dimensional Spin
PHYSICAL REVIEW E 84, 046706 (2011) Inference algorithm for finite-dimensional spin glasses: Belief propagation on the dual lattice Alejandro Lage-Castellanos and Roberto Mulet Department of Theoretical Physics and Henri-Poincare´ Group of Complex Systems, Physics Faculty, University of Havana, La Habana, Codigo Postal 10400, Cuba Federico Ricci-Tersenghi Dipartimento di Fisica, INFN–Sezione di Roma 1, and CNR–IPCF, UOS di Roma, Universita` La Sapienza, Piazzale Aldo Moro 5, I-00185 Roma, Italy Tommaso Rizzo Dipartimento di Fisica and CNR–IPCF, UOS di Roma, Universita` La Sapienza, Piazzale Aldo Moro 5, I-00185 Roma, Italy (Received 18 February 2011; revised manuscript received 15 September 2011; published 24 October 2011) Starting from a cluster variational method, and inspired by the correctness of the paramagnetic ansatz [at high temperatures in general, and at any temperature in the two-dimensional (2D) Edwards-Anderson (EA) model] we propose a message-passing algorithm—the dual algorithm—to estimate the marginal probabilities of spin glasses on finite-dimensional lattices. We use the EA models in 2D and 3D as benchmarks. The dual algorithm improves the Bethe approximation, and we show that in a wide range of temperatures (compared to the Bethe critical temperature) our algorithm compares very well with Monte Carlo simulations, with the double-loop algorithm, and with exact calculation of the ground state of 2D systems with bimodal and Gaussian interactions. Moreover, it is usually 100 times faster than other provably convergent methods, as the double-loop algorithm. In 2D and 3D the quality of the inference deteriorates only where the correlation length becomes very large, i.e., at low temperatures in 2D and close to the critical temperature in 3D. -
Many-Pairs Mutual Information for Adding Structure to Belief Propagation Approximations
Proceedings of the Twenty-Third AAAI Conference on Artificial Intelligence (2008) Many-Pairs Mutual Information for Adding Structure to Belief Propagation Approximations Arthur Choi and Adnan Darwiche Computer Science Department University of California, Los Angeles Los Angeles, CA 90095 {aychoi,darwiche}@cs.ucla.edu Abstract instances of the satisfiability problem (Braunstein, Mezard,´ and Zecchina 2005). IBP has also been applied towards a We consider the problem of computing mutual informa- variety of computer vision tasks (Szeliski et al. 2006), par- tion between many pairs of variables in a Bayesian net- work. This task is relevant to a new class of Generalized ticularly stereo vision, where methods based on belief prop- Belief Propagation (GBP) algorithms that characterizes agation have been particularly successful. Iterative Belief Propagation (IBP) as a polytree approx- The successes of IBP as an approximate inference algo- imation found by deleting edges in a Bayesian network. rithm spawned a number of generalizations, in the hopes of By computing, in the simplified network, the mutual in- understanding the nature of IBP, and further to improve on formation between variables across a deleted edge, we the quality of its approximations. Among the most notable can estimate the impact that recovering the edge might are the family of Generalized Belief Propagation (GBP) al- have on the approximation. Unfortunately, it is compu- gorithms (Yedidia, Freeman, and Weiss 2005), where the tationally impractical to compute such scores for net- works over many variables having large state spaces. “structure” of an approximation is given by an auxiliary So that edge recovery can scale to such networks, we graph called a region graph, which specifies (roughly) which propose in this paper an approximation of mutual in- regions of the original network should be handled exactly, formation which is based on a soft extension of d- and to what extent different regions should be mutually con- separation (a graphical test of independence in Bayesian sistent. -
Structural Parameterizations of Clique Coloring
Structural Parameterizations of Clique Coloring Lars Jaffke University of Bergen, Norway lars.jaff[email protected] Paloma T. Lima University of Bergen, Norway [email protected] Geevarghese Philip Chennai Mathematical Institute, India UMI ReLaX, Chennai, India [email protected] Abstract A clique coloring of a graph is an assignment of colors to its vertices such that no maximal clique is monochromatic. We initiate the study of structural parameterizations of the Clique Coloring problem which asks whether a given graph has a clique coloring with q colors. For fixed q ≥ 2, we give an O?(qtw)-time algorithm when the input graph is given together with one of its tree decompositions of width tw. We complement this result with a matching lower bound under the Strong Exponential Time Hypothesis. We furthermore show that (when the number of colors is unbounded) Clique Coloring is XP parameterized by clique-width. 2012 ACM Subject Classification Mathematics of computing → Graph coloring Keywords and phrases clique coloring, treewidth, clique-width, structural parameterization, Strong Exponential Time Hypothesis Digital Object Identifier 10.4230/LIPIcs.MFCS.2020.49 Related Version A full version of this paper is available at https://arxiv.org/abs/2005.04733. Funding Lars Jaffke: Supported by the Trond Mohn Foundation (TMS). Acknowledgements The work was partially done while L. J. and P. T. L. were visiting Chennai Mathematical Institute. 1 Introduction Vertex coloring problems are central in algorithmic graph theory, and appear in many variants. One of these is Clique Coloring, which given a graph G and an integer k asks whether G has a clique coloring with k colors, i.e. -
Clique-Width and Tree-Width of Sparse Graphs
Clique-width and tree-width of sparse graphs Bruno Courcelle Labri, CNRS and Bordeaux University∗ 33405 Talence, France email: [email protected] June 10, 2015 Abstract Tree-width and clique-width are two important graph complexity mea- sures that serve as parameters in many fixed-parameter tractable (FPT) algorithms. The same classes of sparse graphs, in particular of planar graphs and of graphs of bounded degree have bounded tree-width and bounded clique-width. We prove that, for sparse graphs, clique-width is polynomially bounded in terms of tree-width. For planar and incidence graphs, we establish linear bounds. Our motivation is the construction of FPT algorithms based on automata that take as input the algebraic terms denoting graphs of bounded clique-width. These algorithms can check properties of graphs of bounded tree-width expressed by monadic second- order formulas written with edge set quantifications. We give an algorithm that transforms tree-decompositions into clique-width terms that match the proved upper-bounds. keywords: tree-width; clique-width; graph de- composition; sparse graph; incidence graph Introduction Tree-width and clique-width are important graph complexity measures that oc- cur as parameters in many fixed-parameter tractable (FPT) algorithms [7, 9, 11, 12, 14]. They are also important for the theory of graph structuring and for the description of graph classes by forbidden subgraphs, minors and vertex-minors. Both notions are based on certain hierarchical graph decompositions, and the associated FPT algorithms use dynamic programming on these decompositions. To be usable, they need input graphs of "small" tree-width or clique-width, that furthermore are given with the relevant decompositions. -
Variational Message Passing
Journal of Machine Learning Research 6 (2005) 661–694 Submitted 2/04; Revised 11/04; Published 4/05 Variational Message Passing John Winn [email protected] Christopher M. Bishop [email protected] Microsoft Research Cambridge Roger Needham Building 7 J. J. Thomson Avenue Cambridge CB3 0FB, U.K. Editor: Tommi Jaakkola Abstract Bayesian inference is now widely established as one of the principal foundations for machine learn- ing. In practice, exact inference is rarely possible, and so a variety of approximation techniques have been developed, one of the most widely used being a deterministic framework called varia- tional inference. In this paper we introduce Variational Message Passing (VMP), a general purpose algorithm for applying variational inference to Bayesian Networks. Like belief propagation, VMP proceeds by sending messages between nodes in the network and updating posterior beliefs us- ing local operations at each node. Each such update increases a lower bound on the log evidence (unless already at a local maximum). In contrast to belief propagation, VMP can be applied to a very general class of conjugate-exponential models because it uses a factorised variational approx- imation. Furthermore, by introducing additional variational parameters, VMP can be applied to models containing non-conjugate distributions. The VMP framework also allows the lower bound to be evaluated, and this can be used both for model comparison and for detection of convergence. Variational message passing has been implemented in the form of a general purpose inference en- gine called VIBES (‘Variational Inference for BayEsian networkS’) which allows models to be specified graphically and then solved variationally without recourse to coding. -
Parameterized Algorithms for Recognizing Monopolar and 2
Parameterized Algorithms for Recognizing Monopolar and 2-Subcolorable Graphs∗ Iyad Kanj1, Christian Komusiewicz2, Manuel Sorge3, and Erik Jan van Leeuwen4 1School of Computing, DePaul University, Chicago, USA, [email protected] 2Institut für Informatik, Friedrich-Schiller-Universität Jena, Germany, [email protected] 3Institut für Softwaretechnik und Theoretische Informatik, TU Berlin, Germany, [email protected] 4Max-Planck-Institut für Informatik, Saarland Informatics Campus, Saarbrücken, Germany, [email protected] October 16, 2018 Abstract A graph G is a (ΠA, ΠB)-graph if V (G) can be bipartitioned into A and B such that G[A] satisfies property ΠA and G[B] satisfies property ΠB. The (ΠA, ΠB)-Recognition problem is to recognize whether a given graph is a (ΠA, ΠB )-graph. There are many (ΠA, ΠB)-Recognition problems, in- cluding the recognition problems for bipartite, split, and unipolar graphs. We present efficient algorithms for many cases of (ΠA, ΠB)-Recognition based on a technique which we dub inductive recognition. In particular, we give fixed-parameter algorithms for two NP-hard (ΠA, ΠB)-Recognition problems, Monopolar Recognition and 2-Subcoloring, parameter- ized by the number of maximal cliques in G[A]. We complement our algo- rithmic results with several hardness results for (ΠA, ΠB )-Recognition. arXiv:1702.04322v2 [cs.CC] 4 Jan 2018 1 Introduction A (ΠA, ΠB)-graph, for graph properties ΠA, ΠB , is a graph G = (V, E) for which V admits a partition into two sets A, B such that G[A] satisfies ΠA and G[B] satisfies ΠB. There is an abundance of (ΠA, ΠB )-graph classes, and important ones include bipartite graphs (which admit a partition into two independent ∗A preliminary version of this paper appeared in SWAT 2016, volume 53 of LIPIcs, pages 14:1–14:14. -
Linearized and Single-Pass Belief Propagation
Linearized and Single-Pass Belief Propagation Wolfgang Gatterbauer Stephan Gunnemann¨ Danai Koutra Christos Faloutsos Carnegie Mellon University Carnegie Mellon University Carnegie Mellon University Carnegie Mellon University [email protected] [email protected] [email protected] [email protected] ABSTRACT DR TS HAF D 0.8 0.2 T 0.3 0.7 H 0.6 0.3 0.1 How can we tell when accounts are fake or real in a social R 0.2 0.8 S 0.7 0.3 A 0.3 0.0 0.7 network? And how can we tell which accounts belong to F 0.1 0.7 0.2 liberal, conservative or centrist users? Often, we can answer (a) homophily (b) heterophily (c) general case such questions and label nodes in a network based on the labels of their neighbors and appropriate assumptions of ho- Figure 1: Three types of network effects with example coupling mophily (\birds of a feather flock together") or heterophily matrices. Shading intensity corresponds to the affinities or cou- (\opposites attract"). One of the most widely used methods pling strengths between classes of neighboring nodes. (a): D: for this kind of inference is Belief Propagation (BP) which Democrats, R: Republicans. (b): T: Talkative, S: Silent. (c): H: iteratively propagates the information from a few nodes with Honest, A: Accomplice, F: Fraudster. explicit labels throughout a network until convergence. A well-known problem with BP, however, is that there are no or heterophily applies in a given scenario, we can usually give known exact guarantees of convergence in graphs with loops. -
Α Belief Propagation for Approximate Inference
α Belief Propagation for Approximate Inference Dong Liu, Minh Thanh` Vu, Zuxing Li, and Lars K. Rasmussen KTH Royal Institute of Technology, Stockholm, Sweden. e-mail: fdoli, mtvu, zuxing, [email protected] Abstract Belief propagation (BP) algorithm is a widely used message-passing method for inference in graphical models. BP on loop-free graphs converges in linear time. But for graphs with loops, BP’s performance is uncertain, and the understanding of its solution is limited. To gain a better understanding of BP in general graphs, we derive an interpretable belief propagation algorithm that is motivated by minimization of a localized α-divergence. We term this algorithm as α belief propagation (α-BP). It turns out that α-BP generalizes standard BP. In addition, this work studies the convergence properties of α-BP. We prove and offer the convergence conditions for α-BP. Experimental simulations on random graphs validate our theoretical results. The application of α-BP to practical problems is also demonstrated. I. INTRODUCTION Bayesian inference provides a general mathematical framework for many learning tasks such as classification, denoising, object detection, and signal detection. The wide applications include but not limited to imaging processing [34], multi-input-multi-output (MIMO) signal detection in digital communication [2], [10], inference on structured lattice [5], machine learning [19], [14], [33]. Generally speaking, a core problem to these applications is the inference on statistical properties of a (hidden) variable x = (x1; : : : ; xN ) that usually can not be observed. Specifically, practical interests usually include computing the most probable state x given joint probability p(x), or marginal probability p(xc), where xc is subset of x. -
Sub-Coloring and Hypo-Coloring Interval Graphs⋆
Sub-coloring and Hypo-coloring Interval Graphs? Rajiv Gandhi1, Bradford Greening, Jr.1, Sriram Pemmaraju2, and Rajiv Raman3 1 Department of Computer Science, Rutgers University-Camden, Camden, NJ 08102. E-mail: [email protected]. 2 Department of Computer Science, University of Iowa, Iowa City, Iowa 52242. E-mail: [email protected]. 3 Max-Planck Institute for Informatik, Saarbr¨ucken, Germany. E-mail: [email protected]. Abstract. In this paper, we study the sub-coloring and hypo-coloring problems on interval graphs. These problems have applications in job scheduling and distributed computing and can be used as “subroutines” for other combinatorial optimization problems. In the sub-coloring problem, given a graph G, we want to partition the vertices of G into minimum number of sub-color classes, where each sub-color class induces a union of disjoint cliques in G. In the hypo-coloring problem, given a graph G, and integral weights on vertices, we want to find a partition of the vertices of G into sub-color classes such that the sum of the weights of the heaviest cliques in each sub-color class is minimized. We present a “forbidden subgraph” characterization of graphs with sub-chromatic number k and use this to derive a a 3-approximation algorithm for sub-coloring interval graphs. For the hypo-coloring problem on interval graphs, we first show that it is NP-complete and then via reduction to the max-coloring problem, show how to obtain an O(log n)-approximation algorithm for it. 1 Introduction Given a graph G = (V, E), a k-sub-coloring of G is a partition of V into sub-color classes V1,V2,...,Vk; a subset Vi ⊆ V is called a sub-color class if it induces a union of disjoint cliques in G. -
Shrub-Depth a Successful Depth Measure for Dense Graphs Graphs Petr Hlinˇen´Y Faculty of Informatics, Masaryk University Brno, Czech Republic
page.19 Shrub-Depth a successful depth measure for dense graphs graphs Petr Hlinˇen´y Faculty of Informatics, Masaryk University Brno, Czech Republic Petr Hlinˇen´y, Sparsity, Logic . , Warwick, 2018 1 / 19 Shrub-depth measure for dense graphs page.19 Shrub-Depth a successful depth measure for dense graphs graphs Petr Hlinˇen´y Faculty of Informatics, Masaryk University Brno, Czech Republic Ingredients: joint results with J. Gajarsk´y,R. Ganian, O. Kwon, J. Neˇsetˇril,J. Obdrˇz´alek, S. Ordyniak, P. Ossona de Mendez Petr Hlinˇen´y, Sparsity, Logic . , Warwick, 2018 1 / 19 Shrub-depth measure for dense graphs page.19 Measuring Width or Depth? • Being close to a TREE { \•-width" sparse dense tree-width / branch-width { showing a structure clique-width / rank-width { showing a construction Petr Hlinˇen´y, Sparsity, Logic . , Warwick, 2018 2 / 19 Shrub-depth measure for dense graphs page.19 Measuring Width or Depth? • Being close to a TREE { \•-width" sparse dense tree-width / branch-width { showing a structure clique-width / rank-width { showing a construction • Being close to a STAR { \•-depth" sparse dense tree-depth { containment in a structure ??? (will show) Petr Hlinˇen´y, Sparsity, Logic . , Warwick, 2018 2 / 19 Shrub-depth measure for dense graphs page.19 1 Recall: Width Measures Tree-width tw(G) ≤ k if whole G can be covered by bags of size ≤ k + 1, arranged in a \tree-like fashion". Petr Hlinˇen´y, Sparsity, Logic . , Warwick, 2018 3 / 19 Shrub-depth measure for dense graphs page.19 1 Recall: Width Measures Tree-width tw(G) ≤ k if whole G can be covered by bags of size ≤ k + 1, arranged in a \tree-like fashion". -