On Mathematical Description of Probabilistic Conditional

Total Page:16

File Type:pdf, Size:1020Kb

On Mathematical Description of Probabilistic Conditional Institute of Information Theory and Automation Academy of Sciences of the Czech Republic On mathematical description of probabilistic conditional indep endence structures Milan Studen y Prague May The thesis for the degree Do ctor of Science in the eld mathematical informatics and theoretical cyb ernetics The thesis was written in the Department of DecisionMaking Theory of the Institute of Information Theory and Automation Academy of Sciences of the Czech Republic and was supp orted by the pro jects K and GACR n It is a result of a longterm research p erformed also within the framework of ESF research program Highly Structured Sto chastic Systems for and the program Causal Interpretation and Identication of Conditional Indep endence Structures of the Fields Institute for Research in Mathematical Sciences University of Toronto Canada September November Address of the author Milan Studen y Institute of Information Theory and Automation Academy of Sciences of the Czech Republic Pod vodarenskou vez Prague Czech Republic email studenyutiacascz web page httpwwwutiacasczuse r datastudenystudeny homehtml Contents Introduction Motivation account Goals of the work Structure of the work Basic concepts Conditional indep endence Semigraphoid prop erties Formal indep endence mo dels Semigraphoids Elementary indep endence statements Problem of axiomatic characterization Classes of probability measures Marginally continuous measures Factorizable measures Multiinformation and conditional pro duct Prop erties of multiinformation function Positive measures Gaussian measures Basic construction Imsets Graphical metho ds Undirected graphs Acyclic directed graphs Classic chain graphs Within classic graphical mo dels Decomp osable mo dels Recursive causal graphs Lattice conditional indep endence mo dels Bubble graphs Advanced graphical mo dels General directed graphs Recipro cal graphs Jointresponse chain graphs Covariance graphs Alternative chain graphs Annotated graphs Hidden variables Ancestral graphs MC graphs Incompleteness of graphical approaches Structural imsets fundamentals Basic class of distributions Discrete measures Nondegenerate Gaussian measures Nondegenerate conditional Gaussian measures Classes of structural imsets Elementary imsets Semielementary and combinatorial imsets Structural imsets Pro duct formula induced by a structural imset Examples of reference systems of measures Topological assumptions Markov condition Semigraphoid induced by a structural imset Markovian measures Equivalence result Description of probabilistic mo dels Sup ermo dular set functions Semigraphoid pro duced by a sup ermo dular function Strong equivalence of sup ermo dular functions Skeletal sup ermo dular functions Skeleton Signicance of skeletal imsets Description of mo dels by structural imsets Galois connection Formal concept analysis Lattice of structural mo dels Markov equivalence Two concepts of equivalence Facial and Markov equivalence Facial implication Direct characterization of facial implication Skeletal characterization of facial implication Adaptation to a distribution framework Testing facial implication Testing structural imsets Grade Invariants of facial equivalence The problem of representative choice Baricentral imsets Standard imsets Translation of DAG mo dels Translation of decomp osable mo dels Imsets of the least degree Strong facial implication Minimal generators Width Determining and unimarginal classes Imsets with the least lower class Exclusivity of standard imsets Other ways of representation Pattern Dual description Op en problems Unsolved theoretical problems Miscellaneous topics Classication of skeletal imsets Op erations with structural mo dels Reductive op erations Expansive op erations Accumulative op erations Implementation tasks Interpretation and learning tasks Meaningful description of structural mo dels Distribution frameworks and learning Conclusions App endix Classes of sets Posets and lattices Graphs .
Recommended publications
  • Hyper and Structural Markov Laws for Graphical Models
    Hyper and Structural Markov Laws for Graphical Models Simon Byrne Clare College and Statistical Laboratory, Department of Pure Mathematics and Mathematical Statistics This dissertation is submitted for the degree of Doctor of Philosophy University of Cambridge September 2011 Declaration This dissertation is the result of my own work and includes nothing which is the outcome of work done in collaboration except where specifically indi- cated in the text. iii Acknowledgements I would like to thank everyone who helped in the writing of this thesis, especially my supervisor Philip Dawid for the thoughtful discussions and advice over the past three years. I would also like to thank my colleagues in the Statistical Laboratory for their support and friendship. Finally I would like to thank my family, especially Rebecca, without whom I would never have been able to complete this work. v Summary My thesis focuses on the parameterisation and estimation of graphical mod- els, based on the concept of hyper and meta Markov properties. These state that the parameters should exhibit conditional independencies, similar to those on the sample space. When these properties are satisfied, parameter estima- tion may be performed locally, i.e. the estimators for certain subsets of the graph are determined entirely by the data corresponding to the subset. Firstly, I discuss the applications of these properties to the analysis of case-control studies. It has long been established that the maximum like- lihood estimates for the odds-ratio may be found by logistic regression, in other words, the “incorrect” prospective model is equivalent to the correct retrospective model.
    [Show full text]
  • Extended Conditional Independence and Applications in Causal Inference
    The Annals of Statistics 2017, Vol. 45, No. 6, 1–36 https://doi.org/10.1214/16-AOS1537 © Institute of Mathematical Statistics, 2017 EXTENDED CONDITIONAL INDEPENDENCE AND APPLICATIONS IN CAUSAL INFERENCE BY PANAYIOTA CONSTANTINOU AND A. PHILIP DAWID University of Warwick and University of Cambridge The goal of this paper is to integrate the notions of stochastic conditional independence and variation conditional independence under a more general notion of extended conditional independence. We show that under appropri- ate assumptions the calculus that applies for the two cases separately (axioms of a separoid) still applies for the extended case. These results provide a rig- orous basis for a wide range of statistical concepts, including ancillarity and sufficiency, and, in particular, the Decision Theoretic framework for statistical causality, which uses the language and calculus of conditional independence in order to express causal properties and make causal inferences. 1. Introduction. Conditional independence is a concept that has been widely studied and used in Probability and Statistics. The idea of treating conditional in- dependence as an abstract concept with its own calculus was introduced by Dawid [12], who showed that many results and theorems concerning statistical concepts such as ancillarity, sufficiency, causality, etc., are just applications of general prop- erties of conditional independence—extended to encompass stochastic and non- stochastic variables together. Properties of conditional independence have also been investigated by Spohn [50] in connection with causality, and Pearl and Paz [45], Pearl [44], Geiger et al. [34], Lauritzen et al. [42] in connection with graphi- cal models. For further related theory, see [15–19, 27, 30].
    [Show full text]
  • Extended Conditional Independence and Applications in Causal Inference
    Extended Conditional Independence and Applications in Causal Inference Panayiota Constantinou and A. Philip Dawid December 2, 2015 Abstract The goal of this paper is to integrate the notions of stochastic conditional independence and variation conditional independence under a more general no- tion of extended conditional independence. We show that under appropriate assumptions the calculus that applies for the two cases separately (axioms of a separoid) still applies for the extended case. These results provide a rigorous ba- sis for a wide range of statistical concepts, including ancillarity and sufficiency, and, in particular, the Decision Theoretic framework for statistical causality, which uses the language and calculus of conditional independence in order to express causal properties and make causal inferences. 1 Introduction Conditional independence is a concept that has been widely studied and used in Prob- ability and Statistics. The idea of treating conditional independence as an abstract concept with its own calculus was introduced by Dawid [1979a], who showed that many results and theorems concerning statistical concepts such as ancillarity, sufficiency, arXiv:1512.00245v1 [math.ST] 1 Dec 2015 causality etc., are just applications of general properties of conditional independence— extended to encompass stochastic and non-stochastic variables together. Properties of conditional independence have also been investigated by Spohn [1994] in con- nection with causality, and Pearl and Paz [1986], Pearl [2000], Geiger et al. [1990], Lauritzen et al. [1990] in connection with graphical models. In this paper, we consider two separate concepts of conditional independence: stochastic conditional independence, which involves solely stochastic variables, and variation conditional independence, which involves solely non-stochastic variables.
    [Show full text]
  • A Charatheodory Theorem for Closed Semispaces
    A Charatheodory Theorem for Closed Semispaces. J. Arocha, J. Bracho and L. Montejano Dedicated to Tudor Zamfirescu June 13, 2005 Abstract Following the spirit of Hadwiger´s transversal theorem we stablish a Caratheodory type theorem for closed halfspaces in which a combina- torial structure is required. Key words. Caratheodory Theorem, matroid, order type. AMS Subject classification 52A35 52C35 1 Introduction n Let F = x1, ..., xm be a finite collection of points in S . The classic Caratheodory{ Theorem} (see [2]) asserts that F iscontainedinanopensemi- sphere of Sn if and only if every subset of n +2points of F is contained in an open semisphere of Sn. The purpose of this paper is to study the same problem but for closed semispheres. First of all note that a square inscribed in S1 has the property that any three of its vertices is contained in a closed semicircle of S1 although the whole square is not contained in a closed semicircle of S1. As in Hadwiger´s Transversal Theorem [4], it is required an extra combinatorial hypothesis, in this case, the order. That is, suppose F = x1, ..., xm is an ordered collection of points in S1. Then, F is contained in a{ closed semicircle} of S1 if and only if every subset A of three points of F is contained in an open semicircle C of S1 in such a way that the order of A is consistent with the order of the 1 semicircle C. Note that this hypothesis is not satisfied for F,whenF consists of the four vertices of an inscribed square.
    [Show full text]
  • On the Acyclic Macphersonian
    On the homotopy type of the Acyclic MacPhersonian Ricardo Strausz [email protected] September 24, 2019 In 2003 Daniel K. Biss published, in the Annals of Mathematics [1], what he thought was a solution of a long standing problem, culminating a discovery due to Gel'fand and MacPher- son [4]; namely, he claimed that the MacPhersonian |a.k.a. the matroid Grassmannian| has the same homotoy type of the Grassmannian. Six years later he was encouraged to publish an \erratum" to his proof [2], observed by Nikolai Mn¨ev;up to now, the homotopy type of the MacPhersonian remains a mystery... d The Acyclic MacPhersonian AMn is to the MacPhersonian, as it is the class of all acyclic oriented matroids, of a given dimension d and order n, to the class of all oriented matroids, of the same dimension and order. The aim of this note is to study the homotopy type d of AMn in comparison with the Grassmannian G(n − d − 1; n − 1), which consists of all n − d − 1 linear subspaces of IRn−1, with the usual topology | recall that, due to the duality given by the \orthogonal complement", G(n − d − 1; n − 1) and G(d; n − 1) are homeomorphic. For, we will define a series of spaces, and maps between them, which we summarise in the following diagram: d { d gGn / AMgn π F S η { { Ö m:c:f: ~ d t d π { Gn o Sn τ arXiv:1504.06592v2 [math.AT] 21 Sep 2019 & d { d Gcn / AMn In this diagram, the maps labeled { represents inclusions and those labeled π are natural projections; the compositions of these will prove homotopical equivalences.
    [Show full text]
  • Lecture Notes the Algebraic Theory of Information
    Lecture Notes The Algebraic Theory of Information Prof. Dr. J¨urgKohlas Institute for Informatics IIUF University of Fribourg Rue P.-A. Faucigny 2 CH { 1700 Fribourg (Switzerland) E-mail: [email protected] http://diuf.unifr.ch/tcs Version: November 24, 2010 2 Copyright c 1998 by J. Kohlas Bezugsquelle: B. Anrig, IIUF Preface In today's scientific language information theory is confined to Shannon's mathematical theory of communication, where the information to be trans- mitted is a sequence of symbols from a finite alphabet. An impressive mathe- matical theory has been developed around this problem, based on Shannon's seminal work. Whereas it has often been remarked that this theory , albeit its importance, cannot seize all the aspect of the concept of \information", little progress has been made so far in enlarging Shannon's theory. In this study, it is attempted to develop a new mathematical theory cov- ering or deepening other aspects of information than Shannon's theory. The theory is centered around basic operations with information like aggregation of information and extracting of information. Further the theory takes into account that information is an answer, possibly a partial answer, to ques- tions. This gives the theory its algebraic flavor, and justifies to speak of an algebraic theory of information. The mathematical theory presented her is rooted in two important de- velopments of recent years, although these developments were not originally seen as part of a theory of information. The first development is local compu- tation, a topic starting with the quest for efficient computation methods in probabilistic networks.
    [Show full text]
  • UNIVERSALITY of SEPAROIDS 1. Preliminaries in Order to Be Self
    ARCHIVUM MATHEMATICUM (BRNO) Tomus 42 (2006), 85 – 101 UNIVERSALITY OF SEPAROIDS JAROSLAV NESETˇ RILˇ AND RICARDO STRAUSZ 2S Abstract. A separoid is a symmetric relation † ⊂ ` 2 ´ defined on disjoint pairs of subsets of a given set S such that it is closed as a filter in the canonical partial order induced by the inclusion (i.e., A † B A′ † B′ ⇐⇒ A ⊆ A′ and B ⊆ B′). We introduce the notion of homomorphism as a map which preserve the so-called “minimal Radon partitions” and show that separoids, endowed with these maps, admits an embedding from the category of all finite graphs. This proves that separoids constitute a countable universal partial order. Furthermore, by embedding also all hypergraphs (all set systems) into such a category, we prove a “stronger” universality property. We further study some structural aspects of the category of separoids. We completely solve the density problem for (all) separoids as well as for separoids of points. We also generalise the classic Radon’s theorem in a categorical setting as well as Hedetniemi’s product conjecture (which can be proved for oriented matroids). 1. Preliminaries In order to be self-contained, we start with some basic definitions and examples. 1.1. The objects. A separoid [1, 3, 8, 9, 10, 14, 15, 16, 17] is a (finite) set S S 2 endowed with a symmetric relation †⊂ 2 defined on its family of subsets with the following properties: if A, B ⊆ S then • A † B =⇒ A ∩ B = ∅ , •• A † B and B ⊂ B′ ⊆ S \ A =⇒ A † B′ . A related pair A † B is called a Radon partition (and we often say “A is not separated from B”).
    [Show full text]
  • Algebraic Structure of Information
    Algebraic Structure of Information J¨urgKohlas ∗ Departement of Informatics University of Fribourg CH { 1700 Fribourg (Switzerland) E-mail: [email protected] http://diuf.unifr.ch/drupal/tns/juerg kohlas March 23, 2016 Abstract Information comes in pieces which can be aggregated or combined and from each piece of information the part relating to a given question can be extracted. This consideration leads to an algebraic structure of information, called an information algebra. It is a generalisation of valuation algebras formerly introduced for the purpose of generic local computation in certain inference problems similar to Bayesian networks, but for more general uncertainty formalisms or representa- tions of information. The new issue is that the algebra is based on a mathematical framework for describing conditional independence and irrelevance. The older valuation algebras are special cases of the new generalised information algebras. It is shown that these algebras allow for generic local computation in Markov trees. The algebraic theory of generalised information algebras is elaborated to some extend. The duality theory between labeled and domain-free versions is presented. A new issue is the development of information order, not only for idempotent algebras, as has been done formerly, but more generally also for non-idempotent algebras. Further, for the case of idempotent information algebras, issues relating to finiteness of information and approximation are discussed, generalizing results known for the more special case of idempotent valuation algebras. ∗Research supported by grant No. 2100{042927.95 of the Swiss National Foundation for Research. 1 CONTENTS 2 Contents 1 Introduction 3 2 Conditional Independence 8 2.1 Quasi-Separoids .
    [Show full text]
  • Open Problems of Separoids
    Open problems of separoids Ricardo Strausz [email protected] October 31, 2004 2S A separoid is a set S endowed with a symmetric relation † ⊂ 2 defined on its pairs of disjoint subsets which is closed as a filter in the natural partial order induced by the inclusion. That is, if A, B ⊆ S then • A † B =⇒ A ∩ B = ∅, •• A † B & B ⊂ B0 (⊆ S \ A) =⇒ A † B0. The pair A † B is a Radon partition, each part (A and B) is a component and the union A ∪ B is the support of the partition. The separoid is acyclic if A † B =⇒ |A||B| > 0. Theorem 1 (Arocha et al. [1] and Strausz [7, 8]) Every finite separoid can be represented by a family of convex sets in some Euclidean space; that is, for every separoid S there exists a family d of convex sets F = {Ci ⊆ IE : i ∈ S} such that, for A, B ⊆ S (∗) A † B ⇐⇒ hCa ∈ F : a ∈ Ai ∩ hCb ∈ F : b ∈ Bi 6= ∅ (and A ∩ B = ∅), where h·i denotes the convex hull. Furthermore, if the separoid is acyclic, then such a representation can be done in the (|S| − 1)-dimensional Euclidean space. For the acyclic case, let S be identified with the set {1, . , n}. For each element i ∈ S and each i n pair A † B such that i ∈ A, let ρA†B be the point of IR defined by " # 1 1 X 1 X ρi = e + e − e , A†B i 2 |B| b |A| a b∈B a∈A n where {ei} is the canonical basis of IR .
    [Show full text]
  • On Separoids
    =3pc 0 On Separoids On Separoids Ricardo Strausz UNAM I want to thank to the Alfr´ed R´enyi Institute of Mathematics, the Hungarian Academy of Sciences, the Central European University and the Soros Foundation for their support during the time I wrote this Ph.D. thesis. 2001-2002 Nagyszuleim¨ emlek´ ere´ Y a la memoria de mi abuelo huasteco Victor Neumann-Lara Pr´ologo El material que se desarrolla en las p´aginasde esta tesis, es la culminaci´onde la investigaci´onque comenc´e,junto con mi tutor y otros colaboradores, en el ´ultimolustro del siglo pasado. Si bien parte de este material ya ha aparecido en otros textos, la recopilaci´onfinal fue hecha durante una estancia en Budapest, Hungr´ıa. Dicha estancia fu´efinanciada por la Fundaci´onSoros (a trav´esde la Universidad de Europa Central) que, a manera de intercambio, me comprometi´o a terminar mi tesis y dejar ah´ı una copia de ´esta;es por esto que el cuerpo principal de esta tesis se presenta en ingl´es. Sin embargo, incluye tambi´en un amplio resumen en espa˜nol(con referencias a los resultados principales) para facilitar su lectura. El texto, adem´asde ser un tratado de la teor´ıade los separoides, pretende ser autocontenido y explicativo; la teor´ıaes muy nueva asi que no supuse ning´un conocimiento previo —salvo, por supuesto, una formaci´ons´olidaen las ramas m´ascomunes de la Matem´atica. Esto me llev´oa organizar el material en un orden “que se llevara bien” con la l´ogicaque surge detr´asde los resultados, m´asque con un orden hist´orico o, quiz´as, did´actico.
    [Show full text]
  • Dependence Logic in Pregeometries and Omega-Stable Theories
    DEPENDENCE LOGIC IN PREGEOMETRIES AND !-STABLE THEORIES GIANLUCA PAOLINI AND JOUKO VA¨ AN¨ ANEN¨ Abstract. We present a framework for studying the concept of independence in a general context covering database theory, algebra and model theory as special cases. We show that well-known axioms and rules of independence for making inferences concerning basic atomic independence statements are complete with respect to a variety of semantics. Our results show that the uses of independence concepts in as different areas as database theory, algebra and model theory, can be completely characterized by the same axioms. We also consider concepts related to independence, such as dependence. 1. Introduction The concepts of dependence and independence are ubiquitous in science. They appear e.g. in biology, physics, economics, statistics, game theory, database theory, and last but not least, in algebra. Dependence and independence concepts of al- gebra have natural analogues in model theory, more exactly in geometric stability theory, arising from pregeometries and non-forking. In this paper we show that all these dependence and independence concepts have a common core that permits a complete axiomatization. For a succinct presentation of our results we introduce the following auxiliary concept. Definition 1.1. An independence structure is a pair S = (I; ?) where I is a non- empty set and ? is a binary relation in the set of finite subsets of I, satisfying the following axioms (we use boldface symbols x; y, etc for finite set, and shorten x [y to xy): I1: x ?; I2: If x ? y, then y ? x I3: If x ? yz , then x ? y I4: If x ? y and xy ? z , then x ? yz .
    [Show full text]
  • Uva-DARE (Digital Academic Repository)
    UvA-DARE (Digital Academic Repository) Causal Calculus in the Presence of Cycles, Latent Confounders and Selection Bias Forré, P.; Mooij, J.M. Publication date 2019 Document Version Author accepted manuscript Published in Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence Link to publication Citation for published version (APA): Forré, P., & Mooij, J. M. (2019). Causal Calculus in the Presence of Cycles, Latent Confounders and Selection Bias. In A. Globerson, & R. Silva (Eds.), Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence: UAI 2019, Tel Aviv, Israel, July 22-25, 2019 [15] AUAI Press. http://auai.org/uai2019/proceedings/papers/15.pdf General rights It is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), other than for strictly personal, individual use, unless the work is under an open content license (like Creative Commons). Disclaimer/Complaints regulations If you believe that digital publication of certain material infringes any of your rights or (privacy) interests, please let the Library know, stating your reasons. In case of a legitimate complaint, the Library will make the material inaccessible and/or remove it from the website. Please Ask the Library: https://uba.uva.nl/en/contact, or a letter to: Library of the University of Amsterdam, Secretariat, Singel 425, 1012 WP Amsterdam, The Netherlands. You will be contacted as soon as possible. UvA-DARE is a service provided by the library of the University of Amsterdam (https://dare.uva.nl) Download date:01 Oct 2021 Causal Calculus in the Presence of Cycles, Latent Confounders and Selection Bias Patrick Forré Joris M.
    [Show full text]