Moment-Entropy Inequalities for a Random Vector Erwin Lutwak, Deane Yang, and Gaoyong Zhang

Total Page:16

File Type:pdf, Size:1020Kb

Moment-Entropy Inequalities for a Random Vector Erwin Lutwak, Deane Yang, and Gaoyong Zhang 1 Moment-entropy inequalities for a random vector Erwin Lutwak, Deane Yang, and Gaoyong Zhang Abstract— The p-th moment matrix is defined for a real Throughout this paper, X denotes a random vector in Rn. random vector, generalizing the classical covariance matrix. The probability measure on Rn associated with a random Sharp inequalities relating the p-th moment and Renyi entropy vector X is denoted mX . are established, generalizing the classical inequality relating n the second moment and the Shannon entropy. The extremal We will denote the standard Lebesgue density on R by distributions for these inequalities are completely characterized. dx. By the density function fX of a random vector X, we mean the Radon-Nikodym derivative of probability measure mX with respect to Lebesgue measure. If V is a vector space and Φ: Rn → V is a continuous I. INTRODUCTION function, then the expected value of Φ(X) is given by In [9] the authors demonstrated how the classical informa- Z tion theoretic inequality for the Shannon entropy and second E[Φ(X)] = Φ(x) dmX (x). n moment of a real random variable could be extended to R inequalities for Renyi entropy and the p-th moment. The We call a random vector X nondegenerate, if E[|v·X|] > 0 extremals of these inequalities were also completely charac- for each nonzero v ∈ Rn. terized. Moment-entropy inequalities, using Renyi entropy, for discrete random variables have also been obtained by Arikan [2]. B. The p-th moment of a random vector We describe how to extend the definition of the second mo- For p ∈ (0, ∞), the standard p-th moment of a random ment matrix of a real random vector to that of the p-th moment vector X is given by matrix. Using this, we extend the moment-entropy inequalities Z p p and the characterization of the extremal distributions proved E[|X| ] = |x| dmX (x). (1) n in [9] to higher dimensions. R Variants and generalizations of the theorems presented can More generally, the p-th moment with respect to the inner be found in work of the authors [8], [10], [11] and Bastero- product h·, ·iA is Romance [3]. Z p p The authors would like to thank Christoph Haberl for his E[|X|A] = |x|A dmX (x). n careful reading of this paper and valuable suggestions for R improving it. C. The p-th moment matrix II. THE p-TH MOMENT MATRIX OF A RANDOM VECTOR The second moment matrix of a random vector X is defined A. Basic notation to be M2[X] = E[X ⊗ X], Throughout this paper we denote: n n where for v ∈ R , v ⊗ v is the linear transformation given R = n-dimensional Euclidean space by x 7→ (x · v)v. Recall that M2[X − E[X]] is the covariance x · y = standard Euclidean inner product of x, y ∈ Rn √ matrix. An important observation is that the definition of the |x| = x · x moment matrix does not use the inner product on Rn. S = positive definite symmetric n-by-n matrices A unique characterization of the second moment matrix is the following: Let M = M [X]. The inner product h·, ·i −1/2 |A| = determinant of A ∈ S 2 M is the unique one whose unit ball has maximal volume among n |K| = Lebesgue measure of K ⊂ R . all inner products h·, ·iA that are normalized so that the second 2 n moment satisfies E[|AX| ] = n. The standard Euclidean ball in R will be denoted by B, and We extend this characterization to a definition of the p-th its volume by ωn. n moment matrix M [X] for all p ∈ (0, ∞). Each inner product on R can be written uniquely as p Theorem 1: If p ∈ (0, ∞) and X is a nondegenerate n n n (x, y) ∈ R × R 7→ hx, yiA = Ax · Ay, random vector in R with finite p-th moment, then there exists a unique matrix A ∈ S such that for A ∈ S. The associated norm will be denoted by | · |A. E[|X|p ] = n E. Lutwak ([email protected]), D. Yang ([email protected]), and A G. Zhang ([email protected]) are with the Department of Mathematics, Polytechnic University, Brooklyn, New York. and were supported in part by and NSF Grant DMS-0405707. |A| ≥ |A0|, 2 0 p for each A ∈ S such that E[|X|A0 ] = n. Moreover, the matrix Observe that hλ[X, Y ] is continuous in λ. A is the unique matrix in S satisfying Lemma 2: If X and Y are random vectors such that hλ[X], h [Y ], and h [X, Y ] are finite, then I = E[AX ⊗ AX|AX|p−2]. λ λ We define the p-th moment matrix of a random vector X to hλ[X, Y ] ≥ 0. −p be Mp[X] = A , where A is given by the theorem above. The proof of the theorem is given in §IV Equality holds if and only if X = Y . Proof: If λ > 1, then by the Holder¨ inequality, λ−1 1 III. MOMENT-ENTROPY INEQUALITIES Z Z λ Z λ λ−1 λ λ fY fX dx ≤ fY dx fX dx , A. Entropy n n n R R R The Shannon entropy of a random vector X is defined to and if λ < 1, then we have be Z Z Z λ λ−1 λ λ(1−λ) h[X] = − fX log fX dx, fX = (fY fX ) fY n n n R R R Z λ Z 1−λ provided that the integral above exists. For λ > 0 the λ-Renyi λ−1 λ ≤ fY fX fY . entropy power of a density function is defined to be n n R R 1 Z 1−λ The inequality for λ = 1 follows by taking the limit λ → 1. λ fX if λ 6= 1, The equality conditions for λ 6= 1 follow from the equality Nλ[X] = n R conditions of the Holder¨ inequality. The inequality for λ = h[f] e if λ = 1, 1, including the equality condition, follows from the Jensen provided that the integral above exists. Observe that inequality (details may be found, for example, page 234 in [4]). lim Nλ[X] = N1[X]. λ→1 The λ–Renyi entropy of a random vector X is defined to be C. Generalized Gaussians We call the extremal random vectors for the moment- h [X] = log N [X]. λ λ entropy inequalities generalized Gaussians and recall their The entropy hλ[X] is continuous in λ and, by the Holder¨ definition here. inequality, decreasing in λ. It is strictly decreasing, unless X Given t ∈ R, let is a uniform random vector. t = max(t, 0). It follows by the chain rule that + Let Nλ[AX] = |A|Nλ[X], (2) Z ∞ Γ(t) = xt−1e−x dx for each A ∈ S. 0 denote the Gamma function, and let B. Relative entropy Γ(a)Γ(b) β(a, b) = Given two random vectors X, Y in Rn, their relative Shan- Γ(a + b) non entropy or Kullback–Leibler distance [6], [5], [1] (also, denote the Beta function. see page 231 in [4]) is defined by For each p ∈ (0, ∞) and λ ∈ (n/(n + p), ∞), define the Z fX standard generalized Gaussian to be the random vector Z in h1[X, Y ] = fX log dx, (3) n n n f f : → [0, ∞) R Y R whose density function Z R is given by provided that the integral above exists. Given λ > 0, we define p 1/(λ−1) ap,λ(1 + (1 − λ)|x| )+ if λ 6= 1 the relative λ–Renyi entropy power of X and Y as follows. If fZ (x) = (5) λ 6= 1, then −|x|p ap,1e if λ = 1, 1 1 Z 1−λ Z λ λ−1 λ fY fX dx fY dx where n n R R n Nλ[X, Y ] = 1 , (4) p(1 − λ) p Z λ(1−λ) if λ < 1, λ n 1 n fX dx nωnβ( p , 1−λ − p ) n R p if λ = 1, and, if λ = 1, then ap,λ = nω Γ( n ) n p h1[X,Y ] n N1[X, Y ] = e , p(λ − 1) p if λ > 1. n λ provided in both cases that the righthand side exists. Define nωnβ( p , λ−1 ) the λ–Renyi relative entropy of random vectors X and Y by Any random vector Y in Rn that can be written as Y = AZ, hλ[X, Y ] = log Nλ[X, Y ]. for some A ∈ S is called a generalized Gaussian. 3 D. Information measures of generalized Gaussians If λ 6= 1, then by (9) and (5), (1), (11), and (7), Z If 0 < p < ∞ and λ > n/(n + p), the λ-Renyi entropy λ−1 fY fX power of the standard generalized Gaussian random vector Z n is given by R Z ≥ aλ−1tn(1−λ) + (1 − λ)aλ−1tn(1−λ)−p |x|pf (x) dx 1 X n n(λ − 1) λ−1 R −1 λ−1 n(1−λ) −p p 1 + ap,λ if λ 6= 1 = a t (1 + (1 − λ)t E[|X| ]) Nλ[Z] = pλ λ−1 n(1−λ) p n −1 = a t (1 + (1 − λ)E[|Z|] ]) p e ap,1 if λ = 1 Z n(1−λ) λ = t fZ , (12) If 0 < p < ∞ and λ > n/(n + p), then the p-th moment n R of Z is given by where equality holds if λ < 1. It follows that if λ 6= 1, then h p i−1 by Lemma 2, (4), (10) and (12), and (11), we have E[|Z|p] = λ 1 + − 1 .
Recommended publications
  • Matrix-Valued Moment Problems
    Matrix-valued moment problems A Thesis Submitted to the Faculty of Drexel University by David P. Kimsey in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Mathematics June 2011 c Copyright 2011 David P. Kimsey. All Rights Reserved. ii Acknowledgments I wish to thank my advisor Professor Hugo J. Woerdeman for being a source of mathematical and professional inspiration. Woerdeman's encouragement and support have been extremely beneficial during my undergraduate and graduate career. Next, I would like to thank Professor Robert P. Boyer for providing a very important men- toring role which guided me toward a career in mathematics. I owe many thanks to L. Richard Duffy for providing encouragement and discussions on fundamental topics. Professor Dmitry Kaliuzhnyi-Verbovetskyi participated in many helpful discussions which helped shape my understanding of several mathematical topics of interest to me. In addition, Professor Anatolii Grinshpan participated in many useful discussions gave lots of encouragement. I wish to thank my Ph.D. defense committee members: Professors Boyer, Lawrence A. Fialkow, Grinshpan, Pawel Hitczenko, and Woerde- man for pointing out any gaps in my understanding as well as typographical mistakes in my thesis. Finally, I wish to acknowledge funding for my research assistantship which was provided by the National Science Foundation, via the grant DMS-0901628. iii Table of Contents Abstract ................................................................................ iv 1. INTRODUCTION ................................................................
    [Show full text]
  • Geometry of the Pfaff Lattices 3
    GEOMETRY OF THE PFAFF LATTICES YUJI KODAMA∗ AND VIRGIL U. PIERCE∗∗ Abstract. The (semi-infinite) Pfaff lattice was introduced by Adler and van Moerbeke [2] to describe the partition functions for the random matrix models of GOE and GSE type. The partition functions of those matrix models are given by the Pfaffians of certain skew-symmetric matrices called the moment matrices, and they are the τ-functions of the Pfaff lattice. In this paper, we study a finite version of the Pfaff lattice equation as a Hamiltonian system. In particular, we prove the complete integrability in the sense of Arnold-Liouville, and using a moment map, we describe the real isospectral varieties of the Pfaff lattice. The image of the moment map is a convex polytope whose vertices are identified as the fixed points of the flow generated by the Pfaff lattice. Contents 1. Introduction 2 1.1. Lie algebra splitting related to SR-factorization 2 1.2. Hamiltonian structure 3 1.3. Outline of the paper 5 2. Integrability 6 2.1. The integrals Fr,k(L) 6 2.2. The angle variables conjugate to Fr,k(L) 10 2.3. Extended Jacobian 14 3. Matrix factorization and the τ-functions 16 3.1. Moment matrix and the τ-functions 17 3.2. Foliation of the phase space by Fr,k(L) 21 3.3. Examples from the matrix models 22 4. Real solutions 23 arXiv:0705.0510v1 [nlin.SI] 3 May 2007 4.1. Skew-orthogonal polynomials 23 4.2. Fixed points of the Pfaff flows 26 4.3.
    [Show full text]
  • Pre-Publication Accepted Manuscript
    Peter Forrester, Shi-Hao Li Classical discrete symplectic ensembles on the linear and exponential lattice: skew orthogonal polynomials and correlation functions Transactions of the American Mathematical Society DOI: 10.1090/tran/7957 Accepted Manuscript This is a preliminary PDF of the author-produced manuscript that has been peer-reviewed and accepted for publication. It has not been copyedited, proofread, or finalized by AMS Production staff. Once the accepted manuscript has been copyedited, proofread, and finalized by AMS Production staff, the article will be published in electronic form as a \Recently Published Article" before being placed in an issue. That electronically published article will become the Version of Record. This preliminary version is available to AMS members prior to publication of the Version of Record, and in limited cases it is also made accessible to everyone one year after the publication date of the Version of Record. The Version of Record is accessible to everyone five years after publication in an issue. CLASSICAL DISCRETE SYMPLECTIC ENSEMBLES ON THE LINEAR AND EXPONENTIAL LATTICE: SKEW ORTHOGONAL POLYNOMIALS AND CORRELATION FUNCTIONS PETER J. FORRESTER AND SHI-HAO LI Abstract. The eigenvalue probability density function for symplectic invariant random matrix ensembles can be generalised to discrete settings involving either a linear or exponential lattice. The corresponding correlation functions can be expressed in terms of certain discrete, and q, skew orthogonal polynomials respectively. We give a theory of both of these classes of polynomials, and the correlation kernels determining the correlation functions, in the cases that the weights for the corresponding discrete unitary ensembles are classical.
    [Show full text]
  • Representational Models: a Common Framework for Understanding Encoding, Pattern-Component, and Representational-Similarity Analysis
    Representational models: A common framework for understanding encoding, pattern-component, and representational-similarity analysis Jörn Diedrichsen1 & Nikolaus Kriegeskorte2 1. Brain and Mind Institute, Western University, Canada 2. Cognitive and Brain Sciences Unit, Cambridge University, UK Abstract Representational models explain how activity patterns in populations of neurons (or, more generally, in multivariate brain activity measurements) relate to sensory stimuli, motor actions, or cognitive processes. In an experimental context, representational models can be defined as probabilistic hypotheses about what profiles of activations across conditions are likely to be observed. We describe three methods to test such models – encoding models, pattern component modeling (PCM), and representational similarity analysis (RSA). We show that these methods are closely related in that they all evaluate the statistical second moment of the activity profile distribution. Using simulated data from three different fMRI experiments, we compare the power of the approaches to adjudicate between competing representational models. PCM implements a likelihood-ratio test and therefore constitutes the most powerful test if its assumptions hold. However, the other two approaches – when conducted appropriately – can perform similarly. In encoding approaches, the linear model needs to be appropriately regularized, which imposes a prior on the activity profiles. Without such a prior, encoding approaches do not test well-defined representational models. In RSA, the unequal variances and dependencies of the distance measures can be taken into account using a multi-normal approximation to the sampling distribution of the distance measures, so as to enable optimal inference. Each of the three techniques renders different information explicit (e.g. single response tuning in encoding models and population representational dissimilarity in RSA) and has specific advantages in terms of computational demands, ease of use, and extensibility.
    [Show full text]
  • A Panorama of Positivity
    A PANORAMA OF POSITIVITY ALEXANDER BELTON, DOMINIQUE GUILLOT, APOORVA KHARE, AND MIHAI PUTINAR Abstract. This survey contains a selection of topics unified by the concept of positive semi-definiteness (of matrices or kernels), reflecting natural constraints imposed on discrete data (graphs or networks) or continuous objects (probability or mass distributions). We put empha- sis on entrywise operations which preserve positivity, in a variety of guises. Techniques from harmonic analysis, function theory, operator theory, statistics, combinatorics, and group representations are invoked. Some partially forgotten classical roots in metric geometry and distance transforms are presented with comments and full bibliographical refer- ences. Modern applications to high-dimensional covariance estimation and regularization are included. Contents 1. Introduction 2 2. From metric geometry to matrix positivity 4 2.1. Distance geometry 4 2.2. Spherical distance geometry 6 2.3. Distance transforms 6 2.4. Altering Euclidean distance 8 2.5. Positive definite functions on homogeneous spaces 11 2.6. Connections to harmonic analysis 15 3. Entrywise functions preserving positivity in all dimensions 18 Date: November 13, 2019. 2010 Mathematics Subject Classification. 15-02, 26-02, 15B48, 51F99, 15B05, 05E05, arXiv:1812.05482v3 [math.CA] 12 Nov 2019 44A60, 15A24, 15A15, 15A45, 15A83, 47B35, 05C50, 30E05, 62J10. Key words and phrases. metric geometry, positive semidefinite matrix, Toeplitz ma- trix, Hankel matrix, positive definite function, completely monotone functions, absolutely monotonic functions, entrywise calculus, generalized Vandermonde matrix, Schur polyno- mials, symmetric function identities, totally positive matrices, totally non-negative matri- ces, totally positive completion problem, sample covariance, covariance estimation, hard / soft thresholding, sparsity pattern, critical exponent of a graph, chordal graph, Loewner monotonicity, convexity, and super-additivity.
    [Show full text]
  • New Publications Offered by The
    New Publications Offered by the AMS To subscribe to email notification of new AMS publications, please go to http://www.ams.org/bookstore-email. Algebra and Algebraic Introduction Geometry to Orthogonal, Symplectic and Unitary The Schrödinger Representations of Model for the Minimal Finite Groups Representation of the Carl R. Riehm, McMaster Indefinite Orthogonal University, Hamilton, ON, Group O(p; q) Canada, and The Fields Institute, Toronto, ON, Canada Toshiyuki Kobayashi, University of Tokyo, Japan, and Gen Mano, Orthogonal, symplectic and unitary representations of finite groups lie at the crossroads of two more traditional subjects of PricewaterhouseCoopers Aarata, mathematics—linear representations of finite groups, and the Tokyo, Japan theory of quadratic, skew symmetric and Hermitian forms—and thus inherit some of the characteristics of both. Contents: Introduction; Two models of the minimal representation of O(p; q); K-finite eigenvectors in the Schrödinger model L2(C); This book is written as an introduction to the subject and not as an Radial part of the inversion; Main theorem; Bessel distributions; encyclopaedic reference text. The principal goal is an exposition of Appendix: special functions; Bibliography; List of Symbols; Index. the known results on the equivalence theory, and related matters such as the Witt and Witt-Grothendieck groups, over the “classical” Memoirs of the American Mathematical Society, Volume 213, fields—algebraically closed, real closed, finite, local and global. A Number 1000 detailed exposition of the background material needed is given in August 2011, 132 pages, Softcover, ISBN: 978-0-8218-4757-2, the first chapter. 2010 Mathematics Subject Classification: 22E30; 22E46, 43A80, It was A.
    [Show full text]
  • RANK SHIFT CONDITIONS and REDUCTIONS of 2D-TODA THEORY 3 Expressed in Terms of the Cauchy Two-Matrix Model
    RANK SHIFT CONDITIONS AND REDUCTIONS OF 2D-TODA THEORY SHI-HAO LI AND GUO-FU YU Abstract. This paper focuses on different reductions of 2-dimensional (2d-)Toda hierarchy. Symmetric and skew symmetric moment matrices are firstly considered, resulting in the differ- ential relations between symmetric/skew symmetric tau functions and 2d-Toda’s tau functions respectively. Furthermore, motivated by the Cauchy two-matrix model and Bures ensemble from random matrix theory, we study the rank one shift condition in symmetric case and rank two shift condition in skew symmetric case, from which the C-Toda hierarchy and B-Toda hierarchy are found, together with their special Lax matrices and integrable structures. 1. Introduction The studies in random matrix theory and classic integrable systems promote the development of both communities, and the orthogonal polynomials play an important role to connect these two totally different subjects. For example, in [17, §2], it is shown that these polynomials can lead to an iso-spectral flow of the Toda lattice, while the normalisation factors of the orthogonal polynomials can be expressed in terms of the partition function of the unitary invariant Hermitian matrix model. Later on, several novel random matrix models have been found in the course of doing analysis of the classical integrable systems. An example is the appearance of the Cauchy two-matrix model. This model was proposed in the studies of the peakon solutions of the Degasperis-Procesi equation and its related Hermite-Padé approximation problem [34]. To some extent, the relation between random matrix and integrable system can be described in terms of the partition function and tau functions.
    [Show full text]
  • Sums of Squares, Moment Matrices and Optimization Over Polynomials
    SUMS OF SQUARES, MOMENT MATRICES AND OPTIMIZATION OVER POLYNOMIALS MONIQUE LAURENT∗ May 8, 2008 Abstract. We consider the problem of minimizing a polynomial over a semialge- braic set defined by polynomial equations and inequalities, which is NP-hard in general. Hierarchies of semidefinite relaxations have been proposed in the literature, involving positive semidefinite moment matrices and the dual theory of sums of squares of poly- nomials. We present these hierarchies of approximations and their main properties: asymptotic/finite convergence, optimality certificate, and extraction of global optimum solutions. We review the mathematical tools underlying these properties, in particular, some sums of squares representation results for positive polynomials, some results about moment matrices (in particular, of Curto and Fialkow), and the algebraic eigenvalue method for solving zero-dimensional systems of polynomial equations. We try whenever possible to provide detailed proofs and background. Key words. positive polynomial, sum of squares of polynomials, moment problem, polynomial optimization, semidefinite programming. AMS(MOS) subject classifications. 13P10, 13J25, 13J30, 14P10, 15A99, 44A60, 90C22, 90C30. Contents 1 Introduction...............................158 1.1 The polynomial optimization problem . 159 1.2 Thescopeofthispaper . .161 1.3 Preliminaries on polynomials and semidefinite programs . 162 1.4 Contentsofthepaper . .166 2 Algebraic preliminaries . 166 2.1 Polynomial ideals and varieties . 166 2.2 The quotient algebra R[x]/I .................169 2.3 Gr¨obner bases and standard monomial bases . 171 2.4 Solving systems of polynomial equations . 173 3 Positive polynomials and sums of squares . 178 3.1 Somebasicfacts ........................178 3.2 Sums of squares and positive polynomials: Hilbert’s result . 178 3.3 Recognizing sums of squares of polynomials .
    [Show full text]
  • Arxiv:2006.16213V4 [Math.FA] 9 Jan 2021 41,4B4(Secondary)
    TOTALLY POSITIVE KERNELS, POLYA´ FREQUENCY FUNCTIONS, AND THEIR TRANSFORMS ALEXANDER BELTON, DOMINIQUE GUILLOT, APOORVA KHARE, AND MIHAI PUTINAR Abstract. The composition operators preserving total non-negativity and total pos- itivity for various classes of kernels are classified, following three themes. Letting a function act by post composition on kernels with arbitrary domains, it is shown that such a composition operator maps the set of totally non-negative kernels to itself if and only if the function is constant or linear, or just linear if it preserves total positiv- ity. Symmetric kernels are also discussed, with a similar outcome. These classification results are a byproduct of two matrix-completion results and the second theme: an extension of A.M. Whitney’s density theorem from finite domains to subsets of the real line. This extension is derived via a discrete convolution with modulated Gauss- ian kernels. The third theme consists of analyzing, with tools from harmonic analysis, the preservers of several families of totally non-negative and totally positive kernels with additional structure: continuous Hankel kernels on an interval, P´olya frequency functions, and P´olya frequency sequences. The rigid structure of post-composition transforms of totally positive kernels acting on infinite sets is obtained by combining several specialized situations settled in our present and earlier works. Contents 1. Introduction and main results 2 2. Preliminaries and overview 7 3. Total non-negativity preservers 9 4. Total-positivity preservers. I. Semi-finite domains 14 5. Total-positivity preservers are continuous 22 6. Extensions of Whitney’s approximation theorem 24 7. Totally non-negative and totally positive Hankel kernels 30 arXiv:2006.16213v4 [math.FA] 9 Jan 2021 8.
    [Show full text]
  • Lecture 1: Bivariate Linear Model
    Covariance Structure Analysis (LISREL) Professor Ross L. Matsueda Lecture Notes Do not copy, quote, or cite without permission LECTURE 1: MOMENTS AND PARAMETERS IN A BIVARIATE LINEAR STRUCTURAL MODEL I PRELIMINARIES: STRUCTURAL EQUATION MODELS. II. POPULATION MOMENTS AND PARAMETERS. III. CORRELATIONS AND STANDARDIZED COEFFICIENTS. IV. ESTIMATION AND TESTING. I. PRELIMINARIES: STRUCTURAL EQUATION MODELS This course is about the use of structural equation models for examining social phenomena. There are four basic principles involved: 1. Assume we can characterize a social phenomenon by a set of random variables. A random variable is one whose values or outcomes are governed by a probability distribution. A set of random variables have joint outcomes, which are governed by a joint probability distribution. We characterize this joint distribution by observable moments (means, variances, and covariances). 2. We assume that some underlying causal structure or model generates observed moments (covariances) The underlying structural model represents our (parsimonious) theory of the phenomenon. The model generates the data (joint distribution of random variables), which we characterize by observed moments. The parameters of the model are assumed to be invariant: they are somewhat stable over time and only change when there is true social change. The causal structure can be expressed in linear structural equations and path diagrams. Hence the terms, "covariance structure analysis," "path analysis" and "structural equation models." 3. Given observed population moments, we could compute population parameters of our structural model (assuming the model is identified). 4. However, we lack access to population moments, and therefore, we cannot compute population parameters. Instead, we have access only to sample moments.
    [Show full text]
  • A Geometric and Topological Viewpoint
    IOP Journal of Physics A: Mathematical and Theoretical J. Phys. A: Math. Theor. Journal of Physics A: Mathematical and Theoretical J. Phys. A: Math. Theor. 51 (2018) 353001 (39pp) https://doi.org/10.1088/1751-8121/aacecf 51 Topical Review 2018 © 2018 IOP Publishing Ltd Fifty years of the fnite nonperiodic Toda lattice: a geometric and topological JPHAC5 viewpoint 353001 Yuji Kodama1,3 and Barbara A Shipman2 Y Kodama and B A Shipman 1 Department of Mathematics, Ohio State University, Columbus, OH 43210, United States of America 2 Department of Mathematics, The University of Texas at Arlington, Arlington, TX Fifty years of the fnite nonperiodic Toda lattice TX 76010, United States of America E-mail: [email protected] and [email protected] Printed in the UK Received 26 January 2018, revised 20 June 2018 JPA Accepted for publication 25 June 2018 Published 23 July 2018 10.1088/1751-8121/aacecf Abstract In 1967, Japanese physicist Morikazu Toda published a pair of seminal papers in the Journal of the Physical Society of Japan that exhibited soliton solutions to a chain of particles with nonlinear interactions between nearest neighbors. In the ffty years that followed, Toda’s system of particles has been generalized in different directions, each with its own analytic, geometric, and topological 1751-8121 characteristics. These are known collectively as the Toda lattice. This survey recounts and compares the various versions of the fnite nonperiodic Toda lattice from the perspective of their geometry and topology. In particular, we highlight the polytope structure of the solution spaces as viewed through the moment map, and we explain the connection between the real indefnite Toda fows and the integral cohomology of real fag varieties.
    [Show full text]
  • Sums of Squares, Moment Matrices and Optimization Over Polynomials
    SUMS OF SQUARES, MOMENT MATRICES AND OPTIMIZATION OVER POLYNOMIALS MONIQUE LAURENT∗ Updated version: February 6, 2010 Abstract. We consider the problem of minimizing a polynomial over a semialge- braic set defined by polynomial equations and inequalities, which is NP-hard in general. Hierarchies of semidefinite relaxations have been proposed in the literature, involving positive semidefinite moment matrices and the dual theory of sums of squares of poly- nomials. We present these hierarchies of approximations and their main properties: asymptotic/finite convergence, optimality certificate, and extraction of global optimum solutions. We review the mathematical tools underlying these properties, in particular, some sums of squares representation results for positive polynomials, some results about moment matrices (in particular, of Curto and Fialkow), and the algebraic eigenvalue method for solving zero-dimensional systems of polynomial equations. We try whenever possible to provide detailed proofs and background. Key words. positive polynomial, sum of squares of polynomials, moment problem, polynomial optimization, semidefinite programming AMS(MOS) subject classifications. 13P10, 13J25, 13J30, 14P10, 15A99, 44A60, 90C22, 90C30 Contents 1 Introduction............................... 3 1.1 The polynomial optimization problem . 4 1.2 Thescopeofthispaper .................... 6 1.3 Preliminaries on polynomials and semidefinite programs . 7 1.3.1 Polynomials ..................... 7 1.3.2 Positive semidefinite matrices . 8 1.3.3 Flat extensions of matrices . 9 1.3.4 Semidefinite programs . 9 1.4 Contentsofthepaper ..................... 11 2 Algebraic preliminaries . 11 2.1 Polynomial ideals and varieties . 11 2.2 The quotient algebra R[x]/I ................. 14 2.3 Gr¨obner bases and standard monomial bases . 16 2.4 Solving systems of polynomial equations .
    [Show full text]