DEFINITE DETERMINANTAL REPRESENTATIONS VIA ORTHOSTOCHASTIC MATRICES
PAPRI DEY
Abstract. Determinantal polynomials play a crucial role in semidefinite programming problems. Helton-Vinnikov proved that real zero (RZ) bivariate polynomials are deter- minantal. However, it leads to a challenging problem to compute such a determinantal representation. We provide a necessary and sufficient condition for the existence of defi- nite determinantal representation of a bivariate polynomial by identifying its coefficients as scalar products of two vectors where the scalar products are defined by orthostochas- tic matrices. This alternative condition enables us to develop a method to compute a monic symmetric/Hermitian determinantal representations for a bivariate polynomial of degree d. In addition, we propose a computational relaxation to the determinantal problem which turns into a problem of expressing the vector of coefficients of the given polynomial as convex combinations of some specified points. We also characterize the range set of vector coefficients of a certain type of determinantal bivariate polynomials. AMS Classification (2010). 15A75, 15B10, 15B51, 90C22. Keywords. Semidefinite Programming, LMI Representable sets, Determinantal Polyno- mials, RZ Polynomials, Orthostochastic matrices, Exterior algebra.
1. Introduction One of the objectives in convex algebraic geometry is to characterize convex semi- algebraic sets which are definite LMI representable sets. A set S ⊆ Rn is said to be LMI representable if n (1) S = {x ∈ R : A0 + x1A1 + x2A2 + ··· + xnAn 0} T for some real symmetric matrices Ai, i = 0, . . . , n and x = (x1, . . . , xn) . If A0 = I( 0), the set S is called a monic (definite) LMI representable set. By A 0( 0) we mean that the matrix A is positive (semi)-definite. A spectrahedron which is the feasible set of a semidefinite programming (SDP) problem is an another term used for a LMI representable set. An approximate optimal solution of a SDP can be found by applying interior point arXiv:1708.09559v3 [math.OC] 30 Jan 2019 methods [NN94], [BGFB] when the SDP is strictly feasible. The assumption of strict feasibility of a SDP problem is equivalent to the assumption that its feasible set has nonempty interior. It is proved that if the set S has non-empty interior, the constant
coefficient matrix A0 can be chosen to be positive definite [[Ram95],section 1.4], [Nt12]. So, if the feasible set of an optimization problem is a definite linear matrix inequality (LMI) representable set, the optimization problem can be transformed into a SDP problem [Ram95], [HV07]. The technique of converting optimization problems into semidefinite programming (SDP) problems arise in control theory, signal processing and many other areas in engineering. Now we briefly talk about the connection between LMI representable sets and determinantal polynomials. 1 2 PAPRI DEY
A polynomial f(x) ∈ R[x] is said to be a determinantal polynomial if it can be written as
(2) f(x) = det(A0 + x1A1 + x2A2 + ··· + xnAn), where coefficient matrices Ai of linear matrix polynomial are symmetric/Hermitian of some order greater than the degree of the polynomial and the constant coefficient ma- trix A0 is positive definite. Then the algebraic interior associated with f(x) i.e., the closure of a (arcwise) connected component of {x ∈ Rn : f(x) > 0} is a spectrahedron [HV07]. Thus one of the successful techniques to deal with characterizing definite LMI representable sets is to characterize determinantal polynomials. So, we focus on defi- nite (monic) symmetric/Hermitian determinantal representation in order to accomplish the connection between determinantal polynomials and semidefinite programming (SDP) problems. The determinantal polynomials are of special kind of polynomial, called real zero (RZ) polynomials [HV07]. A multivariate polynomial f(x) ∈ R[x] is said to be a real zero (RZ) polynomial if the polynomial has only real zeros when it is restricted to any line passing through origin i.e., for any x ∈ Rn, all the roots of the univariate polynomial fx(t) := f(t · x) are real and f(0) 6= 0. The polynomial f(x) is called strictly RZ if all these roots are distinct, for all x ∈ Rn, x 6= 0. The homogenization of a RZ polynomial is known as hyperbolic polynomial which has a vast area of research on its own. For example, one can see [Nui69], [Hen10],[Br¨a10],[PSV12], [KPV15], [LP17], [JT18], [DP18] from the literature.
Helton-Vinnikov have proved that a RZ bivariate polynomial f(x1, x2) always admits monic Hermitian as well as symmetric determinantal representations of size d [HV07]. The homogenized version of this result is known as Lax conjecture [LPR05]. However, it is computationally difficult task to test whether a given polynomial is hyperbolic (with respect to a fixed point e). The precise complexity is only known in some special cases (see [RRS]). The authors in [HV07] have provided explicit expressions of the coefficient matrices of a symmetric determinantal representation in terms of theta functions, and the period matrix of the curve f(x1, x2) = 0 when the curve is defined by a strictly RZ bivariate polynomial f(x1, x2). Indeed, it is not easy to compute determinantal representation numerically or symbolically using this method. Later, the problem of computing monic symmetric/Hermitian determinantal representation for a strictly RZ bivariate polynomial has been widely studied, for example one can see [Dix02], [PSV12], [Hen10], [GKVVW14].
The results of this paper. In this paper, we provide a representation of the vector coefficient of mixed monomials of a determinantal bivariate polynomial as scalar product of two known vectors with different defining matrices. It is shown that defining matrices are orthostochastic matrices. This provides us a necessary and sufficient condition for the existence of a monic symmetric/Hermitian determinantal representation, see the Theorem 2.8. This necessary and sufficient condition can be treated as an alternative condition for a bivariate polynomial to be a determinantal polynomial, although it’s different from RZ property of a polynomial in the sense of computation. An important fact about DEFINITE DETERMINANTAL REPRESENTATIONS VIA ORTHOSTOCHASTIC MATRICES 3 this alternative condition is that it provides a mean to compute such a determinantal representation as opposed to RZ condition. The proposed method can compute the eigenvalues and diagonal entries of coefficient matrices by solving roots of certain univariate polynomials which are restrictions of the given polynomial along coordinates and solving systems of linear equations respectively, see Lemma 2.2 and Corollary 2.5. As we need the coefficient matrices to be symmet- ric/Hermitian, so all the eigenvalues and diagonal entries of coefficient matrices must be real. Thus these results provide a few necessary conditions for the given polynomial of degree d to be a determinantal polynomial of size d. More precisely, if such a representation exists, these two vectors in scalar product are uniquely (up to ordering) constructed from the eigenvalues of the symmetric/Hermitian coefficient matrices of a determinantal representation of the given polynomial and the defining matrices are obtained as (complex) Hadamard product of exterior powers of an orthogonal (unitary) matrix V with themselves. In fact, these matrices are orthostochastic (resp. unistochastic) matrices corresponding to a monic symmetric (resp. Hermitian) determinantal representation. Moreover, we translate the determinantal representation problem into finding a suitable orthostochastic/unistochastic matrix using theory of majorization, see the Theorem 2.14. This enables us to compute such a determinantal representation for a lower degree bivari- ate polynomial if it exists. The another advantage of this method is that it works even though the polynomial is not strictly RZ or the plane curve defined by the polynomial is not smooth. An interesting fact about this method is that it can completely characterize cubic determinantal polynomials (Section-2.3). Consequently, this alternative representation characterize determinantal bivariate polynomials as well as RZ bivariate polynomials. However, we also propose a computational relaxation to determinantal problem which works well for higher degree polynomials (Section-3). The necessary and sufficient con- dition for the relaxation problem turns into a polytope membership problem which is computationally easier to handle. Explicitly, in the relaxation method one needs to check whether a point which is the vector coefficient of a bivariate polynomial can be written as convex combinations of some specified points. It’s also shown how this method can help us to compute such a determinantal representation for bivariate polynomial by an example of quartic case in details. In Section 4, we study the range set of vector coefficients of mixed monomials of certain class of determinantal bivariate polynomials and have shown that it lies inside the convex hull of some specified points. All these methods have been implemented in Macaulay2 in DeterminantalRepresentations package. Monic symmetric determinantal representation is abbreviated as MSDR and monic Hermitian determinantal representation is abbreviated as MHDR in this paper. Acknowledgements. I would like to thank my PhD supervisor Prof. Harish K. Pillai for helpful discussions on the subject of this paper. I am grateful to Prof. J.W. Helton and Prof. Cynthia Vinzant for useful suggestions to express the ideas of this paper properly. I would like to thank Dr. Justin Chen for helping me to implement these methods in Macaulay2. Much of the work on this paper has been supported by Council of 4 PAPRI DEY
Scientific and Industrial Research (CSIR), India while the author was doing her doctoral work in IIT Bombay. The author also gratefully acknowledges support through the Max Planck Institute for Mathematics in the Sciences in Leipzig, Germany and the Institute for Computational and Experimental Research in Mathematics, Brown University, USA.
2. Determinantal Polynomials In this section, we convert the determinantal representations of bivariate polynomials into another problem which is concerned to find a suitable orthostochastic or unistochastic matrix corresponding to MSDR or MHDR respectively.
2.1. Determining the Eigenvalues of Coefficient Matrices: First we notice some facts about determinantal multivariate polynomials. Since the coefficient matrices of a determinantal polynomial are symmetric (Hermitian), therefore by the spectral theorem of a symmetric (Hermitian) matrix there exist a suitable orthogonal (unitary) matrix U such that one of the coefficient matrices becomes diagonal. Without loss of generality, it is enough to consider coefficient matrix associated to x1 as a diagonal matrix D1 and obtain an MSDR (MHDR) of the following form
(3) f(x) = det(I + x1D1 + x2A2 ··· + xnAn)
We explain a technique to determine the eigenvalues of the coefficient matrices D1 and Ai, 2 ≤ i ≤ n. We take restrictions of the given multivariate polynomial f(x) along each xi, i = 1, . . . , n that means we restrict the polynomial along one variable at a time by making the rest of the variables zero and generate n univariate polynomials fxi = f(0, . . . , xi,..., 0). It is known that if a multivariate polynomial f(x) admits an MSDR (MHDR), it is a RZ polynomial. By recalling the definition of RZ polynomial, we know that for any x ∈ Rn, RZ polynomial f(x) when restricted along any line passing through origin, i.e., the univariate polynomial fx(t) has only real zeros. So when a RZ polynomial f(x) restricted along xi, i = 1, . . . , n, each of them has only real zeros, i.e., all univariate polynomials fxi in xi have only real zeros. As a consequence of this result we have a necessary condition for the existence of an MSDR (MHDR) of size equal to the degree of the polynomial for a multivariate polynomial of any degree.
Lemma 2.1. If a multivariate polynomial f(x) ∈ R[x] of degree d has an MSDR (MHDR) of size d, then all the roots of fxi are real for all i = 1, . . . , n.
Moreover, the eigenvalues of matrices Ai can be found from the roots of fxi for all i = 1, . . . , n by the following Lemma which is proved in [Dey]. For the sake of completeness we include the proof here.
Lemma 2.2. The eigenvalues of coefficient matrices Ai are the negative reciprocal of the roots of univariate polynomials fxi for all i = 1, . . . , n.
Proof: The eigenvalues of coefficient matrices Ai are all real as the coefficient matrices are either symmetric or Hermitian. By Lemma 2.1, all the roots of fxi are real for all i = 1, . . . , n. If a univariate polynomial f(x) = det(xI + A) of degree d has only real zeros, so is the reversed polynomial f˜(x) := xdf(1/x). DEFINITE DETERMINANTAL REPRESENTATIONS VIA ORTHOSTOCHASTIC MATRICES 5
d ˜ On the other hand, det(tI + Ai) = t fei (ei/t) at x = ei, where ei denotes the standard basis vector in Rn. Therefore, there is a one to one correspondence between the roots of fxi and the non-zero eigenvalues of Ai and here the map is t 7→ −1/t.
Remark 2.3. Eigenvalues of Ai, i = 1, 2 are unique up to ordering. Without loss of generality arrange them in descending order.
Now we state the following theorem which talks about the relations between the co- efficients of a determinantal polynomial and the coefficient matrices of its determinantal representation by using the notion of generalized mixed discriminant of coefficient matri- ces [Dey].
Theorem 2.4. (Generalized Mixed Discriminant Theorem) The coefficients of a deter- minantal polynomial f(x) of degree d are uniquely determined by the generalized mixed discriminants of the coefficient matrices Ai as follows. If the degree of a monomial k1 k2 kn k1 kn x1 x2 . . . xn is k (i.e., k1 + k2 + ··· + kn = k) ≤ d, then the coefficient of (x1 . . . xn ) is given by Db(A1,...,A1,A2,...,A2,...,An,...,An). | {z } | {z } | {z } k1 k2 kn Then using the generalized mixed discriminant Theorem we compute the diagonal en- tries of Ai, i = 1, 2. Let T T eig(A1) := u1 := d1 d2 . . . dd , eig(A2) := w1 := r1 r2 . . . rd ,
yi := Diag(Ai), i = 1, 2
By the Theorem 2.4 the diagonal entries of coefficient matrices Ai, i = 1, 2 can be deter- mined by solving systems of linear equations of the form Giyi = zi, where (4) 1 1 ... 1 coeff of xi Pd Pd Pd−1 ri ri ... ri coeff of x x i=2 i=1,i6=2 i=1 i j P r r ...... P r r zi = . ,G1 = ik,il6=1,ik Note that G1 involves only the eigenvalues of A2. Similarly, matrix G2 which is defined by replacing ri with di in equation (4) involves only the eigenvalues of A1. α2 As z1 is the vector of coefficients of monomials of the form x1x2 , 0 ≤ α2 ≤ d − 1, so the relations are linear in terms of the entries of diagonal entries of A1 by the Theorem 2.4. In fact, this makes it possible to compute diagonal entries of A1 by solving a system of linear equations. Similarly, one can compute the diagonal entries of A2 by considering α1 the vector z2 associated with monomials x1 x2, 0 ≤ α1 ≤ d − 1 . Q Q Note that det(G1) = i Corollary 2.5. The diagonal entries of coefficient matrices (A1,A2) of a determinantal polynomial f(x) = det(I + x1A1 + x2A2) can be determined (uniquely up to ordering) by solving the systems of linear equations defined in equation (4) (provided Gi is invertible). 6 PAPRI DEY 2.2. Connections With Orthostochastic (Unistochastic) Matrices. In this sec- tion, we provide a necessary and sufficient condition for the existence of an MSDR (MHDR) of size d of a bivariate polynomial of degree d by representing its coefficients as scalar products of two vectors defined by different matrices. Note that this technique works well even if the polynomial is not a strictly RZ polynomial that means repeated eigenvalues of coefficient matrices are allowed. At first, we briefly recall some basic definitions and facts that will be used in sequel. A doubly stochastic matrix is a square matrix whose entries are nonnegative and the sum of the elements in each row and each column is unity. An othostochastic matrix is a doubly stochastic matrix whose entries are the squares of the entries of some orthogonal matrix. A unistochastic matrix is a doubly stochastic matrix whose entries are the squares of the absolute values of the entries of some unitary matrix. n n n T A bilinear form on R is a map from R ×R to R defined by (x, y) 7→ hx, yiQ = x Qy where the matrix Q ∈ Rn×n is associated with the bilinear form for all x, y ∈ Rn. The matrix Q is known as the defining matrix of the bilinear form. The form is said to be non degenerate when Q is nonsingular. This is also known as scalar product. Constructions of Vectors in Scalar Products: Let Nk = {δ := (i1, . . . , ij, . . . , ik) ∈ k N : i1 < ··· < ik, 1 ≤ k ≤ d} denotes the k ordered index set. The δ-th components of uk and wk are the the product of k ordered (with i1 < ··· < ik) eigenvalues of A1 and A2 respectively, i.e., Y Y (5) uk(δ) = di, wk(δ) = ri, δ ∈ Nk. i∈δ i∈δ Observe that the multi-indexed vectors uk and wk can be uniquely determined up to ordering of eigenvalues of Ai, i = 1, 2. Now we define another set of multi-indexed vectors. 0 c The δ -th component of uk,k0 is the sum of the product of all possible combinations of k ordered eigenvalues of A1 except those combinations which involve at least one of the 0 0 c eigenvalues of δ -th component of uk0 , and the δ -th component of wk,k0 is the sum of the product of all possible combinations of k ordered eigenvalues of A2 except those 0 combinations which involve at least one of the eigenvalues of δ -th component of wk0 , i.e., c 0 X Y c 0 X Y (6) uk,k0 (δ ) = di, wk,k0 (δ ) = ri 0 0 0 0 (δ,δ )∈Nk×Nk0 ,δ∩δ =∅ i∈δ (δ,δ )∈Nk×Nk0 ,δ∩δ =∅ i∈δ c Note that the degree of each component of uk,k0 depends on the degree of each component c of uk, the support and the order (i.e., number of components) of uk,k0 depend on the vector uk0 . Construction of Defining Matrices in Scalar Products: We use the notion of k -th exterior power of an orthogonal matrix. In order to explain what is meant by k -th exterior power of an orthogonal matrix which will be used in sequel we need to discuss some preliminaries [Win10], [TM01] d If {e1, e2, . . . , ed} is the standard basis of the vector space E over the field K(R or C), then the set of k-vectors eI := ei1 ∧ · · · ∧ eik where I = {(i1, i2, . . . , ik) : 1 ≤ i1 < ··· < d k d ik ≤ d} is a basis of the k-th exterior power of E , denoted by ∧ E for k = 1, . . . , d and thus any element in ∧kEd which is nothing but a k vector (sum of k blades) can be DEFINITE DETERMINANTAL REPRESENTATIONS VIA ORTHOSTOCHASTIC MATRICES 7 uniquely written as X X a = aI eI = ai1...ik (ei1 ∧ · · · ∧ eik ). I⊆{1,...,d},|I|=k 1≤i1<···