Norms on Complex Matrices Induced by Complete
Total Page:16
File Type:pdf, Size:1020Kb
NORMS ON COMPLEX MATRICES INDUCED BY COMPLETE HOMOGENEOUS SYMMETRIC POLYNOMIALS KONRAD AGUILAR, ÁNGEL CHÁVEZ, STEPHAN RAMON GARCIA, AND JURIJ VOLCIˇ Cˇ Abstract. We introduce a remarkable new family of norms on the space of n × n complex matrices. These norms arise from the combinatorial properties of sym- metric functions, and their construction and validation involve probability theory, partition combinatorics, and trace polynomials in noncommuting variables. Our norms enjoy many desirable analytic and algebraic properties, such as an ele- gant determinantal interpretation and the ability to distinguish certain graphs that other matrix norms cannot. Furthermore, they give rise to new dimension- independent tracial inequalities. Their potential merits further investigation. 1. Introduction In this note we introduce a family of norms on complex matrices. These are ini- tially defined in terms of certain symmetric functions of eigenvalues of complex Hermitian matrices. The fact that we deal with eigenvalues, as opposed to their absolute values, is notable. First, it prevents standard machinery, such as the the- ory of symmetric gauge functions, from applying. Second, the techniques used to establish that we indeed have norms are more complicated than one might ex- pect. For example, combinatorics, probability theory, and Lewis’ framework for group invariance in convex matrix analysis each play key roles. These norms on the Hermitian matrices are of independent interest. They can be computed recursively or directly read from the characteristic polyno- mial. Moreover, our norms distinguish certain pairs of graphs which the standard norms (operator, Frobenius, Schatten-von Neumann, Ky Fan) cannot distinguish. Our norms extend in a natural and nontrivial manner to all complex matrices. These extensions of our original norms involve partition combinatorics and trace polynomials in noncommuting variables. A Schur convexity argument permits our norms to be bounded below in terms of the mean eigenvalue of a matrix. arXiv:2106.01976v2 [math.CO] 20 Jul 2021 These norms, their unusual construction, and their potential applications sug- gest a host of open problems. We pose several at the end of the paper. 1.1. Notation. Denote by N, R, and C, respectively, the set of natural numbers, real numbers, and complex numbers. Let Hn(C) denote the set of n × n complex Hermitian matrices and Mn(C) the set of n × n complex matrices. Denote the eigenvalues of A ∈ Hn(C) by λ1(A) ≥ λ2(A) ≥···≥ λn(A) and define n λ(A) = λ1(A), λ2(A),..., λn(A) ∈ R . Key words and phrases. norm, complete homogeneous symmetric polynomial, partition, trace, pos- itivity, convexity, expectation, complexification, trace polynomial, symmetric tensor power. Third named author supported by NSF grants DMS-1800123 and DMS-2054002. Fourth named author supported by NSF grant DMS-1954709. 1 2 K. AGUILAR, Á. CHÁVEZ, S.R. GARCIA, AND J. VOLCIˇ Cˇ We may use λ and λ1, λ2,..., λn if the matrix A is clear from context. Let diag(x1, x2,..., xn) ∈ Mn(C) denote the n × n diagonal matrix with diagonal en- tries x1, x2,..., xn, in that order. If x =(x1, x2,..., xn) is understood from context, we may write diag(x) for brevity. 1.2. Complete homogeneous symmetric polynomials. The complete homogeneous symmetric (CHS) polynomial of degree d in the n variables x1, x2,..., xn is hd(x1, x2,..., xn) = ∑ xi1 xi2 ··· xid , (1) 1≤i1≤···≤id≤n the sum of all degree d monomials in x1, x2,..., xn [28, Sec. 7.5]. For example, h0(x1, x2) = 1, h1(x1, x2) = x1 + x2, 2 2 h2(x1, x2) = x1 + x1x2 + x2, and 3 2 2 3 h3(x1, x2) = x1 + x1x2 + x1x2 + x2. n+d−1 Elementary combinatorics confirms that there are precisely ( d ) summands in Rn the definition (1). We often write hd(x), in which x =(x1, x2,..., xn) ∈ , when the number of variables is clear from context. Rn For d even and x ∈ , Hunter proved that hd(x) ≥ 0, with equality if and only if x = 0 [10]. This is not obvious because some of the summands that comprise hd(x) (for d even) may be negative. Hunter’s theorem has been reproved many times; see [2], [4, p. 69 & Thm. 3], [6, Cor. 17], [25, Thm. 2.3], and [29, Thm. 1]. 1.3. Partitions and traces. A partition of d ∈ N is an r-tuple π =(π1, π2,..., πr) ∈ r N such that π1 ≥ π2 ≥ ··· ≥ πr and π1 + π2 + ··· + πr = d; the number of terms r depends on the partition π. We write π ⊢ d if π is a partition of d. For π ⊢ d, define pπ(x1, x2,..., xn) = pπ1 pπ1 ··· pπr , k k k in which pk(x1, x2,..., xn) = x1 + x2 + ··· + xn. If the length of x =(x1, x2,..., xn) is clear from context, we often write pπ(x) and pk(x), respectively. Another expression for (1) is pπ(x1, x2,..., xn) hd(x1, x2,..., xn) = ∑ , (2) π ⊢ d zπ in which the sum runs over all partitions π =(π1, π2,..., πr) of d and mi zπ = ∏ i mi!, (3) i≥1 where mi is the multiplicity of i in π [28, Prop. 7.7.6]. For example, if π = 3 1 2 (4,4,2,1,1,1), then zπ =(1 3!)(2 1!)(4 2!) = 384 [28, (7.17)]. C If A ∈ Hn( ) has eigenvalues λ =(λ1, λ2,..., λn), then π1 π2 πr pπ(λ) = pπ1 (λ)pπ2(λ) ··· pπr (λ)=(tr A )(tr A ) ··· (tr A ). (4) This connects eigenvalues, traces, and partitions to symmetric polynomials. NORMS ON COMPLEX MATRICES 3 1.4. Main results. The following theorem provides a family of novel norms on the space Hn(C) of n × n Hermitian matrices. Some special properties of these norms are discussed in Section 4. Theorem 1. For even d ≥ 2, the following is a norm on Hn(C): 1/d |||A|||d = hd λ1(A), λ2(A),..., λn(A) . For example, equations (2) and (4) yield trace-polynomial representations 1 |||A|||2 = tr(A2) + (tr A)2 , 2 2 1 |||A|||4 = (tr A)4 + 6(tr A)2 tr(A2)+ 3(tr(A2))2 + 8(tr A) tr(A3)+ 6 tr(A4) . (5) 4 24 Theorem 1 is nontrivial for several reasons. (a) The sums (1) and (2) that characterize hd(λ(A)) may contain negative sum- mands. For example, (tr A) tr(A3) in (5) can be negative for Hermitian A. (b) The relationship between the spectra of (Hermitian) A, B, and A + B, conjec- tured by A. Horn in 1962 [9], was only established in 1998-9 by Klyachko [13] and Knutson–Tao [14]. Therefore, the triangle inequality is difficult to estab- lish. Even if A and B are diagonal, the result is not obvious; see (11). (c) The sums that define these norms do not involve the absolute values of the eigenvalues of A. Theorem 1 does not follow from standard considerations, but rather from delicate properties of multivariate symmetric polynomials. Example 2. Because CHS norms do not rely upon the absolute values of the eigenvalues of a Hermitian matrix (that is, its singular values), they can some- times distinguish singularly cospectral graphs (graphs with the same singular values) that are not cospectral. This feature is not enjoyed by many standard norms (e.g., operator, Frobenius, Schatten–von Neumann, Ky Fan). For example, 0 1 1 K = 1 0 1 , 1 1 0 which has eigenvalues 2, −1, −1, is the adjacency matrix for the complete graph on three vertices. The graphs with adjacency matrices K 0 0 K A = and B = 0 K K 0 are singularly cospectral but not cospectral: their eigenvalues are −1, −1, −1, −1, 6 6 2,2 and −2, −1, −1, 1, 1, 2, respectively. However, |||A|||6 = 120 6= 112 = |||B|||6. The norms of Theorem 1 extend in a natural and nontrivial fashion to the set Mn(C) of all n × n complex matrices. Theorem 3. Let d ≥ 2 be even and let π =(π1, π2,..., πr) be a partition of d. Define C R d d Tπ :Mn( ) → by setting Tπ(A) to be 1/(d/2) times the sum over the (d/2) possible locations to place d/2 adjoints ∗ among the d copies of A in (tr AA ··· A)(tr AA ··· A) ··· (tr AA ··· A). π1 π2 πr | {z } | {z } | {z } 4 K. AGUILAR, Á. CHÁVEZ, S.R. GARCIA, AND J. VOLCIˇ Cˇ Then T (A) 1/d |||A||| = ∑ π , (6) d z π π in which the sum runs over all partitions π of d and zπ is defined in (3), is a norm on Mn(C) that restricts to the norm on Hn(C) given by Theorem 1. Example 4. The two partitions of d = 2 satisfy z(2) = 2 and z(1,1) = 2. There are 2 ∗ (1) = 2 ways to place two adjoints in a string of two As. Therefore, 1 T (A) = tr(A∗ A) + tr(AA∗) = tr(A∗ A), and (2) 2 1 T (A) = (tr A∗)(tr A)+(tr A)(tr A∗) =(tr A∗)(tr A), (1,1) 2 so 1 1 |||A|||2 = tr(A∗ A) + (tr A∗)(tr A). (7) 2 2 2 Example 5. The five partitions of d = 4 satisfy z(4) = 4, z(3,1) = 3, z(2,2) = 8, 4 ∗ z(2,1,1) = 4, and z(1,1,1,1) = 24. There are (2) = 6 ways to place two adjoints in a string of four As. For example, ∗ ∗ ∗ ∗ ∗ ∗ 6T(3,1)(A)=(tr A A A)(tr A)+(tr A AA )(tr A)+(tr A AA)(tr A ) +(tr AA∗ A∗)(tr A)+(tr AA∗ A)(tr A∗)+(tr AAA∗)(tr A∗) = 3 tr(A∗2A)(tr A) + 3(tr A2 A∗)(tr A∗).