<<

PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 138, Number 2, February 2010, Pages 495–504 S 0002-9939(09)10103-X Article electronically published on September 11, 2009

A MATRIX SUBADDITIVITY FOR SYMMETRIC NORMS

JEAN-CHRISTOPHE BOURIN

(Communicated by Marius Junge)

Dedicated to Fran¸coise Lust-Piquard, with affection

Abstract. Let f(t) be a non-negative concave on [0, ∞). We prove that  f(|A + B|) ≤f(|A|)+f(|B|)  for all normal n-by-n matrices A, B and all symmetric norms. This result has several applications. For instance, for a Hermitian A =[Ai, j ] partitioned in blocks of the same size,  f(|A|) ≤ f(|Ai, j |) . We also prove, in a similar way, that given Z expansive and A normal of the same size, ∗ ∗  f(|Z AZ|) ≤Z f(|A|)Z .

1. Some recent results for positive operators Several nice inequalities for concave functions of operators have been recently established in a series of papers [5], [8], [7] and [6]. Most of these results are matrix versions of the obvious inequality (1) f(a + b) ≤ f(a)+f(b) for non-negative concave functions f on [0, ∞) and scalars a, b ≥ 0. By matrix version we mean a suitable extension where scalars are replaced by n-by-n matrices, i.e., operators on an n-dimensional Hilbert space H. For instance, we have [8]: Theorem 1.1. Let A, B ≥ 0 and let f :[0, ∞) → [0, ∞) be concave. Then, for all symmetric norms,  f(A + B) ≤f(A)+f(B) . As usual, capital letters A,B,... stand for operators, A ≥ 0 refers to positive semi-definite, and a symmetric (or unitarily invariant) satisfies A = UAV for all A and all unitaries U, V . Thus, up to symmetric norms, the basic inequality (1) still holds on the cone of positive operators. This subadditivity result for norms cannot be extended to the determinant, even in the case of an operator concave

Received by the editors November 5, 2008, and, in revised form, June 8, 2009. 2000 Mathematics Subject Classification. Primary 15A60, 47A30, 47A60. Key words and phrases. Matrix inequalities, symmetric norms, normal operators, concave functions.

c 2009 American Mathematical Society Reverts to public domain 28 years from publication 495

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use 496 JEAN-CHRISTOPHE BOURIN √ function such as f(t)= t. The most elementary case in the above theorem is for the trace norm. Then, the result can be restated as a famous trace inequality [11]:

Rotfel’d inequality. Let f be a on [0, ∞) such that f(0) ≥ 0. Then, for all A, B ≥ 0, Tr f(A + B) ≤ Tr (f(A)+f(B)). In the matrix setting, the concavity assumption is quite crucial as shown in the following simple remark [13]: Let f :[0, ∞) → [0, ∞) be continuous with f(0) = 0.If Tr f(A + B) ≤ Tr (f(A)+f(B)) holds for all two-by-two positive matrices A, B,thenf is concave. To prove this statement, take for s, t > 0, √ √ 1 s st 1 s − st A = √ ,B= √ 2 st t 2 − st t and observe that the trace inequality means that f is concave. Theorem 1.1 closed a list of papers of several authors, including Ando-Zhan [1] and Kosem [10]. However, it remained natural to ask whether this result could be extended to the set of all Hermitian, or even all normal, operators. We noticed a partial answer in [6]: Theorem 1.2. Let A, B ≥ 0 and let f :[0, ∞) → [0, ∞) be concave and e-convex. Then, for all symmetric norms,  f(|A + B|) ≤f(|A|)+f(|B|) . Here the e-convexity property of f means that f(et)isconvexon(−∞, ∞). In particular, the theorem holds for the power functions f(t)=tp,1≥ p ≥ 0. This result for normal operators entails several estimates for block matrices. A special case involving an operator partitioned into four normal blocks A, B, C, D of the same size is p AB ≤|A|p + |B|p + |C|p + |D|p  CD for all symmetric norms and 0 ≤ p ≤ 1. These estimates, comparing an operator on H⊕H with a related operator on H, differ from the usual ones in the literature where the norm of the full matrix is evaluated with the norms of its blocks, for instance, [9] and [3]. In the subsequent sections we solve the conjectures in [6] by showing that the assumption of e-convexity is not necessary in Theorem 1.2, in its application to block matrices and in some related inequalities. The proof of Theorem 1.2 given in [7] reduced to the positive case by using the fact that for any normal A, B and any non-negative e-convex functions f(t), we have  f(|A + B|) ≤f(|A| + |B|)  for all symmetric norms. This is no longer true if the e-convexity assumption is dropped. In fact, one can easily find two-by-two positive semi-definite matrices A, B and a non-negative concave function f(t)on[0, ∞) such that  f(|A − B|)  >  f(|A| + |B|) 

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use A MATRIX SUBADDITIVITY INEQUALITY 497

for all symmetric norms which are√ not a multiple scalar of the usual operator norm. For instance, take f(t)=min{t, 2/2} and 10 1/21/2 (2) A = ,B= . 00 1/21/2 The main point of the forthcoming proof is to overcome this difficulty. This proof can be adapted in order to obtain a version for normal operators of the following companion result to Theorem 1.1: Theorem 1.3. Let A ≥ 0 and let Z be expansive. If f :[0, ∞) → [0, ∞) is concave, then, for all symmetric norms,  f(Z∗AZ) ≤Z∗f(A)Z .

2. Subadditivity results for normal operators We have the following norm inequalities: Theorem 2.1. Let A, B be normal and let f :[0, ∞) → [0, ∞) be concave. Then, for all symmetric norms,  f(|A + B|) ≤f(|A|)+f(|B|) . Corollary 2.2. Let Z = A + iB be a decomposition into real and imaginary parts, and let f :[0, ∞) → [0, ∞) be concave. Then, for all symmetric norms,  f(|Z|) ≤f(|A|)+f(|B|)  . This is a matrix version of the obvious inequality f(|z|) ≤ f(|a|)+f(|b|)for complex numbers z = a + ib. Since non-negative concave functions on [0, ∞)are non-decreasing, we actually have f(|z|) ≤ f(|a| + |b|) ≤ f(|a|)+f(|b|). But the left inequality cannot be extended to matrices. Indeed it is easy to find two-by-two matrices Z = A + iB (a simple example is given with A, B defined in (2)) with the eigenvalue relation

λ2(|A| + |B|) <λ2(|Z|) <λ1(|Z|) <λ1(|A| + |B|).

Thus, there are non-negative concave functions such as f(t)=min{t, λ2(|Z|)} such that  f(|Z|)  >  f(|A| + |B|)  for all symmetric norms which are not a multiple scalar of the usual operator norm. Let A, B be general operators. Applying Theorem 2.1 to the Hermitian operators 0 A∗ 0 B∗ and A 0 B 0

we obtain | | | | | | f( A + B )0 ≤ f( A )+f( B )0 0 f(|A∗ + B∗|) 0 f(|A∗|)+f(|B∗|) so that letting B = A∗ yields: Corollary 2.3. If f :[0, ∞) → [0, ∞) is concave, then, for all Z and all symmetric norms,  f(|Z + Z∗|) ≤f(|Z|)+f(|Z∗|) .

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use 498 JEAN-CHRISTOPHE BOURIN

Note that equality occurs in Corollary 2.3 whenever f(0) = 0 and 00 Z = , X 0 where X is arbitrary. Note also that it may happen that  f(|Z + Z∗|)  >  f(|Z| + |Z∗|)  for some concave functions and some symmetric norms, for instance when ⎛ ⎞ 010 Z = ⎝001⎠ 000 and the norm is the sum of the two largest singular values. At the end of this section, we will see some application of Theorem 2.1 to parti- tioned operators. Now, we turn to the proof of Theorem 2.1. We start by recalling the Ky Fan Principle. The Ky Fan k-norms of A, k =1, 2,...,n, are defined as the sum of its k largest singular values, k A(k) = λj (|A|). j=1

Let A, B be such that A(k) ≤B(k) for all k =1, 2,...,n. Then, the vector of the singular values of A lies in the convex hull of the permuted singular values of B multiplied by ±1,

(λ1(|A|),...,λn(|A|)) ∈ convσ(±λσ(1)(|B|),...,±λσ(n)(|B|)). This can be proved by using the hyperplane separation process to reach a con- tradiction; see [12] for details and [2] for alternative proofs. From this convexity statement follows a useful fact:

Ky Fan principle. Suppose that A(k) ≤B(k) for all Ky Fan k-norms. Then, we have A≤B for all symmetric norms. We also need two elementary, well-known lemmas. For A, B ≥ 0itissometimes convenient to write A ≺w B to mean that A≤B for all symmetric norms.

Lemma 1. Let A, B, X, Y ≥ 0 such that B ≺w Y and A ≺w X.Then, A 0 X 0 ≺ . 0 B w 0 Y Proof. We have ⎧ ⎫ k ⎨s t ⎬ λj (A ⊕ B)= max λj (A)+ λj (B) . s+t=k ⎩ ⎭ j=1 j=1 j=1 Combining this with s t s t k λj (A)+ λj(B) ≤ λj (X)+ λj (Y ) ≤ λj (X ⊕ Y ) j=1 j=1 j=1 j=1 j=1 ends the proof. 

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use A MATRIX SUBADDITIVITY INEQUALITY 499

Lemma 2. Let A, B ≥ 0.Then, A 0 A + B 0 ≺ . 0 B w 00 Proof. Note that A + B 0 A1/2 B1/2 A1/2 0 = 00 00B1/2 0 so that A + B 0 AA1/2B1/2 A −A1/2B1/2 , 00 B1/2A1/2 B −B1/2A1/2 B where means unitarily congruent. Combining with A 0 1 AA1/2B1/2 1 A −A1/2B1/2 = + 0 B 2 B1/2A1/2 B 2 −B1/2A1/2 B gives the lemma.  Proof of Theorem 2.1. It suffices to prove the result when A and B are Hermitian. The general case then follows by replacing A, B by 0 A 0 A A˜ = , B˜ = A∗ 0 A∗ 0 and by using normality of A and B. Therefore assume that A, B are Hermitian with decomposition into positive and negative parts,

A = A+ − A− and B = B+ − B−. Let g(t)=f(t) − f(0) and note that, for each Ky Fan k-norm,

 f(|A + B|) (k) = kf(0) +  g(|A + B|) (k) and

 f(|A|)+f(|B|) (k) =2kf(0) +  g(|A|)+g(|B|) (k). Hence, it suffices to prove the result for g(t), or equivalently when f(0) = 0. This assumption implies that

(3) f(|A|)=f(A+)+f(A−)andf(|B|)=f(B+)+f(B−). Now, given two positive n-by-n matrices X and Y with direct sum X 0 X ⊕ Y = , 0 Y we have

(4) λj (|X − Y |) ≤ λj (X ⊕ Y ) for all j =1, 2,...,n. Indeed, for some subspace S⊂Hwe have

|X − Y | =(X − Y )+ +(X − Y )−

=(X − Y )S ⊕ (Y − X)S⊥

≤ XS ⊕ YS⊥ ; hence λj (|X − Y |) ≤ λj (XS ⊕ YS⊥ ) ≤ λj (X ⊕ Y )

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use 500 JEAN-CHRISTOPHE BOURIN

for all j =1, 2,...,n. In (4) replacing X by A+ + B+ and Y by A− + B− we then get λj (|A + B|) ≤ λj ((A+ + B+) ⊕ (A− + B−)) for all j =1, 2,...,n.Sincef is non-decreasing, it follows that

λj (f(|A + B|)) ≤ λj (f(A+ + B+)) ⊕ (f(A− + B−)) for all j =1, 2,...,n,sothat f(A+ + B+)0  f(|A + B|) ≤ 0 f(A− + B−) for all symmetric norms. By Theorem 1.1 combined with Lemma 1, followed by an application of Lemma 2, we then obtain

 f(|A + B|) ≤f(A+)+f(B+)+f(A−)+f(B−) , and making use of relations (3) ends the proof.  Let us now give some applications for block matrices. The most obvious one is for a Hermitian matrix AB B∗ C partitioned into four blocks of the same size. Then by using Theorem 2.1 for the decomposition into two Hermitian matrices, AB A 0 0 B = + , B∗ C 0 C B∗ 0 and then using Lemma 1, we have AB ∗ (5) f ≤f(|A|)+f(|B|)+f(|B |)+f(|C|)  B∗ C for all concave functions f :[0, ∞) → [0, ∞) and all symmetric norms. To obtain similar statements for more general partitions, note that the proof of { }m Theorem 2.1 is valid for any finite family of normal operators. Thus: Let Ai i=1 be normal and let f :[0, ∞) → [0, ∞) be concave. Then, for all symmetric norms,

 f(|A1 + ···+ Am|) ≤f(|A1|)+···+ f(|Am|) . We may then obtain results for some matrices partitioned into m2 blocks of the same size.

Corollary 2.4. Let A =[Ai, j] be a block matrix with normal entries and let f be a non-negative concave function on [0, ∞). Then, for all symmetric norms,  f(|A|) ≤ f(|Ai, j|) .

Proof. We prove this corollary via Theorem 2.1 for a partition into four blocks SR A = . TQ The proof for a partition into m2 blocks is similar by using the version of Theo- rem 2.1 for m operators. Let 0 A A˜ = A∗ 0

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use A MATRIX SUBADDITIVITY INEQUALITY 501

and note that |A∗| 0 |A˜| = 0 |A| so that |A| 0 (6) |A˜| , 0 |A| where the symbol stands for unitarily equivalent. On the other hand, A˜ = S˜ + T˜, where ⎛ ⎞ ⎛ ⎞ 00S 0 000R ⎜ 000Q⎟ ⎜ 00T 0 ⎟ S˜ = ⎜ ⎟ , T˜ = ⎜ ⎟ ⎝S∗ 000⎠ ⎝ 0 T ∗ 00⎠ 0 Q∗ 00 R∗ 000 are Hermitian. Therefore, Theorem 2.1 yields f(|A˜|) ≤ f(|S˜|)+f(|T˜|) for all symmetric norms; that is, using the shorthand symbol ≺ , ⎛ w ⎞ f(|S∗|)+f(|R∗|)0 0 0 ⎜ 0 f(|T ∗|)+f(|Q∗|)0 0⎟ f(|A˜|)≺ ⎜ ⎟. w ⎝ 00f(|S|)+f(|T |)0⎠ 000f(|R|)+f(|Q|) Gathering the first two lines and the last two lines, we have via Lemmas 2 and 1, f(|S∗|)+f(|T ∗|)+f(|R∗|)+f(|Q∗|)0 f(|A˜|)≺ . w 0 f(|S|)+f(|T |)+f(|R|)+f(|Q|) By using (6) we then obtain, using normality of S, T, R, Q,

f(|A|) ≺w f(|S|)+f(|T |)+f(|R|)+f(|Q|), which is equivalent to inequalities for symmetric norms.  Let us point out a variation of Corollary 2.4 in which some operators are not necessarily normal. Corollary 2.5. Let T be a triangular block matrix, AN T = , 0 B in which N is normal. Let f :[0, ∞) → [0, ∞) be concave. Then, for all symmetric norms, f(|T|)≤f(|A∗|)+f(|N|)+f(|B|). Proof. Consider the polar decompositions A = |A∗|U and B = V |B|,notethat AN I 0 AN U ∗ 0 |A∗| N = 0 B 0 V ∗ 0 B 0 I 0 |B| and apply Theorem 2.1.  The assumption in Corollary 2.4 requiring normality of each block is rather special. The next corollary generalizes (5) and meets the simple requirement that the full matrix is Hermitian.

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use 502 JEAN-CHRISTOPHE BOURIN

Corollary 2.6. Let A =[Ai, j] be a Hermitian matrix partitioned into blocks of the same size and let f :[0, ∞) → [0, ∞) be concave. Then, for all symmetric norms,  f(|A|) ≤ f(|Ai, j|) .

Proof. The proof of Corollary 2.4 actually shows that for a general block matrix A =(Ai, j) partitioned into blocks of the same size, we have |A| | ∗ | f( )0 f( Ai, j )0 ≺w 0 f(|A|) 0 f(|Ai, j|) A ∗ for all non-negative concave functions f. Assuming Hermitian, we have Ai, j = Aj, i and Corollary 2.6 follows. 

3. Related results for expansive congruences Let A be normal and let Z be expansive, i.e., Z∗Z ≥ I. The following extension of Theorem 1.3 holds. Theorem 3.1. Let A be normal and let Z be expansive. If f :[0, ∞) → [0, ∞) is concave, then, for all symmetric norms,  f(|Z∗AZ|) ≤Z∗f(|A|)Z . Indeed, we can derive Theorem 3.1 from Theorem 1.3 in a quite similar way to that in which we derive Theorem 2.1 from Theorem 1.1. The proof of Theorem 3.1 starts by noticing that we can assume that A is Hermitian. Then, using the decomposition into positive and negative parts,

A = A+ − A−, we have, as in the proof of Theorem 2.1, ∗ ∗ ∗ λj (|Z AZ|) ≤ λj (|Z A+Z|⊕|Z A−Z|) and we may proceed as previously. When we deal with the trace norm, the fact that f is positive on the whole half-line is not essential, as in the Rotfel’d inequality. Hence we have the following corollary, extending to normal operators a result from [4]. Corollary 3.2. Let A be normal and let Z be expansive. If f(t) is a concave function on the positive half-line with f(0) ≥ 0,then Tr f(|Z∗AZ|) ≤ Tr Z∗f(|A|)Z. Corollary 3.2 follows from Theorem 3.1 by approaching f(t) with g(t)+at for some scalar a and some non-negative concave function g(t). Theorems 2.1 and 3.1 can be combined in a unique statement, extending the main result in [7]: { }m { }m Theorem 3.3. Let Ai i=1 be normal, let Zi i=1 be expansive and let f be a non-negative concave function on [0, ∞). Then, for all symmetric norms, ∗ ≤ ∗ | | f Zi AiZi Zi f( Ai )Zi .

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use A MATRIX SUBADDITIVITY INEQUALITY 503

It would be elegant and interesting to state this theorem in the more general framework of positive linear maps Φ between matrix algebras. This leads to the problem of characterizing the positive linear maps Φ such that f(|Φ(N)|)≤Φ(f(|N|)) for all normal operators N, all non-negative concave functions and all symmetric norms. Some further questions are considered in [6]. For the sake of completeness, we mention that when f(t) is a non-negative convex function vanishing at 0, then the inequalities of Theorems 1.1-1.3 are reversed. For instance we have [7] { }m { }m Theorem 3.4. Let Ai i=1 be positive and let Zi i=1 be expansive. Then, for all symmetric norms and all p>1, p ∗ p ≤ ∗ Zi Ai Zi Zi AiZi .

If Zi = I for all i, it is a famous result of Ando-Zhan [1] and of Bhatia- p p ≤ Kittaneh [3] in the case of integer exponents. The very special case Tr(A1 + A2) p Tr(A1 +A2) is McCarthy’s inequality [13, p. 20]. Note that the positivity assump- tion in Theorem 3.4 cannot be replaced by a normality one. When we consider contractive congruences and positive operators, there exist several Jensen-type inequalities, not only for norms but also for eigenvalues (cf. [4], [5]). The proofs are much simpler than in the expansive case, where some unexpected counterexamples may occur (see discussion and counterexamples in [4], [5]). We give an example of such results: { }m { }m ∗ ≤ Let Ai i=1 be positive and Zi i=1 such that Zi Zi I.Iff is a monotone concave function on [0, ∞), f(0) ≥ 0,then ∗ ≥ ∗ ∗ f Zi AiZi V Zi f(Ai)Zi V for some unitary V .

References

[1] T. Ando and X. Zhan, Norm inequalities related to operator monotone functions, Math. Ann. 315 (1999) 771-780. MR1727183 (2000m:47008) [2] R. Bhatia, Matrix Analysis, Grad. Texts in Math., 169, Springer-Verlag, New York, 1997. MR1477662 (98i:15003) [3] R. Bhatia and F. Kittaneh, Norm inequalities for partitioned operators and an application, Math. Ann. 287 (1990) 719-726. MR1066826 (91f:47010) [4] J.-C. Bourin, Convexity or concavity inequalities for Hermitian operators, Math. Ineq. Appl. 7 (4) (2004) 607-620. MR2097515 (2005h:47034) [5] J.-C. Bourin, A concavity inequality for symmetric norms, Linear Alg. Appl. 413 (2006) 212-217. MR2202104 (2007d:47023) [6] J.-C. Bourin, Matrix subadditivity inequalities and block-matrices, Internat. J. Math. 20, no. 6 (2009), 679-691. [7] J.-C. Bourin and E.-Y. Lee, Concave functions of positive operators, sums and congruences, to appear in J. Operator Theory. [8] J.-C. Bourin and M. Uchiyama, A matrix subadditivity inequality for f(A + B)andf(A)+ f(B), Linear Alg. Appl. 423 (2007) 512-518. MR2312422 (2007m:15023) [9] C. King, M. Nathanson, New trace norm inequalities for 2 × 2 blocks of diagonal matrices, Linear Alg. Appl. 389 (2004) 77-93. MR2080396 (2005h:15058) [10] T. Kosem, Inequalities between f(A + B) and f(A)+f(B), Linear Alg. Appl. 418 (2006) 153-160. MR2257586 (2007g:47026) [11] S. Ju. Rotfel’d, The singular values of the sum of completely continuous operators, Problems of Mathematical Physics, Vol. 3, Consultants Bureau, 1969, 73-78. MR0339968

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use 504 JEAN-CHRISTOPHE BOURIN

[12] B. Simon, Trace ideals and their applications, Cambridge Univ. Press, Cambridge-New York, 1979. MR541149 (80k:47048) [13] O. Tychonov, Subadditivity inequalities in von Neumann algebras and characterization of tracial functionals, Positivity 9 (2005) 259-264. MR2189748 (2006f:46056)

Laboratoire de Mathematiques,´ Universite´ de Franche-Comte,´ 25030 Besanc¸on, France E-mail address: [email protected]

License or copyright restrictions may apply to redistribution; see https://www.ams.org/journal-terms-of-use