On Symplectic Eigenvalues of Positive Definite Matrices
arXiv:1803.04647v1 [math-ph] 13 Mar 2018 lci arcs i.e., matrices; plectic Let Introduction 1 mean. Riemannian majorisation, theorem, Williamson’s values, : Keywords : Classifications Subject AMS ossigo oiiedfiiemtie,and matrices, definite positive of consisting nsmlci ievle fpstv definite positive of eigenvalues symplectic On M ssasmlci matrix symplectic a ists D A oevrainlpicpe,adsm nqaiisbetwee eigenvalues. inequalities ordinary some and plectic and principles, variational some a nre,wihaecle h ypetcegnausof eigenvalues symplectic the called are which entries, nal es mn hmaerltosbtentesmlci eigen symplectic the between the relations about are of them inequalities Among fundamental several bers. derive we paper (2 1 A diag( = A , . . . , n If etesaeo 2 of space the be ) n hs of those and 1 2 A ninSaitclIsiue e eh 106 India 110016, Delhi New Institute, Statistical Indian ninSaitclIsiue e eh 106 India 110016, Delhi New Institute, Statistical Indian sa2 a is m ypetcmti,pstv ent arx ypetceigen- symplectic matrix, definite positive matrix, Symplectic d aedaBhatia Rajendra 1 n fterReana en etraintheorem, perturbation a mean, Riemannian their of and ( A Sp ) n d , . . . , (2 A × n t , = ) 2 ewe h ypetcegnausof eigenvalues symplectic the between n n ( elpstv ent arx hnteeex- there then matrix, definite positive real n A M M [email protected] )i ignlmti ihpstv diago- positive with matrix diagonal a is )) × matrices [email protected] 2 ∈ uhthat such Abstract n M 59,8P5 81S10. 81P45, 15A90, elmatrices, real (2 1 n 1 av Jain Tanvi , : ) M M Sp T T AM JM (2 P n (2 h ru fra sym- real of group the ) = = n J h ustof subset the )
O D D O 2 . h sym- the n m A. matrices enum- se nthis In values where M (2 n ) O I Here J = , and J itself is a symplectic matrix. IO If A is an− element of P(2n), then there exists a symplectic matrix M such that DO M T AM = , (1) OD where D is a diagonal matrix with positive entries
d (A) d (A) d (A). (2) 1 ≤ 2 ≤···≤ n This is often called Williamson’s theorem [1], [7], [17]. In [11] it is pointed out that this was known to Weierstrass. The numbers dj(A) are uniquely determined by A and characterise the orbits of P(2n) under the action of the group Sp(2n). We call them the symplectic eigenvalues of A. They play an important role in classical Hamiltonian dynamics [1], in quantum mechanics [2], in symplectic topology [11], and in the more recent subject of quantum information; see e.g., [8], [10], [13], [17]. The goal of this paper is to present some fundamental inequalities for symplectic eigenvalues. It is clear from the definition that if the symplectic eigenvalues of A are enumerated as in (2), then those of A−1 are 1 1 1 . (3) dn(A) ≤ dn−1(A) ≤···≤ d1(A)
No relation between the symplectic eigenvalues of A and those of At is readily apparent. Our first theorem unveils such relationships. Given x Rm, we denote by x↓ = x↓,...,x↓ the vector whose coordi- ∈ + 1 m ↓ ↓ nates are the coordinates of x rearranged in decreasing order x1 xm. If x and y are two m-vectors with positive coordinates, then we≥···≥ say that x is log majorised by y, in symbols x y, if ≺log k k x↓ y↓, 1 k m (4) j ≤ j ≤ ≤ j=1 j=1 Y Y and m m ↓ ↓ xj = yj . (5) j=1 j=1 Y Y 2 By classical theorems of Weyl and Polya, log majorisation implies the usual weak majorisation relation x y characterised by the inequalities ≺w k k x↓ y↓, 1 k m. (6) j ≤ j ≤ ≤ j=1 j=1 X X See Chapter II of [3]. It is convenient to introduce a 2n-vector d(A) whose coordinates are
d1(A) d2(A) bd2n(A), (7) ≥ ≥···≥ which are the symplecticb eigenvaluesb of A, eachb counted twice and rearranged in decreasing order. (Thus d1(A)= d2(A)= dn(A) and d2n−1(A)= d2n(A)= d1(A).) With these notations we have the following. b b b b Theorem 1. Let A be any element of P(2n). Then
t d(At) d (A) for 0 t 1, (8) ≺log ≤ ≤ and bt b d (A) d(At) for 1 t< . (9) ≺log ≤ ∞ Corollary 2. The symplecticb eigenvaluesb of A have the properties: (i) If 0 t 1, then for all 1 k n ≤ ≤ ≤ ≤ k k d (At) dt (A). (10) j ≥ j j=1 j=1 Y Y (ii) If t 1, then for all 1 k n ≥ ≤ ≤ k k d (At) dt (A). (11) j ≤ j j=1 j=1 Y Y Given two n n positive definite matrices A and B, their geometric mean G(A, B), also denoted× as A#B, is defined as
G(A, B)= A#B = A1/2(A−1/2BA−1/2)1/2A1/2. (12)
3 This was introduced by Pusz and Woronowicz [18], and has been much stud- ied in connection with problems in physics, electrical networks, and matrix analysis. Recently there has been renewed interest in it because of its inter- pretation as the midpoint of the geodesic joining A and B in the Riemannian manifold P(n). The Riemannian distance between A and B is defined as
n 1/2 2 −1 δ(A, B)= log λi (A B) , (13) i=1 ! X where λi(X), 1 i n, are the eigenvalues of X. With this metric P(n) is a nonpositively curved≤ ≤ space. Any two points A and B in P(n) can be joined by a unique geodesic. A natural parametrisation for this geodesic is
t A# B = A1/2 A−1/2BA−1/2 A1/2, 0 t 1. (14) t ≤ ≤ G(A, B) is evidently the midpoint of this geodesic. Our next theorem links the symplectic eigenvalues of A#tB with those of A and B. Theorem 3. Let A, B be any two elements of P(2n). Then for 0 t 1, ≤ ≤ 1−t t d (A# B) d (A)d (B). (15) t ≺log In particular b b b 1/2 d(G(A, B)) d(A)d(B) . (16) ≺log Next let A1, A2,...,Ab m be m pointsb in P(bn). Their geomeric mean, vari- ously called the Riemannian mean, the Cartan mean, the Karcher mean, the Riemannian barycentre, is defined as
m 1 G(A ,...,A ) = argmin δ2(A ,X). (17) 1 m m j j=1 X This object of classical differential geometry has received much attention from operator theorists and matrix analysts in the past ten years, and many new properties of it have been established. It has also found applications in diverse areas such as statistics, machine learning, image processing, brain- computer interface, etc. We refer the reader to [4] for a basic introduction to this area and to [5] for an update.
4 A little more generally, a weighted geometric mean of A1,...,Am can be defined as follows. Given positive numbers w1,...,wm with wj =1, let m P 2 G(w, A1,...,Am) = argmin wjδ (Aj,X). (18) j=1 X 1 The object in (17) is the special case when wj = m for all j. In case m =2, we have (w ,w )=(1 t, t) for some 0 t 1, and then G(w, A, B) reduces 1 2 − ≤ ≤ to the matrix in (14). With t =1/2 it reduces further to (12). Our next theorem is a several-variables version of Theorem 3.
Theorem 4. Let A1,...,Am be elements of P(2n) and let w =(w1,...,wm) be a positive vector with wj =1. Then
m P wj d(G(w, A ,...,A )) d (A ). (19) 1 m ≺log j j=1 Y b b In particular m 1/m d(G(A ,...,A )) d(A ) (20) 1 m ≺log j j=1 ! Y b b In the study of eigenvalues of Hermitian matrices, a very important role is played by variational principles, such as the Courant-Fischer-Weyl minmax principle, Cauchy’s interlacing theorem and Ky Fan’s theorems on extremal characterisations of sums and products of eigenvalues. It will be valuable to assemble a similar arsenal of techniques for symplectic eigenvalues. In Section 4 of this paper we give an exposition of some of these ideas. We provide an outline of proofs of a minmax principle and an interlacing theorem (both of which are known results). Then we use this to provide a unified simple proof of the following theorem. To emphasize the dependence on n we use the O I notation J for the 2n 2n matrix . The minimum in Theorem 2n × IO − 5 below is taken over 2n 2k matrices M satisfying M T J M = J . × 2n 2k Theorem 5. Let A P(2n). Then for all 1 k n ∈ ≤ ≤ k T (i) 2 dj(A) = min tr M AM, (21) T M:M J2nM=J2 j=1 k X 5 k (ii) d2(A) = min det M T AM. (22) j T M:M J2nM=J2 j=1 k Y Part (i) of this theorem has been proved by Hiroshima [10], and was an inspiration for our work. Our proof might be simpler and more conceptual. An interesting property of symplectic matrices crops up as a byproduct of our analysis. Every element M of Sp(2n) has a block decomposition A B M = , (23) C G in which A,B,C,G are n n matrices satisfying the conditions × AGT BCT = I, ABT BAT =0, CGT GCT =0. (24) − − − We associate with M an n n matrix M whose entries are given by × 1 m = a2 + b2 + c2 + g2 . (25) ij 2 ij ijf ij ij This matrix has some nice properties and can be put to good use in the study of symplectic matrices.e In the course of our proof of Theorem 5 we will see that for every M Sp(2n) the matrix M has the properties ∈ n m 1, 1 f i n, and ij ≥ ≤ ≤ j=1 Xn e m 1, 1 j n. (26) ij ≥ ≤ ≤ i=1 X It turns out that more is true.e An n n matrix A is said to be doubly stochastic if a 0 for all i, j, × ij ≥ n a = 1 for all 1 i n ij ≤ ≤ j=1 X and n a = 1 for all 1 j n. ij ≤ ≤ i=1 X A matrix B with nonnegative entries is called doubly superstochastic if there exists a doubly stochastic matrix A such that b a for all i, j. Our next ij ≥ ij theorem shows that M is a doubly superstochastic matrix.
f 6 Theorem 6. Let M Sp(2n), and let M be the n n matrix associated with ∈ × M according to the rule (25). Then M is doubly superstochastic. Further M is doubly stochastic if and only if M isf orthogonal. f f Doubly stochastic, superstochastic and substochastic matrices play an important role in the theory of inequalities; see the monograph [16]. Theo- rem 6 is thus likely to be very useful in deriving inequalities for symplectic matrices. For the usual eigenvalues of Hermitian matrices there are several pertur- bation bounds available. See [3]. Our next theorem gives such inequalities for symplectic eigenvalues. The continuity implied by these bounds will be used in our proofs of Theorems 1, 3, 4. But they are of independent interest. We use the symbol to denote any unitarily invariant norm on the |||·||| space of matrices [3]. Particular examples are the operator norm
T 1/2 A = λ1(A A) = sup Ax , (27) k k kxk=1 k k and the Frobenius norm
1/2 1/2 A = tr AT A = a 2 . (28) k k2 | ij| X Here λ1 stands for the maximum eigenvalue. Theorem 7. Let A, B be two elements of P(2n), and let D(A), D(B) be the diagonal matrices whose diagonals are d(A) and d(B). Then for every unitarily invariant norm we have b b b b D(A) D(B) A 1/2 + B 1/2 A B 1/2 . (29) − ≤ k k k k | − |