Entropy methods for CMB analysis of anisotropy and non-Gaussianity
Momchil Minkov,1, ∗ Marvin Pinkwart,2, † and Peter Schupp2, ‡ 1Department of Electrical Engineering, Stanford University, 350 Serra Mall, Stanford, CA 94305-9505, USA 2Department of Physics and Earth Sciences, Jacobs University Bremen, Campus Ring 1, 28759 Bremen, Germany (Dated: June 8, 2021) In recent years, high-resolution cosmic microwave background (CMB) measurements have opened up the possibility to explore statistical features of the temperature fluctuations down to very small angular scales. One method that has been used is the Wehrl entropy, which is, however, extremely costly in terms of computational time. Here, we propose several different pseudoentropy measures (projection, angular, and quadratic) that agree well with the Wehrl entropy, but are significantly faster to compute. All of the presented alternatives are rotationally invariant measures of entangle- ment after identifying each multipole l of temperature fluctuations with a spin-l quantum state and are very sensitive to non-Gaussianity, anisotropy, and statistical dependence of spherical harmonic coefficients in the data. We provide a simple proof that the projection pseudoentropy converges to the Wehrl entropy with increasing dimensionality of the ancilla projection space. Furthermore, for l = 2, we show that both the Wehrl entropy and the angular pseudoentropy can be expressed as one-dimensional functions of the squared chordal distance of multipole vectors, giving a tight connection between the two measures. We also show that the angular pseudoentropy can clearly distinguish between Gaussian and non-Gaussian temperature fluctuations at large multipoles and henceforth provides a non-brute-force method for identifying non-Gaussianities. This allows us to study possible hints of statistical anisotropy and non-Gaussianity in the CMB up to multipole l = 1000 using Planck 2015, Planck 2018, and WMAP 7-yr full sky data. We find that l = 5 and l = 28 have a large entropy at 2–3σ significance and a slight hint towards a connection of this with the cosmic dipole. On a wider range of large angular scales we do not find indications of violation of isotropy or Gaussianity. We also find a small-scale range, l ∈ [895, 905], that is incompatible with the assumptions at about 3σ level, although how much this significance can be reduced by taking into account the selection effect, i.e.,, how likely it is to find ranges of a certain size with the observed features, and inhomogeneous noise is left as an open question. Furthermore, we find overall similar results in our analysis of the 2015 and the 2018 data. Finally, we also demonstrate how a range of angular momenta can be studied with the range angular pseudoentropy, which measures averages and correlations of different multipoles. Our main purpose in this work is to introduce the methods, analyze their mathematical background, and demonstrate their usage for providing researchers in this field with an additional tool. We believe that the formalism developed here can underpin future studies of the Gaussianity and isotropy of the CMB and help to identify deviations, especially at small angular scales. Keywords: CMB – data analysis – coherent states – pseudoentropy
I. INTRODUCTION of order 10−3 K, which is assumed to be of pure kine- matic origin. The anisotropy of the CMB temperature Since its discovery in 1964 by Penzias and Wilson [1], has been measured first by the Cosmic Background Ex- the cosmic microwave background (CMB) has served as plorer satellite from 1989 to 1993, followed up by the the main source of information about the current and Wilkinson Microwave Anisotropy Probe (WMAP) mis- past Universe. Originating in the process of recombina- sion from 2001 to 2010. The most recent CMB investiga- tion at about 380 000 years after the Big Bang at red- tion satellite, Planck, was launched in 2009 and shutdown shift of about 1100, the CMB temperature distribution in 2013. Its 2015 results from the second data release pro- arXiv:1809.08779v2 [astro-ph.CO] 3 May 2019 on the celestial sphere displays the energy density distri- vide the most precise values of cosmological parameters bution on the Last Scattering Surface when the Universe [3, 4] measured up to this point. Recently, they have became transparent to electromagnetic radiation. The been refined in the 2018 data release [5, 6]. CMB intensity follows a nearly perfect Planckian distri- It is commonly assumed that the tiny temperature fluc- bution and its average temperature has been measured tuations follow a Gaussian and statistically isotropic dis- to be T0 = 2.72548±0.00057 K[2] with minor fluctuations −5 −4 tribution. This assumption has been confirmed by the of order 10 –10 K and one larger dipole modulation Planck mission to a large extent[7, 8], but nevertheless the search for possible non-Gaussianities[9] and statisti- cal anisotropies [10–19] has been rich and certain anoma- ∗ [email protected] lies have been found, as, for example, unusual (anti- † [email protected] )correlation of the lowest multipoles with the Cosmic ‡ [email protected] Dipole as well as with each other, a sign of parity asym- 2 metry and a lack of large-angle correlation (see e.g. the it to the statistics used before. Eventually, in Sec. V we review[20]). summarize and discuss our findings. Common tools in these analyses are multipole vectors (MPVs) which were introduced for cosmological data analysis in [21] and whose properties have been elab- II. CMB STATISTICS orated in [11, 22–25]. For the most recent results on possible CMB anomalies using multipole vectors and an As a function on S2 the CMB temperature fluctuations overview over the mathematical approaches see [26–28]. ∆T := δT/T0 can be decomposed uniquely according to MPVs are closely related to Bloch coherent states (see irreducible representations of SO(3), i.e.,, into spherical [11]) which were also used in the past to prove special harmonics cases of Lieb’s conjecture[29] for the Wehrl entropy. ∞ l In this work, we develop and compare several rota- X X tionally invariant measures of randomness on functions ∆T (θ, φ) = almYlm(θ, φ) ∈ R, (1) on the two-sphere, namely, the angular, projection, and l=1 m=−l quadratic pseudoentropies. We show that for l = 2 the where the l = 0-summand is omitted because the fluctua- Wehrl and angular entropy can be expressed as a func- tions average to zero, and θ, φ denote the usual spherical tion of the squared chordal distance of MPVs. We find coordinates. The fact that ∆T is real together with the that all these measures except the quadratic one show property Y = (−1)mY ∗ impose the constraints the same features, making the quadratic pseudoentropy l,−m lm the least preferred measure. Because of the shared fea- a = (−1)ma∗ (2) tures, we then restrict ourselves to the numerically fastest l,−m lm method, and use it to analyze Planck 2015 and 2018 full on the spherical harmonic coefficients, leaving for each sky as well as WMAP 7-year Internal Linear Combina- multipole number l exactly 2l+1 real d.o.f. The multipole tion (ILC) maps. The angular pseudoentropy allows for number l corresponds to angular scales of ≈ 180/l deg. comparing the data to many ensembles of Gaussian and The orthonormality of {Y } allows to compute the co- isotropic random maps up to l = 1000 in short computing lm efficients from the temperature map via time. With a better theoretical understanding of confi- dence levels, also the Wehrl entropy could be used easily Z ∗ since the computing time for a single map is still rea- alm = dΩ ∆T (Ω)Ylm(Ω). (3) sonable. In general, it is especially nice to have a single S2 number for each multipole even in the case that the data Simple inflationary models together with linear pertur- would not be Gaussian and isotropic. In this case, the bation theory predict nearly Gaussian temperature fluc- CMB would be described by more than one degree of free- tuations and a further common assumption is statistical dom (d.o.f.) per multipole. Non-Gaussian distributions isotropy. The spherical harmonic coefficients inherit both need higher correlation functions and anisotropic distri- properties from the temperature map, meaning that butions yield an m-dependent two-point function. In the tradition of thermodynamics, with these pseudoentropies 1 − 1 ~a†D ~a p(~a ) = e 2 l l l (Gaussianity), (4) one can approximately reduce a possibly large set of data l N again to one number for each multipole. Since all con- sidered types of entropies show a similar behavior the where p denotes the joint probability distribution, ~al = T l −1 ∗ information does not depend on the definition of the en- (al0, . . . , all) , Cmn = Dl mn = halmalni and N de- tropy. Eventually there exists also an extension of the notes a normalization constant, and angular entropy to ranges and collections of multipoles, 2 which we call range angular entropy. ∀R ∈ SO(3),~e1, . . . ,~en ∈ S , n ∈ N : This paper is organized as follows: In Sec. II, we briefly Gn({R~ei}) = Gn({~ei}) (isotropy), (5) recapitulate the basic ingredients of CMB spherical har- Qn monic statistics. Afterwards, in Sec. III, we introduce where Gn({~ei}) = h i=1 ∆T (~ei)i denotes the n-point our methods mathematically, clarify their properties, and function of temperature fluctuations. The isotropy con- show the connection to multipole vectors. We also pro- dition is equivalent to rotationally invariance of the joint vide a simple proof of the convergence of the projection alm-probability distribution. The averaging h.i is meant entropy to the Wehrl entropy up to a term which is in- to be performed over all possible universes, which of dependent of the input density matrix. Section IV is course is not possible, wherefore we are left with a natu- dedicated to the application of our methods to real data. ral and inevitable variance in all quantities, called cosmic We compare the different pseudoentropy methods, then variance. If we impose both Gaussianity and isotropy we apply the angular pseudoentropy to 2015 Planck and then the two-point correlation of spherical harmonic co- 7-year WMAP full sky foreground-cleaned maps before efficients is diagonal comparing the 2015 results to those obtained with 2018 l data and also applying the range entropy and comparing Cmn = Clδmn. (6) 3
In practice, the power spectrum Cl is calculated using where ρmixed is obtained from the pure state ρ by ap- the unbiased estimator plying a rotationally symmetric quantum channel Φ, i.e., a completely positive map between Hilbert spaces with 1 X Cˆ = |a2 | , hCˆ i = C , (7) possibly different dimensions, or by computing its lower l 2l + 1 lm l l m symbol, i.e., its expectation value in spin coherent states. The latter choice leads to the Wehrl entropy [29–31] with cosmological variance Z dΩ 2 2 ˆ 2 2 SW = −(2l + 1) |hΩ|ψi| ln |hΩ|ψi| , (10) var(Cl) = C . (8) 4π 2l + 1 l where |Ωi is a spin-l coherent state. The Wehrl entropy was first proposed in [32] as a useful tool for CMB anal- III. PSEUDOENTROPIES AND THEIR PROPERTIES ysis. See Fig. 9d for a showing the Wehrl entropy for CMB data. Closely related is the “quadratic entropy” that is obtained by replacing −x ln x in the formula for In this section, we shall discuss the mathematical prop- the Wehrl entropy by the concave function x(1 − x), erties and interrelation of various macroscopic entropy measures that can be used as powerful tools to analyze 2l + 1 S = 1 − |P |ψ ⊗ ψi|2 , (11) Gaussianity and isotropy of the CMB and can also be quad 4l + 1 2l useful in other contexts. This section contains a review of the mathematical background as well as new defini- where P2l is the projector onto the spin-2l part, i.e., the tions, results and insights. The motivation to look for highest spin component of the tensor product. Other ex- macroscopic entropy measures is the same as in statistical amples using the choice −x ln x are what we call angular physics: A microscopic description of a physical system, entropy e.g., the positions and momenta of a fluid, is useful for " 3 # simulation purposes, but not when comparing to a real X Li|ψihψ|Li S = Tr φ (12) fluid. Instead, one would resort to the study of macro- ang l(l + 1) scopic quantities and parameters like internal energy, i=1 temperature, entropy, pressure etc. that are well-defined with φ(x) := − x ln x , (13) because of symmetries. In the analysis of the CMB, the where the Li are angular momentum generators in the alm coefficients are an analog of the microscopic quanti- spin-l representation, and j-projection entropy ties. For low l these and derived quantities like multipole vectors can be studied individually, but for high l this 2l + 1 S(j) = Tr φ P |ψihψ| ⊗ 1P quickly becomes impractical: The Planck mission data proj 2(l + j) + 1 l+j l+j easily comprises several million reliable data points. For (14) the CMB the obvious underlying spacetime symmetry is where 1 is the unit operator on a spin-j ancilla [j] and φ rotation invariance. The representations of the rotation is as in Eq. (13). An overview of these entropies applied group decompose into irreducible components labeled by to CMB data can be found in Fig. 9. In the following the angular momentum quantum number l (multipole ex- section we shall explain the mathematics in detail, de- pansion) and we can focus on fixed-l subspaces. A loose rive relations between the various entropies and point out analog of internal energy is the angular power spectrum, interesting side results including a fast way to compute i.e., the Cl coefficients [see Eqs. (6) and (7)]. They have multipole vectors. Readers that are mostly interested in proven immensely useful in the analysis of the CMB and results and numerics can skip to the algorithm (50)-(54) its cosmological implications, but when it comes to ques- at the end of Sec. III A. tions of isotropy and preferred directions, individual m matter and an analog of entropy would be useful. A natural idea is to consider the abstract quantum state A. Coherent states, multipole vectors and entropy P |ψi := alm|l, mi that can be formally computed from the alm and associate an entropy S to it. We will usu- Coherent states were originally introduced by ally focus on one l at a time and normalize the states by Schr¨odinger[33] and are well known in the context of rescaling the alm appropriately. Since the states are by the quantum harmonic oscillator, where they can be construction pure, the von Neumann entropy will be triv- defined either as eigenstates of the lowering operator ially zero, but there are also non-trivial pseudoentropies or, equivalently, as elements of the orbit of the ground that can distinguish pure states and turn out to be sen- state under the Heisenberg group. Perelomov [34] has sitive to non-Gaussianity and anisotropy. The general generalized the latter notion to orbits of a fiducial vector strategy is as follows: in some representation of a Lie group under the action of X that group. The choice of the fiducial vector is essential T (θ, φ) → alm → |ψi := alm|l, mi (9) for the properties of the resulting coherent states. Spin → ρ = |ψihψ| → ρmixed → S, coherent states – also called Bloch coherent states – in 4 a spin-l irreducible representation [l] ≡ C2l+1 of SU(2) with 2l + 1 ∈ N are defined as orbits of the highest weight vector |l, li. The stability group of that vector is U(1) and spin coherent states can thus be labeled by ∼ points Ω = (θ, φ) on the sphere S2 = SU(2)/U(1),
|Ωli = R(Ω)|l, li , (15) where R(Ω) denotes a rotation that takes the north pole to the point Ω and l labels the representation of SU(2). For the CMB data l will be an integer, but everything 1 we discuss here is also valid for half-integer l. For l = 2 this gives for example
−iφ/2 θ 1 1 +iφ/2 θ 1 1 |Ω 1 i = e cos | , i + e sin | , − i . (16) 2 2 2 2 2 2 2 Coherent states inherit nice properties from the under- lying fiducial vector. A particular important one is that the tensor product of coherent states is again a coherent FIG. 1: Dependence of the angular pseudoentropy state and lies in the highest spin component: Sang() (solid) and the Wehrl entropy SW() (dashed) on the squared chordal distance between multipole 1 |Ωli ⊗ |Ωji = |Ωl+ji (17) vectors on the sphere with radius r = 2 for l = 2. The maximum is obtained when the multipole vectors are Using this property repeatedly yields an explicit formula orthogonal to each other and the minimum is obtained for any spin from (16): when both multipole vectors are the same.
|Ωli = |Ω 1 i ⊗ ... ⊗ |Ω 1 i 2 2 l 1 X 2l 2 = e−imφ/2 cosl+m( θ ) sinl−m( θ ) |l, mi see [32]). In [29] this method was introduced to deter- l + m 2 2 m=−l mine explicit formulas for the Wehrl entropy and to prove (18) Lieb’s conjecture. Applying those explicit formulas to the case l = 2 with two pairs of anti-aligned multipole Interestingly, such a product representation in terms of vectors of length 1/2 gives the following formula for the 1 spin- 2 states exists for any state |ψi ∈ [l], but except Wehrl entropy as a function of the squared chordal dis- for coherent states, a projection onto the highest spin 2 α tance = sin ( 2 ) between the vectors, where α is the component and a renormalization are required [29]: angle between them:
l 32 X (1) (2l) SW() = c − ln c + − ln 6 , (21) |ψli = alm|l, mi = cPl |Ω 1 i⊗...⊗|Ω 1 i , (19) 15 2 2 m=−l where where P is the projector onto [l] and c is a normalization l 1 constant. The Ω(i) point into the direction of the 2l mul- c = c() := . (22) tipole vectors that characterize the state |ψi. Contract- 1 − (1 − ) ing (18) with (19) and using the stereographic projection For the angular entropy a similar computation gives to express points on the sphere in terms of complex num- bers z = eiφ cot( θ ), leads to a polynomial c c 2 S () = − (1 − )2 ln(1 − )2 + 2 ln 2−ln (23) ang 2 2 l 1 X 2l 2 zl+ma , (20) with c as above. Plots of the two functions look very l + m lm m=−l similar (see Fig. 1), confirming the observed similarities in behavior of the two entropy measures in the CMB whose n ≤ 2l zeros (roots) correspond to points on the analysis (see Fig. 9). The polynomial method provides sphere that are antipodal to n of the 2l multipole vec- a very fast and convenient way to determine multipole tors. The remaining 2l −n multipole vectors point to the vectors and has been used in [25, 32] and many other south pole of the sphere. For the CMB data l is an inte- publications to analyze the CMB. ∗ m ger, ∆T (Ω) is real and consequently alm = (−) al,−m. Spin coherent states are complete via Schur’s lemma This implies that the zeroes of the polynomial are lo- cated at pairs of antipodal points on the sphere and the Z dΩ (2l + 1) |Ω ihΩ | = P , (24) multipole vectors come in anti-aligned pairs (for details 4π l l l 5 where Pl is the projector onto [l]. They are normalized entropy, which is mathematically better behaved than hΩl|Ωli = 1 but not orthogonal the Boltzmann entropy in classical statistical mechan- ics. The Wehrl entropy is always larger than the von 0 2 4l 0 |hΩl|Ωli| = cos (^(Ω, Ω )) , (25) Neumann entropy and it is positive even for pure states. Furthermore its definition is rotationally symmetric and i.e., they form an overcomplete basis of [l]. In the l → ∞ it is hence perfectly suited for our purposes. The mini- 0 2 0 limit, (2l +1)|hΩl|Ωli| becomes a delta function δ(Ω, Ω ) mum of the Wehrl entropy is attained for coherent states. and in this limit the coherent states form an infinite- This fact is surprisingly difficult to prove. It was first dimensional orthonormal basis labeled by points on the shown for low spin in [29] and then finally in general in sphere. [35, 36]. Computational evidence suggested in fact an A striking property of coherent states is that the diag- analogous but much stronger conjecture for any concave onal matrix elements function φ(x) [29, 37], which has also been settled affir- matively in [36]. For our application, the maximal value A(Ω) = hΩl|A|Ωli (lower symbol) (26) of the Wehrl entropy is more interesting. The exact value is not known, in fact finding the maximizing pure state of an operator A on [l] already determine that operator is another hard problem, but a reasonable upper limit uniquely: Let C = A − B with an arbitrary operator B, can be obtained very simply from a totally mixed state: then C(Ω) = 0 for all Ω implies C = 0, i.e., A = B. S ≤ ln(2l + 1). To summarize: The Wehrl entropy is The proof uses analytic properties of the lower symbol. W the Shannon entropy of the probability density obtained The lower symbol is thus a faithful representation of an from the (faithful) lower symbol representation of a den- operator. Using Eq. (24), the trace of an operator A on sity matrix. It has all the right properties for our pur- [l] can be computed as an integral over its lower symbol poses. The only drawback is that its computation with Z dΩ Z dΩ suitable precision has a high computational complexity. Tr (A) = (2l+1) hΩ |A|Ω i = (2l+1) A(Ω) . We will now introduce and discuss several alternatives [l] 4π l l 4π (27) with similar properties, but better computability. Another interesting property is that any operator A on Instead of −x ln x, one can consider other concave func- [l] can be expanded diagonally in coherent states tions defined on the interval [0, 1]. For φ(x) = x(1 − x) we obtain the quadratic entropy Z dΩ A = (2l + 1) hA(Ω)|ΩlihΩl| , (28) 4π 2l + 1 S (ρ) = 1 − Tr (P (ρ ⊗ ρ)) , (31) quad 4l + 1 2l where hA(Ω) is called an upper symbol of A. These two properties are in fact closely related: Contract- ing Eq. (28) with an operator C gives Tr(C†A) ∝ where P2l is the projector onto spin 2l and we have used R ∗ dΩ hA(Ω) Ω (C), i.e., the operators that can be repre- the following trick [29]: sented by an upper symbol as in Eq. (28), are orthogonal to the operators that are in the kernel of the lower sym- (ρ(Ω))2 = hΩ |ρ|Ω i2 bol map. Hermitean operators have real lower and upper l l symbols. Positive semi definite operators and density ma- = hΩl ⊗ Ωl|ρ ⊗ ρ|Ωl ⊗ Ωli = hΩ2l|ρ ⊗ ρ|Ω2li trices have unique non-negative lower symbols, but the same is in general not true for upper symbols. Follow- and the trace formula (27) adapted to spin-2l. For a ing Wehrl, these properties suggest to interpret the lower pure state ρ = |ψihψ| we obtain formula (11). The symbol of a density matrix ρ, which is by definition posi- quadratic entropy has a large computational advantage tive semi definite and normalized, as a probability density as compared to the Wehrl entropy (see Fig. 8), but its and compute qualitative behavior is a bit different, as can be seen in Fig. 9f. This is not surprising, since many important Z dΩ Z dΩ (2l + 1) φ(ρ(Ω)) = (2l + 1) φ(c) . (29) properties of entropy like additivity and (strong) sub- 4π 4π additivity depend crucially on the choice of the function φ(x) = −x ln x. We shall hence not pursue quadratic en- With φ(x) = x we can verify the normalization and get tropy any further and will try to identify and construct Tr(ρ) = 1. With φ = −x ln x we compute the Shannon other alternatives of the Wehrl entropy that share its entropy of the probability density ρ(Ω), which is precisely characteristic features but are computationally more ac- the Wehrl entropy cessible. Let us start by reconsidering the main ingredi- Z dΩ ent of Wehrl entropy. S (ρ) = −(2l + 1) ρ(Ω) ln ρ(Ω) , (30) 2l+1 W 4π Let ρ be a density matrix on [l] = C and introduce an ancilla Hilbert space [j] = C2j+1. Using the product with the special case (10) for a pure state ρ = |ψihψ|. property (17) and normalization of coherent states, we The Wehrl entropy was introduced as a semi-classical can rewrite the lower symbol ρ(Ω) that enters the formula 6 for the Wehrl entropy as follows: hΩl|ρ|Ωli = hΩl|ρ|ΩlihΩj|Ωji = hΩl ⊗ Ωj|ρ ⊗ 1|Ωl ⊗ Ωji
= hΩl+j|ρ ⊗ 1|Ωl+ji , (32) where 1 is the unit operator on [j]. The values of the lower symbol are thus the diagonal elements of a family of infinite-dimensional matrices
0 0 ρj(Ω, Ω ) = hΩl+j|ρ ⊗ 1|Ωl+ji . (33) By an infinite-dimensional compact analog of the Schur- Horn theorem the diagonal elements ρ(Ω) are majorized 0 by the eigenvalues of the ρj(Ω, Ω ) matrices. This implies that any concave function of the values ρ(Ω) will be larger or equal to the respective function of the eigenvalues of 0 ρj(Ω, Ω ). The Wehrl entropy is therefore larger than or 0 equal to the von Neumann entropy of ρj(Ω, Ω ). For con- FIG. 2: Comparison of the Wehrl entropy SW(l) with vex functions the inequalities are reversed. See e.g. [38] the j = 1-,10-,100-projection pseudoentropies minus a j- for an overview of the mathematical background. In the and l-dependent term, S(j) (l) + ln 2l+1 , for the limit j → ∞ and in view of Eq. (25) the off-diagonal ma- proj 2(l+j)+1 0 trix elements of ρj(Ω, Ω ) become zero and the inequali- NILC 2015 map on the range [1, 30]. For j → ∞ the ties become equalities. Using the property (24) on both latter converges to the former, but not uniformly. sides of Eq. (33) we can recover a finite-dimensional ma- trix
Pl+j (ρ ⊗ 1) Pl+j (34) with φ(x) = x ln(x). From the fact that the mixed den- from ρ (Ω, Ω0), where P is the projector onto the high- sity matrix (37) has at most 2j + 1 non-zero eigenvalues, j l+j we get an upper bound for the projection entropy [39] est spin component [l+j] of the tensor product. The ma- (j) 0 trix (34) has the same eigenvalues as ρj(Ω, Ω ). In fact, Sproj(ρ) ≤ ln(2j + 1). From the [l + j] perspective the if Wehrl entropy should also be computed from Eq. (37) and we get the aforementioned inequalities. The only dif- Pl+j (ρ ⊗ 1) Pl+j|Vλi = λ|Vλi (35) ferences from the original definition of Wehrl entropy (30) is a rescaling of the density matrix and related renormal- then V (Ω) := hΩ |V i satisfies λ l+j λ ization of the integral, which leads to a shift in entropy Z 0 and the following inequality: dΩ 0 0 (2(l + j) + 1) hΩl+j|ρ ⊗ 1|Ωl+jiVλ(Ω ) = λVλ(Ω) 4π (36) (j) 2l + 1 SW (ρ) ≥ Sproj(ρ) + ln . (39) and vice versa if Vλ(Ω) is a solution of Eq. (36), then 2(l + j) + 1 R dΩ |Vλi = (2(l + j) + 1) 4π |Ωl+jiVλ(Ω) satisfies Eq. (35). We have shown that the eigenvalues of the matrix (34) In the limit j → ∞ this inequality becomes an equality, (j) majorize the values of the lower symbol of ρ in the sense see Fig. 2 for the converge of Sproj to SW for Needlet explained above, namely that inequalities are implied for Internal Linear Combination (NILC) Planck data and concave (or convex) functions of these values. It can fur- [40] for an alternative proof. The projector Pl+j :[l] ⊗ thermore be shown that pure states majorize mixed ones [j] → [l+j] can be expressed in terms of Clebsch-Gordan and that among the pure states, projectors |ΩihΩ| onto coefficients: coherent states will lead to matrices (35) that majorize v all other choices. Among the concave functionals we are u 2l 2j u l+m j+M in particular interested in entropy and define an appro- P |l, mi ⊗ |j, Mi = . (40) l+j t 2(l+j) priately normalized mixed density matrix l+j+m+M 2l + 1 (j) For large j the projection method provides a good way ρproj = Pl+j (ρ ⊗ 1) Pl+j , (37) 2(l + j) + 1 to compute the Wehrl entropy with high precision. For whose von Neumann entropy is what we call the “projec- small j we get an entropy measure with all the nice prop- tion entropy” erties of Wehrl entropy, but a pretty large computational advantage. We will now focus on the case where ρ is a (j) 2l + 1 pure state, i.e., ρ = |ψlihψl| with |ψli computed from the Sproj(ρ) = Tr φ Pl+j ρ ⊗ 1 Pl+j , (38) 2(l + j) + 1 alm of the CMB data with l fixed. For a pure state ρ the 7 matrix (34) can be rewritten as the Gram matrix of a set spin-l representation and define a mixed density matrix of vectors V~M ∈ [l + j] that are labeled by a basis of [j]: and entropy via
j X ~ ~ † Pl+j (|ψlihψl| ⊗ 1) Pl+j = VM VM (41) 3 1 X M=−j ρ = L ρL † (48) ang l(l + 1) i i with V~M = Pl+j |ψli ⊗ |j, Mi . (42) i=1 Sang = −Tr (ρang ln (ρang)) . (49) The dual Gram matrix