Santa Fe Institute Working Paper 16-07-015 arxiv.org:1607.06526 [math-ph] Beyond the Spectral Theorem: Spectrally Decomposing Arbitrary Functions of Nondiagonalizable Operators
Paul M. Riechers∗ and James P. Crutchfield† Complexity Sciences Center Department of Physics University of California at Davis One Shields Avenue, Davis, CA 95616 (Dated: September 24, 2017) Nonlinearities in finite dimensions can be linearized by projecting them into infinite dimensions. Unfortunately, the familiar linear operator techniques that one would then use often fail since the operators cannot be diagonalized. This curse is well known. It also occurs for finite-dimensional lin- ear operators. We circumvent it via two tracks. First, using the well-known holomorphic functional calculus, we develop new practical results about spectral projection operators and the relationship between left and right generalized eigenvectors. Second, we generalize the holomorphic calculus to a meromorphic functional calculus that can decompose arbitrary functions of nondiagonalizable lin- ear operators in terms of their eigenvalues and projection operators. This simultaneously simplifies and generalizes functional calculus so that it is readily applicable to analyzing complex physical systems. Together, these results extend the spectral theorem of normal operators to a much wider class, including circumstances in which poles and zeros of the function coincide with the operator spectrum. By allowing the direct manipulation of individual eigenspaces of nonnormal and nondi- agonalizable operators, the new theory avoids spurious divergences. As such, it yields novel insights and closed-form expressions across several areas of physics in which nondiagonalizable dynamics arise, including memoryful stochastic processes, open nonunitary quantum systems, and far-from- equilibrium thermodynamics. The technical contributions include the first full treatment of arbitrary powers of an operator, highlighting the special role of the zero eigenvalue. Furthermore, we show that the Drazin inverse, previously only defined axiomatically, can be derived as the negative-one power of singular operators within the meromorphic functional calculus and we give a new general method to construct it. We provide new formulae for constructing spectral projection operators and delineate the relations among projection operators, eigenvectors, and left and right generalized eigenvectors. By way of illustrating its application, we explore several, rather distinct examples. First, we analyze stochastic transition operators in discrete and continuous time. Second, we show that nondiagonalizability can be a robust feature of a stochastic process, induced even by simple counting. As a result, we directly derive distributions of the time-dependent Poisson process and point out that nondiagonalizability is intrinsic to it and the broad class of hidden semi-Markov processes. Third, we show that the Drazin inverse arises naturally in stochastic thermodynamics and that applying the meromorphic functional calculus provides closed-form solutions for the dynamics of key thermodynamic observables. Fourth, we show that many memoryful processes have power spectra indistinguishable from white noise, despite being highly organized. Nevertheless, whenever the power spectrum is nontrivial, it is a direct signature of the spectrum and projection operators of the process’ hidden linear dynamic, with nondiagonalizable subspaces yielding qualitatively distinct line profiles. Finally, we draw connections to the Ruelle–Frobenius–Perron and Koopman operators for chaotic dynamical systems and propose how to extract eigenvalues from a time-series.
PACS numbers: 02.50.-r 05.45.Tp 02.50.Ey 02.50.Ga
... the supreme goal of all theory is to make I. INTRODUCTION the irreducible basic elements as simple and as few as possible without having to surrender Decomposing a complicated system into its constituent the adequate representation of a single datum parts—reductionism—is one of science’s most power- of experience. A. Einstein [1, p. 165] ful strategies for analysis and understanding. Large- scale systems with linearly coupled components give one paradigm of this success. Each can be decomposed into an equivalent system of independent elements using a similarity transformation calculated from the linear al- ∗ [email protected] gebra of the system’s eigenvalues and eigenvectors. The † [email protected] physics of linear wave phenomena, whether of classical 2 light or quantum mechanical amplitudes, sets the stan- ues. The corresponding theory of spectral decomposition dard of complete reduction rather high. The dynamics established the quantitative foundation of quantum me- is captured by an “operator” whose allowed or exhib- chanics. ited “modes” are the elementary behaviors out of which The applications and discoveries enabled by spectral composite behaviors are constructed by simply weighting decomposition and the corresponding spectral theory each mode’s contribution and adding them up. fill a long list. In application, direct-bandgap semi- However, one should not reduce a composite system conducting materials can be turned into light-emitting more than is necessary nor, as is increasingly appre- diodes (LEDs) or lasers by engineering the spatially- ciated these days, more than one, in fact, can. In- inhomogeneous distribution of energy eigenvalues and the deed, we live in a complex, nonlinear world whose con- occupation of their corresponding states [7]. Before their stituents are strongly interacting. Often their key struc- experimental discovery, anti-particles were anticipated as tures and memoryful behaviors emerge only over space the nonoccupancy of negative-energy eigenstates of the and time. These are the complex systems. Yet, perhaps Dirac Hamiltonian [8]. surprisingly, many complex systems with nonlinear dy- The spectral theory, though, extends far beyond phys- namics correspond to linear operators in abstract high- ical science disciplines. In large measure, this arises dimensional spaces [2–4]. And so, there is a sense in since the evolution of any object corresponds to a lin- which even these complex systems can be reduced to the ear dynamic in a sufficiently high-dimensional state study of independent nonlocal collective modes. space. Even nominally nonlinear dynamics over sev- Reductionism, however, faces its own challenges even eral variables, the canonical mechanism of determin- within its paradigmatic setting of linear systems: lin- istic chaos, appear as linear dynamics in appropriate ear operators may have interdependent modes with ir- infinite-dimensional shift-spaces [4]. A nondynamic ver- reducibly entwined behaviors. These irreducible com- sion of rendering nonlinearities into linearities in a higher- ponents correspond to so-called nondiagonalizable sub- dimensional feature space is exploited with much success spaces. No similarity transformation can reduce them. today in machine learning by support vector machines, In this view, reductionism can only ever be a guide. for example [9]. Spectral decomposition often allows a The actual goal is to achieve a happy medium, as Einstein problem to be simplified by approximations that use only reminds us, of decomposing a system only to that level the dominant contributing modes. Indeed, human-face at which the parts are irreducible. To proceed, though, recognition can be efficiently accomplished using a small begs the original question, What happens when reduc- basis of “eigenfaces” [10]. tionism fails? To answer this requires revisiting one of Certainly, there are many applications that highlight its more successful implementations, spectral decompo- the importance of decomposition and the spectral theory sition of completely reducible operators. of operators. However, a brief reflection on the math- ematical history will give better context to its precise results, associated assumptions, and, more to the point, A. Spectral Decomposition the generalizations we develop here in hopes of advancing the analysis and understanding of complex systems. Spectral decomposition—splitting a linear operator Following on early developments of operator theory into independent modes of simple behavior—has greatly by Hilbert and co-workers [11], the spectral theorem for accelerated progress in the physical sciences. The im- normal operators reached maturity under von Neumann pact stems from the fact that spectral decomposition is by the early 1930s [12, 13]. It became the mathemat- not only a powerful mathematical tool for expressing the ical backbone of much progress in physics since then, organization of large-scale systems, but also yields pre- from classical partial differential equations to quantum dictive theories with directly observable physical conse- physics. Normal operators, by definition, commute with † † quences [5]. Quantum mechanics and statistical mechan- their Hermitian conjugate: A A = AA . Examples in- ics identify the energy eigenvalues of Hamiltonians as the clude symmetric and orthogonal matrices in classical me- basic objects in thermodynamics: transitions among the chanics and Hermitian, skew-Hermitian, and unitary op- energy eigenstates yield heat and work. The eigenvalue erators in quantum mechanics. spectrum reveals itself most directly in other kinds of The spectral theorem itself is often identified as a col- spectra, such as the frequency spectra of light emitted lection of related results about normal operators; see, by the gases that permeate the galactic filaments of our e.g., Ref. [14]. In the case of finite-dimensional vector universe [6]. Quantized transitions, an initially mystify- spaces [15], the spectral theorem asserts that normal op- ing feature of atomic-scale systems, correspond to dis- erators are diagonalizable and can always be diagonalized tinct eigenvectors and discrete spacing between eigenval- by a unitary transformation; that left and right eigenvec- 3 tors (or eigenfunctions) are simply related by complex- stract mathematical theory into a more tractable frame- conjugate transpose; that these eigenvectors form a com- work for analyzing complex physical systems. plete basis; and that functions of a normal operator re- Taken together, the functional calculus, Drazin inverse, duce to the action of the function on each eigenvalue. and methods to manipulate particular eigenspaces, are Most of these qualities survive with only moderate pro- key to a thorough-going analysis of many complex sys- visos in the infinite-dimensional case. In short, the spec- tems, many now accessible for the first time. Indeed, the tral theorem makes physics governed by normal operators framework has already been fruitfully employed in sev- tractable. eral specific applications, including closed-form expres- The spectral theorem, though, appears powerless when sions for signal processing and information measures of faced with nonnormal and nondiagonalizable operators. hidden Markov processes [19–23], compressing stochastic What then are we to do when confronted by, say, com- processes over a quantum channel [24, 25], and stochas- plex interconnected systems with nonunitary time evolu- tic thermodynamics [26, 27]. However, the techniques are tion, by open systems, by structures that emerge on space sufficiently general they will be much more widely useful. and time scales different from the equations of motion, or We envision new opportunities for similar detailed anal- by other novel physics governed by nonnormal and not- yses, ranging from biophysics to quantum field theory, necessarily-diagonalizable operators? Where is the com- wherever restrictions to normal operators and diagonal- parably constructive framework for calculations beyond izability have been roadblocks. the standard spectral theorem? Fortunately, portions of With this broad scope in mind, we develop the math- the necessary generalization have been made within pure ematical theory first without reference to specific appli- mathematics [16], some finding applications in engineer- cations and disciplinary terminology. We later give ped- ing and control [17, 18]. However, what is available is agogical (yet, we hope, interesting) examples, exploring incomplete. And, even that which is available is often several niche, but important applications to finite hid- not in a form adapted to perform calculations that lead den Markov processes, basic stochastic process theory, to quantitative predictions. nonequilibrium thermodynamics, signal processing, and nonlinear dynamical systems. At a minimum, the exam- ples and their breadth serve to better acquaint readers B. Synopsis with the basic methods required to employ the theory. We introduce the meromorphic functional calculus in Here, we build on previous work in functional analy- §III through §IV, after necessary preparation in §II. §VA sis and operator theory to provide both a rigorous and further explores and gives a new formula for eigenprojec- constructive foundation for physically relevant calcula- tors, which we refer to here simply as projection opera- tions involving not-necessarily-diagonalizable operators. tors. §V B makes explicit their general relationship with In effect, we extend the spectral theorem for normal oper- eigenvectors and generalized eigenvectors and clarifies the ators to a broader setting, allowing generalized “modes” orthonormality relationship among left and right gener- of nondiagonalizable systems to be identified and manip- alized eigenvectors. §V B 4 then discusses simplifications ulated. The meromorphic functional calculus we develop of the functional calculus for special cases, while §VI A extends Taylor series expansion and standard holomor- takes up the spectral properties of transition operators. phic functional calculus to analyze arbitrary functions of The examples are discussed at length in §VI before we not-necessarily-diagonalizable operators. It readily han- close in §VII with suggestions on future applications and dles singularities arising when poles (or zeros) of the research directions. function coincide with poles of the operator’s resolvent— poles that appear precisely at the operator’s eigenvalues. Pole–pole and pole–zero interactions substantially mod- II. SPECTRAL PRIMER ify the complex-analytic residues within the functional calculus. A key result is that the negative-one power of The following is relatively self-contained, assuming ba- a singular operator exists in the meromorphic functional sic familiarity with linear algebra at the level of Refs. [15, calculus. It is the Drazin inverse, a powerful tool that is 17]—including eigen-decomposition and knowledge of the receiving increased attention in stochastic thermodynam- Jordan canonical form, partial fraction expansion (see ics and elsewhere. Furthermore, we derive consequences Ref. [28]), and series expansion—and basic knowledge from the more familiar holomorphic functional calculus of complex analysis—including the residue theorem and that readily allow spectral decomposition of nondiago- calculation of residues at the level of Ref. [29]. For those nalizable operators in terms of spectral projections and lacking a working facility with these concepts, a quick re- left and right generalized eigenvectors—decanting the ab- view of §VI’s applications may motivate reviewing them. 4
In this section, we introduce our notation and, in doing is complex differentiable throughout the domain under so, remind the reader of certain basic concepts in linear consideration. A pole of order n at z0 C is a singular- n ∈ algebra and complex analysis that will be used exten- ity that behaves as h(z)/(z z0) as z z0, where h(z) is − → sively in the following. holomorphic within a neighborhood of z0 and h(z0) = 0. 6 To begin, we restrict attention to operators with finite We say that h(z) has a zero of order m at z1 if 1/h(z) has representations and only sometimes do we take the limit a pole of order m at z1.A meromorphic function is one of dimension going to infinity. That is, we do not consider that is holomorphic except possibly at a set of isolated infinite-rank operators outright. While this runs counter poles within the domain under consideration. to previous presentations in mathematical physics that Defined over the continuous complex variable z C, ∈ consider only infinite-dimensional operators, the upshot A’s resolvent: is that they—as limiting operators—can be fully treated −1 with a countable point spectrum. We present examples of R(z; A) (zI A) , ≡ − this later on. Accordingly, we restrict our attention to op- erators with at most a countably infinite spectrum. Such captures all of A’s spectral information through the poles operators share many features with finite-dimensional of R(z; A)’s matrix elements. In fact, the resolvent con- square matrices, and so we recall several elementary but tains more than just A’s spectrum: we later show that the essential facts from matrix theory used repeatedly in the order of each pole gives the index ν of the corresponding main development. eigenvalue. If A is a finite-dimensional square matrix, then its spec- The spectrum ΛA can be expressed in terms of the resolvent. Explicitly, the point spectrum (i.e., the set of trum is simply the set ΛA of its eigenvalues: eigenvalues) is the set of complex values z at which zI A − ΛA = λ C : det(λI A) = 0 , is not a one-to-one mapping, with the implication that ∈ − the inverse of zI A does not exist: − where det( ) is the determinant of its argument and I · is the identity matrix. The algebraic multiplicity a of ΛA = λ C : R(λ; A) = inv(λI A) , λ ∈ 6 − eigenvalue λ is the power of the term (z λ) in the char- − where inv( ) is the inverse of its argument. Later, via acteristic polynomial det(zI A). In contrast, the geo- · − our investigation of the Drazin inverse, it should become metric multiplicity gλ is the dimension of the kernel of the transformation A λI or, equivalently, the number clear that the resolvent operator can be self-consistently − of linearly independent eigenvectors associated with the defined at the spectrum, despite the lack of an inverse. eigenvalue. The algebraic and geometric multiplicities For infinite-rank operators, the spectrum becomes are all equal when the matrix is diagonalizable. more complicated. In that case, the right point spec- Since there can be multiple subspaces associated with trum (the point spectrum of A) need not be the same as a single eigenvalue, corresponding to different Jordan the left point spectrum (the point spectrum of A’s dual > blocks in the Jordan canonical form, it is structurally A ). Moreover, the spectrum may grow to include non- eigenvalues z for which the range of zI A is not dense important to distinguish the index of the eigenvalue as- − in the vector space it transforms or for which zI A has sociated with the largest of these subspaces [30]. − dense range but the inverse of zI A is not bounded. − Definition 1. Eigenvalue λ’s index νλ is the size of the These two settings give rise to the so-called residual spec- largest Jordan block associated with λ. trum and continuous spectrum, respectively [32]. To mit- igate confusion, it should be noted that the point spec- If z / ΛA, then νz = 0. Note that the index of the trum can be continuous, yet never coincides with the con- ∈ operator A itself is sometimes discussed [31]. In such tinuous spectrum just described. Moreover, understand- contexts, the index of A is ν0. Hence, νλ corresponds to ing only countable point spectra is necessary to follow the index of A λI. the developments here. − The index of an eigenvalue gives information beyond Each of A’s eigenvalues λ has an associated projection what the algebraic and geometric multiplicities them- operator Aλ, which is the residue of the resolvent as z → selves yield. Nevertheless, for λ ΛA, it is always true λ [14]. Explicitly: ∈ that νλ 1 aλ gλ aλ 1. In the diagonalizable − ≤ − ≤ − −1 case, aλ = gλ and νλ = 1 for all λ ΛA. Aλ = Res (zI A) , z λ , ∈ − → The following employs basic features of complex analy- sis extensively in conjunction with linear algebra. Let us where Res( , z λ) is the element-wise residue of its · → therefore review several elementary notions in complex first argument as z λ. The projection operators are → analysis. Recall that a holomorphic function is one that 5 orthonormal: convergence for most functions. For example, suppose f(z) has poles and choose a Maclaurin series; i.e., ξ = 0 AλAζ = δλ,ζ Aλ . (1) in Eq. (3). Then the series only converges when A’s spec- tral radius is less than the radius of the innermost pole and sum to the identity: of f(z). Addressing this and related issues leads directly X to alternative functional calculi. I = Aλ . (2)
λ∈ΛA
The following discusses in detail and then derives several B. Holomorphic functional calculus new properties of projection operators. Holomorphic functions are well behaved, smooth func- tions that are complex differentiable. Given a function III. FUNCTIONAL CALCULI f( ) that is holomorphic within a disk enclosed by a coun- · terclockwise contour C, its Cauchy integral formula is In the following, we develop an extended functional given by: calculus that makes sense of arbitrary functions f( ) of · 1 I a linear operator A. Within any functional calculus, one f(a) = f(z)(z a)−1 dz , (4) considers how A’s eigenvalues map to the eigenvalues of 2πi C − f(A); which we call a spectral mapping. For example, it Taking this as inspiration, the holomorphic functional is known that holomorphic functions of bounded linear calculus performs a contour integration of the resolvent operators enjoy an especially simple spectral mapping to extend f( ) to operators: theorem [33]: · 1 I f(A) = f(z)(zI A)−1 dz , (5) Λf(A) = f(ΛA) . 2πi − CΛA To fully appreciate the meromorphic functional calculus, where CΛA is a closed counterclockwise contour that en- we first state and compare the main features and limita- compasses ΛA. Assuming that f(z) is holomorphic at tions of alternative functional calculi. z = λ for all λ ΛA, a nontrivial calculation [30] shows ∈ that Eq. (5) is equivalent to the holomorphic calculus defined by: A. Taylor series
νλ−1 (m) X X f (λ) m Inspired by the Taylor expansion of scalar functions: f(A) = (A λI) Aλ . (6) m! − λ∈ΛA m=0 ∞ (n) X f (ξ) n f(a) = (a ξ) , After some necessary development, we will later derive n! − n=0 Eq. (6) as a special case of our meromorphic functional a calculus for functions of an operator A can be based on calculus, such that Eq. (6) is valid whenever f(z) is holo- morphic at z = λ for all λ ΛA. the series: ∈ The holomorphic functional calculus was first proposed ∞ X f (n)(ξ) in Ref. [30] and is now in wide use; e.g., see Ref. [17, p. f(A) = (A ξI)n , (3) n! 603]. It agrees with the Taylor-series approach whenever n=0 − the infinite series converges, but gives a functional calcu- where f (n)(ξ) is the nth derivative of f(z) evaluated at lus when the series approach fails. For example, using the z = ξ. principal branch of the complex logarithm, the holomor- This is often used, for example, to express the expo- phic functional calculus admits log(A) for any nonsingu- log(A) nential of A as: lar matrix, with the satisfying result that e = A. Whereas, the Taylor series approach fails to converge for ∞ X An the logarithm of most matrices even if the expansion for, eA = . n! say, log(1 z) is used. n=0 − The major shortcoming of the holomorphic functional This particular series-expansion is convergent for any A calculus is that it assumes f(z) is holomorphic at ΛA. z since e is entire, in the sense of complex analysis. Un- Clearly, if f(z) has a pole at some z ΛA, then Eq. (6) ∈ fortunately, even if it exists there is a limited domain of fails. An example of such a failure is the negative-one 6 power of a singular operator, which we take up later on. The major assumption of our meromorphic functional Several efforts have been made to extend the holomor- calculus is that the domain of operators must have a spec- phic functional calculus. For example, Refs. [34] and [35] trum that is at most countably infinite—e.g., A can be define a functional calculus that extends the standard any compact operator. A related limitation is that sin- holomorphic functional calculus to include a certain class gularities of f(z) that coincide with ΛA must be isolated of meromorphic functions that are nevertheless still re- singularities. Nevertheless, we expect that these restric- quired to be holomorphic on the point spectrum (i.e., on tions can be lifted with proper treatment, as discussed in the eigenvalues) of the operator. However, we are not fuller context later. aware of any previous work that introduces and develops the consequences of a functional calculus for functions that are meromorphic on the point spectrum—which we IV. MEROMORPHIC SPECTRAL take up in the next few sections. DECOMPOSITION
The preceding gave an overview of the relationship be- C. Meromorphic functional calculus tween alternative functional calculi and their trade-offs, highlighting the advantages of the meromorphic func- Meromorphic functions are holomorphic except at a tional calculus. This section leverages these advantages set of isolated poles of the function. The resolvent of and employs a partial fraction expansion of the resolvent a finite-dimensional operator is meromorphic, since it is to give a general spectral decomposition of almost any holomorphic everywhere except for poles at the eigenval- function of any operator. Then, since it plays a key role ues of the operator. We will now also allow our function in applications, we apply the functional calculus to inves- f(z) to be meromorphic with possible poles that coincide tigate the negative-one power of singular operators—thus with the poles of the resolvent. deriving, what is otherwise an operator defined axiomat- Inspired again by the Cauchy integral formula of Eq. ically, the Drazin inverse from first principles. (4), but removing the restriction to holomorphic func- tions, our meromorphic functional calculus instead em- ploys a partitioned contour integration of the resolvent: A. Partial fraction expansion of the resolvent
X 1 I f(A) = f(z)R(z; A) dz , The elements of A’s resolvent are proper rational func- 2πi Cλ λ∈ΛA tions that contain all of A’s spectral information. (Recall that a proper rational function r(z) is a ratio of polyno- where Cλ is a small counterclockwise contour around the mials in z whose numerator has degree strictly less than eigenvalue λ. This and a spectral decomposition of the the degree of the denominator.) In particular, the re- resolvent (to be derived later) extends the holomorphic solvent’s poles coincide with A’s eigenvalues since, for calculus to a much wider domain, defining: z / ΛA: ∈ νλ−1 I −1 X X m 1 f(z) R(z; A) = (zI A) f(A) = Aλ A λI dz . − − 2πi (z λ)m+1 > λ∈Λ m=0 Cλ A − = C (7) det(zI A) −> = C , (8) The contour is integrated using knowledge of f(z) since Q aλ λ∈ΛA (z λ) meromorphic f(z) can introduce poles and zeros at ΛA − that interact with the resolvent’s poles. where aλ is the algebraic multiplicity of eigenvalue λ and The meromorphic functional calculus agrees with the is the matrix of cofactors of zI A. That is, ’s trans- C − C Taylor-series approach whenever the series converges and pose > is the adjugate of zI A: agrees with the holomorphic functional calculus when- C − > ever f(z) is holomorphic at ΛA. However, when both the = adj(zI A) , previous functional calculi fail, the meromorphic calculus C − extends the domain of f(A) to yield surprising, yet sen- whose elements will be polynomial functions of z of de- P sible answers. For example, we show that within it, the gree less than λ∈ΛA aλ. negative-one power of a singular operator is the Drazin Recall that the partial fraction expansion of a proper inverse—an operator that effectively inverts everything rational function r(z) with poles in Λ allows a unique that is invertible. decomposition into a sum of constant numerators divided 7
0 by monomials in z λ up to degree aλ, when aλ is the z = 1. Then, Eq. (13) implies: − order of the pole of r(z) at λ Λ [28]. Equation (8) thus ∈ X 1 I makes it clear that the resolvent has the unique partial I = R(z; A)dz 2πi Cλ fraction expansion: λ∈ΛA X aλ−1 = Aλ . X X 1 R(z; A) = Aλ,m , (9) λ∈ΛA (z λ)m+1 λ∈ΛA m=0 − This shows that the projection operators are, in fact, a where Aλ,m is the set of matrices with constant entries decomposition of the identity, as anticipated in Eq. (2). { } (not functions of z) uniquely determined elementwise by the partial fraction expansion. However, R(z; A)’s poles are not necessarily of the same order as the algebraic mul- tiplicity of the corresponding eigenvalues since the entries C. Dunford decomposition, decomposed of , and thus of >, may have zeros at A’s eigenvalues. C C This has the potential to render Aλ,m equal to the zero matrix 0. For f(A) = A, Eqs. (13) and (10) imply that: The Cauchy integral formula indicates that the con- X 1 I stant matrices Aλ,m of Eq. (9) can be obtained as the A = zR(z; A)dz { } 2πi Cλ residues: λ∈ΛA X I I 1 I = λ 1 R(z; A)dz + 1 (z λ)R(z; A)dz A = (z λ)mR(z; A)dz , (10) 2πi 2πi − λ,m λ∈Λ Cλ Cλ 2πi C − A λ X = (λAλ,0 + Aλ,1) . (14) where the residues are calculated elementwise. The pro- λ∈ΛA jection operators Aλ associated with each eigenvalue λ were already referenced in §II, but can now be properly We denote the important set of nilpotent matrices Aλ,1 introduced as the Aλ,0 matrices: that project onto the generalized eigenspaces by relabel- ing them: Aλ = Aλ,0 (11) 1 I Nλ Aλ,1 (15) = R(z; A)dz . (12) ≡ 2πi 1 I Cλ = (z λ)R(z; A)dz . (16) 2πi Cλ − Since R(z; A)’s elements are rational functions, as we just showed, it is analytic except at a finite number of Equation (14) is the unique Dunford decomposi- P isolated singularities—at A’s eigenvalues. In light of the tion [16]: A = D + N, where D λ∈Λ λAλ is di- P ≡ A residue theorem, this motivates the Cauchy-integral-like agonalizable, N Nλ is nilpotent, and D and N ≡ λ∈ΛA formula that serves as the starting point for the mero- commute: [D,N] = 0. This is also known as the Jordan– morphic functional calculus: Chevalley decomposition. The special case where A is diagonalizable implies that X 1 I f(A) = f(z)R(z; A)dz . (13) N = 0. And so, Eq. (14) simplifies to: 2πi Cλ λ∈ΛA X A = λAλ .
Let’s now consider several immediate consequences. λ∈ΛA
D. The resolvent, resolved
B. Decomposing the identity As shown in Ref. [14] and can be derived from Eqs. (12) and (16): Even the simplest applications of Eq. (13) yield insight. A A = δ A and Consider the identity as the operator function f(A) = λ ζ λ,ζ λ A0 = I that corresponds to the scalar function f(z) = AλNζ = δλ,ζ Nλ . 8
Due to these, our spectral decomposition of the Dunford and itself: decomposition implies that: ν −1 X Xλ 1 I f(z) X f(A) = Aλ,m m+1 dz . (23) Nλ = Aλ A ζAζ 2πi Cλ (z λ) − λ∈ΛA m=0 − ζ∈ΛA = Aλ A λAλ In obtaining Eq. (23) we finally derived Eq. (7), as − promised earlier in § III C. Effectively, by modulating the = Aλ A λI . (17) − modes associated with the resolvent’s singularities, the scalar function f( ) is mapped to the operator domain, Moreover: · where its action is expressed in each of A’s independent m Aλ,m = Aλ A λI . (18) subspaces. − m It turns out that for m > 0: Aλ,m = Nλ . (See also Ref. [14, p. 483].) This leads to a generalization of the F. Evaluating the residues projection operator orthonormality relations of Eq. (1). Most generally, the operators of Aλ,m are mutually re- Interpretation aside, how does one use this result? { } lated by: Equation (23) says that the spectral decomposition of f(A) reduces to the evaluation of several residues, where: Aλ,mAζ,n = δλ,ζ Aλ,m+n . (19) 1 I Res g(z), z λ = g(z) dz . Finally, if we recall that the index νλ is the dimension of → 2πi Cλ the largest associated subspace, we find that the index m So, to make progress with Eq. (23), we must evaluate of λ characterizes the nilpotency of Nλ: Nλ = 0 for m νλ. That is: function-dependent residues of the form: ≥ m+1 Aλ,m = 0 for m νλ . (20) Res f(z)/(z λ) , z λ . ≥ − →
Returning to Eq. (9), we see that all Aλ,m with m νλ If f(z) were holomorphic at each λ, then the order of ≥ are zero-matrices and so do not contribute to the sum. the pole would simply be the power of the denomina- Thus, we can rewrite Eq. (9) as: tor. We could then use Cauchy’s differential formula for holomorphic functions: ν −1 X Xλ 1 n! I f(z) R(z; A) = m+1 Aλ,m (21) (n) (z λ) f (a) = n+1 dz , (24) λ∈ΛA m=0 − 2πi C (z a) a − or: for f(z) holomorphic at a. And, the meromorphic cal- culus would reduce to the holomorphic calculus. Often, νλ−1 X X 1 m f(z) will be holomorphic at least at some of A’s eigenval- R(z; A) = Aλ A λI , (22) (z λ)m+1 − ues. And so, Eq. (24) is still locally a useful simplification λ∈ΛA m=0 − in those special cases. for z / ΛA. In general, though, f(z) introduces poles and zeros at ∈ The following sections sometimes use Aλ,m in place of λ ΛA that change their orders. This is exactly the im- m ∈ Aλ A λI . This is helpful both for conciseness and petus for the generalized functional calculus. The residue − when applying Eq. (19). Nonetheless, the equality in of a complex-valued function g(z) around its isolated pole Eq. (18) is a useful one to keep in mind. λ of order n + 1 can be calculated from: n 1 d n+1 Res g(z), z λ = lim n (z λ) g(z) . → n! z→λ dz −
E. Meromorphic functional calculus G. Decomposing AL In light of Eq. (13), Eq. (21) together with Eq. (18) allow us to express any function of an operator simply Equation (23) says that we can explicitly derive the and solely in terms of its spectrum (i.e., its eigenvalues spectral decomposition of powers of the operator A. Of for the finite dimensional case), its projection operators, course, we already did this for the special cases of A0 and 9
A1. The goal, though, is to do this in general. example, at this special value of λ and for integer L > 0, For f(A) = AL f(z) = zL, z = 0 can be either a λ = 0 induces poles that cancel with the zeros of f(z) = → zero or a pole of f(z), depending on the value of L. In zL, since zL has a zero at z = 0 of order L. For integer either case, an eigenvalue of λ = 0 will distinguish itself L < 0, an eigenvalue of λ = 0 increases the order of the in the residue calculation of AL via its unique ability to z = 0 pole of f(z) = zL. For all other eigenvalues, the change the order of the pole (or zero) at z = 0. For residues will be as expected. Hence, from Eq. (23) and inserting f(z) = zL, for any L C: ∈
1 dm L λL−m Qm = m! limz→λ dzm z = m! n=1(L−n+1) z }| { " νλ−1 I L # ν0−1 I L X X m 1 z X m 1 L−m−1 A = Aλ A λI m+1 dz + [0 ΛA] A0A z dz − 2πi C (z λ) ∈ 2πi C λ∈ΛA m=0 λ − m=0 0 λ6=0 | {z } =δL,m
" νλ−1 # ν0−1 X X L L−m m X m = λ Aλ A λI + [0 ΛA] δL,mA0A , (25) m − ∈ λ∈ΛA m=0 m=0 λ6=0