Partial Trace Regression and Low-Rank Kraus Decomposition

Total Page:16

File Type:pdf, Size:1020Kb

Partial Trace Regression and Low-Rank Kraus Decomposition Partial Trace Regression and Low-Rank Kraus Decomposition Hachem Kadri 1 Stéphane Ayache 1 Riikka Huusari 2 Alain Rakotomamonjy 3 4 Liva Ralaivola 4 Abstract The trace regression model, a direct extension of the well-studied linear regression model, al- lows one to map matrices to real-valued outputs. We here introduce an even more general model, namely the partial-trace regression model, a fam- ily of linear mappings from matrix-valued inputs to matrix-valued outputs; this model subsumes the Figure 1. Illustration of the partial trace operation. The partial trace regression model and thus the linear regres- trace operation applied to m × m-blocks of a qm × qm matrix sion model. Borrowing tools from quantum infor- gives a q × q matrix as an output. mation theory, where partial trace operators have been extensively studied, we propose a framework where tr(·) denotes the trace, B∗ is some unknown ma- for learning partial trace regression models from trix of regression coefficients, and is random noise. This data by taking advantage of the so-called low-rank model has been used beyond mere matrix regression for Kraus representation of completely positive maps. problems such as phase retrieval (Candes et al., 2013), quan- We show the relevance of our framework with tum state tomography (Gross et al., 2010), and matrix com- synthetic and real-world experiments conducted pletion (Klopp, 2011). for both i) matrix-to-matrix regression and ii) pos- ` itive semidefinite matrix completion, two tasks Given a sample S = f(Xi; yi)gi=1, where Xi is a p1 × p2 which can be formulated as partial trace regres- matrix and yi 2 R, and each (Xi; yi) is assumed to be sion problems. distributed as (X; y), the training task associated with sta- tistical model (1) is to find a matrix Bb that is a proxy to B∗. To this end, Koltchinskii et al.(2011); Fan et al.(2019) 1. Introduction proposed to compute an estimation Bb of B∗ as the solution of the regularized least squares problem Trace regression model. The trace regression model or, ` in short, trace regression, is a generalization of the well- X 2 B = arg min y − tr B>X + λkBk ; (2) known linear regression model to the case where input data b i i 1 B i=1 are matrices instead of vectors (Rohde & Tsybakov, 2011; Koltchinskii et al., 2011; Slawski et al., 2015), with the where k · k1 is the trace norm (or nuclear norm), which output still being real-valued. This model assumes, for the promotes a low-rank Bb, a key feature for the authors to pair of covariates (X; y), the following relation between establish bounds on the deviation of Bb from B∗. Slawski the matrix-valued random variable X and the real-valued et al.(2015); Koltchinskii & Xia(2015) have considered the : random variable y: particular case where p = p1 = p2 and B∗ is assumed to be + from Sp , the cone of positive semidefinite matrices of order y = tr B>X + , (1) ∗ p, and they showed that guarantees on the deviation of Bb 1Aix-Marseille University, CNRS, LIS, Marseille, France from B∗ continue to hold when Bb is computed as 2 Helsinki Institute for Information Technology HIIT, Department ` 3 of Computer Science, Aalto University, Espoo, Finland Université X 2 4 Bb = arg min (yi − tr (BXi)) : (3) Rouen Normandie, LITIS, Rouen, France Criteo AI Lab, Paris, + France. Correspondence to: Hachem Kadri <hachem.kadri@univ- B2Sp i=1 amu.fr>. Here, the norm regularization of (2) is no longer present Proceedings of the 37 th International Conference on Machine and it is replaced by an explicit restriction for Bb to be in + Learning, Online, PMLR 119, 2020. Copyright 2020 by the au- Sp (as B∗). This setting is tied to the learning of completely thor(s). positive maps developed hereafter. Partial Trace Regression and Low-Rank Kraus Decomposition Partial trace regression model. Here, we propose the Symbol Meaning partial trace regression model, that generalizes the trace i j m n p q regression model to the case when both inputs and outputs , , , , , integers α β γ ::: are matrices, and we go a step farther from works that are , , , real numbers X Y H ::: 1 assuming either matrix-variate inputs (Zhou & Li, 2014; , , , vector spaces x y k ::: Ding & Cook, 2014; Slawski et al., 2015; Luo et al., 2015) , , , vectors (or functions) X Y K ::: or matrix/tensor-variate outputs (Kong et al., 2019; Li & , , , matrices (or operators) X Y K ::: Zhang, 2017; Rabusseau & Kadri, 2016). Key to our work , , , block matrices Φ Λ Γ ::: is the so-called partial trace, explained in the following , , , linear maps on matrices > section and depicted in Figure1. transpose This novel regression model that maps matrix-to-matrix Table 1. Notational conventions used in the paper. is of interest for several application tasks. For instance, in Brain-Computer Interfaces, covariance matrices are fre- quently used as a feature for representing mental state of a learning with the partial trace regression model translates, subject (Barachant et al., 2011; Congedo et al., 2013) and together with a generalization error bound for the associated those covariance matrices need to be transformed (Zanini learned model. In addition, we present how the problem et al., 2018) into other covariance matrices to be discrimi- of (block) positive semidefinite matrix completion can be native for some BCI tasks. Similarly in Computer Vision cast as a partial trace regression problem. and especially in the subfield of 3D Shape retrieval, co- variance matrices are of interest as descriptors (Guo et al., 2.1. Notations, Block Matrices and Partial Trace 2018; Hariri et al., 2017), while there is a surging interest Notations. Our notational conventions are summarized in deep learning methods for defining trainable layers with in Table1. For n; m 2 N, Mn×m = Mn×m(R) denotes covariance matrices as input and output (Huang & Van Gool, the space of all n × m real matrices. If n = m, then we 2017; Brooks et al., 2019). write Mn instead of Mn×n. If M is a matrix, Mij denotes Contributions. We make the following contributions. its (i; j)-th entry. For M 2 Mn, M 0 will be used i) We introduce the partial trace regression model, a family to indicate that M is positive semidefinite (PSD); we may + l of linear predictors from matrix-valued inputs to matrix- equivalently write M 2 Sn . Throughout, f(Xi;Yi)gi=1 valued outputs; this model encompasses previously known denotes a training sample of l examples, with each (Xi;Yi) assumed to be drawn IID from a fixed but unknown dis- regression models, including the trace regression model and : tribution on X × Y where, from now on, X = and thus the linear regression model; ii) borrowing concepts : Mp from quantum information theory, where partial trace opera- Y = Mq. tors have been extensively studied, we propose a framework for learning a partial trace regression model from data; we Block matrices. We will make extensive use of the notion take advantage of the low-rank Kraus representation of com- of block matrices, i.e., matrices that can be partitioned into pletely positive maps to cast learning as an optimization submatrices of the same dimension. If M 2 Mnm, the problem which we are able to handle efficiently; iii) we number of block partitions of M directly depends on the provide statistical guarantees for the model learned under number of factors of n and m; to uniquely identify the our framework, thanks to a provable estimation of pseudodi- partition we are working with, we will always consider a mension of the class of functions that we can envision; iv) n × n block partition, where n will be clear from context finally, we show the relevance of the proposed framework —the number of rows and columns of the matrix at hand will for the tasks of matrix-to-matrix regression and positive thus be multiples of n. The set Mn(Mm) will then denote semidefinite matrix completion, both of them are amenable the space of n × n block matrices M = [[Mij]] whose i; j 2 to a partial trace regression formulation; our empirical re- entry is an element of Mm. sults show that partial trace regression model yields good performance, demonstrating wide applicability and effec- Partial trace operators. Partial trace, extensively studied tiveness. and used in quantum computing (see, e.g,. Rieffel & Polak 2011, Chapter 10), generalizes the trace operation to block 2. Partial Trace Regression matrices. The definition we work with is the following. Here, we introduce the partial trace regression model to en- Definition 1 (Partial trace, see, e.g., Bhatia, 2003.) The code linear mappings from matrix-valued spaces to matrix- partial trace operator, denoted by trm(·), is the linear map valued spaces. We specialize this setting to completely 1 n We also use the standard notations such as R and Mn. positive maps, and show the optimization problem to which 2 The space Mn(Mm) is isomorphic to Mn ⊗ Mm. Partial Trace Regression and Low-Rank Kraus Decomposition from Mq(Mm) to Mq defined by fields of mathematics, physics, and more specifically quan- tum computation and information (Bhatia, 2009; Nielsen & trm(M) = tr(Mij) ; i; j = 1; : : : ; q: Chuang, 2000; Størmer, 2012; Watrous, 2018). Operators from L(Mp; Mq) that have special properties are In other words, given a block matrix M of size qm × qm, the so-called completely positive maps, a family that builds the partial trace is obtained by computing the trace of each upon the notion of positive operators. block of size m × m in the input matrix M, as depicted in Figure1.
Recommended publications
  • A Mini-Introduction to Information Theory
    A Mini-Introduction To Information Theory Edward Witten School of Natural Sciences, Institute for Advanced Study Einstein Drive, Princeton, NJ 08540 USA Abstract This article consists of a very short introduction to classical and quantum information theory. Basic properties of the classical Shannon entropy and the quantum von Neumann entropy are described, along with related concepts such as classical and quantum relative entropy, conditional entropy, and mutual information. A few more detailed topics are considered in the quantum case. arXiv:1805.11965v5 [hep-th] 4 Oct 2019 Contents 1 Introduction 2 2 Classical Information Theory 2 2.1 ShannonEntropy ................................... .... 2 2.2 ConditionalEntropy ................................. .... 4 2.3 RelativeEntropy .................................... ... 6 2.4 Monotonicity of Relative Entropy . ...... 7 3 Quantum Information Theory: Basic Ingredients 10 3.1 DensityMatrices .................................... ... 10 3.2 QuantumEntropy................................... .... 14 3.3 Concavity ......................................... .. 16 3.4 Conditional and Relative Quantum Entropy . ....... 17 3.5 Monotonicity of Relative Entropy . ...... 20 3.6 GeneralizedMeasurements . ...... 22 3.7 QuantumChannels ................................... ... 24 3.8 Thermodynamics And Quantum Channels . ...... 26 4 More On Quantum Information Theory 27 4.1 Quantum Teleportation and Conditional Entropy . ......... 28 4.2 Quantum Relative Entropy And Hypothesis Testing . ......... 32 4.3 Encoding
    [Show full text]
  • Quantum Information Theory
    Lecture Notes Quantum Information Theory Renato Renner with contributions by Matthias Christandl February 6, 2013 Contents 1 Introduction 5 2 Probability Theory 6 2.1 What is probability? . .6 2.2 Definition of probability spaces and random variables . .7 2.2.1 Probability space . .7 2.2.2 Random variables . .7 2.2.3 Notation for events . .8 2.2.4 Conditioning on events . .8 2.3 Probability theory with discrete random variables . .9 2.3.1 Discrete random variables . .9 2.3.2 Marginals and conditional distributions . .9 2.3.3 Special distributions . 10 2.3.4 Independence and Markov chains . 10 2.3.5 Functions of random variables, expectation values, and Jensen's in- equality . 10 2.3.6 Trace distance . 11 2.3.7 I.i.d. distributions and the law of large numbers . 12 2.3.8 Channels . 13 3 Information Theory 15 3.1 Quantifying information . 15 3.1.1 Approaches to define information and entropy . 15 3.1.2 Entropy of events . 16 3.1.3 Entropy of random variables . 17 3.1.4 Conditional entropy . 18 3.1.5 Mutual information . 19 3.1.6 Smooth min- and max- entropies . 20 3.1.7 Shannon entropy as a special case of min- and max-entropy . 20 3.2 An example application: channel coding . 21 3.2.1 Definition of the problem . 21 3.2.2 The general channel coding theorem . 21 3.2.3 Channel coding for i.i.d. channels . 24 3.2.4 The converse . 24 2 4 Quantum States and Operations 26 4.1 Preliminaries .
    [Show full text]
  • Quantum Channels and Operations
    1 Quantum information theory (20110401) Lecturer: Jens Eisert Chapter 4: Quantum channels 2 Contents 4 Quantum channels 5 4.1 General quantum state transformations . .5 4.2 Complete positivity . .5 4.2.1 Choi-Jamiolkowski isomorphism . .7 4.2.2 Kraus’ theorem and Stinespring dilations . .9 4.2.3 Disturbance versus information gain . 10 4.3 Local operations and classical communication . 11 4.3.1 One-local operations . 11 4.3.2 General local quantum operations . 12 3 4 CONTENTS Chapter 4 Quantum channels 4.1 General quantum state transformations What is the most general transformation of a quantum state that one can perform? One might wonder what this question is supposed to mean: We already know of uni- tary Schrodinger¨ evolution generated by some Hamiltonian H. We also know of the measurement postulate that alters a state upon measurement. So what is the question supposed to mean? In fact, this change in mindset we have already encountered above, when we thought of unitary operations. Of course, one can interpret this a-posteriori as being generated by some Hamiltonian, but this is not so much the point. The em- phasis here is on what can be done, what unitary state transformations are possible. It is the purpose of this chapter to bring this mindset to a completion, and to ask what kind of state transformations are generally possible in quantum mechanics. There is an abstract, mathematically minded approach to the question, introducing notions of complete positivity. Contrasting this, one can think of putting ingredients of unitary evolution and measurement together.
    [Show full text]
  • Quantum Mechanics Quick Reference
    Quantum Mechanics Quick Reference Ivan Damg˚ard This note presents a few basic concepts and notation from quantum mechanics in very condensed form. It is not the intention that you should read all of it immediately. The text is meant as a source from where you can quickly recontruct the meaning of some of the basic concepts, without having to trace through too much material. 1 The Postulates Quantum mechanics is a mathematical model of the physical world, which is built on a small number of postulates, each of which makes a particular claim on how the model reflects physical reality. All the rest of the theory can be derived from these postulates, but the postulates themselves cannot be proved in the mathematical sense. One has to make experiments and see whether the predictions of the theory are confirmed. To understand what the postulates say about computation, it is useful to think of how we describe classical computation. One way to do this is to say that a computer can be in one of a number of possible states, represented by all internal registers etc. We then do a number of operations, each of which takes the computer from one state to another, where the sequence of operations is determined by the input. And finally we get a result by observing which state we have arrived in. In this context, the postulates say that a classical computer is a special case of some- thing more general, namely a quantum computer. More concretely they say how one describes the states a quantum computer can be in, which operations we can do, and finally how we can read out the result.
    [Show full text]
  • Partial Traces and Entropy Inequalities Rajendra Bhatia Indian Statistical Institute, 7, S.J.S
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Linear Algebra and its Applications 370 (2003) 125–132 www.elsevier.com/locate/laa Partial traces and entropy inequalities Rajendra Bhatia Indian Statistical Institute, 7, S.J.S. Sansanwal Marg, New Delhi 110016, India Received 8 May 2002; accepted 5 January 2003 Submitted by C. Davis Abstract The partial trace operation and the strong subadditivity property of entropy in quantum mechanics are explained in linear algebra terms. © 2003 Elsevier Inc. All rights reserved. Keywords: Quantum entropy; Density matrix; Partial trace; Operator convexity; Matrix inequalities 1. Introduction There is an interesting matrix operation called partial trace in the physics lit- erature. The first goal of this short expository paper is to connect this operation to others more familiar to linear algebraists. The second goal is to use this connection to present a slightly simpler proof of an important theorem of Lieb and Ruskai [11,12] called the strong subadditivity (SSA) of quantum-mechanical entropy. Closely re- lated to SSA, and a crucial ingredient in one of its proofs, is another theorem of Lieb [9] called Lieb’s concavity theorem (solution of the Wigner–Yanase–Dyson conjecture). An interesting alternate formulation and proof of this latter theorem appeared in a paper of Ando [1] well-known to linear algebraists. Discussion of the Lieb–Ruskai theorem, however, seems to have remained confined to the physics literature. We believe there is much of interest here for others as well, particularly for those interested in linear algebra.
    [Show full text]
  • Quantum Computation Lecture Notes
    Quantum Computation Lecture Notes Jyh Ying Peng (based on lecture notes by John Preskill 1998) Summer 2003 2 Chapter 1 Quantum States and Ensembles 1.1 Axioms of Quantum Mechanics Quantum theory (as all physical theories) can be characterized by how it represents (physical) states, observables, measurements, and dynamics (evo- lution in time). 1. States. A state is a complete description of a physical system. In quantum mechanics, a state is a ray in a Hilbert space. A Hilbert space is a vector space over the field of complex numbers C, with vectors denoted by |ψi (Dirac’s ket notation). A ket |ψi is represented as a n × 1 matrix (or a n element vector), where n is the dimension of the Hilbert space, and its corresponding bra hψ| is the transpose conjugate of the ket. It has an inner product hψ|ϕi (may be seen as matrix multiplication) that maps an ordered pair of vectors to C, with the properties: (a) Positivity. hψ|ψi > 0 for |ψi= 6 0 (b) Linearity. hϕ| (a|ψ1i + b|ψ2i) = ahϕ|ψ1i + bhϕ|ψ2i (c) Skew symmetry. hϕ|ψi = hψ|ϕi∗ 1 The Hilbert space is complete in the norm kψk = hψ|ψi 2 . A ray in a Hilbert space is an equivalent class of vectors that differ by multiplication by a nonzero complex scalar. That is, a ray is rep- resented by a given vector and all its (complex) multiples, and two different rays treated as vectors will never be “parallel” (in the Hilbert space). We represent rays (the equivalent classes) by a vector with unit norm hψ|ψi = 1, so eiα|ψi (where α ∈ <) represent the same physical 3 4 CHAPTER 1.
    [Show full text]
  • Algebraic Approach to Quantum Theory: a Finite-Dimensional Guide
    Algebraic approach to quantum theory: a finite-dimensional guide Cédric Bény Florian Richter arXiv:1505.03106v8 [quant-ph] 26 May 2020 Contents 1 Introduction 1 2 States and effects 1 2.1 Basic Quantum Mechanics . .1 2.2 Positive operators . .1 2.3 Generalized States . .3 2.3.1 Ensembles . .3 2.3.2 Tensor Product and Reduced States . .5 2.3.3 Density Operator/Matrix . .7 2.3.4 Example: Entanglement . .8 2.3.5 Example: Bloch Sphere . .9 2.4 Generalised propositions . .9 2.5 Abstract State/Effect Formalism . 10 2.5.1 Example: Quantum Theory . 11 2.5.2 Example: Classical Probability Theory . 11 2.6 Algebraic Formulation . 12 2.6.1 Example: Quantum Theory . 13 2.6.2 Example: Classical Probability Theory . 14 2.6.3 Example: C∗-Algebra . 15 2.7 Structure of matrix algebras . 15 2.8 Hybrid quantum/classical systems . 21 3 Channels 23 3.1 Schrödinger Picture . 24 3.1.1 Example: Partial Transpose . 25 3.1.2 Example: Homomorphisms . 25 3.1.3 Example: Mixture of unitaries . 26 3.1.4 Example: Classical channels . 26 3.2 Heisenberg Picture . 26 3.3 Observables as channels with classical output . 28 3.3.1 Example: Observable associated with an effect . 28 3.3.2 Coarse-graining of observables . 30 3.3.3 Accessible observables . 31 3.4 Stinespring dilation . 32 1 Introduction This document is meant as a pedagogical introduction to the modern language used to talk about quantum theories, especially in the field of quantum information. It assumes that the reader has taken a first traditional course on quantum mechanics, and is familiar with the concept of Hilbert space and elementary linear algebra.
    [Show full text]
  • Lecture Notes
    Lecture Notes Quantum Cryptography Week 1: Quantum tools and a first protocol cbea This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Licence. Contents 1.1 Probability notation3 1.2 Density matrices3 1.2.1 Properties.......................................................6 1.2.2 One qubit mixed states............................................6 1.3 Combining density matrices7 1.4 Classical-quantum states8 1.5 General measurements9 1.6 The partial trace 12 1.7 Secure message transmission 14 1.7.1 Shannon’s secrecy condition and the need for large keys................ 15 1.8 The (quantum) one-time pad 16 1.8.1 The classical one-time pad........................................ 16 1.8.2 The quantum one-time pad....................................... 17 1.1 Probability notation 3 In this course you will learn about the basics of quantum communication and quantum cryp- tography. Unlike large scale quantum computers, both technologies are already implemented today. Yet it remains a grand challenge to do quantum communication and cryptography over long distances. This week we will learn about a very simple quantum protocol: we will encrypt quantum states! Yet, to prepare our entry into quantum communication and cryptography, we first need to learn a little more about quantum information. 1.1 Probability notation Before we start we recall standard notation of classical probability theory which we use throughout the lecture notes. There are many good textbooks and online resources on probability theory available, such as [kelly1994introduction, ross2010first], and we refer you to any of them for additional background. Consider a discrete random variable X taking values in some alphabet X of size n.
    [Show full text]
  • Lecture Notes for Ph219/CS219: Quantum Information Chapter 2
    Lecture Notes for Ph219/CS219: Quantum Information Chapter 2 John Preskill California Institute of Technology Updated July 2015 Contents 2 Foundations I: States and Ensembles 3 2.1 Axioms of quantum mechanics 3 2.2 The Qubit 7 1 2.2.1 Spin- 2 8 2.2.2 Photon polarizations 14 2.3 The density operator 16 2.3.1 The bipartite quantum system 16 2.3.2 Bloch sphere 21 2.4 Schmidt decomposition 23 2.4.1 Entanglement 25 2.5 Ambiguity of the ensemble interpretation 26 2.5.1 Convexity 26 2.5.2 Ensemble preparation 28 2.5.3 Faster than light? 30 2.5.4 Quantum erasure 31 2.5.5 The HJW theorem 34 2.6 How far apart are two quantum states? 36 2.6.1 Fidelity and Uhlmann's theorem 36 2.6.2 Relations among distance measures 38 2.7 Summary 41 2.8 Exercises 43 2 2 Foundations I: States and Ensembles 2.1 Axioms of quantum mechanics In this chapter and the next we develop the theory of open quantum systems. We say a system is open if it is imperfectly isolated, and therefore exchanges energy and information with its unobserved environment. The motivation for studying open systems is that all realistic systems are open. Physicists and engineers may try hard to isolate quantum systems, but they never completely succeed. Though our main interest is in open systems we will begin by recalling the theory of closed quantum systems, which are perfectly isolated. To understand the behavior of an open system S, we will regard S combined with its environment E as a closed system (the whole \universe"), then ask how S behaves when we are able to observe S but not E.
    [Show full text]
  • Mathematical Introduction to Quantum Information Processing
    Mathematical Introduction to Quantum Information Processing (growing lecture notes, SS2019) Michael M. Wolf June 22, 2019 Contents 1 Mathematical framework 5 1.1 Hilbert spaces . .5 1.2 Bounded Operators . .8 Ideals of operators . 10 Convergence of operators . 11 Functional calculus . 12 1.3 Probabilistic structure of Quantum Theory . 14 Preparation . 15 Measurements . 17 Probabilities . 18 Observables and expectation values . 20 1.4 Convexity . 22 Convex sets and extreme points . 22 Mixtures of states . 23 Majorization . 24 Convex functionals . 26 Entropy . 28 1.5 Composite systems and tensor products . 29 Direct sums . 29 Tensor products . 29 Partial trace . 34 Composite and reduced systems . 35 Entropic quantities . 37 1.6 Quantum channels and operations . 38 Schrödinger & Heisenberg picture . 38 Kraus representation and environment . 42 Choi-matrices . 45 Instruments . 47 Commuting dilations . 48 1.7 Unbounded operators and spectral measures . 51 2 Basic trade-offs 53 2.1 Uncertainty relations . 53 Variance-based preparation uncertainty relations . 54 Joint measurability . 55 2 CONTENTS 3 2.2 Information-disturbance . 56 No information without disturbance . 56 2.3 Time-energy . 58 Mandelstam-Tamm inequalities . 58 Evolution to orthogonal states . 59 These are (incomplete but hopefully growing) lecture notes of a course taught in summer 2019 at the department of mathematics at the Technical University of Munich. 4 CONTENTS Chapter 1 Mathematical framework 1.1 Hilbert spaces This section will briefly summarize relevant concepts and properties of Hilbert spaces. A complex Hilbert space is a vector space over the complex numbers, equipped with an inner product h·; ·i : H × H ! C and an induced norm k k := h ; i1=2 w.r.t.
    [Show full text]
  • Geometry of Quantum States
    Geometry of Quantum States Ingemar Bengtsson and Karol Zyczk_ owski An Introduction to Quantum Entanglement 12 Density matrices and entropies A given object of study cannot always be assigned a unique value, its \entropy". It may have many different entropies, each one worthwhile. |Harold Grad In quantum mechanics, the von Neumann entropy S(ρ) = Trρ ln ρ (12.1) − plays a role analogous to that played by the Shannon entropy in classical prob- ability theory. They are both functionals of the state, they are both monotone under a relevant kind of mapping, and they can be singled out uniquely by natural requirements. In section 2.2 we recounted the well known anecdote ac- cording to which von Neumann helped to christen Shannon's entropy. Indeed von Neumann's entropy is older than Shannon's, and it reduces to the Shan- non entropy for diagonal density matrices. But in general the von Neumann entropy is a subtler object than its classical counterpart. So is the quantum relative entropy, that depends on two density matrices that perhaps cannot be diagonalized at the same time. Quantum theory is a non{commutative proba- bility theory. Nevertheless, as a rule of thumb we can pass between the classical discrete, classical continuous, and quantum cases by choosing between sums, integrals, and traces. While this rule of thumb has to be used cautiously, it will give us quantum counterparts of most of the concepts introduced in chapter 2, and conversely we can recover chapter 2 by restricting the matrices of this chapter to be diagonal. 12.1 Ordering operators The study of quantum entropy is to a large extent a study in inequalities, and this is where we begin.
    [Show full text]
  • Computing Partial Traces and Reduced Density Matrices
    Computing partial traces and reduced density matrices Jonas Maziero1,2, ∗ 1Departamento de Física, Centro de Ciências Naturais e Exatas, Universidade Federal de Santa Maria, Avenida Roraima 1000, 97105-900, Santa Maria, RS, Brazil 2Instituto de Física, Facultad de Ingeniería, Universidad de la República, J. Herrera y Reissig 565, 11300, Montevideo, Uruguay Taking partial traces for computing reduced density matrices, or related functions, is a ubiquitous procedure in the quantum mechanics of composite systems. In this article, we present a thorough description of this function and analyze the number of elementary operations (ops) needed, under some possible alternative implementations, to compute it on a classical computer. As we notice, it is worthwhile doing some analytical developments in order to avoid making null multiplications and sums, what can considerably reduce the ops. For instance, for a bipartite system Ha ⊗ Hb with dimensions da = dim Ha and db = dim Hb and for da,db ≫ 1, while a direct use of partial trace 6 6 2 definition applied to Hb requires O(dadb ) ops, its optimized implementation entails O(dadb) ops. In the sequence, we regard the computation of partial traces for general multipartite systems and describe Fortran code provided to implement it numerically. We also consider the calculation of reduced density matrices via Bloch’s parametrization with generalized Gell Mann’s matrices. Keywords: quantum mechanics, composite systems, partial trace, reduced density matrix, Bloch parametriza- tion, Gell Mann matrices I. INTRODUCTION When calculating certain functions of quantum systems, in many instances the running time of classical computers increases exponentially with the number of elementary parts that compose those systems.
    [Show full text]