Chapman University Chapman University Digital Commons Mathematics, Physics, and Computer Science Science and Technology Faculty Articles and Faculty Articles and Research Research
2013 Towards a Formulation of Quantum Theory as a Causally Neutral Theory of Bayesian Inference Matthew .S Leifer Chapman University, [email protected]
Robert W. Spekkens Perimeter Institute for Theoretical Physics
Follow this and additional works at: http://digitalcommons.chapman.edu/scs_articles Part of the Quantum Physics Commons
Recommended Citation Leifer, M.S., Spekkens, R.W., 2013. Towards a formulation of quantum theory as a causally neutral theory of Bayesian inference. Phys. Rev. A 88, 052130. doi:10.1103/PhysRevA.88.052130
This Article is brought to you for free and open access by the Science and Technology Faculty Articles and Research at Chapman University Digital Commons. It has been accepted for inclusion in Mathematics, Physics, and Computer Science Faculty Articles and Research by an authorized administrator of Chapman University Digital Commons. For more information, please contact [email protected]. Towards a Formulation of Quantum Theory as a Causally Neutral Theory of Bayesian Inference
Comments This article was originally published in Physical Review A, volume 88, in 2013. DOI: 10.1103/ PhysRevA.88.052130
Copyright American Physical Society
This article is available at Chapman University Digital Commons: http://digitalcommons.chapman.edu/scs_articles/522 PHYSICAL REVIEW A 88, 052130 (2013)
Towards a formulation of quantum theory as a causally neutral theory of Bayesian inference
M. S. Leifer1,2,* and Robert W. Spekkens2,† 1Department of Physics and Astronomy, University College London, Gower Street, London WC1E 6BT, United Kingdom 2Perimeter Institute for Theoretical Physics, 31 Caroline Street North, Waterloo, Ontario, Canada N2L 2Y5 (Received 19 June 2012; revised manuscript received 30 October 2013; published 27 November 2013) Quantum theory can be viewed as a generalization of classical probability theory, but the analogy as it has been developed so far is not complete. Whereas the manner in which inferences are made in classical probability theory is independent of the causal relation that holds between the conditioned variable and the conditioning variable, in the conventional quantum formalism, there is a significant difference between how one treats experiments involving two systems at a single time and those involving a single system at two times. In this article, we develop the formalism of quantum conditional states, which provides a unified description of these two sorts of experiment. In addition, concepts that are distinct in the conventional formalism become unified: Channels, sets of states, and positive operator valued measures are all seen to be instances of conditional states; the action of a channel on a state, ensemble averaging, the Born rule, the composition of channels, and nonselective state-update rules are all seen to be instances of belief propagation. Using a quantum generalization of Bayes’ theorem and the associated notion of Bayesian conditioning, we also show that the remote steering of quantum states can be described within our formalism as a mere updating of beliefs about one system given new information about another, and retrodictive inferences can be expressed using the same belief propagation rule as is used for predictive inferences. Finally, we show that previous arguments for interpreting the projection postulate as a quantum generalization of Bayesian conditioning are based on a misleading analogy and that it is best understood as a combination of belief propagation (corresponding to the nonselective state-update map) and conditioning on the measurement outcome.
DOI: 10.1103/PhysRevA.88.052130 PACS number(s): 03.65.Ca, 03.65.Ta, 03.67.−a
I. INTRODUCTION define a joint probability distribution over the input and output variables. This joint probability distribution could equally Quantum theory can be understood as a noncommutative well be used to describe two spacelike separated variables. generalization of classical probability theory wherein prob- Therefore, we do not need to know how the variables are ability measures are replaced by density operators. Much embedded in space-time in advance in order to apply classical of quantum information theory, especially quantum Shannon theory, can be viewed as the systematic application of this probability theory. This has the advantage that it cleanly generalization of probability theory to information theory. separates the concept of correlation from that of causation. However, despite the power of this point of view, the The former is the proper subject of probabilistic inference conventional formalism for quantum theory is a poor analog and statistics. Within the subjective Bayesian approach to to classical probability theory because, in quantum theory, probability, independence of inference and causality has been the appropriate mathematical description of an experiment emphasized by de Finetti ([1], Preface pp. x–xi): “Probabilistic reasoning—always to be understood as depends on its causal structure. For example, experiments subjective—merely stems from our being uncertain about involving a pair of systems at spacelike separation are something. It makes no difference whether the uncertainty described differently from those that involve a single system at relates to an unforeseeable future, or to an unnoticed past, or two different times. The former are described by a joint state to a past doubtfully reported or forgotten; it may even relate to on the tensor product of two Hilbert spaces, and the latter are something more or less knowable (by means of a computation, described by an input state and a dynamical map on a single a logical deduction, etc.) but for which we are not willing to Hilbert space. Classical probability works at a more abstract make the effort; and so on.” level than this. It specifies how to represent uncertainty prior Thus, in order to build a quantum theory of Bayesian to, and independently of, causal structure. For example, our inference, we need a formalism that is even handed in uncertainty about two random variables is always described its treatment of different causal scenarios. There are some by a joint probability distribution, regardless of whether the clues that this might be possible. Several authors have noted variables represent two spacelike separated systems or the that there are close connections, and often isomorphisms, input and output of a classical channel. Although channels between the statistics that can be obtained from quantum represent time evolution, they are described mathematically by experiments with distinct causal arrangements [2–8]. Time- conditional probability distributions. The input state specifies reversal symmetry is an example of this, but it is also possible to a marginal distribution, and thus we have the ingredients to relate experiments involving two systems at the same time with those involving a single system at two times. The equivalence [9] of prepare-and-measure [10] and entanglement-based [11] *[email protected] quantum key distribution protocols is an example of this and †[email protected] provides the basis for proofs of the security of the former [12].
1050-2947/2013/88(5)/052130(38)052130-1 ©2013 American Physical Society M. S. LEIFER AND ROBERT W. SPEKKENS PHYSICAL REVIEW A 88, 052130 (2013)
Such equivalences suggest that it may be possible to obtain a HAB = HA ⊗ HB . In contrast to the usual formalism, this causally neutral formalism for quantum theory by describing applies regardless of whether A and B describe independent such isomorphic experiments by similar mathematical objects. systems or the same system at two different times. Because Among the main goals of this work are to provide this of this, if an experiment involves a system that does persist unification for the case of experiments involving two distinct through time, then a different label is given to each region quantum systems at one time and those involving a single it inhabits; e.g., the input and output spaces for a quantum quantum system at two times and to provide a framework channel are assigned different labels. for making probabilistic inferences that is independent of this Although we have motivated our work by the distinction causal structure. Both types of experiment can be described between spatial and temporal separation, in fact it is not the by operators on a tensor product of Hilbert spaces, differing spatiotemporal relation between the regions that is relevant from one another only by a partial transpose. Probabilistic for how they ought to be represented in our quantum inference is achieved using a quantum generalization of generalization of probability theory. Rather, it is the causal Bayesian conditioning applied to quantum conditional states, relation that holds between them which is important. which are the main objects of study of this work. More precisely, what is important is the distinction between Quantum conditional states are a generalization of classical two regions that are causally related, which is to say that one conditional probability distributions. Conditional probability has a causal influence on the other (perhaps via intermediaries), plays a key role in classical probability theory, not least due and two regions that are acausally related, which is to say that to its role in Bayesian inference, and there have been attempts neither has a causal influence on the other (although they may to generalize it to the quantum case. The most relevant to have a common cause or a common effect or be connected via quantum information are perhaps the quantum conditional intermediaries to a common cause or a common effect). expectation [13](see[14,15] for a basic introduction and [16] The causal relation between a pair of regions cannot be for a review) and the Cerf-Adami conditional density operator inferred simply from their spatiotemporal relation. Consider [17–19]. To date, these have not seen widespread application a relativistic quantum theory for instance. Although a pair in quantum information, which casts some doubt on whether of regions that are spacelike separated are always acausally they are really the most useful generalization of conditional related, a pair of regions that are timelike separated can be probability from the point of view of practical applications. related causally, for instance if they constitute the input and Quantum conditional states, which have previously appeared the output of a channel, or they can be related acausally, for in [4,20,21], provide an alternative approach to this problem. instance if they constitute the input of one channel and the We show that they are useful for drawing out the analogies output of another. Although timelike separation implies that a between classical probability and quantum theory, they can causal connection is possible, it is whether such a connection be used to describe both spacelike and timelike correlations, actually holds that is relevant in our formalism. The distinction and they unify concepts that look distinct in the conventional can also be made in nonrelativistic theories and in theories formalism. with exotic causal structure. Indeed, causal structure is a more The remainder of the introduction summarizes the contents primitive notion than spatiotemporal structure, and it is all that of this article. It is meant to provide a broad overview we need here. of the conditional states formalism, its motivations, and Typically, we confine our attention to two paradigmatic its applications, while introducing only a minimum of the examples of causal and acausal separation (which can be technical details found in the rest of the paper. formulated in either a relativistic or a nonrelativistic quantum theory). Two distinct regions at the same time, the correlations between which are conventionally described by a bipartite A. Irrelevance of causal structure to the rules of inference quantum state, are acausally related. The regions at the input Unifying the quantum description of experiments involving and output of a quantum channel, the correlations between two distinct systems at one time with the description of those which are conventionally described by an input state and involving a single system at two distinct times requires some a quantum channel, are causally related (although there are modifications to the way that the Hilbert space formalism of exceptions, such as a channel which erases the state of the quantum theory is usually set up. Conventionally, a Hilbert system and then reprepares it in a fixed state).1 space HA describes a system, labeled A, that persists through time. Given two such systems, A and B, the joint system is described by the tensor product H = H ⊗ H .Inthe AB A B 1Although it is not required here, one can be more precise about this present work, a Hilbert space and its associated label should distinction as follows. A causal structure for a set of quantum regions rather be thought of as representing a localized region of is represented by a directed acyclic graph wherein the nodes are the space-time. Specifically, an elementary region is a small regions and the directed edges are relations of causal dependence space-time region in which an agent might possibly make a (the restriction to acyclic graphs prohibits causal loops). Two regions single intervention in the course of an experiment, for example are said to be causally related if for all paths connecting one to the by making a measurement or by preparing a specific state. other in the graph every edge along the path is directed in the same Each elementary region is associated with a label and a Hilbert sense. Two systems are said to be acausally related if for all paths space, for instance, A and HA. connecting one to the other not every edge along the path is directed Generally, a region refers to a collection of elementary in the same sense. When there exist both sorts of paths between a regions. A region that is composed of a pair of disjoint regions, pair of nodes, the associated regions are neither purely causally nor labeled A and B, is ascribed the tensor product Hilbert space purely acausally related. We do not consider this case in the article.
052130-2 TOWARDS A FORMULATION OF QUANTUM THEORY AS A ... PHYSICAL REVIEW A 88, 052130 (2013)
TABLE I. Analogies between the classical theory of Bayesian inference and the conditional states formalism for quantum theory.
Classical Quantum
State P (R) ρA Joint state P (R,S) σAB = = Marginalization P (S) R P (R,S) ρB TrA(σAB ) | Conditional state P (S R) σB|A | = = S P (S R) 1TrB (σB|A) IA
Relation between joint and P (R,S) = P (S|R)P (R) σAB = σB|A ρA | = = −1 conditional states P (S R) P (R,S)/P (R) σB|A σAB ρA −1 Bayes’ theorem P (R|S) = P (S|R)P (R)/P (S) σ | = σ | (ρ ρ ) A B B A A B = | = Belief propagation P (S) R P (S R)P (R) ρB TrA(σB|AρA)
We unify the description of Bayesian inference in the We refer to this map from ρA to ρB as belief two different causal scenarios in the sense that various propagation.2 formulas are shown to have precisely the same form, in particular the relation between joints and conditionals, the C. Relevance of causal structure to the form of the state formula for Bayesian inversion, and the formula for belief propagation. In the case of acausally related regions, it is the joint state that is easily inferred from the conventional formalism, and the conditional state that is derived from the joint. Specifically, if B. Basic elements of the formalism A and B are acausally related, then their joint state, σAB ,is simply the bipartite state that one would assign to them in Without providing all the details, we summarize the the conventional formalism. Consequently, σ is a positive analogs, within our formalism, of the most basic elements AB operator in this case. The conditional state can be inferred of classical probability theory. These are presented in from the rule relating joints to conditionals, namely, σ | = Table I. B A σ ρ . It follows that σ | is also a positive operator. For an elementary region A, the quantum analog of a AB A B A On the other hand, if A and B are causally related, then normalized probability distribution is a conventional quantum it is the conditional state that is easily inferred from the state ρ , that is, a positive trace-one operator on H .For A A conventional formalism and the joint state that is derivative. aregionAB, composed of two disjoint elementary regions, Specifically, if the regions are related by a quantum operation the analog of a joint probability distribution is a trace-one EB|A, then σB|A is defined as the operator on HA ⊗ HB that is operator σAB on HAB . This operator is not always positive Jamiołkowski isomorphic to E | [22]. The joint state is then (but we nonetheless refer to it as a state). The marginal- B A inferred from the rule relating joints to conditionals. One can ization operation is replaced by the partial trace operation, show that both σ | and σ fail to be positive in general, but Tr , which corresponds to ignoring region A. The role of B A AB A they have positive partial transpose. the marginal distribution is played by the marginal state Because of this, the set of permissible joint and conditional ρ = Tr (σ ). B A AB states for acausally related regions is different from the set for The quantum analog of a conditional probability is a causally related regions. To distinguish the two cases, we use conditional state for region B given region A.Thisisan the notation ρAB and ρB|A for the acausal case and AB and operator on HAB , denoted σB|A, that satisfies TrB (σB|A) = IA. | for the causal case. The relation between a conditional state and a joint state B A −1 It is important to note that in a classical theory of Bayesian is σ | = σ ρ , where the product is a particular B A AB A inference, the rules of inference are independent of the causal noncommutative and nonassociative product, defined by relations that hold among the variables. The causal relations MN≡ N 1/2MN1/2, where we have adopted the conven- can still be relevant, however, for constraining the probability tion of dropping identity operators and tensor products, so − distribution that is assigned to those variables. For instance, σ ρ−1 σ ρ−1 ⊗ I = ρ 1/2 ⊗ that AB A is shorthand for AB ( A B ) ( A the causal relations among a triple of variables are significant −1/2 ⊗ IB )σAB (ρA IB ). for the sort of probability distribution that can be assigned to This relation implies that the quantum analog of Bayes’ them. Specifically, if variable R is a common cause of variables = −1 theorem, relating σB|A and σA|B ,isσA|B σB|A (ρAρB ). S and T , while there is no direct causal connection between S A standard example of inference then proceeds as follows. and T , then S and T should be conditionally independent Suppose a conditional state σB|A represents your beliefs about given R, which is to say that the joint distribution over the relation that holds between a pair of elementary regions. In this case, if you represent your beliefs about A by the quantum state ρA, then you must represent your beliefs about B by the quantum state ρB , where 2Note that the term “belief propagation” has also been used to describe message-passing algorithms for performing inference on ρB = TrA(σB|AρA). (1) Bayesian networks. This is not the intended meaning here.
052130-3 M. S. LEIFER AND ROBERT W. SPEKKENS PHYSICAL REVIEW A 88, 052130 (2013)
TABLE II. Translation of concepts and equations from conventional notation to the conditional states formalism.
Conventional notation Conditional states formalism
Probability distribution of XP(X) ρX Probability that X = xP(X = x) ρ = X x A Set of states on A ρx A|X A Individual state on Aρ x A|X=x A POVM on A Ey Y |A A Individual effect on AEy Y =y|A E Channel from A to B B|A B|A E B|A Instrument y YB|A E B|A Individual operation y Y =y,B|A ∀ = = A = The Born rule y : P (Y y) TrA Ey ρA ρY TrA(Y |AρA) = = A = Ensemble averaging ρA x P (X x)ρx ρA TrX(A|XρX) Action of a channel (Schrodinger)¨ ρB = EB|A(ρA) ρB = TrA(B|AρA) Composition of channels EC|A = EC|B ◦ EB|A C|A = TrB (C|B B|A) A = E † B = Action of a channel (Heisenberg) Ey ( B|A) (Ey ) Y |A TrB (Y |B B|A) ∀ = B = E B|A = Nonselective state-update rule y : P (Y y)ρy y (ρA) ρYB TrA(YB|AρA) these variables is not arbitrary, but has the form P (R,S,T ) = region is also restricted to have this diagonal form. It follows P (S|R)P (T |R)P (R). that for a set of regions that are all classical, the formalism In the quantum case, the situation is similar. The rules reproduces the classical theory of Bayesian inference. of inference, such as the formula for belief propagation, the The formalism also yields a new and unified perspective formula for Bayesian inversion, and the relation between the on many notions in quantum theory. To see this, it is useful joint and the conditional, do not depend on the causal relations to recall that measurements, sets of state preparations, and between the regions under consideration, but causal relations transformations can all be represented by quantum operations, do constrain the set of operators that can describe joint states. that is, as completely positive trace-preserving (CPT) linear In fact, the dependence is stronger in the quantum case maps. Channels are CPT maps wherein the input and output because the set of permissible states depends on the causal spaces are both quantum. A positive operator valued measure relation even for a pair of regions. This is not a feature of a (POVM) is a CPT map from a quantum input to a classical classical theory of inference: If we consider all the possible output (the measurement outcome). A set of states is a CPT joint distributions over a pair of variables, R and S, we find map from a classical input (the state index) to a quantum output that the set of possibilities is the same for the case where R (the associated state). Finally, a quantum instrument, which and S are causally related as it is for the case where R and S is a measurement together with a state-update rule for every are acausally related. outcome, can be represented as a CPT map from a quantum To reiterate, the fact that the set of possible states that can input to a composite output with a quantum part (the updated be assigned to a set of regions is constrained by the causal state) and a classical part (the measurement outcome). Insofar relation between those regions is common to the classical and as every CPT map defines a conditional state, each of these quantum theories of inference. What is particular to the theory notions in quantum theory is an instance of a conditional state of quantum inference is that even in the case of a pair of in our formalism. This is summarized in the top half of Table II. regions, the causal relation between the regions is relevant for It follows that many relations that seem unrelated in the the set of possible states that can be assigned to those regions.3 conventional formalism all become instances of the belief propagation rule in our formalism. This includes the Born D. Recasting conventional quantum notions in terms rule, the formula for calculating the average state for an of conditional states and belief propagation ensemble, the composition of channels, the state-update rule in a measurement, and the action of a channel in both the The conditional states formalism incorporates the possibil- Heisenberg and Schrodinger¨ pictures. This is summarized in ity that a given region is associated to a classical variable rather the bottom half of Table II. than a quantum system. In this case, the classical variable is represented by a Hilbert space with a preferred basis, where the different elements of the basis correspond to different values E. Applications of the formalism of the variable, and any state assigned to that region is diagonal The formalism also accommodates forms of belief prop- in that basis. Any joint state or conditional state involving this agation that do not fit into the standard list of the previous section. One example is the inference made about one system based 3There does exist a classical analog of this dependence on causal on the outcome of a measurement made on another when structure for pairs of regions, but it requires considering the case of a the two are correlated by virtue of a common cause. This classical theory with an epistemic restriction [23]. We do not pursue reproduces the remote collapse postulate of quantum theory, the analogy here. which is sometimes called “remote steering” of a quantum state
052130-4 TOWARDS A FORMULATION OF QUANTUM THEORY AS A ... PHYSICAL REVIEW A 88, 052130 (2013) and was made famous by the thought experiment of Einstein, projection postulate being viewed as an instance of Bayesian Podolsky, and Rosen. It follows that in the conditional states conditioning) [34,35] are based on a misleading analogy. framework, the steering effect is merely belief propagation Within the conditional states formalism, the projection postu- (updating beliefs about one system based on new evidence late is best described as the application of a belief propagation about another) and does not require any causal influence rule (a nonselective update map), followed by conditioning from one to the other. This interpretation has been advocated (the selection). This is broadly in line with the treatment previously by Fuchs [24]. Our formalism also provides an of quantum measurements advocated by Ozawa [36,37]. In elegant derivation of the formula for the set of ensembles to support of the argument that the projection postulate is not a which a remote system may be steered, previously obtained type of conditioning, we provide a conditional state version by conventional methods in [25]. of the argument that all informative measurements must be Another example of an unconventional form of belief disturbing, which may be of independent interest due to its propagation is retrodiction, that is, inferences about a region close relationship to entanglement monogamy. based on beliefs about another region in its future. We develop a retrodictive formalism using our quantum Bayes’ theorem. The latter is a necessary ingredient because the “givens” in F. Structure of the paper a retrodiction problem are typically the descriptions of sets The remainder of this paper is structured as follows. of state preparations, measurements, and channels, each of The relevant aspects of classical conditional probability are which corresponds to a conditional wherein the conditioning reviewed in Sec. II. Section III introduces quantum conditional system is to the past of the conditioned system. We use Bayes’ states and the basic concepts of quantum Bayesian inference theorem to invert each of these conditionals to ones wherein for a pair of regions. The distinction between conditional states the conditioning system is to the future of the conditioned for causally related and acausally related regions is discussed system. Then, one can use these conditionals to propagate here. This section also provides a detailed discussion of the one’s beliefs backwards in time, that is, to update one’s beliefs translations from the conventional formalism to the conditional about the past based on new evidence in the present. This states formalism that are highlighted in Table II. application of our formalism is a good example of how one Section IV introduces our quantum version of Bayes’ theo- can achieve causal neutrality: Belief propagation backward in rem and discusses its applications, in particular the connection time follows the same rules as belief propagation forward in with the update rule proposed by Fuchs, the correspondence time. The retrodictive formalism we devise coincides with the between POVMs and ensemble decompositions of a density one introduced in [26–28] in the case of unbiased sources, operator, the pretty-good measurement, and the Barnum-Knill but differs in the general case, retaining a closer analogy with recovery map. In Sec. IV C, we develop the retrodictive classical Bayesian inference. formalism for quantum theory and describe how it relates to the In the case where a quantum system is passed through one introduced in [26–28]. Finally, in Sec. IV D, the acausal a channel (possibly noisy), the Bayesian inversion of the analog of the symmetry between prediction and retrodiction is conditional associated to this channel, when interpreted as discussed in the context of remote measurement. a quantum operation itself, is the Barnum-Knill approximate Section V discusses quantum Bayesian conditioning. After error correction map [29]. It follows that this error correction a brief discussion of the general problem of conditioning a scheme is the quantum analog of the following classical error quantum region on another quantum region, we focus on correction scheme: Based on a channel’s output, compute a conditioning a quantum region on a classical variable. This posterior distribution over inputs (i.e., classical retrodiction) is the correct way to update quantum states in light of classical and then sample from the latter. data, regardless of the causal relationship between the two. In the case where a quantum system is prepared in one Various examples of this are discussed in Sec. VA, including of a set of states, the Bayesian inversion of the conditional the case of the remote steering phenomenon. Section VB associated to this set of states (a “quantum given classical” concerns how to understand within our formalism the rules conditional) is a conditional associated to a measurement (a for updating quantum states after a nondestructive quantum “classical given quantum” conditional). Indeed, we find that measurement, in particular, how to understand the projection in these contexts, our quantum Bayes’ theorem reproduces the postulate. well-known rule relating sets of states to POVMs [3,4,30]. In Sec. VI, we discuss related work. Quantum conditional The POVM obtained as the Bayesian inversion of an ensemble states are compared to other proposals for quantum generaliza- of states turns out to be the “pretty-good” measurement for tions of conditional probability in Sec. VI A and the conditional distinguishing those states [31–33]. Therefore, the latter, like states formalism is compared to several recently proposed the Barnum-Knill recovery operation, can be understood as a operational reformulations of quantum theory in Sec. VI B. quantum analog of sampling from the posterior. Section VII discusses limitations of the conditional states Similarly, the Bayesian inversion of the conditional asso- framework. These arise because the classical operation of ciated with a measurement is a conditional associated with taking the product of a conditional and a marginal prob- a set of states. For this case, our quantum Bayes’ theorem ability distribution to form a joint distribution is replaced reproduces a rule proposed by Fuchs as a quantum analog of by a noncommutative and nonassociative operation on the Bayes’ theorem [24]. corresponding operators in the quantum case. Because of Finally, we show that our notion of conditioning does this, unlike in classical probability, an equation involving not include the projection postulate as a special case and conditional states does not necessarily remain valid when both that previous arguments to the contrary (i.e., in favor of the sides are conditionalized on an additional variable. This is
052130-5 M. S. LEIFER AND ROBERT W. SPEKKENS PHYSICAL REVIEW A 88, 052130 (2013) discussed in Sec. VII A. Section VII B discusses the reasons Finally, note that the process of marginalizing a distribution why causal joint states are limited to two elementary regions. over a set of variables commutes with the process of condition- Section VII B1 discusses why they cannot be applied to mixed ing on a disjoint set of variables, as illustrated in the following causal scenarios, such as two acausally related regions with commutative diagram: a third region causally related to one of the other two, and R Sec. VII B2 discusses the difficulties with generalizing the P (R,S,T⏐ ) −−−−−→ P (S,T⏐ ) notion to multiple time steps, where we have three or more ⏐ ⏐ ⏐ − ⏐ − causally related regions. ×P (T ) 1 ×P (T ) 1 (7) Section VIII discusses an open question about when | −−−−−R → | assignments of conditional states are compatible with one P (R,S T ) P (S T ). another. That not all conditional assignments are compatible can be shown via the monogamy of entanglement. Indeed, this incompatibility seems to be a more basic notion, of which III. QUANTUM CONDITIONAL STATES monogamy is a consequence. Finally, we conclude in Sec. IX. In this section, the quantum analog of conditional probability—a conditional state—is introduced. We also dis- II. CLASSICAL CONDITIONAL PROBABILITY cuss how the states assigned to disjoint regions are related via a quantum analog of the belief propagation rule P (S) = In this section, the basic definitions and formalism of | R P (S R)P (R). There is a small difference between condi- classical conditional probability are reviewed, with a view tional states for acausally related and causally related regions. to their quantum generalization in Sec. III. = The acausal case is discussed in Secs. III A and III B.Onthe Let R denote a (discrete) random variable, R r the event other hand, Secs. III C–III K mainly concern the causal case, that R takes the value r, P (R = r) the probability of event = wherein we find that quantum dynamics, ensemble averaging, R r, and P (R) the probability that R takes an arbitrary the Born rule, Heisenberg dynamics, and the transition from unspecified value. Finally, R denotes a sum over the possible the initial state to the ensemble of states resulting from values of R. a measurement can all be represented as special cases of A conditional probability distribution is a function of two | quantum belief propagation. Acausal analogies of some of random variables P (S R), such that for each value r of R, these ideas are also developed in these sections. P (S|R = r) is a probability distribution over S. Equivalently, it is a positive function of R and S such that A. Acausal conditional states | = P (S R) 1(2)We begin by defining conditional states for acausally S related regions. This scenario, and its classical analog, are independently of the value of R. depicted in Fig. 1. The definition proceeds in analogy with the Given a probability distribution P (R) and a conditional classical treatment given in Sec. II. The convention of using probability distribution P (S|R), a joint distribution over R A,B,C,... to label quantum regions that are analogous to and S can be defined via classical variables R,S,T,... is adopted throughout. The la- bels X,Y,Z, . . . are reserved for classical variables associated P (R,S) = P (S|R)P (R), (3) with preparations and measurements, which remain classical where the multiplication is defined elementwise, i.e., for all when we pass from probability theory to the quantum analog. values r,s of R and S, P (R = r,S = s) = P (S = s|R = r) The analog of a probability distribution P (R) assigned to P (R = r). a random variable R is a quantum state (density operator) Conversely, given a joint distribution P (R,S), the marginal distribution over R is defined as = P (R) P (R,S), (4) AB S and the conditional probability of S given R is (a) P (R,S) P (S|R) = . (5) P (R) Note that Eq. (5) only defines a conditional probability RS distribution for those values r of R such that P (R = r) = 0. The conditional probability is undefined for other values of R. (b) The chain rule for conditional probabilities states that a joint probability over n random variables R ,R ,...,R can FIG. 1. Acausally related quantum and classical regions. Classi- 1 2 n cal variables are denoted by triangles and quantum regions by circles be written as (this convention is suggested by the shape of the convex set of states in P (R1,R2,...,Rn) each theory). The dotted line represents acausal correlation. (a) Two quantum regions in an arbitrary joint state (possibly correlated). (b) = P (R |R ,R ,...,R − ) n 1 2 n 1 Two classical variables with an arbitrary joint probability distribution × | | P (Rn−1 R1,R2,...,Rn−2) ...P(R2 R1)P (R1). (6) (possibly correlated).
052130-6 TOWARDS A FORMULATION OF QUANTUM THEORY AS A ... PHYSICAL REVIEW A 88, 052130 (2013)
TABLE III. Analogies between classical probability theory for Starting with a joint state ρAB and its reduced state two random variables and quantum theory for two acausally related ρA = TrB (ρAB ), a conditional state can be defined via regions. −1 ρ | = ρ ρ , (12) Classical probability Quantum theory B A AB A
P (R) ρA which is the analog of Eq. (5). P (R,S) ρAB As with Eq. (5) there are problems with this formula if ρA P (S) = P (R,S) ρB = TrA(ρAB ) R is not supported on the entire Hilbert space HA. In that case | = = S P (S R) 1TrB (ρB|A) IA Eq. (12) is to be understood as an equation on the Hilbert = | = P (R,S) P (S R)P (R) ρAB ρB|A ρA space supp(ρA) ⊗ HB , where supp(ρA) denotes the support | = = −1 P (S R) P (R,S)/P (R) ρB|A ρAB ρA of ρA (the span of the eigenvectors of ρA having nonzero eigenvalues). Because of this, the resulting conditional density operator satisfies Tr (ρ | ) = I rather than Eq. (8). ρ acting on a Hilbert space H . When there are two B B A supp(ρA) A A The analogies between the classical and quantum relations disjoint regions with Hilbert spaces HA and HB , the tensor product H = H ⊗ H describes the composite region. between conditionals, marginals, and joints are set out in the AB A B The quantum analog of a joint distribution P (R,S) is a density bottom half of Table III. As these analogies suggest, the - product notation allows equations from classical probability operator ρAB of the composite region, defined on HAB .The analog of marginalization over a variable is the partial trace to be generalized to quantum theory by replacing functions over a region. These analogies are set out in the top half of with operators, products with products, and division with Table III. products with the inverse. However, while this is a useful way of postulating results in the conditional states formalism, one In analogy to the classical case, where P (S|R) is a positive function that satisfies P (S|R) = 1, an acausal conditional has to take care of the nonassociativity and noncommutativity S state for B given A is defined as follows. of the product when making such generalizations. Definition 1. An acausal conditional state for B given A is To provide an analogy with the chain rule of Eq. (6) it is helpful to adopt the convention that, in the absence a positive operator ρ | on H = H ⊗ H that satisfies B A AB A B of parentheses, products are evaluated right to left. Then, = TrB (ρB|A) IA, (8) given n disjoint acausally related regions A1,A2,...,An, with Hilbert space H = n H , the joint state can be where I is the identity operator on H . A1,A2,...,An j=1 Aj A A written as To provide an analogy with Eq. (3), a method of construct- ing a joint state on HAB from a reduced state on HA and a = | | conditional state on HAB is required. This is given by ρA1,A2,...,An ρAn A1,A2,...,An−1 ρAn−1 A1,A2,...,An−2