Information Theoretical Properties of Tsallis Entropies
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Lecture 3: Entropy, Relative Entropy, and Mutual Information 1 Notation 2
EE376A/STATS376A Information Theory Lecture 3 - 01/16/2018 Lecture 3: Entropy, Relative Entropy, and Mutual Information Lecturer: Tsachy Weissman Scribe: Yicheng An, Melody Guan, Jacob Rebec, John Sholar In this lecture, we will introduce certain key measures of information, that play crucial roles in theoretical and operational characterizations throughout the course. These include the entropy, the mutual information, and the relative entropy. We will also exhibit some key properties exhibited by these information measures. 1 Notation A quick summary of the notation 1. Discrete Random Variable: U 2. Alphabet: U = fu1; u2; :::; uM g (An alphabet of size M) 3. Specific Value: u; u1; etc. For discrete random variables, we will write (interchangeably) P (U = u), PU (u) or most often just, p(u) Similarly, for a pair of random variables X; Y we write P (X = x j Y = y), PXjY (x j y) or p(x j y) 2 Entropy Definition 1. \Surprise" Function: 1 S(u) log (1) , p(u) A lower probability of u translates to a greater \surprise" that it occurs. Note here that we use log to mean log2 by default, rather than the natural log ln, as is typical in some other contexts. This is true throughout these notes: log is assumed to be log2 unless otherwise indicated. Definition 2. Entropy: Let U a discrete random variable taking values in alphabet U. The entropy of U is given by: 1 X H(U) [S(U)] = log = − log (p(U)) = − p(u) log p(u) (2) , E E p(U) E u Where U represents all u values possible to the variable. -
An Introduction to Information Theory
An Introduction to Information Theory Vahid Meghdadi reference : Elements of Information Theory by Cover and Thomas September 2007 Contents 1 Entropy 2 2 Joint and conditional entropy 4 3 Mutual information 5 4 Data Compression or Source Coding 6 5 Channel capacity 8 5.1 examples . 9 5.1.1 Noiseless binary channel . 9 5.1.2 Binary symmetric channel . 9 5.1.3 Binary erasure channel . 10 5.1.4 Two fold channel . 11 6 Differential entropy 12 6.1 Relation between differential and discrete entropy . 13 6.2 joint and conditional entropy . 13 6.3 Some properties . 14 7 The Gaussian channel 15 7.1 Capacity of Gaussian channel . 15 7.2 Band limited channel . 16 7.3 Parallel Gaussian channel . 18 8 Capacity of SIMO channel 19 9 Exercise (to be completed) 22 1 1 Entropy Entropy is a measure of uncertainty of a random variable. The uncertainty or the amount of information containing in a message (or in a particular realization of a random variable) is defined as the inverse of the logarithm of its probabil- ity: log(1=PX (x)). So, less likely outcome carries more information. Let X be a discrete random variable with alphabet X and probability mass function PX (x) = PrfX = xg, x 2 X . For convenience PX (x) will be denoted by p(x). The entropy of X is defined as follows: Definition 1. The entropy H(X) of a discrete random variable is defined by 1 H(X) = E log p(x) X 1 = p(x) log (1) p(x) x2X Entropy indicates the average information contained in X. -
Chapter 2: Entropy and Mutual Information
Chapter 2: Entropy and Mutual Information University of Illinois at Chicago ECE 534, Natasha Devroye Chapter 2 outline • Definitions • Entropy • Joint entropy, conditional entropy • Relative entropy, mutual information • Chain rules • Jensen’s inequality • Log-sum inequality • Data processing inequality • Fano’s inequality University of Illinois at Chicago ECE 534, Natasha Devroye Definitions A discrete random variable X takes on values x from the discrete alphabet . X The probability mass function (pmf) is described by p (x)=p(x) = Pr X = x , for x . X { } ∈ X University of Illinois at Chicago ECE 534, Natasha Devroye Copyright Cambridge University Press 2003. On-screen viewing permitted. Printing not permitted. http://www.cambridge.org/0521642981 You can buy this book for 30 pounds or $50. See http://www.inference.phy.cam.ac.uk/mackay/itila/ for links. 2 Probability, Entropy, and Inference Definitions Copyright Cambridge University Press 2003. On-screen viewing permitted. Printing not permitted. http://www.cambridge.org/0521642981 This chapter, and its sibling, Chapter 8, devote some time to notation. Just You can buy this book for 30 pounds or $50. See http://www.inference.phy.cam.ac.uk/mackay/itila/as the White Knight fordistinguished links. between the song, the name of the song, and what the name of the song was called (Carroll, 1998), we will sometimes 2.1: Probabilities and ensembles need to be careful to distinguish between a random variable, the v23alue of the i ai pi random variable, and the proposition that asserts that the random variable x has a particular value. In any particular chapter, however, I will use the most 1 a 0.0575 a Figure 2.2. -
Quantum Entanglement Inferred by the Principle of Maximum Tsallis Entropy
Quantum entanglement inferred by the principle of maximum Tsallis entropy Sumiyoshi Abe1 and A. K. Rajagopal2 1College of Science and Technology, Nihon University, Funabashi, Chiba 274-8501, Japan 2Naval Research Laboratory, Washington D.C., 20375-5320, U.S.A. The problem of quantum state inference and the concept of quantum entanglement are studied using a non-additive measure in the form of Tsallis’ entropy indexed by the positive parameter q. The maximum entropy principle associated with this entropy along with its thermodynamic interpretation are discussed in detail for the Einstein- Podolsky-Rosen pair of two spin-1/2 particles. Given the data on the Bell-Clauser- Horne-Shimony-Holt observable, the analytic expression is given for the inferred quantum entangled state. It is shown that for q greater than unity, indicating the sub- additive feature of the Tsallis entropy, the entangled region is small and enlarges as one goes into the super-additive regime where q is less than unity. It is also shown that quantum entanglement can be quantified by the generalized Kullback-Leibler entropy. quant-ph/9904088 26 Apr 1999 PACS numbers: 03.67.-a, 03.65.Bz I. INTRODUCTION Entanglement is a fundamental concept highlighting non-locality of quantum mechanics. It is well known [1] that the Bell inequalities, which any local hidden variable theories should satisfy, can be violated by entangled states of a composite system described by quantum mechanics. Experimental results [2] suggest that naive local realism à la Einstein, Podolsky, and Rosen [3] may not actually hold. Related issues arising out of the concept of quantum entanglement are quantum cryptography, quantum teleportation, and quantum computation. -
Noisy Channel Coding
Noisy Channel Coding: Correlated Random Variables & Communication over a Noisy Channel Toni Hirvonen Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing [email protected] T-61.182 Special Course in Information Science II / Spring 2004 1 Contents • More entropy definitions { joint & conditional entropy { mutual information • Communication over a noisy channel { overview { information conveyed by a channel { noisy channel coding theorem 2 Joint Entropy Joint entropy of X; Y is: 1 H(X; Y ) = P (x; y) log P (x; y) xy X Y 2AXA Entropy is additive for independent random variables: H(X; Y ) = H(X) + H(Y ) iff P (x; y) = P (x)P (y) 3 Conditional Entropy Conditional entropy of X given Y is: 1 1 H(XjY ) = P (y) P (xjy) log = P (x; y) log P (xjy) P (xjy) y2A "x2A # y2A A XY XX XX Y It measures the average uncertainty (i.e. information content) that remains about x when y is known. 4 Mutual Information Mutual information between X and Y is: I(Y ; X) = I(X; Y ) = H(X) − H(XjY ) ≥ 0 It measures the average reduction in uncertainty about x that results from learning the value of y, or vice versa. Conditional mutual information between X and Y given Z is: I(Y ; XjZ) = H(XjZ) − H(XjY; Z) 5 Breakdown of Entropy Entropy relations: Chain rule of entropy: H(X; Y ) = H(X) + H(Y jX) = H(Y ) + H(XjY ) 6 Noisy Channel: Overview • Real-life communication channels are hopelessly noisy i.e. introduce transmission errors • However, a solution can be achieved { the aim of source coding is to remove redundancy from the source data -
Networked Distributed Source Coding
Networked distributed source coding Shizheng Li and Aditya Ramamoorthy Abstract The data sensed by differentsensors in a sensor network is typically corre- lated. A natural question is whether the data correlation can be exploited in innova- tive ways along with network information transfer techniques to design efficient and distributed schemes for the operation of such networks. This necessarily involves a coupling between the issues of compression and networked data transmission, that have usually been considered separately. In this work we review the basics of classi- cal distributed source coding and discuss some practical code design techniques for it. We argue that the network introduces several new dimensions to the problem of distributed source coding. The compression rates and the network information flow constrain each other in intricate ways. In particular, we show that network coding is often required for optimally combining distributed source coding and network information transfer and discuss the associated issues in detail. We also examine the problem of resource allocation in the context of distributed source coding over networks. 1 Introduction There are various instances of problems where correlated sources need to be trans- mitted over networks, e.g., a large scale sensor network deployed for temperature or humidity monitoring over a large field or for habitat monitoring in a jungle. This is an example of a network information transfer problem with correlated sources. A natural question is whether the data correlation can be exploited in innovative ways along with network information transfer techniques to design efficient and distributed schemes for the operation of such networks. This necessarily involves a Shizheng Li · Aditya Ramamoorthy Iowa State University, Ames, IA, U.S.A. -
On Measures of Entropy and Information
On Measures of Entropy and Information Tech. Note 009 v0.7 http://threeplusone.com/info Gavin E. Crooks 2018-09-22 Contents 5 Csiszar´ f-divergences 12 Csiszar´ f-divergence ................ 12 0 Notes on notation and nomenclature 2 Dual f-divergence .................. 12 Symmetric f-divergences .............. 12 1 Entropy 3 K-divergence ..................... 12 Entropy ........................ 3 Fidelity ........................ 12 Joint entropy ..................... 3 Marginal entropy .................. 3 Hellinger discrimination .............. 12 Conditional entropy ................. 3 Pearson divergence ................. 14 Neyman divergence ................. 14 2 Mutual information 3 LeCam discrimination ............... 14 Mutual information ................. 3 Skewed K-divergence ................ 14 Multivariate mutual information ......... 4 Alpha-Jensen-Shannon-entropy .......... 14 Interaction information ............... 5 Conditional mutual information ......... 5 6 Chernoff divergence 14 Binding information ................ 6 Chernoff divergence ................. 14 Residual entropy .................. 6 Chernoff coefficient ................. 14 Total correlation ................... 6 Renyi´ divergence .................. 15 Lautum information ................ 6 Alpha-divergence .................. 15 Uncertainty coefficient ............... 7 Cressie-Read divergence .............. 15 Tsallis divergence .................. 15 3 Relative entropy 7 Sharma-Mittal divergence ............. 15 Relative entropy ................... 7 Cross entropy -
Information Theory and Maximum Entropy 8.1 Fundamentals of Information Theory
NEU 560: Statistical Modeling and Analysis of Neural Data Spring 2018 Lecture 8: Information Theory and Maximum Entropy Lecturer: Mike Morais Scribes: 8.1 Fundamentals of Information theory Information theory started with Claude Shannon's A mathematical theory of communication. The first building block was entropy, which he sought as a functional H(·) of probability densities with two desired properties: 1. Decreasing in P (X), such that if P (X1) < P (X2), then h(P (X1)) > h(P (X2)). 2. Independent variables add, such that if X and Y are independent, then H(P (X; Y )) = H(P (X)) + H(P (Y )). These are only satisfied for − log(·). Think of it as a \surprise" function. Definition 8.1 (Entropy) The entropy of a random variable is the amount of information needed to fully describe it; alternate interpretations: average number of yes/no questions needed to identify X, how uncertain you are about X? X H(X) = − P (X) log P (X) = −EX [log P (X)] (8.1) X Average information, surprise, or uncertainty are all somewhat parsimonious plain English analogies for entropy. There are a few ways to measure entropy for multiple variables; we'll use two, X and Y . Definition 8.2 (Conditional entropy) The conditional entropy of a random variable is the entropy of one random variable conditioned on knowledge of another random variable, on average. Alternative interpretations: the average number of yes/no questions needed to identify X given knowledge of Y , on average; or How uncertain you are about X if you know Y , on average? X X h X i H(X j Y ) = P (Y )[H(P (X j Y ))] = P (Y ) − P (X j Y ) log P (X j Y ) Y Y X X = = − P (X; Y ) log P (X j Y ) X;Y = −EX;Y [log P (X j Y )] (8.2) Definition 8.3 (Joint entropy) X H(X; Y ) = − P (X; Y ) log P (X; Y ) = −EX;Y [log P (X; Y )] (8.3) X;Y 8-1 8-2 Lecture 8: Information Theory and Maximum Entropy • Bayes' rule for entropy H(X1 j X2) = H(X2 j X1) + H(X1) − H(X2) (8.4) • Chain rule of entropies n X H(Xn;Xn−1; :::X1) = H(Xn j Xn−1; :::X1) (8.5) i=1 It can be useful to think about these interrelated concepts with a so-called information diagram. -
On the Exact Variance of Tsallis Entanglement Entropy in a Random Pure State
entropy Article On the Exact Variance of Tsallis Entanglement Entropy in a Random Pure State Lu Wei Department of Electrical and Computer Engineering, University of Michigan, Dearborn, MI 48128, USA; [email protected] Received: 26 April 2019; Accepted: 25 May 2019; Published: 27 May 2019 Abstract: The Tsallis entropy is a useful one-parameter generalization to the standard von Neumann entropy in quantum information theory. In this work, we study the variance of the Tsallis entropy of bipartite quantum systems in a random pure state. The main result is an exact variance formula of the Tsallis entropy that involves finite sums of some terminating hypergeometric functions. In the special cases of quadratic entropy and small subsystem dimensions, the main result is further simplified to explicit variance expressions. As a byproduct, we find an independent proof of the recently proven variance formula of the von Neumann entropy based on the derived moment relation to the Tsallis entropy. Keywords: entanglement entropy; quantum information theory; random matrix theory; variance 1. Introduction Classical information theory is the theory behind the modern development of computing, communication, data compression, and other fields. As its classical counterpart, quantum information theory aims at understanding the theoretical underpinnings of quantum science that will enable future quantum technologies. One of the most fundamental features of quantum science is the phenomenon of quantum entanglement. Quantum states that are highly entangled contain more information about different parts of the composite system. As a step to understand quantum entanglement, we choose to study the entanglement property of quantum bipartite systems. The quantum bipartite model, proposed in the seminal work of Page [1], is a standard model for describing the interaction of a physical object with its environment for various quantum systems. -
Arxiv:1707.03526V1 [Cond-Mat.Stat-Mech] 12 Jul 2017 Eq
Generalized Ensemble Theory with Non-extensive Statistics Ke-Ming Shen,∗ Ben-Wei Zhang,y and En-Ke Wang Key Laboratory of Quark & Lepton Physics (MOE) and Institute of Particle Physics, Central China Normal University, Wuhan 430079, China (Dated: October 17, 2018) The non-extensive canonical ensemble theory is reconsidered with the method of Lagrange multipliers by maximizing Tsallis entropy, with the constraint that the normalized term of P q Tsallis' q−average of physical quantities, the sum pj , is independent of the probability pi for Tsallis parameter q. The self-referential problem in the deduced probability and thermal quantities in non-extensive statistics is thus avoided, and thermodynamical relationships are obtained in a consistent and natural way. We also extend the study to the non-extensive grand canonical ensemble theory and obtain the q-deformed Bose-Einstein distribution as well as the q-deformed Fermi-Dirac distribution. The theory is further applied to the general- ized Planck law to demonstrate the distinct behaviors of the various generalized q-distribution functions discussed in literature. I. INTRODUCTION In the last thirty years the non-extensive statistical mechanics, based on Tsallis entropy [1,2] and the corresponding deformed exponential function, has been developed and attracted a lot of attentions with a large amount of applications in rather diversified fields [3]. Tsallis non- extensive statistical mechanics is a generalization of the common Boltzmann-Gibbs (BG) statistical PW mechanics by postulating a generalized entropy of the classical one, S = −k i=1 pi ln pi: W X q Sq = −k pi lnq pi ; (1) i=1 where k is a positive constant and denotes Boltzmann constant in BG statistical mechanics. -
On the Entropy of Sums
On the Entropy of Sums Mokshay Madiman Department of Statistics Yale University Email: [email protected] Abstract— It is shown that the entropy of a sum of independent II. PRELIMINARIES random vectors is a submodular set function, and upper bounds on the entropy of sums are obtained as a result in both discrete Let X1,X2,...,Xn be a collection of random variables and continuous settings. These inequalities complement the lower taking values in some linear space, so that addition is a bounds provided by the entropy power inequalities of Madiman well-defined operation. We assume that the joint distribu- and Barron (2007). As applications, new inequalities for the tion has a density f with respect to some reference mea- determinants of sums of positive-definite matrices are presented. sure. This allows the definition of the entropy of various random variables depending on X1,...,Xn. For instance, I. INTRODUCTION H(X1,X2,...,Xn) = −E[log f(X1,X2,...,Xn)]. There are the familiar two canonical cases: (a) the random variables Entropies of sums are not as well understood as joint are real-valued and possess a probability density function, or entropies. Indeed, for the joint entropy, there is an elaborate (b) they are discrete. In the former case, H represents the history of entropy inequalities starting with the chain rule of differential entropy, and in the latter case, H represents the Shannon, whose major developments include works of Han, discrete entropy. We simply call H the entropy in all cases, ˇ Shearer, Fujishige, Yeung, Matu´s, and others. The earlier part and where the distinction matters, the relevant assumption will of this work, involving so-called Shannon-type inequalities be made explicit. -
Grand Canonical Ensemble of the Extended Two-Site Hubbard Model Via a Nonextensive Distribution
Grand canonical ensemble of the extended two-site Hubbard model via a nonextensive distribution Felipe Américo Reyes Navarro1;2∗ Email: [email protected] Eusebio Castor Torres-Tapia2 Email: [email protected] Pedro Pacheco Peña3 Email: [email protected] 1Facultad de Ciencias Naturales y Matemática, Universidad Nacional del Callao (UNAC) Av. Juan Pablo II 306, Bellavista, Callao, Peru 2Facultad de Ciencias Físicas, Universidad Nacional Mayor de San Marcos (UNMSM) Av. Venezuela s/n Cdra. 34, Apartado Postal 14-0149, Lima 14, Peru 3Universidad Nacional Tecnológica del Cono Sur (UNTECS), Av. Revolución s/n, Sector 3, Grupo 10, Mz. M Lt. 17, Villa El Salvador, Lima, Peru ∗Corresponding author. Facultad de Ciencias Físicas, Universidad Nacional Mayor de San Marcos (UNMSM), Av. Venezuela s/n Cdra. 34, Apartado Postal 14-0149, Lima 14, Peru Abstract We hereby introduce a research about a grand canonical ensemble for the extended two-site Hubbard model, that is, we consider the intersite interaction term in addition to those of the simple Hubbard model. To calculate the thermodynamical parameters, we utilize the nonextensive statistical mechan- ics; specifically, we perform the simulations of magnetic internal energy, specific heat, susceptibility, and thermal mean value of the particle number operator. We found out that the addition of the inter- site interaction term provokes a shifting in all the simulated curves. Furthermore, for some values of the on-site Coulombian potential, we realize that, near absolute zero, the consideration of a chemical potential varying with temperature causes a nonzero entropy. Keywords Extended Hubbard model,Archive Quantum statistical mechanics, Thermalof properties SID of small particles PACS 75.10.Jm, 05.30.-d,65.80.+n Introduction Currently, several researches exist on the subject of the application of a generalized statistics for mag- netic systems in the literature [1-3].