Arxiv:1905.06216V2 [Cond-Mat.Stat-Mech] 19 Sep 2019 B

Information thermodynamics for interacting stochastic systems without bipartite structure R. Chétrite,1 M.L. Rosinberg,2 T. Sagawa,3 and G. Tarjus2 1Laboratoire J.A. Dieudonné,UMR CNRS 7351, Universitéde Nice Sophia-Antipolis, Parc Valrose, 06108 Nice Cedex 02, France 2Sorbonne Université,CNRS, Laboratoire de Physique Théoriquede la Matière Condensée,LPTMC, F-75005 Paris, France 3Department of Applied Physics, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan Fluctuations in biochemical networks, e.g., in a living cell, have a complex origin that precludes a description of such systems in terms of bipartite or multipartite processes, as is usually done in the framework of stochastic and/or information thermodynamics. This means that fluctuations in each subsystem are not independent: subsystems jump simultaneously if the dynamics is modeled as a Markov jump process, or noises are correlated for diffusion processes. In this paper, we consider information and thermodynamic exchanges between a pair of coupled systems that do not satisfy the bipartite property. The generalization of information-theoretic measures, such as learning rates and transfer entropy rates, to this situation is non-trivial and also involves introducing several additional rates. We describe how this can be achieved in the framework of general continuous- time Markov processes, without restricting the study to the steady-state regime. We illustrate our general formalism on the case of diffusion processes and derive an extension of the second law of information thermodynamics in which the difference of transfer entropy rates in the forward and backward time directions replaces the learning rate. As a side result, we also generalize an important relation linking information theory and estimation theory. To further obtain analytical expressions we treat in detail the case of Ornstein-Uhlenbeck processes, and discuss the ability of the various information measures to detect a directional coupling in the presence of correlated noises. Finally, we apply our formalism to the analysis of the directional influence between cellular processes in a concrete example, which also requires considering the case of a non-bipartite and non-Markovian process. Contents I. Introduction 2 II. Setup and brief reminder of the bipartite case 4 A. Setup 4 B. Definition of information measures 5 C. Inequalities and sufficient statistic 7 D. Entropy production and second law 7 III. Information measures for non-bipartite processes 8 A. Learning rates 8 B. Transfer entropy rates 10 C. Backward transfer entropy rates 11 D. Inequalities 12 IV. Markov diffusion processes and second law of information thermodynamics 14 A. Learning rates 14 arXiv:1905.06216v2 [cond-mat.stat-mech] 19 Sep 2019 B. Single-time-step transfer entropy rates 15 C. Multi-time-step transfer entropy rates 17 D. Second law for non-bipartite processes 17 V. Stationary bi-dimensional Ornstein-Uhlenbeck process 19 A. Expression of the multi-time-step TE rates 20 B. Numerical illustration 21 2 VI. Generalization to a non-Markovian process and application to the study of directional influence between cellular processes 23 A. Extension to a bivariate non-Markovian process 23 B. Application to the study of directional influence between cellular processes 24 VII. Summary 27 Acknowledgements 27 A. Expression of the learning rates for a pure jump Markov process in discrete space 27 B. Proof of various relations of the main text 27 C. Sufficient statistic for a non-bipartite dynamics 30 D. Analytical expressions for a stationary bi-dimensional Ornstein-Uhlenbeck process 30 E. Multi-time-step TE rates for the three-component process of Sec. VI B 35 References 37 I. INTRODUCTION One recent and important field of application of information theory is biological systems, in particular gene regu- lation and signal transduction systems. Cells must sense, process and adapt to their environment or their own phys- iological state, which are noisy processes subjected to fluctuations (see e.g. [1]). In addition, information transfers consume energy, so that there is a competition between the information gain and the energy cost. This fundamental issue is the realm of information thermodynamics, a recent and active field of research, as reviewed in [2]. In this framework, the present work is motivated by the observation that there exist, broadly speaking, two different sources of fluctuations contributing to the stochasticity of biochemical processes, for instance in cell metabolic networks. The first one { commonly called \intrinsic"{ is due the small numbers of biomolecules involved in a given reaction. The second one { the \extrinsic" source{ arises from the heterogeneity in the physical environment of the cell and the occurrence of (many) other biochemical reactions (see, e.g., [3{11]). This implies that the noise in the input biochemical signal { to be detected{ and the noise of the reactions that form the network are correlated. In short, stochastic noises have a nontrivial structure and fluctuations in each subsystem are not independent. In contrast, in the context of stochastic and information thermodynamics, the so-called bipartite assumption is usually made (e.g., for modeling Maxwell's demons): one assumes that subsystems cannot jump simultaneously if the dynamics is modeled as a Markov jump process or that noises are uncorrelated if the dynamics is modeled as diffusion processes.This simplifies the theoretical analysis and allows the contribution of each components of the system to the entropy production to be clearly identified [12{28]. Although the abandon of the bipartite (or multipartite [22]) structure seriously complicates the interpretation of information and thermodynamics exchanges, our objective in the present work is to show that a detailed description is still available. The price to pay is that several information- theoretic measures must be added to those already introduced in the literature (information flow, aka learning rate, and transfer entropy), which characterize how information is exchanged between two interacting systems in the course of their dynamical evolution. In this paper, we will mostly consider non-equilibrium systems that can be modeled by continuous-time Markov processes (diffusions, jump processes, or both). It turns out however that many definitions or relations are also valid beyond the Markovian description and we will therefore provide a general framework. Moreover, in order to offer a sufficiently general perspective, we assume the presence of multiplicative noises (additive noises being regarded as a only special case) and we do not restrict the study to steady-state situations, as is often done. On the other hand, as far as stochastic thermodynamics is concerned, we only consider averaged quantities and do not derive fluctuation relations. We leave this important issue to future investigations. The main results of the paper can be summarized as follows: 1) We introduce a set of information-theoretic measures to consistently characterize information exchanges in a generic non-bipartite stochastic system composed of two interacting subsystems. This includes learning rates and transfer entropy rates. In particular, we define a learning rate that contains the footprints of time-irreversibility. To make the reading of the following sections easier, the definitions of all these information measures are listed in Table I. 3 2) We derive a set of inequalities among these quantities and show under which circumstances some inequalities become equalities, which may signal the presence of a so-called “sufficient statistic" [27, 29]. For Markov processes, we then propose a generalization of the notion of sensory capacity introduced in [23]. 3) We give explicit expressions of the information measures for Markov diffusion processes in terms of probability currents, diffusion tensor, and propagators. In passing, we obtain a generalization to non-bipartite systems of a classical result linking information theory and estimation theory [30, 31]. 5) We derive a generalized version of the so-called \second law of information thermodynamics" [2] that applies to non-bipartite diffusion processes, showing that the entropy production rate in one of the subsystems is lower bounded by the difference in transfer entropy rates in the forward and backward time directions. 6) We illustrate the behavior of the various information-theoretic measures in the case of a stationary bidimensional Ornstein-Uhlenbeck process and show their evolution as one systematically varies the parameter quantifying the correlation between the noises. 7) We apply our formalism to the study of the directional influence between cellular processes in the metabolism of E. coli [8, 11] and exhibit an intriguing feature that may indicate some optimality in the transmission of information. Information measures Name Definition + 1 D P (YtjXt+h) E l (t) Forward learning rate (a.k.a. information flow) ln j + X h P (YtjXt) 0 − 1 D P (Yt+hjXt+h) E l (t) Backward learning rate ln j + X h P (Yt+hjXt) 0 s s 1 + − lX (t) Symmetric learning rate lX (t) = 2 [lX (t) + lX (t)] t t 1 D P (Yt+hjX0;Y0 ) E TX!Y (t) Transfer entropy (TE) rate h ln t j0+ P (Yt+hjY0 ) 1 D P (Yt+hjXt;Yt) E T X!Y (t) Single-time-step TE rate ln j + h P (Yt+hjYt) 0 T T y 1 D P (YtjXt+h;Yt+h) E TX!Y (t) Backward TE rate h ln T j0+ P (YtjYt+h) y 1 D P (YtjXt+h;Yt+h) E T X!Y (t) Single-time-step backward TE rate ln j + h P (YtjYt+h) 0 t 1 D P (Xt+h;Yt+hjY0 ) E TbX!Y (t) Filtered TE rate h ln t t j0+ P (Xt+hjY0 )P (Yt+hjY0 ) 1 D P (Xt+h;Yt+hjYt) E Tb X!Y (t) Single-time-step filtered TE rate ln j + h P (Xt+hjYt)P (Yt+hjYt) 0 TABLE I: Definitions of information measures for a non-bipartite process Zt = (Xt;Yt).

Arxiv:1905.06216V2 [Cond-Mat.Stat-Mech] 19 Sep 2019 B

JIDT: an Information-Theoretic Toolkit for Studying the Dynamics of Complex Systems

Exploratory Causal Analysis in Bivariate Time Series Data

Empirical Analysis Using Symbolic Transfer Entropy

Entropic Measures of Connectivity with an Application to Intracerebral Epileptic Signals Jie Zhu

Fast and Effective Pseudo Transfer Entropy for Bivariate Data-Driven

TRENTOOL 3.3.1 Beta – User Documentation

Direction of Information Flow in Large-Scale Resting-State

Causal Coupling Inference from Multivariate Time Seriesbasedonordinalpartitiontransitionnetworks

Disentangling Causal Webs in the Brain Using Functional Magnetic Resonance Imaging: a Review of Current Approaches

Exploratory Causal Analysis with Time Series Data Synthesis Lectures on Data Mining and Knowledge Discovery

On the Use of Transfer Entropy to Investigate the Time Horizon of Causal Influences Between Signals

Arxiv:1706.01954V2 [Stat.ME] 5 Oct 2017 Models and Predictions by Themselves