Granger Causality and Transfer Entropy for Financial Returns E.M

Total Page:16

File Type:pdf, Size:1020Kb

Granger Causality and Transfer Entropy for Financial Returns E.M Vol. 127 (2015) ACTA PHYSICA POLONICA A No. 3-A Proceedings of the 7th Symposium FENS, Lublin, May 14–17, 2014 Granger Causality and Transfer Entropy for Financial Returns E.M. Syczewskaa;∗ and Z.R. Struzikb;c;d aWarsaw School of Economics, Department of Economic Analyses, Institute of Econometrics, Madalińskiego 6/8, 02-513 Warsaw, Poland bRIKEN Brain Science Institute, 2-1 Hirosawa, Wako-shi 351-0198, Japan cGraduate School of Education, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan dInstitute of Theoretical Physics and Astrophysics, The University of Gdańsk, Wita Stwosza 57, 80-952 Gdańsk, Poland Granger causality in its linear form has been shown by Barnett, Barrett and Seth [Phys. Rev. Lett. 103, 238701 (2009)] to be equivalent to transfer entropy in case of Gaussian distribution. Generalizations by Hlaváčková- Schindler [Appl. Math. Sci. 5, 3637 (2011)] are applied to distributions typical for biomedical applications. The financial returns, which are of great importance in financial econometrics, typically do not have Gaussian distribution. Generalizations leading to the concept of nonlinear Granger causality (e.g. causality in variance, causality in risk), known and applied in econometric literature, seem to be less known outside this field. In the paper an overview of some of the definitions and applications is given. In particular, we indicate some recent econometric results concerning application of the tests in linear multivariate framework. We emphasize importance of other variants of Granger causality, and need of development of methods reflecting features of financial variables. DOI: 10.12693/APhysPolA.127.A-129 PACS: 89.20.–a, 89.70.Cf, 02.50.–r, 05.45.Tp, 02.70.Rr 1. Introduction important generalizations. Second, the question of sta- Granger causality (GC) and transfer entropy (TE) are tionarity and nonstationarity, and of proper choice of two approaches to mutual causation. The transfer en- distribution has been learned by econometricians the tropy, TE, introduced by Schreiber [1], builds on the hard way. This is especially important for financial time concept of (Shannon) entropy, but aims at detecting series — underestimating or overestimating particular dynamic causation links between a pair of variables. values due to a wrong choice of probability distribu- The Granger causality, GC, aside of TE, tends to be tion for the model can lead to losses or to inefficiency. used by researchers in climatology, physiology, neuro- The proper choice of the distribution is especially crucial physiology, multimode laser dynamics, analysis of causal- for risk assessment and volatility forecasting by financial ity in cardio-respiratory interactions, etc. (see [2], p. 5), institutions. partially due to equivalence of the GC and TE con- Hlaváčková-Schindler et al. [2] in addition to entropy cepts, proved by Barnett et al. in [3], and by Hlaváčková- and mutual information measures, describe multivariate Schindler et al. [2]. Let us emphasize that the equiva- GC in vector autoregression model framework, nonlinear lence was shown only for a linear GC test, and originally GC tests, and nonparametric GC measures based on cor- only under assumption of Gaussianity. Later Hlaváčková- relation integral, but still do not cover the GC-TE equiv- Schindler [4] extended the analysis for some probabil- alence in the general sense. On the other hand, econo- ity distributions typical for biomedical phenomena (log- metricians use both linear and nonlinear GC tests, and normal distribution, Gaussian mixtures, generalized nor- tools based on mutual information, entropy etc. It seems mal distribution, Weinman exponential distribution). that absorption by econometricians of methods aimed at Barnett and Bossomaier [5] show TE and GC equivalence detecting causality, developed in the field of neurophysi- for the vector autoregressive model also under assump- ology etc. is stronger than that of methods developed in tion of Gaussianity. This does not cover other concepts of econometrics (especially of nonlinear Granger causality) Granger causality, and does not necessarily cover all dis- in the other direction. tributions used in applied financial research and practice. 2. Typical features of financial variables The concentration on the linear form of the GC (and the Gaussianity assumption) in biomedical appli- Financial variables have specific features, which led to cations seems to be somewhat misleading. First of all, development of particular modeling tools from the field of the Granger causality concept as described in papers financial econometrics. Let Pt denote price of a financial by C.W.J. Granger is richer than that (covers linear instrument at time t. Most financial variables of interest and nonlinear causality, causality in spectral domain, (stock indices, exchange rates, stock and options etc.) causality based on information concepts), and has several are nonstationary. Their returns, defined as difference of natural logarithms of prices, rt = ln(Pt) − ln(Pt−1), are stationary in mean, but typically show changes of volatility in time (termed volatility clustering). Changing ∗corresponding author; e-mail: [email protected] volatility is modeled e.g. with ARCH or GARCH-models, the first equation of which describes (and forecasts) mean (A-129) A-130 E.M. Syczewska, Z.R. Struzik of the variable, and the second equation describes de- The GED probability density function is (see Tsay [15], pendence between changing volatilities with use of condi- p. 122): tional variance of the first equation. The ARCH (“autore- ν exp(−0:5jx/λjν ) f (x) = ; (2) gressive with conditional heteroskedasticity”) model was GED λ2(1+1/ν)Γ(1/ν) introduced by Robert F. Engle [6] in 1982; GARCH (gen- where Γ(·) is the gamma function, and λ = eralized ARCH) — by Timothy Bollerslev [7] in 1986, and [2(−2/ν)Γ(1/ν)=Γ(3/ν)]1=2. This distribution has heavy many more detailed GARCH-type specifications have tails when ν < 2 and reduces to a Gaussian distribution been introduced by various authors to cover specific fea- when ν = 2. tures of financial instruments and particular markets. To illustrate discrepancies, especially in the tails, be- The GARCH family models originally estimated by the tween Gaussian and the empirical distribution of the maximum likelihood method with assumption of Gaus- daily returns of the USDPLN exchange rate and the sianity of error terms, prove to give more accurate results WIG20 returns, see Figure for daily quotations from if non-Gaussian, often skewed, probability distributions stooq.pl (Jan. 4, 2000–Oct. 31, 2014)‡. The results of the are applied, especially for daily and higher frequency data Granger causality test (based on the VAR model) for the (see Alexander [8]). two variables [16] show that during crisis, Oct. 01, 2007– Returns of financial instruments typically have skewed Feb. 27, 2009, the null hypothesis of no causality from and leptokurtic distributions. According to Mandelbrot, USDPLN to WIG20 is weakly rejected, the same for who first noticed volatility clustering in financial returns causality from WIG20 towards USDPLN. The results time series [9], returns should be modelled with use of † change somewhat after the crisis. Neither the returns nor Pareto-Lévy processes . In finance and in econometrics the error terms of the VAR models are Gaussian. returns are most often described with use of the follow- ing distributions: Gaussian mixtures, t-Student, or more complicated: general error distribution (GED in short) or USDPLN generalizations of the last two (see Osińska [12], pp. 170– ● ● ● ●●●●● ●●●● ●●●●●● 0.04 ●●●●●●● 172). The GED (General Error Distribution) has been ●●●●●●●●● ●●●●●●●●● ●●●●●●●●●●●● ●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●● introduced in [13] by Subbotin, and later generalized by ●●●●●●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●● 0.00 ●●●●●●●●●●●●●●●● ●●●●●●●●●●●●● ●●●●●●●●●● Theodossiou [14] to a Skewed GED. The reason is that ●●●●●●●●● ●●●● ●●●●●●●●● ●●●●● some stable distributions are not suitable for statistical ● ● Sample Quantiles testing of an econometric model, and that more complex ● −0.06 distributions are intended to better reflect actual features −2 0 2 of financial variables. The skewed generalized t distribu- tion can be expressed as (see Osińska [12]): Theoretical Quantiles k fSGDT(x) = C 1 + WIG20 ν − 2 ν+1 k) k ● ● x + µ ●●● −k ●●●●●● ●●●●●●●●●● × [θ(1 + sgn(x − µ))λ] ; (1) ●●●●●●●●● ●●●●●●●●●●● ●●●●●●●●●● ●●●●●●●●●●●●● σ ●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●● 0.00 ●●● ●●●●●●●●●●●●●●●● ●●●●●●●●●●●● where B(·; ·) — the beta function, k; ν; λ, µ, σ — param- ●●●●●●●●●●● ●●●●●●●●●● ●●●●●● ●●●●●●● ●●●●●● eters of the distribution (k > 0; ν > 2; −1 < λ < 1; µ — ●●●● ●●●● z Sample Quantiles ● ● the mean; σ — the standard deviation); sgn(z) = jzj ; −0.10 the coefficients are defined as −2 0 2 1 ν −1:5 3 ν − 20:5 C = 0:5B ; B ; S(λ)σ−1; Theoretical Quantiles k k k k Fig. 1. Comparison of the quantiles of daily returns k 1=k 1 ν 3 ν − 2 θ = B( ; )0:5B( ; )0:5S(λ)−1; with theoretical Gaussian distribution. ν − 2 k k k k " In order to be aplicable in financial econometrics, ques- 2 ν − 12 S(λ) = 1 + 3λ2 − 4λ2B ; tion of TE and GC equivalence should include general- k k ization to the types of distributions typically applied in #0:5 financial econometric research. 3 ν − 2−1 1 ν −1 × B ; B ; : In this paper, examples of causation inference in the k k k k case of realistic data models
Recommended publications
  • JIDT: an Information-Theoretic Toolkit for Studying the Dynamics of Complex Systems
    JIDT: An information-theoretic toolkit for studying the dynamics of complex systems Joseph T. Lizier1, 2, ∗ 1Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, 04103 Leipzig, Germany 2CSIRO Digital Productivity Flagship, Marsfield, NSW 2122, Australia (Dated: December 4, 2014) Complex systems are increasingly being viewed as distributed information processing systems, particularly in the domains of computational neuroscience, bioinformatics and Artificial Life. This trend has resulted in a strong uptake in the use of (Shannon) information-theoretic measures to analyse the dynamics of complex systems in these fields. We introduce the Java Information Dy- namics Toolkit (JIDT): a Google code project which provides a standalone, (GNU GPL v3 licensed) open-source code implementation for empirical estimation of information-theoretic measures from time-series data. While the toolkit provides classic information-theoretic measures (e.g. entropy, mutual information, conditional mutual information), it ultimately focusses on implementing higher- level measures for information dynamics. That is, JIDT focusses on quantifying information storage, transfer and modification, and the dynamics of these operations in space and time. For this purpose, it includes implementations of the transfer entropy and active information storage, their multivariate extensions and local or pointwise variants. JIDT provides implementations for both discrete and continuous-valued data for each measure, including various types of estimator for continuous data (e.g. Gaussian, box-kernel and Kraskov-St¨ogbauer-Grassberger) which can be swapped at run-time due to Java's object-oriented polymorphism. Furthermore, while written in Java, the toolkit can be used directly in MATLAB, GNU Octave, Python and other environments. We present the principles behind the code design, and provide several examples to guide users.
    [Show full text]
  • Exploratory Causal Analysis in Bivariate Time Series Data
    Exploratory Causal Analysis in Bivariate Time Series Data Abstract Many scientific disciplines rely on observational data of systems for which it is difficult (or impossible) to implement controlled experiments and data analysis techniques are required for identifying causal information and relationships directly from observational data. This need has lead to the development of many different time series causality approaches and tools including transfer entropy, convergent cross-mapping (CCM), and Granger causality statistics. A practicing analyst can explore the literature to find many proposals for identifying drivers and causal connections in times series data sets, but little research exists of how these tools compare to each other in practice. This work introduces and defines exploratory causal analysis (ECA) to address this issue. The motivation is to provide a framework for exploring potential causal structures in time series data sets. J. M. McCracken Defense talk for PhD in Physics, Department of Physics and Astronomy 10:00 AM November 20, 2015; Exploratory Hall, 3301 Advisor: Dr. Robert Weigel; Committee: Dr. Paul So, Dr. Tim Sauer J. M. McCracken (GMU) ECA w/ time series causality November 20, 2015 1 / 50 Exploratory Causal Analysis in Bivariate Time Series Data J. M. McCracken Department of Physics and Astronomy George Mason University, Fairfax, VA November 20, 2015 J. M. McCracken (GMU) ECA w/ time series causality November 20, 2015 2 / 50 Outline 1. Motivation 2. Causality studies 3. Data causality 4. Exploratory causal analysis 5. Making an ECA summary Transfer entropy difference Granger causality statistic Pairwise asymmetric inference Weighed mean observed leaning Lagged cross-correlation difference 6.
    [Show full text]
  • Empirical Analysis Using Symbolic Transfer Entropy
    University of South Florida Scholar Commons Graduate Theses and Dissertations Graduate School June 2019 Measuring Influence Across Social Media Platforms: Empirical Analysis Using Symbolic Transfer Entropy Abhishek Bhattacharjee University of South Florida, [email protected] Follow this and additional works at: https://scholarcommons.usf.edu/etd Part of the Computer Sciences Commons Scholar Commons Citation Bhattacharjee, Abhishek, "Measuring Influence Across Social Media Platforms: Empirical Analysis Using Symbolic Transfer Entropy" (2019). Graduate Theses and Dissertations. https://scholarcommons.usf.edu/etd/7745 This Thesis is brought to you for free and open access by the Graduate School at Scholar Commons. It has been accepted for inclusion in Graduate Theses and Dissertations by an authorized administrator of Scholar Commons. For more information, please contact [email protected]. Measuring Influence Across Social Media Platforms: Empirical Analysis Using Symbolic Transfer Entropy by Abhishek Bhattacharjee A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science Department of Computer Science and Engineering College of Engineering University of South Florida Major Professor: Adriana Iamnitchi, Ph.D. John Skvoretz, Ph.D. Giovanni Luca Ciampaglia, Ph.D. Date of Approval: April 29, 2019 Keywords: Social Networks, Cross-platform influence Copyright © 2019, Abhishek Bhattacharjee DEDICATION Dedicated to my parents, brother, girlfriend and those five humans. ACKNOWLEDGMENTS I would like to pay my thanks to my advisor Dr Adriana Iamnitchi for her constant support and feedback throughout this project. The work wouldn’t have been possible without Leidos who provided us the data, and DARPA for funding this research work. I would also like to acknowledge the contributions made by Dr.
    [Show full text]
  • Entropic Measures of Connectivity with an Application to Intracerebral Epileptic Signals Jie Zhu
    Entropic measures of connectivity with an application to intracerebral epileptic signals Jie Zhu To cite this version: Jie Zhu. Entropic measures of connectivity with an application to intracerebral epileptic signals. Signal and Image processing. Université Rennes 1, 2016. English. NNT : 2016REN1S006. tel-01359072 HAL Id: tel-01359072 https://tel.archives-ouvertes.fr/tel-01359072 Submitted on 1 Sep 2016 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. ANNÉE 2016 THÈSE / UNIVERSITÉ DE RENNES 1 sous le sceau de l’Université Bretagne Loire pour le grade de DOCTEUR DE L’UNIVERSITÉ DE RENNES 1 Mention : Traitement du Signal et Télécommunications Ecole doctorale MATISSE Ecole Doctorale Mathématiques, Télécommunications, Informatique, Signal, Systèmes, Electronique Jie ZHU Préparée à l’unité de recherche LTSI - INSERM UMR 1099 Laboratoire Traitement du Signal et de l’Image ISTIC UFR Informatique et Électronique Thèse soutenue à Rennes Entropic Measures of le 22 juin 2016 Connectivity with an devant le jury composé de : MICHEL Olivier Application to Professeur
    [Show full text]
  • Fast and Effective Pseudo Transfer Entropy for Bivariate Data-Driven
    www.nature.com/scientificreports OPEN Fast and efective pseudo transfer entropy for bivariate data‑driven causal inference Riccardo Silini* & Cristina Masoller Identifying, from time series analysis, reliable indicators of causal relationships is essential for many disciplines. Main challenges are distinguishing correlation from causality and discriminating between direct and indirect interactions. Over the years many methods for data‑driven causal inference have been proposed; however, their success largely depends on the characteristics of the system under investigation. Often, their data requirements, computational cost or number of parameters limit their applicability. Here we propose a computationally efcient measure for causality testing, which we refer to as pseudo transfer entropy (pTE), that we derive from the standard defnition of transfer entropy (TE) by using a Gaussian approximation. We demonstrate the power of the pTE measure on simulated and on real‑world data. In all cases we fnd that pTE returns results that are very similar to those returned by Granger causality (GC). Importantly, for short time series, pTE combined with time‑shifted (T‑S) surrogates for signifcance testing strongly reduces the computational cost with respect to the widely used iterative amplitude adjusted Fourier transform (IAAFT) surrogate testing. For example, for time series of 100 data points, pTE and T‑S reduce the computational time by 82% with respect to GC and IAAFT. We also show that pTE is robust against observational noise. Therefore, we argue that the causal inference approach proposed here will be extremely valuable when causality networks need to be inferred from the analysis of a large number of short time series.
    [Show full text]
  • TRENTOOL 3.3.1 Beta – User Documentation
    TRENTOOL 3.3.1 beta – User Documentation Patricia Wollstadt∗ Michael Lindner Raul Vicente Michael Wibral Nicu Pampu Mario Martinez-Zarzuela Version 0.93 http://www.trentool.de/ Contents 1 Introduction 4 2 Installation and configuration6 3 Background 7 3.1 Transfer entropy....................................7 3.2 Notation and preliminaries..............................7 3.3 Practical TE estimation in TRENTOOL.......................8 3.4 Additional functionalities implemented in TRENTOOL.............. 11 3.4.1 Reconstruction of interaction delays..................... 11 3.4.2 Estimating TE from an ensemble of time series............... 11 3.4.3 Group analysis................................. 12 3.4.4 Graph correction for multivariate effects................... 12 4 Using TRENTOOL 14 4.1 Overview........................................ 14 4.2 TRENTOOL core functions.............................. 16 4.2.1 TEprepare.m .................................. 18 4.2.2 TEsurrogatestats.m ............................. 18 4.3 Reconstruction of interaction delays......................... 19 4.3.1 InteractionDelayReconstruction_calculate.m ............. 21 4.3.2 InteractionDelayReconstruction_plotting.m .............. 22 4.4 Estimating time-resolved transfer entropy from an ensemble of time series.... 22 4.4.1 TEsurrogatestats_ensemble.m ....................... 24 4.4.2 Combining time-resolved transfer entropy estimation with interaction delay reconstruction................................. 25 4.5 Binomial Test for Single Subject Results......................
    [Show full text]
  • Direction of Information Flow in Large-Scale Resting-State
    Direction of information flow in large-scale resting- state networks is frequency-dependent Arjan Hillebranda,1, Prejaas Tewariea, Edwin van Dellena,b, Meichen Yua, Ellen W. S. Carboc, Linda Douwc,d, Alida A. Gouwa,e, Elisabeth C. W. van Straatena,f, and Cornelis J. Stama aDepartment of Clinical Neurophysiology and Magnetoencephalography Center, VU University Medical Center, 1081 HV Amsterdam, The Netherlands; bDepartment of Psychiatry, Brain Center Rudolf Magnus, University Medical Center Utrecht, 3508 GA Utrecht, The Netherlands; cDepartment of Anatomy and Neurosciences, VU University Medical Center, 1081 HV Amsterdam, The Netherlands; dDepartment of Radiology, Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, MA 02129; eAlzheimer Center and Department of Neurology, VU University Medical Center, 1081 HV Amsterdam, The Netherlands; and fNutricia Advanced Medical Nutrition, Nutricia Research, 3584 CT Utrecht, The Netherlands Edited by Marcus E. Raichle, Washington University in St. Louis, St. Louis, MO, and approved February 24, 2016 (received for review August 8, 2015) Normal brain function requires interactions between spatially sepa- front pattern has also been observed in EEG (26–28). An important rated, and functionally specialized, macroscopic regions, yet the advantage of MEG over EEG in this context is that it is reference- directionality of these interactions in large-scale functional networks free. Moreover, the large number of sensors (several hundreds) in is unknown. Magnetoencephalography was used to determine the modern whole-head MEG systems allow for sophisticated spatial directionality of these interactions, where directionality was inferred filtering approaches to accurately reconstruct time series of neuronal from time series of beamformer-reconstructed estimates of neuronal activation across the cortex (29, 30).
    [Show full text]
  • Causal Coupling Inference from Multivariate Time Seriesbasedonordinalpartitiontransitionnetworks
    CAUSAL COUPLING INFERENCE FROM MULTIVARIATE TIME SERIES BASED ON ORDINAL PARTITION TRANSITION NETWORKS Narayan Puthanmadam Subramaniyam Faculty of Medicine and Health Technology Tampere University [email protected] Reik V. Donner Magdeburg–Stendal University of Applied Sciences and Potsdam Institute for Climate Impact Research (PIK) – Member of the Leibniz Association Davide Caron Gabriella Pannucio Enhanced Regenerative Medicine Enhanced Regenerative Medicine Istituto Italiano di Tecnologia Istituto Italiano di Tecnologia Jari Hyttinen Faculty of Medicine and Health Technology Tampere University June 3, 2021 ABSTRACT Identifying causal relationships is a challenging yet crucial problem in many fields of science like epidemiology, climatology, ecology, genomics, economics and neuroscience, to mention only a few. Recent studies have demonstrated that ordinal partition transition networks (OPTNs) allow inferring the coupling direction between two dynamical systems. In this work, we generalize this concept to the study of the interactions among multiple dynamical systems and we propose a new method to de- tect causality in multivariate observational data. By applying this method to numerical simulations of coupled linear stochastic processes as well as two examples of interacting nonlinear dynamical systems (coupled Lorenz systems and a network of neural mass models), we demonstrate that our approach can reliably identify the direction of interactions and the associated coupling delays. Fi- arXiv:2010.00948v4 [stat.ME] 1 Jun 2021 nally, we study real-world observational microelectrode array electrophysiology data from rodent brain slices to identify the causal coupling structures underlying epileptiform activity. Our results, both from simulations and real-world data, suggest that OPTNs can provide a complementary and robust approach to infer causal effect networks from multivariate observational data.
    [Show full text]
  • Disentangling Causal Webs in the Brain Using Functional Magnetic Resonance Imaging: a Review of Current Approaches
    REVIEW Disentangling causal webs in the brain using functional magnetic resonance imaging: A review of current approaches Natalia Z. Bielczyk 1,2, Sebo Uithol 1,3, Tim van Mourik 1,2,PaulAnderson 1,4, Jeffrey C. Glennon1,2, and Jan K. Buitelaar 1,2 1Donders Institute for Brain, Cognition and Behavior, Nijmegen, the Netherlands 2Department of Cognitive Neuroscience, Radboud University Nijmegen Medical Centre, Nijmegen, the Netherlands 3Bernstein Centre for Computational Neuroscience, Charité Universitätsmedizin, Berlin, Germany 4Faculty of Science, Radboud University Nijmegen, Nijmegen, the Netherlands an open access journal Keywords: Causal inference, Effective connectivity, Functional Magnetic Resonance Imaging, Dynamic Causal Modeling, Granger Causality, Structural Equation Modeling, Bayesian Nets, Directed Acyclic Graphs, Pairwise inference, Large-scale brain networks ABSTRACT In the past two decades, functional Magnetic Resonance Imaging (fMRI) has been used to relate neuronal network activity to cognitive processing and behavior. Recently this approach has been augmented by algorithms that allow us to infer causal links between component populations of neuronal networks. Multiple inference procedures have been proposed to approach this research question but so far, each method has limitations when it comes to establishing whole-brain connectivity patterns. In this paper, we discuss eight ways to infer causality in fMRI research: Bayesian Nets, Dynamical Causal Modelling, Granger Citation: Bielczyk, N. Z., Uithol, S., Causality, Likelihood Ratios, Linear Non-Gaussian Acyclic Models, Patel’s Tau, Structural van Mourik, T., Anderson, P., Glennon, J. C., & Buitelaar, J. K. (2019). Equation Modelling, and Transfer Entropy. We finish with formulating some recommendations Disentangling causal webs in the brain using functional magnetic resonance for the future directions in this area.
    [Show full text]
  • Exploratory Causal Analysis with Time Series Data Synthesis Lectures on Data Mining and Knowledge Discovery
    Exploratory Causal Analysis with Time Series Data Synthesis Lectures on Data Mining and Knowledge Discovery Editors Jiawei Han, University of Illinois at Urbana-Champaign Lise Getoor, University of Maryland Wei Wang, University of North Carolina, Chapel Hill Johannes Gehrke, Cornell University Robert Grossman, University of Chicago Synthesis Lectures on Data Mining and Knowledge Discovery is edited by Jiawei Han, Lise Getoor, Wei Wang, Johannes Gehrke, and Robert Grossman. e series publishes 50- to 150-page publications on topics pertaining to data mining, web mining, text mining, and knowledge discovery, including tutorials and case studies. Potential topics include: data mining algorithms, innovative data mining applications, data mining systems, mining text, web and semi-structured data, high performance and parallel/distributed data mining, data mining standards, data mining and knowledge discovery framework and process, data mining foundations, mining data streams and sensor data, mining multi-media data, mining social networks and graph data, mining spatial and temporal data, pre-processing and post-processing in data mining, robust and scalable statistical methods, security, privacy, and adversarial data mining, visual data mining, visual analytics, and data visualization. Exploratory Causal Analysis with Time Series Data James M. McCracken 2016 Mining Human Mobility in Location-Based Social Networks Huiji Gao and Huan Liu 2015 Mining Latent Entity Structures Chi Wang and Jiawei Han 2015 Probabilistic Approaches to Recommendations Nicola Barbieri, Giuseppe Manco, and Ettore Ritacco 2014 iii Outlier Detection for Temporal Data Manish Gupta, Jing Gao, Charu Aggarwal, and Jiawei Han 2014 Provenance Data in Social Media Geoffrey Barbier, Zhuo Feng, Pritam Gundecha, and Huan Liu 2013 Graph Mining: Laws, Tools, and Case Studies D.
    [Show full text]
  • On the Use of Transfer Entropy to Investigate the Time Horizon of Causal Influences Between Signals
    entropy Article On the Use of Transfer Entropy to Investigate the Time Horizon of Causal Influences between Signals Andrea Murari 1, Michele Lungaroni 2 ID , Emmanuele Peluso 2,*, Pasquale Gaudio 2 ID , Ernesto Lerche 3,4, Luca Garzotti 3, Michela Gelfusa 2 ID and JET Contributors † 1 Consorzio RFX (CNR, ENEA, INFN, Universita’ di Padova, Acciaierie Venete SpA), I-35127 Padova, Italy; [email protected] 2 Associazione EUROfusion—University of Rome “Tor Vergata”, Via Orazio Raimondo, 18, 00173 Roma, Italy; [email protected] (M.L.); [email protected] (P.G.); [email protected] (M.G.) 3 EUROfusion Programme Management Unit, JET, Culham Science Centre, Abingdon OX14 3DB, UK; [email protected] (E.L.); [email protected] (L.G.) 4 LPP-ERM/KMS, Association EUROFUSION-Belgian State, TEC partner, Brussels 1000, Belgium * Correspondence: [email protected]; Tel.: +39-06-7259-7196 † See the author list of “X. Litaudon et al., 2017 Nucl. Fusion 57 102001. Received: 28 June 2018; Accepted: 15 August 2018; Published: 22 August 2018 Abstract: Understanding the details of the correlation between time series is an essential step on the route to assessing the causal relation between systems. Traditional statistical indicators, such as the Pearson correlation coefficient and the mutual information, have some significant limitations. More recently, transfer entropy has been proposed as a powerful tool to understand the flow of information between signals. In this paper, the comparative advantages of transfer entropy, for determining the time horizon of causal influence, are illustrated with the help of synthetic data. The technique has been specifically revised for the analysis of synchronization experiments.
    [Show full text]
  • Arxiv:1706.01954V2 [Stat.ME] 5 Oct 2017 Models and Predictions by Themselves
    SPARSE CAUSALITY NETWORK RETRIEVAL FROM SHORT TIME SERIES TOMASO ASTE1;2;3 AND T. DI MATTEO1;2;3;4 1 Department of Computer Science, UCL, London, UK 2 UCL Centre for Blockchain Technologies, UCL, London, UK 3 Systemic Risk Centre, London School of Economics and Political Sciences, London, UK 4 Department of Mathematics King's College London, London, UK Abstract. We investigate how efficiently a known underlying sparse causality structure of a sim- ulated multivariate linear process can be retrieved from the analysis of time-series of short lengths. Causality is quantified from conditional transfer entropy and the network is constructed by retaining only the statistically validated contributions. We compare results from three methodologies: two commonly used regularization methods, Glasso and ridge, and a newly introduced technique, LoGo, based on the combination of information filtering network and graphical modelling. For these three methodologies we explore the regions of time series lengths and model-parameters where a signifi- cant fraction of true causality links is retrieved. We conclude that, when time-series are short, with their lengths shorter than the number of variables, sparse models are better suited to uncover true causality links with LoGo retrieving the true causality network more accurately than Glasso and ridge. Keywords: LoGo, Sparse Modelling, Information Filtering Networks, Graphical Modeling, Machine Learning. 1. Introduction Establishing causal relations between variables from observation of their behaviour in time is central to scientific investigation and it is at the core of data-science where these causal relations are the basis for the construction of useful models and tools capable of prediction.
    [Show full text]