The Synaptic Weight Matrix

Total Page:16

File Type:pdf, Size:1020Kb

The Synaptic Weight Matrix The Synaptic Weight Matrix: Dynamics, Symmetry Breaking, and Disorder Francesco Fumarola Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2018 © 2018 Francesco Fumarola All rights reserved ABSTRACT The Synaptic Weight Matrix: Dynamics, Symmetry Breaking, and Disorder Francesco Fumarola A key role in simplified models of neural circuitry (Wilson and Cowan, 1972) is played by the matrix of synaptic weights, also called connectivity matrix, whose elements describe the amount of influence the firing of one neuron has on another. Biologically, this matrix evolves over time whether or not sensory inputs are present, and symmetries possessed by the internal dynamics of the network may break up sponta- neously, as found in the development of the visual cortex (Hubel and Wiesel, 1977). In this thesis, a full analytical treatment is provided for the simplest case of such a biologi- cal phenomenon, a single dendritic arbor driven by correlation-based dynamics (Linsker, 1988). Borrowing methods from the theory of Schrödinger operators, a complete study of the model is performed, analytically describing the break-up of rotational symmetry that leads to the functional specialization of cells. The structure of the eigenfunctions is calcu- lated, lower bounds are derived on the critical value of the correlation length, and explicit expressions are obtained for the long-term profile of receptive fields, i.e. the dependence of neural activity on external inputs. The emergence of a functional architecture of orientation preferences in the cortex is another crucial feature of visual information processing. This is discussed through a model consisting of large neural layers connected by an infinite number of Hebb-evolved arbors. Ohshiro and Weiliky (2006), in their study of developing ferrets, found correlation profiles of neural activity in contradiction with previous theories of the phenomenon (Miller, 1994; Wimbauer, 1998). The theory proposed herein, based upon the type of correlations they measured, leads to the emergence of three different symmetry classes. The contours of a highly structured phase diagram are traced analytically, and observables concerning the various phases are estimated in every phase by means of perturbative, asymptotic and varia- tional methods. The proper modeling of axonal competition proves to be key to reproducing basic phenomenological features. While these models describe the long-term effect of synaptic plasticity, plasticity itself makes the connectivity matrix highly dependent on particular histories, hence its stochas- ticity cannot be considered perturbatively. The problem is tackled by carrying out a detailed treatment of the spectral properties of synaptic-weight matrices with an arbitrary distribu- tion of disorder. Results include a proof of the asymptotic compactness of random spectra, calculations of the shape of supports and of the density profiles, a fresh analysis of the problem of spectral outliers, a study of the link between eigenvalue density and the pseu- dospectrum of the mean connectivity, and applications of these general results to a variety of biologically relevant examples. The strong non-normality of synaptic-weight matrices (biologically engineered through Dale’s law) is believed to play important functional roles in cortical operations (Murphy and Miller, 2009; Goldman, 2009). Accordingly, a comprehensive study is dedicated to its effect on the transient dynamics of large disordered networks. This is done by adapting standard field-theoretical methods (such as the summation of ladder diagrams) to the non- Hermitian case. Transient amplification of activity can be measured from the average norm squared; this is calculated explicitly for a number of cases, showing that transients are typically amplified by disorder. Explicit expressions for the power spectrum of response are derived and applied to a number of biologically plausible networks, yielding insights into the interplay between disorder and non-normality. The fluctuations of the covariance of noisy neural activity are also briefly discussed. Recent optogenetic measurements have raised questions on the link between synaptic structure and the response of disordered networks to targeted perturbations. Answering to these developments, formulae are derived that establish the operational regime of networks through their response to specific perturbations, and a minimal threshold is found to exist for counterintuitive responses of an inhibitory-stabilized circuit such as have been reported in Ozeki et al. (2016), Adesnik (2016), Kato et al. (2017). Experimental advances are also bringing to light unsuspected differences between various neuron types, which appear to translate into different roles in network function (Pfeffer et al., 2013; Tremblay et al., 2016). Accordingly, the last part of the thesis focuses on networks with an arbitrary num- ber of neuronal types, and predictions are provided for the response of networks with a multipopulation structure to targeted input perturbations. Contents Acknowledgments xiii 1 Introduction 1 2 Biological foundations 6 2.1 Fundamentals of neural electrodynamics . 6 2.2 Derivation of rate models . 8 2.3 Range of validity . 13 2.4 Equivalence of alternative rate models . 13 3 Hebbian dynamics and the Linsker cell 16 3.1 Hebbian dynamics in the visual cortex . 16 3.1.1 Hebbian dynamics . 16 3.1.2 Visual cortex development . 17 3.1.3 Hebbian theories of orientation selectivity . 20 3.2 Diagonalization of the Linsker operator . 25 3.2.1 Diagonalization in polar coordinates . 25 3.2.2 Diagonalization in Cartesian coordinates . 30 3.2.3 Connecting the polar and Cartesian representations . 32 vi 3.3 Homeostatic constraints . 36 3.3.1 Positive semidefiniteness of the constrained operator . 38 3.3.2 Matrix elements of the constrained operator . 39 3.4 Diagonalization of the constrained model . 41 3.4.1 Long-range solution . 41 3.4.2 Moveable node theory . 42 3.4.3 A lower bound for the transition point . 46 3.5 Conclusions . 49 4 A multilayer model of orientation development 52 4.1 The Hebbian theory of orientation maps . 52 4.1.1 Review of the experimental literature . 52 4.1.2 Unconstrained dynamics of the model . 56 4.1.3 Homeostatic constraint . 60 4.1.4 Projection operators . 63 4.2 Properties of the time-evolution operator . 64 4.2.1 Matrix elements . 65 4.2.2 Fourier Transform . 66 4.2.3 Positive semidefiniteness . 68 4.2.4 Long-term dynamics . 69 4.2.5 Symmetries of the system: translations and rotations . 70 4.2.6 Symmetries of the system: parity and CP symmetry . 71 4.2.7 Diagonalization of the unconstrained dynamics . 74 4.3 Perturbation theory for short correlations . 76 4.3.1 Solution with uncorrelated input . 76 4.3.2 Optimal wavenumber for oriented eigenfunctions . 79 4.3.3 Optimal wavenumber for non-oriented eigenfunctions . 81 4.4 Forbidden regions for the N- and R-phases . 84 4.5 The long-range limit . 86 4.5.1 The long-range limit: derivation . 86 4.5.2 The long-range limit: analysis of results . 93 4.6 Cartesian-basis expansion . 96 4.6.1 Combining s-waves with longitudinal eigenfunctions (0x theory) . 99 4.6.2 Short-range limit . 101 4.6.3 Local stability of the R-phase . 104 4.6.4 0x eigenfunction . 106 4.6.5 The uniform phase . 106 4.7 The N-phase domain . 108 4.7.1 Global stability of the 0x solution . 108 4.7.2 Shape of the phase boundary . 109 4.7.3 Corrections to the variational results . 112 4.8 Numerics and conclusions . 120 4.8.1 Numerical results . 120 4.8.2 Conclusions . 121 5 The Spectral Properties of Weight Matrices 125 5.1 The open problem of synaptic stochasticity . 125 5.1.1 Background and motivation . 125 5.1.2 Random matrices beyond quantum mechanics . 126 5.1.3 Outline of the treatment . 129 5.2 General results on spectral properties . 130 5.2.1 Preliminary definitions . 130 5.2.2 The shape of supports and the density profiles . 133 5.2.3 Formulation in terms of singular values . 135 5.2.4 Purely random synaptic weight matrices . 136 5.2.5 Ordering of limits . 137 5.2.6 Relationship to pseudospectra . 141 5.3 Calculation of spectral properties . 144 5.3.1 The shape of supports and density profiles . 144 5.3.2 Singular value formulation . 157 5.3.3 Validity of the non-crossing approximation . 158 5.3.4 Proving the compactness of the supports . 169 5.3.5 The case of zero mean connectivity . 175 5.4 Applications to specific networks . 180 5.4.1 Schur structure of the mean connectivity . 181 5.4.2 Networks with a single feedforward chain . 182 5.4.3 Examples motivated by Dale’s law . 185 5.4.4 Linearizations of nonlinear networks . 201 5.5 Conclusions . 206 6 Disordered neural dynamics 208 6.1 Dynamical functions of non-normality . 208 6.2 General results . 211 6.2.1 Preliminary remarks . 211 6.2.2 Result for the average squared norm . 212 6.2.3 Results for the power spectrum . 213 6.2.4 The amplifying effects of disorder . 216 6.2.5 Conditions for applicability . 217 6.3 Derivation of the general results . 219 6.3.1 Formulation in terms of diffusive kernel . 219 6.3.2 Summing up the diffuson . 224 6.4 Applications to specific networks . 232 6.4.1 Feedforward chain with alternating eigenvalues . 233 6.4.2 Feedforward chain with null eigenvalues . 235 6.4.3 Network with two-state feedforward components . 239 6.5 Beyond the non-crossing approximation . 242 6.6 Disorder fluctuations of the covariance . 244 7 Response to targeted input perturbations 249 7.1 Measuring the response to input perturbations . 249 7.2 Modeling paradoxical responses . 252 7.2.1 Are excitatory subnetworks independently stable? . 252 7.2.2 A uniform excitatory-inhibitory network .
Recommended publications
  • Synaptic Plasticity in Correlated Balanced Networks
    bioRxiv preprint doi: https://doi.org/10.1101/2020.04.26.061515; this version posted April 28, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Synaptic Plasticity in Correlated Balanced Networks Alan Eric Akil Department of Mathematics, University of Houston, Houston, Texas, United States of America Robert Rosenbaum Department of Applied and Computational Mathematics and Statistics, University of Notre Dame, Notre Dame, Indiana, United States of America∗ Kreˇsimir Josi´cy Department of Mathematics, University of Houston, Houston, Texas, United States of America z (Dated: April 2020) 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.04.26.061515; this version posted April 28, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Abstract The dynamics of local cortical networks are irregular, but correlated. Dynamic excitatory{ inhibitory balance is a plausible mechanism that generates such irregular activity, but it remains unclear how balance is achieved and maintained in plastic neural networks. In particular, it is not fully understood how plasticity induced changes in the network affect balance, and in turn, how correlated, balanced activity impacts learning. How does the dynamics of balanced networks change under different plasticity rules? How does correlated spiking activity in recurrent networks change the evolution of weights, their eventual magnitude, and structure across the network? To address these questions, we develop a general theory of plasticity in balanced networks.
    [Show full text]
  • Energy Efficient Synaptic Plasticity Ho Ling Li1, Mark CW Van Rossum1,2*
    RESEARCH ARTICLE Energy efficient synaptic plasticity Ho Ling Li1, Mark CW van Rossum1,2* 1School of Psychology, University of Nottingham, Nottingham, United Kingdom; 2School of Mathematical Sciences, University of Nottingham, Nottingham, United Kingdom Abstract Many aspects of the brain’s design can be understood as the result of evolutionary drive toward metabolic efficiency. In addition to the energetic costs of neural computation and transmission, experimental evidence indicates that synaptic plasticity is metabolically demanding as well. As synaptic plasticity is crucial for learning, we examine how these metabolic costs enter in learning. We find that when synaptic plasticity rules are naively implemented, training neural networks requires extremely large amounts of energy when storing many patterns. We propose that this is avoided by precisely balancing labile forms of synaptic plasticity with more stable forms. This algorithm, termed synaptic caching, boosts energy efficiency manifold and can be used with any plasticity rule, including back-propagation. Our results yield a novel interpretation of the multiple forms of neural synaptic plasticity observed experimentally, including synaptic tagging and capture phenomena. Furthermore, our results are relevant for energy efficient neuromorphic designs. Introduction The human brain only weighs 2% of the total body mass but is responsible for 20% of resting metab- olism (Attwell and Laughlin, 2001; Harris et al., 2012). The brain’s energy need is believed to have shaped many aspects of its design, such as its sparse coding strategy (Levy and Baxter, 1996; Len- nie, 2003), the biophysics of the mammalian action potential (Alle et al., 2009; Fohlmeister, 2009), *For correspondence: and synaptic failure (Levy and Baxter, 2002; Harris et al., 2012).
    [Show full text]
  • Arxiv:2103.12649V1 [Q-Bio.NC] 23 Mar 2021
    A synapse-centric account of the free energy principle David Kappel and Christian Tetzlaff March 24, 2021 Abstract The free energy principle (FEP) is a mathematical framework that describes how biological systems self- organize and survive in their environment. This principle provides insights on multiple scales, from high-level behavioral and cognitive functions such as attention or foraging, down to the dynamics of specialized cortical microcircuits, suggesting that the FEP manifests on several levels of brain function. Here, we apply the FEP to one of the smallest functional units of the brain: single excitatory synaptic connections. By focusing on an experimentally well understood biological system we are able to derive learning rules from first principles while keeping assumptions minimal. This synapse-centric account of the FEP predicts that synapses interact with the soma of the post-synaptic neuron through stochastic synaptic releases to probe their behavior and use back-propagating action potentials as feedback to update the synaptic weights. The emergent learning rules are regulated triplet STDP rules that depend only on the timing of the pre- and post-synaptic spikes and the internal states of the synapse. The parameters of the learning rules are fully determined by the parameters of the post-synaptic neuron model, suggesting a close interplay between the synaptic and somatic compartment and making precise predictions about the synaptic dynamics. The synapse-level uncertainties automatically lead to representations of uncertainty on the network level that manifest in ambiguous situations. We show that the FEP learning rules can be applied to spiking neural networks for supervised and unsupervised learning and for a closed loop learning task where a behaving agent interacts with an environment.
    [Show full text]
  • Improved Expressivity Through Dendritic Neural Networks
    Improved Expressivity Through Dendritic Neural Networks Xundong Wu Xiangwen Liu Wei Li Qing Wu School of Computer Science and Technology Hangzhou Dianzi University, Hangzhou, China [email protected], [email protected] A typical biological neuron, such as a pyramidal neuron of the neocortex, receives thousands of afferent synaptic inputs on its dendrite tree and sends the efferent axonal output downstream. In typical artificial neural networks, dendrite trees are modeled as linear structures that funnel weighted synaptic inputs to the cell bodies. However, numerous experimental and theoretical studies have shown that dendritic arbors are far more than simple linear accumulators. That is, synaptic inputs can actively modulate their neighboring synaptic activities; therefore, the dendritic structures are highly nonlinear. In this study, we model such local nonlinearity of dendritic trees with our dendritic neural network (DENN) structure and apply this structure to typical machine learning tasks. Equipped with localized nonlinearities, DENNs can attain greater model expressivity than regular neural networks while maintaining efficient network inference. Such strength is evidenced by the increased fitting power when we train DENNs with supervised machine learning tasks. We also empirically show that the locality structure of DENNs can improve the generalization performance, as exemplified by DENNs outranking naive deep neural network architectures when tested on classification tasks from the UCI machine learning repository. 1 Introduction Deep learning algorithms have made remarkable achievements in a vast array of fields over the past few years. Notably, deep convolutional neural networks have revolutionized computer vision research and real-world applications. Inspired by biological neuronal networks in our brains, originally in the form of a perceptron [37], an artificial neural network unit is typically constructed as a simple weighted sum of synaptic inputs followed by feeding the summation result through an activation function.
    [Show full text]
  • Synaptic Plasticity As Bayesian Inference
    ARTICLES https://doi.org/10.1038/s41593-021-00809-5 Synaptic plasticity as Bayesian inference Laurence Aitchison 1,2 ✉ , Jannes Jegminat 3,4, Jorge Aurelio Menendez 1,5, Jean-Pascal Pfister3,4, Alexandre Pouget 1,6,7 and Peter E. Latham 1,7 Learning, especially rapid learning, is critical for survival. However, learning is hard; a large number of synaptic weights must be set based on noisy, often ambiguous, sensory information. In such a high-noise regime, keeping track of probability distribu- tions over weights is the optimal strategy. Here we hypothesize that synapses take that strategy; in essence, when they esti- mate weights, they include error bars. They then use that uncertainty to adjust their learning rates, with more uncertain weights having higher learning rates. We also make a second, independent, hypothesis: synapses communicate their uncertainty by linking it to variability in postsynaptic potential size, with more uncertainty leading to more variability. These two hypotheses cast synaptic plasticity as a problem of Bayesian inference, and thus provide a normative view of learning. They generalize known learning rules, offer an explanation for the large variability in the size of postsynaptic potentials and make falsifiable experimental predictions. o survive, animals must accurately estimate the state of the state-of-the-art adaptive optimization algorithms for artificial neu- world. This estimation problem is plagued by uncertainty: ral networks4. Tnot only is information often extremely limited (for example, Bayesian plasticity is a hypothesis about what synapses com- because it is dark) or ambiguous (for example, a rustle in the bushes pute. It does not, however, tell synapses how to set their weights; could be the wind, or it could be a predator), but sensory recep- for that, a second hypothesis is needed.
    [Show full text]
  • Mathematical Models of Dynamic Behavior of Individual Neural Networks of Central Nervous System Dimitra-Despoina Pagania, Adam Adamopoulos, Spiridon Likothanassis
    Mathematical Models of Dynamic Behavior of Individual Neural Networks of Central Nervous System Dimitra-Despoina Pagania, Adam Adamopoulos, Spiridon Likothanassis To cite this version: Dimitra-Despoina Pagania, Adam Adamopoulos, Spiridon Likothanassis. Mathematical Models of Dynamic Behavior of Individual Neural Networks of Central Nervous System. 12th Engineering Appli- cations of Neural Networks (EANN 2011) and 7th Artificial Intelligence Applications and Innovations (AIAI), Sep 2011, Corfu, Greece. pp.433-442, 10.1007/978-3-642-23957-1_48. hal-01571328 HAL Id: hal-01571328 https://hal.inria.fr/hal-01571328 Submitted on 2 Aug 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution| 4.0 International License Mathematical Models of Dynamic Behavior of Individual Neural Networks of Central Nervous System Dimitra – Despoina Pagania1, Adam Adamopoulos1,2 and Spiridon D. Likothanassis1 1 Pattern Recognition Laboratory, Dept. of Computer Engineering & Informatics, University of Patras, 26500, Patras, Greece 2Dept. of Medicine, Democritus University of Thrace, Alexandroupolis, Greece {pagania, likothan}@ceid.upatras.gr, [email protected] Abstract. We present mathematical models that describe individual neural networks of the Central Nervous System. Three cases are examined, varying in each case the values of the refractory period and the synaptic delay of a neuron.
    [Show full text]
  • Synaptic Mechanisms of Memory Consolidation During Sleep Slow Oscillations
    The Journal of Neuroscience, April 13, 2016 • 36(15):4231–4247 • 4231 Cellular/Molecular Synaptic Mechanisms of Memory Consolidation during Sleep Slow Oscillations X Yina Wei, Giri P. Krishnan, and Maxim Bazhenov Department of Cell Biology and Neuroscience, University of California at Riverside, Riverside, California 92521 Sleep is critical for regulation of synaptic efficacy, memories, and learning. However, the underlying mechanisms of how sleep rhythms contribute to consolidating memories acquired during wakefulness remain unclear. Here we studied the role of slow oscillations, 0.2–1 Hz rhythmic transitions between Up and Down states during stage 3/4 sleep, on dynamics of synaptic connectivity in the thalamocortical network model implementing spike-timing-dependent synaptic plasticity. We found that the spatiotemporal pattern of Up-state propa- gation determines the changes of synaptic strengths between neurons. Furthermore, an external input, mimicking hippocampal ripples, delivered to the cortical network results in input-specific changes of synaptic weights, which persisted after stimulation was removed. These synaptic changes promoted replay of specific firing sequences of the cortical neurons. Our study proposes a neuronal mechanism on how an interaction between hippocampal input, such as mediated by sharp wave-ripple events, cortical slow oscillations, and synaptic plasticity, may lead to consolidation of memories through preferential replay of cortical cell spike sequences during slow-wave sleep. Key words: cortex; hippocampus; learning and memory; memory consolidation; sleep slow oscillations; thalamus Significance Statement Sleep is critical for memory and learning. Replay during sleep of temporally ordered spike sequences related to a recent experience was proposed to be a neuronal substrate of memory consolidation.
    [Show full text]
  • Synaptic Weight Modulation and Adaptation Part I: Introduction and Presynaptic Mechanisms
    Synaptic Weight I Synaptic Weight Modulation and Adaptation Part I: Introduction and Presynaptic Mechanisms A Technical Strategy Brief for the Neurofuzzy Soft Computing Program R. Wells May 15, 2003 The Biological Basis of Synaptic Weight A “weight” in an artificial neuron model is meant to be a representation of the amount of effect that a synaptic input has on the overall response of the neuron. The neuron that generates the synaptic input (the “action potential” or AP) is called the “presynaptic neuron”; the neuron receiving the effect of the action potential is called the “postsynaptic neuron”. The synapse is the biological structure that connects these two neurons. As such, “the” synapse belongs to neither the presynaptic nor the postsynaptic neuron. The part of the synaptic structure contained within the cell of the presynaptic neuron is called the “presynaptic terminal”. The part of the synaptic structure contained within the cell of the postsynaptic neuron is called the “postsynaptic compartment”. Between these two is a small, liquid-filled gap called the “synaptic cleft.” Transmission from the presynaptic terminal to the postsynaptic compartment takes place by chemical signaling. The presynaptic terminal secretes particular molecules, called neurotransmitters (NTXs), into the cleft. These NTXs bind to receptor proteins on the surface of the postsynaptic compartment, and this chemical binding causes electro-chemical reactions to take place in the postsynaptic cell. Figure 1 illustrates the generic form of a synapse. The “weight” of a synapse can therefore be regarded as a measure of the amount of postsynaptic response that occurs when an action potential invades the presynaptic terminal.
    [Show full text]
  • A New Learning Method for Inference Accuracy, Core Occu- Pation, and Performance Co-Optimization on Truenorth Chip
    A New Learning Method for Inference Accuracy, Core Occu- pation, and Performance Co-optimization on TrueNorth Chip 1 Wei Wen, 1 Chunpeng Wu, 1 Yandan Wang, 1 Kent Nixon, 2 Qing Wu, 2 Mark Barnell, 1 Hai Li, 1 Yiran Chen 1 University of Pittsburgh, Pittsburgh, PA, USA; 2 Air Force Research Laboratory, Rome, NY, USA 1{wew57, chw127, yaw46, kwn2, hal66, yic52}@pitt.edu, 2{qing.wu.2, Mark.Barnell.1}@us.af.mil ABSTRACT ues. Besides the stochastic scheme, TrueNorth also supports many IBM TrueNorth chip uses digital spikes to perform neuromor- deterministic neural coding schemes (i.e., rate code, population phic computing and achieves ultrahigh execution parallelism and code, time-to-spike code and rank code [9]) in temporal domain to power efficiency. However, in TrueNorth chip, low quantization represent non-binary message using multiple binary spikes. Rate resolution of the synaptic weights and spikes significantly limits code, for example, uses the number of the spikes occurring within the inference (e.g., classification) accuracy of the deployed neural a predefined number of time steps to indicate the amplitude of a network model. Existing workaround, i.e., averaging the results signal [7]. However, this “official” workaround rapidly exhausts over multiple copies instantiated in spatial and temporal domains, hardware resources when the network scale increases and a large rapidly exhausts the hardware resources and slows down the number of spatial samples (copies) are instantiated. It also slows computation. In this work, we propose a novel learning method on down the computation when many spike samples are generated. TrueNorth platform that constrains the random variance of each In this work, we first formulate the standard TrueNorth learning computation copy and reduces the number of needed copies.
    [Show full text]
  • Drawing Inspiration from Biological Dendrites to Empower Artificial
    DRAWING INSPIRATION FROM BIOLOGICAL DENDRITES TO EMPOWER ARTIFICIAL NEURAL NETWORKS OPINION ARTICLE Spyridon Chavlis Institute of Molecular Biology and Biotechnology (IMBB) Foundation for Research and Technology Hellas (FORTH) Plastira Avenue 100, Heraklion, 70013, Greece [email protected] Panayiota Poirazi Institute of Molecular Biology and Biotechnology (IMBB) Foundation for Research and Technology Hellas (FORTH) Plastira Avenue 100, Heraklion, 70013, Greece [email protected] ABSTRACT This article highlights specific features of biological neurons and their dendritic trees, whose adop- tion may help advance artificial neural networks used in various machine learning applications. Advancements could take the form of increased computational capabilities and/or reduced power consumption. Proposed features include dendritic anatomy, dendritic nonlinearities, and compart- mentalized plasticity rules, all of which shape learning and information processing in biological networks. We discuss the computational benefits provided by these features in biological neurons and suggest ways to adopt them in artificial neurons in order to exploit the respective benefits in machine learning. Keywords Artificial Neural Networks · Biological dendrites · Plasticity 1 Introduction Artificial Neural Networks (ANN) implemented in Deep Learning (DL) architectures have been extremely successful in solving challenging machine learning (ML) problems such as image [1], speech [2], and face [3] recognition, arXiv:2106.07490v1 [q-bio.NC] 14 Jun 2021 playing online games [4], autonomous driving [5], etc. However, despite their huge successes, state-of-the-art DL architectures suffer from problems that seem rudimentary to real brains. For example, while becoming true experts in solving a specific task, they typically fail to transfer these skills to new problems without extensive retraining – a property known as “transfer learning”.
    [Show full text]
  • Effects of Synaptic Weight Diffusion on Learning in Decision Making Networks
    Effects of Synaptic Weight Diffusion on Learning in Decision Making Networks Kentaro Katahira1;2;3, Kazuo Okanoya1;3 and Masato Okada1;2;3 1ERATO Okanoya Emotional Information Project, Japan Science Technology Agency 2Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8561, Japan 3RIKEN Brain Science Institute, Wako, Saitama 351-0198, Japan [email protected] [email protected] [email protected] Abstract When animals repeatedly choose actions from multiple alternatives, they can al- locate their choices stochastically depending on past actions and outcomes. It is commonly assumed that this ability is achieved by modifications in synaptic weights related to decision making. Choice behavior has been empirically found to follow Herrnstein’s matching law. Loewenstein & Seung (2006) demonstrated that matching behavior is a steady state of learning in neural networks if the synap- tic weights change proportionally to the covariance between reward and neural ac- tivities. However, their proof did not take into account the change in entire synap- tic distributions. In this study, we show that matching behavior is not necessarily a steady state of the covariance-based learning rule when the synaptic strength is sufficiently strong so that the fluctuations in input from individual sensory neu- rons influence the net input to output neurons. This is caused by the increasing variance in the input potential due to the diffusion of synaptic weights. This effect causes an undermatching phenomenon, which has been observed in many behav- ioral experiments. We suggest that the synaptic diffusion effects provide a robust neural mechanism for stochastic choice behavior.
    [Show full text]
  • Postsynaptic Potential Energy As Determinant of Synaptic Plasticity
    bioRxiv preprint doi: https://doi.org/10.1101/2020.08.26.269290; this version posted August 27, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. Postsynaptic potential energy as determinant of synaptic plasticity Huan-Wen Chen1,*, Li-Juan Xie2, Yi-Jun Wang1 & Hang Zhang1 1 School of Automation, Central South University, Changsha, China 2 Institute of physiology and psychology, Changsha University of Science and Technology, Changsha, China *Correspondence: [email protected] Abstract Metabolic energy can be used as a unified principle to control neuronal activity. However, whether and how metabolic energy alone can determine the outcome of synaptic plasticity remains unclear. In this study, a computational model of synaptic plasticity that is completely determined by energy is proposed. A simple quantitative relationship between synaptic plasticity and postsynaptic potential energy is established. Synaptic weight is directly proportional to the difference between the baseline potential energy and the suprathreshold potential energy and is constrained by the maximum energy supply. Results show that the energy constraint improves the performance of synaptic plasticity and avoids setting the hard boundary of synaptic weights. With the same set of model parameters, our model can reproduce several classical experiments in homo and heterosynaptic plasticity. The proposed model can explain the interaction mechanism of Hebbian and homeostatic plasticity at the cellular level, thereby providing a new way to deeply understand the characteristics of learning and memory. Introduction Although the brain accounts for only 2% of the body mass, it consumes 20% of the resting metabolic energy produced by the whole body (Attwell and Laughlin, 2001; Harris et al., 2012).
    [Show full text]