An Investigation of the Cortical Learning Algorithm

Total Page:16

File Type:pdf, Size:1020Kb

An Investigation of the Cortical Learning Algorithm Rowan University Rowan Digital Works Theses and Dissertations 5-24-2018 An investigation of the cortical learning algorithm Anthony C. Samaritano Rowan University Follow this and additional works at: https://rdw.rowan.edu/etd Part of the Electrical and Computer Engineering Commons, and the Neuroscience and Neurobiology Commons Recommended Citation Samaritano, Anthony C., "An investigation of the cortical learning algorithm" (2018). Theses and Dissertations. 2572. https://rdw.rowan.edu/etd/2572 This Thesis is brought to you for free and open access by Rowan Digital Works. It has been accepted for inclusion in Theses and Dissertations by an authorized administrator of Rowan Digital Works. For more information, please contact [email protected]. AN INVESTIGATION OF THE CORTICAL LEARNING ALGORITHM by Anthony C. Samaritano A Thesis Submitted to the Department of Electrical and Computer Engineering College of Engineering In partial fulfillment of the requirement For the degree of Master of Science in Electrical and Computer Engineering at Rowan University November 2, 2016 Thesis Advisor: Robi Polikar, Ph.D. © 2018 Anthony C. Samaritano Acknowledgments I would like to express my sincerest gratitude and appreciation to Dr. Robi Polikar for his help, instruction, and patience throughout the development of this thesis and my research into cortical learning algorithms. Dr. Polikar took a risk by allowing me to follow my passion for neurobiologically inspired algorithms and explore this emerging category of cortical learning algorithms in the machine learning field. The skills and knowledge I acquired throughout my research have definitively molded me into a more diligent and thorough engineer. I have, and will continue to, take these characteristics and skills I have gained into my current and future professional endeavors. I would like to thank my mother, Dawn Samaritano, and fiancé, Lauren Anderson, for the continued and unwavering love and support through this journey. Without your support, I would not have been about to finish this research. Thank you for propelling me forward at every turn and the countless hours of reading and feedback throughout the writing of the thesis. iii Abstract Anthony C. Samaritano AN INVESTIGATION OF THE CORTICAL LEARNING ALGORITHM 2017-2018 Robi Polikar, Ph.D. Master of Science in Electrical and Computer Engineering Pattern recognition and machine learning fields have revolutionized countless industries and applications from biometric security to modern industrial assembly lines. The fields continue to accelerate as faster, more efficient processing hardware becomes commercially available. Despite the accelerated growth of the pattern recognition and machine learning fields, computers still are unable to learn, reason, and perform rudimentary tasks that humans and animals find routine. Animals are able to move fluidly, understand their environment, and maximize their chances of survival through adaptation – animals demonstrate intelligence. A primary argument in this thesis that we have not yet achieved a level of intelligence similar to humans and animals in the pattern recognition and machine learning fields, not due to a lack of computational power but, rather, due to lack of understanding of how the cortical structures of mammalian brain interact and operate. This thesis describes a cortical learning algorithm (CLA) that models how the cortical structures in the mammalian neocortex operate. Furthermore, a high level understanding of how the cortical structures in the mammalian brain interact, store semantic patterns, and auto-recall these patterns for future predictions are discussed. Finally, we demonstrate that the algorithm can build and maintain a model of its environment and provide feedback for actions and/or classification in a similar fashion to our understanding of cortical operation. iv Table of Contents Abstract .......................................................................................................................... iv List of Figures ................................................................................................................ ix List of Tables............................................................................................................... xiii Introduction .................................................................................................... 1 Why Study the Brain? ................................................................................................ 2 Related Work ............................................................................................................. 4 Motivation ................................................................................................................. 5 Contributions ............................................................................................................. 6 Broader Impacts......................................................................................................... 7 Thesis Structure ......................................................................................................... 7 Understanding the Brain ................................................................................. 9 Cortical Structures ..................................................................................................... 9 Neuron ................................................................................................................. 9 Neocortex .......................................................................................................... 13 Neocortical Structure ......................................................................................... 14 Neocortical Regions ........................................................................................... 18 Thalamus ........................................................................................................... 19 Hippocampus ..................................................................................................... 22 Machine Learning......................................................................................... 25 Supervised Learning ................................................................................................ 25 Reinforcement Learning .......................................................................................... 27 Unsupervised Learning ............................................................................................ 28 v Table of Contents (continued) Applicable Machine Learning Algorithms ............................................................... 29 Shallow Architectures ........................................................................................ 30 Deep Architectures ............................................................................................. 35 Relationship between the Brain and Machine Learning ................................. 40 Properties of the Brain ............................................................................................. 40 Online Learning ................................................................................................. 41 Hierarchy ........................................................................................................... 42 Time Sequences ................................................................................................. 45 Sparsity .............................................................................................................. 47 Attention ............................................................................................................ 49 Invariant Representation .................................................................................... 52 Signal Agnostics and Plasticity. ......................................................................... 54 Pattern Recognition Properties of an Intelligent System ........................................... 56 Resistance to Catastrophic Forgetting ................................................................. 56 Data Fusion ........................................................................................................ 57 Noise Resistance ................................................................................................ 58 Anomaly Detection ............................................................................................ 60 Non-Stationary Environments ............................................................................ 61 New Class .......................................................................................................... 62 Cortical Learning Algorithm ......................................................................... 64 Neocortical Principles .............................................................................................. 65 Online Learning and Time Sequences ................................................................ 65 vi Table of Contents (continued) Cell Regions and Hierarchy................................................................................ 65 Sparsity .............................................................................................................. 66 Prediction and Invariant Representations ............................................................ 67 Cortical Learning Algorithm Design ........................................................................ 68 Modularity ........................................................................................................
Recommended publications
  • Cortical Layers: What Are They Good For? Neocortex
    Cortical Layers: What are they good for? Neocortex L1 L2 L3 network input L4 computations L5 L6 Brodmann Map of Cortical Areas lateral medial 44 areas, numbered in order of samples taken from monkey brain Brodmann, 1908. Primary visual cortex lamination across species Balaram & Kaas 2014 Front Neuroanat Cortical lamination: not always a six-layered structure (archicortex) e.g. Piriform, entorhinal Larriva-Sahd 2010 Front Neuroanat Other layered structures in the brain Cerebellum Retina Complexity of connectivity in a cortical column • Paired-recordings and anatomical reconstructions generate statistics of cortical connectivity Lefort et al. 2009 Information flow in neocortical microcircuits Simplified version “computational Layer 2/3 layer” Layer 4 main output Layer 5 main input Layer 6 Thalamus e - excitatory, i - inhibitory Grillner et al TINS 2005 The canonical cortical circuit MAYBE …. DaCosta & Martin, 2010 Excitatory cell types across layers (rat S1 cortex) Canonical models do not capture the diversity of excitatory cell classes Oberlaender et al., Cereb Cortex. Oct 2012; 22(10): 2375–2391. Coding strategies of different cortical layers Sakata & Harris, Neuron 2010 Canonical models do not capture the diversity of firing rates and selectivities Why is the cortex layered? Do different layers have distinct functions? Is this the right question? Alternative view: • When thinking about layers, we should really be thinking about cell classes • A cells class may be defined by its input connectome and output projectome (and some other properties) • The job of different classes is to (i) make associations between different types of information available in each cortical column and/or (ii) route/gate different streams of information • Layers are convenient way of organising inputs and outputs of distinct cell classes Excitatory cell types across layers (rat S1 cortex) INTRATELENCEPHALIC (IT) | PYRAMIDAL TRACT (PT) | CORTICOTHALAMIC (CT) From Cereb Cortex.
    [Show full text]
  • Scaling the Htm Spatial Pooler
    International Journal of Artificial Intelligence and Applications (IJAIA), Vol.11, No.4, July 2020 SCALING THE HTM SPATIAL POOLER 1 2 1 1 Damir Dobric , Andreas Pech , Bogdan Ghita and Thomas Wennekers 1University of Plymouth, Faculty of Science and Engineering, UK 2Frankfurt University of Applied Sciences, Dept. of Computer Science and Engineering, Germany ABSTRACT The Hierarchical Temporal Memory Cortical Learning Algorithm (HTM CLA) is a theory and machine learning technology that aims to capture cortical algorithm of the neocortex. Inspired by the biological functioning of the neocortex, it provides a theoretical framework, which helps to better understand how the cortical algorithm inside of the brain might work. It organizes populations of neurons in column-like units, crossing several layers such that the units are connected into structures called regions (areas). Areas and columns are hierarchically organized and can further be connected into more complex networks, which implement higher cognitive capabilities like invariant representations. Columns inside of layers are specialized on learning of spatial patterns and sequences. This work targets specifically spatial pattern learning algorithm called Spatial Pooler. A complex topology and high number of neurons used in this algorithm, require more computing power than even a single machine with multiple cores or a GPUs could provide. This work aims to improve the HTM CLA Spatial Pooler by enabling it to run in the distributed environment on multiple physical machines by using the Actor Programming Model. The proposed model is based on a mathematical theory and computation model, which targets massive concurrency. Using this model drives different reasoning about concurrent execution and enables flexible distribution of parallel cortical computation logic across multiple physical nodes.
    [Show full text]
  • Universal Transition from Unstructured to Structured Neural Maps
    Universal transition from unstructured to structured PNAS PLUS neural maps Marvin Weiganda,b,1, Fabio Sartoria,b,c, and Hermann Cuntza,b,d,1 aErnst Strüngmann Institute for Neuroscience in Cooperation with Max Planck Society, Frankfurt/Main D-60528, Germany; bFrankfurt Institute for Advanced Studies, Frankfurt/Main D-60438, Germany; cMax Planck Institute for Brain Research, Frankfurt/Main D-60438, Germany; and dFaculty of Biological Sciences, Goethe University, Frankfurt/Main D-60438, Germany Edited by Terrence J. Sejnowski, Salk Institute for Biological Studies, La Jolla, CA, and approved April 5, 2017 (received for review September 28, 2016) Neurons sharing similar features are often selectively connected contradictory to experimental observations (24, 25). Structured with a higher probability and should be located in close vicinity to maps in the visual cortex have been described in primates, carni- save wiring. Selective connectivity has, therefore, been proposed vores, and ungulates but are reported to be absent in rodents, to be the cause for spatial organization in cortical maps. In- which instead exhibit unstructured, seemingly random arrange- terestingly, orientation preference (OP) maps in the visual cortex ments commonly referred to as salt-and-pepper configurations are found in carnivores, ungulates, and primates but are not found (26–28). Phase transitions between an unstructured and a struc- in rodents, indicating fundamental differences in selective connec- tured map have been described in a variety of models as a function tivity that seem unexpected for closely related species. Here, we of various model parameters (12, 13). Still, the biological correlate investigate this finding by using multidimensional scaling to of the phase transition and therefore, the reason for the existence of predict the locations of neurons based on minimizing wiring costs structured and unstructured neural maps in closely related species for any given connectivity.
    [Show full text]
  • Cortical Columns: Building Blocks for Intelligent Systems
    Cortical Columns: Building Blocks for Intelligent Systems Atif G. Hashmi and Mikko H. Lipasti Department of Electrical and Computer Engineering University of Wisconsin - Madison Email: [email protected], [email protected] Abstract— The neocortex appears to be a very efficient, uni- to be completely understood. A neuron, the basic structural formly structured, and hierarchical computational system [25], unit of the neocortex, is orders of magnitude slower than [23], [24]. Researchers have made significant efforts to model a transistor, the basic structural unit of modern computing intelligent systems that mimic these neocortical properties to perform a broad variety of pattern recognition and learning systems. The average firing interval for a neuron is around tasks. Unfortunately, many of these systems have drifted away 150ms to 200ms [21] while transistors can operate in less from their cortical origins and incorporate or rely on attributes than a nanosecond. Still, the neocortex performs much better and algorithms that are not biologically plausible. In contrast, on pattern recognition and other learning based tasks than this paper describes a model for an intelligent system that contemporary high speed computers. One of the main reasons is motivated by the properties of cortical columns, which can be viewed as the basic functional unit of the neocortex for this seemingly unusual behavior is that certain properties [35], [16]. Our model extends predictability minimization [30] of the neocortex–like independent feature detection, atten- to mimic the behavior of cortical columns and incorporates tion, feedback, prediction, and training data independence– neocortical properties such as hierarchy, structural uniformity, make it a quite flexible and powerful parallel processing and plasticity, and enables adaptive, hierarchical independent system.
    [Show full text]
  • 2 the Cerebral Cortex of Mammals
    Abstract This thesis presents an abstract model of the mammalian neocortex. The model was constructed by taking a top-down view on the cortex, where it is assumed that cortex to a first approximation works as a system with attractor dynamics. The model deals with the processing of static inputs from the perspectives of biological mapping, algorithmic, and physical implementation, but it does not consider the temporal aspects of these inputs. The purpose of the model is twofold: Firstly, it is an abstract model of the cortex and as such it can be used to evaluate hypotheses about cortical function and structure. Secondly, it forms the basis of a general information processing system that may be implemented in computers. The characteristics of this model are studied both analytically and by simulation experiments, and we also discuss its parallel implementation on cluster computers as well as in digital hardware. The basic design of the model is based on a thorough literature study of the mammalian cortex’s anatomy and physiology. We review both the layered and columnar structure of cortex and also the long- and short-range connectivity between neurons. Characteristics of cortex that defines its computational complexity such as the time-scales of cellular processes that transport ions in and out of neurons and give rise to electric signals are also investigated. In particular we study the size of cortex in terms of neuron and synapse numbers in five mammals; mouse, rat, cat, macaque, and human. The cortical model is implemented with a connectionist type of network where the functional units correspond to cortical minicolumns and these are in turn grouped into hypercolumn modules.
    [Show full text]
  • Toward a Mathematics of Brain Function
    PERSPECTIVE Columnar connectome: toward a mathematics of brain function Anna Wang Roe Institute of Interdisciplinary Neuroscience and Technology, Zhejiang University, Hangzhou, China Keywords: Primate, Cerebral cortex, Functional networks, Functional tract tracing, Matrix mapping, Brain theory, Artificial intelligence Downloaded from http://direct.mit.edu/netn/article-pdf/3/3/779/1092449/netn_a_00088.pdf by guest on 01 October 2021 ABSTRACT an open access journal Understanding brain networks is important for many fields, including neuroscience, psychology, medicine, and artificial intelligence. To address this fundamental need, there are multiple ongoing connectome projects in the United States, Europe, and Asia producing brain connection maps with resolutions at macro- and microscales. However, still lacking is a mesoscale connectome. This viewpoint (1) explains the need for a mesoscale connectome in the primate brain (the columnar connectome), (2) presents a new method for acquiring such data rapidly on a large scale, and (3) proposes how one might use such data to achieve a mathematics of brain function. THE COLUMNAR CONNECTOME The Cerebral Cortex Is Composed of Modular Processing Units Termed “Columns” In humans and nonhuman primates, the cerebral cortex occupies a large proportion of brain volume. This remarkable structure is highly organized. Anatomically, it is a two-dimensional Citation: Roe, A. W. (2019). Columnar (2D) sheet, roughly 2mm in thickness, and divided into different cortical areas, each specializ- connectome: toward a mathematics of brain function. Network Neuroscience, ing in some aspect of sensory, motor, cognitive, and limbic function. There is a large literature, 3(3), 779–791. https://doi.org/10.1162/ especially from studies of the nonhuman primate visual cortex, to support the view that the netn_a_00088 cerebral cortex is composed of submillimeter modular functional units, termed “columns” DOI: (Mountcastle, 1997).
    [Show full text]
  • Effects of Experimental Strabismus on the Architecture of Macaque Monkey Striate Cortex
    THE JOURNAL OF COMPARATIVE NEUROLOGY 438:300–317 (2001) Effects of Experimental Strabismus on the Architecture of Macaque Monkey Striate Cortex SUZANNE B. FENSTEMAKER,1,2* LYNNE KIORPES,2 AND J. ANTHONY MOVSHON1,2 1Howard Hughes Medical Institute, New York University, New York, New York 10003 2Center for Neural Science, New York University, New York, New York 10003 ABSTRACT Strabismus, a misalignment of the eyes, results in a loss of binocular visual function in humans. The effects are similar in monkeys, where a loss of binocular convergence onto single cortical neurons is always found. Changes in the anatomical organization of primary visual cortex (V1) may be associated with these physiological deficits, yet few have been reported. We examined the distributions of several anatomical markers in V1 of two experimentally strabismic Macaca nemestrina monkeys. Staining patterns in tangential sections were re- lated to the ocular dominance (OD) column structure as deduced from cytochrome oxidase (CO) staining. CO staining appears roughly normal in the superficial layers, but in layer 4C, one eye’s columns were pale. Thin, dark stripes falling near OD column borders are evident in Nissl-stained sections in all layers and in immunoreactivity for calbindin, especially in layers 3 and 4B. The monoclonal antibody SMI32, which labels a neurofilament protein found in pyramidal cells, is reduced in one eye’s columns and absent at OD column borders. The pale SMI32 columns are those that are dark with CO in layer 4. Gallyas staining for myelin reveals thin stripes through layers 2–5; the dark stripes fall at OD column centers.
    [Show full text]
  • Poster: P292.Pdf
    Reconstructing the connectome of a cortical column with biologically-constrained associative learning Danke Zhang, Chi Zhang, and Armen Stepanyants Department of Physics and Center for Interdisciplinary Research on Complex Systems, Northeastern University, Boston, MA ➢ The structural column was loaded with associative sequences of network ➢ The average number of synapses between potentially connected excitatory ➢ The model connectome shows that L5 excitatory neurons receive inputs from all 1. Introduction states, 푋1 → 푋2 →. 푋푚+1, by training individual neurons (e.g. neuron i) to neurons matches well with experimental data. layers, including a strong excitatory projection from L2/3. These and many other independently associate a given network state, vector 푋휇, with the state at the ➢ The average number of synapses for E → I, I → E, and I → I connections obtained features of the model connectome are ubiquitously present in many cortical The cortical connectome develops in an experience-dependent manner under following time step, 푋휇+1. in the model are about 4 times smaller than that reported in experimental studies. areas (see e.g., [11,12]). the constraints imposed by the morphologies of axonal and dendritic arbors of 푖 We think that this may be due to a bias in the identification of synapses based on ➢ Several biologically-inspired constraints were imposed on the learning process. numerous classes of neurons. In this study, we describe a theoretical framework light microscopy images. which makes it possible to construct the connectome of a cortical column by These include sign constraints on excitatory and inhibitory connection weights, 8. Two- and three-neuron motifs loading associative memory sequences into its structurally (potentially) connected hemostatic l1 norm constraints on presynaptic inputs to each neuron, and noise network.
    [Show full text]
  • Discovering Cortical Algorithms
    DISCOVERING CORTICAL ALGORITHMS Atif G. Hashmi and Mikko H. Lipasti Department of Electrical and Computer Engineering, University of Wisconsin - Madison 1415 Engineering Drive, Madison, WI - 53706, USA. [email protected], [email protected] Keywords: Cortical Columns, Unsupervised Learning, Invariant Representation, Supervised Feedback, Inherent Fault Tolerance Abstract: We describe a cortical architecture inspired by the structural and functional properties of the cortical columns distributed and hierarchically organized throughout the mammalian neocortex. This results in a model which is both computationally efficient and biologically plausible. The strength and robustness of our cortical ar- chitecture is ascribed to its distributed and uniformly structured processing units and their local update rules. Since our architecture avoids complexities involved in modelling individual neurons and their synaptic con- nections, we can study other interesting neocortical properties like independent feature detection, feedback, plasticity, invariant representation, etc. with ease. Using feedback, plasticity, object permanence, and temporal associations, our architecture creates invariant representations for various similar patterns occurring within its receptive field. We trained and tested our cortical architecture using a subset of handwritten digit images ob- tained from the MNIST database. Our initial results show that our architecture uses unsupervised feedforward processing as well as supervised feedback processing to differentiate handwritten
    [Show full text]
  • Bachelor Degree Project Hierarchical Temporal Memory Software Agent
    Bachelor Degree Project Hierarchical Temporal Memory Software Agent - in the light of general artificial intelligence criteria Author: Jakob Heyder Supervisor: Johan Hagelbäck Semester: VT/HT 2017 Subject: Computer Science Abstract Artificial general intelligence is not well defined, but attempts such as the recent list of “Ingredients for building machines that think and learn like humans” are a starting point for building a system considered as such [1]. Numenta is attempting to lead the new era of machine intelligence with their research to re-engineer principles of the neocortex. It is to be explored how the ingredients are in line with the design princi- ples of their algorithms. Inspired by Deep Minds commentary about an autonomy- ingredient, this project created a combination of Numentas Hierarchical Temporal Memory theory and Temporal Difference learning to solve simple tasks defined in a browser environment. An open source software, based on Numentas intelligent com- puting platform NUPIC and Open AIs framework Universe, was developed to allow further research of HTM based agents on customized browser tasks. The analysis and evaluation of the results show that the agent is capable of learning simple tasks and there is potential for generalization inherent to sparse representations. However, they also reveal the infancy of the algorithms, not capable of learning dynamic com- plex problems, and that much future research is needed to explore if they can create scalable solutions towards a more general intelligent system. Keywords: General
    [Show full text]
  • Cortical Columns
    In: Encyclopedia of Cognitive Science, Macmillan Publishers Ltd., 2002. URL: http://cns.georgetown.edu/~miguel/papers/ecs02.html Cortical Columns Geoffrey J. Goodhill Miguel A.´ Carreira-Perpin˜an´ Dept. of Neuroscience, Georgetown University Medical Center, Washington, DC fmiguel,[email protected] April 8, 2002 Definition In many regions of the cortex, neuronal response properties remain relatively constant as one moves perpendic- ular to the surface of the cortex, while they vary in a direction parallel to the cortex. Such columnar organization is particularly evident in the visual system, in the form of ocular dominance and orientation columns. Introduction The most prominent feature of the architecture of the cortex is its horizontal organization into layers. Each layer contains different cell types, and forms different types of connections with other neurons. However, a strong vertical organization is often also apparent: neurons stacked on top of each other through the depth of the cortex tend to be connected and have similar response properties despite residing in different layers. This type of vertical structure is called a cortical column, and has been hypothesized to represent a basic functional unit for sensory processing or motor output. Columnar organization has been most extensively studied in the somatosensory and visual systems. Discovery of columnar organization Cortical columns were first discovered electrophysiologically by Mountcastle (1957). When he moved an elec- trode obliquely to the surface of somatosensory cortex, he encountered neurons which responded to different sensory submodalities (e.g. deep vs light touch). However, when the electrode was moved perpendicular to the cortical surface, all neurons had similar response properties.
    [Show full text]
  • Confusing Cortical Columns,’’ by GENETICS
    COMMENTARY. For the article ‘‘Confusing cortical columns,’’ by GENETICS. For the article ‘‘Fine structure mapping of a gene-rich Pasko Rakic, which appeared in issue 34, August 26, 2008, of region of wheat carrying Ph1, a suppressor of crossing over Proc Natl Acad Sci USA (105:12099–12100; first published between homoeologous chromosomes,’’ by Gaganpreet K. August 20, 2008; 10.1073͞pnas.0807271105), the authors note Sidhu, Sachin Rustgi, Mustafa N. Shafqat, Diter von Wettstein, that, due to printer’s errors, three references were omitted and and Kulvinder S. Gill, which appeared in issue 15, April 15, 2008, a phrase appeared incorrectly. On page 12099, left column, of Proc Natl Acad Sci USA (105:5815–5820; first published April second paragraph, line 11, ‘‘(e.g., ref. 2)’’ should appear as ‘‘(e.g., 8, 2008; 10.1073͞pnas.0800931105), the authors note that ‘‘a part refs. 2, 22, and 23).’’ Also on page 12099, right column, line 6, of our Fig. 1 was reproduced from figure 1 of the paper by Simon ‘‘single receptive field’’ should instead read: ‘‘single modalities in Griffiths et al. [Griffiths S, et al. (2006) Nature 439:749–752]. In the receptive field.’’ Finally, on page 12100, center column, line our paper we neglected to reference this figure as adapted from 21, ‘‘(15, 19)’’ should appear as ‘‘(15, 24)’’; and in line 25, the original work.’’ ‘‘(19–21)’’ should appear as ‘‘(21–24).’’ In addition to these ͞ ͞ ͞ ͞ errors, the authors note that in ref. 3, the author names www.pnas.org cgi doi 10.1073 pnas.0808507105 ‘‘Herculano-Housel S, Collins CE, Wang P, Kaas J’’ should have appeared as ‘‘Herculano-Houzel S, Collins CE, Wong P, Kaas JH, Lent R.’’ The corrected and omitted references appear below.
    [Show full text]