Normative Theory of Visual Receptive Fields
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  MITOCW | 9: Receptive Fields - Intro to Neural ComputationMITOCW | 9: Receptive Fields - Intro to Neural Computation MICHALE FEE: So today, we're going to introduce a new topic, which is related to the idea of fine- tuning curves, and that is the notion of receptive fields. So most of you have probably been, at least those of you who've taken 9.01 or 9.00 maybe, have been exposed to the idea of what a receptive field is. The idea is basically that in sensory systems neurons receive input from the sensory periphery, and neurons generally have some kind of sensory stimulus that causes them to spike. And so one of the classic examples of how to find receptive fields comes from the work of Huble and Wiesel. So I'll show you some movies made from early experiments of Huble-Wiesel where they are recording in the visual cortex of the cat. So they place a fine metal electrode into a primary visual cortex, and they present. So then they anesthetize the cat so the cat can't move. They open the eye, and the cat's now looking at a screen that looks like this, where they play a visual stimulus. And they actually did this with essentially a slide projector that they could put a card in front of that had a little hole in it, for example, that allowed a spot of light to project onto the screen. And then they can move that spot of light around while they record from neurons in visual cortex and present different visual stimuli to the retina.
- 
												  Visual PhysiologyVisual Physiology Perception and Attention Graham Hole Problems confronting the visual system: Retinal image contains a huge amount of information - which must be processed quickly. Retinal image is dim, blurry and distorted. Light levels vary enormously. Solutions: Enhance the important information (edges and contours). Process relative intensities, not absolute light levels. Use two systems, one for bright light and another for twilight (humans). The primary visual pathways: The right visual field is represented in the left hemisphere. The left visual field is represented in the right hemisphere The eye: 50% of light entering the eye is lost by absorption & scattering (diffraction). The rest strikes retinal photoreceptors. Photoreceptors: Rod Outer Segment Cone The retina contains two kinds of photoreceptors, rods and cones. Inner Segment Pedicle Why are two kinds of photoreceptor needed? 2 S Surface Luminance (cd/m ) c o t o White paper in starlight 0.05 p i c White paper in moonlight 0.5 White paper in artificial light 250 P ho t Computer monitor, TV 100 o p i White paper in sunlight 30,000 c Ambient light levels can vary hugely, by a factor of 10,000,000. Pupil diameter can vary retinal illumination by only a factor of 16. Rods respond at low (scotopic) light levels. Cones respond at high (photopic) light levels. Differences between rods and cones: Sensitivity Number Retinal distribution Visual Pigment Number and distribution of receptors: There are about 120 million rods, and 8 million cones. Cones are concentrated in central vision, around the fovea. Rods cover the entire retina, with the exception of the fovea.
- 
												  The Roles and Functions of Cutaneous Mechanoreceptors Kenneth O Johnson455 The roles and functions of cutaneous mechanoreceptors Kenneth O Johnson Combined psychophysical and neurophysiological research has nerve ending that is sensitive to deformation in the resulted in a relatively complete picture of the neural mechanisms nanometer range. The layers function as a series of of tactile perception. The results support the idea that each of the mechanical filters to protect the extremely sensitive recep- four mechanoreceptive afferent systems innervating the hand tor from the very large, low-frequency stresses and strains serves a distinctly different perceptual function, and that tactile of ordinary manual labor. The Ruffini corpuscle, which is perception can be understood as the sum of these functions. located in the connective tissue of the dermis, is a rela- Furthermore, the receptors in each of those systems seem to be tively large spindle shaped structure tied into the local specialized for their assigned perceptual function. collagen matrix. It is, in this way, similar to the Golgi ten- don organ in muscle. Its association with connective tissue Addresses makes it selectively sensitive to skin stretch. Each of these Zanvyl Krieger Mind/Brain Institute, 338 Krieger Hall, receptor types and its role in perception is discussed below. The Johns Hopkins University, 3400 North Charles Street, Baltimore, MD 21218-2689, USA; e-mail: [email protected] During three decades of neurophysiological and combined Current Opinion in Neurobiology 2001, 11:455–461 psychophysical and neurophysiological studies, evidence has accumulated that links each of these afferent types to 0959-4388/01/$ — see front matter © 2001 Elsevier Science Ltd. All rights reserved. a distinctly different perceptual function and, furthermore, that shows that the receptors innervated by these afferents Abbreviations are specialized for their assigned functions.
- 
												  Center Surround Receptive Field Structure of Cone Bipolar Cells in Primate RetinaVision Research 40 (2000) 1801–1811 www.elsevier.com/locate/visres Center surround receptive field structure of cone bipolar cells in primate retina Dennis Dacey a,*, Orin S. Packer a, Lisa Diller a, David Brainard b, Beth Peterson a, Barry Lee c a Department of Biological Structure, Uni6ersity of Washington, Box 357420, Seattle, WA 98195-7420, USA b Department of Psychology, Uni6ersity of California Santa Barbara, Santa Barbara, CA, USA c Max Planck Institute for Biophysical Chemistry, Gottingen, Germany Received 28 September 1999; received in revised form 5 January 2000 Abstract In non-mammalian vertebrates, retinal bipolar cells show center-surround receptive field organization. In mammals, recordings from bipolar cells are rare and have not revealed a clear surround. Here we report center-surround receptive fields of identified cone bipolar cells in the macaque monkey retina. In the peripheral retina, cone bipolar cell nuclei were labeled in vitro with diamidino-phenylindole (DAPI), targeted for recording under microscopic control, and anatomically identified by intracellular staining. Identified cells included ‘diffuse’ bipolar cells, which contact multiple cones, and ‘midget’ bipolar cells, which contact a single cone. Responses to flickering spots and annuli revealed a clear surround: both hyperpolarizing (OFF) and depolarizing (ON) cells responded with reversed polarity to annular stimuli. Center and surround dimensions were calculated for 12 bipolar cells from the spatial frequency response to drifting, sinusoidal luminance modulated gratings. The frequency response was bandpass and well fit by a difference of Gaussians receptive field model. Center diameters were all two to three times larger than known dendritic tree diameters for both diffuse and midget bipolar cells in the retinal periphery.
- 
												  The Visual System: Higher Visual ProcessingThe Visual System: Higher Visual Processing Primary visual cortex The primary visual cortex is located in the occipital cortex. It receives visual information exclusively from the contralateral hemifield, which is topographically represented and wherein the fovea is granted an extended representation. Like most cortical areas, primary visual cortex consists of six layers. It also contains, however, a prominent stripe of white matter in its layer 4 - the stripe of Gennari - consisting of the myelinated axons of the lateral geniculate nucleus neurons. For this reason, the primary visual cortex is also referred to as the striate cortex. The LGN projections to the primary visual cortex are segregated. The axons of the cells from the magnocellular layers terminate principally within sublamina 4Ca, and those from the parvocellular layers terminate principally within sublamina 4Cb. Ocular dominance columns The inputs from the two eyes also are segregated within layer 4 of primary visual cortex and form alternating ocular dominance columns. Alternating ocular dominance columns can be visualized with autoradiography after injecting radiolabeled amino acids into one eye that are transported transynaptically from the retina. Although the neurons in layer 4 are monocular, neurons in the other layers of the same column combine signals from the two eyes, but their activation has the same ocular preference. Bringing together the inputs from the two eyes at the level of the striate cortex provide a basis for stereopsis, the sensation of depth perception provided by binocular disparity, i.e., when an image falls on non-corresponding parts of the two retinas. Some neurons respond to disparities beyond the plane of fixation (far cells), while others respond to disparities in front of the plane of the fixation (near cells).
- 
												  A Linear Systems View to the Concept of Strfs Mounya Elhilali(1)A linear systems view to the concept of STRFs Mounya Elhilali (1) , Shihab Shamma (2,3) , Jonathan Z Simon (2,3,4) , Jonathan B. Fritz (3) (1) Department of Electrical & Computer Engineering, Johns Hopkins University (2) Department of Electrical & Computer Engineering, University of Maryland, College Park (3)Institute for Systems Research, University of Maryland, College Park (4)Department of Biology, University of Maryland, College Park 1. Introduction A key requirement in the study of sensory nervous systems is the ability to characterize effectively neuronal response selectivity. In the visual system, striving for this objective yielded remarkable progress in understanding the functional organization of the visual cortex and its implications to perception. For instance, the discovery of ordered feature representations of retinal space, color, ocular dominance, orientation, and motion direction in various cortical fields has catalyzed extensive studies into the anatomical bases of this organization, its developmental course, and the changes it undergoes during learning or following injury. In the primary auditory cortex (A1), response selectivity of the neurons (also called their receptive fields ) is ordered topographically according to the frequency they are most tuned to, an organization inherited from the cochlea. Beyond their tuning, however, A1 responses and receptive fields exhibit a bewildering variety of dynamics, frequency bandwidths, response thresholds, and patterns of excitatory and inhibitory regions. All this has been learned over decades of testing with a large variety of acoustic stimuli (tones, clicks, noise, and natural sounds) and response measures (tuning curves, rate-level functions, and binaural maps). A response measure that has proven particularly useful beyond the feature- specific measures enumerated earlier is the notion of a generalized spatiotemporal , or equivalently for the auditory system, the spectro-temporal receptive field (STRF).
- 
												  Seeing in Three Dimensions: the Neurophysiology of Stereopsis Gregory CReview DeAngelis – Neurophysiology of stereopsis Seeing in three dimensions: the neurophysiology of stereopsis Gregory C. DeAngelis From the pair of 2-D images formed on the retinas, the brain is capable of synthesizing a rich 3-D representation of our visual surroundings. The horizontal separation of the two eyes gives rise to small positional differences, called binocular disparities, between corresponding features in the two retinal images. These disparities provide a powerful source of information about 3-D scene structure, and alone are sufficient for depth perception. How do visual cortical areas of the brain extract and process these small retinal disparities, and how is this information transformed into non-retinal coordinates useful for guiding action? Although neurons selective for binocular disparity have been found in several visual areas, the brain circuits that give rise to stereoscopic vision are not very well understood. I review recent electrophysiological studies that address four issues: the encoding of disparity at the first stages of binocular processing, the organization of disparity-selective neurons into topographic maps, the contributions of specific visual areas to different stereoscopic tasks, and the integration of binocular disparity and viewing-distance information to yield egocentric distance. Some of these studies combine traditional electrophysiology with psychophysical and computational approaches, and this convergence promises substantial future gains in our understanding of stereoscopic vision. We perceive our surroundings vividly in three dimen- lished the first reports of disparity-selective neurons in the sions, even though the image formed on the retina of each primary visual cortex (V1, or area 17) of anesthetized cats5,6.
- 
												  COGS 101A: Sensation and Perception 1COGS 101A: Sensation and Perception 1 Virginia R. de Sa Department of Cognitive Science UCSD Lecture 4: Coding Concepts – Chapter 2 Course Information 2 • Class web page: http://cogsci.ucsd.edu/ desa/101a/index.html • Professor: Virginia de Sa ? I’m usually in Chemistry Research Building (CRB) 214 (also office in CSB 164) ? Office Hours: Monday 5-6pm ? email: desa at ucsd ? Research: Perception and Learning in Humans and Machines For your Assistance 3 TAS: • Jelena Jovanovic OH: Wed 2-3pm CSB 225 • Katherine DeLong OH: Thurs noon-1pm CSB 131 IAS: • Jennifer Becker OH: Fri 10-11am CSB 114 • Lydia Wood OH: Mon 12-1pm CSB 114 Course Goals 4 • To appreciate the difficulty of sensory perception • To learn about sensory perception at several levels of analysis • To see similarities across the sensory modalities • To become more attuned to multi-sensory interactions Grading Information 5 • 25% each for 2 midterms • 32% comprehensive final • 3% each for 6 lab reports - due at the end of the lab • Bonus for participating in a psych or cogsci experiment AND writing a paragraph description of the study You are responsible for knowing the lecture material and the assigned readings. Read the readings before class and ask questions in class. Academic Dishonesty 6 The University policy is linked off the course web page. You will all have to sign a form in section For this class: • Labs are done in small groups but writeups must be in your own words • There is no collaboration on midterms and final exam Last Class 7 We learned about the cells in the Retina This Class 8 Coding Concepts – Every thing after transduction (following Chris Johnson’s notes) Remember:The cells in the retina are neurons 9 neurons are specialized cells that transmit electrical/chemical information to other cells neurons generally receive a signal at their dendrites and transmit it electrically to their soma and axon.
- 
												  Understanding the Effective Receptive Field in Deep Convolutional Neural NetworksUnderstanding the Effective Receptive Field in Deep Convolutional Neural Networks Wenjie Luo∗ Yujia Li∗ Raquel Urtasun Richard Zemel Department of Computer Science University of Toronto {wenjie, yujiali, urtasun, zemel}@cs.toronto.edu Abstract We study characteristics of receptive fields of units in deep convolutional networks. The receptive field size is a crucial issue in many visual tasks, as the output must respond to large enough areas in the image to capture information about large objects. We introduce the notion of an effective receptive field, and show that it both has a Gaussian distribution and only occupies a fraction of the full theoretical receptive field. We analyze the effective receptive field in several architecture designs, and the effect of nonlinear activations, dropout, sub-sampling and skip connections on it. This leads to suggestions for ways to address its tendency to be too small. 1 Introduction Deep convolutional neural networks (CNNs) have achieved great success in a wide range of problems in the last few years. In this paper we focus on their application to computer vision: where they are the driving force behind the significant improvement of the state-of-the-art for many tasks recently, including image recognition [10, 8], object detection [17, 2], semantic segmentation [12, 1], image captioning [20], and many more. One of the basic concepts in deep CNNs is the receptive field, or field of view, of a unit in a certain layer in the network. Unlike in fully connected networks, where the value of each unit depends on the entire input to the network, a unit in convolutional networks only depends on a region of the input.
- 
												  Do We Know What the Early Visual System Does?The Journal of Neuroscience, November 16, 2005 • 25(46):10577–10597 • 10577 Mini-Symposium Do We Know What the Early Visual System Does? Matteo Carandini,1 Jonathan B. Demb,2 Valerio Mante,1 David J. Tolhurst,3 Yang Dan,4 Bruno A. Olshausen,6 Jack L. Gallant,5,6 and Nicole C. Rust7 1Smith-Kettlewell Eye Research Institute, San Francisco, California 94115, 2Departments of Ophthalmology and Visual Sciences, and Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, Michigan 48105, 3Department of Physiology, University of Cambridge, Cambridge CB2 1TN, United Kingdom, Departments of 4Molecular and Cellular Biology and 5Psychology and 6Helen Wills Neuroscience Institute and School of Optometry, University of California, Berkeley, Berkeley, California 94720, and 7Center for Neural Science, New York University, New York, New York 10003 We can claim that we know what the visual system does once we can predict neural responses to arbitrary stimuli, including those seen in nature. In the early visual system, models based on one or more linear receptive fields hold promise to achieve this goal as long as the models include nonlinear mechanisms that control responsiveness, based on stimulus context and history, and take into account the nonlinearity of spike generation. These linear and nonlinear mechanisms might be the only essential determinants of the response, or alternatively, there may be additional fundamental determinants yet to be identified. Research is progressing with the goals of defining a single “standard model” for each stage of the visual pathway and testing the predictive power of these models on the responses to movies of natural scenes.
- 
												  The Neocognitron As a System for Handavritten Character Recognition: Limitations and ImprovementsThe Neocognitron as a System for HandAvritten Character Recognition: Limitations and Improvements David R. Lovell A thesis submitted for the degree of Doctor of Philosophy Department of Electrical and Computer Engineering University of Queensland March 14, 1994 THEUliW^^ This document was prepared using T^X and WT^^. Figures were prepared using tgif which is copyright © 1992 William Chia-Wei Cheng (william(Dcs .UCLA. edu). Graphs were produced with gnuplot which is copyright © 1991 Thomas Williams and Colin Kelley. T^ is a trademark of the American Mathematical Society. Statement of Originality The work presented in this thesis is, to the best of my knowledge and belief, original, except as acknowledged in the text, and the material has not been subnaitted, either in whole or in part, for a degree at this or any other university. David R. Lovell, March 14, 1994 Abstract This thesis is about the neocognitron, a neural network that was proposed by Fuku- shima in 1979. Inspired by Hubel and Wiesel's serial model of processing in the visual cortex, the neocognitron was initially intended as a self-organizing model of vision, however, we are concerned with the supervised version of the network, put forward by Fukushima in 1983. Through "training with a teacher", Fukushima hoped to obtain a character recognition system that was tolerant of shifts and deformations in input images. Until now though, it has not been clear whether Fukushima's ap- proach has resulted in a network that can rival the performance of other recognition systems. In the first three chapters of this thesis, the biological basis, operational principles and mathematical implementation of the supervised neocognitron are presented in detail.
- 
												  Different Roles for Simple-Cell and Complex-Cell Inhibition in V1The Journal of Neuroscience, November 12, 2003 • 23(32):10201–10213 • 10201 Behavioral/Systems/Cognitive Different Roles for Simple-Cell and Complex-Cell Inhibition in V1 Thomas Z. Lauritzen1,4 and Kenneth D. Miller1,2,3,4 1Graduate Group in Biophysics, 2Departments of Physiology and Otolaryngology, 3Sloan-Swartz Center for Theoretical Neurobiology, and 4W. M. Keck Center for Integrative Neuroscience, University of California, San Francisco, California 94143-0444 Previously, we proposed a model of the circuitry underlying simple-cell responses in cat primary visual cortex (V1) layer 4. We argued that the ordered arrangement of lateral geniculate nucleus inputs to a simple cell must be supplemented by a component of feedforward inhibition that is untuned for orientation and responds to high temporal frequencies to explain the sharp contrast-invariant orientation tuning and low-pass temporal frequency tuning of simple cells. The temporal tuning also requires a significant NMDA component in geniculocortical synapses. Recent experiments have revealed cat V1 layer 4 inhibitory neurons with two distinct types of receptive fields (RFs): complex RFs with mixed ON/OFF responses lacking in orientation tuning, and simple RFs with normal, sharp-orientation tuning (although, some respond to all orientations). We show that complex inhibitory neurons can provide the inhibition needed to explain simple-cell response properties. Given this complex cell inhibition, antiphase or “push-pull” inhibition from tuned simple inhibitory neurons acts to sharpen