Event-Based Action Recognition Using Motion Information and Spiking Neural Networks

Total Page:16

File Type:pdf, Size:1020Kb

Event-Based Action Recognition Using Motion Information and Spiking Neural Networks Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21) Event-based Action Recognition Using Motion Information and Spiking Neural Networks Qianhui Liu1;2 , Dong Xing1;2 , Huajin Tang1;2 , De Ma1∗ and Gang Pan1;2∗ 1College of Computer Science and Technology, Zhejiang University, Hangzhou, China 2Zhejiang Lab, Hangzhou, China fqianhuiliu, dongxing, htang, made, [email protected] Abstract pixel, forming an asynchronous and sparse representation of the scene. Event-based cameras have attracted increasing at- This event-based representation is inherently suitable to tention due to their advantages of biologically cooperate with the spiking neural network (SNN) since SNN inspired paradigm and low power consumption. also has the event-based property [Hu et al., 2018]. SNN uses Since event-based cameras record the visual input discrete spikes to transmit information between units which as asynchronous discrete events, they are inherently mimics the behavior of biological neural systems. Benefit- suitable to cooperate with the spiking neural net- ing from this event-driven processing paradigm, SNN is en- work (SNN). Existing works of SNNs for process- ergy efficient on neuromorphic hardware and has a power- ing events mainly focus on the task of object recog- ful ability in processing spatio-temporal information. Recent nition. However, events from the event-based cam- years, SNN has been increasingly applied to the task related era are triggered by dynamic changes, which makes to event-based cameras. it an ideal choice to capture actions in the visual Existing works of SNNs cooperating with event-based scene. Inspired by the dorsal stream in visual cor- cameras mainly focus on the object recognition tasks [Or- tex, we propose a hierarchical SNN architecture for chard et al., 2015; Xiao et al., 2019; Liu et al., 2020b]. How- event-based action recognition using motion infor- ever, since the event-based camera naturally captures move- mation. Motion features are extracted and utilized ments in the visual scene, it is a good fit for the action recog- from events to local and finally to global perception nition task. Nevertheless, works of SNN on event-based ac- for action recognition. To the best of the authors’ tion recognition are still limited. knowledge, it is the first attempt of SNN to apply Humans can recognize actions accurately, which motivates motion information to event-based action recog- us to explore the biological visual cortex to gain experience nition. We evaluate our proposed SNN on three for event-based recognition. The visual cortex is organized in event-based action recognition datasets, including two different pathways [Jhuang et al., 2007]. One is a ven- our newly published DailyAction-DVS dataset com- tral stream dealing with shape information, which has been prising 12 actions collected under diverse recording widely used in the existing spiking object recognition models conditions. Extensive experimental results show [Orchard et al., 2015; Liu et al., 2020b]. The other is a dor- the effectiveness of motion information and our sal stream involved with the analysis of motion information. proposed SNN architecture for event-based action Since the event streams representing the actions contain rich recognition. motion information, motion features of event stream may be an ideal choice for action recognition tasks. Further, the orga- nization of dorsal stream is hierarchical. Neurons gradually 1 Introduction increase their receptive field along the hierarchy, as well as Event-based cameras are a novel class of vision devices im- their selectivity and invariance to features. Inspired by the itating the mechanism of human retina. Contrary to conven- current theory of the visual cortex, we will make steps to- tional cameras, which record the visual input from all pix- wards the solution of event-based action recognition. els as images at a fixed rate, with event-based cameras, each We propose a hierarchical SNN architecture for event- pixel individually emits events when it observes sufficient based action recognition using motion information. Motion changes of light intensity in its receptive field. Thus, event- features are extracted and utilized from events to local and based cameras naturally respond to moving objects and ig- finally to global perception for action recognition. Specif- nore static redundant information, resulting in significant re- ically, we first adopt motion-sensitive neurons to estimate duction of memory usage and energy consumption. The final optical flow for the purpose of local motion (direction and output of the camera is a stream of events collected from each speed) perception. Then, we perform a motion pooling and a spatial pooling to mitigate the effect of the aperture prob- ∗Corresponding author lem [Orchard et al., 2013] and increase the spatial invariance 1743 Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21) respectively. As the final stage of the architecture, a SNN 2020b] and the proposed features become one of the most classifier in which the spiking neurons are fully connected to commonly used features when using SNN to process the the previous pooling layer, is adopted as global perception for event-based recognition. recognition results. To the best of the authors’ knowledge, it We are inspired by the function of dorsal stream also in vi- is the first attempt of SNN to apply the motion information to sual cortex and make steps towards using motion information action recognition tasks. for action recognition tasks. Since event-based cameras pro- Besides, due to the lack of event-based action datasets vide an efficient way for encoding light and its temporal vari- and their importance for the algorithm development, we ations [Benosman et al., 2012], we introduce the optical flow present a new event-based action recognition dataset called estimation for motion (direction and speed) perception. Ex- DailyAction-DVS. The dataset comprises 15 subjects per- isting works on SNN-based optical flow estimation adopted forming 12 daily actions under 2 lighting conditions and 2 motion-sensitive neurons with synaptic delays [Orchard et camera positions (with different distances and angles to the al., 2013; Paredes-Valles´ et al., 2019]. We here adopt neu- subjects). This setting increases the challenge of dataset rons in [Orchard et al., 2013] due to their effectiveness and while gaining more practical significance. simplicity. We evaluate the proposed SNN on the new event-based ac- tion recognition dataset and two other challenging ones. Ex- 2.3 Event-based Datasets perimental results show the effectiveness of motion informa- Existing datasets on event-based action recognition can be tion and our proposed SNN architecture for event-based ac- divided into two categories: one is recorded with a static tion recognition. event-based camera facing a monitor on which video-based datasets were set to play automatically [Hu et al., 2016]. 2 Related Work However, this recording way will lose real dynamics of mov- ing objects between two frames. There is no guarantee that 2.1 Event-based Action Recognition a method tested on this kind of artificial data will behave Action recognition task has drawn a significant amount of similarly in real-world conditions [Sironi et al., 2018]. The attention from the academic community, owing to its appli- other is to record directly by event-based cameras in the real cations in many areas like security and behavior analysis. scene. Among them, several datasets are proposed for ges- With the popularity of event-based cameras, they have been tures. [Amir et al., 2017] proposed an event-based hand ges- found to be ideal choices to capture human actions since they ture dataset captured by a fixed DVS camera [Lichtsteiner et only record the activity in the field of view and automatically al., 2008]. [Maro et al., 2020] also proposed a gesture dataset partition the foreground and background. Recently, research but was recorded by an ATIS camera [Posch et al., 2011] con- on event-based action recognition has emerged progressively. nected to the smartphone. As for human action, [Miao et al., One approach is to convert the output of an event camera into 2019] proposed an event-based action recognition dataset us- frames and use standard computer vision methods, such as ing a DAVIS camera [Brandli et al., 2014] with 3 different [Innocenti et al., 2020]. However, these works mainly focus positions. However, this dataset is recorded under single light on how to aggregate events and deal with frames. Another condition and is relatively small (291 recordings released). approach is to directly deal with events. [Maro et al., 2020] Our proposed event-based action recognition DailyAction- introduced a framework for dynamic gesture recognition rely- DVS dataset has 1440 recordings of 12 daily actions. The ing on the concept of time-surfaces introduced in [Lagorce et dataset is captured by a DVS camera with 2 different light- al., 2017]. In addition, SNN is trying to solve the event-based ing conditions and 2 different camera positions (with differ- gesture recognition. [George et al., 2020] presented a SNN ent distances and angles), which brings more challenges to which uses the idea of convolution and reservoir computing the dataset and is also more in line with the realistic situation. in order to classify human hand gestures. Since SNN has the event-based property and the ability in processing spatio- temporal information, it has great potential to solve the event- 3 Method based action recognition, but related works are still limited. In this section, we introduce the proposed SNN for event- 2.2 Event-based Features based action recognition, which extracts and utilizes motion [Lagorce et al., 2017] proposed the spatio-temporal features information from events to local and finally to global percep- based on recent temporal activity of events within a local spa- tion. The architecture of the proposed SNN is shown in Fig- tial neighborhood called time-surfaces.
Recommended publications
  • The Hodgkin-Huxley Neuron Model for Motion Detection in Image Sequences Hayat Yedjour, Boudjelal Meftah, Dounia Yedjour, Olivier Lézoray
    The Hodgkin-Huxley neuron model for motion detection in image sequences Hayat Yedjour, Boudjelal Meftah, Dounia Yedjour, Olivier Lézoray To cite this version: Hayat Yedjour, Boudjelal Meftah, Dounia Yedjour, Olivier Lézoray. The Hodgkin-Huxley neuron model for motion detection in image sequences. Neural Computing and Applications, Springer Verlag, In press, 10.1007/s00521-021-06446-0. hal-03331961 HAL Id: hal-03331961 https://hal.archives-ouvertes.fr/hal-03331961 Submitted on 2 Sep 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Noname manuscript No. (will be inserted by the editor) The Hodgkin{Huxley neuron model for motion detection in image sequences Hayat Yedjour · Boudjelal Meftah · Dounia Yedjour · Olivier L´ezoray Received: date / Accepted: date Abstract In this paper, we consider a biologically-inspired spiking neural network model for motion detection. The proposed model simulates the neu- rons' behavior in the cortical area MT to detect different kinds of motion in image sequences. We choose the conductance-based neuron model of the Hodgkin-Huxley to define MT cell responses. Based on the center-surround antagonism of MT receptive fields, we model the area MT by its great pro- portion of cells with directional selective responses.
    [Show full text]
  • Capture of Stereopsis and Apparent Motion by Illusory Contours
    Perception & Psychophysics 1986, 39 (5), 361-373 Capture of stereopsis and apparent motion by illusory contours V. S. RAMACHANDRAN University of California, San Diego, La Jolla, California The removal of right-angle sectors from four circular black disks created the impression of a white "subjective" square with its four corners occluding the disks ("illusory contours"). The dis­ play in one eye was identical to that in the other except that for one eye a horizontal disparity was introduced between the edges of the cut sectors so that an illusory white square floated out in front of the paper. When a "template" ofthis stereogram was superimposed on a vertical square­ wave grating (6 cycles/deg) the illusory square pulled the corresponding lines of grating forward even though the grating was at zero disparity. (Capture was not observed if the template was superimposed on horizontal gratings; the grating remained flush with the background). Analo­ gous effects were observed in apparent motion ("motion capture"). Removal of sectors from disks in an appropriately timed sequence generated apparent motion of an illusory square. When a template of this display was superimposed on a grating, the lines appeared to move vividly with the square even though they were physically stationary. It was concluded that segmentation of the visual scene into surfaces and objects can profoundly influence the early visual processing of stereopsis and apparent motion. However, there are interesting differences between stereo cap­ ture and motion capture, which probably reflect differences in natural constraints. The implica­ tions of these findings for computational models of vision are discussed.
    [Show full text]
  • Unsupervised Learning of a Hierarchical Spiking Neural Network for Optical Flow Estimation: from Events to Global Motion Perception
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019 1 Unsupervised Learning of a Hierarchical Spiking Neural Network for Optical Flow Estimation: From Events to Global Motion Perception Federico Paredes-Valles´ , Kirk Y. W. Scheper , and Guido C. H. E. de Croon , Member, IEEE Abstract—The combination of spiking neural networks and event-based vision sensors holds the potential of highly efficient and high-bandwidth optical flow estimation. This paper presents the first hierarchical spiking architecture in which motion (direction and speed) selectivity emerges in an unsupervised fashion from the raw stimuli generated with an event-based camera. A novel adaptive neuron model and stable spike-timing-dependent plasticity formulation are at the core of this neural network governing its spike-based processing and learning, respectively. After convergence, the neural architecture exhibits the main properties of biological visual motion systems, namely feature extraction and local and global motion perception. Convolutional layers with input synapses characterized by single and multiple transmission delays are employed for feature and local motion perception, respectively; while global motion selectivity emerges in a final fully-connected layer. The proposed solution is validated using synthetic and real event sequences. Along with this paper, we provide the cuSNN library, a framework that enables GPU-accelerated simulations of large-scale spiking neural networks. Source code and samples are available at https://github.com/tudelft/cuSNN. Index Terms—Event-based vision, feature extraction, motion detection, neural nets, neuromorphic computing, unsupervised learning F 1 INTRODUCTION HENEVER an animal endowed with a visual system interconnected cells for motion estimation, among other W navigates through an environment, turns its gaze, or tasks.
    [Show full text]
  • Human Visual Motion Perception Shows Hallmarks of Bayesian Structural Inference Sichao Yang1,2,6*, Johannes Bill3,4,6*, Jan Drugowitsch3,5,7 & Samuel J
    www.nature.com/scientificreports OPEN Human visual motion perception shows hallmarks of Bayesian structural inference Sichao Yang1,2,6*, Johannes Bill3,4,6*, Jan Drugowitsch3,5,7 & Samuel J. Gershman4,5,7 Motion relations in visual scenes carry an abundance of behaviorally relevant information, but little is known about how humans identify the structure underlying a scene’s motion in the frst place. We studied the computations governing human motion structure identifcation in two psychophysics experiments and found that perception of motion relations showed hallmarks of Bayesian structural inference. At the heart of our research lies a tractable task design that enabled us to reveal the signatures of probabilistic reasoning about latent structure. We found that a choice model based on the task’s Bayesian ideal observer accurately matched many facets of human structural inference, including task performance, perceptual error patterns, single-trial responses, participant-specifc diferences, and subjective decision confdence—especially, when motion scenes were ambiguous and when object motion was hierarchically nested within other moving reference frames. Our work can guide future neuroscience experiments to reveal the neural mechanisms underlying higher-level visual motion perception. Motion relations in visual scenes are a rich source of information for our brains to make sense of our environ- ment. We group coherently fying birds together to form a fock, predict the future trajectory of cars from the trafc fow, and infer our own velocity from a high-dimensional stream of retinal input by decomposing self- motion and object motion in the scene. We refer to these relations between velocities as motion structure.
    [Show full text]
  • Motion-In-Depth Perception and Prey Capture in the Praying Mantis Sphodromantis Lineola
    © 2019. Published by The Company of Biologists Ltd | Journal of Experimental Biology (2019) 222, jeb198614. doi:10.1242/jeb.198614 RESEARCH ARTICLE Motion-in-depth perception and prey capture in the praying mantis Sphodromantis lineola Vivek Nityananda1,*, Coline Joubier1,2, Jerry Tan1, Ghaith Tarawneh1 and Jenny C. A. Read1 ABSTRACT prey as they come near. Just as with depth perception, several cues Perceiving motion-in-depth is essential to detecting approaching could contribute to the perception of motion-in-depth. or receding objects, predators and prey. This can be achieved Two of the motion-in-depth cues that have received the most using several cues, including binocular stereoscopic cues such attention in humans are binocular: changing disparity and as changing disparity and interocular velocity differences, and interocular velocity differences (IOVDs) (Cormack et al., 2017). monocular cues such as looming. Although these have been Stereoscopic disparity refers to the difference in the position of an studied in detail in humans, only looming responses have been well object as seen by the two eyes. This disparity reflects the distance to characterized in insects and we know nothing about the role of an object. Thus as an object approaches, the disparity between the stereoscopic cues and how they might interact with looming cues. We two views changes. This changing disparity cue suffices to create a used our 3D insect cinema in a series of experiments to investigate perception of motion-in-depth for human observers, even in the the role of the stereoscopic cues mentioned above, as well as absence of other cues (Cumming and Parker, 1994).
    [Show full text]
  • Motion Processing in Human Visual Cortex
    1 Motion Processing in Human Visual Cortex Randolph Blake*, Robert Sekuler# and Emily Grossman* *Vanderbilt Vision Research Center, Nashville TN 37203 #Volen Center for Complex Systems, Brandeis University, Waltham MA 02454 Acknowledgments. The authors thank David Heeger, Alex Huk, Nestor Matthews and Mark Nawrot for comments on sections of this chapter. David Bloom helped greatly with manuscript formatting and references. During preparation of this chapter RB and EG were supported by research grants from the National Institutes of Health (EY07760) and the National Science Foundation (BCS0121962) and by a Core Grant from the National Institutes of Health (EY01826) awarded to the Vanderbilt Vision Research Center. [email protected] 2 During the past few decades our understanding of the neural bases of visual perception and visually guided actions has advanced greatly; the chapters in this volume amply document these advances. It is fair to say, however, that no area of visual neuroscience has enjoyed greater progress than the physiology of visual motion perception. Publications dealing with visual motion perception and its neural bases would fill volumes, and the topic remains central within the visual neuroscience literature even today. This chapter aims to provide an overview of brain mechanisms underlying human motion perception. The chapter comprises four sections. We begin with a short review of the motion pathways in the primate visual system, emphasizing work from single-cell recordings in Old World monkeys; more extensive reviews of this literature appear elsewhere.1,2 With this backdrop in place, the next three sections concentrate on striate and extra-striate areas in the human brain that are implicated in the perception of motion.
    [Show full text]
  • Abnormal Cortical Processing of Pattern Motion in Amblyopia: Evidence from Fmri
    NeuroImage 60 (2012) 1307–1315 Contents lists available at SciVerse ScienceDirect NeuroImage journal homepage: www.elsevier.com/locate/ynimg Abnormal cortical processing of pattern motion in amblyopia: Evidence from fMRI B. Thompson a,b,⁎, M.Y. Villeneuve c,d, C. Casanova e, R.F. Hess b a Department of Optometry and Vision Science, University of Auckland, Private Bag 92019, Auckland, New Zealand b McGill Vision Research, Department of Ophthalmology, McGill University, Royal Victoria Hospital, 687 Pine Avenue West, Montreal, Canada c NMR Martinos Center, Massachusetts General Hospital, Harvard Medical School, Harvard University, Boston, USA d Montreal Neurological Institute, Departments of Neurology and Neurosurgery, McGill University, Montreal, Canada e Laboratoire des Neurosciences de la Vision, École d'optométrie, Université de Montréal, CP 6128, succ. Centre-Ville, Montreal, Canada article info abstract Article history: Converging evidence from human psychophysics and animal neurophysiology indicates that amblyopia is as- Received 12 November 2011 sociated with abnormal function of area MT, a motion sensitive region of the extrastriate visual cortex. In this Revised 29 December 2011 context, the recent finding that amblyopic eyes mediate normal perception of dynamic plaid stimuli was sur- Accepted 14 January 2012 prising, as neural processing and perception of plaids has been closely linked to MT function. One intriguing Available online 24 January 2012 potential explanation for this discrepancy is that the amblyopic eye recruits alternative visual brain areas to support plaid perception. This is the hypothesis that we tested. We used functional magnetic resonance im- Keywords: Amblyopia aging (fMRI) to measure the response of the amblyopic visual cortex and thalamus to incoherent and coher- Motion ent motion of plaid stimuli that were perceived normally by the amblyopic eye.
    [Show full text]
  • Binocular Eye Movement Control and Motion Perception: What Is Being Tracked?
    Eye Movements, Strabismus, Amblyopia, and Neuro-Ophthalmology Binocular Eye Movement Control and Motion Perception: What Is Being Tracked? Johannes van der Steen and Joyce Dits PURPOSE. We investigated under what conditions humans can representation of the visual world is constructed based on make independent slow phase eye movements. The ability to binocular retinal correspondence.4 To achieve this, the brain make independent movements of the two eyes generally is must use the visual information from each eye to stabilize the attributed to few specialized lateral eyed animal species, for two retinal images relative to each other. Imagine one is example chameleons. In our study, we showed that humans holding two laser pointers, one in each hand, and one must also can move the eyes in different directions. To maintain project the two beams precisely on top of each other on a wall. binocular retinal correspondence independent slow phase Eye movements contribute to binocular vision using visual movements of each eye are produced. feedback from image motion of each eye and from binocular correspondence. It generally is believed that the problem of METHODS. We used the scleral search coil method to measure binocular eye movements in response to dichoptically viewed binocular correspondence is solved in the primary visual 5,6 visual stimuli oscillating in orthogonal direction. cortex (V1). Neurons in V1 process monocular and binocular visual information.7 At subsequent levels of process- RESULTS. Correlated stimuli led to orthogonal slow eye ing eye-of-origin information is lost, and the perception of the movements, while the binocularly perceived motion was the binocular world is invariant for eye movements and self- vector sum of the motion presented to each eye.
    [Show full text]
  • Motion Perception and Prediction: a Subsymbolic Approach
    Motion Perception and Prediction: A Subsymbolic Approach Volker Baier Institut für Informatik, Lehrstuhl für Theoretische Informatik und Grundlagen der Künstliche Intelligenz Motion Perception and Prediction: A Subsymbolic Approach Volker Baier Vollständiger Abdruck der von der Fakultät für Informatik der Technischen Universität München zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften genehmigten Dissertation. Vorsitzender: Univ.-Prof. Dr. A. Knoll Prüfer der Dissertation: 1. Univ.-Prof. Dr. Dr. h.c. mult. W. Brauer 2. Univ.-Prof. Dr. K. Schill Universität Bremen 3. Univ.-Prof. M. Beetz, PhD Die Dissertation wurde am 28.06.2006 bei der Technischen Universität München eingereicht und durch die Fakultät für Informatik am 13.11.2006 angenommen. Contents 1 Introduction 3 2 Overview of the Model 9 3 Paradigms of Perception 13 3.1 Philosophy . 13 3.2 Behaviorism . 14 3.3 Cognitivism . 15 3.4 Cybernetics and Constructivism . 17 3.5 Connectionism . 18 3.6 Computational Neuroscience . 19 3.7 Guiding Concepts . 20 4 Neuro-Anatomy 21 4.1 Visual Streams . 21 4.1.1 The Dorsal Pathway . 22 4.1.2 The Ventral Pathway . 23 4.2 Visual Cortical Areas . 23 4.2.1 Retina and Lateral Geniculate Nucleus . 24 4.2.2 Primary Visual Cortex V1 . 24 4.3 Motion Processing Related Areas . 28 4.4 What we Derive from Neuro-Anatomy . 30 5 Cognitive Psychology 33 5.1 Visual Spatio-Temporal Memory . 34 5.2 Dynamic Spatio-Temporal Memory . 36 5.3 Psychophysic Experimental Results . 39 5.4 What we Derive for our Model . 44 6 Neural Networks for Static Patterns 47 6.1 Multilayer Perceptron .
    [Show full text]
  • Visual Perception Survey Project -- Perceiving Motion and Events
    Visual Perception Survey Project -- Perceiving Motion and Events Chienchih Chen Yutian Chen 1. Image Motion When a moving object is viewed by an observer with eyes, head, and body fixed, its projected image moves across the retina. In this instance it is natural to equate object motion with image motion. But much of the time, our eyes, head, and body are in motion, and this fact invalidates any attempt to equate object motion directly with image motion. Image motion depends on both eye movements and object motion. As the eye moves Adaptation and Aftereffects in one direction, the image sweeps across the visual field in the opposite direction. But, there When an observer stares for a prolonged is image motion when there is no object motion period at a field of image elements moving at a at all. The opposite can also occur, as when a constant direction and speed, the visual system moving object is tracked by the eye: It's image undergoes motion adaptation; that is, its on the retina will actually be stationary even response to the motion diminishes in intensity though the object itself is in motion. For this over time. It is related to motion aftereffects: by reason, it is important to distinguish clearly viewing of constant motion for a prolonged between image motion and object always period, after the motion has stopped, subsequent depends on eye, head, and body movements as motion perception is significantly altered. well as image motion. Apparent Motion Continuous Motion Modern technology provides a host of An environmental object is considered to situations in which realistic motion perception be moving if its position changes over time.
    [Show full text]
  • Brain Areas Involved in Perception of Biological Motion
    Brain Areas Involved in Perception of Biological Motion E. Grossman, M. Donnelly, R. Price, D. Pickens, V. Morgan, G. Neighbor, and R. Blake Vanderbilt University Abstract & These experiments use functional magnetic resonance motion was located within a small region on the ventral bank imaging (fMRI) to reveal neural activity uniquely associated of the occipital extent of the superior-temporal sulcus (STS). with perception of biological motion. We isolated brain areas This region is located lateral and anterior to human MT/MST, activated during the viewing of point-light figures, then and anterior to KO. Among our observers, we localized this compared those areas to regions known to be involved in region more frequently in the right hemisphere than in the coherent-motion perception and kinetic-boundary perception. left. This was true regardless of whether the point-light figures Coherent motion activated a region matching previous reports were presented in the right or left hemifield. A small region of human MT/MST complex located on the temporo-parieto- in the medial cerebellum was also active when observers occipital junction. Kinetic boundaries activated a region viewed biological-motion sequences. Consistent with earlier posterior and adjacent to human MT previously identified as neuroimaging and single-unit studies, this pattern of results the kinetic-occipital (KO) region or the lateral-occipital (LO) points to the existence of neural mechanisms specialized complex. The pattern of activation during viewing of biological for analysis of the kinematics defining biological motion. & INTRODUCTION Of course, motion information is important for more We live in a dynamic visual world: Objects often move than just identification of biological motion.
    [Show full text]
  • Chapter 4: Perception, Motion and Action
    Chapter 4: Perception, motion and action The first major theme addressed in this chapter is perception for action, or how we manage to act appropriately on the environment and the objects within it. Of some relevance here are theories (e.g., the perception–action model and the dual-process approach) that distinguish between processes and systems involved in perception-for-recognition and those involved in perception-for-action. There is convincing evidence that attention plays an important role in determining which aspects of the environment are consciously perceived. Direct perception In the past it was assumed that the function of visual perception was object identification. Gibson’s radical idea (e.g., 1950) was that perceptual information is used primarily in the organisation of action, facilitating interactions between the individual and his/her environment. According to Gibson, perception influences our actions with minimal involvement of conscious awareness. WEBLINK: Gibson’s theory Gibson regarded his theoretical approach as “ecological”. He proposed that perception involves picking up information directly from the optic array – the structured pattern of light striking the eye, containing unambiguous information about objects in the environment. The optic flow consists of changes in the pattern of light reaching an observer created when he/she moves, or parts of the visual environment move. According to Gibson, optic flow provides pilots with unambiguous information about their direction, speed and altitude: The point towards which the pilot is moving (the focus of expansion) appears motionless, while the visual environment appears to be moving away. Texture gradients – the gradient of texture density from slanting objects – also provides very useful information.
    [Show full text]