Laboratory Notes

LABORATORY The iLab Neuromorphic Vision NOTES C++ Toolkit: Free tools for the next generation of vision algorithms Because of its truly interdisciplinary nature— stage relies on a neural saliency map, which toolkit for the implementation of the next benefiting from the latest advances in experi- gives a graded measure of ‘attractiveness’ to processing stage—concerned with identify- mental and computational neuroscience, elec- every location in the scene, and is modeled ing the object that has drawn the attention trical engineering, control theory, and signal on the neural architecture of the posterior and the eyes—and most of these models are and image processing—neuromorphic engi- parietal cortex in the monkey brain. At any inspired by the visual-response properties of neering is a very complex field. This has been neurons in the infero-temporal cortex. Finally, one of the leading motivations for the devel- additional modules are available for short- opment of a Neuromorphic Vision Toolkit term and long-term memory, cognitive at the iLab of the University of Southern knowledge representation, and modulatory California to provide a set of basic tools that feedback from a high-level task definition can assist newcomers in the field with the de- (e.g., look for the stop sign) to the low-level velopment of new models and systems. More visual processing (e.g., emphasize the con- generally, the iLab Neuromorphic Vision tribution of red to the saliency map, prime C++ Toolkit project aims at developing the the object recognition module for the ‘traffic next generation of vision algorithms, the ar- sign’ object class). chitecture of which will closely mimic the Not all of the components shown in the neurobiology of the primate brain rather than figure have been fully implemented, and being specifically developed for a given set many are at a very preliminary stage of de- of environmental conditions or tasks. To this velopment: including some simply not yet end, it provides a software foundation that is existing. The interesting point to note already, specifically geared towards the development however, is how the biologically-inspired vi- of neuromorphic models and systems. sual system architecture proposed here is very At the core of the toolkit are a number of different from typical robotic-vision and com- neuroscience models, initially developed to puter-vision systems, which are usually de- provide greater understanding of biological fined to solve a specific problem (e.g., find a vision processing, but here made ready to be stop sign by looking for its specific shape applied to engineering challenges such as vi- using an algorithm matched to its exact geo- sually-guided robotics in outdoor environ- metrical properties). This promises to make ments. Taken together, these models provide the systems developed around this architec- general-purpose vision modules that can be ture particularly capable when dealing with easily reconfigured for, and tuned to, specific novel complex outdoors scenes and unex- tasks. The gross driving architecture for a pected situations, as has been widely dem- general vision system, the basis of many of onstrated by, for example, our model of bot- the modules available in the toolkit, is shown tom-up attention.1 in Figure 1. The entire source code for the iLab Input video, whether captured by cam- Figure 1. The iLab Neuromorphic Vision Neuromorphic Vision C++ Toolkit is dis- era or from other sources, is first processed C++ Toolkit provides software modules for the tributed freely upon request, in an effort to by a bank of low-level visual feature detec- implementation of some of the major encourage more researchers to explore the tors. These are sensitive to image properties components of primate vision, as depicted here. potential of neuromorphic vision for real- such as local contrast, orientation, or motion These include low-level visual processing world applications. energy and mimic the known response prop- algorithms, bottom-up visual attention and a erties of early visual neurons in the retina, saliency map, object recognition, as well as For more information about the iLab Neuro- lateral geniculate nucleus of the thalamus, and algorithms for the rapid computation of scene- morphic Vision C++ Toolkit, see our web page.2 primary visual cortex. Subsequent visual pro- level gist and rough layout. When used on a cessing is then split into two cooperating robot, the toolkit additionally provides modules Laurent Itti streams. The first is concerned with the rapid for image and video digitization, short- and University of Southern California computation of the ‘gist’ and layout of the long-term symbolic and spatial memories, scene and provides coarse clues by which the decision processes, and control of actuators. The References 2 1. http://iLab.usc.edu/bu/ system obtains a sense of the environmental source code for the toolkit is freely available. 2. http://iLab.usc.edu/toolkit/ conditions (e.g., indoors vs. outdoors, on a track vs. off-road) and of its position within given point in time, the system uses the gist the environment (e.g., path is turning left, for basic orientation in the scene and sequen- If have a toolkit or other goodies to the scene is highly cluttered). The second tially attends to interesting objects (which share, contact the Editor at stream is concerned with directing both at- could be obstacles, landmarks to aid naviga- [email protected] tention and the eyes towards the few most tion, or target objects being looked for). visually-conspicuous objects in the scene. This Several neural models are available in the The Neuromorphic Engineer 10 Volume 1, Issue 1, 2004.

Laboratory Notes

The Photographerls Guide to The

Neural Encoding with Visual Attention

Feature-Based Attention Is Independent of Object Appearance

The Seabee II: a Beosub Class

Collection Neuroscience 1995 - 2009

Mining Videos for Features That Drive Attention

NSF Workshop on Hybrid Neuro- Computer Vision Systems

Visual Attention and Target Detection in Cluttered Natural Scenes

Computational Models: Bottom-Up and Top-Down Aspects

Data Sharing for Computational Neuroscience

Biologically-Inspired Robotics Vision Monte-Carlo Localization in the Outdoor Environment

CURRICULUM VITAE Scott D