Sound Localization
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Representation of Sound Localization Cues in the Auditory Thalamus of the Barn Owl
Proc. Natl. Acad. Sci. USA Vol. 94, pp. 10421–10425, September 1997 Neurobiology Representation of sound localization cues in the auditory thalamus of the barn owl LARRY PROCTOR* AND MASAKAZU KONISHI Division of Biology, MC 216–76, California Institute of Technology, Pasadena, CA 91125 Contributed by Masakazu Konishi, July 14, 1997 ABSTRACT Barn owls can localize a sound source using responses of neurons throughout the N.Ov to binaural sound either the map of auditory space contained in the optic tectum localization cues to determine what, if any, transformations or the auditory forebrain. The auditory thalamus, nucleus occur in the representation of auditory information. ovoidalis (N.Ov), is situated between these two auditory areas, and its inactivation precludes the use of the auditory forebrain METHODS for sound localization. We examined the sources of inputs to the N.Ov as well as their patterns of termination within the Adult barn owls (Tyto alba), while under ketamineydiazepam nucleus. We also examined the response of single neurons anesthesia (Ketaset, Aveco, Chicago; 25 mgykg diazepam; 1 within the N.Ov to tonal stimuli and sound localization cues. mgykg intramuscular injection), were placed in a stereotaxic Afferents to the N.Ov originated with a diffuse population of device that held the head tilted downward at an angle of 30° neurons located bilaterally within the lateral shell, core, and from the horizontal plane. A stainless steel plate was attached medial shell subdivisions of the central nucleus of the inferior to the rostral cranium with dental cement, and a reference pin colliculus. Additional afferent input originated from the ip- was glued to the cranium on the midline in the plane between silateral ventral nucleus of the lateral lemniscus. -
Master Project Interactive Media Design Master Program Södertörn Högskola
Master Project Interactive Media Design Master Program Södertörn Högskola Supervisor: Minna Räsänen External: Annika Waern (MobileLife Centre) Student: Syed Naseh Title: Mobile SoundAR: Your Phone on Your Head Table of Contents Abstract....................................................................................................................................... 3 Keywords.................................................................................................................................... 3 ACM Classification Keywords................................................................................................... 3 General Terms............................................................................................................................. 3 Introduction.................................................................................................................................4 Immersion issues without head-tracking.....................................................................................5 Front-back confusion.................................................................................................... 5 Interaural Time Difference ITD ................................................................................... 5 Interaural Intensity Difference IID .............................................................................. 6 Design Problem...........................................................................................................................7 Design Methodology...................................................................................................................7 -
Discrimination of Sound Source Positions Following Auditory Spatial
Discrimination of sound source positions following auditory spatial adaptation Stephan Getzmann Ruhr-Universität Bochum, Kognitions- & Umweltpsychologie; Email: [email protected] Abstract auditory contrast effect in sound localization increases with in- Based on current models of auditory spatial adaptation in human creasing adapter duration, indicating a time dependence in the sound localization, the present study evaluated whether auditory strength of adaptation. Analogously, improvements in spatial dis- discrimination of two sound sources in the median plane (MP) crimination should be stronger following a long adapter, and improves after presentation of an appropriate adapter sound. Fif- weaker following a short adapter. In order to test this assumption, teen subjects heard two successively presented target sounds (200- in a third condition a short adapter sound preceded the target ms white noise, 500 ms interstimulus interval), which differed sounds. Best performance was expected in the strong-adapter con- slightly in elevation. Subjects had to decide whether the first dition, followed by the weak- and the no-adapter condition. sound was emitted above or below the second. When a long adap- ter sound (3-s white noise emitted at the elevation of the first tar- 2. Method get sound) preceded the stimulus pair, performance improved in Fifteen normal hearing subjects (age range 19-52 years) volun- speed and accuracy. When a short adapter sound (500 ms) pre- teered as listeners in the experiment which was conducted in a ceded, performance improved only insignificantly in comparison dark, sound-proof, anechoic room. In the subjects' median plane, with a no-adapter control condition. These results may be linked 31 broad-band loudspeakers (Visaton SC 5.9, 5 x 9 cm) were to previous findings concerning sound localization after the pres- mounted ranging from -31° to +31° elevation. -
Of Interaural Time Difference by Spiking Neural Networks Zihan Pan, Malu Zhang, Jibin Wu, and Haizhou Li, Fellow, IEEE
JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2015 1 Multi-Tones’ Phase Coding (MTPC) of Interaural Time Difference by Spiking Neural Networks Zihan Pan, Malu Zhang, Jibin Wu, and Haizhou Li, Fellow, IEEE Abstract—Inspired by the mammal’s auditory localization (in humans 20Hz to 2kHz) for MSO and high frequencies (in pathway, in this paper we propose a pure spiking neural humans >2kHz) for LSO [7]. From the information flow point network (SNN) based computational model for precised sound of view, the location-dependent information from the sounds, localization in the noisy real-world environment, and implement this algorithm in a real-time robotic system with microphone or aforementioned time or intensity differences, are encoded array. The key of this model relies on the MTPC scheme, which into spiking pulses in the MSO and LSO. After that, they are encodes the interaural time difference (ITD) cues into spike projected to the inferior colliculus (IC) in the mid-brain, where patterns. This scheme naturally follows the functional structures the IC integrates both cues for estimating the sound source of human auditory localization system, rather than artificially direction [8]. The most successful model for ITD encoding computing of time difference of arrival. Besides, it highlights the advantages of SNN, such as event-driven and power efficiency. in MSO and cortex decoding might be the “place theory” by The MTPC is pipeliend with two different SNN architectures, an American psychologist named Lloyd Jeffress [9], which is the convolutional SNN and recurrent SNN, by which it shows the also referred to as Jeffress model hereafter. -
The Sonification Handbook Chapter 4 Perception, Cognition and Action In
The Sonification Handbook Edited by Thomas Hermann, Andy Hunt, John G. Neuhoff Logos Publishing House, Berlin, Germany ISBN 978-3-8325-2819-5 2011, 586 pages Online: http://sonification.de/handbook Order: http://www.logos-verlag.com Reference: Hermann, T., Hunt, A., Neuhoff, J. G., editors (2011). The Sonification Handbook. Logos Publishing House, Berlin, Germany. Chapter 4 Perception, Cognition and Action in Auditory Display John G. Neuhoff This chapter covers auditory perception, cognition, and action in the context of auditory display and sonification. Perceptual dimensions such as pitch and loudness can have complex interactions, and cognitive processes such as memory and expectation can influence user interactions with auditory displays. These topics, as well as auditory imagery, embodied cognition, and the effects of musical expertise will be reviewed. Reference: Neuhoff, J. G. (2011). Perception, cognition and action in auditory display. In Hermann, T., Hunt, A., Neuhoff, J. G., editors, The Sonification Handbook, chapter 4, pages 63–85. Logos Publishing House, Berlin, Germany. Media examples: http://sonification.de/handbook/chapters/chapter4 8 Chapter 4 Perception, Cognition and Action in Auditory Displays John G. Neuhoff 4.1 Introduction Perception is almost always an automatic and effortless process. Light and sound in the environment seem to be almost magically transformed into a complex array of neural impulses that are interpreted by the brain as the subjective experience of the auditory and visual scenes that surround us. This transformation of physical energy into “meaning” is completed within a fraction of a second. However, the ease and speed with which the perceptual system accomplishes this Herculean task greatly masks the complexity of the underlying processes and often times leads us to greatly underestimate the importance of considering the study of perception and cognition, particularly in applied environments such as auditory display. -
Acoustic Structure and Musical Function: Musical Notes Informing Auditory Research
OUP UNCORRECTED PROOF – REVISES, 06/29/2019, SPi chapter 7 Acoustic Structure and Musical Function: Musical Notes Informing Auditory Research Michael Schutz Introduction and Overview Beethoven’s Fifth Symphony has intrigued audiences for generations. In opening with a succinct statement of its four-note motive, Beethoven deftly lays the groundwork for hundreds of measures of musical development, manipulation, and exploration. Analyses of this symphony are legion (Schenker, 1971; Tovey, 1971), informing our understanding of the piece’s structure and historical context, not to mention the human mind’s fascination with repetition. In his intriguing book The first four notes, Matthew Guerrieri decon- structs the implications of this brief motive (2012), illustrating that great insight can be derived from an ostensibly limited grouping of just four notes. Extending that approach, this chapter takes an even more targeted focus, exploring how groupings related to the harmonic structure of individual notes lend insight into the acoustical and perceptual basis of music listening. Extensive overviews of auditory perception and basic acoustical principles are readily available (Moore, 1997; Rossing, Moore, & Wheeler, 2013; Warren, 2013) discussing the structure of many sounds, including those important to music. Additionally, several texts now focus specifically on music perception and cognition (Dowling & Harwood, 1986; Tan, Pfordresher, & Harré, 2007; Thompson, 2009). Therefore this chapter focuses 0004314570.INDD 145 Dictionary: NOAD 6/29/2019 3:18:38 PM OUP UNCORRECTED PROOF – REVISES, 06/29/2019, SPi 146 michael schutz on a previously under-discussed topic within the subject of musical sounds—the importance of temporal changes in their perception. This aspect is easy to overlook, as the perceptual fusion of overtones makes it difficult to consciously recognize their indi- vidual contributions. -
Thinking of Sounds As Materials and a Sketch of Auditory Affordances
Send Orders for Reprints to [email protected] 174 The Open Psychology Journal, 2015, 8, (Suppl 3: M2) 174-182 Open Access Bringing Sounds into Use: Thinking of Sounds as Materials and a Sketch of Auditory Affordances Christopher J. Steenson* and Matthew W. M. Rodger School of Psychology, Queen’s University Belfast, United Kingdom Abstract: We live in a richly structured auditory environment. From the sounds of cars charging towards us on the street to the sounds of music filling a dancehall, sounds like these are generally seen as being instances of things we hear but can also be understood as opportunities for action. In some circumstances, the sound of a car approaching towards us can pro- vide critical information for the avoidance of harm. In the context of a concert venue, sociocultural practices like music can equally afford coordinated activities of movement, such as dancing or music making. Despite how evident the behav- ioral effects of sound are in our everyday experience, they have been sparsely accounted for within the field of psychol- ogy. Instead, most theories of auditory perception have been more concerned with understanding how sounds are pas- sively processed and represented and how they convey information of the world, neglecting than how this information can be used for anything. Here, we argue against these previous rationalizations, suggesting instead that information is instan- tiated through use and, therefore, is an emergent effect of a perceiver’s interaction with their environment. Drawing on theory from psychology, philosophy and anthropology, we contend that by thinking of sounds as materials, theorists and researchers alike can get to grips with the vast array of auditory affordances that we purposefully bring into use when in- teracting with the environment. -
Binaural Sound Localization
CHAPTER 5 BINAURAL SOUND LOCALIZATION 5.1 INTRODUCTION We listen to speech (as well as to other sounds) with two ears, and it is quite remarkable how well we can separate and selectively attend to individual sound sources in a cluttered acoustical environment. In fact, the familiar term ‘cocktail party processing’ was coined in an early study of how the binaural system enables us to selectively attend to individual con- versations when many are present, as in, of course, a cocktail party [23]. This phenomenon illustrates the important contributionthat binaural hearing makes to auditory scene analysis, byenablingusto localizeandseparatesoundsources. Inaddition, the binaural system plays a major role in improving speech intelligibility in noisy and reverberant environments. The primary goal of this chapter is to provide an understanding of the basic mechanisms underlying binaural localization of sound, along with an appreciation of how binaural pro- cessing by the auditory system enhances the intelligibility of speech in noisy acoustical environments, and in the presence of competing talkers. Like so many aspects of sensory processing, the binaural system offers an existence proof of the possibility of extraordinary performance in sound localization, and signal separation, but it does not yet providea very complete picture of how this level of performance can be achieved with the contemporary tools of computational auditory scene analysis (CASA). We first summarize in Sec. 5.2 the major physical factors that underly binaural percep- tion, and we briefly summarize some of the classical physiological findings that describe mechanisms that could support some aspects of binaural processing. In Sec. 5.3 we briefly review some of the basic psychoacoustical results which have motivated the development Computational Auditory Scene Analysis. -
Neuroethology in Neuroscience Why Study an Exotic Animal
Neuroethology in Neuroscience or Why study an exotic animal Nobel prize in Physiology and Medicine 1973 Karl von Frisch Konrad Lorenz Nikolaas Tinbergen for their discoveries concerning "organization and elicitation of individual and social behaviour patterns". Behaviour patterns become explicable when interpreted as the result of natural selection, analogous with anatomical and physiological characteristics. This year's prize winners hold a unique position in this field. They are the most eminent founders of a new science, called "the comparative study of behaviour" or "ethology" (from ethos = habit, manner). Their first discoveries were made on insects, fishes and birds, but the basal principles have proved to be applicable also on mammals, including man. Nobel prize in Physiology and Medicine 1973 Karl von Frisch Konrad Lorenz Nikolaas Tinbergen Ammophila the sand wasp Black headed gull Niko Tinbergen’s four questions? 1. How the behavior of the animal affects its survival and reproduction (function)? 2. By how the behavior is similar or different from behaviors in other species (phylogeny)? 3. How the behavior is shaped by the animal’s own experiences (ontogeny)? 4. How the behavior is manifested at the physiological level (mechanism)? Neuroethology as a sub-field in brain research • A large variety of research models and a tendency to focus on esoteric animals and systems (specialized behaviors). • Studying animals while ignoring the relevancy to humans. • Studying the brain in the context of the animal’s natural behavior. • Top-down approach. Archer fish Prof. Ronen Segev Ben-Gurion University Neuroethology as a sub-field in brain research • A large variety of research models and a tendency to focus on esoteric animals and systems (specialized behaviors). -
Neural Development of Binaural Tuning Through Hebbian Learning Predicts Frequency-Dependent Best Delays
11692 • The Journal of Neuroscience, August 10, 2011 • 31(32):11692–11696 Behavioral/Systems/Cognitive Neural Development of Binaural Tuning through Hebbian Learning Predicts Frequency-Dependent Best Delays Bertrand Fontaine1,2 and Romain Brette1,2 1Laboratoire Psychologie de la Perception, CNRS and Universite´ Paris Descartes, 75006 Paris, France, and 2Equipe Audition, De´partement d’Etudes Cognitives, Ecole Normale Supe´rieure, 75005 Paris, France Birds use microsecond differences in the arrival times of the sounds at the two ears to infer the location of a sound source in the horizontal plane. These interaural time differences (ITDs) are encoded by binaural neurons which fire more when the ITD matches their “best delay.” In the textbook model of sound localization, the best delays of binaural neurons reflect the differences in axonal delays of their monaural inputs, but recent observations have cast doubts on this classical view because best delays were found to depend on preferred frequency.Here,weshowthattheseobservationsareinfactconsistentwiththenotionthatbestdelaysarecreatedbydifferencesinaxonal delays, provided ITD tuning is created during development through spike-timing-dependent plasticity: basilar membrane filtering results in correlations between inputs to binaural neurons, which impact the selection of synapses during development, leading to the observed distribution of best delays. Introduction BD Ͻ 1/(2CF), solid curves). From a functional point of view, In many species, interaural time difference (ITD) is the main cue -
Sound Localization in Birds - Physical and Neurophysiological Mechanisms of Sound Localization in the Chicken
Technische Universität München Lehrstuhl für Zoologie Sound Localization in Birds - Physical and Neurophysiological Mechanisms of Sound Localization in the Chicken Hans A. Schnyder Vollständiger Abdruck der von der Fakultät Wissenschaftszentrum Weihenstephan für Ernährung, Landnutzung und Umwelt der Technischen Universität München zur Erlangung des akademischen Grades eines Doktors der Naturwissenschaften genehmigten Dissertation. Vorsitzender: Prof. Dr. Rudolf Fries Prüfer der Dissertation: 1. Prof. Dr. Harald Luksch 2. Prof. Dr. Bernhard Seeber Die Dissertation wurde am 09.08.2017 bei der Technischen Universität München eingereicht und durch die Fakultät Wissenschaftszentrum Weihenstephan für Ernährung, Landnutzung und Umwelt am 22.01.2018 angenommen. Summary Accurate sound source localization in three-dimensional space is essential for an animal’s orientation and survival. While the horizontal position can be determined by interaural time and intensity differences, localization in elevation was thought to require external structures that modify sound before it reaches the tympanum. Here I show that in birds even without external structures such as pinnae or feather ruffs, the simple shape of their head induces sound modifications that depend on the elevation of the source. A model of localization errors shows that these cues are sufficient to locate sounds in the vertical plane. These results suggest that the head of all birds induces acoustic cues for sound localization in the vertical plane, even in the absence of external ears. Based on these findings, I investigated the neuronal processing of spatial acoustic cues in the midbrain of birds. The avian midbrain is a computational hub for the integration of auditory and visual information. While at early stages of the auditory pathway time and intensity information is processed separately, information from both streams converges at the inferior colliculus (IC). -
The Perceptual Representation of Timbre
Chapter 2 The Perceptual Representation of Timbre Stephen McAdams Abstract Timbre is a complex auditory attribute that is extracted from a fused auditory event. Its perceptual representation has been explored as a multidimen- sional attribute whose different dimensions can be related to abstract spectral, tem- poral, and spectrotemporal properties of the audio signal, although previous knowledge of the sound source itself also plays a role. Perceptual dimensions can also be related to acoustic properties that directly carry information about the mechanical processes of a sound source, including its geometry (size, shape), its material composition, and the way it is set into vibration. Another conception of timbre is as a spectromorphology encompassing time-varying frequency and ampli- tude behaviors, as well as spectral and temporal modulations. In all musical sound sources, timbre covaries with fundamental frequency (pitch) and playing effort (loudness, dynamic level) and displays strong interactions with these parameters. Keywords Acoustic damping · Acoustic scale · Audio descriptors · Auditory event · Multidimensional scaling · Musical dynamics · Musical instrument · Pitch · Playing effort · Psychomechanics · Sound source geometry · Sounding object 2.1 Introduction Timbre may be considered as a complex auditory attribute, or as a set of attributes, of a perceptually fused sound event in addition to those of pitch, loudness, per- ceived duration, and spatial position. It can be derived from an event produced by a single sound source or from the perceptual blending of several sound sources. Timbre is a perceptual property, not a physical one. It depends very strongly on the acoustic properties of sound events, which in turn depend on the mechanical nature of vibrating objects and the transformation of the waves created as they propagate S.