3-D Audio Using Loudspeakers ~: ~

Total Page:16

File Type:pdf, Size:1020Kb

3-D Audio Using Loudspeakers ~: ~ 3-D Audio Using Loudspeakers William G. Gardner B. S., Computer Science and Engineering, Massachusetts Institute of Technology, 1982 M. S., Media Arts and Sciences, Massachusetts Institute of Technology, 1992 Submitted to the Program in Media Arts and Sciences, School of Architecture and Planning in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy at the Massachusetts Institute of Technology September, 1997 ©Massachusetts Institute of Technology, 1997. All Rights Reserved. Author Program in Media Arts and Sciences August 8, 1997 Certified by i bBarry L. Vercoe Professor of Media Arts and Sciences 7 vlassachusetts Institute of Tecgwlogy Accepted by V V Stephen A. Benton Chair, Departmental Committee on Graduate Students Program in Media Arts and Sciences Massachusetts Institute of Technology ~: ~ 2 3-D Audio Using Loudspeakers William G. Gardner Submitted to the Program in Media Arts and Sciences, School of Architecture and Planning on August 8, 1997, in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy. Abstract 3-D audio systems, which can surround a listener with sounds at arbitrary locations, are an important part of immersive interfaces. A new approach is presented for implementing 3-D audio using a pair of conventional loudspeakers. The new idea is to use the tracked position of the listener's head to optimize the acoustical presentation, and thus produce a much more realistic illusion over a larger listening area than existing loudspeaker 3-D audio systems. By using a remote head tracker, for instance based on computer vision, an immersive audio environment can be created without donning headphones or other equipment. The general approach to a 3-D audio system is to reconstruct the acoustic pressures at the listener's ears that would result from the natural listening situation to be simulated. To accomplish this using loudspeakers requires that first, the ear signals corresponding to the target scene are synthesized by appropriately encoding directional cues, a process known as "binaural synthesis," and second, these signals are delivered to the listener by inverting the transmission paths that exist from the speakers to the listener, a process known as "crosstalk cancellation." Existing crosstalk cancellation systems only function at a fixed listening location; when the listener moves away from the equalization zone, the 3-D illusion is lost. Steering the equalization zone to the tracked listener preserves the 3-D illu- sion over a large listening volume, thus simulating a reconstructed soundfield, and also provides dynamic localization cues by maintaining stationary external sound sources during head motion. This dissertation will discuss the theory, implementation, and testing of a head-tracked loudspeaker 3-D audio system. Crosstalk cancellers that can be steered to the location of a tracked listener will be described. The objective performance of these systems has been evaluated using simulations and acoustical measurements made at the ears of human subjects. Many sound localization experiments were also conducted; the results show that head-tracking both significantly improves localization when the listener is displaced from the ideal listening location, and also enables dynamic localiza- tion cues. Thesis Supervisor: Barry L. Vercoe Professor of Media Arts and Sciences This work was performed at the MIT Media Laboratory. Support for this work was provided in part by Motorola. The views expressed within do not necessarily reflect the views of the supporting sponsors. Doctoral Dissertation Committee Thesis Advisor Barry L. Vercoe Professor of Media Arts and Sciences Massachusetts Institute of Technology Thesis Reader William M. Rabinowitz Principal Research Scientist MIT Research Laboratory for Electronics Thesig§Reader David Griesinger Lexicon, Inc. Waltham, MA Thesis Reader Jean-Marc Jot Charg6 de Recherche IRCAM Paris, France Acknowledgments First and foremost, I would like to thank my advisor, Barry Vercoe, for providing constant encouragement and support through this long process. Barry's vision has resulted in the creation of the Machine Listening Group (formerly the Music and Cognition Group) at the MIT Media Lab, where researchers such as myself can freely pursue topics in the under- standing and synthesis of music and audio. Barry has always encouraged me to "look at the big picture," a task which I have not mastered. I can't imagine having a better doctoral committee than Bill Rabinowitz, David Griesinger, and Jean-Marc Jot. Each has a particular perspective that complements the others. Bill has been cheerful and supportive throughout; he has been particularly helpful with the psycho- acoustic validation portion of this work. Bill also arranged for the use of the KEMAR and MIT's anechoic chamber, and he assisted me in building miniature microphones for mak- ing ear recordings. David has worked closely with me on previous projects, most notably a study of reverberation perception. We've met numerous times to discuss room reverbera- tion, loudspeaker audio systems, and spatial hearing. He strives to find simple solutions to difficult problems; I hope I have emulated that ideal here. Jean-Marc's work in spatial audio synthesis, strongly grounded in theory, has been inspirational. Some of the key ideas in this work are the result of many illuminating email discussions with him. My parents are no doubt responsible for my interest in science and for my choice of schooling at MIT. At a young age, they took me to MIT to see an exhibit of moon dust recently brought back from the lunar surface. From that time on I was certain that I would attend MIT; after a total of twelve years here (both undergraduate and graduate) it seems hard to believe that I'm going to get out. Mr. Aloian at Belmont Hill School deserves par- ticular credit for getting me into MIT as an undergrad. I would like to thank my colleagues in the Machine Listening Group for providing a stim- ulating and fun place to work. Thanks are extended to current group members Keith Mar- tin, Michael Casey, Eric Scheirer, Paris Smaragdis, and Jonathan Feldman. Keith Martin, my current officemate, was closely involved with the early stages of this work, assisting me with the measurement of the KEMAR HRTF data. Keith is also to be thanked for pro- viding a large music library for my late night listening pleasure. Eric Scheirer provided valuable proofreading assistance. Mike Casey, by using my systems in a number of audio productions, has both advertised my work and reminded me of the need for production- friendly designs. Thanks are also extending to former members of our group, including Dan Ellis, Nicolas Saint-Arnaud, Jeff Bilmes, Tom Maglione, and Mary Ann Norris. Dan Ellis, my former officemate, deserves particular accolades. In addition to being a dear friend and providing emotional support during the initial stages of this work, he helped me solve countless prob- lems ranging from computer glitches to theoretical issues. Dan is also credited with sug- gesting that I use the Tcl/Tk Toolkit to build a graphical interface for the sound localization experiment software; this was a huge time saver. Connie Van Rheenen, Betty Lou McClanahan, Greg Tucker, and Molly Bancroft provided essential support at the Media Lab. Thanks are also extended to Bob Chidlaw at Kurzweil Music Systems, who first introduced me to the wonderful world of digital signal process- ing. Finally, none of this would be possible without Felice, who has been an unfaltering source of confidence, support, and love in this long and difficult process. This thesis is dedicated to her. Contents 1 Introduction .......................................................................................... 11 1.1 M otivation: spatial auditory displays ............................................................................. 11 1.2 Ideas to be investigated .................................................................................................. 13 1.3 Applications ...................................................................................................................... 14 1.4 Organization of this docum ent ....................................................................................... 15 2 B ackground .......................................................................................... 17 2.1 Sound localization in spatial auditory displays.............................................................. 17 2.1.1 Interaural cues................................................................................................... 17 2.1.2 Spectral cues ..................................................................................................... 18 2.1.3 Individualized HRTFs..................................................................................... 19 2.1.4 Externalization................................................................................................ 20 2.1.5 Dynam ic localization ....................................................................................... 20 2.2 Loudspeaker displays ..................................................................................................... 21 2.2.1 Stereo .................................................................................................................. 21 2.2.2 Crosstalk cancellation..................................................................................... 22 2.2.3 Inverse filtering of room acoustics.................................................................
Recommended publications
  • Optimal Crosstalk Cancellation for Binaural Audio with Two Loudspeakers
    Optimal Crosstalk Cancellation for Binaural Audio with Two Loudspeakers Edgar Y. Choueiri Princeton University [email protected] Crosstalk cancellation (XTC) yields high-spatial-fidelity reproduction of binaural audio through loudspeakers allowing a listener to perceive an accurate 3-D image of a recorded soundfield. Such accurate 3-D sound reproduction is useful in a wide range of applications in the medical, military and commercial audio sectors. However, XTC is known to add a severe spectral coloration to the sound and that has been an impediment to the wide adoption of loudspeaker-based binaural audio. The nature of this coloration in two-loudspeaker XTC systems, and the fundamental aspects of the regularization methods that can be used to optimally control it, were studied analytically using a free-field two-point-source model. It was shown that constant-parameter regularization, while effective at decreasing coloration peaks, does not yield optimal XTC filters, and can lead to the formation of roll-offs and doublet peaks in the filter’s frequency response. Frequency-dependent regularization was shown to be significantly better for XTC optimization, and was used to derive a prescription for designing optimal two-loudspeaker XTC filters, whereby the audio spectrum is divided into adjacent bands, each of is which associated with one of three XTC impulse responses, which were derived analytically. Aside from the sought fundamental insight, the analysis led to the formulation of band-assembled XTC filters, whose optimal properties favor their practical use for enhancing the spatial realism of two-loudspeaker playback of standard stereo recordings containing binaural cues. I.
    [Show full text]
  • Practises in Listener Envelopment with Controllable Spatial Audio
    DIGITAL AUDIO SYSTEMS BRUNO MARION WRITTEN REVIEW 2 440566533 Practises in Listener Envelopment with Controllable Spatial Audio Introduction The envelopment of the listener in the sound field can be created in a number of ways. The two most import factors to Binaural Quality Index in acoustic space, be it real or virtual; is the Auditory Source Width and the Listener Envelopment Factor (Cardenas, et al., 2012). Multiple solutions exist for the increase in ASW and LEV, those discussed here will be Natural Sounding Artificial Reverberation, Recirculating Delays, Higher-Order Ambisonics and finally Higher-Order Speakers. Reverberation A computationally effective artificial reverberation can be easily achieved by using an exponentially decaying impulse and convolving this with the original audio signal. Inversing a duplication of this signal can also give a stereo effect but when compared to reality, this single-impulse method does not come close to creating the complex interactions of plane waves in a given space as each time a wave reflects from a surface or another wave, it is properties are affected as a function of frequency, phase and amplitude. The two main problems cited by Manfred Schroeder with artificial reverberation are: 1. A non-flat Amplitude Frequency Response 2. Low Echo Density (Schroeder, 1961) Shroeder offers solutions to these problems by use of an all-pass filter. An allpass filter is a method of filtering an audio signal without manipulation of other properties and can be constructed by combining a Finite Impuse Respose (FIR) feed forward and feedback. Figure 1 The allpass filter where fc is equal to the cut off frequency, fcl is equal to the lower cu toff frequency and fch is equal to the higher cut off frequency.
    [Show full text]
  • Franz Zotter Matthias Frank a Practical 3D Audio Theory for Recording
    Springer Topics in Signal Processing Franz Zotter Matthias Frank Ambisonics A Practical 3D Audio Theory for Recording, Studio Production, Sound Reinforcement, and Virtual Reality Springer Topics in Signal Processing Volume 19 Series Editors Jacob Benesty, INRS-EMT, University of Quebec, Montreal, QC, Canada Walter Kellermann, Erlangen-Nürnberg, Friedrich-Alexander-Universität, Erlangen, Germany The aim of the Springer Topics in Signal Processing series is to publish very high quality theoretical works, new developments, and advances in the field of signal processing research. Important applications of signal processing will be covered as well. Within the scope of the series are textbooks, monographs, and edited books. More information about this series at http://www.springer.com/series/8109 Franz Zotter • Matthias Frank Ambisonics A Practical 3D Audio Theory for Recording, Studio Production, Sound Reinforcement, and Virtual Reality Franz Zotter Matthias Frank Institute of Electronic Music and Acoustics Institute of Electronic Music and Acoustics University of Music and Performing Arts University of Music and Performing Arts Graz, Austria Graz, Austria ISSN 1866-2609 ISSN 1866-2617 (electronic) Springer Topics in Signal Processing ISBN 978-3-030-17206-0 ISBN 978-3-030-17207-7 (eBook) https://doi.org/10.1007/978-3-030-17207-7 © The Editor(s) (if applicable) and The Author(s) 2019. This book is an open access publication. Open Access This book is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adap- tation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
    [Show full text]
  • Accurate Reproduction of Binaural Recordings Through Individual Headphone Equalization and Time Domain Crosstalk Cancellation
    PROCEEDINGS of the 23rd International Congress on Acoustics 9 to 13 September 2019 in Aachen, Germany Accurate reproduction of binaural recordings through individual headphone equalization and time domain crosstalk cancellation David GRIESINGER1 1 David Griesinger Acoustics, Cambridge, MA, USA ABSTRACT Accessing the acoustic quality of spaces of all sizes depends on methods that can instantly and precisely compare the sound of different seats and spaces. Complex systems using many loudspeakers have been developed that hopefully achieve this goal. Binaural technology offers a simpler solution. The sound pressure at a listener’s eardrums can be measured with probe microphones, and reproduced with headphones or speakers calibrated at the eardrums. When carefully done the scene is precisely reproduced. But due to the variability of ear canal resonances such recordings and playbacks are highly individual. In this paper we present methods that are non-individual. Our recordings from a dummy head or from the eardrums are equalized to be essentially frequency linear to sound sources in front, giving the recording head the frontal frequency response of studio microphones. The recordings are then played back either through headphones equalized at the eardrum to match the response of a frontal source, or with a simple, non-individual crosstalk cancelling system. We equalize headphones at an individual’s eardrums using equal loudness measurements, and generate crosstalk cancellation with a non-individual algorithm in the time domain. Software apps and plug-ins that enable headphone equalization and crosstalk cancellation on computers and cellphones are now available. Keywords: Binaural, Headphones, Crosstalk 1. INTRODUCTION This paper concerns methods for binaurally recording and later precisely reproducing the sound in a hall or room.
    [Show full text]
  • Scene-Based Audio and Higher Order Ambisonics
    Scene -Based Audio and Higher Order Ambisonics: A technology overview and application to Next-Generation Audio, VR and 360° Video NOVEMBER 2019 Ferdinando Olivieri, Nils Peters, Deep Sen, Qualcomm Technologies Inc., San Diego, California, USA The main purpose of an EBU Technical Review is to critically examine new technologies or developments in media production or distribution. All Technical Reviews are reviewed by 1 (or more) technical experts at the EBU or externally and by the EBU Technical Editions Manager. Responsibility for the views expressed in this article rests solely with the author(s). To access the full collection of our Technical Reviews, please see: https://tech.ebu.ch/publications . If you are interested in submitting a topic for an EBU Technical Review, please contact: [email protected] EBU Technology & Innovation | Technical Review | NOVEMBER 2019 2 1. Introduction Scene Based Audio is a set of technologies for 3D audio that is based on Higher Order Ambisonics. HOA is a technology that allows for accurate capturing, efficient delivery, and compelling reproduction of 3D audio sound fields on any device, such as headphones, arbitrary loudspeaker configurations, or soundbars. We introduce SBA and we describe the workflows for production, transport and reproduction of 3D audio using HOA. The efficient transport of HOA is made possible by state-of-the-art compression technologies contained in the MPEG-H Audio standard. We discuss how SBA and HOA can be used to successfully implement Next Generation Audio systems, and to deliver any combination of TV, VR, and 360° video experiences using a single audio workflow. 1.1 List of abbreviations & acronyms CBA Channel-Based Audio SBA Scene-Based Audio HOA Higher Order Ambisonics OBA Object-Based Audio HMD Head-Mounted Display MPEG Motion Picture Experts Group (also the name of various compression formats) ITU International Telecommunications Union ETSI European Telecommunications Standards Institute 2.
    [Show full text]
  • Spatialized Audio Rendering for Immersive Virtual Environments
    Spatialized Audio Rendering for Immersive Virtual Environments Martin Naef1 Oliver Staadt2 Markus Gross1 1Computer Graphics Laboratory 2Computer Science Department ETH Zurich, Switzerland University of California, Davis, USA +41-1-632-7114 +1-530-752-4821 {naef, grossm}@inf.ethz.ch [email protected] ABSTRACT We are currently developing a novel networked SID, the blue-c, which combines immersive projection with multi-stream video We present a spatialized audio rendering system for the use in acquisition and advanced multimedia communication. Multiple immersive virtual environments. The system is optimized for ren- interconnected portals will allow remotely located users to meet, to dering a sufficient number of dynamically moving sound sources communicate, and to collaborate in a shared virtual environment. in multi-speaker environments using off-the-shelf audio hardware. We have identified spatial audio as an important component for Based on simplified physics-based models, we achieve a good this system. trade-off between audio quality, spatial precision, and perfor- In this paper, we present an audio rendering pipeline and system mance. Convincing acoustic room simulation is accomplished by suitable for spatially immersive displays. The pipeline is targeted integrating standard hardware reverberation devices as used in the at systems which require a high degree of flexibility while using professional audio and broadcast community. We elaborate on inexpensive consumer-level audio hardware. important design principles for audio rendering as well as on prac- For successful deployment in immersive virtual environments, tical implementation issues. Moreover, we describe the integration the audio rendering pipeline has to meet several important require- of the audio rendering pipeline into a scene graph-based virtual ments: reality toolkit.
    [Show full text]
  • Localization of 3D Ambisonic Recordings and Ambisonic Virtual
    Localization of 3D Ambisonic Recordings and Ambisonic Virtual Sources Sebastian Braun and Matthias Frank UniversitÄatfÄurMusik und darstellende Kunst Graz, Austria Institut fÄurElektronische Musik und Akustik, Email: [email protected] Abstract The accurate recording and reproduction of real 3D sound environments is still a challenging task. In this paper, the two most established 3D microphones are evaluated: the Sound¯eld SPS200 and MH Acoustics' Eigenmike EM32, according to 1st and 4th order Ambisonics. They are compared to virtual encoded sound sources of the same orders in a localization test. For the reproduction, an Ambisonics system with 24 (12+8+4) hemispherically arranged loudspeakers is used. In order to compare to existing results from the literature, this paper focuses on sound sources in the horizontal plane. As expected, the 4th order sources yield better localization as the 1st order sources. Within each order, the real recordings and the virtual encoded sources show a good correspondence. Introduction Coordinate system. Fig. 1 shows the coordinate sys- tem used in this paper. The directions θ are vectors Surround sound systems have entered movie theaters and of unit length that depend on the azimuth angle ' and our living rooms during the last decades. But these zenith angle # (# = 90± in the horizontal plane) systems are still limited to horizontal sound reproduc- 0 1 tion. Recently, further developments, such as 22.2 [1] cos(') sin(#) and AURO 3D [2], expanded to the third dimension by θ('; #) = @sin(') sin(#)A : (1) adding height channels. cos(#) The third dimension envolves new challenges in panning and recording [3]. VBAP [4] is the simplest method for z panning and at the same time the most flexible one, as it works with arbitrary loudspeaker arrangements.
    [Show full text]
  • The Impact of Multichannel Game Audio on the Quality of Player Experience and In-Game Performance
    The Impact of Multichannel Game Audio on the Quality of Player Experience and In-game Performance Joseph David Rees-Jones PhD UNIVERSITY OF YORK Electronic Engineering July 2018 2 Abstract Multichannel audio is a term used in reference to a collection of techniques designed to present sound to a listener from all directions. This can be done either over a collection of loudspeakers surrounding the listener, or over a pair of headphones by virtualising sound sources at specific positions. The most popular commercial example is surround-sound, a technique whereby sounds that make up an auditory scene are divided among a defined group of audio channels and played back over an array of loudspeakers. Interactive video games are well suited to this kind of audio presentation, due to the way in which in-game sounds react dynamically to player actions. Employing multichannel game audio gives the potential of immersive and enveloping soundscapes whilst also adding possible tactical advantages. However, it is unclear as to whether these factors actually impact a player’s overall experience. There is a general consensus in the wider gaming community that surround-sound audio is beneficial for gameplay but there is very little academic work to back this up. It is therefore important to investigate empirically how players react to multichannel game audio, and hence the main motivation for this thesis. The aim was to find if a surround-sound system can outperform other systems with fewer audio channels (like mono and stereo). This was done by performing listening tests that assessed the perceived spatial sound quality and preferences towards some commonly used multichannel systems for game audio playback over both loudspeakers and headphones.
    [Show full text]
  • Binaural Basics.Pdf
    INAURAL Binaural recordings will add a new dimension to your audio world. JOHN SUNlER mikes feed two channels which are kept entirely separated from the source all the way to the final listener, whether live, a record- ing, or a broadcast. The listener wears stereo headphones and the between the two. That's because original left ear signal must be most source material isn't de- routed properly to the left ear and ALTHOUGH MOST AUDIOPHJLES ARE signed for headphone listening. the right to the right or the effect familiar with the term binal~ral, An unnaturally exaggerated is compromised. The final result there's still quite a bit of con- effect is created with head- is for the listener to be sonically fusion about it. Early in stereo phones, as though half an or- transported to where the sounds history the terms binaural and chestra is on one side of your originated, rather than attempt- stereo were used interchangea- head and the other half on the ing to bring the sounds into the bly, even though the two record- other side, with a hole in the mid- listener's room as with speakers. ing methods are totally different. dle. Also, the music sounds as if The left speaker signal is pre- Recording pioneer Emory Cook it's happening inside your head vented from feeding into the lis- caused some of that confusion by rather than out in the room. No tener's right ear, and vice versa, calling his early 50's twin- serious record producer would with binaural playback on stereo grooved stereo LP's binaural ever monitor a recording session headphones.
    [Show full text]
  • What's New in Pro Tools 12.8.2
    What’s New in Pro Tools® and Pro Tools | HD version 12.8.2 Legal Notices © 2017 Avid Technology, Inc., (“Avid”), all rights reserved. This guide may not be duplicated in whole or in part without the written consent of Avid. 003, 192 Digital I/O, 192 I/O, 96 I/O, 96i I/O, Adrenaline, AirSpeed, ALEX, Alienbrain, AME, AniMatte, Archive, Archive II, Assistant Station, Audiotabs, AudioStation, AutoLoop, AutoSync, Avid, Avid Active, Avid Advanced Response, Avid DNA, Avid DNxcel, Avid DNxHD, Avid DS Assist Station, Avid Ignite, Avid Liquid, Avid Media Engine, Avid Media Processor, Avid MEDIArray, Avid Mojo, Avid Remote Response, Avid Unity, Avid Unity ISIS, Avid VideoRAID, AvidRAID, AvidShare, AVIDstripe, AVX, Beat Detective, Beauty Without The Bandwidth, Beyond Reality, BF Essentials, Bomb Factory, Bruno, C|24, CaptureManager, ChromaCurve, ChromaWheel, Cineractive Engine, Cineractive Player, Cineractive Viewer, Color Conductor, Command|8, Control|24, Cosmonaut Voice, CountDown, d2, d3, DAE, D-Command, D-Control, Deko, DekoCast, D-Fi, D-fx, Digi 002, Digi 003, DigiBase, Digidesign, Digidesign Audio Engine, Digidesign Development Partners, Digidesign Intelligent Noise Reduction, Digidesign TDM Bus, DigiLink, DigiMeter, DigiPanner, DigiProNet, DigiRack, DigiSerial, DigiSnake, DigiSystem, Digital Choreography, Digital Nonlinear Accelerator, DigiTest, DigiTranslator, DigiWear, DINR, DNxchange, Do More, DPP-1, D-Show, DSP Manager, DS-StorageCalc, DV Toolkit, DVD Complete, D-Verb, Eleven, EM, Euphonix, EUCON, EveryPhase, Expander, ExpertRender, Fairchild,
    [Show full text]
  • Binaural Reproduction of Higher Order Ambisonics
    Binaural Reproduction of Higher Order Ambisonics A Real-Time Implementation and Perceptual Improvements Jakob Vennerød Electronics System Design and Innovation Submission date: June 2014 Supervisor: Peter Svensson, IET Norwegian University of Science and Technology Department of Electronics and Telecommunications Problem description A spherical microphone array is a scalable array with a certain number of microphones, where the number of microphones determines how directive the array can be implemented. The microphone signals can be processed by means of spherical harmonic (SH) modal functions, e.g. in the Higher Order Ambisonics format. This format is easily scalable between different array sizes, and sound fields can easy be rotated in this format. One can also convert between Higher Order Ambisonics and the binaural format for headphone reproduction. In this project, the student should study how signals from a spherical microphone array, in the SH format, can be used to create rotatable sound fields that can be repro- duced through headphones, with a head-tracking device. A real-time system shall, if possible, be implemented. i ii Preface This thesis marks the end of six years of studying in Trondheim, and my completion of the MSc. degree in Electronics at the Norwegian University of Science and Technology (NTNU). The thesis work was done from January to June, 2014. Initially, the topic of this thesis developed from a wish to study and work with microphone arrays, which is a rather popular, but also broad, field of acoustics today. I soon discovered that spatial audio and binaural sound would be the main ingredients of the thesis, which was not my primary field of interest prior to this work.
    [Show full text]
  • AMBEO® for Binaural AMBEO How It Works: for Binaural Normal Stereo Audio Is Limited to Two Dimensions: Only Left and Right Inside the Head of the Listener
    AMBEO® for Binaural AMBEO How it works: for Binaural Normal stereo audio is limited to two dimensions: only left and right inside the head of the listener. Binaural audio breaks this barrier by allowing sounds to be placed anywhere in front, behind, above, or below the listener in three dimensions, while still using only a stereo signal to carry the audio. Technically speaking, binaural audio is a stereo audio signal that has been treated with the same temporal and spatial acoustic properties that, in the 3D audio can be created in diverse ways. real world, allow us to hear sounds all around us, in three dimensions. These The right recording technique is defined acoustic properties are simulated with Head Related Transfer Function Filters, by the desired playback device. Of the or HRTFs, which render a virtual 3D surround experience over headphones. several immersive audio techniques that Binaural recording is a method to create binaural audio by recording sounds Sennheiser is offering solution for, AMBEO in real life. It uses two microphones, to create a natural sensation as if you for binaural is the most appropriate and were actually in the room where the sound is being produced. It is an easy convenient solution to deliver 3D content and effective technique for creating immersive audio content for playback to mobile platforms and headphones. over headphones. Introducing AMBEO, Immersive Audio by Sennheiser AMBEO is Sennheiser’s program and sub-brand for immersive audio, which covers immersive audio products and technologies for the entire audio signal chain, from capture to mixing and processing to reproduction.
    [Show full text]