Concert Hall Acoustics, Springer-Verlag, It Is Considered That the Scale Value of Apparent Heidelberg

Total Page:16

File Type:pdf, Size:1020Kb

Concert Hall Acoustics, Springer-Verlag, It Is Considered That the Scale Value of Apparent Heidelberg Orthogonal Factors Describing Primary and Spatial Sensations of the Sound Field in a Concert Hall Yoichi Ando Graduate School of Science and Technology, Kobe University Rokkodai, Nada, Kobe 657-8501 Japan Subjective preference of the sound field in a concert hall is described based on the model of human auditory-brain system. The model consists of the autocorrelation function (ACF) mechanism and the interaural crosscorrelation function (IACF) mechanism for signals arriving at two ear entrances, and the specialization of human cerebral hemispheres [Ando, Architectural Acoustics, AIP/Springer, 1998]. From this view point, primary sensations such as pitch or missing fundamental, loudness, timbre, and in addition duration sensation which is introduce here as a fourth are well described by the temporal factors extracted from the ACF associated with left hemisphere. And, spatial sensations such as apparent source width (ASW) and subjective diffuseness are described by the spatial factors extracted from the IACF associated with the right hemisphere ORTHOGONAL FACTORS peak. Usually, there are certain correlation between τ τ φ φ n and n +1, and between n and n+1; Primary sensations and spatial sensations as well as (3) Effective duration of the envelope of the subjective preference for sound fields are well normalized ACF, τe, which is defined by the ten- described by a model of the auditory-brain system. The percentile delay and which represents a repetitive model includes autocorrelation function (ACF) and feature or reverberation containing the sound interaural crosscorrelation function (IACF) source it. mechanisms [1,2]. Important evidences supporting this (a) 1 model were discovered in relation to the auditory-brain activity [2]. This article reviews that primary φ sensations and spatial sensations are mainly described 1 by temporal and spatial factors extracted from the ACF ) τ ( 0 and the IACF, respectively. p φ τ Factors extracted from the ACF 1 -1 The ACF is defined by 0 Delay time τ [ms] +T (b) Φ τ 1 τ (1) 0 τ p( ) = p'(t)p'(t+ )dt e 2T -T [dB] -5 where p’(t) = p(t)*s(t), s(t) being the ear sensitivity, ) | which is essentially formed by the transfer function of τ ( p physical system to oval of cochlea. For convenience, φ -10 s(t) may be chosen as the impulse response of an A- weighted network [1,2]. The ACF and the power log | density spectrum contain the same information. There -15 are four factors, which can be extracted from the ACF: 020100 0 Delay time τ [ms] (1) Energy represented at the origin of the delay, Φ (0); p FIGURE 1. Definition of independent factors other than (2) Fine structure, including peaks and delays (Figure Φ (0) extracted from the normalized ACF. (a) Values of 1a). For instance, τ and φ are the delay time and τ φ 1 1 1 and 1 for the first peak; (b) The effective duration of τ φ τ the amplitude of the first peak of ACF, n and n the ACF e is obtained practically by the extrapolation of the envelope of the normalized ACF during the decay, being the delay time and the amplitude of the n-th 5 dB initial (b). The normalized ACF is defined by φp(τ) = Φp(τ) /Φp(0) (2) As a manner shown in Figure 1b, the value of τe is obtained by fitting a straight line for extrapolation of delay time at –10 dB, if the initial envelope of ACF decays exponentially. Therefore, four orthogonal and temporal factors that can be extracted from the ACF are Φp(0), τ1, φ1, and τe . Auditory-Temporal Window In analysis of the running ACF, of particular interest FIGURE 2. Definition of independent factors IACC, is so called an “auditory-temporal window”, 2T in τIACC and WIACC extracted from the normalized IACF. Equation (1), that must be determined. Since the initial τ part of ACF within the effective duration e of the ACF contains the most important information of the signal, PRIMARY SENSATIONS thus the recommended signal duration (2T)r is given by Loudness (2T)r ≈ K1(τe)min [s] (3) Let us now consider primary sensations. Loudness sL is given by where (τe)min is the minimum value of τe obtained by analyzing the running ACF, K1 being the constant sL = f[Φp(0), τ1, φ1, τe, D] (6) around 30 [7]. The running step (Rs) is selected as K2(2T)r, K2 being selected, say, in the range of 1/4 – 3/4. where D is the duration of sound signal as is represented by musical notes. It is worth noticing that τ Factors extracted from the IACF the value of 1 corresponds to pitch of sound and/or the missing fundamental as discussed below. Since the The IACF is given by sampling frequency of the sound wave is more than the twice of the maximum audio frequency, the value +T 10logΦ (0)/Φ (0)ref is far more accurate than the Leq 1 Φlr(τ) = p'l(t)p'r(t+τ)dt (4) which is measured by the sound level meter. 2T -T Scale values of loudness within the critical band were obtained in paired-comparison tests (with filters where p’ (t) = p(t) *s(t), p(t) being the sound with the slope of 1080 or 2068 dB/octave) under the l ,r l,r l,r Φ pressure at the left- and right-ear entrances. The condition of a constant p(0) [2,4]. Obviously, when normalized IACF is given by sound signal has the similar repetitive feature, τe becomes a great value, as like a pure tone, then the 1/2 φlr(τ) = Φlr(τ)/[Φll(0)Φrr(0)] (5) greater loudness results. Thus a plot of loudness versus bandwidth is not flat in the critical band. This where Φll(0) and Φrr(0) are autocorrelation functions contradicts previous results of the frequency range (τ = 0) or sound energies arriving at the left- and right- centered on 1 kHz [5]. ear entrance, respectively. Spatial factors extracted from the IACF are defined in Figure 2 [2]. Pitch In analyzing the running IACF, 2T is selected by Equation (3) also. For the purpose of spatial design for The second primary sensation applying the ACF is sound fields, however, longer values of (2T)r may be the pitch or the missing fundamental of the noise. It is useful, because it is essentially time independent. given by sP = f[Φp(0), τ1, φ1, τe , D] (7) When a sound signal contains only a number of Table 1. Primary sensations in relation to factors extracted harmonics without the fundamental frequency, we hear from the autocorrelation function and the interaural crosscorrelation function. the fundamental as a pitch. This phenomenon is well explained by the delay time of the first peak in the ACF fine structure, τ1 [6,7]. According to experimental Factors Primitive Sensations results on the pitch perceived when listening to a) bandpass noises without any fundamental frequency, Loudness Pitch Timbre Duration the pitch sp is expressed by equation (7) as well, under the condition of a constant Φπ(0). The strength of the ACF Φp(o) X x X X pitch sensation is described by the magnitude of the τ φ 1 X X X X first peak of the ACF, 1. For a signal of short duration, φ x X X X factor D must be taken into account. 1 τe X x X x D xb) xb) Xb) X Timbre The third primary sensation, timbre that includes X and x : Major and minor factors influencing the corresponding response, respectively. pitch, loudness, and duration, might be expressed by a). Timbre in relation to all of temporal and spatial factors is under investigation. b). It is suggested that loudness, pitch and timbre should be sT = f[Φp(0),τe, τ1, φ1, D] (8) examined in relation to the signal duration. It is worth noticing that the intelligibility of single τ syllables as a function of the delay time of single s = f[LL, IACC, IACC, WIACC] (10) reflection is well be calculated by the four orthogonal factors extracted from the running ACF analyzed for where the piece between consonant and vowel sounds [7]. A Φ Φ recent investigation, clearly show that timbre or LL = 10 log [ p(0)/ (0)ref] (11) dissimilarity judgment is an overall subjective response Φ Φ Φ 1/2 Φ Φ similar for the subjective preference of sound fields in And p(0) = [ ll(0) rr(0)] , and ll(0) and rr(0) concert hall. being ACFs at τ = 0 (sound energies), of the signals arriving at the left and right ear-entrances. In four orthogonal factors in Equation (10), the interaural Duration delay time, τ , is a significant factor in determining IACC the perceived horizontal direction of the source. A The forth-primitive sensation, which is introduced well-defined direction is perceived when the here, is the perception of signal duration, which is normalized interaural crosscorrelation function has one given by [12,13] sharp maximum, a high value of the IACC and a narrow value of the W , due to high frequency s = f[Φ (0), τ , φ , τ , D] (9) IACC D p 1 1 e components. On the other hand, subjective diffuseness or no spatial directional impression corresponds to a One of experimental results has been expressed in τ φ low value of IACC (< 0.15) [9]. relation to 1, 1, and D [8]. Table 1 indicates Of particular interest is that, for the perception of a summarization of primary sensations in relation to sound source located in the median plane, the temporal factors extracted from the ACF and physical signal factors extracted from the ACF of sound signal duration D.
Recommended publications
  • フラン•パルレ Franc-Parler
    無料 フ ラ ン ・ パ ル レ gratuit Nº 109 FRANC-PARLER 10月号 octobre http://franc-parler.main.jp 2007 Le mensuel de la culture et de l’actualité francophones 月刊フランス語圏情報誌 ROBERT MÉNARD, REPORTERS SANS FRONTIÈRES: POUR UNE PRESSE LIBRE ロベール・メナール:国境なき記者団−自由な報道のために L’ONG internationale 国際的NGOである Reporters sans fron- 国境なき記者団は tières a été créée en 1985年に現事務局 1985 par le journaliste 長であるジャーナリ Robert Ménard, son スト、ロベール・メ ナールによって、ジ secrétaire général, ャーナリストと報道 dans le but de défen- の基本的権利を護 dre les journalistes et る目的で設 立され le droit fondamental à た。その活動は、各 l’information. Son ac- 国の支部、事務所と tion s’étend aux cinq 120を超える通信員 continents au moyen によって、五大陸に de ses sections na- 及んでいる。このス tionales, ses bureaux ローガンとともに: et son réseau de plus 報道を禁止されるこ de cent vingt corre- とに抗うために手を こまぬいていてはい spondants. Avec pour けない。 slogan: N’attendez pas qu’on vous prive フラン・パルレ:国境 de l’information pour なき記者団によって la défendre. 用いられる方法はど んなものですか? Franc-Parler: ロベール・メナール: Quelles sont そうですね、いいで les méthodes すか、国境なき記者 employées par 団によって果たされ Reporters sans ている活動には、3 frontières? つのタイプがあるの です。何よりまず、メ Robert Ménard: ディアを通じて一般 Bien écoutez, il y a 大衆に情報を与える trois types d’actions 仕事があります。そ qui sont menées の目的は自らに問う par Reporters sans ということです。ひ frontières. D’abord, どい態度をとる国家 t o u t u n t r a v a i l は、一般的に、これ d’information du らの国家が必要とし grand public via les ている、彼らがもの médias. L’objectif すごく必要としてい c’est de se dire: les るのは国際社会であ り、彼らのブランドイ États qui se com- メージは重要なので por tent mal, en す。彼らのブランドイ général , ces États, メージが重要である ils ont besoin, ils ont から、もし私達が、メ énormément besoin ディアを通じて彼ら de la communauté の良くない宣伝をす internationale et leur ると、それは彼らの image de marque est 態度に影響を与える importante.
    [Show full text]
  • Estimation of Direction of Arrival of Acoustic Signals Using Microphone
    Time-Delay-Estimate Based Direction-of-Arrival Estimation for Speech in Reverberant Environments by Krishnaraj Varma Thesis submitted to the Faculty of The Bradley Department of Electrical and Computer Engineering Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Master of Science in Electrical Engineering APPROVED Dr. A. A. (Louis) Beex, Chairman Dr. Ira Jacobs Dr. Douglas K. Lindner October 2002 Blacksburg, VA KEYWORDS: Microphone array processing, Beamformer, MUSIC, GCC, PHAT, SRP-PHAT, TDE, Least squares estimate © 2002 by Krishnaraj Varma Time-Delay-Estimate Based Direction-of-Arrival Estimation for Speech in Reverberant Environments by Krishnaraj Varma Dr. A. A. (Louis) Beex, Chairman The Bradley Department of Electrical and Computer Engineering (Abstract) Time delay estimation (TDE)-based algorithms for estimation of direction of arrival (DOA) have been most popular for use with speech signals. This is due to their simplicity and low computational requirements. Though other algorithms, like the steered response power with phase transform (SRP-PHAT), are available that perform better than TDE based algorithms, the huge computational load required for this algorithm makes it unsuitable for applications that require fast refresh rates using short frames. In addition, the estimation errors that do occur with SRP-PHAT tend to be large. This kind of performance is unsuitable for an application such as video camera steering, which is much less tolerant to large errors than it is to small errors. We propose an improved TDE-based DOA estimation algorithm called time delay selection (TIDES) based on either minimizing the weighted least squares error (MWLSE) or minimizing the time delay separation (MWTDS).
    [Show full text]
  • TA-1VP Vocal Processor
    D01141720C TA-1VP Vocal Processor OWNER'S MANUAL IMPORTANT SAFETY PRECAUTIONS ªª For European Customers CE Marking Information a) Applicable electromagnetic environment: E4 b) Peak inrush current: 5 A CAUTION: TO REDUCE THE RISK OF ELECTRIC SHOCK, DO NOT REMOVE COVER (OR BACK). NO USER- Disposal of electrical and electronic equipment SERVICEABLE PARTS INSIDE. REFER SERVICING TO (a) All electrical and electronic equipment should be QUALIFIED SERVICE PERSONNEL. disposed of separately from the municipal waste stream via collection facilities designated by the government or local authorities. The lightning flash with arrowhead symbol, within equilateral triangle, is intended to (b) By disposing of electrical and electronic equipment alert the user to the presence of uninsulated correctly, you will help save valuable resources and “dangerous voltage” within the product’s prevent any potential negative effects on human enclosure that may be of sufficient health and the environment. magnitude to constitute a risk of electric (c) Improper disposal of waste electrical and electronic shock to persons. equipment can have serious effects on the The exclamation point within an equilateral environment and human health because of the triangle is intended to alert the user to presence of hazardous substances in the equipment. the presence of important operating and (d) The Waste Electrical and Electronic Equipment (WEEE) maintenance (servicing) instructions in the literature accompanying the appliance. symbol, which shows a wheeled bin that has been crossed out, indicates that electrical and electronic equipment must be collected and disposed of WARNING: TO PREVENT FIRE OR SHOCK separately from household waste. HAZARD, DO NOT EXPOSE THIS APPLIANCE TO RAIN OR MOISTURE.
    [Show full text]
  • Re-20 Om.Pdf
    RE-20_e.book 1 ページ 2007年6月8日 金曜日 午後4時32分 Thank you, and congratulations on your choice of the BOSS RE-20 Space Echo. Before using this unit, carefully read the sections entitled: “USING THE UNIT SAFELY” and “IMPORTANT NOTES” (separate sheet). These sections provide important information concerning the proper operation of the unit. Additionally, in order to feel assured that you have gained a good understanding of every feature provided by your new unit, this manual should be read in its entirety. The manual should be saved and kept on hand as a convenient reference. Main Features ● The RE-20 uses COSM technology to faithfully simulate the characteristics of the famed Roland SPACE ECHO RE-201. ● Faithfully reproduces the characteristics of the RE-201, including the echo’s distinctive wow- and flutter-induced wavering and the compressed sound obtained with magnetic saturation. ● The Mode Selector carries on the tradition of the RE-201, offering twelve different reverberation effects through various combinations of the three playback heads and reverb. ● You can set delay times with the TAP input pedal and use an expression pedal (sold separately) for controlling parameters. ● Equipped with a “Virtual Tape Display,” which produces a visual image of a running tape. About COSM (Composite Object Sound Modeling) Composite Object Sound Modeling—or “COSM” for short—is BOSS/Roland’s innovative and powerful technology that’s used to digitally recreate the sound of classic musical instruments and effects. COSM analyzes the many factors that make up the original sound—including its electrical and physical characteristics—and creates a digital model that accurately reproduces the original.
    [Show full text]
  • Implementing an M-Fold Wah-Wah Filter in Matlab
    “What if we had, not one Wah Wah Filter, not two, but 20?”: Implementing an M-Fold Wah-Wah Filter in Matlab Digital Audio Systems, DESC9115, 2020 Master of Interaction Design & Electronic Arts (Audio and Acoustics) Sydney School of Architecture, Design and Planning, The University of Sydney ABSTRACT An M-fold Wah-Wah filter can be described as an effect where multiple Wah-Wah filters are applied to a signal, each at a certain frequency range. This report describes the implementation of such a filter in Matlab. By using preexisting code on a single state-variable bandpass filter, multiple bandpass filters are implemented across a defined frequency spectrum. The filter is adjustable through a number of variables, these being: the number of bandpass filters (M), the damping factor of each filter, the spectrum for which the filters are applied, as well as the Wah Frequency, i.e. the number of cycles through each bandpass. 1. INTRODUCTION The Wah-Wah filter is commonly used by guitarist to alter the shape and tone of the note(s) they are playing. The effect can be described as the combination of ‘u’ and ‘ah’ sounds created by human voice. The mouth’s shape here going from a small O to a big O. The center frequencies are called “formants”. The Wah-Wah pedal works in a similar manner, the formants shifts creating a “wah” sound. The Wah-Wah filter is a time-varying delay line filter. Each filter has a set of unique characteristics such as the range of frequencies the effect is applied to and its Wah- Frequency, i.e.
    [Show full text]
  • Architectural Acoustics
    A COMPARISON OF SOURCE TYPES AND THEIR IMPACTS ON ACOUSTICAL METRICS By KEELY SIEBEIN A THESIS PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE IN ARCHITECTURAL STUDIES UNIVERSITY OF FLORIDA 2012 1 © 2012 Keely Siebein 2 To my parents 3 ACKNOWLEDGMENTS I would like to thank my mother, father, sisters, brothers, nephews and the rest of my family for all your love and support throughout my life. I would like to thank Professors Gold and Siebein for your continuous support, guidance and encouragement throughout the process of my thesis. I’d also like to thank my fellow acoustics students, Sang Bong, and Sang Bum, Cory, Adam B, Adam G., Jose, Jorge and Jenn and any others who helped me with the data collection and reviewing of the material and for keeping me on track with the process. I’d also like to thank Dave for your continuous support throughout this entire process, for believing in me and doing all that you could to help me throughout the course of my studies. I’d also like to thank my wonderful friends, for believing in me and encouraging me throughout my studies. Thanks to my Dad, for being my father, professor, boss, but most importantly, my mentor. It’s an honor to have studied with you and learned from you in this capacity and I will treasure these years and the time I spent learning under your guidance forever. Thank you all for everything. 4 TABLE OF CONTENTS page ACKNOWLEDGMENTS .................................................................................................
    [Show full text]
  • Subwoofer Arrays: a Practical Guide
    Subwoofer Arrays A Practical Guide VVVeVeeerrrrssssiiiioooonnnn 111 EEElEllleeeeccccttttrrrroooo----VVVVooooiiiicccceeee,,,, BBBuBuuurrrrnnnnssssvvvviiiilllllleeee,,,, MMMiMiiinnnnnneeeessssoooottttaaaa,,,, UUUSUSSSAAAA AAApAppprrrriiiillll,,,, 22202000009999 © Bosch Security Systems Inc. Subwoofer Arrays A Practical Guide Jeff Berryman Rev. 1 / June 7, 2010 TABLE OF CONTENTS 1. Introduction .......................................................................................................................................................1 2. Acoustical Concepts.......................................................................................................................................2 2.1. Wavelength ..........................................................................................................................................2 2.2. Basic Directivity Rule .........................................................................................................................2 2.3. Horizontal-Vertical Independence...................................................................................................3 2.4. Multiple Sources and Lobing ...........................................................................................................3 2.5. Beamforming........................................................................................................................................5 3. Gain Shading....................................................................................................................................................6
    [Show full text]
  • Effect Types and Parameters
    Effect Types and Parameters © 2017 ZOOM CORPORATION Manufacturer names and product names are trademarks or registered trademarks of their respective owners. The names are used only to illustrate sonic characteristics and do not indicate any affiliation with the Zoom Corporation. Effect explanation overview Pedal control possible icon Effect type Effect explanation Parameter range PDL Vol The volume curve of the volume pedal can be set. VOL Adjusts the volume. 0 – 100 P Min Adjusts the volume when the pedal is at minimum position. 0 – 100 Max Adjusts the volume when the pedal is at maximum position. 0 – 100 Curve Sets the volume curve. A , B Effect Screen Parameter Parameter explanation Tempo synchronization possible icon Contents DYNAMICS 3 FILTER 4 DRIVE 6 AMP 8 CABINET 9 MODULATION 10 SFX 12 DELAY 13 REVERB 14 PEDAL 15 Additional tables 16 2 3 Effect Types and Parameters [ DYNAMICS ] SlowATTCK This effect slows the attack of each note, resulting in a violin-like performance. Time Adjusts the attack time. 1 – 50 Curve Set the curve of volume change during attack. 0 – 10 Tone Adjusts the tone. 0 – 100 VOL Adjusts the volume. 0 – 100 ZOOM's unique noise reduction cuts noise during pauses in playing without affecting ZNR the tone. GTRIN , DETCT Sets control signal detection level. EFXIN Depth Sets the depth of noise reduction. 0 – 100 THRSH Adjusts the effect sensitivity. 0 – 100 Decay Adjust the envelope release. 0 – 100 This is a simulation of the Demeter COMP-1 Compulator. BlackOpt Added parameters allow you to adjust the tone. Comp Adjusts the depth of the compression.
    [Show full text]
  • Digital Guitar Effects Pedal
    Digital Guitar Effects Pedal 01001000100000110000001000001100 010010001000 Jonathan Fong John Shefchik Advisor: Dr. Brian Nutter Texas Tech University SPRP499 [email protected] Presentation Outline Block Diagram Input Signal DSP Configuration (Audio Processing) Audio Daughter Card • Codec MSP Configuration (User Peripheral Control) Pin Connections User Interfaces DSP Connection MSP and DSP Connections Simulink Modeling MSP and DSP Software Flowcharts 2 Objective \ Statements To create a digital guitar effects pedal All audio processing will be done with a DSP 6711 DSK board With an audio daughter card All user peripherals will be controlled using a MSP430 evaluation board Using the F149 model chip User Peripherals include… Floor board switches LCD on main unit Most guitar effect hardware that is available on the market is analog. Having a digital system would allow the user to update the system with new features without having to buy new hardware. 3 Block Diagram DSP 6711 DSK Board Audio Daughter Card Input from Output to Guitar ADC / DAC Amplifier Serial 2 RX Serial 2 [16 bit] Data Path Clock Frame Sync Serial 2 TX MSP 430 Evaluation Board [16 bit] GPIO (3) Outlet MSP430F149 TMS320C6711 Power Supply GND +5V Data Bus (8) RS LEDs (4) EN Voltage +3.3V to MSP R/W Switch Select (4) +5V to LCD LCD Footboard Regulator +0.7V to LCD Drive Display Switch 4 Guitar Input Signal Voltage Range of Input Signal Nominal ~ 300 mV peak-to-peak Maximum ~ 3 V Frequency Range Standard Tuning 500 Hz – 1500 Hz 5 Hardware: DSP
    [Show full text]
  • Object-Based Reverberation for Spatial Audio Coleman, P, Franck, A, Jackson, P, Hughes, RJ, Remaggi, L and Melchior, F 10.17743/Jaes.2016.0059
    Object-based reverberation for spatial audio Coleman, P, Franck, A, Jackson, P, Hughes, RJ, Remaggi, L and Melchior, F 10.17743/jaes.2016.0059 Title Object-based reverberation for spatial audio Authors Coleman, P, Franck, A, Jackson, P, Hughes, RJ, Remaggi, L and Melchior, F Type Article URL This version is available at: http://usir.salford.ac.uk/id/eprint/53438/ Published Date 2017 USIR is a digital collection of the research output of the University of Salford. Where copyright permits, full text material held in the repository is made freely available online and can be read, downloaded and copied for non-commercial private study or research purposes. Please check the manuscript for any further copyright restrictions. For more information, including our policy and submission procedure, please contact the Repository Team at: [email protected]. PAPERS Journal of the Audio Engineering Society Vol. 65, No. 1/2, January/February 2017 (C 2017) DOI: https://doi.org/10.17743/jaes.2016.0059 Object-Based Reverberation for Spatial Audio* PHILIP COLEMAN,1 AES Member, ANDREAS FRANCK,2 AES Member, ([email protected]) PHILIP J. B. JACKSON,1 AES Member, RICHARD J. HUGHES,3 AES Associate Member, LUCA REMAGGI1, AND FRANK MELCHIOR,4 AES Member 1Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, Surrey, GU2 7XH, UK 2Institute of Sound and Vibration Research, University of Southampton, Southampton, Hampshire, SO17 1BJ, UK 3Acoustics Research Centre, University of Salford, Salford, M5 4WT, UK 4BBC Research and Development, Dock House, MediaCityUK, Salford, M50 2LH, UK Object-based audio is gaining momentum as a means for future audio content to be more immersive, interactive, and accessible.
    [Show full text]
  • Contribution of Precisely Apparent Source Width to Auditory Spaciousness
    Chapter 8 Contribution of Precisely Apparent Source Width to Auditory Spaciousness Chiung Yao Chen Additional information is available at the end of the chapter http://dx.doi.org/10.5772/56616 1. Introduction It has been shown that the apparent source width (ASW) for one-third-octave band pass noises signal offers a satisfactory explanation for functions of the inter-aural cross-correlation (IACC) and WIACC, which is defined as the time interval of the inter-aural cross-correlation function within ten percent of the maximum (Sato and Ando, [18]). In this chapter, the binaural criteria of spatial impression in halls will be investigated by comparing with ASW for the auditory purpose assistant to visual attention, which is called source localization. It was proposed that the ASW could properly define directional impression corresponding to the inter-aural time delay (τIACC) perceived when listening to sound with a sharp peak in the inter-aural cross- correlation function (ICF) with a small value of WIACC. We supposed that the ASW would be sensed not only with regard to the relative amplitudes between reflections in a hall, but the total arrived energies at two ears through the A-weighting network in the brain, termed as listening level (LL) and the temporal characteristics of sound sources. This hypothesis is based on the fact that the spatial experience in a room will be varied by changing the center frequency of one-third-octave band pass noise signal, and the ASW decreases as the main frequency goes up. For the purpose of this chapter, we shall discuss the relationship among some factors, the geometric mean of sound energies at two ears, the reverberation, IACC, τIACC, and WIACC, and whether they are independently related to the sound source on a horizontal plane.
    [Show full text]
  • Characterizing Apparent Source Width Perception
    CONTRIBUTIONS TO HEARING RESEARCH Volume 25 Johannes Käsbach Characterizing apparent source width perception Hearing Systems Department of Electrical Engineering Characterizing apparent source width perception PhD thesis by Johannes Käsbach Preliminary version: August 2, 2016 Technical University of Denmark 2016 © Johannes Käsbach, 2016 Cover illustration by Emma Ekstam. Preprint version for the assessment committee. Pagination will differ in the final published version. This PhD dissertation is the result of a research project carried out at the Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark. The project was partly financed by the CAHR consortium (2/3) and by the Technical University of Denmark (1/3). Supervisors Prof. Torsten Dau Phd Tobias May Hearing Systems Group Department of Electrical Engineering Technical University of Denmark Kgs. Lyngby, Denmark Abstract Our hearing system helps us in forming a spatial impression of our surrounding, especially for sound sources that lie outside our visual field. We notice birds chirping in a tree, hear an airplane in the sky or a distant train passing by. The localization of sound sources is an intuitive concept to us, but have we ever thought about how large a sound source appears to us? We are indeed capable of associating a particular size with an acoustical object. Imagine an orchestra that is playing in a concert hall. The orchestra appears with a certain acoustical size, sometimes even larger than the orchestra’s visual dimensions. This sensation is referred to as apparent source width. It is caused by room reflections and one can say that more reflections generate a larger apparent source width.
    [Show full text]