Improved Audio Coding Using a Psychoacoustic Model Based on a Cochlear Filter Bank Frank Baumgarte

Total Page:16

File Type:pdf, Size:1020Kb

Improved Audio Coding Using a Psychoacoustic Model Based on a Cochlear Filter Bank Frank Baumgarte IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 7, OCTOBER 2002 495 Improved Audio Coding Using a Psychoacoustic Model Based on a Cochlear Filter Bank Frank Baumgarte Abstract—Perceptual audio coders use an estimated masked a band and the range of spectral components that can interact threshold for the determination of the maximum permissible within a band, e.g., two sinusoids creating a beating effect. This just-inaudible noise level introduced by quantization. This es- interaction plays a crucial role in the perception of whether a timate is derived from a psychoacoustic model mimicking the properties of masking. Most psychoacoustic models for coding sound is noise-like which in turn corresponds to a significantly applications use a uniform (equal bandwidth) spectral decomposi- more efficient masking compared with a tone-like signal [2]. tion as a first step to approximate the frequency selectivity of the The noise or tone-like character is basically determined by human auditory system. However, the equal filter properties of the the amount of envelope fluctuations at the cochlear filter uniform subbands do not match the nonuniform characteristics of outputs which widely depend on the interaction of the spectral cochlear filters and reduce the precision of psychoacoustic mod- eling. Even so, uniform filter banks are applied because they are components in the pass-band of the filter. computationally efficient. This paper presents a psychoacoustic Many existing psychoacoustic models, e.g., [1], [3], and [4], model based on an efficient nonuniform cochlear filter bank employ an FFT-based transform to derive a spectral decom- and a simple masked threshold estimation. The novel filter-bank position of the audio signal into uniform subbands with equal structure employs cascaded low-order IIR filters and appropriate bandwidths. The nonuniform spectral resolution of the auditory down-sampling to increase efficiency. The filter responses are optimized for the modeling of auditory masking effects. Results system is taken into account by summing up the energies of the of the new psychoacoustic model applied to audio coding show appropriate number of neighboring FFT frequency subbands. better performance in terms of bit rate and/or quality of the new Consequently, the phase relation between the spectral compo- model in comparison with other state-of-the-art models using a nents of the different subbands within a cochlear filter band is uniform spectral decomposition. The low delay of the new model not taken into account. Since the cochlear filter slopes are less is particularly suitable for low-delay coders. steep than the subband slopes, they must be approximated by Index Terms—Audio coding, filter bank, masked threshold, spreading the subband energies across several bands. This way model of masking, perceptual model. of mapping the uniform subbands to cochlear filter bands pro- duces envelopes of the output signal that are different from those I. INTRODUCTION measured at the output of the cochlea. The temporal resolution of the spectral decomposition is determined by the transform N PERCEPTUAL audio coding [1], the audio signal is size, i.e., FFT length, and thus, is constant across all center fre- treated as a masker for distortions introduced by lossy I quencies. For high center frequencies this results in a signif- data compression. For this purpose, the masked threshold for icantly lower temporal resolution in comparison with that of the distortions is approximated by a psychoacoustic model. the corresponding cochlear filters. All the described mismatches The masked threshold is the time and frequency-dependent contribute to an inaccurate modeling of masking that causes sub- maximum level that marks the boundary for distortions being optimal coder compression performance. inaudible if superimposed to the audio signal. The initial audio To overcome the mismatch between uniform filter banks and signal processing within the psychoacoustic model consists of a the spectral decomposition of the cochlea, a linear nonuniform spectral decomposition to account for the frequency selectivity cochlear filter bank was developed. A linear filter bank was of the auditory system. However, the auditory system performs chosen because it is computationally less complex than a non- a nonuniform (nonequal bandwidths) spectral decomposition linear one [5], [6]. Furthermore, a psychoacoustic model based of the acoustic signal in the cochlea. This first stage of cochlear on a nonlinear filter bank generally approximates the masked sound processing already determines basic properties of threshold in an iteration process. Applied to audio coding, masking, e.g., the frequency spread of masking which is related this involves encoding, decoding, and threshold computation to the frequency response of the human cochlear filters. Above for each iteration step, which can considerably increase the 1 kHz, the cochlear filter bandwidths increase almost propor- encoder complexity. The linear filter bank does not account for tionally to the center frequency. These bandwidths determine sound level-dependent effects. However, since the playback both, the spectral width of energy integration associated with level of the decoded audio signal is usually unknown, this is considered a minor restriction only. Manuscript received June 20, 2001; revised July 18, 2002. The associate ed- The cochlear filter bank is based on a novel structure that itor coordinating the review of this manuscript and approving it for publication supports the time- and frequency resolution necessary to sim- was Dr. Peter Vary. ulate psychophysical data closely related to cochlear spectral The author is with the Media Signal Processing Research Department, Agere Systems, Berkeley Heights, NJ 07922 USA (e-mail: [email protected]). decomposition properties. It will be shown that this filter bank Digital Object Identifier 10.1109/TSA.2002.804536 is able to closely mimic the spectral and temporal properties 1063-6676/02$17.00 © 2002 IEEE 496 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 10, NO. 7, OCTOBER 2002 Fig. 1. Block diagram of the cochlear filter-bank structure. Fig. 2. Downsampling scheme of the cochlear filter bank. of frequency decomposition of the human peripheral auditory plexity. A simple and efficient way to implement a “stage-wise” system. The benefits of using this filter bank in a new psy- sampling rate reduction is shown in Fig. 2, where a stage com- choacoustic model are explained and evaluated for two different prises a group of those cascaded filter-bank sections having audio coders. An informal subjective quality assessment was equal sampling rate. The rate reduction by a factor of two is carried out for both state-of-the-art coders. For this compar- achieved by leaving out every other sample at the stage input. It ison the coders were used with their individual reference psy- is applied when the cutoff frequency of the LPF cascade output choacoustic model based on a uniform filter bank and with the is below a given ratio with respect to the sampling rate in that new psychoacoustic model. Results show improved coder per- stage to reduce aliasing. The number of sections covering the formance for the new psychoacoustic model. auditory frequency range is usually in the order of 100. It can The paper is organized as follows. The filter-bank structure be adapted to the desired frequency resolution for a specific ap- is described in Section II. In Section III the filter-bank imple- plication. The number of stages is typically chosen between five mentation using low-order IIR filters is presented. The filter re- and nine. sponses are optimized for modeling of masked thresholds. A All the high-pass filters have the same order. Also, all the novel psychoacoustic model based on that filter bank is the sub- low-pass filters have the same order. However, the LPF and HPF ject of Section IV. The experimental setup of the coders used orders can be chosen independently and should be large enough and of the subjective listening tests is outlined in Section V. Re- to accurately model the spectral decomposition features found sults are given in Section VI in terms of the subjective quality in relevant psychophysical data. After the orders are fixed, the and data rate. Conclusions are drawn in Section VII. filter coefficients can be determined by an optimization algo- rithm to minimize the difference between the responses of the II. FILTER-BANK STRUCTURE desired and the proposed filter banks. The responses of the de- sired filters are generally derived from psychophysical measure- The peripheral auditory system performs spectral analysis of ments. the input acoustic signal in the cochlea with spectrally highly overlapping band-pass filters. The nonuniform frequency res- III. FILTER BANK IMPLEMENTATION olution and bandwidth of these filters is approximated in the proposed structure by cascaded IIR filters. Fig. 1 shows the In this section, the cochlear filter bank parameters are given proposed filter-bank structure with low-pass filters (LPF) and and the derivation of the filter coefficients is described for the high-pass filters (HPF). The LPFs in the cascade have a de- application in a psychoacoustic model. It turns out that an LPF creasing cutoff frequency from left to right (see Fig. 1). Each order of and an HPF order of is sufficient LPF output is connected to an HPF. The HPF cutoff frequency to achieve a reasonable approximation of the desired frequency is equal to the cutoff frequency of the LPF cascade segment be- responses. The slopes of the desired magnitude frequency re- tween the filter-bank input and the HPF input of the next section. sponses are chosen according to simple masking models that Thus, the output of each HPF has a bandpass characteristic with assume a constant slope steepness on a Bark [7] or an equiva- respect to the filter-bank input signal. The basic block of an LPF lent-rectangular-bandwidth (ERB) [8] scale. For center frequen- connected to an HPF, as shown in Fig.
Recommended publications
  • Linearisation of Analogue to Digital and Digital to Analogue Converters Can Be Achieved by Threshold Tracking
    LINEARISATION OF ANALOGUE TO DIGITAL AND DIGITAL TO ANALOGUE CONVERTERS ALAN CHRISTOPHER DENT, B.Sc, AMIEE, MIEEE A thesis submitted for the degree of Doctor of Philosophy, to the Faculty of Science, at the University of Edinburgh, May 1990. 7. DECLARATION OF ORIGINALITY I hereby declare that this thesis and the work reported herein was composed and originated entirely by myself in the Department of Electrical Engineering, at the University of Edinburgh, between October 1986 and May 1990. Iffl ABSTRACT Monolithic high resolution Analogue to Digital and Digital to Analogue Converters (ADC's and DAC's), cannot currently be manufactured with as much accuracy as is desirable, due to the limitations of the various fabrication technologies. The tolerance errors introduced in this way cause the converter transfer functions to be nonlinear. This nonlinearity can be quantified in static tests measuring integral nonlinearity (INL) and differential nonlinearity (DNL). In the dynamic testing of the converters, the transfer function nonlinearity is manifested as harmonic distortion, and intermodulation products. In general, regardless of the conversion technique being used, the effects of the transfer function nonlinearities get worse as converter resolution and speed increase. The result is that these nonlinearities can cause severe problems for the system designer who needs to accurately convert an input signal. A review is made of the performance of modern converters, and the existing methods of eliminating the nonlinearity, and of some of the schemes which have been proposed more recently. A new method is presented whereby code density testing techniques are exploited so that a sufficiently detailed characterisation of the converter can be made.
    [Show full text]
  • Active Non-Linear Filter Design Utilizing Parameter Plane Analysis Techniques
    1 ACTIVE NONLINEAR FILTER DESIGN UTILIZING PARAMETER PLANE ANALYSIS TECHNIQUES by Wi 1 ?am August Tate It a? .1 L4 ULUfl United States Naval Postgraduate School ACTIVE NONLINEAR FILTER DESIGN UTILIZING PARAMETER PLANE ANALYSIS TECHNIQUES by William August Tate December 1970 ThU document ka6 bzzn appnjovtd ^oh. pubtic. K.Z.- tza&c and 6 ate; Ltb dUtAtbuutlon -U untunitzd. 13781 ILU I I . Active Nonlinear Filter Design Utilizing Parameter Plane Analysis Techniques by William August Tate Lieutenant, United States Navy B.S., Leland Stanford Junior University, 1963 Submitted in partial fulfillment of the requirement for the degrees of ELECTRICAL ENGINEER and MASTER OF SCIENCE IN ELECTRICAL ENGINEERING from the NAVAL POSTGRADUATE SCHOOL December 1970 . SCHOSf NAVAL*POSTGSi&UATE MOMTEREY, CALIF. 93940 ABSTRACT A technique for designing nonlinear active networks having a specified frequency response is presented. The technique, which can be applied to monolithic/hybrid integrated-circuit devices, utilizes the parameter plane method to obtain a graphical solution for the frequency response specification in terms of a frequency-dependent resistance. A limitation to this design technique is that the nonlinear parameter must be a slowly-varying quantity - the quasi-frozen assumption. This limitation manifests itself in the inability of the nonlinear networks to filter a complex signal according to the frequency response specification. Within the constraints of this limitation, excellent correlation is attained between the specification and measurements of the frequency response of physical networks Nonlinear devices, with emphasis on voltage-controlled nonlinear FET resistances and nonlinear device frequency controllers, are realized physically and investigated. Three active linear filters, modified with the nonlinear parameter frequency are constructed, and comparisons made to the required specification.
    [Show full text]
  • Chapter 3 FILTERS
    Chapter 3 FILTERS Most images are a®ected to some extent by noise, that is unexplained variation in data: disturbances in image intensity which are either uninterpretable or not of interest. Image analysis is often simpli¯ed if this noise can be ¯ltered out. In an analogous way ¯lters are used in chemistry to free liquids from suspended impurities by passing them through a layer of sand or charcoal. Engineers working in signal processing have extended the meaning of the term ¯lter to include operations which accentuate features of interest in data. Employing this broader de¯nition, image ¯lters may be used to emphasise edges | that is, boundaries between objects or parts of objects in images. Filters provide an aid to visual interpretation of images, and can also be used as a precursor to further digital processing, such as segmentation (chapter 4). Most of the methods considered in chapter 2 operated on each pixel separately. Filters change a pixel's value taking into account the values of neighbouring pixels too. They may either be applied directly to recorded images, such as those in chapter 1, or after transformation of pixel values as discussed in chapter 2. To take a simple example, Figs 3.1(b){(d) show the results of applying three ¯lters to the cashmere ¯bres image, which has been redisplayed in Fig 3.1(a). ² Fig 3.1(b) is a display of the output from a 5 £ 5 moving average ¯lter. Each pixel has been replaced by the average of pixel values in a 5 £ 5 square, or window centred on that pixel.
    [Show full text]
  • The Unscented Kalman Filter for Nonlinear Estimation
    The Unscented Kalman Filter for Nonlinear Estimation Eric A. Wan and Rudolph van der Merwe Oregon Graduate Institute of Science & Technology 20000 NW Walker Rd, Beaverton, Oregon 97006 [email protected], [email protected] Abstract 1. Introduction The Extended Kalman Filter (EKF) has become a standard The EKF has been applied extensively to the field of non- technique used in a number of nonlinear estimation and ma- linear estimation. General application areas may be divided chine learning applications. These include estimating the into state-estimation and machine learning. We further di- state of a nonlinear dynamic system, estimating parame- vide machine learning into parameter estimation and dual ters for nonlinear system identification (e.g., learning the estimation. The framework for these areas are briefly re- weights of a neural network), and dual estimation (e.g., the viewed next. Expectation Maximization (EM) algorithm) where both states and parameters are estimated simultaneously. State-estimation This paper points out the flaws in using the EKF, and The basic framework for the EKF involves estimation of the introduces an improvement, the Unscented Kalman Filter state of a discrete-time nonlinear dynamic system, (UKF), proposed by Julier and Uhlman [5]. A central and (1) vital operation performed in the Kalman Filter is the prop- (2) agation of a Gaussian random variable (GRV) through the system dynamics. In the EKF, the state distribution is ap- where represent the unobserved state of the system and proximated by a GRV, which is then propagated analyti- is the only observed signal. The process noise drives cally through the first-order linearization of the nonlinear the dynamic system, and the observation noise is given by system.
    [Show full text]
  • Median Filter Performance Based on Different Window Sizes for Salt and Pepper Noise Removal in Gray and RGB Images
    International Journal of Signal Processing, Image Processing and Pattern Recognition Vol.8, No.10 (2015), pp.343-352 http://dx.doi.org/10.14257/ijsip.2015.8.10.34 Median Filter Performance Based on Different Window Sizes for Salt and Pepper Noise Removal in Gray and RGB Images Elmustafa S.Ali Ahmed1, Rasha E. A.Elatif2 and Zahra T.Alser3 Department of Electrical and Electronics Engineering, Red Sea University, Port Sudan, Sudan [email protected], [email protected], [email protected] Abstract Noised image is a major problem of digital image systems when images transferred between most electronic communications devices. Noise occurs in digital images due to transmit the image through the internet or maybe due to error generated by noisy sensors. Other types of errors are related to the communication system itself, since image needed to be transferred from analogue to digital and vice versa, also to be transmitted in most of the communication systems. Impulse noise added to image signal due to process of converting image signal or error from communication channel. The noise added to the original image by changes the intensity of some pixels while other remain unchanged. Salt-and-pepper noise is one of the impulse noises, to remove it a simplest way used by windowing the noisy image with a conventional median filter. Median filters are the most popular nonlinear filters extensively applied to eliminate salt- and-pepper noise. This paper evaluates the performance of median filter based on the effective median per window by using different window sizes and cascaded median filters. The performance of the proposed idea has been evaluated in MATLAB simulations on a gray and RGB images.
    [Show full text]
  • Using Nonlinear Kalman Filtering to Estimate Signals
    Using Nonlinear Kalman Filtering to Estimate Signals Dan Simon Revised September 10, 2013 It appears that no particular approximate [nonlinear] filter is consistently better than any other, though ... any nonlinear filter is better than a strictly linear one.1 The Kalman filter is a tool that can estimate the variables of a wide range of processes. In mathematical terms we would say that a Kalman filter estimates the states of a linear system. There are two reasons that you might want to know the states of a system: • First, you might need to estimate states in order to control the system. For example, the electrical engineer needs to estimate the winding currents of a motor in order to control its position. The aerospace engineer needs to estimate the velocity of a satellite in order to control its orbit. The biomedical engineer needs to estimate blood sugar levels in order to regulate insulin injection rates. • Second, you might need to estimate system states because they are interesting in their own right. For example, the electrical engineer needs to estimate power system parameters in order to predict failure probabilities. The aerospace engineer needs to estimate satellite position in order to intelligently schedule future satellite activities. The biomedical engineer needs to estimate blood protein levels in order to evaluate the health of a patient. The standard Kalman filter is an effective tool for estimation, but it is limited to linear systems. Most real-world systems are nonlinear, in which case Kalman filters do not directly apply. In the real world, nonlinear filters are used more often than linear filters, because in the real world, systems are nonlinear.
    [Show full text]
  • Nonlinear Matched Filter
    A MUTUAL INFORMATION EXTENSION TO THE MATCHED FILTER Deniz Erdogmus1, Rati Agrawal2, Jose C. Principe2 1 CSEE Department, Oregon Graduate Institute, OHSU, Portland, OR 97006, USA 2 CNEL, ECE Department, University of Florida, Gainesville, FL 32611, USA Abstract. Matched filters are the optimal linear filters demonstrated the superior performance of information for signal detection under linear channel and white noise theoretic measures over second order statistics in signal conditions. Their optimality is guaranteed in the additive processing [4]. In this general framework, we assume an white Gaussian noise channel due to the sufficiency of arbitrary instantaneous channel and additive noise with an second order statistics. In this paper, we introduce a nonlinear arbitrary distribution. Specifically in the case of linear filter for signal detection based on the Cauchy-Schwartz channels, we will consider Cauchy noise, which is a member quadratic mutual information (CS-QMI) criterion. This filter of the family of symmetric α-stable distributions, and is an is still implementing correlation but now in a high impulsive noise source. These types of noise distributions are dimensional transformed space defined by the kernel utilized known to plague signal detection [5, 6]. in estimating the CS-QMI. Simulations show that the Specifically, the proposed nonlinear signal detection nonlinear filter significantly outperforms the traditional filter is based on the Cauchy-Schwartz Quadratic Mutual matched filter in nonlinear channels, as expected. In the Information (CS-QMI) measure that has been proposed by linear channel case, the proposed filter outperforms the Principe et al. [7]. This definition of mutual information is matched filter when the received signal is corrupted by preferred because of the existence of a simple nonparametric impulsive noise such as Cauchy-distributed noise, but estimator for the CS-QMI, which in turn forms the basis of performs at the same level in Gaussian noise.
    [Show full text]
  • ECG Filtering
    12/1/13 ECG Filtering Willem Einthoven’s EKG machine, 1903 ECG Filtering n Three common noise sources q Baseline wander q Power line interference q Muscle noise n When filtering any biomedical signal care should be taken not to alter the desired information in any way n A major concern is how the QRS complex influences the output of the filter; to the filter they often pose a large unwanted impulse n Possible distortion caused by the filter should be carefully quantified 1 12/1/13 Baseline Wander Baseline Wander n Baseline wander, or extragenoeous low- frequency high-bandwidth components, can be caused by: q Perspiration (effects electrode impedance) q Respiration q Body movements n Can cause problems to analysis, especially when exmining the low-frequency ST-T segment n Two main approaches used are linear filtering and polynomial fitting 2 12/1/13 BW – Linear, time-invariant filtering n Basically make a highpass filter to cut of the lower- frequency components (the baseline wander) n The cut-off frequency should be selected so as to ECG signal information remains undistorted while as much as possible of the baseline wander is removed; hence the lowest-frequency component of the ECG should be saught. n This is generally thought to be definded by the slowest heart rate. The heart rate can drop to 40 bpm, implying the lowest frequency to be 0.67 Hz. Again as it is not percise, a sufficiently lower cutoff frequency of about 0.5 Hz should be used. n A filter with linear phase is desirable in order to avoid phase distortion that can alter various temporal realtionships in the cardiac cycle n Linear phase response can be obtained with finite impulse response, but the order needed will easily grow very high (approximately 2000) q Figure shows filters of 400 (dashdot) and 2000 (dashed) and a 5th order forward-backward filter (solid) n The complexity can be reduced by for example forward- backward IIR filtering.
    [Show full text]
  • Neural Networks for Signal Processing
    Downloaded from orbit.dtu.dk on: Oct 05, 2021 A neural architecture for nonlinear adaptive filtering of time series Hoffmann, Nils; Larsen, Jan Published in: Proceedings of the IEEE Workshop Neural Networks for Signal Processing Link to article, DOI: 10.1109/NNSP.1991.239488 Publication date: 1991 Document Version Publisher's PDF, also known as Version of record Link back to DTU Orbit Citation (APA): Hoffmann, N., & Larsen, J. (1991). A neural architecture for nonlinear adaptive filtering of time series. In Proceedings of the IEEE Workshop Neural Networks for Signal Processing (pp. 533-542). IEEE. https://doi.org/10.1109/NNSP.1991.239488 General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. Users may download and print one copy of any publication from the public portal for the purpose of private study or research. You may not further distribute the material or use it for any profit-making activity or commercial gain You may freely distribute the URL identifying the publication in the public portal If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. A NEURAL ARCHITECTURE FOR NONLINEAR ADAPTIVE FILTERING OF TIME SERIES Nils Hoffmann and Jan Larsen The Computational Neural Network Center Electronics Institute, Building 349 Technical University of Denmark DK-2800 Lyngby, Denmark INTRODUCTION The need for nonlinear adaptive filtering may arise in different types of fil- tering tasks such as prediction, system identification and inverse modeling [17].
    [Show full text]
  • Adaptive Method for 1-D Signal Processing Based on Nonlinear Filter Bank and Z-Parameter
    ADAPTIVE METHOD FOR 1-D SIGNAL PROCESSING BASED ON NONLINEAR FILTER BANK AND Z-PARAMETER V. V. Lukin 1, A. A. Zelensky 1, N. O. Tulyakova 1, V. P. Melnik 2 1 State Aerospace University (Kharkov Aviation Institute), Kharkov, Ukraine E-mail: [email protected] 2 Digital Media Institute, Tampere University of Technology, Tampere, Finland E-mail: [email protected] ABSTRACT nonlinear filter and vise versa [6]. One way out is to apply an adaptive nonlinear smoother. A basic idea put behind the locally adaptive aproach is to process the noisy signal fragment by An approach to synthesis of adaptive 1-D filters based on means of nonlinear filter suited in the best (optimal) manner for nonlinear filter bank and the use of Z-parameter is put forward. situation at hand. In other words, the local signal and noise The nonlinearity of elementary filters ensures the predetermined properties and the priority of requirements of fragmentary data robustness of adaptive procedure with respect to impulsive noise filtering should be taken into consideration. By minimizing the and outliers. In turn, a local adaptation principle enables to output local errors one obtains the minimization of the total error minimize the total output error being a sum of the residual in MSE or MAE sense. fluctuation component and dynamic errors. The use of Z- There already exist many locally-adaptive (data dependent) parameter as a local activity indicator permits to “recognize ” the nonlinear 1-D filters. As examples, let us mention ones proposed signal and noise properties for given fragment quite surely and, by R.
    [Show full text]
  • A New Nonlinear Filtering Technique for Source Localization Frederik Beutler, Uwe D
    A New Nonlinear Filtering Technique for Source Localization Frederik Beutler, Uwe D. Hanebeck Intelligent Sensor–Actuator–Systems Institute of Computer Design and Fault Tolerance Universitat¨ Karslruhe Kaiserstrasse 12, 76128 Karlsruhe, Germany [email protected], [email protected] Abstract A method for source direction estimation based on hemi- A new model-based approach for estimating the pa- sphere sampling is presented in [2], where a nonlinear trans- rameters of an arbitrary transformation between two formation is used to convert the correlation vectors of the discrete-time sequences will be introduced. One se- GCC for all TDoA measurements to a common coordinate quence is interpreted as part of a nonlinear measure- system. ment equation, the other sequence is typically measured In [3] a fast Bayesian acoustic localization method is pre- sequentially. Based on every measured value, the prob- sented, a unifying framework for localization can be found ability density function of the parameters is updated in [1]. These approaches are similar to the nonlinear filtering using a Bayesian approach. For the evolution of the technique presented here, but they use a window for source system over time, a system equation is included. The estimation from received samples. Furthermore, a motion new approach provides a high update rate for the de- model is not included. sired parameters up to the sampling rate with high ac- This paper introduces a new model-based approach for esti- curacy. It will be demonstrated for source localization mating the state, e.g. the position of a source, directly from of a speaker, where the parameters describe the position time sequences.
    [Show full text]
  • Kalman Filters for Nonlinear Systems and Heavy-Tailed Noise
    Linköping studies in science and technology. Licentiate Thesis. No. 1613 Kalman Filters for Nonlinear Systems and Heavy-Tailed Noise Michael Roth LERTEKN REG IK AU L TO RO MATIC CONT LINKÖPING Division of Automatic Control Department of Electrical Engineering Linköping University, SE-581 83 Linköping, Sweden http://www.control.isy.liu.se [email protected] Linköping 2013 This is a Swedish Licentiate’s Thesis. Swedish postgraduate education leads to a Doctor’s degree and/or a Licentiate’s degree. A Doctor’s Degree comprises 240 ECTS credits (4 years of full-time studies). A Licentiate’s degree comprises 120 ECTS credits, of which at least 60 ECTS credits constitute a Licentiate’s thesis. Linköping studies in science and technology. Licentiate Thesis. No. 1613 Kalman Filters for Nonlinear Systems and Heavy-Tailed Noise Michael Roth [email protected] www.control.isy.liu.se Department of Electrical Engineering Linköping University SE-581 83 Linköping Sweden ISBN 978-91-7519-535-3 ISSN 0280-7971 LIU-TEK-LIC-2013:47 Copyright © 2013 Michael Roth Printed by LiU-Tryck, Linköping, Sweden 2013 Dedicated to all vegetarians. Abstract This thesis is on filtering in state space models. First, we examine approximate Kalman filters for nonlinear systems, where the optimal Bayesian filtering recur- sions cannot be solved exactly. These algorithms rely on the computation of cer- tain expected values. Second, the problem of filtering in linear systems that are subject to heavy-tailed process and measurement noise is addressed. Expected values of nonlinearly transformed random vectors are an essential in- gredient in any Kalman filter for nonlinear systems, because of the required joint mean vector and joint covariance of the predicted state and measurement.
    [Show full text]