Perceptual Model for Assessment of Coded Audio

Perceptual model for assessment of coded audio David J M Robinson A thesis submitted for the degree of Doctor of Philosophy Department of Electronic Systems Engineering University of Essex March 2002 Summary In this thesis, a time-domain binaural auditory model is developed for the assessment of high quality coded audio signals. Traditional objective measurements of sound quality yield misleading results when applied to psychoacoustic based codecs. Whilst human subjective opinion must be the ultimate arbiter of perceived sound quality, an objective measurement is sought that accurately predicts human perception. The auditory model described in this thesis is designed to match the processing within the human auditory system as closely as possible. The outer and middle ear transforms are simulated by linear filters; the basilar membrane response is simulated by a bank of amplitude dependent gammachirp filters, and the compressive action of the inner hair cells is simulated by an adaptation circuit. The whole model is calibrated using known human performance data. This monophonic model is used as the basis for a model of binaural hearing. Signals from the inner hair cell simulation are compared with those from the opposing ear, and an internal binaural image is formed. Changes in this image are shown to reflect audible changes in the binaural properties of the sound field. The binaural model predicts binaural masking and lo- calisation data using a single detection criterion, which suggests that both phenomena may be due to the same internal process. Finally, the model is used for audio quality assessment. The monophonic model correctly predicts human perception for some coded audio extracts, but not all. A specific problem in the hair cell simulation is identified, and solutions to this problem are suggested. The binaural model accurately predicts human perception of spatial qualities. In particular, the model can detect changes to the stereo soundstage within real music stimuli. Table of Contents 1 INTRODUCTION................................................................................................................ 1 2 BACKGROUND.................................................................................................................. 5 2.1 Overview .....................................................................................................................5 2.2 Audio coding............................................................................................................... 5 2.2.1 Why reduce the data rate? .................................................................................. 5 2.2.2 Data reduction by quality reduction ................................................................... 7 2.2.3 Lossless and lossy audio codecs......................................................................... 8 2.2.4 General psychoacoustic coding principles ....................................................... 11 2.2.5 MPEG audio codecs ......................................................................................... 13 2.3 Audio quality measurements..................................................................................... 19 2.3.1 Frequency response .......................................................................................... 20 2.3.2 Signal to Noise Ratio........................................................................................ 22 2.3.3 Total Harmonic Distortion (plus noise)............................................................ 22 2.3.4 Input Output difference analysis ...................................................................... 23 2.4 Subjective assessment ............................................................................................... 25 2.4.1 Beyond subjective assessment.......................................................................... 27 2.5 Conclusion ................................................................................................................ 28 3 THE HUMAN AUDITORY SYSTEM ............................................................................. 29 3.1 Overview.................................................................................................................... 29 3.2 A walk through the human auditory system............................................................... 29 3.2.1 The Pinna........................................................................................................... 33 3.2.2 The ear canal...................................................................................................... 37 ii TABLE OF CONTENTS 3.2.3 Middle Ear......................................................................................................... 39 3.2.4 The Cochlea....................................................................................................... 40 3.2.5 Neural signal processing.................................................................................... 52 3.3 Conclusion................................................................................................................. 54 3.4 Acknowledgements .................................................................................................... 54 4 AUDITORY PERCEPTUAL MODELS ........................................................................... 55 4.1 Overview ................................................................................................................... 55 4.2 Auditory modelling ................................................................................................... 55 4.3 The Johnston model .................................................................................................. 58 4.3.1 Algorithm ......................................................................................................... 60 4.3.2 Verification: Psychoacoustic test. .................................................................... 69 4.3.3 Assessing Audio Quality using the Johnston model ........................................ 74 4.3.4 Criticism of the Johnston model....................................................................... 81 4.4 Other models............................................................................................................. 88 4.4.1 Comparison of models...................................................................................... 92 4.5 Conclusion ................................................................................................................ 93 5 MONOPHONIC AUDITORY MODEL............................................................................ 94 5.1 Overview ................................................................................................................... 94 5.2 Introduction .............................................................................................................. 94 5.3 Structure of the model............................................................................................... 95 5.3.1 Signal Preparation and Implementation ........................................................... 95 5.3.2 Pre-filtering....................................................................................................... 97 5.3.3 Middle Ear filtering .......................................................................................... 98 5.3.4 Basilar membrane filtering ............................................................................... 99 5.3.5 Hair cell transduction ..................................................................................... 108 5.4 Perceiving a difference ........................................................................................... 113 5.4.1 Possible strategies........................................................................................... 114 5.4.2 Chosen Strategy.............................................................................................. 117 5.5 Implementation Speed............................................................................................. 120 5.6 Demonstration ........................................................................................................ 122 5.6.1 Threshold of audibility ................................................................................... 122 5.6.2 Spectral Masking............................................................................................ 123 iii TABLE OF CONTENTS 5.6.3 Amplitude dependent filter response.............................................................. 125 5.7 Conclusion .............................................................................................................. 125 6 SPATIAL MASKING...................................................................................................... 126 6.1 Overview.................................................................................................................. 126 6.2 Introduction ............................................................................................................. 126 6.3 Spatial Masking Experiment.................................................................................... 130 6.3.1 Aims and parameters of experiment................................................................ 130 6.3.2 Details of experiment....................................................................................... 133 6.3.3 Results ............................................................................................................

Perceptual Model for Assessment of Coded Audio

CD/DVD Player

C-Based Hardware Design of Imdct Accelerator for Ogg Vorbis Decoder

A Forensic Database for Digital Audio, Video, and Image Media

Ardour Export Redesign

(A/V Codecs) REDCODE RAW (.R3D) ARRIRAW

Ogg Audio Codec Download

Codec Is a Portmanteau of Either

File Name Benchmark Width 1024 Height 768 Anti-Aliasing None

6/8/2018, 16:56:57 Machine Name: LAPTOP

Directshow Codecs On

How to Play Itunes Purchased and Rental Movies with XBMC

System Information ---Time of This Report: 7/30/2013, 15:28:13 Machine Name