Simplified Binaural Measurement Systems for Interior Noise Evaluation of Truck Vehicle Compartments

Master’s Thesis in the International Master’s programme in Sound and Vibration MADELENE PERSSON AND PETER TORSTENSSON

Department of Civil and Environmental Engineering Division of Applied Acoustics Chalmers Room Acoustics Group - MSA CHALMERS UNIVERSITY OF TECHNOLOGY Göteborg, Sweden 2007 Master’s Thesis 2007:32

MASTER’STHESIS 2007:32

SimplifiedBinauralMeasurementSystemsfor InteriorNoiseEvaluationofTruckVehicle Compartments

Master’sThesisintheInternationalMaster’sprogrammeinSoundand Vibration

MADELENEPERSSONANDPETERTORSTENSSON

DepartmentofCivilandEnvironmentalEngineering DivisionofAppliedAcoustics ChalmersRoomAcousticsGroupMSA CHALMERSUNIVERSITYOFTECHNOLOGY Göteborg,Sweden2007

Simplified Binaural Measurement Systems for Interior Noise Evaluation of TruckVehicleCompartments Master’s Thesis in the International Master’s programme in Sound and Vibration MADELENEPERSSONANDPETERTORSTENSSON

©MADELENEPERSSONANDPETERTORSTENSSON,2007

Master’s Thesis 2007:32 DepartmentofCivilandEnvironmentalEngineering Division of Applied Acoustics Chalmers Room Acoustics Group - MSA ChalmersUniversityofTechnology SE41296Göteborg Sweden Telephone:+46(0)317721000 Cover; Plot of interaural level difference calculated from head related transfer functionmeasurements, ChalmersReproservice Gothenburg,Sweden 2007

Simplified Binaural Measurement Systems for Interior Noise Evaluation of TruckVehicleCompartments Master’s Thesis in the International Master’s programme in Sound and Vibration MADELENEPERSSONANDPETERTORSTENSSON DepartmentofCivilandEnvironmentalEngineering Division of Applied Acoustics Chalmers Room Acoustics Group - MSA ChalmersUniversityofTechnology

ABSTRACT

ThisMasterThesiswascarriedoutincooperationwithVolvo3Ptodesigna simple system capable to record, playback and measure levels of noise in truck compartments. The desire was to achieve good enough quality of the output data to replace the currently used binaural recording system. It was requested that the recording system should be able to measure correct absolute levels, be portable as well as provide a binaural recording fidelity comparabletothatoftheequipmentusedatthecompanytoday.Thebinaural recording fidelity of five different binauralsimilar recording systems has been studied, with commercially produced artificial head and headset as reference. For level measurements, free hanging were used as reference.

The binaural recording quality was investigated using spectral analysis, measurementofheadrelatedtransferfunctions,andlisteningtests.Fromthe head related transfer function we obtained knowledge of the directional propertiesoftherecordingsystems,whilethelisteningtestsprovideduswith theperceiveddifferencesoftherecordings.Tojudgethebinauralqualityof therecordingsintheirintendedacousticalenvironment,thefinalevaluation wasperformedwithincabnoisemeasurements.Thetwosystemscomposed of a spherical smooth head, “Sphere”, and the display head, “Friggo” performed well in the listening test regarding reality of incab noise recordings.ThesetwoheadsweremanufacturedattheVolvoTrucks.

Becauseofitsgoodbinauralrecordingqualitycomparedtotheartificialhead and reliable level measurements, Friggo is recommended to be used for recordingsofincabnoise.Itperformedwellinalllisteningtests.Moreover, Friggomeasuredincomparisontotheartificialhead,incabnoiselevelsmore similar to those of measurements done with free hanging microphones, in comparisontotheothersystems.Thus,furtherinvestigationisrecommended toobtainloudnesslevelsinplaybackcorrespondingtotherealsituation.

I

Contents

ABSTRACT I CONTENTS 1 PREFACE 3

1 INTRODUCTION 4 1.1 Goal 4

2 BACKGROUND 5 2.1 Humanlocalizationofsound 5 2.1.1 Interauralleveldifference,ILD 5 2.1.2 Interauraltimedifference,ITD 6 2.1.3 Confusioninlocalization 8 2.1.4 Binauralloudness 8 2.1.5 Binauralmaskingleveldifference 9 2.1.6 Cocktailpartyeffect 10 2.2 Headrelatedtransferfunctions 10 2.3 Stereoreproductionandrecordingsystems 11 2.3.1 Recordingsystems 11 2.3.2 Reproduction 11 2.4 Soundfields 12 2.4.1 Freefield 12 2.4.2 Diffusefield 12 2.4.3 Thesoundfieldinsideatruckcompartment 13 2.4.4 Interiornoise 14

3 MEASUREMENTSYSTEMS 16 3.1 ORTFconfiguration 16 3.2 OptimumstereosignalJecklinDisc 17 3.3 Angularsphericalpatent–Wedge 18 3.4 Sphericalshapewithbuiltinmicrophones–Sphere 19 3.5 Headwithbuiltinmicrophones–Friggo 19 3.6 Artificialhead–Manikin 20 3.7 Binauralheadset–Headset 21 3.8 Microphones 22

4 PHYSICALMEASUREMENTS 23

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 1 4.1 Referencesource 23 4.1.1 Resultsfrommeasurementswithreferencesource 24 4.2 MeasurementofHRTFs 27 4.2.1 Measurementresults 29 4.3 Measurementsofincabnoise 35 4.3.1 Comparisonstofreehangingmicrophones 35 4.3.2 Spectralanalysis 38

5 PERCEPTUALSENSATIONOFTHEMEASUREMENTSYSTEMS 43 5.1 Localization 43 5.1.1 ResultsoftheLocalizationtest 45 5.2 Sourceseparationability 48 5.2.1 ResultsoftheSourceseparationabilitytest 49 5.3 Cocktailpartyeffect 51 5.3.1 Resultsfromtheperceptionofnumbers 52 5.3.2 Resultsfromtheintelligibilityofnumbers 54 5.3.3 Results,disregardingdeviantanswers 56 5.3.4 Spectralanalysisofthesoundsinthenumberstest 60 5.4 Recordedtrucksound 65 5.4.1 Resultsfromincabtest 66

6 DISCUSSION 67

7 CONCLUSIONS 70 REFERENCES 72 APPENDIXA–MEASUREMENTSYSTEMSSPECIFICATIONS 74 APPENDIXBABSORPTIONMATERIAL 75 APPENDIXCCALIBRATION 76 APPENDIXDADDITIONALMEASUREMENTEQUIPMENT: 77 APPENDIXE–REFERENCEFAN 78 APPENDIXF–RESULTSFROMHRTFMEASUREMENTS 79 APPENDIXG–LISTENINGTESTS 85 APPENDIXH–BARKSCALE 89 APPENDIXI–STATISTICALANALYSIS 90

2 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Preface

This project has been carried out from September 2006 to February 2007 in cooperation with Volvo 3P and the Chalmers Room Acoustic Group at Applied Acoustics (CRAGMSA), Chalmers University of Technology. The mainpartofthethesishasbeendoneattheNoiseandVibrationLaboratoryat VolvoTrucks,Gothenburg.

Throughout the master thesis we received an immense support from our supervisorsMScChristinaKeulemans,Volvo3PandTechLicAndersGenell, CRAGwhoalwayscouldfindtimeforourthoughtsandquestions.Wealso like to give special thanks Eugen Joss and Dr. Kaj Bodlund for their commitmentandhelpwithproofreadingthereport.

SpecialregardstothestaffbothatVolvo3PandChalmersthatdidtheirbest ofansweringourquestionsandhelpuswithsmallbitsandpiecesalongthe way.

FinallywealsowouldliketothankDr.MendelKleiner,examineratApplied Acoustics,fortheguidanceandcriticalreviewofthethesiswork.

GöteborgMarch2007

MadelenePerssonandPeterTorstensson

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 3 1 Introduction

Recordingequipmentrenderinggoodbinauralfidelityisneededtoenablean evaluation of a recorded sound field. Sound waves travelling inside an enclosurearereflectedandshattered.Hencetheresultingreverberantsound field will be a superposition of waves travelling in different directions. All sounds together create the acoustical environment and contribute to the listener’sperceptionofthesound.

Measurements of sound pressure levels are preferably performed using a single omnidirectional . This diminishes the influence from the recordingequipmentontherecordedsoundfield.Measurementsofabsolute levels with a twochannel binaural recording system are associated with a number of difficulties. One obvious difficulty is to find a strategy for combining the two channels, and establishing a correct resulting level comparabletothatobtainedwithsinglemicrophonemeasurement.Also,the presence of the recording equipment in the recorded sound field will influenceonthesoundlevelmeasurements.

ThisMasterThesiswascarriedoutincooperationwithVolvo3Pinorderto help designing a simple system capable to record, playback and measure levelsofnoiseintruckcompartments.Thedesirewastoachievegoodenough quality of the output data to replace the currently used binaural recording system(HMS.II.1).Itwasrequestedthattherecordingsystemshouldbeable tomeasurecorrectabsolutelevels,beportableaswellasprovideabinaural recordingfidelitycomparabletotheequipmentusedatthecompanytoday.

1.1 Goal

Thegoalofthisthesisworkwastoinvestigatethepossibilityofdevelopinga microphone system having a binaural recording quality comparable to that obtainedusingtherecordingequipmentavailableatVolvotoday.Therefore aninvestigationregardingifsimplerbinauralsimilarmeasurementsystems, including stereo techniques, can be used instead of the binaural recording system is of interest. The recording system should also be able to provide correctabsolutelevelsinaccordancewiththepresentequipment.Moreover, astherecordingsystemisintendedtobeusedforonroadmeasurementsat thepassengersideinthetruckcompartments,itisdesiredthattherecording systemiseasilyportable.

4 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 2 Background 2.1 Humanlocalizationofsound

Sound localization is the process by which humans perceive the apparent spatialpositionofanacousticalsource.Weareabletodifferentiatebetween soundsourcesthatareinfrontandbehind,behindandbelow,totherightand totheleft.Humanhearingisprovidedwiththeabilitytoevaluateavarietyof cueswhichenablesthisperceptioninwaysyetnotentirelyunderstood.The auditory system processes and correlates the sounds coming to both ears continuously.Theconceptoftheprocess,fromthemomentwhenthesound arrivesattheearstotheperceivedsensationiscalledbinauralhearing.

Listening to a binaural recording will ideally give the same auditory experienceasifthelistenerexperiencedthesituationinreallife.Inabinaural recording it is possible to capture the acoustical room environment and the different source locations in the recorded sound field. This is not possible whenlisteningtorecordingsdonewithasinglemicrophone.

There are many acoustical cues and properties which enable humans to localize sound. Some of them have been selected to be more thoroughly presentedinthissectionofthereport.

2.1.1 Interauralleveldifference,ILD

Using both ears when listening to a sound will give the ability to acquire a sensationoffromwhichdirectionthesoundarrives.Oneimportantproperty ofhearingistosensetheleveldifferencebetweentheears.Thisisknownas theInterauralleveldifference(ILD)andisacomplexquantitydependingon headandearshadowingeffects,frequencyandtheangleofincidence.Figure 1showsaprincipalsketchoftheILDconcept.

Figure1:Interauralleveldifferenceisthedifferenceinlevelofasoundatbothears.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 5 Forsoundsourcesatrelativelylongdistancefromthehead(>1m),thepath lengthdifferencefromthesourcetotheearsissmallcomparedtothedistance fromthesourcetothelistener’shead.Thedifferenceinpathlengthbetween the two ears is negligible, and the ILD is caused by the diffraction and reflectionofthehead.Thisisoftenreferredtoasacousticalshadowing,which reducesthesoundpressurelevelatthefarear.Whenthesourceiswithin1m ofthehead,however,thepathlengthdifferenceisnolongernegligible,and theILDisalsocreatedbytheadditionalattenuationincurredalongthepath to the ear farther from the source. At frequencies with a wavelength longer thantheshortestdistancebetweentheearstheILDwillbesmall.Forhigher frequenciesthescatteringduetotheheadwillincreasesignificantlyandthe ILDwillincreaseasaconsequence.

2.1.2 Interauraltimedifference,ITD

Thedifferenceintimeofarrivalofsoundatthetwoearsfromasoundsource displaced from the median plane informs the auditory system about the position of the source. This time is referred to as interaural time difference,ITD,andconstitutesaveryimportantbinauralcueforlocalization ability.ThefunctionofITDispresentedinFigure2.

Figure2:Theinterauraltimedifferencedescribesthedifferenceintimeofarrivalat bothears.

TheITDisexploitedbytheauditorysystemintwodifferentwaysdepending on the frequency of the sound. In the frequency rangeuptoapproximately 800Hzthephasedelaybetweentherightandleftsignal is the dominating attribute whereas for frequencies higher than approximately 1.5 kHz, the groupdelaybetweentheleftandrightsignalbecomesofgreaterimportance. Atfrequenciesabove800Hztheabilitytousethephasedelayinlocalization decreasesanditisnotusedatfrequencieshigherthanapproximately1.5kHz. Inthefrequencyrange0.81.5kHzbothphaseandgroupdelayareusedas cuesforlocalization.Basedontheinterauraltransferfunction,ITF,definedas the ration between the right and left HRTFs (explained later in this section)

6 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 the ITD based on the phase and group delay can be computed using the followingexpressions:

arg( ITF ) ITD = f <~ 5.1 kHz (1) ω

d(arg( ITF )) ITD = f >~800 Hz (2) dω

AsmentionedbeforeITDcalculatedwithphasedelay,(eq.1)isonlyrelevant at low frequencies and ambiguity occurs at higher frequencies, where the phasedelayisgreaterthanthehalfperiodT/2ofatone.

For higher frequencies where the ITD becomes ambiguous, ITD can be estimatedwiththegroupdelay(eq.2).TheITDisevaluatedbythetimeshift between envelopes. ITD based on the group delay has been proven in psychoacousticalexperimentstobeanimportantcueforlocalization[16]f> 800Hz.

The ITD has been studied by evaluating the perceived lateral displacement from the median plane as a function of ITD. The result for impulse content (noise,speech)wasfoundtobelinearuntilapproximatelyITD=630s[16]. Inextensionthisimpliesthatforfrequencieswithahalfperiodgreaterthan 630 s will attain a full lateral displacement. Another reason for the upper frequencylimitoflateraldisplacementoftonesisduetorefractoryperiod:the restperiodforwhichtheneuronscannotsendsignals.Thisperiodis12ms, whichindicatesalateraldisplacementinducedbyphasedelaynotnoticeable at1.51.6kHz[16].

HumanscandetectachangeintheITDassmallas10 µsfromanoriginalITD of400 µs[25].ThevariationofITDcausedbyahumanheadisintherangeof the phase delays that are perceptible. Experiments have shown that prominentchangesofabout80 µs(overafrequencyrangeof500Hzto1kHz) inITDcanbefoundforasingleazimuthbetween30ºand60ºofthearriving sound.Inthefrequencyrangeuptoapproximately1kHzithasbeenshown in previous studies that ITD dominates the cues of localization. In this frequency range good resemblance of ITD estimated using spherical heads andrealorartificialheadshasbeenobserved[26],[27].

ToproduceajustnoticeabledifferenceofITDforfrontalincidenttherelative timedelaybetweentheearshastobeabout50sat frequencies below 1.5 kHz [2], which corresponds to an angle of 5º. It varies though a lot from subjecttosubject;researchhasmeasuredvaluesofITDbetween30andupto 200sforjustnoticeableinterauraldelay.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 7 2.1.3 Confusioninlocalization

ITDandILDarepowerfulcuesforthelocalizationofasource,buttheyhave important limitations. For a source moving in the median plane (the perpendicular bisector of a line drawn trough both ears), the binaural differencesILDandITDarezero.Thiscancausefront/backconfusionwhich arisesasthelistenercannotdifferentiatebetweensourcesinthefrontandin the back. The cone of confusion also describe a situation where the sound arriving from different angles still give rise to equal ILD and ITD. As an example all sounds arriving from the angles along the circumference of the sphereconeinFigure3willhavethesamerelationofITDandILDatboth ears.

b a

Figure3:Soundarrivingfromallpositionsalongthecircumferenceoftheconewill haveequalILDandITDiftheheadisseenasspherical.

Evidently,thereareotherpropertiesofthehumanhearingwhichenableusto localizeasoundsource.Theseadditionalpropertiesmightberelatedtohow theindividuallistener’souterears,head,shoulderanduppertorsoaffectthe sound [6]. The body of the listener leads to scattering of the sound, an acousticalfilteringofthesignalappearsattheleftandrightear.

2.1.4 Binauralloudness

Binaural loudness is a property of hearing which describes the perceived loudnesswhenlisteningwithbothears.Youmighthavepersonalexperience of the difference in perceived loudness between listening to one and two earphones.Changingfromoneearphonetotwoearphonesgiveanincreased perceived loudness level. The increased level depends on the present sensationlevel(SL)andloudness.Binauralloudnessisnotlineardependent of either frequency or sound pressure level. Switching from monaural to binaural listening of a soft sound (20dB SL) give rise to an increase of the

8 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 perceived loudness with a factor 2. This corresponds to a level increase in (monaural)soundpressurelevelof8dB.Athighersensationlevels80dB(SL), theincreasefrommonauraltobinauralloudnessisonlyafactorofabout1.4 (SPLincreaseof6dB)[2].

2.1.5 Binauralmaskingleveldifference

Binaural Masking Level Difference (BMLD) can be described as an improvement of signal detection in noise when either the phase or level differencesofthesignalatthetwoearsarenotthesameasforthemasking signal,[2].BMLDcanbeunderstoodasareductionofthemaskingeffectifthe signalandmaskeroriginatefromdifferentlocationsinspace.

BMLDcanarrivefromvariousconditionsandareinvestigatedinmanyways. Usuallyonestudiesamaskerwithsamephaseatbothearstowhichasignal isaddedatbothearsbutwithaphaseshiftof180o.Oppositeconditionsisalso used;thesignalhasconstantphaseatbothearsandthemaskerhasaphase shift.GenerallytheeffectofBMLDishigherinthelowfrequencyrange[3]. Figure4showsoneexampleofBMLDwherethecomparablemonauraland binauralthresholds[2]areinvestigated.

Figure4:Theupperpanelshowsthemediantesttonethresholdsasafunctionofthe testtonefrequency.Themaskinglevelis60dBSPL.Thedotsshowthe monaural results while the open circles are the binaural result. The dotted line shows the expected values according to justnoticeable difference in level. The lower panel shows the corresponding BMLD (open circles), as a function of test frequency and predicted values (dashdottedline),[2].

Usingasinusoidaltoneasmaskerandthedetectabletonephaseshifted105 o thethresholdhasbeenfoundbothinthemonauralcase(masker+toneinone

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 9 ear,level60dB)andbinauralcase(maskeratoneearandmasker+toneinthe other ear both at level 60dB). There is a noticeable level difference between monaural and binaural threshold. For binaural hearing at lower frequencies below400Hz,thethresholdis15dBlowerthaninthemonauralcasewhileno improvementisfoundabove1kHzincomparisontothemonauralthreshold. Thephaseshiftof105 oiscomparabletoatimedelayof80 µsandthedashed lineshowstheexpectedlevel.

2.1.6 Cocktailpartyeffect

Thecocktailpartyeffectreferstotheabilitytofocusone'slisteningattention onasingletalkeramongacacophonyofconversationsandbackgroundnoise. It is taken for granted and is used on a daily basis by most people. The cocktail party effect is a consequence of the binaural hearing and is mainly derived from that sound origin from different locations will not be as effectivelymaskedassoundcomingfromthesamelocation.

Thecocktailpartyeffecthasbeenresearchedsincethemid1960satMIT.Their studieshaveforexampleconcernedintelligibilityofcertainmessagesreadout simultaneous with other disturbing messages. The results showed that generallythecontextisunderstoodandonlyshortphrasesof23wordsare incorrect[19].

2.2 Headrelatedtransferfunctions

Theshapeandsizeofthehead,torso,andinparticularofthepinnaehavea greatinfluenceontheperceptionofsound.Theheadrelatedtransferfunction (HRTF)somehowgivesananalyticalmeasureofthoseinfluences.Itaccounts for diffraction around the head, reflections from the shoulders and most significant,reflectionsinsidethepinnae.AstheHRTFaremeasuredformany anglesofsoundincidentstheycanbeusedtostudythedirectiondependent perception of sound of recording system or individual. The anatomical features vary greatly across individuals and hence the HRTF is highly personalized. To achieve the best possible binaural playback one should thereforemeasureapersonalHRTFtofiltertherecordedsoundwith.Ifone thenusesanappropriateplaybackconfiguration,theexperienceatplayback isascloseaspossibletotheexperienceduringrecording.Thereareofcourse stilllimitationsifahighnumberofsourcesarepresentwithsoundarriving fromseveraldirectionsintherecordedenvironment.PremeasuredHRTFcan alsobeusedforbinauralsoundsynthesis,whereanechoicallyrecordedsound isfilteredwithtransferfunctionscorrespondingtothedesiredspatialposition ofthesource[22].

10 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 2.3 Stereoreproductionandrecordingsystems

2.3.1 Recordingsystems

There are a number of basic systems for stereo recordings using different arrangement and directivity characteristics of microphones. Basically all systemsrelyontheuseofvariousdirectivitypatternmicrophonesmounted inananglefromeachotherorspacedsymmetricalfromacenterline.Sound sources on the center line will produce equal signals at both microphones whichproducesaphantomimageinthemiddleofastereoplayback.Sound sources off center will produce a higherlevel signal from the microphone withdirectiontowardsthesourcewhichwillresultinanoffcenterphantom image[5].

In one study two main categories of stereo recording systems were investigated [2]; near coincident pair and baffledomni pair. The near coincidentpairmethodusestwodirectionalmicrophonesangledapartwith the diaphragms some decimetres apart horizontally. The spacing gives the stereospreadandaddsasenseofambientwarmthorairtotherecording.The greatertheangleorspacingbetweenthemicrophones,thegreaterthestereo spread [5]. The directional pattern of the microphones produces level differencesbetweenthechannelsandthespacingproducetimedifferences.A recording system in this category is the ORTF (Office de Radiodiffusion Television Francaise) configuration, which consists of two angled microphones with cardioid pattern (further explained in section 3.1), and is amongtherecordingsystemsstudiedhere. TheBaffledOmniPairusestwo omnidirectional microphones, usually spaced at a distance equal to the distancebetweentheears,andalsoseparatedbyahardorpaddedbaffle.The spacingbetweenthemicrophonescreatestimedifferences(correspondingto ITD) and the baffle creates level differences (corresponding to ILD), most prominentforhighfrequencies.Amongtherecordingsystemsconcernedin this report, the artificial head as well as the recording systems using microphonesmountedonbafflesbelongstotheBaffledOmniPaircategory.

2.3.2 Reproduction

Itisabasicpurposeofstereophonicmicrophonestopickupthedirectional information of the sound scene. This is in orderto achieve a corresponding directional distribution and dimension of the sound sources during reproduction[4].Duringplaybackofastereorecordingthelistenerperceives virtual sound sources contained in the sound. The image location ideally correspondstothesoundsourcelocationduringtherecordingsession[5].

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 11 Twomethodscommonlyusedtoreproducebinauralrecordingsareplayback using with crosstalk cancellation and direct headphone playback.Crosstalkcancellationconsistsoftwoloudspeakerswhichtogether formthebinauralsoundfieldbyILD,ITDandincludealsointerferencesuch ascancellation.Itonlyreproducestherecordingatacorrectlyspecificpoint andhashighdemandstobeadjustedproperly.Thecrosstalkcancellationhas aclearadvantagetogeneratefrontallylocatedauditoryimagesincomparison to.Arefinedversionofcrosstalkcancellationisknownasstereo dipole crosstalk cancellation [28]. Playback with stereo dipole crosstalk cancellationinasetuphastheadvantageofallowingsmallhead movementsinordertolocalizesounds.Instereodipolecrosstalkcancellation theloudspeakerareplacedatanangleof10 owhichislargerthantheangle generally used in crosstalk cancellation setups. The reason to use playback through headphones is naturally its simplicity. The fact that it is portable makesitverysuitabletouseduringmeasurementsandlisteningtests.

2.4 Soundfields

Sound waves emitted by a sound source inside a room, enclosed space, or outdoors,arereflected,diffracted,scattered,andmightbepartlyabsorbedby obstaclesastheytravelinspace.Thesoundfieldthenreferstotheacoustical environment which is created from all this contributions. Different sound fields will be perceived differently by the listener. The perception of sound dependsonthespectral,temporalandspatialpropertiesofthesoundfield.

2.4.1 Freefield

Thefreefieldconditioniseasiestdescribedasanopenspacesoundfield.The soundtravelswithoutanyinterruptioninspaceandisthereforenotaffected or influenced of the surrounding. The anechoic chamber is a room used to simulate free field condition. It is designed to absorb most of the power in soundwaveshittingtheboundariesoftheroom.Measurementsmadeinan anechoic environment present the sound pressure level of a sound source withlittleinterferenceoftheenvironment.

2.4.2 Diffusefield

Aperfectlydiffusesoundfieldhasauniformlydistributedenergydensityat anypointinanenclosureindependentofthedirection.Anotherdefinitionof adiffusesoundfieldcanbeformulatedthateachpointontheboundaryofthe enclosurewillbehitbythesameamountofenergyperunitarea.Inreality such a field does not exist but it will normally be simulated in reverberant

12 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 rooms. According to ISO standard (EN ISO354:2003), it is recommended to have a volume of at least 200 m 3, but not larger than 500 m 3 (then air absorptionstarttoinfluence)toenableacceptablemeasuringconditions.The sound field is then seen to be diffuse at positions more than 1 m from the walls,floor,andceiling,atfrequenciesabovetheshröderfrequency, fs,ofthe room.

T f = 2000 (3), s V

V=volumeoftheroom(m 3) T=reverberationtime(s)

2.4.3 Thesoundfieldinsideatruckcompartment

Inavehicleoraroomthesoundfieldisneitherafreenoradiffusefield.The soundfieldinanenclosureiscomposedbymodeswhichdependonthesize oftheenclosureandthespeedofsound.

If a truck compartment is approximated by a rectangular space, the mode frequenciescanbecalculatedas[20]:

2  π  2  π   π  2 = c a +  b  + c f a,b,c       (4), 2π  x   y   z  wherecisthespeedofsoundinair(ms 1) a,b,c=integermodenumber1,2, x,y,z=dimensionsofspace(m)

Calculationwithwidthx=2.27,lengthy=2andheightz=1.52mgivethe firstmodesatapproximately75,85,112,113,135and140Hz.

These numbers coincide well with those obtained in simulations of truck compartments. A study has been made of the modes in the truck. The strongest modes in the empty truck compartment are close to 80 Hz when excitedatthedriversandpassengerheadspositionrespectively.Twomodes are dominant; one extending over the width (maximum at doors and minimumnodeinbetweentheseats)andonestretchingdiagonallyfromthe roofatthebackwalldowntothefront(maximumattheroofandthefront floor).Inthestudysimulationsofaseveraltruckcompartmentsallresultedin with the previously mentioned dominating 80 Hz mode. Higher up in frequency there appeared larger shifts of the modes for the different compartments

The simulations were done assuming empty compartments, therefore additional obstacles will of course help to achieve higher diffuseness of the compartment. According to Head Acoustics, the sound field in a passenger

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 13 carcanbedescribedwithasoundfieldhavingpropertiedinbetweenfreeand diffusefield.ThesoundfieldisthencalledIDbyHeadAcousticsandstands forindependentofdirection[23].

2.4.4 Interiornoise

Noiseinavehicleisacombinationof:

• enginenoise

• roadnoise

• intakenoise

• exhaustnoise

• aerodynamicnoise

• noisefromcomponentsandancillaries

• brakenoise

• squeaksandrattles

Noisecaneitherbeairborneorstructureborn[21].Asanexample,airborne soundcanbereducedbysoundinsulation,whilethestructurebornesound canbecontrolledbyresilientmountsandradiationlimiting.

To avoid high levels of noise in the truck compartment coming from the engineitisimportanttolimittheacousticalcouplingbetweentheengineand themodesofthecompartment.GenerallyinVolvotruckstheengineisasix cylinderenginewhichisfiredinpairsoftwo,i.e.threefiringseachrotation. Thefiringofthecylinders,anenginespeedof1000rpmor1700rpmwould generate strong radiation of sound at a frequency of 50 Hz and 85 Hz respectively(correspondingtothe3rdengineorder).Theharmonicsofthose frequencies that coincide with the modes of the compartment will therefore result in a standing wave pattern and consequently high noise levels. The figurebelowshowsresultsfromanincabnoiselevelmeasurement(interior noisemeasurementinthetruckcompartment)atthepassengerside.Onecan clearlyseetheresonanceat50Hzanditsharmonicat100Hz.

14 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 60

50

40

30

Noise Noise level, dB(A) 20

10 1000rpm

0 10 20 40 80 160 315 630 1250 2500 5000 1 0 0 0 0 2 0 0 0 0

1/3-octave band, Hz Figure5:Noiselevelmeasurementintruckcompartmentfortherotationalspeedof 1000rpm.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 15 3 Measurementsystems

Five binauralsimilar recording systems were designed with a complexity rangingfromasimplestereoconfigurationtoamanikinheadmoresimilarto that used in conventional binaural recording. In addition, commercial equipment such as an artificial head and binaural headset was part of the studytoo.

Simple stereo recording systems are frequently used in auditoria, concert halls etc, in order to record . The requirement on the equipment in such measurements is more of an emotional matter than a correct reproduction. In our case the realism of the playback is of high importancealongwithmeasuringcorrectsoundpressurelevels.

Themeasurementsystemsarepresentedinthenextsectionsandadditional description of the measurement systems and microphones can be found in AppendixA. 3.1 ORTFconfiguration

The ORTF configuration (Office de Radiodiffusion Television Francaise) representsastandardmethodforrecordingstereosignals.Twomicrophones, withcardioiddirectivity,areusedspaced165mmanddiverginginanangle of112.5 o.Thissystemwillenablereproductionoflevelandphasedifference betweenrightandleftchannelwithminimumphasecancellation[9,10].Since the cardioid directivity rejects sounds from certain directions, less of the ambient room characteristics are recorded in comparison to a binaural recording[10].TheORTFconfigurationisoftenconsideredtobetheclosest simulation of how human ears perceive sound if one only uses two microphones [7]. Therefore it was chosen to be one of our measurement systems. Figure 6 shows a picture of the ORTF configuration used in the study.

16 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Figure6:ORTFconfiguration,seenfromabove.

3.2 OptimumstereosignalJecklinDisc

TheJecklindiscwasdevelopedbyJürgJeckling,Switzerland[11]inorderto both record and reproduce a relative satisfying stereo sound with a simple setup.TheJecklindisc,seeFigure7hasanincreasedstereophonicseparation incomparisontoORTF.Itismadeof10mmplywoodandhasadiameterof 300mm.Afoamsoundabsorberof30mmthicknesswasaddedonbothsides (Material data in Appendix C). The distance between the omnidirectional microphonesis170mm.Themicrophonesaresymmetricallyplacedonboth sidesofthedisc.Windshieldswereused.

Figure7:Jecklindisc,frontview.

Thedisc,insertedbetweenthemicrophones,ensuresadifferenceinfrequency responseatthetwomicrophonesduetotheangleofincidence.Asideeffectis theangledependentdisturbanceduetotheedgeofthedisc.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 17 3.3 Angularsphericalpatent–Wedge

In order to achieve a closer correspondence to the human head than the Jecklin disc, a simple three dimensional shape with microphones placed parallelwiththeshapewasinvestigated.Theideaofthemethodwastaken fromthepatent“Apparatusforpickingupsoundwaves”[12].Inthepatentit wasdescribed:“Itisaprincipalobjectofthepresentinventiontoprovidean apparatuswhichallowsapickupandrecordingofsoundwaveswhichisas truetonatureaspossible.”Itcanbedescribedastwocirculardiscscombined byacylindricalshelltoawedgeshapedarrangement.

Figure8:Viewformtheside:M bisthecenterofthedisc,thedistanceh=52mm from M btothemicrophoneposition.2a+2b=210mmandthe nose width 60 mm [12].Bothsideswere

Figure9:Viewfromabove:6aand6barepositionsofthemicrophones[12].

ThedimensionsoftheshapecanbeseeninFigure8Figure9withtheangle α = 17 o. To avoid the lateral reflections and interference in the interesting frequencyrangethemicrophonesareplacedatadistanceof5mmfromthe shapeafoamsoundabsorptionwasalsoattachedonbothsidesoftheshape; seeFigure10(MaterialdatainAppendixC).

18 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Figure10:Wedge,frontview.

3.4 Sphericalshapewithbuiltinmicrophones–Sphere

Thismeasurementsystemusesafishingfloat,withbuiltinmicrophones,see Figure 11. The Sphere is symmetric and measures the sound pressure level wheretheouterearnormallyisplaced,usingomnidirectionalmicrophones. Thematerialofthefloatisa~20mmthickplasticshellwhichwassawedinto twohalves.InordertoavoidresonancesinsidetheSphereitwasfilledwith foamabsorptionmaterialandtheupperpartwasfixedwiththicktape.There alreadyexistsacommercialproductknownasthe“Sphere“.Thecommercial “Sphere” microphone was developed to achieve convincing results for loudspeaker playback in the same way as artificial heads are used for playbackinheadphones[13].

Figure11:Sphere,frontview.

3.5 Headwithbuiltinmicrophones–Friggo

A further refinement in accordance to the human head is to use a display head made by expanded polystyrene, see Figure 12. Two microphones are

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 19 placed at the positions of the right and left ear by drilling holes from the oppositesideofthehead.

Figure12:Friggo,sideview.

3.6 Artificialhead–Manikin

Head Acoustics artificial head (referred to as Manikin in the report) is the current measurement system used by Volvo. It is placed at the passenger positionformeasurementsofinteriornoiseintruckcompartments.Therefore comparisonsofManikinanddesignedmeasurementsdevicewereofinterest. TheManikinusedwasHMSII.1,seeFigure13whichisaquiteoldversion andconsequentlydifferencesinqualitytonewerversionscanoccur.Inorder to reproduce the signal in headphones it has to be equalized for the aural cavity and ear canal for which the Manikin has an analogue filter. The usedforplaybackofmeasurementsinvehiclecompartmentsisa filterwhichiscalledID(independentofdirection),inbetweenfreefieldand diffusefield.HeadacousticsrecommendusingtheIDfilterwhenmeasuring invehicles,becausethefilteris“designed”accordingtothesoundfieldina car.However,asthisworkconcernstruckcompartments,onemustconsider possible influences on the results from the differences in the sound field betweenacarandatruck.WhenmeasuringwiththeManikintwofiltersare used;firstthebuiltin(analogue)filterandthenthedigitalfilteraddedinthe software.Thissecondfilterisusedtoachievebetterresemblancewithsingle microphonerecordings.

20 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Figure13:Manikin,frontview. 3.7 Binauralheadset–Headset

The Headset equipment from Head Acoustics, (BMH III.3) when worn correctlycanreplaceaManikinforbinauralrecording,insituationswherethe Manikincannotbeused,forexamplewhenmeasuringatthedriver’sposition inatruck.

Figure14:HeadsetplacedonManikin.

Onedifficultywhenmeasuringwithaheadsetisthattheresultswilldifferfor differentpersonswearingtheheadset.Asalways,whendoingmeasurements the positioning of the microphones is very important. This is even more importantwhenusingaheadset,sincethemeasurementwillalsoincorporate influencefromthetorso[14].Allmeasurementsinthisreportweredonewith theHeadsetplacedontheManikintoensurecomparability,seeFigure14.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 21 The Headset, with additional earplugs, can record the sound pressure level with characteristics corresponding to the ID equalization. The ID filter was createdusingresultsobtainedfrommanymeasurementsandwill,according to Head Acoustics, give results comparable to those from measurements made with Head’s artificial head. Still, the influence on binaural recording duetotheperson’sanatomicpropertiesi.e.headandearshapeshouldnotbe underestimated,whenevaluatingandcomparingresults.

3.8 Microphones

The noise levels were also measured with single microphones. Such microphonescanbeusedasacomplementtobinauralrecordingsystemsin ordertomeasuretheabsolutenoiselevels.Inthisstudy,asetupcalled“free hanging microphones” has been used, as a complement to the other measurement systems presented earlier, in order to compare the absolute noise levels measured. This setup is used within Volvo to do comparable noise level measurements. The “Free hanging microphones” consists of two single omnidirectional ¼” microphones, with windshields, placed at 0.1 m distancefromtheearsoftheManikin,seeFigure15.

Figure15:FreehangingmicrophoneswithManikin.

TheomnidirectionalmicrophonesusedinthesemeasurementswereG.R.A.S. prepolarizedcondensermicrophonetype40AE.Boththemicrophonesinthe headsetandtheomnidirectionalmicrophoneshadthatoperate usingaconstantcurrentpowersupply.

Microphoneswithacardioidpatternwhereusedinthemeasurementsforthe ORTF configuration. Further information regarding the calibration can be foundinAppendixB.

22 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 4 Physicalmeasurements

Theperformanceofthedifferentrecordingsystemswasevaluatedusingboth physical and perceptual measurements. The perceptual characteristics were studiedinanumberoflisteningtests.Thepurpose,designandresultsfrom thesearefoundinnextsectionofthereport.Anyrecordingsystemmustbe able to measure correct absolute levels. Partly for this reason, and also to analytically support conclusions drawn from the listening tests, physical measurementsweremadeaswell.Spectralandlimiteddirectionalproperties in the various recording systems were investigated by measurements in an anechoic chamber. The directional properties of the recording systems were investigated in detail by measuring both conventional and head related transfer functions as appropriate. It can be expected that the level measurement results will vary with the properties of the measured sound field. Therefore it was also important to do measurements inside truck compartments.

4.1 Referencesource

Measurementswithareferencefanwereperformedtogetafirstimpression of the measurement system’s directional properties as well as the possible differences in measuring absolute noise levels. The setup can be seen in Figure 16. From the reference measurement we could also verify the symmetries of the measurement systems, ensuring that both channels measured equal levels during a 360˚ sweep. The results were compared to thosefrommeasurementsusingfreehangingmicrophones.

Measurementsof360˚sweepweredonewithareferencefanplacedonthenet floorat3mdistancefromthemeasurementsystem,forfurtherspecifications seeAppendixE.

Figure16:Measurementsinanechoicchamberwithreferencefan,Wedge.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 23 4.1.1 Resultsfrommeasurementswithreferencesource

The signals were analysed spectrally in comparison with free hanging microphones. The measurement results areplotted in smoothed 12thoctave bands.Thepeakat50Hzisduetoelectricalnoise.BothManikinandHeadset signalswerefilteredwithrecommendedfilterfromHeadAcoustics.InFigure 17 and Figure 18 the results from the measurements with all measurement systemsforfrontalincidencearepresented.

Frontalincidence

Figure17:Frontalincidencewithreferencesource:fan,rightchannel.

Figure18:Frontalincidencewithreferencesource:fan,rightchannel.

Above Figure 17 and Figure 18 shows results obtained by the different measurement systems in comparison to the results using free hanging microphones(blackdashedcurve).TheManikin,Sphere,WedgeandJecklin system all show quite similar result to those using the free hanging microphones, except for the Wedge’s lack of the peak at 6 kHz. Note the

24 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 resonancesabove2.5kHzforHeadset.Thepeaksanddips,whicharedueto theconstructionoftheHeadsetdesign,areevenmoreprominentbeforethe filteringwithrecommendedfilter.

Thefiguresbelowshowthedifferencebetweenthefreehangingmicrophones and the measurement systems. Average of two channels with the reference fanatfrontalincidenceisshown.

Figure 19: Comparisons between Friggo, Jecklin and Sphere vs. free hanging microphones.

Figure20:ComparisonbetweenWedge,ORTF,ManikinandHeadset.

Figure19andFigure20showthedifferencesbetweenWedge,ORTF,Manikin and Headset versus free hanging microphones. The low frequency noise in Figure19fortheORTFisaresultofproblemswiththemeasurementsetup. Generallyallsystemsobtainsimilarspectraincomparisontothoseobtained usingfreehangingmicrophonesinthelowerfrequencyrange,upto500Hz. Athighfrequencies,between35kHz,asimilarbehaviourcanbeseenforthe

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 25 measurement systems but with individual differences. The Manikin, Jecklin andSpherehavethebestsimilaritytothefreehangingmicrophones.

Incidentangleof180°

Figure21:Frequencydependencywith180°incidence,rightear.

ThelevelsshowninFigure21areaverageovera10sintervalfromthesweep of 360°. The approximate angle of incidence is 180° ±5°.Above700Hzthe measurementsystemsstarttoseparatemoreclearly.Becauseoftheshapeof Wedge, with the widest part at the back, its measured sound level (A weighted)decreasesfasterabove2kHzthantheothermeasurementsystems, which are closer to the Manikin and Headset. All systems measure lower averageSPLthanManikinandHeadset.

TheconclusionofthemeasurementswiththereferencefanisthatSphereand Jecklinarethetwomeasurementsystemsthathaveclosestresemblancetothe free hanging microphones. But the differences inbetween the measurement systemsaresmallcomparedtofreehangingmicrophones.

Inaddition,theaveragesoundpressurelevelsfromthe360°sweepwiththe turntable,wereanalysed.Mostmeasurementsystemsmeasuredequallevels for the both channels. The 0.1 dB difference for Friggo can probably be explained by two noise peaks that can be seen in the measurement. Additional results from the measurements with the reference fan can be foundinAppendixE.

Table1:AverageSPLover360 orotationintheanechoicchamber.

[dB] Headset Manikin Friggo Sphere Wedge Jecklin ORTF

Leftchannel 75.3 75.8 74.2 73.3 74.0 71.8 76.3

Rightchannel 75.3 75.6 74.3 73.3 74.0 71.8 74.3

26 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 4.2 MeasurementofHRTFs

Because of the expected different directivity properties of the recording systems,headrelatedtransferfunctions(HRTF)wasmeasured.Theintention was to compare the directivity properties of the recording systems and possibly explain tendencies seen in listening test results. We could also investigatethesymmetryofthetwochannels.

MeasurementsofHRTFwereperformedinasetupthatconsistedofanarray of32loudspeakers,placedinasectorofasphere,asshowninFigure22.The radius of the sphere sector of loudspeakers was 1.25 m which reasonably correspondstofarfieldmeasurements.Therigenabledaresolutionofabout 8°inthehorizontalplaneand10°intheverticalplane.Measurementswere doneforbothchannelseach18˚rotation[33].

Figure22:HRTFmeasurementrig.

HRTFmeasurementswereperformedforallthedifferentrecordingsystems. The HRTF for a particular sound source location wascreatedbytakingthe ratioofthemeasuredcomplexsoundpressurewiththerecordingsystemsto the same measurement using a single omnidirectional microphone. The results are presented in terms of ILD and ITD for the two channels of the recordingsystems.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 27 Thebinauralcues,ILDandITD,werecomputedfromamplitudeandphaseof theinterauraltransferfunction,ITF.TheITFwascomputedas:

HRTF ITF = ch 1 (5) HRTF ch 2

ILDwascalculatedfromthefollowingexpression:

2 ILD = 10 log (ITF ) (6)

TheITDwascomputedwiththephasedelay(seeeq.1)inthefrequencyrange below800Hzandthegroupdelay(seeeq.2)forhigherfrequencies.Special care was taken to “unwrap” the phase, i.e. to avoid errors caused by the abruptphasechangesgoingfrom–π to π.

Bothquantitieswereanalysedbyaveragingintocriticalbandsandpresented inmollweideprojectionplots,whicharemapprojections.Duetolimitations in the measurements we did not present results below 1.6 kHz. The coordinate system in the figures has its origin in the cross section of the horizontal,frontalandmedianplane.

Figure23:ThecoordinatesystemusedinthepresentationoftheHRTF.

Theazimuthangleθistheanglebetweenavectortothesoundsourceandthe median plane, it varies between –180° and 180°. An azimuth of 90° correspondstotheleftsideofthesubjectwhile–90°correspondstotheright side. The elevation angle δ varies from –64° (below the horizontal plane) to 86°(abovethehorizontalplane).

Toreducetheinfluenceofnoiseinthemeasurements,allmeasuredimpulse responses were windowed using a Hanning window function. Moreover, possible influence from the measurement rig, such as nonflat frequency responseoftheloudspeaker,wasreducedbynormalisingthemeasurements

28 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 obtainedusingthedifferentrecordingsystemstothoseofmeasurementswith asinglemicrophone,inaccordancewiththedefinitionofHRTF.

4.2.1 Measurementresults

TheManikinistheonlyrecordingsystemthatincorporatestheinfluenceon the localization due to the torso. Reflections from the torso can appear as interferencebetweenthedirectandthereflectedsoundatthepinnae.These effectsarelikelytoappearinthefrequencyrangebelow3kHz[30].Influence fromthepinnaeitselfcanbeexpectedatfrequenciesaboveapproximately5 kHz.Theonlyrecordingsystemsthat,toanyextend,resemblesthepinnaeare the Manikin and Friggo, possible this can be seen as diverging results comparedtotheotherrecordingsystemsatfrequenciesabove5kHz[30].

AsfortheSphere,onecannotpresumeittobeagoodapproximationofthe humancharacteristicsoverthewholeaudiblefrequencyrange.Atfrequencies wherethewavelengthiscomparativelyshortincomparisontothestructure, suchasthetorso,head,andthepinnae,thecorrespondencetohumanhearing willbereduced[29].InearlierstudiesaSpherewasfoundtobeareasonable approximation to a real head up to a frequency of around 2.5 kHz [29] in termsoffrequencyresponsecharacteristics.Intermsoflocalizationcues,ITD dominateILDandspectralcuesatlowfrequenciesbelowapproximately1.5 kHz. Higher in frequency, ILD dominates the localization cues. Good correspondence to measurements with artificial heads was obtained for Sphereupto2.5kHz.Higherupinfrequency,aninfluenceonthelocalization fromthepinnaecanbeexpectedandtheILDproducedforsphereresultsina largeconeofconfusion.

ResultsfromtheHRTFmeasurementsweredecidedtobepresentedforthe recordingsystemsManikin,FriggoandSphere.Thescalesinthefigureshave individual ranges for each recording system. All results from the HRTF measurementshowaspecificzigzagpattern.Thispatternhasbeenfoundto correspondtothearrangementofloudspeakersatdifferentelevationsinthe measurementrig(seeFigure22).ThemeasurementsforFriggocanbeseento haveashiftduetoincorrectadjustmentalongthemedianplane.

Interaural level difference

According to earlier studies, presented above, reasonably goodresemblance couldbeexpectedbetweenthethreerecordingsystemsforfrequenciesbelow ~4kHz(wherethepinnaestarttohaveasignificantinfluence).InFigure24 theILDispresentedatthe2.5kHzcriticalfrequencyband.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 29

30 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Figure24:ILD[dB]forManikin,FriggoandSphereatthecriticalfrequencyband withcenterfrequencyof2.5kHz.

TheILDhasastrongdependencyoffrequencyandangleofincidenceasseen in Figure 24 and the figures in Appendix F. Compared to measurements of ILDfoundinliterature,ourmeasurementfallintothesamerange[31].Itcan be noticed that the range of ILD corresponds better for Sphere and Friggo, while the values for Manikin are higher. The Sphere and Friggo reach approximately 15dB, while for Manikin values reach 20dB. The ILD for Manikin is not symmetrical over elevation angles. The explanation for the increasedILDabovethemedianplanemaybeduetotorsoreflections,which shouldbemostprominentforpositiveelevations.Thetorsomightalsolower thelevelsofILDfornegativeelevationsbecauseofscatteringandinterference betweenthedirectsoundandreflectedsoundfromthetorso.

In Figure 24 the ILD for Manikin appear to be more directional, i.e. more concentratedtotheangleofincidenceclosetotheears.Thesameappearance canbeseenuptoafrequencyrangeof13.5kHz(SeeAppendixF).Sincethe frequencyrange2.5kHzinFigure24isbelowwherethepinnaestartstohave aninfluence,theexplanationisratherconnectedtotheshapeoftheheadand torso.

In the frequency range below 1.6 kHz, the principal diameter of the head shape of the measurement systems are smaller than the wavelength, which should indicate a relatively good correspondence between them. However, thesamerelationsareshownasfor2.5kHz(seeAppendixF),thepresumed similarities could not be found. Thus in this frequency range the ITD is dominatingandILDislessimportantasabinauralcue.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 31 The ILD measured for Friggo (Figure 24) can be seen to be shifted in the direction of negative azimuth as mentioned earlier. Moreover, it can be noticed that Friggo measures higher ILD levels for angle of incidence with negativeazimuthcomparedtotheManikinandSphere.

Negative values of ILD can be seen in the figures of ILD. This is counter intuitive,asitimpliesahigherpressureatthefarsideoftheheadthanonthe sideclosertothesoundsource.NegativeILDcan bemeasuredonhumans, andareoftenexplainedbypinnaenotches.Thus,asthepinnaeisquitesimply modelled in all recording systems concerned in this study, including the Manikin, the main reason probably is lack of precision in the measurement setup.

Interaural time difference

Inthefrequencyrangebelow1.6kHztheITDisthemostimportantbinaural cueforlocalization.Therefore,inFigure25ITDispresentedforthe1.6kHz criticalfrequencyband.

32 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32

Figure25:ITD[s]forManikin,FriggoandSphereatthecriticalfrequencybandwith centerfrequencyof1.6kHz.

TherangeofITDusedinFigure25isdeterminedforeachrecordingsystem individuallyovertheentiremeasuredfrequencyrange.Basedonthat,itcan benoticedthattherangeofITDislargerforManikin(upto1.2ms).AnITD

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 33 of1.2mscorrespondstoasoundpathlengthofapproximately30cm,which seems as rather long in comparison to the circumference of Manikin. These unrealisticresultsmayhavebeencausedbyfrequencydependentphaseshifts introducedbythemeasurementsystem.Moreover,negativevaluesofITDcan be seen in the plots. This does not agree with reality as it implies that the sound wave reaches the ear at greater distance from the source before it reaches the closer ear. Most likely this also is due to imprecision in the measurement of phase delay. Compared to studies of ITD in literature, one can expect ITD extending up to approximately 0.8 ms. This corresponds ratherwelltotheresultsofITDforFriggoandSphere.

Judging from the appearance of the plots in Figure 25, Sphere shows ITD values remarkably levelled compared toManikin andFriggo. This indicates that concerning ITD, the Sphere is less sensitive to changes in angle of incidence than the other two recording systems. Thewavelengthat1.6kHz correspondtothediametersoftheheadshapesoftherecordingsystems.This clearly viewable characteristic of Sphere is rather surprising. Moreover, the ITDforSphereinFigure25,aswellasintheentiremeasuredfrequencyrange (see Appendix F), shows ILDvalues most focused tothe angle of incidence closetotheears.MostlikelythishastodowiththesmoothdesignofSphere. Obstaclesandcavitiescausesphaseshiftsdependentonthefrequencyandthe angleofincidence,thisbehaviourappearsinthefiguresofITDforFriggoand Manikin.

Therelationbetweenthewavelengthofsoundandthesizeofthemeasured headshapeseemstohaveaconsiderableinfluenceontheITD.InFigure25, theITDat1.6kHzforManikincanbeseentobequite varying. However, alreadyatthe2.5kHzcriticalfrequencyband,theManikinshowsvaluesof ITDconsiderablemorefocusedtotheareasclosetotheears.Itcanbenoticed, in correspondence with the figures of ILD, that the Manikin shows larger valuesofITDforpositiveanglesofelevationcomparedtonegative(appear more clearly for frequencies higher in range, see Appendix F). This may be duetotherelativelylargerdimensionsoftheupperhalfoftheManikin.

TheITDresultsforFriggodeviateslightlyoverthemeasuredfrequencyrange comparedtotheotherrecordingsystems.Firstlyitisnotuntilatthe8.5kHz critical frequency band where Friggo shows the levelled out ITD characteristicscenteredattheearswhichisrecognizedatconsiderablelower frequenciesforManikinandSphere.Thiscanpartlybeexplainedbythesize of Friggo which is smaller than the other recording systems. Thus, at frequenciesabove3.4kHzthewavelengthapproachesthesizeoftheradiusof Friggoandaccordinglyabovethisfrequencythesizeshouldnotserveasan explanation. Another deviation regarding Friggo is that negative values of ITD can be seen at 8.5 kHz critical frequency band for angles of incidence close to the ears, i.e. for angle of incidence where large ITD could be presumed.Oneexplanationcouldbethatsoundwavesareabletoexciteand travelwithinthepolystyreneplasticandbecauseofthehighspeedofsound

34 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 in polystyrene, cause small values of ITD. This in combination with the possibleinfluencefromthemeasurementsystemdiscussedearliercouldserve asanexplanation.However,whatcausesthiseffecttobemostprominentat specifically the 8.5 kHz critical frequency band and for certain angel of incidenceisnotexplainedfromthis.Theexplanationforthiscouldratherbe duetointerferenceofsoundwavesattheearonthefarside.

Graphs of ILD and ITD can be found in Appendix F for a frequency range reaching from the 1.6 – 13.5 kHz critical frequency band. For frequencies above4kHz,wherethecavitiesandobstaclesofManikinandFriggowould influenceonthecharacteristics,someobservationscanbemade.Therangein which the ILD varies has a better correspondence between the different recording systems when the frequency is increased. It can be noticed from studying the figures that the maximum ILD is found in the 8.5 kHz critical band for all three recording systems. One could expect the ILD to increase with frequency for the entire measured frequency range and an obvious explanationforthisishardtofind.However,theexplanationmaybefoundin thepropertiesofmeasurementrig.ThemeasuredITDforFriggoandSphere increasesforincreasingfrequencies.Thisagreeswithwhatcouldbeexpected and implies that the phase shift caused by the recording systems increases withfrequencyandthetraveldistance(numberofwavelengths). 4.3 Measurementsofincabnoise

Measurements were performed at Hällered proving ground to decide how well the measurement systems finally could be used for the purpose of recordingincabnoiseatthepassengerseat.

ThemeasurementweredoneaccordingtoGDI96424butincludedonlythe followingdrivingcases:

• Fullloadacceleration

• Constantspeed85km/h

• Lowidlespeed,withoutcompressorworking

• Lowidlespeed,withcompressorworking

The measurements were done during a day with stable weather conditions andmediumwind.Thetemperaturewas4 °C.

4.3.1 Comparisonstofreehangingmicrophones

The results from the Incab noise measurements are presented in L(A) eq according to Volvo standard (method I). The L(A) eq is a time averaged A weighted SPL. Narrow band spectra are studied to detect differences in the characteristicsofthemeasurementsystems.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 35 Table 2 and Table 3 show the difference in averaged dBA level difference between the measurements performed with the measurement systems and those with free hanging microphones for all concerned driving cases (measurement system – free hanging microphones). The noise levels presentedareenergyaveragesofthetwochannels.

Table2:Differencesinoveralllevelsbetweenthemeasurementsystemsandthefree hangingmicrophones.Recordingsdoneatthepassengerside.

Full load acceleration Constant speed 85km/h 990-1620 rpm (level difference in dBA) (level difference in dBA) Friggo – Free mic. 0.2 0.3 Jecklin – Free mic. 0.6 1.1 Sphere – Free mic. 0.6 0.5 Wedge – Free mic. 0 0.4 Manikin – Free mic. 1.4 1.4

Table3:Differencesinoveralllevelsbetweenthemeasurementsystemsandthefree hangingmicrophones.Recordingsdoneatthepassengerside.

Low Idle 600 rpm Low Idle 600 rpm without compressor with compressor working working (level difference in dBA) (level difference in dBA) Friggo – Free mic. 0.1 0.2 Jecklin – Free mic. 0.8 0.8 Sphere – Free mic. 0.6 0.6 Wedge – Free mic. 0.6 0.5 Manikin – Free mic. 0.9 0.3

As can be seen in Table 2 and Table 3 the overall minimum differences between free hanging microphones and the measurement systems are obtained for Friggo (maximum of 0.3 dBA). Furthermore it can be noticed thatJecklinandtheManikinmeasurehigherlevelscomparedtofreehanging microphones, while Friggo and Sphere measure lowerlevels. It can be seen thatthemaximumdifferencesisobtainedforManikin(1.4dBA).

36 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Additional, a study of the repeatability of the measurements done with Headset was made. Table 4 and Table 5 show the average and maximum differencesbetweenthemeasurementsperformedwiththeHeadsetandthose withfreehangingmicrophones.Thevaluesinthetablesarecalculatedfrom seriesoffiverecordings.

Table 4: Average level differences between Headset and free hanging microphones. Calculationsbasedonfiverecordingsdoneatthedriverside.

Full load acceleration Constant speed 85km/h 960-1620 rpm (level difference in dBA) (level difference in dBA) Average difference 1.2±0.2 0.6±0.1 (Headset – Free mic.) Max difference 1.4 0.8 (Headset – Free mic.)

Table 5: Average level differences between Headset and free hanging microphones. Calculationsbasedonfiverecordingsdoneatthedriverside.

Low idle without Low idle with compressor working compressor working 600 rpm 600 rpm (level difference in (level difference in dBA) dBA) Average difference 0.2±0.1 0.2±0.1 (Headset – Free mic.) Max difference 0.3 0.3 (Headset – Free mic.)

AlargedifferencebetweenmeasurementsperformedwithHeadsetandfree hangingmicrophonesforthedrivingcasefullloadaccelerationcanbenoticed inTable4.TheHeadsetmeasureslevelslowerthanfreehangingmicrophones inboththefullloadaccelerationandconstantspeeddrivingcases.

LoweraverageandmaximumdifferencescanbenoticedinTable5forboth driving cases compared with values in Table 4. It can be noticed that the averagedifferencesarepositiveforlowidlewithoutcompressorworkingbut negativeforthecasewithcompressorworking.Howeverforalldrivingcases, thevariationinthemeasurementsof±0.10.2dBAissmallandaccordingly therepeatabilitywhenmeasuringwithHeadsetisrelativelygood.Moreover, thevariationinmeasuredlevelmightbecausedbydifferencesinthedriving conditionsratherthanbythemeasurementequipment.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 37

4.3.2 Spectralanalysis

Constantspeed85km/h:

InFigure26Aweightedspectrafromtheconstantspeed85km/hdrivingcase are presented for all measurement systems and free hanging microphones. Energyaveragesofthetwochannelsareobserved.

Binaural/Similar-binaural recording Free hanging microphones System

Driver side – Headset Driver side – Free hanging microphones

Passenger side – Measurement systems Passenger side – Free hanging microphones

Figure26:Aweightedspectrafromtheconstantspeed85km/hdrivingcase.Average ofthetwochannelsispresented.

Figure26showsthatJecklin’sresponsedecreasesmorequicklyabove1kHz thantheothermeasurementsystems.Wedgeishigherinlevelbetween0.81.2

38 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 kHzanddecreasesprominentaboveabout3kHzincomparisontotheother systems. Furthermore, one can see how well the shape of both Sphere and Friggo curves coincide with Manikin, especially the peaks at approximately 1.2kHzand3kHz.Moreover,shiftingthecurvesofSphereandFriggoafew decibelsupwouldgenerateaverygoodoverallcorrespondencewithManikin bothinshapeandlevel.

Inaccordancewithwhatcouldbeconcludedinthecomparisonsoflevelsin Table 4 and Table 5, the variations of the measurements with Headset are small.

Fullloadacceleration:

Aweighted spectra from the full load acceleration are presented for all measurement systems and free hanging microphones in Figure 27. The respective left and right channels of the measurement systems and the free hangingmicrophonesareaveragedtogether.

Binaural/ similar-binaural recording System Free hanging microphones

Driver side - Headset Driver side – Free hanging microphones

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 39

Passenger side – Measurement systems Passenger side – Free hanging microphones

Figure 27: Aweighted spectra from the full load acceleration. Average of two channelsispresented.

AsshowninFigure27,thespectrafromthefullloadaccelerationcasehavea slightly different shape compared to the constant speed case. However, the mutual relations of the measurement systems within the test group are similar.

Studying the frequency spectra of the measurements at constant speed 85 km/handfullloadaccelerationonefindsthatbothFriggoandSpherehave similar response as Manikin while Wedge and Jecklin show lower levels in thehighfrequencyrange.

Thedifferenceinmeasuredlevelbetweenthemeasurementsystemsandthe free hanging microphones was calculated. In Figure 28 this difference is presented for the driving cases: full load acceleration and constant speed 85km/h.Thereasonforplottingthisistoestimateafilterforprocessingthe data measured with the measurement systems to better correspond to that frommeasurementswithfreehangingmicrophones.

40 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32

Difference in measured level between measurement system and free hanging microphone, average of both channels (L eq, measurement system - Leq, free hanging microphones )

Full load acceleration

Constant speed 85 km/h

Figure 28: Difference in measured L eq between the measurement systems and free hanging microphones for full load acceleration and constant speed 85km/h.Averageoftwochannelsispresented.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 41 A very similar shape of the driving cases for each measurement system showed in Figure 28 is desired. This would mean that a filtering of the recordingsdonewiththemeasurementsystemscouldgivereasonableresults agreementwithmeasurementsusingfreehangingmicrophonesforalldriving cases.Moreover,ifagreementsarefoundforthedifferentdrivingcasesthis also should indicate similarities in the measured sound fields. However, Figure 28 shows quite clear differences in shapes both for the two driving casesandforthemeasurementsystems.

In Figure 29 the difference in measured level between Friggo and the free hangingmicrophonesarepresentedforalldrivingcases.

Figure 29: The level differences between Friggo – Free hanging microphones, all drivingcases.

AsseeninFigure29thedifferencesareapproximatelywithin±2.5dBforboth thefullloadaccelerationandtheconstantspeeddrivingcases.Thisrangeis somewhat larger for the two low idle driving cases, mostly because of the pronouncedpeaksthattheyhaveincommon.Thesepeaksoccurforexample at30,90and180Hz,wherethemeasurementsystemsmeasuredhigherlevels than free hanging microphones. The peaks in the low frequency range are most likely due to the mode shapes of the compartment interior. With a constantsourcesuchasthatatlowidlethemeasurements,themicrophones aremoresensitivetopositioningthaninamorefluctuatingsoundfield.Still, Friggo measured overall levels in good correspondence to free hanging microphonesforalldrivingcases,seeTable3.

42 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 5 Perceptual sensation of the measurementsystems

Therecordingsystemsusedinthisstudywereevaluatedinseverallistening tests.Inthefirsttest,thebinauralqualityoftherecordingswasinvestigated by examining how well the test participants could make use of the cocktail partyeffect.Questionsregardingtheabilityofsoundsourceseparationina recorded sound field consisting of several sources were also asked. In the second test the binaural quality of the recordings were investigated using a localization test. Only the directional location in the horizontal plane was examined.

Throughout the test integer response scales without any mean number was used.Thereasonwhythemeanvaluewasnotwrittenintheansweringscales, wastoavoidoverrepresentationofanswersinthatregion.

The test participants performed the test one by one. The listening test was designed in ArtemiS analysis software and was played back from a laptop throughheadphones,typeSennheiserHD600.Inthelisteningtestregarding incabnoisetheSoundQualitySimulatoratVolvowasusedforreproduction. TheinstructionsandquestionsusedinthetestarefoundinAppendixG.

The statistics used in the analysis of the listening tests can be found in AppendixI.

5.1 Localization

Toinvestigatethepsychoacousticalspatialcharacteristicsofthemeasurement systems, a localization test was used. The purpose of this test was only to examine the directional localization; accordingly the distance to the sound source was constant, and only the horizontal plane was considered. Recordingsweredoneinananechoicenvironmentplacingthemeasurement systemonarotatingtable.Arecordingofamale counting “one two” was usedasbasicstimuliandtheanglesofincidenceusedarepresentedinFigure 30.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 43 0˚ 340˚ 20˚

0º 50˚ 310˚ 330º 30º 70˚ 290˚ 300º 60º

270º 90º

240º 120º 120˚ 210º 150º 180º

210˚

Figure30:Anglesofincidencechosenforthelocalizationlisteningtest.

AscanbeseeninFigure30itwasdecidedtoinvestigatefrontalincidentmore thoroughly than sounds arriving from the back. The reason for this was mainlytoreducethenumberofsamplesinthetest.Still,inthechosendesign itshouldbepossibletoestimatethefrontbackconfusionthatmayariseinthe recordings.Theanglesforfrontalincidentwerechosencloselypositionedto beabletoevaluatethelocalizationperformanceofthemeasurementsystems forsmallangledifferences.Tofurtherdecreasethenumberofsamplesinthe listeningtest,onlyfourmeasurementsystemswerechosenforthelocalization test;Friggo,Sphere,WedgeandManikin.Topossiblyimprovetheabilityto decideifthesoundiscomingfromthefrontorbackuncorrelatedAweighted whitenoisewasaddedtotherecordings.Theplaybackorderwasrandomly chosenforallsystemsandangels.Theorderwasnotchangedamongthetest participants. All 13 participants listened to all measurement systems and angles.Attheendofthelisteningtestsomeadditionalquestionswereasked regardingtherecordingsat0 ˚, frontalincident.Thequestionsconcernedthe experiencedsourcedistanceandifthesoundwasexperiencedascomingfrom inside or outside the head. The sound pressure levels in playback were calibrated manually by tuning the levels of the recordings done at frontal incidencetobeperceivedasequallyloud.

Thestimuluswasreproducingwithaloudspeakerat5.30mdistancefromthe measurementsystems.Theloudspeakerandthemeasurementsystemwereat thesamelaterallevel.

44 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Figure 31: Measurement setup from the localization measurement in the anechoic chamber.Stimuliwerereproducedfromtheloudspeakeroutlinedinthe figure.

5.1.1 ResultsoftheLocalizationtest

Humanhearingisabletodetectdeviationsofabout2ºinthehorizontalplane from a forward direction with sinusoidal signals. For sound sources facing oneoftheearsthelocalizationabilityisreducedanddeviationsofabout10º are detectable [2]. In the following analysis the localization error, which is calculated as the difference between the actual direction and the answered directionwasregardedasarandomvariable.Thedistributionwasassumed to be a normal distribution. Unbiased estimators for sample mean x , and samplevariances 2,werecalculatedandsamplestandarddeviation,s,ofthe localizationerrorwerecalculatedasthesquarerootoftheestimatedvariance [16]. A 95% confidence interval centered atthe estimated global mean was introducedhavingawidthequalto3.92timesthestandarddeviation.

In Figure 32 all reported answers from the listening test are presented together with actual sound directions for the four studied measurement systems.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 45

Sphere Friggo

360 360

330 330

300 300

270 270

240 240

210 210

180 180

150 150

120 120

90 90

60 60 30 30 Perceived sound directionsPerceived (degrees) 0 Perceived sound directions Perceived (degrees) 0 30 60 90 120 150 180 210 240 270 300 330 360 0 0 30 60 90 120 150 180 210 240 270 300 330 360 Actual sound directions (degrees) Actual sound directions (degrees)

Manikin Wedge

3 6 0 360

3 3 0 330

3 0 0 300

2 7 0 270

2 4 0 240

2 10 210

18 0 180

15 0 150

12 0 120

9 0 90

6 0 60

3 0 30 Perceived sound directions(degrees) Perceived Perceived sound directions (degrees) Perceived

0 0 0 30 60 90 120 150 180 210 240 270 300 330 360 0 30 60 90 120 150 180 210 240 270 300 330 360 Actual sound directions (degrees) Actual sound directions (degrees)

Figure32:Judgeddirectionsinthehorizontalplaneforthefourmeasurementsystems understudy:reportedsounddirectionsvs.actualsoundstimulidirections.

Figure 32 shows that the frontback ambiguity is a considerable problem in the test. A special analysis to uncover this problem has been performed, where the localization error was calculated to be the smallest difference betweentheperceiveddirectionandthestimulusdirection.Table10givesthe resultsfromthisanalysis,wherefront/backlocalizationerrorisapercentage measureofhowmanytimesthemirroredpositionwasusedcomparedtothe total number of positions (also called frontback confusion). The 95% confidence intervals centered at the respective measurement system’s mean valuewerecalculated forthecaseoffrontbackconfusion mirroredandnot mirrored(normal).

46 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32

Table6::95%confidenceintervalscenteredatthemeasurementsystem’smeanvalue fornormalandfront/backcompensatedlocalizationerrors. Front/back reversalsaremirroredbacktopositionsclosertostimulidirections.

Front/back Normal Mirrored localizationerror Wedge [184.2 ˚;144.5 ˚] [53.1 ˚;44.3 ˚] 68% Sphere [168.3 ˚;133.3 ˚] [49.7 ˚;50.4 ˚] 54% Friggo [139.3 ˚;108.2 ˚] [55.8 ˚;45.3 ˚] 47% Manikin [130.7 ˚;123.7 ˚] [45.7 ˚;47.6 ˚] 44%

A highly pronounced bias in the frontback confusion is clearly visible in Table 6. The least percentage of mirrored answers was obtained for the Manikin,thesmallest95%confidenceintervalformirroredanswerswerealso obtained using the Manikin. However, the results regarding the frontback confusionaresopoorandcloselypositionedthatitsuggeststhatthelisteners in the test reported the frontback perception by mere chance. Moreover no conclusionsofwhichmeasurementsystemthatperformedbest,withrespect tofrontbackconfusion,canbedrawn.

Figure34showsthemeanvaluesofthefront/backcompensatedlocalization errors and associated standard deviations for each measurement system. Actual stimuli directions are plotted on the xaxis. In Table 7 statistical characteristicsusedintheFigure33canbefound.Positivelocalizationerror equals an underestimated angle and consequently a negative localization errorequalsanoverestimatedangleofperception.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 47 Average front/back compensated localization errors together with associated standard deviation

70

50

30 Sphere 10 Wedge -10 Friggo

-30 Manikin

Localization (degrees) error -50

-70 0 20 50 70 120 210 290 310 340

Actual direction (degrees) Figure33:Meanvaluesofthelocalizationerrorswithstandarddeviationsforthefour concerned measurement systems plotted over actual directions. The blackhorizontallineindicatestheglobalaveragelocalizationerrorand theredhorizontallinesindicateitsassociated95%confidenceinterval. Valuesarecompensatedforfront/backlocalizationerrors.

Table7:Statisticalcharacteristicsforthelocalizationerrorobtainedforallanswersin thelocalizationlisteningtest.

averageerror( xglobal ) 2.1 ˚

Standarddeviation( s) 24.9 ˚

95%confidenceinterval [51.0 ˚;46.8 ˚]

To conclude if any of the measurement systems could be considered as performingbetterthantheothersanANOVAwasconducted.However,the ANOVA showed that no measurement system could, at any relevant confidencelevel,performbetterthantheothersinthelocalizationtest.

5.2 Sourceseparationability

Recordings were performed in the lecture room of Applied Acoustics at Chalmers to uncover the source separation ability. This was done in a comparison test in which the Manikin was used as reference. The test participantswereaskedtojudgeiftheirabilitytoseparatesoundsources,i.e. toperceivethedifferentlocationsofsoundsourcesinarecordedsoundfield, was better or worse for sound samples recorded with the measurement

48 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 systems compared to a reference. An interactive listening test design was usedwhichallowedthetestparticipantstolistentothesamplesanarbitrary numberoftimes.Theadvantageofthissystemisthatitreducestheinfluence oftheplaybackorderofthemeasurementsystems.Allmeasurementsystems exceptthefreehangingmicrophoneswereconsideredinthetest.

Fiveloudspeakerswereusedtoreproducespeech,placedatvariousdistances intheroom(9x6m)andatvariousdirectionsrelativefromtothemeasuring point.Tocreateamurmuringacousticalenvironmentthestimulichosenwere Danish, American English and British English voices both male and female talking about various subjects. The speech played back in the test where uncorrelated and adjustments in levels where done subjectively in order to haveaseparationabilityinthemeasurement.Thesamesoundwasrecorded forallmeasurementsystems.

Figure34:Measurementsetup,seenfromtherearoftheroom.Friggoisplacedinthe measuringlocation.

Figure35:Measurementsetup,seenfromthefrontoftheroom.Friggoisplacedinthe measuringlocation.

5.2.1 ResultsoftheSourceseparationabilitytest

Figure 34 shows the results regarding the source separation of the measurement systems in comparison with the Manikin. The scale ranged

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 49 from 1 (much lower source separation ability) to 6 (much higher source separation ability) and a rate of 3.5 implied equal source separation ability. Onlyonerecordingpositionwasusedinthelisteningtest.

Source separation test

7

6

5

4

3

2 Ratesourceseparation of

1

0 ORTF Headset Sphere Friggo Jecklin Wedge Figure 36: Ratings of the source separation ability in comparison to the Manikin presentedtogetherwithmeanvalues(dashedlines).

In Figure 36 the highest mean value of the source separation ability can be seenforFriggo.FriggoandORTFweretheonlymeasurement systems that werethetestpersonsreportedasprovidingabettersourceseparationability thantheManikin,howeverthisisnotstatisticallyensured.Thelowestmean valuewasreceivedforWedge.Greatvariancescanbenoticedintheanswers.

ANOVA showed significant difference in source separation ability (F = 2.76, p < 0.05) forthemeasurementsystemsincomparisontoManikin.

FromTukey’smultiplecomparisontest(seeTable10)withrangestatistic,q (6.132)=4.1and Tukey’slimit :0.9nosignificantdifferenceswasfound.

Nostatisticallysignificantdifferencescouldbefoundbetweentherecording systems in the listening test. However, Friggo received a mean rate that differedfromJecklinwiththesameamountastheTukey’slimit.

50 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Table 8: Differences between mean values of the answers in the listening test regarding the source separation ability. Significant differences are

enframed. µtot is the average response and (µi – µj) the difference betweenthecolumn µtot row µtot

ORTF Sphere Jecklin Headset Friggo Wedge

µtot 3.9 3.5 3.2 3.4 4.0 3.3

( µi–µj) * 0.4 0.7 0.6 0.1 0.7 ORTF

* 0.3 0.2 0.5 0.3 Sphere

* 0.2 0.9 0.1 Jecklin

* 0.7 0.1 Headset

* 0.8Friggo

5.3 Cocktailpartyeffect

Thecocktailpartyeffectwasinvestigatedbecauseofthattheabilitytouseit inplayback,togreatextend,reliesonthebinauralrecordingqualitiesofthe recording systems. The cocktail party effect was examined by studying the abilitytoperceiveacertaincombinationofnumbersinthebackgroundnoise of human speech. The reason for using human speech is to achieve a well recognizedacousticalenvironment.

Therecordingsusedforexaminingthecocktailpartyeffectweredoneinthe lunch room at Volvo. In total six loudspeakers at various distances, heights and directions from the recording systems were used. Five loudspeakers reproduced speech, while the sixth reproduced a set of numbers. The same loudspeaker was used in all recordings to reproduce the combination of numbers.Thelocationsandlevelsofallloudspeakerswereheldconstantfor allrecordingsystems.Therecordingsweredonehavingtherecordingsystem in the same position and directed equally in all recordings. The test participantswereaskedtowritedownthenumbersthatwerespokenfroma malevoice.Thecombinationofnumberswasrepeatedonceforeverysystem. Different combinations of the numbers (19) were used for the different recordingsystems.Theorderofwhichthedifferentrecordingsystemswere played back in the test was randomly chosen and arranged into six test groups.Tofurtherreducetheinfluenceoftheplaybackorderoftherecording systems,allthetestparticipantswereabletofamiliarizethemselveswiththe soundbylisteningtoarecordingdonewithfreehangingmicrophoneswhich were not included in the test. A total of 23 subjects participated in the test, aboutfourpersonsineachtestgroup.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 51 Besides writing down the interpreted number combination, the test participantwasalsoaskedtomarktheexperiencedintelligibilityofthemale voiceonascalerangingfromonetosix.

Lsp

Figure37:Measurementsetupofthecocktailpartyeffectrecording,twoloudspeakers isoutlinedinthefigure.

5.3.1 Resultsfromtheperceptionofnumbers

Correctly perceived numbers in the Cocktail party effect test

10

9

8

7

6

5

4

3

2 Correctly perceived numbers perceived Correctly 1

0 ORTF Headset Sphere Manikin Friggo Jecklin Wedge

Figure 38: All listening test answers from the perception of numbers in the cocktail party effect test presented together with the mean value. The maximum numbers able to perceive was nine.

52 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Jecklinreceivedthehighestmeanvalueofcorrectlyperceivednumbers.The resulting mean values for Wedge, Friggo and Wedge were all very close in leveljustbelowJecklin.Spheregotthelowestmeanvalueinthetest.Large spreads can be noticed for all recording systems, particularly for Headset, ManikinandFriggo.

Tobeabletodiscoveriftheplaybackorderhadaninfluence,westudiedthe answersdividedintotestgroups.TheresultcanbefoundinFigure39.

Results from the Cocktail party effect test presented in test groups

10

9 ORTF 8 Headset 7

Sphere 6

5 Manikin

4 Friggo

3 Jecklin Correctly perceived numbers perceived Correctly 2 Wedge 1

0 Group 1 Group 2 Group 3 Group 4 Group 5 Group 6 1. Friggo 1. ORTF 1. Headset 1. Friggo 1. ORTF 1. Sphere 2. ORTF 2. Headset 2. Wedge 2. Manikin 2. Jecklin 2. Wedge 3. Headset 3. Sphere 3. ORTF 3. ORTF 3. Sphere 3. ORTF 4. Sphere 4. Manikin 4. Sphere 4. Wedge 4. Headset 4. Headset 5. Manikin 5. Friggo 5. Jecklin 5. Sphere 5. Wedge 5. Jecklin 6. Jecklin 6. Jecklin 6. Manikin 6. Jecklin 6. Manikin 6. Manikin 7. Wedge 7. Wedge 7. Friggo 7. Headset 7. Friggo 7. Friggo

Figure39:Allanswersfromtheperceptionofnumbersinthecocktailpartyeffecttest presentedintestgroups.

Figure 39 suggests that it can be suspected that some answers are affected negatively by early play order positions. For example unexpected answers canbeseenforFriggointestgroups1and4.Intestgroup5oneoutlierfor ORTFcanbenoticed.TheseoutliersforORTFandFriggocanpresumablybe explained by them being the first measurement system in the playback. Furtherinvestigationneglectingtheoutlierscanbefoundinalatersection.

From ANOVA, significant difference was found for the reported numbers (F = 5.94, p < 0.05) forthemeasurementsystems.

Further, a test according to Tukey’s procedure was done. The differences exceedingtheTukey’slimitaremarkedinred.TocalculatetheTukey’slimit

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 53 therangestatistic qk,v for k=7 meanvaluesand v=154 degreesoffreedomat a significance level of 0.05 were used. The range statistic can be found in tables, q(7.154)=4.2.

Tukey’slimit: 1.3

From Tukey’s multiple comparison (Table 9) it was found with 95% confidencethat:

ORTF,Friggo,JecklinandWedgeperformbetterthanSphere.

JecklinandWedgeperformbetterthanManikin

Table 9: Differences between the mean values of the measurement systems for listening test regarding the perception of numbers. Significant

differences are enframed. µtot is the average response and (µi – µj) the differencebetweenthecolumn µtot row µtot

ORTF Headset Sphere Manikin Friggo Jecklin Wedge

µtot 7.7 6.9 6.0 6.4 7.5 8.0 7.9

(µi–µj) * 0.8 1.7 1.3 0.2 0.3 0.2 ORTF

* 0.9 0.4 0.7 1.1 1.0 Headset

* 0.4 -1.5 -2.0 -1.9 Sphere

* 1.1 -1.5 -1.4 Manikin

* 0.4 0.4Friggo

* 0.1Jecklin

5.3.2 Resultsfromtheintelligibilityofnumbers

Ratings of the intelligibility in the cocktail party effect test are presented in Figure40andFigure41.InFigure41theresultsaredividedintotestgroups. Thescaleusedrangedfrom1to6,aratingof1meantthatthenumberswere hardlyintelligible,6meantthatthenumberswereclearlyintelligible.

54 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Intelligibility results in the Cocktail party effect test

7

6

5

4

3

Rate of intelligibilityRate 2

1

0

ORTF Headset Sphere Manikin Friggo Jecklin Wedge

Figure40:Ratingsofintelligibilityinthecocktailpartyeffecttestpresentedtogether withmeanvalues.

Intelligibility results from the Cocktail party effect test presented in test groups

7

ORTF 6

Headset 5 Sphere

4 Manikin

3 Friggo Rate ofRate intelligibility 2 Jecklin

Wedge 1

0 Group 1 Group 2 Group 3 Group 4 Group 5 Group 6 1. Friggo 1. ORTF 1. Headset 1. Friggo 1. ORTF 1. Sphere 2. ORTF 2. Headset 2. Wedge 2. Manikin 2. Jecklin 2. Wedge 3. Headset 3. Sphere 3. ORTF 3. ORTF 3. Sphere 3. ORTF 4. Sphere 4. Manikin 4. Sphere 4. Wedge 4. Headset 4. Headset 5. Manikin 5. Friggo 5. Jecklin 5. Sphere 5. Wedge 5. Jecklin 6. Jecklin 6. Jecklin 6. Manikin 6. Jecklin 6. Manikin 6. Manikin 7. Wedge 7. Wedge 7. Friggo 7. Headset 7. Friggo 7. Friggo Figure41:Ratingsofintelligibilityinthecocktailpartyeffecttestpresentedintest groups.

When studying the results in Figure 40 and Figure 41, one can note a considerablevariationintheanswers.Itcouldbepresumedthatagoodresult fromtheperceptionofnumberstestwouldimplyagoodperformanceofthe

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 55 recordingsystemintheintelligibilitytestaswell.However,theresultsfrom thetwotestswerefoundnottobeentirelycorrelated.Inagreementwiththe perceptionofnumbers,themeanvaluesforORTF,Friggo,JecklinandWedge are closely levelled. Thus, in contrary to the test of perception of numbers, whereJecklingotthehighestmeanvalue,Wedgereceivedthehighestratings ofintelligibility.ThelowestmeanvalueinthistestwasreceivedforManikin and Headset got a mean value just slightly higher. It can be noticed that Sphereperformedbetterinthistestthaninthetestofperceptionofnumbers. Apparently,inspiteofthepoorresultinprevioustest,theintelligibilitywith Spherewasexperiencedasbetterrelativetotheotherrecordingsystems.

FromANOVA,significantdifferencewasfoundforintelligibilityofnumber combinations (F = 2.158, p < 0.05) forthemeasurementsystems.

FromTukey’smultiplecomparisontest(seeTable10)withrangestatistic:q (7.154)=4.2and Tukey’slimit :0.9,itwasfoundwith95%confidencethat:

WedgeperformsbetterthanManikin.

Table10:Differencesbetweenthemeanvaluesofthemeasurementsystemsforthe listening test regarding the intelligibility of numbers. Significant

differences are enframed. tot = average response and (µi – µj) = the differencebetweenthecolumntot row tot

ORTF Headset Sphere Manikin Friggo Jecklin Wedge

µtot 3.5 2.8 3.2 2.7 3.5 3.5 3.7

(µi–µj) * 0.7 0.3 0.8 0.1 0.0 0.2 ORTF

* 0.4 0.1 0.7 0.7 -0.9 Headset

* 0.5 0.3 0.3 0.5 Sphere

* 0.8 0.8 -1.0 Manikin

* 0.1 0.3Friggo

* 0.2Jecklin

5.3.3 Results,disregardingdeviantanswers

In Figure 42 the answers from the perception of numbers is shown again. Some encircled deviant answers can be found and in many cases these deviants can be concluded to be connected with an early position in the playbackorderofthesamples.Theplaybackorderpositionfortheoutlying

56 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 answers is outlined in the figure. To estimate the influence of outliers these wereremovedfromtheanalysisandanewANOVAwasdone.

Correctly perceived numbers in the Cocktail party effect test

10

9

8

7

6

5 1st sample 4 1st sample 2nd sample 3

2 rd

Correctly perceived numbers perceived Correctly 3 sample 1

0 ORTF Headset Sphere Manikin Friggo Jecklin Wedge

Figure42:Answersfromperceptionofnumbers,cocktailpartyeffectlisteningtest. Deviantanswersarecircledwithredandtheplaybackorderpositionis outlined.

Mean values and standard deviations of the answers in the part of the listeningtestregardingtheperceptionofnumbersarepresentedinFigure43,.

Correctly perceived numbers in the Cocktail party effect test

10 9 8 7 6 5 4 3 2 Correctly perceived numbers 1 0 ORTF Headset Sphere Manikin Friggo Jecklin Wedge Figure43:Numberofperceivednumbersinthecocktailpartyeffecttest,with9as maximumofcorrectanswers.Deviatinganswersdisregarded.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 57 OutliershavebeenremovedfromtheanalysisshowninFigure43andsome changes can be noticed compared to earlier results. The best perception of numbers is received for ORTF instead of Jecklin that got the best result in earlieranalysis.ThestandardvariationassociatedwiththeORTFissmallin comparisontotheotherrecordingsystems.

The ANOVA shows, as suspected, significant differences for perception of numbers for the recording systems even when the deviant answers were disregarded.

From Tukey’s multiple comparison test (see Table 11) with range statistic from table, q (7.149 ) = 4.2 and Tukey’s limit : 1.2 it was found with 95% confidencethatforperceptionofnumbers:

ORTFperformsbetterthanHeadset,SphereandManikin.

Friggo,JecklinandWedgeperformbetterthanSphereandManikin.

Table 11: Differences between mean values, disregarding deviants, in the listening test regarding the perception of numbers. Significant differences are

enframed. tot =averageresponseand (µi – µj) = thedifferencebetween thecolumntot row tot

ORTF Headset Sphere Manikin Friggo Jecklin Wedge

µtot 8.7 7.0 6.2 6.4 8.0 8.0 7.9

(µi–µj) * 1.6 2.5 2.3 0.7 0.7 0.8 ORTF

* 0.8 0.6 0.9 0.9 0.8 Headset

* 0.2 -1.7 -1.7 -1.7 Sphere

* -1.5 -1.5 -1.4 Manikin

* 0.0 0.1 Friggo

* 0.1 Jecklin

In Figure 44 the mean value of the intelligibility ratings and standard deviationseliminatingoutliersforthecocktailpartyeffecttestarepresented.

58 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Intelligibility results in the Cocktail party effect test

6

5

4

3

Rate of intelligibilityRate 2

1

0 ORTF Headset Sphere Manikin Friggo Jecklin Wedge Figure44:Answersfromtheratingsofintelligibilityinthecocktailpartyeffecttest (scale16),eliminatingoutlyinganswers.

No particularly changes can be noticed in Figure 44 compared to the case includingallanswersinFigure40.

The ANOVA shows, as suspected, significant difference between the recordingsystemsconcerningtheintelligibilityofnumbersalsowhendeviant answersaredisregarded.

From Tukey’s multiple comparison test (see Table 12) with range statistic from table, q ( 7.149 ) = 4.2 and Tukey’s limit : 0.9 it was found with 95% confidencethat:

FriggoandWedgeperformbetterthanManikin.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 59

Table12:Differencesbetweenmeanvalues,eliminatingoutliersinthelisteningtest regarding the intelligibility of numbers. Significant differences are

marked in read. tot = average response and (µi – µj) = the difference betweenthecolumntot row tot

ORTF Headset Sphere Manikin Friggo Jecklin Wedge

µtot 3.6 2.8 3.2 2.7 3.7 3.5 3.7

( µi–µj) * 0.8 0.4 0.9 0.1 0.1 0.1 ORTF

* 0.4 0.1 0.9 0.7 0.9 Headset

* 0.5 0.4 0.3 0.5 Sphere

* -1.0 0.8 -1.0 Manikin

* 0.2 0.1 Friggo

* 0.2 Jecklin

5.3.4 Spectralanalysisofthesoundsinthenumberstest

Nearlyallinformationofspeechiscontainedinthefrequencyrangeof200Hz 6000 Hz i.e. 213 Bark. The frequency dependency of the hearing is dependent on the position along the basilar membrane where a wave oscillationisexcitedbyanincomingsoundwave.Observationshaveshown that this frequency selectivity of the basilar membrane can be assumed to followthecriticalbandscale(Barkscale).TheBarkscaleprovidesforexample informationofthemaskingconditionsofarecordedsoundandithasacloser connection to the human perception of sound than other frequencyband analysismethods.AtableoftheBarkscalecanbefoundinAppendixH.

In order to understand the results from the listening test, a study of the spectra of the numbers on a critical band scale was done. The analysis concernedonlythesoundfilesusedforplaybackinthelisteningtest,thusno influencefromloudspeakersorthemeasurementsystemswereincluded.

In Figure 45 and Figure 46 we see the spectrum for each number in comparison to the average background “murmur”. It is important that the background“murmur”istimeaveragedoveralongertimethanthelengthof the spoken number. Generally it can be seen that the numbers have a high spectralcontentinthelowfrequencyrange.

60 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Figure45:Thenumberspectraincomparisontothebackgroundlevel.

Figure46:Thenumberspectraincomparisontothebackgroundlevel.

Acomparisonoftheperceptionofnumbersisshownbelow.Itisusedasbase forfurtherinvestigationofmaskingphenomena.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 61 Perception of numbers used in the Cocktail party effect test (not ORTF)

140

120

100

80

60

40

20 Number of correct perceptions Number of correct

0 1 2 3 4 5 6 7 8 9 Number Figure 47: Number of correctly perceived numbers, 6 measurement systems (not includingORTFandFreehangingmicrophones).

Figure47showthatnumber7isgenerallyeasytohearasare8and4,whileit is more difficult to hear number 3. Figure 48 shows the results for each measurementsystemseparately.

Perception of numbers in the Cocktail party effect test

25

20 Sphere Dummy

15 Friggo Jecklin Wedge 10

5 Number of correct perceptions Number of correct

0 1 2 3 4 5 6 7 8 9 Number Figure48:Perceptionofdifferentnumberspresentedforeachmeasurementsystem separately.

Figure48showsahighnumberofcorrectanswersforbothJecklinandFriggo. Manikinhasexceptionallylownumberofcorrectanswersfornumber3which mightbeexplainedstudyingthespectrafurther.

Spectralinvestigationwasperformedfornumber3and7.Themeasurement systemsJecklin,Manikin,FriggoandWedgewereanalysed.Inthefollowing

62 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 figures loudness levels divided into bark bands for the numbers and the simultaneousbackgroundnoiseareshown.

Figure 49: Number 7 in listening test for Wedge and Manikin together with backgroundnoise.

Since the analysis only concerns the original sound files there will be no differencebetweenthefrequencycontentofthesamenumbers,thereforethe turquoise curve is hidden by the black curve. In Figure 49 the level of backgroundnoiseis6sonehigherforManikinthanWedge,thiscanexplain the better result of Wedge seen previously in the listening test. The 7 has slightlyhigherlevelsforManikinthanWedge,0.2sone,butitcanprobablybe neglected.

Figure50:Number7forFriggoandJecklintogetherwithbackgroundnoise.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 63 ThebackgroundnoiselevelforFriggoandJecklincoincidewellfornumber7. This can be connected to previously good results of number of correct answersinFigure48.

Figure51:Number3forFriggoandJecklintogetherwithbackgroundnoise.

Whenrecordingnumber3thebackgroundnoisedeviatedmore.Friggohad higher levels above 17th bark than Jecklin but lower levels between 710 barks, see Figure 51. Still their result in listening test did not deviate too much,Jecklinhadslightlybetterresults.

Figure52:Number3forWedgeandManikintogetherwithbackgroundnoise.

64 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 The background noise for Manikin is at some Bark bands higher than for Wedge,seeFigure52.TheresultsfromthelisteningtestshowthatWedgeis themeasurementsystemachievingmostcorrectanswersandManikinisthe measurementsystemwithlessnumberofcorrectanswersregardingnumber 3.Thiscannotbeexplainedbydifferencesinthebackgroundnoise. 5.4 Recordedtrucksound

To estimate the performance of the measurement system in their intended acousticalenvironment,recordingsofincabnoiseatthepassengerseatwas doneattheHälleredprovingground.

ThemeasurementweredoneaccordingtoVolvostandards(GDI96424)but onlyincludedthedrivingcases:

 Full load acceleration

 Constant speed 85km/h

 Low idle speed ,withoutcompressorworking

 Low idle speed ,withcompressorworking

The measurements were done during one day with acceptable weather conditionsandlowwind.Thetemperaturewas4 oC.

Basedontheresultsfromthepreviouslisteningtests,thefourbestsystems, Wedge,Jecklin, Sphere and Friggo were selected for measurement of incab noise. Simultaneous measurements were done with the Headset at driver positionandfreehangingmicrophonesatbothdriverandpassengerside.

The constant speed 85 km/h and full load acceleration driving cases were chosenforthelisteningtest.Inthetesttheparticipantswereaskedtoratethe binaural quality of the recordings performed with the recording systems in comparison with recordings done with Manikin. The test participants were alsoaskedtoordertherecordingsdonewiththerecordingsystemsaccording toincreasingbinauralrecordingquality.Theparticipantswerefreetolistento thesoundsamplesas manytimesastheywished.Thesound sampleswere played back with Head Acoustics playback system. In the playback, the recordings done with the developed recording systems were only compensatedforthespectralinfluenceoftheheadphones.TheArtificialhead andHeadsetrecordingswereplayedbackusingtheHeadAcousticIDfilter. ThelisteningtestwascarriedoutatVolvoandninepersonsamongthesound engineeringstaffparticipated.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 65 5.4.1 Resultsfromincabtest

Figure53showstheresultsfortheincabnoiselisteningtest.Thepositioning ofthemeasurementssystemsinrelationtoeachotherislinkedtothenumber oftimestherecordingsystemsweregradedtoacertainmutualposition.

Constant Speed 85 km/h

Jecklin Sphere

Wedge Friggo Poor binaural Good binaural recording quality recording quality Full load acceleration 990-1620 rpm

Jecklin Sphere

Wedge Friggo Poor binaural Good binaural recording quality recording quality Total

Jecklin Sphere

Wedge Friggo Poor binaural Good binaural recording quality recording quality Figure53:Resultsfromtheincabnoiselistening test. The uppermost and middle figures present the result from the constant speed andthe full load acceleration driving cases, respectively. The result for both driving casesweightedequallytogetherarepresentedinthebottomfigure.The positioning of the measurement systems is based on the mutual positioninginthetests.

Theresultsofthelisteningtestshowsacleardifferenceofperceivedbinaural recording quality between the pairs Jecklin, Wedge and Sphere, Friggo. In totalFriggoachievedthebestresults.However,SphereandFriggowerevery closelypositionedandfortheconstantspeeddrivingcasetheyevenreceived equal results. Jecklin and Wedge can be noticed to have received the same ratingsforthefullloadaccelerationdrivingcase.

66 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 6 Discussion

Themeasurementwithreferencefangaveafirstimpressionofdirectivityand symmetryofthemeasuredlevelatbothchannelsforeachrecordingsystem. Thefrequencyspectraforthedifferentsystemscoincidewellbelow500Hz. At higher frequencies the recording systems shows more diverging spectral behaviour. At frequencies where the wavelength is longer than the acoustic distance between the microphones positions, influence is seen for: Jecklin, Friggo~30cm=>f~1130Hz,Wedge~38cm=>f~902Hz.Consequently theshapeofWedgewillinfluencetherecordingslessthantheotherrecording systems.Inthemeasurementfrombehind,theinfluenceoftheshapeofthe recording system was particularly seen for Wedge, for which the sound pressureleveldecreasedclearlyinhigherfrequencies.Thisisprobablydueto ashadowingeffectcausedbythewiderbackofWedge.

IfwecompareourdesignedmeasurementsystemswithManikinandHeadset there are more resonances in the high frequency range for the commercial equipment.Thiscanbeexplainedbyinfluencefromthemoredetailedshape oftheManikinheadandtheexistenceoftorsoandpinnaewhichisnotused intheotherrecordingsystems.

HRTF measurements were performed to discover possible similarities and differences between the binauralsimilar recording systems and Manikin. Ratherbigdifferencescouldbeobservedregardingthedirectionalproperties of the recording systems. Generally, over the entire measured frequency range, the ILD for Manikin achieved its maximum values for angles of incidentclosetotheears.Thisisinagreementwithwhatshouldbeexpected. However,thistendencyalsocouldbenoticedforManikinatlowfrequencies such as 1 kHz. Compared to Manikin, the ILD measured for Friggo and Spherewereconsiderablymorescatteredoverangleofincidents,especiallyin the frequency range below 8 kHz. The ILD could be seen for all recording systems to reach its maximum at 8 kHz and then decrease for higher frequencies. This was not expected as ILD is supposed to increase for increasingfrequency.TheinfluencefromthemoredetaileddesignofManikin wasclearlyvisible,assignificantlylargerchangesinphasewereobservedfor ManikincomparedtoFriggoandSphere.

Designingthelisteningtestregardingthecocktailpartyeffectwasdifficult.It was particularly difficult was to adjust the levels of the loudspeaker in the setup to reach the limit of intelligibility. In the test it was considered as an advantage to use a masker which the test participants could relate to. Thereforethecocktailpartyeffectwasexaminedbystudyingtheintelligibility ofnumbercombinationsinamurmurofspeech.Possiblebiasfromthatthe numbersbeingperceiveddifferentlywasavoidedbyusingallnumbersinall combinations.However,becauseofthenonstaticbackgroundnoisenumbers couldpossiblebemaskeddifferently.Fromtheinvestigationofthemasking

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 67 ofnumbersitseemedlikeitisnotonlythelevelofthebackgroundnoisethat hasaninfluenceontheperception.Fromthatanalysisitcouldbeconcluded that lower background noise does not necessarily give better result. This couldindicatethatthereareotherparametersthanloudnessthatinfluenceon theresult,i.e.thebinauralrecordingqualityoftherecordingsystems.

All results from the listening test regarding subjective ratings have to be analysedcarefully,especiallywhenthevariancesareasconsiderableasseen inourresults.Thiswasalsonoticedinthemultiplecomparisontestswhere we only could observe significant differences between a few measurement systems.

Generally throughout the listening test neither the Manikin nor Headset performed as well as one could expect, in comparison to the more simple recording systems. In many cases they even had worse results. From the statisticalanalysisoftheperceptionofnumbersinthecocktailpartyeffecttest one could say with 95% confidence that Jecklin, Friggo, ORTF and Wedge performed better than Sphere and thatJecklin and Wedge performed better than Manikin. Moreover the results regarding the intelligibility of numbers indicated a significantly better result for Wedge compared to Manikin. The goodresultinthecocktailpartyeffecttestbyORTFcouldpartlybeexplained bythedirectionalityofthemicrophoneswhichwaslargeinthedirectionof theincidenceofthenumbercombination.

Ameasurementsystemgivinggoodsourceseparationabilityisdesirableto enableperceptionofdifferentsoundsourcesinasoundfield.Unfortunately, theresultsfromthesourceseparationtestwerenotasclearasdesired.The reason could be either thatthe questions asked were hardto understand or thatthereactuallywereonlysmalldifferencesamongtherecordingsystems concerning this acoustical property. Noticeable is that Jecklin and Wedge, which obtained god results in the listening test regarding perception of numbers,receivedcomparativelylowratingsinthistest.However,thiscould notbesupportedbytheANOVAtest.

As expected, frontback confusion caused a lot of errors in the localization listeningtest,whichisawelldocumentedprobleminliterature.Answersthat wereinfluencedbyfrontbackconfusionwereseenforallrecordingsystems, but least frequently occurring for the Manikin. According to the results, soundswithfrontalandperpendicularanglesofincidentwerelessdifficultto locatethanintermediateangles.Thismightpartlybebecauseofthatlisteners tendtoperceivesoundascomingfromanglescloseto0 oandmultiplesof90 o. In literature, frontal incidence is reported to have better accuracy than perpendicularincidence[20].However,becauseofthelargevariancesinour testnothingcouldbeconcludedregardingthis.

From the incab listening test it was a clear difference between the pairs Jecklin, Wedge and Sphere, Friggo. Common for Jecklin and Wedge is that

68 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 they include absorption material which might give difficulties in perceiving the binaural properties. When we listened to the recordings of Jecklin and Wedge we experienced them as more damped than the recordings with SphereandFriggo.Asinpreviouslisteningteststhebinauralpropertiesare difficulttoexamine,inthistestwethereforedecidedtocomparethebinaural similarrecordingsystemsandtheManikin.Thissimplifiedthetestprocedure in the way that instead of letting the test participants estimate the binaural quality of the recordings they could judge the similarity between the measurement systems. Also, in this way the test was independent on the participant’spreviousexperienceoftrucks.

AsitwassuspectedthatthepresenceofabsorptionmaterialonWedgemay have an influence on the perceived binaural recording quality a swift investigation was performed. Recordings of music in a truck compartment were made with Wedge having the absorption material added and taken away.Whenlisteningtotherecordingsanaudibledifferencebetweenthetwo casescouldbenoted,howeveritishardtoestimatetheimpactofthisonthe perceivedbinauralquality.

InTable13theresultfromthemeasurementsthatwereexperiencedasmost important,bothinstrumentalandperceptual,forrespectiverecordingsystem canbefound.Theresultsinthetablearebasedonlyontheaverageratefrom the listening tests and hence not statistically proven. From the basis of this tableitwouldbepossibletodesignatethemostappropriaterecordingsystem.

Table 13: The performance of the concerned recording systems in all decisive measurements performed in the thesis. The recording system which performed the best and second best respective worst and second worst are marked with ++ and +, respective – and -. The recording system that was found to perform best is circled in red.

Recording Source separation Cocktail party In-cab noise In-cab noise level system test effect test listening test measurements

ORTF + ++ notconcerned notconcerned

Sphere +

Wedge +

Friggo ++ + ++ ++

Jecklin

In Table 13 it can be seen that Friggo achieves the best result including all performedmeasurements.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 69 7 Conclusions

Byperformingrecordingsovera360ºsweep,allmeasurementsystemscould be concluded to measure symmetrically at both channels. In the reference measurementsWedgeandJecklinobtainedthebestspatialresemblancewith recordingsdonewithfreehangingmicrophones.

In the HRTF measurements rather big differences could be observed regarding the directional properties between the binauralsimilar recording systems and Manikin. A prominent feature of Manikin was that the largest valuesofILDwereachievedforanglesofincidenceclosetotheears.Thisdid notappearasclearforSphereandFriggo.

InthelisteningtestregardingtheperceptionofnumbersWedge,Jecklincould be concluded with 95% statistically certainty to have performed better than theManikin.Intheintelligibilityratingstheresultsweresomewhatdifferent, hereonlyFriggoandWedgewereperceivedasbeingbetterthanManikinat the same level of confidence. The best mean result in the listening test regardingthesourceseparationabilitywas obtainedforFriggo.Therewere nosignificantdifferencesfoundfromthesourceseparationtest.

The listening test regarding localization ability only showed the very pronounced source of error deriving from the frontback confusion. Moreover, the ANOVA results could not state any influence on the localization ability from the choice of recording system. The frontback confusion was least pronounced for Manikin, still 44% of the perceived locationswerefrontbackconfused.

The analysis of the recordings of incab noise showed that the Manikin measured a channel averaged L eq level which differed the most from measurementswithfreehangingmicrophones.Thelevelmeasuredwiththe Manikinwasapproximatelyonedecibelhigherthanmeasurementswithfree microphones. The minimum difference was received for Friggo. The measurement stability of the Headset was also examined. An average deviationfromthemeanof±0.1dBwasconcluded.Someconclusionscould bedrawnfromthelisteningtestregardingthebinauralqualityofincabnoise recordings.Friggoshowedtheoverallbestratingsinthetest,howeververy closelyfollowedbySphere.Forrecordingsofincabtrucknoise,Friggoand SpherewereperceivedassignificantlybetterthanJecklinandWedge.

BecauseofitsgoodbinauralrecordingqualitycomparedtotheManikinand reliablelevelmeasurements,Friggoisrecommendedtobeusedforrecordings of incab noise. It performed well in all listening tests. Moreover Friggo measured, in comparison to the artificial head, incab noise levels less diverging from measurements done with free hanging microphones in

70 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 comparisontotheothersystems.Thus,furtherinvestigationisrecommended toobtainloudnesslevelsinplaybackcorrespondingtotherealsituation.

During the progress of the thesis work difficulties were experienced regarding the estimation of the binaural quality. The binaural quality is a complex acoustical property which especially has appeared during the performance of the listening tests. Another general conclusion is the pronouncedinfluencefromthepropertiesofthesoundfieldwhenmeasuring with binaural recording systems. Therefore, level measurements with binaural recording equipment can not be considered as equivalent to measurementsperformedwithsinglemicrophone.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 71 References

[1]HeadAcoustics(2004):Datasheet,SQuadriga(Code1369).

[2] Zwicker,E,Fastl,H(1999):Psycoacoustics–FactsandModels,Germany

[3]Ossietzky,C. Common principles of across channel processing in binauralandmonauralhearing,DTU,Oldenburg. [4] Pulkki, V, (2002): Microphone Techniques and Directional Quality of Sound Reproduction Audio engineering society; 112 th convention 2002 May1013,No.5500. [5] http://www.tape.com/cgi bin/SoftCart.exe/Bartlett_Articles/stereo_microphone_techniques.html?E +cassette ,(1999),Bartlett,B. [6] Hartmann,W.(1999):Howwelocalizesound,PhysicsToday.

[7]Reproduction of auditorium spatial impression with binaural and stereophonicsoundsystems,AES118 th Convention,Barcelona,SpainMay 2831paper6485 [8] Merimaa, J. (2006): Analysis, synthesis and perception of spatial sound. Helsinki University of Technology, Laboratory of Acoustics and Audio SignalProcessingreport77. [9] FarinaA,Tronchin,L.(2005):Measurementsandreproductionofspatial soundcharacteristicsofauditoia,AcousticScience&Technology26,2

[10]http://www.wikirecording.org/ORTF,(2006)

[11](2004), http://www.macmusic.org/articles/view.php/lang/en/id/71

[12]UnitedStatesPatent,(2002):PatentNo.: US6463158B2

[13]http://www.schoeps.de/E2004/kfm6.html ,SchoepsGmbH

[14]Comparability of Recordnings Made with Measurement Systems from HEADAcoustics,HEAD’sapplicationnotes.

[15] Beranek, L., (1996): Concert and opera halls: How they sound, J. AcousticalsocietyofAmerica,NewYork

[16] Blauert, J, (1995): Spatial Hearing The Psychophysics of Human Sound Localization,reviseded.TheMITPress,Cambridge.

[17] Gerzon, M., (1992):Signal processing for simulating realistic stereo images,93rdAESconvention,NewYork,14October,Preprint3424,

72 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 [18]Kendall,G.S.(1995):TheDecorrelationofAudioSignalsandItsImpact onSpatialImagery,ComputerMusicJournal,vol.19(4),pp7187,

[19]Arons,(1992):AReviewoftheCocktailPartyEffect, http://citeseer.ist.psu.edu/cachedpage/10582/3

[20] Kleiner, M, (2005): Audio Technology & Acoustics, Department of AppliedAcousticsCTH,Gothenburg

[21] Harrisson, M, (2004): Vehicle Refinement Controlling Noise and VibrationinRoadVehicles,CranfielsUniversity,UK,

[22] Väljamäe, A, (2004): Auditory Presence, Individualized HeadRelated Transfer Functions, and Iiiusory EgoMotion in Virtual Enviroments, DepartmentofSignalsandSystems,ChalmersUniversityofTechnology, Sweden

[23]HeadAcoustics,(2006):Binaural_Measurement_Okt_06_e

[24]HeadAcoustics,SystemComparability

[25]ConstanZ,HartmanW(2003):Onthedetectionofdispersioninthehead relatedtransferfunction

[26] Khun, G.F. (1982): Towards a model for sound localization, in Localizationofsound:Theoryandapplications,editedbyR.W.Gatehouse (Amphora,Groton,CT)

[27] Brungart, D.S,, and Rabinowitz, W.M, (1999): Auditory localization of nerbysourcesHRTFs,J.Acoust.Soc.Am.10614651479.

[28] Cabera. D, (2006): Percieved room size and source distance in five simulatedconcertauditoria,ICSV12,2005Lisbon,SchoolofArchitecture, UniversityofSydney,Australia

[29] Kahana. Y, (1998): A multiple microphone recording technique for the generation of virtual acoustic images. Institute of Sound & Vibration Research,SouthamptonUniversity,UnitedKingdom

[30]Raykar.V,(2004):Extractingfrequenciesofthepinnaespectralnotchesin measured head related impulse responses. Department of Computer Science,UniversityofMaryland,USA

[31] Calamia. P.T, (1998): Threedimensional localization of a closerange acousticalsourceusingbinauralcues.UniversityofTexasatAustin,USA

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 73 AppendixA–Measurementsystemsspecifications

MEASUREMENTS SYSTEMS, SPECIFICATIONS

o ORTFangle:112 o 17cmdistancebetweenthemicrophonegrills. MicrophonesAKGwithcardioiddirectivity: Serialno.15253 andSerialno.

o Jecklindisc 17cmdistancebetweenthemicrophonegrillssymmetricaldivided with10mmplywood,diameter300mm. Foamabsorberof30mmthickness. MicrophoneswithomnidirectionaldirectivityG.R.A.S.prepolarized: MicrophoneType40AE Serialno.53557 Serialno.53548

o ManikinHMSII.1 44.1kHzsamplingsfrequency. FilterHP0,levelsettings94dB AppliedfilterinArtimiS;User1_EJ_N

o HeadsetBHMIIIonManikinHMSII.1 FilterID,94dB.AppliedfilterinArtimiS:BMHIII.

o Wedgeshapeddummyhead 20cmdistancebetweenmicrophonegrills. MicrophoneswithomnidirectionaldirectivityG.R.A.S.prepolarized: MicrophoneType40AE Serialno.53557 Serialno.53548

o SimpleManikin,Friggo 13cmdistancebetweenmicrophonediaphragms. MicrophoneswithomnidirectionaldirectivityG.R.A.S.prepolarized: MicrophoneType40AE Serialno.53557 Serialno.53548.

o Sphere 23.5cmdistancebetweenmicrophonegrills. MicrophoneswithomnidirectionaldirectivityG.R.A.S.prepolarized: MicrophoneType40AE Serialno.53557 Serialno.53548

o Singlemicrophones,besideManikin 10cmdistancefromManikinatearheight. MicrophoneswithomnidirectionaldirectivityG.R.A.S.prepolarized: MicrophoneType40AE Serialno.53557 Serialno.53548

Calibrator:B&Ktype4228serialno:1756621,1kHz, Pistonphone type 4228 serialno.1756621,250Hz

74 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 AppendixBAbsorptionmaterial

ABSORPTION MATERIAL

Table14:Materialparametersofabsorptionmaterial

Absorbent Type Thickness

Wedge AntiphonSkumabsorbent LA10S 10mm MaterialNo.2a

Jecklin AntiphonSkumabsorbent LA30S 30mm MaterialNo2c

Further information in “Documented materials of consumption Department 26747folder”L47A6.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 75 AppendixCCalibration

CALIBRATION

Calibrationofthestandardomnidirectionalmicrophoneswasdonewitha1 kHz calibrator, and a 250 Hz pistonphone was used for the Headset microphones. The ORTF configuration was calibrated trough simultaneous measurement with calibrated omnidirectional microphone. All microphones used in the measurement systems were calibrated individually before the performedmeasurements.

Figure 54: ORTF right vs. omnidirectional ORTFleftvs.omnidirectionalcalibrated calibratedmicrophone,calculatedwithhighpass microphone,calculatedwithhighpassfilter filter125Hz. 125Hz.

76 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 AppendixDAdditionalmeasurementequipment:

SQUADRIGA

The SQuadriga is a mobile fourchannel module used for all measurement equipmentfromHeadAcousticsincludingabuiltinmemorycard.Itallows upto20minrecordingsatthemaximumsamplingfrequencyof48kHz[1] without additional computer, which ensure reliable measurements in the audible range of 4 Hz 20 kHz. The RPM or engine speed signals can be recordedbytwoadditionalelectricalseparatedpulsechannelswhichalsocan beusedtotrigstartandstopoftherecording.

REFERENCE SOURCE - FAN

ThereferencesourcewasofB&Ktype4204serialno.680816K.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 77 AppendixE–Referencefan

Measurementswithreferencefanintheanechoicchamberovera360 osweep.

Figure55:Wedge Figure56:Jecklin

Figure 57: Manikin, filtered with Figure58:Friggo recommendedfilter.

Figure 59: Manikin, filtered with Figure60:Sphere recommendedfilter.

78 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 AppendixF–ResultsfromHRTFmeasurements

ILD [dB] for Manikin averaged into critical frequency bands (fc = 1.6 – 13.5 kHz)

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 79 ITD [s] for Manikin averaged into critical frequency bands (fc=2.5–13.5 kHz)

80 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 ILD [dB] for Friggo averaged into critical frequency bands (fc=1.6–13.5 kHz)

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 81 ITD [s] for Friggo averaged into critical frequency bands (fc = 2.5 – 13.5 kHz)

82 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 ILD [dB] for Sphere averaged into critical frequency bands (fc=1.6–13.5 kHz)

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 83 ITD [s] for Sphere averaged into critical frequency bands (fc = 2.5–13.5 kHz)

84 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 AppendixG–Listeningtests

LOCALIZATION TEST

This localization test aims to investigate the spatial qualities of recordings donewithbinauralsimilarrecordingsystems.Inthefirstpartofthetest,only theangulardirectiontoasourceisatquestionandhencethedistancetothe sourceisnotconcerned.Inthesecondpart,additionalquestionsarewrittento investigatetheperceptionofdistancetoasource.

The sound used is a male voice counting “one two”.This is played back to you twice for every test sample direction. We want you to mark the experienceddirectionofthemalevoiceintheangularchartsbelow.Thereis oneangularchartforeverytestsampleandthelisteningtestcontainsatotal of36samples.Anexamplecanbeseenbelow.

Example: Thesourceisexperiencedascomingfromthefront,35ºtotheright.

0º 330º 30º

300º 60º

270º 90º

240º 120º

210º 150º 180º

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 85 Part two:

Soundfiles:Testmodel1–Testmodel4

Questions:

1. Atwhatdistancedoyouperceivethelocationofthesource?

2. Doyouexperiencethesoundasbeinglocalizedinsideoroutsideyour head?

Test sample Answers

Question1 :…… Testmodel1

Question2 :…………………………………..

Question1 :…… Testmodel2

Question2 :…………………………………..

Question1 :…… Testmodel3

Question2 :…………………………………..

Question1 :…… Testmodel4

Question2 :…………………………………..

86 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 LISTENING TEST REGARDING THE COCKTAIL PARTY EFFECT AND SOURCE SEPARATION

Youwillheareightdifferentsoundfiles,recordedwithseveralpeopletalking atthesametime.Eachsamplewillbeplayedbacktoyoutwotimes.

Onepersonwillreadoutnumbers09randomly.

Numbercombination:

Sample 1

First:………………………………

Second:……………………………

Please,tryto write down the number combination onthelinestotheleftand marktheintelligibilitywithacrossonthescaletotheright.

How easy was it to detect the numbers?

Sample 1 Hardly intelligible clearly intelligible intelligible

1 2 3 4 5 6

Comparethesoundsamples1to6withthereferencesample:

Has the sample higher or lower source separation ability than the reference?

Pleasemarkwithacrossontheaxisbelow:

Sample1:

Muchlower Equal Muchhigher

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 87 LISTENING TEST REGARDING THE BINAURAL QUALITY OF RECORDINGS OF IN-CAB NOISE

Preface

Wewantyouto:

• Rate the binaural quality of the recordings in comparison with the referencerecording.

• Position the recording systems themselves in order corresponding to ascendingbinauralrecordingqualityincomparisonwithreference.

Therecordingsystemsarereferredtowithnumbers(14)andthescaleusedis reaching from “poor binaural recording quality compared to reference” to “goodbinauralrecordingqualitycomparedtoreference”.

Positionthesamplesinordercorrespondingtotheperceivedbinauralquality incomparisonwithreference.

Constant speed

Poorbinauralquality Goodbinauralquality comparedtoreference comparedtoreference

Full load

Poorbinauralquality Goodbinauralquality comparedtoreference comparedtoreference

88 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 AppendixH–Barkscale

In the analysis of the listening test regarding the cocktail party effect the loudnessareshownusingtheBarkscale.ThebandoftheBarkscaleisshown inTable15.

Table15:TheBarkscale.

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 89 AppendixI–Statisticalanalysis

STATISTIC ANALYSIS: USED TO LISTENING TESTS

Theanalysisofvariance(ANOVA)procedurewasdecidedtobeusedinthe analysisofthesampledatafromthelisteningtests.InANOVAitispossible tocombinesampledatafromseveralpopulationsintoasingletestanditis capableofdetectingwhenoneormoreof thepopulationmean valuediffer from the rest. In our listening tests only one single factor (the choice of recording system) was investigated and therefore the results were analysed usingsinglefactor(oneway)ANOVA.TheonewayANOVAisexplainedin nextsection.

If an ANOVA test is significant further investigations is needed before drawing conclusions. A significant ANOVA test only states that there is a significant difference between some means within the studied population means. Multiple comparison procedures are then used to determine which mean values that are different from each other. In this study the Tukey’s pairedcomparisonprocedurewaschosentodothisanditisexplainedlater.

Itwasdesiredtoestimatetheinaccuracyassociatedwiththeresultsfromthe listening test. In statistics this is usually done by calculating confidence intervalsindicatingtherangeofdeviationaroundaresult.Howtocalculate confidenceintervalisalsopresented.

ANOVA

Todeterminewhetherpopulationmeanvaluesdiffer,theANOVAapproach compares the variation between the sample means under study to the inherent variability within each sample. The more the sample means differ, the larger the betweensamples variation will be. Figure 61 shows an imagined example of three populations with variances and mean values outlined.Theteststaticthatcomparesthesetwotypesofvariationistheratio ofthebetweensamplesvariationtothewithinsamplesvariation.

Between − samples var iation Test statistic = Within − samples var iation

The test statistic follows a continuous probability distribution called an F distribution.Anullhypothesisassumingnodifferencebetween kpopulation µ = µ = = µ means (H 0 : 1 2 ... k ) and is true when the test statistic ( F ratio) describedpreviouslyfollowstheFdistribution.Thevariationmeasuresused inan Fratioarebasedoncertainsumofsquarescalculatedfromthesample data.Thenumeratoranddenominatorsumofsquares,eachhasanassociated number of degrees of freedom. There is a different F distribution for every different combination of numerator (df 1) and denominator (df 2) degrees of freedomfromthecalculationofthe F ratio.

90 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 Variation within samples

x2 Variation x1 between sample x 3 means Sample number 1 2 3 Figure61:Exampleofbetweensamplesand withinsamplesvariationinANOVA.

Pvalues are used in the hypothesis testing. A Pvalue associated with a calculated F ratio is the area under the F distribution to the right of the calculatedFratio.Figure62showsthePvalueassociatedwithacalculatedF ratioof4.53basedondf 1=4anddf 2=6.

0.7

0.6

0.5 F curvefordf 1=4anddf 2=6

0.4

0.3

0.2

0.1 Pvalue= 0.05

0 0 1 2 3 4 5 6 7 8 9 10

F= 4.53 Figure62:PvalueforanuppertailFtest.

NOTATIONSANDFORMULAS

Samplesizes: n1 ,n2 ,..., nk

Samplemeans: x1 , x2 ,..., xk 2 2 2 Samplevariances: s1 , s2 ,..., sk = + + + Totalsamplesize: n n1 n2 ... nk Grandaverage: x = average of all n responses

The treatment sum of squares (denoted SSTr) and error sum of squares (denotedSSE)aredefinedas

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 91 2 2 2 SSTr = n (x − x ) + n (x − x ) +... + n (x − x ) 1 1 2 2 k k = ()()()− 2 + − 2 + + − 2 SSE n1 1 s1 n2 1 s2 ... nk 1 sk

SSTr and SSE form the basis of the betweensamples variation and within samplesvariation.Together,thesetwosourcesofvariationcomprisethetotal sumofsquares(denotedSST)

SST = SSTr + SSE

HYPOTHESISTEST

Tobeabletoformastatisticalprocedureandtoperformahypothesistesta fewassumptionsaboutthepopulationstudiedneedstobedone.Theseare:

1. All kpopulationvariancesareequal.

2. Eachofthe kpopulationfollowsanormaldistribution.

When sampling from normal populations, each sum of squares has its own unique number of degrees of freedom. For instance, in a oneway classification,thetotaldegreeoffreedom(associatedwithSST)is n 1,which equalsthesumof k 1(associatedwithSSTr)and n –k (associatedwithSSE). Thisenablesustoconvertthesumofsquaresintomeansquaresbydividing each sum of squares by its associated df. To do this the following formulas wereused:

SSTr MSTr = − k 1 SSE MSE = n − k

MSTr and MSE serve as measures of the betweensamples and within samplesvariationdescribedpreviously.

Topresentallthisinformationinthisreportthedatahasbeenorganizedinto ANOVAtables.AnexampleofanANOVAtableforonewayclassificationis shownbelow.

Table16:ANOVAtablefortheonewayclassification.

Source of Variation df SS MS F Betweensamples k1 SSTr MSTr MSTr/MSE Withinsamples nk SSE MSE Totalvariation n1 SST

ThehypothesisH 0isrejectedwheneverthePvalueoftheteststaticFisless thanorequaltothesignificancelevel αusedforthehypothesistest.

92 CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 MULTIPLECOMPARISON:TUKEY’SMETHOD

Tukey’sprocedureallowsustoconductseparateteststodecidewhether i= j for each pair of mean values in an ANOVA study of k population mean values. In correspondence with other multiple comparison procedures, Tukey’smethodisbasedontheselectionofasignificancelevel α.whichsays thatitisatmosta α%chanceofobtainingafalsepositiveamongtheentire set of pair wise tests.According to Tukey’sprocedurethe distance between − any two sample means xi x j can be compared to the threshold value T, Tukey’slimit thatdependson αaswellasontheMSEfromtheANOVAtest. Theformulafor Tis,

MSE T = qα ,where niisthesizeofthesampledrawnfromeachpopulation. ni

Thevalueof qαisfoundtablesofstatistics, q,anditfollowstheStudentized range distribution. The Studentized range distribution is a probability distributionthatdependsontwodifferentdegreesoffreedom( k,m ),where k isthenumberofpopulationmeanvaluestobecomparedand mistheerror degrees of freedom in the ANOVA test. To determine whether two mean − − valuesiand jdiffer,wesimplycompare xi x j with T. If xi x j > T,then µ ≠ µ it is concluded that i j . Otherwise, it is concluded to be no significant differencesbetweenthetwomeanvalues.

CONFIDENCEINTERVALS

Inanalysisofexperimentsitisoftenofinteresttocalculateanintervalwithin which the true value with some probability could be expected. Such an intervaliscalledaconfidenceinterval.Thesizeoftheconfidenceintervalis dependentofthestandarddeviation( s)andtheaccuracythatisrequested.In theanalysisofthelisteningtestitisofinteresttonowanintervalinwhicha certain percentage of the population values are included. Assuming that a populationisnormaldistributed,asinglesampleconfidenceintervalcentered atthesamplemeanvalueincluding95%ofthevaluesinthepopulationcan becalculatedwiththefollowingformula:

confidence interval limits = x ± 96.1 ⋅ s

CHALMERS , CivilandEnvironmentalEngineering ,Master’sThesis2007:32 93