Scalable and Perceptual Audio Compression

Total Page:16

File Type:pdf, Size:1020Kb

Scalable and Perceptual Audio Compression University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2003 Scalable and perceptual audio compression Mohammed Raad University of Wollongong, [email protected] Follow this and additional works at: https://ro.uow.edu.au/theses University of Wollongong Copyright Warning You may print or download ONE copy of this document for the purpose of your own research or study. The University does not authorise you to copy, communicate or otherwise make available electronically to any other person any copyright material contained on this site. You are reminded of the following: This work is copyright. Apart from any use permitted under the Copyright Act 1968, no part of this work may be reproduced by any process, nor may any other exclusive right be exercised, without the permission of the author. Copyright owners are entitled to take legal action against persons who infringe their copyright. A reproduction of material that is protected by copyright may be a copyright infringement. A court may impose penalties and award damages in relation to offences and infringements relating to copyright material. Higher penalties may apply, and higher damages may be awarded, for offences and infringements involving the conversion of material into digital or electronic form. Unless otherwise indicated, the views expressed in this thesis are those of the author and do not necessarily represent the views of the University of Wollongong. Recommended Citation Raad, Mohammed, Scalable and perceptual audio compression, Doctor of Philosophy thesis, School of Electrical, Computer and Telecommunications Engineering, University of Wollongong, 2003. https://ro.uow.edu.au/theses/1957 Research Online is the open access institutional repository for the University of Wollongong. For further information contact the UOW Library: [email protected] NOTE This online version of the thesis may have different page formatting and pagination from the paper copy held in the University of Wollongong Library. UNIVERSITY OF WOLLONGONG COPYRIGHT WARNING You may print or download ONE copy of this document for the purpose of your own research or study. The University does not authorise you to copy, communicate or otherwise make available electronically to any other person any copyright material contained on this site. You are reminded of the following: Copyright owners are entitled to take legal action against persons who infringe their copyright. A reproduction of material that is protected by copyright may be a copyright infringement. A court may impose penalties and award damages in relation to offences and infringements relating to copyright material. Higher penalties may apply, and higher damages may be awarded, for offences and infringements involving the conversion of material into digital or electronic form. SCALABLE AND PERCEPTUAL AUDIO COMPRESSION By Mohammed Raad SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY AT UNIVERSITY OF WOLLONGONG NORTHFIELDS AVE WOLLONGONG NSW 2522 AUSTRALIA FEBRUARY 2003 © Copyright by Mohammed Raad, 2003 To all those who value, seek and apply knowledge for the true benefit of mankind. 11 Table of Contents Table of Contents iii List of Tables vii List of Figures ix Abstract xiv Statement of Originality xvi Acknowledgements xvii List of Acronyms xviii 1 Introduction 1 1.1 Perceptually scalable audio compression ............................................. 1 1.2 Scalable to lossless audio compression................................................... 2 1.3 Contributions......................................................... 3 1.4 Publications............................................................................................. 4 2 DSP and psychoacoustic concepts in audio compression 6 2.1 Introduction............................................................................................. 6 2.2 Audio signal processing................................................ 8 2.2.1 Transforms and transform theory.............................................. 8 2.2.2 Quantization ............................................................................... 20 2.2.3 Audio signal models .................................................................. 31 2.3 Perception and perceptual entropy ...................................................... 35 2.3.1 The mechanics of sound perception.......................................... 36 2.3.2 Modelling the human ear............................................................ 38 2.3.3 Masking....................................................................................... 43 2.3.4 Calculating the masking threshold.............................................. 48 2.4 Sensory pleasantness and its contributing factors................................. 54 2.5 Audio quality assessment........................................................................ 60 2.5.1 Subjective quality assessment................................................... 62 2.5.2 Objective quality assessment...................................................... 65 2.6 Summary and conclusion........................................................................ 68 3 A Review of Perceptual Audio Compression 70 3.1 Introduction............................................................................................. 70 3.2 Transform and sub-band audio compression basics.............................. 72 3.2.1 Frame selection 72 3.2.2 The transform.............................................................................. 76 3.2.3 The perceptual model and quantization.................................... 77 3.3 Parametric audio compression basics................................................... 79 3.4 Scalable audio compression- why and how ? ....................................... 82 3.5 Lossless audio compression - why and how ? ....................................... 83 3.6 Audio compression standards ............................................................... 85 3.6.1 MPEG-1....................................................................................... 85 3.6.2 MPEG-2....................................................................................... 88 3.6.3 MPEG-4....................................................................................... 90 3.7 The Dolby AC series of coders ............................................................ 93 3.8 AT&T Perceptual Audio C oding (PAC).......................................... 95 3.9 Sony’s ATRAC....................................................................................... 97 3.10 Other transform and sub-band coders................................................... 100 3.11 Parametric audio coders .....'......................................................... 106 3.12 Scalable audio coders.............................................................................. 109 3.13 Lossless audio coding.............................................................................. 112 3.14 Discussion and conclusions..................................................................... 114 4 Perceptually Scalable Sinusoidal Compression of Audio 116 4.1 Introduction............................................................................................. 116 4.2 Sinusoidal Compression of Speech and Audio....................................... 118 4.2.1 The Short Time Fourier Transform (STFT) Approach............ 118 4.2.2 The Analysis By synthesis Approach (A-by-S)........................ 121 4.3 A new scalable sinusoidal coder ............................................................. 125 4.3.1 Motivation.................................................................................... 125 4.3.2 System Overview........................................................................ 127 4.3.3 The Perceptual Model............................................................... 131 4.3.4 Sorting the Parameters............................................................... 132 IV 4.3.5 Frequency Domain Gains............................................................ 139 4.3.6 Quantization .............................................................................. 141 4.4 Results........................ 147 4.4.1 Variable rate coder objective results.......................................... 148 4.4.2 Scalable rate coder objective results.......................................... 148 4.4.3 Subjective scalable results......................................................... 162 4.5 Discussion and conclusion..................................................................... 165 5 Set Partitioning In Hierarchical Trees (SPIHT) For Audio Coding 167 5.1 Introduction............................................................................................. 168 5.2 A study of SPIHT with current audio codingtransforms ................... 169 5.2.1 Set Partitioning In Hierarchical Trees....................................... 169 5.2.2 Audio compression using wavelets and SPIH T....................... 171 5.2.3 The M-Band Transform Based Codec....................................... 177 5.2.4 M-Band Transform Coding Results.......................................... 183 5.2.5 Discussion of the M-Band Transform coding results............... 186 5.3 Improving the compression ratio of the M-Band SPIHT coder .... 188 5.3.1 The use of masking..................................................................... 188 5.3.2
Recommended publications
  • Sound Masking Done Right: Simple Solutions for Complex Problems
    SOUND MASKING DONE RIGHT: SIMPLE SOLUTIONS FOR COMPLEX PROBLEMS Robert Chanaud, Ph.D. Magnum Publishing L.L.C. i Copyright 2008, Robert Chanaud, All Rights Reserved Published with permission by Magnum Publishing, LLC No part of this book may be reproduced, stored in a retrieval system, or transmitted in any other form, or by any means, electronic, mechanical, photocopying, recording or otherwise, without prior written permission of the publishers. While every effort has been made to ensure that the contents of this document are accurate and reliable, Magnum Publishing LLC, Robert Chanaud, or Atlas Sound L.P. cannot assume liability for any damages caused by inaccuracies in the data or documentation, or as a result of the failure of the data, documentation, software, or products described herein to function in a particular manner. The authors and publishers make no warranty, expressed or implied, nor does the fact of distribution constitute a warranty. International Standard Book Number: 978-0-9818166-0-9 Printed in the United States of America ii - Sound Masking Done Right: Simple Solutions for Complex Problems - About Bob Chanaud Dr. Robert C. Chanaud received his BS from the US Coast Guard Academy, his MS from the University of California and his PhD from Purdue University. Active in the field of acoustics since 1958, he taught at Purdue University and the University of Colorado and, in 1975, founded Dynasound, Inc. He has developed software programs to facilitate the design and equalization of masking systems. Acknowledgements The following companies have graciously supplied products for evaluation: Atlas Sound Cambridge Sound Management Dynasound Soft dB Sound Advance Notation Square brackets [ ] are references contained at the end of the manual.
    [Show full text]
  • MARK Serv Grap GLOB 2013 EUROCOM Catalog EN V8 2013
    EUROCOM The New Standard for Installed Systems Summer 2013 EUROCOM The EUROCOM Story Innovative Tools Discover EUROCOM, the new for Modern standard for installed systems: Systems Integrators New ideas, breathtaking technologies, a unique design aesthetic, fl exibility – and tremendous value! I’m often asked why BEHRINGER customers love our made them tick. I quickly realized that companies were Over the past two decades we have with a diff erent sales and distribution Longtime BEHRINGER observers will products so much. The answer is simple… it’s because charging US$1000 or more for a piece of equipment, heard from many customers who model, terms of sale, marketing and recognize our EUROLIVE, EURODESK we are so passionate about everything we do. while the components inside were only worth US$100! have used our products in installed support requirements. We recognized and EUROPOWER product family Our customers tell us what they want, and we design sound systems. After all, system early on that the nature of project-driven names. The origin of these names So I immediately grabbed my soldering iron and went to and build products that sound great and provide integrators and contractors are always sales diff ers considerably from that of stems from our roots as a small work on my fi rst signal processor. Word spread quickly amazing feature sets—at extremely aff ordable prices! looking for products that deliver a retail sales and that our organization startup in Germany, hence the among friends that my products sounded really good. would need to adapt in name—EURO. So, of course when It all started 29 years ago when I was studying classical More importantly, I found out that my friends were very order to fully and properly it came time to name the family piano and sound engineering.
    [Show full text]
  • Sound Masking in Patient Rooms
    Navigate to ... Navigate to ... SnoCQ eguraie: nRtuMedPin oait ontaimskgi s highprofile | May 19, 2014 | 0 Comments by Benjamin Davenny Noise levels in hospitals have become an increasing concern as more noise sources have been added to the hospital environment. These sources range from noisy medical instruments to the layout of patient rooms as they relate to the nurses’ station. Numerous studies have evaluated the impact of noise levels on the hospital environment, but few have considered the type of noise source. The sound of a fan is different from an alarm, even if they measure at the same sound levels. Without clear objectives on the type of noises studied, there is a false impression that a quieter environment is always a better one. Some of these noise sources are necessary in a modern working hospital. The trick is to take a different look at their sources and Ben Davenny develop more efficient methods to reduce disturbance to patients. The often-cited World Health Organization (WHO) and Environmental Protection Agency (EPA) guidelines for hospitals require low noise levels in patient rooms, precluding conversation in corridors with patient doors open. This requirement conflicts with nurses’ needs to see patients and discuss patient care. The WHO and EPA guidelines are also based mainly on transportation noise, whose character is quite distinctive and bothersome to building occupants. Introducing a constant noise source as background sound helps to reduce the impact of impulsive tonal noises such as speech. The typical background noise level can be considered a constant noise with full frequency content. These noise sources include air movement from the building’s HVAC system and cooling fans, as well as electronic sound masking systems.
    [Show full text]
  • Machine Learning Based Cyber Attack Targeting on Controlled Information
    Machine Learning Based Cyber Attacks Targeting on Controlled Information By Yuantian Miao A thesis submitted in fulfillment for the degree of Doctor of Philosophy Faculty of Science, Engineering and Technology (FSET) Swinburne University of Technology May 2021 Abstract Due to the fast development of machine learning (ML) techniques, cyber attacks utilize ML algorithms to achieve a high success rate and cause a lot of damage. Specifically, the attack against ML models, along with the increasing number of ML-based services, has become one of the most emerging cyber security threats in recent years. We review the ML-based stealing attack in terms of targeted controlled information, including controlled user activities, controlled ML model-related information, and controlled authentication information. An overall attack methodology is extracted and summarized from the recently published research. When the ML model is the target, the attacker can steal model information or mislead the model’s behaviours. The model information stealing attacks can steal the model’s structure information or model’s training set information. Targeting at Automated Speech Recognition (ASR) system, the membership inference method is studied to whether the model’s training set can be inferred at user-level, especially under the black-box access. Under the label-only black-box access, we analyse user’s statistical information to improve the user-level membership inference results. When even the label is not provided, google search results are collected instead, while fuzzy string matching techniques would be utilized to improve membership inference performance. Other than inferring training set information, understanding the model’s structure information can launch an effective adversarial ML attack.
    [Show full text]
  • Noise Assessment Activities
    Noise assessment activities Interesting stories in Europe ETC/ACM Technical Paper 2015/6 April 2016 Gabriela Sousa Santos, Núria Blanes, Peter de Smet, Cristina Guerreiro, Colin Nugent The European Topic Centre on Air Pollution and Climate Change Mitigation (ETC/ACM) is a consortium of European institutes under contract of the European Environment Agency RIVM Aether CHMI CSIC EMISIA INERIS NILU ÖKO-Institut ÖKO-Recherche PBL UAB UBA-V VITO 4Sfera Front page picture: Composite that includes: photo of a street in Berlin redesigned with markings on the asphalt (from SSU, 2014); view of a noise barrier in Alverna (The Netherlands)(from http://www.eea.europa.eu/highlights/cutting-noise-with-quiet-asphalt), a page of the website http://rumeur.bruitparif.fr for informing the public about environmental noise in the region of Paris. Author affiliation: Gabriela Sousa Santos, Cristina Guerreiro, Norwegian Institute for Air Research, NILU, NO Núria Blanes, Universitat Autònoma de Barcelona, UAB, ES Peter de Smet, National Institute for Public Health and the Environment, RIVM, NL Colin Nugent, European Environment Agency, EEA, DK DISCLAIMER This ETC/ACM Technical Paper has not been subjected to European Environment Agency (EEA) member country review. It does not represent the formal views of the EEA. © ETC/ACM, 2016. ETC/ACM Technical Paper 2015/6 European Topic Centre on Air Pollution and Climate Change Mitigation PO Box 1 3720 BA Bilthoven The Netherlands Phone +31 30 2748562 Fax +31 30 2744433 Email [email protected] Website http://acm.eionet.europa.eu/ 2 ETC/ACM Technical Paper 2015/6 Contents 1 Introduction ...................................................................................................... 5 2 Noise Action Plans .........................................................................................
    [Show full text]
  • 22Nd International Congress on Acoustics ICA 2016
    Page intentionaly left blank 22nd International Congress on Acoustics ICA 2016 PROCEEDINGS Editors: Federico Miyara Ernesto Accolti Vivian Pasch Nilda Vechiatti X Congreso Iberoamericano de Acústica XIV Congreso Argentino de Acústica XXVI Encontro da Sociedade Brasileira de Acústica 22nd International Congress on Acoustics ICA 2016 : Proceedings / Federico Miyara ... [et al.] ; compilado por Federico Miyara ; Ernesto Accolti. - 1a ed . - Gonnet : Asociación de Acústicos Argentinos, 2016. Libro digital, PDF Archivo Digital: descarga y online ISBN 978-987-24713-6-1 1. Acústica. 2. Acústica Arquitectónica. 3. Electroacústica. I. Miyara, Federico II. Miyara, Federico, comp. III. Accolti, Ernesto, comp. CDD 690.22 ISBN 978-987-24713-6-1 © Asociación de Acústicos Argentinos Hecho el depósito que marca la ley 11.723 Disclaimer: The material, information, results, opinions, and/or views in this publication, as well as the claim for authorship and originality, are the sole responsibility of the respective author(s) of each paper, not the International Commission for Acoustics, the Federación Iberoamaricana de Acústica, the Asociación de Acústicos Argentinos or any of their employees, members, authorities, or editors. Except for the cases in which it is expressly stated, the papers have not been subject to peer review. The editors have attempted to accomplish a uniform presentation for all papers and the authors have been given the opportunity to correct detected formatting non-compliances Hecho en Argentina Made in Argentina Asociación de Acústicos Argentinos, AdAA Camino Centenario y 5006, Gonnet, Buenos Aires, Argentina http://www.adaa.org.ar Proceedings of the 22th International Congress on Acoustics ICA 2016 5-9 September 2016 Catholic University of Argentina, Buenos Aires, Argentina ICA 2016 has been organised by the Ibero-american Federation of Acoustics (FIA) and the Argentinian Acousticians Association (AdAA) on behalf of the International Commission for Acoustics.
    [Show full text]
  • Hybrid Active Noise Barrier with Sound Masking
    Hybrid active noise barrier with sound masking Xun WANG1; Yosuke KOBA2; Satoshi ISHIKAWA2; Shinya KIJIMOTO2 1;2 Kyushu University, Japan ABSTRACT In this paper, a hybrid active noise barrier (ANB) with sound masking capability is considered. To protect the speech privacy, several sound masking techniques have been developed. However, these sound masking techniques based on the superposition of the masker and original sound will lead to increase in the loudness of the sound after masking. Against this problem, this paper proposes a soundproof system which combines an ANB with a sound masking system. The ANB applies a new type of hybrid active noise control (ANC) system, which can reduce the noise diffraction and the noise propagated to the ear positions of the people behind the ANB simultaneously, to attenuate the conversation sound. Consequently, the required volume of the masker sound will decrease because of the smaller loudness of the original conversation sound in the control area. The sound masking system applies a method which can generate masker based on the frequency properties of the original sound. Several simulations are carried out to investigate the sound attenuation and sound masking performance of this system, and the results show that comparing with the traditional sound masking system, the proposed system needs smaller masker sound to achieve the sound masking effect. Keywords: Active Noise Control, Noise Barrier, Sound Masking I-INCE Classification of Subjects Number(s): 38.1, 38.2 1. INTRODUCTION In the open areas such as clinics, pharmacies, banks and offices, leakage of the privacy contained in the conversations has been seen as a very serious problem.
    [Show full text]
  • PROCEEDINGS of the ICA CONGRESS (Onl the ICA PROCEEDINGS OF
    ine) - ISSN 2415-1599 ISSN ine) - PROCEEDINGS OF THE ICA CONGRESS (onl THE ICA PROCEEDINGS OF Page intentionaly left blank 22nd International Congress on Acoustics ICA 2016 PROCEEDINGS Editors: Federico Miyara Ernesto Accolti Vivian Pasch Nilda Vechiatti X Congreso Iberoamericano de Acústica XIV Congreso Argentino de Acústica XXVI Encontro da Sociedade Brasileira de Acústica 22nd International Congress on Acoustics ICA 2016 : Proceedings / Federico Miyara ... [et al.] ; compilado por Federico Miyara ; Ernesto Accolti. - 1a ed . - Gonnet : Asociación de Acústicos Argentinos, 2016. Libro digital, PDF Archivo Digital: descarga y online ISBN 978-987-24713-6-1 1. Acústica. 2. Acústica Arquitectónica. 3. Electroacústica. I. Miyara, Federico II. Miyara, Federico, comp. III. Accolti, Ernesto, comp. CDD 690.22 ISSN 2415-1599 ISBN 978-987-24713-6-1 © Asociación de Acústicos Argentinos Hecho el depósito que marca la ley 11.723 Disclaimer: The material, information, results, opinions, and/or views in this publication, as well as the claim for authorship and originality, are the sole responsibility of the respective author(s) of each paper, not the International Commission for Acoustics, the Federación Iberoamaricana de Acústica, the Asociación de Acústicos Argentinos or any of their employees, members, authorities, or editors. Except for the cases in which it is expressly stated, the papers have not been subject to peer review. The editors have attempted to accomplish a uniform presentation for all papers and the authors have been given the opportunity
    [Show full text]
  • Understanding Sound Masking
    WHY SOUND MASKING? HOW DOES IT WORK? WILL I HEAR IT? HOW EFFECTIVE IS IT? Disruptive noises and conversations make tasks harder to If you’ve ever ran water at your kitchen sink while trying to talk to Sound masking allows easy communication over short someone in the next room, you’ll understand. You can tell your How is the solution complete. Errors happen more often. That adds to stress. conversational partner is speaking, but it’s difficult to comprehend Common misconceptions distances while what they’re saying. That’s because the running water has raised the noise floor in your area. implemented? protecting employees Can’t music provide masking? More than 40 million North Americans work in open plan offices featuring Are acoustics A sound masking system uses loudspeakers The noise floor is the level of constant sound present in a space. If it to distribute a comfortable, engineered from the noises coming from surrounding partial height panels. Granted, these cubicles make better use of space really that important? Music alone does not provide the is too high, you’ll find it irritating. Too low, and you can easily overhear background sound. This makes it difficult to hear frequency spectrum required to offices and workstations. Understanding and improve communication flow, but they’re an acoustical challenge. conversations and noises. incidental noises or conversations. consistently mask conversations and Research conducted over the last decade noises. Music preferences are a matter Traditional walls have given way to modular furniture systems, more by the Center for the Built Environment The LogiSon Acoustic Network’s loudspeakers of personal taste, and because music (CBE) and others shows that poor acoustics are usually installed in a grid-like pattern above contains variations and patterns, it As noise travels, its volume decreases to a level that is covered up by the masking sound, so it follows that this employees use the same space, and everyone is seated closer together.
    [Show full text]
  • Newport Beach, California
    47th International Congress and Exposition on Noise Control Engineering (INTERNOISE 2018) Impact of Noise Control Engineering Chicago, Illinois, USA 26 - 29 August 2018 Volume 1 of 10 ISBN: 978-1-5108-7303-2 Printed from e-media with permission by: Curran Associates, Inc. 57 Morehouse Lane Red Hook, NY 12571 Some format issues inherent in the e-media version may also appear in this print version. Copyright© (2018) by Institute of Noise Control Engineering - USA (INCE-USA) All rights reserved. Printed by Curran Associates, Inc. (2018) For permission requests, please contact Institute of Noise Control Engineering - USA (INCE-USA) at the address below. Institute of Noise Control Engineering - USA (INCE-USA) INCE-USA Business Office 11130 Sunrise Valley Drive Suite 350 Reston, VA 20190 USA Phone: +1 703 234 4124 Fax: +1 703 435 4390 [email protected] Additional copies of this publication are available from: Curran Associates, Inc. 57 Morehouse Lane Red Hook, NY 12571 USA Phone: 845-758-0400 Fax: 845-758-2633 Email: [email protected] Web: www.proceedings.com INTER-NOISE 2018 – Pleanary, Keynote and Technical Sessions Papers are listed by Day, Morning/Afternoon, Session Number and Order of Presentation Select the underlined text (in18_xxxx.pdf) to link to paper Papers without links are available from the American Society of Mechanical Engineers (ASME) - Noise Control and Acoustics Division (NCAD) Sunday – 26 August, 2018 Opening Plenary: Barry Marshall Gibbs Session Chair: Stuart Bolton, Room: Chicago D and E 16:00 in18_4002.pdf Structure-Borne Sound in Buildings: Application of Vibro-Acoustic Methods for Measurement and Prediction .....1 Barry Marshall Gibbs, University of Liverpool Monday Morning – 27 August, 2018 Keynote: Jean-Louis Guyader Session Chair: Steffen Marburg Room: Chicago D 08:00 in18_5001.pdf Building Optimal Vibro-Acoustics Models From Measured Responses .....26 Jean-Louis Guyader, LVA/INSA de Lyon; Michael Ruzek, LAMCOS INSA de Lyon; Charles Pezerat, LAUM Universite du Maine Keynote: Amiya R.
    [Show full text]
  • Convention Paper
    Audio Engineering Society Convention Paper Presented at the 127th Convention 2009 October 9–12 New York, NY, USA The papers at this Convention have been selected on the basis of a submitted abstract and extended precis that have been peer reviewed by at least two qualified anonymous reviewers. This convention paper has been reproduced from the author's advance manuscript, without editing, corrections, or consideration by the Review Board. The AES takes no responsibility for the contents. Additional papers may be obtained by sending request and remittance to Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA; also see www.aes.org. All rights reserved. Reproduction of this paper, or any portion thereof, is not permitted without direct permission from the Journal of the Audio Engineering Society. Practical Measurement of Loudspeaker Distortion Using a Simplified Auditory Perceptual Model Steve Temme 1, Pascal Brunet 2, and D. B. (Don) Keele, Jr. 3 1 Listen, Inc., Boston, MA, 02118, USA [email protected] 2 Listen, Inc., Boston, MA, 02118, [email protected] 3 DBK Associates and Labs, Bloomington, IN 47408, USA [email protected] ABSTRACT Manufacturing defects in loudspeaker production can often be identified by an increase in Rub & Buzz distortion. This type of distortion is quite noticeable because it contributes an edgy sound to the reproduction and is annoying because it often sounds separate or disembodied from the fundamental signal. The annoyance of Rub & Buzz distortion is tied intimately to human perception of sound and psychoacoustics. To properly implement automated production-line testing of loudspeaker Rub & Buzz defects, one has to model or imitate the hearing process using a sufficiently accurate perceptual model.
    [Show full text]
  • Cisp-Bmei 2017)
    2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI 2017) Shanghai, China 14-16 October 2017 Pages 1-754 IEEE Catalog Number: CFP17J14-POD ISBN: 978-1-5386-1938-4 1/3 Copyright © 2017 by the Institute of Electrical and Electronics Engineers, Inc. All Rights Reserved Copyright and Reprint Permissions: Abstracting is permitted with credit to the source. Libraries are permitted to photocopy beyond the limit of U.S. copyright law for private use of patrons those articles in this volume that carry a code at the bottom of the first page, provided the per-copy fee indicated in the code is paid through Copyright Clearance Center, 222 Rosewood Drive, Danvers, MA 01923. For other copying, reprint or republication permission, write to IEEE Copyrights Manager, IEEE Service Center, 445 Hoes Lane, Piscataway, NJ 08854. All rights reserved. *** This is a print representation of what appears in the IEEE Digital Library. Some format issues inherent in the e-media version may also appear in this print version. IEEE Catalog Number: CFP17J14-POD ISBN (Print-On-Demand): 978-1-5386-1938-4 ISBN (Online): 978-1-5386-1937-7 Additional Copies of This Publication Are Available From: Curran Associates, Inc 57 Morehouse Lane Red Hook, NY 12571 USA Phone: (845) 758-0400 Fax: (845) 758-2633 E-mail: [email protected] Web: www.proceedings.com TABLE OF CONTENTS AN EFFICIENT ALGORITHM FOR RECOGNITION OF POINTER SCALE VALUE ON DASHBOARD........................................................................................................................................................................1 Pengfei Li ; Qingxiang Wu ; Yao Xiao ; Yanan Zhang AN IMPROVED SPATIO-TEMPORAL CONTEXT TRACKING ALGORITHM COMBINING LK OPTICAL FLOW..................................................................................................................................................................7 Zhongjian Li ; Yaru Su ; Wenke Ma ANALYSIS ON THE VALIDITY OF EYE MOVEMENT DATA DURING VIDEO VIEWING..............................
    [Show full text]