Automatic Speech Recognition Adaptation for Various Noise Levels

Total Page:16

File Type:pdf, Size:1020Kb

Automatic Speech Recognition Adaptation for Various Noise Levels Automatic Speech Recognition Adaptation for Various Noise Levels by Azhar Sabah Abdulaziz Bachelor of Science Computer Engineering College of Engineering University of Mosul 2002 Master of Science in Communication and Signal Processing College of Agricultual, Science and Engineering Electrical and Electronics Engineering Newcastle University 2009 A dissertation submitted to the College of Engineering Department of Electrical and Computer Engineering at Florida Institute of Technology in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Computer Engineering Melbourne, Florida May 2018 ⃝c Copyright 2018 Azhar Sabah Abdulaziz All Rights Reserved The author grants permission to make single copies We the undersigned committee hereby recommend that the attached document be accepted as fulfilling in part the requirements for the degree of Doctor of Philosophy in Computer Engineering. “Automatic Speech Recognition Adaptation for Various Noise Levels”, a dissertation by Azhar Sabah Abdulaziz Veton Z. Kepuska,¨ Ph.D. Professor, Electrical and Computer Engineering Dissertation Advisor Samuel Kozaitis, Ph.D. Professor, Department Head Electrical and Computer Engineering Josko Zec, Ph.D. Associate Professor, Electrical and Computer Engineering Nezamoddin Nezamoddini-Kachouie, Ph.D. Assistant Professor, Mathematical Sciences Abstract TITLE: Automatic Speech Recognition Adaptation for Various Noise Levels AUTHOR: Azhar Sabah Abdulaziz MAJOR ADVISOR: Veton Z. Kepuska,¨ Ph.D. The automatic speech recognition (ASR) is a set of complicated algorithms that convert the intended spoken utterance into a textual form. Acoustic features, which are extracted from the speech signal, are matched against a trained network of linguistic and acoustic models. The ASR performance is degraded significantly when the ambient noise is different than that of the training data. Many approaches have been introduced to address this problem with various degrees of complexity and improvement rates. The general pattern of solving this issue lies in three categories: empowering features, train a general acoustic model and transform models to match noisy features. The acoustic noise is added to the training speech data after collecting them for two reasons: firstly because the data are usually recorded in a specific environment and secondly to control the environments during the training and testing phases. The speech and noise signals are usually combined in the electrical domain using straightforward linear addition. Although this procedure is commonly used, it is iii investigated in depth in this research. It has been proven that the linear addition is no more than an approximation of the real acoustic combination, and it is valid if the speech and noise are non-coherent signals. The adaptive model switching (AMS) solution is proposed, so that the ASR mea- sures the noise level then picks the model that should produce as minimum errors as possible. This solution is a trade-off between model generalization and transformation properties, so that both error and speed costs are maintained as minimum as possible. The short time of silence (STS), which is a signal-to-noise ratio (SNR) level detector, was designed specifically for the proposed system. The proposed AMS approach is a general recipe that could be applied to any other ASR systems, although it was tested on Gaussian Mixture Model-Hidden Markov Model (GMM-HMM) recognizer. The AMS ASR has outperformed the model generalization and multiple-decoder maximum score voting for both accuracy and decoding speed. The average error rate reduction was around 34.11% , with a decoding speed improvement of about 37.79% relatively, both compared to the baseline ASR. iv Contents Contents v List of Figures ix List of Tables xii Abbreviations xiii Acknowledgments xvi Dedication xvii 1 Introduction 1 1.1 Automatic Speech Recognition . 1 1.2 ASR in Noisy Environment . 3 1.3 Outline . 5 2 Review of Speech Recognition Technologies 6 2.1 Introduction . 6 2.2 The Motivations for the ASR Design . 7 2.3 The Automatic Speech Recognition Design . 9 2.3.1 Feature Extraction . 10 v 2.3.2 Decoder . 12 2.3.3 Knowledge Base . 15 2.4 Knowledge Sources Integration . 16 2.5 Decoder Methodologies . 18 2.5.1 Hidden Markov Model . 20 2.5.2 Neural Networks . 24 2.6 Towards Robust ASR . 27 2.6.1 Feature Enhancement . 27 2.6.2 Transformation . 31 2.6.3 Generalization . 32 2.6.4 Multiple Decoder Combination . 34 3 Acoustic Noise Simulation 35 3.1 Introduction . 35 3.2 The SPL and the Total SPL . 36 3.3 Total Microphone Voltage For N Acoustic Sources . 39 3.4 Experimental Results . 42 3.4.1 Monotonic Acoustic Signal Test . 43 3.4.2 Real Speech Audio Test . 46 3.5 The Divergence between the Two Approaches . 48 3.6 Conclusion . 53 4 Signal-to-Noise Ratio Estimation 54 4.1 Introduction . 54 4.2 Previous SNR Estimation Approaches . 55 4.2.1 The Statical Closed-Form Approach . 56 vi 4.2.2 The Audio Power Histogram Approach . 59 4.2.3 The Supervised Method Approach . 61 4.3 The Proposed SNR Estimation Approach . 63 4.3.1 The Short-Time of Silence SNR Estimator . 63 4.3.2 The Noise-Speech Power Offset . 67 4.4 Experimental Results . 69 4.5 Discussion . 73 5 Experimental Tools and Environments 77 5.1 Introduction . 77 5.2 CMU Sphinx ASR Toolkit . 78 5.3 Feature Extraction . 78 5.4 GMM-HMM Acoustic Model Training . 80 5.5 Sphinxtrain Tool Procedure . 84 5.6 The Data . 86 6 Acoustic Noise and ASR Performance 87 6.1 Introduction . 87 6.2 Performance Metrics . 88 6.3 Analyzing the Noise Effect on Performance . 89 6.3.1 Noise Type Effect . 90 6.3.2 Noise Level Effect . 92 6.4 Conclusion . 94 7 Adaptive Model Switching (AMS) Speech Recognition 95 7.1 Introduction . 95 7.2 System Design . 96 vii 7.3 Adaptive Model Switching ASR Training . 97 7.3.1 Acoustic Models Training . 97 7.3.2 STS-SNR Training . 99 7.4 AMS Decoding Algorithm . 101 7.5 Silence Samples Update . 103 7.6 Conclusion . 104 8 Results and Discussion 105 8.1 Introduction . 105 8.2 The word Error Rate (WER) . 106 8.2.1 White Noise Experiment . 107 8.2.2 Door Slam Noise Experiment . 109 8.2.3 Babble Noise Experiment . 111 8.3 WER Relative Improvement Comparison . 114 8.4 The Recognition Speed . 115 8.5 Conclusion . 118 9 Conclusions 120 Appendices 122 A Noisy TIMIT Corpus 123 A.1 TIMIT Corpus . 123 A.2 Noisy TIMIT Data Structure . 125 A.3 Noise Types and Ranges . 126 viii List of Figures 1.1 General schematic diagram for the LVCSR system . 2 2.1 Human speech production/perception process . 7 2.2 A simple speech production/perception model . 8 2.3 The LVCSR general modules’ functions . 9 2.4 PLP and MFCC analysis comparison . 11 2.5 Knowledge sources hierarchy for ASR . 15 2.6 ASR bottom-up approach for knowledge integration . 17 2.7 Top-down approach for knowledge integration . 18 2.8 Blackboard knowledge sources integration in ASR . 19 2.9 Two states simple Markov model . 21 2.10 HMM word model and sub-phoneme . 23 2.11 Deep neural network (DNN) architecture . 26 2.12 Conventional MFCC, RASTA-PLP and PNCC . 29 2.13 PNCC vs MFCC WER . 30 2.14 Joint training frame work example . 33 3.1 Monotonic acoustic signal synchronization . 44 3.2 Monotonic signal addition experiment . 45 3.3 Synchronization pulse train . 47 ix 3.4 The divergence analysis scenario . 49 3.5 Factors affecting vector-linear divergence . 52 4.1 The closed-form SNR response for frame-based noise . 58 4.2 The closed form SNR response for fixed 10 dB noise . 58 4.3 The NIST-STNR approach . 59 4.4 Comparison WADA and NIST SNR algorithms . 61 4.5 Regression, WADA and NIST SNR error . 62 4.6 The Short-Time-Silence estimator (STS-SNR) Algorithm . 65 4.7 The effect of STS step 4 . 66 4.8 Finding the offset for STS-SNR . 68 4.9 MAE results for NIST, WADA and STS using NOIZEUS corpus . 71 4.10 MAE of NIST, WADA and STS estimators using TIMIT corpus . 72 4.11 Mean estimation of NIST, WADA and STS using NOIZUS corpus . 73 4.12 Comparing different SNR estimators on for different noise types. 74 5.1 The speech features used in experiments . 79 5.2 A simple 3-state HMM . 82 6.1 ASR WER degradation in noisy environment . 90 6.2 Different noise types speech on AWGN AM . 91 6.3 Baeline ASR noise level test . 93 7.1 The proposed AMS speech recognition . 96 7.2 STS-SNR Estmator Block Diagram . 99 8.1 Multiple decoder maximum MAP decoder . 106 8.2 The AMS WER performance of TIMIT on AWGN . 107 x 8.3 The AMS WER performance of AN4 on AWGN . 108 8.4 The AMS WER performance of PDAmWSJ on AWGN . 108 8.5 The AMS WER performance of TIMIT on door slam noise . 109 8.6 The AMS WER performance of AN4 on door slam noise . 110 8.7 The AMS WER performance of PDAmWSJ on door slam noise . 110 8.8 Babble noise TIMIT test . 112 8.9 Babble noise AN4 test . 113 8.10 Babble noise PDAmWSJ test . 113 8.11 The averge decoding speed performance of TIMIT . 116 8.12 The average decoding speed performance for AN4 . 117 8.13 The average decoding performance for PDAmWSJ . 118 A.1 Noisy TIMIT Corpus directory structure . 125 A.2 The spectrum of different colors of noise. ..
Recommended publications
  • Noise Reduction Applied to Asteroseismology
    PERTURBATIONS OF OBSERVATIONS. NOISE REDUCTION APPLIED TO ASTEROSEISMOLOGY. Javier Pascual Granado CCD 21/10/2009 Contents • Noise characterization and time series analysis Introduction The colors of noise Some examples in astrophysics Spectral estimation • Noise reduction applied to asteroseismology Asteroseismology: Origin and Objetives CoRoT A new method for noise reduction: Phase Adding Method (PAM) Aplications: Numerical experiment Star HD181231 Star 102719279 Javier Pascual Granado CCD 21/10/2009 2 Introduction Javier Pascual Granado CCD 21/10/2009 3 The Colors Of Noise Pink noise Javier Pascual Granado CCD 21/10/2009 4 The Colors Of Noise Red noise Javier Pascual Granado CCD 21/10/2009 5 Some Examples In Astrophysics •Thermal noise due to a nonzero temperature is approximately white gaussian noise. • Photon-shot noise due to statistical fluctuations in the measurements has a Poisson distribution and a power increasing with frequency. It is a problem with weak signals. • Also seismic noise affect to sensible ground instruments like LIGO (gravitational wave detector). Nevertheless, for LISA the noise characterization adopted is a gaussian noise. •Observations of the black hole candidate X-ray binary Cyg X-1 by EXOSAT, show a continuum power spectrum with a pink noise. • In asteroseismology the noise is assumed to be of white gaussian type. Javier Pascual Granado CCD 21/10/2009 6 Time Series: Spectral Estimation A light curve (or a photometric time series) is a set of data points ordered In time. ti-1-ti might not be constant.
    [Show full text]
  • Intuilink Waveform Editor
    IntuiLink Waveform Editor The Agilent IntuiLink Arbitrary Waveform Editor application is an ActiveX document server. This application provides a graphical user interface (GUI) that allows you to create, import, modify, and export arbitrary waveforms, and to send these waveforms to the Agilent type 33120A, 33220A or 33250A Arbitrary Waveform (ARB) Generator instrument. The user interface is shown below with its main areas identified by numbers: Let's look at each of these numbered areas in turn. 1. Menu Bar - provides standard Microsoft Windows style menus. 2. Standard Toolbar - provides toolbar buttons for such standard menu functions as Save, Cut, Paste, and Print. 3. Waveform Toolbar - provides toolbar buttons for the arbitrary waveform Creation and Edit functions. In particular, these buttons allow you to Add waveform segments to the active waveform edit window. 4. Waveform Edit Windows - each of these windows allows you to create, edit, and display an arbitrary waveform. (Waveform defaults – see: Appendix A.) The contents of the active waveform edit window can be sent to an Agilent ARB Generator. 5. Status Bar - displays messages about the following: - General status messages. - The current mode. - In Select mode, the start and end positions and number of points selected. - In Marker mode: · When an X marker is being moved, the positions of both X markers and the difference between them. · When a Y marker is being moved, the positions of both Y markers and the difference between them. - In Freehand or Line Draw mode, the cursor position and the length of the waveform. - The address of an arbitrary waveform generator (for example: GPIB0::10::INSTR) if one is connected, or else Disconnected.
    [Show full text]
  • 3A Whatissound Part 2
    What is Sound? Part II Timbre & Noise Prayouandi (2010) - OneOhtrix Point Never 1 PSYCHOACOUSTICS ACOUSTICS LOUDNESS AMPLITUDE PITCH FREQUENCY QUALITY TIMBRE 2 Timbre / Quality everything that is not frequency / pitch or amplitude / loudness envelope - the attack, sustain, and decay portions of a sound spectra - the aggregate of simple waveforms (partials) that make up the frequency space of a sound. noise - the inharmonic and unpredictable fuctuations in the sound / signal 3 envelope 4 envelope ADSR 5 6 Frequency Spectrum 7 Spectral Analysis 8 Additive Synthesis 9 Organ Harmonics 10 Spectral Analysis 11 Cancellation and Reinforcement In-phase, out-of-phase and composite wave forms 12 (max patch) Tone as the sum of partials 13 harmonic / overtone series the fundamental is the lowest partial - perceived pitch A harmonic partial conforms to the overtone series which are whole number multiples of the fundamental frequency(f) (f)1, (f)2, (f)3, (f)4, etc. if f=110 110, 220, 330, 440 doubling = 1 octave An inharmonic partial is outside of the overtone series, it does not have a whole number multiple relationship with the fundamental. 14 15 16 Basic Waveforms fundamental only, no additional harmonics odd partials only (1,3,5,7...) 1 / p2 (3rd partial has 1/9 the energy of the fundamental) all partials 1 / p (3rd partial has 1/3 the energy of the fundamental) only odd-numbered partials 1 / p (3rd partial has 1/3 the energy of the fundamental) 17 (max patch) Spectrogram (snapshot) 18 Identifying Different Instruments 19 audio sonogram of 2 bird trills 20 Spear (software) audio surgery? isolate partials within a complex sound 21 the physics of noise Random additions to a signal By fltering white noise, we get different types (colors) of noise, parallels to visible light White Noise White noise is a random noise that contains an equal amount of energy in all frequency bands.
    [Show full text]
  • Musical Elements in the Discrete-Time Representation of Sound
    0 Musical elements in the discrete-time representation of sound RENATO FABBRI, University of Sao˜ Paulo VILSON VIEIRA DA SILVA JUNIOR, Cod.ai ANTONIOˆ CARLOS SILVANO PESSOTTI, Universidade Metodista de Piracicaba DEBORA´ CRISTINA CORREA,ˆ University of Western Australia OSVALDO N. OLIVEIRA JR., University of Sao˜ Paulo e representation of basic elements of music in terms of discrete audio signals is oen used in soware for musical creation and design. Nevertheless, there is no unied approach that relates these elements to the discrete samples of digitized sound. In this article, each musical element is related by equations and algorithms to the discrete-time samples of sounds, and each of these relations are implemented in scripts within a soware toolbox, referred to as MASS (Music and Audio in Sample Sequences). e fundamental element, the musical note with duration, volume, pitch and timbre, is related quantitatively to characteristics of the digital signal. Internal variations of a note, such as tremolos, vibratos and spectral uctuations, are also considered, which enables the synthesis of notes inspired by real instruments and new sonorities. With this representation of notes, resources are provided for the generation of higher scale musical structures, such as rhythmic meter, pitch intervals and cycles. is framework enables precise and trustful scientic experiments, data sonication and is useful for education and art. e ecacy of MASS is conrmed by the synthesis of small musical pieces using basic notes, elaborated notes and notes in music, which reects the organization of the toolbox and thus of this article. It is possible to synthesize whole albums through collage of the scripts and seings specied by the user.
    [Show full text]
  • White Noise Generator Circuitry and Analysis
    UNIVERSITY OF TEXAS AT SAN ANTONIO White Noise Generator Circuitry and Analysis David Sanchez 8/4/2008 Table of Contents Introduction .................................................................................................................................................. 3 Overview of the Circuit ................................................................................................................................ 3 Circuit Subsets .............................................................................................................................................. 4 Noise Generation Stage ........................................................................................................................ 4 Amplification of the Noise Signal .......................................................................................................... 6 Active Low-Pass Filter ........................................................................................................................... 7 Audio Output Stage............................................................................................................................... 8 Analysis ......................................................................................................................................................... 9 Discussion ................................................................................................................................................... 11 Conclusion .................................................................................................................................................
    [Show full text]
  • Deep Point Correlation Design
    Deep Point Correlation Design THOMAS LEIMKÜHLER, MPI Informatik, Saarland Informatics Campus GURPRIT SINGH, MPI Informatik, Saarland Informatics Campus KAROL MYSZKOWSKI, MPI Informatik, Saarland Informatics Campus HANS-PETER SEIDEL, MPI Informatik, Saarland Informatics Campus TOBIAS RITSCHEL, University College London 10,000 10D points / s l1(radSpec(X)- BNOT)+ l1(spec(x(X))- STEP) 2D 1D 2D 1D Applications Point pattern Point Neural network Neural Design specication Design Result spectra Target spectra MC rendering Object placement Dithering Fig. 1. We suggest a framework for designing correlated point patterns. A design specification (left) is used to train an architecture mapping random points to correlated ones (middle). This architecture produces ten-thousands of points per second with use in rendering, object placement or dithering (right). Designing point patterns with desired properties can require substantial ACM Reference Format: effort, both in hand-crafting coding and mathematical derivation. Retaining Thomas Leimkühler, Gurprit Singh, Karol Myszkowski, Hans-Peter Seidel, these properties in multiple dimensions or for a substantial number of points and Tobias Ritschel. 2019. Deep Point Correlation Design. ACM Trans. Graph. can be challenging and computationally expensive. Tackling those two issues, 38, 6, Article 226 (November 2019), 17 pages. https://doi.org/10.1145/3355089. we suggest to automatically generate scalable point patterns from design 3356562 goals using deep learning. We phrase pattern generation as a deep compo- sition of weighted distance-based unstructured filters. Deep point pattern design means to optimize over the space of all such compositions according to 1 INTRODUCTION a user-provided point correlation loss, a small program which measures a pat- Point patterns have many important uses in computer graphics, tern’s fidelity in respect to its spatial or spectral statistics, linear or non-linear linking apparently disparate topics such as natural placement of (e.
    [Show full text]
  • State Estimation Using Gaussian Process Regression for Colored Noise Systems
    State Estimation using Gaussian Process Regression for Colored Noise Systems Kyuman Lee Eric N. Johnson School of Aerospace Engineering School of Aerospace Engineering Georgia Institute of Technology Georgia Institute of Technology Atlanta, GA 30332 Atlanta, GA 30332 404-422-3697 404-385-2519 [email protected] [email protected] Abstract—The goal of this study is to use Gaussian process (GP) the times of interest in the system; for brevity such errors are regression models to estimate the state of colored noise systems. called “colored” noise [1]. In audio engineering, electronics, The derivation of a Kalman filter assumes that the process noise physics, and many other fields, the color of a noise signal and measurement noise are uncorrelated and both white. In is generally understood to be some broad characteristic of relaxing those assumptions, the Kalman filter equations were its power spectrum. Different colors of noise have signif- modified to deal with the non-whiteness of each noise source. The standard Kalman filter ran on an augmented system that icantly different properties: for example, as audio signals had white noises and other approaches were also introduced they will sound different to human ears, and as images they depending on the forms of the noises. Those existing methods will have a visibly different texture. In image processing, can only work when the characteristics of the colored noise are the quality of images will directly influence the accuracy perfectly known. However, it is usually difficult to model a noise of locating landmarks. During sampling and transmission, without additional knowledge of the noise statistics.
    [Show full text]
  • Characterization and Comparison of Optical Source Relative Intensity Noise and Effects in Optical Coherence Tomography
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Illinois Digital Environment for Access to Learning and Scholarship Repository CHARACTERIZATION AND COMPARISON OF OPTICAL SOURCE RELATIVE INTENSITY NOISE AND EFFECTS IN OPTICAL COHERENCE TOMOGRAPHY BY SUNGHWAN SHIN THESIS Submitted in partial fulfillment of the requirements for the degree of Master of Science in Electrical and Computer Engineering in the Graduate College of the University of Illinois at Urbana-Champaign, 2010 Urbana, Illinois Adviser: Professor Stephen A. Boppart ABSTRACT This thesis research investigates the effects of optical source noise in optical coherence tomography (OCT). While shot noise, thermal noise and relative intensity noise (RIN) (excess photon noise) constitute optical noise, RIN is the least understood and often the most significant factor limiting the sensitivity of an optical system. The existing and prevalent theory being used for estimating RIN for various light sources in OCT is questionable, and cannot be applied uniformly for different types of sources. The origin of noise in various sources varies significantly, owing to the different physical nature of photon generation. In this thesis, RIN of several OCT light sources are characterized and compared. Light sources include a super-luminescent diode (SLD), an erbium-doped fiber amplifier (EDFA), multiplexed SLDs, and a continuous wave (cw) laser. A method for the reduction of RIN by amplifying the SLD light output by using a gain-saturated semiconductor optical amplifier (SOA) is reported. Also, measurements using a time-domain OCT (TD-OCT) system are performed to verify the effects of optical source noise reduction on OCT data.
    [Show full text]
  • Investigation of Improved Masking Noise for the Speech Privacy
    INVESTIGATION OF IMPROVED MASKING NOISE FOR THE SPEECH PRIVACY by Allen Cho APPROVED BY SUPERVISORY COMMITTEE: ___________________________________________ Dr. Issa M. S. Panahi, Chair ___________________________________________ Dr. John H. L. Hansen ___________________________________________ Dr. P. K. Rajasekaran Copyright 2016 Allen Cho All Rights Reserved For the glory of God by the grace of God INVESTIGATION OF IMPROVED MASKING NOISE FOR THE SPEECH PRIVACY by ALLEN CHO, BS THESIS Presented to the Faculty of The University of Texas at Dallas in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE IN ELECTRICAL ENGINEERING THE UNIVERSITY OF TEXAS AT DALLAS December 2016 ACKNOWLEDGMENTS First and above all, I praise God almighty for providing me this opportunity and granting me the capability to proceed successfully. This thesis has come to its present form thanks to the assistance and guidance of several people. I would, therefore, like to offer my sincere thanks to all of them. Foremost, I would like to express my sincere gratitude to my advisor Dr. Issa Panahi for his continuous support for my Master’s degree study and research as well as for his patience, motivation, enthusiasm, and immense knowledge. His guidance helped me throughout my research and writing of this thesis. I could not have imagined having a better advisor and mentor for my graduate study. Besides my advisor, I would like to thank the rest of my thesis committee: Dr. P. K. Rajasekaran, and Prof. John Hansen for their encouragement, insightful comments, and challenging questions. My sincere thanks also goes to Speech Privacy Systems for offering me the valuable opportunity to study and research with their groups and leading me to work on diverse exciting projects.
    [Show full text]
  • Word Boundary Detection
    University of New Hampshire University of New Hampshire Scholars' Repository Master's Theses and Capstones Student Scholarship Spring 2007 Word boundary detection Deepak Jadhav University of New Hampshire, Durham Follow this and additional works at: https://scholars.unh.edu/thesis Recommended Citation Jadhav, Deepak, "Word boundary detection" (2007). Master's Theses and Capstones. 268. https://scholars.unh.edu/thesis/268 This Thesis is brought to you for free and open access by the Student Scholarship at University of New Hampshire Scholars' Repository. It has been accepted for inclusion in Master's Theses and Capstones by an authorized administrator of University of New Hampshire Scholars' Repository. For more information, please contact [email protected]. WORD BOUNDARY DETECTION BY DEEPAK JADHAV B.E. Babasaheb Ambedkar Marathwada University, India, 2000 THESIS Submitted to the University of New Hampshire in Partial Fulfillment of the Requirements for the Degree of Masters of Science In Electrical Engineering May 2007 Reproduced with permission of the copyright owner. Further reproduction prohibited without permission. UMI Number: 1443609 INFORMATION TO USERS The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleed-through, substandard margins, and improper alignment can adversely affect reproduction. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. ® UMI UMI Microform 1443609 Copyright 2007 by ProQuest Information and Learning Company. All rights reserved.
    [Show full text]
  • A Survey of Blue-Noise Sampling and Its Applications
    Yan DM, Guo JW, Wang B et al. A survey of blue-noise sampling and its applications. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 30(3): 439–452 May 2015. DOI 10.1007/s11390-015-1535-0 A Survey of Blue-Noise Sampling and Its Applications 1,2 2 3 ý² Àïå Ê Dong-Ming Yan (î ), Member, CCF, ACM, Jian-Wei Guo ( ), Bin Wang ( ) 2 1,4 ¡· Xiao-Peng Zhang (Ü ), and Peter Wonka 1Visual Computing Center, King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia 2National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China 3School of Software, Tsinghua University, Beijing 100084, China 4Department of Computer Science and Engineering, Arizona State University, Tempe, AZ 85287, U.S.A. E-mail: [email protected]; [email protected]; [email protected] E-mail: [email protected]; [email protected] Received March 9, 2015; revised April 2, 2015. Abstract In this paper, we survey recent approaches to blue-noise sampling and discuss their beneficial applications. We discuss the sampling algorithms that use points as sampling primitives and classify the sampling algorithms based on various aspects, e.g., the sampling domain and the type of algorithm. We demonstrate several well-known applications that can be improved by recent blue-noise sampling techniques, as well as some new applications such as dynamic sampling and blue-noise remeshing. Keywords blue-noise sampling, Poisson-disk sampling, Lloyd relaxation, rendering, remeshing 1 Introduction concentrated spikes in energy. Intuitively, blue-noise sampling generates randomized uniform distributions.
    [Show full text]
  • Electronic Circuit Design for Audio Noise Elimination
    DOKUZ EYLÜL UNIVERSITY GRADUATE SCHOOL OF NATURAL AND APPLIED SCIENCES ELECTRONIC CIRCUIT DESIGN FOR AUDIO NOISE ELIMINATION by Şekibe Şebnem TAS July, 2008 İZMİR ELECTRONIC CIRCUIT DESIGN FOR AUDIO NOISE ELIMINATION A Thesis Submitted to the Graduate School of Natural and Applied Sciences of Dokuz Eylül University In Partial Fulfillment of the Requirements for the Degree of Master of Science in Electrical and Electronics Engineering by Şekibe Şebnem TAS July, 2008 İZMİR M.Sc THESIS EXAMINATION RESULT FORM We have read the thesis entitled “ELECTRONIC CIRCUIT DESIGN FOR AUDIO NOISE ELIMINATION” completed by ŞEKİBE ŞEBNEM TAS under supervision of ASSOC. PROF. DR. UĞUR ÇAM and we certify that in our opinion it is fully adequate, in scope and in quality, as a thesis for the degree of Master of Science. Assoc. Prof. Dr. Uğur ÇAM (Jury Member) (Jury Member) Prof.Dr. Cahit HELVACI Director Graduate School of Natural and Applied Sciences ii ACKNOWLEDGEMENTS I would like to thank my advisor Assoc. Prof. Dr. Uğur Çam for his support, encouragement and valuable guidance. I am grateful to him for letting me share his professional knowledge and experience. I would like to thank TÜBİTAK for supporting me financially during my master study. I also would like to thank my colleagues at Vestel R&D department who helped me a lot and gave me the oppurtunity to use R&D laboratories during my research. I am also grateful to Kürşat Sarıarslan for his help and for sharing his experience with me during my study. Finally, I would like to thank my parents for always helping and supporting me throughout my life.
    [Show full text]