Compressive Sensing

Total Page:16

File Type:pdf, Size:1020Kb

Compressive Sensing Compressive Sensing Mike Wakin 1 Introduction: Filling in the blanks 2 3 4 5 6 7 Two Key Ingredients 1. Model: “How is the signal supposed to behave?” 2. Algorithm: “How can I fill in the blanks so that the signal obeys the model?” 8 Data‐Rich Environments • The Big Data Era – Social networks – Scientific instrumentation – Hyperspectral imaging – Video 9 Challenges • Storage, transmission, processing • Conventional solution: collect all of the data then compress it • New solution: collect less data 10 Data‐Poor Environments • Limitations: – size, power, weight • embedded systems – cost • sensing hardware • seismic surveys – time • magnetic resonance imaging – communications bandwidth • wireless sensor networks • Make the most of the data we can collect 11 Compressive Sensing (CS) in a Nutshell Using signal models to fill in the blanks Topics: 1. What are signal models? 2. What do we mean by “blanks”? 3. How can we fill in the blanks? 4. How can we understand this process? 5. What (else) can we do with these ideas? 12 1. What are signal models? 13 Sinusoids • Sinusoids are fundamental to signal processing. • Many real‐world phenomena travel in waves or are generated by harmonic motion. – electromagnetic waves – acoustic waves – seismic waves • Many real‐world systems propagate sinusoids in a natural way: a sinusoidal input produces a sinusoidal output of exactly the same frequency. 14 Fourier Transform (1807) = + + + … + 15 Example Audio Signal 16 Frequency Spectrum 17 Bandlimited Signal Model bandwidth: highest effective frequency 18 Features of the Model • Expressed in terms of a transform: Fourier transform • Variable complexity: higher bandwidths allow for broader classes of signals but at the expense of greater complexity 19 Shannon/Nyquist Sampling Theorem (1920s‐1940s) • Theorem: Any bandlimited signal can be perfectly reconstructed from its samples, provided that the sampling frequency is at least twice the signal’s bandwidth. • Sampling at the information level: – sampling frequency is proportional to bandwidth – as model complexity increases, must take more samples (per second) 20 Example Signal 21 Spectrum bandwidth: 100Hz 22 “Nyquist‐Rate” Samples 1/200 sec 23 Filling in the Blanks 24 Smooth Interpolation 25 Implications • For bandlimited signals it is possible to perfectly fill in the blanks between adjacent samples. • There are many potential ways to “connect the dots” but only one of these will have the correct bandwidth. • All of the others are known as aliases –they will all have higher bandwidths. 26 An Alias 27 Alias Spectrum 28 Digital Signal Processing (DSP) Revolution • Sampling theorem – sample signals without loss of information and process them on a computer • Advances in computing – Moore’s law • Algorithms such as the Fast Fourier Transform – first discovered by Gauss (1805) – rediscovered by Cooley and Tukey (1965) 29 From Bandlimitedness to Sparsity • Time‐frequency analysis frequency content may change over time • Sinusoids go on forever …… • Short‐time Fourier transform • Wigner–Ville distribution (1932) • Gabor transform (1946) 30 Recall: Audio Signal 31 Frequency Spectrum 32 Spectrogram 33 Wavelet Analysis (1980s‐1990s) [Morlet, Grossman, Meyer, Daubechies, Mallat, …] Wavelet analysis –multiscale –local –computationally efficient 34 Wavelet Coefficients • Which parts of a signal have high frequency behavior? N‐pixel image N wavelet coefficients 35 Wavelets as Edge Detectors 36 Wavelets as Building Blocks = ++ ‐ ‐ + + … + + + + + … 37 Sparsity in Wavelet Coefficients few large coefficients (just “K”) = ++ ‐ ‐ + + … + + + + + … many small coefficients (N coefficients in total) 38 Wavelet Advantage • Wavelets capture the energy of many natural signals more economically than sinusoids • Example signal: 39 Compression: Wavelet vs. Fourier 40 Sparsity in General • Sparsity can exist in many domains – time (spike trains) – frequency (sparse spectrum) – wavelets (piecewise smooth signals) – curvelets (piecewise smooth images) • The 1990s and 2000s witnessed many new Xlets – shearlets, bandelets, vaguelettes, contourlets, chirplets, ridgelets, beamlets, brushlets, wedgelets, platelets, surflets, seislets, … 41 The Curvelet Transform (1999) [Candès and Donoho] •Sparse representation for piecewise smooth images with smooth discontinuities 42 Curvelets and the Wave Equation [Candès and Demanet] propagation of a curvelet sparse sum of curvelets at nearby scales, orientations, locations 43 Dictionary Learning • Sparse transforms can also be learned from collections of training data • Example algorithm: K‐SVD (2006) [Aharon et al.] • Demonstration: image patches 44 Learned Dictionary 45 Compression Performance 46 Compression Performance 47 Implications of Sparsity • When we sample a signal (say to obtain N samples or pixels), there may be some residual structure in those samples. • This could permit us to further compress the samples. • Or… we could reduce the number of samples we collect, and just fill in the blanks later. 48 2. What do we mean by “blanks”? 49 New Ways of Sampling • Recall the two ingredients: 1. Model: “How is the signal supposed to behave?” 2. Algorithm: “How can I fill in the blanks so that the signal obeys the model?” • Now, we will suppose that our signal is sparse in some particular transform. • We must be careful about how we “measure” the signal so that we don’t miss anything important. 50 Don’t • Don’t just take uniform samples more slowly. • Aliasing becomes a risk –there will be more than one way to fill in the blanks using a sparse signal. 51 Example Signal with Sparse Spectrum Nyquist‐rate samples 52 Sparse Spectrum 53 Uniform Samples (5x Sub‐Nyquist) uniform sub‐Nyquist samples 54 Uniform Samples (5x Sub‐Nyquist) uniform sub‐Nyquist samples 55 Alias with Sparse Spectrum 56 Original Sparse Spectrum 57 Sparse Spectrum of the Alias 58 Do • Do take measurements that are “incoherent” with the sparse structure (we will explain). • This ensures that there is only one way to fill in the blanks using a signal that is sparse. • Incoherence is often achieved using: (1) non‐uniform samples or (2) generic linear measurements along with some amount of randomness. 59 Non‐uniform Sampling Nyquist‐rate samples non‐uniform samples 60 Common Approach: Sub‐sample Nyquist Grid Sample/retain only a random subset of the Nyquist samples 61 Example: Non‐uniform Sampler [with Northrop Grumman, Caltech, Georgia Tech] • Underclock a standard ADC to capture spectrally sparse signals • Prototype system: ‐ captures 1.2GHz+ bandwidth (800MHz to 2GHz) with 400MHz ADC underclocked to 236MHz (10x sub‐Nyquist) ‐ up to 100MHz total occupied spectrum 62 Generic Linear Measurements Nyquist‐rate samples (don’t record these directly) Generic linear measurements 63 Generic Linear Measurements Nyquist‐rate samples (don’t record these directly) y(1) = 3.2 (pixel 1) 2.7 (pixel 2) + ∗ − ∗ ··· y(2) = 8.1 (pixel 1) + 0.3 (pixel 2) + Generic linear − ∗ ∗ ··· . measurements . y(M)=4.2 (pixel 1) + 1.7 (pixel 2) + ∗ ∗ ··· 64 Then What? • Filling in the blanks means reconstructing the original N Nyquist‐rate samples from the measurements. y(1) = 3.2 (pixel 1) 2.7 (pixel 2) + ∗ − ∗ ··· y(2) = 8.1 (pixel 1) + 0.3 (pixel 2) + − ∗ ∗ ··· . y(M)=4.2 (pixel 1) + 1.7 (pixel 2) + ∗ ∗ ··· 65 How Many Measurements? • N: number of conventional Nyquist‐rate samples • K: sparsity level in a particular transform domain • Sufficient number of incoherent measurements M ≈ K log N • Sampling at the information level: – sampling frequency is proportional to sparsity – as model complexity increases, must take more measurements 66 Random Modulation Pre‐Integrator (RMPI) [with Northrop Grumman, Caltech, Georgia Tech] • RMPI receiver ‐ four parallel “random demodulator” channels ‐ effective instantaneous bandwidth spanning 100MHz—2.5GHz ‐ 385 MHz measurement rate (13x sub‐Nyquist) • Goal: identify radar pulse descriptor words (PDWs) RMPI receiver discrete, low‐rate, radar pulses (analog) information‐carrying measurements 67 RMPI Architecture 68 Single‐Pixel Camera [Baraniuk and Kelly, et al.] single photon detector random pattern on DMD array 69 Single‐Pixel Camera –Results N = 4096 pixels M = 1600 measurements (40%) true color low‐light imaging 256 x 256 image with 10:1 compression 70 Data‐Poor Situations in the Wild • Incomplete measurements arise naturally – missing/corrupted sample streams – missing seismic traces – medical imaging • Can try to use sparse models to fill in the blanks, even if there was nothing random about the sampling/measurement process. 71 Medical Imaging Space domain Fourier coefficients Sampling pattern 256x256 256x256 71% undersampled Min. TV, 34.23dB Backproj., 29.00dB [Candès, Romberg] 72 Contrast‐enhanced 3D angiography [Lustig, Donoho, Pauly] 73 3. How can we fill in the blanks? 74 New Ways of Processing Samples • Recall the two ingredients: 1. Model: “How is the signal supposed to behave?” 2. Algorithm: “How can I fill in the blanks so that the signal obeys the model?” • We suppose that our signal is sparse in some particular transform (e.g., Fourier or wavelets). • We must fill in the blanks so that the signal obeys this sparse model. 75 Don’t • Don’t use smooth interpolation to fill in the blanks between samples. • This will fail to recover high‐frequency features. 76 Recall: Signal with Sparse Spectrum Nyquist‐rate samples 77 Sparse Spectrum 78 Non‐Uniform Samples (5x Sub‐Nyquist) non‐uniform samples 79 Non‐Uniform Samples (5x Sub‐Nyquist) non‐uniform samples 80 Smooth Interpolation 81 Spectrum of Smooth Interpolation
Recommended publications
  • An Overview of Wavelet Transform Concepts and Applications
    An overview of wavelet transform concepts and applications Christopher Liner, University of Houston February 26, 2010 Abstract The continuous wavelet transform utilizing a complex Morlet analyzing wavelet has a close connection to the Fourier transform and is a powerful analysis tool for decomposing broadband wavefield data. A wide range of seismic wavelet applications have been reported over the last three decades, and the free Seismic Unix processing system now contains a code (succwt) based on the work reported here. Introduction The continuous wavelet transform (CWT) is one method of investigating the time-frequency details of data whose spectral content varies with time (non-stationary time series). Moti- vation for the CWT can be found in Goupillaud et al. [12], along with a discussion of its relationship to the Fourier and Gabor transforms. As a brief overview, we note that French geophysicist J. Morlet worked with non- stationary time series in the late 1970's to find an alternative to the short-time Fourier transform (STFT). The STFT was known to have poor localization in both time and fre- quency, although it was a first step beyond the standard Fourier transform in the analysis of such data. Morlet's original wavelet transform idea was developed in collaboration with the- oretical physicist A. Grossmann, whose contributions included an exact inversion formula. A series of fundamental papers flowed from this collaboration [16, 12, 13], and connections were soon recognized between Morlet's wavelet transform and earlier methods, including harmonic analysis, scale-space representations, and conjugated quadrature filters. For fur- ther details, the interested reader is referred to Daubechies' [7] account of the early history of the wavelet transform.
    [Show full text]
  • A Low Bit Rate Audio Codec Using Wavelet Transform
    Navpreet Singh, Mandeep Kaur,Rajveer Kaur / International Journal of Engineering Research and Applications (IJERA) ISSN: 2248-9622 www.ijera.com Vol. 3, Issue 4, Jul-Aug 2013, pp.2222-2228 An Enhanced Low Bit Rate Audio Codec Using Discrete Wavelet Transform Navpreet Singh1, Mandeep Kaur2, Rajveer Kaur3 1,2(M. tech Students, Department of ECE, Guru kashi University, Talwandi Sabo(BTI.), Punjab, INDIA 3(Asst. Prof. Department of ECE, Guru Kashi University, Talwandi Sabo (BTI.), Punjab,INDIA Abstract Audio coding is the technology to represent information and perceptually irrelevant signal audio in digital form with as few bits as possible components can be separated and later removed. This while maintaining the intelligibility and quality class includes techniques such as subband coding; required for particular application. Interest in audio transform coding, critical band analysis, and masking coding is motivated by the evolution to digital effects. The second class takes advantage of the communications and the requirement to minimize statistical redundancy in audio signal and applies bit rate, and hence conserve bandwidth. There is some form of digital encoding. Examples of this class always a tradeoff between lowering the bit rate and include entropy coding in lossless compression and maintaining the delivered audio quality and scalar/vector quantization in lossy compression [1]. intelligibility. The wavelet transform has proven to Digital audio compression allows the be a valuable tool in many application areas for efficient storage and transmission of audio data. The analysis of nonstationary signals such as image and various audio compression techniques offer different audio signals. In this paper a low bit rate audio levels of complexity, compressed audio quality, and codec algorithm using wavelet transform has been amount of data compression.
    [Show full text]
  • Image Compression Techniques by Using Wavelet Transform
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by International Institute for Science, Technology and Education (IISTE): E-Journals Journal of Information Engineering and Applications www.iiste.org ISSN 2224-5782 (print) ISSN 2225-0506 (online) Vol 2, No.5, 2012 Image Compression Techniques by using Wavelet Transform V. V. Sunil Kumar 1* M. Indra Sena Reddy 2 1. Dept. of CSE, PBR Visvodaya Institute of Tech & Science, Kavali, Nellore (Dt), AP, INDIA 2. School of Computer science & Enginering, RGM College of Engineering & Tech. Nandyal, A.P, India * E-mail of the corresponding author: [email protected] Abstract This paper is concerned with a certain type of compression techniques by using wavelet transforms. Wavelets are used to characterize a complex pattern as a series of simple patterns and coefficients that, when multiplied and summed, reproduce the original pattern. The data compression schemes can be divided into lossless and lossy compression. Lossy compression generally provides much higher compression than lossless compression. Wavelets are a class of functions used to localize a given signal in both space and scaling domains. A MinImage was originally created to test one type of wavelet and the additional functionality was added to Image to support other wavelet types, and the EZW coding algorithm was implemented to achieve better compression. Keywords: Wavelet Transforms, Image Compression, Lossless Compression, Lossy Compression 1. Introduction Digital images are widely used in computer applications. Uncompressed digital images require considerable storage capacity and transmission bandwidth. Efficient image compression solutions are becoming more critical with the recent growth of data intensive, multimedia based web applications.
    [Show full text]
  • JPEG2000: Wavelets in Image Compression
    EE678 WAVELETS APPLICATION ASSIGNMENT 1 JPEG2000: Wavelets In Image Compression Group Members: Qutubuddin Saifee [email protected] 01d07009 Ankur Gupta [email protected] 01d070013 Nishant Singh [email protected] 01d07019 Abstract During the past decade, with the birth of wavelet theory and multiresolution analysis, image processing techniques based on wavelet transform have been extensively studied and tremendously improved. JPEG 2000 uses wavelet transform and provides an integrated toolbox to better address increasing needs for compression. In this report, we study the basic concepts of JPEG2000, the LeGall 5/3 and Daubechies 9/7 wavelets used in it and finally Embedded zerotree wavelet coding and Set Partitioning in Hierarchical Trees. Index Terms Wavelets, JPEG2000, Image Compression, LeGall, Daubechies, EZW, SPIHT. I. INTRODUCTION I NCE the mid 1980s, members from both the International Telecommunications Union (ITU) and the International SOrganization for Standardization (ISO) have been working together to establish a joint international standard for the compression of grayscale and color still images. This effort has been known as JPEG, the Joint Photographic Experts Group. The process was such that, after evaluating a number of coding schemes, the JPEG members selected a discrete cosine transform( DCT)-based method in 1988. From 1988 to 1990, the JPEG group continued its work by simulating, testing and documenting the algorithm. JPEG became Draft International Standard (DIS) in 1991 and International Standard (IS) in 1992. With the continual expansion of multimedia and Internet applications, the needs and requirements of the technologies used grew and evolved. In March 1997, a new call for contributions was launched for the development of a new standard for the compression of still images, the JPEG2000 standard.
    [Show full text]
  • Comparison of Image Compressions: Analog Transformations P
    Proceedings Comparison of Image Compressions: Analog † Transformations P Jose Balsa P CITIC Research Center, Universidade da Coruña (University of A Coruña), 15071 A Coruña, Spain; [email protected] † Presented at the 3rd XoveTIC Conference, A Coruña, Spain, 8–9 October 2020. Published: 21 August 2020 Abstract: A comparison between the four most used transforms, the discrete Fourier transform (DFT), discrete cosine transform (DCT), the Walsh–Hadamard transform (WHT) and the Haar- wavelet transform (DWT), for the transmission of analog images, varying their compression and comparing their quality, is presented. Additionally, performance tests are done for different levels of white Gaussian additive noise. Keywords: analog image transformation; analog image compression; analog image quality 1. Introduction Digitized image coding systems employ reversible mathematical transformations. These transformations change values and function domains in order to rearrange information in a way that condenses information important to human vision [1]. In the new domain, it is possible to filter out relevant information and discard information that is irrelevant or of lesser importance for image quality [2]. Both digital and analog systems use the same transformations in source coding. Some examples of digital systems that employ these transformations are JPEG, M-JPEG, JPEG2000, MPEG- 1, 2, 3 and 4, DV and HDV, among others. Although digital systems after transformation and filtering make use of digital lossless compression techniques, such as Huffman. In this work, we aim to make a comparison of the most commonly used transformations in state- of-the-art image compression systems. Typically, the transformations used to compress analog images work either on the entire image or on regions of the image.
    [Show full text]
  • Wavelet Transform for JPG, BMP & TIFF
    Wavelet Transform for JPG, BMP & TIFF Samir H. Abdul-Jauwad Mansour I. AlJaroudi Electrical Engineering Department, Electrical Engineering Department, King Fahd University of Petroleum & King Fahd University of Petroleum & Minerals Minerals Dhahran 31261, Saudi Arabia Dhahran 31261, Saudi Arabia ABSTRACT This paper presents briefly both wavelet transform and inverse wavelet transform for three different images format; JPG, BMP and TIFF. After brief historical view of wavelet, an introduction will define wavelets transform. By narrowing the scope, it emphasizes the discrete wavelet transform (DWT). A practical example of DWT is shown, by choosing three images and applying MATLAB code for 1st level and 2nd level wavelet decomposition then getting the inverse wavelet transform for the three images. KEYWORDS: Wavelet Transform, JPG, BMP, TIFF, MATLAB. __________________________________________________________________ Correspondence: Dr. Samir H. Abdul-Jauwad, Electrical Engineering Department, King Fahd University of Petroleum & Mineral, Dhahran 1261, Saudi Arabia. Email: [email protected] Phone: +96638602500 Fax: +9663860245 INTRODUCTION In 1807, theories of frequency analysis by Joseph Fourier were the main lead to wavelet. However, wavelet was first appeared in an appendix to the thesis of A. Haar in 1909 [1]. Compact support was one property of the Haar wavelet which means that it vanishes outside of a finite interval. Unfortunately, Haar wavelets have some limits, because they are not continuously differentiable. In 1930, work done separately by scientists for representing the functions by using scale-varying basis functions was the key to understanding wavelets. In 1980, Grossman and Morlet, a physicist and an engineer, provided a way of thinking for wavelets based on physical intuition through defining wavelets in the context of quantum physics [3].
    [Show full text]
  • The Wavelet Tutorial Second Edition Part I by Robi Polikar
    THE WAVELET TUTORIAL SECOND EDITION PART I BY ROBI POLIKAR FUNDAMENTAL CONCEPTS & AN OVERVIEW OF THE WAVELET THEORY Welcome to this introductory tutorial on wavelet transforms. The wavelet transform is a relatively new concept (about 10 years old), but yet there are quite a few articles and books written on them. However, most of these books and articles are written by math people, for the other math people; still most of the math people don't know what the other math people are 1 talking about (a math professor of mine made this confession). In other words, majority of the literature available on wavelet transforms are of little help, if any, to those who are new to this subject (this is my personal opinion). When I first started working on wavelet transforms I have struggled for many hours and days to figure out what was going on in this mysterious world of wavelet transforms, due to the lack of introductory level text(s) in this subject. Therefore, I have decided to write this tutorial for the ones who are new to the topic. I consider myself quite new to the subject too, and I have to confess that I have not figured out all the theoretical details yet. However, as far as the engineering applications are concerned, I think all the theoretical details are not necessarily necessary (!). In this tutorial I will try to give basic principles underlying the wavelet theory. The proofs of the theorems and related equations will not be given in this tutorial due to the simple assumption that the intended readers of this tutorial do not need them at this time.
    [Show full text]
  • Discrete Generalizations of the Nyquist-Shannon Sampling Theorem
    DISCRETE SAMPLING: DISCRETE GENERALIZATIONS OF THE NYQUIST-SHANNON SAMPLING THEOREM A DISSERTATION SUBMITTED TO THE DEPARTMENT OF ELECTRICAL ENGINEERING AND THE COMMITTEE ON GRADUATE STUDIES OF STANFORD UNIVERSITY IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY William Wu June 2010 © 2010 by William David Wu. All Rights Reserved. Re-distributed by Stanford University under license with the author. This work is licensed under a Creative Commons Attribution- Noncommercial 3.0 United States License. http://creativecommons.org/licenses/by-nc/3.0/us/ This dissertation is online at: http://purl.stanford.edu/rj968hd9244 ii I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Brad Osgood, Primary Adviser I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Thomas Cover I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. John Gill, III Approved for the Stanford University Committee on Graduate Studies. Patricia J. Gumport, Vice Provost Graduate Education This signature page was generated electronically upon submission of this dissertation in electronic format. An original signed hard copy of the signature page is on file in University Archives. iii Abstract 80, 0< 11 17 æ 80, 1< 5 81, 0< 81, 1< 12 83, 1< 84, 0< 82, 1< 83, 4< 82, 3< 15 7 6 83, 2< 83, 3< 84, 8< 83, 6< 83, 7< 13 3 æ æ æ æ æ æ æ æ æ æ 9 -5 -4 -3 -2 -1 1 2 3 4 5 84, 4< 84, 13< 84, 14< 1 HIS DISSERTATION lays a foundation for the theory of reconstructing signals from T finite-dimensional vector spaces using a minimal number of samples.
    [Show full text]
  • A Novel Robust Audio Watermarking Algorithm by Modifying the Average Amplitude in Transform Domain
    applied sciences Article A Novel Robust Audio Watermarking Algorithm by Modifying the Average Amplitude in Transform Domain Qiuling Wu 1,2,* and Meng Wu 1 1 College of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003, China; [email protected] 2 College of Zijin, Nanjing University of Science and Technology, Nanjing 210046, China * Correspondence: [email protected]; Tel.: +86-139-5103-2810 Received: 18 April 2018; Accepted: 1 May 2018; Published: 4 May 2018 Featured Application: This algorithm embeds a binary image into an audio signal as a marker to prove the ownership of this audio signal. With large payload capacity and strong robustness against common signal processing attacks, it can be used for copyright protection, broadcast monitoring, fingerprinting, data authentication, and medical safety. Abstract: In order to improve the robustness and imperceptibility in practical application, a novel audio watermarking algorithm with strong robustness is proposed by exploring the multi-resolution characteristic of discrete wavelet transform (DWT) and the energy compaction capability of discrete cosine transform (DCT). The human auditory system is insensitive to the minor changes in the frequency components of the audio signal, so the watermarks can be embedded by slightly modifying the frequency components of the audio signal. The audio fragments segmented from the cover audio signal are decomposed by DWT to obtain several groups of wavelet coefficients with different frequency bands, and then the fourth level detail coefficient is selected to be divided into the former packet and the latter packet, which are executed for DCT to get two sets of transform domain coefficients (TDC) respectively.
    [Show full text]
  • A Very Short Introduction to Sound Analysis for Those Who Like Elephant Trumpet Calls Or Other Wildlife Sound
    A very short introduction to sound analysis for those who like elephant trumpet calls or other wildlife sound J´er^ome Sueur Mus´eum national d'Histoire naturelle CNRS UMR 7205 ISYEB, Paris, France July 14, 2021 This document is a very brief introduction to sound analysis principles. It is mainly written for students starting with bioacoustics. The content should be updated regularly. Demonstrations are based on the package seewave. For more details see the book https://www.springer.com/us/book/9783319776453. Contents 1 Digitization 3 1.1 Sampling........................................3 1.2 Quantisation......................................3 1.3 File format.......................................4 2 Amplitude envelope4 3 Discrete Fourier Transform (DFT)6 3.1 Definitions and principle................................6 3.2 Complete sound.....................................7 3.3 Sound section......................................7 3.3.1 Window shape................................. 10 4 Short-time Fourier Transform (STFT) 10 4.1 Principle......................................... 10 4.2 Spectrogram...................................... 11 4.2.1 3D in a 2D plot................................. 11 4.2.2 Overlap..................................... 12 4.2.3 Values...................................... 13 4.3 Mean spectrum..................................... 13 5 Instantaneous frequency 15 5.1 Zero-crossing...................................... 15 5.2 Hilbert transform.................................... 15 5.3 Cepstral transform..................................
    [Show full text]
  • Image Compression Using the Haar Wavelet Transform
    Image compression using the Haar wavelet transform Colm Mulcahy, Ph.D. ABSTRACT The wavelet transform is a relatively new arrival on the mathematical scene. Wegive a brief intro duction to the sub ject byshowing how the Haar wavelet transform allows information to b e enco ded according to \levels of detail." In one sense, this parallels the way in whichwe often pro cess information in our everydaylives. Our investigations enable us to givetwo interesting applications of wavelet metho ds to digital images: compression and progressive transmission. The mathematical prerequesites will b e kept to a minimum; indeed, the main concepts can b e understo o d in terms of addition, subtraction and division bytwo. We also present a linear algebra implementation of the Haar wavelet transform, and mention imp ortantrecent generalizations. This material is suitable for undergraduate researchby b oth mathematics and computer science ma jors, and has b een used successfully in this way at Sp elman College since 1995. 1 So much information In the real world, we are constantly faced with having to decide howmuch information to absorb or impart. Example 1: An election is imp ending, and you and your b est friend haveyet to decide for whom you will cast your votes. Yourfriendiscontent to base her decision solely on the p olitical party aliations of the available candidates, whereas you take the trouble to nd out how the candidates stand on a variety of issues, and to investigate their records for integrity, e ectiveness, and vision. Example 2: Your younger brother is across the ro om, working on a pro ject for Black History Month.
    [Show full text]
  • A Tutorial of the Wavelet Transform
    A Tutorial of the Wavelet Transform Chun-Lin, Liu February 23, 2010 Chapter 1 Overview 1.1 Introduction The Fourier transform is an useful tool to analyze the frequency components of the signal. However, if we take the Fourier transform over the whole time axis, we cannot tell at what instant a particular frequency rises. Short-time Fourier transform (STFT) uses a sliding window to find spectrogram, which gives the information of both time and frequency. But still another problem exists: The length of window limits the resolution in frequency. Wavelet transform seems to be a solution to the problem above. Wavelet transforms are based on small wavelets with limited duration. The translated-version wavelets locate where we concern. Whereas the scaled-version wavelets allow us to analyze the signal in different scale. 1.2 History The first literature that relates to the wavelet transform is Haar wavelet. It was proposed by the mathematician Alfrd Haar in 1909. However, the con- cept of the wavelet did not exist at that time. Until 1981, the concept was proposed by the geophysicist Jean Morlet. Afterward, Morlet and the physi- cist Alex Grossman invented the term wavelet in 1984. Before 1985, Haar wavelet was the only orthogonal wavelet people know. A lot of researchers even thought that there was no orthogonal wavelet except Haar wavelet. For- tunately, the mathematician Yves Meyer constructed the second orthogonal wavelet called Meyer wavelet in 1985. As more and more scholars joined in this field, the 1st international conference was held in France in 1987. 1 In 1988, Stephane Mallat and Meyer proposed the concept of multireso- lution.
    [Show full text]