Introduction to Audio Signal Processing Human-Computer Interaction Angelo Antonio Salatino
[email protected] http://infernusweb.altervista.org License This work is licensed under the Creative Commons Attribution-Noncommercial-Share Alike 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA. Overview • Audio Signal Processing; • Waveform Audio File Format; • FFmpeg; • Audio Processing with Matlab; • Doing phonetics with Praat; • Last but not least: Homework. Audio Signal Processing • Audio signal processing is an engineering field that focuses on the computational methods for intentionally altering auditory signals or sounds, in order to achieve a particular goal. Output Signal Input Signal Audio Signal Processing Data with meaning Audio Processing in HCI Some HCI applications involving audio signal processing are: • Speech Emotion Recognition • Speaker Recognition ▫ Speaker Verification ▫ Speaker Identification • Voice Commands • Speech to Text • Etc. Audio Signals You can find audio signals represented in either digital or analog format. • Digital – the pressure wave-form is a sequence of symbols, usually binary numbers. • Analog – is a smooth wave of energy represented by a continuous stream of data. Analog to Digital Converter (ADC) • Don’t worry, it’s only a fast review!!! Sampling Frequency # bits per sample must be defined must be defined Analog Signal Sample Digital Signal Quantization Encoding Continuous in Time & Hold Discrete in Time Discrete in Time Discrete in Time Continuous in Continuous in Discrete in Discrete in Amplitude Amplitude Amplitude Amplitude • For each measurement a number is assigned according to its amplitude.