<<

History and Practice of Digital Sound Synthesis

Julius Smith CCRMA, Stanford University

AES-2006 Heyser Lecture

October 6, 2006

Julius Smith AES-2006 Heyser Lecture – 1 / 84 Overview Early Digital Synthesis Spectral Modeling Physical Modeling Summary

Overview

Julius Smith AES-2006 Heyser Lecture – 2 / 84 Outline

Overview Digital sound synthesis approaches in approximate historical order: • Outline • CCRMA Perspective • Wavetable (one period) Early Digital Synthesis • Subtractive Spectral Modeling • Additive Physical Modeling • (FM) Summary • Sampling • Spectral Modeling • Physical Modeling Some connections with audio coding will be noted

Emphasis: • Sound examples • Block diagrams • Historical notes

Julius Smith AES-2006 Heyser Lecture – 3 / 84 CCRMA Perspective

Overview • Outline • CCRMA Perspective

Early Digital Synthesis

Spectral Modeling

Physical Modeling

Summary

The Knoll, Stanford University

Julius Smith AES-2006 Heyser Lecture – 4 / 84 Overview Early Digital Synthesis Spectral Modeling Physical Modeling Summary

Early Digital Sound Synthesis

Julius Smith AES-2006 Heyser Lecture – 5 / 84 in Music I-V (1957-1969)

Overview

Early Digital Synthesis • Music V • KL Music • “Daisy” • Additive Analysis • • FM Synthesis • FM Formula • FM Patch • FM Spectra • FM Examples • FM Voice • Sampling Synthesis • Modern Example

Spectral Modeling

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 6 / 84 Music V Scripting Language (“Note Cards”)

Overview

Early Digital Synthesis • Music V • KL Music • “Daisy” • Additive Analysis • Additive Synthesis • FM Synthesis • FM Formula • FM Patch • FM Spectra • FM Examples • FM Voice • Sampling Synthesis • Modern Example

Spectral Modeling

Physical Modeling

Summary • Essentially Supported in MPEG-4 Structured Audio Orchestra Language (SAOL) (Music V → → SAOL) • “Encoding sounds” as “instruments” is hard, in general

Julius Smith AES-2006 Heyser Lecture – 7 / 84 Kelly-Lochbaum Vocal Tract Model

Overview

Early Digital Synthesis • Music V … y(n) • KL Music e(n) • “Daisy” • Additive Analysis • Additive Synthesis • FM Synthesis Glottal Pulse • FM Formula Train or Noise 1 + k 1 1 + k M • FM Patch − − Speech e(n) z 1 2 … z 1 2 • FM Spectra Output • FM Examples y(n) k 1 − k 1 R 1 k M − k M R M • FM Voice • Sampling Synthesis (Unused − … − • Modern Example Allpass z 1 2 z 1 2 Output) 1 − k 1 1 − k M Spectral Modeling Kelly-Lochbaum Vocal Tract Model (Piecewise Cylindrical) Physical Modeling

Summary John L. Kelly and Carol Lochbaum (1962)

Julius Smith AES-2006 Heyser Lecture – 8 / 84 Sound Example

Overview “Bicycle Built for Two”: (WAV) (MP3) Early Digital Synthesis • Music V • KL Music • “Daisy” • Vocal part by Kelly and Lochbaum (1961) • Additive Analysis • Musical accompaniment by • Additive Synthesis • FM Synthesis • Computed on an IBM 704 • FM Formula • Based on Russian speech-vowel data from Gunnar Fant’s book • FM Patch • FM Spectra • Probably the first digital physical-modeling synthesis sound • FM Examples • FM Voice example by any method • Sampling Synthesis • Inspired Arthur C. Clarke to adapt it for “2001: A Space Odyssey” • Modern Example — the ’s “first song” Spectral Modeling

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 9 / 84 Classic Additive-Synthesis Analysis (Heterodyne Comb)

Overview

Early Digital Synthesis • Music V • KL Music • “Daisy” • Additive Analysis • Additive Synthesis • FM Synthesis • FM Formula • FM Patch • FM Spectra • FM Examples • FM Voice • Sampling Synthesis • Modern Example

Spectral Modeling

Physical Modeling

Summary

John Grey 1975 — CCRMA Tech. Reports 1 & 2 (CCRMA “STANM” reports — available online)

Julius Smith AES-2006 Heyser Lecture – 10 / 84 Classic Additive-Synthesis (Sinusoidal Oscillator Envelopes)

Overview

Early Digital Synthesis • Music V • KL Music • “Daisy” • Additive Analysis • Additive Synthesis • FM Synthesis • FM Formula • FM Patch • FM Spectra • FM Examples • FM Voice • Sampling Synthesis • Modern Example

Spectral Modeling

Physical Modeling

Summary

John Grey 1975 — CCRMA Tech. Reports 1 & 2 (CCRMA “STANM” reports — available online)

Julius Smith AES-2006 Heyser Lecture – 11 / 84 Classic Additive Synthesis Diagram

Overview A1(t) f1(t) A2(t) f2(t) A3(t) f3(t) A4(t) f4(t)

Early Digital Synthesis • Music V • KL Music • “Daisy” • Additive Analysis • Additive Synthesis • FM Synthesis • FM Formula • FM Patch • FM Spectra • FM Examples • FM Voice • Sampling Synthesis • Modern Example

Spectral Modeling noise FIR Physical Modeling Σ

Summary

4 t y(t)= Ai(t) sin ωi(t)dt + φi(0) Z0  Xi=1

Julius Smith AES-2006 Heyser Lecture – 12 / 84 Classic Additive-Synthesis Examples

Overview • Bb Clarinet Early Digital Synthesis • Eb Clarinet • Music V • KL Music • Oboe • “Daisy” • • Additive Analysis Bassoon • Additive Synthesis • Tenor Saxophone • FM Synthesis • FM Formula • Trumpet • FM Patch • English Horn • FM Spectra • FM Examples • French Horn • FM Voice • Flute • Sampling Synthesis • Modern Example Spectral Modeling • All of the above Physical Modeling • Independently synthesized set Summary (Synthesized from original John Grey data)

Julius Smith AES-2006 Heyser Lecture – 13 / 84 (FM) Synthesis

Overview FM synthesis is normally used as a spectral modeling technique. Early Digital Synthesis • Music V • Discovered and developed (1970s) by John M. Chowning • KL Music • “Daisy” (CCRMA Founding Director) • Additive Analysis • Key paper: JAES 1973 (vol. 21, no. 7) • Additive Synthesis • FM Synthesis • Commercialized by Yamaha Corporation: • FM Formula • FM Patch ◦ DX-7 (1983) • FM Spectra ◦ OPL chipset (SoundBlaster PC ) • FM Examples • FM Voice ◦ Cell phone ring tones • Sampling Synthesis • Modern Example • On the physical modeling front, synthesis of vibrating-string Spectral Modeling using finite differences started around this time: Physical Modeling Hiller & Ruiz, JAES 1971 (vol. 19, no. 6) Summary

Julius Smith AES-2006 Heyser Lecture – 14 / 84 FM Formula

Overview

Early Digital Synthesis x(t)= Ac sin[ωct + φc + Am sin(ωmt + φm)] • Music V • KL Music where • “Daisy” • Additive Analysis (Ac, ωc, φc) specify the carrier sinusoid • Additive Synthesis • FM Synthesis (Am, ωm, φm) specify the modulator sinusoid • FM Formula • FM Patch Can also be called modulation • FM Spectra • FM Examples • FM Voice • Sampling Synthesis • Modern Example

Spectral Modeling

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 15 / 84 Simple FM “Brass” Patch (1970–)

Jean-Claude Risset observation (1964–1969): Overview Brass bandwidth ∝ Early Digital Synthesis • Music V • KL Music • “Daisy” • Additive Analysis 0 • Additive Synthesis = f • FM Synthesis • FM Formula g • FM Patch A F • FM Spectra • FM Examples • FM Voice • Sampling Synthesis • Modern Example fc = f0

Spectral Modeling

Physical Modeling A F Summary

Out

Julius Smith AES-2006 Heyser Lecture – 16 / 84 FM (Bessel Function of First Kind)

Overview Harmonic number k, FM index β:

Early Digital Synthesis • Music V • KL Music • “Daisy” • Additive Analysis • Additive Synthesis • FM Synthesis • FM Formula 1 • FM Patch • FM Spectra • FM Examples • FM Voice 0.5

• Sampling Synthesis ) β ( k

• Modern Example J 0 Spectral Modeling 0

Physical Modeling 2 −0.5 Summary 4 0 5 10 6 15 20 8 25 Order k 30 10 Argument β

Julius Smith AES-2006 Heyser Lecture – 17 / 84 Frequency Modulation (FM) Examples

Overview All examples by John Chowning unless otherwise noted: Early Digital Synthesis • Music V • FM brass synthesis • KL Music • “Daisy” ◦ Low Brass example • Additive Analysis • Additive Synthesis ◦ Dexter Morril’s FM Trumpet • FM Synthesis • FM Formula • FM singing voice (1978) • FM Patch Each synthesized using an FM operator pair • FM Spectra • FM Examples (two sinusoidal oscillators) • FM Voice • Sampling Synthesis ◦ Chorus • Modern Example ◦ Voices Spectral Modeling ◦ Basso Profundo Physical Modeling Summary • Other early FM synthesis ◦ Clicks and Drums ◦ Big Bell ◦ String Canon

Julius Smith AES-2006 Heyser Lecture – 18 / 84 FM Voice

Overview FM voice synthesis can be viewed as compressed modeling of Early Digital Synthesis spectral • Music V • KL Music • “Daisy” • Additive Analysis • Additive Synthesis Carrier 2 • FM Synthesis Carrier 1 • FM Formula Carrier 3 • FM Patch • FM Spectra • FM Examples • FM Voice • Sampling Synthesis Magnitude • Modern Example

Spectral Modeling

Physical Modeling

Summary 0 Frequency f Modulation Frequency (all three)

Julius Smith AES-2006 Heyser Lecture – 19 / 84 Sampling Synthesis History

Overview • 1979 - Fairlight Computer Music Instrument - 8-bit Early Digital Synthesis • Music V ◦ First commercial sampler • KL Music ◦ • “Daisy” Eight voices, 8 bits, 64 KB (4 sec) RAM, 16 kHz (mono) • Additive Analysis ◦ Editing, looping, mixing • Additive Synthesis • FM Synthesis ◦ One could draw waveforms and additive-synthesis amplitude • FM Formula envelopes (for each harmonic) with a light pen • FM Patch • FM Spectra ◦ $25,000–$36,000! • FM Examples • FM Voice • 1981 - E-mu Systems Emulator • Sampling Synthesis • Modern Example ◦ First “affordable” sampler ($10,000) Spectral Modeling ◦ Eight voices, 128 K RAM, 8-bit, 80 lb. Physical Modeling

Summary • 1986 - Mirage ◦ Breakthrough price-point ($1695) ◦ Eight voices, 144 K RAM, 8-bit

Julius Smith AES-2006 Heyser Lecture – 20 / 84 Modern Sampled

Overview Example:1 Early Digital Synthesis • Music V • 40 Gigabytes on ten DVDs (three sampled ) • KL Music • “Daisy” • Every key sampled • Additive Analysis • 4–10 “velocity layers” • Additive Synthesis • FM Synthesis • Separate recordings with soft pedal down • FM Formula • • FM Patch Separate “release” recordings, for multiple striking velocities • FM Spectra • FM Examples • FM Voice • Sampling Synthesis • Modern Example

Spectral Modeling

Physical Modeling

Summary

1Synthogy Ivory, $349 (Electronic Musician, October 2006)

Julius Smith AES-2006 Heyser Lecture – 21 / 84 Fundamental Problem with Sampling Synthesis

Overview Piano is determined by Early Digital Synthesis • Music V • key number (1 byte) • KL Music • • “Daisy” key velocity (2 bytes more than enough) • Additive Analysis • pedal state (1 bit [or byte] per pedal) • Additive Synthesis • FM Synthesis Piano control is relatively low-dimensional: • FM Formula • FM Patch • Less than six bytes of information per note played • FM Spectra • FM Examples • No continuous controls (typically) • FM Voice • Sampling Synthesis • Ratio of total sampled data to one note of control data • Modern Example ≈ one billion Spectral Modeling

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 22 / 84 Now consider bowed strings

Overview Control parameters: Early Digital Synthesis • Music V • Left-hand finger position(s) • KL Music • • “Daisy” Left-hand vibrato • Additive Analysis • Bow velocity • Additive Synthesis • FM Synthesis • Bow force • FM Formula • Bow position • FM Patch • FM Spectra • Bow angle • FM Examples • Shoulder damping • FM Voice • Sampling Synthesis • Instrument orientation • Modern Example • Player motion (within a room) Spectral Modeling

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 23 / 84 Difficulty of sampling bowed strings

Overview • Bowed-string control is infinite-dimensional in principle Early Digital Synthesis • Many time-varying functions — “gestures” • Music V • KL Music (we counted more than 10) • “Daisy” • • Additive Analysis Complete sampling of bowed strings on the level of pianos has • Additive Synthesis apparently never been done • FM Synthesis • FM Formula • Rule-driven navigation of the most useful recorded playing • FM Patch regimes has worked well (e.g., Synful Orchestra) • FM Spectra • FM Examples • Model-based approaches greatly reduce data requirements: • FM Voice • Sampling Synthesis ◦ Spectral models (inspired by sound perception) • Modern Example ◦ Physical models (model the sound source) Spectral Modeling

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 24 / 84 Overview Early Digital Synthesis Spectral Modeling Physical Modeling Summary

Spectral Modeling Synthesis (Historical Summary)

Julius Smith AES-2006 Heyser Lecture – 25 / 84 Classic Analysis & Resynthesis (Dudley 1939)

magnitude, or Overview magnitude and phase extraction Early Digital Synthesis

Spectral Modeling • A Vocoder x0(t) xˆ0(t) • Vocoder Examples • Sinusoidal Modeling f • Spectral Trajectories Data Compression, • + Noise A Transmission, xˆ(t) • S+N Examples x(t) x1(t) Storage, xˆ1(t) Manipulation, • S+N FX f Noise reduction, ... • S+N XSynth • Sines + Transients • S + N + Transients • S+N+T TSM • S+N+T Freq Map • S+N+T Windows A N−1 • HF Noise Modeling x xˆN−1 • HF Noise Band f • S+N+T Examples • SM Summary • Spec Future

Physical Modeling Analysis Processing Synthesis

Summary

Julius Smith AES-2006 Heyser Lecture – 26 / 84 Channel Model

Overview Analysis Model Synthesis Model Early Digital Synthesis a a (t) ω +∆ω (t) Spectral Modeling Channel Filter k k k k • Vocoder Response • Vocoder Examples A F ∆ω Osc • Sinusoidal Modeling k • Spectral Trajectories • 0 ω ω Sines + Noise k • S+N Examples Out • S+N FX • S+N XSynth • Early “channel vocoder” implementations (hardware) only • Sines + Transients • S + N + Transients measured amplitude ak(t) (Dudley 1939) • S+N+T TSM • The “phase vocoder” (Flanagan and Golden 1966) added phase • S+N+T Freq Map • S+N+T Windows tracking in each channel • HF Noise Modeling • Portnoff (1976) developed the FFT phase vocoder, • HF Noise Band • S+N+T Examples which replaced the heterodyne comb in computer-music • SM Summary additive-synthesis analysis (James A. Moorer) • Spec Future

Physical Modeling • Inverse FFT synthesis (Rodet and Depalle 1992) gave faster

Summary sinusoidal oscillator banks

Julius Smith AES-2006 Heyser Lecture – 27 / 84 Amplitude and Frequency Envelopes

a (t) Overview k

Early Digital Synthesis

Spectral Modeling • Vocoder • Vocoder Examples • Sinusoidal Modeling • Spectral Trajectories • Sines + Noise • S+N Examples • S+N FX t • S+N XSynth ∆ωk(t)= φ˙k(t) • Sines + Transients • S + N + Transients • S+N+T TSM • S+N+T Freq Map • S+N+T Windows 0 • HF Noise Modeling • HF Noise Band • S+N+T Examples • SM Summary • Spec Future t Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 28 / 84 Channel Vocoder Sound Examples

Overview • Original Early Digital Synthesis Spectral Modeling • • Vocoder 10 channels, sine carriers • Vocoder Examples • 10 channels, narrowband-noise carriers • Sinusoidal Modeling • Spectral Trajectories • Sines + Noise • 26 channels, sine carriers • S+N Examples • S+N FX • 26 channels, narrowband-noise carriers • S+N XSynth • 26 channels, narrowband-noise carriers, channels reversed • Sines + Transients • S + N + Transients • S+N+T TSM • S+N+T Freq Map • Phase Vocoder: Identity system in absence of modifications • S+N+T Windows • HF Noise Modeling • HF Noise Band • The FFT Phase Vocoder next transitioned to the Short-Time • S+N+T Examples Fourier Transform (STFT) (Allen and Rabiner 1977) • SM Summary • Spec Future

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 29 / 84 Tracking Spectral Peaks in the Short-Time Fourier Transform

Overview Phases Early Digital Synthesis atan

Spectral Modeling • Vocoder s(t) • FFT Vocoder Examples • Sinusoidal Modeling Quadratic • Peak Spectral Trajectories Peak dB mag tracking • Sines + Noise Interpolation • S+N Examples window w(n) Amplitudes • S+N FX • S+N XSynth • Sines + Transients • S + N + Transients • S+N+T TSM • STFT peak tracking at CCRMA: mid-1980s (PARSHL program) • S+N+T Freq Map • Motivated by vocoder analysis of piano tones • S+N+T Windows • HF Noise Modeling • Influences: STFT (Allen and Rabiner 1977), • HF Noise Band ADEC (1977), MAPLE (1979) • S+N+T Examples • SM Summary • Independently developed for speech coding by McAulay and • Spec Future Quatieri at Lincoln Labs Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 30 / 84 Example Spectral Trajectories

Overview f Early Digital Synthesis

Spectral Modeling • Vocoder • Vocoder Examples • Sinusoidal Modeling • Spectral Trajectories • Sines + Noise • S+N Examples • S+N FX • S+N XSynth • Sines + Transients • S + N + Transients • S+N+T TSM • S+N+T Freq Map • S+N+T Windows • HF Noise Modeling • HF Noise Band t • S+N+T Examples • SM Summary • Spec Future

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 31 / 84 Sines + Noise Synthesis (1989) PSfrag 1 1 2 2 3 3 4 4 Overview A (t) f (t) A (t) f (t) A (t) f (t) A (t) f (t)

Early Digital Synthesis

Spectral Modeling • Vocoder • Vocoder Examples • Sinusoidal Modeling • Spectral Trajectories • Sines + Noise • S+N Examples • S+N FX • S+N XSynth • Sines + Transients • S + N + Transients • S+N+T TSM • S+N+T Freq Map • S+N+T Windows white noise • HF Noise Modeling u(t) filter(t) • HF Noise Band h (τ) • S+N+T Examples t • SM Summary P • Spec Future 4 Physical Modeling t y(t)= Ai(t) cos 0 ωi(t)dt + φi(0) +(ht ∗ u)(t) =1 Summary iP hR i

Julius Smith AES-2006 Heyser Lecture – 32 / 84 Sines + Noise Sound Examples

Overview Xavier Serra 1989 thesis demos (Sines + Noise signal modeling) Early Digital Synthesis • Spectral Modeling Piano • Vocoder • Vocoder Examples ◦ Original • Sinusoidal Modeling ◦ Sinusoids alone • Spectral Trajectories • Sines + Noise ◦ Residual after sinusoids removed • S+N Examples ◦ Sines + noise model • S+N FX • S+N XSynth • Sines + Transients • Voice • S + N + Transients • S+N+T TSM ◦ Original • S+N+T Freq Map ◦ Sinusoids • S+N+T Windows • HF Noise Modeling ◦ Residual • HF Noise Band ◦ Synthesis • S+N+T Examples • SM Summary • Spec Future

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 33 / 84 Musical Effects with Sines+Noise Models (Serra 1989)

Overview • Piano Effects Early Digital Synthesis ◦ Spectral Modeling Pitch downshift one octave • Vocoder ◦ Pitch flattened • Vocoder Examples • Sinusoidal Modeling ◦ Varying partial stretching • Spectral Trajectories • Sines + Noise • Voice Effects • S+N Examples • S+N FX ◦ Frequency-scale by 0.6 • S+N XSynth • Sines + Transients ◦ Frequency-scale by 0.4 and stretch partials • S + N + Transients ◦ Variable time-scaling, deterministic to stochastic • S+N+T TSM • S+N+T Freq Map • S+N+T Windows • HF Noise Modeling • HF Noise Band • S+N+T Examples • SM Summary • Spec Future

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 34 / 84 Cross-Synthesis with Sines+Noise Models (Serra 1989)

Overview • Voice “modulator” Early Digital Synthesis Spectral Modeling • • Vocoder Creaking ship’s mast “carrier” • Vocoder Examples • Voice-modulated creaking mast • Sinusoidal Modeling • Spectral Trajectories • Same with modified spectral envelopes • Sines + Noise • S+N Examples • S+N FX • S+N XSynth • Sines + Transients • S + N + Transients • S+N+T TSM • S+N+T Freq Map • S+N+T Windows • HF Noise Modeling • HF Noise Band • S+N+T Examples • SM Summary • Spec Future

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 35 / 84 Sines + Transients Sound Examples (Serra 1989)

Overview In this technique, the sinusoidal sum is phase-matched at the Early Digital Synthesis cross-over point only (with no cross-fade). Spectral Modeling • Vocoder • • Vocoder Examples • Sinusoidal Modeling • Spectral Trajectories ◦ Original • Sines + Noise ◦ Sinusoidal model • S+N Examples • S+N FX ◦ Original attack, followed by sinusoidal model • S+N XSynth • Sines + Transients • Piano • S + N + Transients • S+N+T TSM ◦ Original • S+N+T Freq Map • S+N+T Windows ◦ Sinusoidal model • HF Noise Modeling ◦ Original attack, followed by sinusoidal model • HF Noise Band • S+N+T Examples • SM Summary • Spec Future

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 36 / 84 Multiresolution Sines + Noise + Transients (Levine 1998)

Overview Why Model Transients Separately? Early Digital Synthesis • Spectral Modeling Sinusoids efficiently model spectral peaks over time • Vocoder • Filtered noise efficiently models spectral residual vs. t • Vocoder Examples • Sinusoidal Modeling • Neither is good for abrupt transients in the • Spectral Trajectories • Phase-matched oscillators are expensive • Sines + Noise • S+N Examples • More efficient to switch to a transient model during transients • S+N FX • Need sinusoidal phase matching at the switching times • S+N XSynth • Sines + Transients • S + N + Transients Transient models: • S+N+T TSM • S+N+T Freq Map • Original waveform slice (1988) • S+N+T Windows • Wavelet expansion (Ali 1996) • HF Noise Modeling • HF Noise Band • MPEG-2 AAC (with short window) (Levine 1998) • S+N+T Examples • Frequency-domain LPC • SM Summary • Spec Future (time-domain amplitude envelope) (Verma 2000) Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 37 / 84 Time Scale Modification of Sines + Noise + Transients Models

Overview transients transients sines+ sines+ sines+ Early Digital Synthesis originalsignal noise noise noise Spectral Modeling • Vocoder • Vocoder Examples • Sinusoidal Modeling •

Spectral Trajectories transients transients • Sines + Noise time-scaled sines+ sines+ sines+ • S+N Examples signal noise noise noise • S+N FX • S+N XSynth • Sines + Transients • S + N + Transients time • S+N+T TSM • S+N+T Freq Map • S+N+T Windows Time-Scale Modification (TSM) becomes well defined: • HF Noise Modeling • HF Noise Band • Transients are translated in time • S+N+T Examples • SM Summary • Sinusoidal envelopes are scaled in time • Spec Future • Noise-filter envelopes also scaled in time Physical Modeling • Dual of TSM is frequency scaling Summary

Julius Smith AES-2006 Heyser Lecture – 38 / 84 Sines + Noise + Transients Time-Frequency Map

16 Overview 14 Early Digital Synthesis 12 Spectral Modeling • Vocoder 10 • Vocoder Examples • Sinusoidal Modeling 8 • Spectral Trajectories 6

• Sines + Noise frequency [kHz] • S+N Examples 4 • S+N FX 2 • S+N XSynth • Sines + Transients 0 0 50 100 150 200 250 • S + N + Transients 1 • S+N+T TSM • S+N+T Freq Map 0.5 • S+N+T Windows 0 • HF Noise Modeling

• HF Noise Band amplitude −0.5 • S+N+T Examples −1 • SM Summary 0 50 100 150 200 250 • Spec Future time [milliseconds] Physical Modeling (Levine 1998) Summary

Julius Smith AES-2006 Heyser Lecture – 39 / 84 Corresponding Analysis Windows

Overview

Early Digital Synthesis transient Spectral Modeling • Vocoder • Vocoder Examples • Sinusoidal Modeling

• Spectral Trajectories high octave • Sines + Noise • S+N Examples • S+N FX •

S+N XSynth middle octave • Sines + Transients • S + N + Transients • S+N+T TSM

• S+N+T Freq Map low octave • S+N+T Windows • HF Noise Modeling • HF Noise Band • S+N+T Examples • SM Summary

• Spec Future amplitude

Physical Modeling 0 50 100 150 200 250 Summary time [milliseconds]

Julius Smith AES-2006 Heyser Lecture – 40 / 84 Quasi-Constant-Q (Wavelet) Time-Frequency Map

16 Overview 14 Early Digital Synthesis

Spectral Modeling 12 • Vocoder 10 • Vocoder Examples • Sinusoidal Modeling 8 • Spectral Trajectories 6 •

Sines + Noise frequency [kHz] • S+N Examples 4 • S+N FX 2 • S+N XSynth • Sines + Transients 0 0 4 50 100 150 200 250 • S + N + Transients x 10 • S+N+T TSM 2 • S+N+T Freq Map 1 • S+N+T Windows 0 • HF Noise Modeling

• HF Noise Band amplitude −1 • S+N+T Examples −2 • SM Summary 0 50 100 150 200 250 • Spec Future time [milliseconds] Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 41 / 84 Bark-Band Noise Modeling at High Frequencies (Levine 1998)

Overview 85

Early Digital Synthesis 80 Spectral Modeling • Vocoder 75 • Vocoder Examples • Sinusoidal Modeling 70 • Spectral Trajectories • Sines + Noise 65 • S+N Examples • S+N FX 60 • S+N XSynth • Sines + Transients 55 • S + N + Transients • magnitude [dB] S+N+T TSM 50 • S+N+T Freq Map • S+N+T Windows 45 • HF Noise Modeling • HF Noise Band 40 • S+N+T Examples • SM Summary 35 • Spec Future 30 Physical Modeling 0 500 1000 1500 2000 2500 3000 3500 4000 4500 frequency [Hz] Summary

Julius Smith AES-2006 Heyser Lecture – 42 / 84 Amplitude Envelope for One Noise Band

Overview 80

Early Digital Synthesis 70 Spectral Modeling • Vocoder • Vocoder Examples 60 • Sinusoidal Modeling • Spectral Trajectories 50 Original Mag. [dB] • Sines + Noise • S+N Examples 40 • S+N FX 0 50 100 150 200 250 300 350 • S+N XSynth • Sines + Transients • S + N + Transients 80 • S+N+T TSM 75 • S+N+T Freq Map 70 • S+N+T Windows 65 • HF Noise Modeling • HF Noise Band 60 • S+N+T Examples 55 LSA Mag. [dB] • SM Summary 50 • Spec Future 45 Physical Modeling 0 50 100 150 200 250 300 350 time [milliseconds] Summary

For more information, see Scott Levine’s thesis.2 Julius Smith 2http://ccrma.stanford.edu/~scottl/thesis.htmlAES-2006 Heyser Lecture – 43 / 84 Sines + Noise + Transients Sound Examples

Overview Scott Levine Thesis Demos (Sines + Noise + Transients at 32 kbps) Early Digital Synthesis (http://ccrma.stanford.edu/~scottl/thesis.html) Spectral Modeling • Vocoder • Vocoder Examples • Sinusoidal Modeling • Spectral Trajectories • Sines + Noise Mozart’s Le Nozze di Figaro • S+N Examples • S+N FX • Original • S+N XSynth • Compressed using MPEG-AAC at 32 kbps • Sines + Transients • S + N + Transients • Compressed using sines+transients+noise at 32 kbps • S+N+T TSM • S+N+T Freq Map • S+N+T Windows • Multiresolution sinusoids alone • HF Noise Modeling • HF Noise Band • Residual Bark-band noise • S+N+T Examples • Transform-coded transients (AAC) • SM Summary • Spec Future • Bark-band noise above 5 kHz

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 44 / 84 Rock Example

Overview Scott Levine Thesis Demos (Sines + Noise + Transients at 32 kbps) Early Digital Synthesis (http://ccrma.stanford.edu/~scottl/thesis.html) Spectral Modeling • Vocoder • Vocoder Examples • Sinusoidal Modeling • Spectral Trajectories “It Takes Two” by Rob Base & DJ E-Z Rock • Sines + Noise • S+N Examples • S+N FX • Original • S+N XSynth • MPEG-AAC at 32 kbps • Sines + Transients • S + N + Transients • Sines+transients+noise at 32 kbps • S+N+T TSM • S+N+T Freq Map • S+N+T Windows • Multiresolution sinusoids • HF Noise Modeling • • HF Noise Band Residual Bark-band noise • S+N+T Examples • Transform-coded transients (AAC) • SM Summary • Spec Future • Bark-band noise above 5 kHz

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 45 / 84 Time Scale Modification using Sines + Noise + Transients

Overview Scott Levine Thesis Demos (Sines + Noise + Transients at 32 kbps) Early Digital Synthesis (http://ccrma.stanford.edu/~scottl/thesis.html) Spectral Modeling • Vocoder • Vocoder Examples • Sinusoidal Modeling • Spectral Trajectories Time-Scale Modification (pitch unchanged) • Sines + Noise • S+N Examples • S+N FX • S+N+T time-scale factors [2.0, 1.6, 1.2, 1.0, 0.8, 0.6, 0.5] • S+N XSynth • Sines + Transients • S + N + Transients • S+N+T TSM S+N+T Pitch Shifting (timing unchanged) • S+N+T Freq Map • S+N+T Windows • Pitch-scale factors [0.89, 0.94, 1.00, 1.06, 1.12] • HF Noise Modeling • HF Noise Band • S+N+T Examples • SM Summary • Spec Future

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 46 / 84 Spectral Modeling History Highlights

• Fourier’s theory (1822) Perceptual audio coding: • Teleharmonium (1906) • Princen-Bradley filterbank (1986) • (1930s) • K. Brandenburg thesis (1989) • Channel Vocoder (1939) • Auditory masking usage • Phase Vocoder (1966) • Dolby AC2 • “Additive Synthesis” (1969) • Musicam • FFT Phase Vocoder (1976) • ASPEC • Sinusoidal Modeling • MPEG-I,II,IV (1977,1979,1985) (incl. S+N+T “parametric sounds”) • Sines+Noise (1989) • Sines+Transients (1989) • Sines+Noise+Transients (1998)

Julius Smith AES-2006 Heyser Lecture – 47 / 84 Future Prospects

Overview Observations: Early Digital Synthesis • Spectral Modeling Sinusoidal modeling of sound is “Unreasonably Effective” • Vocoder • Basic “auditory masking” discards ≈ 90% information • Vocoder Examples • Sinusoidal Modeling • Interesting neuroscience observation: • Spectral Trajectories • Sines + Noise “... most neurons in the primary auditory cortex A1 • S+N Examples are silent most of the time ...” • S+N FX • S+N XSynth (from “Sparse Time-Frequency Representations”, Gardner and • Sines + Transients • S + N + Transients Magnesco, PNAS:103(16), April 2006) • S+N+T TSM • S+N+T Freq Map • What is a true and correct “psychospectral model” for sound? • S+N+T Windows • HF Noise Modeling ◦ The cochlea of the ear is a real-time spectrum analyzer • HF Noise Band ◦ How is the “ear’s ” represented at higher levels? • S+N+T Examples • SM Summary • Spec Future

Physical Modeling

Summary

Julius Smith AES-2006 Heyser Lecture – 48 / 84 Overview Early Digital Synthesis Spectral Modeling Physical Modeling Summary

Physical Modeling Synthesis (Historical Summary)

Julius Smith AES-2006 Heyser Lecture – 49 / 84 Kelly-Lochbaum Vocal Tract Model

Overview

Early Digital Synthesis

Spectral Modeling e(n) … y(n) Physical Modeling • KL Music • Digital Waveguide • Signal Scattering • Plucked String Glottal Pulse + + • Train or Noise 1 k 1 1 k M Struck String − − Speech e(n) z 1 2 … z 1 2 • Karplus Strong Output • EKS Algorithm y(n) k − k R k − k R • Clarinet 1 1 1 M M M • Wind Examples (Unused − − • Bowed Strings Allpass z 1 2 … z 1 2 • Distortion Guitar Output) 1 − k 1 1 − k M • Acoustic Strings Kelly-Lochbaum Vocal Tract Model (Piecewise Cylindrical) • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis John L. Kelly and Carol Lochbaum (1962) • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 50 / 84 Digital Waveguide Models (1985)

∆ Overview Lossless digital waveguide = bidirectional delay line Early Digital Synthesis at some wave impedance R Spectral Modeling

Physical Modeling • KL Music • Digital Waveguide − N • Signal Scattering z • Plucked String • Struck String • Karplus Strong R • EKS Algorithm • Clarinet − N • Wind Examples z • Bowed Strings • Distortion Guitar • Acoustic Strings Useful for efficient models of • Sound Examples • Linearized Violin • strings • Commuted Piano • Pulse Synthesis • bores • Complete Piano • plane waves • Sound Examples • Phy Audio Coding • conical waves • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 51 / 84 Signal Scattering

Overview Signal scattering is caused by a change in wave impedance R: Early Digital Synthesis

Spectral Modeling R2 − R1 k 1 = Physical Modeling R2 + R1 • KL Music • Digital Waveguide + • Signal Scattering 1 k 1 • − − Plucked String z N z N • Struck String • Karplus Strong • EKS Algorithm k 1 − k 1 • Clarinet R 1 R 2 • Wind Examples • Bowed Strings − N − N • Distortion Guitar z z • Acoustic Strings 1 − k 1 • Sound Examples • Linearized Violin • Commuted Piano If the wave impedance changes every spatial sample, the • Pulse Synthesis • Complete Piano Kelly-Lochbaum vocal-tract model results. • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 52 / 84 Ideal Plucked String (Displacement Waves)

Overview

Early Digital Synthesis

Spectral Modeling + + y (n) y (n-N/2) Physical Modeling • KL Music • Digital Waveguide ªBridgeº -1 (x = Pluck Position) -1 ªNutº • Signal Scattering • Plucked String • Struck String y- (n) y- (n+N/2) • Karplus Strong • EKS Algorithm • Clarinet (x = 0) (x = L) • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Load each delay line with half of initial string displacement • Linearized Violin • Sum of upper and lower delay lines = string displacement • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 53 / 84 Ideal Struck String (Velocity Waves)

Overview

Early Digital Synthesis c Spectral Modeling + + v (n) v (n-N/2) Physical Modeling • KL Music • Digital Waveguide ªBridgeº -1 (x = Hammer Position) -1 ªNutº • Signal Scattering • Plucked String c • Struck String v- (n) v- (n+N/2) • Karplus Strong • EKS Algorithm (x = 0) (x = L) • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings Hammer strike = momentum transfer = velocity step: • Sound Examples • Linearized Violin • Commuted Piano mhvh(0−)=(mh + ms)vs(0+) • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 54 / 84 Karplus-Strong (KS) Algorithm (1983)

Overview Early Digital Synthesis + + Output y (n) y (n-N) Spectral Modeling N samples delay

Physical Modeling • KL Music 1/2 • Digital Waveguide • Signal Scattering • 1/2 Plucked String z -1 • Struck String • Karplus Strong • EKS Algorithm • Clarinet • Wind Examples • Bowed Strings • Discovered (1978) as “self-modifying wavetable synthesis” • Distortion Guitar • • Acoustic Strings Wavetable is preferably initialized with random numbers • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 55 / 84 Karplus-Strong Sound Examples

Overview • “Vintage” 8-bit sound examples: Early Digital Synthesis • Original Plucked String: (WAV) (MP3) Spectral Modeling

Physical Modeling • Drum: (WAV) (MP3) • KL Music • Stretched Drum: (WAV) (MP3) • Digital Waveguide • Signal Scattering • Plucked String • Struck String • Karplus Strong • EKS Algorithm • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 56 / 84 EKS Algorithm (Jaffe-Smith 1983)

Overview −N Hp(z) Hβ (z) z HL(z) Early Digital Synthesis

Spectral Modeling

Physical Modeling Hρ(z) Hs(z) Hd(z) • KL Music • Digital Waveguide • Signal Scattering N = pitch period (2× string length) in samples • Plucked String • Struck String 1 − p • Karplus Strong Hp(z) = −1 = pick-direction lowpass filter • EKS Algorithm 1 − p z • Clarinet −βN • Wind Examples Hβ(z) = 1 − z = pick-position comb filter, β ∈ (0, 1) • Bowed Strings • Distortion Guitar Hd(z) = string-damping filter (one/two poles/zeros typical) • Acoustic Strings • Sound Examples Hs(z) = string-stiffness allpass filter (several poles and zeros) • Linearized Violin −1 • ρ(N) − z Commuted Piano H (z) = = first-order string-tuning allpass filter • Pulse Synthesis ρ 1 − ρ(N) z−1 • Complete Piano • Sound Examples 1 − RL • Phy Audio Coding HL(z) = −1 = dynamic-level lowpass filter • Phy Audio Coding? 1 − RL z Summary Julius Smith AES-2006 Heyser Lecture – 57 / 84 STK EKS Sound Examples

Overview • Synthesis Tool Kit (STK) by Perry Cook, Gary Scavone, and Early Digital Synthesis others — distributed by CCRMA: Spectral Modeling Google search: STK ToolKit Physical Modeling • KL Music • Digital Waveguide STK Plucked String: (WAV) (MP3) • Signal Scattering • Plucked String • Plucked String 1: (WAV) (MP3) • Struck String • Karplus Strong • Plucked String 2: (WAV) (MP3) • EKS Algorithm • Plucked String 3: (WAV) (MP3) • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 58 / 84 EKS Sound Example (1988)

Overview Bach A-Minor Concerto—Orchestra Part: (WAV) (MP3) Early Digital Synthesis

Spectral Modeling

Physical Modeling • Executed in real time on one Motorola DSP56001 • KL Music (20 MHz clock, 128K SRAM) • Digital Waveguide • Signal Scattering • Plucked String • Struck String • Developed for the NeXT Computer introduction at Davies • Karplus Strong Symphony Hall, San Francisco, 1988 • EKS Algorithm • Clarinet • Wind Examples • Solo violin part was played live by Dan Kobialka of the San • Bowed Strings • Distortion Guitar Francisco Symphony • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 59 / 84 Digital Waveguide Single Reed, Cylindrical Bore Model (1986)

Mouth Overview Pressure − ( ) p b n Early Digital Synthesis p ( n) h m Output m Reed to Bell Delay 2 Filter Spectral Modeling Reed Table - ρ Physical Modeling à Reflection • KL Music * Filter • Digital Waveguide + h ∆ • Signal Scattering - • Plucked String Bell to Reed Delay - + • Struck String Embouchure ( ) p b n • Karplus Strong Offset • EKS Algorithm Reed Bore Bell • Clarinet • Wind Examples Digital waveguide clarinet • Bowed Strings • Distortion Guitar • Acoustic Strings • Control variable = mouth half-pressure • Sound Examples • Linearized Violin • Total reed cost = two subtractions, one multiply, and one table • Commuted Piano lookup per sample • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 60 / 84 Digital Waveguide Wind Instrument Sound Examples

Overview • STK Clarinet: (WAV) (MP3) Early Digital Synthesis Google search: STK clarinet Spectral Modeling

Physical Modeling ◦ Synthesis Tool Kit (STK) by Perry Cook, Gary Scavone, and • KL Music others — distributed by CCRMA: • Digital Waveguide • Signal Scattering Google search: STK ToolKit • Plucked String • Struck String • Staccato Systems Slide Flute • Karplus Strong • EKS Algorithm (based on STK flute, ca. 1995): (WAV) (MP3) • Clarinet • Yamaha VL1 “Virtual Lead” synthesizer demos (1994): • Wind Examples • Bowed Strings • Shakuhachi: (WAV) (MP3) • Distortion Guitar • Acoustic Strings • Oboe and Bassoon: (WAV) (MP3) • Sound Examples • Tenor Saxophone: (WAV) (MP3) • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 61 / 84 Digital Waveguide Bowed Strings (1986)

Bow Force Overview Bow Velocity + − v , ls v s , r Early Digital Synthesis Body Nut to Bow Delay Bow to Bridge Delay Filter Spectral Modeling - + ρ v b v ∆ Ã Reflection Physical Modeling -1 Filter • - * KL Music Bow Table • Digital Waveguide + v s , r • Signal Scattering Bow to Nut Delay Bridge to Bow Delay − • Plucked String v , ls • Struck String Bridge- Nut String Bow String Air • Karplus Strong Body • EKS Algorithm • Clarinet • Wind Examples • Reflection filter summarizes all losses per period • Bowed Strings (due to bridge, bow, finger, etc.) • Distortion Guitar • Acoustic Strings • Bow-string junction = memoryless lookup table • Sound Examples (or segmented polynomial) • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 62 / 84 “Electric Cello” Sound Examples (Peder Larson)

Overview • Staccato Notes: (WAV) (MP3) Early Digital Synthesis (short strokes of high bow pressure, as from a bouncing bow) Spectral Modeling • Bach’s First Suite for Unaccompanied Cello: (WAV) (MP3) Physical Modeling • KL Music • Digital Waveguide • Signal Scattering • Plucked String • Struck String • Karplus Strong • EKS Algorithm • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 63 / 84 Soft Clipper

Overview 2 − 3 , x ≤−1 Early Digital Synthesis  x3 Spectral Modeling f(x)= x − , −1 ≤ x ≤ 1  3 Physical Modeling  2 • KL Music 3 , x ≥ 1 • Digital Waveguide  • Signal Scattering  • Plucked String x=−1:0.01:1; plot([−(2/3)*ones(1,100), x−x. 3 /3, (2/3)*ones(1,100)]) • Struck String 0.8 • Karplus Strong 0.6 • EKS Algorithm • Clarinet 0.4 • Wind Examples • Bowed Strings 0.2 • Distortion Guitar 0 • Acoustic Strings f(x(n)) • Sound Examples −0.2 • Linearized Violin −0.4 • Commuted Piano • Pulse Synthesis −0.6 • Complete Piano • −0.8 Sound Examples −2 −1.5 −1 −0.5 0 0.5 1 1.5 2 • Phy Audio Coding x(n) • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 64 / 84 Amplifier Distortion + Amplifier Feedback

Overview Sullivan 1990 Early Digital Synthesis

Spectral Modeling

Physical Modeling • KL Music Pre-distortion output level • Digital Waveguide String 1 • Signal Scattering Nonlinear Distortion • Plucked String • . Struck String . Output Signal • Karplus Strong • EKS Algorithm Pre-distortion gain Distortion output level • Clarinet String N Amplifier • Wind Examples Feedback • Bowed Strings Gain • Distortion Guitar • Acoustic Strings Amplifier Feedback Delay • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano Distortion output signal often further filtered by an amplifier cabinet • Sound Examples • Phy Audio Coding filter, representing speaker cabinet, driver responses, etc. • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 65 / 84 Distortion Guitar Sound Examples

Overview (Stanford Sondius Project, ca. 1995) Early Digital Synthesis • Spectral Modeling Distortion Guitar: (WAV) (MP3)

Physical Modeling • Amplifier Feedback 1: (WAV) (MP3) • KL Music • Amplifier Feedback 2: (WAV) (MP3) • Digital Waveguide • Signal Scattering • Plucked String • Struck String • Karplus Strong • EKS Algorithm • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 66 / 84 Commuted Synthesis of Acoustic Strings (1993)

Overview e(t) s(t) y(t) Early Digital Synthesis Trigger Excitation String Resonator Output

Spectral Modeling

Physical Modeling • KL Music Schematic diagram of a stringed musical instrument. • Digital Waveguide • Signal Scattering • Plucked String • Struck String • Karplus Strong Trigger Excitation Resonator String Output • EKS Algorithm • Clarinet • Wind Examples • Bowed Strings Equivalent diagram in the linear, time-invariant case. • Distortion Guitar • Acoustic Strings • Sound Examples a(t) x(t) Aggregate • Trigger String Output Linearized Violin Excitation • Commuted Piano • Pulse Synthesis • Complete Piano Use of an aggregate excitation given by the convolution of original • Sound Examples excitation with the resonator impulse response. • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 67 / 84 Commuted Components

Overview

Early Digital Synthesis a(t) x(t) Aggregate Trigger String Output Spectral Modeling Excitation Physical Modeling • KL Music “Plucked Resonator” driving a String. • Digital Waveguide • Signal Scattering • Plucked String • Struck String • Karplus Strong • EKS Algorithm • Clarinet s(t) Bridge Guitar Air Room y(t) • Wind Examples Coupling Body Absorption Response Output • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples Possible components of a guitar resonator. • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 68 / 84 Sound Examples

Overview Electric Guitar (Pick-Ups and/or Body-Model Added) (Stanford Early Digital Synthesis Sondius Project → Staccato Systems, Inc. → ADI, ca. 1995) Spectral Modeling Physical Modeling • Example 1: (WAV) (MP3) • KL Music • Digital Waveguide • Example 2: (WAV) (MP3) • Signal Scattering • Example 3: (WAV) (MP3) • Plucked String • Struck String • Virtual “wah-wah pedal”: (WAV) (MP3) • Karplus Strong • EKS Algorithm • Clarinet STK Mandolin • Wind Examples • Bowed Strings • STK Mandolin 1: (WAV) (MP3) • Distortion Guitar • Acoustic Strings • STK Mandolin 2: (WAV) (MP3) • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 69 / 84 Sound Examples

Overview More Recent Acoustic Guitar Early Digital Synthesis • Spectral Modeling Bach Prelude in E Major: (WAV) (MP3)

Physical Modeling • soundexamplewavBach silenceLoure in E Major: (WAV) (MP3) • KL Music • Digital Waveguide • Signal Scattering Virtual performance by Dr. Mikael Laurson, Sibelius Institute • Plucked String • Struck String 3 • Karplus Strong Virtual guitar by Helsinki Univ. of Tech., Lab • EKS Algorithm • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding 3 • Phy Audio Coding? http://www.acoustics.hut.fi/

Summary Julius Smith AES-2006 Heyser Lecture – 70 / 84 Commuted Synthesis of Linearized Violin

a) Output Overview e(n) s(n) Amplitude(n) Impulse x(n) String Resonator Early Digital Synthesis Frequency(n) Train

Spectral Modeling b) Output Physical Modeling e(n) Amplitude(n) Impulse a(n) x(n) • KL Music Resonator String Frequency(n) Train • Digital Waveguide • Signal Scattering • Plucked String c) Output a(n) • Struck String Amplitude(n) Impulse-Response x(n) String • Karplus Strong Frequency(n) Train • EKS Algorithm • Clarinet • Wind Examples • Assumes ideal Helmholtz motion of string • Bowed Strings • Distortion Guitar • Sound Examples (Stanford Sondius project, ca. 1995): • Acoustic Strings ◦ Bass: (WAV) (MP3) ◦ Violin 1: (WAV) (MP3) • Sound Examples • Linearized Violin ◦ Cello: (WAV) (MP3) ◦ Violin 2: (WAV) (MP3) • Commuted Piano ◦ Viola 1: (WAV) (MP3) ◦ Ensemble: (WAV) (MP3) • Pulse Synthesis • Complete Piano ◦ Viola 2: (WAV) (MP3) • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 71 / 84 Commuted Piano Synthesis (1995)

Overview Hammer-string interaction pulses (force): Early Digital Synthesis

Spectral Modeling Force 0.5 Physical Modeling 0.4 • KL Music 0.3 • Digital Waveguide • Signal Scattering 0.2 • Plucked String 0.1 • Struck String 5 10 15 20 Time • Karplus Strong • EKS Algorithm • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 72 / 84 Synthesis of Hammer-String Interaction Pulse

Overview Impulse Impulse Response Early Digital Synthesis

Spectral Modeling Lowpass Physical Modeling Filter • KL Music • Digital Waveguide Time Time • Signal Scattering • Plucked String • Struck String • Karplus Strong • Faster collisions correspond to narrower pulses • EKS Algorithm • Clarinet (nonlinear filter) • Wind Examples • Bowed Strings • Distortion Guitar • For a given velocity, filter is linear time-invariant • Acoustic Strings • Sound Examples • Linearized Violin • Piano is “linearized” for each hammer velocity • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 73 / 84 Multiple Hammer-String Interaction Pulses

Overview Superimpose several individual pulses: Early Digital Synthesis

Spectral Modeling

Physical Modeling Force • KL Music • Impulse 1 Digital Waveguide δ LPF1 • Signal Scattering 1 String Impulse 2 • Plucked String δ LPF2 + Input • Struck String 2 Impulse 3 LPF3 • Karplus Strong δ 3 • EKS Algorithm 0 Time • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 74 / 84 Multiple Hammer-String Interaction Pulses

Overview Superimpose several individual pulses: Early Digital Synthesis

Spectral Modeling

Physical Modeling Force • KL Music • Impulse 1 Digital Waveguide δ LPF1 • Signal Scattering 1 String Impulse 2 • Plucked String δ LPF2 + Input • Struck String 2 Impulse 3 LPF3 • Karplus Strong δ 3 • EKS Algorithm 0 Time • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar As impulse amplitude grows (faster hammer strike), output pulses • Acoustic Strings • Sound Examples become taller and thinner, showing less overlap. • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 74 / 84 Complete Piano Model

Natural Ordering: Overview

Early Digital Synthesis v c Spectral Modeling LPF1 Impulse δ Tapped δ Physical Modeling 1 1 Sound Board • Gener- Delay LPF2 String KL Music Trigger δ + & Enclosure • ator Line 2 Output Digital Waveguide LPF3 • δ Signal Scattering 3 • Plucked String • Struck String • Karplus Strong Commuted Ordering: • EKS Algorithm • Clarinet v c • Wind Examples • Bowed Strings LPF1 Sound Board Tapped • Distortion Guitar & Enclosure Delay LPF2 String Output • Acoustic Strings + Trigger Impulse Response Line • Sound Examples LPF3 • Linearized Violin • Commuted Piano • Pulse Synthesis • Soundboard and enclosure are commuted • Complete Piano • Sound Examples • Only need a stored recording of their impulse response • Phy Audio Coding • An enormous digital filter is otherwise required • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 75 / 84 Piano and Harpsichord Sound Examples

Overview (Stanford Sondius Project, ca. 1995) Early Digital Synthesis

Spectral Modeling • Piano: (WAV) (MP3) Physical Modeling • Harpsichord 1: (WAV) (MP3) • KL Music • Digital Waveguide • Harpsichord 2: (WAV) (MP3) • Signal Scattering • Plucked String • Struck String • Karplus Strong • EKS Algorithm • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 76 / 84 More Recent Harpsichord Example

Overview • Harpsichord Soundboard Hammer-Response: (WAV) (MP3) Early Digital Synthesis • Musical Commuted Harpsichord Example: (WAV) (MP3) Spectral Modeling

Physical Modeling Reference: • KL Music • Digital Waveguide • Signal Scattering “Sound Synthesis of the Harpsichord Using a Computationally • Plucked String Efficient Physical Model”, • Struck String • Karplus Strong • EKS Algorithm by Vesa Valim¨ aki,¨ Henri Penttinen, Jonte Knif, Mikael Laurson, • Clarinet • Wind Examples and Cumhur Erkut • Bowed Strings • Distortion Guitar • Acoustic Strings JASP-2004 • Sound Examples • Linearized Violin • Commuted Piano Google search: Harpsichord Sound Synthesis • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 77 / 84 Physical Modeling in Audio Coding

Spectral modeling synthesis is finding application in audio coding. Overview Can physical modeling synthesis be used as well? Early Digital Synthesis

Spectral Modeling • MPEG-4/SAOL already supports essentially all sound synthesis Physical Modeling methods • KL Music • Digital Waveguide • Ability to encode sounds automatically is limited • Signal Scattering • Plucked String ◦ Codebook-Excited Linear Prediction (CELP) is a successful • Struck String source-filter model (not quite physical) • Karplus Strong • EKS Algorithm ◦ There are many isolated examples of model-fitting to • Clarinet • Wind Examples recorded data • Bowed Strings ◦ Good model-based denoising results have been obtained • Distortion Guitar • Acoustic Strings ◦ Coder problem much harder when many sources are mixed • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 78 / 84 Best Known Model-Based Audio Coders

Overview A “cover band” can put together a very convincing facsimile of Early Digital Synthesis popular music performance Spectral Modeling

Physical Modeling • KL Music • Digital Waveguide • Signal Scattering • Plucked String • Struck String • Karplus Strong • EKS Algorithm • Clarinet • Wind Examples • Bowed Strings • Distortion Guitar • Acoustic Strings • Sound Examples • Linearized Violin • Commuted Piano • Pulse Synthesis • Complete Piano • Sound Examples • Phy Audio Coding JOS high-school band “Bittersweet” • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 79 / 84 Future Physical Modeling in Audio Coding?

Overview A “Cover Band” Approach to Model-Based Audio Coding: Early Digital Synthesis

Spectral Modeling 1. Recognize individual “audio streams” in a mix (CASA)

Physical Modeling (“I hear a trap set, electric bass, Fender Rhodes, and a strat”) • KL Music 2. For each stream, calibrate its model heuristically • Digital Waveguide • Signal Scattering (“Here is what I hear the bass part doing: ...”) • Plucked String 3. Fine-tune the synthetic mix to the real mix • Struck String • Karplus Strong (joint “maximum likelihood estimation”) • EKS Algorithm • Clarinet Features of “Cover-Band Coding” (CBC): • Wind Examples • Bowed Strings • The “playing experience” of each “virtual performer” prevents • Distortion Guitar • Acoustic Strings artifacts — “musically unreasonable” parameters are made • Sound Examples unlikely (“Bayesian priors”) • Linearized Violin • Commuted Piano • An incorrect instrument must “imitate” its assigned stream • Pulse Synthesis • Complete Piano • New can be synthesized by deliberately choosing • Sound Examples a new ensemble! • Phy Audio Coding • Phy Audio Coding?

Summary Julius Smith AES-2006 Heyser Lecture – 80 / 84 Overview Early Digital Synthesis Spectral Modeling Physical Modeling Summary

Summary

Julius Smith AES-2006 Heyser Lecture – 81 / 84 Summary

Overview We have reviewed a “CCRMA-centric slice” of the history of digital Early Digital Synthesis sound synthesis (usually starting with results from ): Spectral Modeling

Physical Modeling • Wavetable (one period)

Summary • Subtractive • Additive • FM • Sampling • Spectral Modeling • Physical Modeling (more in tomorrow’s 4:30 PM masterclass) • Connections to audio coding

Julius Smith AES-2006 Heyser Lecture – 82 / 84 Overview Early Digital Synthesis Spectral Modeling Physical Modeling Summary Sound Acknowledgment

Sound Acknowledgment

Julius Smith AES-2006 Heyser Lecture – 83 / 84 Sound Acknowledgment

Overview Thanks to Emu / Creative Labs for providing a superb-quality Early Digital Synthesis external D/A converter for this talk (an E-Mu 0404/USB) Spectral Modeling

Physical Modeling

Summary

Sound Acknowledgment

Julius Smith AES-2006 Heyser Lecture – 84 / 84