Additive Synthesis
Total Page:16
File Type:pdf, Size:1020Kb
Additive synthesis Additive synthesis is a sound synthesis technique that creates timbre by adding sine waves together.[1][2] Additive synthesis example The timbre of musical instruments can be considered in the light of Fourier theory to consist of multiple harmonic or 0:00 inharmonic partials or overtones. Each partial is a sine wave of different frequency and amplitude that swells and decays over A bell-like sound generated by time due to modulation from an ADSR envelope or low frequency oscillator. additive synthesis of 21 inharmonic partials Additive synthesis most directly generates sound by adding the output of multiple sine wave generators. Alternative implementations may use pre-computedwavetables or the inverse Fast Fourier transform. Problems playing this file? See media help. Contents 1 Explanation 2 Definitions 2.1 Harmonic form 2.2 Time-dependent amplitudes 2.3 Inharmonic form 2.4 Time-dependent frequencies 2.5 Broader definitions 3 Implementation methods 3.1 Oscillator bank synthesis 3.2 Wavetable synthesis 3.2.1 Group additive synthesis 3.3 Inverse FFT synthesis 4 Additive analysis/resynthesis 4.1 Products 5 Applications 5.1 Musical instruments 5.2 Speech synthesis 6 History 6.1 Timeline 7 Discrete-time equations 8 See also 9 References 10 External links Explanation The sounds that are heard in everyday life are not characterized by a single frequency. Instead, they consist of a sum of pure sine frequencies, each one at a different amplitude. When humans hear these frequencies simultaneously, we can recognize the sound. This is true for both "non-musical" sounds (e.g. water splashing, leaves rustling, etc) and for "musical sounds" (e.g. a piano note, a bird's tweet, etc). This set of parameters (frequencies, their relative amplitudes, and how the relative amplitudes change over time) are encapsulated by the timbre of the sound. Fourier analysis is the technique that is used to determine these exact timbre parameters from an overall sound signal; conversely, the resulting set of frequencies and amplitudes is called the Fourier series of the original sound signal. In the case of a musical note, the lowest frequency of its timbre is designated as the sound's fundamental frequency. For simplicity, we often say that the note is playing at that fundamental frequency (e.g. "middle C is 261.6 Hz")[3], even though the sound of that note consists of many other frequencies as well. The set of the remaining frequencies is called the overtones (or the harmonics) of the sound[4]. In other words, the fundamental frequency alone is responsible for the pitch of the note, while the overtones define the timbre of the sound. The overtones of a piano playing middle C will be quite different from the overtones of a violin playing the same note; that's what allows us to differentiate the sounds of the two instruments. There are even subtle differences in timbre between different versions of the same instrument (for example, anupright piano vs. a grand piano). Additive synthesis aims to exploit this property of sound in order to construct timbre from the ground up. By adding together pure frequencies (sine waves) of varying frequencies and amplitudes, we can precisely define the timbre of the sound that we want to create. Definitions Harmonic additive synthesis is closely related to the concept of a Fourier series which is a way of expressing a periodic function as the sum of sinusoidal functions with frequencies equal to integer multiples of a common fundamental frequency. These sinusoids are called harmonics, overtones, or generally, partials. In general, a Fourier series contains an infinite number of sinusoidal components, with no upper limit to the frequency of the sinusoidal functions and includes a DC component (one with frequency of 0 Hz). Frequencies outside of the human audible range can be omitted in additive synthesis. As a result, only a finite number of sinusoidal terms with frequencies that lie within the audible range are modeled in additive synthesis. A waveform or function is said to beperiodic if for all and for some period . The Fourier series of a periodic function is mathematically expressed as: where is the fundamental frequency of the waveform and is equal to the reciprocal of the period, Schematic diagram of additive synthesis. The inputs to the oscillators are frequencies and amplitudes . is the amplitude of the th harmonic, is the phase offset of the th harmonic. atan2( ) is the four-quadrant arctangent function, Being inaudible, the DC component, , and all components with frequencies higher than some finite limit, , are omitted in the following expressions of additive synthesis. Harmonic form The simplest harmonic additive synthesis can be mathematically expressed as: , (1) where is the synthesis output, , , and are the amplitude, frequency, and the phase offset, respectively, of the th harmonic partial of a total of harmonic partials, and is the fundamental frequency of the waveform and thefrequency of the musical note. Time-dependent amplitudes More generally, the amplitude of each harmonic can be prescribed as a function of time, , in which case the synthesis output is Example of harmonic additive synthesis in which each harmonic has a time-dependent amplitude. The fundamental frequency is 440 Hz. 0:00 Problems listening to this file? See Media help . (2) Each envelope should vary slowly relative to the frequency spacing between adjacent sinusoids. Thebandwidth of should be significantly less than . Inharmonic form Additive synthesis can also produce inharmonic sounds (which are aperiodic waveforms) in which the individual overtones need not have frequencies that are integer multiples of some common fundamental frequency.[5][6] While many conventional musical instruments have harmonic partials (e.g. an oboe), some have inharmonic partials (e.g. bells). Inharmonic additive synthesis can be described as where is the constant frequency of th partial. Time-dependent frequencies In the general case, the instantaneous frequency of a sinusoid is the derivative (with respect to time) of the argument of the sine or cosine function. If this frequency is represented in hertz, rather than in angular frequency form, then this derivative is divided by . This is the case whether the partial is harmonic or inharmonic and whether its frequency is constant or time- varying. In the most general form, the frequency of each non-harmonic partial is a non-negative function of time, , yielding Example of inharmonic additive synthesis in which both the amplitude and frequency of each partial are time-dependent. 0:00 Problems listening to this file? See Media help (3) Broader definitions Additive synthesis more broadly may mean sound synthesis techniques that sum simple elements to create more complex timbres, even when the elements are not sine waves.[7][8] For example, F. Richard Moore listed additive synthesis as one of the "four basic categories" of sound synthesis alongside subtractive synthesis, nonlinear synthesis, and physical modeling.[8] In this broad sense, pipe organs, which also have pipes producing non-sinusoidal waveforms, can be considered as a variant form of additive synthesizers. Summation of principal components and Walsh functions have also been classified as additive synthesis.[9] Implementation methods Modern-day implementations of additive synthesis are mainly digital. (See sectionDiscr ete-time equations for the underlying discrete-time theory) Oscillator bank synthesis Additive synthesis can be implemented using a bank of sinusoidal oscillators, one for each partial.[1] Wavetable synthesis In the case of harmonic, quasi-periodic musical tones, wavetable synthesis can be as general as time-varying additive synthesis, but requires less computation during synthesis.[10][11] As a result, an efficient implementation of time-varying additive synthesis of harmonic tones can be accomplished by use ofwavetable synthesis. Group additive synthesis Group additive synthesis[12][13][14] is a method to group partials into harmonic groups (having different fundamental frequencies) and synthesize each group separately with wavetable synthesis before mixing the results. Inverse FFT synthesis An inverse Fast Fourier transform can be used to efficiently synthesize frequenciesthat evenly divide the transform period or "frame". By careful consideration of theDFT frequency-domain representation it is also possible to efficiently synthesize sinusoids of arbitrary frequencies using a series of overlapping frames and the inverseFast Fourier transform.[15] Additive analysis/resynthesis It is possible to analyze the frequency components of a recorded sound giving a "sum of sinusoids" representation. This representation can be re-synthesized using additive synthesis. One method of decomposing a sound into time varying sinusoidal partials isshort-time Fourier transform (STFT)-based McAulay-Quatieri Analysis.[17][18] By modifying the sum of sinusoids representation, timbral alterations can be made prior to resynthesis. For example, a harmonic sound could be restructured to sound inharmonic, and vice versa. Sound hybridisation or "morphing" has been implemented by additive resynthesis.[19] Additive analysis/resynthesis has been employed in a number of techniques including Sinusoidal Modelling,[20] Spectral Modelling Synthesis (SMS),[19] and the Reassigned Bandwidth-Enhanced Additive Sound Model.[21] Software that implements additive analysis/resynthesis includes: SPEAR,[22] LEMUR, LORIS,[23] SMSTools,[24] ARSS.[25] Sinusoidal