Current Research in Concatenative Sound Synthesis

Total Page:16

File Type:pdf, Size:1020Kb

Current Research in Concatenative Sound Synthesis Proceedings of the International Computer Music Conference (ICMC),Barcelona, Spain, September 5-9, 2005 CURRENT RESEARCH IN CONCATENATIVE SOUND SYNTHESIS Diemo Schwarz Ircam – Centre Pompidou 1, place Igor-Stravinsky, 75004 Paris, France ABSTRACT is a complex method that needs many different concepts Concatenative synthesis is a promising method of musical working together, thus much work on only one single as- sound synthesis with a steady stream of work and publi- pect fails to relate to the whole. It has seen accelerating cations in recent years. It uses a large database of sound development over the past few years as can be seen in the snippets to assemble a given target phrase. We explain its chronology in section 3. There is now the first commer- principle and components and its main applications, and cial product available (3.13), and, last but not least, ICMC compare existing concatenative synthesis approaches. We 2004 saw the first musical pieces using concatenative syn- then list the most urgent problems for further work on con- thesis (3.12). We try in this article to acknowledge this catenative synthesis. young field of musical sound synthesis that has been iden- tified as such only five years ago. 1. INTRODUCTION Any CSS system must perform the following tasks, Concatenative synthesis methods use a large database of sometimes implicitly. This list of tasks will serve later source sounds, segmented into units, and a unit selec- for a taxonomy of existing systems. tion algorithm that finds the sequence of units that match Analysis The source sound files are segmented into best the sound or phrase to be synthesised, called the tar- units and analysed to express their characteristics with get. The selection is performed according to the descrip- sound descriptors. Segmentation can be by automatic tors of the units, which are characteristics extracted from alignment of music with its score for instrument corpora, the source sounds, or higher level descriptors attributed by blind or arbitrary grain segmentation for free and re- to them. The selected units can then be transformed to synthesis, or can happen on-the-fly. The descriptors can fully match the target specification, and are concatenated. be categorical (class membership), static (constant over a However, if the database is sufficiently large, the proba- unit), or dynamic (varying over the duration of a unit). bility is high that a matching unit will be found, so the need to apply transformations is reduced. The units can Database Source file references, units and unit descrip- be non-uniform, i.e. they can comprise a sound snippet, tors are stored in a database. The subset of the database an instrument note, up to a whole phrase. that is preselected for one particular synthesis is called the Concatenative synthesis can be more or less data- corpus. driven, where, instead of supplying rules constructed by Target The target specification is generated from a sym- careful thinking as in a rule based approach, the rules are bolic score (expressed in notes or descriptors), or anal- induced from the data itself. The advantage of this ap- ysed from an audio score (using the same segmentation proach is that the information contained in the many sound and analysis methods as for the source sounds). examples in the database can be exploited. The current work on concatenative sound synthesis Selection Units are selected from the database that (CSS) focuses on three main applications: match best the given target descriptors according to a dis- High Level Instrument Synthesis Because concaten- tance function and a concatenation quality function. The ative synthesis is aware of the context of the database as selection can be local (the best match for each target unit well as the target units, it can synthesise natural sounding is found individually), or global (the sequence with the transitions by selecting units from matching contexts. In- least total distance if found). formation attributed to the source sounds can be exploited Synthesis is done by concatenation of selected units, pos- for unit selection, which allows high-level control of syn- sibly applying transformations. Depending on the appli- thesis, where the fine details lacking in the target specifi- cation, the selected units are placed at the times given by cation are filled in by the units in the database. the target (musical or rhythmic synthesis), or are concate- Resynthesis of audio A sound or phrase is taken as the nated with their natural duration (free synthesis or speech audio score, which is resynthesized with the same pitch, synthesis). amplitude, and timbre characteristics using units from the database. 2. RELATED WORK Free synthesis from heterogeneous sound databases of- Concatenative synthesis is at the intersection of many fers a sound composer efficient control of the result by fields of research, such as music information retrieval, using perceptually meaningful descriptors, and allows to database technology, real-time and interactive methods, browse and explore a corpus of sounds. sound synthesis models, musical modeling, classification, Concatenative synthesis sprung up independently in mul- perception. Concatenative text-to-speech synthesis shares tiple places and is sometimes referred to as mosaicing. It many concepts and methods with concatenative sound 1 Proceedings of the International Computer Music Conference (ICMC),Barcelona, Spain, September 5-9, 2005 synthesis, but has different goals. Singing voice synthe- by hand. The sound base was manually labeled with mu- sis occupies an intermediate position between speech and sical genre and tempo as descriptors. sound synthesis and often uses concatenative methods. For instance, Meron [13] uses an automatically consti- 3.2. Caterpillar (2000) tuted large unit database of one singer. Caterpillar [19, 20, 21, 22], performs data-driven concat- Content based processing is a new paradigm in digi- enative musical sound synthesis from large heterogeneous tal audio processing that shares the analysis with CSS. It sound databases. Units are segmented by automatic align- performs symbolic manipulations of elements of a sound, ment of music with its score [14], or by blind segmen- rather than using signal processing alone. Lindsay [12] tation. The descriptors are based on the MPEG-7 low- proposes context-sensitive effects by utilising MPEG-7 de- level descriptor set [17], plus descriptors derived from the scriptors. Jehan [7] works on the objet segmentation and score and the sound class. The low-level descriptors are perception-based description of audio material and then condensed to unit descriptors by modeling of their tempo- manipulates the audio in terms of its musical structure. ral evolution over the unit (mean value, slope, range, etc.) The Song Sampler [1] is a system which automatically The database is implemented using a relational SQL data- samples meaningful units of a song, assigns them to the base management system for reliability and flexibility. keys of a Midi-keyboard to be played with by a user. The unit selection algorithm inspired from speech syn- Related to selection based sound synthesis is music se- thesis finds the sequence of database units that best match lection where a sequence of songs is generated accord- the given synthesis target units using two cost functions: ing to their characteristics and a desired evolution over the The target cost expresses the similarity of a target unit to playlist. An innovative solution based on constraint satis- the database units, including a context around the target, faction is proposed in [16], which ultimately inspired the and the concatenation cost predicts the quality of the join use of constraints for CSS in [27] (section 3.3). of two database units. The optimal sequence of units is The Musescape music browser [25] works by speci- found by a Viterbi algorithm as the best path through the fying high-level musical features (tempo, genre, year) on network of database units. sliders. The system then selects in real time musical ex- The Caterpillar framework is also used for expressive cerpts that match the desired features. speech synthesis [2], and first attempts for hybrid synthe- sis combining music and speech are described in [3]. 3. CHRONOLOGY 3.3. Musical Mosaicing (2001) Approaches to musical sound synthesis that are somehow Musical Mosaicing, or Musaicing [27], performs a kind of data-driven and concatenative can be found throughout automated remix of songs. It is aimed at a sound database history. They are usually not identified as such but can of pop music, selecting pre-analysed homogeneous snip- be arguably seen as instances of fixed inventory or man- pets of songs and reassembling them. Its great innovation ual concatenative synthesis. was to formulate the unit selection as a constraint solv- The groundwork for concatenative synthesis was laid ing problem. The set of descriptors used for the selection in 1948 by the Groupe de Recherche Musicale (GRM) of is: mean pitch, loudness, percussivity, timbre. Work on Pierre Schaeffer, using for the first time recorded segments adding more descriptors has picked up again with [28]. of sound to create their pieces of Musique Concrete` . Scha- effer defines the notion of sound object, which is a clearly 3.4. Soundmosaic (2001) delimited segment in a source recording. This is not so far Soundmosaic [5] constructs an approximation of one from what is here called unit. sound out of small units of varying size from other sounds. Concatenative aspects can also be found in sampling. For version 1.0 of Soundmosaic, the selection of the best The sound database consists of a fixed unit inventory anal- source unit uses a direct match of the normalised wave- ysed by instrument, playing style, pitch, and dynamics, form (Manhatten distance). Version 1.1 introduced as and the selection is reduced to a fixed mapping of Midi- distance metric the correlation between normalized units. note and velocity to a sample.
Recommended publications
  • Granular Synthesis and Physical Modeling with This Manual Is Released Under the Creative Commons Attribution 3.0 Unported License
    Granular synthesis and physical modeling with This manual is released under the Creative Commons Attribution 3.0 Unported License. You may copy, distribute, transmit and adapt it, for any purpose, provided you include the following attribution: Kaivo and the Kaivo manual by Madrona Labs. http://madronalabs.com. Version 1.3, December 2016. Written by George Cochrane and Randy Jones. Illustrated by David Chandler. Typeset in Adobe Minion using the TEX document processing system. Any trademarks mentioned are the sole property of their respective owners. Such mention does not imply any endorsement of or associa- tion with Madrona Labs. Introduction “The future evolution of virtual devices is less constrained than that of real devices.” –Julius O. Smith Kaivo is a software instrument combining two powerful synthesis techniques (physical modeling and granular synthesis) in an easy-to- use semi-modular package. It’s laid out a bit like an acoustic instru- For more information on the theory be- ment; the GRANULATOR module acts like the player’s touch, exciting hind Kaivo, see Chapter 1, “Physi-who? Granu-what?” one or more tuned objects (here, the RESONATOR module, based on physical models of resonant objects) that come together in a central resonating body (the BODY module, also physics-based). This allows for a natural (or uncanny) sense of space, pleasing in- teractions between voices, and a ton of expressive potential—all traits in short supply in digital synthesis. The acoustic comparison begins to pale when you realize that your “touch” is really a granular sam- ple player with scads of options, and that the physical properties of the resonating modules are widely variable, and in real time.
    [Show full text]
  • The Sonification Handbook Chapter 9 Sound Synthesis for Auditory Display
    The Sonification Handbook Edited by Thomas Hermann, Andy Hunt, John G. Neuhoff Logos Publishing House, Berlin, Germany ISBN 978-3-8325-2819-5 2011, 586 pages Online: http://sonification.de/handbook Order: http://www.logos-verlag.com Reference: Hermann, T., Hunt, A., Neuhoff, J. G., editors (2011). The Sonification Handbook. Logos Publishing House, Berlin, Germany. Chapter 9 Sound Synthesis for Auditory Display Perry R. Cook This chapter covers most means for synthesizing sounds, with an emphasis on describing the parameters available for each technique, especially as they might be useful for data sonification. The techniques are covered in progression, from the least parametric (the fewest means of modifying the resulting sound from data or controllers), to the most parametric (most flexible for manipulation). Some examples are provided of using various synthesis techniques to sonify body position, desktop GUI interactions, stock data, etc. Reference: Cook, P. R. (2011). Sound synthesis for auditory display. In Hermann, T., Hunt, A., Neuhoff, J. G., editors, The Sonification Handbook, chapter 9, pages 197–235. Logos Publishing House, Berlin, Germany. Media examples: http://sonification.de/handbook/chapters/chapter9 18 Chapter 9 Sound Synthesis for Auditory Display Perry R. Cook 9.1 Introduction and Chapter Overview Applications and research in auditory display require sound synthesis and manipulation algorithms that afford careful control over the sonic results. The long legacy of research in speech, computer music, acoustics, and human audio perception has yielded a wide variety of sound analysis/processing/synthesis algorithms that the auditory display designer may use. This chapter surveys algorithms and techniques for digital sound synthesis as related to auditory display.
    [Show full text]
  • Combining Granular Synthesis with Frequency Modulation
    Combining granular synthesis with frequency modulation. Kim ERVIK Øyvind BRANDSEGG Department of music Department of music University of Science and Technology University of Science and Technology Norway Norway [email protected] [email protected] Abstract acoustical events. Sound as particles has been Both granular synthesis and frequency used in applications like independent time and modulation are well-established synthesis pitch scaling, formant modification, analog synth techniques that are very flexible. This paper modeling, clouds of sound, granular delays and will investigate different ways of combining reverbs, etc [3]. Examples of granular synthesis the two techniques. It will describe the rules parameters are density, grain pitch, grain duration, of spectra that emerge when combining, compare it to similar synthesis techniques and grain envelope, the global arrangement of the suggest some aesthetic perspectives on the grains, and of course the content of the grains matter. (which can be a synthetic waveform or sampled sound). Granular synthesis gives the musician vast Keywords expressive possibilities [10]. Granular synthesis, frequency modulation, 1.3 FM synthesis partikkel, csound, sound synthesis Frequency modulation has been a known method of coding audio into radio signal since the beginning of commercial radio. In 1964 that John Chowning discovered the implication of frequency modulation of audio working on synthesis of brass instrument at Stanford [2]. He discovered that modulating the frequency resulted in sideband emerging from the carrier frequency. Fig 1:Grain Pitch modulation Yamaha later implemented the technique in the hugely successful DX7. 1 Introduction If we consider FM synthesis using sinusoidal carrier and modulation oscillator, the spectral 1.1 Method components present in a FM sound can be mathematically stated as in figure 2.
    [Show full text]
  • Neural Granular Sound Synthesis
    Neural Granular Sound Synthesis Adrien Bitton Philippe Esling Tatsuya Harada IRCAM, CNRS UMR9912 IRCAM, CNRS UMR9912 The University of Tokyo, RIKEN Paris, France Paris, France Tokyo, Japan [email protected] [email protected] [email protected] ABSTRACT Granular sound synthesis is a popular audio generation technique based on rearranging sequences of small wave- resynthesis generated sequence match decode form windows. In order to control the synthesis, all grains grains in a given corpus are analyzed through a set of acoustic continuous + f(↵) descriptors. This provides a representation reflecting some discrete + condition + form of local similarities across the grains. However, the grain space + quality of this grain space is bound by that of the descrip- + + + grain tors. Its traversal is not continuously invertible to signal grain + + latent space sample and does not render any structured temporality. library dz acoustic R We demonstrate that generative neural networks can im- analysis encode plement granular synthesis while alleviating most of its target signal shortcomings. We efficiently replace its audio descriptor input basis by a probabilistic latent space learned with a Vari- signal ational Auto-Encoder. In this setting the learned grain space is invertible, meaning that we can continuously syn- thesize sound when traversing its dimensions. It also im- Figure 1. Left: A grain library is analysed and scattered (+) into the acoustic dimensions. A target is defined, by analysing an other signal (o) plies that original grains are not stored for synthesis. An- or as a free trajectory, and matched to the library through the acoustic other major advantage of our approach is to learn struc- descriptors.
    [Show full text]
  • Concatenative Sound Synthesis: the Early Years
    CONCATENATIVE SOUND SYNTHESIS: THE EARLY YEARS Diemo Schwarz Ircam – Centre Pompidou 1, place Igor-Stravinsky, 75003 Paris, France http://www.ircam.fr/anasyn/schwarz http://concatenative.net [email protected] ABSTRACT Concatenative sound synthesis (CSS) methods use a large database of source sounds, segmented into units, Concatenative sound synthesis is a promising method and a unit selection algorithm that finds the sequence of of musical sound synthesis with a steady stream of work units that match best the sound or phrase to be synthe- and publications for over five years now. This article of- sised, called the target. The selection is performed ac- fers a comparative survey and taxonomy of the many dif- cording to the descriptors of the units, which are charac- ferent approaches to concatenative synthesis throughout teristics extracted from the source sounds, or higher level the history of electronic music, starting in the 1950s, even descriptors attributed to them. The selected units can then if they weren't known as such at their time, up to the recent be transformed to fully match the target specification, and surge of contemporary methods. Concatenative sound are concatenated. However, if the database is sufficiently synthesis methods use a large database of source sounds, large, the probability is high that a matching unit will be segmented into units, and a unit selection algorithm that found, so the need to apply transformations, which always finds the units that match best the sound or musical phrase degrade sound quality, is reduced. The units can be non- to be synthesised, called the target. The selection is per- uniform (heterogeneous), i.e.
    [Show full text]
  • Chroma Palette: Chromatic Maps of Sound As Granular Synthesis
    Proceedings of the 2007 Conference on New Interfaces for Musical Expression (NIME07), New York, NY, USA Chroma Palette: Chromatic Maps of Sound As Granular Synthesis Interface Justin Donaldson Ian Knopke Chris Raphael Indiana University School of Indiana University School of Indiana University School of Informatics Informatics Informatics 1900 E. 10th Street, Room 931 1900 E. 10th Street, Room 932 1900 E. 10th Street, Room 933 Bloomington, IN 47406 Bloomington, IN 47406 Bloomington, IN 47406 [email protected] [email protected] [email protected] ABSTRACT There are many different categories of granular synthesis, Chroma based representations of acoustic phenomenon are such as synchronous, quasi-synchronous, and asynchronous representations of sound as pitched acoustic energy. A frame- forms, referring to the regularity with which grains are wise chroma distribution over an entire musical piece is a useful reassembled. Grains are usually windowed, both to aid resynthesis and straightforward representation of its musical pitch over time. and to avoid audible clicks. However, the choice of window This paper examines a method of condensing the block-wise function can also have a pronounced effect on the resulting timbre chroma information of a musical piece into a two dimensional and is an important component of the synthesis process. Granular embedding. Such an embedding is a representation or map of the synthesis has similarities to other common analysis/resynthesis different pitched energies in a song, and how these energies relate methodologies such as the short-term Fourier transform and to each other in the context of the song. The paper presents an wavelet-based techniques. Figure 1 shows an example of an interactive version of this representation as an exploratory envelope windowing and overlap arrangement for four different analytical tool or instrument for granular synthesis.
    [Show full text]
  • NUMERICAL SOUND SYNTHESIS Ii Numerical Sound Synthesis
    NUMERICAL SOUND SYNTHESIS ii Numerical Sound Synthesis Stefan Bilbao November 27, 2007 iii iv Contents Preface xiii 1 Sound Synthesis and Physical Modeling 1 1.1 AbstractDigitalSoundSynthesis . .......... 2 1.1.1 AdditiveSynthesis . .. .. .. .. .. .. .. .. .. .. .. .. .... 3 1.1.2 SubtractiveSynthesis . ..... 5 1.1.3 WavetableSynthesis . .... 5 1.1.4 AMandFMSynthesis.............................. 7 1.1.5 OtherMethods .................................. 8 1.2 PhysicalModeling ................................ ..... 9 1.2.1 LumpedMass-SpringNetworks . ..... 10 1.2.2 ModalSynthesis ................................ .. 11 1.2.3 DigitalWaveguides. .... 13 1.2.4 HybridMethods ................................. 16 1.2.5 DirectNumericalSimulation . ...... 17 1.3 PhysicalModeling: ALargerView . ........ 20 1.3.1 Physical Models as Descended from Abstract Synthesis ............ 20 1.3.2 Connections: Direct Simulation and Other Methods . ........... 21 1.3.3 ComplexityofMusicalSystems . ...... 22 1.3.4 Why? ........................................ 25 2 Time Series and Difference Operators 27 2.1 TimeSeries ...................................... ... 28 2.2 Shift, Difference and AveragingOperators . ............ 29 2.2.1 TemporalWidth ofDifference Operators. ........ 30 2.2.2 CombiningDifferenceOperators . ...... 31 2.2.3 TaylorSeriesandAccuracy . ..... 32 2.2.4 Identities .................................... .. 33 2.3 FrequencyDomainAnalysis . ....... 34 2.3.1 Laplace and z transforms ............................. 34 2.3.2 Frequency Domain Interpretation
    [Show full text]
  • Physically-Based Parametric Sound Synthesis and Control
    Physically-Based Parametric Sound Synthesis and Control Perry R. Cook Princeton Computer Science (also Music) Course Introduction Parametric Synthesis and Control of Real-World Sounds for virtual reality games production auditory display interactive art interaction design Course #2 Sound - 1 Schedule 0:00 Welcome, Overview 0:05 Views of Sound 0:15 Spectra, Spectral Models 0:30 Subtractive and Modal Models 1:00 Physical Models: Waveguides and variants 1:20 Particle Models 1:40 Friction and Turbulence 1:45 Control Demos, Animation Examples 1:55 Wrap Up Views of Sound • Sound is a recorded waveform PCM playback is all we need for interactions, movies, games, etc. (Not true!!) • Time Domain x( t ) (from physics) • Frequency Domain X( f ) (from math) • Production what caused it • Perception our image of it Course #2 Sound - 2 Views of Sound Time Domain is most closely related to Production Frequency Domain is most closely related to Perception we will see that many hybrids abound Views of Sound: Time Domain Sound is produced/modeled by physics, described by quantities of • Force force = mass * acceleration • Position x(t) actually < x(t), y(t), z(t) > • Velocity Rate of change of position dx/dt • Acceleration Rate of change of velocity dv/dt Examples: Mass+Spring+Damper Wave Equation Course #2 Sound - 3 Mass/Spring/Damper F = ma = - ky - rv - mg F = ma = - ky - rv (if gravity negligible) d 2 y r dy k + + y = 0 dt2 m dt m ( ) D2 + Dr / m + k / m = 0 2nd Order Linear Diff Eq. Solution 1) Underdamped: -t/τ ω y(t) = Y0 e cos( t ) exp.
    [Show full text]
  • Soft Sound: a Guide to the Tools and Techniques of Software Synthesis
    California State University, Monterey Bay Digital Commons @ CSUMB Capstone Projects and Master's Theses Spring 5-20-2016 Soft Sound: A Guide to the Tools and Techniques of Software Synthesis Quinn Powell California State University, Monterey Bay Follow this and additional works at: https://digitalcommons.csumb.edu/caps_thes Part of the Other Music Commons Recommended Citation Powell, Quinn, "Soft Sound: A Guide to the Tools and Techniques of Software Synthesis" (2016). Capstone Projects and Master's Theses. 549. https://digitalcommons.csumb.edu/caps_thes/549 This Capstone Project is brought to you for free and open access by Digital Commons @ CSUMB. It has been accepted for inclusion in Capstone Projects and Master's Theses by an authorized administrator of Digital Commons @ CSUMB. Unless otherwise indicated, this project was conducted as practicum not subject to IRB review but conducted in keeping with applicable regulatory guidance for training purposes. For more information, please contact [email protected]. California State University, Monterey Bay Soft Sound: A Guide to the Tools and Techniques of Software Synthesis Quinn Powell Professor Sammons Senior Capstone Project 23 May 2016 Powell 2 Introduction Edgard Varèse once said: “To stubbornly conditioned ears, anything new in music has always been called noise. But after all, what is music but organized noises?” (18) In today’s musical climate there is one realm that overwhelmingly leads the revolution in new forms of organized noise: the realm of software synthesis. Although Varèse did not live long enough to see the digital revolution, it is likely he would have embraced computers as so many millions have and contributed to these new forms of noise.
    [Show full text]
  • The Evolution of Granular Synthesis: an Overview of Current Research
    International Symposium on The Creative and Scientific Legacies of Iannis Xenakis, 8-10 June 2006, University of Guelph, Toronto, Canada ________________________________________________________ The evolution of granular synthesis: an overview of current research Curtis Roads Media Arts and Technology University of California Santa Barbara CA 93106-6065 USA 9 June 2006 This lecture presents an overview of several projects pursued over the past five years in our laboratories at Santa Barbara. All this research is based on a scientific model of sound initially proposed by Dennis Gabor (1946), and soon afterward extended to music by Iannis Xenakis (1960). Granular analysis (also called atomic decomposition) and granular synthesis have evolved over more than five decades from a paper theory and primitive experiments into a broad range of applied techniques. Specific to the granular model is its focus on the microacoustic time scale (typically 1 to 100 ms). Granular methods treat sound as a stream of acoustic particles in both the time domain and the time-frequency (TF) domain. For more details, see Roads (2002; Kling, et al. 2005). In this lecture, I first very briefly trace the history of the idea of sound particles. Next I will demonstrate PulsarGenerator, an application developed by Alberto de Campo and me in 2001 for a specific type of particle synthesis with links to past analog techniques. I will also demonstrate the SweepingQGranulator, a tool that I wrote in the SuperCollider language for the microfiltration of granulated sound. The latest threads in this line of research go in two directions. The first is a time-frequency analysis method known as matching pursuit decomposition.
    [Show full text]
  • Morphing of Granular Sounds
    Proc. of the 18th Int. Conference on Digital Audio Effects (DAFx-15), Trondheim, Norway, Nov 30 - Dec 3, 2015 MORPHING OF GRANULAR SOUNDS Sadjad Siddiq Advanced Technology Division, Square Enix Co., Ltd. Tokyo, Japan [email protected] ABSTRACT Granular sounds are commonly used in video games but the con- ventional approach of using recorded samples does not allow sound designers to modify these sounds. In this paper we present a tech- nique to synthesize granular sound whose tone color lies at an arbi- trary point between two given granular sound samples. We first ex- tract grains and noise profiles from the recordings, morph between them and finally synthesize sound using the morphed data. Dur- ing sound synthesis a number of parameters, such as the number of grains per second or the loudness distribution of the grains, can be altered to vary the sound. The proposed method does not only al- low to create new sounds in real-time, it also drastically reduces the Figure 1: Summary of granular synthesis. Grains are at- memory footprint of granular sounds by reducing a long recording tenuated and randomly added to the final mix. to a few hundred grains of a few milliseconds length each. simplest case their amplitude is attenuated randomly. Depending 1. INTRODUCTION on the probability distribution of the attenuation factors, this can 1.1. Granular synthesis in video games have a very drastic effect on the granularity and tone color of the produced sound. For a more detailed introduction to granular syn- In previous work we described the morphing between simple im- thesis refer to [4] or [5].
    [Show full text]
  • CM3106 Chapter 5: Digital Audio Synthesis
    CM3106 Chapter 5: Digital Audio Synthesis Prof David Marshall [email protected] and Dr Kirill Sidorov [email protected] www.facebook.com/kirill.sidorov School of Computer Science & Informatics Cardiff University, UK Digital Audio Synthesis Some Practical Multimedia Digital Audio Applications: Having considered the background theory to digital audio processing, let's consider some practical multimedia related examples: Digital Audio Synthesis | making some sounds Digital Audio Effects | changing sounds via some standard effects. MIDI | synthesis and effect control and compression Roadmap for Next Few Weeks of Lectures CM3106 Chapter 5: Audio Synthesis Digital Audio Synthesis 2 Digital Audio Synthesis We have talked a lot about synthesising sounds. Several Approaches: Subtractive synthesis Additive synthesis FM (Frequency Modulation) Synthesis Sample-based synthesis Wavetable synthesis Granular Synthesis Physical Modelling CM3106 Chapter 5: Audio Synthesis Digital Audio Synthesis 3 Subtractive Synthesis Basic Idea: Subtractive synthesis is a method of subtracting overtones from a sound via sound synthesis, characterised by the application of an audio filter to an audio signal. First Example: Vocoder | talking robot (1939). Popularised with Moog Synthesisers 1960-1970s CM3106 Chapter 5: Audio Synthesis Subtractive Synthesis 4 Subtractive synthesis: Simple Example Simulating a bowed string Take the output of a sawtooth generator Use a low-pass filter to dampen its higher partials generates a more natural approximation of a bowed string instrument than using a sawtooth generator alone. 0.5 0.3 0.4 0.2 0.3 0.1 0.2 0.1 0 0 −0.1 −0.1 −0.2 −0.2 −0.3 −0.3 −0.4 −0.5 −0.4 0 2 4 6 8 10 12 14 16 18 20 0 2 4 6 8 10 12 14 16 18 20 subtract synth.m MATLAB Code Example Here.
    [Show full text]