
10th International Society for Music Information Retrieval Conference (ISMIR 2009) 21ST CENTURY ELECTRONICA: MIR TECHNIQUES FOR CLASSIFICATION AND PERFORMANCE Dimitri Diakopoulos, Owen Vallis, Jordan Hochenbaum, Jim Murphy, Ajay Kapur California Institute of the Arts New Zealand School of Music Valencia, CA USA Wellington, NZ {ddiakopoulos,jamesmurphy} {ajay.kapur,vallisowen,hochenjord} @alum.calarts.edu @nzsm.ac.nz ABSTRACT 2. RELATED WORK The performance of electronica by Disc Jockys (DJs) Our work draws on a wide array of related research rang- presents a unique opportunity to develop interactions be- ing from musical descriptors and novel performance inter- tween performer and music. Through recent research in faces to recent applications in tabletop computing. By the MIR field, new tools for expanding DJ performance synthesizing these related but disparate areas of research, are emerging. The use of spectral, loudness, and temporal we enable new performance experiences for individual descriptors for the classification of electronica is ex- and group DJs to create and modify set-lists in real time. plored. Our research also introduces the use of a multi- Genre classification can be accomplished using a range touch interface to drive a performance-oriented DJ appli- of signal features and algorithms. For electronica in par- cation utilizing the feature set. Furthermore, we present ticular, features and patterns such as rhythm, tempo, pe- that a multi-touch surface provides an extensible and col- riodicity, and even use of panning have been explored in laborative interface for browsing and manipulating MIR- the literature [1-3]. related data in real time. For DJs specifically, the use of interfaces to retrieve musically relevant material in performance has included Keywords: Electronica, Electronic Dance Music, Genre query-by-beat-boxing [4], and query-by-humming [5]. Classification, User Interfaces, DJ, Multi-touch. Retrieval using both traditional and non-traditional in- struments and interfaces has been explored by [6]. Other 1. INTRODUCTION research in the academic arena for enabling DJ perform- ance includes AudioPad [7], and Mixxx [8]. Although we Electronic dance music, often referred to as Electronica, is take influence in these interfaces for retrieval, our work an overarching collection of genres that focus predomi- wishes to explore a browsing paradigm using similar nately on rhythmic motifs & repeating loops. A task of the creative interfaces. electronica DJ is to compile a set-list of music for per- 1 In the commercial sector, Stanton’s Final Scratch en- formance. Additionally, DJs are always looking for ways ables DJs to use a physical controller to manipulate and to expand the interactivity of their performances through 2 mix digital music, while Native Instruments’ Traktor is a the use of new tools. The primary goal of this work is to software-only solution for DJ performance. Ableton’s give the modern, digital DJ access to a wider range of 3 flagship software, Live , has been increasingly used to performance options using MIR techniques such as fea- enable DJs to use their own pre-composed music in live ture extraction, genre classification, and clustering. Com- performance through the synchronized playback of dif- bined with advances in tabletop computing, these tech- ferent audio loops, known as clips. niques have made it possible to add a layer of interactivity A multitude of literature on tabletop computing & in- to automatic playlist generation. terfaces exists. The Reactable team was one of the first In the following section we detail related work on mu- groups to directly apply both tangible and multi-touch sic features and electronic performance interfaces includ- interaction to the performance of music [9], followed by ing recent work in tabletop computing. In the remainder others including the earlier referenced AudioPad, which of the paper we discuss our feature extractors, genre clas- is also a tangible interface. More recently, MarGrid, a UI sification results and the interface that we developed to for the browsing of a digital music collection using Self- enable DJs to interact with those results to create set-lists. Organizing Maps has been examined using a tabletop We conclude the paper with a discussion of future work interface [10]. The use of Self Organizing Maps (SOM) in interactive MIR powered DJ applications and tabletop for visualizing feature data has also been previously cov- computing. ered by [11], [12]. In addition, although not performance- oriented, MusicSim presents an interesting combination of audio analysis and music browsing in an interactive com- puter-based interface [13]. Permission to make digital or hard copies of all or part of this work for Our aim here is to expand on these efforts by introduc- personal or classroom use is granted without fee provided that copies ing the use of a multi-touch surface in a way that is both are not made or distributed for profit or commercial advantage and that intuitive and collaborative. The use of Self Organizing copies bear this notice and the full citation on the first page. © 2009 International Society for Music Information Retrieval 1 http://www.stantondj.com/ 2 http://www.native-instruments.com/ 3 http://www.ableton.com/ 465 Poster Session 3 Maps represents a useful way of organizing features for around 140 BPM. DJ Tiesto, Ferry Corsten, and Sasha visualization, on-top of which many real-time interactive are popular artists within the Trance genre. applications are possible. Downtempo employs lush harmonic textures and groove-oriented percussion. Tempos are characteristically 3. DATA COLLECTION low, ranging from 60 to 90 BPM. Boards of Canada, Air, and Bonobo are well-known artists within the genre. For our experiments, six genres across the spectrum of electronic music were selected for their diverse character- 4. AUDIO ANALYSIS AND CLASSIFICATION istics and wide-spread popularity. One hundred 2 to 8 minute prototypical tracks were Audio analysis was performed using the ChucK audio sliced at random into single 30-second chunks for each programming language [14]. Our results are based on a genre. Our dataset contains at least 20 distinct artists in two-second (88200 sample) Hann window, resulting in 15 each genre; tracks were not chosen on the perceived 8-dimensional vectors for each audio clip. In addition to genre of the composing artist, but a human baseline being written to disk for further analysis, the raw data was analysis by the authors. In total, there are 600 30-second also sent over networked protocol (OSC1) into Process- clips, each in a stereo 44.1 kHz PCM-encoded file format. ing2, a visuals-oriented programming language. The proc- All files were normalized before experimentation. ess of visualizing the data using Processing is later pre- sented in Section 5. Before application development could 3.1 Genre Definitions begin, a central concern was to uncover a feature-set that could accurately classify electronica. We follow with a Many subgenres fall beneath the umbrella term of elec- description of the eight features used in our experiments. tronica—this paper examines six of the most broad & popular genres commonly played by DJs: intelligent dance 4.1 Spectral Features music (IDM), house, techno drum and bass (DnB), trance, and downtempo. A brief description of them is as follows: • Centroid, the centre of mass of the spectrum; IDM distinguishes itself by its heavy use of complex • Flux, the change in spectral energy across suc- meter, sophisticated and often sporadic percussive ele- cessive frames; ments, and varying use of syncopation. IDM carries with • Rolloff, the frequency below which resides 85 it a rich harmonic and melodic palate borrowed from percent of a spectrum's energy. many genres. Tempos typically range from 150-180 BPM. Notable artists in the genre are Aphex Twin, 4.2 Loudness Features Squarepusher, and Autechre. IDM may sometimes be referred to as Glitch music. • RMS, the amplitude of a window; House music makes use of the common ‘four-on-the- • Panning, a coefficient used to describe the floor’ rhythm pattern consisting of a steady kick drum on weight of the signal in either the left or right each downbeat in a 4/4 meter. Defining characteristics channels [3]; involve offbeat open hi-hat patterns and snare or claps on • Panning Delta, change in the panning coefficient the two and four of every bar. Harmonic content and in- across successive windows [3]. strumentation is often borrowed from Disco genres. Tem- pos usually range from 115 to 135 BPM. Daft Punk, 4.3 Temporal Features Thomas Bangalter, and Alan Braxe are popular artists in • Number of Bass Onsets, an integer representing the genre. the number of peaks (‘Beats’) detected in a win- Techno uses minimal melodic ornamentation, relying dow; more on bass riffs and polyrhythmic drums layered over a • Average Inter-onset Time, a basic feature to de- common four-on-the-flour kick drum. The rhythmic ele- scribe the periodicity of the beats across a win- ments in techno are often the defining features of the dow. song, with percussive grooves and riffs taking precedence over more traditional melodic and harmonic structure. Significant artists include Derrick May, Richie Hawtin, 4.4 Classification and Robert Hood. Four separate classifiers were run on all six classes, and DnB makes heavy use of “break beat chopping,”—the also on a smaller set of four classes. All experiments were re-sequencing of drum hits from other previously re- performed utilizing a 10-fold cross-validation method in corded material. DnB is often composed above 160 BPM, the Weka machine learning environment [15]. with characteristic bass lines moving at half the tempo. A k-Nearest Neighbour classifier (IBk) gave the best Goldie and Pendulum are both well-known artists. overall result, resting at a 75.2% classification rate across Trance distinguishes itself by employing thick, com- the six classes (16.7% baseline accuracy).
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages5 Page
-
File Size-