Modeling Rhythm Similarity for Electronic Dance Music

Modeling Rhythm Similarity for Electronic Dance Music

15th International Society for Music Information Retrieval Conference (ISMIR 2014) MODELING RHYTHM SIMILARITY FOR ELECTRONIC DANCE MUSIC Maria Panteli Niels Bogaards Aline Honingh University of Amsterdam, Elephantcandy, University of Amsterdam, Amsterdam, Netherlands Amsterdam, Netherlands Amsterdam, Netherlands [email protected] [email protected] [email protected] Rhythm&in&Musical&& Attack&Positions& Most&Common&Instrumental& ABSTRACT Notation& of&Rhythm& Associations& & 1/5/9/13& Bass&drum& 5/13& Snare&drum;&handclaps& A model for rhythm similarity in electronic dance music HiEhat&(open&or&closed);&also& 3/7/11/15& (EDM) is presented in this paper. Rhythm in EDM is built snare&drum&or&synth&“stabs”& All& HiEhat&(closed)& on the concept of a ‘loop’, a repeating sequence typically & associated with a four-measure percussive pattern. The Figure 1: Example of a common (even) EDM rhythm [2]. presented model calculates rhythm similarity between seg- ments of EDM in the following steps. 1) Each segment is split in different perceptual rhythmic streams. 2) Each The model focuses on content-based analysis of audio stream is characterized by a number of attributes, most no- recordings. A large and diverse literature deals with the tably: attack phase of onsets, periodicity of rhythmic el- challenges of audio rhythm similarity. These include, a- ements, and metrical distribution. 3) These attributes are mongst other, approaches to onset detection [1], tempo es- combined into one feature vector for every segment, af- timation [9,25], rhythmic representations [15,24], and fea- ter which the similarity between segments can be calcu- ture extraction for automatic rhythmic pattern description lated. The stages of stream splitting, onset detection and and genre classification [5, 12, 20]. Specific to EDM, [4] downbeat detection have been evaluated individually, and study rhythmic and timbre features for automatic genre a listening experiment was conducted to evaluate the over- classification, and [6] investigate temporal and structural all performance of the model with perceptual ratings of features for music generation. rhythm similarity. In this paper, an algorithm for rhythm similarity based on EDM characteristics and perceptual rhythm attributes is 1. INTRODUCTION presented. The methodology for extracting rhythmic ele- ments from an audio segment and a summary of the fea- Music similarity has attracted research from multidisci- tures extracted is provided. The steps of the algorithm are plinary domains including tasks of music information re- evaluated individually. Similarity predictions of the model trieval and music perception and cognition. Especially for are compared to perceptual ratings and further considera- rhythm, studies exist on identifying and quantifying rhythm tions are discussed. properties [16, 18], as well as establishing rhythm similar- ity metrics [12]. In this paper, rhythm similarity is studied with a focus on Electronic Dance Music (EDM), a genre 2. METHODOLOGY with various and distinct rhythms [2]. Structural changes in an EDM track typically consist of EDM is an umbrella term consisting of the ‘four on an evolution of timbre and rhythm as opposed to a verse- the floor’ genres such as techno, house, trance, and the chorus division. Segmentation is firstly performed to split ‘breakbeat-driven’ genres such as jungle, drum ‘n’ bass, the signal into meaningful excerpts. The algorithm devel- breaks etc. In general, four on the floor genres are charac- oped in [21] is used, which segments the audio signal based terized by a four-beat steady bass-drum pattern whereas on timbre features (since timbre is important in EDM struc- breakbeat-driven exploit irregularity by emphasizing the ture [2]) and musical heuristics. metrically weak locations [2]. However, rhythm in EDM EDM rhythm is expressed via the ‘loop’, a repeating exhibits multiple types of subtle variations and embellish- pattern associated with a particular (often percussive) in- ments. The goal of the present study is to develop a rhythm strument or instruments [2]. Rhythm information can be similarity model that captures these embellishments and al- extracted by evaluating characteristics of the loop: First, lows for a fine inter-song rhythm similarity. the rhythmic pattern is often presented as a combination of instrument sounds (eg. Figure 1), thus exhibiting a certain ‘rhythm polyphony’ [3]. To analyze this, the signal is split c Maria Panteli, Niels Bogaards, Aline Honingh. Licensed under a Creative Commons Attribution 4.0 International Li- into the so-called rhythmic streams. Then, to describe the cense (CC BY 4.0). Attribution: Maria Panteli, Niels Bogaards, Aline underlying rhythm, features are extracted for each stream Honingh. “Modeling rhythm similarity for electronic dance music”, 15th based on three attributes: a) The attack phase of the on- International Society for Music Information Retrieval Conference, 2014. sets is considered to describe if the pattern is performed on 537 15th International Society for Music Information Retrieval Conference (ISMIR 2014) segmentation feature extraction rhythmic onset attack metrical periodicity feature streams detection characterization metricaldistribution similarity distribution vector detection feature extraction stream # 1 stream # 2 stream # 3 Figure 2: Overview of methodology. percussive or non-percussive instruments. Although this locations P of the novelty curve define the number of the is typically viewed as a timbre attribute, the percussive- bark band that marks the beginning of a new stream, i.e., if ness of a sound is expected to influence the perception P = p 1,...,24 i =1,...,I for total number of { i 2{ }| } of rhythm [16]. b) The repetition of rhythmic sequences peaks I, then stream Si consists of bark bands b given by, of the pattern are described by evaluating characteristics b b [pi,pi+1 1] for i =1,...,I 1 of different levels of onsets’ periodicity. c) The metrical Si = { | 2 − } − b b [p , 24] i = I. structure of the pattern is characterized via features ex- I for ⇢ { | 2 } (1) tracted from the metrical profile [24] of onsets. Based on An upper limit of 6 streams is considered based on the ap- the above, a feature vector is extracted for each segment proach of [22] that uses a total of 6 bands for onset detec- and is used to measure rhythm similarity. Inter-segment tion and [14] that suggests a total of three or four bands for similarity is evaluated with perceptual ratings collected via meter analysis. a specifically designed experiment. An overview of the The notion of rhythmic stream here is similar to the no- methodology is shown in Figure 2 and details for each step tion of ‘accent band’ in [14] with the difference that each are provided in the sections below. Part of the algorithm is rhythmic stream is formed on a variable number of adja- implemented using the MIRToolbox [17]. cent bark bands. Detecting a rhythmic stream does not necessarily imply separating the instruments, since if two 2.1 Rhythmic Streams instruments play the same rhythm they should be grouped to the same rhythmic stream. The proposed approach does Several instruments contribute to the rhythmic pattern of not distinguish instruments that lie in the same bark band. an EDM track. Most typical examples include combina- The advantage is that the number of streams and the fre- tions of bass drum, snare and hi-hat (eg. Figure 1). This quency range for each stream do not need to be predeter- is mainly a functional rather than a strictly instrumental di- mined but are rather estimated from the spectral represen- vision, and in EDM one finds various instrument sounds tation of each song. This benefits the analysis of electronic to take the role of bass, snare and hi-hat. In describing dance music by not imposing any constraints on the possi- rhythm, it is essential to distinguish between these sources ble instrument sounds that contribute to the characteristic since each contributes differently to rhythm perception [11]. rhythmic pattern. Following this, [15, 24] describe rhythmic patterns of latin dance music in two prefixed frequency bands (low and 2.1.1 Onset Detection high frequencies), and [9] represents drum patterns as two To extract onset candidates, the loudness envelope per bark components, the bass and snare drum pattern, calculated band and its derivative are normalized and summed with via non-negative matrix factorization of the spectrogram. more weight on loudness than its derivative, i.e., In [20], rhythmic events are split based on their perceived loudness and brightness, where the latter is defined as a Ob(n)=(1 λ)Nb(n)+λNb0(n) (2) function of the spectral centroid. − In the current study, rhythmic streams are extracted with where Nb is the normalized loudness envelope Lb, Nb0 the respect to the frequency domain and loudness pattern. In normalized derivative of Lb, n =1,...,N the frame num- particular, the Short Time Fourier Transform of the sig- ber for a total of N frames, and λ<0.5 the weighting fac- nal is computed and logarithmic magnitude spectra are as- tor. This is similar to the approach described by Equation signed to bark bands, resulting into a total of 24 bands for 3 in [14] with reduced λ, and is computed prior summation a 44.1 kHz sampling rate. Synchronous masking is mod- to the different streams as suggested in [14,22]. Onsets are eled using the spreading function of [23], and temporal detected via peak extraction within each stream, where the masking is modeled with a smoothing window of 50 ms. (rhythmic) content of stream i is defined as This representation is hereafter referred to as loudness en- Ri =⌃b Si Ob (3) velope and denoted by Lb for bark bands b =1,...,24.A 2 self-similarity matrix is computed from this 24-band rep- with Si as in Equation 1 and Ob as in Equation 2. This resentation indicating the bands that exhibit similar loud- onset detection approach incorporates similar methodolog- ness pattern.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    6 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us