Rhythm of Motion Extraction and Rhythm-Based Cross-Media Alignment for Dance Videos

> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 1 Rhythm of Motion Extraction and Rhythm-Based Cross-Media Alignment for Dance Videos Wei-Ta Chu, Member, IEEE , and Shang-Yin Tsai replacing background music of a dance video by a high-quality Abstract —We present how to extract rhythm information in music piece. In addition, music videos can be generated by dance videos and music, and accordingly correlate them based on concatenating multiple dance video clips with similar ROMs. rhythmic representation. From dancer’s movement, we construct The concept “rhythm” describes patterns of changes in motion trajectories, detect turnings and stops of trajectories, and then estimate rhythm of motion (ROM). For music, beats are various disciplines. In music, beat refers to a perceived pulse detected to describe rhythm of music. Two modalities are marking off equal durational units [7], and is the basis with therefore represented as sequences of rhythm information to which we compare or measure rhythmic durations [9]. Tempo facilitate finding cross-media correspondence. Two applications, refers to the rate at which beats strike, and meter describes i.e. background music replacement and music video generation, accent structure on beats. These parameters jointly determine are developed to demonstrate the practicality of cross-media how we perceive music rhythm. In contrast to the long history of correspondence. We evaluate performance of ROM extraction, and conduct subjective/objective evaluation to show that rich music cognition study, analyzing rhythm of motion in videos is browsing experience can be provided by the proposed just at its infant stage. We focus on extracting motion beats from applications. videos, which play an essential role in constituting ROM. To simplify description, we interchangeably use “rhythm” and Index Terms —Rhythm of motion, motion trajectory, music beat, “beats” in this paper. background music replacement, music video generation. Contributions of this work are summarized in Figure 1 and are described as follows. ROM extraction: By tracking distinctive feature points on I. INTRODUCTION human body, motion trajectories are constructed and HEN listening to music, people spontaneously tap their transformed into time-varied signals, which are then Wfingers or feet according to the music’s periodic structure. analyzed to extract ROM. ROM represents periodic motion Dancing with music is a human nature to express meaning of changes, such as “turning” and “stop” of trajectories. music or to show people’s emotion. In recent years, hip-hop Music beat detection and segmentation: By integrating culture drives the development of street dance, and learning to energy dynamics in different frequency bands, music beats dance has deeply attracted young people. Due to popularity of are detected. Periodically evolved beats are then used to street dance and ease of video capturing, many dancers record describe rhythm of music. their dances and share them on the web. However, quality of Rhythm-based cross-media alignment: Two rhythm sequences are compared, and an appropriate these videos, especially the audio tracks accompanying with the correspondence between them is determined. videos, is generally low. Moreover, to promote dance Applications: Based on rhythm-based cross-media alignment, competitions or TV shows, music videos are elaborately background music replacement and music video generation produced by experts who have ample knowledge on are developed, which demonstrate practicality of choreography, music rhythm, and video editing. It is never an rhythm-based multimodal analysis. easy task for amateur dancers who want to share or preserve The rest of this paper is organized as follows. Section II their performances for entertainment or education purposes. provides a survey on rhythm analysis in music and video, and In this paper, we investigate how rhythm information can be introduces a related area derived from musicology. ROM found and utilized in street dance videos. From the visual track, extraction is described in Section III. Section IV first shows periodic motion changes of dancer’s movement are extracted, how we find rhythm of music, and then rhythm-based which constitute “rhythm of motion” (ROM). From music, correspondence is determined to conduct background music rhythm is constructed based on periodic properties of music replacement. Automatic music video generation is described in beats. After extracting rhythm information from two modalities, Section V. Section VI reports evaluation results and discussions, cross-media correspondence is determined to facilitate followed by concluding remarks in Section VII. Wei-Ta Chu and Shang-Yin Tsai are with National Chung Cheng University, Chiayi, Taiwan (e-mail: [email protected], [email protected]). > REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 2 ROM Extraction Music Beat Detection changes of motion”, such as periodic characteristics of turning, & Segmentation twisting, jumping, or stopping. Dancer’s movement does not Dance Music necessarily repeat, but we still perceive he/she follows an videos Motion Beat implicit periodicity to make movement changes. Candidate Detection Relatively fewer works have been done for periodic motion Motion Beat Estimation Detection analysis. Deman et al. [5] explore the use of object-based Rhythm Estimation and Filtering motion to detect specific events in observational Psychology. Trajectory Music Specific moving patterns are detected, but rhythm information Construction Segmentation from motion is not specially studied. Based on videos captured in light-controlled environments, Guedes calculates luminance changes of pixels in consecutive frames [15], which indicate Rhythm-based Cross-Media Alignment motion magnitude between frames. Evolution of motion magnitude is then transformed into the frequency domain, and Background Music Music Video Replacement Generation the dominant frequency component is detected by a pitch Figure 1. Framework of the proposed system. tracking technique. Our system detects periodic changes of motion by a method similar to Guedes’s. However, in our case, II. RELATED WORKS dance videos were captured in uncontrolled environments and varied luminance changes hurt Guedes’s approach. Cutler and A. Motion Analysis in Videos Davis [4] compute object’s self-similarity as it evolves in time, Research about motion analysis mainly focuses on three and then apply time-frequency analysis to detect periodic factors: motion magnitude (or moving speed, motion activity), motion. Laptev et al. [18] view periodic motion subsequences as motion direction, and motion trajectory. Shiratori et al. [28] the same sequence captured by multiple cameras. Periodic detect changes of moving speed in traditional Japanese dances, motion is thus detected and segmented by approximate and then segment dance videos into a series of basic patterns. sequence matching algorithms. Both [4] and [18] assume that Deman et al. [5] detect temporal discontinuities by extracting orientation and size of objects do not change significantly, and local minimums of motion magnitudes. Motion analysis is they analyze how objects repeat themselves. However, in dance conducted for the same object part in neighboring frames. videos, ROM is not necessarily from motion repetition, and Based on motion trajectories, Su et al. [29] develop a different body parts are not guaranteed to have consistent framework for motion flow (i.e. motion trajectory in our work) moving orientation and object size. construction, which is then adopted to conduct video retrieval. Kim et al. [17] provide us a hint to extract ROM from motion This work only extracts a single motion flow to represent video data. They detect rapid directional change on joints, and then content. With feature point detection and motion prediction, the transform this information as motion signals. Power spectrum work proposed in [21] constructs multiple trajectories in dance density of signals is then analyzed to estimate the dominant videos based on a “color-optical flow” method, which jointly period. This systematic approach is suitable for our case. considers motion and color information to facilitate motion However, motion data in [17] were explicitly captured from tracking. Based on the extracted dance patterns, this system sensors. We focus on ROM from real dance videos. Estimating segments dance videos automatically. periodicity from noisy motion data is more challenging. Despite rich studies on motion analysis, information to B. Audio to Video Matching constitute ROM is not only motion magnitude or Associating videos with music has been viewed as a good absolution/relative moving direction, but also the periodicity of way to enrich presentation. Foote et al. [8] propose one of the substantial motion changes. To extract implicit rhythm derived earliest works on automatically generating music videos. Audio from human body movement, we need finer motion analysis for clips are segmented based on significant audio changes, and body parts with complex dancing steps. For example, for a videos are segmented based on camera motion and exposure. specific music rhythm, a dancer may move his left hand up and Video clips are then adjusted to align with audio to generate a right hand down, followed by jumping at the instant of a music music video. Also for home videos, Hua et al. [16] discover beat strikes. For the same music rhythm,

Rhythm of Motion Extraction and Rhythm-Based Cross-Media Alignment for Dance Videos

Types of Dance Styles

'What Ever Happened to Breakdancing?'

Dance Base 14 –16 Grassmarket, Edinburgh EH1 2JU 0131 225 5525 Dance @Dancebase.Co.Uk Dancebase.Co.Uk

The Miseducation of Hip-Hop Dance: Authenticity, and the Commodification of Cultural Identities

2018/19 Hip Hop Rules & Regulations

Official Rules & Regulations of Hip Hop International Crews of 5-9

Urban Street Dance Department

Hip Hop Dance: Performance, Style, and Competition

Folk Dance Remixed Tour Pack 2020

Volve: New Works Festival Returns for Season 41 Winter Series

Streetdance and Hip Hop - Let's Get It Right!

Hip Hop Terms