Use Style: Paper Title
Total Page:16
File Type:pdf, Size:1020Kb
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Binus University Repository Indonesian's Dangdut Music Classification Based on Audio Features Ferdinand Mahardhika Harco Leslie Hendric Spits Warnars Yaya Heryadi Computer Science Departement, Computer Science Departement, Computer Science Departement, BINUS Graduate Program - Doctor of BINUS Graduate Program - Doctor of BINUS Graduate Program – Doctor of Computer Science. Computer Science. Computer Science. Bina Nusantara University Bina Nusantara University Bina Nusantara University Jakarta, Indonesia 11480 Jakarta, Indonesia 11480 Jakarta, Indonesia 11480 [email protected] [email protected] [email protected] Lukas Cognitive Engineering Research Group (CERG), Faculty of Engineering, Universitas Katolik Atma Jaya. Jakarta, Indonesia 12930 [email protected] Abstract—The uniqueness of modern dangdut music today dance music, sometimes there are similarities that are at first is in the beat and melody of music that is relatively faster like glance difficult to distinguish. techno dance music, and the progressive melody arrangement from synthesizer keyboard. In this paper, we will extract audio In this paper, we will extract audio features from music, features from music, and then used that features for and then used that features for classification based on classification based on machine learning methods. Support machine learning methods. The goal in this study is to do Vector Machine (SVM) was used in this study. In this paper’s music clustering between two class of genre music, which is experiment we use 32 audio music files, consisting of 16 for dangdut music and techno dance music. dangdut music and 16 for techno dance music. The results of this testing get varying accuracy levels, between 80% to 90% II. RELATED WORK for each music audio file that has been successfully classified. There have been several studies conducted to classify Keywords—dangdut music classification, machine learning, music using audio features. Basically this is part of music Support Vector Machine information retrieval (MIR). In 2014, Konstantin Markov [4] uses several features to classify music, one of these features I. INTRODUCTION is: MFCC (Mel Frequency Cepstral Coefficients) features and TMBR (timbre features). Classification of 8 musical Very extraordinary music in Indonesia today, where genres, including: Blues, Electronic, Rock, Classical, Folk, music listeners in Indonesia vary widely, from a young age Jazz, Country and Pop. The algorithm used as a classifier is to old age. Various types of music exist in this country, from Support Vector Machine (SVM). Rock, Pop, HipHop, RnB, Rap, Traditional, etc., are growing rapidly in Indonesia. Lately there is an extraordinary In 2016, Prafulla et al [5] used 5 features of audio that phenomenon, where the type of music "Dangdut" starts to are used as a basis for classification. The features that they get a very extraordinary response[1]. In the era of the 80s, extract are: rhythm, timbre, pitch, tonality and dynamic this music was typical of "old style" music that was not so features. This research was conducted on Indian music genre. popular, and its listeners were dominated by the lower The feature is extracted using MIR Toolbox in MATLAB. classes and more often heard in the villages. Dangdut music The algorithm that they used as a classifier is the Neural is basically wilderness music from the Arab region with a Network and SVM. combination of "tabla" musical instruments from India, In 2016, Khonglah et al [6] they use audio features, such which then enter the country of Indonesia and experience as zero crossing rate, spectral roll-off, spectral flux, spectral alkulturation with local culture. centroid and energy in their study. In 2017, Aisha et al [7] Time has changed, nowadays dangdut music is music they clustered traditional Indonesian music using 11 features that is quite popular [2] because it has a more modern "look", on the time domain and frequency domain. These features especially with the influence of western music. "Dangdut include energy, entropy energy, zero-crossing rate, spectral Koplo Remix" is so popular today, a modern form of centroid, spectral entropy, spectral flux, spectral rolloff, mel- dangdut music where a blend of dangdut music with "techno frequency cepstral coefficient, harmonic, chroma vector, and dance music" that originating from European countries. The spectral zone. Meanwhile in the terms of success for doing uniqueness of modern dangdut music today is in the beat and the classification, there is few example show us that they melody of music that is relatively faster like techno dance success in their study using SVM [4],[5],[8],[9],[10] as music[3], and the progressive melody arrangement from classifier in audio classification. synthesizer keyboard. Between dangdut music and techno 978-1-5386-9422-0/18/$31.00 ©2018 IEEE The 1st 2018 INAPR International Conference, 7 Sept 2018, Jakarta, Indonesia 99 III. METHODOLOGY 9. Chroma Vector: In this study we used nine features from audio [7] to do music classification. The features include time-domain audio features, which consist of energy, entropy energy and zero (9) crossing rate. And also frequency-domain audio features, which consist of spectral centroid, spectral entropy, spectral flux, spectral roll-off, harmonic, mel-frequency cepstral coefficient, and chroma vector. IV. EXPERIMENT This experiment is generally divided into four major Following is the equation of audio features: steps, starting from data collection, pre processing data, 1. Energy: testing data and experimental results. Each section we will explain one by one. The whole process flow in this (1) experiment is illustrated in the Fig. 1. 2. Entropy Energy: (2) 3. Zero Crossing Rate: (3) 4. Spectral Centroid: (4) 5. Spectral Entropy: (5) 6. Spectral Flux: Fig. 1. Process flow in experiment (6) A. Data Collection 7. SpectralRoll-Off: (7) 8. Harmonic: (8) Fig. 2. Stereo audio waveform 978-1-5386-9422-0/18/$31.00 ©2018 IEEE The 1st 2018 INAPR International Conference, 7 Sept 2018, Jakarta, Indonesia 100 We collect audio files for dangdut music and techno dance music using Internet search engine and Youtube[11]. No Song Title (Artist) No Song Title (Artist) From the results of this collection, 80 audio files for dangdut Sumpah Aku Nggak Sakit 59 Goyang 2 Jari (Sandrina) 70 music and 80 audio files for techno dance music were (Ratu Idola) Goyang Dumang (Cita Susah Move ON (Duo obtained, so that the total audio files collected were 160 60 71 audio files. All audio files collected are in MP3 format with a Citata) Anggrek) Goyang Nasi padang (Duo Tak Sanggup Lagi (Dinda 61 72 frequency sampling of 44.1 kHz, stereo with the duration of Anggrek) Permata) each song about four minutes. Fig. 2 show stereo audio Goyang Pantura (Velline 62 73 Teganya (Susi Legit) waveform. In table I, is a list of dangdut song title used in Ratu Begal) Goyang Pokemon (Varra this study: 63 74 Tercyduk (Desy Ning Nong) Selvarra) Hello Sayang (Baby Terong Dicabein (Siti 64 75 TABLE I. LIST OF DANGDUT SONG TITLE Sexyola) Badriah) Hey Siapa Kamu (2Racun Tua Tua Keladi (Hesty 65 76 Youbi Sister) Klepek Klepek) No Song Title (Artist) No Song Title (Artist) 66 Jagung Bakar (Lynda Moy) 77 Tuh Kan (Ayurasi) Aku Mah Apa Tuh (Cita Jagung Rebus (Maya Undangan Mantan (Siti 01 5 Centi (Via Vallen) 30 67 78 Citata) Jasika) Badriah) Abang Ghoib (Barbie Jaran goyang (Nella 02 31 Aku Tak Biasa (Ria Amelia) 68 79 Ya Nasib (Susi Legit) Felia) Kharisma) Abang Goda (Duo Alamat Palsu (Ayu Ting 03 32 Serigala) Ting) 69 Kamu Pelakor (Ratu Idola) 80 Yank Haus (2TikTok) Baru 6 Bulan (Connie 04 Aisah Jamilah (Sandrina) 33 Nurlita) B. Data Pre Processing Berdiri Bulu Romaku 05 Aku Jijik (Sandrina) 34 Before the data is used, initial preparations are made for (Bebizy) Kau Tercipta Bukan existing data. Audio music data that has been collected, is 06 Cie Cie (Nella Kharisma) 35 Untukku (Nella Kharisma) converted to mono and with a sampling rate of 44.1 kHz then Cikini Gondangdia (Duo changed the format to WAV. The song duration is taken 07 36 Keong Racun (Lissa) Anggrek) only 20 seconds from the beginning, and then audio 08 Cinta Basi (Regina) 37 Konco Mesra (Meggy Diaz) normalization process is carried out to get the same sound Cinta Ganjil Genap (Susie hardness level to -16 dB. All this process is done using Praat 09 38 Kost Kostan (Duo Serigala) Legit) software[12]. Fig. 3 show the wave form after alteration to Cinta Kurang Gizi (Via 10 39 Kuper (Susi Ngapak) 20s and from stereo to mono with format convertion to Vallen) Cinta Pertama (Hesty WAV. 11 40 Lagi Syantik (Siti Badriah) Klepek Klepek) Makan Hati (2Racun Youbi 12 Cinta Putih (Duo Biduan) 41 sister) Cinta Satu Malam Makan Hati ver 2 (Baby 13 42 (Melinda) Shima) Cinta Tak Terbatas Waktu Mama Minta Pulsa (Siti 14 43 (Ucie Sucita) Badriah) Melanggar Hukum (Siti 15 Cinta Tulalit (Bebizy) 44 Badriah) CKC Cuma Kamu Cin 16 45 Pacar Baru (Yuni R) (Camel Petir) Pacar Satu Satunya (Ratu 17 Coba Coba (iMeyMey) 46 Idola) Cowok Ayam Kampung Perawan Atau Janda (Cita 18 47 (Ayu Maharany) Citata) Cuit Cuit Witwiw (Neng Pernikahan Dini (Cita 19 48 Oshin) Citata) Pura Pura Bahagia (Cita 20 Depan Belakang (Salsiah) 49 Citata) Di Tikung Teman Pusing Pala Barbie (Lula 21 50 (Sandrina) Lula) Sakit Sakit Hatiku (Via 22 Dibalas Dusta (Ratu Idola) 51 Vallen) Digenjot Cinta (Ucie Sakitnya Luar Dalam (Ratu 23 52 Sucita) Meta) Duda Anak 2 (Nyimas 24 53 Sambalado (Ayu Ting Ting) Fig. 3. Wave form after alteration Idola) Sebelas Duabelas (Nella 25 Duyeh (Neng Oshin) 54 Kharisma) C. Training 26 Enaknya Dikamu (Salsiah) 55 Selfie (Viola Arsa) For training and testing purposes, we use Python[13] Gak Ada Waktu Beib 27 56 Sianida (Duo Serigala) based programming languages and specific python open (Ghea Youbi) Galau Ting Ting (Ratu Suamiku Kawin Lagi (Siti library for audio extraction features.