Classification of Musical Instruments Using SVM and KNN

International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075, Volume-9 Issue-7, May 2020 Classification of Musical Instruments using SVM and KNN S. Prabavathy, V. Rathikarani, P. Dhanalakshmi. Abstract: Automatic classification of musical instruments is a In [6] SVM and kNN are used for the classification of challenging task. Music data classification has become a very musical instruments; MFCC, Sonogram and MFCC popular research in the digital world. Classification of the combined with Sonogram are used for feature extraction. musical instruments required a huge manual process. This Musical instruments are classified according to this method system classifies the musical instruments from a several acoustic which is used to make sound. The musical instruments were features that includes MFCC, Sonogram and MFCC combined with Sonogram. SVM and kNN are two modeling techniques classified by their material. A traditional classification was used to classify the features. In this paper, to simply musical called as eight sounds and produced by eight different kinds instruments classifications based on its features which are of materials. They are stone, metal, thread, gourd, skin, extracted from various instruments using recent algorithms. The earth, bamboo and wood. proposed work compares the performance of kNN with SVM. There are many types of musical instruments, some of the Identifying the musical instruments and computing its accuracy musical instruments are used for classification such as is performed with the help of SVM and kNN classifier, using the banjo, cello, guitar, mandolin and violin from a string combination of MFCC and Sonogram with SVM a high accuracy instrument, bass clarinet, bassoon, clarinet, contrabassoon, rate of 98% achieve in classifying musical instruments. The flute and saxophone from woodwind instrument, system tested sixteen musical instruments to find out the accuracy level using SVM and kNN. frenchhorn, trombone, trumpet and tuba from brass instrument and piano from keyboard instrument. 1284 music Keywords: Musical Instruments Classification, Mel- samples were collected from various musical instrument frequency cepstral coefficients (MFCC), Feature extraction, k- databases. The block diagram of this proposed work is Nearest Neighborhood (kNN), Sonogram, Support Vector shown is fig. 1 Machine (SVM). I. INTRODUCTION Music is a means of communication and is an integral part of many human activities. Most ancient cultures used music for celebrations, worship, healing and preserving their stories. The fundamental characters of the tones produced are pitch, loudness, duration and timbre. One of the basic functions of a musical instrument is to produce tones of the desired pitch [1]. Classification [2] is a process of a class label assigning to the given observation. Using signal processing techniques, the raw audio signal are computed from the features in the musical analysis systems. MFCC is used to construct a feature vector. It is adequate and to Fig. 1The block diagram of Musical Instruments determine the best way to perform their transformation. This Classification. shows that MFCCs not only suitable for speech and also for music modeling [3]. SVM is used to consider a simple way II. LITERATURE REVIEW in binary classification. SVM is a conceptual learning machine that will learn from a set of training and try to For each isolated word, features were extracted and trained generalize and build correct predictions on fresh data [4]. successfully by SVM. Audio content is characterized using The performance of kNN is much better when all data the features of Sonogram in [7]. In [8] musical instruments having the same scale. It stores the training dataset which it like western were classified with HOS features were uses as its representation. combined with MFCC. The kNN classifier is used for This algorithm makes the predictions by calculating the hierarchical classification. The proposed methods in [9] like similarities between the input samples and the each training kNN, GMM, ANN and dropout ANN used to classify instance [5]. European orchestral musical instruments. The proposed algorithm [10] various types of music genres classified using the support vector machine (SVM) that can perform binary classification. In [11] the audio and video data classified into any of the five following classes such as advertisement, news, cartoon, movie and songs using Mel Revised Manuscript Received on May 07, 2020. S. Prabavathy*, Research Scholar, Department of Computer and frequency cepstral coefficients (MFCC) with colored Information Science, Department of Computer Science and Engineering, histogram features were extracted from images in video Annamalai University, Chidambaram, Tamilnadu, India. clips that used for visual features. V. Rathikarani, Assistant Professor, Department of Computer Science and Engineering, Annamalai University, Chidambaram, Tamilnadu, India. P. Dhanalakshmi, Professor, Department of Computer Science and Engineering, Annamalai University, Chidambaram, Tamilnadu, India. Published By: Retrieval Number: G5836059720/2020©BEIESP Blue Eyes Intelligence Engineering DOI: 10.35940/ijitee.G5836.059720 1186 & Sciences Publication Classification of Musical Instruments using SVM and KNN In [12] speech emotion recognized using k-Nearest IV. CLASSIFIERS Neighbor (kNN) and Gaussian Mixture Model (GMM) Support vector machine (SVM) models to recognize six emotional categories such as happy, neutral, angry, surprised, sad and fearful. Emotion like Support vector machines are used for nonlinear regression ‘happy’ gives more accuracy whereas ‘fear’ and ‘surprise’ and pattern classification. It constructs the linear model, emotion gives the lowest rate. The computation speed of the with non-linear boundaries based on the support vectors to kNN classifier is fast when compared to GMM classifier. approximate the result function. The data are separated Using the Minimum Distance Classifier in [13] the musical linearly, the linear machines are trained for the optimal instruments were classified. By calculating the FRR, the hyperplane which split the data without miscalculation and results analyzed. The proposed algorithm [14] uses Auto into the most distance between the hyperplane and the Associative Neural Network (AANN) based on features nearby training points. These training points are closest to using Sonogram to recognized speech. the optimal separating hyperplane are called the support vectors. SVM architecture shows in Fig. 4.1. III. FEATURE EXTRACTION Mel-frequency cepstral coefficients (MFCC) MFCC is used to construct a feature vector. Great amount of researchers already inspecting for what purposes of MFCCs are adequate and it determines the finest way to perform those transformations. This shows that MFCCs not only suitable for speech and also for music modeling. MFCCs are widely accepted in the field of speech recognition, human speech and voice identification [15]. In [16] musical genre is classified based on the short speech signals by using MFCC as a feature extraction method. MFCC is used to characterize audio content by calculating the features. In [17] AANN is used to segment and classify the audio signal Fig. 4.1 Architecture of SVM automatically by using LPC, LPCC and MFCC as a feature extractor. The extraction of MFCC from the music signal is K-Nearest Neighbor (kNN) shown in Fig 2. kNN algorithm is very easy and very efficient. The output data is calculated while the class with the peak frequency as of the k-most related instances. For each instance the votes for their class and the class with a large amount of votes is taken as prediction [5]. Fig. 4.2 shows the process of K- Nearest Neighbor (kNN). Fig.2 Extraction of MFCC from music signal Sonogram Sonogram is a current incarnation of feature set, music data is sampled at 22 kHz, mono format. In [7] the features of each isolated word extracted and the model is trained effectively. To characterize the audio content, Sonogram is calculated as features. SVM classifier has been used to recognize speech by learn from the training data. Fig. 4.2 K-Nearest Neighbor V. PROPOSED WORK In this proposed work, the music signals are preprocessed to remove noise before feature extraction, 39 MFCC features, 22 Sonogram features and 61 MFCC and Sonogram features Fig. 3 Sonogram Feature Extraction were extracted from each and every music samples. In python, a package named LibROSA which is used to In [14] spoken word converted into text using AANN model convert sound to FFT, using the audio data the MFCC & using Sonogram as acoustic feature. Feature vectors from Sonogram plot feature extraction done. In training, the saved Sonogram were given as input. VAD is used to separate features are loaded as input to the classification algorithm distinct words from the continuous speeches. The block such as SVM and kNN. Using the confusion matrix the diagram of Sonogram feature extraction is shown in fig. 3. Recall, Precision, F-Score and Accuracy were calculated. Published By: Retrieval Number: G5836059720/2020©BEIESP Blue Eyes Intelligence Engineering DOI: 10.35940/ijitee.G5836.059720 1187 & Sciences Publication International Journal of Innovative Technology and Exploring Engineering (IJITEE) ISSN: 2278-3075, Volume-9 Issue-7, May 2020 The trained classifiers are saved as a model and inference to that model which is used to classify the class. In SVM, linear kernel is used to classify the class that one class compares the other

Classification of Musical Instruments Using SVM and KNN

Piano Manufacturing an Art and a Craft

Fall 2013 FYS Brochure.Pdf

Brian Baldauff Treatise 11.9

Bachelor of Music 1

The Interwoven Evolution of the Early Keyboard and Baroque Culture

Music Teacher(S): Mr

Book 2 of the Structured Learning Guide of the Royal Scottish Pipe Band Association

Performer Experience on a Continuous Keyboard Instrument

The Invention and Evolution of the Piano

Medium of Performance Thesaurus for Music

Toward a Musicology of Interfaces

The Wurlitzer Style 180 Band Organ