Analysis of Audio File Structure Format Jennifer L

ICMSIT 2017: 4th International Conference on Management Science, Innovation, and Technology 2017 Faculty of Management Science, Suan Sunandha Rajabhat University (http://www.icmsit.ssru.ac.th) FILE CONVERSION AFTERMATH: ANALYSIS OF AUDIO FILE STRUCTURE FORMAT JENNIFER L. SANTOS1 JASMIN D. NIGUIDULA Abstract Technological innovation has brought a massive leap in data processing. As information turns out to be broadly accessible, various tools have been produced to create more types of format on existing data. This format is then manipulated to fit in different processing structure to generate needed results. In information technology, audio is one of the most flexible types of data that could be manipulated into different forms. In this light, this paper evaluates the conversion of multiple audio files in waveform audio file (.wav) formats to .mp3, wma, and . Aac formats using standard parameters. The evaluation has demonstrated noteworthy changes as compared to the original files and as supported by interesting fact, this study further explained the structures behind the converted output. Keywords: Audio Compression, Audio Format, Audio Analysis 1. INTRODUCTION The sounds that human hear from nature are analog, and these are processed by our auditory senses in their original format, but if theses sounds would be stored, they would have to be converted in digital form. Audio file is created through the PC either from a recording or produced naturally by a synthesizer. There are many applications available nowadays where people can just select, listen, and sometimes download songs they love. Music lovers can download the audio files in various formats. Even journal articles can be converted to audio files particularly in mp3 format so people with low-vision or no vision at all can have access to the journals. Downloading data from the internet may affect the size and quality of the data. For easy uploading and downloading in the internet, data compression is used. Data compression is a process where allowable number of bits is reduced for easy storage and transmission of data. Ordinary music listeners may not be able to distinguish compressed music from uncompressed music most especially if these are digital audio files. Audio files in compact discs (CD’s) are uncompressed, and needs 1 Technological Institute of the Philippines (T.I.P.)- Manila Email: [email protected], [email protected] 51 ICMSIT 2017: 4th International Conference on Management Science, Innovation, and Technology 2017 Faculty of Management Science, Suan Sunandha Rajabhat University (http://www.icmsit.ssru.ac.th) to be converted to compressed formats to be read on a computer or portable device. The quality of the audio file is dependent on the bit rate as better quality of the audio is achieved when it is higher. This paper aims to analyze the factors that affect the audio quality upon audio conversion. Comparison of original data and converted data will be presented and evaluated in terms of audio file size, audio duration, audio stream size, audio sampling rate, and audio overall bit rate. 2. RELATED WORKS There are various distinctive sorts of Audio records. The most well-known are Wave documents (wav) and MPEG Layer-3 records (mp3). The way the sound is packed and put away is called the codec which decides how little the document size is. Some document sorts dependably utilize a specific codec. Waveform Audio (wav) File Format is a standard sound record arrange utilized for the most part in Windows PCs. file MPEG-1 Audio Layer-3 (mp3) is a standard format for downloading and storing around one-twelfth of the size of the original audio files while maintaining the quality of the sound. Free Lossless Audio Codec (flac) on the other hand is a lossless compression codec, compression and decompression of this file type does not affect the original file. Windows Media Audio (wma) is an audio format developed by Microsoft, and it can encode digital audio like that of mp3 but with a higher rate. Advanced Audio Coding format (aac) is used in compressing lossy digital audio files. AAC and MP3 has similar rates but AAC produces better quality of sound. [6] Data compression may be classified as lossless or lossy. In lossy compression, the size of the compressed file is very small as compared to the original data because some signals were removed. [7]Compression of audio, video, and image use lossy methods. [8] Additionally, data compression is a process that reduces specific files. It discards redundancy and inaudible data to lessen the file size and transmission time of the data while maintaining the supposed audio quality. The term audio compression is used when audio signals are used as data. Original signal is filtered while removing unwanted signals so that only true digital signal is produced. MP3 uses variable-length Huffman codes and tis results to a more effective data compression. The sampling rate can have values of be 32, 44.1, or 48 kHz. It is very well appropriate for audio transmission over the internet. Different algorithm is used in converting audio data such as Run-length encoding (RLE), Burrows-wheeler transform (BWT), Move to front transform (MTF), and Arithmetic coding (ARI). Other techniques are Shannon- Fano, Huffman Coding. The result of data compression is dependent on the data source while the characteristic of the output relies whether the converted format is lossy or lossless. 52 ICMSIT 2017: 4th International Conference on Management Science, Innovation, and Technology 2017 Faculty of Management Science, Suan Sunandha Rajabhat University (http://www.icmsit.ssru.ac.th) 3. METHODOLOGY a. Audio Conversion and Audio Analyzer Tool The Waveform Audio File (.wav) format audio files used were extracted using the online website www.onlinevideoconverter.com. The videos were randomly selected from the nursery rhymes in YouTube.com. The URL of the videos were copied and pasted into the online converter to convert the videos into audio files. This research utilizes an audio conversion tool known as format factory. The tools’ feature includes conversion of video, audio and picture files. Specifically, it converts audio data to MPEG-1 Audio Layer-3 (.mp3), Windows Media Audio (.wma), and Advanced Audio Coding (.aac) format. As the research assess multiple audio files converted to four audio formats, another tool is then used for the analysis of the results. The data results of the conversion are then evaluated using an open source application. The Media Info is a tool that accesses to technical information such that of reviewing different audio format, customization of format, exporting of information, providing a graphical user interface, and integration of data in a shell. b. Formulation Experimental method was used in this research as it manipulates multiple data, and controls the rest of the data. In this case, three nursery songs (audio files) are used entitled Baa Baa Black Sheep, Head Shoulders Knees Toes, and Hey Diddle Diddle. The three original data used are in Waveform Audio File (.wav) format and are subject for conversion using Format Factory into MPEG-1 Audio Layer-3 (.mp3), Windows Media Audio (.wma), Free Lossless Audio Codec (.flac), and Advanced Audio Coding (.aac) formats. After the conversion, the results are then analyzed in the MediaInfo application utilizing the accompanying parameters: Table 1. Parameters for Identifying Audio Conversion Findings Parameters for Audio Conversion Analysis Audio File Size Audio Duration Overall bit rate Sampling rate Stream Size 53 ICMSIT 2017: 4th International Conference on Management Science, Innovation, and Technology 2017 Faculty of Management Science, Suan Sunandha Rajabhat University (http://www.icmsit.ssru.ac.th) Waveform Audio File (.wav) format have uncompressed audio in Pulse-Code Modulation (PCM) format. In, PCM the signals can only have two values, 1 or 0. There are three steps involved in pulse code modulation as displayed in figure 1. The first step involves conversion of continuous amplitude signal into discrete-time- continuous signal. Followed by quantization where the excessive and redundant bits are reduced and compressed. And finally the analog signals are digitized in encoding, and therefore the bandwidth used by the signal is reduced. Sampling Quantizing Encoding Figure 1. Block diagram of Pulse Code Modulation Modified Discrete Cosine Transform (MDCT) is used in most lossy formats such as MP3, AAC, and WMA. MDCT is a Fourier related change in light of type-IV DCT and has an extra property of being "lapped”, making it very useful in quantization. The following equation is used in MDCT. 4. RESULTS AND DISCUSSION This part of the study shows the extraction results based on the different parameters from the audio conversion tool. The audio file size, and the audio stream size are both measured in megabytes (MB), the duration of the audio files are presented in seconds (sec), overall bit rate of the audio files is presented in bits per second (kbps), and the audio sampling rate is measured in kilohertz (kHz). Figure 2. Result of Audio Conversion of Baa Baa Black Sheep using Format factory as analyzed by Media Info 54 ICMSIT 2017: 4th International Conference on Management Science, Innovation, and Technology 2017 Faculty of Management Science, Suan Sunandha Rajabhat University (http://www.icmsit.ssru.ac.th) Since the audio file has been converted to formats wherein the file size has been reduced, it can be said that the audio file had undergone compression. Figure 2 shows the data extracted from the Media Info in analyzing Baa Baa Black Sheep. It can be noted that the .wav format has the biggest file size and stream size, and the .aac format has the smallest file size and stream size. Duration and sampling rate did not change after the conversions of .wav file to other audio formats used in the study. Figure 3.

Load more