AZIP, Audio Compression System: Research on Audio Compression, Comparison of Psychoacoustic Principles and Genetic Algorithms

Total Page:16

File Type:pdf, Size:1020Kb

AZIP, Audio Compression System: Research on Audio Compression, Comparison of Psychoacoustic Principles and Genetic Algorithms California State University, San Bernardino CSUSB ScholarWorks Theses Digitization Project John M. Pfau Library 2005 AZIP, audio compression system: Research on audio compression, comparison of psychoacoustic principles and genetic algorithms Howard Chen Follow this and additional works at: https://scholarworks.lib.csusb.edu/etd-project Part of the Computer Sciences Commons Recommended Citation Chen, Howard, "AZIP, audio compression system: Research on audio compression, comparison of psychoacoustic principles and genetic algorithms" (2005). Theses Digitization Project. 2617. https://scholarworks.lib.csusb.edu/etd-project/2617 This Project is brought to you for free and open access by the John M. Pfau Library at CSUSB ScholarWorks. It has been accepted for inclusion in Theses Digitization Project by an authorized administrator of CSUSB ScholarWorks. For more information, please contact [email protected]. AZIP, AUDIO COMPRESSION SYSTEM: RESEARCH ON AUDIO COMPRESSION, COMPARISON OF PSYCHOACOUSTIC PRINCIPLES AND GENETIC ALGORITHMS A Project Presented to the Faculty of California State University, San Bernardino In Partial Fulfillment of the Requirements for the Degree Master of Science in Computer Science by Howard Chen March 2005 AZIP, AUDIO COMPRESSION SYSTEM: RESEARCH ON AUDIO COMPRESSION, COMPARISON OF PSYCHOACOUSTIC PRINCIPLES AND GENETIC ALGORITHMS A Project Presented to the Faculty of California State University, San Bernardino by Howard Chen March 2005 Approved by: Dr. Keith Schubert ABSTRACT The purpose of this project is to investigate the differences between psychoacoustic principles and genetic algorithms (GA). These two theories will be discussed separately. The review will also compare the compression ratio and the quality of the decompressed files decoded by these two methods. The report will thoroughly discuss the reason that psychoacoustic principles produce small compressed files without affecting the quality of the sound. Two important topics, absolute threshold of hearing and masking effect, will be discussed in the report. Unlike psychoacoustic principles, GA uses a different approach to achieve high compression ratio. In the review, topics related to GA will be discussed. These topics will include the concept, training procedure, and the performance of GA. A computer program called Azip has been developed and is included for the use of researchers who are interested in this topic. In the review, the design and implementation of Azip will be discussed. Several experiments have been done using Azip. The result of experiments is discussed in the report as well. iii The review also compares the performance of Azip with that of Ogg and MPEG Layer3 (MP3), so the readers can have an understanding on their differences. iv ACKNOWLEDGMENTS Firstly, I would like to thank my supervisor, Professor Tong Lai Yu for his supervision and his support. He provided lots of valuable information that extended my understanding in audio compression and eventually improved the quality of the project. I would also like to thank my two-committee members, Dr. Kay Zemoudeh and Dr. Keith Schubert, for their suggestions and help. I would like to thank Dr. Jenq-Shyang Pan and Yi-Chih Hsin for introducing and discussing genetic algorithms. They also gave me lots of help in solving many problems and debugging many programs. I would like to extend my thanks to many other colleagues and friends helped me during my master study. In no particular orders, thanks to: Chun-Hsien Wang, John Ho, Jacob Lee, Chao-Kang Lee, Chuan-Ting Chou, Tang-Li Chen, Steven Chih, and Dennis Hsieh. Finally, I wish to thank to my parent and my sister for their encouragement and support. The support of the National Science Foundation under award 9810708 is gratefully acknowledged. v TABLE OF CONTENTS ABSTRACT.................................................. iii ACKNOWLEDGMENTS .......................................... v LIST OF TABLES........................................... ix LIST OF FIGURES.................... x CHAPTER ONE: INTRODUCTION 1.1 Purpose of the Project........................ 1 1.2 The Scope of the project...................... 2 1.3 Organization of Chapters ...................... 5 CHAPTER TWO: PSYCHOACOUSTIC PRINCIPLES 2.1 Introduction ................................... 7 2.2 Absolute Threshold of Hearing ................ 8 2.3 Masking Effect............ '.................... 10 2.3.1 Simultaneous Masking ................... 10 2.3.2 Non-Simultaneous Masking .............. 11 CHAPTER THREE: GENETIC ALGORITHMS 3.1 Introduction................................... 13 3.2 Training Processes ............................ 13 3.2.1 Initialization......................... 15 3.2.2 Evaluation.............................. 16 3.2.3 Selection............................... 16 3.2.4 Crossover............................... 16 3.2.5 Mutation................................ 18 3.3 Applying Genetic Algorithms on Audio Compression.................................... 19 vi 3.4 Problem of Genetic Algorithm and Conclusion................... ......... 22 CHAPTER FOUR: PROJECT DESIGN AND IMPLEMENTATION 4.1 Overall Description ........................... 24 4.1.1 Project .Functions............ ......... 24 4.1.2 Hardware Interface ..................... 24 4.1.3 User Interface......................... 24 4.2 Project Functions Design and Implementation.............................. 26 4.2.1 Filter Implementation...... 27 4.2.2 Implementation of Genetic Algorithms Training Processes ......... 34 4.2.3 Implementation of Psychoacoustic Principles............................. 3 8 CHAPTER FIVE: PERFORMANCE ANALYSIS 5.1 Filter Analysis................................ 41 5.1.1 Discrete Cosine Transform and Modified Discrete Cosine Transform . .............................. 42 5.1.2 Complexity of Discrete Cosine Transform and Modified Discrete Cosine Transform ....................... 44 5.1.3 Performance of Discrete Cosine Transform and Modified Discrete Cosine Transform ....................... 44 5.2 Comparing Psychoacoustic Principles with Genetic Algorithm ............................. 46 5.2.1 Signal-to-Noise Ratio ................. 46 5.2.2 Subjective QualityMeasure ............. 47 vii 5.3 Comparing Azip with Moving Picture Experts Group-Layer3 and Ogg Vorbis .......... 50 5.3.1 Background.............................. 50 5.3.2 Comparing with Ogg and Moving Picture Experts Group-Layer3 .......... 50 CHAPTER SIX: FUTURE ENHANCEMENT AND CONCLUSION 6.1 Vector Quantization ........................... 52 6.2 Conclusion..................................... 53 APPENDIX A: ABBREVIATION ................................ 56 APPENDIX B: COMPARING PSYCHOACOUSTIC PRINCIPLES WITH GENETIC ALGORITHMS .................... 58 APPENDIX C: COMPARING AZIP WITH MOVING PICTURE EXPERTS GROUP-LAYER3A ND OGG............... 60 REFERENCES............................................... 62 viii LIST OF TABLES Table 1. Legal Commands in Azip Audio Compression System ............................ 25 Table 2. Files in Azip Audio Compression System ...... 27 Table 3. Critical Functions Used in Discrete Cosine Transform and Modified Discrete Cosine Transform ............................... 28 Table 4. Functions Required for Genetic Training Processes . .................................... 34 ix LIST OF FIGURES Figure 1. Basic Audio Compression and Decompression Process ....................... 3 Figure 2. The Absolute Threshold of Hearing Under Quiet Condition ....................... 8 Figure 3. Non-Simultaneous Masking .................... 11 Figure 4. The Training Procedures of Genetic Algorithms................................... 15 Figure 5. Music Handling for Genetic Algorithm ....... 20 Figure 6. Compressing Process of Genetic Algorithm.................................... 21 Figure 7. Compression Ratio versus Signal-to-Noise Ratio in Genetic Algorithms................................... 22 Figure 8. File Codec_config.h.......................... 25 Figure 9. Implementation of Function Init_dct ........ 28 Figure 10. Implementation of Function Discrete Cosine Transform.... ........................ 2 9 Figure 11. Implementation of Function Inverse Discrete Cosine Transform .................. 30 Figure 12. Implementation of Function Init_mdct ...... 31 Figure 13. Implementation of Function Modified Discrete Cosine Transform ................... 32 Figure 14. Implementation of Function Inverse Modified Discrete Cosine Transform ......... 33 Figure 15. Implementation of Function Mutation ....... 35 Figure 16. Implementation of Function Crossover ...... 36 Figure 17. Implementation of Function Crossover (Continue) ........ .......................... 3 7 Figure 18. Implementation of Function Evaluate ....... 38 x Figure 19. Implementation of Function Freqmask_append ............................. 39 Figure 20. Implementation of Function Freqmask_append (Continue) ................. 40 Figure 21. Modified Discrete Cosine Transform ........ 43 Figure 22. Discrete Cosine Transform versus Modified Discrete Cosine Transform ......... 45 Figure 23. Comparing Psychoacoustic Principle with Genetic Algorithm Using Signal-to-Noise Ratio ....................... 47 Figure 24. Subjective Quality Scales .................. 48 Figure 25. Subjective Testing Result of Psychoacoustic Principle and Genetic Algorithms................................... 49 Figure 26. Subjective Testing Result of Moving Picture Experts Group-Layer3 and Ogg ...... 51 Figure 27. Codewords in 2-Dimensional Space ........... 53 xi CHAPTER ONE INTRODUCTION
Recommended publications
  • Audio Coding for Digital Broadcasting
    Recommendation ITU-R BS.1196-7 (01/2019) Audio coding for digital broadcasting BS Series Broadcasting service (sound) ii Rec. ITU-R BS.1196-7 Foreword The role of the Radiocommunication Sector is to ensure the rational, equitable, efficient and economical use of the radio- frequency spectrum by all radiocommunication services, including satellite services, and carry out studies without limit of frequency range on the basis of which Recommendations are adopted. The regulatory and policy functions of the Radiocommunication Sector are performed by World and Regional Radiocommunication Conferences and Radiocommunication Assemblies supported by Study Groups. Policy on Intellectual Property Right (IPR) ITU-R policy on IPR is described in the Common Patent Policy for ITU-T/ITU-R/ISO/IEC referenced in Resolution ITU-R 1. Forms to be used for the submission of patent statements and licensing declarations by patent holders are available from http://www.itu.int/ITU-R/go/patents/en where the Guidelines for Implementation of the Common Patent Policy for ITU-T/ITU-R/ISO/IEC and the ITU-R patent information database can also be found. Series of ITU-R Recommendations (Also available online at http://www.itu.int/publ/R-REC/en) Series Title BO Satellite delivery BR Recording for production, archival and play-out; film for television BS Broadcasting service (sound) BT Broadcasting service (television) F Fixed service M Mobile, radiodetermination, amateur and related satellite services P Radiowave propagation RA Radio astronomy RS Remote sensing systems S Fixed-satellite service SA Space applications and meteorology SF Frequency sharing and coordination between fixed-satellite and fixed service systems SM Spectrum management SNG Satellite news gathering TF Time signals and frequency standards emissions V Vocabulary and related subjects Note: This ITU-R Recommendation was approved in English under the procedure detailed in Resolution ITU-R 1.
    [Show full text]
  • Subjective Assessment of Audio Quality – the Means and Methods Within the EBU
    Subjective assessment of audio quality – the means and methods within the EBU W. Hoeg (Deutsch Telekom Berkom) L. Christensen (Danmarks Radio) R. Walker (BBC) This article presents a number of useful means and methods for the subjective quality assessment of audio programme material in radio and television, developed and verified by EBU Project Group, P/LIST. The methods defined in several new 1. Introduction EBU Recommendations and Technical Documents are suitable for The existing EBU Recommendation, R22 [1], both operational and training states that “the amount of sound programme ma- purposes in broadcasting terial which is exchanged between EBU Members, organizations. and between EBU Members and other production organizations, continues to increase” and that “the only sufficient method of assessing the bal- ance of features which contribute to the quality of An essential prerequisite for ensuring a uniform the sound in a programme is by listening to it.” high quality of sound programmes is to standard- ize the means and methods required for their as- Therefore, “listening” is an integral part of all sessment. The subjective assessment of sound sound and television programme-making opera- quality has for a long time been carried out by tions. Despite the very significant advances of international organizations such as the (former) modern sound monitoring and measurement OIRT [2][3][4], the (former) CCIR (now ITU-R) technology, these essentially objective solutions [5] and the Member organizations of the EBU remain unable to tell us what the programme will itself. It became increasingly important that com- really sound like to the listener at home.
    [Show full text]
  • Equalization
    CHAPTER 3 Equalization 35 The most intuitive effect — OK EQ is EZ U need a Hi EQ IQ MIX SMART QUICK START: Equalization GOALS ■ Fix spectral problems such as rumble, hum and buzz, pops and wind, proximity, and hiss. ■ Fit things into the mix by leveraging of any spectral openings available and through complementary cuts and boosts on spectrally competitive tracks. ■ Feature those aspects of each instrument that players and music fans like most. GEAR ■ Master the user controls for parametric, semiparametric, program, graphic, shelving, high-pass and low-pass filters. ■ Choose the equalizer with the capabilities you need, focusing particularly on the parameters, slopes, and number of bands available. Home stereos have tone controls. We in the studio get equalizers. Perhaps bet- ter described as a “spectral modifier” or “frequency-specific amplitude adjuster,” the equalizer allows the mix engineer to increase or decrease the level of specific frequency ranges within a signal. Having an equalizer is like having multiple volume knobs for a single track. Unlike the volume knob that attenuates or boosts an entire signal, the equalizer is the tool used to turn down or turn up specific frequency portions of any audio track . Out of all signal-processing tools that we use while mixing and tracking, EQ is probably the easiest, most intuitive to use—at first. Advanced applications of equalization, however, demand a deep understanding of the effect and all its Mix Smart. © 2011 Elsevier Inc. All rights reserved. 36 Mix Smart possibilities. Don't underestimate the intellectual challenge and creative poten- tial of this essential mix processor.
    [Show full text]
  • 4. MPEG Layer-3 Audio Encoding
    MPEG Layer-3 An introduction to MPEG Layer-3 K. Brandenburg and H. Popp Fraunhofer Institut für Integrierte Schaltungen (IIS) MPEG Layer-3, otherwise known as MP3, has generated a phenomenal interest among Internet users, or at least among those who want to download highly-compressed digital audio files at near-CD quality. This article provides an introduction to the work of the MPEG group which was, and still is, responsible for bringing this open (i.e. non-proprietary) compression standard to the forefront of Internet audio downloads. 1. Introduction The audio coding scheme MPEG Layer-3 will soon celebrate its 10th birthday, having been standardized in 1991. In its first years, the scheme was mainly used within DSP- based codecs for studio applications, allowing professionals to use ISDN phone lines as cost-effective music links with high sound quality. In 1995, MPEG Layer-3 was selected as the audio format for the digital satellite broadcasting system developed by World- Space. This was its first step into the mass market. Its second step soon followed, due to the use of the Internet for the electronic distribution of music. Here, the proliferation of audio material – coded with MPEG Layer-3 (aka MP3) – has shown an exponential growth since 1995. By early 1999, “.mp3” had become the most popular search term on the Web (according to http://www.searchterms.com). In 1998, the “MPMAN” (by Saehan Information Systems, South Korea) was the first portable MP3 player, pioneering the road for numerous other manufacturers of consumer electronics. “MP3” has been featured in many articles, mostly on the business pages of newspapers and periodicals, due to its enormous impact on the recording industry.
    [Show full text]
  • EBU Evaluations of Multichannel Audio Codecs
    EBU – TECH 3324 EBU Evaluations of Multichannel Audio Codecs Status: Report Source: D/MAE Geneva September 2007 1 Page intentionally left blank. This document is paginated for recto-verso printing Tech 3324 EBU evaluations of multichannel audio codecs Contents 1. Introduction ................................................................................................... 5 2. Participating Test Sites ..................................................................................... 6 3. Selected Codecs for Testing ............................................................................... 6 3.1 Phase 1 ....................................................................................................... 9 3.2 Phase 2 ...................................................................................................... 10 4. Codec Parameters...........................................................................................10 5. Test Sequences ..............................................................................................10 5.1 Phase 1 ...................................................................................................... 10 5.2 Phase 2 ...................................................................................................... 11 6. Encoding Process ............................................................................................12 6.1 Codecs....................................................................................................... 12 6.2 Verification of bit-rate
    [Show full text]
  • Methods of Sound Data Compression \226 Comparison of Different Standards
    See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/251996736 Methods of sound data compression — Comparison of different standards Article CITATIONS READS 2 151 2 authors, including: Wojciech Zabierowski Lodz University of Technology 123 PUBLICATIONS 96 CITATIONS SEE PROFILE Some of the authors of this publication are also working on these related projects: How to biuld correct web application View project All content following this page was uploaded by Wojciech Zabierowski on 11 June 2014. The user has requested enhancement of the downloaded file. 1 Methods of sound data compression – comparison of different standards Norbert Nowak, Wojciech Zabierowski Abstract - The following article is about the methods of multimedia devices, DVD movies, digital television, data sound data compression. The technological progress has transmission, the Internet, etc. facilitated the process of recording audio on different media such as CD-Audio. The development of audio Modeling and coding data compression has significantly made our lives One's requirements decide what type of compression he easier. In recent years, much has been achieved in the applies. However, the choice between lossy or lossless field of audio and speech compression. Many standards method also depends on other factors. One of the most have been established. They are characterized by more important is the characteristics of data that will be better sound quality at lower bitrate. It allows to record compressed. For instance, the same algorithm, which the same CD-Audio formats using "lossy" or lossless effectively compresses the text may be completely useless compression algorithms in order to reduce the amount in the case of video and sound compression.
    [Show full text]
  • Evaluation of Sound Quality of High Resolution Audio
    Proceedings of the 1st IEEE/IIAE International Conference on Intelligent Systems and Image Processing 2013 Evaluation of Sound Quality of High Resolution Audio Naoto Kanetadaa,*, Ryuta Yamamotob, Mitsunori Mizumachi aKyushu Institute of Technology,1-1 Sensui-cho, Tobata-ku, Kitakyushu 804-8550, Japan bDigifusion Japan Co.,Ltd 1-1-68 Futabanosato Higashi-ku Hirosima,7320057 Japan *Corresponding Author: [email protected] Abstract 1. Introduction High resolution audio (HRA), which is recorded in the digital audio format with high sound quality, appears on the In recent years, high resolution audio (HRA), which is audio market. HRA has the quality equal to or better than sampled at 96 kHz or 192 kHz with 24 bits accuracy, is the standard compact disc (CD), and is distributed as the becoming popular in the audio market. HRA is super audio CD (SACD), DVD-audio, Blue-ray audio, and commercially distributed as the lossless encoded file via the a data file through the internet. In this paper, sound quality internet, and is also available in the Blu-ray audio disc. of HRA is investigated in the view point of auditory Compact disc (CD) and lossy compression such as MPEG perception. Perceptual characteristics of HRA have been Audio Layer-3 (MP3) are the current major audio formats examined by listening tests as compared with the standard as the storage medium and the data file, respectively. As a audio CD and the compressed MPEG audio layer-3 (MP3) memory capacity increases and a wide communication qualities. The listening tests were carried out by the method network spreads out, HRA must be increasingly popular.
    [Show full text]
  • Sound Quality of Audio Systems
    Loudspeaker Data – Reliable, Comprehensive, Interpretable Introduction Biography: 1977-1982 Study Electrical Engineering, TU Dresden 1982-1990 R&D Engineer VEB RFT, Leipzig, 1992-1993 Scholarship at the University Waterloo (Canada) 1993-1995 Harman International, USA 1995-1997 Consultancy 1997 Managing the KLIPPEL GmbH 2007 Professor for Electro-acoustics, TU Dresden My interests and experiences: • electro-acustics, loudspeakers • digital signal processing applied to audio Wolfgang Klippel • psycho-acoustics and measurement techniques Klippel GmbH Agenda Left Right Audio Audio Channel Channel Audio-System Transducer Perception (Transducer, DSP, (woofer, tweeter) Final Audio Application Amplifier) (Room, Speaker, Listening Position, Stimulus) 1. Perceptual and physical evaluation at the listening point perceptive modeling & sound quality assessment auralization techniques & systematic listening tests 2. Output-based evaluation of (active) audio systems holografic near field measurement of 3D sound output prediction of far field and room interaction nonlinear distortion at max. SPL 3. Comprehensive description of the passive transducer parameters (H(f), T/S, nonlinear, thermal) symptoms (THD, IMD, rub&buzz, power handling) 3 Objectives Left Right Audio Audio Channel Channel Audio-System Transducer Perception (Transducer, DSP, (woofer, tweeter) Final Audio Application Amplifier) (Room, Speaker, Listening Position, Stimulus) • clear definition of sound quality in target application • filling the gap between measurement and listening • numerical evaluation of design choices • meaningful transducer data for DSP and system design • selection of optimal components • maximal performance-cost ratio • smooth communication between customer and supplier 4 Objective Methods for Assessing Loudspeakers Room Loudspeaker Parameter-Based Parameters Parameters Method e.g. T/S parameter, amplitude and phase response, nonlinear and thermal parameters Loudspeaker- Psychoacoustical Stimulus Room Model Model Sensations nonlinear nonlinear e.g.
    [Show full text]
  • White Paper: the AAC Audio Coding Family for Broadcast and Cable TV
    WHITE PAPER Fraunhofer Institute for THE AAC AUDIO CODING FAMILY FOR Integrated Circuits IIS BROADCAST AND CABLE TV Management of the institute Prof. Dr.-Ing. Albert Heuberger (executive) Over the last few years, the AAC audio codec family has played an increasingly important Dr.-Ing. Bernhard Grill role as an enabling technology for state-of-the-art multimedia systems. The codecs Am Wolfsmantel 33 combine high audio quality with very low bit-rates, allowing for an impressive audio 91058 Erlangen experience even over channels with limited bandwidth, such as those in broadcasting or www.iis.fraunhofer.de mobile multimedia streaming. The AAC audio coding family also includes special low delay versions, which allow high quality two way communication. Contact Matthias Rose Phone +49 9131 776-6175 [email protected] Contact USA Fraunhofer USA, Inc. Digital Media Technologies* Phone +1 408 573 9900 [email protected] Contact China Toni Fiedler [email protected] Contact Japan Fahim Nawabi Phone: +81 90-4077-7609 [email protected] Contact Korea Youngju Ju Phone: +82 2 948 1291 [email protected] * Fraunhofer USA Digital Media Technologies, a division of Fraunhofer USA, Inc., promotes and supports the products of Fraunhofer IIS in the U. S. Audio and Media Technologies The AAC Audio Coding Family for Broadcast and Cable TV www.iis.fraunhofer.de/audio 1 / 12 A BRIEF HISTORY AND OVERVIEW OF THE MPEG ADVANCED AUDIO CODING FAMILY The first version of Advanced Audio Coding (AAC) was standardized in 1994 as part of the MPEG-2 standard.
    [Show full text]
  • Lecture #2 – Digital Audio Basics
    CSC 170 – Introduction to Computers and Their Applications Lecture #2 – Digital Audio Basics Digital Audio Basics • Digital audio is music, speech, and other sounds represented in binary format for use in digital devices. • Most digital devices have a built-in microphone and audio software, so recording external sounds is easy. 1 Digital Audio Basics • To digitally record sound, samples of a sound wave are collected at periodic intervals and stored as numeric data in an audio file. • Sound waves are sampled many times per second by an analog-to-digital converter . • A digital-to-analog converter transforms the digital bits into analog sound waves. Digital Audio Basics 2 Digital Audio Basics • Sampling rate refers to the number of times per second that a sound is measured during the recording process. • Higher sampling rates increase the quality of the recording but require more storage space. Digital Audio File Formats • A digital file can be identified by its type or its file extension, such as Thriller.mp3 (an audio file). • The most popular digital audio formats are: AAC, MP3, Ogg, Vorbis, WAV, FLAC, and WMA . 3 Digital Audio File Formats AUDIO FORMAT EXTENSION ADVANTAGES DISADVANTAGES AAC (Advanced Audio .aac, .m4p, or .mp4 Very good sound quality Files can be copy protected Coding) based on MPEG-4; lossy so that use is limited to compression; used for approved devices iTunes music MP3 (also called MPEG-1 .mp3 Good sound quality; lossy Might require a standalone Layer 3) compression; can be player or browser plugin streamed over the
    [Show full text]
  • Introduction to Computer Music Week 13 Audio Data Compression Version 1, 13 Nov 2018
    Introduction to Computer Music Week 13 Audio Data Compression Version 1, 13 Nov 2018 Roger B. Dannenberg Topics Discussed: Compression Techniques, Coding Redundancy, Intersample Redundancy, Psycho-Perceptual Redundancy, MP3, LPC, Physical Models/Speech Synthesis, Music Notation 1 Introduction to General Compression Techniques High-quality audio takes a lot of space—about 10M bytes/minute for CD-quality stereo. Now that disk capacity is measured in terabytes, storing uncompressed audio is feasible even for personal libraries, but compression is still useful for smaller devices, downloading audio over the Internet, or sometimes just for convenience. What can we do to reduce the data explosion? Data compression is simply storing the same information in a shorter string of symbols (bits). The goal is to store the most information in the smallest amount of space, without compromising the quality of the signal (or at least, compromising it as little as possible). Compression techniques and research are not limited to digital sound—–data compression plays an essential part in the storage and transmission of all types of digital information, from word- processing documents to digital photographs to full-screen, full-motion videos. As the amount of information in a medium increases, so does the importance of data compression. What is compression exactly, and how does it work? There is no one thing that is “data compression.” Instead, there are many different approaches, each addressing a different aspect of the problem. We’ll take a look at a few ways to compress digital audio information. What’s important about these different ways of compressing data is that they tend to illustrate some basic ideas in the representation of information, particularly sound, in the digital world.
    [Show full text]
  • B I C T Basic Concepts
    Audio BiCBasic Concept s Audio in Multimedia Digital Audio: 9 SdhhbSound that has been capture d or create d electronically by a computer 9 In a multimedia production, sound and music are crucial in helping to establish moods and create environments. History of Digital Audio Some important dates in history include: 91975 – Digital tape recording becomes common in professional recording studios 91980 – Multi-track digital audio recorder is introduced 91981 – Philips introduces the compact disk ()(CD) 91999- MP3 format becomes the accepted digital format for distribution for audio on the Internet History of Digital Audio Important dates (cont.): 9 1999 – Napster (a P2P [peer to peer] – file distribution program) is launched. This technology ushered in an era of illegal file sharing of music on the Internet 9 2001 – Apple iPod is introduced 9 2001 – XM Radio is launched as the first satellite radio service Uses of Digital Audio How digital audio is used: 9Educa tion 9Information 9Entertainment 9Advertising Digital Audio Digital Audio Recording: 9 Dig ita l recor ding dev ices cap ture soun d by sampling the sound waves. 9 Sampling – reproducing a sound or motion by recording many fragments of it. 9 VU meter (Volume units) – displays the audio signal level Factors in Audio File Size The qua lity an d s ize o f dig ita l au dio depen ds on: 9 The sampling rate – number of times per second a device records a sample of the sound wave being created 9 The sample size (audio resolution) – number of bits of data in each sample 9 The number of channels – streams of audio 9 The time span of the recording - length Factors in Audio File Size The sampling rate: 9 The number of times per second a device records a sample of the sound wave being created 9 A sample is a small fragment of a sound wave 9 It takes many samples to accurately reproduce sound 9 Measured in kilohertz (kHz).
    [Show full text]