Comparison of Lossless Video and Image Compression Codecs for Medical Computed Tomography Datasets Vy Bui1,3, Lin-Ching Chang1, Dunling Li2, Li-Yueh Hsu3, Marcus Y

Total Page:16

File Type:pdf, Size:1020Kb

Comparison of Lossless Video and Image Compression Codecs for Medical Computed Tomography Datasets Vy Bui1,3, Lin-Ching Chang1, Dunling Li2, Li-Yueh Hsu3, Marcus Y 2016 IEEE International Conference on Big Data (Big Data) Comparison of Lossless Video and Image Compression Codecs for Medical Computed Tomography Datasets Vy Bui1,3, Lin-Ching Chang1, Dunling Li2, Li-Yueh Hsu3, Marcus Y. Chen3 1Department of Electrical Engineering and Computer Science, The Catholic University of America, Washington, DC, USA 2BTS Software Solutions, Columbia, Maryland, USA 3National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland, USA [email protected], [email protected], [email protected], [email protected], [email protected] Abstract—Modern multidimensional medical imaging efficiency of lossless compression for volumetric CT datasets technology produces very large amount of data especially from using five state-of-the-art video codecs, namely H264/AVC, the computed tomography modality. These volumetric dataset H265/HEVC, Lagarith, MSU, MLC, and compare them with opens new demands for big data storage and high-speed still-image codecs JPEG, JPEG2000, JPEG-LS as specified in communication systems which may be alleviated by image the DICOM standard [2]. compression techniques. Current image compression schemes adopted in the DICOM standard do not exploit the inter-slice II. METHODS correlation within three-dimensional (3D) dataset. Video compression may have a potential to improve the compression A. Data Description ratio by reducing the redundancy in volumetric medical images. Medical CT image datasets were obtained from 20 patients In this paper, we compare the performance of five lossless video underwent routine clinical examinations. All subjects gave codecs (H264, H265, Lagarith, MSU, MLC) and three still-image written consent form approved by the institutional review codecs (JPEG, JPEG2000, JPEG-LS) using 3D medical board of National Heart, Lung, and Blood Institute. Volumetric computed tomography datasets. Performance evaluation shows images were reconstructed at 2.0 mm slice thickness, 1.0 mm that video codecs improve the compression ratio compared to slice overlay, 0.5 mm pixel size, and 512×512 image matrix JPEG and JPEG2000 while being competitive to JPEG-LS. size. The number of frames in each dataset ranges from 591 to Keywords— lossless compression; compression ratio; video 751 slices covering chest, abdomen, and pelvis of the patients. codecs; medical imaging; computed tomography. All studies were anonymized and converted to 8-bit DICOM images from the original 16-bit DICOM 3D images, and then I. INTRODUCTION to 8-bit mono channel video sequence stored in AVI format. Advances in medical imaging technology have facilitated a B. Optimized 16-to-8 bit Data Conversion significant increase of the volume and size of image data. In order to optimize the 16-to-8 bit data conversion and Large volumes of medical datasets as generated by different preserve the original signal and contrast levels, the stored pixel imaging modalities such as computed tomography (CT), values were calibrated to Hounsfield Unit (HU) using modality magnetic resonance imaging, positron emission tomography look-up-table (LUT) transformation. In the DICOM header, and single photon emission tomography, have led to a growing LUT transformation’s attributes are specified in tags demand for big and efficient storage systems. Report from (0028,1052) and (0028,1053) representing intercept and slope EMC Corporation and the International Data Corporation for calculating the calibrated HU using the following equation: anticipates healthcare data to grow at 48% per year, reaching 2.3 zettabytes by year 2020 [1]. Therefore, medical image = × + (1) compression plays an important role in the ability to not only reduce data storage cost but also improve transmission time for Next, window width and window center/level settings as large image studies such as high-resolution three-dimensional specified in tags (0028,1050) and (0028,1051) are adjusted to (3D) CT datasets. improve image contrast for visualization of different Current compression standards for digital imaging and anatomical structures such as lung, heart, vessel, bone, or other communications in medicine (DICOM) images use still-image soft tissues. These two parameters specify a linear conversion codecs which do not exploit inter-frame redundancies as from stored pixel values to a more restricted dynamic range for commonly used in video compression algorithms. As the optimal image display. In this study, a soft-tissue setting of corresponding pixel values between adjacent slices are window width of 380 and window center/ level of 40 are used typically very similar in volumetric medical images, video to optimize the display of anatomical structures in chest, compression techniques that exploit such inter-frame abdomen, and pelvis regions. Under this optimized redundancies may improve the compression ratio than those window/level display range, 16-bit pixel values are rescaled still-image codecs. The objective of this work is to explore the into 8-bit dynamic range (0- 255) for image compression. 978-1-4673-9005-7/16/$31.00 ©2016 IEEE 3960 C. Evaluation Metrics improvement of 30.05% in CR was obtained by the JPEG-LS Three evaluation metrics were computed to compare the codec. When comparing the video and still-image compression performance different codecs for their compression ratio, methods, both JPEG2000 and JPEG have a lower CR than all video codecs tested. However, JPEG-LS delivers a CR encoding cost, and decoding cost. The compression ratio (CR) comparable to MSU and MLC, but slightly better than H264, is defined as the ratio of the total number of bytes in the H265, and Lagarith. original image versus the compressed data. The encoding and decoding costs are defined as the total time required to Figure 2 shows the total computational overhead for both compress or decompress the input data divided by the data encoding and decoding time. In general, all video codecs and size. This cost represents the encoding or decoding time (in still-image compression methods have a lower decoding cost millisecond) required for each megabyte of data. compared to the encoding cost except for JPEG-LS (i.e., its decoding cost is higher than the encoding cost). The JPEG-LS D. Software Implementation has the lowest encoding cost while the JPEG and JPEG2000 We used IDL/Merge DICOM toolkit (Exelis Visual have the lowest decoding cost. The encoding speed of video Information Solutions, Boulder, Colorado) to import the pixel codecs is faster than JPEG2000 except for MSU (i.e., the data from DICOM images and convert them into 8-bit AVI encoding cost of MSU is slightly higher than other video video files. An open-source video processing utility Virtual codecs and JPEG2000). Surprisingly, the popular JPEG Dub [3] was used to process the AVI files for Lagarith, MSU compression has a very high encoding cost than other and MLC compression. For other codecs, we used an open- compression methods. Noted that MSU or MLC have a relative higher decoding cost than all still-image compressions. source multimedia application FFmpeg [4] for H264, H265 and JPEG-LS compression, and the IDL/Merge DICOM TABLE I. SUMMARY STATISTICS FOR COMPRESSION RATIO toolkit for JPEG2000 and JPEG compression. A batch COMPARISON. processing script was implemented in C++ under Microsoft Visual Studio 2015 to process all video files in RGB color Codec mean ± standard deviation space. The processing time was computed with an Intel JPEG (16-bit) 2.45±0.01 CoreTM i5-3337U 1.80 GHz processor. Since most of the JPEG2000 (16-bit) 3.48±0.02 codecs support different parameter configurations, we used the JPEG-LS (16-bit) 3.66±0.02 parameter settings that maximized the compression ratio for JPEG (8-bit) 2.69±0.09 each codecs unless otherwise specified. JPEG2000 (8-bit) 3.81±0.29 JPEG-LS (8-bit) 4.76±0.69 III. RESULTS H264 4.43±0.57 Table 1 summarizes the descriptive statistics of the H265 4.33±0.51 compression ratio for all codecs. Figure 1 shows the results of Lagarith 4.26±0.50 compression ratios using the box-and-whisker plot. For still- MSU 4.73±0.67 image compression codecs, the 16-to-8 bit conversion MLC 4.75±0.69 improves the CR by 9.80% and 9.48% for JPEG and JPEG2000 compression. Furthermore, a significantly large Fig. 1. Comparison of compression ratio 3961 Fig. 2. Compariosn of encoding and decoding costs (millisecond/megabyte). IV. DISCUSSION ACKNOWLEDGMENT We evaluated five lossless video compression methods on Part of this work was supported by the U.S. National volumetric medical computer tomography image datasets and Science Foundation under Grants CNS-1624485. showed their performance is consistently better than JPEG and JPEG2000 still-image compression schemes. In contrast, the REFERENCES compression ratio of JPEG-LS is similar to the video codecs [1] EMC Corporation, and International Data Corporation 2014. “The results. We also compared the CR between original 16-bit and Digital Universe: Driving Data Growth in Healthcare.” optimized 8-bit data. The results show the CR of the optimized [2] DICOM part 5. http://dicom.nema.org 8-bit data is significantly better than the original 16-bit images [3] VirtualDub. http://www.virtualdub.org for all still-image codecs compared. [4] FFmpeg. http://trac.ffmpeg.org Comparisons on lossless compression for 3D medical [5] Clunie DA. "Lossless compression of grayscale medical images: effectiveness of traditional and state-of-the-art approaches." Proc. SPIE images have been studied previously [5-10]. Various lossless 3980, Medical Imaging 2000, pp.74-84. still-image compression methods have been compared using [6] Philips W, Van Assche S, De Rycke D, Denecker K. "State-of-the-art different image modalities. Some studies applied video techniques for lossless compression of 3D medical image sets." compression techniques on medical images [8-10], but limited Computerized Medical Imaging and Graphics, 2001, 25, pp.173-185. their evaluation to H264 and H265 only.
Recommended publications
  • A Survey Paper on Different Speech Compression Techniques
    Vol-2 Issue-5 2016 IJARIIE-ISSN (O)-2395-4396 A Survey Paper on Different Speech Compression Techniques Kanawade Pramila.R1, Prof. Gundal Shital.S2 1 M.E. Electronics, Department of Electronics Engineering, Amrutvahini College of Engineering, Sangamner, Maharashtra, India. 2 HOD in Electronics Department, Department of Electronics Engineering , Amrutvahini College of Engineering, Sangamner, Maharashtra, India. ABSTRACT This paper describes the different types of speech compression techniques. Speech compression can be divided into two main types such as lossless and lossy compression. This survey paper has been written with the help of different types of Waveform-based speech compression, Parametric-based speech compression, Hybrid based speech compression etc. Compression is nothing but reducing size of data with considering memory size. Speech compression means voiced signal compress for different application such as high quality database of speech signals, multimedia applications, music database and internet applications. Today speech compression is very useful in our life. The main purpose or aim of speech compression is to compress any type of audio that is transfer over the communication channel, because of the limited channel bandwidth and data storage capacity and low bit rate. The use of lossless and lossy techniques for speech compression means that reduced the numbers of bits in the original information. By the use of lossless data compression there is no loss in the original information but while using lossy data compression technique some numbers of bits are loss. Keyword: - Bit rate, Compression, Waveform-based speech compression, Parametric-based speech compression, Hybrid based speech compression. 1. INTRODUCTION -1 Speech compression is use in the encoding system.
    [Show full text]
  • Download Media Player Codec Pack Version 4.1 Media Player Codec Pack
    download media player codec pack version 4.1 Media Player Codec Pack. Description: In Microsoft Windows 10 it is not possible to set all file associations using an installer. Microsoft chose to block changes of file associations with the introduction of their Zune players. Third party codecs are also blocked in some instances, preventing some files from playing in the Zune players. A simple workaround for this problem is to switch playback of video and music files to Windows Media Player manually. In start menu click on the "Settings". In the "Windows Settings" window click on "System". On the "System" pane click on "Default apps". On the "Choose default applications" pane click on "Films & TV" under "Video Player". On the "Choose an application" pop up menu click on "Windows Media Player" to set Windows Media Player as the default player for video files. Footnote: The same method can be used to apply file associations for music, by simply clicking on "Groove Music" under "Media Player" instead of changing Video Player in step 4. Media Player Codec Pack Plus. Codec's Explained: A codec is a piece of software on either a device or computer capable of encoding and/or decoding video and/or audio data from files, streams and broadcasts. The word Codec is a portmanteau of ' co mpressor- dec ompressor' Compression types that you will be able to play include: x264 | x265 | h.265 | HEVC | 10bit x265 | 10bit x264 | AVCHD | AVC DivX | XviD | MP4 | MPEG4 | MPEG2 and many more. File types you will be able to play include: .bdmv | .evo | .hevc | .mkv | .avi | .flv | .webm | .mp4 | .m4v | .m4a | .ts | .ogm .ac3 | .dts | .alac | .flac | .ape | .aac | .ogg | .ofr | .mpc | .3gp and many more.
    [Show full text]
  • Arxiv:2004.10531V1 [Cs.OH] 8 Apr 2020
    ROOT I/O compression improvements for HEP analysis Oksana Shadura1;∗ Brian Paul Bockelman2;∗∗ Philippe Canal3;∗∗∗ Danilo Piparo4;∗∗∗∗ and Zhe Zhang1;y 1University of Nebraska-Lincoln, 1400 R St, Lincoln, NE 68588, United States 2Morgridge Institute for Research, 330 N Orchard St, Madison, WI 53715, United States 3Fermilab, Kirk Road and Pine St, Batavia, IL 60510, United States 4CERN, Meyrin 1211, Geneve, Switzerland Abstract. We overview recent changes in the ROOT I/O system, increasing per- formance and enhancing it and improving its interaction with other data analy- sis ecosystems. Both the newly introduced compression algorithms, the much faster bulk I/O data path, and a few additional techniques have the potential to significantly to improve experiment’s software performance. The need for efficient lossless data compression has grown significantly as the amount of HEP data collected, transmitted, and stored has dramatically in- creased during the LHC era. While compression reduces storage space and, potentially, I/O bandwidth usage, it should not be applied blindly: there are sig- nificant trade-offs between the increased CPU cost for reading and writing files and the reduce storage space. 1 Introduction In the past years LHC experiments are commissioned and now manages about an exabyte of storage for analysis purposes, approximately half of which is used for archival purposes, and half is used for traditional disk storage. Meanwhile for HL-LHC storage requirements per year are expected to be increased by factor 10 [1]. arXiv:2004.10531v1 [cs.OH] 8 Apr 2020 Looking at these predictions, we would like to state that storage will remain one of the major cost drivers and at the same time the bottlenecks for HEP computing.
    [Show full text]
  • Lossless Compression of Audio Data
    CHAPTER 12 Lossless Compression of Audio Data ROBERT C. MAHER OVERVIEW Lossless data compression of digital audio signals is useful when it is necessary to minimize the storage space or transmission bandwidth of audio data while still maintaining archival quality. Available techniques for lossless audio compression, or lossless audio packing, generally employ an adaptive waveform predictor with a variable-rate entropy coding of the residual, such as Huffman or Golomb-Rice coding. The amount of data compression can vary considerably from one audio waveform to another, but ratios of less than 3 are typical. Several freeware, shareware, and proprietary commercial lossless audio packing programs are available. 12.1 INTRODUCTION The Internet is increasingly being used as a means to deliver audio content to end-users for en­ tertainment, education, and commerce. It is clearly advantageous to minimize the time required to download an audio data file and the storage capacity required to hold it. Moreover, the expec­ tations of end-users with regard to signal quality, number of audio channels, meta-data such as song lyrics, and similar additional features provide incentives to compress the audio data. 12.1.1 Background In the past decade there have been significant breakthroughs in audio data compression using lossy perceptual coding [1]. These techniques lower the bit rate required to represent the signal by establishing perceptual error criteria, meaning that a model of human hearing perception is Copyright 2003. Elsevier Science (USA). 255 AU rights reserved. 256 PART III / APPLICATIONS used to guide the elimination of excess bits that can be either reconstructed (redundancy in the signal) orignored (inaudible components in the signal).
    [Show full text]
  • Comparison of Image Compressions: Analog Transformations P
    Proceedings Comparison of Image Compressions: Analog † Transformations P Jose Balsa P CITIC Research Center, Universidade da Coruña (University of A Coruña), 15071 A Coruña, Spain; [email protected] † Presented at the 3rd XoveTIC Conference, A Coruña, Spain, 8–9 October 2020. Published: 21 August 2020 Abstract: A comparison between the four most used transforms, the discrete Fourier transform (DFT), discrete cosine transform (DCT), the Walsh–Hadamard transform (WHT) and the Haar- wavelet transform (DWT), for the transmission of analog images, varying their compression and comparing their quality, is presented. Additionally, performance tests are done for different levels of white Gaussian additive noise. Keywords: analog image transformation; analog image compression; analog image quality 1. Introduction Digitized image coding systems employ reversible mathematical transformations. These transformations change values and function domains in order to rearrange information in a way that condenses information important to human vision [1]. In the new domain, it is possible to filter out relevant information and discard information that is irrelevant or of lesser importance for image quality [2]. Both digital and analog systems use the same transformations in source coding. Some examples of digital systems that employ these transformations are JPEG, M-JPEG, JPEG2000, MPEG- 1, 2, 3 and 4, DV and HDV, among others. Although digital systems after transformation and filtering make use of digital lossless compression techniques, such as Huffman. In this work, we aim to make a comparison of the most commonly used transformations in state- of-the-art image compression systems. Typically, the transformations used to compress analog images work either on the entire image or on regions of the image.
    [Show full text]
  • Input Formats & Codecs
    Input Formats & Codecs Pivotshare offers upload support to over 99.9% of codecs and container formats. Please note that video container formats are independent codec support. Input Video Container Formats (Independent of codec) 3GP/3GP2 ASF (Windows Media) AVI DNxHD (SMPTE VC-3) DV video Flash Video Matroska MOV (Quicktime) MP4 MPEG-2 TS, MPEG-2 PS, MPEG-1 Ogg PCM VOB (Video Object) WebM Many more... Unsupported Video Codecs Apple Intermediate ProRes 4444 (ProRes 422 Supported) HDV 720p60 Go2Meeting3 (G2M3) Go2Meeting4 (G2M4) ER AAC LD (Error Resiliant, Low-Delay variant of AAC) REDCODE Supported Video Codecs 3ivx 4X Movie Alaris VideoGramPiX Alparysoft lossless codec American Laser Games MM Video AMV Video Apple QuickDraw ASUS V1 ASUS V2 ATI VCR-2 ATI VCR1 Auravision AURA Auravision Aura 2 Autodesk Animator Flic video Autodesk RLE Avid Meridien Uncompressed AVImszh AVIzlib AVS (Audio Video Standard) video Beam Software VB Bethesda VID video Bink video Blackmagic 10-bit Broadway MPEG Capture Codec Brooktree 411 codec Brute Force & Ignorance CamStudio Camtasia Screen Codec Canopus HQ Codec Canopus Lossless Codec CD Graphics video Chinese AVS video (AVS1-P2, JiZhun profile) Cinepak Cirrus Logic AccuPak Creative Labs Video Blaster Webcam Creative YUV (CYUV) Delphine Software International CIN video Deluxe Paint Animation DivX ;-) (MPEG-4) DNxHD (VC3) DV (Digital Video) Feeble Files/ScummVM DXA FFmpeg video codec #1 Flash Screen Video Flash Video (FLV) / Sorenson Spark / Sorenson H.263 Forward Uncompressed Video Codec fox motion video FRAPS:
    [Show full text]
  • Data Compression
    Data Compression Data Compression Compression reduces the size of a file: ! To save space when storing it. ! To save time when transmitting it. ! Most files have lots of redundancy. Who needs compression? ! Moore's law: # transistors on a chip doubles every 18-24 months. ! Parkinson's law: data expands to fill space available. ! Text, images, sound, video, . All of the books in the world contain no more information than is Reference: Chapter 22, Algorithms in C, 2nd Edition, Robert Sedgewick. broadcast as video in a single large American city in a single year. Reference: Introduction to Data Compression, Guy Blelloch. Not all bits have equal value. -Carl Sagan Basic concepts ancient (1950s), best technology recently developed. Robert Sedgewick and Kevin Wayne • Copyright © 2005 • http://www.Princeton.EDU/~cos226 2 Applications of Data Compression Encoding and Decoding hopefully uses fewer bits Generic file compression. Message. Binary data M we want to compress. ! Files: GZIP, BZIP, BOA. Encode. Generate a "compressed" representation C(M). ! Archivers: PKZIP. Decode. Reconstruct original message or some approximation M'. ! File systems: NTFS. Multimedia. M Encoder C(M) Decoder M' ! Images: GIF, JPEG. ! Sound: MP3. ! Video: MPEG, DivX™, HDTV. Compression ratio. Bits in C(M) / bits in M. Communication. ! ITU-T T4 Group 3 Fax. Lossless. M = M', 50-75% or lower. ! V.42bis modem. Ex. Natural language, source code, executables. Databases. Google. Lossy. M ! M', 10% or lower. Ex. Images, sound, video. 3 4 Ancient Ideas Run-Length Encoding Ancient ideas. Natural encoding. (19 " 51) + 6 = 975 bits. ! Braille. needed to encode number of characters per line ! Morse code.
    [Show full text]
  • The Deep Learning Solutions on Lossless Compression Methods for Alleviating Data Load on Iot Nodes in Smart Cities
    sensors Article The Deep Learning Solutions on Lossless Compression Methods for Alleviating Data Load on IoT Nodes in Smart Cities Ammar Nasif *, Zulaiha Ali Othman and Nor Samsiah Sani Center for Artificial Intelligence Technology (CAIT), Faculty of Information Science & Technology, University Kebangsaan Malaysia, Bangi 43600, Malaysia; [email protected] (Z.A.O.); [email protected] (N.S.S.) * Correspondence: [email protected] Abstract: Networking is crucial for smart city projects nowadays, as it offers an environment where people and things are connected. This paper presents a chronology of factors on the development of smart cities, including IoT technologies as network infrastructure. Increasing IoT nodes leads to increasing data flow, which is a potential source of failure for IoT networks. The biggest challenge of IoT networks is that the IoT may have insufficient memory to handle all transaction data within the IoT network. We aim in this paper to propose a potential compression method for reducing IoT network data traffic. Therefore, we investigate various lossless compression algorithms, such as entropy or dictionary-based algorithms, and general compression methods to determine which algorithm or method adheres to the IoT specifications. Furthermore, this study conducts compression experiments using entropy (Huffman, Adaptive Huffman) and Dictionary (LZ77, LZ78) as well as five different types of datasets of the IoT data traffic. Though the above algorithms can alleviate the IoT data traffic, adaptive Huffman gave the best compression algorithm. Therefore, in this paper, Citation: Nasif, A.; Othman, Z.A.; we aim to propose a conceptual compression method for IoT data traffic by improving an adaptive Sani, N.S.
    [Show full text]
  • Comparison Between (RLE and Huffman) Algorithmsfor Lossless Data Compression Dr
    Amin Mubarak Alamin Ibrahim* et al. (IJITR) INTERNATIONAL JOURNAL OF INNOVATIVE TECHNOLOGY AND RESEARCH Volume No.3, Issue No.1, December – January 2015, 1808 – 1812. Comparison Between (RLE And Huffman) Algorithmsfor Lossless Data Compression Dr. AMIN MUBARK ALAMIN IBRAHIM Dr. MUSTAFA ELGILI MUSTAFA Department of mathematics Computer Science Department Faculty of Education Community College Shagra University Shaqra University Afif, Saudi Arabia Shaqra, Saudi Arabia Home affilation Assistant Professor, Faculty of Computer Science and Information Technology–Neelian University, Khartoum, Sudan Abstract: Multimedia field is distinguished from other areas of the need for massive storage volumes. This caused a lot of problems, particularly the speed of reading files when (transmission and reception) and increase the cost (up capacities petition) was to be the presence of ways we can get rid of these problems resulting from the increase Size was one of the successful solutions innovation algorithms to compress files. This paper aims to compare between (RLE and Huffman) algorithms which are also non- compression algorithms devoid texts, according to the standard file size. Propagated the comparison between the original file size and file size after compression using (RLE & HUFFMAN) algorithms for more than (30) text file. We used c++ program to compress the files and Microsoft excel program in the description analysis so as to calculate the compression ratio and others things. The study pointed to the effectiveness of the algorithm (HUFFMAN) in the process of reducing the size of the files. Keywords: RlE,Huffman I. INTRODUCTION compression), and the computational resources required to compress and uncompressed the In computer science and information theory, data data.[5] However, the most important reason for compression, source coding,[1] or bit-rate compressing data is that more and more we share reduction involves encoding information using data.
    [Show full text]
  • Source Coding Basics and Speech Coding
    Source Coding Basics and Speech Coding Yao Wang Polytechnic University, Brooklyn, NY11201 http://eeweb.poly.edu/~yao Outline • Why do we need to compress speech signals • Basic components in a source coding system • Variable length binary encoding: Huffman coding • Speech coding overview • Predictive coding: DPCM, ADPCM • Vocoder • Hybrid coding • Speech coding standards ©Yao Wang, 2006 EE3414: Speech Coding 2 Why do we need to compress? • Raw PCM speech (sampled at 8 kbps, represented with 8 bit/sample) has data rate of 64 kbps • Speech coding refers to a process that reduces the bit rate of a speech file • Speech coding enables a telephone company to carry more voice calls in a single fiber or cable • Speech coding is necessary for cellular phones, which has limited data rate for each user (<=16 kbps is desired!). • For the same mobile user, the lower the bit rate for a voice call, the more other services (data/image/video) can be accommodated. • Speech coding is also necessary for voice-over-IP, audio-visual teleconferencing, etc, to reduce the bandwidth consumption over the Internet ©Yao Wang, 2006 EE3414: Speech Coding 3 Basic Components in a Source Coding System Input Transformed Quantized Binary Samples parameters parameters bitstreams Transfor- Quanti- Binary mation zation Encoding Lossy Lossless Prediction Scalar Q Fixed length Transforms Vector Q Variable length Model fitting (Huffman, …... arithmetic, LZW) • Motivation for transformation --- To yield a more efficient representation of the original samples. ©Yao Wang, 2006 EE3414:
    [Show full text]
  • Supported Codecs and Formats Codecs
    Supported Codecs and Formats Codecs: D..... = Decoding supported .E.... = Encoding supported ..V... = Video codec ..A... = Audio codec ..S... = Subtitle codec ...I.. = Intra frame-only codec ....L. = Lossy compression .....S = Lossless compression ------- D.VI.. 012v Uncompressed 4:2:2 10-bit D.V.L. 4xm 4X Movie D.VI.S 8bps QuickTime 8BPS video .EVIL. a64_multi Multicolor charset for Commodore 64 (encoders: a64multi ) .EVIL. a64_multi5 Multicolor charset for Commodore 64, extended with 5th color (colram) (encoders: a64multi5 ) D.V..S aasc Autodesk RLE D.VIL. aic Apple Intermediate Codec DEVIL. amv AMV Video D.V.L. anm Deluxe Paint Animation D.V.L. ansi ASCII/ANSI art DEVIL. asv1 ASUS V1 DEVIL. asv2 ASUS V2 D.VIL. aura Auravision AURA D.VIL. aura2 Auravision Aura 2 D.V... avrn Avid AVI Codec DEVI.. avrp Avid 1:1 10-bit RGB Packer D.V.L. avs AVS (Audio Video Standard) video DEVI.. avui Avid Meridien Uncompressed DEVI.. ayuv Uncompressed packed MS 4:4:4:4 D.V.L. bethsoftvid Bethesda VID video D.V.L. bfi Brute Force & Ignorance D.V.L. binkvideo Bink video D.VI.. bintext Binary text DEVI.S bmp BMP (Windows and OS/2 bitmap) D.V..S bmv_video Discworld II BMV video D.VI.S brender_pix BRender PIX image D.V.L. c93 Interplay C93 D.V.L. cavs Chinese AVS (Audio Video Standard) (AVS1-P2, JiZhun profile) D.V.L. cdgraphics CD Graphics video D.VIL. cdxl Commodore CDXL video D.V.L. cinepak Cinepak DEVIL. cljr Cirrus Logic AccuPak D.VI.S cllc Canopus Lossless Codec D.V.L.
    [Show full text]
  • A Scalable High-Bandwidth Architecture for Lossless Compression on Fpgas
    A Scalable High-Bandwidth Architecture for Lossless Compression on FPGAs Jeremy Fowers∗, Joo-Young Kim∗ and Doug Burger∗ Scott Hauck∗y ∗Microsoft Research yDepartment of Electrical Engineering Redmond, WA, USA University of Washington, Seattle, WA, USA Abstract—Data compression techniques have been the subject [10] have suffered from low throughput performance. How- of intense study over the past several decades due to exponential ever, recent efforts from industrial vendors [11]–[14] have increases in the quantity of data stored and transmitted by demonstrated 10 GB/s and 4 GB/s lossless compression on computer systems. Compression algorithms are traditionally forced to make tradeoffs between throughput and compression ASICs and FPGAs, respectively, proving that these devices quality (the ratio of original file size to compressed file are indeed a compelling substrate for compression. These size). FPGAs represent a compelling substrate for streaming accelerators are useful for both higher throughput and for applications such as data compression thanks to their capacity alleviating the computational cost of compression from the for deep pipelines and custom caching solutions. Unfortunately, CPU. data hazards in compression algorithms such as LZ77 inhibit the creation of deep pipelines without sacrificing some amount While these new accelerators provide an order-of- of compression quality. In this work we detail a scalable fully magnitude throughput increase over modern CPUs, they are pipelined FPGA accelerator that performs LZ77 compression resource intensive and will not scale to significantly higher and static Huffman encoding at rates up to 5.6 GB/s. Further- more, we explore tradeoffs between compression quality and bandwidths on existing hardware.
    [Show full text]