The Current Role of Image Compression Standards in Medical Imaging

Total Page:16

File Type:pdf, Size:1020Kb

The Current Role of Image Compression Standards in Medical Imaging information Review The Current Role of Image Compression Standards in Medical Imaging Feng Liu 1 ID , Miguel Hernandez-Cabronero 2, Victor Sanchez 3, Michael W. Marcellin 2 and Ali Bilgin 2,4,5,* 1 College of Electronic Information and Optical Engineering, Nankai University, Haihe Education Park, 38 Tongyan Road, Jinnan District, Tianjin 300350, China; [email protected] 2 Department of Electrical and Computer Engineering, The University of Arizona, 1230 E. Speedway Blvd, Tucson, AZ 85721, USA; [email protected] (M.H.-C.); [email protected] (M.W.M.) 3 Department of Computer Science, University of Warwick, Coventry CV4 7AL, UK; [email protected] 4 Department of Biomedical Engineering, The University of Arizona, 1127 E. James E. Rogers Way, Tucson, AZ 85721, USA 5 Department of Medical Imaging, The University of Arizona, 1501 N. Campbell Ave., Tucson, AZ 85724, USA * Correspondence: [email protected]; Tel.: +1-520-626-9414 Received: 25 September 2017; Accepted: 15 October 2017; Published: 19 October 2017 Abstract: With the increasing utilization of medical imaging in clinical practice and the growing dimensions of data volumes generated by various medical imaging modalities, the distribution, storage, and management of digital medical image data sets requires data compression. Over the past few decades, several image compression standards have been proposed by international standardization organizations. This paper discusses the current status of these image compression standards in medical imaging applications together with some of the legal and regulatory issues surrounding the use of compression in medical settings. Keywords: image compression; medical imaging; standards; JPEG; JPEG-LS; JPEG2000; JPEG-XR; HEVC; DICOM 1. Introduction Medical imaging has become an indispensable tool in clinical practice. Studies have shown links between the use of medical imaging exams and declines in mortality, reduced need for exploratory surgery, fewer hospital admissions, shorter hospital stays, and longer life expectancy [1]. As a result, the utilization of medical imaging has risen sharply during the early part of the last decade. In 2003, the percentage of medical visits in the US by patients aged ≥ 65 years that resulted in medical imaging was estimated to be 12.8% [2]. While earlier medical imaging exams were recorded on radiological film, most exams are now acquired digitally. In addition to increased utilization, there have also been major advances in medical imaging technology that have resulted in significant increases in the quantity of digital medical imaging data during the last few decades. For example, in early 1990s, a typical Computed Tomography (CT) exam of the thorax would have consisted of 25 slices with 10 mm thickness, yielding a data size of roughly 12 megabytes (MBs). Today, a similar exam on a modern CT scanner can yield sub-millimeter slice thickness with increased in-plane resolution resulting in 600 MB to a gigabyte (GB) of data. In a modern hospital, Picture Archiving and Communication Systems (PACS) handle the short- and long-term storage, retrieval, management, distribution, processing, and presentation of these large datasets. Data compression plays an important role in these systems. Since the earliest days of PACS, compression of medical images has been anticipated and novel compression techniques have been proposed before standardized compression Information 2017, 8, 131; doi:10.3390/info8040131 www.mdpi.com/journal/information Information 2017, 8, 131 2 of 26 approaches were available [3]. However, proprietary compression techniques greatly increase the cost and effort required to migrate data between different systems, and interoperability and compatibility of these systems necessitate the use of standards for digital communications [4]. In this paper, we provide a review of the current status of image compression standards used for medical imaging data in these systems (It is also worthwhile to point out that the role of data compression in the medical setting is not limited to images. Modern medical practice utilizes many physiological signals (e.g., electrocardiogram (ECG), electroencephalogram (EEG), and Electromyogram (EMG)) which must be stored and transmitted in clinical practice. Data compression has an important role to play for management of such physiological signals as well. However, in this paper, we limit our discussion to medical images). It is important to note that there have been earlier reviews of medical image compression techniques [5–14]. In this paper, we focus on the image compression standards including the more recent standards that have not been considered in these earlier reviews. We also compare the compression performances of these standards on publicly available datasets. The rest of this paper is organized as follows: In Section2, we discuss characteristics of common medical imaging data sets. In Section3, we provide a brief introduction to current image compression standards. Section4 introduces the standards used for medical image communications. Section5 briefly reviews the current legal and regulatory environment in medical image communications. We present experimental results obtained using the image compression standards on typical medical image datasets in Section6. Finally, Section7 provides a summary and a brief discussion on future trends. 2. Characteristics of Medical Imaging Data Sets The purpose of this section is to briefly describe the unique attributes of the major medical imaging modalities. Medical imaging refers to techniques used to view the human body with the goal of diagnosing, monitoring, and/or treating medical conditions. Different imaging modalities are based on different physical principles and provide different information about structure, morphology, and function of the human body. Medical imaging is an active field with new imaging modalities introduced frequently and existing modalities refined and expanded constantly. Given this depth and diversity of the medical imaging field, it is not realistic to provide a complete description of all medical imaging techniques in this section. Therefore, we limit our discussion to the most common medical imaging methods in clinical practice and their general characteristics as they relate to data compression (For an extended review of the characteristics of medical datasets, the interested reader is referred to [15]). Table1 provides the typical image dimensions and uncompressed file sizes for common medical imaging modalities. Table 1. Typical image dimensions and uncompressed file sizes for common medical imaging modalities. Modality Anatomy Image Dimensions (x, y, z, t) Bit Depth Uncompressed File Size Radiography Chest (2000, 2500, -, -) 10–16 bits 10 MB Abdomen (512, 512, 500, -) 12–16 bits 250 MB CT Brain (512, 512, 300, -) 12-16 bits 150 MB Heart (512, 512, 100, 20) 12-16 bits 1 GB Breast Tomosynthesis Breast (2457, 1890, 50, -) 10–16 bits 0.4 GB Abdomen (512, 512, 100, -) 12–16 bits 50 MB MRI Brain (512, 512, 200, -) 12–16 bits 100 MB Heart (256, 256, 20, 25) 12–16 bits 250 MB Ultrasound Heart (512, 512, -, 50)/s 24 bits (color) 38 MB/s Brain (256, 256, 50, -) 16 bits 6 MB PET Heart (128, 128, 40, 16) 16 bits 1 MB Digital Pathology Cells (30,000, 30,000, -, -) 24 bits (color) 2.5 GB Information 2017, 8, 131 3 of 26 2.1. Digital Radiography and Computed Tomography In X-ray radiography, the subject is penetrated by a collimated beam of X-rays and the properties of the X-ray (intensity, energy spectrum, direction of propagation, etc.) are modified as it travels through the human body. An array of X-ray detectors placed behind the subject is then used to record the modified properties of the X-ray beam and form the X-ray image. The dimensions of the image depend on the dimensions of the detector elements (0.1 mm to 0.2 mm) and the field of view (18 × 20 cm to 35 × 40 cm based on the anatomy of interest). A sample X-ray image of the pelvis is shown in Figure1a. In X-ray radiography, a single projection plane of an anatomical region is imaged onto the detector plane. In contrast, CT imaging produces a 3D image of the anatomy of interest by acquiring several projections at different angles and reconstructing the 3D volume using tomography [16]. Today’s multi-detector row CT scanners can acquire up to 320 simultaneous slices in each rotation of the X-ray tube. A thin-slice CT dataset can contain over 500 slices and dynamic imaging (such as cardiac angiography or perfusion imaging) can be performed on the latest generation of CT scanners although the routine clinical utilization of these studies is limited due to concerns about radiation dose. Another recent X-ray imaging technique developed to enhance characterization of breast lesions is Digital Breast Tomosynthesis (3D Mammography) [17,18]. In this technique, thin sections of the breast tissue are reconstructed through acquisition of multiple projections over a limited arc angle. These use of thin slices covering the entire breast allows better visualization and characterization of masses that may otherwise be superimposed with out-of-plane structures. A sample CT image of an axial slice of the abdomen is shown in Figure1b. The contrast resolution in CT is largely determined by the number of photons per voxel and, therefore, there is a trade-off between X-ray dose and contrast resolution, as well as spatial resolution. Noise in CT images originates from a number of sources including photon noise at the detector, electronic noise of the detector system, and the image reconstruction algorithm used. (a) (b) Figure 1. (a) An X-ray image of the pelvis; (b) A CT image of the abdomen. 2.2. Magnetic Resonance Imaging Magnetic Resonance Imaging (MRI) [19] is based on the principles of nuclear magnetic resonance. In MRI, the contrast of the image is dependent on several tissue-dependent parameters as well as the choice of acquisition parameters selected during the imaging procedure. By varying these parameters, MRI can yield images with drastically different contrast.
Recommended publications
  • Versatile Video Coding – the Next-Generation Video Standard of the Joint Video Experts Team
    31.07.2018 Versatile Video Coding – The Next-Generation Video Standard of the Joint Video Experts Team Mile High Video Workshop, Denver July 31, 2018 Gary J. Sullivan, JVET co-chair Acknowledgement: Presentation prepared with Jens-Rainer Ohm and Mathias Wien, Institute of Communication Engineering, RWTH Aachen University 1. Introduction Versatile Video Coding – The Next-Generation Video Standard of the Joint Video Experts Team 1 31.07.2018 Video coding standardization organisations • ISO/IEC MPEG = “Moving Picture Experts Group” (ISO/IEC JTC 1/SC 29/WG 11 = International Standardization Organization and International Electrotechnical Commission, Joint Technical Committee 1, Subcommittee 29, Working Group 11) • ITU-T VCEG = “Video Coding Experts Group” (ITU-T SG16/Q6 = International Telecommunications Union – Telecommunications Standardization Sector (ITU-T, a United Nations Organization, formerly CCITT), Study Group 16, Working Party 3, Question 6) • JVT = “Joint Video Team” collaborative team of MPEG & VCEG, responsible for developing AVC (discontinued in 2009) • JCT-VC = “Joint Collaborative Team on Video Coding” team of MPEG & VCEG , responsible for developing HEVC (established January 2010) • JVET = “Joint Video Experts Team” responsible for developing VVC (established Oct. 2015) – previously called “Joint Video Exploration Team” 3 Versatile Video Coding – The Next-Generation Video Standard of the Joint Video Experts Team Gary Sullivan | Jens-Rainer Ohm | Mathias Wien | July 31, 2018 History of international video coding standardization
    [Show full text]
  • 2018-07-11 and for Information to the Iso Member Bodies and to the Tmb Members
    Sergio Mujica Secretary-General TO THE CHAIRS AND SECRETARIES OF ISO COMMITTEES 2018-07-11 AND FOR INFORMATION TO THE ISO MEMBER BODIES AND TO THE TMB MEMBERS ISO/IEC/ITU coordination – New work items Dear Sir or Madam, Please find attached the lists of IEC, ITU and ISO new work items issued in June 2018. If you wish more information about IEC technical committees and subcommittees, please access: http://www.iec.ch/. Click on the last option to the right: Advanced Search and then click on: Documents / Projects / Work Programme. In case of need, a copy of an actual IEC new work item may be obtained by contacting [email protected]. Please note for your information that in the annexed table from IEC the "document reference" 22F/188/NP means a new work item from IEC Committee 22, Subcommittee F. If you wish to look at the ISO new work items, please access: http://isotc.iso.org/pp/. On the ISO Project Portal you can find all information about the ISO projects, by committee, document number or project ID, or choose the option "Stages search" and select "Search" to obtain the annexed list of ISO new work items. Yours sincerely, Sergio Mujica Secretary-General Enclosures ISO New work items 1 of 8 2018-07-11 Alert Detailed alert Timeframe Reference Document title Developing committee VA Registration dCurrent stage Stage date Guidance for multiple organizations implementing a common Warning Warning – NP decision SDT 36 ISO/NP 50009 (ISO50001) EnMS ISO/TC 301 - - 10.60 2018-06-10 Warning Warning – NP decision SDT 36 ISO/NP 31050 Guidance for managing
    [Show full text]
  • (L3) - Audio/Picture Coding
    Committee: (L3) - Audio/Picture Coding National Designation Title (Click here to purchase standards) ISO/IEC Document L3 INCITS/ISO/IEC 9281-1:1990:[R2013] Information technology - Picture Coding Methods - Part 1: Identification IS 9281-1:1990 INCITS/ISO/IEC 9281-2:1990:[R2013] Information technology - Picture Coding Methods - Part 2: Procedure for Registration IS 9281-2:1990 INCITS/ISO/IEC 9282-1:1988:[R2013] Information technology - Coded Representation of Computer Graphics Images - Part IS 9282-1:1988 1: Encoding principles for picture representation in a 7-bit or 8-bit environment :[] Information technology - Coding of Multimedia and Hypermedia Information - Part 7: IS 13522-7:2001 Interoperability and conformance testing for ISO/IEC 13522-5 (MHEG-7) :[] Information technology - Coding of Multimedia and Hypermedia Information - Part 5: IS 13522-5:1997 Support for Base-Level Interactive Applications (MHEG-5) :[] Information technology - Coding of Multimedia and Hypermedia Information - Part 3: IS 13522-3:1997 MHEG script interchange representation (MHEG-3) :[] Information technology - Coding of Multimedia and Hypermedia Information - Part 6: IS 13522-6:1998 Support for enhanced interactive applications (MHEG-6) :[] Information technology - Coding of Multimedia and Hypermedia Information - Part 8: IS 13522-8:2001 XML notation for ISO/IEC 13522-5 (MHEG-8) Created: 11/16/2014 Page 1 of 44 Committee: (L3) - Audio/Picture Coding National Designation Title (Click here to purchase standards) ISO/IEC Document :[] Information technology - Coding
    [Show full text]
  • Lossy Image Compression Based on Prediction Error and Vector Quantisation Mohamed Uvaze Ahamed Ayoobkhan* , Eswaran Chikkannan and Kannan Ramakrishnan
    Ayoobkhan et al. EURASIP Journal on Image and Video Processing (2017) 2017:35 EURASIP Journal on Image DOI 10.1186/s13640-017-0184-3 and Video Processing RESEARCH Open Access Lossy image compression based on prediction error and vector quantisation Mohamed Uvaze Ahamed Ayoobkhan* , Eswaran Chikkannan and Kannan Ramakrishnan Abstract Lossy image compression has been gaining importance in recent years due to the enormous increase in the volume of image data employed for Internet and other applications. In a lossy compression, it is essential to ensure that the compression process does not affect the quality of the image adversely. The performance of a lossy compression algorithm is evaluated based on two conflicting parameters, namely, compression ratio and image quality which is usually measured by PSNR values. In this paper, a new lossy compression method denoted as PE-VQ method is proposed which employs prediction error and vector quantization (VQ) concepts. An optimum codebook is generated by using a combination of two algorithms, namely, artificial bee colony and genetic algorithms. The performance of the proposed PE-VQ method is evaluated in terms of compression ratio (CR) and PSNR values using three different types of databases, namely, CLEF med 2009, Corel 1 k and standard images (Lena, Barbara etc.). Experiments are conducted for different codebook sizes and for different CR values. The results show that for a given CR, the proposed PE-VQ technique yields higher PSNR value compared to the existing algorithms. It is also shown that higher PSNR values can be obtained by applying VQ on prediction errors rather than on the original image pixels.
    [Show full text]
  • JPEG Image Compression.Pdf
    JPEG image compression FAQ, part 1/2 2/18/05 5:03 PM Part1 - Part2 - MultiPage JPEG image compression FAQ, part 1/2 There are reader questions on this topic! Help others by sharing your knowledge Newsgroups: comp.graphics.misc, comp.infosystems.www.authoring.images From: [email protected] (Tom Lane) Subject: JPEG image compression FAQ, part 1/2 Message-ID: <[email protected]> Summary: General questions and answers about JPEG Keywords: JPEG, image compression, FAQ, JPG, JFIF Reply-To: [email protected] Date: Mon, 29 Mar 1999 02:24:27 GMT Sender: [email protected] Archive-name: jpeg-faq/part1 Posting-Frequency: every 14 days Last-modified: 28 March 1999 This article answers Frequently Asked Questions about JPEG image compression. This is part 1, covering general questions and answers about JPEG. Part 2 gives system-specific hints and program recommendations. As always, suggestions for improvement of this FAQ are welcome. New since version of 14 March 1999: * Expanded item 10 to discuss lossless rotation and cropping of JPEGs. This article includes the following sections: Basic questions: [1] What is JPEG? [2] Why use JPEG? [3] When should I use JPEG, and when should I stick with GIF? [4] How well does JPEG compress images? [5] What are good "quality" settings for JPEG? [6] Where can I get JPEG software? [7] How do I view JPEG images posted on Usenet? More advanced questions: [8] What is color quantization? [9] What are some rules of thumb for converting GIF images to JPEG? [10] Does loss accumulate with repeated compression/decompression?
    [Show full text]
  • Vue PACS 12.2 DICOM Conformance Statement
    Clinical Collaboration Platform Vue PACS 12.2 DICOM Conformance Statement Part # AD0536 2017-11-23 Vue PACS 12.2 DICOM Conformance Statement AD0536 A Uncontrolled unless otherwise indicated Confidential Vue PACS 12.2 - DICOM Conformance Statement - AD0536.docx PAGE 1 of 155 Table of Contents 1 Introduction ............................................................................................................................................ 3 1.1 Terms and Definitions ...................................................................................................................... 3 1.2 About This Document ....................................................................................................................... 4 1.3 Important Remarks ........................................................................................................................... 4 2 Implementation Model............................................................................................................................ 5 2.1 Application Data Flow Diagram ........................................................................................................ 5 2.2 Functional Definitions of AEs ......................................................................................................... 11 2.3 Sequencing of Real World Activities .............................................................................................. 12 3 AE Specifications ................................................................................................................................
    [Show full text]
  • Security Solutions Y in MPEG Family of MPEG Family of Standards
    1 Security solutions in MPEG family of standards TdjEbhiiTouradj Ebrahimi [email protected] NET working? Broadcast Networks and their security 16-17 June 2004, Geneva, Switzerland MPEG: Moving Picture Experts Group 2 • MPEG-1 (1992): MP3, Video CD, first generation set-top box, … • MPEG-2 (1994): Digital TV, HDTV, DVD, DVB, Professional, … • MPEG-4 (1998, 99, ongoing): Coding of Audiovisual Objects • MPEG-7 (2001, ongo ing ): DitiDescription of Multimedia Content • MPEG-21 (2002, ongoing): Multimedia Framework NET working? Broadcast Networks and their security 16-17 June 2004, Geneva, Switzerland MPEG-1 - ISO/IEC 11172:1992 3 • Coding of moving pictures and associated audio for digital storage media at up to about 1,5 Mbit/s – Part 1 Systems - Program Stream – Part 2 Video – Part 3 Audio – Part 4 Conformance – Part 5 Reference software NET working? Broadcast Networks and their security 16-17 June 2004, Geneva, Switzerland MPEG-2 - ISO/IEC 13818:1994 4 • Generic coding of moving pictures and associated audio – Part 1 Systems - joint with ITU – Part 2 Video - joint with ITU – Part 3 Audio – Part 4 Conformance – Part 5 Reference software – Part 6 DSM CC – Par t 7 AAC - Advance d Au dio Co ding – Part 9 RTI - Real Time Interface – Part 10 Conformance extension - DSM-CC – Part 11 IPMP on MPEG-2 Systems NET working? Broadcast Networks and their security 16-17 June 2004, Geneva, Switzerland MPEG-4 - ISO/IEC 14496:1998 5 • Coding of audio-visual objects – Part 1 Systems – Part 2 Visual – Part 3 Audio – Part 4 Conformance – Part 5 Reference
    [Show full text]
  • The H.264 Advanced Video Coding (AVC) Standard
    Whitepaper: The H.264 Advanced Video Coding (AVC) Standard What It Means to Web Camera Performance Introduction A new generation of webcams is hitting the market that makes video conferencing a more lifelike experience for users, thanks to adoption of the breakthrough H.264 standard. This white paper explains some of the key benefits of H.264 encoding and why cameras with this technology should be on the shopping list of every business. The Need for Compression Today, Internet connection rates average in the range of a few megabits per second. While VGA video requires 147 megabits per second (Mbps) of data, full high definition (HD) 1080p video requires almost one gigabit per second of data, as illustrated in Table 1. Table 1. Display Resolution Format Comparison Format Horizontal Pixels Vertical Lines Pixels Megabits per second (Mbps) QVGA 320 240 76,800 37 VGA 640 480 307,200 147 720p 1280 720 921,600 442 1080p 1920 1080 2,073,600 995 Video Compression Techniques Digital video streams, especially at high definition (HD) resolution, represent huge amounts of data. In order to achieve real-time HD resolution over typical Internet connection bandwidths, video compression is required. The amount of compression required to transmit 1080p video over a three megabits per second link is 332:1! Video compression techniques use mathematical algorithms to reduce the amount of data needed to transmit or store video. Lossless Compression Lossless compression changes how data is stored without resulting in any loss of information. Zip files are losslessly compressed so that when they are unzipped, the original files are recovered.
    [Show full text]
  • CP2100 Clarify SMPTE Transfer Syntaxes for DICOM Web Services
    16 CP-2100 - Clarify SMPTE Transfer Syntaxes for DICOM Web Services​ Page 1​ 1 Status​ Final Text​ 2 Date of Last Update​ 2021/09/03​ 3 Person Assigned​ David Clunie​ 4 mailto:[email protected]​ 5 Submitter Name​ Mathieu Malaterre​ 6 mailto:[email protected]​ 7 Submission Date​ 2020/04/16​ 8 Correction Number CP-2100​ 9 Log Summary: Clarify SMPTE Transfer Syntaxes for DICOM Web Services​ 10 Name of Standard​ 11 PS3.18​ 12 Rationale for Correction:​ 13 Clarify that the video Transfer Syntaxes used for RTV, such as SMPTE ST 2110-20, are not applicable.​ 14 Correction Wording:​ 15 - Final Text -​ 48 CP-2100 - Clarify SMPTE Transfer Syntaxes for DICOM Web Services​ Page 2​ 1 Amend DICOM PS3.18 as follows (changes to existing text are bold and underlined for additions and struckthrough for removals):​ 2 8.7.3.1 Instance Media Types​ 3 The application/dicom media type specifies a representation of Instances encoded in the DICOM File Format specified in Section 7​ 4 “DICOM File Format” in PS3.10.​ 5 Note​ 6 The origin server may populate the PS3.10 File Meta Information with the identification of the Source, Sending and Receiving​ 7 AE Titles and Presentation Addresses as described in Section 7.1 in PS3.10, or these Attributes may have been left unaltered​ 8 from when the origin server received the objects. The user agent storing the objects received in the response may populate​ 9 or coerce these Attributes based on its own knowledge of the endpoints involved in the transaction, so that they accurately​ 10 identify the most recent
    [Show full text]
  • ITC Confplanner DVD Pages Itcconfplanner
    Comparative Analysis of H.264 and Motion- JPEG2000 Compression for Video Telemetry Item Type text; Proceedings Authors Hallamasek, Kurt; Hallamasek, Karen; Schwagler, Brad; Oxley, Les Publisher International Foundation for Telemetering Journal International Telemetering Conference Proceedings Rights Copyright © held by the author; distribution rights International Foundation for Telemetering Download date 25/09/2021 09:57:28 Link to Item http://hdl.handle.net/10150/581732 COMPARATIVE ANALYSIS OF H.264 AND MOTION-JPEG2000 COMPRESSION FOR VIDEO TELEMETRY Kurt Hallamasek, Karen Hallamasek, Brad Schwagler, Les Oxley [email protected] Ampex Data Systems Corporation Redwood City, CA USA ABSTRACT The H.264/AVC standard, popular in commercial video recording and distribution, has also been widely adopted for high-definition video compression in Intelligence, Surveillance and Reconnaissance and for Flight Test applications. H.264/AVC is the most modern and bandwidth-efficient compression algorithm specified for video recording in the Digital Recording IRIG Standard 106-11, Chapter 10. This bandwidth efficiency is largely derived from the inter-frame compression component of the standard. Motion JPEG-2000 compression is often considered for cockpit display recording, due to the concern that details in the symbols and graphics suffer excessively from artifacts of inter-frame compression and that critical information might be lost. In this paper, we report on a quantitative comparison of H.264/AVC and Motion JPEG-2000 encoding for HD video telemetry. Actual encoder implementations in video recorder products are used for the comparison. INTRODUCTION The phenomenal advances in video compression over the last two decades have made it possible to compress the bit rate of a video stream of imagery acquired at 24-bits per pixel (8-bits for each of the red, green and blue components) with a rate of a fraction of a bit per pixel.
    [Show full text]
  • The H.264/MPEG4 Advanced Video Coding Standard and Its Applications
    SULLIVAN LAYOUT 7/19/06 10:38 AM Page 134 STANDARDS REPORT The H.264/MPEG4 Advanced Video Coding Standard and its Applications Detlev Marpe and Thomas Wiegand, Heinrich Hertz Institute (HHI), Gary J. Sullivan, Microsoft Corporation ABSTRACT Regarding these challenges, H.264/MPEG4 Advanced Video Coding (AVC) [4], as the latest H.264/MPEG4-AVC is the latest video cod- entry of international video coding standards, ing standard of the ITU-T Video Coding Experts has demonstrated significantly improved coding Group (VCEG) and the ISO/IEC Moving Pic- efficiency, substantially enhanced error robust- ture Experts Group (MPEG). H.264/MPEG4- ness, and increased flexibility and scope of appli- AVC has recently become the most widely cability relative to its predecessors [5]. A recently accepted video coding standard since the deploy- added amendment to H.264/MPEG4-AVC, the ment of MPEG2 at the dawn of digital televi- so-called fidelity range extensions (FRExt) [6], sion, and it may soon overtake MPEG2 in further broaden the application domain of the common use. It covers all common video appli- new standard toward areas like professional con- cations ranging from mobile services and video- tribution, distribution, or studio/post production. conferencing to IPTV, HDTV, and HD video Another set of extensions for scalable video cod- storage. This article discusses the technology ing (SVC) is currently being designed [7, 8], aim- behind the new H.264/MPEG4-AVC standard, ing at a functionality that allows the focusing on the main distinct features of its core reconstruction of video signals with lower spatio- coding technology and its first set of extensions, temporal resolution or lower quality from parts known as the fidelity range extensions (FRExt).
    [Show full text]
  • H.264/MPEG-4 Advanced Video Coding Alexander Hermans
    Seminar Report H.264/MPEG-4 Advanced Video Coding Alexander Hermans Matriculation Number: 284141 RWTH September 11, 2012 Contents 1 Introduction 2 1.1 MPEG-4 AVC/H.264 Overview . .3 1.2 Structure of this paper . .3 2 Codec overview 3 2.1 Coding structure . .3 2.2 Encoder . .4 2.3 Decoder . .6 3 Main differences 7 3.1 Intra frame prediction . .7 3.2 Inter frame prediction . .9 3.3 Transform function . 11 3.4 Quantization . 12 3.5 Entropy encoding . 14 3.6 In-loop deblocking filter . 15 4 Profiles and levels 16 4.1 Profiles . 17 4.2 Levels . 18 5 Evaluation 18 6 Patents 20 7 Summary 20 1 1 Introduction Since digital images have been created, the need for compression has been clear. The amount of data needed to store an uncompressed image is large. But while it still might be feasible to store a full resolution uncompressed image, storing video sequences without compression is not. Assuming a normal frame-rate of about 25 frames per second, the amount of data needed to store one hour of high definition video is about 560GB1. Compressing each image on its own, would reduce this amount, but it would not make the video small enough to be stored on today's typical storage mediums. To overcome this problem, video compression algorithms have exploited the temporal redundancies in the video frames. By using previously encoded, or decoded, frames to predict values for the next frame, the data can be compressed with such a rate that storage becomes feasible.
    [Show full text]