Arxiv:2002.01657V1 [Eess.IV] 5 Feb 2020 Port Lossless Model to Compress Images Lossless

Total Page:16

File Type:pdf, Size:1020Kb

Arxiv:2002.01657V1 [Eess.IV] 5 Feb 2020 Port Lossless Model to Compress Images Lossless LEARNED LOSSLESS IMAGE COMPRESSION WITH A HYPERPRIOR AND DISCRETIZED GAUSSIAN MIXTURE LIKELIHOODS Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto Department of Computer Science and Communications Engineering, Waseda University, Tokyo, Japan. ABSTRACT effectively in [12, 13, 14]. Some methods decorrelate each Lossless image compression is an important task in the field channel of latent codes and apply deep residual learning to of multimedia communication. Traditional image codecs improve the performance as [15, 16, 17]. However, deep typically support lossless mode, such as WebP, JPEG2000, learning based lossless compression has rarely discussed. FLIF. Recently, deep learning based approaches have started One related work is L3C [18] to propose a hierarchical archi- to show the potential at this point. HyperPrior is an effective tecture with 3 scales to compress images lossless. technique proposed for lossy image compression. This paper In this paper, we propose a learned lossless image com- generalizes the hyperprior from lossy model to lossless com- pression using a hyperprior and discretized Gaussian mixture pression, and proposes a L2-norm term into the loss function likelihoods. Our contributions mainly consist of two aspects. to speed up training procedure. Besides, this paper also in- First, we generalize the hyperprior from lossy model to loss- vestigated different parameterized models for latent codes, less compression model, and propose a loss function with L2- and propose to use Gaussian mixture likelihoods to achieve norm for lossless compression to speed up training. Second, adaptive and flexible context models. Experimental results we investigate four parameterized distributions and propose validate our method can outperform existing deep learning to use Gaussian mixture likelihoods for the context model. based lossless compression, and outperform the JPEG2000 Experimental results have demonstrated our method can out- and WebP for JPG images. perform recent learned compression approach L3C. Besides, our method outperform JPEG2000 and WebP for JPG images. Index Terms— Lossless Image Compression, Deep Learning, HyperPrior, Gaussian Mixture Model 2. LEARNED LOSSLESS IMAGE COMPRESSION 1. INTRODUCTION 2.1. Formulation of Compression with a Hyperprior Image compression is a fundamental task in the field of According to the transform coding approach [19], image com- signal processing in many decades for efficient transmis- pression can be formulated as sion and storage. Classical image compression standards, y = g (x; φ) such as JPEG [1], JPEG2000 [2], WebP [3], HEVC/H.265- a intra [4] and lossless FLIF [5], usually rely on hand-crafted y^ = Q(y) (1) encoder/decoder (codec) block diagrams. They use fixed lin- x^ = gs(y^; θ) ear transform, quantization and the entropy coder to reduce spatial redundancy for images. Typically they also sup- where x, x^, y, and y^ are raw images, reconstructed images, arXiv:2002.01657v1 [eess.IV] 5 Feb 2020 port lossless model to compress images lossless. However, a latent presentation before quantization, and compressed along with the fast development of new image formats and codes, respectively. Q represents the quantization , which 1 1 high-resolution mobile devices, existing image compression is approximated by a uniform noise U(− 2 ; 2 ) during the algorithms are not expected to be optimal solutions. training, which is denoted as UjQ in Fig. 1. φ and θ are Recently, we have seen a great surge of deep learning optimized parameters of analysis and synthesis transforms. based image compression approaches. Most of them fo- The operation diagram is explained in Fig. 1(a). cus on lossy image compression. For example, some image In the work [12], Balle´ proposed a hyperprior for lossy compression approaches use generative models to learn the image compression to achieve promising results. Hyperprior distribution of images using adversarial training [6, 7, 8] to introduces a side information z to capture spatial dependen- achieved impressive subjective quality at extremely low bit cies among the elements of y, formulated as rate. Some works use recurrent neural networks to compress the residual information recursively, such as [9, 10, 11] to re- z = ha(y; φh); alize scalable coding. Some approaches propose a hyperprior- z^ = Q(z) (2) based and context-adaptive context model to compress codes (µy; σy) = hs(z^; θh) (a) Lossy model (b) Lossy model with a hyper- (c) Proposed Lossless model (d) Distributions of lossy and prior with a hyperprior lossless models Fig. 1. Operational diagrams and distributions of learned lossy and lossless image compression. where h and h denote the analysis and synthesis functions Y 2 1 1 a s py^(y^jz^) = (N (µy ; σ ) ∗ U(− ; ))(^yi) i yi 2 2 (5) in the auxiliary autoencoder, where φh and θh are optimized i parameters of the hyperprior, respectively. µ and σ are y y Y 1 1 generated by the auxiliary autoencoder to model a Gaus- pz^(z^j ) = (pzij i ( i) ∗ U(− ; ))(^zi) (6) 2 2 2 sian distribution N (µy; σy). Originally, Balle´ assumed a i zero-mean scalar Gaussian distribution, but in the enhanced where are learnable factorized distributions [12] for z be- work [13], a mean and scale Gaussian distribution achieves cause there is no prior for z. better results. Then we use a mean and scale one in this paper. Basically, by using Eq.(3), we can train a neural network The operation diagram is explained in Fig. 1(b). for lossless compression. However, by experiments, we have However, both cases are lossy compression. In this paper, found the converge is very slow. By observing the distribu- we generalize a lossy model to a lossless model as Fig. 1(c). tion in the distribution of Fig. 1(d), we can notice the actual Not only the reconstructed pixel value, we predict a probabil- marginal distribution of x typically is not identical to the ity model for raw images, and model x using another param- estimated entropy model px^, especially for the initial training eterized distributions. This is feasible because entropy cod- stage of neural networks. It takes quite a long time to find a ing techniques such as arithmetic coding [20] can losslessly proper and accurate distribution, which is centralized at the compress the signals if a probability model is given. The dis- actual x value. Therefore, we novelly introduce a L2-norm tribution difference of lossy and lossless compression is illus- term between the ground-truth value and predicted mean trated in Fig. 1(d), where we can think lossy compression is value into the lossless compression, that is, to predict a delta distribution at the value x^ for each element, 0 2 2 while lossless compression is to predict a more generalized L = jjµx − x^jj + jjµy − y^jj (7) and arbitrary probability models. One intuitive choice is to 2 Then the updated loss function for lossless image compres- use Gaussian distribution N (µx; σ ), like y, as Fig. 1(d). x sion is defined as 2.2. Proposed Loss Function with a L2-Norm L = E[− log2(px^(x^jy^)) + E[− log2(py^(y^jz^))+ [− log (p (z^j )) + λ · (jjµ − x^jj2 + jjµ − y^jj2) Different from lossy compression which controls the rate- E 2 z^ x y (8) distortion tradeoff, the loss function of lossless compression where λ controls the weights of L2-norm term. only consists of required bits for x, y and z, that is, 2.3. Proposed Discretized Gaussian Mixture Likelihoods L = E[− log2(px^(x^jµx; σx))] + E[− log2(py^(y^jµy; σy))]+ E[− log2(pz^(z^j ))] Whether the parameterized distribution model fits the margin (3) distribution of x and y^ is a key factor for the performance. where the probability models of x^ are estimated by the pa- We visualize the margin distribution of all sub-pixel values x rameters µx; σx, which are conditioned on y^ actually. Simi- and compressed codes y^ for kodim02 from Kodak dataset [27] larly, the probability model of y are estimated by parameters in Fig. 3, and it does not actually satisfy the assumption that conditioned on z^, i.e. they follow Gaussian distribution. Previous works have inves- tigated several parameterized distribution models, but to our Y 2 1 1 knowledge, which distribution model can fit true margin dis- px^(x^jy^) = (N (µx ; σ ) ∗ U(− ; ))(^xi) i xi 2 2 (4) i tribution best is still an open question. For example, standard Fig. 2. Network architecture of lossless image compression. (a) sub-pixel values x (b) compressed codes y^ Fig. 3. Margin Distributions for Image kodim02. Fig. 4. Different distributions with zero-mean and one-scale. PixelCNN [21] uses full 256-way softmax likelihoods, which rameterized probabilities from univariate Gaussian model to is memory-consuming and impractical for large images. Pix- multivariate Gaussian mixture model as elCNN++ [22] proposed a discretized logistic mixture likeli- Y hoods to achieve faster training. L3C [18] followed the Pix- py^(y^jz^) = py^(^yijz^) elCNN++ to use a logistic mixture model. Lossy compres- i sion work [12] assumed a univariate Gaussian distribution. K (9) X (k) (k) 2(k) 1 1 The work [23] used Laplace distribution. Traditional coding py^(^yijz^) = ( w N (µ ; σ ) ∗ U(− ; ))(^yi) yi yi yi 2 2 work [24] used a Cauchy distribution to model coefficients. k=1 We investigate all the above models, including Gaussian where the k-th mixture distribution is characterized by Gaus- distribution, Cauchy distribution, Laplace distribution and (k) sian distribution with three parameters, i.e. weights wi , Logistic distribution. We replace them for y^ individually. (k) 2(k) Examples of their distribution models are visualized in Fig. 4, means µi and variances σi for each i-th element. The where we clip the ranges and add the tail masses to edge mixture models for x are conducted in the same way as y.
Recommended publications
  • Free Lossless Image Format
    FREE LOSSLESS IMAGE FORMAT Jon Sneyers and Pieter Wuille [email protected] [email protected] Cloudinary Blockstream ICIP 2016, September 26th DON’T WE HAVE ENOUGH IMAGE FORMATS ALREADY? • JPEG, PNG, GIF, WebP, JPEG 2000, JPEG XR, JPEG-LS, JBIG(2), APNG, MNG, BPG, TIFF, BMP, TGA, PCX, PBM/PGM/PPM, PAM, … • Obligatory XKCD comic: YES, BUT… • There are many kinds of images: photographs, medical images, diagrams, plots, maps, line art, paintings, comics, logos, game graphics, textures, rendered scenes, scanned documents, screenshots, … EVERYTHING SUCKS AT SOMETHING • None of the existing formats works well on all kinds of images. • JPEG / JP2 / JXR is great for photographs, but… • PNG / GIF is great for line art, but… • WebP: basically two totally different formats • Lossy WebP: somewhat better than (moz)JPEG • Lossless WebP: somewhat better than PNG • They are both .webp, but you still have to pick the format GOAL: ONE FORMAT THAT COMPRESSES ALL IMAGES WELL EXPERIMENTAL RESULTS Corpus Lossless formats JPEG* (bit depth) FLIF FLIF* WebP BPG PNG PNG* JP2* JXR JLS 100% 90% interlaced PNGs, we used OptiPNG [21]. For BPG we used [4] 8 1.002 1.000 1.234 1.318 1.480 2.108 1.253 1.676 1.242 1.054 0.302 the options -m 9 -e jctvc; for WebP we used -m 6 -q [4] 16 1.017 1.000 / / 1.414 1.502 1.012 2.011 1.111 / / 100. For the other formats we used default lossless options. [5] 8 1.032 1.000 1.099 1.163 1.429 1.664 1.097 1.248 1.500 1.017 0.302� [6] 8 1.003 1.000 1.040 1.081 1.282 1.441 1.074 1.168 1.225 0.980 0.263 Figure 4 shows the results; see [22] for more details.
    [Show full text]
  • How to Exploit the Transferability of Learned Image Compression to Conventional Codecs
    How to Exploit the Transferability of Learned Image Compression to Conventional Codecs Jan P. Klopp Keng-Chi Liu National Taiwan University Taiwan AI Labs [email protected] [email protected] Liang-Gee Chen Shao-Yi Chien National Taiwan University [email protected] [email protected] Abstract Lossy compression optimises the objective Lossy image compression is often limited by the sim- L = R + λD (1) plicity of the chosen loss measure. Recent research sug- gests that generative adversarial networks have the ability where R and D stand for rate and distortion, respectively, to overcome this limitation and serve as a multi-modal loss, and λ controls their weight relative to each other. In prac- especially for textures. Together with learned image com- tice, computational efficiency is another constraint as at pression, these two techniques can be used to great effect least the decoder needs to process high resolutions in real- when relaxing the commonly employed tight measures of time under a limited power envelope, typically necessitating distortion. However, convolutional neural network-based dedicated hardware implementations. Requirements for the algorithms have a large computational footprint. Ideally, encoder are more relaxed, often allowing even offline en- an existing conventional codec should stay in place, ensur- coding without demanding real-time capability. ing faster adoption and adherence to a balanced computa- Recent research has developed along two lines: evolu- tional envelope. tion of exiting coding technologies, such as H264 [41] or As a possible avenue to this goal, we propose and investi- H265 [35], culminating in the most recent AV1 codec, on gate how learned image coding can be used as a surrogate the one hand.
    [Show full text]
  • Panstamps Documentation Release V0.5.3
    panstamps Documentation Release v0.5.3 Dave Young 2020 Getting Started 1 Installation 3 1.1 Troubleshooting on Mac OSX......................................3 1.2 Development...............................................3 1.2.1 Sublime Snippets........................................4 1.3 Issues...................................................4 2 Command-Line Usage 5 3 Documentation 7 4 Command-Line Tutorial 9 4.1 Command-Line..............................................9 4.1.1 JPEGS.............................................. 12 4.1.2 Temporal Constraints (Useful for Moving Objects)...................... 17 4.2 Importing to Your Own Python Script.................................. 18 5 Installation 19 5.1 Troubleshooting on Mac OSX...................................... 19 5.2 Development............................................... 19 5.2.1 Sublime Snippets........................................ 20 5.3 Issues................................................... 20 6 Command-Line Usage 21 7 Documentation 23 8 Command-Line Tutorial 25 8.1 Command-Line.............................................. 25 8.1.1 JPEGS.............................................. 28 8.1.2 Temporal Constraints (Useful for Moving Objects)...................... 33 8.2 Importing to Your Own Python Script.................................. 34 8.2.1 Subpackages.......................................... 35 8.2.1.1 panstamps.commonutils (subpackage)........................ 35 8.2.1.2 panstamps.image (subpackage)............................ 35 8.2.2 Classes............................................
    [Show full text]
  • Download This PDF File
    Sindh Univ. Res. Jour. (Sci. Ser.) Vol.47 (3) 531-534 (2015) I NDH NIVERSITY ESEARCH OURNAL ( CIENCE ERIES) S U R J S S Performance Analysis of Image Compression Standards with Reference to JPEG 2000 N. MINALLAH++, A. KHALIL, M. YOUNAS, M. FURQAN, M. M. BOKHARI Department of Computer Systems Engineering, University of Engineering and Technology, Peshawar Received 12thJune 2014 and Revised 8th September 2015 Abstract: JPEG 2000 is the most widely used standard for still image coding. Some other well-known image coding techniques include JPEG, JPEG XR and WEBP. This paper provides performance evaluation of JPEG 2000 with reference to other image coding standards, such as JPEG, JPEG XR and WEBP. For the performance evaluation of JPEG 2000 with reference to JPEG, JPEG XR and WebP, we considered divers image coding scenarios such as continuous tome images, grey scale images, High Definition (HD) images, true color images and web images. Each of the considered algorithms are briefly explained followed by their performance evaluation using different quality metrics, such as Peak Signal to Noise Ratio (PSNR), Mean Square Error (MSE), Structure Similarity Index (SSIM), Bits Per Pixel (BPP), Compression Ratio (CR) and Encoding/ Decoding Complexity. The results obtained showed that choice of the each algorithm depends upon the imaging scenario and it was found that JPEG 2000 supports the widest set of features among the evaluated standards and better performance. Keywords: Analysis, Standards JPEG 2000. performance analysis, followed by Section 6 with 1. INTRODUCTION There are different standards of image compression explanation of the considered performance analysis and decompression.
    [Show full text]
  • Making Color Images with the GIMP Las Cumbres Observatory Global Telescope Network Color Imaging: the GIMP Introduction
    Las Cumbres Observatory Global Telescope Network Making Color Images with The GIMP Las Cumbres Observatory Global Telescope Network Color Imaging: The GIMP Introduction These instructions will explain how to use The GIMP to take those three images and composite them to make a color image. Users will also learn how to deal with minor imperfections in their images. Note: The GIMP cannot handle FITS files effectively, so to produce a color image, users will have needed to process FITS files and saved them as grayscale TIFF or JPEG files as outlined in the Basic Imaging section. Separately filtered FITS files are available for you to use on the Color Imaging page. The GIMP can be downloaded for Windows, Mac, and Linux from: www.gimp.org Loading Images Loading TIFF/JPEG Files Users will be processing three separate images to make the RGB color images. When opening files in The GIMP, you can select multiple files at once by holding the Ctrl button and clicking on the TIFF or JPEG files you wish to use to make a color image. Go to File > Open. Image Mode RGB Mode Because these images are saved as grayscale, all three images need to be converted to RGB. This is because color images are made from (R)ed, (G)reen, and (B)lue picture elements (pixels). The different shades of gray in the image show the intensity of light in each of the wavelengths through the red, green, and blue filters. The colors themselves are not recorded in the image. Adding Color Information For the moment, these images are just grayscale.
    [Show full text]
  • Neural Multi-Scale Image Compression
    Neural Multi-scale Image Compression Ken Nakanishi 1 Shin-ichi Maeda 2 Takeru Miyato 2 Daisuke Okanohara 2 Abstract 1.00 This study presents a new lossy image compres- sion method that utilizes the multi-scale features 0.98 of natural images. Our model consists of two networks: multi-scale lossy autoencoder and par- 0.96 allel multi-scale lossless coder. The multi-scale Proposed 0.94 JPEG lossy autoencoder extracts the multi-scale image MS-SSIM WebP features to quantized variables and the parallel BPG 0.92 multi-scale lossless coder enables rapid and ac- Johnston et al. Rippel & Bourdev curate lossless coding of the quantized variables 0.90 via encoding/decoding the variables in parallel. 0.0 0.2 0.4 0.6 0.8 1.0 Our proposed model achieves comparable perfor- Bits per pixel mance to the state-of-the-art model on Kodak and RAISE-1k dataset images, and it encodes a PNG image of size 768 × 512 in 70 ms with a single Figure 1. Rate-distortion trade off curves with different methods GPU and a single CPU process and decodes it on Kodak dataset. The horizontal axis represents bits-per-pixel into a high-fidelity image in approximately 200 (bpp) and the vertical axis represents multi-scale structural similar- ms. ity (MS-SSIM). Our model achieves better or comparable bpp with respect to the state-of-the-art results (Rippel & Bourdev, 2017). 1. Introduction K Data compression for video and image data is a crucial tech- via ML algorithm is not new. The -means algorithm was nique for reducing communication traffic and saving data used for vector quantization (Gersho & Gray, 2012), and storage.
    [Show full text]
  • High Efficiency, Moderate Complexity Video Codec Using Only RF IPR
    Thor High Efficiency, Moderate Complexity Video Codec using only RF IPR draft-fuldseth-netvc-thor-00 Arild Fuldseth, Gisle Bjontegaard (Cisco) IETF 93 – Prague, CZ – July 2015 1 Design principles • Moderate complexity to allow real-time implementation in SW on common HW, as well as new HW designs • Basic building blocks from well-known hybrid approach (motion compensated prediction and transform coding) • Common design elements in modern codecs – Larger block sizes and transforms, up to 64x64 – Quarter pixel interpolation, motion vector prediction, etc. • Cisco RF IPR (note well: declaration filed on draft) – Deblocking, transforms, etc. (some also essential in H.265/4) • Avoid non-RF IPR – If/when others offer RF IPR, design/performance will improve 2 Encoder Architecture Input Transform Quantizer Entropy Output video Coding bitstream - Inverse Transform Intra Frame Prediction Loop filters Inter Frame Prediction Reconstructed Motion Frame Estimation Memory 3 Decoder Architecture Input Entropy Inverse Bitstream Decoding Transform Intra Frame Prediction Loop filters Inter Frame Prediction Output video Reconstructed Frame Memory 4 Block Structure • Super block (SB) 64x64 • Quad-tree split into coding blocks (CB) >= 8x8 • Multiple prediction blocks (PB) per CB • Intra: 1 PB per CB • Inter: 1, 2 (rectangular) or 4 (square) PBs per CB • 1 or 4 transform blocks (TB) per CB 5 Coding-block modes • Intra • Inter0 MV index, no residual information • Inter1 MV index, residual information • Inter2 Explicit motion vector information, residual information
    [Show full text]
  • Hardware for Speech and Audio Coding
    Linköping Studies in Science and Technology Thesis No. 1093 Hardware for Speech and Audio Coding Mikael Olausson LiU-TEK-LIC-2004:22 Department of Electrical Engineering Linköpings universitet, SE-581 83 Linköping, Sweden Linköping 2004 Linköping Studies in Science and Technology Thesis No. 1093 Hardware for Speech and Audio Coding Mikael Olausson LiU-TEK-LIC-2004:22 Department of Electrical Engineering Linköpings universitet, SE-581 83 Linköping, Sweden Linköping 2004 ISBN 91-7373-953-7 ISSN 0280-7971 ii Abstract While the Micro Processors (MPUs) as a general purpose CPU are converging (into Intel Pentium), the DSP processors are diverging. In 1995, approximately 50% of the DSP processors on the market were general purpose processors, but last year only 15% were general purpose DSP processors on the market. The reason general purpose DSP processors fall short to the application specific DSP processors is that most users want to achieve highest performance under mini- mized power consumption and minimized silicon costs. Therefore, a DSP proces- sor must be an Application Specific Instruction set Processor (ASIP) for a group of domain specific applications. An essential feature of the ASIP is its functional acceleration on instruction level, which gives the specific instruction set architecture for a group of appli- cations. Hardware acceleration for digital signal processing in DSP processors is essential to enhance the performance while keeping enough flexibility. In the last 20 years, researchers and DSP semiconductor companies have been working on different kinds of accelerations for digital signal processing. The trade-off be- tween the performance and the flexibility is always an interesting question because all DSP algorithms are "application specific"; the acceleration for audio may not be suitable for the acceleration of baseband signal processing.
    [Show full text]
  • Efficient Multi-Codec Support for OTT Services: HEVC/H.265 And/Or AV1?
    Efficient Multi-Codec Support for OTT Services: HEVC/H.265 and/or AV1? Christian Timmerer†,‡, Martin Smole‡, and Christopher Mueller‡ ‡Bitmovin Inc., †Alpen-Adria-Universität San Francisco, CA, USA and Klagenfurt, Austria, EU ‡{firstname.lastname}@bitmovin.com, †{firstname.lastname}@itec.aau.at Abstract – The success of HTTP adaptive streaming is un- multiple versions (e.g., different resolutions and bitrates) and disputed and technical standards begin to converge to com- each version is divided into predefined pieces of a few sec- mon formats reducing market fragmentation. However, other onds (typically 2-10s). A client first receives a manifest de- obstacles appear in form of multiple video codecs to be sup- scribing the available content on a server, and then, the client ported in the future, which calls for an efficient multi-codec requests pieces based on its context (e.g., observed available support for over-the-top services. In this paper, we review the bandwidth, buffer status, decoding capabilities). Thus, it is state of the art of HTTP adaptive streaming formats with re- able to adapt the media presentation in a dynamic, adaptive spect to new services and video codecs from a deployment way. perspective. Our findings reveal that multi-codec support is The existing different formats use slightly different ter- inevitable for a successful deployment of today's and future minology. Adopting DASH terminology, the versions are re- services and applications. ferred to as representations and pieces are called segments, which we will use henceforth. The major differences between INTRODUCTION these formats are shown in Table 1. We note a strong differ- entiation in the manifest format and it is expected that both Today's over-the-top (OTT) services account for more than MPEG's media presentation description (MPD) and HLS's 70 percent of the internet traffic and this number is expected playlist (m3u8) will coexist at least for some time.
    [Show full text]
  • CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter
    CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter Prediction using Binary classification techniques A graduate project submitted in partial fulfillment of the requirements for the degree of Master of Science in Software Engineering by Alex Kit Romero May 2020 The graduate project of Alex Kit Romero is approved: ____________________________________ ____________ Dr. Katya Mkrtchyan Date ____________________________________ ____________ Dr. Kyle Dewey Date ____________________________________ ____________ Dr. John J. Noga, Chair Date California State University, Northridge ii Dedication This project is dedicated to all of the Computer Science professors that I have come in contact with other the years who have inspired and encouraged me to pursue a career in computer science. The words and wisdom of these professors are what pushed me to try harder and accomplish more than I ever thought possible. I would like to give a big thanks to the open source community and my fellow cohort of computer science co-workers for always being there with answers to my numerous questions and inquiries. Without their guidance and expertise, I could not have been successful. Lastly, I would like to thank my friends and family who have supported and uplifted me throughout the years. Thank you for believing in me and always telling me to never give up. iii Table of Contents Signature Page ................................................................................................................................ ii Dedication .....................................................................................................................................
    [Show full text]
  • Image Formats
    Image Formats Ioannis Rekleitis Many different file formats • JPEG/JFIF • Exif • JPEG 2000 • BMP • GIF • WebP • PNG • HDR raster formats • TIFF • HEIF • PPM, PGM, PBM, • BAT and PNM • BPG CSCE 590: Introduction to Image Processing https://en.wikipedia.org/wiki/Image_file_formats 2 Many different file formats • JPEG/JFIF (Joint Photographic Experts Group) is a lossy compression method; JPEG- compressed images are usually stored in the JFIF (JPEG File Interchange Format) >ile format. The JPEG/JFIF >ilename extension is JPG or JPEG. Nearly every digital camera can save images in the JPEG/JFIF format, which supports eight-bit grayscale images and 24-bit color images (eight bits each for red, green, and blue). JPEG applies lossy compression to images, which can result in a signi>icant reduction of the >ile size. Applications can determine the degree of compression to apply, and the amount of compression affects the visual quality of the result. When not too great, the compression does not noticeably affect or detract from the image's quality, but JPEG iles suffer generational degradation when repeatedly edited and saved. (JPEG also provides lossless image storage, but the lossless version is not widely supported.) • JPEG 2000 is a compression standard enabling both lossless and lossy storage. The compression methods used are different from the ones in standard JFIF/JPEG; they improve quality and compression ratios, but also require more computational power to process. JPEG 2000 also adds features that are missing in JPEG. It is not nearly as common as JPEG, but it is used currently in professional movie editing and distribution (some digital cinemas, for example, use JPEG 2000 for individual movie frames).
    [Show full text]
  • Course Outline & Schedule
    Course Outline & Schedule Call US 408-759-5074 or UK +44 20 7620 0033 Open Media Encoding Techniques Course Code PWL396 Duration 3 Day Course Price $2,815 Course Description Video, TV and Image technology today dominates Internet Services. Whether it be for live TV, Streamed movies or video clips within social media video images are everywhere. The long-term efficiency of services depends upon the methods and mechanisms used to encode these services. We have relied upon the developments from the Digital Video Broadcast industry, ISO, MPEG and the ITU to provide us with standard ways to achieve this. However the patent royalty cost is now considered to be holding back this efficient development. All commercial users of the normal encoding used such as H.264, H.265, HEVC and other standardised codecs are required to pay royalties for using this technology through a firm of US patent Lawyers known as MPEG-LA. Each new ITU- T standard encoding requires new and increased payments. The Alliance for Open Media is founded by leading Internet companies focused on developing next-generation media formats, codecs and technologies in the public interest. The new Alliance is committing its collective technology and expertise to meet growing Internet demand for top-quality video, audio, imagery and streaming across devices of all kinds and for users worldwide. The aim is to develop royalty free standardized encoding based upon the technology contributed by its members. This course provides a technical study of Video Coding and the technologies which the developing Open Media implementations are based upon.
    [Show full text]