API for Real Time/Dynamic Image and Video Processing Presented By

Total Page:16

File Type:pdf, Size:1020Kb

API for Real Time/Dynamic Image and Video Processing Presented By API for Real time/Dynamic Image and Video Processing Presented By Ivan Wong VP and Co-founder Background CTAccelAbout & XilinxCTAccel partnership CTAccel Limited Partner with Xilinx Inc since 2016 Delivered first Xilinx FPGA based Off-Premise Founded in Mar, 2016 solution in 2016 Based in Hong Kong & Shenzhen Delivered first Xilinx FPGA based Cloud Focus on FPGA-based accelerated computing for Data solution in 2017 Center US Patent Core value of our company: Quality & Innovation Background Market demand and challenges More Data Users generate more image and video data everyday Higher resolution in capture and display More Better Quality Better Data Quality Users crave better viewing experience Faster Access Faster Customers demand instant access to the resource Access Challenges: • Huge consumption of computational and storage resource • Server and storage performance IS NOT KEEPING PACE Data storage supply and demand worldwide, from 2009 to 2020 (in exabytes)* *Source: https://www.statista.com/statistics/751749/worldwide-data-storage-capacity-and-demand/ Background Constraints in existing solutions Internet traffic increases by 26% annually, CPU performance per core has not image and video contributes a large increased since 2005 portion of internet data. Source: Cisco--VNI Forecast Highlights Tool Source: The Future of Computing Performance (2011) Chip Frequency graphed against year of introduction How to achieve the fastest image processing with the least amount of computational resources? Background Modern image and video formats continue to emerge Guetzli HEIF Mozjpeg OpenCV Professional Lepton image BPG processing GraphicsMagick accelerator ImageMagick WebP JPEG Baseline Pixels Processing /Progressive Image Video AI Where is CTAccel image processor located in customers’ production environment? Cloud album Social network E-commerce News app Scenarios UGC Web portals Cloud storage …… CIP in Object-oriented storage Image Video AI CIP software stack and accelerated functions System Stack Accelerated Function Units Pixel Decode Encode /FFmpeg Processing JPEG Resizing JPEG WebP Crop WebP Lepton Rotate Lepton HEIF HEIF Image Video AI Why Xilinx? Applications Financial Machine Video/Image Processing Big Data Analytics Genomics Security Analytics Learning Machine Software-Defined Learning Development Environments Video & Image Processing Solution Development Partners & Customers Compute Acceleration Data Analytics Increase Service Level Cloud, Enterprise to Edge-Computing Genomics Image Video AI Product 1:JPEG 2 JPEG Problems solved Solution Scenarios Internet apps with JPEG Accelerate the speed of JPEG End-user E-commerce platform Cloud album image thumbnail generation, Smart phone/PAD Camera … PC reduce image-processing Social network with pictures servers. Web portals App News application Key features Cloud album Social network … News app Values TCO reduction: High Throughput CIP Reduce image-processing servers Low Latency OPEX reduction JPEG JPEG FPGA Resize Low CPU Utilization Decode Encode Improve customer experience: Accelerate the image rendering speed Image Video AI JPEG 2 JPEG performance on cloud VU9P (HUAWEI fp1c.2xlarge) 8vCPU (HUAWEI fp1c.2xlarge) Throughput(MB/s) Latency(ms) CPU Utilization 350 310.4 1400 120.00% 1122 1162 99.00% 99.00% 300 1200 100.00% 95.00% 250 1000 80.00% 200 800 63.00% 60.00% 150 600 100 400 40.00% 62.2 201 20.00% 50 7.7 7 200 28 0 0 0.00% 640x480 2048x1080 640x480 2048x1080 640x480 2048x1080 Input:10000*4096x2160(Average Size 803k) Throughput(MB/s) Latency(ms) CPU Utilization 100 94.9 250 100.00% 83.00% 87.00% 197 80 200 80.00% 70.00% 60 50.8 150 60.00% 40 100 40.00% 37.00% 48 20 13.3 8.5 50 23 20.00% 4 0 0 0.00% 240x180 768x576 240x180 768x576 240x180 768x576 Input:10000*1024x768 (Average Size 130k) Image Video AI Product 2: JPEG 2 WebP(M6) – Highest compression ratio Problems solved Solution Scenarios Vast amount of computational Internet apps with WebP resources and long time when End-user E-commerce platform processing WebP encoding by Smart phone/PAD Camera … PC Social network with pictures CPU; Web portals Accelerate the transcoding from News application JPEG to WebP; App Key features E-commerce Social network … News app Values TCO reduction: High Throughput CIP Save CDN traffic Reduce image-processing servers Low Latency JPEG WebP FPGA Resize OPEX reduction Low CPU Utilization Decode Encode Improve customer experience: Accelerate the image rendering speed Image Video AI JPEG to WebP performance FPGA Used: QPS Throughput (MB/s) Single Xilinx UltraScale+ 800 721 700 Dual Xilinx UltraScale+ 700 599 600 600 470 500 500 QPS 384 390 Single FPGA is 5.11 times of CPU 400 400 318 286 Dual FPGA is 8.39 times of CPU 300 300 237 208 173 200 200 99 108 82 89 100 56 25 100 46 21 Latency 0 0 Single FPGA is 0.20 times of CPU 640x480 1024x768 2048x1080 640x480 1024x768 2048x1080 Dual FPGA is times of CPU 0.12 FPGA*1 FPGA*2 CPU FPGA*1 FPGA*2 CPU Input: 10000*4096 x2160 images Latency (ms) CPU Utilization 1000 941.62 120% 100% 100% 100% 800 100% Test environment: 80% CPU: 2*Intel(R) Xeon(R) CPU E5-2630v2 600 RAM: 128GB 422.73 60% 44% 45% 45% OS: CentOS Linux release 7.2.1511 400 240.64 40% 220.07 25% 23% 21% 200 82.79 113.03 60.9732.7 49.94 20% 0 0% 640x480 1024x768 2048x1080 640x480 1024x768 2048x1080 FPGA*1 FPGA*2 CPU FPGA*1 FPGA*2 CPU Image Video AI Modern image format-Lepton and its problem Lepton is a new image compression format developed by Dropbox and made open source in 2016. Lepton compresses JPEG images losslessly. It reduces file size by an average of 22%. It preserves the original JPEG file bit-by-bite perfectly, including all metadata. Lepton can be applied into the scenarios with massive images storage. It can effectively reduce the storage cost. JPEG Lepton Compression Ratio 1024x768 420 1.3G 1.1G 15% 1024x768 422 1.4G 1.1G 21% 1024x768 444 1.6G 1.3G 18% 4096x2160 420 8.3G 6.3G 24% 4096x2160 422 8.7G 6.6G 24% 4096x2160 444 9.5G 7.1G 25% Average 21% The downside of Lepton is that it requires heavy computation power for both compression and decompression. Problem A conventional X86 server with dual E5-2630 CPU can only compress JPEG files into Lepton format at a rate of 20 megabytes per second. Image Video AI Product 3: JPEG to Lepton Problems solved Solution Scenarios Vast amount of computational resources and long time when End-user Scenarios with massive images storage : processing Lepton encoding by Smart phone/PAD Camera … PC ; CPU; Cloud album ; Accelerate the transcoding from Cloud storage JPEG to Lepton; App Key features Cloud album Cloud storage Values TCO Reduction: High Throughput CIP Save CDN traffic Low Latency Reduce image-processing servers FPGA JPEG Lepton Low CPU Utilization Decode Encode Reduce image-storage servers OPEX reduction Image Video AI JPEG to Lepton performance FPGA Used: QPS U200 Throughput(MB/s) U200 CPU Virtex UltraScale+ FPGA Alveo U200 30.0 28.3 CPU 120.0 108.2 25.0 100.0 20.0 80.0 QPS FPGA is 4.2 times of CPU 15.0 60.0 10.0 6.7 40.0 25.7 Latency 5.0 20.0 FPGA is 0.24 times of CPU 0.0 0.0 24 threads 24 threads U200 Input images: Latency(ms) CPU Utilization U200 Total number: 999 images CPU 4000.00 120% CPU 3492.08 100% Total file size: 3.8GB 3500.00 100% 3000.00 2500.00 80% Test environment : 2000.00 60% CPU: Intel(R) Xeon(R) E5-2630v2 x2 1500.00 40% 1000.00 827.77 RAM: 128GB 500.00 20% 7% OS: CentOS Linux release 7.3.1611 0.00 0% 24 threads 24 threads Image Video AI Product 4: JPEG to HEIF Problems solved Solution Scenarios Internet apps with HEIF Vast amount of computational End-user E-commerce platform resources and long time when Smart phone/PAD Camera … PC Social network with pictures processing HEIC encoding by CPU; Web portals Poor customer experience; News application App Key features Cloud album Social network … News app Values TCO reduction: High Throughput CIP Reduce image-processing servers Low Latency OPEX reduction JPEG HEIF FPGA Resize Low CPU Utilization Decode Encode Improve customer experience: Accelerate the image rendering speed Image Video AI JPEG to HEIF performance QPS: FPGA*1 is 10 times that of Params JPG FPGA CPU CPU Images 100 100 100 Latency: FPGA*1 is 0.1 times that of Total 173738658 135858024 133290287 CPU Size(Bytes) Input: Compression 78.2% 76.72% Resolution:1920*1080 Ratio VQ: preset=medium Total files: 100 Param -slice_qp 5 crf=12 Output: (10179.74 kb/s) Resolution: 1920*1080 FPS 200.12 20.02 Test environment: CPU: 2*Intel(R) Xeon(R) CPU E5-2680v4 PSNR 54.51 57.06 RAM: 128GB OS: CentOS Linux release 7.2.1511 Kernel version: 3.10.0-862.el7.x86_64 VMAF 97.25 97.31 Image Video AI SummaryFeatures of CIP’s of valuesCIP High 5-10 times throughput promotion compare to CPU performance 3 times latency reduction compare to CPU 20W-40W per FPGA card Low power TCO reduction: improve computing density of DC, reduce racks Software Support: ImageMagick, OpenCV, GraphicsMagick, Lepton compatible Allow seamless migration from software-based implementation to CIP Remote upgrading Ease of PR(Partial Reconfiguration) technology allows fast and easy context switch in maintenance accelerator functionality without rebooting the server Image Video AI Case study 1 Customer Advantages Leading mobile phone High TCO Better Ease of manufacturer in China performance reduction QoS maintenance Reduce the total Reduce TCO by 3x latency reduction Smaller cluster of number of server 25% servers Scenario cluster by 50% Improve the throughput by 50% Deterministic
Recommended publications
  • An Effective Self-Adaptive Policy for Optimal Video Quality Over Heterogeneous Mobile Devices and Network Discovery Services
    Appl. Math. Inf. Sci. 13, No. 3, 489-505 (2019) 489 Applied Mathematics & Information Sciences An International Journal http://dx.doi.org/10.18576/amis/130322 An Effective Self-Adaptive Policy for Optimal Video Quality over Heterogeneous Mobile Devices and Network Discovery Services Saleh Ali Alomari∗1 , Mowafaq Salem Alzboon1, Belal Zaqaibeh1 and Mohammad Subhi Al-Batah1 1Faculty of Science and Information Technology, Jadara University, 21110 Irbid, Jordan Received: 12 Dec. 2018, Revised: 1 Feb. 2019, Accepted: 20 Feb. 2019 Published online: 1 May 2019 Abstract: The Video on Demand (VOD) system is considered a communicating multimedia system that can allow clients be interested whilst watching a video of their selection anywhere and anytime upon their convenient. The design of the VOD system is based on the process and location of its three basic contents, which are: the server, network configuration and clients. The clients are varied from numerous approaches, battery capacities, involving screen resolutions, capabilities and decoder features (frame rates, spatial dimensions and coding standards). The up-to-date systems deliver VOD services through to several devices by utilising the content of a single coded video without taking into account various features and platforms of a device, such as WMV9, 3GPP2 codec, H.264, FLV, MPEG-1 and XVID. This limitation only provides existing services to particular devices that are only able to play with a few certain videos. Multiple video codecs are stored by VOD systems store for a similar video into the storage server. The problems caused by the bandwidth overhead arise once the layers of a video are produced.
    [Show full text]
  • Free Lossless Image Format
    FREE LOSSLESS IMAGE FORMAT Jon Sneyers and Pieter Wuille [email protected] [email protected] Cloudinary Blockstream ICIP 2016, September 26th DON’T WE HAVE ENOUGH IMAGE FORMATS ALREADY? • JPEG, PNG, GIF, WebP, JPEG 2000, JPEG XR, JPEG-LS, JBIG(2), APNG, MNG, BPG, TIFF, BMP, TGA, PCX, PBM/PGM/PPM, PAM, … • Obligatory XKCD comic: YES, BUT… • There are many kinds of images: photographs, medical images, diagrams, plots, maps, line art, paintings, comics, logos, game graphics, textures, rendered scenes, scanned documents, screenshots, … EVERYTHING SUCKS AT SOMETHING • None of the existing formats works well on all kinds of images. • JPEG / JP2 / JXR is great for photographs, but… • PNG / GIF is great for line art, but… • WebP: basically two totally different formats • Lossy WebP: somewhat better than (moz)JPEG • Lossless WebP: somewhat better than PNG • They are both .webp, but you still have to pick the format GOAL: ONE FORMAT THAT COMPRESSES ALL IMAGES WELL EXPERIMENTAL RESULTS Corpus Lossless formats JPEG* (bit depth) FLIF FLIF* WebP BPG PNG PNG* JP2* JXR JLS 100% 90% interlaced PNGs, we used OptiPNG [21]. For BPG we used [4] 8 1.002 1.000 1.234 1.318 1.480 2.108 1.253 1.676 1.242 1.054 0.302 the options -m 9 -e jctvc; for WebP we used -m 6 -q [4] 16 1.017 1.000 / / 1.414 1.502 1.012 2.011 1.111 / / 100. For the other formats we used default lossless options. [5] 8 1.032 1.000 1.099 1.163 1.429 1.664 1.097 1.248 1.500 1.017 0.302� [6] 8 1.003 1.000 1.040 1.081 1.282 1.441 1.074 1.168 1.225 0.980 0.263 Figure 4 shows the results; see [22] for more details.
    [Show full text]
  • Thank You for Listening to My Presentation Gif
    Thank You For Listening To My Presentation Gif Derisible and alveolar Harris parades: which Marko is Noachian enough? Benny often interknit all-over when uxorilocal slangily.Jeremy disanoints frowningly and debugs her inventions. Defeated Sherwood usually scraping some outsides or undermine She needed to my presentation gifs to acknowledge the presenters try to share the voice actor: thanks in from anywhere online presentations. Lottie support integration with you for thanks for husband through the present, a person that lay behind him that feeling of. When my presentation gifs and presenting me and graphics let me or listen in the presenters and wife. It for my peers or are telling me wide and gif plays, presenters who took it have a random relevant titles to do not affiliated with. He had for you gif by: empty pen by taking the. The presentation for listening animated gifs for watching the body is an award ceremony speech reader is serious first arabesque, he climbed to. Make your organization fulfill its vastness, then find the bed to? Music streaming video or message a sample thank your take pride in via text and widescreen view my friends and many lovely pictures to think he killed them! Also you for presentation with quotes for the speech month club is back out of people meet again till it! From you listen to thank you a presentation and presenting? But i would not, bowing people of those values. Most memorable new life happened he used in twenty strokes for some unseen animal gifs that kind. So my presentation gifs image gifs with short but an! Life go on a meme or buy sound library to gif thank you for listening to my presentation visual design type some sort a pet proposal of.
    [Show full text]
  • How to Exploit the Transferability of Learned Image Compression to Conventional Codecs
    How to Exploit the Transferability of Learned Image Compression to Conventional Codecs Jan P. Klopp Keng-Chi Liu National Taiwan University Taiwan AI Labs [email protected] [email protected] Liang-Gee Chen Shao-Yi Chien National Taiwan University [email protected] [email protected] Abstract Lossy compression optimises the objective Lossy image compression is often limited by the sim- L = R + λD (1) plicity of the chosen loss measure. Recent research sug- gests that generative adversarial networks have the ability where R and D stand for rate and distortion, respectively, to overcome this limitation and serve as a multi-modal loss, and λ controls their weight relative to each other. In prac- especially for textures. Together with learned image com- tice, computational efficiency is another constraint as at pression, these two techniques can be used to great effect least the decoder needs to process high resolutions in real- when relaxing the commonly employed tight measures of time under a limited power envelope, typically necessitating distortion. However, convolutional neural network-based dedicated hardware implementations. Requirements for the algorithms have a large computational footprint. Ideally, encoder are more relaxed, often allowing even offline en- an existing conventional codec should stay in place, ensur- coding without demanding real-time capability. ing faster adoption and adherence to a balanced computa- Recent research has developed along two lines: evolu- tional envelope. tion of exiting coding technologies, such as H264 [41] or As a possible avenue to this goal, we propose and investi- H265 [35], culminating in the most recent AV1 codec, on gate how learned image coding can be used as a surrogate the one hand.
    [Show full text]
  • The GIF That Keeps on Giving: Assignment Design with Looped Animations
    The GIF that Keeps on Giving: Assignment Design with Looped Animations http://bit.ly/gifgive While an older format (created in 1987), GIFs (Graphics Interchange Format) have experienced a resurgence in popularity and use in the past decade. Due to the recent proliferation of digital platforms that host GIFs, the format as a means of communication and as an act of composition has become naturalized and therefore invisible. The rise in their popularity makes GIFs important for scholars of digital literacy and composition to understand and utilize in their teaching and research. We argue that their playful nature, ease of use, and delivery opens GIFs up for new avenues of research, inquiry, and analysis. As such, GIFs can be meaningfully incorporated into the composition classroom as critical components of assignment design and the feedback process. In this mini-workshop, we will introduce participants to pre-existing examples of GIF assignments we have used in courses, discuss avenues for pedagogical research, guide participants in the creation of new GIFs, and facilitate participants’ efforts to design assignments that use GIFs. Workshop Agenda http://bit.ly/gifgive Introduction ● Facilitators Icebreaker (5-10): Getting to Know You ​ ​ ● Participants Overview (10): The GIF That Keeps on Giving (http://bit.ly/gifgivepres) ​ ​ ​ ​ ​ ● History (Matt) ● Integration into Current Communication (Jamie) ● Affective Nature of GIFs (Megan) ● Avenues of Inquiry ● As Digital Composition (Matt) ● Issues of Identity (Matt) ● Hierarchy/ Canon: r/highqualitygifs
    [Show full text]
  • Download Media Player Codec Pack Version 4.1 Media Player Codec Pack
    download media player codec pack version 4.1 Media Player Codec Pack. Description: In Microsoft Windows 10 it is not possible to set all file associations using an installer. Microsoft chose to block changes of file associations with the introduction of their Zune players. Third party codecs are also blocked in some instances, preventing some files from playing in the Zune players. A simple workaround for this problem is to switch playback of video and music files to Windows Media Player manually. In start menu click on the "Settings". In the "Windows Settings" window click on "System". On the "System" pane click on "Default apps". On the "Choose default applications" pane click on "Films & TV" under "Video Player". On the "Choose an application" pop up menu click on "Windows Media Player" to set Windows Media Player as the default player for video files. Footnote: The same method can be used to apply file associations for music, by simply clicking on "Groove Music" under "Media Player" instead of changing Video Player in step 4. Media Player Codec Pack Plus. Codec's Explained: A codec is a piece of software on either a device or computer capable of encoding and/or decoding video and/or audio data from files, streams and broadcasts. The word Codec is a portmanteau of ' co mpressor- dec ompressor' Compression types that you will be able to play include: x264 | x265 | h.265 | HEVC | 10bit x265 | 10bit x264 | AVCHD | AVC DivX | XviD | MP4 | MPEG4 | MPEG2 and many more. File types you will be able to play include: .bdmv | .evo | .hevc | .mkv | .avi | .flv | .webm | .mp4 | .m4v | .m4a | .ts | .ogm .ac3 | .dts | .alac | .flac | .ape | .aac | .ogg | .ofr | .mpc | .3gp and many more.
    [Show full text]
  • Encoding H.264 Video for Streaming and Progressive Download
    What is Tuning? • Disable features that: • Improve subjective video quality but • Degrade objective scores • Example: adaptive quantization – changes bit allocation over frame depending upon complexity • Improves visual quality • Looks like “error” to metrics like PSNR/VMAF What is Tuning? • Switches in encoding string that enables tuning (and disables these features) ffmpeg –input.mp4 –c:v libx264 –tune psnr output.mp4 • With x264, this disables adaptive quantization and psychovisual optimizations Why So Important • Major point of contention: • “If you’re running a test with x264 or x265, and you wish to publish PSNR or SSIM scores, you MUST use –tune PSNR or –tune SSIM, or your results will be completely invalid.” • http://x265.org/compare-video-encoders/ • Absolutely critical when comparing codecs because some may or may not enable these adjustments • You don’t have to tune in your tests; but you should address the issue and explain why you either did or didn’t Does Impact Scores • 3 mbps football (high motion, lots of detail) • PSNR • No tuning – 32.00 dB • Tuning – 32.58 dB • .58 dB • VMAF • No tuning – 71.79 • Tuning – 75.01 • Difference – over 3 VMAF points • 6 is JND, so not a huge deal • But if inconsistent between test parameters, could incorrectly show one codec (or encoding configuration) as better than the other VQMT VMAF Graph Red – tuned Green – not tuned Multiple frames with 3-4-point differentials Downward spikes represent untuned frames that metric perceives as having lower quality Tuned Not tuned Observations • Tuning
    [Show full text]
  • Screen Capture Tools to Record Online Tutorials This Document Is Made to Explain How to Use Ffmpeg and Quicktime to Record Mini Tutorials on Your Own Computer
    Screen capture tools to record online tutorials This document is made to explain how to use ffmpeg and QuickTime to record mini tutorials on your own computer. FFmpeg is a cross-platform tool available for Windows, Linux and Mac. Installation and use process depends on your operating system. This info is taken from (Bellard 2016). Quicktime Player is natively installed on most of Mac computers. This tutorial focuses on Linux and Mac. Table of content 1. Introduction.......................................................................................................................................1 2. Linux.................................................................................................................................................1 2.1. FFmpeg......................................................................................................................................1 2.1.1. installation for Linux..........................................................................................................1 2.1.1.1. Add necessary components........................................................................................1 2.1.2. Screen recording with FFmpeg..........................................................................................2 2.1.2.1. List devices to know which one to record..................................................................2 2.1.2.2. Record screen and audio from your computer...........................................................3 2.2. Kazam........................................................................................................................................4
    [Show full text]
  • Download This PDF File
    Sindh Univ. Res. Jour. (Sci. Ser.) Vol.47 (3) 531-534 (2015) I NDH NIVERSITY ESEARCH OURNAL ( CIENCE ERIES) S U R J S S Performance Analysis of Image Compression Standards with Reference to JPEG 2000 N. MINALLAH++, A. KHALIL, M. YOUNAS, M. FURQAN, M. M. BOKHARI Department of Computer Systems Engineering, University of Engineering and Technology, Peshawar Received 12thJune 2014 and Revised 8th September 2015 Abstract: JPEG 2000 is the most widely used standard for still image coding. Some other well-known image coding techniques include JPEG, JPEG XR and WEBP. This paper provides performance evaluation of JPEG 2000 with reference to other image coding standards, such as JPEG, JPEG XR and WEBP. For the performance evaluation of JPEG 2000 with reference to JPEG, JPEG XR and WebP, we considered divers image coding scenarios such as continuous tome images, grey scale images, High Definition (HD) images, true color images and web images. Each of the considered algorithms are briefly explained followed by their performance evaluation using different quality metrics, such as Peak Signal to Noise Ratio (PSNR), Mean Square Error (MSE), Structure Similarity Index (SSIM), Bits Per Pixel (BPP), Compression Ratio (CR) and Encoding/ Decoding Complexity. The results obtained showed that choice of the each algorithm depends upon the imaging scenario and it was found that JPEG 2000 supports the widest set of features among the evaluated standards and better performance. Keywords: Analysis, Standards JPEG 2000. performance analysis, followed by Section 6 with 1. INTRODUCTION There are different standards of image compression explanation of the considered performance analysis and decompression.
    [Show full text]
  • Understanding Image Formats and When to Use Them
    Understanding Image Formats And When to Use Them Are you familiar with the extensions after your images? There are so many image formats that it’s so easy to get confused! File extensions like .jpeg, .bmp, .gif, and more can be seen after an image’s file name. Most of us disregard it, thinking there is no significance regarding these image formats. These are all different and not cross‐ compatible. These image formats have their own pros and cons. They were created for specific, yet different purposes. What’s the difference, and when is each format appropriate to use? Every graphic you see online is an image file. Most everything you see printed on paper, plastic or a t‐shirt came from an image file. These files come in a variety of formats, and each is optimized for a specific use. Using the right type for the right job means your design will come out picture perfect and just how you intended. The wrong format could mean a bad print or a poor web image, a giant download or a missing graphic in an email Most image files fit into one of two general categories—raster files and vector files—and each category has its own specific uses. This breakdown isn’t perfect. For example, certain formats can actually contain elements of both types. But this is a good place to start when thinking about which format to use for your projects. Raster Images Raster images are made up of a set grid of dots called pixels where each pixel is assigned a color.
    [Show full text]
  • Neural Multi-Scale Image Compression
    Neural Multi-scale Image Compression Ken Nakanishi 1 Shin-ichi Maeda 2 Takeru Miyato 2 Daisuke Okanohara 2 Abstract 1.00 This study presents a new lossy image compres- sion method that utilizes the multi-scale features 0.98 of natural images. Our model consists of two networks: multi-scale lossy autoencoder and par- 0.96 allel multi-scale lossless coder. The multi-scale Proposed 0.94 JPEG lossy autoencoder extracts the multi-scale image MS-SSIM WebP features to quantized variables and the parallel BPG 0.92 multi-scale lossless coder enables rapid and ac- Johnston et al. Rippel & Bourdev curate lossless coding of the quantized variables 0.90 via encoding/decoding the variables in parallel. 0.0 0.2 0.4 0.6 0.8 1.0 Our proposed model achieves comparable perfor- Bits per pixel mance to the state-of-the-art model on Kodak and RAISE-1k dataset images, and it encodes a PNG image of size 768 × 512 in 70 ms with a single Figure 1. Rate-distortion trade off curves with different methods GPU and a single CPU process and decodes it on Kodak dataset. The horizontal axis represents bits-per-pixel into a high-fidelity image in approximately 200 (bpp) and the vertical axis represents multi-scale structural similar- ms. ity (MS-SSIM). Our model achieves better or comparable bpp with respect to the state-of-the-art results (Rippel & Bourdev, 2017). 1. Introduction K Data compression for video and image data is a crucial tech- via ML algorithm is not new. The -means algorithm was nique for reducing communication traffic and saving data used for vector quantization (Gersho & Gray, 2012), and storage.
    [Show full text]
  • Spin Digital HEVC/H.265 Software Encoder (Spin Enc) Enables Ultra-High-Quality Video with the Highest Compression Level
    Spin Digital HEVC/H.265 software encoder (Spin Enc) enables ultra-high-quality video with the highest compression level. Encoding, transmission, and storage of video in 4K, 8K, and beyond are now possible with commodity computing technologies. HEVC/H.265 encoder for production, contribution, and distribution of professional video for broadcast, VoD, and creative studios. Spin Digital HEVC/H.265 encoder is ready for the next generation of high-quality video systems, providing support for Ultra HD (UHD), High Dynamic Range (HDR), High Frame Rate (HFR), Wide Color Gamut (WCG), and Virtual Reality (360° video). Product Highlights HEVC/H.265 Encoder Package • Fast offline encoding software solution • Encoder: standalone HEVC/H.265 encoder • Ready for 8K and beyond • Transcoder: HEVC/H.265 decoder and encoder • Significantly better compression and quality integrated in FFmpeg than competing encoders • SDK: encoder plugin for FFmpeg (Libavcodec) • Enables WCG and HDR with up to 12-bit video • Compatible with ARIB STD-B32 standard • Versatile high-precision pre-processing filters • Preserves color resolution with 4:2:2, 4:4:4, and RGB formats • 22.2-ch AAC audio encoding and decoding SPIN DIGITAL HEVC/H.265 ENCODER Support for the HEVC standard: Main and Main 10 profiles Range Extensions (HEVCv2) profiles ARIB STD-B32 version 3.9 Resolutions: 4K, 8K, and beyond Color formats: 4:2:0, 4:2:2, 4:4:4, RGB Bit depths: 8-, 10-, 12-bit Color spaces: BT.601, BT.709, DCI-P3, BT.2020 HDR support: ST2084 transfer function, ST2086 HDR metadata, HLG Coding
    [Show full text]