Compressing and Companding High Dynamic Range Images with Subband Architectures
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
A Multidimensional Companding Scheme for Source Coding Witha Perceptually Relevant Distortion Measure
A MULTIDIMENSIONAL COMPANDING SCHEME FOR SOURCE CODING WITHA PERCEPTUALLY RELEVANT DISTORTION MEASURE J. Crespo, P. Aguiar R. Heusdens Department of Electrical and Computer Engineering Department of Mediamatics Instituto Superior Técnico Technische Universiteit Delft Lisboa, Portugal Delft, The Netherlands ABSTRACT coding mechanisms based on quantization, where less percep- tually important information is quantized more roughly and In this paper, we develop a multidimensional companding vice-versa, and lossless coding of the symbols emitted by the scheme for the preceptual distortion measure by S. van de Par quantizer to reduce statistical redundancy. In the quantization [1]. The scheme is asymptotically optimal in the sense that process of the (en)coder, it is desirable to quantize the infor- it has a vanishing rate-loss with increasing vector dimension. mation extracted from the signal in a rate-distortion optimal The compressor operates in the frequency domain: in its sim- sense, i.e., in a way that minimizes the perceptual distortion plest form, it pointwise multiplies the Discrete Fourier Trans- experienced by the user subject to the constraint of a certain form (DFT) of the windowed input signal by the square-root available bitrate. of the inverse of the masking threshold, and then goes back into the time domain with the inverse DFT. The expander is Due to its mathematical tractability, the Mean Square based on numerical methods: we do one iteration in a fixed- Error (MSE) is the elected choice for the distortion measure point equation, and then we fine-tune the result using Broy- in many coding applications. In audio coding, however, the den’s method. -
Digital Speech Processing— Lecture 17
Digital Speech Processing— Lecture 17 Speech Coding Methods Based on Speech Models 1 Waveform Coding versus Block Processing • Waveform coding – sample-by-sample matching of waveforms – coding quality measured using SNR • Source modeling (block processing) – block processing of signal => vector of outputs every block – overlapped blocks Block 1 Block 2 Block 3 2 Model-Based Speech Coding • we’ve carried waveform coding based on optimizing and maximizing SNR about as far as possible – achieved bit rate reductions on the order of 4:1 (i.e., from 128 Kbps PCM to 32 Kbps ADPCM) at the same time achieving toll quality SNR for telephone-bandwidth speech • to lower bit rate further without reducing speech quality, we need to exploit features of the speech production model, including: – source modeling – spectrum modeling – use of codebook methods for coding efficiency • we also need a new way of comparing performance of different waveform and model-based coding methods – an objective measure, like SNR, isn’t an appropriate measure for model- based coders since they operate on blocks of speech and don’t follow the waveform on a sample-by-sample basis – new subjective measures need to be used that measure user-perceived quality, intelligibility, and robustness to multiple factors 3 Topics Covered in this Lecture • Enhancements for ADPCM Coders – pitch prediction – noise shaping • Analysis-by-Synthesis Speech Coders – multipulse linear prediction coder (MPLPC) – code-excited linear prediction (CELP) • Open-Loop Speech Coders – two-state excitation -
Audio Coding for Digital Broadcasting
Recommendation ITU-R BS.1196-7 (01/2019) Audio coding for digital broadcasting BS Series Broadcasting service (sound) ii Rec. ITU-R BS.1196-7 Foreword The role of the Radiocommunication Sector is to ensure the rational, equitable, efficient and economical use of the radio- frequency spectrum by all radiocommunication services, including satellite services, and carry out studies without limit of frequency range on the basis of which Recommendations are adopted. The regulatory and policy functions of the Radiocommunication Sector are performed by World and Regional Radiocommunication Conferences and Radiocommunication Assemblies supported by Study Groups. Policy on Intellectual Property Right (IPR) ITU-R policy on IPR is described in the Common Patent Policy for ITU-T/ITU-R/ISO/IEC referenced in Resolution ITU-R 1. Forms to be used for the submission of patent statements and licensing declarations by patent holders are available from http://www.itu.int/ITU-R/go/patents/en where the Guidelines for Implementation of the Common Patent Policy for ITU-T/ITU-R/ISO/IEC and the ITU-R patent information database can also be found. Series of ITU-R Recommendations (Also available online at http://www.itu.int/publ/R-REC/en) Series Title BO Satellite delivery BR Recording for production, archival and play-out; film for television BS Broadcasting service (sound) BT Broadcasting service (television) F Fixed service M Mobile, radiodetermination, amateur and related satellite services P Radiowave propagation RA Radio astronomy RS Remote sensing systems S Fixed-satellite service SA Space applications and meteorology SF Frequency sharing and coordination between fixed-satellite and fixed service systems SM Spectrum management SNG Satellite news gathering TF Time signals and frequency standards emissions V Vocabulary and related subjects Note: This ITU-R Recommendation was approved in English under the procedure detailed in Resolution ITU-R 1. -
(A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format. -
Ultra HD Playout & Delivery
Ultra HD Playout & Delivery SOLUTION BRIEF The next major advancement in television has arrived: Ultra HD. By 2020 more than 40 million consumers around the world are projected to be watching close to 250 linear UHD channels, a figure that doesn’t include VOD (video-on-demand) or OTT (over-the-top) UHD services. A complete UHD playout and delivery solution from Harmonic will help you to meet that demand. 4K UHD delivers a screen resolution four times that of 1080p60. Not to be confused with the 4K digital cinema format, a professional production and cinema standard with a resolution of 4096 x 2160, UHD is a broadcast and OTT standard with a video resolution of 3840 x 2160 pixels at 24/30 fps and 8-bit color sampling. Second-generation UHD specifications will reach a frame rate of 50/60 fps at 10 bits. When combined with advanced technologies such as high dynamic range (HDR) and wide color gamut (WCG), the home viewing experience will be unlike anything previously available. The expected demand for UHD content will include all types of programming, from VOD movie channels to live global sporting events such as the World Cup and Olympics. UHD-native channel deployments are already on the rise, including the first linear UHD channel in North America, NASA TV UHD, launched in 2015 via a partnership between Harmonic and NASA’s Marshall Space Flight Center. The channel highlights incredible imagery from the U.S. space program using an end-to-end UHD playout, encoding and delivery solution from Harmonic. The Harmonic UHD solution incorporates the latest developments in IP networking and compression technology, including HEVC (High- Efficiency Video Coding) signal transport and HDR enhancement. -
MC14SM5567 PCM Codec-Filter
Product Preview Freescale Semiconductor, Inc. MC14SM5567/D Rev. 0, 4/2002 MC14SM5567 PCM Codec-Filter The MC14SM5567 is a per channel PCM Codec-Filter, designed to operate in both synchronous and asynchronous applications. This device 20 performs the voice digitization and reconstruction as well as the band 1 limiting and smoothing required for (A-Law) PCM systems. DW SUFFIX This device has an input operational amplifier whose output is the input SOG PACKAGE CASE 751D to the encoder section. The encoder section immediately low-pass filters the analog signal with an RC filter to eliminate very-high-frequency noise from being modulated down to the pass band by the switched capacitor filter. From the active R-C filter, the analog signal is converted to a differential signal. From this point, all analog signal processing is done differentially. This allows processing of an analog signal that is twice the amplitude allowed by a single-ended design, which reduces the significance of noise to both the inverted and non-inverted signal paths. Another advantage of this differential design is that noise injected via the power supplies is a common mode signal that is cancelled when the inverted and non-inverted signals are recombined. This dramatically improves the power supply rejection ratio. After the differential converter, a differential switched capacitor filter band passes the analog signal from 200 Hz to 3400 Hz before the signal is digitized by the differential compressing A/D converter. The decoder accepts PCM data and expands it using a differential D/A converter. The output of the D/A is low-pass filtered at 3400 Hz and sinX/X compensated by a differential switched capacitor filter. -
TMS320C6000 U-Law and A-Law Companding with Software Or The
Application Report SPRA634 - April 2000 TMS320C6000t µ-Law and A-Law Companding with Software or the McBSP Mark A. Castellano Digital Signal Processing Solutions Todd Hiers Rebecca Ma ABSTRACT This document describes how to perform data companding with the TMS320C6000 digital signal processors (DSP). Companding refers to the compression and expansion of transfer data before and after transmission, respectively. The multichannel buffered serial port (McBSP) in the TMS320C6000 supports two companding formats: µ-Law and A-Law. Both companding formats are specified in the CCITT G.711 recommendation [1]. This application report discusses how to use the McBSP to perform data companding in both the µ-Law and A-Law formats. This document also discusses how to perform companding of data not coming into the device but instead located in a buffer in processor memory. Sample McBSP setup codes are provided for reference. In addition, this application report discusses µ-Law and A-Law companding in software. The appendix provides software companding codes and a brief discussion on the companding theory. Contents 1 Design Problem . 2 2 Overview . 2 3 Companding With the McBSP. 3 3.1 µ-Law Companding . 3 3.1.1 McBSP Register Configuration for µ-Law Companding. 4 3.2 A-Law Companding. 4 3.2.1 McBSP Register Configuration for A-Law Companding. 4 3.3 Companding Internal Data. 5 3.3.1 Non-DLB Mode. 5 3.3.2 DLB Mode . 6 3.4 Sample C Functions. 6 4 Companding With Software. 6 5 Conclusion . 7 6 References . 7 TMS320C6000 is a trademark of Texas Instruments Incorporated. -
Bit Rate Adaptation Using Linear Quadratic Optimization for Mobile Video Streaming
applied sciences Article Bit Rate Adaptation Using Linear Quadratic Optimization for Mobile Video Streaming Young-myoung Kang 1 and Yeon-sup Lim 2,* 1 Network Business, Samsung Electronics, Suwon 16677, Korea; [email protected] 2 Department of Convergence Security Engineering, Sungshin Women’s University, Seoul 02844, Korea * Correspondence: [email protected]; Tel.: +82-2-920-7144 Abstract: Video streaming application such as Youtube is one of the most popular mobile applications. To adjust the quality of video for available network bandwidth, a streaming server provides multiple representations of video of which bit rate has different bandwidth requirements. A streaming client utilizes an adaptive bit rate (ABR) scheme to select a proper representation that the network can support. However, in mobile environments, incorrect decisions of an ABR scheme often cause playback stalls that significantly degrade the quality of user experience, which can easily happen due to network dynamics. In this work, we propose a control theory (Linear Quadratic Optimization)- based ABR scheme to enhance the quality of experience in mobile video streaming. Our simulation study shows that our proposed ABR scheme successfully mitigates and shortens playback stalls while preserving the similar quality of streaming video compared to the state-of-the-art ABR schemes. Keywords: dynamic adaptive streaming over HTTP; adaptive bit rate scheme; control theory 1. Introduction Video streaming constitutes a large fraction of current Internet traffic. In particular, video streaming accounts for more than 65% of world-wide mobile downstream traffic [1]. Commercial streaming services such as YouTube and Netflix are implemented based on Citation: Kang, Y.-m.; Lim, Y.-s. -
A Predictive H.263 Bit-Rate Control Scheme Based on Scene
A PRJ3DICTIVE H.263 BIT-RATE CONTROL SCHEME BASED ON SCENE INFORMATION Pohsiang Hsu and K. J. Ray Liu Department of Electrical and Computer Engineering University of Maryland at College Park College Park, Maryland, USA ABSTRACT is typically constant bit rate (e.g. PSTN). Under this cir- cumstance, the encoder’s output bit-rate must be regulated For many real-time visual communication applications such to meet the transmission bandwidth in order to guaran- as video telephony, the transmission channel in used is typ- tee a constant frame rate and delay. Given the channel ically constant bit rate (e.g. PSTN). Under this circum- rate and a target frame rate, we derive the target bits per stance, the encoder’s output bit-rate must be regulated to frame by distribute the channel rate equally to each hme. meet the transmission bandwidth in order to guarantee a Ffate control is typically based on varying the quantization constant frame rate and delay. In this work, we propose a paraneter to meet the target bit budget for each frame. new predictive rate control scheme based on the activity of Many rate control scheme for the block-based DCT/motion the scene to be coded for H.263 encoder compensation video encoders have been proposed in the past. These rate control algorithms are typically focused on MPEG video coding standards. Rate-Distortion based rate 1. INTRODUCTION control for MPEG using Lagrangian optimization [4] have been proposed in the literature. These methods require ex- In the past few years, emerging demands for digital video tensive,rate-distortion measurements and are unsuitable for communication applications such as video conferencing, video real time applications. -
Codec Is a Portmanteau of Either
What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format. -
JPEG-HDR: a Backwards-Compatible, High Dynamic Range Extension to JPEG
JPEG-HDR: A Backwards-Compatible, High Dynamic Range Extension to JPEG Greg Ward Maryann Simmons BrightSide Technologies Walt Disney Feature Animation Abstract What we really need for HDR digital imaging is a compact The transition from traditional 24-bit RGB to high dynamic range representation that looks and displays like an output-referred (HDR) images is hindered by excessively large file formats with JPEG, but holds the extra information needed to enable it as a no backwards compatibility. In this paper, we demonstrate a scene-referred standard. The next generation of HDR cameras simple approach to HDR encoding that parallels the evolution of will then be able to write to this format without fear that the color television from its grayscale beginnings. A tone-mapped software on the receiving end won’t know what to do with it. version of each HDR original is accompanied by restorative Conventional image manipulation and display software will see information carried in a subband of a standard output-referred only the tone-mapped version of the image, gaining some benefit image. This subband contains a compressed ratio image, which from the HDR capture due to its better exposure. HDR-enabled when multiplied by the tone-mapped foreground, recovers the software will have full access to the original dynamic range HDR original. The tone-mapped image data is also compressed, recorded by the camera, permitting large exposure shifts and and the composite is delivered in a standard JPEG wrapper. To contrast manipulation during image editing in an extended color naïve software, the image looks like any other, and displays as a gamut. -
Senior Tech Tuesday 11 Iphone Camera App Tour
More Info: Links.SeniorTechClub.com/Tuesdays Welcome to Senior Tech Tuesday Live: 1/19/2021 Photography Series Tour of the Camera App Don Frederiksen Our Tuesday Focus ➢A Tour of the Camera App - Getting Beyond Point & Shoot ➢Selfies ➢Flash ➢Zoom ➢HDR is Good ➢What is a Live Photo ➢Focus & Exposure ➢Filters ➢Better iPhone Photography Tip ➢What’s Next www.SeniorTechClub.com Zoom Setup Zoom Speaker View Computer iPad or laptop Laptop www.SeniorTechClub.com Our Learning Tools ◦ Zoom Video Platform ◦ Slides – Downloadable from class page ◦ Demonstrations ◦ Your Questions ◦ “Hey Don” or Chat ◦ Email: [email protected] ◦ Online Class Page at: Links.SeniorTechClub.com/STT11 ◦ Tuesdays Page for Future Topics Links.SeniorTechClub.com/tuesdays www.SeniorTechClub.com Our Class Page Find our class page at: ◦ Links.SeniorTechClub.com/STT11 ◦ Bottom of the Tuesday page Purpose of the Class Page ◦ Relevant Information ◦ Fill in gaps from the online session ◦ Participate w/o being online www.SeniorTechClub.com Tour of our Class Page Slide Deck Video Archive Links & Resources Recipes & Nuggets www.SeniorTechClub.com A Tour of the Camera App Poll www.SeniorTechClub.com A Tour of the Camera App - Classic www.SeniorTechClub.com A Tour of the Camera App - New www.SeniorTechClub.com Switch Camera - Selfie Reminder: Long Press Shortcut Zoom Two kinds of zoom on iPhones Optical Zoom via a Lens Zoom Digital Zoom via a Pinch Better to zoom with your feet then digital Zoom Digital Zoom – Pinch Screen in or out Optical ◦ If your iPhone has more than one lens, tap: ◦ .5x or 1x or 2x (varies by model) Flash Focus & Exposure HDR Photos High Dynamic Range iPhone takes multiple photos to balance shadows and highlights.