Algorithms and Methods for Video Transcoding. Robert Gordon University [Online], Phd Thesis
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
VOL. E100-C NO. 6 JUNE 2017 the Usage of This PDF File Must Comply
VOL. E100-C NO. 6 JUNE 2017 The usage of this PDF file must comply with the IEICE Provisions on Copyright. The author(s) can distribute this PDF file for research and educational (nonprofit) purposes only. Distribution by anyone other than the author(s) is prohibited. IEICE TRANS. ELECTRON., VOL.E100–C, NO.6 JUNE 2017 643 PAPER A High-Throughput and Compact Hardware Implementation for the Reconstruction Loop in HEVC Intra Encoding Yibo FAN†a), Member, Leilei HUANG†, Zheng XIE†, and Xiaoyang ZENG†, Nonmembers SUMMARY In the newly finalized video coding standard, namely high 4×4, 8×8, 16×16, 32×32 and 64×64 with 35 possible pre- efficiency video coding (HEVC), new notations like coding unit (CU), pre- diction modes in intra prediction. Although several fast diction unit (PU) and transformation unit (TU) are introduced to improve mode decision designs have been proposed, still a consid- the coding performance. As a result, the reconstruction loop in intra en- coding is heavily burdened to choose the best partitions or modes for them. erable amount of candidate PU modes, PU partitions or TU In order to solve the bottleneck problems in cycle and hardware cost, this partitions are needed to be traversed by the reconstruction paper proposed a high-throughput and compact implementation for such a loop. reconstruction loop. By “high-throughput”, it refers to that it has a fixed It can be inferred that the reconstruction loop in intra throughput of 32 pixel/cycle independent of the TU/PU size (except for 4×4 TUs). By “compact”, it refers to that it fully explores the reusability prediction has become a bottleneck in cycle and hardware between discrete cosine transform (DCT) and inverse discrete cosine trans- cost. -
Dynamic Resource Management of Network-On-Chip Platforms for Multi-Stream Video Processing
DYNAMIC RESOURCE MANAGEMENT OF NETWORK-ON-CHIP PLATFORMS FOR MULTI-STREAM VIDEO PROCESSING Hashan Roshantha Mendis Doctor of Engineering University of York Computer Science March 2017 2 Abstract This thesis considers resource management in the context of parallel multiple video stream de- coding, on multicore/many-core platforms. Such platforms have tens or hundreds of on-chip processing elements which are connected via a Network-on-Chip (NoC). Inefficient task allo- cation configurations can negatively affect the communication cost and resource contention in the platform, leading to predictability and performance issues. Efficient resource management for large-scale complex workloads is considered a challenging research problem; especially when applications such as video streaming and decoding have dynamic and unpredictable workload characteristics. For these type of applications, runtime heuristic-based task mapping techniques are required. As the application and platform size increase, decentralised resource management techniques are more desirable to overcome the reliability and performance bot- tlenecks in centralised management. In this work, several heuristic-based runtime resource management techniques, targeting real-time video decoding workloads are proposed. Firstly, two admission control approaches are proposed; one fully deterministic and highly predictable; the other is heuristic-based, which balances predictability and performance. Secondly, a pair of runtime task mapping schemes are presented, which make use of limited known application properties, communication cost and blocking-aware heuristics. Combined with the proposed deterministic admission con- troller, these techniques can provide strict timing guarantees for hard real-time streams whilst improving resource usage. The third contribution in this thesis is a distributed, bio-inspired, low-overhead, task re-allocation technique, which is used to further improve the timeliness and workload distribution of admitted soft real-time streams. -
Parameter Optimization in H.265 Rate-Distortion by Single Frame
https://doi.org/10.2352/ISSN.2470-1173.2019.11.IPAS-262 © 2019, Society for Imaging Science and Technology Parameter optimization in H.265 Rate-Distortion by single- frame semantic scene analysis Ahmed M. Hamza; University of Portsmouth Abdelrahman Abdelazim; Blackpool and the Fylde College Djamel Ait-Boudaoud; University of Portsmouth Abstract with Q being the quantization value for the source. The relation is The H.265/HEVC (High Efficiency Video Coding) codec and derived based on several assumptions about the source probability its 3D extensions have crucial rate-distortion mechanisms that distribution within the quantization intervals, and the nature of help determine coding efficiency. We have introduced in this the rate-distortion relations themselves (constantly differentiable work a new system of Lagrangian parameterization in RDO cost throughout, etc.). The value initially used for c in the literature functions, based on semantic cues in an image, starting with the was 0.85. This was modified and made adaptive in subsequent current HEVC formulation of the Lagrangian hyper-parameter standards including HEVC/H.265. heuristics. Two semantic scenery flag algorithms are presented Our work here investigates further adaptations to the rate- and tested within the Lagrangian formulation as weighted factors. distortion Lagrangian by semantic algorithms, made possible by The investigation of whether the semantic gap between the coder recent frameworks in computer vision. and the image content is holding back the block-coding mecha- nisms as a whole from achieving greater efficiency has yielded a Adaptive Lambda Hyper-parameters in HEVC positive answer. Early versions of the rate-distortion parameters in H.263 were replaced with more sophisticated models in subsequent stan- Introduction dards. -
Efficient Multi-Codec Support for OTT Services: HEVC/H.265 And/Or AV1?
Efficient Multi-Codec Support for OTT Services: HEVC/H.265 and/or AV1? Christian Timmerer†,‡, Martin Smole‡, and Christopher Mueller‡ ‡Bitmovin Inc., †Alpen-Adria-Universität San Francisco, CA, USA and Klagenfurt, Austria, EU ‡{firstname.lastname}@bitmovin.com, †{firstname.lastname}@itec.aau.at Abstract – The success of HTTP adaptive streaming is un- multiple versions (e.g., different resolutions and bitrates) and disputed and technical standards begin to converge to com- each version is divided into predefined pieces of a few sec- mon formats reducing market fragmentation. However, other onds (typically 2-10s). A client first receives a manifest de- obstacles appear in form of multiple video codecs to be sup- scribing the available content on a server, and then, the client ported in the future, which calls for an efficient multi-codec requests pieces based on its context (e.g., observed available support for over-the-top services. In this paper, we review the bandwidth, buffer status, decoding capabilities). Thus, it is state of the art of HTTP adaptive streaming formats with re- able to adapt the media presentation in a dynamic, adaptive spect to new services and video codecs from a deployment way. perspective. Our findings reveal that multi-codec support is The existing different formats use slightly different ter- inevitable for a successful deployment of today's and future minology. Adopting DASH terminology, the versions are re- services and applications. ferred to as representations and pieces are called segments, which we will use henceforth. The major differences between INTRODUCTION these formats are shown in Table 1. We note a strong differ- entiation in the manifest format and it is expected that both Today's over-the-top (OTT) services account for more than MPEG's media presentation description (MPD) and HLS's 70 percent of the internet traffic and this number is expected playlist (m3u8) will coexist at least for some time. -
CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter
CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter Prediction using Binary classification techniques A graduate project submitted in partial fulfillment of the requirements for the degree of Master of Science in Software Engineering by Alex Kit Romero May 2020 The graduate project of Alex Kit Romero is approved: ____________________________________ ____________ Dr. Katya Mkrtchyan Date ____________________________________ ____________ Dr. Kyle Dewey Date ____________________________________ ____________ Dr. John J. Noga, Chair Date California State University, Northridge ii Dedication This project is dedicated to all of the Computer Science professors that I have come in contact with other the years who have inspired and encouraged me to pursue a career in computer science. The words and wisdom of these professors are what pushed me to try harder and accomplish more than I ever thought possible. I would like to give a big thanks to the open source community and my fellow cohort of computer science co-workers for always being there with answers to my numerous questions and inquiries. Without their guidance and expertise, I could not have been successful. Lastly, I would like to thank my friends and family who have supported and uplifted me throughout the years. Thank you for believing in me and always telling me to never give up. iii Table of Contents Signature Page ................................................................................................................................ ii Dedication ..................................................................................................................................... -
Multicom Product Catalog
MULTICOM PRODUCT CATALOG Multicom Products Incorporate Leading Edge USA Technology: • USA Lasers • USA IC Chipsets • USA Amplifiers • USA Fiber Optics • USA Engineering & Designs www.multicominc.comwww.multicominc.com 800-423-2594 800-423-2594 407-331-7779 407-331-7779 ISSUE 5 1 “After spending decades with Fortune 500 companies, I decided to become an entrepreneur. It started in my garage in the fall of 1982. Over the years Multicom has matured into a multi- faceted corporation bringing the latest technology to diversified geographic and vertical markets. Global locations and markets served include the United States, its territories and 34 foreign countries. The future is exciting. The ability to add new communication products from our manufacturing facilities domestically and overseas has received enthusiastic acceptance. Hundreds of new state-of-the-art SKUs have recently been added to our over 19,000 products in stock, and more are in process. We are proud to display our current family of products with this product catalog.” Sherman Miller, Multicom Founder, President and CEO WELCOME 1982 was a significant year for Sherman Miller, Multicom’s founder and president. It was that year that he started Multicom, Inc. – an event marked by the opening of the garage door of his home. Entrepreneurs understand that unless you know your clients’ problems, unless you identify their pain, you can’t provide viable, desirable solutions. Multicom, a Service-Disabled, Veteran-owned, Small Business (SDVOSB), now reaches around the world providing innovative solutions since 1982. Since that time, Multicom has grown to be a cohesive team of experienced technicians, engineers, administrators, and sales people with decades of experience under their belt – many of our staff today are original hires and have worked at Multicom for more than a decade. -
Computational Complexity Reduction of Intra−Frame Prediction in Mpeg−2
COMPUTATIONAL COMPLEXITY REDUCTION OF INTRA-FRAME PREDICTION IN MPEG-2/H.264 VIDEO TRANSCODERS Gerardo Fern ndez-Escribano, Pedro Cuenca, Luis Orozco-Barbosa and Antonio Garrido Instituto de Investigacin en Infor tica Universidad de Castilla-La Mancha, Ca pus Universitario, 02071 Albacete, SPAIN ,gerardo,pcuenca,lorozco,antonio-.info-ab.ucl .es ABSTRACT 1 :ualit1 video at about 1/3 to 1/2 of the bitrates offered b1 the MPEG-2 is the most widely digital video-encoding standard in MPEG-2 for at. 2hese significant band0idth savings open the use nowadays. It is being widely used in the development and ar5et to ne0 products and services, including 6423 services deployment of digital T services, D D and video-on-demand at lo0er bitrates. Further ore, given the relativel1 earl1 stage of services. However, recent developments have given birth to the video services in obile phones, obile phones 0ill be one of H.264/A C, offering better bandwidth to video )uality ratios the first ar5et seg ents to adopt 6.278 video. 6o0ever, these than MPEG2. It is expected that the H.264/A C will take over gains co e 0ith a significant increase in encoding and decoding the digital video market, replacing the use of MPEG-2 in most co pleAit1 ,3-. digital video applications. The complete migration to the new Bhile the 6.278 video standard is eApected to replace video-coding algorithm will take several years given the wide MPEG-2 video over the neAt several 1ears, a significant a ount scale use of MPEG-2 in the market place today. -
Design and Implementation of a Fast HEVC Random Access Video Encoder
Design and Implementation of a Fast HEVC Random Access Video Encoder ALFREDO SCACCIALEPRE Master's Degree Project Stockholm, Sweden March 2014 XR-EE-KT 2014:003 Contents 1 Introduction 11 1.1 Background . 11 1.2 Thesis work . 12 1.2.1 Factors to consider . 12 1.3 The problem . 12 1.3.1 C65 . 12 1.4 Methods and thesis outline . 13 1.4.1 Methods . 13 1.4.2 Objective measurement . 14 1.4.3 Subjective measurement . 14 1.4.4 Test sequences . 14 1.4.5 Thesis outline . 16 1.4.6 Abbreviations . 16 2 General concepts 19 2.1 Color spaces . 19 2.2 Frames, slices and tiles . 19 2.2.1 Frames . 19 2.2.2 Slices and Tiles . 19 2.3 Predictions . 20 2.3.1 Intra . 20 2.3.2 Inter . 20 2.4 Merge mode . 20 2.4.1 Skip mode . 20 2.5 AMVP mode . 20 2.5.1 I, P and B frames . 21 2.6 CTU, CU, CTB, CB, PB, and TB . 21 2.7 Transforms . 23 2.8 Quantization . 24 2.9 Coding . 24 2.10 Reference picture lists . 24 2.11 Gop structure . 24 2.12 Temporal scalability . 25 2.13 Hierarchical B pictures . 25 2.14 Decoded picture buffer (DPB) . 25 2.15 Low delay and random access configurations . 26 2.16 H.264 and its encoders . 26 2.16.1 H.264 . 26 1 2 CONTENTS 3 Preliminary tests 27 3.1 Speed - quality considerations . 27 3.1.1 Interactive applications . 27 3.1.2 Entertainment applications . -
Fast Coding Unit Encoding Mechanism for Low Complexity Video Coding
RESEARCH ARTICLE Fast Coding Unit Encoding Mechanism for Low Complexity Video Coding Yuan Gao1,2,3☯, Pengyu Liu1,2,3☯, Yueying Wu1,2,3, Kebin Jia1,2,3*, Guandong Gao1,2,3 1 Beijing Advanced Innovation Center for Future Internet Technology, Beijing University of Technology, Beijing, China, 2 Beijing Laboratory of Advanced Information Networks, Beijing, China, 3 College of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China ☯ These authors contributed equally to this work. * [email protected] Abstract In high efficiency video coding (HEVC), coding tree contributes to excellent compression performance. However, coding tree brings extremely high computational complexity. Inno- vative works for improving coding tree to further reduce encoding time are stated in this OPEN ACCESS paper. A novel low complexity coding tree mechanism is proposed for HEVC fast coding Citation: Gao Y, Liu P, Wu Y, Jia K, Gao G (2016) unit (CU) encoding. Firstly, this paper makes an in-depth study of the relationship among Fast Coding Unit Encoding Mechanism for Low CU distribution, quantization parameter (QP) and content change (CC). Secondly, a CU Complexity Video Coding. PLoS ONE 11(3): coding tree probability model is proposed for modeling and predicting CU distribution. Even- e0151689. doi:10.1371/journal.pone.0151689 tually, a CU coding tree probability update is proposed, aiming to address probabilistic Editor: You Yang, Huazhong University of Science model distortion problems caused by CC. Experimental results show that the proposed low and Technology, CHINA complexity CU coding tree mechanism significantly reduces encoding time by 27% for lossy Received: September 23, 2015 coding and 42% for visually lossless coding and lossless coding. -
The Essentials of Video Transcoding Enabling Acquisition, Interoperability, and Distribution in Digital Media
Telestream Whitepaper The Essentials of Video Transcoding Enabling Acquisition, Interoperability, and Distribution in Digital Media The intent of this paper is to Contents provide an overview of the Introduction Page 1 transcoding basics you’ll need to Digital media formats Page 2 know in order to make informed The essence of compression Page 3 decisions about implementing The right tool for the job Page 4 transcoding in your workflow. That Bridging the gaps Page 5 involves explaining the elements of (media exchange between diverse systems) the various types of digital media Transcoding workflows Page 6 files used for different purposes, Conclusion Page 7 how transcoding works upon those elements, what challenges might arise in moving from one Introduction format to another, and what It’s been more than three decades now since digital media emerged from the workflows might be most effective lab into real-world applications, igniting a communications revolution that for transcoding in various common continues to play out today. First up was the Compact Disc, featuring situations. compression-free LPCM digital audio that was designed to fully reconstruct the source at playback. With digital video, on the other hand, it was clear from the outset that most recording, storage, and playback situations wouldn’t have the data capacity required to describe a series of images with complete accuracy. And so the race was on to develop compression/decompression algorithms (codecs) that could adequately capture a video source while requiring as little data bandwidth as possible. 1 Telestream Whitepaper The good news is that astonishing progress has been Transcoding bypasses this tedious, inefficient scenario. -
Conversion of MP3 to AAC in the Compressed Domain
Conversion of MP3 to AAC in the Compressed Domain Koichi Takagi Satoshi Miyaji Shigeyuki Sakazawa Yasuhiro Takishima KDDI R&D Labs. Inc. Fujimino-shi, Saitama 356-8502 Japan Abstract— In this paper, we propose an efficient algorithm transcoding, even in transcoding for bit rate conversion where the for conversion of an MP3 stream into AAC. Generally, this kind encoding method is not changed. of conversion, transcoding, requires full decoding and re- Therefore, in this paper, we propose a novel algorithm for encoding. However, the re-encoding based transcoding process conversion of MP3 to AAC in the compressed data domain with may result in degradation of quality and take longer than the least quality degradation. We analyzed the encoding parameters encoding from a PCM signal. This paper proposes a method to of MP3 and AAC, which consists of side information, scale factors, inherit the frame structure and quantization scale from MP3 to and MDCT coefficients, and then tried to figure out the differences AAC. This enables a reduction in the iteration process, which and similarities. Consequently, we realized faster conversion of requires the most time in the AAC encoding process, without MP3 to AAC compared with the simple transcoding technique. incurring degradation in quality. Experimental results show that the proposed method can perform bitstream domain transcoding This paper is organized as follows. Section II provides an at high speed while maintaining a high level of audio quality. overview of the related techniques and outlines the proposed method. In Section III, performance of the proposed algorithm is Keywords—MP3; AAC; Transcoding; Audio conversion; Scale compared with the existing transcoding technique. -
JPEG 2000 for Video Archiving
The Pros and Cons of JPEG 2000 for Video Archiving Katty Van Mele November, 2010 Overview • Introduction – Current situation – Multiple challenges • Archiving challenges for cinema and video content • JPEG 2000 for Video Archiving • intoPIX Solutions • Conclusions INTOPIX PRIVATE & CONFIDENTIAL © 2010 JPEG 2000 SOLUTIONS 2 Current Situation • Most museums, film archiving and broadcast organizations – digitizing available content considered or initiated • Both movie content (reels) and analog video content (tapes) – Digitization process and constraints are very different. – More than 10.000.000 Hours Film (analog = film) • 30 to 40 % will disappear in the next 10 years ( Vinager syndrom) • Digitization process is complex – More than 6.000.000H ? Video ( 90% analog = tape) • x% will disappear because of the magnetic tape (binder) • Natural digitization process taking place due to the technology evolution. • Technical constraints are easier. INTOPIX PRIVATE & CONFIDENTIAL © 2010 JPEG 2000 SOLUTIONS 3 Multiple challenges • The goal of the digitization process : – Ensure the long term preservation of the content – Ensure the sharing and commercialization of the content. • Based on these different viewpoints and needs – Different technical challenges and choices – Different workflows utilized – Different commercial constraints – Different cultural and legal issues INTOPIX PRIVATE & CONFIDENTIAL © 2010 J PEG 2000 SOLUTIONS 4 Overview • Introduction • Archiving challenges for cinema and video content – General archiving concerns – Benefits of