GPU Implementation of SAO Filter
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
(A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format. -
Ffmpeg Documentation Table of Contents
ffmpeg Documentation Table of Contents 1 Synopsis 2 Description 3 Detailed description 3.1 Filtering 3.1.1 Simple filtergraphs 3.1.2 Complex filtergraphs 3.2 Stream copy 4 Stream selection 5 Options 5.1 Stream specifiers 5.2 Generic options 5.3 AVOptions 5.4 Main options 5.5 Video Options 5.6 Advanced Video options 5.7 Audio Options 5.8 Advanced Audio options 5.9 Subtitle options 5.10 Advanced Subtitle options 5.11 Advanced options 5.12 Preset files 5.12.1 ffpreset files 5.12.2 avpreset files 6 Examples 6.1 Video and Audio grabbing 6.2 X11 grabbing 6.3 Video and Audio file format conversion 7 Syntax 7.1 Quoting and escaping 7.1.1 Examples 7.2 Date 7.3 Time duration 7.3.1 Examples 7.4 Video size 7.5 Video rate 7.6 Ratio 7.7 Color 7.8 Channel Layout 8 Expression Evaluation 9 OpenCL Options 10 Codec Options 11 Decoders 12 Video Decoders 12.1 hevc 12.2 rawvideo 12.2.1 Options 13 Audio Decoders 13.1 ac3 13.1.1 AC-3 Decoder Options 13.2 flac 13.2.1 FLAC Decoder options 13.3 ffwavesynth 13.4 libcelt 13.5 libgsm 13.6 libilbc 13.6.1 Options 13.7 libopencore-amrnb 13.8 libopencore-amrwb 13.9 libopus 14 Subtitles Decoders 14.1 dvbsub 14.1.1 Options 14.2 dvdsub 14.2.1 Options 14.3 libzvbi-teletext 14.3.1 Options 15 Encoders 16 Audio Encoders 16.1 aac 16.1.1 Options 16.2 ac3 and ac3_fixed 16.2.1 AC-3 Metadata 16.2.1.1 Metadata Control Options 16.2.1.2 Downmix Levels 16.2.1.3 Audio Production Information 16.2.1.4 Other Metadata Options 16.2.2 Extended Bitstream Information 16.2.2.1 Extended Bitstream Information - Part 1 16.2.2.2 Extended Bitstream -
FOSS Open Standards/Comparison of File Formats 1 FOSS Open Standards/Comparison of File Formats
FOSS Open Standards/Comparison of File Formats 1 FOSS Open Standards/Comparison of File Formats FOSS Open Standards Foreword — Preface — List of Acronyms — Introduction — Importance and Benefits of Open Standards — Standard-Setting and Open Standards — Some Important Open Standards — Comparison of File Formats — Standards and Internationalization/Localization of Software — Patents in Standards — The Linux Standard Base — Government/National Open Standards Policies and Initiatives — Conclusion — Annexure: Comments on RAND, as seen from both sides — Glossary — About the Author — Acknowledgements — About APDIP — About IOSN This section will list, compare and discuss the degrees of openness and/or lack of openness of several popular file formats. These include file formats for the following application areas: 1. office applications 2. graphics 3. audio 4. video Office Applications File Formats Microsoft Office Formats Currently, the most popular office application is Microsoft Office (MS Office). This suite of office software comprises mainly (depending on the type of suite purchased) word processing (MS Word), spreadsheet (MS Excel) and presentation software (MS PowerPoint). Up till version 10 (MS Office 10), the file formats used were binary (i.e. non-plain text) in nature and not publicly published. MS Word, MS Excel and MS PowerPoint use the binary DOC, XLS and PPT formats, respectively, and these are proprietary formats, being owned and controlled entirely by Microsoft. The file formats for these applications are widely used due to the popularity of MS Office. Other software not from Microsoft, e.g. OpenOffice.org or StarOffice, are able to read and write files using these proprietary formats but the compatibility is incomplete. -
Perceptual Video Quality Assessment and Analysis Using Adaptive Content Dynamics
PERCEPTUAL VIDEO QUALITY ASSESSMENT AND ANALYSIS USING ADAPTIVE CONTENT DYNAMICS A Dissertation Presented to The Academic Faculty by Mohammed A. Aabed In Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the School of Electrical and Computer Engineering Georgia Institute of Technology May 2017 Copyright c 2017 by Mohammed A. Aabed PERCEPTUAL VIDEO QUALITY ASSESSMENT AND ANALYSIS USING ADAPTIVE CONTENT DYNAMICS Approved by: Professor Ghassan I. AlRegib, Advisor Professor Elliot Moore II School of Electrical and Computer School of Electrical and Computer Engineering Engineering Georgia Institute of Technology Georgia Institute of Technology Professor Anthony J. Yezzi Professor Berdinus A. Bras School of Electrical and Computer The George W. Woodruff School of Engineering Mechanical Engineering Georgia Institute of Technology Georgia Institute of Technology Professor David V. Anderson Date Approved: December 21st, 2016 School of Electrical and Computer Engineering Georgia Institute of Technology In the memory of my mother, to my father, and to my brothers iii ACKNOWLEDGEMENTS I would like to express my utmost gratitude to my advisor, Prof. Ghassan AlRegib, a man whose guidance, patience and generosity have always made him a trusted mentor and a dear close friend. His continuous teaching, friendship, support, and understanding throughout this process have allowed me to learn, innovate, enjoy and made it a rewarding experience. I would like to thank Dr. Anthony J. Yezzi and Prof. David V. Anderson for their time, valuable comments and for agreeing to serve as reading remember of my dissertation committee. I would like to also thank Prof. Elliot Moore II and Prof. Bert Bras for their time, comments, and for agreeing to serve on my defense committee. -
Fast VP9-To-AV1 Transcoding Based on Block Partitioning Inheritance
Fast VP9-to-AV1 Transcoding based on Block Partitioning Inheritance Alex Borges, Daniel Palomino, Bruno Zatt, Marcelo Porto, Guilherme Correa Video Technology Research Group (ViTech) Graduate Program in Computing (PPGC), Federal University of Pelotas (UFPel), Brazil {amborges, dpalomino, zatt, porto, gcorrea}@inf.ufpel.edu.br Abstract— This paper proposes a fast VP9-to-AV1 video compared to VP9 (considering the same image quality). This transcoding algorithm based on block partitioning inheritance. represents an average superiority of 22% for AV1 over VP9, The proposed algorithm relies on the reuse of VP9 block an economy of 1/5 in storage resources and other costs partitioning during the AV1 re-encoding process. This way, the involving video transmission services. exhaustive search for the best block size option is avoided to save encoding time. The reuse of VP9 block partitioning is proposed Video transcoding is the process that converts from one based on a statiscal analysis that shows the relation of block video bitstream format to the same format with different parititioning sizes between VP9 and AV1. The analysis configurations (homogeneous transcoding) or to another demontrates that there is a high probability of the AV1 encoding bitstream format (heterogeneous transcoding), as presented in process to choose block sizes of the same size as in the VP9 Fig. 1. With the advent of AV1, converting legacy content encoding. Experimental results show that the proposed from previous formats, such as VP9, becomes an essential task algorithm is able to accelerate the VP9-to-AV1 transcoding for service providers that intend to benefit from its process by 28% on average at the cost of only 4% increase in the compression efficiency. -
High-Throughput HEVC CABAC Decoding
High-Throughput HEVC CABAC Decoding vorgelegt von M.Sc. Philipp Habermann an der Fakultät IV – Elektrotechnik und Informatik der Technischen Universität Berlin zur Erlangung des akademischen Grades Doktor der Ingenieurwissenschaften - Dr.-Ing. - genehmigte Dissertation Promotionsausschuss: Vorsitzender: Prof. Giuseppe Caire, Ph.D. Gutachter: Prof. Dr.-Ing. Thomas Sikora Gutachter: Prof. Dr.-Ing. Benno Stabernack Gutachter: Dr.-Ing. Jens Brandenburg Tag der wissenschaftlichen Aussprache: 02.11.2020 Berlin 2020 Abstract Video applications have emerged in various felds of our everyday life. They have con- tinuously enhanced the user experience in entertainment and communication services. All this would not have been possible without the evolution of video compression stan- dards and computer architectures over the last decades. Modern video codecs employ sophisticated algorithms to transform raw video data to an intermediate representation consisting of syntax elements, which allows enormous compression rates before recon- structing the video with minimal objective quality losses compared to the original video. Modern computer architectures lay the foundation for these computationally intensive tasks. They provide multiple cores and specialized vector architectures to exploit the massive amount of parallelism that can be found in video applications. Customized hardware solutions follow the same principles. Parallel processing is essential to satisfy real-time performance constraints while optimizing energy efciency, the latter being the most important design goal for mobile devices. One of the main tasks in modern video compression standards implements a highly sequential algorithm and lacks data-level parallelism in contrast to all other compute- intensive tasks: Context-based Adaptive Binary Arithmetic Coding (CABAC). It is the entropy coding module in the state-of-the-art High Efciency Video Coding (HEVC) standard and also its successor Versatile Video Coding. -
Standards and Protocols for Visual Materials
Ref. Ares(2017)5589715 - 15/11/2017 Project Number: 693349 Standard and protocols for visual materials Giuseppe Airò Farulla, Pietro Braione, Marco Indaco, Mauro Pezzè, Paolo Prinetto CINI, Aster SpA Version 1.2 – 15/11/2017 Lead contractor: Universitat Pompeu Fabra Contact person: Josep Quer Departament de Traducció i Ciències del Llenguatge Roc Boronat, 138 08018 Barcelona Spain Tel. +34-93-542-11-36 Fax. +34-93-542-16-17 E-mail: [email protected] Work package: 3 Affected tasks: 3.5 Nature of deliverable1 R P D O Dissemination level2 PU PP RE CO 1 R: Report, P: Prototype, D: Demonstrator, O: Other 2 PU: public, PP: Restricted to other programme participants (including the commission services), RE Restrict- ed to a group specified by the consortium (including the Commission services), CO Confidential, only for members of the consor- tium (Including the Commission services) Deliverable D3.11: Standard and protocols for visual materials Page 2 of 45 COPYRIGHT © COPYRIGHT SIGN-HUB Consortium consisting of: UNIVERSITAT POMPEU FABRA Spain UNIVERSITA' DEGLI STUDI DI MILANO-BICOCCA Italy UNIVERSITEIT VAN AMSTERDAM Netherlands BOGAZICI UNIVERSITESI Turkey CENTRE NATIONAL DE LA RECHERCHE SCIENTIFIQUE France UNIVERSITE PARIS DIDEROT - PARIS 7 France TEL AVIV UNIVERSITY Israel GEORG-AUGUST-UNIVERSITAET GÖTTINGEN Germany UNIVERSITA CA' FOSCARI VENEZIA Italy CONSORZIO INTERUNIVERSITARIO NAZIONALE PER L'INFORMATICA Italy CONFIDENTIALITY NOTE THIS DOCUMENT MAY NOT BE COPIED, REPRODUCED, OR MODIFIED IN WHOLE OR IN PART FOR ANY PURPOSE WITHOUT WRITTEN PERMISSION FROM THE SIGN-HUB CONSORTIUM. IN ADDITION TO SUCH WRITTEN PERMISSION TO COPY, REPRODUCE, OR MODIFY THIS DOCUMENT IN WHOLE OR PART, AN ACKNOWLEDGMENT OF THE AUTHORS OF THE DOCUMENT AND ALL APPLICABLE PORTIONS OF THE COPYRIGHT NOTICE MUST BE CLEARLY REFERENCED ALL RIGHTS RESERVED. -
A Review of Emerging Video Codecs: Challenges and Opportunities
A Review of Emerging Video Codecs: Challenges and Opportunities Amal Punchihewa, Donald Bailey Centre for Research in Image and Signal Processing Massey University Palmerston North, New Zealand [email protected], [email protected] Abstract—This paper presents a review of video codecs that high-efficiency video coding HEVC or H265 and Versatile are in use and currently being developed, the codec development Video Coding (VVC) or H266. Online or streaming service process, current trends, challenges and opportunities for theproviders are also contributing to compression research in the research community. There is a paradigm shift in video coding areas of efficiency and performance evaluation. This paper standards. Concurrently, multiple video standards arebriefly discusses and presents some of the research in standardised by standardising organisations. At the same time,developing next-generation codecs (VVC-Versatile Video royalty free video compression standards are being developed Coding, EVC-Enhancement Video Coding, LCEVC-Low and standardised. Introduction of enhancement-layer-based Complexity Enhancement Video Coding) and needs of the coding standards will extend the lifetime of legacy video codecs industry. However, those multiple codecs where some are finding middle ground in improved coding efficiency, royalty-free have disturbed the ecosystem. This is another computational complexity and power requirements. The video coding landscape is changing that is challenged by emergence disruption of that broadcasters and media industry need to multiple video coding standards for different use cases. Thesemanage in business and digital transformation. may offer some opportunities for coding industry, especially for Compression technologies have been an enabler for the New Zealand researchers serving niche markets in video games, broadcast industry starting with analogue television. -
Reconstructing Compressed Photo and Video Data
UCAM-CL-TR-813 Technical Report ISSN 1476-2986 Number 813 Computer Laboratory Reconstructing compressed photo and video data Andrew B. Lewis February 2012 15 JJ Thomson Avenue Cambridge CB3 0FD United Kingdom phone +44 1223 763500 http://www.cl.cam.ac.uk/ c 2012 Andrew B. Lewis This technical report is based on a dissertation submitted June 2011 by the author for the degree of Doctor of Philosophy to the University of Cambridge, Trinity College. Some figures in this document are best viewed in colour. If you received a black-and-white copy, please consult the online version if necessary. Technical reports published by the University of Cambridge Computer Laboratory are freely available via the Internet: http://www.cl.cam.ac.uk/techreports/ ISSN 1476-2986 Summary Forensic investigators sometimes need to verify the integrity and processing history of digital photos and videos. The multitude of storage formats and devices they need to access also presents a challenge for evidence recovery. This thesis explores how visual data files can be recovered and analysed in scenarios where they have been stored in the JPEG or H.264 (MPEG-4 AVC) compression formats. My techniques make use of low-level details of lossy compression algorithms in order to tell whether a file under consideration might have been tampered with. I also show that limitations of entropy coding sometimes allow us to recover intact files from storage devices, even in the absence of filesystem and container metadata. I first show that it is possible to embed an imperceptible message within a uniform region of a JPEG image such that the message becomes clearly visible when the image is recompressed at a particular quality factor, providing a visual warning that recompression has taken place.