FFV1 Video Codec Specification
Total Page:16
File Type:pdf, Size:1020Kb
Load more
										Recommended publications
									
								- 
												  The Perceptual Impact of Different Quantization Schemes in G.719The perceptual impact of different quantization schemes in G.719 BOTE LIU Master’s Degree Project Stockholm, Sweden May 2013 XR-EE-SIP 2013:001 Abstract In this thesis, three kinds of quantization schemes, Fast Lattice Vector Quantization (FLVQ), Pyramidal Vector Quantization (PVQ) and Scalar Quantization (SQ) are studied in the framework of audio codec G.719. FLVQ is composed of an RE8 -based low-rate lattice vector quantizer and a D8 -based high-rate lattice vector quantizer. PVQ uses pyramidal points in multi-dimensional space and is very suitable for the compression of Laplacian-like sources generated from transform. SQ scheme applies a combination of uniform SQ and entropy coding. Subjective tests of these three versions of audio codecs show that FLVQ and PVQ versions of audio codecs are both better than SQ version for music signals and SQ version of audio codec performs well on speech signals, especially for male speakers. I Acknowledgements I would like to express my sincere gratitude to Ericsson Research, which provides me with such a good thesis work to do. I am indebted to my supervisor, Sebastian Näslund, for sparing time to communicate with me about my work every week and giving me many valuable suggestions. I am also very grateful to Volodya Grancharov and Eric Norvell for their advice and patience as well as consistent encouragement throughout the thesis. My thanks are extended to some other Ericsson researchers for attending the subjective listening evaluation in the thesis. Finally, I want to thank my examiner, Professor Arne Leijon of Royal Institute of Technology (KTH) for reviewing my report very carefully and supporting my work very much.
- 
												  © 2019 Jimi Jones© 2019 Jimi Jones SO MANY STANDARDS, SO LITTLE TIME: A HISTORY AND ANALYSIS OF FOUR DIGITAL VIDEO STANDARDS BY JIMI JONES DISSERTATION Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Library and Information Science in the Graduate College of the University of Illinois at Urbana-Champaign, 2019 Urbana, Illinois Doctoral Committee: Associate Professor Jerome McDonough, Chair Associate Professor Lori Kendall Assistant Professor Peter Darch Professor Howard Besser, New York University ABSTRACT This dissertation focuses on standards for digital video - the social aspects of their design and the sociotechnical forces that drive their development and adoption. This work is a history and analysis of how the MXF, JPEG 2000, FFV1 and Matroska standards have been adopted and/or adapted by libraries and archives of different sizes. Well-funded institutions often have the resources to develop tailor-made specifications for the digitization of their analog video objects. Digital video standards and specifications of this kind are often derived from the needs of the cinema production and television broadcast realms in the United States and may be unsuitable for smaller memory institutions that are resource-poor and/or lack staff with the knowledge to implement these technologies. This research seeks to provide insight into how moving image preservation professionals work with - and sometimes against - broadcast and film production industries in order to produce and/or implement standards governing video formats and encodings. This dissertation describes the transition of four digital video standards from niches to widespread use in libraries and archives. It also examines the effects these standards produce on cultural heritage video preservation by interviewing people who implement the standards as well as people who develop them.
- 
												  Reduced-Complexity End-To-End Variational Autoencoder for on Board Satellite Image Compressionremote sensing Article Reduced-Complexity End-to-End Variational Autoencoder for on Board Satellite Image Compression Vinicius Alves de Oliveira 1,2,* , Marie Chabert 1 , Thomas Oberlin 3 , Charly Poulliat 1, Mickael Bruno 4, Christophe Latry 4, Mikael Carlavan 5, Simon Henrot 5, Frederic Falzon 5 and Roberto Camarero 6 1 IRIT/INP-ENSEEIHT, University of Toulouse, 31071 Toulouse, France; [email protected] (M.C.); [email protected] (C.P.) 2 Telecommunications for Space and Aeronautics (TéSA) Laboratory, 31500 Toulouse, France 3 ISAE-SUPAERO, University of Toulouse, 31055 Toulouse, France; [email protected] 4 CNES, 31400 Toulouse, France; [email protected] (M.B.); [email protected] (C.L.) 5 Thales Alenia Space, 06150 Cannes, France; [email protected] (M.C.); [email protected] (S.H.); [email protected] (F.F.) 6 ESA, 2201 AZ Noordwijk, The Netherlands; [email protected] * Correspondence: [email protected] Abstract: Recently, convolutional neural networks have been successfully applied to lossy image compression. End-to-end optimized autoencoders, possibly variational, are able to dramatically outperform traditional transform coding schemes in terms of rate-distortion trade-off; however, this is at the cost of a higher computational complexity. An intensive training step on huge databases allows autoencoders to learn jointly the image representation and its probability distribution, pos- sibly using a non-parametric density model or a hyperprior auxiliary autoencoder to eliminate the need for prior knowledge. However, in the context of on board satellite compression, time and memory complexities are submitted to strong constraints.
- 
												  (12) Patent Application Publication (10) Pub. No.: US 2016/0248440 A1 Lookup | | | | | | | | | | | | |US 201602484.40A1 (19) United States (12) Patent Application Publication (10) Pub. No.: US 2016/0248440 A1 Greenfield et al. (43) Pub. Date: Aug. 25, 2016 (54) SYSTEMAND METHOD FOR COMPRESSING Publication Classification DATAUSING ASYMMETRIC NUMERAL SYSTEMIS WITH PROBABILITY (51) Int. Cl. DISTRIBUTIONS H03M 7/30 (2006.01) (52) U.S. Cl. (71) Applicants: Daniel Greenfield, Cambridge (GB); CPC ....................................... H03M 730 (2013.01) Alban Rrustemi, Cambridge (GB) (72) Inventors: Daniel Greenfield, Cambridge (GB); (57) ABSTRACT Albanan Rrustemi, Cambridge (GB(GB) A data compression method using the range variant of asym (21) Appl. No.: 15/041,228 metric numeral systems to encode a data stream, where the probability distribution table is constructed using a Markov (22) Filed: Feb. 11, 2016 model. This type of encoding results in output that has higher compression ratios when compared to other compression (30) Foreign Application Priority Data algorithms and it performs particularly well with information that represents gene sequences or information related to gene Feb. 11, 2015 (GB) ................................... 1502286.6 Sequences. 128bit o 500 580 700 7so 4096 20123115 a) 8-entry SIMD lookup Vector minpos (e.g. phminposuw) 's 20 b) 16-entry SMD —- lookup | | | | | | | | | | | | | ||l Vector sub (e.g. psubw) Wector min e.g. pminuw) Vector minpos (e.g. phmirposuw) Patent Application Publication Aug. 25, 2016 Sheet 1 of 2 US 2016/0248440 A1 128bit e (165it o 500 580 700 750 4096 2012 s115 8-entry SIMD Vector sub a) lookup (e.g. pSubw) 770 270 190 70 20 (ufi) (ufi) (uf) Vector minpos (e.g. phminposuw) b) 16-entry SIMD lookup Vector sub Vector sub (e.g.
- 
												  Recommended File Formats for Long-Term Archiving and for Web Dissemination in PhaidraRecommended file formats for long-term archiving and for web dissemination in Phaidra Edited by Gianluca Drago May 2019 License: https://creativecommons.org/licenses/by-nc-sa/4.0/ Premise This document is intended to provide an overview of the file formats to be used depending on two possible destinations of the digital document: long-term archiving uploading to Phaidra and subsequent web dissemination When the document uploaded to Phaidra is also the only saved file, the two destinations end up coinciding, but in general one will probably want to produce two different files, in two different formats, so as to meet the differences in requirements and use in the final destinations. In the following tables, the recommendations for long-term archiving are distinct from those for dissemination in Phaidra. There are no absolute criteria for choosing the file format. The choice is always dependent on different evaluations that the person who is carrying out the archiving will have to make on a case by case basis and will often result in a compromise between the best achievable quality and the limits imposed by the costs of production, processing and storage of files, as well as, for the preceding, by the opportunity of a conversion to a new format. 1 This choice is particularly significant from the perspective of long-term archiving, for which a quality that respects the authenticity and integrity of the original document and a format that guarantees long-term access to data are desirable. This document should be seen more as an aid to the reasoned choice of the person carrying out the archiving than as a list of guidelines to be followed to the letter.
- 
												  Pragmatic Audiovisual Preservationhttp://doi.org/10.7207/twr20-10 Pragmatic Audiovisual Preservation Ashley Blewer DPC Technology Watch Report October 2020 © Digital Preservation Coalition 2020 and Ashley Blewer 2020 ISSN: 2048-7916 DOI: http://doi.org/10.7207/twr20-10 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without prior permission in writing from the publisher. The moral rights of the author have been asserted. First published in Great Britain in 2020 by the Digital Preservation Coalition. Pragmatic Audiovisual Preservation Foreword The Digital Preservation Coalition (DPC) is an advocate and catalyst for digital preservation, ensuring our members can deliver resilient long-term access to digital content and services. It is a not-for- profit membership organization whose primary objective is to raise awareness of the importance of the preservation of digital material and the attendant strategic, cultural and technological issues. It supports its members through knowledge exchange, capacity building, assurance, advocacy and partnership. The DPC’s vision is to make our digital memory accessible tomorrow. The DPC Technology Watch Reports identify, delineate, monitor and address topics that have a major bearing on ensuring our collected digital memory will be for the future. They provide an advanced introduction in order to support those charged with ensuring a robust digital memory, and they are of general interest to a wide and international audience with interests in computing, information management, collections management and technology. The reports are commissioned after consultation among DPC members about shared priorities and challenges; they are commissioned from experts; and they are thoroughly scrutinized by peers before being released.
- 
												  20130218 Technical White Paper.Bw.JpSending, Storing & Sharing Video With latakoo © Copyright latakoo. All rights reserved. Revised 11/12/2012 Table of contents Table of contents ................................................................................................... 1 1. Introduction ...................................................................................................... 2 2. Sending video & files with latakoo ...................................................................... 3 2.1 The latakoo app ............................................................................................ 3 2.2 Compression & upload ................................................................................. 3 2.3 latakoo minutes ............................................................................................ 4 3. The latakoo web interface .................................................................................. 4 3.1 Web interface requirements .......................................................................... 4 3.2 Logging into the dashboard .......................................................................... 4 3.3 Streaming video previews ............................................................................. 4 3.4 Downloading video ...................................................................................... 5 3.5 latakoo Pilot ................................................................................................. 5 3.6 Search and tagging ......................................................................................
- 
												![Asymmetric Numeral Systems. Arxiv:0902.0271V5 [Cs.IT]](https://docslib.b-cdn.net/cover/1387/asymmetric-numeral-systems-arxiv-0902-0271v5-cs-it-1161387.webp)  Asymmetric Numeral Systems. Arxiv:0902.0271V5 [Cs.IT]Asymmetric numeral systems. Jarek Duda Jagiellonian University, Poland, email: [email protected] Abstract In this paper will be presented new approach to entropy coding: family of generalizations of standard numeral systems which are optimal for encoding sequence of equiprobable symbols, into asymmetric numeral systems - optimal for freely chosen probability distributions of symbols. It has some similarities to Range Coding but instead of encoding symbol in choosing a range, we spread these ranges uniformly over the whole interval. This leads to simpler encoder - instead of using two states to define range, we need only one. This approach is very universal - we can obtain from extremely precise encoding (ABS) to extremely fast with possibility to additionally encrypt the data (ANS). This encryption uses the key to initialize random number generator, which is used to calculate the coding tables. Such preinitialized encryption has additional advantage: is resistant to brute force attack - to check a key we have to make whole initialization. There will be also presented application for new approach to error correction: after an error in each step we have chosen probability to observe that something was wrong. We can get near Shannon's limit for any noise level this way with expected linear time of correction. arXiv:0902.0271v5 [cs.IT] 21 May 2009 1 Introduction In practice there are used two approaches for entropy coding nowadays: building binary tree (Huffman coding [1]) and arithmetic/range coding ([2],[3]). The first one approximates probabilities of symbols with powers of 2 - isn't precise. Arithmetic coding is precise. It encodes symbol in choosing one of large ranges of length proportional to assumed probability distribution (q).
- 
												  Ffsake: Further Fluff with Ffmpegffsake: further fluff with ffmpeg Talking Tech, Spring 2017 2/5/17 Ethan Gates What is ffmpeg? ● Leading open-source framework for processing and manipulating audiovisual material ● Aims to decode, encode, transcode, mux, demux, stream, filter and play “anything that humans and machines have created” ● A combination of tools and libraries ○ ffmpeg - media conversion ○ ffprobe - metadata analysis/extraction ○ ffplay - playback ■ libavutil - various utilities (e.g. math equations) ■ libavcodec - codec library ■ libavformat - format (wrapper/container) library ■ libavdevice - device library (drivers for common/open hardware) ■ libavfilter - filter library ■ libswscale - scaling library ■ libswresample - audio sampling library Who’s using it? ● VLC Media Player ● YouTube ● Media Player Classic ● Chromium/Google Chrome ● Handbrake ● Facebook ● Audacity ● DaVinci Resolve ● DCP-o-matic ● Axle But what can it do? ● Make derivative copies ● Re-wrap files ● Edit files ● Generate checksums ● Create test files/streams ● Extract/insert metadata ● Generate informative (or just plain cool) audio/video effects ● Livestream video/screencapture * Thanks for this slide, Andrew Weaver Installation All about that configuration! $ brew info ffmpeg $ brew install ffmpeg --with-sdl2 --with-freetype --with-openjpeg --with-x265 --with-rubberband --with-tesseract Translation: with ffplay (playback), with added text/titles support, with JPEG 2000 codec, with H.265 (HEVC) codec, with extra audio filters, with OCR support Tasks to Cover I. Making a test file/signal
- 
												  Indiana University Media Digitization and Preservation Initiative (MDPI)Indiana University Media Digitization and Preservation Initiative (MDPI) White Paper: Encoding and Wrapper Decisions and Implementation for Video Preservation Master Files Authored by: Mike Casey, Director of Technical Operations, MDPI Reviewed by: Carla Arton, Film Digitization Specialist, IU Libraries Moving Image Archive Jon Cameron, Digital Media Service Manager, IU Libraries Jon Dunn, Assistant Dean for Library Technologies, IU Libraries Heidi Kelly, Digital Preservation Librarian, IU Libraries Brent Moberly (Appendix author), Software Developer, UITS, IU Brian Wheeler, Senior Systems Engineer, IU Libraries Special thanks to Dave Rice for astute comments and suggestions Copyright 2017 Trustees of Indiana University This document is licensed under a Creative Commons Attribution 4.0 International license (CC BY 4.0) https://creativecommons.org/licenses/by/4.0/ Release date: March 27, 2017 Indiana University Media Digitization and Preservation Initiative Encoding and Wrapper Decisions and Implementation for Video Preservation Master Files 1. Overview There is no consensus in the media preservation community on best practice for encoding and wrapping video preservation master files. Institutions engaged in long- term video preservation typically choose from three paths, each of which are currently seen as viable for this purpose: 10-bit, uncompressed, v210 codec, usually with a QuickTime wrapper JPEG 2000, mathematically lossless profile, usually with an MXF wrapper FFV1, a mathematically lossless format, with an AVI or Matroska wrapper No single path can be said to be widely adopted, in part because there are relatively few institutions engaged in digitizing video for long-term preservation, especially at scale. It appears as of this writing that institutions are roughly evenly divided between the three paths listed above.
- 
												  VIDEO DIGITIZATION at the AUSTRIAN MEDIATHEK Chives Obtain a Quality Assurance Instrument for the Recording of Their MediaARTICLE 8LIIQTPS]QIRXSJWYGLERSRPMRIGIVXM½GEXMSRWIVZMGITVSZMHIWFIRI½XWJSVFSXLTEVXMIWEV- VIDEO DIGITIZATION AT THE AUSTRIAN MEDIATHEK chives obtain a quality assurance instrument for the recording of their media. In the past, sound Herman Lewetz, Austian Mediathek9 carrier owners had to implement quality control by means of a cost-intensive strategy: mul- tiple recording of a single sound medium and time-consuming comparative analysis. The avail- In September 2009 the Austrian Mediathek started a project called “Österreich am Wort“. EFMPMX]SJSFNIGXMZIQIEWYVIQIRXVIWYPXWEPPS[WJSVEHVEWXMGEPP]WMQTPM½IHUYEPMX]QSRMXSVMRK Its goal is to digitize and publish via the web about 10,000 full-length recordings within three years. The misfortune for me personally was that in the proposal for this project someone had For companies or organisations dealing with the process of mass migration of analogue media, claimed 2,000 of these to be video recordings. This meant I had to start what we so far suc- long-term evaluation is an important tool for supporting the continuous internal improve- cessfully had postponed: Video digitization. ment process. For example, slow deterioration over time can be made visible. Besides the cost savings due to precise situation assessment, there is another advantage for service providers: Requirements [LIRYWMRKTPE]FEGOHIZMGIWSJHMJJIVIRXUYEPMX]MXQE]LEZIFIIRHMJ½GYPXXSEVKYIXLILMKLIV costs for recording with high-quality machines to their clients. With the help of the automatic, %WE½VWXWXIT[ISYXPMRIHXLIVIUYMVIQIRXW[IXLSYKLXMQTSVXERXJSVEHMKMXM^EXMSRWGLIQE reference-based analysis, quality differences between average and high-class playback devices that best supported long-term preservation. Unlike audio digitization there is still no widely become easily measurable. By documenting the measurement results, the client can easily see accepted archive format for video.
- 
												  Input Formats & CodecsInput Formats & Codecs Pivotshare offers upload support to over 99.9% of codecs and container formats. Please note that video container formats are independent codec support. Input Video Container Formats (Independent of codec) 3GP/3GP2 ASF (Windows Media) AVI DNxHD (SMPTE VC-3) DV video Flash Video Matroska MOV (Quicktime) MP4 MPEG-2 TS, MPEG-2 PS, MPEG-1 Ogg PCM VOB (Video Object) WebM Many more... Unsupported Video Codecs Apple Intermediate ProRes 4444 (ProRes 422 Supported) HDV 720p60 Go2Meeting3 (G2M3) Go2Meeting4 (G2M4) ER AAC LD (Error Resiliant, Low-Delay variant of AAC) REDCODE Supported Video Codecs 3ivx 4X Movie Alaris VideoGramPiX Alparysoft lossless codec American Laser Games MM Video AMV Video Apple QuickDraw ASUS V1 ASUS V2 ATI VCR-2 ATI VCR1 Auravision AURA Auravision Aura 2 Autodesk Animator Flic video Autodesk RLE Avid Meridien Uncompressed AVImszh AVIzlib AVS (Audio Video Standard) video Beam Software VB Bethesda VID video Bink video Blackmagic 10-bit Broadway MPEG Capture Codec Brooktree 411 codec Brute Force & Ignorance CamStudio Camtasia Screen Codec Canopus HQ Codec Canopus Lossless Codec CD Graphics video Chinese AVS video (AVS1-P2, JiZhun profile) Cinepak Cirrus Logic AccuPak Creative Labs Video Blaster Webcam Creative YUV (CYUV) Delphine Software International CIN video Deluxe Paint Animation DivX ;-) (MPEG-4) DNxHD (VC3) DV (Digital Video) Feeble Files/ScummVM DXA FFmpeg video codec #1 Flash Screen Video Flash Video (FLV) / Sorenson Spark / Sorenson H.263 Forward Uncompressed Video Codec fox motion video FRAPS: