1 Introduction

Total Page:16

File Type:pdf, Size:1020Kb

1 Introduction SHORTEN Simple lo s sle s s andne arlo s sle s s waveform compre s s ion Tony Robinson Technical rep ort CUEDFINFENGTR Cambr idge Univers ityEngineer ing Department Trumpington Street Cambr idge CB PZ UK Decemb er Ab stract Thi s rep ort de scr ib e s a program that p erforms compre ss ion of waveform le s such as audio data A s imple pre dictivemodel of thewaveform i s us e d followed by Human co dingofthe pre diction re s iduals Thi s i s b oth f ast andnear optimal for many com monly o ccur ingwaveform s ignals Thi s f rameworkisthen extended to lossy co ding under the conditions of maximisingthe s egmental s ignal to noi s e ratio on a p er f rame bas i s and co dingto a xe d acceptable s ignal to noi s e ratio Intro duction It i s common tostore digitised waveforms on computers andtheresulting le s can often consume s ignicantamountsofstorage space General compre s s ion algor ithms do not p erform very well on the s e le s as they f ail totakeinto accountthestructure of thedataand thenature of the s ignal contained there in Typically a waveform le will cons i st of s igned bit numb ers andthere will b e s ignicantsample to sample correlation A compre s s ion utility for the s e le must b e re asonably f ast p ortable accept datainamostpopular formats and give s ignicant compre s s ion Thi s rep ort de scr ib e s shorten a program for the UNIX and DOS environmentswhich aims tomeet the s e requirements A s ignicantapplication of thi s program i s tothe problem of compre s s ion of sp eech le s for di str ibution on CDROM Thi s rep ort starts withade scr iption of thi s domain then di scus s e s thetwomain problems as so ciate d with general waveform compre s s ion namely pre dictivemodellingandresidual co ding Thi s f rameworkisthen extended to lossy coding Finallytheshorten implementation i s de scr ib e d andanappendix details the command line options Compre s s ion for sp eech corp ora One imp ortant use for lossless waveform compression is to compre s s sp eech corp ora for di str ibution on CDROM Stateofthe art sp eech recognition systems require gigabytes of acoustic data for mo del e stimation whichtakes many CDROMs tostore Us e of compre s s ion software b othreduce s the di str ibution co st andthenumber of CDROM change s require d toreadthecompletedata s et Thekey f actors in thede s ign of compre s s ion software for sp eech corp ora are thatthere must b e no p erceptual degradation in the sp eech s ignal andthatthedecompre s s ion routine must b e f ast and p ortable There has b een much researchinto ecient sp eechcodingtechnique s andmanystand ards have b een e stabli shed However mo st of thi s workhas b een for telephonyapplications where de dicated hardware can used to p erform the co dingandwhere it i s imp ortantthat the re sulting system operates atawell dene d bit rate In suchapplications lo s sy co ding i s acceptable andindee d nece s sary order to guarantee thatthesystem op erates atthe xe d bit rate Similarly there has b een muchworkinde s ign of general purpose lossless compre s sors for workstation us e Suchsystems do not guarantee any compre s s ion for an arbitrary le but in general achieveworthwhile compre s s ion in re asonable timeongeneral purpose computers Sp eech corp ora compre s s ion nee ds somefeature s of b oth systems Lo s sle s s compre s s ion i s an advantage as it guarantee s there i s no p erceptual degradation in the sp eech s ignal However theestabli she d compre s s ion utilitie s do not exploit theknown structure of the sp eech s ignal Hence shorten was wr itten to ll thi s gap andisnow in us e in the di str ibution of CDROMs containing sp eechdatabas e s The recordings us e d as example s in s ection and s ection are f rom the TIMIT corpus which i s di str ibute d as bit kHz line ar PCM sample s Thi s format i s in common us e d for continuous sp eech recognition research corp ora The recordings were collecte d us inga Sennheiser HMD noisecancellinghe admounte d microphoneinlow noise conditions All ten utterance s f rom sp e aker fcjf are used which amounttoatotal of s econds or about sample s Waveform Mo deling Compression is achieved by building a pre dictivemodel of thewaveform a go o d intro duc tion for sp eech i s JayantandNoll An e stabli shed model for a widevar ietyofwaveforms is thatofanautoregre s s ivemodel also known as line ar pre dictive co ding LPC Here the pre dicted waveform i s a line ar combination of past sample s p X st a st i i i Thecode d s ignal et i s the dierence b etween theestimateoftheline ar pre dictors t andthe sp eech s ignal st et st st However manywaveforms of intere st are not stationarythatisthebestvalue s for the co ecientsofthe pre dictor a vary f rom one s ection of thewaveform toanother It i s i often re asonable toassumethatthe s ignal i s p s eudostationary ie there exi stsa timespan over which re asonable value s for the line ar pre dictor can b e found Thus thethree main stage s in the co ding pro ce s s are blo cking pre dictivemodelling andresidual co ding Blo cking Thetime f rameover whichsample s are blo cked dep ends to some extentonthenature of the s ignal It i s inecienttoblo ckontoo short a time scale as thi s incurs an overhead in the computation and transmi s s ion of the pre diction parameters It i s also inecienttouse atime scale over whichthesignal characteristics change appreciably as thi s will re sultin a p o orer mo del of the s ignal However in the implementation described belowthe linear pre dictor parameters typically takemuch le s s information to transmit than the residual s ignal so thechoice of window length i s not cr itical Thedef aultvalue in theshorten implementation i s which re sults in ms f rame s for a s ignal sample d at kHz Sample interle aved signals are handelle d by tre atingeachdata stre am as indep endent Even in cas e s where thereisaknown correlation b etween the stre ams suchasinstereo audio the withinchannel correlations are often s ignicantly gre ater than the cro s schannel correlations so for lo s sle s s or ne arlo s sle s s co dingthe exploitation of thi s additional correl ation only re sultsinsmall additional gains A rectangular window i s us e d in preference toanytap er ingwindowasthe aim i s to mo del just those sample s within theblo ck not the sp ectral characteristics of the s egment surroundingtheblo ck The windowlength i s longer than theblo cksizebythe pre diction order whichistypically three sample s Line ar Pre diction Shorten supp ortstwo forms of line ar pre diction thestandard pth order LPC analys i s of equation andarestricte d form wherebythe co ecients are s elected from one of four xe d p olynomial pre dictors In thecaseofthegeneral LPC algor ithm the pre diction co ecients a are quantised in i accordance withthe same Laplacian di str ibution us e d for theresidual s ignal andde scr ib e d in s ection The exp ected numb er of bits p er co ecient i s as thi s was foundtobe a go o d tradeo b etween mo delling accuracy andmodel storage Thestandard Durbins algor ithm for computingthe LPC co ecients f rom theauto correlation co ecients is used in a incremental wayOneachiteration theme an square d value of the pre diction re s idual i s calculated andthi s i s us e d to computethe exp ected number of bitsnee ded tocode the residual s ignal Thi s i s added tothenumb er of bitsnee ded tocodethe pre diction co ecientsandthe LPC order i s s elected to minimis e thetotal As the computation of theauto correlation co ecientsisthe mo st exp ens ivestep in this process the s e arch for theoptimal mo del order i s terminated when the last twomodels have re sulte d in a higher bit rate Whilst it i s possible to construct s ignals thatdefe atthis search pro ce dure in practice for sp eechsignals it has b een foundthatthe o ccas ional us e of a lower pre diction order re sults in an ins ignicant incre as e in the bit rateandhas the additional s ide eect of requir ing le s s computetodeco de A re str ictiveformofthe line ar pre dictor has b een foundto b e us eful In thi s cas e the pre diction co ecients are tho s e sp ecie d byttinga p order p olynomial tothe last p data p oints eg a linetothe last two p oints s t s t st s t st st s t st st st Writing e tasthe error s ignal f rom the ithpolynomial pre dictor i e t st e t e t e t e t e t e t e t e t e t As can b e s een f rom equations there i s an ecient recurs ivealgorithm for comput ingthe s et of p olynomial pre diction re s iduals Eachresidual term i s forme d f rom the dierence of the previous order pre dictors As e achterm involve s only a few integer addi tionssubtractions it i s p o s s ible to compute all pre dictors and s elect the b e st Moreover as thesumofabsolutevalue s i s line arly related tothevar iance thi s may be used as the bas i s of pre dictor s election andsothewhole process is cheap to computeasitinvolves no multiplications Figure shows b oth forms of pre diction for a range of maximum pre dictor orders The gure shows that rst and s econdorder pre diction provides a substantial incre as e in compre s s ion andthathigher order pre dictors provide relatively little improvement The gure also shows that for thi s example mo st of thetotal compre s s ion can b e obtaine d us ing no pre diction that i s a zerothorder co der achieved about compre s s ion andthebest pre dictor Hence for lo s sle s s compre s s ion it i s imp ortant not towastetoo much computeonthe pre dictor andtoto p erform the residual co ding eciently
Recommended publications
  • Lossless Audio Codec Comparison
    Contents Introduction 3 1 CD-audio test 4 1.1 CD's used . .4 1.2 Results all CD's together . .4 1.3 Interesting quirks . .7 1.3.1 Mono encoded as stereo (Dan Browns Angels and Demons) . .7 1.3.2 Compressibility . .9 1.4 Convergence of the results . 10 2 High-resolution audio 13 2.1 Nine Inch Nails' The Slip . 13 2.2 Howard Shore's soundtrack for The Lord of the Rings: The Return of the King . 16 2.3 Wasted bits . 18 3 Multichannel audio 20 3.1 Howard Shore's soundtrack for The Lord of the Rings: The Return of the King . 20 A Motivation for choosing these CDs 23 B Test setup 27 B.1 Scripting and graphing . 27 B.2 Codecs and parameters used . 27 B.3 MD5 checksumming . 28 C Revision history 30 Bibliography 31 2 Introduction While testing the efficiency of lossy codecs can be quite cumbersome (as results differ for each person), comparing lossless codecs is much easier. As the last well documented and comprehensive test available on the internet has been a few years ago, I thought it would be a good idea to update. Beside comparing with CD-audio (which is often done to assess codec performance) and spitting out a grand total, this comparison also looks at extremes that occurred during the test and takes a look at 'high-resolution audio' and multichannel/surround audio. While the comparison was made to update the comparison-page on the FLAC website, it aims to be fair and unbiased.
    [Show full text]
  • Ardour Export Redesign
    Ardour Export Redesign Thorsten Wilms [email protected] Revision 2 2007-07-17 Table of Contents 1 Introduction 4 4.5 Endianness 8 2 Insights From a Survey 4 4.6 Channel Count 8 2.1 Export When? 4 4.7 Mapping Channels 8 2.2 Channel Count 4 4.8 CD Marker Files 9 2.3 Requested File Types 5 4.9 Trimming 9 2.4 Sample Formats and Rates in Use 5 4.10 Filename Conflicts 9 2.5 Wish List 5 4.11 Peaks 10 2.5.1 More than one format at once 5 4.12 Blocking JACK 10 2.5.2 Files per Track / Bus 5 4.13 Does it have to be a dialog? 10 2.5.3 Optionally store timestamps 5 5 Track Export 11 2.6 General Problems 6 6 MIDI 12 3 Feature Requests 6 7 Steps After Exporting 12 3.1 Multichannel 6 7.1 Normalize 12 3.2 Individual Files 6 7.2 Trim silence 13 3.3 Realtime Export 6 7.3 Encode 13 3.4 Range ad File Export History 7 7.4 Tag 13 3.5 Running a Script 7 7.5 Upload 13 3.6 Export Markers as Text 7 7.6 Burn CD / DVD 13 4 The Current Dialog 7 7.7 Backup / Archiving 14 4.1 Time Span Selection 7 7.8 Authoring 14 4.2 Ranges 7 8 Container Formats 14 4.3 File vs Directory Selection 8 8.1 libsndfile, currently offered for Export 14 4.4 Container Types 8 8.2 libsndfile, also interesting 14 8.3 libsndfile, rather exotic 15 12 Specification 18 8.4 Interesting 15 12.1 Core 18 8.4.1 BWF – Broadcast Wave Format 15 12.2 Layout 18 8.4.2 Matroska 15 12.3 Presets 18 8.5 Problematic 15 12.4 Speed 18 8.6 Not of further interest 15 12.5 Time span 19 8.7 Check (Todo) 15 12.6 CD Marker Files 19 9 Encodings 16 12.7 Mapping 19 9.1 Libsndfile supported 16 12.8 Processing 19 9.2 Interesting 16 12.9 Container and Encodings 19 9.3 Problematic 16 12.10 Target Folder 20 9.4 Not of further interest 16 12.11 Filenames 20 10 Container / Encoding Combinations 17 12.12 Multiplication 20 11 Elements 17 12.13 Left out 21 11.1 Input 17 13 Credits 21 11.2 Output 17 14 Todo 22 1 Introduction 4 1 Introduction 2 Insights From a Survey The basic purpose of Ardour's export functionality is I conducted a quick survey on the Linux Audio Users to create mixdowns of multitrack arrangements.
    [Show full text]
  • Lossless Audio Codec Comparison
    Contents Introduction 3 1 Test setup 4 1.1 Scripting and graphing . .4 1.2 Codecs and parameters used . .5 1.3 WMA, RealAudio and ALAC . .6 2 CD-audio test 8 2.1 CD's used . .8 2.2 Results all CD's together . .9 2.3 Interesting quirks . 12 2.3.1 Mono encoded as stereo (Dan Browns Angels and Demons) 12 2.4 Convergence of the results . 15 3 High-resolution audio 17 3.1 Nine Inch Nails' The Slip . 17 3.2 Howard Shore's soundtrack for The Lord of the Rings: The Re- turn of the King . 20 3.3 Wasted bits . 22 4 Multichannel audio 24 4.1 Howard Shore's soundtrack for The Lord of the Rings: The Re- turn of the King . 24 A Motivation for choosing these CDs 27 Bibliography 31 2 Introduction While testing the efficiency of lossy codecs can be quite cumbersome (as results differ for each person), comparing lossless codecs is much easier. As the last well documented and comprehensive test available on the internet has been a few years ago, I thought it would be a good idea to update. Beside comparing with CD-audio (which is often done to assess codec perfor- mance) and spitting out a grand total, this comparison also looks at extremes that occurred during the test and takes a look at 'high-resolution audio' and multichannel/surround audio. While the comparison was made to update the comparison-page on the FLAC website, it aims to be fair and unbiased. Because of this, you'll probably won't find anything that looks like conclusions: test results are displayed and analysed, but there is no judgement or choice made.
    [Show full text]
  • (A/V Codecs) REDCODE RAW (.R3D) ARRIRAW
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • Name Synopsis Description
    SHNTOOL(1) local SHNTOOL(1) NAME shntool − a multi-purpose WAV Edata processing and reporting utility SYNOPSIS shntool mode ... shntool [CORE OPTION] DESCRIPTION shntool is a command-line utility to viewand/or modify WAV Edata and properties. It runs in several dif- ferent operating modes, and supports various lossless audio formats. shntool is comprised of three parts - its core, mode modules, and format modules. This helps to makethe code easier to maintain, as well as aid other programmers in developing newfunctionality.The distribution archive contains a file named ’modules.howto’ that describes howtocreate a newmode or format module, for those so inclined. Mode modules shntool performs various functions on WAV Edata through the use of mode modules. The core of shntool is simply a wrapper around the mode modules. In fact, when shntool is run with a valid mode as its first argument, it essentially runs the main procedure for the specified mode, and quits. shntool comes with sev- eral built-in modes, described below: len Displays length, size and properties of PCM WAV Edata fix Fixes sector-boundary problems with CD-quality PCM WAV Edata hash Computes the MD5 or SHA1 fingerprint of PCM WAV Edata pad Pads CD(hyquality files not aligned on sector boundaries with silence join Joins PCM WAV Edata from multiple files into one split Splits PCM WAV Edata from one file into multiple files cat Writes PCM WAV Edata from one or more files to the terminal cmp Compares PCM WAV Edata in twofiles cue Generates a CUE sheet or split points from a set of files conv Converts files from one format to another info Displays detailed information about PCM WAV Edata strip Strips extra RIFF chunks and/or writes canonical headers gen Generates CD-quality PCM WAV Edata files containing silence trim Trims PCM WAV Esilence from the ends of files Formore information on the meaning of the various command-line options for each mode, see the MODE- SPECIFIC OPTIONS section below.
    [Show full text]
  • Lossless Compression of Audio Data
    CHAPTER 12 Lossless Compression of Audio Data ROBERT C. MAHER OVERVIEW Lossless data compression of digital audio signals is useful when it is necessary to minimize the storage space or transmission bandwidth of audio data while still maintaining archival quality. Available techniques for lossless audio compression, or lossless audio packing, generally employ an adaptive waveform predictor with a variable-rate entropy coding of the residual, such as Huffman or Golomb-Rice coding. The amount of data compression can vary considerably from one audio waveform to another, but ratios of less than 3 are typical. Several freeware, shareware, and proprietary commercial lossless audio packing programs are available. 12.1 INTRODUCTION The Internet is increasingly being used as a means to deliver audio content to end-users for en­ tertainment, education, and commerce. It is clearly advantageous to minimize the time required to download an audio data file and the storage capacity required to hold it. Moreover, the expec­ tations of end-users with regard to signal quality, number of audio channels, meta-data such as song lyrics, and similar additional features provide incentives to compress the audio data. 12.1.1 Background In the past decade there have been significant breakthroughs in audio data compression using lossy perceptual coding [1]. These techniques lower the bit rate required to represent the signal by establishing perceptual error criteria, meaning that a model of human hearing perception is Copyright 2003. Elsevier Science (USA). 255 AU rights reserved. 256 PART III / APPLICATIONS used to guide the elimination of excess bits that can be either reconstructed (redundancy in the signal) orignored (inaudible components in the signal).
    [Show full text]
  • Codec Is a Portmanteau of Either
    What is a Codec? Codec is a portmanteau of either "Compressor-Decompressor" or "Coder-Decoder," which describes a device or program capable of performing transformations on a data stream or signal. Codecs encode a stream or signal for transmission, storage or encryption and decode it for viewing or editing. Codecs are often used in videoconferencing and streaming media solutions. A video codec converts analog video signals from a video camera into digital signals for transmission. It then converts the digital signals back to analog for display. An audio codec converts analog audio signals from a microphone into digital signals for transmission. It then converts the digital signals back to analog for playing. The raw encoded form of audio and video data is often called essence, to distinguish it from the metadata information that together make up the information content of the stream and any "wrapper" data that is then added to aid access to or improve the robustness of the stream. Most codecs are lossy, in order to get a reasonably small file size. There are lossless codecs as well, but for most purposes the almost imperceptible increase in quality is not worth the considerable increase in data size. The main exception is if the data will undergo more processing in the future, in which case the repeated lossy encoding would damage the eventual quality too much. Many multimedia data streams need to contain both audio and video data, and often some form of metadata that permits synchronization of the audio and video. Each of these three streams may be handled by different programs, processes, or hardware; but for the multimedia data stream to be useful in stored or transmitted form, they must be encapsulated together in a container format.
    [Show full text]
  • How to Play Itunes Purchased and Rental Movies with XBMC
    How to Play iTunes Purchased and Rental Movies with XBMC What are XBMC Player Video Formats? XBMC is an open source media player software developed by XBMC team. With XBMC media player, you can view and watch any videos, music, podcasts on your local computer or from internet. XBMC is developed for Mac, Windows, iOS, Android platform now. So almost all of us can use this powerful media player app without obstacles. XBMC for Mac can be compatible with Mac OS X tiger or later. It supports playing 1080p video on Mac computer via software decoding on the CPU if it is powerful enough. And XBMC for Windows is compatible with Windows 7, Vista and XP. Even though it can run well on 64-bit machine, it is not yet optimized for that architecture so there is no performance gain when running on 64-bit Windows. Let's learn what formats does XBMC support at first. Video formats supported by XBMC: MPEG-1, MPEG-2, H.263, MPEG-4 SP and ASP, MPEG-4 AVC (H.264), HuffYUV, Indeo, MJPEG, RealVideo, RMVB, Sorenson, WMV, Cinepak. Audio formats supported by XBMC: MIDI, AIFF, WAV/WAVE, AIFF, MP2, MP3, AAC, AACplus (AAC+), Vorbis, AC3, DTS, ALAC, AMR, FLAC, Monkey's Audio (APE), RealAudio, SHN, WavPack, MPC/Musepack/Mpeg+, Shorten, Speex, WMA, IT, S3M, MOD (Amiga Module), XM, NSF (NES Sound Format), SPC (SNES), GYM (Genesis), SID (Commodore 64), Adlib, YM (Atari ST), ADPCM (Nintendo GameCube), and CD-DA. Can XBMC Play iTunes Downloaded Videos? The current software limitation on XBMC is that it can't play any DRM-protected music and videos, like audio files purchased from online music stores as iTunes Music Store, MSN Music, Audible.com, Windows Media Player Stores, and video files protected with Windows Media DRM, Fairplay DRM or DivX proprietary DRM.
    [Show full text]
  • File Format Guidelines for Management and Long-Term Retention of Electronic Records
    FILE FORMAT GUIDELINES FOR MANAGEMENT AND LONG-TERM RETENTION OF ELECTRONIC RECORDS 9/10/2012 State Archives of North Carolina File Format Guidelines for Management and Long-Term Retention of Electronic records Table of Contents 1. GUIDELINES AND RECOMMENDATIONS .................................................................................. 3 2. DESCRIPTION OF FORMATS RECOMMENDED FOR LONG-TERM RETENTION ......................... 7 2.1 Word Processing Documents ...................................................................................................................... 7 2.1.1 PDF/A-1a (.pdf) (ISO 19005-1 compliant PDF/A) ........................................................................ 7 2.1.2 OpenDocument Text (.odt) ................................................................................................................... 3 2.1.3 Special Note on Google Docs™ .......................................................................................................... 4 2.2 Plain Text Documents ................................................................................................................................... 5 2.2.1 Plain Text (.txt) US-ASCII or UTF-8 encoding ................................................................................... 6 2.2.2 Comma-separated file (.csv) US-ASCII or UTF-8 encoding ........................................................... 7 2.2.3 Tab-delimited file (.txt) US-ASCII or UTF-8 encoding .................................................................... 8 2.3
    [Show full text]
  • Input Formats & Codecs
    Input Formats & Codecs Pivotshare offers upload support to over 99.9% of codecs and container formats. Please note that video container formats are independent codec support. Input Video Container Formats (Independent of codec) 3GP/3GP2 ASF (Windows Media) AVI DNxHD (SMPTE VC-3) DV video Flash Video Matroska MOV (Quicktime) MP4 MPEG-2 TS, MPEG-2 PS, MPEG-1 Ogg PCM VOB (Video Object) WebM Many more... Unsupported Video Codecs Apple Intermediate ProRes 4444 (ProRes 422 Supported) HDV 720p60 Go2Meeting3 (G2M3) Go2Meeting4 (G2M4) ER AAC LD (Error Resiliant, Low-Delay variant of AAC) REDCODE Supported Video Codecs 3ivx 4X Movie Alaris VideoGramPiX Alparysoft lossless codec American Laser Games MM Video AMV Video Apple QuickDraw ASUS V1 ASUS V2 ATI VCR-2 ATI VCR1 Auravision AURA Auravision Aura 2 Autodesk Animator Flic video Autodesk RLE Avid Meridien Uncompressed AVImszh AVIzlib AVS (Audio Video Standard) video Beam Software VB Bethesda VID video Bink video Blackmagic 10-bit Broadway MPEG Capture Codec Brooktree 411 codec Brute Force & Ignorance CamStudio Camtasia Screen Codec Canopus HQ Codec Canopus Lossless Codec CD Graphics video Chinese AVS video (AVS1-P2, JiZhun profile) Cinepak Cirrus Logic AccuPak Creative Labs Video Blaster Webcam Creative YUV (CYUV) Delphine Software International CIN video Deluxe Paint Animation DivX ;-) (MPEG-4) DNxHD (VC3) DV (Digital Video) Feeble Files/ScummVM DXA FFmpeg video codec #1 Flash Screen Video Flash Video (FLV) / Sorenson Spark / Sorenson H.263 Forward Uncompressed Video Codec fox motion video FRAPS:
    [Show full text]
  • Low Complexity Lossless Compression of Underwater Sound Recordings
    Low complexity lossless compression of underwater sound recordings Mark Johnsona) Scottish Oceans Institute, University of St. Andrews, Fife KY16 8LB, United Kingdom Jim Partan and Tom Hurst Woods Hole Oceanographic Institution, Woods Hole, Massachusetts 02543 (Received 1 February 2012; revised 1 October 2012; accepted 31 December 2012) Autonomous listening devices are increasingly used to study vocal aquatic animals, and there is a constant need to record longer or with greater bandwidth, requiring efficient use of memory and battery power. Real-time compression of sound has the potential to extend recording durations and bandwidths at the expense of increased processing operations and therefore power consumption. Whereas lossy methods such as MP3 introduce undesirable artifacts, lossless compression algo- rithms (e.g., FLAC) guarantee exact data recovery. But these algorithms are relatively complex due to the wide variety of signals they are designed to compress. A simpler lossless algorithm is shown here to provide compression factors of three or more for underwater sound recordings over a range of noise environments. The compressor was evaluated using samples from drifting and animal- borne sound recorders with sampling rates of 16–240 kHz. It achieves >87% of the compression of more-complex methods but requires about 1/10 of the processing operations resulting in less than 1 mW power consumption at a sampling rate of 192 kHz on a low-power microprocessor. The poten- tial to triple recording duration with a minor increase in power consumption and no loss in sound quality may be especially valuable for battery-limited tags and robotic vehicles. VC 2013 Acoustical Society of America.
    [Show full text]
  • Name Description Synopsis Options
    xcfa_cli(1) Manual: 0.0.6 xcfa_cli(1) NAME xcfa_cli −This program is an implementation of xcfaincommand line. DESCRIPTION xcfa_cli is an application for conversion, normalization, reconfiguring wav files and cut audio files ... What xcfa_cli can do: -replaygain on files: flac, mp3, ogg, wavpack -conversions: -from files: wav, flac, ape, wavpack, ogg, m4a, mpc, mp3, wma, shorten, rm, dts, aif, ac3 -tofiles: wav, flac, ape, wavpack, ogg, m4a, mpc, mp3, aac -conversion settings for file management: flac, ape, wavpack, ogg, m4a, aac, mpc, mp3 -management tags -management cue wav file -manipulation of the frequency, track and bit wav files -standardization on files: wav,mp3, ogg -cuts (split) wav files -displaying information on files SYNOPSIS xcfa_cli [ −i "file.*" ][ −d wav,mpc,... ][ OPTIONS ] OPTIONS −−verbose Verbose mode −h −−help Print help mode and quit −i <"file.type"> −−input <"file.type"> Input name file to convert in inverted commas: −−input "*.flac" Type input files: wav,flac, ape, wavpack, ogg, m4a, mpc, mp3, wma, shorten, rm, dts, aif, ac3 −o <path_dest/> −−output <path_dest/> Destination folder.Bydefault in the source file folder. −d <wav,flac,ape,...> −−dest <wav,flac,ape,...> Destination file: wav,flac, ape, wavpack, ogg, m4a, mpc, mp3, aac −r −−recursion Recursive search −e −−ext2src Extract in the source folder.This option is useful with ’−−recursion’ −−nice <priority> Change the priority of running processes in the interval: 0 .. 20 Management options with default parameters: −−op_flac <"−5"> −−op_ape <"c2000"> −−op_wavpack <"−y −j1"> 0.0.6 Thu, 06
    [Show full text]