FRAME RATE PREFERENCES in LOW BIT RATE VIDEO Gayatri

Total Page:16

File Type:pdf, Size:1020Kb

FRAME RATE PREFERENCES in LOW BIT RATE VIDEO Gayatri FRAME RATE PREFERENCES IN LOW BIT RATE VIDEO Gayatri Yadavalli, Mark Masry and Sheila S. Hemami Cornell University, School of Electrical and Computer Engineering, Ithaca, NY email:[email protected], {masry, hemami}@ece.cornell.edu ABSTRACT The paper is organized as follows: the frame rate subjective preference test is described in Section 2. In Section 3, a statistical A double stimulus subjective evaluation was performed to deter- analysis is used to determine viewer preferences across subgroups. mine preferred frame rates at a fixed bit rate for low bit rate video. Section 4 concludes the paper. Stimuli consisted of eight reference color video sequences of size 352 × 240 pixels. These were compressed at rates of 100, 200 2. TEST DESCRIPTION and 300 kbps for low, medium, and high motion sequences, re- spectively, using three encoders and frame rates of 10, 15 and 30 This section presents the design of the frame rate preference eval- frames per second. Twenty-two viewers ranked their frame rate uation. It includes a summary of content chosen, followed by a preferences using an adjectival categorical scale. Their preferences discussion of the coding conditions and a description of the test were analyzed across sequence content, motion type, and encoder. environment. Viewers preferred a frame rate of 15 frames per second across all categories, with several notable content-based exceptions. 2.1. Video Sequences 1. INTRODUCTION To determine the effects of frame rate on different content types, eight reference sequences of approximately eight seconds each were chosen specifically to represent a variety of streaming video As the demand for streaming video rises among Internet users, 352 × 240 providers are faced with new challenges to maximize video qual- content and resized to pixels. All sequences were dig- ity under fixed bandwidth constraints. At a given bit rate, it is itized using the 4:2:2 YUV color space. The reference sequences not known how the frame rate affects the perceived quality of the are described here: video. The majority of digital video available today is coded at 30 frames per second (fps). A lower frame rate improves the qual- Low motion: ity of individual frames at the expense of the smooth motion of • videoconference - a head-and-shoulders view of a woman video coded at high frame rates. Example frames from one of the talking to a stationary camera. video sequences in the Video Quality Experts Group (VQEG) [1] • news - news footage consisting of one scene with a station- database coded at 10 and 30 fps are shown in Figure 1. Given ary camera. this tradeoff, it is useful for video providers to have a set of rules governing the selection of an optimal frame rate for a particular Medium motion: content type, given a fixed bit rate. • crowd - a moving crowd with a high level of detail and some This paper presents the results of a subjective evaluation of camera motion. viewer-preferred frame rates for a variety of low bit rate video • martial arts - a fight scene involving several people and fast content. Low, medium, and high motion sequences were selected; motion. the motion content of each sequence determined the bit rate used, • ranging from 100 to 300 kbps. The reference sequences were se- airport - several people walking through a moving crowd lected from a wide assortment of film and television programming with high camera motion. to span the range of available content types. In order to test frame • animation - several scenes of computer animated characters rate preferences, each reference sequence was compressed with a with some camera motion. given encoder at three different frame rates: 10, 15, and 30 fps. High motion: The resulting set of three compressed video sequences was pre- sented and evaluated together. Since the bitrate was fixed for each • sports - a panning shot of a football game with a large num- reference sequence and each of the eight reference sequences was ber of moving players on a stationary background. processed with three encoders, there were 24 such test sets, all of • car chase - a very high-speed car chase scene with many which were evaluated by each viewer. The evaluation was per- cuts and extremely high motion. formed using the double-stimulus, five-grade adjectival, categor- ical ITU-R quality scale of ITU-R BT.500-11 [2]. A rank-based 2.2. Coding Conditions analysis of variance was performed to analyze the significance of viewer preferences in terms of sequence content, motion level, and Each of the sequences was encoded in color using three differ- encoder. Fifteen fps was generally preferred across categories. ent motion-compensated video coding algorithms: the Sorenson Specific exceptions are noted and discussed. Professional video coder version 2.1 [3], the University of British Columbia’s H.263+ coder version 3.0 [4], and a wavelet-based rate-distortion optimized coder developed at Cornell University [5]. The three coders implement vector quantization, the Discrete Cosine Transform (DCT), and the wavelet transform, respectively, and are referred to as VQ, H.263+ and Wavelet throughout this paper. Different bit rates were chosen for each motion category. The low motion sequences were encoded at 100 Kbps, the medium mo- tion sequences at 200 Kbps, and the high motion at 300 Kbps in order to maintain a similar level of quality across the categories. The encoded sequences exhibited a range of blocking and blur- ring artifacts typical of coded video sequences due to the low bit rates used. These artifacts differed across coders. Blockiness oc- curs when quantization causes the appearance of distinct edges be- tween adjacent blocks. Blurriness is the loss of high frequency de- tail. Sequences coded with the H.263+ and VQ coders exhibited a higher degree of blocking, while those coded with the Wavelet coder showed greater blurriness. The Wavelet coder also exhibited a pulsing effect that was most visible on low motion sequences. (a) H.263+ was the only coder that dropped frames in order to meet bit rate targets. 2.3. Test Environment The test environment was designed to simulate standard viewing conditions as nearly as possible for low bit rate video. Room light- ing was fixed at approximately 230 lux. The video sequences were displayed on a 21” Nokia Multigraph 445XPro monitor at a display resolution of 1024 × 768. Viewing distance was fixed at 6 picture heights. Monitor gamma was 2.3. Maximum and minimum lumi- nances were measured at 98.9 and 1.5 cd/m2, respectively. The test setup consisted of a single screen with two display areas. Each of the twenty-four test sets of sequences with 10, 15 and 30 fps was presented to every viewer in random order to re- move contextual effects. For each of these sets, subjects were first shown the broadcast quality reference video coded at 4 Mbps and 30 fps in the left display area. This video remained available to replay at any time during the test. Subjects then viewed each of (b) the three possible frame rate sequences in the right area. Three buttons labeled A, B, and C located just below the right display Fig. 1. Example sections of frames compressed at 275 kbps using area allowed users to replay the test sequences as desired. Each H.263+ and frame rates of (a) 30 fps and (b) 10 fps of the three encodings in a test set were assigned to these buttons in a pseudorandom order to eliminate the possibility of viewer ac- climatization to a button/fps combination. Viewers were able to examine all three video sequences before rating them. in terms of sequence content, motion category, and encoder to de- Viewers used a five-position slider to rate each of the three se- termine the effects of each on viewer preference. quences in each test set on a five-grade categorical scale; ratings were then converted to ordinal rankings for the purposes of anal- ysis. Ties were allowed in order account for lack of preference. 3.1. Statistical Analysis The test subjects consisted of twenty-two viewers - eleven male Twenty-four test sets were generated as described in section 2. and eleven female - with varying levels of experience viewing and Ratings for the sequences within a test set were then converted into rating video quality. Each viewer performed two full trials of the ordinal rankings using a three rank scale (i.e. ranks of 1, 2 and 3) test, and the results from both trials have been consolidated into for analysis purposes. Tied values were assigned as the average of the overall results. Viewers had normal (20/20) visual acuity or the rankings they would otherwise occupy. corrective lenses. Lower ranking numbers corresponded to higher viewer pref- erence. The rankings for each test set were grouped and summed 3. RESULTS AND ANALYSIS by frame rate to determine overall preferences for each viewer. In order to test preference across a particular subset of sequences, the This section analyzes the results of testing frame rate preferences rankings of each viewer’s ratings of the sequences in that subset for twenty-two viewers. The rank-based Friedman Test used to were summed. The sums for each viewer were then ranked again. perform the statistical analysis is described. Results are discussed The Friedman Test [6] was performed on the resulting set of 22 Sum of Viewer Rankings Preference χ2 Confidence Level by Frame Rate Grouping Motion 10 15 30 10 15 30 By Video Sequence Videoconference L 49.5 31.5 51 2 1 3 10.70 99.5% News L 31.5 36.5 62 1 2 3 24.38 99.9% Crowd M 43 31.5 56.5 2 1 3 14.25 99.9% Martial Arts M 45 33 53 3 1 2 9.23 99.0% Airport M 52 27 53 2 1 2 19.73 99.9% Animation M 39.5 39 51.5 1 1 3 8.07 98.2% Sports H 54 37 41 3 1 2 7.18 97.2% Car chase H 39.5 37.5 55 2 1 3 8.34 98.4% By Motion Low Motion L 39.5 34 58.5 2 1 3 15.02 99.9% Medium Motion M 43 32 57 2 1 3 14.27 99.9% High Motion H 47 35.5 49.5 3 1 2 5.07 92.0% By Encoder Wavelet All 54 35 43 3 1 2 8.27 98.4% H.263+ All 45 32 55 2 1 3 12.09 99.7% VQ All 37.5 36 58.5 2 1 3 14.39 99.9% Table 1.
Recommended publications
  • MPEG Video in Software: Representation, Transmission, and Playback
    High Speed Networking and Multimedia Computing, IS&T/SPIE Symp. on Elec. Imaging Sci. & Tech., San Jose, CA, February 1994. MPEG Video in Software: Representation, Transmission, and Playback Lawrence A. Rowe, Ketan D. Patel, Brian C Smith, and Kim Liu Computer Science Division - EECS University of California Berkeley, CA 94720 ([email protected]) Abstract A software decoder for MPEG-1 video was integrated into a continuous media playback system that supports synchronized playing of audio and video data stored on a file server. The MPEG-1 video playback system supports forward and backward play at variable speeds and random positioning. Sending and receiving side heuristics are described that adapt to frame drops due to network load and the available decoding capacity of the client workstation. A series of experiments show that the playback system adds a small overhead to the stand alone software decoder and that playback is smooth when all frames or very few frames can be decoded. Between these extremes, the system behaves reasonably but can still be improved. 1.0 Introduction As processor speed increases, real-time software decoding of compressed video is possible. We developed a portable software MPEG-1 video decoder that can play small-sized videos (e.g., 160 x 120) in real-time and medium-sized videos within a factor of two of real-time on current workstations [1]. We also developed a system to deliver and play synchronized continuous media streams (e.g., audio, video, images, animation, etc.) on a network [2].Initially, this system supported 8kHz 8-bit audio and hardware-assisted motion JPEG compressed video streams.
    [Show full text]
  • HERO6 Black Manual
    USER MANUAL 1 JOIN THE GOPRO MOVEMENT facebook.com/GoPro youtube.com/GoPro twitter.com/GoPro instagram.com/GoPro TABLE OF CONTENTS TABLE OF CONTENTS Your HERO6 Black 6 Time Lapse Mode: Settings 65 Getting Started 8 Time Lapse Mode: Advanced Settings 69 Navigating Your GoPro 17 Advanced Controls 70 Map of Modes and Settings 22 Connecting to an Audio Accessory 80 Capturing Video and Photos 24 Customizing Your GoPro 81 Settings for Your Activities 26 Important Messages 85 QuikCapture 28 Resetting Your Camera 86 Controlling Your GoPro with Your Voice 30 Mounting 87 Playing Back Your Content 34 Removing the Side Door 5 Using Your Camera with an HDTV 37 Maintenance 93 Connecting to Other Devices 39 Battery Information 94 Offloading Your Content 41 Troubleshooting 97 Video Mode: Capture Modes 45 Customer Support 99 Video Mode: Settings 47 Trademarks 99 Video Mode: Advanced Settings 55 HEVC Advance Notice 100 Photo Mode: Capture Modes 57 Regulatory Information 100 Photo Mode: Settings 59 Photo Mode: Advanced Settings 61 Time Lapse Mode: Capture Modes 63 YOUR HERO6 BLACK YOUR HERO6 BLACK 1 2 4 4 3 11 2 12 5 9 6 13 7 8 4 10 4 14 6 1. Shutter Button [ ] 6. Latch Release Button 10. Speaker 2. Camera Status Light 7. USB-C Port 11. Mode Button [ ] 3. Camera Status Screen 8. Micro HDMI Port 12. Battery 4. Microphone (cable not included) 13. microSD Card Slot 5. Side Door 9. Touch Display 14. Battery Door For information about mounting items that are included in the box, see Mounting (page 87).
    [Show full text]
  • The Evolutionof Premium Vascular Ultrasound
    Ultrasound EPIQ 5 The evolution of premium vascular ultrasound Philips EPIQ 5 ultrasound system The new challenges in global healthcare Unprecedented advances in premium ultrasound performance can help address the strains on overburdened hospitals and healthcare systems, which are continually being challenged to provide a higher quality of care cost-effectively. The goal is quick and accurate diagnosis the first time and in less time. Premium ultrasound users today demand improved clinical information from each scan, faster and more consistent exams that are easier to perform, and allow for a high level of confidence, even for technically difficult patients. 2 Performance More confidence in your diagnoses even for your most difficult cases EPIQ 5 is the new direction for premium vascular ultrasound, featuring an exceptional level of clinical performance to meet the challenges of today’s most demanding practices. Our most powerful architecture ever applied to vascular ultrasound EPIQ performance touches all aspects of acoustic acquisition and processing, allowing you to truly experience the evolution to a more definitive modality. Carotid artery bulb Superficial varicose veins 3 The evolution in premium vascular ultrasound Supported by our family of proprietary PureWave transducers and our leading-edge Anatomical Intelligence, this platform offers our highest level of premium performance. Key trends in global ultrasound • The need for more definitive premium • A demand to automate most operator ultrasound with exceptional image functions
    [Show full text]
  • A Deblocking Filter Hardware Architecture for the High Efficiency
    A Deblocking Filter Hardware Architecture for the High Efficiency Video Coding Standard Cláudio Machado Diniz1, Muhammad Shafique2, Felipe Vogel Dalcin1, Sergio Bampi1, Jörg Henkel2 1Informatics Institute, PPGC, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, Brazil 2Chair for Embedded Systems (CES), Karlsruhe Institute of Technology (KIT), Germany {cmdiniz, fvdalcin, bampi}@inf.ufrgs.br; {muhammad.shafique, henkel}@kit.edu Abstract—The new deblocking filter (DF) tool of the next encoder configuration: (i) Random Access (RA) configuration1 generation High Efficiency Video Coding (HEVC) standard is with Group of Pictures (GOP) equal to 8 (ii) Intra period2 for one of the most time consuming algorithms in video decoding. In each video sequence is defined as in [8] depending upon the order to achieve real-time performance at low-power specific frame rate of the video sequence, e.g. 24, 30, 50 or 60 consumption, we developed a hardware accelerator for this filter. frames per second (fps); (iii) each sequence is encoded with This paper proposes a high throughput hardware architecture four different Quantization Parameter (QP) values for HEVC deblocking filter employing hardware reuse to QP={22,27,32,37} as defined in the HEVC Common Test accelerate filtering decision units with a low area cost. Our Conditions [8]. Fig. 1 shows the accumulated execution time architecture achieves either higher or equivalent throughput (in % of total decoding time) of all functions included in C++ (4096x2048 @ 60 fps) with 5X-6X lower area compared to state- class TComLoopFilter that implement the DF in HEVC of-the-art deblocking filter architectures. decoder software. DF contributes to up to 5%-18% to the total Keywords—HEVC coding; Deblocking Filter; Hardware decoding time, depending on video sequence and QP.
    [Show full text]
  • Analysis of Video Compression Using DCT
    Imperial Journal of Interdisciplinary Research (IJIR) Vol-3, Issue-3, 2017 ISSN: 2454-1362, http://www.onlinejournal.in Analysis of Video Compression using DCT Pranavi Patil1, Sanskruti Patil2 & Harshala Shelke3 & Prof. Anand Sankhe4 1,2,3 Bachelor of Engineering in computer, Mumbai University 4Proffessor, Department of Computer Science & Engineering, Mumbai University Maharashtra, India Abstract: Video is most useful media to represent the where in order to get the best possible compression great information. Videos, images are the most efficiency, it considers certain level of distortion. essential approaches to represent data. Now this year, all the communications are done on such ii. Lossless Compression: media. The central problem for the media is its large The lossless compression technique is generally size. Also this large data contains a lot of redundant focused on the decreasing the compressed output information. The huge usage of digital multimedia video' bit rate without any alteration of the frame. leads to inoperable growth of data flow through The decompressed bit-stream is matching to the various mediums. Now a days, to solve the problem original bit-stream. of bandwidth requirement in communication, multimedia data is a big challenge. The main 2. Video Compression concept in the present paper is about the Discrete The main unit behind the video compression Cosine Transform (DCT) algorithm for compressing approach is the video encoder found at the video's size. Keywords: Compression, DCT, PSNR, transmitter side, which encodes the video that is to be MSE, Redundancy. transmitted in form of bits and also the video decoder positioned at the receiver side, which rebuild the Keywords: Compression, DCT, PSNR, MSE, video in its original form based on the bit sequence Redundancy.
    [Show full text]
  • Contrast-Enhanced High-Frame-Rate Ultrasound Imaging of Flow Patterns in Cardiac Chambers and Deep Vessels
    Delft University of Technology Contrast-Enhanced High-Frame-Rate Ultrasound Imaging of Flow Patterns in Cardiac Chambers and Deep Vessels Vos, Hendrik J.; Voorneveld, Jason D.; Groot Jebbink, Erik; Leow, Chee Hau; Nie, Luzhen; van den Bosch, Annemien E.; Tang, Meng Xing; Freear, Steven; Bosch, Johan G. DOI 10.1016/j.ultrasmedbio.2020.07.022 Publication date 2020 Document Version Final published version Published in Ultrasound in Medicine and Biology Citation (APA) Vos, H. J., Voorneveld, J. D., Groot Jebbink, E., Leow, C. H., Nie, L., van den Bosch, A. E., Tang, M. X., Freear, S., & Bosch, J. G. (2020). Contrast-Enhanced High-Frame-Rate Ultrasound Imaging of Flow Patterns in Cardiac Chambers and Deep Vessels. Ultrasound in Medicine and Biology, 46(11), 2875-2890. https://doi.org/10.1016/j.ultrasmedbio.2020.07.022 Important note To cite this publication, please use the final published version (if applicable). Please check the document version above. Copyright Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons. Takedown policy Please contact us and provide details if you believe this document breaches copyrights. We will remove access to the work immediately and investigate your claim. This work is downloaded from Delft University of Technology. For technical reasons the number of authors shown on this cover page is limited to a maximum of 10. ARTICLE IN PRESS Ultrasound in Med.
    [Show full text]
  • Video Coding Standards
    Module 8 Video Coding Standards Version 2 ECE IIT, Kharagpur Lesson 23 MPEG-1 standards Version 2 ECE IIT, Kharagpur Lesson objectives At the end of this lesson, the students should be able to : 1. Enlist the major video coding standards 2. State the basic objectives of MPEG-1 standard. 3. Enlist the set of constrained parameters in MPEG-1 4. Define the I- P- and B-pictures 5. Present the hierarchical data structure of MPEG-1 6. Define the macroblock modes supported by MPEG-1 23.0 Introduction In lesson 21 and lesson 22, we studied how to perform motion estimation and thereby temporally predict the video frames to exploit significant temporal redundancies present in the video sequence. The error in temporal prediction is encoded by standard transform domain techniques like the DCT, followed by quantization and entropy coding to exploit the spatial and statistical redundancies and achieve significant video compression. The video codecs therefore follow a hybrid coding structure in which DPCM is adopted in temporal domain and DCT or other transform domain techniques in spatial domain. Efforts to standardize video data exchange via storage media or via communication networks are actively in progress since early 1980s. A number of international video and audio standardization activities started within the International Telephone Consultative Committee (CCITT), followed by the International Radio Consultative Committee (CCIR), and the International Standards Organization / International Electrotechnical Commission (ISO/IEC). An experts group, known as the Motion Pictures Expects Group (MPEG) was established in 1988 in the framework of the Joint ISO/IEC Technical Committee with an objective to develop standards for coded representation of moving pictures, associated audio, and their combination for storage and retrieval of digital media.
    [Show full text]
  • Flow Imaging LOGIQ E10 Series
    WHITEPAPER Flow Imaging LOGIQ E10 Series Introduction Ultrasound can be a highly desirable imaging tool to assess flow hemodynamics due to its lack of ionizing radiation, real-time nature, portability, and economy. To address the wide-ranging clinical needs of various specialties, GE Healthcare has made a variety of flow technologies available on the LOGIQ™ E10 Series ultrasound systems, including: • Color Flow • Power Doppler Imaging • Microvascular Imaging • Radiantflow™ • B-Flow™ Imaging This paper will review the technical aspects and clinical benefits of each flow technology. Color Flow Introduction The color flow (CF) mode allows the visualization of flow direction Figure 1 shows CF (light blue, inside the ROI) and background and velocity information within the region of interest (ROI), B-Mode (gray), which are generated by separated transmit (Tx) or color box, defined by the operator. The Doppler shifts waveforms and received (Rx) echoes. The entire CF frame is of returning ultrasound waves within the ROI are color-coded created by overlaying CF information onto the background B-Mode. based on average velocity and direction. A wall motion filter (WMF) is always applied to differentiate true flow and clutter. How CF Works Similar to Pulsed Wave (PW) Doppler, CF utilizes intermittent Tx* & Rx sampling of ultrasound waves, and avoids the range ambiguity of Continuous Wave (CW) Doppler. Flow is depicted in blue when traveling away from the transducer (negative Doppler shift), while flow traveling toward the transducer (positive Doppler shift) is depicted in red. Lighter shades of each color denote higher velocities. The areas of high flow turbulence are depicted in a third color.
    [Show full text]
  • Video Source File Specifications
    Video Source File Specifications Limelight recommends the following specifications for all video source files. Adherence to these specifications will result in optimal playback quality and efficient uploading to your account. Edvance360 Best Practices Videos should be under 1 Gig for best results (download speed and mobile devices), but our limit per file is 2 Gig for Lessons MP4 is the optimum format for uploading videos Compress the video to resolution of 1024 x 768 Limelight does compress the video, but it's best if it's done on the original file A resolution is 1080p or less is recommended Recommended frame rate is 30fps Note: The maximum file size for Introduction Videos in Courses is 50MB. This is located in Courses > Settings > Details > Introduction Video. Ideal Source File In general, a source file that represents the following will produce the best results: MP4 file (H.264/ACC-LC) Fast Start (MOOV atom at the front of file) Progressive scan (no interlacing) Frame rate of 24 (23.98), 25, or 30 (29.97) fps A Bitrate between 5,000 - 8,000 Kbps 720p resolution Detailed Recommendations The table below provides detailed recommendations (CODECs, containers, Bitrates, resolutions, etc.) for all video source material uploaded to a Limelight Account: Source File Element Recommendations Video CODEC Recommended CODEC: H.264 Accepted but not Recommended: MPEG-1, MPEG-2, MPEG-4, VP6, VP5, H.263, Windows Media Video 7 (WMV1), Windows Media Video 8 (WMV2), Windows Media Video 9 (WMV3) Audio CODEC Recommended CODEC: AAC-LC Accepted but not Recommended: MP3, MP2, WMA, WMA Pro, PCM, WAV Container MP4 Source File Element Recommendations Fast-Start Make sure your source file is created with the 'MOOV atom' at the front of the file.
    [Show full text]
  • The Evolutionof
    Ultrasound EPIQ 5 The evolution of premium vascular ultrasound Philips EPIQ 5 ultrasound system The print quality of this copy is not an accurate representation of the original. The new challenges in global healthcare Unprecedented advances in premium ultrasound performance can help address the strains on overburdened hospitals and healthcare systems, which are continually being challenged to provide a higher quality of care cost-effectively. The goal is quick and accurate diagnosis the first time and in less time. Premium ultrasound users today demand improved clinical information from each scan, faster and more consistent exams that are easier to perform, and allow for a high level of confidence, even for technically difficult patients. 2 The print quality of this copy is not an accurate representation of the original. Performance More confidence in your diagnoses even for your most difficult cases EPIQ 5 is the new direction for premium vascular ultrasound, featuring an exceptional level of clinical performance to meet the challenges of today’s most demanding practices. Our most powerful architecture ever applied to vascular ultrasound EPIQ performance touches all aspects of acoustic acquisition and processing, allowing you to truly experience the evolution to a more definitive modality. Carotid artery bulb Superficial varicose veins 3 The print quality of this copy is not an accurate representation of the original. The evolution in premium vascular ultrasound Supported by our family of proprietary PureWave transducers and our leading-edge
    [Show full text]
  • A Deblocking Filter Architecture for High Efficiency Video Coding Standard (HEVC)
    UNIVERSIDADE FEDERAL DO RIO GRANDE DO SUL INSTITUTO DE INFORMÁTICA CURSO DE ENGENHARIA DE COMPUTAÇÃO FELIPE VOGEL DALCIN A Deblocking Filter Architecture for High Efficiency Video Coding Standard (HEVC) Final report presented in partial fulfillment of the Requirements for the degree in Computer Engineering. Advisor: Prof. Dr. Sergio Bampi Co-advisor: MSc. Cláudio Machado Diniz Porto Alegre 2014 UNIVERSIDADE FEDERAL DO RIO GRANDE DO SUL Reitor: Prof. Carlos Alexandre Netto Vice-Reitor: Prof. Rui Vicente Oppermann Pró-Reitor de Graduação: Prof. Sérgio Roberto Kieling Diretor do Instituto de Informática: Prof. Luís da Cunha Lamb Coordenador do Curso de Engenharia de Computação: Prof. Marcelo Götz Bibliotecária-Chefe do Instituto de Informática: Beatriz Regina Bastos Haro “Docendo discimus.” Seneca AKNOWLEDGEMENTS I would like to express my gratitude to my advisor, Prof. Dr. Sergio Bampi, who accepted me as lab assistant in 2010 to work for the research group under his supervision, bringing me the opportunity to stay in touch with a high-level academic environment. A very special thanks goes out to my co-advisor, MSc. Cláudio Machado Diniz, whose expertise, understanding, patience and unconditionall support were vital to the accomplishment of this work, specially taking into considerations the reduced amount of time he had to review this work. Last but not least, I would like to thank my parents and friends for their unconditional support throughout my degree. A special remark goes to my girlfriend, who kept me motivated during the last weeks of work to get this report done. In conclusion, I recognize that this research would not have been possible without the opportunities offered by both the Universidade Federal do Rio Grande do Sul, in Brazil, and the Technische Universität Kaiserslautern, in Germany.
    [Show full text]
  • A Multiplierless Pruned DCT-Like Transformation for Image and Video
    A Multiplierless Pruned DCT-like Transformation for Image and Video Compression that Requires 10 Additions Only Vitor A. Coutinho ∗ Renato J. Cintra F´abio M. Bayer† Sunera Kulasekera Arjuna Madanayake‡ Abstract A multiplierless pruned approximate 8-point discrete cosine transform (DCT) requiring only 10 ad- ditions is introduced. The proposed algorithm was assessed in image and video compression, showing competitive performance with state-of-the-art methods. Digital synthesis in 45 nm CMOS technology up to place-and-route level indicates clock speed of 288 MHz at a 1.1 V supply. The 8 × 8 block rate is 36 MHz. The DCT approximation was embedded into HEVC reference software; resulting video frames, at up to 327 Hz for 8-bit RGB HEVC, presented negligible image degradation. Keywords Approximate discrete cosine transform, pruning, pruned DCT, HEVC 1 Introduction The discrete cosine transform (DCT) plays a fundamental role in signal processing techniques [1] and is part of modern image and video standards, such as JPEG [2], MPEG-1 [3], MPEG-2 [4], H.261 [5], H.263 [6], H.264/AVC [7, 8], and the high efficiency video coding (HEVC) [9, 10]. In particular, the transform coding stage of the H.264 and HEVC standards employs the 8-point DCT of type II [10,11] among other transforms of different blocklenghts, such as 4, 16, and 32 points [12–14]. In [15], the 8-point DCT stage of the HEVC was optimized. Among the above-mentioned standards, the HEVC is capable of achieving high compression performance at approximately half the bit rate required by H.264/AVC with same image quality [10,13,15,16].
    [Show full text]