Arxiv:1602.05975V3 [Cs.MM] 28 Oct 2017 CDEF Works by Identifying the Direction [4] of Each Block Ed =  (Xp − Μd,K) 

Total Page:16

File Type:pdf, Size:1020Kb

Arxiv:1602.05975V3 [Cs.MM] 28 Oct 2017 CDEF Works by Identifying the Direction [4] of Each Block Ed =  (Xp − Μd,K)  THE AV1 CONSTRAINED DIRECTIONAL ENHANCEMENT FILTER (CDEF) Steinar Midtskogen Jean-Marc Valin Cisco Systems Inc. Mozilla Corporation Lysaker, Norway Mountain View, CA, USA [email protected] [email protected] 0 1 2 3 4 5 6 7 0 0 1 1 2 2 3 3 0 0 0 0 0 0 0 0 3 3 2 2 1 1 0 0 ABSTRACT 1 2 3 4 5 6 7 8 1 1 2 2 3 3 4 4 1 1 1 1 1 1 1 1 4 4 3 3 2 2 1 1 2 3 4 5 6 7 8 9 2 2 3 3 4 4 5 5 2 2 2 2 2 2 2 2 5 5 4 4 3 3 2 2 This paper presents the constrained directional enhancement filter 3 4 5 6 7 8 9 10 3 3 4 4 5 5 6 6 3 3 3 3 3 3 3 3 6 6 5 5 4 4 3 3 4 5 6 7 8 9 10 11 4 4 5 5 6 6 7 7 4 4 4 4 4 4 4 4 7 7 6 6 5 5 4 4 designed for the AV1 royalty-free video codec. The in-loop filter 5 6 7 8 9 10 11 12 5 5 6 6 7 7 8 8 5 5 5 5 5 5 5 5 8 8 7 7 6 6 5 5 is based on a non-linear low-pass filter and is designed for vector- 6 7 8 9 10 11 12 13 6 6 7 7 8 8 9 9 6 6 6 6 6 6 6 6 9 9 8 8 7 7 6 6 7 8 9 10 11 12 13 14 7 7 8 8 9 9 10 10 7 7 7 7 7 7 7 7 10 10 9 9 8 8 7 7 ization efficiency. It takes into account the direction of edges and d = 0 d = 1 d = 2 d = 3 patterns being filtered. The filter works by identifying the direc- 7 6 5 4 3 2 1 0 7 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7 tion of each block and then adaptively filtering with a high degree 8 7 6 5 4 3 2 1 7 6 5 4 3 2 1 0 0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7 of control over the filter strength along the direction and across it. 9 8 7 6 5 4 3 2 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 1 2 3 4 5 6 7 8 10 9 8 7 6 5 4 3 8 7 6 5 4 3 2 1 0 1 2 3 4 5 6 7 1 2 3 4 5 6 7 8 The proposed enhancement filter is shown to improve the quality of 11 10 9 8 7 6 5 4 9 8 7 6 5 4 3 2 0 1 2 3 4 5 6 7 2 3 4 5 6 7 8 9 the Alliance for Open Media (AOM) AV1 and Thor video codecs in 12 11 10 9 8 7 6 5 9 8 7 6 5 4 3 2 0 1 2 3 4 5 6 7 2 3 4 5 6 7 8 9 13 12 11 10 9 8 7 6 10 9 8 7 6 5 4 3 0 1 2 3 4 5 6 7 3 4 5 6 7 8 9 10 particular in low complexity configurations. 14 13 12 11 10 9 8 7 10 9 8 7 6 5 4 3 0 1 2 3 4 5 6 7 3 4 5 6 7 8 9 10 Index Terms— enhancement filter, directional filter, AV1 d = 4 d = 5 d = 6 d = 7 Fig. 1. Line number k for pixels following direction d = 0 : 7 in 1. INTRODUCTION an 8 × 8 block. The main goal of the in-loop constrained directional enhancement filter (CDEF) is to filter out coding artifacts while retaining the For each block we determine the direction that best matches the details of the image. In HEVC [1], the Sample Adaptive Off- pattern in the block by minimizing the sum of squared differences set (SAO) [2] algorithm achieves a similar goal by defining signal (SSD) between the quantized block and the closest perfectly direc- offsets for different classes of pixels. Unlike SAO, the approach tional block. A perfectly directional block is a block where all of we take in AV1 is that of a non-linear spatial filter. From the very the pixels along a line in one direction have the same value. For beginning, the design of the filter was constrained to be easily vec- each direction, we assign a line number k to each pixel, as shown in torizable (i.e. implementable with SIMD operations), which was Fig.1. not the case for other non-linear filters like the median filter and the For each direction d, the pixel average for line k is bilateral filter [3]. 1 X µd;k = xp ; (1) The CDEF design originates from the following observations. Nd;k The amount of ringing artifacts in a coded image tends to be roughly p2Pd;k proportional to the quantization step size. The amount of detail is a where xp is the value of pixel p, Pd;k is the set of pixels in line k fol- property of the input image, but the smallest detail actually retained lowing direction d and Nd;k is the cardinality of Pd;k (for example, in the quantized image tends to also be proportional to the quanti- in Fig.1, N1;0 = 2 and N1;4 = 8). The SSD is then zation step size. For a given quantization step size, the amplitude of 2 3 the ringing is generally less than the amplitude of the details. 2 X X 2 arXiv:1602.05975v3 [cs.MM] 28 Oct 2017 CDEF works by identifying the direction [4] of each block Ed = 4 (xp − µd;k) 5 : (2) (Sec.2) and then adaptively filtering along the identified direction k p2Pd;k (Sec.3) and to a lesser degree along directions rotated 45 degrees Substituting (1) into (2) and simplifying results in from the identified direction. The filter strengths are signaled ex- 2 plicitly, which allows a high degree of control over the blurring 0 1 2 X 2 X 1 X (Sec.4). Sec.5 demonstrates an efficient encoder search for the Ed = xp − @ xpA : (3) Nd;k filter strengths, with results presented in Sec.6. p k p2Pd;k Note that the simplifications leading to (3) are the same as to those 2. DIRECTION SEARCH 2 P x2 (P x)2 allowing a variance to be computed as σx = N − N2 . Con- sidering that the first term of (3) is constant with respect to d, we The direction search operates on the reconstructed pixels, just after find the optimal direction dopt by maximizing the second term: the deblocking filter. Since those pixels are available to the de- dopt = max sd (4) coder, the directions require no signaling. The search operates on d 8 × 8 blocks, which are small enough to adequately handle non- 0 12 straight edges, while being large enough to reliably estimate direc- X 1 X sd = @ xpA : (5) tions when applied to a quantized image. Having a constant direc- Nd;k tion over an 8×8 region also makes vectorization of the filter easier. k p2Pd;k Initialize all variables to zero for d = 0 to 7 do for i = 0 to 7 do for j = 0 to 7 do L line_table[d][i][j] partial[d][L] partial[d][L] + (pixel[i][j] − 128) count[d][L] count[d][L] + 1 end for end for for L = 0 to 14 do if count[d][L] > 0 then s[d] s[d] + partial[d][L]2 · 840=count[d][L] Fig. 2. Example of direction search for an 8×8 block. The patterns end if shown are based on the µd;k values. In this case, the 45-degree 2 end for direction is selected because it minimizes Ed . The error values Ed shown are never computed in practice (only s is). end for d for d = 0 to 7 do if s[d] > s[best_d] then best_d d We can avoid the division in (5) by multiplying sd by 840, the end if least common multiple of the possible Nd;k values (1 ≤ Nd;k ≤ end for 8). When using 8-bit pixel data, and centering the values such that direction best_d −128 ≤ xp ≤ 127, then 840sd and all calculations needed for sd directional_contrast s[best_d]−s[(best_d+4) mod 8] fit in a 32-bit signed integer. For higher bit depths, we downscale the pixels to 8 bits during the direction search. Algorithm 1: Direction search. The line_table[d][i][j] values are Fig.2 shows an example of a direction search for an 8 × 8 the line numbers shown in Fig.1. The 840=count[d][L] terms can block containing a line. The step-by-step process is described in al- be pre-computed. More functionally equivalent algebraic simplifi- gorithm1. To save on decoder complexity, we assume that luma and cations are possible, but they are not shown here for clarity. chroma directions are correlated, and only search the luma compo- nent.
Recommended publications
  • Network Working Group A
    Network Working Group A. Filippov Internet Draft Huawei Technologies Intended status: Informational A. Norkin Netflix J.R. Alvarez Huawei Technologies Expires: May 17, 2017 November 17, 2016 <Video Codec Requirements and Evaluation Methodology> draft-ietf-netvc-requirements-04.txt Status of this Memo This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet- Drafts. The list of current Internet-Drafts can be accessed at http://datatracker.ietf.org/drafts/current/ Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." The list of current Internet-Drafts can be accessed at http://www.ietf.org/1id-abstracts.html The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html This Internet-Draft will expire on May 17, 2017. Copyright Notice Copyright (c) 2016 IETF Trust and the persons identified as the document authors. All rights reserved. This document is subject to BCP 78 and the IETF Trust’s Legal Provisions Relating to IETF Documents (http://trustee.ietf.org/license-info) in effect on the date of publication of this document. Please review these documents carefully, as they describe your rights and restrictions with <Filippov> Expires May 17, 2017 [Page 1] Internet-Draft Video Codec Requirements and Evaluation November 2016 respect to this document.
    [Show full text]
  • Automatic Detection of Perceived Ringing Regions in Compressed Images
    International Journal of Electronics and Communication Engineering. ISSN 0974-2166 Volume 4, Number 5 (2011), pp. 491-516 © International Research Publication House http://www.irphouse.com Automatic Detection of Perceived Ringing Regions in Compressed Images D. Minola Davids Research Scholar, Singhania University, Rajasthan, India Abstract An efficient approach toward a no-reference ringing metric intrinsically exists of two steps: first detecting regions in an image where ringing might occur, and second quantifying the ringing annoyance in these regions. This paper presents a novel approach toward the first step: the automatic detection of regions visually impaired by ringing artifacts in compressed images. It is a no- reference approach, taking into account the specific physical structure of ringing artifacts combined with properties of the human visual system (HVS). To maintain low complexity for real-time applications, the proposed approach adopts a perceptually relevant edge detector to capture regions in the image susceptible to ringing, and a simple yet efficient model of visual masking to determine ringing visibility. Index Terms: Luminance masking, per-ceptual edge, ringing artifact, texture masking. Introduction In current visual communication systems, the most essential task is to fit a large amount of visual information into the narrow bandwidth of transmission channels or into a limited storage space, while maintaining the best possible perceived quality for the viewer. A variety of compression algorithms, for example, such as JPEG[1] and MPEG/H.26xhave been widely adopted in image and video coding trying to achieve high compression efficiency at high quality. These lossy compression techniques, however, inevitably result in various coding artifacts, which by now are known and classified as blockiness, ringing, blur, etc.
    [Show full text]
  • DVP Tutorial
    IP Video Conferencing: A Tutorial Roman Sorokin, Jean-Louis Rougier Abstract Video conferencing is a well-established area of communications, which have been studied for decades. Recently this area has received a new impulse due to significantly increased bandwidth of Local and Wide area networks, appearance of low-priced video equipment and development of web based media technologies. This paper presents the main techniques behind the modern IP-based videoconferencing services, with a particular focus on codecs, network protocols, architectures and standardization efforts. Questions of security and topologies are also tackled. A description of a typical video conference scenario is provided, demonstrating how the technologies, responsible for different conference aspects, are working together. Traditional industrial disposition as well as modern innovative approaches are both addressed. Current industry trends are highlighted in respect to the topics, described in the tutorial. Legacy analog/digital technologies, together with the gateways between the traditional and the IP videoconferencing systems, are not considered. Keywords Video Conferencing, codec, SVC, MCU, SIP, RTP Roman Sorokin ALE International, Colombes, France e-mail: [email protected] Jean-Louis Rougier Télécom ParisTech, Paris, France e-mail: [email protected] 1 1 Introduction Video conferencing is a two-way interactive communication, delivered over networks of different nature, which allows people from several locations to participate in a meeting. Conference participants use video conferencing endpoints of different types. Generally a video conference endpoint has a camera and a microphone. The video stream, generated by the camera, and the audio stream, coming from the microphone, are both compressed and sent to the network interface.
    [Show full text]
  • High Efficiency, Moderate Complexity Video Codec Using Only RF IPR
    Thor High Efficiency, Moderate Complexity Video Codec using only RF IPR draft-fuldseth-netvc-thor-00 Arild Fuldseth, Gisle Bjontegaard (Cisco) IETF 93 – Prague, CZ – July 2015 1 Design principles • Moderate complexity to allow real-time implementation in SW on common HW, as well as new HW designs • Basic building blocks from well-known hybrid approach (motion compensated prediction and transform coding) • Common design elements in modern codecs – Larger block sizes and transforms, up to 64x64 – Quarter pixel interpolation, motion vector prediction, etc. • Cisco RF IPR (note well: declaration filed on draft) – Deblocking, transforms, etc. (some also essential in H.265/4) • Avoid non-RF IPR – If/when others offer RF IPR, design/performance will improve 2 Encoder Architecture Input Transform Quantizer Entropy Output video Coding bitstream - Inverse Transform Intra Frame Prediction Loop filters Inter Frame Prediction Reconstructed Motion Frame Estimation Memory 3 Decoder Architecture Input Entropy Inverse Bitstream Decoding Transform Intra Frame Prediction Loop filters Inter Frame Prediction Output video Reconstructed Frame Memory 4 Block Structure • Super block (SB) 64x64 • Quad-tree split into coding blocks (CB) >= 8x8 • Multiple prediction blocks (PB) per CB • Intra: 1 PB per CB • Inter: 1, 2 (rectangular) or 4 (square) PBs per CB • 1 or 4 transform blocks (TB) per CB 5 Coding-block modes • Intra • Inter0 MV index, no residual information • Inter1 MV index, residual information • Inter2 Explicit motion vector information, residual information
    [Show full text]
  • Hardware for Speech and Audio Coding
    Linköping Studies in Science and Technology Thesis No. 1093 Hardware for Speech and Audio Coding Mikael Olausson LiU-TEK-LIC-2004:22 Department of Electrical Engineering Linköpings universitet, SE-581 83 Linköping, Sweden Linköping 2004 Linköping Studies in Science and Technology Thesis No. 1093 Hardware for Speech and Audio Coding Mikael Olausson LiU-TEK-LIC-2004:22 Department of Electrical Engineering Linköpings universitet, SE-581 83 Linköping, Sweden Linköping 2004 ISBN 91-7373-953-7 ISSN 0280-7971 ii Abstract While the Micro Processors (MPUs) as a general purpose CPU are converging (into Intel Pentium), the DSP processors are diverging. In 1995, approximately 50% of the DSP processors on the market were general purpose processors, but last year only 15% were general purpose DSP processors on the market. The reason general purpose DSP processors fall short to the application specific DSP processors is that most users want to achieve highest performance under mini- mized power consumption and minimized silicon costs. Therefore, a DSP proces- sor must be an Application Specific Instruction set Processor (ASIP) for a group of domain specific applications. An essential feature of the ASIP is its functional acceleration on instruction level, which gives the specific instruction set architecture for a group of appli- cations. Hardware acceleration for digital signal processing in DSP processors is essential to enhance the performance while keeping enough flexibility. In the last 20 years, researchers and DSP semiconductor companies have been working on different kinds of accelerations for digital signal processing. The trade-off be- tween the performance and the flexibility is always an interesting question because all DSP algorithms are "application specific"; the acceleration for audio may not be suitable for the acceleration of baseband signal processing.
    [Show full text]
  • Efficient Multi-Codec Support for OTT Services: HEVC/H.265 And/Or AV1?
    Efficient Multi-Codec Support for OTT Services: HEVC/H.265 and/or AV1? Christian Timmerer†,‡, Martin Smole‡, and Christopher Mueller‡ ‡Bitmovin Inc., †Alpen-Adria-Universität San Francisco, CA, USA and Klagenfurt, Austria, EU ‡{firstname.lastname}@bitmovin.com, †{firstname.lastname}@itec.aau.at Abstract – The success of HTTP adaptive streaming is un- multiple versions (e.g., different resolutions and bitrates) and disputed and technical standards begin to converge to com- each version is divided into predefined pieces of a few sec- mon formats reducing market fragmentation. However, other onds (typically 2-10s). A client first receives a manifest de- obstacles appear in form of multiple video codecs to be sup- scribing the available content on a server, and then, the client ported in the future, which calls for an efficient multi-codec requests pieces based on its context (e.g., observed available support for over-the-top services. In this paper, we review the bandwidth, buffer status, decoding capabilities). Thus, it is state of the art of HTTP adaptive streaming formats with re- able to adapt the media presentation in a dynamic, adaptive spect to new services and video codecs from a deployment way. perspective. Our findings reveal that multi-codec support is The existing different formats use slightly different ter- inevitable for a successful deployment of today's and future minology. Adopting DASH terminology, the versions are re- services and applications. ferred to as representations and pieces are called segments, which we will use henceforth. The major differences between INTRODUCTION these formats are shown in Table 1. We note a strong differ- entiation in the manifest format and it is expected that both Today's over-the-top (OTT) services account for more than MPEG's media presentation description (MPD) and HLS's 70 percent of the internet traffic and this number is expected playlist (m3u8) will coexist at least for some time.
    [Show full text]
  • CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter
    CALIFORNIA STATE UNIVERSITY, NORTHRIDGE Optimized AV1 Inter Prediction using Binary classification techniques A graduate project submitted in partial fulfillment of the requirements for the degree of Master of Science in Software Engineering by Alex Kit Romero May 2020 The graduate project of Alex Kit Romero is approved: ____________________________________ ____________ Dr. Katya Mkrtchyan Date ____________________________________ ____________ Dr. Kyle Dewey Date ____________________________________ ____________ Dr. John J. Noga, Chair Date California State University, Northridge ii Dedication This project is dedicated to all of the Computer Science professors that I have come in contact with other the years who have inspired and encouraged me to pursue a career in computer science. The words and wisdom of these professors are what pushed me to try harder and accomplish more than I ever thought possible. I would like to give a big thanks to the open source community and my fellow cohort of computer science co-workers for always being there with answers to my numerous questions and inquiries. Without their guidance and expertise, I could not have been successful. Lastly, I would like to thank my friends and family who have supported and uplifted me throughout the years. Thank you for believing in me and always telling me to never give up. iii Table of Contents Signature Page ................................................................................................................................ ii Dedication .....................................................................................................................................
    [Show full text]
  • Arxiv:2002.01657V1 [Eess.IV] 5 Feb 2020 Port Lossless Model to Compress Images Lossless
    LEARNED LOSSLESS IMAGE COMPRESSION WITH A HYPERPRIOR AND DISCRETIZED GAUSSIAN MIXTURE LIKELIHOODS Zhengxue Cheng, Heming Sun, Masaru Takeuchi, Jiro Katto Department of Computer Science and Communications Engineering, Waseda University, Tokyo, Japan. ABSTRACT effectively in [12, 13, 14]. Some methods decorrelate each Lossless image compression is an important task in the field channel of latent codes and apply deep residual learning to of multimedia communication. Traditional image codecs improve the performance as [15, 16, 17]. However, deep typically support lossless mode, such as WebP, JPEG2000, learning based lossless compression has rarely discussed. FLIF. Recently, deep learning based approaches have started One related work is L3C [18] to propose a hierarchical archi- to show the potential at this point. HyperPrior is an effective tecture with 3 scales to compress images lossless. technique proposed for lossy image compression. This paper In this paper, we propose a learned lossless image com- generalizes the hyperprior from lossy model to lossless com- pression using a hyperprior and discretized Gaussian mixture pression, and proposes a L2-norm term into the loss function likelihoods. Our contributions mainly consist of two aspects. to speed up training procedure. Besides, this paper also in- First, we generalize the hyperprior from lossy model to loss- vestigated different parameterized models for latent codes, less compression model, and propose a loss function with L2- and propose to use Gaussian mixture likelihoods to achieve norm for lossless compression to speed up training. Second, adaptive and flexible context models. Experimental results we investigate four parameterized distributions and propose validate our method can outperform existing deep learning to use Gaussian mixture likelihoods for the context model.
    [Show full text]
  • Analysis of Ringing Artifact in Image Fusion Using Directional Wavelet
    Special Issue - 2021 International Journal of Engineering Research & Technology (IJERT) ISSN: 2278-0181 NTASU - 2020 Conference Proceedings Analysis of Ringing Artifact in Image Fusion Using Directional Wavelet Transforms y z x Ashish V. Vanmali Tushar Kataria , Samrudhha G. Kelkar , Vikram M. Gadre Dept. of Information Technology Dept. of Electrical Engineering Vidyavardhini’s C.O.E. & Tech. Indian Institute of Technology, Bombay Vasai, Mumbai, India – 401202 Powai, Mumbai, India – 400076 Abstract—In the field of multi-data analysis and fusion, image • Images taken from multiple sensors. Examples include fusion plays a vital role for many applications. With inventions near infrared (NIR) images, IR images, CT, MRI, PET, of new sensors, the demand of high quality image fusion fMRI etc. algorithms has seen tremendous growth. Wavelet based fusion is a popular choice for many image fusion algorithms, because of its We can broadly classify the image fusion techniques into four ability to decouple different features of information. However, categories: it suffers from ringing artifacts generated in the output. This 1) Component substitution based fusion algorithms [1]–[5] paper presents an analysis of ringing artifacts in application of 2) Optimization based fusion algorithms [6]–[10] image fusion using directional wavelets (curvelets, contourlets, non-subsampled contourlets etc.). We compare the performance 3) Multi-resolution (wavelets and others) based fusion algo- of various fusion rules for directional wavelets available in rithms [11]–[15] and literature. The experimental results suggest that the ringing 4) Neural network based fusion algorithms [16]–[19]. artifacts are present in all types of wavelets with the extent of Wavelet based multi-resolution analysis decouples data into artifact varying with type of the wavelet, fusion rule used and levels of decomposition.
    [Show full text]
  • Course Outline & Schedule
    Course Outline & Schedule Call US 408-759-5074 or UK +44 20 7620 0033 Open Media Encoding Techniques Course Code PWL396 Duration 3 Day Course Price $2,815 Course Description Video, TV and Image technology today dominates Internet Services. Whether it be for live TV, Streamed movies or video clips within social media video images are everywhere. The long-term efficiency of services depends upon the methods and mechanisms used to encode these services. We have relied upon the developments from the Digital Video Broadcast industry, ISO, MPEG and the ITU to provide us with standard ways to achieve this. However the patent royalty cost is now considered to be holding back this efficient development. All commercial users of the normal encoding used such as H.264, H.265, HEVC and other standardised codecs are required to pay royalties for using this technology through a firm of US patent Lawyers known as MPEG-LA. Each new ITU- T standard encoding requires new and increased payments. The Alliance for Open Media is founded by leading Internet companies focused on developing next-generation media formats, codecs and technologies in the public interest. The new Alliance is committing its collective technology and expertise to meet growing Internet demand for top-quality video, audio, imagery and streaming across devices of all kinds and for users worldwide. The aim is to develop royalty free standardized encoding based upon the technology contributed by its members. This course provides a technical study of Video Coding and the technologies which the developing Open Media implementations are based upon.
    [Show full text]
  • 2D Signal Processing (III): 2D Filtering
    6.003 Signal Processing Week 11, Lecture A: 2D Signal Processing (III): 2D Filtering 6.003 Fall 2020 Filtering and Convolution Time domain: Frequency domain: 푋[푘푟, 푘푐] ∙ 퐻[푘푟, 푘푐] = 푌[푘푟, 푘푐] Today: More detailed look and further understanding about 2D filtering. 2D Low Pass Filtering Given the following image, what happens if we apply a filter that zeros out all the high frequencies in the image? Where did the ripples come from? 2D Low Pass Filtering The operation we did is equivalent to filtering: 푌 푘푟, 푘푐 = 푋 푘푟, 푘푐 ∙ 퐻퐿 푘푟, 푘푐 , where 2 2 1 푓 푘푟 + 푘푐 ≤ 25 퐻퐿 푘푟, 푘푐 = ቊ 0 표푡ℎ푒푟푤푠푒 2D Low Pass Filtering 2 2 1 푓 푘푟 + 푘푐 ≤ 25 Find the 2D unit-sample response of the 2D LPF. 퐻퐿 푘푟, 푘푐 = ቊ 0 표푡ℎ푒푟푤푠푒 퐻 푘 , 푘 퐿 푟 푐 ℎ퐿 푟, 푐 (Rec 8A): 푠푛 Ω 푛 ℎ 푛 = 푐 퐿 휋푛 Ω푐 Ω푐 The step changes in 퐻퐿 푘푟, 푘푐 generated overshoot: Gibb’s phenomenon. 2D Convolution Multiplying by the LPF is equivalent to circular convolution with its spatial-domain representation. ⊛ = 2D Low Pass Filtering Consider using the following filter, which is a circularly symmetric version of the Hann window. 2 2 1 1 푘푟 + 푘푐 2 2 + 푐표푠 휋 ∙ 푓 푘푟 + 푘푐 ≤ 25 퐻퐿2 푘푟, 푘푐 = 2 2 25 0 표푡ℎ푒푟푤푠푒 2 2 1 푓 푘푟 + 푘푐 ≤ 25 퐻퐿1 푘푟, 푘푐 = ቐ Now the ripples are gone. 0 표푡ℎ푒푟푤푠푒 But image more blurred when compare to the one with LPF1: With the same base-width, Hann window filter cut off more high freq.
    [Show full text]
  • Handout 04 Filtering in the Frequency Domain
    Digital Image Processing (420137), SSE, Tongji University, Spring 2013, given by Lin ZHANG Handout 04 Filtering in the Frequency Domain 1. Basic Concepts (1) Any periodic functions, no matter how complex it is, can be represented as the sum of a series of sine and cosine functions with different frequencies. (2) The signal’s frequency components can be clearly observed in the frequency domain. (3) Fourier transform is invertible. That means after the Fourier transform and the inverse Fourier transform, no information will be lost. (4) Usually, the Fourier transform F(u) is a complex number, from which we can define its magnitude, power spectrum, and the phase angle. (5) The impulse function, or Dirac function, is widely used in theoretical analysis of linear systems. It can be considered as an infinitely high, infinitely thin spike at the origin, with total area one under the spike. (6) Dirac function has a sift property, defined as f ()tttdtft (00 ) ( ). (7) The basic theorem linking the spatial domain to the frequency domain is the convolution theorem, ft()*() ht F ( ) H ( ). (8) The Fourier transform of a Gaussian function is also of a Gaussian shape in the frequency domain. (9) For visualization purpose, we often use fftshift to shift the origin of an image’s Fourier transform to its center position. (10) The basic steps for frequency filtering comprise the following, (11) Edges and other sharp intensity transitions, such as noise, in an image contribute significantly to the high frequency content of its Fourier transform. (12) The drawback of the ideal low-pass filter is that the filtering result has obvious ringing artifacts.
    [Show full text]