Extension of Phase Correlation to Subpixel Registration Hassan Foroosh (Shekarforoush), Josiane B

Total Page:16

File Type:pdf, Size:1020Kb

Extension of Phase Correlation to Subpixel Registration Hassan Foroosh (Shekarforoush), Josiane B Extension of Phase Correlation to Subpixel Registration Hassan Foroosh (Shekarforoush), Josiane B. Zerubia, Senior Member, IEEE, and Marc Berthod Abstract—In this paper, we have derived analytic expressions proach is to relate the temporal derivatives of the sequence to the for the phase correlation of downsampled images. We have shown spatial derivatives of the reference frame using the so called gra- that for downsampled images the signal power in the phase corre- dient constraint equation under intensity conservation assump- lation is not concentrated in a single peak, but rather in several co- herent peaks mostly adjacent to each other. These coherent peaks tion [10], [11], [17]. Due to the intensity conservation assump- correspond to the polyphase transform of a filtered unit impulse tion, these methods often require that the inter-frame displace- centered at the point of registration. The analytic results provide ments to be relatively small compared to the intensity gradi- a closed-form solution to subpixel translation estimation, and are ents of the reference frame. Also, to reduce the sensitivity of used for detailed error analysis. Excellent results have been ob- tained for subpixel translation estimation of images of different na- the derivative operators to the noise process, in general, some ture and across different spectral bands. regularization is required and the gradient constraint is com- bined within some local neighborhood of each pixel. However, Index Terms—Image alignment, phase correlation, subpixel reg- istration. the main pitfall of this approach is probably the so called aper- ture problem [10], where for some patterns such as a gradual curve, the available information in a local neighborhood of a I. INTRODUCTION pixel (small aperture) is not sufficient to disambiguate the true NALYSIS and fusion of data in an image sequence often registration parameters. A require registration of the images. A wide variety of A second possible approach without interpolation is to for- methods and techniques [4] can be found in the literature for mulate subpixel registration as an optimization problem. Ex- solving this fundamental and challenging problem. Although amples of this approach include [14], [27], [29]. In this ap- in many applications pixel-level registration may be adequate, proach the problem is formulated as a cost function to be min- some important problems in remote sensing [13], [22], [25], imized with respect to the registration parameters of the inter- [26] and biomedical imaging [5], [9] have introduced the frame transformation model. As in the gradient based methods requirement for subpixel registration. these methods are highly dependent on image intensity con- In this paper, we are interested in the refinement of coarsely servation assumption and would fail when applied across mul- registered images to subpixel accuracy, where by coarse regis- tiple spectral bands or when considerable amount of luminance tration we imply pixel-level registration. We will assume that variations are present between frames. This shortcoming can the error in registration at this refinement level is a translation probably be alleviated by an adequate normalization of var- within the bounds of the noise and the error of the imaging ious spectral bands. Also note that these methods as well as the system. Therefore we confine our attention only to the problem gradient-based methods implicitly include interpolation by ap- of subpixel translation estimation. We are also interested in sub- plying local smoothing and regularizing operators. pixel registration across different spectral bands. There are also other techniques that are based on the local The most commonly used approach to subpixel registration normalized correlation [1], polynomial regression [32], the dis- is based on interpolation. Examples include correlation inter- crete cosine transform [15], and the use of control points using polation [6], [28], intensity interpolation [28], phase correlation a model-based approach [7]. interpolation [21], [28] and the geometric methods [2], [8]. It is This work is particularly motivated by some important fea- obvious that the accuracy of these methods depends highly on tures of the phase correlation method which make its use at- the quality of the interpolation algorithms. tractive for multispectral registration, and hence analytic results One popular approach for subpixel registration without in- have been derived to demonstrate how the method can be ex- terpolation is based on the differential properties of image se- tended to subpixel accuracy. A condensed version of part of this quences [11]–[13], [17], [28], [30]. The main idea in this ap- work was presented in [23] and [24]. This paper is organized as follows. In the next section, we Manuscript received June 8, 2000; revised November 13, 2001. The associate describe the phase correlation method and outline its impor- editor coordinating the review of this manuscript and approving it for publica- tant properties. In Section III, we present our extension of the tion was Dr. Eric L. Miller. H. Foroosh (Shekarforoush) is with the Department of Electrical Engineering method to subpixel registration. Section IV provides thorough and Computer Science, University of California, Berkeley, CA 94720 USA analysis of various sources of error. Experimental results are (e-mail: [email protected]). then given in Section V for different modalities of images in- J. Zerubia and M. Berthod are with INRIA, 06902 Sophia Antipolis Cedex, France (e-mail: [email protected]; [email protected]). cluding satellite images across different spectral bands. Finally Publisher Item Identifier S 1057-7149(02)00803-5. in Section VI some concluding remarks are provided. 1057–7149/02$17.00 © 2002 IEEE FOROOSH et al.: EXTENSION OF PHASE CORRELATION TO SUBPIXEL REGISTRATION 189 (a) (b) (c) (d) Fig. 1. (a) and (b) Aerial images of Paris with displacements along both axes, (c) standard cross-correlation, and (d) phase correlation. II. PHASE CORRELATION METHOD rather inaccurate since it requires fitting a plane to noisy phase difference data. The idea behind this method [16], [21] is quite simple and The second possible approach which is more practical and is based on the Fourier shift property [20], which states that a also more robust to noise is to first inverse Fourier transform the shift in the coordinate frames of two functions is transformed normalized cross power spectrum. It is then a simple matter to in the Fourier domain as linear phase differences. This can be determine ( described as follows: ), since from (3) the result is which is a Dirac delta function centered at ( ). Let and be two functions that are absolutely integrable over . Let also In practice, when dealing with images, and are speci- fied only in finite size discretized arrays. However, replacing the (1) Fourier transform by the discrete Fourier transform (DFT) and the Dirac delta function by a unit impulse, and also assuming a According to the Fourier shift property periodic extension of the images outside their compact support, these results will still hold and work remarkably well [16]. (2) A. Properties of the Method Hence the normalized cross power spectrum is given by The most remarkable property of the phase correlation method compared to the classical cross correlation method is (3) the accuracy by which the peak of the correlation function can be detected. Fig. 1 shows an example of two displaced aerial where indicates the complex conjugate. images. The phase correlation method provides a distinct sharp The normalized cross power spectrum may also be viewed as peak at the point of registration whereas the standard cross the cross power spectrum of whitened signals. There are two correlation yields several broad peaks and a main peak whose possible ways of solving (3) for ( ). One way is to di- maximum is not always exactly centered at the right point. rectly work in the Fourier domain. For this purpose, consider a A second important property is due to whitening of the signals three-dimensional (3-D) Euclidean space whose canonical ref- by normalization, which makes the phase correlation notably ro- erence frame is given by the two frequency axes and the phase bust to those types of noise that are correlated to the image func- difference between the two images. In this space tion, e.g., uniform variations of illumination, offsets in average defines a plane through the origin perpendicular to the vector intensity, and fixed gain errors due to calibration. This property ( ) whose slopes along the two frequency axes specify the also makes phase correlation suitable for registration across dif- shifts along the two spatial axes [i.e., ( )]. This approach is ferent spectral bands. 190 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 11, NO. 3, MARCH 2002 Using the convolution theorem, it can be shown that the and method can also handle blurred images, provided that the blurring kernel is relatively invariant from one frame to another. One may for instance use this property to register images con- taminated with wide-band additive noise, by taking the phase (5) correlation in the low-frequency portion of the spectrum [12]. In the discrete case, however, (3) is valid only if the shift Therefore, the cross-power spectrum of the downsampled im- vector ( ) is of integer values. Therefore, when applied to ages is given by (6), shown at the bottom of the page, where discrete images, the method would fail to detect noninteger sub- pixel shifts. To our best knowledge the only practical approach proposed in the literature for adapting the method to subpixel (7) estimation is the use of some interpolation methods [21], [28]. Also, as is known [16] the phase correlation always contains a single coherent peak at the point of registration corresponding to signal power, and some incoherent peaks which can be assumed It follows from these results that, the normalized cross power to be distributed normally over a mean value of zero.
Recommended publications
  • WFM7200 Multiformat Multistandard Waveform Monitor
    Multiformat, multistandard waveform monitor WFM7200 Datasheet Comprehensive audio monitoring (Option AD or DPE) Up to 16-channel audio monitoring for embedded audio Multichannel Surround Sound 1 display and flexible Lissajous display with audio level readouts Audio Loudness monitoring to ITU-R BS.1770-3 with audio trigger start/stop functions via GPI (Option AD or DPE) Support for analog, digital, and embedded audio (Option AD) Dolby Digital (AC-3), Dolby Digital Plus, and Dolby E (Option DPE) Comprehensive Dolby metadata decode and display (Option DPE) Dolby E Guard Band meter with user-defined limits (Option DPE) Unmatched display versatility FlexVu™, the most flexible four-tile display, suited for various application needs to increase productivity Patented Diamond and Arrowhead displays for gamut This video/audio/data monitor and analyzer all-in-one platform provides monitoring flexible options and field installable upgrades to monitor a diverse variety of Patented Timing and Lightning displays video and audio formats. Support for video formats includes 3G-SDI, Dual New patented Spearhead display and Luma Qualified Vector Link, HD-SDI and composite analog. Support for audio formats includes (LQV™) display facilitate precise color adjustment for post- Dolby E, Dolby Digital Plus, Dolby Digital, AES/EBU, embedded audio and production applications (Option PROD) analog audio. Stereoscopic 3D video displays for camera alignment and production/ Option GEN provides a simple test signal generator output that creates post-production applications
    [Show full text]
  • Camera Basics, Principles and Techniques –MCD 401 VU Topic No
    Camera Basics, Principles and techniques –MCD 401 VU Topic no. 76 Film & Video Conversion Television standards conversion is the process of changing one type of TV system to another. The most common is from NTSC to PAL or the other way around. This is done so TV programs in one nation may be viewed in a nation with a different standard. The TV video is fed through a video standards converter that changes the video to a different video system. Converting between a different numbers of pixels and different frame rates in video pictures is a complex technical problem. However, the international exchange of TV programming makes standards conversion necessary and in many cases mandatory. Vastly different TV systems emerged for political and technical reasons – and it is only luck that makes video programming from one nation compatible with another. History The first known case of TV systems conversion probably was in Europe a few years after World War II – mainly with the RTF (France) and the BBC (UK) trying to exchange their441 line and 405 line programming. The problem got worse with the introduction of PAL, SECAM (both 625 lines), and the French 819 line service. Until the 1980s, standards conversion was so difficult that 24 frame/s 16 mm or 35 mm film was the preferred medium of programming interchange. Overview Perhaps the most technically challenging conversion to make is the PAL to NTSC. PAL is 625 lines at 50 fields/s NTSC is 525 lines at 59.94 fields/s (60,000/1,001 fields/s) The two TV standards are for all practical purposes, temporally and spatially incompatible with each other.
    [Show full text]
  • Computational Complexity Optimization on H.264 Scalable/Multiview Video Coding
    Computational Complexity Optimization on H.264 Scalable/Multiview Video Coding By Guangyao Zhang A thesis submitted in partial fulfilment for the requirements for the degree of PhD, at the University of Central Lancashire April 2014 Student Declaration Concurrent registration for two or more academic awards I declare that while registered as a candidate for the research degree, I have not been a registered candidate or enrolled student for another award of the University or other academic or professional institution. Material submitted for another award I declare that no material contained in the thesis has been used in any other submission for an academic award and is solely my own work. Collaboration This work presented in this thesis was carried out at the ADSIP (Applied Digital Signal and Image Processing) Research Centre, University of Central Lancashire. The work described in the thesis is entirely the candidate’s own work. Signature of Candidate ________________________________________ Type of Award Doctor of Philosophy School School of Computing, Engineering and Physical Sciences Abstract Abstract The H.264/MPEG-4 Advanced Video Coding (AVC) standard is a high efficiency and flexible video coding standard compared to previous standards. The high efficiency is achieved by utilizing a comprehensive full search motion estimation method. Although the H.264 standard improves the visual quality at low bitrates, it enormously increases the computational complexity. The research described in this thesis focuses on optimization of the computational complexity on H.264 scalable and multiview video coding. Nowadays, video application areas range from multimedia messaging and mobile to high definition television, and they use different type of transmission systems.
    [Show full text]
  • 3. Visual Attention
    Brunel UNIVERSITY WEST LONDON INTELLIGENT IMAGE CROPPING AND SCALING A thesis submitted for the degree of Doctor of Philosophy by Joerg Deigmoeller School of Engineering and Design th 24 January 2011 1 Abstract Nowadays, there exist a huge number of end devices with different screen properties for watching television content, which is either broadcasted or transmitted over the internet. To allow best viewing conditions on each of these devices, different image formats have to be provided by the broadcaster. Producing content for every single format is, however, not applicable by the broadcaster as it is much too laborious and costly. The most obvious solution for providing multiple image formats is to produce one high- resolution format and prepare formats of lower resolution from this. One possibility to do this is to simply scale video images to the resolution of the target image format. Two significant drawbacks are the loss of image details through downscaling and possibly unused image areas due to letter- or pillarboxes. A preferable solution is to find the contextual most important region in the high-resolution format at first and crop this area with an aspect ratio of the target image format afterwards. On the other hand, defining the contextual most important region manually is very time consuming. Trying to apply that to live productions would be nearly impossible. Therefore, some approaches exist that automatically define cropping areas. To do so, they extract visual features, like moving areas in a video, and define regions of interest (ROIs) based on those. ROIs are finally used to define an enclosing cropping area.
    [Show full text]
  • The Art of Sound Reproduction the Art of Sound Reproduction
    The Art of Sound Reproduction The Art of Sound Reproduction John Watkinson Focal Press An imprint of Butterworth-Heinemann Ltd 225 Wildwood Avenue, Woburn, MA 01801-2041 Linacre House, Jordan Hill, Oxford OX2 8DP A member of the Reed Elsevier plc group OXFORD JOHANNESBURG BOSTON MELBOURNE NEW DELHI SINGAPORE First published 1998 John Watkinson 1998 All rights reserved. No part of this publication may be reproduced in any material form (including photocopying or storing in any medium by electronic means and whether or not transiently or incidentally to some other use of this publication) without the written permission of the copyright holder except in accordance with the provisions of the Copyright, Designs and Patents Act 1988 or under the terms of a licence issued by the Copyright Licensing Agency Ltd, 90 Tottenham Court Road, London, England W1P 9HE. Applications for the copyright holder’s written permission to reproduce any part of this publication should be addressed to the publishers British Library Cataloguing in Publication Data A catalogue record for this book is available from the British Library Library of Congress Cataloguing in Publication Data A catalogue record for this book is available from the Library of Congress ISBN 0 240 51512 9 Typeset by Laser Words, Madras, India Printed and bound in Great Britain Contents Preface xiii Chapter 1 Introduction 1 1.1 A short history 1 1.2 Types of reproduction 8 1.3 Sound systems 12 1.4 Portable consumer equipment 14 1.5 Fixed consumer equipment 14 1.6 High-end hi-fi 16 1.7 Public address
    [Show full text]
  • State-Of-The Art Motion Estimation in the Context of 3D TV
    State-of-the Art Motion Estimation in the Context of 3D TV ABSTRACT Progress in image sensors and computation power has fueled studies to improve acquisition, processing, and analysis of 3D streams along with 3D scenes/objects reconstruction. The role of motion compensation/motion estimation (MCME) in 3D TV from end-to-end user is investigated in this chapter. Motion vectors (MVs) are closely related to the concept of disparities and they can help improving dynamic scene acquisition, content creation, 2D to 3D conversion, compression coding, decompression/decoding, scene rendering, error concealment, virtual/augmented reality handling, intelligent content retrieval and displaying. Although there are different 3D shape extraction methods, this text focuses mostly on shape-from-motion (SfM) techniques due to their relevance to 3D TV. SfM extraction can restore 3D shape information from a single camera data. A.1 INTRODUCTION Technological convergence has been prompting changes in 3D image rendering together with communication paradigms. It implies interaction with other areas, such as games, that are designed for both TV and the Internet. Obtaining and creating perspective time varying scenes are essential for 3D TV growth and involve knowledge from multidisciplinary areas such as image processing, computer graphics (CG), physics, computer vision, game design, and behavioral sciences (Javidi & Okano, 2002). 3D video refers to previously recorded sequences. 3D TV, on the other hand, comprises acquirement, coding, transmission, reception, decoding, error concealment (EC), and reproduction of streaming video. This chapter sheds some light on the importance of motion compensation and motion estimation (MCME) for an end-to-end 3D TV system, since motion information can help dealing with the huge amount of data involved in acquiring, handing out, exploring, modifying, and reconstructing 3D entities present in video streams.
    [Show full text]
  • A PHASE BASED DENSE STEREO ALGORITHM IMPLEMENTED in CUDA a Thesis by BRENT DAVID MACOMBER Submitted to the Office of Graduate St
    A PHASE BASED DENSE STEREO ALGORITHM IMPLEMENTED IN CUDA A Thesis by BRENT DAVID MACOMBER Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE May 2011 Major Subject: Aerospace Engineering A PHASE BASED DENSE STEREO ALGORITHM IMPLEMENTED IN CUDA A Thesis by BRENT DAVID MACOMBER Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Approved by: Chair of Committee, John Junkins Committee Members, John Hurtado Jim Ji James Turner Head of Department, Dimitris Lagoudas May 2011 Major Subject: Aerospace Engineering iii ABSTRACT A Phase Based Dense Stereo Algorithm Implemented in CUDA. (May 2011) Brent Macomber, B.A., University of California at Berkeley Chair of Advisory Committee: Dr. John Junkins Stereo imaging is routinely used in Simultaneous Localization and Mapping (SLAM) systems for the navigation and control of autonomous spacecraft proxim- ity operations, advanced robotics, and robotic mapping and surveying applications. A key step (and generally the most computationally expensive step) in the genera- tion of high fidelity geometric environment models from image data is the solution of the dense stereo correspondence problem. A novel method for solving the stereo correspondence problem to sub-pixel accuracy in the Fourier frequency domain by exploiting the Convolution Theorem is developed. The method is tailored to chal- lenging aerospace applications by incorporation of correction factors for common error sources. Error-checking metrics verify correspondence matches to ensure high quality depth reconstructions are generated.
    [Show full text]
  • Discovery Communucations Global Technical Specifications Version
    DCL GLOBAL TECHNICAL SPECIFICATIONS VERSION 4.1 DCL GLOBAL TECHNICAL SPECIFICATIONS. ©2014, DISCOVERY COMMUNICATIONS LLC. CONFIDENTIAL. 1 INTRODUCTION This specification notifies Discovery’s production partners of Discovery’s technical requirements and standards for program masters, graphics masters, and supplemental program audio. Production partners are responsible for reading and understanding these requirements and delivering products that meet them. In this specification, “Discovery” refers to any network owned by Discovery Communications LLC (DCL), as well as any of DCL’s joint ventures. The term “production partner” refers to the production company providing a program to Discovery, whether through commission, co-production, or acquisition. WHAT IS IN THIS SPECIFICATION? This specification includes Discovery’s technical requirements for SD, HD, and stereoscopic 3D programs. It applies to programs delivered in any of the seven television standards that Discovery accepts for program delivery. • 1080i 59.94 • 1080i 50 • 1080p 29.97 • 1080p 25 • 1080p 23.98 • NTSC • PAL Production partners must refer to their contracts to determine which of these television standards Discovery requires for their program. This specification also includes Discovery’s technical requirements for the delivery of program graphics packages on LTO data tape. This specification does not include Discovery’s requirements for file based program master delivery nor for delivery of file based camera footage. Production partners can find those requirements in the “Discovery File Based Program Masters” addendum and the “Discovery File Based Footage Delivery Policy.” NEW IN VERSION 4.1 Version 4.1 of this specification includes clarifications and changes from version 4.0. These are some of the key differences.
    [Show full text]
  • Multichannel HD/SD Waveform Monitors
    PRODUCT CODE: PT0700 SERIES Multichannel HD/SD Waveform Monitors The PT0700 Series has been designed to meet the increasingly complex demands being placed on the broadcasters of today. In order to improve efficiency and to reduce costs we have combined precision audio and video metering in one chassis FEATURES... and full remote access via Ethernet or our own client panel, PT0700R which permits full access with independent control and display. Comprehensive Loudness • Up to 2 independent Auto Sensing measurements to ITU, EBU R128 & ATSC A/85 and logging via the Ethernet port are HD/SD Inputs included in designated versions • 1 Assignable HD/SD SDI Output The PT0700 Series is capable of handling two HD/SD SDI inputs and up to 16 audio channels on the internal Screen. The • Zoom with cursors embedded audio from each of the 2 input channels can be de-embedded (maximum 16 channels). • Vectorscope Using the inbuilt DK-Technologies audio matrix, a stereo downmix can be easily achieved removing the requirement for • Colour Gamut Alarms external matrixes or mixer. The meter can been equipped with Dolby E decoding. The audio elements include 5.1 Surround • STA - Smart Timing Analysis (StarFish™), ITU/EBU/ATSC Loudness measurements & FFT Spectrum Analysis. • SVLC - Smart Video Landscape Compressor for detail highlighting • Independent External Reference Input • Full HD/SD Audio De-embedding (16 Channels) • Dolby E/D (AC3) Decoder • StarFish™ & JellyFish™ • 5.1/6.1/7.1 Metering • EBU, ITU & ATSC Loudness scales & logging • Simultaneous Audio & Video display • Independent Video & Audio controls • Full remote control & display on PC or via optional PT0700R Client Panel • Assignable DVI/VGA Output via DVI connector OPTIONS..
    [Show full text]
  • Analog Audio Recording
    The Art of Sound Reproduction This Page Intentionally Left Blank The Art of Sound Reproduction John Watkinson Focal Press An imprint of Butterworth-Heinemann Ltd 225 Wildwood Avenue, Woburn, MA 01801-2041 Linacre House, Jordan Hill, Oxford OX2 8DP A member of the Reed Elsevier plc group OXFORD JOHANNESBURG BOSTON MELBOURNE NEW DELHI SINGAPORE First published 1998 John Watkinson 1998 All rights reserved. No part of this publication may be reproduced in any material form (including photocopying or storing in any medium by electronic means and whether or not transiently or incidentally to some other use of this publication) without the written permission of the copyright holder except in accordance with the provisions of the Copyright, Designs and Patents Act 1988 or under the terms of a licence issued by the Copyright Licensing Agency Ltd, 90 Tottenham Court Road, London, England W1P 9HE. Applications for the copyright holder’s written permission to reproduce any part of this publication should be addressed to the publishers British Library Cataloguing in Publication Data A catalogue record for this book is available from the British Library Library of Congress Cataloguing in Publication Data A catalogue record for this book is available from the Library of Congress ISBN 0 240 51512 9 Typeset by Laser Words, Madras, India Printed and bound in Great Britain Contents Preface xiii Chapter 1 Introduction 1 1.1 A short history 1 1.2 Types of reproduction 8 1.3 Sound systems 12 1.4 Portable consumer equipment 14 1.5 Fixed consumer equipment 14
    [Show full text]
  • WFM8200 and WFM8300 Waveform Monitors User Manual
    xx WFM8200 and WFM8300 Waveform Monitors ZZZ User Manual *P077025303* 077-0253-03 xx WFM8200 and WFM8300 Waveform Monitors ZZZ User Manual www.tektronix.com 077-0253-03 Copyright © Tektronix. All rights reserved. Licensed software products are owned by Tektronix or its subsidiaries or suppliers, and are protected by national copyright laws and international treaty provisions. Tektronix products are covered by U.S. and foreign patents, issued and pending. Information in this publication supersedes that in all previously published material. Specifications and price change privileges reserved. TEKTRONIX and TEK are registered trademarks of Tektronix, Inc. FlexVu and CaptureVu are registered trademarks of Tektronix, Inc. Contacting Tektronix Tektronix, Inc. 14150 SW Karl Braun Drive P.O. Box 500 Beaverton, OR 97077 USA For product information, sales, service, and technical support: In North America, call 1-800-833-9200. Worldwide, visit www.tektronix.com to find contacts in your area. Warranty Tektronix warrants that this product will be free from defects in materials and workmanship for a period of one (1) year from the date of shipment. If any such product proves defective during this warranty period, Tektronix, at its option, either will repair the defective product without charge for parts and labor, or will provide a replacement in exchange for the defective product. Parts, modules and replacement products used by Tektronix for warranty work may be new or reconditioned to like new performance. All replaced parts, modules and products become the property of Tektronix. In order to obtain service under this warranty, Customer must notify Tektronix of the defect before the expiration of the warranty period and make suitable arrangements for the performance of service.
    [Show full text]
  • Estimating Translation/Deformation Motion Through Phase Correlation *
    Estimating Translation/Deformation Motion through Phase Correlation * Filiberto Pla t and Miroslaw Bober~ t Dept. of Computer Science. University Jaume I, 12071 Castell6, Spain. Dept. of Electrical and Electronic Engineering. University of Surrey, Guilford GU2 5XH, UK. pla@uj i. es m. bober@ee, surrey, ac. uk Abstract. Phase correlation techniques have been used in image reg- istration to estimate image displacements. These techniques have been also used to estimate optical flow by applying it locally. In this work a different phase correlation-based method is proposed to deal with a deformation/translation motion model, instead of the pure translations that the basic phase correlation technique can estimate. Some experi- mentals results are also presented to show the accuracy of the motion paramenters estimated and the use of the phase correlation to estimate optical flow. Key Words : Motion, Optical Flow, Image Registration, Phase Correlation. 1 Introduction Estimating visual motion is a valuable information for many machine vision ap- plications, like traffic monitoring, surveillance, image coding, etc. The work pre- sented here has been aimed at developing some techniques for accurate enough visual motion estimation and optical flow. Optical flow estimation methods mea- sure the velocity and displacement vectors perceived from the time-varying im- age intensity pattern, that is, they measure the apparent 2D image plane motion which is the projected 3D motion of the objects of the scene in the image plane. Most of existing techniques for optical flow estimation can be classified into Gradient-based techniques, that is, methods based on the so-called optical flow equation [7][12]; Block matching-based techniques try to overcome the aperture problem by assuming that all pixels in a block undergo the same motion [6][11]; Spatiotemporal Energy-based techniques which exploits the equivalent optical flow equation in the frequency domain [9]; Bayesian techniques utilize proba- bilistic smoothness constraints, usually in the form of a Gibbs ramdom field [10][81.
    [Show full text]