Extension of Phase Correlation to Subpixel Registration Hassan Foroosh (Shekarforoush), Josiane B

Extension of Phase Correlation to Subpixel Registration Hassan Foroosh (Shekarforoush), Josiane B. Zerubia, Senior Member, IEEE, and Marc Berthod Abstract—In this paper, we have derived analytic expressions proach is to relate the temporal derivatives of the sequence to the for the phase correlation of downsampled images. We have shown spatial derivatives of the reference frame using the so called gra- that for downsampled images the signal power in the phase corre- dient constraint equation under intensity conservation assump- lation is not concentrated in a single peak, but rather in several coherent peaks mostly adjacent to each other. These coherent peaks tion [10], [11], [17]. Due to the intensity conservation assump- correspond to the polyphase transform of a filtered unit impulse tion, these methods often require that the inter-frame displace- centered at the point of registration. The analytic results provide ments to be relatively small compared to the intensity gradi- a closed-form solution to subpixel translation estimation, and are ents of the reference frame. Also, to reduce the sensitivity of used for detailed error analysis. Excellent results have been ob- tained for subpixel translation estimation of images of different na- the derivative operators to the noise process, in general, some ture and across different spectral bands. regularization is required and the gradient constraint is com- bined within some local neighborhood of each pixel. However, Index Terms—Image alignment, phase correlation, subpixel registration. the main pitfall of this approach is probably the so called aperture problem [10], where for some patterns such as a gradual curve, the available information in a local neighborhood of a I. INTRODUCTION pixel (small aperture) is not sufficient to disambiguate the true NALYSIS and fusion of data in an image sequence often registration parameters. A require registration of the images. A wide variety of A second possible approach without interpolation is to for- methods and techniques [4] can be found in the literature for mulate subpixel registration as an optimization problem. Ex- solving this fundamental and challenging problem. Although amples of this approach include [14], [27], [29]. In this ap- in many applications pixel-level registration may be adequate, proach the problem is formulated as a cost function to be min- some important problems in remote sensing [13], [22], [25], imized with respect to the registration parameters of the inter- [26] and biomedical imaging [5], [9] have introduced the frame transformation model. As in the gradient based methods requirement for subpixel registration. these methods are highly dependent on image intensity con- In this paper, we are interested in the refinement of coarsely servation assumption and would fail when applied across mul- registered images to subpixel accuracy, where by coarse regis- tiple spectral bands or when considerable amount of luminance tration we imply pixel-level registration. We will assume that variations are present between frames. This shortcoming can the error in registration at this refinement level is a translation probably be alleviated by an adequate normalization of var- within the bounds of the noise and the error of the imaging ious spectral bands. Also note that these methods as well as the system. Therefore we confine our attention only to the problem gradient-based methods implicitly include interpolation by ap- of subpixel translation estimation. We are also interested in sub- plying local smoothing and regularizing operators. pixel registration across different spectral bands. There are also other techniques that are based on the local The most commonly used approach to subpixel registration normalized correlation [1], polynomial regression [32], the dis- is based on interpolation. Examples include correlation inter- crete cosine transform [15], and the use of control points using polation [6], [28], intensity interpolation [28], phase correlation a model-based approach [7]. interpolation [21], [28] and the geometric methods [2], [8]. It is This work is particularly motivated by some important fea- obvious that the accuracy of these methods depends highly on tures of the phase correlation method which make its use at- the quality of the interpolation algorithms. tractive for multispectral registration, and hence analytic results One popular approach for subpixel registration without in- have been derived to demonstrate how the method can be ex- terpolation is based on the differential properties of image se- tended to subpixel accuracy. A condensed version of part of this quences [11]–[13], [17], [28], [30]. The main idea in this ap- work was presented in [23] and [24]. This paper is organized as follows. In the next section, we Manuscript received June 8, 2000; revised November 13, 2001. The associate describe the phase correlation method and outline its impor- editor coordinating the review of this manuscript and approving it for publica- tant properties. In Section III, we present our extension of the tion was Dr. Eric L. Miller. H. Foroosh (Shekarforoush) is with the Department of Electrical Engineering method to subpixel registration. Section IV provides thorough and Computer Science, University of California, Berkeley, CA 94720 USA analysis of various sources of error. Experimental results are (e-mail: [email protected]). then given in Section V for different modalities of images in- J. Zerubia and M. Berthod are with INRIA, 06902 Sophia Antipolis Cedex, France (e-mail: [email protected]; [email protected]). cluding satellite images across different spectral bands. Finally Publisher Item Identifier S 1057-7149(02)00803-5. in Section VI some concluding remarks are provided. 1057–7149/02$17.00 © 2002 IEEE FOROOSH et al.: EXTENSION OF PHASE CORRELATION TO SUBPIXEL REGISTRATION 189 (a) (b) (c) (d) Fig. 1. (a) and (b) Aerial images of Paris with displacements along both axes, (c) standard cross-correlation, and (d) phase correlation. II. PHASE CORRELATION METHOD rather inaccurate since it requires fitting a plane to noisy phase difference data. The idea behind this method [16], [21] is quite simple and The second possible approach which is more practical and is based on the Fourier shift property [20], which states that a also more robust to noise is to first inverse Fourier transform the shift in the coordinate frames of two functions is transformed normalized cross power spectrum. It is then a simple matter to in the Fourier domain as linear phase differences. This can be determine ( described as follows: ), since from (3) the result is which is a Dirac delta function centered at ( ). Let and be two functions that are absolutely integrable over . Let also In practice, when dealing with images, and are speci- fied only in finite size discretized arrays. However, replacing the (1) Fourier transform by the discrete Fourier transform (DFT) and the Dirac delta function by a unit impulse, and also assuming a According to the Fourier shift property periodic extension of the images outside their compact support, these results will still hold and work remarkably well [16]. (2) A. Properties of the Method Hence the normalized cross power spectrum is given by The most remarkable property of the phase correlation method compared to the classical cross correlation method is (3) the accuracy by which the peak of the correlation function can be detected. Fig. 1 shows an example of two displaced aerial where indicates the complex conjugate. images. The phase correlation method provides a distinct sharp The normalized cross power spectrum may also be viewed as peak at the point of registration whereas the standard cross the cross power spectrum of whitened signals. There are two correlation yields several broad peaks and a main peak whose possible ways of solving (3) for ( ). One way is to di- maximum is not always exactly centered at the right point. rectly work in the Fourier domain. For this purpose, consider a A second important property is due to whitening of the signals three-dimensional (3-D) Euclidean space whose canonical ref- by normalization, which makes the phase correlation notably ro- erence frame is given by the two frequency axes and the phase bust to those types of noise that are correlated to the image func- difference between the two images. In this space tion, e.g., uniform variations of illumination, offsets in average defines a plane through the origin perpendicular to the vector intensity, and fixed gain errors due to calibration. This property ( ) whose slopes along the two frequency axes specify the also makes phase correlation suitable for registration across dif- shifts along the two spatial axes [i.e., ( )]. This approach is ferent spectral bands. 190 IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 11, NO. 3, MARCH 2002 Using the convolution theorem, it can be shown that the and method can also handle blurred images, provided that the blurring kernel is relatively invariant from one frame to another. One may for instance use this property to register images con- taminated with wide-band additive noise, by taking the phase (5) correlation in the low-frequency portion of the spectrum [12]. In the discrete case, however, (3) is valid only if the shift Therefore, the cross-power spectrum of the downsampled im- vector ( ) is of integer values. Therefore, when applied to ages is given by (6), shown at the bottom of the page, where discrete images, the method would fail to detect noninteger subpixel shifts. To our best knowledge the only practical approach proposed in the literature for adapting the method to subpixel (7) estimation is the use of some interpolation methods [21], [28]. Also, as is known [16] the phase correlation always contains a single coherent peak at the point of registration corresponding to signal power, and some incoherent peaks which can be assumed It follows from these results that, the normalized cross power to be distributed normally over a mean value of zero.

Extension of Phase Correlation to Subpixel Registration Hassan Foroosh (Shekarforoush), Josiane B

WFM7200 Multiformat Multistandard Waveform Monitor

Camera Basics, Principles and Techniques –MCD 401 VU Topic No

Computational Complexity Optimization on H.264 Scalable/Multiview Video Coding

3. Visual Attention

The Art of Sound Reproduction the Art of Sound Reproduction

State-Of-The Art Motion Estimation in the Context of 3D TV

A PHASE BASED DENSE STEREO ALGORITHM IMPLEMENTED in CUDA a Thesis by BRENT DAVID MACOMBER Submitted to the Office of Graduate St

Discovery Communucations Global Technical Specifications Version

Multichannel HD/SD Waveform Monitors

Analog Audio Recording

WFM8200 and WFM8300 Waveform Monitors User Manual

Estimating Translation/Deformation Motion Through Phase Correlation *