The Variable Bandwidth Mean Shift and Data-Driven Scale Selection
Dorin Comaniciu Visvanathan Ramesh Peter Meer
Imaging & Visualization Department Electrical & Computer Engineering Department
Siemens Corp orate Research Rutgers University
755 College Road East, Princeton, NJ 08540 94 Brett Road, Piscataway, NJ 08855
Abstract where the d-dimensional vectors fx g representa
i
i=1:::n
random sample from some unknown density f and the
Wepresenttwo solutions for the scale selection prob-
kernel, K , is taken to be a radially symmetric, non-
lem in computer vision. The rst one is completely non-
negative function centered at zero and integrating to
parametric and is based on the the adaptive estimation
one. The terminology xedbandwidth is due to the fact
of the normalized density gradient. Employing the sam-
d
that h is held constant across x 2 R . As a result, the
ple p oint estimator, we de ne the Variable Bandwidth
xed bandwidth pro cedure (1) estimates the densityat
Mean Shift, prove its convergence, and show its sup eri-
eachpoint x by taking the average of identically scaled
orityover the xed bandwidth pro cedure. The second
kernels centered at each of the data p oints.
technique has a semiparametric nature and imp oses a
For p ointwise estimation, the classical measure of the
lo cal structure on the data to extract reliable scale in-
^
closeness of the estimator f to its target value f is the
formation. The lo cal scale of the underlying densityis
mean squared error (MSE), equal to the sum of the
taken as the bandwidth which maximizes the magni-
variance and squared bias
tude of the normalized mean shift vector. Both estima-
i h
2
tors provide practical to ols for autonomous image and
^
f (x ) f (x) MSE(x ) = E
quasi real-time video analysis and several examples are
i h
2
shown to illustrate their e ectiveness.
^ ^
f (x ) : (2) f (x ) + Bias = Var
1 Motivation for Variable Bandwidth
Using the multivariate form of the Taylor theorem, the
The ecacy of Mean Shift analysis has b een demon-
bias and the variance are approximated by [20, p.97]
strated in computer vision problems such as tracking
1
2
and segmentation in [5, 6]. However, one of the limi-
h (K )f (x ) (3) Bias(x)
2
2
tations of the mean shift pro cedure as de ned in these
and