Particle Filter-Based Fingertip Tracking with Circular Hough

Total Page:16

File Type:pdf, Size:1020Kb

Particle Filter-Based Fingertip Tracking with Circular Hough Particle Filter-Based Fingertip Tracking with Circular Hough Transform Features Martin Do, Tamim Asfour, and R¨udigerDillmann Karlsruhe Institute of Technology (KIT), Adenauerring 2, 76131 Karlsruhe, Germany. martin.do,as f our,dillmann @kit.edu { } Abstract multi-scale color features [7] introduces a hierarchical representation of the hand consisting of blobs of dif- In this work, we present a fingertip tracking frame- ferent sizes with each blob representing a part of the work which allows observation of finger movements hand. The blob features are matched with a number in task space. By applying a multi-scale edge extrac- hierarchical 2D models each incorporating a specific tion technique, an edge map is generated in which low finger pose. Therefore, tracking is accomplished under contrast edges are preserved while noise is suppressed. the assumption that the local finger poses regarding Based on circular image features, determined from the the hand remains fixed. In order to implement a con- map using Hough transform, the fingertips are accu- tinuous fingertip tracking method, we would like to rately tracked by combining a particle filter and a sub- rely on prominent features which can be extracted at sequent mean-shift procedure. To increase the robust- any time of an image sequence. In [8], for detecting ness of the proposed method, dynamical motion mod- a guitarist’s fingertips, circular features are proposed els are trained for the prediction of the finger displace- which are localized by performing a circular Hough ments. Experiments were conducted on various image transform. For the same application, [9] defined semi- sequences from which statements on the performance circular templates which are used to find the fingers’ of the framework can be derived. positions. In our work, we adopt the concept of circular fea- 1 Introduction tures to tackle the more complex problem of track- Towards an intuitive and natural interface to ma- ing fingertips of a freely moving hand, where overlap chines, markerless human observation has become a of finger and palm occur frequently leading to diffi- major research focus during the past years. Regard- culties regarding the robust extraction of these fea- ing coarse granular human tracking incorporating the tures. For tracking, we combined particle filtering with torso, arms, head, and legs, considerable progress has a mean-shift algorithm. In addition, a dynamical mo- been achieved whereas human hand tracking still re- tion model for predicting was trained to enhance the mains an unresolved issue, although the hand is con- robustness of the proposed framework. sidered to be one of the most crucial body parts re- The paper is organized as follows. Section 2 de- garding the interaction with other humans and the en- scribes the feature extraction consisting of an edge vironment. detection step and the Hough transform. Details on Regarding markerless tracking and detection of hu- the tracking procedure performed on the resulting map man body parts, most systems have limited sensor ca- are given in Section 3. Subsequently, first experiments pabilities which in common case are limited to a stereo with the framework are explained in Section 4. In Sec- camera setup. Full hand tracking approaches in joint tion 5, the work is summarized and notes to future angle space have been proposed in [1],[2],[3]. However, works are given. due to the highly complex structure of the hand whose motion involves 27 DoF, tracking can be only achieved 2 Feature Extraction at a low frame rate or on multiple views from different In order to generate the edge image, a skin color perspectives. segmentation is performed for extracting the hand and Using stereo vision, a reasonable solution lies in re- finger regions. Morphological operators are applied on ducing dimensionality of the problem by shifting from the segmented image to eliminate noise and to produce joint angle space into task space. In [4], a finger track- a uniform region. To detect the edges in this prepro- ing approach based on Active contours is presented for cessed image, image gradients are calculated on various air-writing. The target to be tracked consists of a con- scales. tour which is laid around the pointing finger. As a result, since no reliable statement can be made on the 2.1 Multi-Scale Edge Extraction actual fingertip position, one has to assume that finger Considering the problem of fingertip tracking, due to pose is not changing. small intensity variances between different parts of the Hence, based on curvature properties, in [5] finger- hand, e.g. the fingernail and the skin, respectively, the tips are detected within a contour which is extracted finger regions and the palm, it is desired to detect edges from skin blob tracking. A more elaborate approach where contrast can vary over a broad range. Depending is presented in [6] where particles are propagated from on the parameters, applying standard algorithms, such the center of the hand to positions close to the con- as the Canny edge detectors on a wider scale, leads to tour. Intersection of the contour with line segments at an edge image where numerous, false edges occur. To particles and examination of the transitions between preserve low contrast edges in certain areas while re- non-skin and skin-area indicate whether a particle rep- ducing noise close to high-contrast edges, based on the resents a fingertip. However, this method is specifically work of [10], we implemented a filter approach consist- designed to detect tips of stretched fingers. Based on ing of a steerable Gaussian derivative filter on multiple R = m 1.1 r,...,m 1.1 r with m N whereby a range{− of pixels· · along· the· edge} tangent∈ is considered during the voting process. In order to increase the robustness of the tracking algorithm, a density distri- bution is formed in Hough space by convolving IH with r a Gaussian kernel G(u,v; 2 ). Since the hand motion occurs in 3D Euclidean space, a fixation of r is only valid if movement of the fingertip Figure 1. Left: Original input image. Cen- in direction of the z-axis of a camera is excluded during ter/Left: Color segmented image. Center/Right: the tracking. Adaptation of r in each frame, allows to Edge image using Canny detector. Right: Edge track fingers in all directions. Based on the generated image using the method proposed in Section 2.1. density distribution in frame t a radius estimate rˆt is determined by applying an Expectation Maximization algorithm. Further details are given in Section 3.3. scales. The basis filter for x is defined as follows: 3 Tracking Fingertips (x2+y2) − x x 2σ2 3.1 Prediction G (x,y;σ ) = e k . (1) k k πσ− 4 2 k Providing a prediction on the movement of the ob- jects to be tracked increases the robustness of a statis- y σ Gk(x,y; k) is defined analogously. To determine the tical tracking framework. We train dynamical motion scale at which a gradient can be reliably estimated, models in the form of a second-order auto-regresssive x σ the magnitude of the filter response rk(x,y; k) and (AR) process as proposed in [4], which is described as y σ rk(x,y; k), obtained by convolution of the image I with follows: the filters from Eq. 1, is checked against a noise thresh- ω old. While the magnitude can be calculated according qt q¯ = A1(qt 1 q¯) + A2(qt 2 q¯) + b0 k (5) − − − − − to: D where qt R denotes the current configuration, q¯ ∈ ω m σ x σ 2 y σ 2 the mean configuration, and k [0,1]. To learn the rk (x,y; k) = r (x,y; k) + r (x,y; k) , (2) D D ∈ D k k AR parameters A1, A2 R × and b0 R , training q data is provided in the∈ form of a configuration∈ se- the threshold is set by following function: quence Q = q0′ ,...,qM′ whereas the sequence is gener- ated by manual{ labeling} of fingertips in each frame of 2ln(1 (1 α)R) c = − − − s . (3) a recorded image sequence. k σ 2√ π l p 2 k 2 Two AR models are trained to provide predictions for the local fingertip movement concerning a static α with sl representing the standard deviation and the hand pose as well as the movement of the hand itself. significance level for an image with R pixels which de- Based on the assumption that the motion of each finger fines an upper boundary for allowed misclassification is influenced by the motion of the neighbored fingers, of image pixels. To take into account local intensity the first model is trained with training data whose in- and contrast conditions, we focus on local signal noise stances qi′ Q with D = N consists of the length of the in a specific region rather than on global sensor noise. j, j∈+1 j+1 mod N j vector v = q q between the fingertip j Therefore, Eq. 3 depends on the local standard devia- t t − t σ max σ max and j + 1 mod N: tion sl calculated within a 2 k 2 k -neighborhood max × where σ denotes the largest scale being examined. j, j+1 k q′( j) = v j = 1,...,N. (6) Hence, we calculate each gradient at the minimum re- i k t k σ min liable scale k where the likelihood of error due lo- For finger j, this leads to following displacement vector: cal signal noise falls below a standard tolerance. This guarantees that a more accurate gradient map is esti- 1 j mated which is less sensitive to signal noise and errors v¯j = AN(i,i + 1)vi,i+1 + AN(i,i + 1)vi,i+1 +b j ω .
Recommended publications
  • Hough Transform, Descriptors Tammy Riklin Raviv Electrical and Computer Engineering Ben-Gurion University of the Negev Hough Transform
    DIGITAL IMAGE PROCESSING Lecture 7 Hough transform, descriptors Tammy Riklin Raviv Electrical and Computer Engineering Ben-Gurion University of the Negev Hough transform y m x b y m 3 5 3 3 2 2 3 7 11 10 4 3 2 3 1 4 5 2 2 1 0 1 3 3 x b Slide from S. Savarese Hough transform Issues: • Parameter space [m,b] is unbounded. • Vertical lines have infinite gradient. Use a polar representation for the parameter space Hough space r y r q x q x cosq + ysinq = r Slide from S. Savarese Hough Transform Each point votes for a complete family of potential lines: Each pencil of lines sweeps out a sinusoid in Their intersection provides the desired line equation. Hough transform - experiments r q Image features ρ,ϴ model parameter histogram Slide from S. Savarese Hough transform - experiments Noisy data Image features ρ,ϴ model parameter histogram Need to adjust grid size or smooth Slide from S. Savarese Hough transform - experiments Image features ρ,ϴ model parameter histogram Issue: spurious peaks due to uniform noise Slide from S. Savarese Hough Transform Algorithm 1. Image à Canny 2. Canny à Hough votes 3. Hough votes à Edges Find peaks and post-process Hough transform example http://ostatic.com/files/images/ss_hough.jpg Incorporating image gradients • Recall: when we detect an edge point, we also know its gradient direction • But this means that the line is uniquely determined! • Modified Hough transform: for each edge point (x,y) θ = gradient orientation at (x,y) ρ = x cos θ + y sin θ H(θ, ρ) = H(θ, ρ) + 1 end Finding lines using Hough transform
    [Show full text]
  • Scale Invariant Feature Transform (SIFT) Why Do We Care About Matching Features?
    Scale Invariant Feature Transform (SIFT) Why do we care about matching features? • Camera calibration • Stereo • Tracking/SFM • Image moiaicing • Object/activity Recognition • … Objection representation and recognition • Image content is transformed into local feature coordinates that are invariant to translation, rotation, scale, and other imaging parameters • Automatic Mosaicing • http://www.cs.ubc.ca/~mbrown/autostitch/autostitch.html We want invariance!!! • To illumination • To scale • To rotation • To affine • To perspective projection Types of invariance • Illumination Types of invariance • Illumination • Scale Types of invariance • Illumination • Scale • Rotation Types of invariance • Illumination • Scale • Rotation • Affine (view point change) Types of invariance • Illumination • Scale • Rotation • Affine • Full Perspective How to achieve illumination invariance • The easy way (normalized) • Difference based metrics (random tree, Haar, and sift, gradient) How to achieve scale invariance • Pyramids • Scale Space (DOG method) Pyramids – Divide width and height by 2 – Take average of 4 pixels for each pixel (or Gaussian blur with different ) – Repeat until image is tiny – Run filter over each size image and hope its robust How to achieve scale invariance • Scale Space: Difference of Gaussian (DOG) – Take DOG features from differences of these images‐producing the gradient image at different scales. – If the feature is repeatedly present in between Difference of Gaussians, it is Scale Invariant and should be kept. Differences Of Gaussians
    [Show full text]
  • Exploiting Information Theory for Filtering the Kadir Scale-Saliency Detector
    Introduction Method Experiments Conclusions Exploiting Information Theory for Filtering the Kadir Scale-Saliency Detector P. Suau and F. Escolano {pablo,sco}@dccia.ua.es Robot Vision Group University of Alicante, Spain June 7th, 2007 P. Suau and F. Escolano Bayesian filter for the Kadir scale-saliency detector 1 / 21 IBPRIA 2007 Introduction Method Experiments Conclusions Outline 1 Introduction 2 Method Entropy analysis through scale space Bayesian filtering Chernoff Information and threshold estimation Bayesian scale-saliency filtering algorithm Bayesian scale-saliency filtering algorithm 3 Experiments Visual Geometry Group database 4 Conclusions P. Suau and F. Escolano Bayesian filter for the Kadir scale-saliency detector 2 / 21 IBPRIA 2007 Introduction Method Experiments Conclusions Outline 1 Introduction 2 Method Entropy analysis through scale space Bayesian filtering Chernoff Information and threshold estimation Bayesian scale-saliency filtering algorithm Bayesian scale-saliency filtering algorithm 3 Experiments Visual Geometry Group database 4 Conclusions P. Suau and F. Escolano Bayesian filter for the Kadir scale-saliency detector 3 / 21 IBPRIA 2007 Introduction Method Experiments Conclusions Local feature detectors Feature extraction is a basic step in many computer vision tasks Kadir and Brady scale-saliency Salient features over a narrow range of scales Computational bottleneck (all pixels, all scales) Applied to robot global localization → we need real time feature extraction P. Suau and F. Escolano Bayesian filter for the Kadir scale-saliency detector 4 / 21 IBPRIA 2007 Introduction Method Experiments Conclusions Salient features X HD(s, x) = − Pd,s,x log2Pd,s,x d∈D Kadir and Brady algorithm (2001): most salient features between scales smin and smax P.
    [Show full text]
  • Hough Transform 1 Hough Transform
    Hough transform 1 Hough transform The Hough transform ( /ˈhʌf/) is a feature extraction technique used in image analysis, computer vision, and digital image processing.[1] The purpose of the technique is to find imperfect instances of objects within a certain class of shapes by a voting procedure. This voting procedure is carried out in a parameter space, from which object candidates are obtained as local maxima in a so-called accumulator space that is explicitly constructed by the algorithm for computing the Hough transform. The classical Hough transform was concerned with the identification of lines in the image, but later the Hough transform has been extended to identifying positions of arbitrary shapes, most commonly circles or ellipses. The Hough transform as it is universally used today was invented by Richard Duda and Peter Hart in 1972, who called it a "generalized Hough transform"[2] after the related 1962 patent of Paul Hough.[3] The transform was popularized in the computer vision community by Dana H. Ballard through a 1981 journal article titled "Generalizing the Hough transform to detect arbitrary shapes". Theory In automated analysis of digital images, a subproblem often arises of detecting simple shapes, such as straight lines, circles or ellipses. In many cases an edge detector can be used as a pre-processing stage to obtain image points or image pixels that are on the desired curve in the image space. Due to imperfections in either the image data or the edge detector, however, there may be missing points or pixels on the desired curves as well as spatial deviations between the ideal line/circle/ellipse and the noisy edge points as they are obtained from the edge detector.
    [Show full text]
  • Histogram of Directions by the Structure Tensor
    Histogram of Directions by the Structure Tensor Josef Bigun Stefan M. Karlsson Halmstad University Halmstad University IDE SE-30118 IDE SE-30118 Halmstad, Sweden Halmstad, Sweden [email protected] [email protected] ABSTRACT entity). Also, by using the approach of trying to reduce di- Many low-level features, as well as varying methods of ex- rectionality measures to the structure tensor, insights are to traction and interpretation rely on directionality analysis be gained. This is especially true for the study of the his- (for example the Hough transform, Gabor filters, SIFT de- togram of oriented gradient (HOGs) features (the descriptor scriptors and the structure tensor). The theory of the gra- of the SIFT algorithm[12]). We will present both how these dient based structure tensor (a.k.a. the second moment ma- are very similar to the structure tensor, but also detail how trix) is a very well suited theoretical platform in which to they differ, and in the process present a different algorithm analyze and explain the similarities and connections (indeed for computing them without binning. In this paper, we will often equivalence) of supposedly different methods and fea- limit ourselves to the study of 3 kinds of definitions of di- tures that deal with image directionality. Of special inter- rectionality, and their associated features: 1) the structure est to this study is the SIFT descriptors (histogram of ori- tensor, 2) HOGs , and 3) Gabor filters. The results of relat- ented gradients, HOGs). Our analysis of interrelationships ing the Gabor filters to the tensor have been studied earlier of prominent directionality analysis tools offers the possibil- [3], [9], and so for brevity, more attention will be given to ity of computation of HOGs without binning, in an algo- the HOGs.
    [Show full text]
  • The Hough Transform As a Tool for Image Analysis
    THE HOUGH TRANSFORM AS A TOOL FOR IMAGE ANALYSIS Josep Llad´os Computer Vision Center - Dept. Inform`atica. Universitat Aut`onoma de Barcelona. Computer Vision Master March 2003 Abstract The Hough transform is a widespread technique in image analysis. Its main idea is to transform the image to a parameter space where clusters or particular configurations identify instances of a shape under detection. In this chapter we overview some meaningful Hough-based techniques for shape detection, either parametrized or generalized shapes. We also analyze some approaches based on the straight line Hough transform able to detect particular structural properties in images. Some of the ideas of these approaches will be used in the following chapter to solve one of the goals of the present work. 1 Introduction The Hough transform was first introduced by Paul Hough in 1962 [4] with the aim of detecting alignments in T.V. lines. It became later the basis of a great number of image analysis applications. The Hough transform is mainly used to detect parametric shapes in images. It was first used to detect straight lines and later extended to other parametric models such as circumferences or ellipses, being finally generalized to any parametric shape [1]. The key idea of the Hough transform is that spatially extended patterns are transformed into a parameter space where they can be represented in a spatially compact way. Thus, a difficult global detection problem in the image space is reduced to an easier problem of peak detection in a parameter space. Ü; Ý µ A set of collinear image points ´ can be represented by the equation: ÑÜ Ò =¼ Ý (1) Ò where Ñ and are two parameters, the slope and intercept, which characterize the line.
    [Show full text]
  • Line Detection by Hough Transformation
    Line Detection by Hough transformation 09gr820 April 20, 2009 1 Introduction When images are to be used in different areas of image analysis such as object recognition, it is important to reduce the amount of data in the image while preserving the important, characteristic, structural information. Edge detection makes it possible to reduce the amount of data in an image considerably. However the output from an edge detector is still a image described by it’s pixels. If lines, ellipses and so forth could be defined by their characteristic equations, the amount of data would be reduced even more. The Hough transform was originally developed to recognize lines [5], and has later been generalized to cover arbitrary shapes [3] [1]. This worksheet explains how the Hough transform is able to detect (imperfect) straight lines. 2 The Hough Space 2.1 Representation of Lines in the Hough Space Lines can be represented uniquely by two parameters. Often the form in Equation 1 is used with parameters a and b. y = a · x + b (1) This form is, however, not able to represent vertical lines. Therefore, the Hough transform uses the form in Equation 2, which can be rewritten to Equation 3 to be similar to Equation 1. The parameters θ and r is the angle of the line and the distance from the line to the origin respectively. r = x · cos θ + y · sin θ ⇔ (2) cos θ r y = − · x + (3) sin θ sin θ All lines can be represented in this form when θ ∈ [0, 180[ and r ∈ R (or θ ∈ [0, 360[ and r ≥ 0).
    [Show full text]
  • Low-Level Vision Tutorial 1
    Low-level vision tutorial 1 complexity and sophistication – so much so that Feature detection and sensitivity the problems of low-level vision are often felt to Image filters and morphology be unimportant and are forgotten. Yet it remains Robustness of object location the case that information that is lost at low level Validity and accuracy in shape analysis Low-level vision – a tutorial is never regained, while distortions that are Scale and affine invariance. introduced at low level can cause undue trouble at higher levels [1]. Furthermore, image Professor Roy Davies In what follows, references are given for the acquisition is equally important. Thus, simple topics falling under each of these categories, and measures to arrange suitable lighting can help to for other topics that could not be covered in make the input images easier and less ambiguous Overview and further reading depth in the tutorial. to interpret, and can result in much greater reliability and accuracy in applications such as 3. Feature detection and sensitivity automated inspection [1]. Nevertheless, in applications such as surveillance, algorithms General [1]. Abstract should be made as robust as possible so that the Edge detection [2, 3, 4]. vagaries of ambient illumination are rendered Line segment detection [5, 6, 7]. This tutorial aims to help those with some relatively unimportant. Corner and interest point detection [1, 8, 9]. experience of vision to obtain a more in-depth General feature mask design [10, 11]. understanding of the problems of low-level 2. Low-level vision The value of thresholding [1, 12, 13]. vision. As it is not possible to cover everything in the space of 90 minutes, a carefully chosen This tutorial is concerned with low-level vision, 4.
    [Show full text]
  • Distinctive Image Features from Scale-Invariant Keypoints
    Distinctive Image Features from Scale-Invariant Keypoints David G. Lowe Computer Science Department University of British Columbia Vancouver, B.C., Canada [email protected] January 5, 2004 Abstract This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene. The features are invariant to image scale and rotation, and are shown to provide robust matching across a a substantial range of affine dis- tortion, change in 3D viewpoint, addition of noise, and change in illumination. The features are highly distinctive, in the sense that a single feature can be cor- rectly matched with high probability against a large database of features from many images. This paper also describes an approach to using these features for object recognition. The recognition proceeds by matching individual fea- tures to a database of features from known objects using a fast nearest-neighbor algorithm, followed by a Hough transform to identify clusters belonging to a sin- gle object, and finally performing verification through least-squares solution for consistent pose parameters. This approach to recognition can robustly identify objects among clutter and occlusion while achieving near real-time performance. Accepted for publication in the International Journal of Computer Vision, 2004. 1 1 Introduction Image matching is a fundamental aspect of many problems in computer vision, including object or scene recognition, solving for 3D structure from multiple images, stereo correspon- dence, and motion tracking. This paper describes image features that have many properties that make them suitable for matching differing images of an object or scene.
    [Show full text]
  • Lecture #06: Edge Detection
    Lecture #06: Edge Detection Winston Wang, Antonio Tan-Torres, Hesam Hamledari Department of Computer Science Stanford University Stanford, CA 94305 {wwang13, tantonio}@cs.stanford.edu; [email protected] 1 Introduction This lecture covers edge detection, Hough transformations, and RANSAC. The detection of edges provides meaningful semantic information that facilitate the understanding of an image. This can help analyzing the shape of elements, extracting image features, and understanding changes in the properties of depicted scenes such as discontinuity in depth, type of material, and illumination, to name a few. We will explore the application of Sobel and Canny edge detection techniques. The next section introduces the Hough transform, used for the detection of parametric models in images;for example, the detection of linear lines, defined by two parameters, is made possible by the Hough transform. Furthermore, this technique can be generalized to detect other shapes (e.g., circles). However, as we will see, the use of Hough transform is not effective in fitting models with a high number of parameters. To address this model fitting problem, the random sampling consensus (RANSAC) is introduced in the last section; this non-deterministic approach repeatedly samples subsets of data, uses them to fit the model, and classifies the remaining data points as "inliers" or "outliers" depending on how well they can be explained by the fitted model (i.e., their proximity to the model). The result is used for a final selection of data points used in achieving the final model fit. A general comparison of RANSAC and Hough transform is also provided in the last section.
    [Show full text]
  • Model Fitting: the Hough Transform I Guido Gerig, CS6640 Image Processing, Utah
    Model Fitting: The Hough transform I Guido Gerig, CS6640 Image Processing, Utah Credit: Svetlana Lazebnik (Computer Vision UNC Chapel Hill, 2008) Fitting Parametric Models: Beyond Lines • Choose a parametric model to represent a set of features simple model: lines simple model: circles complicated model: car Source: K. Grauman Fitting • Choose a parametric model to represent a set of features • Membership criterion is not local • Can’t tell whether a point belongs to a given model just by looking at that point • Three main questions: • What model represents this set of features best? • Which of several model instances gets which feature? • How many model instances are there? • Computational complexity is important • It is infeasible to examine every possible set of parameters and every possible combination of features Fitting: Issues • Noise in the measured feature locations • Extraneous data: clutter (outliers), multiple lines • Missing data: occlusions Case study: Line detection IMAGE PROCESSING LANE DEPARTURE SYSTEM WITH EDGE DETECTION TECHNIQUE USING HOUGH TRANSFORM – A.RAJYA LAKSHMI & J. MOUNIKA (link) Voting schemes • Let each feature vote for all the models that are compatible with it • Hopefully the noise features will not vote consistently for any single model • Missing data doesn’t matter as long as there are enough features remaining to agree on a good model Hough transform • An early type of voting scheme • General outline: • Discretize parameter space into bins • For each feature point in the image, put a vote in every bin in the parameter space that could have generated this point • Find bins that have the most votes Image space Hough parameter space P.V.C.
    [Show full text]
  • Analysing Arbitrary Curves from the Line Hough Transform
    Journal of Imaging Article Analysing Arbitrary Curves from the Line Hough Transform Donald Bailey ∗ , Yuan Chang and Steven Le Moan Centre for Research in Image and Signal Processing, Massey University, Palmerston North 4442, New Zealand; [email protected] (Y.C.); [email protected] (S.L.M.) * Correspondence: [email protected] Received: 24 March 2020; Accepted: 17 April 2020; Published: 23 April 2020 Abstract: The Hough transform is commonly used for detecting linear features within an image. A line is mapped to a peak within parameter space corresponding to the parameters of the line. By analysing the shape of the peak, or peak locus, within parameter space, it is possible to also use the line Hough transform to detect or analyse arbitrary (non-parametric) curves. It is shown that there is a one-to-one relationship between the curve in image space, and the peak locus in parameter space, enabling the complete curve to be reconstructed from its peak locus. In this paper, we determine the patterns of the peak locus for closed curves (including circles and ellipses), linear segments, inflection points, and corners. It is demonstrated that the curve shape can be simplified by ignoring parts of the peak locus. One such simplification is to derive the convex hull of shapes directly from the representation within the Hough transform. It is also demonstrated that the parameters of elliptical blobs can be measured directly from the Hough transform. Keywords: Hough transform; curve detection; features; edgelets; convex hull; ellipse parameters 1. Introduction to the Hough Transform A common task in many image processing applications is the detection of objects or features (for example lines, circles, etc.).
    [Show full text]