Scale Invariance Without Scale Selection

Total Page:16

File Type:pdf, Size:1020Kb

Scale Invariance Without Scale Selection Scale Invariance without Scale Selection Iasonas Kokkinos Alan Yuille Department of Statistics, UCLA Department of Statistics, UCLA [email protected] [email protected] ¤ Abstract In this work we construct scale invariant descriptors (SIDs) without requiring the estimation of image scale; we thereby avoid scale selection which is often unreliable. Our starting point is a combination of Log-Polar sam- pling and spatially-varying smoothing that converts image scalings and rotations into translations. Scale invariance Local Structure Context can then be guaranteed by estimating the Fourier Trans- Figure 1. Our goal is to extract scale-invariant information around generic image structures where scale selection can be unreliable, form Modulus (FTM) of the formed signal as the FTM is e.g. edges. The local image structure that is used by most scale translation invariant. selection mechanisms is often not informative about the scale of We build our descriptors using phase, orientation and the structure, which becomes apparent from the image context. amplitude features that compactly capture the local image structure. Our results show that the constructed SIDs out- shown in Fig. 1. Context is most informative and can only perform state-of-the-art descriptors on standard datasets. be incorporated by considering multiple scales. A main advantage of SIDs is that they are applicable to a We propose a method to compute scale invariant descrip- broader range of image structures, such as edges, for which tors (SIDs) that does not require scale selection. For this scale selection is unreliable. We demonstrate this by com- we use a combination of log-polar sampling with spatially bining SIDs with contour segments and show that the per- varying filtering that converts image scalings and rotations formance of a boundary-based model is systematically im- into translations. Scale invariance is achieved by taking the proved on an object detection task. Fourier Transform Modulus (FTM) of the transformed sig- nals as the FTM is translation invariant. Our experiments show that SIDs outperform current descriptors when tested on standard datasets. 1. Introduction By freeing us from the need for scale selection, SIDs can Local image descriptors evaluated at interest points have be used in a broader setting, in conjunction with features been very successful for many visual tasks related to ob- such as edges and ridges [18]. Such features can be used to ject detection [21]. An important issue is how to deal with construct intuitive object representations as they are related changes in image scale. Typically this is done in a two- to semantically meaningful structures, namely boundaries stage process which first extracts a local estimate of the and symmetry axes. However, they have been limited since scale and then computes the descriptor based on an appro- they do not come with scale estimates, so it has been hard to priately sized image patch. This strategy is limited in two use them for scale-invariant detection. We address this by respects. First, for most places in the image it is hard to ob- augmenting contour segments with SIDs, and use them in a tain reliable scale estimates, with the exception of symmet- flexible object model for scale-invariant object detection. ric structures, such as blobs or ridges. However, we would not like to limit ourselves to the few structures for which 2. Previous Work scale estimation is reliable, as other structures, e.g. edges can be useful for object detection. Second, even if scale es- Image descriptors summarize image information around timation is reliable, it does not necessary indicate the scale points of interest using low-dimensional feature vectors that where the most useful appearance information resides, as are designed to be both distinctive and repeatable. Two seminal contributions have been SIFT descriptors [21] and ¤This work was supported by NSF Grant 0413214 Shape Contexts [26]; these have been followed by several Scale Space Band-pass image Amplitude, A Phase, sin(Á) Orientation, θ Space-Variant Processing Figure 2. Front-End Analysis. Left: We combine log polar sampling with a spatially increasing filter scale to guarantee scale invariance (taken from [20]). Right: Features extracted from a band-pass filtered image using the Monogenic Signal. extensions and refinements, such as Geometric Blur [3, 4], is proportional to the distance from the center of the log- PCA-SIFT [12] and GLOH [23]. Please see [23] for an up- polar sampling grid. As we show in App. A, this guarantees to-date review, comparisons and more extensive references; that scaling the image only scales the features and does not we will compare in detail our descriptor to related ones in distort them in any other way. This allows us to then use a the following sections. sparse log-polar sampling of the image features and convert Typically, a two stage approach is used to deal with scalings/rotations into translations. changes in image scale. First, a front end system is used Comparing to other image descriptors that use a log- to estimate local image scale, e.g. by using a scale-adapted polar sampling strategy, in the GLOH descriptor of [23], differential operator [19, 18]. Then, the estimated scale is the histogram of the image gradient at several orientations used to adapt the descriptor. As argued in the introduction, is computed by averaging the gradient within each compart- this two-stage approach is often problematic and the only ment of a log polar grid. Such descriptors can be redundant, work known to us that addresses this is [6]. There the au- since typically a single orientation is locally dominant. Fur- thors obtain stable measures of scale along edges by extract- ther, a front-end detector is used to determine the descrip- ing descriptors at multiple scales and choosing the most sta- tor’s scale, which as mentioned can be problematic. The ble one; however this requires an iterative, two-stage front- work of the authors in [3] is also closely related, as they end procedure for each interest point. increase the smoothing with the distance from the center. However their approach leads to distortions due to scale 3. Front-End Analysis changes, while in our work we guarantee that apart from being translated, the signal does not get distorted. 3.1. Image Sampling with the Log-Polar Transform In order to design scale-invariant descriptors we exploit 3.2. Feature Extraction the fact that the log polar transform Having described our sampling strategy, we now de- r r scribe the features that are being sampled. Specifically, I^(r; u) = I(x0 ¡ σ cos(u); y0 ¡ σ sin(u)) (1) 0 0 we compute the image orientation, phase and amplitude, as converts rotations and scalings of the image I around shown in Fig. 2, which largely capture local image struc- (x0; y0) into translations of the transformed image I^; r and ture. The phase of an image provides symmetry-related in- u are log-polar coordinates and σ0 is a scaling constant. formation, indicating whether the image looks locally like This transform is extensively used in image registration (see an edge (Á = 0) or a peak/valley (Á = §¼=2). The am- e.g. [31] and references therein), while in [27] it is argued plitude A is a measure of feature strength (contrast), and that this logarithmic sampling of the image is similar to the its orientation θ indicates the dominant direction of image sampling pattern of the human visual front-end. We apply variation. To compute these we use the Monogenic signal this sampling strategy to our problem, by setting x0; y0 in of [7], as described below. (1) equal to the location of an interest-point, and consider the construction of a scale-invariant descriptor around it. 3.2.1 The Monogenic Signal A practical concern is that directly sampling the image around each point is impractical, because we would need In order to estimate the amplitude and phase of an 1-D sig- too many samples to avoid aliasing. Therefore we remove nal a well established method is based on the Analytic Sig- high frequency components by band-pass filtering the im- nal, obtained via the Hilbert transform [11]. However the age before extracting features from it. extension of the Analytic Signal to 2D had only been par- Further, as shown in Fig. 2, we use spatially varying tial, until the introduction of the Monogenic Signal in [7]. filtering and sample the image (or its features) at a scale that Following [7], we obtain the local amplitude a, orienta- tion θ and phase Á measurements of a 2D signal h by: Input Image q h h A = h2 + h2 + h2 ; θ = tan¡1 y ;Á = tan¡1 p x y 2 2 hx hy + hx ¡1 p !fx;yg hfx;yg = F ( ¡1 p H); 2 2 !x + !y where H = F(h) is the 2D Fourier transform of h and + Multi-Scale Analysis !x;!y are horizontal/vertical frequencies. A simple imple- Filter Amplitude: A Phase: sin(Á) Orientation: θ mentation can be found at [16]. Apart from being theoretically sound, the Monogenic Signal is also efficient; prior to the generalization of the Hilbert transform to 2D, earlier approaches would first pre- process the image with a set of orientation-selective filters Small Scale [25, 10, 22, 24, 13] and then essentially treat the output of each filter as an 1-D signal. Instead, the Monogenic Signal only requires filtering with a single band-pass filter with no orientational preference. Large Scale + Log-Polar Sampling A cos(Á ) A sin(Á ) A cos(2θ¤) A sin(2θ¤) 4. Scale Invariant Descriptor Construction d d d d d d d d Having laid out the ideas underlying our front-end pro- cessing, we now describe our method for computing SIDs, depicted as a block diagram in Fig.
Recommended publications
  • Context-Aware Features and Robust Image Representations 5 6 ∗ 7 P
    CAKE_article_final.tex Click here to view linked References 1 2 3 4 Context-Aware Features and Robust Image Representations 5 6 ∗ 7 P. Martinsa, , P. Carvalhoa,C.Gattab 8 aCenter for Informatics and Systems, University of Coimbra, Coimbra, Portugal 9 bComputer Vision Center, Autonomous University of Barcelona, Barcelona, Spain 10 11 12 13 14 Abstract 15 16 Local image features are often used to efficiently represent image content. The limited number of types of 17 18 features that a local feature extractor responds to might be insufficient to provide a robust image repre- 19 20 sentation. To overcome this limitation, we propose a context-aware feature extraction formulated under an 21 information theoretic framework. The algorithm does not respond to a specific type of features; the idea is 22 23 to retrieve complementary features which are relevant within the image context. We empirically validate the 24 method by investigating the repeatability, the completeness, and the complementarity of context-aware fea- 25 26 tures on standard benchmarks. In a comparison with strictly local features, we show that our context-aware 27 28 features produce more robust image representations. Furthermore, we study the complementarity between 29 strictly local features and context-aware ones to produce an even more robust representation. 30 31 Keywords: Local features, Keypoint extraction, Image content descriptors, Image representation, Visual 32 saliency, Information theory. 33 34 35 1. Introduction While it is widely accepted that a good local 36 37 feature extractor should retrieve distinctive, accu- 38 Local feature detection (or extraction, if we want 39 rate, and repeatable features against a wide vari- to use a more semantically correct term [1]) is a 40 ety of photometric and geometric transformations, 41 central and extremely active research topic in the 42 it is equally valid to claim that these requirements fields of computer vision and image analysis.
    [Show full text]
  • Pavement Crack Detection by Ridge Detection on Fractional Calculus and Dual-Thresholds
    International Journal of Multimedia and Ubiquitous Engineering Vol.10, No.4 (2015), pp.19-30 http://dx.doi.org/10.14257/ijmue.2015.10.4.03 Pavement Crack Detection by Ridge Detection on Fractional Calculus and Dual-thresholds Song Hongxun, Wang Weixing, Wang Fengping, Wu Linchun and Wang Zhiwei Shaanxi Road Traffic Intelligent Detection and Equipment Engineering Technology Research Center, Chang’an University, Xi’an, China School of Information Engineering, Chang’an University, Xi’an, China songhongx @163.com Abstract In this paper, a new road surface crack detection algorithm is proposed; it is based on the ridge edge detection on fractional calculus and the dual-thresholds on a binary image. First, the multi-scale reduction of image data is used to shrink an original image to eliminate noise, which can not only smooth an image but also enhance cracks. Then, the main cracks are extracted by using the ridge edge detection on fractional calculus in a grey scale image. Subsequently, the resulted binary image is further processed by applying both short and long line thresholds to eliminate short curves and noise for getting rough crack segments. Finally the gaps in cracks are connected with a curve connection function which is an artificial intelligence routine. The experiments show that the algorithm for pavement crack images has the good performance of noise immunity, accurate positioning, and high accuracy. It can accurately locate and detect small and thin cracks that are difficult to identify by other traditional algorithms. Keywords: Crack; Multiscale; Image enhancement; Ridge edge; Threshold 1. Introduction In a time period, any road can be damaged gradually due to traffic loads and the other natural/artificial factors, and cracks are the most common road pavement surface defects that need to mend as early as possible.
    [Show full text]
  • Feature Detection Florian Stimberg
    Feature Detection Florian Stimberg Outline ● Introduction ● Types of Image Features ● Edges ● Corners ● Ridges ● Blobs ● SIFT ● Scale-space extrema detection ● keypoint localization ● orientation assignment ● keypoint descriptor ● Resources Introduction ● image features are distinct image parts ● feature detection often is first operation to define image parts to process later ● finding corresponding features in pictures necessary for object recognition ● Multi-Image-Panoramas need to be “stitched” together at according image features ● the description of a feature is as important as the extraction Types of image features Edges ● sharp changes in brightness ● most algorithms use the first derivative of the intensity ● different methods: (one or two thresholds etc.) Types of image features Corners ● point with two dominant and different edge directions in its neighbourhood ● Moravec Detector: similarity between patch around pixel and overlapping patches in the neighbourhood is measured ● low similarity in all directions indicates corner Types of image features Ridges ● curves which points are local maxima or minima in at least one dimension ● quality depends highly on the scale ● used to detect roads in aerial images or veins in 3D magnetic resonance images Types of image features Blobs ● points or regions brighter or darker than the surrounding ● SIFT uses a method for blob detection SIFT ● SIFT = Scale-invariant feature transform ● first published in 1999 by David Lowe ● Method to extract and describe distinctive image features ● feature
    [Show full text]
  • Coding Images with Local Features
    Int J Comput Vis (2011) 94:154–174 DOI 10.1007/s11263-010-0340-z Coding Images with Local Features Timo Dickscheid · Falko Schindler · Wolfgang Förstner Received: 21 September 2009 / Accepted: 8 April 2010 / Published online: 27 April 2010 © Springer Science+Business Media, LLC 2010 Abstract We develop a qualitative measure for the com- The results of our empirical investigations reflect the theo- pleteness and complementarity of sets of local features in retical concepts of the detectors. terms of covering relevant image information. The idea is to interpret feature detection and description as image coding, Keywords Local features · Complementarity · Information and relate it to classical coding schemes like JPEG. Given theory · Coding · Keypoint detectors · Local entropy an image, we derive a feature density from a set of local fea- tures, and measure its distance to an entropy density com- puted from the power spectrum of local image patches over 1 Introduction scale. Our measure is meant to be complementary to ex- Local image features play a crucial role in many computer isting ones: After task usefulness of a set of detectors has vision applications. The basic idea is to represent the image been determined regarding robustness and sparseness of the content by small, possibly overlapping, independent parts. features, the scheme can be used for comparing their com- By identifying such parts in different images of the same pleteness and assessing effects of combining multiple de- scene or object, reliable statements about image geome- tectors. The approach has several advantages over a simple try and scene content are possible. Local feature extraction comparison of image coverage: It favors response on struc- comprises two steps: (1) Detection of stable local image tured image parts, penalizes features in purely homogeneous patches at salient positions in the image, and (2) description areas, and accounts for features appearing at the same lo- of these patches.
    [Show full text]
  • Rotational Features Extraction for Ridge Detection
    1 Rotational Features Extraction for Ridge Detection German´ Gonzalez,´ Member, IEEE, Franc¸ois Fleuret, Member, IEEE, Franc¸ois Aguet, Fethallah Benmansour, Michael Unser Fellow, IEEE., and P. Fua, Senior Member, IEEE. Abstract State-of-the-art approaches to detecting ridge-like structures in images rely on filters designed to respond to locally linear intensity features. While these approaches may be optimal for ridges whose appearance is close to being ideal, their performance degrades quickly in the presence of structured noise that corrupts the image signal, potentially to the point where it truly does not conform to the ideal model anymore. In this paper, we address this issue by introducing a learning framework that relies on rich, local, rotationally invariant image descriptors and demonstrate that we can outperform state-of-the-art ridge detectors in many different kinds of imagery. More specifically, our framework yields superior performance for the detection of blood vessel in retinal scans, dendrites in bright-field and confocal microscopy image-stacks, and streets in satellite imagery. Index Terms Rotational features. Ridge Detection. Steerable filters. Data aggregation. Machine Learning. F 1 INTRODUCTION INEAR structures of significant width, such as roads in aerial images, blood vessels in retinal scans, or L dendrites in 3D-microscopy image stacks, are pervasive. State-of-the-art approaches to detecting them rely on ideal models of their appearance and of the noise. They are usually optimized to find ideal ribbon or tubular structures. However, ridge-like linear structures such as those of Fig. 1 often do not conform to these models, which can drastically impact performance.
    [Show full text]
  • A Statistical Approach to Feature Detection and Scale Selection in Images
    A Statistical Approach to Feature Detection and Scale Selection in Images Dissertation zur Erlangung des wirtschaftswissenschaftlichen Doktorgrades des Fachbereichs Wirtschaftswissenschaften der Universität Göttingen vorgelegt von Peter Majer aus Frankfurt a.M. Göttingen, Mai 2000 Erstgutachter: Professor Dr. W. Zucchini (Göttingen) Zweitgutachter: Professor Dr. T. Lindeberg (Stockholm) Tag der mündlichen Prüfung: 7.7.2000 Contents 1 Introduction 1 1.1 A Feature Detection Recipe ..................... 2 1.2 Some Questions about Scale .................... 5 1.3 Contributions of the Thesis ..................... 6 1.4 Some Remarks Concerning the Formulation ............ 7 2 Scale-Space 8 2.1 Linear Scale-Space ......................... 8 2.2 The Purpose of Scale-Space: Vision ................ 9 2.3 Useful Properties of Scale-Space .................. 11 2.3.1 Simplification ........................ 12 2.3.2 Translation and Rotation Invariance ............ 14 2.3.3 Observational Blur and Scaling .............. 16 2.3.4 Differentiability ....................... 17 2.4 Stochastic Simplification and Scale-Space ............. 19 2.4.1 A Derivation of Scale-Space ................ 19 2.4.2 Stochastic Simplification and Local Entropy ........ 21 3 Feature Detection 24 3.1 Pattern matching .......................... 24 3.2 Feature Detection Operators .................... 25 3.2.1 Design Criteria for Feature Detectors ........... 26 3.2.2 Derivative of Gaussian Feature Detectors ......... 27 3.2.3 Interpretations of Derivative of Gaussian Detectors .... 30 3.3 Differential Geometry of Scale-Space ............... 31 3.3.1 Local Coordinates ..................... 33 3.4 Zero-Crossings ........................... 35 4 Scale Selection 38 4.1 The Need for Scale Selection .................... 38 4.2 Invariance Requirements and Scale Selection ........... 39 I CONTENTS II 4.3 Normalized Derivatives ....................... 40 4.4 γ-Normalized Derivatives .....................
    [Show full text]
  • Edge Detection and Ridge Detection with Automatic Scale Selection
    Edge detection and ridge detection with automatic scale selection Tony Lindeberg Computational Vision and Active Perception Laboratory (CVAP) Department of Numerical Analysis and Computing Science KTH (Royal Institute of Technology) S-100 44 Stockholm, Sweden. http://www.nada.kth.se/˜tony Email: [email protected] Technical report ISRN KTH/NA/P–96/06–SE, May 1996, Revised August 1998. Int. J. of Computer Vision, vol 30, number 2, 1998. (In press). Shortened version in Proc. CVPR’96, San Francisco, June 1996. Abstract When computing descriptors of image data, the type of information that can be extracted may be strongly dependent on the scales at which the image operators are applied. This article presents a systematic methodology for addressing this problem. A mechanism is presented for automatic selection of scale levels when detecting one-dimensional image features, such as edges and ridges. A novel concept of a scale-space edge is introduced, defined as a connected set of points in scale-space at which: (i) the gradient magnitude assumes a local maximum in the gradient direction, and (ii) a normalized measure of the strength of the edge response is locally maximal over scales. An important consequence of this definition is that it allows the scale levels to vary along the edge. Two specific measures of edge strength are analysed in detail, the gradient magnitude and a differential expression derived from the third-order derivative in the gradient direction. For a certain way of normalizing these differential de- scriptors, by expressing them in terms of so-called γ-normalized derivatives,an immediate consequence of this definition is that the edge detector will adapt its scale levels to the local image structure.
    [Show full text]
  • Salient Point and Scale Detection by Minimum Likelihood
    JMLR: Workshop and Conference Proceedings 1: 59-72 Gaussian Processes in Practice Salient Point and Scale Detection by Minimum Likelihood Kim Steenstrup Pedersen [email protected] Marco Loog [email protected] Department of Computer Science University of Copenhagen Universitetsparken 1 DK-2100 Copenhagen Denmark Pieter van Dorst Department of Biomedical Engineering Technische Universiteit Eindhoven P.O. Box 513 5600 MB Eindhoven The Netherlands Editor: Neil Lawrence, Anton Schwaighofer and Joaquin Quinonero~ Candela Abstract We propose a novel approach for detection of salient image points and estima- tion of their intrinsic scales based on the fractional Brownian image model. Under this model images are realisations of a Gaussian random process on the plane. We define salient points as points that have a locally unique image structure. Such points are usually sparsely distributed in images and carry important information about the image content. Locality is defined in terms of the measurement scale of the filters used to describe the image structure. Here we use partial derivatives of the image function defined using linear scale space theory. We propose to detect salient points and their intrinsic scale by detecting points in scale-space that locally minimise the likelihood under the model. Keywords: Image analysis, salient image point detection, scale selection, fractional Brownian images 1. Introduction Following the widely recognised paradigm by Marr (1982), image feature detec- tion is the basis which many vision and image analysis algorithms build upon. The definition of what constitutes an image feature is debatable, but there is a common agreement that it is either a point or curve at which the image in- tensities have a visual information carrying geometry.
    [Show full text]
  • The Harris Corner Detection Method Based on Three Scale Invariance Spaces
    View metadata,IJCSI International citation and Journal similar of papers Computer at core.ac.uk Science Issues, Vol. 9, Issue 6, No 2, November 2012 brought to you by CORE ISSN (Online): 1694-0814 provided by Directory of Open Access Journals www.IJCSI.org 18 The Harris Corner Detection Method Based on Three Scale Invariance Spaces Yutian Wang, Yiqiang Chen, Jing Li, Biming Li Institute of Electrical Engineering, Yanshan University, Qinhuangdao, Hebei Province, 066004, China given, ignoring the role of the differential scales in the Abstract establishing of the scale space image. In order to solve the problem that the traditional Harris comer operator hasn’t the property of variable scales and is sensitive to Aiming at the problem of the traditional Harris detectors noises, an improved three scale Harris corner detection without the property of variable scales and is sensitive to algorithm was proposed. First, three scale spaces with the noises, this paper proposed an improved three scale Harris characteristic of scale invariance were constructed using discrete corner detection method. Three scale spaces were Gaussian convolution. Then, Harris scale invariant detector was used to extract comers in each scale image. Finally, supportable constructed through selecting reasonably the and unsupportable set of points were classified according to parameters , S and t that influence the performance of whether the corresponding corners in every scale image support the Harris scale invariant detector. Harris comers in each that of the original images. After the operations to those scale images were extracted by the Harris scale invariant unsupportable set of points, the noised corners and most of detector and the supportable and unsupportable set of unstable corners could be got rid of.
    [Show full text]
  • Edge, Ridge, and Blob Detection with Symmetric Molecules
    Edge, Ridge, and Blob Detection with Symmetric Molecules Rafael Reisenhofer∗1 and Emily J. King2 1University of Vienna, Faculty of Mathematics 2University of Bremen, AG Computational Data Analysis Abstract We present a novel approach to the detection and characterization of edges, ridges, and blobs in two-dimensional images which exploits the symmetry properties of directionally sensitive an- alyzing functions in multiscale systems that are constructed in the framework of α-molecules. The proposed feature detectors are inspired by the notion of phase congruency, stable in the presence of noise, and by definition invariant to changes in contrast. We also show how the behavior of coefficients corresponding to differently scaled and oriented analyzing functions can be used to obtain a comprehensive characterization of the geometry of features in terms of local tangent directions, widths, and heights. The accuracy and robustness of the proposed measures are validated and compared to various state-of-the-art algorithms in extensive numerical experi- ments in which we consider sets of clean and distorted synthetic images that are associated with reliable ground truths. To further demonstrate the applicability, we show how the proposed ridge measure can be used to detect and characterize blood vessels in digital retinal images and how the proposed blob measure can be applied to automatically count the number of cell colonies in a Petri dish. 1 Introduction The correct localization of the significant structures in an image as well as the precise characteri- zation of their geometry are two eminent tasks in digital image processing with an overwhelming number of applications. Since the advent of digital image processing, a large body of research has been devoted to the development and analysis of algorithms for the detection and characterization of features such as edges, ridges, and well-defined two-dimensional shapes.
    [Show full text]
  • Improving the Effectiveness of Local Feature Detection
    Improving the Effectiveness of Local Feature Detection Shoaib Ehsan A thesis submitted for the degree of Doctor of Philosophy at the School of Computer Science and Electronic Engineering University of Essex May 2012 Abstract The last few years have seen the emergence of sophisticated computer vision systems that target complex real-world problems. Although several factors have contributed to this success, the ability to solve the image correspondence problem by utilizing local image features in the presence of various image transformations has made a major contribution. The trend of solving the image correspondence problem by a three-stage system that comprises detection, description, and matching is prevalent in most vision systems today. This thesis concentrates on improving the local feature detection part of this three-stage pipeline, generally targeting the image correspondence problem. The thesis presents offline and online performance metrics that reflect real- world performance for local feature detectors and shows how they can be utilized for building more effective vision systems, confirming in a statistically meaningful way that these metrics work. It then shows how knowledge of individual feature detectors’ functions allows them to be combined and made into an integral part of a robust vision system. Several improvements to feature detectors’ performance in terms of matching accuracy and speed of execution are presented. Finally, the thesis demonstrates how resource-efficient architectures can be designed for local feature detection methods, especially for embedded vision applications. i ii Copyright © 2012 Shoaib Ehsan All Rights Reserved iii iv To Abbu, Ammi, Maria & Guria v vi Acknowledgements First of all, I would like to thank ALLAH Almighty for giving me the strength and determination to complete this work.
    [Show full text]
  • On Detection of Faint Edges in Noisy Images
    1 On Detection of Faint Edges in Noisy Images Nati Ofir, Meirav Galun, Sharon Alpert, Achi Brandt, Boaz Nadler, Ronen Basri Abstract—A fundamental question for edge detection in noisy images is how faint can an edge be and still be detected. In this paper we offer a formalism to study this question and subsequently introduce computationally efficient multiscale edge detection algorithms designed to detect faint edges in noisy images. In our formalism we view edge detection as a search in a discrete, though potentially large, set of feasible curves. First, we derive approximate expressions for the detection threshold as a function of curve length and the complexity of the search space. We then present two edge detection algorithms, one for straight edges, and the second for curved ones. Both algorithms efficiently search for edges in a large set of candidates by hierarchically constructing difference filters that match the curves traced by the sought edges. We demonstrate the utility of our algorithms in both simulations and applications involving challenging real images. Finally, based on these principles, we develop an algorithm for fiber detection and enhancement. We exemplify its utility to reveal and enhance nerve axons in light microscopy images. Index Terms—Edge detection. Fiber enhancement. Multiscale methods. Low signal-to-noise ratio. Multiple hypothesis tests. Microscopy images. F 1 INTRODUCTION This paper addresses the problem of detecting faint edges in noisy images. Detecting edges is important since edges mark the boundaries of shapes and provide cues to their relief and surface markings. Noisy images are common in a variety of domains in which objects are captured un- der limited visibility.
    [Show full text]