Deformable Templates for Face Recognition

Total Page:16

File Type:pdf, Size:1020Kb

Deformable Templates for Face Recognition Deformable Templates for Face Recognition Alan L. Yuille Division of Applied Science Harvard University Abstract Downloaded from http://mitprc.silverchair.com/jocn/article-pdf/3/1/59/1755719/jocn.1991.3.1.59.pdf by guest on 18 May 2021 We describe an approach for extracting facial features from data. Variations in the parameters correspond to allowable de- images and for determining the spatial organization between formations of the object and can be specified by a probabilis- these features using the concept of a deformable template. This tic model. After the extraction stage the parameters of the is a parameterized geometric model of the object to be rec- deformable template can be used for object description and ognized together with a measure of how well it fits the image recognition. LNTRODUCTION geometry will appear in the image and a corresponding measure of fitness, or matching criterion, to determine The sophistication of the human visual system is often how the template interacts with the image. taken for granted until we try to design artificial systems For many objects it is straightforward to specify a plau- with similar capabilities. In particular humans have an sible geometric model. Intuition is often a useful guide amazing ability to recognize faces seen from different and one can draw on the experience of artists (for ex- viewpoints, with various expressions and under a variety ample, Bridgman, 1973). Once the form of the model is of lighting conditions. It is hoped that current attempts specified it can be evaluated and the probability distri- to build computer vision face recognition systems will bution of the parameters determined by statistical tests, shed light on how humans perform this task and the given enough instances of the object. difficulties they overcome. Specifying the imaging model and the matching cri- In this article we describe one promising approach terion is often considerably harder. The way the object toward building a face recognition system. Some alter- reflects light depends both on the reflectance function native approaches are reviewed in Turk and Pentland of the object, which in principle can be modeled, and (1989). on the scene illumination, which is usually unknown. A standard way of describing a face consists of repre- The matching criterion, however, should aim to be rel- senting the features and the spatial relations between atively independent of the lighting conditions. An even them. Indeed two of the earliest face recognition systems more serious problem arises when part of the object is (Goldstein, Harmon, & Lesk, 1972; Kanade, 1977) used invisible, perhaps due to occlusion by another object or features and spatial relations, respectively. Such repre- by lying in deep shadow. Most matching criteria will sentations are independent of lighting conditions, may break down in this situation but recent work, described be able to characterize different spatial expressions, and in the fourth section, shows promise of dealing with such have some limited viewpoint invariance. problems. These representations, however, are extremely difficult The plan of this chapter is as follows. The second to compute reliably from a photograph of a face. It is section gives a simple introduction to deformable tem- very hard to extract features, such as the eyes, by using plates by describing the pioneering work of Fischler and current computer vision techniques. Deformable tem- Elschlager (1973) on extracting global descriptions of plates, however, offer a promising way of using a priori faces. The third section gives a more detailed description knowledge about features, and the spatial relationships of using deformable templates to extract facial features between them, in the detection stage. Once the feature (Yuille, Cohen, Hallinan, 1989). The fourth section shows has been extracted the parameters of the deformable a way to put deformable templates in a more robust template can be used for description and recognition. framework (Hallinan & Mumford, 1990), which promises Designing a deformable template to detect a feature, to make them more reliable and effective. or an object, falls into two parts: (1) providing a geo- Deformable templates, and the closely related elastic metric model for the template, and (2) specifying a models (Burr, 1981a,b;Durbin & Willshaw, 1987; Durbin, model, the imaging model, for how a template of specific Szeliski, & Yuille, 1989) and snakes (Kass, Witlun, & 0 1991 Massachusetts Institute of Technology Journal of Cognitive Neuroscience Volume 3, Number 1 Downloaded from http://www.mitpressjournals.org/doi/pdf/10.1162/jocn.1991.3.1.59 by guest on 24 September 2021 Terzopolous, 1987; Terzopolous, Witkin, & Kass, 1987), The total measure of fitness is the sum of the local have been extensively used in recent years in vision and fitness measures for each feature and the cost of the related areas, Much of this work fits into the general spring terms. Let X7 = {xl,x2, . , x7} be the positions Bayesian statistical framework developed by Grenander of the features. The fitness is and his collaborators (Chow, Grenander, & Keenan, 1989; 7 7c Grenander, 1989; Knoerr, 1989) for synthesizing and rec- ognizing biological shapes. The theory suggests localizing the features in a specific face by adjusting to maximize E(X,). This is a hard GLOBAL TEMPLATES X7 combinatoric problem and Fischler and Elschlager sug- In Fischler and Elschlager’s approach (1973) the face is gest an algorithm, the linear embedding algorithm, to modeled by a set of basic features connected by springs solve it. We will not, however, be concerned here with (see Fig. 1). The set of features includes the eyes, hair, the details of the algorithm, which are related to dynamic mouth, nose, and left and right edges. The individual programming. Downloaded from http://mitprc.silverchair.com/jocn/article-pdf/3/1/59/1755719/jocn.1991.3.1.59.pdf by guest on 18 May 2021 features do not deform and each feature has a local The system was shown to work on a number of 35 by measure of fit to the image. During the matching stage 40 images. The authors reported that the main errors the entire structure is deformed, rather like a rubber occurred in the location of the mouthhose complex, sheet, until all features have a good local fit and the although the other features were correctly located. In spring forces are balanced. The springs help ensure that these situations the spring forces put the mouthhose the spatial relations between the features are reasonable complex in roughly the right position but were unable (i.e.,the nose is not above the hair). to localize it correctly. They also mentioned that in the More specifically, this approach defines a local fitness presence of noise the simple local fitness measures measure Zi(xz)that indicates how strongly the zth feature seemed inadequate to accurately locate the nose and fits at location xi. Fischler and Elschlager define simple mouth. fitness measures for each feature. For example, for the Computer simulations at the EIarvard Robotics Lab (U. left edge of the face the fitness measure at xj is the Wehmeier, personal communication) show that closely difference between the sums of the four intensity values related techniques work well for images of faces with to the left and right of x,.Thus this measure is large for constant size and with little noise. In these simulations a straight vertical line with low intensity on the left and five features were chosen: (1) the eyes, (2) the nostrils high intensity on the right. of the nose, and (3) the sides of the mouth. These fea- The spring joining the ith andjth features are given a tures were attracted to valleys, dark blobs, in the image cost function g&, q),where xi and q are the positions intensity and were connected by springs. A steepest of the features. In most cases gq is assumed to be sym- descent algorithm was used (see Fig. 2). Once the posi- metric and to depend only on the relative positions of tions of the features were located approximately, then the features, hence it can be written as g,(x, - 3).Not more accurate detectors can be used to find them more all features are joined by springs (see Fig. l), so for each precisely. feature i we let N, denote the set of features to which it Fischler and Elschlager’s approach is very attractive is connected but it has several weaknesses in its present form. The main problem is the simplicity of the local fitness mea- sures. At present they are strongly scale dependent and may fail under certain noise conditions. The spring forces might also be improved, by a systematic study of the <I spatial relations, to make sure they give a more accurate bias. Systems of this type seem to work best on small pic- tures with coarse levels of resolution. At coarse scales the local fitness measures may be very simple (Wehmeier was able to locate many features in terms of valleys). At finer scales, however, the local fitness measures must become more complex to take into account the greater variability of the feature. This suggests a coarse to fine, or pyramidal (Burt & Adelson, 1983), approach in which -\ / a global template is used to identify likely positions in the coarse image that can be located and (we hope) verified by more sophisticated techniques at a finer scale. Figure 1. The Eace model consists of the hair, eyes, nose, mouth, We will now describe a system that uses deformable and the left and right edges joined together by springs. templates to locate the features themselves. 60 Journal of Cognitive Neuroscience Volume 3, Number 1 Downloaded from http://www.mitpressjournals.org/doi/pdf/10.1162/jocn.1991.3.1.59 by guest on 24 September 2021 Downloaded from http://mitprc.silverchair.com/jocn/article-pdf/3/1/59/1755719/jocn.1991.3.1.59.pdf by guest on 18 May 2021 Figure 2.
Recommended publications
  • Zhou Ren Email: [email protected] Homepage
    Zhou Ren Email: [email protected] Homepage: http://cs.ucla.edu/~zhou.ren INDUSTRY EXPERIENCE 12/2018 – present, Senior Research Lead, Wormpex AI Research, Bellevue, WA, USA 05/2018 – 12/2018, Senior Research Scientist, Snap Inc., Santa Monica, CA, USA 10/2017 – 05/2018, Research Scientist (III), Snap Inc., Venice, CA, USA 04/2017 – 10/2017, Research Scientist (II), Snap Inc., Venice, CA, USA 07/2016 – 04/2017, Research Scientist (I), Snap Inc., Venice, CA, USA 03/2016 – 06/2016, Research Intern, Snap Inc., Venice, CA, USA 06/2015 – 09/2015, Deep Learning Research Intern, Adobe Research, San Jose, CA, USA 06/2013 – 09/2013, Research Intern, Toyota Technological Institute, Chicago, IL, USA 07/2010 – 07/2012, Project Officer, Media Technology Lab, Nanyang Technological University, Singapore PROFESSIONAL SERVICES Area Chair, CVPR 2021, CVPR 2022, WACV 2022. Senior Program Committee, AAAI 2021. Associate Editor, The Visual Computer Journal (TVCJ), 2018 – present. Director of Industrial Governance Board, Asia-Pacific Signal and Information Processing Association (APSIPA). EDUCATION Doctor of Philosophy 09/2012 – 09/2016, University of California, Los Angeles (UCLA) Major: Vision and Graphics, Computer Science Department Advisor: Prof. Alan Yuille Master of Science 09/2012 – 06/2014, University of California, Los Angeles (UCLA) Major: Vision and Graphics, Computer Science Department Advisor: Prof. Alan Yuille Master of Engineering 08/2010 – 01/2012, Nanyang Technological University (NTU), Singapore Major: Information Engineering, School
    [Show full text]
  • Template Matching Using Fast Normalized Cross Correlation
    Template Matching using Fast Normalized Cross Correlation Kai Briechle and Uwe D Haneb eck Institute of Automatic Control Engineering Technische Universitat Munc hen Munc hen Germany ABSTRACT In this pap er we present an algorithm for fast calculation of the normalized cross correlation NCC and its applica tion to the problem of template matching Given a template t whose p osition is to b e determined in an image f the basic idea of the algorithm is to represent the template for which the normalized cross correlation is calculated as a sum of rectangular basis functions Then the correlation is calculated for each basis function instead of the whole template Theresultofthe correlation of the template t and the image f is obtained as the weighted sum of the correlation functions of the basis functions Dep ending on the approximation the algorithm can by far outp erform Fouriertransform based implementations of the normalized cross correlation algorithm and it is esp ecially suited to problems where many dierent templates are to b e found in the same image f Keywords Normalized cross correlation image pro cessing template matching basis functions INTRODUCTION A basic problem that often o ccurs image pro cessing is to determine the p osition of a given pattern in an image or part of an image the socalled region of interest This problem is closely related to the determination of a received digital signal in signal pro cessing using eg a matched lter Two basic cases can b e dierentiated The p osition of the pattern is unknown estimate
    [Show full text]
  • Human Face and Facial Parts Detection Using Template Matching Technique
    International Journal of Engineering and Advanced Technology (IJEAT) ISSN: 2249 – 8958, Volume-9 Issue-4, April 2020 Human Face and Facial Parts Detection using Template Matching Technique Payal Bose, Samir Kumar Bandyopadhyay Abstract: Face recognition has become relevant in recent years But still, it is one of the most important tasks. Because face because of its potential applications. The aim of this paper is to detection is very much important for security sections in find out the relevant techniques which give not only better different places, crowd analysis and many more. There are accuracy also the efficient speed. There are several techniques several authors describe several techniques for face available for face detection which give much better accuracy but detection. In 2017, the authors, Shrinath Oza, Abdullah the execution speed is not efficient. In this paper, a normalized Khan, Nilabh Swapnil, Aayush Sharma, Jayashree cross-correlation template matching technique is used to solve [1] this problem. According to the proposed algorithm, first different Chaudhari, implemented an algorithm , the Restricted facial parts are detected likes mouth, eyes, and nose. If any of Boltzmann machine with the rules of deep learning and deep the two facial parts are found successfully then the face can be belief network for face recognition. According to this detected. For matching the templates with the target image, the algorithm, it stores the face images as input in a database template rotates at a certain angle interval. and a neural network used to ensure the efficiency and improvement of the recognition system trained the neural Keywords: Face detection, Face Recognition, Template network greedy layer-wise.
    [Show full text]
  • Compositional Convolutional Neural Networks: a Deep Architecture with Innate Robustness to Partial Occlusion
    Compositional Convolutional Neural Networks: A Deep Architecture with Innate Robustness to Partial Occlusion Adam Kortylewski Ju He Qing Liu Alan Yuille Johns Hopkins University Abstract Recent findings show that deep convolutional neural net- works (DCNNs) do not generalize well under partial occlu- sion. Inspired by the success of compositional models at classifying partially occluded objects, we propose to inte- grate compositional models and DCNNs into a unified deep model with innate robustness to partial occlusion. We term this architecture Compositional Convolutional Neural Net- work. In particular, we propose to replace the fully con- nected classification head of a DCNN with a differentiable compositional model. The generative nature of the compo- sitional model enables it to localize occluders and subse- Figure 1: Partially occluded cars from the MS-COCO quently focus on the non-occluded parts of the object. We dataset [20] that are misclassified by a standard DCNN conduct classification experiments on artificially occluded but correctly classified by the proposed CompositionalNet. images as well as real images of partially occluded objects Intuitively, a CompositionalNet can localize the occluders from the MS-COCO dataset. The results show that DC- (occlusion scores on the right) and subsequently focus on NNs do not classify occluded objects robustly, even when the non-occluded parts of the object to classify the image. trained with data that is strongly augmented with partial occlusions. Our proposed model outperforms standard DC- NNs by a large margin at classifying partially occluded ob- One approach to overcome this limitation is to use data jects, even when it has not been exposed to occluded ob- augmentation in terms of partial occlusion [6, 35].
    [Show full text]
  • RESEARCH INVENTY: International Journal of Engineering and Science
    International Refereed Journal of Engineering and Science (IRJES) ISSN (Online) 2319-183X, (Print) 2319-1821 Volume 5, Issue 1 (January 2016), PP.36-40 Introduction of Pattern Recognition with various Approaches 1Tiwary, 2Shiva ramakrishna,3B Sandeep 1,2,3,4( Sreedattha Institute Of Engineering And Science) (Department Of Electronics & Communication Engineering ) ABSTRACT : The objective of this paper is To Familiarize with the fundamental concepts of pattern recognition (PR) & discusses various approaches for same. This paper deals issues: definition of PR (Pattern recognition), PR as System, approaches for PR. Keywords: About five key words in alphabetical order, separated by comma. I. INTRODUCTION In machine learning, PR( pattern recognition) is the assignment of a label to a given input value. An example of pattern recognition is classification, which attempts to assign each input value to one of a given set of classes. Pattern recognition generally aim to provide a reasonable answer for all possible inputs and to perform "most likely" matching of the inputs, taking into account their statistical variation. This is opposed to pattern matching, which look for exact matches in the input with pre-existing patterns. Our goal is to discuss and compare pattern recognition as the best possible Way. II. DEFINITION Before we define PR, we see what is mean by Pattern is a term with pair comprising an observation & a meaning.So PR as per “It is a process of taking raw data & performing action based on the “category” of the pattern.” Or Assignments of inputs to classes. Classification of PR is based on the following two: 1) Supervised classification: Here the input pattern is identified as a member of a pre- defined class.
    [Show full text]
  • OATM: Occlusion Aware Template Matching by Consensus Set Maximization
    OATM: Occlusion Aware Template Matching by Consensus Set Maximization Simon Korman1,3 Mark Milam2 Stefano Soatto1,3 1Weizmann Institute of Science 2Northrop Grumman 3UCLA Vision Lab Abstract We present a novel approach to template matching that is efficient, can handle partial occlusions, and comes with provable performance guarantees. A key component of the method is a reduction that transforms the problem of searching a nearest neighbor among high-dimensional N Figure 1. Instances of the occlusion experiment (Sec. 4.2) A tem- vectors, to searching neighbors among two sets of order plate (overlaid in green) that is 60% occluded by random blocks is √N vectors, which can be found efficiently using range searched for in an image. OATM shows the best results in dealing search techniques. This allows for a quadratic improve- with significant deformation and occlusion (use zoom for detail). ment in search complexity, and makes the method scalable in handling large search spaces. The second contribution increasing the area increases the probability that its corre- is a hashing scheme based on consensus set maximiza- spondent in the target image will be occluded, which causes tion, which allows us to handle occlusions. The resulting the correspondence to fail, unless occlusion phenomena are scheme can be seen as a randomized hypothesize-and-test explicitly taken into account. algorithm, which is equipped with guarantees regarding the In this work we model occlusions explicitly as part of a number of iterations required for obtaining an optimal so- robust template matching process where the co-visible re- lution with high probability. The predicted matching rates gion is assumed to undergo affine deformations of the do- are validated empirically and the algorithm shows a sig- main and range, up to additive noise.
    [Show full text]
  • Matchability Prediction for Full-Search Template Matching Algorithms
    Matchability Prediction for Full-Search Template Matching Algorithms Adrian Penate-Sanchez1, Lorenzo Porzi2,3, and Francesc Moreno-Noguer1 1Institut de Robotica` i Informatica` Industrial, UPC-CSIC, Barcelona, Spain 2Fondazione Bruno Kessler, Trento, Italy 3University of Perugia, Italy Abstract template matching algorithms that perform a full search on all possible transformations of the template guarantee to While recent approaches have shown that it is possible find the global maximum of the distance function, yielding to do template matching by exhaustively scanning the pa- more accurate results at the price of increasing the compu- rameter space, the resulting algorithms are still quite de- tational cost. manding. In this paper we alleviate the computational load Several approaches have focused on improving specific of these algorithms by proposing an efficient approach for characteristics of the template matching framework, such as predicting the matchability of a template, before it is actu- its speed [22, 24], its robustness to partial occlusions [4] or ally performed. This avoids large amounts of unnecessary ambiguities [26] and its ability to select reliable templates computations. We learn the matchability of templates by for rapid visual tracking [2]. Yet, most of these improve- using dense convolutional neural network descriptors that ments have been designed for the algorithms that do not per- do not require ad-hoc criteria to characterize a template. form a full search. In this work we will bring these improve- By using deep learning descriptions of patches we are able ments to algorithms that perform full search, and in particu- to predict matchability over the whole image quite reli- lar we will consider FAst-Match [15], a recent method that ably.
    [Show full text]
  • Brain Mapping Methods: Segmentation, Registration, and Connectivity Analysis
    UCLA UCLA Electronic Theses and Dissertations Title Brain Mapping Methods: Segmentation, Registration, and Connectivity Analysis Permalink https://escholarship.org/uc/item/1jt7m230 Author Prasad, Gautam Publication Date 2013 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California University of California Los Angeles Brain Mapping Methods: Segmentation, Registration, and Connectivity Analysis A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Computer Science by Gautam Prasad 2013 c Copyright by Gautam Prasad 2013 Abstract of the Dissertation Brain Mapping Methods: Segmentation, Registration, and Connectivity Analysis by Gautam Prasad Doctor of Philosophy in Computer Science University of California, Los Angeles, 2013 Professor Demetri Terzopoulos, Chair We present a collection of methods that model and interpret information represented in structural magnetic resonance imaging (MRI) and diffusion MRI images of the living hu- man brain. Our solution to the problem of brain segmentation in structural MRI combines artificial life and deformable models to develop a customizable plan for segmentation real- ized as cooperative deformable organisms. We also present work to represent and register white matter pathways as described in diffusion MRI. Our method represents these path- ways as maximum density paths (MDPs), which compactly represent information and are compared using shape based registration for population studies. In addition, we present a group of methods focused on connectivity in the brain. These include an optimization for a global probabilistic tractography algorithm that computes fibers representing connectivity pathways in tissue, a novel maximum-flow based measure of connectivity, a classification framework identifying Alzheimer's disease based on connectivity measures, and a statisti- cal framework to find the optimal partition of the brain for connectivity analysis.
    [Show full text]
  • Arxiv:2007.15817V3 [Cs.CV] 7 May 2021
    Robust Template Matching via Hierarchical Convolutional Features from a Shape Biased CNN Bo Gao1[0000−0003−3930−1815] and Michael W. Spratling1[0000−0001−9531−2813] Department of Informatics, King's College London, London, UK fbo.gao,[email protected] Abstract. Finding a template in a search image is an important task underlying many computer vision applications. Recent approaches perform template matching in a deep feature-space, produced by a convolutional neural network (CNN), which is found to provide more tolerance to changes in appearance. In this article we investigate if enhancing the CNN's encoding of shape information can produce more distinguishable features that improve the performance of template matching. This investigation results in a new template matching method that produces state-of-the-art results on a standard benchmark. To confirm these results we also create a new benchmark and show that the proposed method also outperforms existing techniques on this new dataset. Our code and dataset is available at: https://github.com/iminfine/Deep-DIM. Keywords: Template match · Convolutional neural networks · VGG19. 1 Introduction Template matching is a technique to find a rectangular region of an image that contains a certain object or image feature. It is widely used in many computer vision applications such as object tracking [1, 2], object detection [3, 4] and 3D reconstruction [5, 6]. A similarity map is typically used to quantify how well a template matches each location in an image. Traditional template matching methods calculate the similarity using a range of metrics such as the normalised cross-correlation (NCC), the sum of squared differences (SSD) or the zero-mean normalised cross correlation (ZNCC) applied to pixel intensity or color values.
    [Show full text]
  • Towards Real-Time Object Recognition and Pose Estimation in Point Clouds
    Towards Real-time Object Recognition and Pose Estimation in Point Clouds Marlon Marcon1 a, Olga Regina Pereira Bellon2 b and Luciano Silva2 c 1Dapartment of Software Engineering, Federal University of Technology - Parana,´ Dois Vizinhos, Brazil 2Department of Computer Science, Federal University of Parana,´ Curitiba, Brazil Keywords: Transfer Learning, 3D Computer Vision, Feature-based Registration, ICP Dense Registration, RGB-D Images. Abstract: Object recognition and 6DoF pose estimation are quite challenging tasks in computer vision applications. De- spite efficiency in such tasks, standard methods deliver far from real-time processing rates. This paper presents a novel pipeline to estimate a fine 6DoF pose of objects, applied to realistic scenarios in real-time. We split our proposal into three main parts. Firstly, a Color feature classification leverages the use of pre-trained CNN color features trained on the ImageNet for object detection. A Feature-based registration module conducts a coarse pose estimation, and finally, a Fine-adjustment step performs an ICP-based dense registration. Our proposal achieves, in the best case, an accuracy performance of almost 83% on the RGB-D Scenes dataset. Regarding processing time, the object detection task is done at a frame processing rate up to 90 FPS, and the pose estimation at almost 14 FPS in a full execution strategy. We discuss that due to the proposal’s modularity, we could let the full execution occurs only when necessary and perform a scheduled execution that unlocks real-time processing, even for multitask situations. 1 INTRODUCTION Template matching approaches rely on RANSAC- based feature matching algorithms, following the Object recognition and 6D pose estimation represent pipeline proposed by (Aldoma et al., 2012b).
    [Show full text]
  • Orientation Detection Using a CNN Designed by Transfer Learning of Alexnet
    Proceedings of the 8th IIAE International Conference on Industrial Application Engineering 2020 Orientation Detection Using a CNN Designed by Transfer Learning of AlexNet Fusaomi Nagataa,*, Kohei Mikia, Yudai Imahashia, Kento Nakashimaa Kenta Tokunoa, Akimasa Otsukaa, Keigo Watanabeb and Maki K. Habibc aDepartment of Mechanical Engineering, Faculty of Engineering, Sanyo-Onoda City University Sanyo-Onoda 756-0884, Japan bGraduate School of Natural Science and Technology, Okayama University Okayama 700-8530, Japan cMechanical Engineering Department, School of Sciences and Engineering, The American University in Cairo AUC Avenue, P.O.Box 74, New Cairo 11835, Egypt *Corresponding Author: [email protected] Abstract classification in spite of only two layers. Nagi et al. designed max-pooling convolutional neural networks (MPCNN) for Artificial neural network (ANN) which has four or vision-based hand gesture recognition(1). The MPCNN could more layers structure is called deep NN (DNN) and is classify six kinds of gestures with 96% accuracy and allow recognized as a promising machine learning technique. mobile robots to perform real-time gesture recognition. Convolutional neural network (CNN) has the most used and Weimer et al. also designed a deep CNN architectures for powerful structure for image recognition. It is also known automated feature extraction in industrial inspection that support vector machine (SVM) has a superior ability for process(2). The CNN automatically generates features from binary classification in spite of only two layers. We have massive amount of training image data and demonstrates developed a CNN&SVM design and training tool for defect excellent defect detection results with low false alarm rates.
    [Show full text]
  • Template Matching Approach for Face Recognition System
    International Journal of Signal Processing Systems Vol. 1, No. 2 December 2013 Template Matching Approach for Face Recognition System Sadhna Sharma National Taiwan Ocean University, Taiwan Email: [email protected] Abstract—Object detection or face recognition is one of the Identification (one-to-many matching): Given an most interesting application in the image processing and it is image of an unknown individual, determining that a classical problem in computer vision, having application to person’s identity by comparing (possibly after surveillance, robotics, multimedia processing. Developing a encoding) that image with a database of (possibly generic object detection system is still an open problem, but there have been important successes over the past several encoded) images of known individuals. years from visual pattern. Among the most influential In general, based on the face representation face system is the face recognition system. Face recognition has recognition techniques can be divided into Appearance- become a popular area of research in computer vision and based technique (uses holistic texture features and is one of the most successful applications of image analysis and applied to either whole face or specific regions in a face understanding. Because of the nature of the problem, not image) and Feature-based technique (uses geometric only computer science researchers are interested in it, but facial features mouth, eyes, brows, cheeks etc and neuroscientists and psychologists also. It is the general geometric relationships between them). opinion that advances in computer vision research will provide useful insights to neuroscientists and psychologists A. Appearance Based into how human brain works, and vice versa.
    [Show full text]