2.5D Elastic Graph Matching Algorithms

2.5D Elastic Graph Matching Algorithms Stefanos Zafeiriou† Maria Petrou† Vasileios Argyriou ∗ † Imperial College London ∗Kingston University {s.zafeiriou,maria.petrou}@imperial.ac.uk {v.argyriou}@kingston.ac.uk Abstract The graph matching process is implemented by a stochas- tic optimization of a cost function which takes into account In this paper, a series of advances in Elastic Graph both jet similarities and grid deformations. One of the basic Matching (EGM) for face recognition assisted by the avail- advantages of EGM algorithms is that they can be applied ability of 3D facial geometry is proposed. More speci- in a fully automatic manner when combined with a face de- cally, we conceptually extend the EGM algorithm in order tector and/or a fully automatic facial feature detection algo- to exploit the 3D nature of human facial geometry for face rithm [14, 11, 13]. recognition/verication. In order to achieve that, rst we A lot of research has been conducted in order to boost extend the matching module of the EGM algorithm in order the performance of EGM for face recognition, face veri- to capitalize on the 2.5D facial data. Moreover, we incor- cation and facial expression recognition [5, 8, 14, 13, 11]. porate the 3D geometry into the multiscale analysis used In [5], EGM has been proposed and tested for frontal face and build a novel geodesic multiscale morphological pyra- verication. A variant of the typical EGM, the so-called mid of dilations/erosions in order to ll the graph jets. We Morphological Elastic Graph Matching (MEGM), has been demonstrate the efciency of the proposed advances in the proposed for frontal face verication and tested under var- face recognition/verication problem. ious recording conditions [8, 14, 13]. EGM with morphological feature vectors for facial expression recognition was proposed in [12]. 1. Introduction In this paper, we present EGM methods that exploit the Dynamic link architecture (DLA) has been proposed 3D facial geometry. Nowadays, the acquisition of 3D faces as an abstract methodology for distortion invariant object is a relatively simple procedure due to the low cost devices recognition [9]. In DLA, an object is represented as a con- that are available. This has constituted 3D face recogni- nected graph. Each graph node is located at a certain spa- tion a very rapidly growing research topic. Another reason, tial image location x. A feature vector, the so called jet, is that resulted in the increase of 3D face recognition popular- attached at each graph node. The jet elements can be local ity as a research topic, is that although 2D face recognition brightness values that represent the image region around the systems can achieve good performance in constrained envi- node. However, it is desirable to have more complex types ronments, they still encounter difculties in handling large of jet that are produced by multiscale image analysis [9]. amounts of facial variations due to head pose, lighting con- The representation of an object in DLA can be summarized ditions and facial expressions. Moreover, the acquisition of in the following steps: 1)Group all the features that corre- 3D face geometry is also quite restricted due to the pres- spond to the same graph node of the object into a jet. 2) ence of noise and the difculty presented in image acquisi- Group all the nodes and jets that belong to the object in tion [3]. For the acquisition of 3D data, different techniques order to form the object graph. 3) Dene neighborhood re- have been used, such as laser and structured light scanners. lationships for each graph node. However, the use of these techniques is not very practical as One of the most well-studied implementations of DLA is it requires the cooperation of the subject. So, we use pho- the Elastic Graph Matching (EGM) algorithm. In [9] EGM tometric stereo [2] for 3D acquisition, that does not depend was initially proposed for arbitrary object recognition from signicantly on the user’s cooperation. The actual model images and has been a very popular topic of research for that we acquire this way is a 2.5D model. 2.5D is a sim- various facial image characterization applications. In EGM, plied 3D (x, y, z) surface representation that contains at a reference object graph is created by overlaying a rectan- most one depth value (z direction) for every point in the gular elastic sparse graph on the object image and then cal- (x, y) plane. culating a Gabor wavelet bank response at each graph node. All EGM based methods proposed so far do not take into 703 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops 978-1-4244-4441-0/09/$25.00 ©2009 IEEE consideration the 3D nature of the objects. In this paper ordinates x, a jet (feature vector) j(x) is formed: we propose methods for exploiting the 3D nature of the ob- j x x x T jects inside the EGM architecture. That is, the proposed ( )=[f1( ),...,fM ( )] , (1) algorithm introduces a new structure for object representa- where f (x) denotes the output of a local operator applied tion combining both image intensity and 3D geometry in- i to image f at the i-thscaleoratthei-th pair (scale, orien- formation. Moreover, we propose algorithms which can be tation) and M is the jet dimensionality. used for robustly matching the object structure in novel in- The next step of EGM is to match the reference graph on stances. The algorithm is general and can be used for arbi- the test facial expression image in order to nd the corre- trary object recognition. In this paper we demonstrate the spondences of the reference graph nodes on the test image. power of the proposed method in face recognition assisted This is accomplished by minimizing a cost function that em- by the available 3D face geometry. Moreover, we exploit ploys node jet similarities while preserving at the same time the 3D face information available in order to advance the the node neighborhood relationships. Let the subscripts t matching procedure. Moreover, we adopt the assumption and r denote a test and a reference facial image (or graph), that facial expressions are isometries upon the facial sur- respectively. The L2 norm between the feature vectors at face and we incorporate this inside the matching algorithm. the l-th graph node of the reference and the test graph is This assumption has been exploited in [4, 10] for creating used as a similarity measure between jets. Another simi- expression invariant face representations. larity measure that is used is the cosine between the two The proposed method in its generalized version can be vectors: applied in a fully automatic manner, combined with a face j(xl )T j(xl ) detection algorithm. The simplied version of the proposed C (j(xl ), j(xl )) = r t . (2) f t r ||j xl ||||j xl || algorithm requires the detection of the nose. The nose is ( r) ( t) the most easily detected facial feature when the 3D facial V geometry is available. On the contrary, most of the 3D face Let be the set of all graph vertices of a certain facial recognition algorithms proposed so far require the robust image. For the rectangular graphs, all nodes apart from the detection [7] of a set of facial features, which is in general boundary nodes have exactly four connected nodes. Let N (l) l a difcult task. Furthermore, we incorporate the 3D facial be the four-connected neighborhood of node .Inor- geometry inside the multiscale analysis and we propose a der to quantify the node neighborhood relationships using a novel Geodesic Morphological Multiscale analysis in order measure, the relative normalized local node deformation is to ll the graph jets. used: || xl − xξ − xl − xξ || l l ( t t ) ( r r) Cd(x , x )= . (3) t r ||xl − xξ|| 2. 2D Elastic Graph Matching Techniques ξ∈N (l) r r {xl }L In the rst step of the EGM algorithm, a suitable for The set of vertices t(r)opt l=1 can be found by mini- face representation sparse graph is selected. An evenly dis- mizing: tributed rectangular graph is one of the most easy to handle l L l {x (r) } =min{xl } C({x }) t opt l=1 t t automatically forms of a graph, since only a face detection l l =min{xl } {−Cf (j(x ), j(x )) algorithm is needed in order to nd an initial approxima- t l∈V t r +λC (xl , xl )}. tion of the rectangular facial region. The facial image re- d t r (4) gion is analyzed and a set of local descriptors is extracted at The choice of λ in (10) controls the rigidity/plasticity of the each graph node (called jets). Analysis is usually performed graph [5], [8]. by building an information pyramid using scale-space tech- In [8], the optimization of (4) was implemented as a Sim- niques. In the standard EGM, a 2D Gabor based lter bank ulated Annealing (SA) procedure, that imposes global trans- has been used for image analysis. The output of multiscale lation at the graph and local node deformations. In [12], in morphological dilation-erosion operations or the morpho- order to deal with face translation, rotation and scaling, the logical signal decomposition at several scales are nonlinear following optimization problem has been formulated: alternatives of the Gabor lters for multiscale analysis and L l both have been both successfully used for facial image anal- {{δl} =1, t, A} =min L C({Ax + t + δl}) l opt {δl}l=1,t,A r ysis [8].

2.5D Elastic Graph Matching Algorithms

Two-Dimensional Rotational Kinematics Rigid Bodies

1.2 Rules for Translations

Multidisciplinary Design Project Engineering Dictionary Version 0.0.2

2-D Drawing Geometry Homogeneous Coordinates

Translation and Reflection; Geometry

Lecture 8 Image Transformations

Affine Transformations and Rotations

Engineering Mechanics

Ting Yip Math 308A 12/3/2001 Ting Yip Math 308A Abstract

The Mathematics Behind Escher's Prints: Form Symmetry to Groups

FRIEZE GROUPS in R2 Contents 1. Metric Spaces, Groups, And

Unit-1 : 2D and 3D Transformation & Viewing