Automatic 2.5D Cartoon Modelling

Total Page:16

File Type:pdf, Size:1020Kb

Automatic 2.5D Cartoon Modelling Automatic 2.5D Cartoon Modelling Fengqi An School of Computer Science and Engineering University of New South Wales A dissertation submitted for the degree of Master of Science 2012 PLEASE TYPE THE UNIVERSITY OF NEW SOUTH WALES T hesis!Dissertation Sheet Surname or Family name. AN First namEY. Fengqi Orner namels: Zane Abbreviatlo(1 for degree as given in the University calendar: MSc School: Computer Science & Engineering Faculty: Engineering Title; Automatic 2.50 Cartoon Modelling Abstract 350 words maximum: (PLEASE TYPE) Declarat ion relating to disposition of project thesis/dissertation I hereby grant to the University of New South Wales or its agents the right to archive and to make available my thesis or dissertation in whole orin part in the University libraries in all forms of media, now or here after known, subject to the provisions of the Copyright Act 1968. I retain all property rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of thts thesis or dissertation. I also authorise University Microfilms to use the 350 word abstract of my thesis in Dissertation· Abstracts International (this is applicable to-doctoral theses only) .. ... .............. ~..... ............... 24 I 09 I 2012 Signature · · ·· ·· ·· ···· · ··· ·· ~ ··· · ·· ··· ···· Date The University recognises that there may be exceptional circumstances requiring restrictions on copying or conditions on use. Requests for restriction for a period of up to 2 years must be made in writi'ng. Requests for a longer period of restriction may be considered in exceptional e<ircumstances and require the approval of the Dean of Graduate Research, FOR OFFICE USE ONLY Date of completion of requirements for Award: THIS SHEET IS TO BE GLUED TO THE INSIDE FRONT COVER OF THE THESIS ii ORIGINALITY STATEMENT 'I hereby declare that this submrssion is my own work and to the best of my knowledge it contains no materials previously published or written by another person, or substantial proportions of material which have been accepted for the award of any other degree or diploma at UNSW or any other educational institution, except where due acknowledgement is made in the thesis. Any contribution made to the research by others, with whom I have worked at UNSW or elsewhere, is explicitly acknowledged in the thesis. I also declare that the intellectual content of this thesis is the product of my own work, except to the extent that assistance from others in the project's aesign and conception or in style, presentation and linguistic expression is acknowledged.' Signed ·················~················ · ·· · ···· Date 24/09 I 2012 COPYRIGHT STATEMENT 'I hereby grant the University of New South Wales or its agents the right to archive and to make available my thesis or dissertation in whole or part in the University libraries in all forms of media, now or here after known, subject to the provisions of the Copyright Act 1968. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertation. I also authorise University Microfilms to use the 350 word abstract of my thesis in Dissertation Abstract International (this is applicable to doctoral theses only). I have either used no substantial portions of copyright material in my thesis or I have obtained permission to use copyright material; where permission has not been granted I have applied/Will apply for a partial restriction of the digital copy of my thesis or dissertation.' Signed ············· · ···~············ · ··················· ·· ··· Date 24109 I 2012 AUTHENTICITY STATEMENT 1 1 certify that the Library deposit digital copy is a direct equivalent of the final officially approved version of my thesis. No emendation of content has occurred and if there are any minor variations in formatting, they are the result of the conversion to digital format.' Sighed · ·············~··············· · ··· ·· · · ·· · ··· · ·· ···· ··· 24 I 0912012 Date iii Abstract Non-photorealistic arts have been an invaluable form of media for over tens of thousands of years, and are widely used in animation and games today, motivating research into this field. Recently, the novel 2.5D Model has emerged, targetting the limitations of both 2D and 3D forms of cartoons. The most recent development is the 2.5D Cartoon Model. The manual building process of such models is labour intensive, and no automatic building method for 2.5D models exists currently. This dissertation proposes a novel approach to the problem of automatic cre- ation of 2.5D Cartoon Models, termed Auto-2CM in this thesis, which is the first attempt of a solution to the problem. The proposed approach aims to build 2.5D models from real world objects. Auto-2CM collects 3D information on the candidate object us- ing 3D reconstruction methods from Computer Vision, then partitions it into meaningful parts using segmentation methods from Computer Graphics. A novel 3D-2.5D conversion method is introduced to create the final 2.5D model, which is the first method for 3D-2.5D conver- sion. The Auto-2CM framework does not mandate specific algorithms of reconstruction or segmentation, therefore different algorithms may be used for different kinds of objects. The effect of different algorithms on the final 2.5D model is currently unknown. A perceptual evaluation of Auto-2CM is performed, which shows that by using different combinations of algorithms within Auto- 2CM for specific kinds of objects, the performance of the system maybe increased significantly. The approach can produce acceptable models for both manual sketches and direct use. It is also the first experimental study of the problem. iv v Acknowledgements Of the many people who deserve thanks, some are particularly prominent: My supervisors, Prof. Arcot Sowmya and Dr. Xiongcai Cai, for their advice, guidance and support. This thesis would not have been possible without their selfless contribution. My parents, Shuli Zhan and Yuli An, for their support and understanding. The thanks and appreciation I owe to them cannot be described by any words. My fellow students in the group, Roger Chen, for his selfless help during my study; Dimitri Semenovich and Anuraag Sridhar, for their guidance and help as seniors. Shy Shalom, co-author of SDF, and Alec Rivers, co-author of 2.5D Cartoon Models, for their patience in answering questions and for helpful discussions. vi Contents 1. Introduction 3 1.1. Cartoon, Model and Modelling ........................ 4 1.2. 2D,3D and 2.5D ................................ 4 1.2.1. 2D ................................... 4 1.2.2. 3D ................................... 5 1.2.3. 2.5D .................................. 5 1.3. Artists at Work ................................ 6 1.3.1. 2D programs and devices ....................... 7 1.3.2. Automatic 2D generation ....................... 7 1.3.3. 3D programs and devices ....................... 8 1.3.4. Automatic 3D modelling ....................... 8 1.3.5. 2.5D programs ............................ 9 1.4. Motivation ................................... 9 1.5. Overview .................................... 10 1.6. Scope ..................................... 11 1.7. Contribution .................................. 11 1.8. Organization ................................. 12 2. Background and Literature Survey 13 2.1. Cartoons .................................... 14 2.1.1. History ................................. 14 2.1.2. Modern Cartoons ........................... 16 2.1.3. The 2D-3D argument ......................... 17 2.2. Early 2.5D Cartoons ............................. 17 2.2.1. Automatic In-betweening for Animation . 18 2.2.2. Drawing for Illustration and Annotation in 3D . 19 vii viii Contents 2.3. The 2.5D Cartoon Models .......................... 20 2.3.1. Advantages .............................. 20 2.3.2. Limitations .............................. 21 2.3.3. The Manual Creation Process .................... 23 2.4. 3D Reconstruction .............................. 24 2.4.1. Feature Detection, Stereo Matching and Visual Hull . 24 2.4.2. Image-Based 3D Reconstruction ................... 26 2.5. 3D Segmentation ............................... 27 2.5.1. Manual 3D Segmentation ....................... 27 2.5.2. Automatic 3D Segmentation ..................... 27 2.5.3. Shape Diameter Function ...................... 28 2.5.4. Fitting Primitives ........................... 29 2.5.5. Protrusion-oriented Segmentation . 29 2.6. 3D Projection ................................. 30 2.7. Summary ................................... 31 3. Automatic 2.5D Cartoon Modelling 33 3.1. 3D Reconstruction .............................. 34 3.2. 3D Segmentation ............................... 35 3.3. 2.5D Stroke Creation ............................. 38 3.3.1. Parts Refinement ........................... 38 3.3.2. Shape extraction ........................... 39 3.3.3. Stroke Control Points Creation ................... 40 3.3.4. Other Attributes ........................... 41 3.4. 2.5D Model Assembly ............................. 42 3.4.1. Key-View Selection .......................... 42 3.4.2. Single Part Assembly ......................... 43 3.4.3. Model Assembly ........................... 43 3.5. Manual Modifications ............................. 44 3.5.1. Stylistic Strokes Modification .................... 45 3.5.2. Colouring ............................... 45 3.6. Algorithm ................................... 47 3.7. Experiments and Results
Recommended publications
  • The Uses of Animation 1
    The Uses of Animation 1 1 The Uses of Animation ANIMATION Animation is the process of making the illusion of motion and change by means of the rapid display of a sequence of static images that minimally differ from each other. The illusion—as in motion pictures in general—is thought to rely on the phi phenomenon. Animators are artists who specialize in the creation of animation. Animation can be recorded with either analogue media, a flip book, motion picture film, video tape,digital media, including formats with animated GIF, Flash animation and digital video. To display animation, a digital camera, computer, or projector are used along with new technologies that are produced. Animation creation methods include the traditional animation creation method and those involving stop motion animation of two and three-dimensional objects, paper cutouts, puppets and clay figures. Images are displayed in a rapid succession, usually 24, 25, 30, or 60 frames per second. THE MOST COMMON USES OF ANIMATION Cartoons The most common use of animation, and perhaps the origin of it, is cartoons. Cartoons appear all the time on television and the cinema and can be used for entertainment, advertising, 2 Aspects of Animation: Steps to Learn Animated Cartoons presentations and many more applications that are only limited by the imagination of the designer. The most important factor about making cartoons on a computer is reusability and flexibility. The system that will actually do the animation needs to be such that all the actions that are going to be performed can be repeated easily, without much fuss from the side of the animator.
    [Show full text]
  • Dual-Forward-Focus
    Scroll Back The Theory and Practice of Cameras in Side-Scrollers Itay Keren Untame [email protected] @itayke Scrolling Big World, Small Screen Scrolling: Neural Background Fovea centralis High cone density Sharp, hi-res central vision Parafovea Lower cone density Perifovea Lowest density, Compressed patterns. Optimized for quick pattern changes: shape, acceleration, direction Fovea centralis High cone density Sharp, hi-res central vision Parafovea Lower cone density Perifovea Lowest density, Compressed patterns. Optimized for quick pattern changes: shape, acceleration, direction Thalamus Relay sensory signals to the cerebral cortex (e.g. vision, motor) Amygdala Emotional reactions of fear and anxiety, memory regulation and conditioning "fight-or-flight" regulation Familiar visual patterns as well as pattern changes may cause anxiety unless regulated Vestibular System Balance, Spatial Orientation Vestibulo-Ocular Reflex Natural image stabilizer Conflicting sensory signals (Visual vs. Vestibular) may lead to discomfort and nausea* * much worse in 3D (especially VR), but still effective in 2D Scrolling with Attention, Interaction and Comfort Attention: Use the camera to provide sufficient game info and feedback Interaction: Make background changes predictable, tightly bound to controls Comfort: Ease and contextualize background changes Attention The Elements of Scrolling Interaction Comfort Scrolling Nostalgia Rally-X © 1980 Namco Scramble © 1981 Jump Bug © 1981 Defender © 1981 Konami Hoei/Coreland (Alpha Denshi) Williams Electronics Vanguard
    [Show full text]
  • Skeletal Animations Part II
    3D Video Games 2021-05-11 09: Computer Animations for games 3/3 Skeletal animations Part II Course Plan lec. 1: Introduction lec. 2: Mathematics for 3D Games lec. 3: Scene Graph lec. 4: Game 3D Physics + lec. 5: Game Particle Systems ◗ lec. 6: Game 3D Models lec. 7: Game Textures ◗ lec. 8: Game 3D Animations lec. 9: Game 3D Audio lec. 10: Networking for 3D Games lec. 11: Artificial Intelligence for 3D Games lec. 12: Game 3D Rendering Techniques 120 Mixing keyframes and entire animations (notes) Poses in a skeletal animations can be easily blended (blending local per-bone transforms) This interpolation is very expressive: very different frames can be blended with good results much more than with blend-shapes! Keyframes can be very far apart E.g.: decent walk-cycles with just 4 key-frames! (2 per step) E.g.: decent attack animations with just 2 key-frames! (but better results are always obtained inserting new key-frames) Entire animations can be mixed. Two ways: Transitions between two animations (or more) Compositing (layering) two animations (or more) 121 Marco Tarini Università degli studi di Milano 1 3D Video Games 2021-05-11 09: Computer Animations for games 3/3 Skeletal animations Part II Pose = keyframe Compress animations animation “walk” t = 0 keyframe A stored pose t = 1 0.75 A ---+ 0.25 B t = 2 0.50 A ---+ 0.50 B Inbetween pose, t = 3 0.25 A ---+ 0.75 B computed on the fly t = 4 keyframe B t = 5 0.50 B ---+ 0.50 C t = 6 keyframe C 122 Interpolation of poses (at runtime): transition between animations Eg:
    [Show full text]
  • Introduction Week 1, Lecture 1
    CS 430 Computer Graphics Introduction Week 1, Lecture 1 David Breen, William Regli and Maxim Peysakhov Department of Computer Science Drexel University 1 Overview • Course Policies/Issues • Brief History of Computer Graphics • The Field of Computer Graphics: A view from 66,000ft • Structure of this course • Homework overview • Introduction and discussion of homework #1 2 Computer Graphics I: Course Goals • Provide introduction to fundamentals of 2D and 3D computer graphics – Representation (lines/curves/surfaces) – Drawing, clipping, transformations and viewing – Implementation of a basic graphics system • draw lines using Postscript • simple frame buffer with PBM & PPM format • ties together 3D projection and 2D drawing 3 Interactive Computer Graphics CS 432 • Learn and program WebGL • Computer Graphics was a pre-requisite – Not anymore • Looks at graphics “one level up” from CS 430 • Useful for Games classes • Part of the HCI and Game Development & Design tracks? 5 Advanced Rendering Techniques (Advanced Computer Graphics) • Might be offered in the Spring term • 3D Computer Graphics • CS 430/536 is a pre-requisite • Implement Ray Tracing algorithm • Lighting, rendering, photorealism • Study Radiosity and Photon Mapping 7 ART Student Images 8 Computer Graphics I: Technical Material • Course coverage – Mathematical preliminaries – 2D lines and curves – Geometric transformations – Line and polygon drawing – 3D viewing, 3D curves and surfaces – Splines, B-Splines and NURBS – Solid Modeling – Color, hidden surface removal, Z-buffering 9
    [Show full text]
  • CS 4204 Computer Graphics 3D Views and Projection
    CS 4204 Computer Graphics 3D views and projection Adapted from notes by Yong Cao 1 Overview of 3D rendering Modeling: * Topic we’ve already discussed • *Define object in local coordinates • *Place object in world coordinates (modeling transformation) Viewing: • Define camera parameters • Find object location in camera coordinates (viewing transformation) Projection: project object to the viewplane Clipping: clip object to the view volume *Viewport transformation *Rasterization: rasterize object Simple teapot demo 3D rendering pipeline Vertices as input Series of operations/transformations to obtain 2D vertices in screen coordinates These can then be rasterized 3D rendering pipeline We’ve already discussed: • Viewport transformation • 3D modeling transformations We’ll talk about remaining topics in reverse order: • 3D clipping (simple extension of 2D clipping) • 3D projection • 3D viewing Clipping: 3D Cohen-Sutherland Use 6-bit outcodes When needed, clip line segment against planes Viewing and Projection Camera Analogy: 1. Set up your tripod and point the camera at the scene (viewing transformation). 2. Arrange the scene to be photographed into the desired composition (modeling transformation). 3. Choose a camera lens or adjust the zoom (projection transformation). 4. Determine how large you want the final photograph to be - for example, you might want it enlarged (viewport transformation). Projection transformations Introduction to Projection Transformations Mapping: f : Rn Rm Projection: n > m Planar Projection: Projection on a plane.
    [Show full text]
  • Polygonal Modeling: the Process of Building a 3D Object by Specifying the Polygons That Make up That Object
    Modeling 3D objects with polygons 1 Why polygons? Simple mathematical description Standard 3D graphics primitive All graphics packages optimized for polygon throughput Most 3D graphics algorithms assume a polygon-based scene Common polygon algorithms implemented in hardware In the end, everything (well, almost everything) is a polygon (C) Doug Bowman, Virginia Tech, 2008 2 2 Terminology Polygon soup: a general set of unstructured polygons used to define a scene Polygonal mesh: a set of connected polygons that together form a surface (C) Doug Bowman, Virginia Tech, 2008 3 3 More terminology 3D polygonal model: a 3D object made up entirely of polygons 3D polygonal modeling: the process of building a 3D object by specifying the polygons that make up that object NOTE: you can build a 3D polygonal model without using 3D polygonal modeling! (C) Doug Bowman, Virginia Tech, 2008 4 4 Methods of creating polygonal meshes Build mesh by hand (C) Doug Bowman, Virginia Tech, 2008 5 5 Methods of creating polygonal meshes Tesselate a theoretical smooth surface Tesselation: the process of creating a polygonal approximation from a smooth surface (C) Doug Bowman, Virginia Tech, 2008 6 6 Methods of creating polygonal meshes Extrude a 2D polygon, curve, etc. Extrusion: the process of moving a 2D cross-section through space to create a 3D solid (C) Doug Bowman, Virginia Tech, 2008 7 7 Methods of creating polygonal meshes Revolve/sweep a 2D polygon or curve Revolution: the process of rotating a 2D cross-section about an axis to create a 3D
    [Show full text]
  • Critical Review of Open Source Tools for 3D Animation
    IJRECE VOL. 6 ISSUE 2 APR.-JUNE 2018 ISSN: 2393-9028 (PRINT) | ISSN: 2348-2281 (ONLINE) Critical Review of Open Source Tools for 3D Animation Shilpa Sharma1, Navjot Singh Kohli2 1PG Department of Computer Science and IT, 2Department of Bollywood Department 1Lyallpur Khalsa College, Jalandhar, India, 2 Senior Video Editor at Punjab Kesari, Jalandhar, India ABSTRACT- the most popular and powerful open source is 3d Blender is a 3D computer graphics software program for and animation tools. blender is not a free software its a developing animated movies, visual effects, 3D games, and professional tool software used in animated shorts, tv adds software It’s a very easy and simple software you can use. It's show, and movies, as well as in production for films like also easy to download. Blender is an open source program, spiderman, beginning blender covers the latest blender 2.5 that's free software anybody can use it. Its offers with many release in depth. we also suggest to improve and possible features included in 3D modeling, texturing, rigging, skinning, additions to better the process. animation is an effective way of smoke simulation, animation, and rendering. Camera videos more suitable for the students. For e.g. litmus augmenting the learning of lab experiments. 3d animation is paper changing color, a video would be more convincing not only continues to have the advantages offered by 2d, like instead of animated clip, On the other hand, camera video is interactivity but also advertisement are new dimension of not adequate in certain work e.g. like separating hydrogen from vision probability.
    [Show full text]
  • Principles of Computer Graphics Animation
    Principles of computer graphics animation Christian Van Brussel – SPS ICTEAM/ELEN – February 2012 Overview 3D animation and 3D deformation tools Generating animations Combining animations Common animation problems Overview 3D animation and 3D deformation tools Generating animations Combining animations Common animation problems 3D Rendering Render one frame: illumination model Set of frames: motion model The Human Eye VS. the Motion 3D Meshes Defined by: Vertices Triangles Textures and texture coordinates Normals Shaders and other material properties Animating a 3D Mesh Three main categories of animation: Frame by frame Key frames + interpolation Procedural Two main tools for the deformation: Morphing Skeletal animation Morphing Pose of a mesh: 3 P= p0 ,... , pn, pi ∈ℝ Morph targets (a.k.a. blend shapes): MT j=P j−Pneutral Applying the morph targets: m P final=Pneutral∑ w j×MT j j=0 Skeletal Animation Tree of bones: B0 ,... , Bn Transform of a given bone (neutral): T Gi =T 0×...×T i−1 Transform of a given bone (posed): T Gi=T 0×P0×...×T i−1×Pi−1 Representation of the rotations Euler angles + understandable by humans - not composable - Gimbal lock Axis angle + understandable by humans - not composable Matrix + composable - not understandable by humans - quite costly in memory and computation Representation of the rotations Quaternion + composable + numerically stable + low memory + low computation costs - really not understandable by humans Skeletal Animation: Skinning Define bone influences
    [Show full text]
  • 3D Modeling and the Role of 3D Modeling in Our Life
    ISSN 2413-1032 COMPUTER SCIENCE 3D MODELING AND THE ROLE OF 3D MODELING IN OUR LIFE 1Beknazarova Saida Safibullaevna 2Maxammadjonov Maxammadjon Alisher o’g’li 2Ibodullayev Sardor Nasriddin o’g’li 1Uzbekistan, Tashkent, Tashkent University of Informational Technologies, Senior Teacher 2Uzbekistan, Tashkent, Tashkent University of Informational Technologies, student Abstract. In 3D computer graphics, 3D modeling is the process of developing a mathematical representation of any three-dimensional surface of an object (either inanimate or living) via specialized software. The product is called a 3D model. It can be displayed as a two-dimensional image through a process called 3D rendering or used in a computer simulation of physical phenomena. The model can also be physically created using 3D printing devices. Models may be created automatically or manually. The manual modeling process of preparing geometric data for 3D computer graphics is similar to plastic arts such as sculpting. 3D modeling software is a class of 3D computer graphics software used to produce 3D models. Individual programs of this class are called modeling applications or modelers. Key words: 3D, modeling, programming, unity, 3D programs. Nowadays 3D modeling impacts in every sphere of: computer programming, architecture and so on. Firstly, we will present basic information about 3D modeling. 3D models represent a physical body using a collection of points in 3D space, connected by various geometric entities such as triangles, lines, curved surfaces, etc. Being a collection of data (points and other information), 3D models can be created by hand, algorithmically (procedural modeling), or scanned. 3D models are widely used anywhere in 3D graphics.
    [Show full text]
  • Automatic Expressive Deformations for Stylizing Motion
    Automatic Expressive Deformations for Stylizing Motion Paul Noble* and Wen Tang† School of Computing, University of Teesside Figure 1: A run cycle. Above: before. Below: after expressive deformations have been applied. Abstract 3D computer animation often struggles to compete with the flexibility and expressiveness commonly found in traditional animation, particularly when rendered non-photorealistically. We present an animation tool that takes skeleton-driven 3D computer animations and generates expressive deformations to the character geometry. The technique is based upon the cartooning and animation concepts of ‘lines of action’ and ‘lines of motion’ and automatically infuses computer animations with some of the expressiveness displayed by traditional animation. Motion and pose-based expressive deformations are generated from the motion data and the character geometry is warped along each limb’s individual line of motion. The effect of this subtle, yet significant, warping is twofold: geometric inter-frame consistency is increased which helps create visually smoother animated sequences, and the warped geometry provides a novel solution to the problem of implied motion in non-photorealistic still images. CR Categories: I.3.7 [Computer Graphics]: Animation; I.3.5 [Computer Graphics]: Curve, surface, solid, and object representations Figure 2: Examples of expressive limb deformations in Keywords: expressive deformations, cartoon animation, non- cartoons. © Hart (top) [Hart 1997]. (Used with permission.) photorealistic rendering, stylizing motion and joints can be broken [Williams 2001] if it makes for a more 1 Introduction appealing image or dynamic motion. As a result, the limbs of hand-crafted animated characters (both pencil and computer- Traditional animators have always had a rather flexible view of generated) are often distorted to accentuate a motion or imply an bone structure.
    [Show full text]
  • Human Skeleton System Animation Stephanie Cheng
    UNIVERSITY OF ZAGREB FACULTY OF ELECTRICAL ENGINEERING AND COMPUTING MASTER THESIS no. 1538 Human Skeleton System Animation Stephanie Cheng Zagreb, June 2017 SVEUČILIŠTE U ZAGREBU FAKULTET ELEKTROTEHNIKE I RAČUNARSTVA DIPLOMSKI RAD br. 1538 Animacija skeletnog modela čovjeka Stephanie Cheng Zagreb, lipanj 2017 Table of Contents 1. Introduction .......................................................................................................... 1 2. Skeletal animation theory .................................................................................... 2 3. Used tools ............................................................................................................. 5 3.1. OpenGL ..................................................................................................................... 5 3.1.1 Libraries ................................................................................................................... 8 3.2. Assimp ...................................................................................................................... 8 3.2.1. Assimp Data Structure ..................................................................................... 8 3.3. Blender ................................................................................................................... 10 3.4. Library Linmath ...................................................................................................... 11 4. Implementation .................................................................................................
    [Show full text]
  • On the Dimensionality of Deformable Face Models
    On the Dimensionality of Deformable Face Models CMU-RI-TR-06-12 Iain Matthews, Jing Xiao, and Simon Baker The Robotics Institute Carnegie Mellon University 5000 Forbes Avenue Pittsburgh, PA 15213 Abstract Model-based face analysis is a general paradigm with applications that include face recognition, expression recognition, lipreading, head pose estimation, and gaze estimation. A face model is first constructed from a collection of training data, either 2D images or 3D range scans. The face model is then fit to the input image(s) and the model parameters used in whatever the application is. Most existing face models can be classified as either 2D (e.g. Active Appearance Models) or 3D (e.g. Morphable Models.) In this paper we compare 2D and 3D face models along four axes: (1) representational power, (2) construction, (3) real-time fitting, and (4) self occlusion reasoning. For each axis in turn, we outline the differences that result from using a 2D or a 3D face model. Keywords: Model-based face analysis, 2D Active Appearance Models, 3D Morphable Models, representational power, model construction, non-rigid structure-from-motion, factorization, real- time fitting, the inverse compositional algorithm, constrained fitting, occlusion reasoning. 1 Introduction Model-based face analysis is a general paradigm with numerous applications. A face model is first constructed from either a set of 2D training images [Cootes et al., 2001] or a set of 3D range scans [Blanz and Vetter, 1999]. The face model is then fit to the input image(s) and the model parameters are used in whatever the application is.
    [Show full text]