Near Optimal Synthesis of Digital Animation from Multiple Motion Capture Clips

Total Page:16

File Type:pdf, Size:1020Kb

Near Optimal Synthesis of Digital Animation from Multiple Motion Capture Clips UNIVERSITA’ DEGLI STUDI DI MILANO Facoltà di Scienze Matematiche, Fisiche e Naturali Near Optimal Synthesis of Digital Animation from Multiple Motion Capture Clips Corso di Laurea Magistrale in: Tecnologie dell’Informazione e della Comunicazione Marco Alamia Matricola n. 736779 Relatore: Prof. Ing. Alberto N. BORGHESE Correlatore: Dott. Iuri FROSIO Anno Accademico 2008/2009 ABSTRACT The present work discusses theory and practice of a powerful animation method designed to generate walk animations for digital characters. Our system permits to interact with the animation, allowing the user to change at run-time several animation’s parameters, such as the motion direction or the gait style, influencing the final animation. Our work starts presenting the skeleton animation system and the motion capture system; then we explain how we can, thanks to these two techniques, generate a database of walk animation clips. The so obtained animations are provided to the animation system which links them together in a sequence. Linking is obtained generating a smooth transition from one clip to the next one through the use of the animation blending technique. Since we want to interact with the animation at run-time, the clip’s sequence is not given a priori, instead it is generated in real-time in respect to the user’s desires. To this aim we create a controller in charge of choosing the next clip in respect to the task that it is given, which can be simulating a walk along a line, or simulating a walk where motion direction and character’s orientation are required by the user. The controller leans on a selection policy in order to choose the next clip; in our work we propose two possible policies, a greedy one and a near-optimal one. The former goes through every animation contained in the database and evaluates the direct cost of adding the clip in exam to the sequence. The latter policy, instead, chooses the next clip evaluating an approximation of the optimal choice, which is obtained through the implementation of a reinforcement learning algorithm. The optimal choice estimates both the direct cost of choosing a clip as well as all the future costs that the system will pay for that choice. Unfortunately we can’t effectively implement the optimal policy, therefore we content ourselves with an approximation that leads to the near- optimal policy. We lastly show how both these policies produce controllers capable of responding, in real- time, to the change of several animation parameters due to the user’s interaction as well as the environmental constraints. iii iv ACKNOWLEDGMENTS I want to thank my thesis supervisor associate professor Alberto N. Borghese for his contribution during the development of this thesis. I am grateful to him for passing on to me appreciation for accurate work and method. This thesis is the fruit of a close collaboration and would certainly not have been the same without his contribution. In addition I thank my co-supervisor dott. Iuri Frosio for his help with the motion capture system. I also wish to thank my family, who always supported me and encouraged me. I am grateful to you because you have always supported me with no hesitation. Thank you. v A mio padre per aver posto le basi di tutto ciò che so A mia madre per avermi sempre cresciuto con infinito amore, nel bene e nel male ed in ogni momento della mia vita A mio fratello, inestimabile compagno di viaggio verso Itaca A Flora per aver saputo recuperare vi vii CONTENTS ABSTRACT ................................................................................................................... III ACKNOWLEDGMENTS .............................................................................................. V CONTENTS................................................................................................................ VIII INTRODUCTION............................................................................................................ 1 1: Introduction ..........................................................................................................................................2 1.1 Introduction ....................................................................................................................................2 1.2 Animation framework ....................................................................................................................2 1.3 Blending system .............................................................................................................................3 1.3.1 Animation blending................................................................................................................3 1.3.2 Linear interpolation ................................................................................................................3 1.3.3 Clip blending system in the motion model .............................................................................4 1.4 Control System...............................................................................................................................5 1.5 Software implementation ...............................................................................................................6 MOTION MODEL .......................................................................................................... 8 2: Animation Framework ........................................................................................................................9 2.1 Introduction ....................................................................................................................................9 2.2 Skeletal Animation .........................................................................................................................9 2.3 The avatar model ............................................................................................................................9 2.3.1 Skeleton ............................................................................................................................... 11 2.3.2 Skinning ............................................................................................................................... 12 2.4 Key frames ................................................................................................................................... 14 2.5 Motion Capture ............................................................................................................................ 15 2.5.1 Motion Capture .................................................................................................................... 15 2.5.2 SMART – Motion Capture System ...................................................................................... 16 2.5.3 Noise and Missing Data ....................................................................................................... 18 2.5.4 Mapping ............................................................................................................................... 20 2.5.5 Skeleton’s Frenet frames ...................................................................................................... 20 2.5.6 Head’s Frenet frame ............................................................................................................. 22 2.6 Software implementation ............................................................................................................. 24 2.7 Conclusion summary ................................................................................................................... 25 3: Blending System ................................................................................................................................. 26 3.1 Introduction .................................................................................................................................. 26 3.2 Animation blending ..................................................................................................................... 26 3.3 Gait cycles .................................................................................................................................... 27 3.4 Motion model clips ...................................................................................................................... 28 3.4.1 Clips definition ..................................................................................................................... 28 3.4.2 Clips Constraints System ..................................................................................................... 29 3.4.3 Mirroring animations ........................................................................................................... 31 viii 3.5 Clip blending ................................................................................................................................ 34 3.5.1 Blending process .................................................................................................................. 34 3.6 Conclusion summary ................................................................................................................... 36 CONTROL SYSTEM .................................................................................................... 37 4: Control system .................................................................................................................................... 38 4.1 Introduction .................................................................................................................................
Recommended publications
  • The Uses of Animation 1
    The Uses of Animation 1 1 The Uses of Animation ANIMATION Animation is the process of making the illusion of motion and change by means of the rapid display of a sequence of static images that minimally differ from each other. The illusion—as in motion pictures in general—is thought to rely on the phi phenomenon. Animators are artists who specialize in the creation of animation. Animation can be recorded with either analogue media, a flip book, motion picture film, video tape,digital media, including formats with animated GIF, Flash animation and digital video. To display animation, a digital camera, computer, or projector are used along with new technologies that are produced. Animation creation methods include the traditional animation creation method and those involving stop motion animation of two and three-dimensional objects, paper cutouts, puppets and clay figures. Images are displayed in a rapid succession, usually 24, 25, 30, or 60 frames per second. THE MOST COMMON USES OF ANIMATION Cartoons The most common use of animation, and perhaps the origin of it, is cartoons. Cartoons appear all the time on television and the cinema and can be used for entertainment, advertising, 2 Aspects of Animation: Steps to Learn Animated Cartoons presentations and many more applications that are only limited by the imagination of the designer. The most important factor about making cartoons on a computer is reusability and flexibility. The system that will actually do the animation needs to be such that all the actions that are going to be performed can be repeated easily, without much fuss from the side of the animator.
    [Show full text]
  • Multimodal Behavior Realization for Embodied Conversational Agents
    Multimed Tools Appl DOI 10.1007/s11042-010-0530-2 Multimodal behavior realization for embodied conversational agents Aleksandra Čereković & Igor S. Pandžić # Springer Science+Business Media, LLC 2010 Abstract Applications with intelligent conversational virtual humans, called Embodied Conversational Agents (ECAs), seek to bring human-like abilities into machines and establish natural human-computer interaction. In this paper we discuss realization of ECA multimodal behaviors which include speech and nonverbal behaviors. We devise RealActor, an open-source, multi-platform animation system for real-time multimodal behavior realization for ECAs. The system employs a novel solution for synchronizing gestures and speech using neural networks. It also employs an adaptive face animation model based on Facial Action Coding System (FACS) to synthesize face expressions. Our aim is to provide a generic animation system which can help researchers create believable and expressive ECAs. Keywords Multimodal behavior realization . Virtual characters . Character animation system 1 Introduction The means by which humans can interact with computers is rapidly improving. From simple graphical interfaces Human-Computer interaction (HCI) has expanded to include different technical devices, multimodal interaction, social computing and accessibility for impaired people. Among solutions which aim to establish natural human-computer interaction the subjects of considerable research are Embodied Conversational Agents (ECAs). Embodied Conversation Agents are graphically embodied virtual characters that can engage in meaningful conversation with human users [5]. Their positive impacts in HCI have been proven in various studies [16] and thus they have become an essential element of A. Čereković (*) : I. S. Pandžić Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia e-mail: [email protected] I.
    [Show full text]
  • Data Driven Auto-Completion for Keyframe Animation
    Data Driven Auto-completion for Keyframe Animation by Xinyi Zhang B.Sc, Massachusetts Institute of Technology, 2014 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of Science in THE FACULTY OF GRADUATE AND POSTDOCTORAL STUDIES (Computer Science) The University of British Columbia (Vancouver) August 2018 c Xinyi Zhang, 2018 The following individuals certify that they have read, and recommend to the Fac- ulty of Graduate and Postdoctoral Studies for acceptance, the thesis entitled: Data Driven Auto-completion for Keyframe Animation submitted by Xinyi Zhang in partial fulfillment of the requirements for the degree of Master of Science in Computer Science. Examining Committee: Michiel van de Panne, Computer Science Supervisor Leonid Sigal, Computer Science Second Reader ii Abstract Keyframing is the main method used by animators to choreograph appealing mo- tions, but the process is tedious and labor-intensive. In this thesis, we present a data-driven autocompletion method for synthesizing animated motions from input keyframes. Our model uses an autoregressive two-layer recurrent neural network that is conditioned on target keyframes. Given a set of desired keys, the trained model is capable of generating a interpolating motion sequence that follows the style of the examples observed in the training corpus. We apply our approach to the task of animating a hopping lamp character and produce a rich and varied set of novel hopping motions using a diverse set of hops from a physics-based model as training data. We discuss the strengths and weak- nesses of this type of approach in some detail. iii Lay Summary Computer animators today use a tedious process called keyframing to make anima- tions.
    [Show full text]
  • Forsíða Ritgerða
    Hugvísindasvið The Thematic and Stylistic Differences found in Walt Disney and Ozamu Tezuka A Comparative Study of Snow White and Astro Boy Ritgerð til BA í Japönsku Máli og Menningu Jón Rafn Oddsson September 2013 1 Háskóli Íslands Hugvísindasvið Japanskt Mál og Menning The Thematic and Stylistic Differences found in Walt Disney and Ozamu Tezuka A Comparative Study of Snow White and Astro Boy Ritgerð til BA / MA-prófs í Japönsku Máli og Menningu Jón Rafn Oddsson Kt.: 041089-2619 Leiðbeinandi: Gunnella Þorgeirsdóttir September 2013 2 Abstract This thesis will be a comparative study on animators Walt Disney and Osamu Tezuka with a focus on Snow White and the Seven Dwarfs (1937) and Astro Boy (1963) respectively. The focus will be on the difference in story themes and style as well as analyzing the different ideologies those two held in regards to animation. This will be achieved through examining and analyzing not only the history of both men but also the early history of animation in Japan and USA respectively. Finally, a comparison of those two works will be done in order to show how the cultural differences affected the creation of those products. 3 Chapter Outline 1. Introduction ____________________________________________5 1.1 Introduction ___________________________________________________5 1.2 Thesis Outline _________________________________________________5 2. Definitions and perception_________________________________6 3. History of Anime in Japan_________________________________8 3.1 Early Years____________________________________________________8
    [Show full text]
  • Multi-Modal Authoring Tool for Populating a Database of Emotional Reactive Animations
    Multi-modal Authoring Tool for Populating a Database of Emotional Reactive Animations Alejandra Garc´ıa-Rojas, Mario Guti´errez, Daniel Thalmann and Fr´ed´eric Vexo Virtual Reality Laboratory (VRlab) Ecole´ Polytechnique F´ed´erale de Lausanne (EPFL) CH-1015 Lausanne, Switzerland {alejandra.garciarojas,mario.gutierrez,daniel.thalmann,frederic.vexo}@epfl.ch Abstract. We aim to create a model of emotional reactive virtual hu- mans. A large set of pre-recorded animations will be used to obtain such model. We have defined a knowledge-based system to store animations of reflex movements taking into account personality and emotional state. Populating such a database is a complex task. In this paper we describe a multimodal authoring tool that provides a solution to this problem. Our multimodal tool makes use of motion capture equipment, a handheld device and a large projection screen. 1 Introduction Our goal is to create a model to drive the behavior of autonomous Virtual Hu- mans (VH) taking into account their personality and emotional state. We are focused on the generation of reflex movements triggered by events in a Virtual Environment and modulated by inner variables of the VH (personality, emo- tions). We intend to build our animation model on the basis of a large set of animation sequences described in terms of personality and emotions. In order to store, organize and exploit animation data, we need to create a knowledge-based system (animations database). This paper focuses on the authoring tool that we designed for populating such animation database. We observe that the process of animating is inherently multi-modal because it involves many inputs such as motion capture (mocap) sensors and user control on an animation software.
    [Show full text]
  • Realistic Crowd Animation: a Perceptual Approach
    Realistic Crowd Animation: A Perceptual Approach by Rachel McDonnell, B.A. (Mod) Dissertation Presented to the University of Dublin, Trinity College in fulfillment of the requirements for the Degree of Doctor of Philosophy University of Dublin, Trinity College November 2006 Declaration I, the undersigned, declare that this work has not previously been submitted as an exercise for a degree at this, or any other University, and that unless otherwise stated, is my own work. Rachel McDonnell November 14, 2006 Permission to Lend and/or Copy I, the undersigned, agree that Trinity College Library may lend or copy this thesis upon request. Rachel McDonnell November 14, 2006 Acknowledgments First and foremost, I would like to thank my supervisor Carol O’Sullivan, for all her help, guidance, advice, and encouragement. I couldn’t imagine a better supervisor! Secondly, I would like thank the past and present members of the ISG, for making the last four years very enjoyable. In particular, Ann McNamara for her enthusiasm and for advising me to do a PhD. Also, to all my friends and in particular my housemates for taking my mind off my research in the evenings! A special thank you to Mam, Dad and Sarah for the all the encouragement and interest in everything that I do. Finally, thanks to Simon for helping me through the four years, and in particular for the all the support in the last few months leading up to submitting. Rachel McDonnell University of Dublin, Trinity College November 2006 iv Realistic Crowd Animation: A Perceptual Approach Publication No. Rachel McDonnell, Ph.D.
    [Show full text]
  • An Online Database of Animal Motions
    THE CHIMAERA PROJECT: AN ONLINE DATABASE OF ANIMAL MOTIONS A Thesis by JULIE KATHERINE GELÉ Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE December 2007 Major Subject: Visualization Sciences THE CHIMAERA PROJECT: AN ONLINE DATABASE OF ANIMAL MOTIONS A Thesis by JULIE KATHERINE GELÉ Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Approved by: Chair of Committee, Carol LaFayette Committee Members, Ergun Akleman B. Stephen Carpenter, II Head of Department, Mark J. Clayton December 2007 Major Subject: Visualization Sciences iii ABSTRACT The Chimaera Project: An Online Database of Animal Motions. (December 2007) Julie Katherine Gelé, B.S., Texas A&M University Chair of Advisory Committee: Prof. Carol LaFayette Digital animators will save vast amounts of project time by starting with a completed skeleton and some base animations. This result can be accomplished with Web 2.0 technologies by creating a repository of skeletons and animations that any animator may use for free. While free Maya™ skeletons currently exist on the Internet, the websites housing them have only brief features and functions for browsing and interacting with these files. None of these websites contain downloadable animations for the provided skeletons. The Chimaera Project improves the field of Web 2.0 sites offering free rigs by offering many new features and freedoms to the animation community. Users may upload and download Maya™ skeletons, share comments and tips with each other, upload animations associated with the skeletons, and search or browse the skeletons in a variety of ways.
    [Show full text]
  • Spatial Motion Doodles: Sketching Animation in VR Using Hand Gestures and Laban Motion Analysis Maxime Garcia, Rémi Ronfard, Marie-Paule Cani
    Spatial Motion Doodles: Sketching Animation in VR Using Hand Gestures and Laban Motion Analysis Maxime Garcia, Rémi Ronfard, Marie-Paule Cani To cite this version: Maxime Garcia, Rémi Ronfard, Marie-Paule Cani. Spatial Motion Doodles: Sketching Animation in VR Using Hand Gestures and Laban Motion Analysis. MIG 2019 - ACM SIGGRAPH Con- ference on Motion, Interaction and Games, Oct 2019, Newcastle upon Tyne, United Kingdom. 10.1145/3359566.3360061. hal-02303803 HAL Id: hal-02303803 https://hal.archives-ouvertes.fr/hal-02303803 Submitted on 9 Jan 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Spatial Motion Doodles: Sketching Animation in VR Using Hand Gestures and Laban Motion Analysis Maxime Garcia Rémi Ronfard Marie-Paule Cani Univ. Grenoble Alpes, CNRS, Inria, Univ. Grenoble Alpes, CNRS, Inria, Ecole Polytechnique, CNRS, LIX Grenoble INP, LJK Grenoble INP, LJK 9128, Palaiseau, France 38000 Grenoble, France 38000 Grenoble, France Figure 1: Left: A user drawing a spatial motion doodle (SMD) which is the six-dimensional trajectory of a moving frame (position and orientation), here attached to the HTC Vive controller. Right: The SMD is parsed into a string of motion tokens, allowing to recognize actions and extract the associated motion qualities.
    [Show full text]
  • Automatic Hand-Over Animation for Free-Hand Motions from Low Resolution Input
    Automatic hand-over animation for free-hand motions from low resolution input Chris Kang1, Nkenge Wheatland1, Michael Neff2, and Victor Zordan1 1 University of California, Riverside 2 University of California, Davis Abstract. Hand-over animation is the process by which hand animation is added to existing full-body motion. This paper proposes a technique for automatically synthesizing full-resolution, high quality free-hand mo- tion based on the capture of a specific, select small number of markers. Starting from a large full-resolution hand motion corpus, our technique extracts a finite pose database and selects the marker sets offline, based on user-defined inputs. For synthesis, capture sequences that include this marker set drive a reconstruction process that results in a full-resolution of the hand through the aid of the pose database. This effort addresses two distinct issues, first how to objectively select which is the best marker set based on a fixed number of desired markers and, second, how to per- form reconstruction from this data set automatically. Findings on both of these fronts are reported in this paper. Keywords: Character animation; motion capture; hand animation 1 Introduction Hand-over is a term used in the animation industry to refer to the process of adding hand animation to pre-existing full-body motion. Hand motion is a criti- cal part of many animations in which a full-body character is present. However, animation of the hand can be difficult especially where realism is important. While high-quality, full-body motion capture is a popular means for animating realistic characters, hand animation is most often not recorded at the same time as the motion of the full-body for a number of reasons.
    [Show full text]
  • Digital Sparks 03
    digital sparks 03 WETTBEWERB STUDENTISCHER MEDIENPROJEKTE Medienkunst / Mediengestaltung / Medieninformatik / Mediale Inszenierung und Vermittlung >digital sparks< Wettbewerb studentischer Medienprojekte >digital sparks< ist ein Wettbewerb für Studierende der Medienkunst, Mediengestaltung, Medieninformatik sowie für Studierende im Bereich mediale Inszenierung und Vermittlung. Ziel des Wettbewerbes ist es, den medienkulturellen Nach- wuchs zu fördern und zugleich einen Einblick in Forschung und Lehre an deutschsprachigen Hochschulen zu geben. >digital sparks< ist ein Modul der Internetplattform netzspan- nung.org, dem Wissensraum für digitale Kunst und Kultur. Für die Konzeption und Durchführung des Wettbewerbs ist die For- schungsgruppe MARS (Media Arts & Research Studies) am Fraunhofer-Institut für Medienkommunikation unter der Lei- tung von Monika Fleischmann und Wolfgang Strauss verant- wortlich. Das Bundesministerium für Bildung und Forschung fördert die Entwicklung von netzspannung.org. http://netzspannung.org/digital-sparks i M a MARS – Media Arts & Research Studies | Monika Fleischmann | Schloss Birlinghoven | D-53754 Sankt Augustin | [email protected] http://www.imk.fraunhofer.de/mars http://netzspannung.org 4 Inhalt Grußwort Christoph Ehrenberg 7 >digital sparks< 2003 Was war? Was kommt? – 9 Bestandsaufnahme und Ausblick Der Wettbewerb >digital sparks< Konzept und Durchführung 11 Jury/Vorjury 15 Die Preisträger Stellungnahme der Jury 21 how-to-bow.com 23 Loser Raum 27 machines will eat itself 31 Die nominierten
    [Show full text]
  • Data-Driven Approach to Synthesizing Facial Animation Using Motion Capture
    Animation of Natural Virtual Characters Data-Driven Approach to Synthesizing Facial Animation Using Motion Capture Kerstin Ruhland, Mukta Prasad, and Rachel McDonnell ■ Trinity College Dublin n the entertainment industry, cartoon anima- of the refining stage, with easily editable curves for tors have a long tradition of creating expres- final polish by an animator. sive characters that are highly appealing to The proposed approach replaces the acting and Iaudiences. It is particularly important to get right blocking phases by recording the artists’ move- the movement of the face, eyes, and head because ments in front of a real-time video-based motion they are the main communicators of the charac- tracking and retargeting system (see Figure 1). ter’s internal state and emotions and an indication We use a commercially available and affordable of its level of engagement with real-time facial-retargeting system for this stage, others. However, creating these which creates the initial motion curves of the Producing cartoon animations highly appealing animations is character animation. These motion curves con- is a laborious task, and there a labor-intensive task that is not tain realistic human movements that are typically is a distinct lack of automatic supported by automatic tools. Re- dense in keyframes, making them difficult for an tools to help animators, ducing the animator’s effort and animator to edit. particularly with creating facial speeding up this process, while Our approach thus focuses on the next stage animation. To speed up and allowing for artistic style and of the pipeline. The initial motion curves of the ease this process, the proposed creativity, would be invaluable to character animation provide the input to our algo- method uses real-time video- production companies that cre- rithm.
    [Show full text]
  • Intuitive Facial Animation Editing Based on a Generative RNN Framework
    ACM SIGGRAPH / Eurographics Symposium on Computer Animation 2020 Volume 39 (2020), Number 8 J. Bender and T. Popa (Guest Editors) Intuitive Facial Animation Editing Based On A Generative RNN Framework Eloïse Berson,1;2Catherine Soladié2 and Nicolas Stoiber1 1Dynamixyz, France 2CentraleSupélec, CNRS, IETR, UMR 6164, F-35000, France Abstract For the last decades, the concern of producing convincing facial animation has garnered great interest, that has only been accelerating with the recent explosion of 3D content in both entertainment and professional activities. The use of motion capture and retargeting has arguably become the dominant solution to address this demand. Yet, despite high level of quality and automation performance-based animation pipelines still require manual cleaning and editing to refine raw results, which is a time- and skill-demanding process. In this paper, we look to leverage machine learning to make facial animation editing faster and more accessible to non-experts. Inspired by recent image inpainting methods, we design a generative recurrent neural network that generates realistic motion into designated segments of an existing facial animation, optionally following user- provided guiding constraints. Our system handles different supervised or unsupervised editing scenarios such as motion filling during occlusions, expression corrections, semantic content modifications, and noise filtering. We demonstrate the usability of our system on several animation editing use cases. CCS Concepts arXiv:2010.05655v1 [cs.GR] 12 Oct 2020 • Computing methodologies ! Motion processing; Neural networks; 1. Introduction of motion capture (mocap) technologies have opened a new era, where realistic animation generation is more deterministic and re- Creating realistic facial animation has been a long-time challenge peatable.
    [Show full text]