Magic Upscaling and Material Synthesis Using Deep Learning

Zoom, Enhance, Synthesize! Magic Upscaling and Material Synthesis using Deep Learning 1 March 2017 Marco Foco, Developer Technology Engineer Dmitry Korobchenko, Deep Learning R&D Engineer Andrew Edelsten, Senior Developer Technology Manager Session Description: Recently deep learning has revolutionized computer vision and other recognition problems. Everyday applications using such techniques are now commonplace with more advanced tasks being automated at a growing rate. During 2016, “image synthesis” techniques started to appear that used deep neural networks to apply style transfer algorithms for image restoration. The speakers review some of these techniques and demonstrate their application in image magnification to enable “super resolution” tools. The speakers also discuss recent discoveries by NVIDIA Research that uses AI, machine learning and deep learning based approaches to greatly improve the process of creating game-ready materials. Using these novel techniques, artists can use standard DSLR, or even cell phone cameras, to create full renderable materials in minutes. The session concludes by showing how developers can integrate these methods into their existing art pipelines. Takeaway: Attendees will gain information about the latest application of machine and deep learning for content creation and get access to new resources to improve their work. Intended Audience: Texture artists, art directors, tool programmers, anyone interested in latest evolution of deep learning in game development. 1 Overview . Welcome . What is Deep Learning? . “GameWorks: Materials & Textures” [producers and artists rejoice] . Examine in detail the design of one tool [coders bathe in technical details] . Wrap up gameworks.nvidia.com 2 2 Deep Learning – What is it? . AI vs ML vs DL - great explanation https://goo.gl/hkayWG . Why now? . Better algorithms . Large GPU compute . Large datasets . Now, huge progress in many fields: . Speech (recognition, synthesis) . Vision (classification, location) . Language (Search, translation) . Game AI (Go, Doom, Poker) gameworks.nvidia.com 3 Machine Learning at its most basic is the practice of using algorithms to parse data, learn from it, and then make a determination or prediction about something in the world. So rather than hand-coding software routines with a specific set of instructions to accomplish a particular task, the machine is “trained” using large amounts of data and algorithms that give it the ability to learn how to perform the task. One approach to ML was “artificial neural networks” – basically use “simple” math in a distributed way to try and mimic the way we think neurons in the brain work. Anyway, for years ANN resulted in nothing until: Prof Hinton @ Uni of Toronto made the algorithms parallel, and then the algorithms were put on GPU. Then training sets exploded. Using DL everyday.. A lot! Web search Siri/Google Now Facebook image/face tagging Language translation Style transfer Neural networks are so useful why now? . Better algorithms – academics never stopped researching.. They just couldn’t try out til recently (eg RNN LSTM invented in 1997 -- Hochreiter, Sepp; and Schmidhuber, Jürgen; Long Short-Term Memory, Neural Computation, 9(8):1735–1780, 1997) . Large datasets – the digital lifestyles we live, leads to huge data collection . Large compute – turns out, the math for NN is HIGHLY parallel.. just like graphics! Yay GPU! 3 Deep Learning is Ready For Use . Already many ways to use deep learning today Just In! Baidu DeepVoice . Chat bots . Data science and Market analysis (e.g. brand sentiment analysis) . Text2Speech & Voice Recognition . Nival’s new “Boris” AI for Blitzkreig 3 - see https://goo.gl/Ah4Mov . Think how to use it in your game . Can image classifiers ID NPC’s in bug screenshots? . Google’s new Perspective API - http://perspectiveapi.com - for “toxic” forums/comments . Check services from Google, AWS, Azure if you don’t “roll your own” gameworks.nvidia.com 4 4 Deep Learning for Art Right Now . Style transfer . Generative networks creating images and voxels . Adversarial networks (DCGAN) – still early but promising . DL & ML based tools from NVIDIA and partners . NVIDIA . Artomatix . Allegorithmic . Autodesk gameworks.nvidia.com 5 5 Style Transfer: Something Fun! . Doodle a masterpiece! . Sept 2015: A Neural Algorithm of Artistic Style by Gatys et al Content Style . Uses CNN to take the “style” from one image and apply it to another . Dec 2015: neural-style (github) . Mar 2016: neural-doodle (github) . Mar 2016: texture-nets (github) . Oct 2016: fast-neural-style (github) . Also numerous services: Vinci, Prisma, Artisto gameworks.nvidia.com 6 References: A Neural Algorithm of Artistic Style paper by Leon A. Gatys, Alexander S. Ecker, and Matthias Bethge https://github.com/jcjohnson/neural-style - github repo by Justin Johnson https://github.com/jcjohnson/fast-neural-style – github repo by Justin Johnson https://github.com/alexjc/neural-doodle - github repo by @alexjc Services: http://ostagram.ru/static_pages/lenta https://www.instapainting.com/ai-painter iOS app (calls out to server) http://prisma-ai.com/ Run your own web service: https://github.com/hletrd/neural_style Decent tutorial: http://www.makeuseof.com/tag/create-neural-paintings-deepstyle- ubuntu/ 6 HTTP://OSTAGRAM.RU/STATIC_PAGES/LENTA gameworks.nvidia.com 7 Can generate some pretty amazing artwork very easily. But in addition to being a great toy, there is great potential – I mean, the AI is actually drawing pixels in a meaningful way. Style Transfer: Something Useful . Game remaster & texture enhancement . Try Neural Style and use a real-world photo for the “style” . For stylized or anime up-rez try https://github.com/nagadomi/waifu2x . NVIDIA’s new tool . Experimenting with art styles . Dream or power-up sequences . “Come Swim” by Kirsten Stewart - https://arxiv.org/pdf/1701.04928v1.pdf gameworks.nvidia.com 8 Come Swim paper - https://arxiv.org/pdf/1701.04928v1.pdf Bhautik J Joshi - Research Engineer, Adobe Kristen Stewart - Director, Come Swim David Shapiro - Producer, Starlight Studios https://www.theguardian.com/film/2017/jan/20/kristen-stewart-research-paper- neural-style-transfer 8 NVIDIA’s Goals for DL in Game Development . Looking at all the research, clearly there’s scope for tools based on DL . Goals: . Expand the use of deep learning into content creation . Remove the mundane and repetitive . Promote increased creativity, realism and experimentation gameworks.nvidia.com 9 9 “GameWorks: Materials & Textures” . Set of tools targeting the game industry using machine learning and deep learning . https://gwmt.nvidia.com . First release targets textures and materials . Tools in this initial release: . Photo To Material: 2shot . Super-resolution . Texture Multiplier gameworks.nvidia.com 10 10 GameWorks: Materials & Textures beta . Tools run as a web service . Sign up for the Beta at: https://gwmt.nvidia.com . Seeking feedback from artists on usage of tools and quality . Also interested in feedback from programmers on automation, pipeline and engine integration gameworks.nvidia.com 11 11 Photo To Material: 2Shot . From two photos of a surface, generate a “material” . Based on a SIGGRAPH 2015 paper by NVResearch and Aalto University (Finland) . “Two-Shot SVBRDF Capture for Stationary Materials” . https://mediatech.aalto.fi/publications/graphics/TwoShotSVBRDF/ . Input is pixel aligned “flash” and “guide” photographs . Use tripod and remote shutter or bracket . Or align later . Use for flat surfaces with repeating patterns gameworks.nvidia.com 12 12 Material Synthesis from Two Photos Flash image Guide image Diffuse Specular Normals Glossiness Anisotropy albedo gameworks.nvidia.com 13 13 Material Synthesis Process gameworks.nvidia.com 14 SVBRDF – spatially varying bidirectional reflectance distribution function 14 Demo Photo To Material: 2Shot 15 Photo To Material: 1Shot . What’s better than two photos? One! . SIGGRAPH 2016 paper by NVResearch and Aalto University (Finland) . “Reflectance modeling by neural texture synthesis” . http://dl.acm.org/citation.cfm?id=2925917&preflayout=flat . Includes slides and video presentation . Uses advanced deep learning research . Combines feature detection and style transfer to create materials . Quality does not (yet) match 2shot gameworks.nvidia.com 16 16 1shot – EARLY Previews gameworks.nvidia.com 17 17 Texture Multiplier . Put simply: texture in, new texture out . Inspired by Gatys et al . Texture Synthesis Using Convolutional Neural Networks . https://arxiv.org/pdf/1505.07376.pdf . Artomatix . Similar product “Texture Mutation” . Very cool “Infinity Tile” . https://artomatix.com/ gameworks.nvidia.com 18 Currently “Beta” Some artifacts – 256x256 now, with 512 and 1024 coming 18 Super Resolution . Final tool in the first roll-out of GameWorks: Materials & Textures . Introduce Dmitry and Marco . Deep dive on the tool and to explain some recent DL based research and techniques gameworks.nvidia.com 19 19 Zoom! Enhance! Yes Sure! Can you Zoom on the enhance that? license plate gameworks.nvidia.com 20 20 Super-resolution: the task Constructed high-resolution image Given low-resolution image H Upscaling n * H W n * W gameworks.nvidia.com 21 The task is to “generate” a bigger image from a smaller one. If we want to use machine learning to do this, we can create two set, one of big images and one of their downscaled version, and train our system with these two sets 21 Super-resolution as reconstruction task Unknown original high-resolution image Reconstructed image Given

Load more