Plenoptic Imaging and Vision Using Angle Sensitive Pixels

Total Page:16

File Type:pdf, Size:1020Kb

Plenoptic Imaging and Vision Using Angle Sensitive Pixels PLENOPTIC IMAGING AND VISION USING ANGLE SENSITIVE PIXELS A Dissertation Presented to the Faculty of the Graduate School of Cornell University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy by Suren Jayasuriya January 2017 c 2017 Suren Jayasuriya ALL RIGHTS RESERVED This document is in the public domain. PLENOPTIC IMAGING AND VISION USING ANGLE SENSITIVE PIXELS Suren Jayasuriya, Ph.D. Cornell University 2017 Computational cameras with sensor hardware co-designed with computer vision and graph- ics algorithms are an exciting recent trend in visual computing. In particular, most of these new cameras capture the plenoptic function of light, a multidimensional function of ra- diance for light rays in a scene. Such plenoptic information can be used for a variety of tasks including depth estimation, novel view synthesis, and inferring physical properties of a scene that the light interacts with. In this thesis, we present multimodal plenoptic imaging, the simultaenous capture of multiple plenoptic dimensions, using Angle Sensitive Pixels (ASP), custom CMOS image sensors with embedded per-pixel diffraction gratings. We extend ASP models for plenoptic image capture, and showcase several computer vision and computational imaging applica- tions. First, we show how high resolution 4D light fields can be recovered from ASP images, using both a dictionary-based machine learning method as well as deep learning. We then extend ASP imaging to include the effects of polarization, and use this new information to image stress-induced birefringence and remove specular highlights from light field depth mapping. We explore the potential for ASPs performing time-of-flight imaging, and in- troduce the depth field, a combined representation of time-of-flight depth with plenoptic spatio-angular coordinates, which is used for applications in robust depth estimation. Fi- nally, we leverage ASP optical edge filtering to be a low power front end for an embedded deep learning imaging system. We also present two technical appendices: a study of using deep learning with energy-efficient binary gradient cameras, and a design flow to enable agile hardware design for computational image sensors in the future. BIOGRAPHICAL SKETCH The author was born on July 19, 1990. From 2008 to 2012 he studied at the University of Pittsburgh, where he received a Bachelor of Science in Mathematics with departmental honors and a Bachelor of Arts in Philosophy. He then moved to Ithaca, New York to study at Cornell University from 2012 to 2017, where he earned his doctorate in Electrical and Computer Engineering in 2017. iii To my parents and brother. iv ACKNOWLEDGEMENTS I am grateful for my advisor Al Molnar for all his mentoring over these past years. It was a risk to accept a student whose background was mathematics and philosophy, and had no experience in ECE or CS, but Al was patient enough to allow me to make mistakes and grow as a researcher. I also want to thank my commitee members Alyssa Apsel and Steve Marschner for their advice throughout my graduate career. Several people made important contributions to this thesis. Sriram Sivaramakrishnan was my go-to expert on ASPs, and co-author on many of my imaging papers. Matthew Hirsch and Gordon Wetzstein, along with the supervision of Ramesh Raskar, collaborated on the dictionary-based learning for recovering 4D light fields in Chapter 3. Arjun Jauhari, Mayank Gupta, and Kuldeep Kulkarni, along with the supervision of Pavan Turaga, per- formed deep learning experiments to speed up light field reconstructions in Chapter 3. Ellen Chuang and Debashree Guruaribam performed the polarization experiments in Chapter 4. Adithya Pediredla and Ryuichi Tadano helped create the depth fields experimental proto- type and worked on applications in Chapter 5. George Chen realized ASP Vision from conception with me, and Jiyue Yang and Judy Stephen collected and analyzed the real digit and face recognition experiments in Chapter 6. Ashok Veeraraghavan helped supervise the projects in Chapters 5 and 6. The work in Appendix A was completed during a summer internship at NVIDIA Re- search under the direction of Orazio Gallo, Jinwei Gu, and Jan Kautz, with the face detec- tion results compiled by Jinwei Gu, the gesture recognition results run by Pavlo Molchanov, and the captured gradient images for intensity reconstruction by Orazio Gallo. The design flow in Appendix B was developed in collaboration with Chris Torng, Moyang Wang, Na- garaj Murali, Bharath Sudheendra, Mark Buckler, Einar Veizaga, Shreesha Srinath, and Taylor Pritchard under the supervision of Chris Batten. Jiyue Yang also helped with the de- v sign and layout of ASP depth field pixels, and Gerd Kiene designed the amplifier presented in the Appendix B. Special thanks go to Achuta Kadambi for many long discussions about computational imaging. I also enjoyed being a fly on the wall of the Cornell Graphics/Vision group es- pecially chatting with Kevin Matzen, Scott Wehrwein, Paul Upchurch, Kyle Wilson, Sean Bell. My heartfelt gratitude goes to my friends and family who supported me while I un- dertook this journey. I couldn’t have done it without Buddy Rieger, Crystal Lee Morgan, Brandon Hang, Chris Hazel, Elly Engle, Steve Miller, Kevin Luke, Ved Gund, Alex Ruy- ack, Jayson Myers, Ravi Patel, and John McKay. Finally, I’d like to thank my brother Gihan and my parents for their love and support for all these years. This thesis is dedicated to them. vi TABLE OF CONTENTS Biographical Sketch................................. iii Dedication...................................... iv Acknowledgements.................................v Table of Contents.................................. vii List of Tables....................................x List of Figures.................................... xi 1 Introduction1 1.1 Dissertation Overview............................2 2 Background5 2.1 Plenoptic Function of Light..........................5 2.2 Applications of Plenoptic Imaging......................6 2.3 Computational Cameras............................ 11 2.4 Angle Sensitive Pixels............................ 12 2.4.1 Background.............................. 13 2.4.2 Applications............................. 15 2.4.3 Our Contribution........................... 15 3 Angle 17 3.1 Introduction.................................. 17 3.2 Related Work................................. 19 3.3 Method.................................... 20 3.3.1 Light Field Acquisition with ASPs.................. 21 3.3.2 Image and Light Field Synthesis................... 24 3.4 Analysis.................................... 26 3.4.1 Frequency Analysis.......................... 26 3.4.2 Depth of Field............................ 27 3.4.3 Resilience to Noise.......................... 31 3.5 Implementation................................ 31 3.5.1 Angle Sensitive Pixel Hardware................... 31 3.5.2 Software Implementation....................... 33 3.6 Results..................................... 35 3.7 Compressive Light Field Reconstructions using Deep Learning....... 38 3.7.1 Related Work............................. 39 3.7.2 Deep Learning for Light Field Reconstruction............ 40 3.7.3 Experimental Results......................... 45 3.7.4 Discussion.............................. 52 3.7.5 Limitations.............................. 53 3.7.6 Future Directions........................... 54 vii 4 Polarization 55 4.1 Polarization Response............................. 55 4.2 Applications.................................. 59 4.3 Conclusions and Future Work......................... 62 5 Time-of-Flight 63 5.1 Motivation for Depth Field Imaging..................... 63 5.2 Related Work................................. 66 5.3 Depth Fields.................................. 67 5.3.1 Light Fields.............................. 69 5.3.2 Time-of-Flight Imaging....................... 70 5.3.3 Depth Fields............................. 70 5.4 Methods to Capture Depth Fields....................... 71 5.4.1 Pixel Designs............................. 71 5.4.2 Angle Sensitive Photogates...................... 73 5.4.3 Experimental Setup.......................... 73 5.5 Experimental Setup.............................. 75 5.6 Applications of Depth Fields......................... 76 5.6.1 Synthetic Aperture Refocusing.................... 76 5.6.2 Phase wrapping ambiguities..................... 77 5.6.3 Refocusing through partial occluders................ 78 5.6.4 Refocusing past scattering media................... 80 5.7 Discussion................................... 81 5.7.1 Limitations.............................. 83 5.8 Conclusions and Future Work......................... 83 6 Visual Recognition 85 6.1 Introduction.................................. 85 6.1.1 Motivation and Challenges...................... 86 6.1.2 Our Proposed Solution........................ 87 6.1.3 Contributions............................. 88 6.1.4 Limitations.............................. 88 6.2 Related Work................................. 89 6.3 ASP Vision.................................. 90 6.3.1 Hardcoding the First Layer of CNNs................. 91 6.3.2 Angle Sensitive Pixels........................ 92 6.4 Analysis.................................... 98 6.4.1 Performance on Visual Recognition Tasks.............. 98 6.4.2 FLOPS savings............................ 101 6.4.3 Noise analysis............................ 101 6.4.4 ASP parameter design space..................... 102 6.5 Hardware Prototype and Experiments.................... 102 viii 6.6
Recommended publications
  • CUDA-Based Global Illumination Aaron Jensen San Jose State
    Running head: CUDA-BASED GLOBAL ILLUMINATION 1 CUDA-Based Global Illumination Aaron Jensen San Jose State University 13 May, 2014 CS 180H: Independent Research for Department Honors Author Note Aaron Jensen, Undergraduate, Department of Computer Science, San Jose State University. Research was conducted under the guidance of Dr. Pollett, Department of Computer Science, San Jose State University. CUDA-BASED GLOBAL ILLUMINATION 2 Abstract This paper summarizes a semester of individual research on NVIDIA CUDA programming and global illumination. What started out as an attempt to update a CPU-based radiosity engine from a previous graphics class evolved into an exploration of other lighting techniques and models (collectively known as global illumination) and other graphics-based languages (OpenCL and GLSL). After several attempts and roadblocks, the final software project is a CUDA-based port of David Bucciarelli's SmallPt GPU, which itself is an OpenCL-based port of Kevin Beason's smallpt. The paper concludes with potential additions and alterations to the project. CUDA-BASED GLOBAL ILLUMINATION 3 CUDA-Based Global Illumination Accurately representing lighting in computer graphics has been a topic that spans many fields and applications: mock-ups for architecture, environments in video games and computer generated images in movies to name a few (Dutré). One of the biggest issues with performing lighting calculations is that they typically take an enormous amount of time and resources to calculate accurately (Teoh). There have been many advances in lighting algorithms to reduce the time spent in calculations. One common approach is to approximate a solution rather than perform an exhaustive calculation of a true solution.
    [Show full text]
  • Vol. 3 Issue 4 July 1998
    Vol.Vol. 33 IssueIssue 44 July 1998 Adult Animation Late Nite With and Comics Space Ghost Anime Porn NYC: Underground Girl Comix Yellow Submarine Turns 30 Frank & Ollie on Pinocchio Reviews: Mulan, Bob & Margaret, Annecy, E3 TABLE OF CONTENTS JULY 1998 VOL.3 NO.4 4 Editor’s Notebook Is it all that upsetting? 5 Letters: [email protected] Dig This! SIGGRAPH is coming with a host of eye-opening films. Here’s a sneak peak. 6 ADULT ANIMATION Late Nite With Space Ghost 10 Who is behind this spandex-clad leader of late night? Heather Kenyon investigates with help from Car- toon Network’s Michael Lazzo, Senior Vice President, Programming and Production. The Beatles’Yellow Submarine Turns 30: John Coates and Norman Kauffman Look Back 15 On the 30th anniversary of The Beatles’ Yellow Submarine, Karl Cohen speaks with the two key TVC pro- duction figures behind the film. The Creators of The Beatles’Yellow Submarine.Where Are They Now? 21 Yellow Submarine was the start of a new era of animation. Robert R. Hieronimus, Ph.D. tells us where some of the creative staff went after they left Pepperland. The Mainstream Business of Adult Animation 25 Sean Maclennan Murch explains why animated shows targeted toward adults are becoming a more popular approach for some networks. The Anime “Porn” Market 1998 The misunderstood world of anime “porn” in the U.S. market is explored by anime expert Fred Patten. Animation Land:Adults Unwelcome 28 Cedric Littardi relates his experiences as he prepares to stand trial in France for his involvement with Ani- meLand, a magazine focused on animation for adults.
    [Show full text]
  • A Practical Analytic Model for the Radiosity of Translucent Scenes
    A Practical Analytic Model for the Radiosity of Translucent Scenes Yu Sheng∗1, Yulong Shi2, Lili Wang2, and Srinivasa G. Narasimhan1 1The Robotics Institute, Carnegie Mellon University 2State Key Laboratory of Virtual Reality Technology and Systems, Beihang University a) b) c) Figure 1: Inter-reflection and subsurface scattering are closely intertwined for scenes with translucent objects. The main contribution of this work is an analytic model of combining diffuse inter-reflection and subsurface scattering (see Figure2). One bounce of specularities are added in a separate pass. a) Two translucent horses (63k polygons) illuminated by a point light source. The three zoomed-in regions show that our method can capture both global illumination effects. b) The missing light transport component if only subsurface scattering is simulated. c) The same mesh rendered with a different lighting and viewing position. Our model supports interactive rendering of moving camera, scene relighting, and changing translucencies. Abstract 1 Introduction Light propagation in scenes with translucent objects is hard to Accurate rendering of translucent materials such as leaves, flowers, model efficiently for interactive applications. The inter-reflections marble, wax, and skin can greatly enhance realism. The interac- between objects and their environments and the subsurface scatter- tions of light within translucent objects and in between the objects ing through the materials intertwine to produce visual effects like and their environments produce pleasing visual effects like color color bleeding, light glows and soft shading. Monte-Carlo based bleeding (Figure1), light glows and soft shading. The two main approaches have demonstrated impressive results but are computa- mechanisms of light transport — (a) scattering beneath the surface tionally expensive, and faster approaches model either only inter- of the materials and (b) inter-reflection between surface locations reflections or only subsurface scattering.
    [Show full text]
  • Photometric Registration of Indoor Real Scenes Using an RGB-D Camera with Application to Mixed Reality Salma Jiddi
    Photometric registration of indoor real scenes using an RGB-D camera with application to mixed reality Salma Jiddi To cite this version: Salma Jiddi. Photometric registration of indoor real scenes using an RGB-D camera with application to mixed reality. Computer Vision and Pattern Recognition [cs.CV]. Université Rennes 1, 2019. English. NNT : 2019REN1S015. tel-02167109 HAL Id: tel-02167109 https://tel.archives-ouvertes.fr/tel-02167109 Submitted on 27 Jun 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. THESE DE DOCTORAT DE L'UNIVERSITE DE RENNES 1 COMUE UNIVERSITE BRETAGNE LOIRE ECOLE DOCTORALE N° 601 Mathématiques et Sciences et Technologies de l'Information et de la Communication Spécialité : Informatique Par Salma JIDDI Photometric Registration of Indoor Real Scenes using an RGB-D Camera with Application to Mixed Reality Thèse présentée et soutenue à Rennes, le 11/01/2019 Unité de recherche : IRISA – UMR6074 Thèse N° : Rapporteurs avant soutenance : Vincent Lepetit Professeur à l’Université de Bordeaux Alain Trémeau Professeur à l’Université Jean Monnet Composition du Jury : Rapporteurs : Vincent Lepetit Professeur à l’Université de Bordeaux Alain Trémeau Professeur à l’Université Jean Monnet Examinateurs : Kadi Bouatouch Professeur à l’Université de Rennes 1 Michèle Gouiffès Maître de Conférences à l’Université Paris Sud Co-encadrant : Philippe Robert Docteur Ingénieur à Technicolor Dir.
    [Show full text]
  • 3D Computer Graphics Compiled By: H
    animation Charge-coupled device Charts on SO(3) chemistry chirality chromatic aberration chrominance Cinema 4D cinematography CinePaint Circle circumference ClanLib Class of the Titans clean room design Clifford algebra Clip Mapping Clipping (computer graphics) Clipping_(computer_graphics) Cocoa (API) CODE V collinear collision detection color color buffer comic book Comm. ACM Command & Conquer: Tiberian series Commutative operation Compact disc Comparison of Direct3D and OpenGL compiler Compiz complement (set theory) complex analysis complex number complex polygon Component Object Model composite pattern compositing Compression artifacts computationReverse computational Catmull-Clark fluid dynamics computational geometry subdivision Computational_geometry computed surface axial tomography Cel-shaded Computed tomography computer animation Computer Aided Design computerCg andprogramming video games Computer animation computer cluster computer display computer file computer game computer games computer generated image computer graphics Computer hardware Computer History Museum Computer keyboard Computer mouse computer program Computer programming computer science computer software computer storage Computer-aided design Computer-aided design#Capabilities computer-aided manufacturing computer-generated imagery concave cone (solid)language Cone tracing Conjugacy_class#Conjugacy_as_group_action Clipmap COLLADA consortium constraints Comparison Constructive solid geometry of continuous Direct3D function contrast ratioand conversion OpenGL between
    [Show full text]
  • A Deep Learning Approach to No-Reference Image Quality Assessment for Monte Carlo Rendered Images
    EG UK Computer Graphics & Visual Computing (2018) G. Tam and F. Vidal (Editors) A Deep Learning Approach to No-Reference Image Quality Assessment For Monte Carlo Rendered Images J. Whittle1 and M. W. Jones1 1Swansea University, Department of Computer Science, UK Abstract In Full-Reference Image Quality Assessment (FR-IQA) images are compared with ground truth images that are known to be of high visual quality. These metrics are utilized in order to rank algorithms under test on their image quality performance. Throughout the progress of Monte Carlo rendering processes we often wish to determine whether images being rendered are of sufficient visual quality, without the availability of a ground truth image. In such cases FR-IQA metrics are not applicable and we instead must utilise No-Reference Image Quality Assessment (NR-IQA) measures to make predictions about the perceived quality of unconverged images. In this work we propose a deep learning approach to NR-IQA, trained specifically on noise from Monte Carlo rendering processes, which significantly outperforms existing NR-IQA methods and can produce quality predictions consistent with FR-IQA measures that have access to ground truth images. CCS Concepts •Computing methodologies ! Machine learning; Neural networks; Computer graphics; Image processing; 1. Introduction ground truth would preclude the need to perform additional ren- dering of the image. In such cases NR-IQA measures allow us to Monte Carlo (MC) light transport simulations are capable of mod- predict the perceived quality of images by modelling the distribu- elling the complex interactions of light with a wide range of phys- tion of naturally occurring distortions that are present in images ically based materials, participating media, and camera models to before convergence of the rendering process has been achieved.
    [Show full text]
  • A Deep Learning Approach to No-Reference Image Quality Assessment for Monte Carlo Rendered Images
    EG UK Computer Graphics & Visual Computing (2018) G. Tam and F. Vidal (Editors) A Deep Learning Approach to No-Reference Image Quality Assessment For Monte Carlo Rendered Images J. Whittle1 and M. W. Jones1 1Swansea University, Department of Computer Science, UK Abstract In Full-Reference Image Quality Assessment (FR-IQA) images are compared with ground truth images that are known to be of high visual quality. These metrics are utilized in order to rank algorithms under test on their image quality performance. Throughout the progress of Monte Carlo rendering processes we often wish to determine whether images being rendered are of sufficient visual quality, without the availability of a ground truth image. In such cases FR-IQA metrics are not applicable and we instead must utilise No-Reference Image Quality Assessment (NR-IQA) measures to make predictions about the perceived quality of unconverged images. In this work we propose a deep learning approach to NR-IQA, trained specifically on noise from Monte Carlo rendering processes, which significantly outperforms existing NR-IQA methods and can produce quality predictions consistent with FR-IQA measures that have access to ground truth images. CCS Concepts •Computing methodologies ! Machine learning; Neural networks; Computer graphics; Image processing; 1. Introduction ground truth would preclude the need to perform additional ren- dering of the image. In such cases NR-IQA measures allow us to Monte Carlo (MC) light transport simulations are capable of mod- predict the perceived quality of images by modelling the distribu- elling the complex interactions of light with a wide range of phys- tion of naturally occurring distortions that are present in images ically based materials, participating media, and camera models to before convergence of the rendering process has been achieved.
    [Show full text]
  • Caradoc of the North Wind Free
    FREE CARADOC OF THE NORTH WIND PDF Allan Frewin Jones | 368 pages | 05 Apr 2012 | Hachette Children's Group | 9780340999417 | English | London, United Kingdom CARADOC OF THE NORTH WIND PDF As the war. Disaster strikes, and a valued friend suffers Caradoc of the North Wind devastating injury. Branwen sets off on a heroic journey towards destiny in an epic adventure of lovewar and revenge. Join Charlotte and Mia in this brilliant adventure full of princess sparkle and Christmas excitement! Chi ama i libri sceglie Kobo e inMondadori. The description is beautiful, but at times a bit too much, and sometimes at its worst the writing is hard to comprehend completely clearly. I find myself hoping vehemently for another book. It definitely allows the I read this in Caradoc of the North Wind sitting and could not put it down. Fair Wind to Widdershins. This author has published under several versions of his name, including Allan Jones, Frewin Jones, and A. Write a product review. Now he has stolen the feathers Caradoc of the North Wind Doodle, the weather-vane cockerel in charge of the weather. Jun 29, Katie rated it really liked it. Is the other warrior child, Arthur?? More than I thought I would, even. I really cafadoc want to know more, and off author is one that can really take you places. Join us by creating an account and start getting the best experience from our website! Jules Ember was raised hearing legends of wjnd ancient magic of the wicked Alchemist and the good Sorceress. Delivery and Returns see our delivery rates and policies thinking of returning an item? Mar 24, Valentina rated it really liked it.
    [Show full text]
  • Differentiable Path Tracing by Regularizing Discontinuities
    Differentiable Path Tracing by Regularizing Discontinuities Peter Quinn Jérôme Parent-Lévesque McGill University Mila, Université de Montréal [email protected] [email protected] Cengiz Oztireli Derek Nowrouzezahrai University of Cambridge, Google McGill University [email protected] [email protected] (a) (b) (c) (d) (e) Figure 1: 1b shows the gradients calculated for the icosahedron in 1a for a horizontal translation. 1c shows Cornell box scene with global illumination and Russian roulette termination. 1d has larger σ, resulting in more blur. 1e has smaller γ, resulting in less transparency. Abstract Recently, viewing computer vision as an inverse rendering problem has led to a growing interest in differentiable rendering. We introduce a novel differentiable path tracing algorithm where discontinuities in the rendering process are regularized through blurring of the geometry. Our differentiable renderer implements full global illumination and has parameters for controlling the regularization, which allows for some control over the smoothness of the loss landscape. We successfully apply our system to solve several examples of challenging inverse rendering optimization problems that involve more complex light transport scenarios that cannot be handled by rasterization-based differentiable renderers. 1 Introduction The field of computer vision has seen significant advancements due to the proliferation of machine learning (ML) techniques and specialized neural networks architectures [1][2]. However, many of these modern techniques focus only on the 2D image, and are agnostic to the underlying 3D world and geometry that are behind the image formation process. Inverse graphics is the process of obtaining a detailed description of a 3D scene from an image.
    [Show full text]
  • Hello, World! Free
    FREE HELLO, WORLD! PDF Disney Book Group | 14 pages | 16 Aug 2011 | Disney Press | 9781423141402 | English | New York, NY, United States "Hello, World!" program - Wikipedia Learn Data Science by completing interactive coding challenges and watching videos by expert instructors. Start Now! Python Hello a very simple language, and has a very straightforward syntax. It encourages programmers to program without boilerplate prepared code. The simplest directive in Python is the "print" directive - it simply prints out a line and also includes a newline, unlike in C. There Hello two major Python versions, Python 2 and Python 3. Python 2 and 3 are quite different. This tutorial uses Python 3, because it more semantically correct and supports newer features. For example, World! difference between Python 2 and 3 is the print statement. In Hello 2, World! "print" statement is not a function, and therefore it is invoked without parentheses. However, in Python World!, it World! a function, and must be invoked with parentheses. Python uses indentation for World!, instead of curly braces. Both tabs and spaces are supported, but the standard indentation requires standard Python code to use four spaces. For example:. This site is generously supported by DataCamp. Join over a million other learners and get started learning Python for data science today! Hello, World! To print a string in Python 3, just write: print "This line will be printed. Hello Tutorial. Read our Terms of Use and Privacy Policy. Hello, World! - Learn Python - Free Interactive Python Tutorial A "Hello, World! Such a Hello is very simple in most programming World!and World! often used to illustrate the basic syntax of a programming language.
    [Show full text]
  • Monte Carlo Methods for Improved Rendering
    Monte Carlo methods for improved rendering Tegan Brennan Stephen Merity Taiyo Wilson Abstract Stochastic ray tracing is one of the fundamental algorithms in computer graphics for generating photo-realistic images. Stochastic ray tracing uses Monte Carlo integration to solve the rendering equation. This presents a major drawback how- ever: noise. To decrease the visible noise to an acceptable level is computationally intensive. In this project, we implement an unbiased Monte Carlo renderer as an experi- mental testbed for evaluating improved sampling strategies, both theoretically and practically motivated. We evaluate our results empirically, aiming to improve both the image quality and speed of convergence for our test images. Our results show that the improved sampling methods we use for rendering can give comparable image quality over twenty times faster than naive Monte Carlo methods. 1 Introduction Raytracing is one of the most elegant techniques in computer graphics for producing realistic images of a given scene. The elegance comes from solving the light transport equation using Monte Carlo methods, which approximates the radiance of a given pixel by simulating how light paths could reach it in a physically accurate way. Many physical phenomena that are near impossible to compute with other techniques can be trivially modelled using a raytracer. To accurately render these scenes however is a computationally intensive task. For a high definition 1080p image, as might be rendered by Pixar, the radiance of over 2 million pixels need to be computed, requiring millions or billions of light paths depending on the complexity of the scene. Performing this naively is impractical, even with advanced hardware such as general purpose graphical programming units (GPGPUs) which can cast millions of rays per second.
    [Show full text]
  • A Deep Learning Approach to Color Spill Suppression in Chroma Keying
    University of Bremen Masterarbeit A Deep Learning Approach to Color Spill Suppression in Chroma Keying Pascale Maul Course of Study: Computer Science Examiner: Udo Frese, Hans Meine Supervisor: Udo Frese Commenced: April 04, 2019 Completed: October 18, 2019 Abstract This thesis proposes a deep learning approach to color spill. Color spill is the effect that occurs when lighting is reflected from the background onto the foreground. This phenomenon occurs mainly in chroma key settings, i.e. upon shooting movies with green-screen scenes, which are then replaced by for example an animated background. Here, the main concerns are creating an image matte and suppressing spill affected areas. As of yet, this process requires heavy user interaction and is very time consuming. One particular difficulty is to distinguish between mixed and spilled pixels. Both appear in form of pixels with a green tinge. In contrast to spilled ones, mixed pixels are caused by foreground- background interactions that should be translated to the new image composition. Blurred areas are to remain translucent, while the background should be let through. The same applies to areas where the foreground object is translucent. In contrast, spilling artifacts should be removed and the original color of the foreground object should be restored. Not doing so results in inconsistencies between the lighting conditions of the foreground and new background, which are easily detected by the human eye. Color spill is a phenomenon with a ground truth, that is difficult to acquire. Therefore, no data sets containing relevant information exist. Rendering synthetic images to construct such a data set is the first major contribution of this work.
    [Show full text]