3D Modeling from Wide Baseline Range Scans Using Contour Coherence

Total Page:16

File Type:pdf, Size:1020Kb

3D Modeling from Wide Baseline Range Scans Using Contour Coherence 3D Modeling from Wide Baseline Range Scans using Contour Coherence Ruizhe Wang Jongmoo Choi Gerard´ Medioni Computer Vision Lab, Institute for Robotics and Intelligent Systems University of Southern California fruizhewa, jongmooc, [email protected] Abstract 1 2 Registering 2 or more range scans is a fundamental problem, with application to 3D modeling. While this prob- lem is well addressed by existing techniques such as ICP when the views overlap significantly at a good initializa- tion, no satisfactory solution exists for wide baseline regis- 1 2 tration. We propose here a novel approach which leverages (a) contour coherence and allows us to align two wide base- line range scans with limited overlap from a poor initializa- 1 2 tion. Inspired by ICP, we maximize the contour coherence by building robust corresponding pairs on apparent con- tours and minimizing their distances in an iterative fashion. We use the contour coherence under a multi-view rigid reg- istration framework, and this enables the reconstruction of accurate and complete 3D models from as few as 4 frames. We further extend it to handle articulations, and this allows 1 2 us to model articulated objects such as human body. Exper- (b) imental results on both synthetic and real data demonstrate Figure 1. (a) Two roughly aligned wide baseline 2.5D range scans the effectiveness and robustness of our contour coherence of the Stanford Bunny with the observed and predicted apparent based registration approach to wide baseline range scans, contours extracted. The two meshed points cloud are generated and to 3D modeling. from the two 2.5D range scans respectively (b) Registration result after maximizing the contour coherence 1. Introduction points on the surface where the tangent plane contains the Registering 2 or more range scans is a fundamental prob- line of sight from the camera. This contour has been shown lem with application to 3D modeling. It is well addressed to be a rich source of geometric information for motion es- in the presence of sufficient overlap and good initializa- timation and 3D reconstruction [7,8,9, 12]. tion [3,4,5, 14]. However, registering two wide baseline Inspired by their works, we propose the concept of con- range scans presents a challenging task where two range tour coherence for wide baseline range scan registration. scans barely overlap and the shape coherence no longer pre- Contour coherence is defined as the agreement between the vails. An example of two wide baseline range scans of the observed apparent contour and the predicted apparent con- Stanford Bunny with approximately 40% overlap is given tour. As shown in Fig. 1(a), the observed contours extracted in Fig. 1(a). The traditional shape coherence based meth- from the original 2.5D range scans, i.e., red lines in image ods may fail as most closest-distance correspondences are 1 and blue lines in image 2, do not match the corresponding incorrect. contours extracted from the projected 2.5D range scans, i.e., In computer vision dealing with intensity images, a large blue lines in image 1 and red lines in image 2. We maximize body of work has been devoted to study the apparent con- contour coherence by iteratively building robust correspon- tour, or simply contour. An apparent contour is the projec- dences among apparent contours and minimizing their dis- tion of a contour generator, which is defined as the set of tances. The registration result is shown in Fig. 1(b) with the 1 contour coherence maximized and two wide baseline range (ICP) algorithm [5] and its variants [6, 19] prove to be ef- scans well aligned. The contour coherence is robust in the fective in accurate rigid registration for high-quality 3D in- presence of wide baseline in the sense that only the shape door scene reconstruction [17]. Few efficient attempts have area close to the predicted contour generator is considered been made for solving rigid range scan registration of par- when building correspondences on the contour, thus avoid- tial overlap and poor initialization. Silva et al.[22] solve the ing the search for correspondences over the entire shape. alignment by an expensive search over the entire pose-space The recently released low-cost structured light 3D sen- which is speeded up by the genetic algorithm. Gelfand sors (e.g., Kinect, Primesense) enable handy complete scan- et al.[11] propose a shape descriptor based method and ning of objects in the home environment. While the state- perform the registration by matching descriptors under the of-the-art scanning methods [17, 23] generate excellent 3D RANSAC framework. Similar approaches have been pro- reconstruction results of rigid or articulated objects, they posed [1,2]. These metric-based registration methods re- assume small motion between consecutive views and as a quires extensive computation of similarity values across a result either the user has to move the 3D sensor carefully set of candidate matches and they are only pairwise. Fur- around the object or the subject must turn slowly in a con- thermore, they fail when two range scans overlap on fea- trolled manner. Even with best effort to reduce the drifting tureless region. error, sometimes the gap is still visible when closing the Articulated non-rigid registration. Allen et al.[3] extend loop (Section7). rigid registration to a template-based articulated registration In our work, we explicitly employ contour coherence un- algorithm for aligning several body scans. Their method der a multi-view framework and develop the Multi-view It- utilizes a set of manually selected markers. Pekelney and erative Closest Contour (M-ICC) algorithm which rigidly Gotsman [18] achieve articulated registration by perform- aligns all range scans at the same time. We further extend ing ICP on each segment and their method requires a man- M-ICC to handle small articulations and propose the Multi- ual segmentation of the first frame. Chang and Zwicker [4] view Articulated Iterative Closest Contour (MA-ICC) algo- further remove the assumption of known segmentation by rithm. Using our proposed registration methods, we suc- solving a discrete labeling problem to detect the set of opti- cessfully address the loop-closure problem from as few as mal correspondences and apply graph cuts for optimization. 4 wide baseline views with limited overlap, and reconstruct General non-rigid registration. Li et al.[14] develop a accurate and complete rigid as well as articulated objects, registration framework that simultaneously solves for point hence greatly reducing the data acquisition time and the reg- correspondences, surface deformation, and region of over- istration complexity, while providing accurate results. lap within a single global optimization. More recently, sev- To the best of our knowledge, we are first to introduce eral methods based on the embedded deformation model contour coherence for multi-view wide baseline range scan have been proposed for modeling of non-rigid objects, ei- registration. Our main contributions include: 1) The con- ther by initializing a complete deformation model with the cept of contour coherence for robust wide baseline range first frame [23] or incrementally updating it [27]. scan registration; 2) Contour coherence based multi-view Apparent contour or silhouette for pose estimation. Ap- rigid registration algorithm M-ICC which allows 3D mod- parent contour has been used for camera motion estimation eling from as few as 4 views; 3) Extension to multi-view [8] relying on the notion of epipolar tangency points. In par- articulated registration algorithm MA-ICC and its applica- ticular, the work of [26] uses only the two outermost epipo- tion to 3D body modeling. lar tangents for the camera pose estimation. More recently, Section2 presents the relevant literature. Section3 de- Hernandez et al.[12] propose the notion of silhouette co- scribes how to extract robust correspondences from the ob- herence, which measures the consistency between observed served and predicted contours. Section4 employs the con- silhouettes and predicted silhouettes, and maximize it to es- tour coherence in a multi-view rigid registration framework, timate the camera poses. Cheung et al.[7] calculate camera which is further extended to consider articulation in Sec- poses by locating colored surface points along the bounding tion5. Section6 briefly covers implementation. Section7 edges. presents the experimental evaluation results while section8 ends with a conclusion and future work. 3. Contour Coherence 2. Related Work We perform wide baseline range scan registration by maximizing contour coherence, i.e., the agreement between We roughly classify all relevant work into 4 categories: the observed and predicted apparent contours. From an im- rigid registration, articulated non-rigid registration (assum- plementation point of view, M-ICC and MA-ICC alternate ing an articulated structure), general non-rigid registration, between finding closest contour correspondences and mini- and apparent contour or silhouette for pose estimation. mizing their distances. However an intuitive closest match- Rigid registration. The classic Iterative Closest Point ing scheme on all contour points fails, mainly due to the Robust Closest Contour (RCC) frame 2 belonging to the back part of the Armadillo are not Ri Extracting Pruning Bijective closest visible in view 1 (Fig. 3(b)). Hence we prune Ci based on contour points contour points matching in 3D Mi, j i R ji the visibility of the corresponding contour generator in view j, Figure 2. General pipeline of our Robust Closest Contour (RCC) method p T Ci=j = fu 2 CijNi(u) · (oj!i − Vi(u)) > 0g; (3) presence of self-occlusion, 2D ambiguity, and outliers as where oj!i is the camera location of frame j in camera i. described later. Hence we propose the Robust Closest Con- An example of pruned contour points is shown in Fig. 3(c). Bijective closest matching in 3D.
Recommended publications
  • Lecture 2 3D Modeling
    Lecture 2 3D Modeling Dr. Shuang LIANG School of Software Engineering Tongji University Fall 2012 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Fall 2012 Lecturer Dr. Shuang LIANG • Assisstant professor, SSE, Tongji – Education » B.Sc in Computer Science, Zhejiang University, 1999-2003 » PhD in Computer Science, Nanjing Univerisity, 2003-2008 » Visit in Utrecht University, 2007, 2008 – Research Fellowship » The Chinese University of Hong Kong, 2009 » The Hong Kong Polytechnic University, 2010-2011 » The City University of Hong Kong, 2012 • Contact – Office: Room 442, JiShi Building, JiaDing Campus, TongJi (Temporary) – Email: [email protected] 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Fall 2012 Outline • What is a 3D model? • Usage of 3D models • Classic models in computer graphics • 3D model representations • Raw data • Solids • Surfaces 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Fall 2012 Outline • What is a 3D model? • Usage of 3D models • Classic models in computer graphics • 3D model representations • Raw data • Solids • Surfaces 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Fall 2012 What is a 3D model? 3D object using a collection of points in 3D space, connected by various geometric entities such as triangles, lines, curved surfaces, etc. It is a collection of data (points and other information) 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Fall 2012 What is a 3D modeling? The process of developing a mathematical representation of any three-dimensional
    [Show full text]
  • Classic Models in Computer Graphics • 3D Model Representations • Raw Data • Solids • Surfaces
    Lecture 2 3D Modeling Dr. Shuang LIANG School of Software Engineering Tongji University Spring 2013 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Spring 2013 Today’s Topics • What is a 3D model? • Usage of 3D models • Classic models in computer graphics • 3D model representations • Raw data • Solids • Surfaces 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Spring 2013 Today’s Topics • What is a 3D model? • Usage of 3D models • Classic models in computer graphics • 3D model representations • Raw data • Solids • Surfaces 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Spring 2013 What is a 3D model? 3D object using a collection of points in 3D space, connected by various geometric entities such as triangles, lines, curved surfaces, etc. It is a collection of data (points and other information) 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Spring 2013 What is a 3D modeling? The process of developing a mathematical representation of any three-dimensional surface of object via specialized software. 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Spring 2013 Today’s Topics • What is a 3D model? • Usage of 3D models • Classic models in computer graphics • 3D model representations • Raw data • Solids • Surfaces 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Spring 2013 Usage of a 3D model The medical industry uses detailed models of organs 3D Modeling, Advanced Computer Graphics Shuang LIANG, SSE, Spring 2013 Usage of a 3D model The movie industry uses them as characters
    [Show full text]
  • Caradoc of the North Wind Free
    FREE CARADOC OF THE NORTH WIND PDF Allan Frewin Jones | 368 pages | 05 Apr 2012 | Hachette Children's Group | 9780340999417 | English | London, United Kingdom CARADOC OF THE NORTH WIND PDF As the war. Disaster strikes, and a valued friend suffers Caradoc of the North Wind devastating injury. Branwen sets off on a heroic journey towards destiny in an epic adventure of lovewar and revenge. Join Charlotte and Mia in this brilliant adventure full of princess sparkle and Christmas excitement! Chi ama i libri sceglie Kobo e inMondadori. The description is beautiful, but at times a bit too much, and sometimes at its worst the writing is hard to comprehend completely clearly. I find myself hoping vehemently for another book. It definitely allows the I read this in Caradoc of the North Wind sitting and could not put it down. Fair Wind to Widdershins. This author has published under several versions of his name, including Allan Jones, Frewin Jones, and A. Write a product review. Now he has stolen the feathers Caradoc of the North Wind Doodle, the weather-vane cockerel in charge of the weather. Jun 29, Katie rated it really liked it. Is the other warrior child, Arthur?? More than I thought I would, even. I really cafadoc want to know more, and off author is one that can really take you places. Join us by creating an account and start getting the best experience from our website! Jules Ember was raised hearing legends of wjnd ancient magic of the wicked Alchemist and the good Sorceress. Delivery and Returns see our delivery rates and policies thinking of returning an item? Mar 24, Valentina rated it really liked it.
    [Show full text]
  • Hello, World! Free
    FREE HELLO, WORLD! PDF Disney Book Group | 14 pages | 16 Aug 2011 | Disney Press | 9781423141402 | English | New York, NY, United States "Hello, World!" program - Wikipedia Learn Data Science by completing interactive coding challenges and watching videos by expert instructors. Start Now! Python Hello a very simple language, and has a very straightforward syntax. It encourages programmers to program without boilerplate prepared code. The simplest directive in Python is the "print" directive - it simply prints out a line and also includes a newline, unlike in C. There Hello two major Python versions, Python 2 and Python 3. Python 2 and 3 are quite different. This tutorial uses Python 3, because it more semantically correct and supports newer features. For example, World! difference between Python 2 and 3 is the print statement. In Hello 2, World! "print" statement is not a function, and therefore it is invoked without parentheses. However, in Python World!, it World! a function, and must be invoked with parentheses. Python uses indentation for World!, instead of curly braces. Both tabs and spaces are supported, but the standard indentation requires standard Python code to use four spaces. For example:. This site is generously supported by DataCamp. Join over a million other learners and get started learning Python for data science today! Hello, World! To print a string in Python 3, just write: print "This line will be printed. Hello Tutorial. Read our Terms of Use and Privacy Policy. Hello, World! - Learn Python - Free Interactive Python Tutorial A "Hello, World! Such a Hello is very simple in most programming World!and World! often used to illustrate the basic syntax of a programming language.
    [Show full text]
  • Brake-Based Shape Displays and Automatic Content Authoring
    TOWARDS LOW-COST SPATIAL HAPTICS: BRAKE-BASED SHAPE DISPLAYS AND AUTOMATIC CONTENT AUTHORING A DISSERTATION SUBMITTED TO THE DEPARTMENT OF ELECTRICAL ENGINEERING AND THE COMMITTEE ON GRADUATE STUDIES OF STANFORD UNIVERSITY IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY Kai Zhang August 2020 © 2020 by Kai Zhang. All Rights Reserved. Re-distributed by Stanford University under license with the author. This work is licensed under a Creative Commons Attribution- Noncommercial 3.0 United States License. http://creativecommons.org/licenses/by-nc/3.0/us/ This dissertation is online at: http://purl.stanford.edu/kh451yb0256 ii I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Sean Follmer, Primary Adviser I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Juan Rivas-Davila I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Gordon Wetzstein Approved for the Stanford University Committee on Graduate Studies. Stacey F. Bent, Vice Provost for Graduate Education This signature page was generated electronically upon submission of this dissertation in electronic format. An original signed hard copy of the signature page is on file in University Archives. iii Abstract Haptic technology can significantly enhance the richness and realism of user interaction by providing the experience of physical touch.
    [Show full text]
  • 02562 Rendering - Introduction DTU Compute
    02562 Rendering - Introduction DTU Compute Worksheet 4 One of the key strengths of computer graphics techniques is that they work in general. Ray tracing is, for example, well known in numerous fields of research. The distinguishing feature of ray tracers in computer graphics is their ability to efficiently handle nearly arbitrary scenes. This set of exercises helps you load and efficiently render large triangle meshes. Learning Objectives • Accelerate rendering techniques using spatial data structures. • Implement ray tracing of a triangle mesh (indexed face set). • Use a BSP tree for efficient space subdivision. • Interpolate normals and texture coordinates across a triangle. Triangle Meshes One way to handle arbitrary surfaces is using triangles. In particular, we often work with an indexed face set. This is a collection of vertices, normals, and texture coordinates, each with an index, and a list of index triples each of which defines a triangle and its vertex normals and texture coordinates (if present). One way to store an indexed face set on disk is using a Wavefront OBJ file.1 This is the file format used by the framework. Most 3D modelling software packages can export to this format, and a couple of the classic computer graphics scenes have been included with the framework (the Stanford bunny, the Utah teapot, and the Cornell box, see the models subfolder). • Load the Cornell box with blocks (CornellBox.obj and CornellBlocks.obj) and display it in pre- view. This is a simple triangle mesh often used for testing rendering algorithms. The framework loads one or more OBJ files if you provide them as command line arguments.2 Take a screenshot of the (base colour) preview.
    [Show full text]
  • Ray Tracing on a Stream Processor
    RAY TRACING ON A STREAM PROCESSOR a dissertation submitted to the department of computer science and the committee on graduate studies of stanford university in partial fulfillment of the requirements for the degree of doctor of philosophy Timothy John Purcell March 2004 c Copyright by Timothy John Purcell 2004 All Rights Reserved ii I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Patrick M. Hanrahan (Principal Adviser) I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. William R. Mark University of Texas, Austin I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Mark Horowitz Approved for the University Committee on Graduate Studies. iii Abstract Ray tracing is an image synthesis technique which simulates the interaction of light with surfaces. Most high-quality, photorealistic renderings are generated by global illumination techniques built on top of ray tracing. Real-time ray tracing has been a goal of the graphics community for many years. Unfortunately, ray tracing is a very expensive operation. VLSI technology has just reached the point where the computational capability of a single chip is sufficient for real-time ray tracing. Super- computers and clusters of PCs have only recently been able to demonstrate interactive ray tracing and global illumination applications.
    [Show full text]
  • Real-Time Generation of Kd-Trees for Ray Tracing Using Directx 11
    Master of Science in Engineering: Game and Software Engineering 2017 Real-time generation of kd-trees for ray tracing using DirectX 11 Martin Säll and Fredrik Cronqvist Dept. Computer Science & Engineering Blekinge Institute of Technology SE–371 79 Karlskrona, Sweden This thesis is submitted to the Department of Computer Science & Engineering at Blekinge Institute of Technology in partial fulfillment of the requirements for the degree of Master of Science in Computer Science. The thesis is equivalent to 20 weeks of full-time studies. Contact Information: Authors: Martin Säll E-mail: [email protected] Fredrik Cronqvist E-mail: [email protected] University advisor: Prof. Lars Lundberg Dept. Computer Science & Engineering Mail: [email protected] Examiner: Veronica Sundstedt E-mail: [email protected] Dept. Computer Science & Engineering Internet : www.bth.se Blekinge Institute of Technology Phone : +46 455 38 50 00 SE–371 79 Karlskrona, Sweden Fax : +46 455 38 50 57 Abstract Context. Ray tracing has always been a simple but effective way to create a photorealistic scene but at a greater cost when expanding the scene. Recent improvements in GPU and CPU hardware have made ray tracing faster, making more complex scenes possible with the same amount of time needed to process the scene. Despite the improvements in hardware ray tracing is still rarely run at an interactive speed. Objectives. The aim of this experiment was to implement a new kd- tree generation algorithm using DirectX 11 compute shaders. Methods. The implementation created during the experiment was tested using two platforms and five scenarios where the generation time for the kd-tree was measured in milliseconds.
    [Show full text]
  • End-To-End Learning of Local Point Cloud Feature Descriptors
    Iowa State University Capstones, Theses and Graduate Theses and Dissertations Dissertations 2019 End-to-end learning of local point cloud feature descriptors David Ambrose Wehr Iowa State University Follow this and additional works at: https://lib.dr.iastate.edu/etd Part of the Computer Sciences Commons Recommended Citation Wehr, David Ambrose, "End-to-end learning of local point cloud feature descriptors" (2019). Graduate Theses and Dissertations. 17604. https://lib.dr.iastate.edu/etd/17604 This Thesis is brought to you for free and open access by the Iowa State University Capstones, Theses and Dissertations at Iowa State University Digital Repository. It has been accepted for inclusion in Graduate Theses and Dissertations by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. End-to-end learning of local point cloud feature descriptors by David Wehr A thesis submitted to the graduate faculty in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Major: Computer Science Program of Study Committee: Rafael Radkowski, Co-major Professor Yan-Bin Jia, Co-major Professor Forrest Bao The student author, whose presentation of the scholarship herein was approved by the program of study committee, is solely responsible for the content of this thesis. The Graduate College will ensure this thesis is globally accessible and will not permit alterations after a degree is conferred. Iowa State University Ames, Iowa 2019 Copyright c David Wehr, 2019. All rights reserved. ii TABLE OF CONTENTS Page LISTOFTABLES.......................................... iv LISTOFFIGURES ........................................ v ACKNOWLEDGMENTS ...................................... vii ABSTRACT ............................................ viii CHAPTER1. OVERVIEW&MOTIVATION . 1 1.1 Introduction.....................................
    [Show full text]
  • Photography, Mobile, and Immersive Imaging 2018
    https://doi.org/10.2352/ISSN.2470-1173.2018.05.PMII-555 PROCEEDINGS © 2018, Society for Imaging Science and Technology 28 January 2018 – 1 February 2018 • Burlingame, CA, USA Photography, Mobile, and Immersive Imaging 2018 Editors: Zhen He, Intel Corp. (United States) Feng Li, GoPro Inc. (United States) Jon S. McElvain, Dolby Labs., Inc. (United States) Nitin Sampat, Rochester Institute of Technology (United States) These papers represent the program of Electronic Imaging 2018, held 28 January – 1 February 2018, at the Hyatt Regency San Francisco Airport in Burlingame, CA. Copyright 2018 Society for Imaging Science and Technology 7003 Kilworth Lane • Springfield, VA 22151 USA 703/642-9090; 703/642-9094 fax [email protected]; www.imaging.org All rights reserved. These proceedings, or parts thereof, may not be reproduced in any form without the written permission of the Society. ISSN 2470-1173 https://doi.org/10.2352/ISSN.2470-1173.2018.05.PMII-555 Manuscripts are reproduced from PDFs as submitted and approved by authors; no editorial changes have been made. IS&T International Symposium on Electronic Imaging 2018 Photography, Mobile, and Immersive Imaging 2018 555-1 electronicimaging.org Photography, Mobile, and Immersive Imaging 2018 Conference Chairs Photography, Mobile, and Zhen He, Intel Corp. (United States) Immersive Imaging 2018 Feng Li, Intuitive Surgical Inc. (United States) Jon S. McElvain, Dolby Labs., Inc. (United States) Symposium Chairs Nitin Sampat, Rochester Institute of Technology (United States) Joyce Farrell, Stanford University (United States) Andrew Woods, Curtin University (Australia) Conference Committee Ajit Bopardikar, Samsung R&D Institute India Bangalore Pvt. Ltd. Symposium Short Course Chairs (India) Susan Farnand, Rochester Institute of Technology (United States) Peter Catrysse, Stanford Univ.
    [Show full text]
  • Supervised Training of Dense Object Nets Using Optimal Descriptors for Industrial Robotic Applications
    The Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21) Supervised Training of Dense Object Nets using Optimal Descriptors for Industrial Robotic Applications Andras Gabor Kupcsik1, Markus Spies1, Alexander Klein2, Marco Todescato1, Nicolai Waniek1, Philipp Schillinger1, Mathias Burger¨ 1 1Bosch Center for Artificial Intelligence, Renningen, Germany 2Technische Universitat¨ Darmstadt, Darmstadt, Germany [email protected] Abstract DONs can be readily applied to learn arbitrary objects with relative ease, including non-rigid objects. The self- Dense Object Nets (DONs) by Florence, Manuelli and supervised training objective of DONs uses contrastive loss Tedrake (2018) introduced dense object descriptors as a novel visual object representation for the robotics community. It is (Hadsell, Chopra, and LeCun 2006) between pixels of image suitable for many applications including object grasping, pol- pairs depicting the object and its environment. The pixel- icy learning, etc. DONs map an RGB image depicting an ob- wise contrastive loss minimizes descriptor space distance ject into a descriptor space image, which implicitly encodes between corresponding pixels (pixels depicting the same key features of an object invariant to the relative camera pose. point on the object surface in an image pair) and pushes Impressively, the self-supervised training of DONs can be ap- away non-correspondences. In essence, minimizing the con- plied to arbitrary objects and can be evaluated and deployed trastive loss defined on descriptors of pixels leads to a view within hours. However, the training approach relies on accu- invariant map of the object surface in descriptor space. rate depth images and faces challenges with small, reflective objects, typical for industrial settings, when using consumer The contrastive loss formulation belongs to the broader grade depth cameras.
    [Show full text]
  • Strategic Latency Unleashed the Role of Technology in a Revisionist Global Order and the Implications for Special Operations Forces Edited by Zachary S
    STRATEGIC LATENCY UNLEASHED THE ROLE OF TECHNOLOGY IN A REVISIONIST GLOBAL ORDER AND THE IMPLICATIONS FOR SPECIAL OPERATIONS FORCES EDITED BY ZACHARY S. DAVIS, FRANK GAC, CHRISTOPHER RAGER, PHILIP REINER, AND JENNIFER SNOW CENTER FOR GLOBAL SECURITY RESEARCH This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory in part under Contract W-7405-Eng-48 and in part under Contract DE-AC52-07NA27344. The views and opinions of the author expressed herein do not necessarily state or reflect those of the United States government or Lawrence Livermore National Security, LLC. ISBN-978-1-952565-07-6 LCCN-2021901137 LLNL-BOOK-818513 TID-59693 To download the ebook: See cgsr.llnl.gov STRATEGIC LATENCY UNLEASHED THE ROLE OF TECHNOLOGY IN A REVISIONIST GLOBAL ORDER AND THE IMPLICATIONS FOR SPECIAL OPERATIONS FORCES EDITED BY ZACHARY S. DAVIS, FRANK GAC, CHRISTOPHER RAGER, PHILIP REINER, AND JENNIFER SNOW Center for Global Security Research Lawrence Livermore National Laboratory January 2021 STRATEGIC LATENCY UNLEASHED | 1 Table of Contents EDITOR'S NOTE . 1 FOREWORD . 2 DEDICATION . 4 ACKNOWLEDGEMENTS . 5 INTRODUCTION Latency Unleashed: What It Means for Special Operations Forces Zachary S. Davis, Lawrence Livermore National Laboratory (LLNL), Research Professor, Naval Postgraduate School (NPS) . 8 SECTION 1 GEOPOLITICS OF STRATEGIC LATENCY FOR SOF: CONTEXT IS EVERYTHING Winning and Losing in Counterproliferation Zachary Davis, LLNL/NPS, and Michael Greene, Naval Special Warfare Command (ret.) . 15 The Role of Special Operations Forces in Countering Weapons of Mass Destruction Brendan Melley, Center for the Study of Weapons of Mass Destruction, National Defense University (NDU) .
    [Show full text]