Using Geometric Primitives to Unify Perception and Action for Object-Based Manipulation

Total Page:16

File Type:pdf, Size:1020Kb

Using Geometric Primitives to Unify Perception and Action for Object-Based Manipulation USING GEOMETRIC PRIMITIVES TO UNIFY PERCEPTION AND ACTION FOR OBJECT-BASED MANIPULATION by Hunter Brown A thesis submitted to the faculty of The University of Utah in partial fulfillment of the requirements for the degree of Master of Science Department of Mechanical Engineering The University of Utah May 2018 Copyright c Hunter Brown 2018 All Rights Reserved The University of Utah Graduate School STATEMENT OF THESIS APPROVAL The thesis of Hunter Brown has been approved by the following supervisory committee members: Tucker Hermans , Chair March 14, 2018 Date Approved Mark Minor , Member March 14, 2018 Date Approved Jake Abbott , Member March 14, 2018 Date Approved and by Tim Ameel , Chair/Dean of the Department/College/School of Mechanical Engineering and by David B. Kieda, Dean of The Graduate School. ABSTRACT In this work we consider task-based planning in uncertainty. To make progress in this problem, we propose an end-to-end method that makes progress toward the unification of perception and manipulation. Critical for this unification is the geometric primitive. A geometric primitive is a 3D geometry that can be fit to a single view from a 3D image. Geometric primitives are a consistent structure in many scenes, and by leveraging this, perceptual tasks such as segmentation, localization, and recognition can be solved. Shar- ing this information between these subroutines also makes the method computationally efficient. Geometric primitives can be used to define a set of actions the robot can use to influ- ence the world. Leveraging the rich 3D information in geometric primitives allows the designer to develop actions with a high chance of success. In this work, we consider a pick-and-place action, parameterized by the object and scene constraints. The design of the perceptual capabilities and actions is independent of the task given to the robot, giving the robot more versatility to complete a range of tasks. With a large number of available actions, the robot needs to select which action the robot performs. We propose a task-specific reward function to determine the next-best action for the robot to complete the task. A key insight into making the action selection tractable is reasoning about the occluded regions of the scene. We propose to not reason about what could be in the occluded regions, but instead to treat the occluded regions as parts of the scene to explore. Defining reward functions that encourage this exploration while balancing trying to solve the given task gives the robot more versatility to perform many different tasks. Reasoning about occlusion in this way also makes actions in the scene more robust to scene uncertainty and increases the computational efficiency of the method overall. In this work, we show results for segmentation of geometric primitives on real data, and discuss problems with fitting their parameters. While positive segmentation results are shown, there are problems with fitting consistent parameters to the geometric prim- itives. We also present simulation results showing the action selection process solving a singulation task. We show that our method is able to perform this task in several scenes with varying levels of complexity. We compare against selecting actions at random, and show our method consistently takes fewer actions to solve the scene. iv For my partner Lucia, who is always supportive, and my Grandmother Linda Rae who has always been there. CONTENTS ABSTRACT ............................................................. iii LIST OF FIGURES ....................................................... vii CHAPTERS 1. INTRODUCTION .................................................... 1 2. RELATED WORKS ................................................... 6 2.1 Segmentation . .6 2.2 Recognition and Localization . .7 2.3 Robotic Approaches . .8 3. METHODS .......................................................... 10 3.1 Problem Definition . 10 3.2 Method Overview . 13 3.3 Geometric Primitive Segmentation . 15 3.4 Action Selection . 18 3.5 Actions . 18 3.5.1 Grasping . 19 3.5.2 Object Placement Locations . 21 3.6 Update State Estimation . 22 3.7 Object Singulation Task Formulation . 24 3.8 Put Away Groceries Task Formulation . 25 4. EXPERIMENTS AND RESULTS ........................................ 28 4.1 Geometric Primitive Segmentation . 28 4.2 Geometric Primitive Manipulation Planner . 29 5. CONCLUSION ...................................................... 37 6. FUTURE WORK ..................................................... 38 APPENDIX: ALGORITHMS ............................................... 39 REFERENCES ........................................................... 41 LIST OF FIGURES 1.1 Examples of common tasks robots might need to complete. (A) An example of a bathroom cabinet with medicine the robot must find, (B) example of a pantry with messy shelves the robot must organize, and (C) An example of a table with groceries the robot must put away. .1 1.2 An example of an over-segmentation created by using the geometric primi- tive. Each color represents a segment of the scene with a different geometric primitive fit to it. .4 3.1 Examples of a box (A), cylinder (B), and sphere (C) geometric primitive. 10 3.2 Given a point cloud of a drill (A) we can fit two cylinders and a box to represent this object (B). 11 3.3 Given an observation (A) possible true states (B). 14 3.4 An example of Geometric Primitive Segmentation results. (A) Initial 3D point cloud data. (B) Segmentation of the scene. Here each color represents all the pixels assigned to a given segment. (C) Geometric primitives fit to the different segments. Blue pixels have a box fit to them, green pixels have a cylinder, and red have a sphere. 17 3.5 An example of generated grasp candidates for (a) box, (b) sphere, and (c) cylinder. 19 3.6 Example of how to generate placement locations. (A) Possible state estima- tion the robot could encounter. The table is colored green, the objects purple and occluded regions are black. If the robot tries to find the placement loca- tions for the cylinder in the middle (B) is the footprint of the scene without the cylinder, (C) is the footprint of the cylinder (scaled to be easily visable), and (D) is the convolution where green indicates placement locations without collision and red placement locations with collision. 21 3.7 An example scene for the put away Groceries taks. (A) Counter full of gro- ceries for the robot to put away. (B) Cabinet the robot must place the groceries into. ............................................................. 25 4.1 Three examples of segmentations generated using Algorithm 2. The first column is an image of the scene. The second column is the segmentation, where each color is a unique segment in the scene. The third column idicates the type of geometric primitive that was assigned to each segment. Green is a cylinder, red a sphere, and blue a box. 29 4.2 Two different fits for the same data: (A) is an ideal fit and (B) is an error. 30 4.3 Examples of each scene type (A) is a large scene and (B) is a small scene. The supporting surface is shown in green and the objects in red. 31 4.4 The mean and CI for the data as a function of the number of objects in the scene. ............................................................ 32 4.5 The mean and CI for the data as a function of the size of the scene. 33 4.6 The mean and CI for the data as a function of the Method. 34 4.7 The mean and CI for the data as a function of the Method and the scene size. 34 4.8 The mean and CI for the data as a function of the Method and number of objects in the scene. 35 viii CHAPTER 1 INTRODUCTION Robots are becoming more relevant in domains beyond their industrial origin, such as in domestic settings and assistive care. A robot in these domains has to handle a wide assortment of challenging tasks. For example, an assistive-care robot could face many different jobs throughout the day. In the morning it might need to search the bathroom cabinet for a medication (Figure 1.1 A), in the afternoon organize a messy shelf (Figure 1.1 B), and in the evening put away groceries (Figure 1.1 C). While the robot might deal with similar objects and use similar actions, planning to solve a given task may vary significantly between different scenes. For example, when finding the medicine, there may be several new objects in the cabinet, because the previous night someone bought a new toothbrush and deodorant, and the objects that were in the cabinet yesterday could be moved to new locations. To complete many of these tasks the robot will need to manipulate the scene, such as moving the lotion to reveal the medicine behind it. This can be difficult (A) (B) (C) Figure 1.1. Examples of common tasks robots might need to complete. (A) An example of a bathroom cabinet with medicine the robot must find, (B) example of a pantry with messy shelves the robot must organize, and (C) An example of a table with groceries the robot must put away. 2 for the robot to do without knocking objects out of the cabinet, running into unseen parts of the scene, and even destroying itself or other objects. For robots to be successful in these new domains, they must be able to safely handle a multitude of tasks, in partially known, and unstructured environments. While there is limited structure in object-based manipulation problems, it is important to leverage the structure that does exist. We can exploit this structure by giving the robot a set of basic perceptual skills. Differentiating objects from each other (i.e, segmenting objects) gives the robot the ability to determine which parts of the scene are objects. Given a segmentation of the scene, the robot should determine the location of the objects in relation to the robot (i.e, localize the object). Segmenting and localizing objects gives the robot critical information about how to manipulate objects in the scene.
Recommended publications
  • Analysis of Geometric Primitives in Quantitative Structure Models of Tree Stems
    Remote Sens. 2015, 7, 4581-4603; doi:10.3390/rs70404581 OPEN ACCESS remote sensing ISSN 2072-4292 www.mdpi.com/journal/remotesensing Article Analysis of Geometric Primitives in Quantitative Structure Models of Tree Stems Markku Åkerblom 1;*, Pasi Raumonen 1, Mikko Kaasalainen 1 and Eric Casella 2 1 Department of Mathematics, Tampere University of Technology, P.O. Box 553, Tampere 33101, Finland; E-Mails: Pasi.Raumonen@tut.fi (P.R.); Mikko.Kaasalainen@tut.fi (M.K.) 2 Centre for Sustainable Forestry and Climate Change, Forest Research, Farnham GU10 4LH, UK; E-Mail: [email protected] * Author to whom correspondence should be addressed; E-Mail: markku.akerblom@tut.fi Academic Editors: Nicolas Baghdadi and Prasad S. Thenkabail Received: 3 February 2015 / Accepted: 3 April 2015 / Published: 16 April 2015 Abstract: One way to model a tree is to use a collection of geometric primitives to represent the surface and topology of the stem and branches of a tree. The circular cylinder is often used as the geometric primitive, but it is not the only possible choice. We investigate various geometric primitives and modelling schemes, discuss their properties and give practical estimates for expected modelling errors associated with the primitives. We find that the circular cylinder is the most robust primitive in the sense of a well-bounded volumetric modelling error, even with noise and gaps in the data. Its use does not cause errors significantly larger than those with more complex primitives, while the latter are much more sensitive to data quality. However, in some cases, a hybrid approach with more complex primitives for the stem is useful.
    [Show full text]
  • The Geometric Primitive Matorus Francisco A
    International Journal of Innovations in Engineering and Technology (IJIET) http://dx.doi.org/10.21172/ijiet.94.04 The Geometric Primitive MatOrus Francisco A. Madera1, Jorge Madera-Valdez2 1Mathematics Faculty, Universidad Autonoma de Yucatan, Merida, Yucatan, Mexico 2School of Architecture, Universidad Marista, Merida, Yucatan, Mexico Abstract- This paper explores the potential of forming hierarchical structures using a geometric primitive, that we called MatOrus, which is a 3D closed surface formed from a Bézier circle. Several MaTorii (a set of MatOrus) can be systematically assembled into variations of interlocking structures, therefore multiple MaTorii can be used as basic elements to build larger and complex geometric structures. We investigate a method that can generate the geometry proposed using a parametric coding so that the several shapes can be created easily. We also propose several forms of integrating the basic form in order to obtain aesthetic geometry that can be used in fields such as sculpture and architectural design. Our results show that the method requires minimum of memory source, a few number of operations and it is effective in procedural parameterizing. Keywords – Computational Geometry, Bézier curves, parametric coding, tridimensional design I. INTRODUCTION One of the most significant dimensions of parametric modeling software is representing variation in design. This works best when variation is continuous and distinctly different alternatives are not part of the model. Patterns are used as constructs to collect, organize, and communicate regularities in designer use of parametric design tools. In this article a geometric primitive is introduced, MatOrus that is created from a Bézier curve handled with four edges.
    [Show full text]
  • Computational Geometry and Computer Graphics by Dobkin
    Computational Geometry and Computer Graphics David P. Dobkin+ Department of Computer Science Princeton University Princeton, NJ 08544 ABSTRACT Computer graphics is a de®ning application for computational geometry. The interaction between these ®elds is explored through two scenarios. Spatial subdivisions studied from the viewpoint of computa- tional geometry are shown to have found application in computer graph- ics. Hidden surface removal problems of computer graphics have led to sweepline and area subdivision algorithms in computational geometry. The paper ends with two promising research areas with practical applica- tions: precise computation and polyhedral decomposition. 1. Introduction Computational geometry and computer graphics both consider geometric phenomena as they relate to computing. Computational geometry provides a theoretical foundation involving the study of algorithms and data structures for doing geometric computations. Computer graphics concerns the practical development of the software, hardware and algorithms necessary to create graphics (i.e. to display geometry) on the computer screen. At the interface lie many common problems which each discipline must solve to reach its goal. The techniques of the ®eld are often similar, sometimes dif- ferent. And, each ®elds has built a set of paradigms from which to operate. Thus, it is natural to ask the question: How do computer graphics and computational geometry interact? It is this question that motivates this paper. I explore here life at the interface between computational geometry and computer graphics. Rather than striving for completeness (see e.g. Yao [Ya] for a broader survey), I consider in detail two scenarios in which the two ®elds have overlapped. The goal is to trace the development of the ideas used in the two areas and to document the movement back and forth.
    [Show full text]
  • Geometric Modeling with Primitives Baptiste Angles
    Geometric modeling with primitives Baptiste Angles To cite this version: Baptiste Angles. Geometric modeling with primitives. Networking and Internet Architecture [cs.NI]. Université Paul Sabatier - Toulouse III, 2019. English. NNT : 2019TOU30060. tel-02896431 HAL Id: tel-02896431 https://tel.archives-ouvertes.fr/tel-02896431 Submitted on 10 Jul 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. THÈSE En vue de l’obtention du DOCTORAT DE L’UNIVERSITÉ DE TOULOUSE Délivré par l'Université Toulouse 3 - Paul Sabatier Cotutelle internationale : Université Victoria Présentée et soutenue par Baptiste ANGLES Le 18 avril 2019 Modélisation géométrique par primitives Ecole doctorale : EDMITT - Ecole Doctorale Mathématiques, Informatique et Télécommunications de Toulouse Spécialité : Informatique et Télécommunications Unité de recherche : IRIT : Institut de Recherche en Informatique de Toulouse Thèse dirigée par Loïc BARTHE et Brian WYVILL Jury M. Andrea TAGLIASACCHI, Examinateur M. Mathias PAULIN, Examinateur M. Alec JACOBSON, Examinateur M. Loïc BARTHE, Co-directeur de thèse M. Brian WYVILL, Co-directeur de thèse GEOMETRIC MODELING WITH PRIMITIVES by Baptiste Angles B.Sc., Universit´ede Bordeaux, France, 2013 M.Sc., Universit´ede Toulouse, France, 2015 In Partial Fulfillment of the Requirements for the Degree of DOCTOR OF PHILOSOPHY in the Department of Computer Science c Baptiste Angles, 2019 University of Victoria All rights reserved.
    [Show full text]
  • Vectorization Using Stochastic Local Search
    Vectorization Using Stochastic Local Search Byron Knoll CPSC303, University of British Columbia March 29, 2009 Abstract: Stochastic local search can be used for the process of vectorization. In this project, local search is used to optimize the parameters of a set of smooth shapes to represent an image of the Mona Lisa. The shapes are made using quadratic piecewise parametric interpolation. The shapes are partially transparent and are defined by three vertices and a color. The method is successfully applied to a different number of shapes to achieve varying levels of accuracy. More shapes resulted in a greater level of accuracy, but required more time to compute and more memory to store. One of the main advantages of performing vectorization is that it reduces the amount of information required to store an image. Introduction: Vectorization is the process of converting raster graphics into vector graphics. A raster graphics image stores a rectangular grid of pixels. Each pixel represents a point of color. A vector graphics image stores geometric primitives such as points, lines, and curves. The two formats each have their advantages and disadvantages depending on the type of image that they are representing. Vector graphics tend to work well on images which do not contain a fine level of detail. Since the image is stored in the form of geometric primitives, it can be magnified to an arbitrary level without any loss of clarity. The file size of vector images also tends to be much smaller than compressed raster images because information only needs to be stored for each geometric primitive instead of each pixel.
    [Show full text]
  • Interactive Computer Graphics: a Top-Down Approach with Shader
    This page intentionally left blank INTERACTIVE COMPUTER GRAPHICS A TOP-DOWN APPROACH WITH SHADER-BASED OPENGL® 6th Edition This page intentionally left blank INTERACTIVE COMPUTER GRAPHICS A TOP-DOWN APPROACH WITH SHADER-BASED OPENGL® 6th Edition EDWARD ANGEL University of New Mexico • DAVE SHREINER ARM, Inc. Editorial Director: Marcia Horton Editor-in-Chief: Michael Hirsch Acquisitions Editor: Matt Goldstein Editorial Assistant: Chelsea Bell Vice President, Marketing: Patrice Jones Marketing Manager: Yezan Alayan Marketing Coordinator: Kathryn Ferranti Vice President, Production: Vince O’Brien Managing Editor: Jeff Holcomb Senior Production Project Manager: Marilyn Lloyd Senior Operations Supervisor: Alan Fischer Operations Specialist: Lisa McDowell Text Designer: Beth Paquin Cover Designer: Central Covers Cover Art: Hue Walker, Fulldome Project, University of New Mexico Media Editor: Daniel Sandin Media Project Manager: Wanda Rockwell Full-Service Project Management: Coventry Composition Composition: Coventry Composition, using ZzTEX Printer/Binder: Edwards Brothers Cover and Insert Printer: Lehigh-Phoenix Color Text Font: Minion Credits and acknowledgments borrowed from other sources and reproduced, with permission, in this textbook appear on appropriate page within text. Copyright © 2012, 2009, 2006, 2003, 2000 Pearson Education, Inc., publishing as Addison- Wesley. All rights reserved. Manufactured in the United States of America. This publication is protected by Copyright, and permission should be obtained from the publisher prior to any prohibited reproduction, storage in a retrieval system, or transmission in any form or by any means, electronic, mechanical, photocopying, recording, or likewise. To obtain permission(s) to use material from this work, please submit a written request to Pearson Education, Inc., Permissions Department, 501 Boylston Street, Suite 900, Boston, Massachusetts 02116.
    [Show full text]
  • What Is Ray Tracing? – Comparison with Rasterization – Why Now? / Timeline – Reasons and Examples for Using Ray Tracing – Open Issues
    Introduction to Realtime Ray Tracing Course 41 Philipp Slusallek Peter Shirley Bill Mark Gordon Stoll Ingo Wald 1 Introduction to Realtime Ray Tracing • Introduction to Ray Tracing – What is Ray Tracing? – Comparison with Rasterization – Why Now? / Timeline – Reasons and Examples for Using Ray Tracing – Open Issues 2 Introduction to Realtime Ray Tracing Rendering in Computer Graphics Rasterization: Ray Tracing: Projection geometry forward Project image samples backwards Computer graphics has only two basic algorithms for rendering 3D scenes to a 2D screen. The dominant algorithms for interactive computer graphics today is the rasterization algorithm implemented in all graphic chips. It conceptually takes a single triangle at a time, projects it to the screen and paints all covered pixels (subject to the Z-buffer and other test and more or less complex shading computations). Because the HW has no knowledge about the scene it must process every triangle leading to a linear complexity with respect to scene size: Twice the number of triangles leads to twice the rendering time. While here are options to optimize this, must be done in the application separate from the HW. The other algorithm – ray tracing – works in fundamentally different ways. It starts by shooting rays for each pixel into the scenes and uses advanced spatial indexes (aka. acceleration structures) to quickly locate the geometric primitive that is being hit. Because these indexes are hierarchical they allow for a logarithmic complexity: Above something like 1 million triangles the rendering time hardly changes any more. 3 Current Technology: Rasterization • Rasterization-Pipeline – Highly successful technology Application – From graphics supercomputers to Vertex Shader an add-on in a PC chip-set Rasterization • Advantages – Simple and proven algorithm Fragment Shader – Getting faster quickly Fragment Tests – Trend towards full programmability Framebuffer Computer graphics knows two different technologies for generating a 2D image from 3D scene description: rasterization and ray tracing.
    [Show full text]
  • Computer Graphics and Applications Lecture 5: Drawing Geometric Objects Point, Lines and Polygons Geometric Object Are Describe
    Computer Graphics and Applications Lecture 5: Drawing Geometric Objects Point, Lines and Polygons Geometric object are described by a set of vertices and the type of primitive to be drawn (e.g. line, triangle, quadrilateral). Note that a vertex is just a point defined in three dimensional space and how these vertices are connected is determined by the primitive type. Every geometric object is ultimately described as an ordered set of vertices. We use the glVertex*() command to specify a vertex. The '*' is used to indicate that there are variations to the base command glVertex(). Points: A point is represented by a single vertex. Vertices specified by the user as two-dimensional (only x- and y-coordinates) are assigned a z-coordinate equal to zero. To control the size of a rendered point, use glPointSize() and supply the desired size in pixels as the argument. The default is as 1 pixel by 1 pixel point. If the width specified is 2.0, the point will be a square of 2 by 2 pixels. glVertex*() is used to describe a point, but it is only effective between a glBegin() and a glEnd() pair. The argument passed to glBegin() determines what sort of geometric primitive is constructed from the vertices. Lines: In OpenGL, the term line refers to a line segment. The easiest way to specify a line is in terms of the vertices at the endpoints. As with the points above, the argument passed to glBegin() tells it what to do with the vertices. The options for lines include: GL_LINES: Draws a series of unconnected line segments drawn between each set of vertices.
    [Show full text]
  • Determining Geometric Primitives for a 3D GIS: Easy As 1D, 2D, 3D?
    Determining geometric primitives for a 3D GIS: easy as 1D, 2D, 3D? Britt Lonneville1, Cornelis Stal1, Berdien De Roo1, Alain De Wulf1 and Philippe De Maeyer1 1Department of Geography, Ghent University, Krijgslaan 281 (building S8), Ghent, Belgium {Britt.Lonneville, Cornelis.Stal, Berdien.DeRoo, Alain.DeWulf, Philippe.DeMaeyer}@Ugent.be Keywords: 3D GIS, Geometric Primitive, Computational Geometry, Point, Surface, Solid. Abstract: Acquisition techniques such as photo modelling, using SfM-MVS algorithms, are being applied increasingly in several fields of research and render highly realistic and accurate 3D models. Nowadays, these 3D models are mainly deployed for documentation purposes. As these data generally encompass spatial data, the development of a 3D GIS would allow researchers to use these 3D models to their full extent. Such a GIS would allow a more elaborate analysis of these 3D models and thus support the comprehension of the objects that the features in the model represent. One of the first issues that has to be tackled in order to make the resulting 3D models compatible for implementation in a 3D GIS is the choice of a certain geometric primitive to spatially represent the input data. The chosen geometric primitive will not only influence the visualisation of the data, but also the way in which the data can be stored, exchanged, manipulated, queried and understood. Geometric primitives can be one-, two- and three-dimensional. By adding an extra dimension, the complexity of the data increases, but the user is allowed to understand the original situation more intuitively. This research paper tries to give an initial analysis of 1D, 2D and 3D primitives in the framework of the integration of SfM-MVS based 3D models in a 3D GIS.
    [Show full text]
  • 52 COMPUTER GRAPHICS David Dobkin and Seth Teller
    52 COMPUTER GRAPHICS David Dobkin and Seth Teller INTRODUCTION Computer graphics has often been cited as a prime application area for the tech- niques of computational geometry. The histories of the two fields have a great deal of overlap, with similar methods (e.g., sweep-line and area subdivision algorithms) arising independently in each. Both fields have often focused on similar problems, although with different computational models. For example, hidden surface removal (visible surface identification) is a fundamental problem in both fields. At the same time, as the fields have matured, they have brought different requirements to simi- lar problems. The body of this chapter contains an updated version of our chapter in an earlier edition of this Handbook [DT04] in which we aimed to highlight both similarities and differences between the fields. Computational geometry is fundamentally concerned with the efficient quanti- tative representation and manipulation of ideal geometric entities to produce exact results. Computer graphics shares these goals, in part. However, graphics practi- tioners also model the interaction of objects with light and with each other, and the media through which these effects propagate. Moreover, graphics researchers and practitioners: (1) typically use finite precision (rather than exact) representa- tions for geometry; (2) rarely formulate closed-form solutions to problems, instead employing sampling strategies and numerical methods; (3) often design into their algorithms explicit tradeoffs between running time and solution quality; (4) often analyze algorithm performance by defining as primitive operations those that have been implemented in particular hardware architectures rather than by measuring asymptotic behavior; and (5) implement most algorithms they propose.
    [Show full text]
  • Describing 3D Geometric Primitives Using the Gaussian Sphere and the Gaussian Accumulator
    Noname manuscript No. (will be inserted by the editor) Describing 3D Geometric Primitives Using the Gaussian Sphere and the Gaussian Accumulator Zahra Toony · Denis Laurendeau · Christian Gagné Received: date / Accepted: date Abstract Most complex object models are composed 1 Introduction of basic parts or primitives. Being able to decompose a complex 3D model into such basic primitives is an The availability of fast and accurate 3D sensors has important step in reverse engineering. Even when an favored the development of different applications in algorithm can segment a complex model into its primi- assembly, inspection, Computer-Aided Design (CAD), tives, a description technique is still needed in order to reverse engineering, mechanical engineering, medicine, identify the type of each primitive. Most feature extrac- and entertainment, to list just a few. While 2D cam- tion methods fail to describe these basic primitives or eras capture 2D images of the surface of objects, either need a trained classifier on a database of prepared data black-and-white or color, 3D cameras provide informa- to perform this identification. In this paper, we pro- tion on the geometry of an object surface. Today, newly pose a method that can describe basic primitives such introduced 3D cameras can acquire the appearance and as planes, cones, cylinders, spheres, and tori as well as geometry of objects concurrently. partial models of the latter four primitives. To achieve One of the most difficult tasks in machine vision this task, we combine the concept of Gaussian sphere to consists in object recognition/description. Over the a new concept introduced in this paper: the Gaussian past four decades, object recognition has been a very accumulator.
    [Show full text]
  • Geometric Primitive Extraction for 3D Model Reconstruction
    Geometric Primitive Extraction for 3D Model Reconstruction Diego VIEJO, Miguel CAZORLA Dept. Ciencia de la Computación e Inteligencia Artificial Universidad de Alicante Apdo. 99, 03080 Alicante, España {dviejo,miguel}@dccia.ua.es Abstract. 3D map building is a complex robotics task which needs mathematical robust models. From a 3D point cloud, we can use the normal vectors to these points to do feature extraction. In this paper, we propose to extract geometric primitives (e.g. planes) from the cloud. We will present a robust method for normal estimation. Keywords. Robotics, Computer Vision, 3D Map Building, Feature Extraction. Introduction In a previous work, [9], we have developed a method for the reconstruction of a map 3D by means of a mobile robot from a sets of points. As 3D sensor we used a commercial stereo camera that yields both 3D coordinates and grey level. The use of a stereo camera for the acquisition of 3D points supposes a handicap both for the reconstruction of a 3D environment and for the extraction of 3D features due to the fact that the data obtained by means of this type of cameras present a high level of error. In addition, we must face the absence of information when the environment presents objects without texture. Related with this task is the well-known SLAM [7] problem which consists of constructing the map of the environment simultaneously while trying to locate ourselves in it. The difficulty of this problem is rooted on the odometry error. Our previous approach, proposed in [9], consists of a modification of the algorithm ICP using the EM methodology who allows us to reduce sensitively the odometry error.
    [Show full text]