Stereo and Structure from Motion
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Seamless Texture Mapping of 3D Point Clouds
Seamless Texture Mapping of 3D Point Clouds Dan Goldberg Mentor: Carl Salvaggio Chester F. Carlson Center for Imaging Science, Rochester Institute of Technology Rochester, NY November 25, 2014 Abstract The two similar, quickly growing fields of computer vision and computer graphics give users the ability to immerse themselves in a realistic computer generated environment by combining the ability create a 3D scene from images and the texture mapping process of computer graphics. The output of a popular computer vision algorithm, structure from motion (obtain a 3D point cloud from images) is incomplete from a computer graphics standpoint. The final product should be a textured mesh. The goal of this project is to make the most aesthetically pleasing output scene. In order to achieve this, auxiliary information from the structure from motion process was used to texture map a meshed 3D structure. 1 Introduction The overall goal of this project is to create a textured 3D computer model from images of an object or scene. This problem combines two different yet similar areas of study. Computer graphics and computer vision are two quickly growing fields that take advantage of the ever-expanding abilities of our computer hardware. Computer vision focuses on a computer capturing and understanding the world. Computer graphics con- centrates on accurately representing and displaying scenes to a human user. In the computer vision field, constructing three-dimensional (3D) data sets from images is becoming more common. Microsoft's Photo- synth (Snavely et al., 2006) is one application which brought attention to the 3D scene reconstruction field. Many structure from motion algorithms are being applied to data sets of images in order to obtain a 3D point cloud (Koenderink and van Doorn, 1991; Mohr et al., 1993; Snavely et al., 2006; Crandall et al., 2011; Weng et al., 2012; Yu and Gallup, 2014; Agisoft, 2014). -
Robust Tracking and Structure from Motion with Sampling Method
ROBUST TRACKING AND STRUCTURE FROM MOTION WITH SAMPLING METHOD A DISSERTATION SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS for the degree DOCTOR OF PHILOSOPHY IN ROBOTICS Robotics Institute Carnegie Mellon University Peng Chang July, 2002 Thesis Committee: Martial Hebert, Chair Robert Collins Jianbo Shi Michael Black This work was supported in part by the DARPA Tactical Mobile Robotics program under JPL contract 1200008 and by the Army Research Labs Robotics Collaborative Technology Alliance Contract DAAD 19- 01-2-0012. °c 2002 Peng Chang Keywords: structure from motion, visual tracking, visual navigation, visual servoing °c 2002 Peng Chang iii Abstract Robust tracking and structure from motion (SFM) are fundamental problems in computer vision that have important applications for robot visual navigation and other computer vision tasks. Al- though the geometry of the SFM problem is well understood and effective optimization algorithms have been proposed, SFM is still difficult to apply in practice. The reason is twofold. First, finding correspondences, or "data association", is still a challenging problem in practice. For visual navi- gation tasks, the correspondences are usually found by tracking, which often assumes constancy in feature appearance and smoothness in camera motion so that the search space for correspondences is much reduced. Therefore tracking itself is intrinsically difficult under degenerate conditions, such as occlusions, or abrupt camera motion which violates the assumptions for tracking to start with. Second, the result of SFM is often observed to be extremely sensitive to the error in correspon- dences, which is often caused by the failure of the tracking. This thesis aims to tackle both problems simultaneously. -
Self-Driving Car Autonomous System Overview
Self-Driving Car Autonomous System Overview - Industrial Electronics Engineering - Bachelors’ Thesis - Author: Daniel Casado Herráez Thesis Director: Javier Díaz Dorronsoro, PhD Thesis Supervisor: Andoni Medina, MSc San Sebastián - Donostia, June 2020 Self-Driving Car Autonomous System Overview Daniel Casado Herráez "When something is important enough, you do it even if the odds are not in your favor." - Elon Musk - 2 Self-Driving Car Autonomous System Overview Daniel Casado Herráez To my Grandfather, Family, Friends & to my supervisor Javier Díaz 3 Self-Driving Car Autonomous System Overview Daniel Casado Herráez 1. Contents 1.1. Index 1. Contents ..................................................................................................................................................... 4 1.1. Index.................................................................................................................................................. 4 1.2. Figures ............................................................................................................................................... 6 1.1. Tables ................................................................................................................................................ 7 1.2. Algorithms ........................................................................................................................................ 7 2. Abstract ..................................................................................................................................................... -
Basic Stereo & Epipolar Geometry
Basic Stereo & Epipolar Geometry EECS 598-08 Fall 2014! Instructor: Jason Corso (jjcorso)! Foundations of Computer Vision! web.eecs.umich.edu/~jjcorso/t/598F14! ! ! Readings: FP 7; SZ 11.1; TV 7! Date: 10/22/14! ! Materials on these slides have come from many sources in addition to myself; individual slides reference specific sources.! 2 Plan •" Basic Stereo! –" Why is more than one view useful?! –" Triangulation! •" Epipolar Geometry! On to Shape •" What cues help us to perceive 3d shape and depth? Shading [Figure from Prados & Faugeras 2006] Focus/Defocus [Figure from H. Jin and P. Favaro, 2002] Texture [From A.M. Loh. The recovery of 3-D structure using visual texture patterns. PhD thesis] Perspective effects Image credit: S. Seitz Motion Figures from L. Zhang http://www.brainconnection.com/teasers/?main=illusion/motion-shape Estimating scene shape •" Shape from X: Shading, Texture, Focus, Motion… •" Stereo: –" shape from “motion” between two views –" infer 3d shape of scene from two (multiple) images from different viewpoints 10 Can Structure Be Recovered from a Single View? P! p! Ow! Scene! C! Calibration rig! Camera K! From calibration rig! → location/pose of the rig, K! From points and lines at infinity ! + orthogonal lines and planes! → structure of the scene, K! Knowledge about scene (point correspondences, geometry of lines & planes, etc…! Slide source: S. Savarese.! 11 Can Structure Be Recovered from a Single View? Pinhole perspective projection! P! p! Ow! Scene! C! Calibration rig! Camera K! Why is it so difficult?! Intrinsic ambiguity of the mapping from 3D to image (2D)! Slide source: S. Savarese.! 12 Can Structure Be Recovered from a Single View? Intrinsic ambiguity of the mapping from 3D to image (2D)! Courtesy slide S. -
Euclidean Reconstruction of Natural Underwater
EUCLIDEAN RECONSTRUCTION OF NATURAL UNDERWATER SCENES USING OPTIC IMAGERY SEQUENCE By Han Hu B.S. in Remote Sensing Science and Technology, Wuhan University, 2005 M.S. in Photogrammetry and Remote Sensing, Wuhan University, 2009 THESIS Submitted to the University of New Hampshire In Partial Fulfillment of The Requirements for the Degree of Master of Science In Ocean Engineering Ocean Mapping September, 2015 This thesis has been examined and approved in partial fulfillment of the requirements for the degree of M.S. in Ocean Engineering Ocean Mapping by: Thesis Director, Yuri Rzhanov, Research Professor, Ocean Engineering Philip J. Hatcher, Professor, Computer Science R. Daniel Bergeron, Professor, Computer Science On May 26, 2015 Original approval signatures are on file with the University of New Hampshire Graduate School. ii ACKNOWLEDGEMENTS I would like to thank all those people who made this thesis possible. My first debt of sincere gratitude and thanks must go to my advisor, Dr. Yuri Rzhanov. Throughout the entire three-year period, he offered his unreserved help, guidance and support to me in both work and life. Without him, it is definite that my thesis could not be completed successfully. I could not have imagined having a better advisor than him. Besides my advisor in Ocean Engineering, I would like to give many thanks to the rest of my thesis committee and my advisors in the Department of Computer Science: Prof. Philip J. Hatcher and Prof. R. Daniel Bergeron as well for their encouragement, insightful comments and all their efforts in making this thesis successful. I also would like to express my thanks to the Center for Coastal and Ocean Mapping and the Department of Computer Science at the University of New Hampshire for offering me the opportunities and environment to work on exciting projects and study in useful courses from which I learned many advanced skills and knowledge. -
Camera Array Image Rectification and Calibration for Stereoscopic and Autostereoscopic Displays Vincent Nozick
Camera array image rectification and calibration for stereoscopic and autostereoscopic displays Vincent Nozick To cite this version: Vincent Nozick. Camera array image rectification and calibration for stereoscopic and autostereoscopic displays. Annals of Telecommunications - annales des télécommunications, Springer, 2013, 68 (11), pp.581-596. hal-00908586 HAL Id: hal-00908586 https://hal-upec-upem.archives-ouvertes.fr/hal-00908586 Submitted on 25 Nov 2013 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Noname manuscript No. (will be inserted by the editor) Camera array image rectification and calibration for stereoscopic and autostereoscopic displays Vincent Nozick Received: date / Accepted: date Abstract This paper presents an image rectification ically, this paper presents an extension of an image rec- method for an arbitrary number of views with aligned tification method [26] to perform a camera calibration camera center. This paper also describes how to extend for an arbitrary number of views with aligned camera this method to easily perform a robust camera cali- center. This technique can be used for stereoscopic ren- bration. These two techniques can be used for stereo- dering to enhance the perception comfort or for depth scopic rendering to enhance the perception comfort or from stereo. -
Motion Estimation for Self-Driving Cars with a Generalized Camera
2013 IEEE Conference on Computer Vision and Pattern Recognition Motion Estimation for Self-Driving Cars With a Generalized Camera Gim Hee Lee1, Friedrich Fraundorfer2, Marc Pollefeys1 1 Department of Computer Science Faculty of Civil Engineering and Surveying2 ETH Zurich,¨ Switzerland Technische Universitat¨ Munchen,¨ Germany {glee@student, pomarc@inf}.ethz.ch [email protected] Abstract In this paper, we present a visual ego-motion estima- tion algorithm for a self-driving car equipped with a close- to-market multi-camera system. By modeling the multi- camera system as a generalized camera and applying the non-holonomic motion constraint of a car, we show that this leads to a novel 2-point minimal solution for the general- ized essential matrix where the full relative motion includ- ing metric scale can be obtained. We provide the analytical solutions for the general case with at least one inter-camera correspondence and a special case with only intra-camera Figure 1. Car with a multi-camera system consisting of 4 cameras correspondences. We show that up to a maximum of 6 so- (front, rear and side cameras in the mirrors). lutions exist for both cases. We identify the existence of degeneracy when the car undergoes straight motion in the ready available in some COTS cars, we focus this paper special case with only intra-camera correspondences where on using a multi-camera setup for ego-motion estimation the scale becomes unobservable and provide a practical al- which is one of the key features for self-driving cars. A ternative solution. Our formulation can be efficiently imple- multi-camera setup can be modeled as a generalized cam- mented within RANSAC for robust estimation. -
Structure from Motion Using a Hybrid Stereo-Vision System François Rameau, Désiré Sidibé, Cédric Demonceaux, David Fofi
Structure from motion using a hybrid stereo-vision system François Rameau, Désiré Sidibé, Cédric Demonceaux, David Fofi To cite this version: François Rameau, Désiré Sidibé, Cédric Demonceaux, David Fofi. Structure from motion using a hybrid stereo-vision system. 12th International Conference on Ubiquitous Robots and Ambient Intel- ligence, Oct 2015, Goyang City, South Korea. hal-01238551 HAL Id: hal-01238551 https://hal.archives-ouvertes.fr/hal-01238551 Submitted on 5 Dec 2015 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. The 12th International Conference on Ubiquitous Robots and Ambient Intelligence (URAI 2015) October 28 30, 2015 / KINTEX, Goyang city, Korea ⇠ Structure from motion using a hybrid stereo-vision system Franc¸ois Rameau, Desir´ e´ Sidibe,´ Cedric´ Demonceaux, and David Fofi Universite´ de Bourgogne, Le2i UMR 5158 CNRS, 12 rue de la fonderie, 71200 Le Creusot, France Abstract - This paper is dedicated to robotic navigation vision system and review the previous works in 3D recon- using an original hybrid-vision setup combining the ad- struction using hybrid vision system. In the section 3.we vantages offered by two different types of camera. This describe our SFM framework adapted for heterogeneous couple of cameras is composed of one perspective camera vision system which does not need inter-camera corre- associated with one fisheye camera. -
Camera Models and Fundamental Concepts Used in Geometric Computer Vision Full Text Available At
Full text available at: http://dx.doi.org/10.1561/0600000023 Camera Models and Fundamental Concepts Used in Geometric Computer Vision Full text available at: http://dx.doi.org/10.1561/0600000023 Camera Models and Fundamental Concepts Used in Geometric Computer Vision Peter Sturm [email protected] Srikumar Ramalingam [email protected] Jean-Philippe Tardif [email protected] Simone Gasparini [email protected] Jo~aoBarreto [email protected] Boston { Delft Full text available at: http://dx.doi.org/10.1561/0600000023 Foundations and Trends R in Computer Graphics and Vision Published, sold and distributed by: now Publishers Inc. PO Box 1024 Hanover, MA 02339 USA Tel. +1-781-985-4510 www.nowpublishers.com [email protected] Outside North America: now Publishers Inc. PO Box 179 2600 AD Delft The Netherlands Tel. +31-6-51115274 The preferred citation for this publication is P. Sturm, S. Ramalingam, J.-P. Tardif, S. Gasparini and J. Barreto, Camera Models and Fundamental Concepts Used in Geometric Computer Vision, Foundations and Trends R in Computer Graphics and Vision, vol 6, nos 1{2, pp 1{183, 2010 ISBN: 978-1-60198-410-4 c 2011 P. Sturm, S. Ramalingam, J.-P. Tardif, S. Gasparini and J. Barreto All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, mechanical, photocopying, recording or otherwise, without prior written permission of the publishers. Photocopying. In the USA: This journal is registered at the Copyright Clearance Cen- ter, Inc., 222 Rosewood Drive, Danvers, MA 01923. -
Motion Estimation for Self-Driving Cars with a Generalized Camera
Motion Estimation for Self-Driving Cars With a Generalized Camera Gim Hee Lee1, Friedrich Fraundorfer2, Marc Pollefeys1 1 Department of Computer Science Faculty of Civil Engineering and Surveying2 ETH Zurich,¨ Switzerland Technische Universitat¨ Munchen,¨ Germany {glee@student, pomarc@inf}.ethz.ch [email protected] Abstract In this paper, we present a visual ego-motion estima- tion algorithm for a self-driving car equipped with a close- to-market multi-camera system. By modeling the multi- camera system as a generalized camera and applying the non-holonomic motion constraint of a car, we show that this leads to a novel 2-point minimal solution for the general- ized essential matrix where the full relative motion includ- ing metric scale can be obtained. We provide the analytical solutions for the general case with at least one inter-camera correspondence and a special case with only intra-camera Figure 1. Car with a multi-camera system consisting of 4 cameras correspondences. We show that up to a maximum of 6 so- (front, rear and side cameras in the mirrors). lutions exist for both cases. We identify the existence of degeneracy when the car undergoes straight motion in the ready available in some COTS cars, we focus this paper special case with only intra-camera correspondences where on using a multi-camera setup for ego-motion estimation the scale becomes unobservable and provide a practical al- which is one of the key features for self-driving cars. A ternative solution. Our formulation can be efficiently imple- multi-camera setup can be modeled as a generalized cam- mented within RANSAC for robust estimation. -
Essential Matrix” • Two Uncalibrated Cameras “Fundamental Matrix” • Gives Epipolar Lines
Two Views Part 2: Essential CS 4495 Computer Vision – A. Bobick and Fundamental Matrices CS 4495 Computer Vision N-Views (2) – Essential and Fundamental Matrices Aaron Bobick School of Interactive Computing Two Views Part 2: Essential CS 4495 Computer Vision – A. Bobick and Fundamental Matrices Administrivia • Today: Second half of N-Views (n = 2) • PS 3: Will hopefully be out by Thursday • Will be due October 6th. • Will be based upon last week and today’s material • We may revisit the logistics – suggestions? Two Views Part 2: Essential CS 4495 Computer Vision – A. Bobick and Fundamental Matrices Two views…and two lectures • Projective transforms from image to image • Some more projective geometry • Points and lines and planes • Two arbitrary views of the same scene • Calibrated – “Essential Matrix” • Two uncalibrated cameras “Fundamental Matrix” • Gives epipolar lines Two Views Part 2: Essential CS 4495 Computer Vision – A. Bobick and Fundamental Matrices Last time • Projective Transforms: Matrices that provide transformations including translations, rotations, similarity, affine and finally general (or perspective) projection. • When 2D matrices are 3x3; for 3D they are 4x4. Two Views Part 2: Essential CS 4495 Computer Vision – A. Bobick and Fundamental Matrices Last time: Homographies • Provide mapping between images (image planes) taken from same center of projection; also mapping between any images of a planar surface. PP2 PP1 Two Views Part 2: Essential CS 4495 Computer Vision – A. Bobick and Fundamental Matrices Last time: Projective geometry • A line is a plane of rays through origin – all rays (x,y,z) satisfying: ax + by + cz = 0 x in vector notation : 0 = [a b c]y z l p • A line is also represented as a homogeneous 3-vector l Two Views Part 2: Essential CS 4495 Computer Vision – A. -
Journal of Global Research in Computer Science RENEWED
Volume 2, No. 7, July 2011 Journal of Global Research in Computer Science RESEARCH PAPER Available Online at www.jgrcs.info RENEWED APPROACH FOR IMAGE RECTIFICATION Sahil Bansal* and Seema Baghla *Computer Engineering Section, Yadavindra College of Engineering, Talwandi Sabo [email protected] Computer Engineering Section, Yadavindra College of Engineering, Talwandi Sabo [email protected] Abstract: This paper proposed a new method for image rectification ,it is the process by which the pairs of stereo images of same solid scene taken from different viewpoints in order to produce a pair of “matched epipolar projections” and become parallel to the x-axis of image. A stereo rectified images are helpful for matching algorithms.It restricts that each line parallel to x-axis.The stereo rectification is not unique and actually lead to undesirable distortions. To overcome the drawback of the relative distortion between left image and right image an epipolar line rectification technique is used for point detection and reduce the distortion by minimized the camera rotation angle at each step.By comparative experiments show that the algorithm has an accuracy where other methods fail, namely when the epipolar lines are far from horizontal. Keywords: Rectification, stereovision, epipolar, distortion, camera rotation. INTRODUCTION alignment. Image rectification is usually performed Image Rectification is a transformation process used to regardless of camera precision due to two reasons:- project two-or-more images onto a common image plane. It . Impractically or Impossibility of perfectly aligning corrects image distortion by transforming the image into a cameras. standard coordinate system. Image Rectification is used . Perfectly aligned cameras may become misaligned over in computer stereo vision to simplify the problem of finding time.