3D Depth Cameras in Vision: Benefits and Limitations of the Hardware
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Emerging Opportunities in Virtualization
TECHNOLOGY INSIGHTS FOR BUSINESS LEADERS TECHNOLOGY INSIGHTS FOR BUSINESS LEADERS www.futurum.xyz EMERGING OPPORTUNITIES IN VIRTUALIZATION 2016-2020 DANIEL NEWMAN SHELLY KRAMER OLIVIER BLANCHARD Principal Analyst Principal Analyst Senior Analyst Published: November 1, 2016. TABLE OF CONTENTS 3 Executive Summary 4 Introduction 6 Uses for Virtualization – Breakdown by category 6 Virtual Reality (VR) 8 Augmented Reality and Mixed Reality (AR) 11 Mixed Reality (MR) 13 Industrial virtualization and modeling software 14 Key Takeaways 17 Additional Resources TECHNOLOGY INSIGHTS FUTURUM PREMIUM REPORT | 2 FOR BUSINESS LEADERS EXECUTIVE SUMMARY Futurum Research provides research, insights and analysis to the market that help tie leading and emerging technology solutions to strategic business needs. The purpose behind each of our reports is to help business executives and decision-makers gain a better understanding of the technologies driving digital transformation, connect the dots between the practical business requirements of digital transformation and the forces that impact employees, customers, markets and experiences, and take appro- priate action regarding critical digital transformation opportunities. Executive Summary This report is an overview of the state of virtualization-related opportunities busi- Virtualization Technology in 2016, and fo- nesses should be aware of going into H1 cuses on Virtual Reality, Augmented Real- and H2 2017. Among them: Healthcare, ity, and industrial virtualization and mod- education, design engineering, logistics, eling software. It outlines market trends retail, enterprise collaboration, hospitali- and specific categories of uses that plot ty and travel, virtual business travel, busi- virtualization’s adoption into 2020 and ness reporting, and systems manage- beyond, and identifies some of the key ment. -
Utilizing the Google Project Tango Tablet Development Kit and the Unity Engine for Image and Infrared Data-Based Obstacle Detection for the Visually Impaired
Int'l Conf. Health Informatics and Medical Systems | HIMS'16 | 163 Utilizing the Google Project Tango Tablet Development Kit and the Unity Engine for Image and Infrared Data-Based Obstacle Detection for the Visually Impaired Rabia Jafri1, Rodrigo Louzada Campos2, Syed Abid Ali3 and Hamid R. Arabnia2 1Department of Information Technology, King Saud University, Riyadh, Saudi Arabia 2Department of Computer Science, University of Georgia, Athens, Georgia, U.S.A. 3Araware LLC, Wilmington, Delaware, U.S.A Abstract: A novel image and infrared data-based application of within a few centimeters. Its integrated infra-red based to assist visually impaired (VI) users in detecting and depth sensors also allow it to measure the distance from the avoiding obstacles in their path while independently device to objects in the real world providing depth data about navigating indoors is proposed. The application will be the objects in the form of point clouds. The depth sensors and developed for the recently introduced Google Project Tango the visual sensors are synchronized, facilitating the Tablet Development Kit equipped with a powerful graphics integration of the data from these two modalities. processor and several sensors which allow it to track its Our project aims to utilize the Project Tango tablet to motion and orientation in 3D space in real-time. It will develop a system to assist VI users in detecting and avoiding exploit the inbuilt functionalities of the Unity engine in the obstacles in their path during navigation in an indoors Tango SDK to create a 3D reconstruction of the surrounding environment. The system will exploit the inbuilt environment and to detect obstacles. -
Depth Camera Technology Comparison and Performance Evaluation
DEPTH CAMERA TECHNOLOGY COMPARISON AND PERFORMANCE EVALUATION Benjamin Langmann, Klaus Hartmann and Otmar Loffeld ZESS – Center for Sensor Systems, University of Siegen, Paul-Bonatz-Strasse 9-11, Siegen, Germany Keywords: Depth, Range camera, Time-of-Flight, ToF, Photonic Mixer Device, PMD, Comparison. Abstract: How good are cheap depth cameras, namely the Microsoft Kinect, compared to state of the art Time-of- Flight depth cameras? In this paper several depth cameras of different types were put to the test on a variety of tasks in order to judge their respective performance and to find out their weaknesses. We will concentrate on the common area of applications for which both types are specified, i.e. near field indoor scenes. The characteristics and limitations of the different technologies as well as the concrete hardware implementations are discussed and evaluated with a set of experimental setups. Especially, the noise level and the axial and angular resolutions are compared. Additionally, refined formulas to generate depth values based on the raw measurements of the Kinect are presented. 1 INTRODUCTION when used outdoors. Currently, ToF imaging chips reaching resolutions up to 200x200 pixels are on the Depth or range cameras have been developed for market and chips with 352x288 pixels are in several years and are available to researchers as well development and for a few years some ToF chips as on the market for certain applications for about a feature methods to suppress ambient light (e.g. decade. PMDTec, Mesa Imaging, 3DV Systems and Suppression of Background Illumination - SBI). Canesta were the companies driving the Other depth cameras or measuring devices, such development of Time-of-Flight (ToF) depth as laser scanners or structured light approaches, cameras. -
Game Implementation in Real-Time Using the Project Tango
Senior Project Game Implementation in Real-Time using the Project Tango By: Samantha Bhuiyan Senior Advisor: Professor Kurfess June 16, 2017 Figure 1: Project Tango Device Table of Contents Abstract 3 List of Figures 4 List of Tables 5 Project Overview 6 Requirements and Features 7 System Design and Architecture 15 Validation and Evaluation 17 Conclusions 19 Citations 20 2 ABSTRACT The goal of this senior project is to spread awareness of augmented reality, which Google defines as “a technology that superimposes a computer-generated image on a user's view of the real world, thus providing a composite view.” It’s a topic that is rarely known to those outside of a technology related field or one that has vested interest in technology. Games can be ideal tools to help educate the public on any subject matter. The task is to create an augmented reality game using a “learn by doing” method. The game will introduce players to augmented reality, and thus demonstrate how this technology can be combined with the world around them. The Tango, Unity and Vuforia are the tools to be used for development. The game itself will be a coin collecting game that changes dynamically to the world around the player. 3 List of Figures Figure 1: Project Tango Device 1 Figure 2: Ceiling with an array of markers 9 Figure 3: floor with an array of markers point 10 Figure 4: Vuforia stock image used for image recognition 11 Figure 5: Initial System Design 15 Figure 6: Final System design 15 Figure 7: Unity view of game 17 4 List of Tables Table 1: Evaluation Criteria 8-9 Table 2: Example of code for coin collection 11 Table 3: Research Questions 12 5 Project Overview Background Google developed the Tango to enable users to take advantage of the physical world around them. -
Direct Interaction with Large Displays Through Monocular Computer Vision
DIRECT INTERACTION WITH LARGE DISPLAYS THROUGH MONOCULAR COMPUTER VISION A thesis submitted in fulfilment of the requirements for the degree of Doctor of Philosophy in the School of Information Technologies at The University of Sydney Kelvin Cheng October 2008 © Copyright by Kelvin Cheng 2008 All Rights Reserved Abstract Large displays are everywhere, and have been shown to provide higher productivity gain and user satisfaction compared to traditional desktop monitors. The computer mouse remains the most common input tool for users to interact with these larger displays. Much effort has been made on making this interaction more natural and more intuitive for the user. The use of computer vision for this purpose has been well researched as it provides freedom and mobility to the user and allows them to interact at a distance. Interaction that relies on monocular computer vision, however, has not been well researched, particularly when used for depth information recovery. This thesis aims to investigate the feasibility of using monocular computer vision to allow bare-hand interaction with large display systems from a distance. By taking into account the location of the user and the interaction area available, a dynamic virtual touchscreen can be estimated between the display and the user. In the process, theories and techniques that make interaction with computer display as easy as pointing to real world objects is explored. Studies were conducted to investigate the way human point at objects naturally with their hand and to examine the inadequacy in existing pointing systems. Models that underpin the pointing strategy used in many of the previous interactive systems were formalized. -
Connecticut DEEP's List of Compliant Electronics Manufacturers Notice to Connecticut Retailersi
Connecticut DEEP’s List of Compliant Electronics manufacturers Notice to Connecticut Retailersi: This list below identifies electronics manufacturers that are in compliance with the registration and payment obligations under Connecticut’s State-wide Electronics Program. Retailers must check this list before selling Covered Electronic Devices (“CEDs”) in Connecticut. If there is a brand of a CED that is not listed below including retail over the internet, the retailer must not sell the CED to Connecticut consumers pursuant to section 22a-634 of the Connecticut General Statutes. Manufacturer Brands CED Type Acer America Corp. Acer Computer, Monitor, Television, Printer eMachines Computer, Monitor Gateway Computer, Monitor, Television ALR Computer, Monitor Gateway 2000 Computer, Monitor AG Neovo Technology AG Neovo Monitor Corporation Amazon Fulfillment Service, Inc. Kindle Computers Amazon Kindle Kindle Fire Fire American Future Technology iBuypower Computer Corporation dba iBuypower Apple, Inc. Apple Computer, Monitor, Printer NeXT Computer, Monitor iMac Computer Mac Pro Computer Mac Mini Computer Thunder Bolt Display Monitor Archos, Inc. Archos Computer ASUS Computer International ASUS Computer, Monitor Eee Computer Nexus ASUS Computer EEE PC Computer Atico International USA, Inc. Digital Prism Television ATYME CORPRATION, INC. ATYME Television Bang & Olufsen Operations A/S Bang & Olufsen Television BenQ America Corp. BenQ Monitor Best Buy Insignia Television Dynex Television UB Computer Toshiba Television VPP Matrix Computer, Monitor Blackberry Limited Balckberry PlayBook Computer Bose Corp. Bose Videowave Television Brother International Corp. Brother Monitor, Printer Canon USA, Inc. Canon Computer, Monitor, Printer Oce Printer Imagistics Printer Cellco Partnership Verizon Ellipsis Computer Changhong Trading Corp. USA Changhong Television (Former Guangdong Changhong Electronics Co. LTD) Craig Electronics Craig Computer, Television Creative Labs, Inc. -
Google Unveils 'Project Tango' 3D Smartphone Platform 20 February 2014
Google unveils 'Project Tango' 3D smartphone platform 20 February 2014 "What if directions to a new location didn't stop at the street address? What if you never again found yourself lost in a new building? What if the visually impaired could navigate unassisted in unfamiliar indoor places? What if you could search for a product and see where the exact shelf is located in a super-store?" The technology could also be used for "playing hide- and-seek in your house with your favorite game character, or transforming the hallways into a tree- lined path." Smartphones are equipped with sensors which make over 1.4 million measurements per second, updating the positon and rotation of the phone. A reporter uses a cell phone to take a photograph of Google Android 3.0 Honeycomb OS being used on a Partners in the project include researchers from the Motorola Xoon tablet during a press event at Google headquarters on February 2, 2011 in Mountain View, University of Minnesota, George Washington California University, German tech firm Bosch and the Open Source Robotics Foundation, among others. Another partner is California-based Movidius, which Google announced a new research project makes vision-processor technology for mobile and Thursday aimed at bringing 3D technology to portable devices and will provide the processor smartphones, for potential applications such as platform. indoor mapping, gaming and helping blind people navigate. Movidius said in a statement the goal was "to mirror human vision with a newfound level of depth, clarity The California tech giant said its "Project Tango" and realism on mobile and portable connected would provide prototypes of its new smartphone to devices." outside developers to encourage the writing of new applications. -
Interacting with Autostereograms
See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/336204498 Interacting with Autostereograms Conference Paper · October 2019 DOI: 10.1145/3338286.3340141 CITATIONS READS 0 39 5 authors, including: William Delamare Pourang Irani Kochi University of Technology University of Manitoba 14 PUBLICATIONS 55 CITATIONS 184 PUBLICATIONS 2,641 CITATIONS SEE PROFILE SEE PROFILE Xiangshi Ren Kochi University of Technology 182 PUBLICATIONS 1,280 CITATIONS SEE PROFILE Some of the authors of this publication are also working on these related projects: Color Perception in Augmented Reality HMDs View project Collaboration Meets Interactive Spaces: A Springer Book View project All content following this page was uploaded by William Delamare on 21 October 2019. The user has requested enhancement of the downloaded file. Interacting with Autostereograms William Delamare∗ Junhyeok Kim Kochi University of Technology University of Waterloo Kochi, Japan Ontario, Canada University of Manitoba University of Manitoba Winnipeg, Canada Winnipeg, Canada [email protected] [email protected] Daichi Harada Pourang Irani Xiangshi Ren Kochi University of Technology University of Manitoba Kochi University of Technology Kochi, Japan Winnipeg, Canada Kochi, Japan [email protected] [email protected] [email protected] Figure 1: Illustrative examples using autostereograms. a) Password input. b) Wearable e-mail notification. c) Private space in collaborative conditions. d) 3D video game. e) Bar gamified special menu. Black elements represent the hidden 3D scene content. ABSTRACT practice. This learning effect transfers across display devices Autostereograms are 2D images that can reveal 3D content (smartphone to desktop screen). when viewed with a specific eye convergence, without using CCS CONCEPTS extra-apparatus. -
Structured Light Based 3D Scanning for Specular Surface by the Combination of Gray Code and Phase Shifting
The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Volume XLI-B3, 2016 XXIII ISPRS Congress, 12–19 July 2016, Prague, Czech Republic STRUCTURED LIGHT BASED 3D SCANNING FOR SPECULAR SURFACE BY THE COMBINATION OF GRAY CODE AND PHASE SHIFTING Yujia Zhanga, Alper Yilmaza,∗ a Dept. of Civil, Environmental and Geodetic Engineering, The Ohio State University 2036 Neil Avenue, Columbus, OH 43210 USA - (zhang.2669, yilmaz.15)@osu.edu Commission III, WG III/1 KEY WORDS: Structured Light, 3D Scanning, Specular Surface, Maximum Min-SW Gray Code, Phase Shifting ABSTRACT: Surface reconstruction using coded structured light is considered one of the most reliable techniques for high-quality 3D scanning. With a calibrated projector-camera stereo system, a light pattern is projected onto the scene and imaged by the camera. Correspondences between projected and recovered patterns are computed in the decoding process, which is used to generate 3D point cloud of the surface. However, the indirect illumination effects on the surface, such as subsurface scattering and interreflections, will raise the difficulties in reconstruction. In this paper, we apply maximum min-SW gray code to reduce the indirect illumination effects of the specular surface. We also analysis the errors when comparing the maximum min-SW gray code and the conventional gray code, which justifies that the maximum min-SW gray code has significant superiority to reduce the indirect illumination effects. To achieve sub-pixel accuracy, we project high frequency sinusoidal patterns onto the scene simultaneously. But for specular surface, the high frequency patterns are susceptible to decoding errors. Incorrect decoding of high frequency patterns will result in a loss of depth resolution. -
A Residual Network and FPGA Based Real-Time Depth Map Enhancement System
entropy Article A Residual Network and FPGA Based Real-Time Depth Map Enhancement System Zhenni Li 1, Haoyi Sun 1, Yuliang Gao 2 and Jiao Wang 1,* 1 College of Information Science and Engineering, Northeastern University, Shenyang 110819, China; [email protected] (Z.L.); [email protected] (H.S.) 2 College of Artificial Intelligence, Nankai University, Tianjin 300071, China; [email protected] * Correspondence: [email protected] Abstract: Depth maps obtained through sensors are often unsatisfactory because of their low- resolution and noise interference. In this paper, we propose a real-time depth map enhancement system based on a residual network which uses dual channels to process depth maps and intensity maps respectively and cancels the preprocessing process, and the algorithm proposed can achieve real- time processing speed at more than 30 fps. Furthermore, the FPGA design and implementation for depth sensing is also introduced. In this FPGA design, intensity image and depth image are captured by the dual-camera synchronous acquisition system as the input of neural network. Experiments on various depth map restoration shows our algorithms has better performance than existing LRMC, DE-CNN and DDTF algorithms on standard datasets and has a better depth map super-resolution, and our FPGA completed the test of the system to ensure that the data throughput of the USB 3.0 interface of the acquisition system is stable at 226 Mbps, and support dual-camera to work at full speed, that is, 54 fps@ (1280 × 960 + 328 × 248 × 3). Citation: Li, Z.; Sun, H.; Gao, Y.; Keywords: depth map enhancement; residual network; FPGA; ToF Wang, J. -
Deep Learning Whole Body Point Cloud Scans from a Single Depth Map
Deep Learning Whole Body Point Cloud Scans from a Single Depth Map Nolan Lunscher John Zelek University of Waterloo University of Waterloo 200 University Ave W. 200 University Ave W. [email protected] [email protected] Abstract ture the complete 3D structure of an object or person with- out the need for the machine or operator to necessarily be Personalized knowledge about body shape has numerous an expert at how to take every measurement. Scanning also applications in fashion and clothing, as well as in health has the benefit that all possible measurements are captured, monitoring. Whole body 3D scanning presents a relatively rather than only measurements at specific points [27]. simple mechanism for individuals to obtain this information Beyond clothing, understanding body measurements and about themselves without needing much knowledge of an- shape can provide details about general health and fitness. thropometry. With current implementations however, scan- In recent years, products such as Naked1 and ShapeScale2 ning devices are large, complex and expensive. In order to have begun to be released with the intention of capturing 3D make such systems as accessible and widespread as pos- information about a person, and calculating various mea- sible, it is necessary to simplify the process and reduce sures such as body fat percentage and muscle gains. The their hardware requirements. Deep learning models have technologies can then be used to monitor how your body emerged as the leading method of tackling visual tasks, in- changes overtime, and offer detailed fitness tracking. cluding various aspects of 3D reconstruction. In this paper Depth map cameras and RGBD cameras, such as the we demonstrate that by leveraging deep learning it is pos- XBox Kinect or Intel Realsense, have become increasingly sible to create very simple whole body scanners that only popular in 3D scanning in recent years, and have even made require a single input depth map to operate. -
Real-Time Depth Imaging
TU Berlin, Fakultät IV, Computer Graphics Real-time depth imaging vorgelegt von Diplom-Mediensystemwissenschaftler Uwe Hahne aus Kirchheim unter Teck, Deutschland Von der Fakultät IV - Elektrotechnik und Informatik der Technischen Universität Berlin zur Erlangung des akademischen Grades Doktor der Ingenieurwissenschaften — Dr.-Ing. — genehmigte Dissertation Promotionsausschuss: Vorsitzender: Prof. Dr.-Ing. Olaf Hellwich Berichter: Prof. Dr.-Ing. Marc Alexa Berichter: Prof. Dr. Andreas Kolb Tag der wissenschaftlichen Aussprache: 3. Mai 2012 Berlin 2012 D83 For my family. Abstract This thesis depicts approaches toward real-time depth sensing. While humans are very good at estimating distances and hence are able to smoothly control vehicles and their own movements, machines often lack the ability to sense their environ- ment in a manner comparable to humans. This discrepancy prevents the automa- tion of certain job steps. We assume that further enhancement of depth sensing technologies might change this fact. We examine to what extend time-of-flight (ToF) cameras are able to provide reliable depth images in real-time. We discuss current issues with existing real-time imaging methods and technologies in detail and present several approaches to enhance real-time depth imaging. We focus on ToF imaging and the utilization of ToF cameras based on the photonic mixer de- vice (PMD) principle. These cameras provide per pixel distance information in real-time. However, the measurement contains several error sources. We present approaches to indicate measurement errors and to determine the reliability of the data from these sensors. If the reliability is known, combining the data with other sensors will become possible. We describe such a combination of ToF and stereo cameras that enables new interactive applications in the field of computer graph- ics.