Geometric Considerations for College of Computing-0280 Graphics, Visualization Stereoscopic Virtual Environments & Usability Center Atlanta, Georgia 30332

Total Page:16

File Type:pdf, Size:1020Kb

Geometric Considerations for College of Computing-0280 Graphics, Visualization Stereoscopic Virtual Environments & Usability Center Atlanta, Georgia 30332 Larry F. Hodges Geometric Considerations for College of Computing-0280 Graphics, Visualization Stereoscopic Virtual Environments & Usability Center Atlanta, Georgia 30332 Elizabeth Thorpe Davis of School Psychology-0170 Abstract Graphics, Visualization & Center Usability We examine the relationship among the different geometries implicit in a stereoscopic Institute of Georgia Technology virtual environment. In particular, we examine in detail the relationship of retinal dispar- Atlanta, 30332 Georgia ity, fixation point, binocular visual direction, and screen parallax. We introduce the con- cept of a volumetric spatial unit called a stereoscopic voxel. Due to the shape of stereo- scopic voxels, apparent depth of points in space may be affected by their horizontal placement. Downloaded from http://direct.mit.edu/pvar/article-pdf/2/1/34/1622602/pres.1993.2.1.34.pdf by guest on 26 September 2021 I Introduction The display component of the most common implementations of virtual environments provides the user with a visual image that incorporates stereopsis as a visual cue. Examples include head-mounted displays (Teitel, 1990), time- multiplexed CRT-based displays (Deering, 1992), and time-multiplexed pro- jection systems (Cruz-Neira, Sandin, DeFanti, Kenyon, & Hart, 1992). It is not always recognized, however, that the characteristics of a stereoscopic image can be very different from that of a monoscopic perspective image. The visual impression given by stereoscopic images are very sensitive to the geometry of the visual system of a user, the geometry ofthe display environment, and the modeling geometry assumed in the computation of the scene. In this paper we present a tutorial whose purpose is to review these basic geometries and to ana- lyze their relationship to each other in a stereoscopic virtual environment. In Section 2 we review the geometry of binocular vision and retinal disparity. In Section 3 we discuss modeling geometry and its relationship to retinal dis- parity. Section 4 describes the effects of the discrete nature of display geometry and the distortions caused by optical and tracking artifacts. In Section 5 we re- visit our approximate model of the visual system geometry and discuss its limi- tations. 2 Visual System Geometry Stereopsis results from the two slightly different views of the external world that our laterally displaced eyes receive (e.g., Schor, 1987; Tyler, 1983). This difference is quantified in terms of retinal disparity. If both eyes are fixated on a point, f1, in space, then an image of f 1 is focused at corresponding points in the center of the fovea1 of each eye. Another point, f2, at a different spatial Presence, Volume 2, Number I, Winter 1993 1. The fovea is the part of the human retina that possesses the best spatial resolution or visual © 1993 The Massachusetts Institute of Technology acuity. 34 PRESENCE: VOLUME 2, NUMBER I Hodges and Davis 35 location, would be imaged at points in each eye that may not be the same distance from the fovea. This difference in distance is the retinal disparity. Normally we measure this distance as the sum of the angles ôi + Ô2 as shown in Figure 1, where angles measured from the center of fo- vea toward the outside of each eye are negative. In the example shown in Figure 1, ôi has a positive value, ô2 has a negative value, and their sum (8i + ô2) is positive. The retinal disparity that results from these two differ- ent views can provide information about the distance or Left Eye Right Eye Downloaded from http://direct.mit.edu/pvar/article-pdf/2/1/34/1622602/pres.1993.2.1.34.pdf by guest on 26 September 2021 depth of an object as well as about the shape of an ob- Retinal = Si + 52 ject. Stereoacuity is the smallest depth that can be de- disparity tected based on retinal disparity. In some humans, under Figure I. Retinal disparity. optimal stimulus conditions, stereoacuities of 5" or less can be obtained (Westheimer & McKee, 1980; McKee, stereoacuities of more than 5" are more 1983); however, where a is the angle of convergence, Di is the distance common 8c Shor 8c (Davis, King, Anoskey, 1992; from the midpoint of the interocular axis to the fixated Wood, 1983). point, f1, and i is the interocular distance (Arditi, 1986; Graham, 1965).2 (See Fig. 2.) Notice that the angle of 2.1 Binocular Visual Direction convergence, a, is inversely related to the distance, D\, of the fixated from the observer; this inverse rela- Visual direction is the location of point perceived spatial tion is nonlinear. an object relative to the observer. Usually, it is measured For another point in space, f2, located at a distance in terms of azimuth (left and of the of fixa- right point D2, the of convergence is ß Note that and of elevation and below the of angle (see Fig. 2). tion) (above point since fixation). Often, the binocular visual direction of an ob- ject is not the same as the monocular visual direction of a + a + c + b + d= 180 either can this at a eye. (You verify yourself by looking and very close object first with one eye, then with the other eye, then with both eyes. Notice how the apparent spa- ß + c + d = 180 tial location of the object changes.) Hering proposed then that binocular visual direction will lie midway between the directions of the monocular others have re- a ß = (-a) + (-b) = (8, + 82) images; - ported that the binocular visual direction will lie some- The difference in vergence (a ß) is angles — equiva- where between the left and right monocular visual direc- lent to the retinal disparity between the two points in tions, but not necessarily midway (e.g., Tyler, 1983). space, measured in units of visual angle. This retinal dis- parity is monotonically related to the depth between the 2.2 Convergence Angles and Retinal Disparities 2. For asymmetric convergence of the two eyes, the formula for the For symmetric convergence of the two eyes on a angle of convergence is basically the same as that shown for symmetric The difference is that D now the fixated in space, f1, the of convergence is convergence. represents perpendicular point angle distance from the interocular axis to the frontoparallel plane that inter- defined as sects the asymmetrically converged point of fixation. This interpreta- tion of the convergence angle formula for asymmetric convergence is a = 2 arctan(¿/2D!) not exact, but it is a good approximation. 36 PRESENCE: VOLUME 2, NUMBER I Dh * Downloaded from http://direct.mit.edu/pvar/article-pdf/2/1/34/1622602/pres.1993.2.1.34.pdf by guest on 26 September 2021 d2 Figure 2. Convergence angles. Figure 3. Vieth-Mueller circle. two points in space (i.e., at a constant distance, D\, a The Vieth-Mueller circle is a theoretical horopter de- larger depth corresponds to a larger retinal disparity); fined only in terms of geometric considerations. This this monotonie relationship is a nonlinear one. horopter is a circle in the horizontal plane that intersects If an object is closer than the fixation point, the retinal each eye at the first nodal point of the eye's lens system disparity will be a negative value. This is known as a (Gulick & Lawson, 1974; Ogle, 1968). (See Fig. 3.) crossed disparity because the two eyes must cross to fixate This circle defines a locus of points with zero disparity. the closer object. Conversely, if an object is farther than However, in devising this theoretical horopter it is as- the fixation point, the retinal disparity will be a positive sumed that the eyes are perfect spheres with perfectly value. This is known as uncrossed disparity because the spherical optics and that the eyes rotate about axes that two eyes must uncross to fixate the farther object. An pass only through their first optical nodal points (e.g., object located at the fixation point or whose image falls Arditi, 1986; Bishop, 1985, Gulick 6k Lawson, 1974). on corresponding points in the two retinae has a zero None of these assumptions is strictly true. Thus, when disparity. one compares the Vieth-Mueller circle to any empiri- cally determined horopter there is a discrepancy between the theoretical and With few 2.3 Horopters empirical horopters. excep- tions (Deering, 1992) the eye geometry used for image Corresponding points on the two retinae are de- calculations for stereoscopic virtual environments as- fined as being the same vertical and horizontal distance sume a visual model consistent with the Veith-Mueller from the center of the fovea in each eye (e.g., Tyler, circle. We will address the results of this assumption in 1983; Arditi, 1986; Davis & Hodges, 1994). When the Section 5. two eyes binocularly fixate on a given point in space, Horopters usually describe a locus of points that there is a locus of points in space that fall on correspond- should result in zero disparity. Stereopsis, however, occurs ing points in the two retinae. This locus of points is the when there is a nonzero disparity that gives rise to the horopter, a term originally used by Aguilonius in 1613. percept of depth. That is, an object or visual stimulus The horopter can be defined either theoretically or em- can appear closer or farther than the horopter for crossed pirically. and uncrossed disparity, respectively. Hodges and Davis 37 Display Screen Left eye position Downloaded from http://direct.mit.edu/pvar/article-pdf/2/1/34/1622602/pres.1993.2.1.34.pdf by guest on 26 September 2021 Right eye position Object with negative parallax Figure 4. Screen parallax. 3 Modeling Geometry The amount of screen parallax in this case may be computed with respect to the geometric model of the For any type of stereoscopic display, we are model- scene as what would be seen each if the were ing by eye image = p i(D d)/D projected onto a screen or window being viewed by an - observer (Hodges, 1992).
Recommended publications
  • The Application of Voxel Size Correction in X-Ray Computed Tomography for Dimensional Metrology
    SINCE2013 Singapore International NDT Conference & Exhibition 2013, 19-20 July 2013 The Application of Voxel Size Correction in X-ray Computed Tomography for Dimensional Metrology Joseph J. LIFTON1,2, Andrew A. MALCOLM2, John W. MCBRIDE1,3, Kevin J. CROSS1 1The University of Southampton, United Kingdom; Email: [email protected]. 2Singapore Institute of Manufacturing Technology, Singapore; Email: [email protected]. 3The University of Southampton Malaysia Campus (USMC), Malaysia; Email: [email protected]. Abstract X-ray computed tomography (CT) is a non-destructive, radiographic scanning technique that enables the visualisation and dimensional evaluation of both internal and external features of a workpiece; it is therefore an attractive alternative for measurement tasks that prove problematic for conventional tactile and optical instruments. The data output of a CT measurement is a volume of grey value integers that describe the material distribution of the scanned workpiece; the relative spacing of volume-elements (voxels) is termed voxel size and influences all dimensional information evaluated from a CT data-set. Voxel size is defined by the position of a workpiece relative to the X-ray source and detector, and is therefore prone to axis position errors, errors in the geometric alignment of the CT system’s hardware, and the positional drift of the X-ray focal spot. In this work a method is presented for calculating a voxel scaling factor that corrects for voxel size errors, and this method is then applied to a general X-ray CT measurement task and demonstrated to reduce measurement errors. Keywords: X-ray computed tomography, dimensional metrology, calibration, uncertainty, voxel size.
    [Show full text]
  • Temporal Voxel Cone Tracing with Interleaved Sample Patterns by Sanghyeok Hong
    c 2015, SangHyeok Hong. All Rights Reserved. The material presented within this document does not necessarily reflect the opinion of the Committee, the Graduate Study Program, or DigiPen Institute of Technology. TEMPORAL VOXEL CONE TRACING WITH INTERLEAVED SAMPLE PATTERNS BY SangHyeok Hong THESIS Submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science awarded by DigiPen Institute of Technology Redmond, Washington United States of America March 2015 Thesis Advisor: Gary Herron DIGIPEN INSTITUTE OF TECHNOLOGY GRADUATE STUDIES PROGRAM DEFENSE OF THESIS THE UNDERSIGNED VERIFY THAT THE FINAL ORAL DEFENSE OF THE MASTER OF SCIENCE THESIS TITLED Temporal Voxel Cone Tracing with Interleaved Sample Patterns BY SangHyeok Hong HAS BEEN SUCCESSFULLY COMPLETED ON March 12th, 2015. MAJOR FIELD OF STUDY: COMPUTER SCIENCE. APPROVED: Dmitri Volper date Xin Li date Graduate Program Director Dean of Faculty Dmitri Volper date Claude Comair date Department Chair, Computer Science President DIGIPEN INSTITUTE OF TECHNOLOGY GRADUATE STUDIES PROGRAM THESIS APPROVAL DATE: March 12th, 2015 BASED ON THE CANDIDATE'S SUCCESSFUL ORAL DEFENSE, IT IS RECOMMENDED THAT THE THESIS PREPARED BY SangHyeok Hong ENTITLED Temporal Voxel Cone Tracing with Interleaved Sample Patterns BE ACCEPTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE AT DIGIPEN INSTITUTE OF TECHNOLOGY. Gary Herron date Xin Li date Thesis Committee Chair Thesis Committee Member Pushpak Karnick date Matt
    [Show full text]
  • Human Body Model Acquisition and Tracking Using Voxel Data
    Submitted to the International Journal of Computer Vision Human Body Model Acquisition and Tracking using Voxel Data Ivana Mikić2, Mohan Trivedi1, Edward Hunter2, Pamela Cosman1 1Department of Electrical and Computer Engineering University of California, San Diego 2Q3DM, Inc. Abstract We present an integrated system for automatic acquisition of the human body model and motion tracking using input from multiple synchronized video streams. The video frames are segmented and the 3D voxel reconstructions of the human body shape in each frame are computed from the foreground silhouettes. These reconstructions are then used as input to the model acquisition and tracking algorithms. The human body model consists of ellipsoids and cylinders and is described using the twists framework resulting in a non-redundant set of model parameters. Model acquisition starts with a simple body part localization procedure based on template fitting and growing, which uses prior knowledge of average body part shapes and dimensions. The initial model is then refined using a Bayesian network that imposes human body proportions onto the body part size estimates. The tracker is an extended Kalman filter that estimates model parameters based on the measurements made on the labeled voxel data. A voxel labeling procedure that handles large frame-to-frame displacements was designed resulting in the very robust tracking performance. Extensive evaluation shows that the system performs very reliably on sequences that include different types of motion such as walking, sitting, dancing, running and jumping and people of very different body sizes, from a nine year old girl to a tall adult male. 1. Introduction Tracking of the human body, also called motion capture or posture estimation, is a problem of estimating the parameters of the human body model (such as joint angles) from the video data as the position and configuration of the tracked body change over time.
    [Show full text]
  • Photorealistic Scene Reconstruction by Voxel Coloring
    In Proc. Computer Vision and Pattern Recognition Conf., pp. 1067-1073, 1997. Photorealistic Scene Reconstruction by Voxel Coloring Steven M. Seitz Charles R. Dyer Department of Computer Sciences University of Wisconsin, Madison, WI 53706 g E-mail: fseitz,dyer @cs.wisc.edu WWW: http://www.cs.wisc.edu/computer-vision Broad Viewpoint Coverage: Reprojections should be Abstract accurate over a large range of target viewpoints. This A novel scene reconstruction technique is presented, requires that the input images are widely distributed different from previous approaches in its ability to cope about the environment with large changes in visibility and its modeling of in- trinsic scene color and texture information. The method The photorealistic scene reconstruction problem, as avoids image correspondence problems by working in a presently formulated, raises a number of unique challenges discretized scene space whose voxels are traversed in a that push the limits of existing reconstruction techniques. fixed visibility ordering. This strategy takes full account Photo integrity requires that the reconstruction be dense of occlusions and allows the input cameras to be far apart and sufficiently accurate to reproduce the original images. and widely distributed about the environment. The algo- This criterion poses a problem for existing feature- and rithm identifies a special set of invariant voxels which to- contour-based techniques that do not provide dense shape gether form a spatial and photometric reconstruction of the estimates. While these techniques can produce texture- scene, fully consistent with the input images. The approach mapped models [1, 3, 4], accuracy is ensured only in places is evaluated with images from both inward- and outward- where features have been detected.
    [Show full text]
  • Online Detector Response Calculations for High-Resolution PET Image Reconstruction
    Home Search Collections Journals About Contact us My IOPscience Online detector response calculations for high-resolution PET image reconstruction This article has been downloaded from IOPscience. Please scroll down to see the full text article. 2011 Phys. Med. Biol. 56 4023 (http://iopscience.iop.org/0031-9155/56/13/018) View the table of contents for this issue, or go to the journal homepage for more Download details: IP Address: 171.65.80.217 The article was downloaded on 15/06/2011 at 20:11 Please note that terms and conditions apply. IOP PUBLISHING PHYSICS IN MEDICINE AND BIOLOGY Phys. Med. Biol. 56 (2011) 4023–4040 doi:10.1088/0031-9155/56/13/018 Online detector response calculations for high-resolution PET image reconstruction Guillem Pratx1 and Craig Levin2 1 Department of Radiation Oncology, Stanford University, Stanford, CA 94305, USA 2 Departments of Radiology, Physics and Electrical Engineering, and Molecular Imaging Program at Stanford, Stanford University, Stanford, CA 94305, USA E-mail: [email protected] Received 3 January 2011, in final form 19 May 2011 Published 15 June 2011 Online at stacks.iop.org/PMB/56/4023 Abstract Positron emission tomography systems are best described by a linear shift- varying model. However, image reconstruction often assumes simplified shift- invariant models to the detriment of image quality and quantitative accuracy. We investigated a shift-varying model of the geometrical system response based on an analytical formulation. The model was incorporated within a list- mode, fully 3D iterative reconstruction process in which the system response coefficients are calculated online on a graphics processing unit (GPU).
    [Show full text]
  • Voxel Game Engine Using Blender Game Engine
    TALLINN UNIVERSITY OF TECHNOLOGY Faculty of Information Technology Department of Computer Science ITV40LT Fred-Eric Kirsi 135216 IAPB VOXEL GAME ENGINE USING BLENDER GAME ENGINE Bachelor's thesis Supervisor: Jaagup Irve Master of Sciences Software Engineer Tallinn 2016 TALLINNA TEHNIKAÜLIKOOL Infotehnoloogia teaduskond Arvutiteaduse instituut ITV40LT Fred-Eric Kirsi 135216 IAPB VOXEL-MÄNGUMOOTOR BLENDER GAME ENGINE ABIL Bakalaureusetöö Juhendaja: Jaagup Irve Magister Tarkvarainsener Tallinn 2016 Author’s declaration of originality I hereby certify that I am the sole author of this thesis. All the used materials, references to the literature and the work of others have been referred to. This thesis has not been presented for examination anywhere else. Author: Fred-Eric Kirsi 23.05.2016 3 Abstract The objective of this thesis is to study various aspects of developing a voxel style game and in the process create a prototype engine to evaluate the possibilities and performance of the algorithms. The algorithms evaluated are Greedy Meshing, A* search, Goal Oriented Action Planning (GOAP), Finite state machine (FSM), and Perlin noise. The used 3d engine is the Bender game engine. The scripting is done in the Python coding language. The result of this work is a working voxel game prototype, where the agents are able to navigate, manipulate, and make decisions in real time. The aim of the prototype is to quantize the performance and feature experience on for the user. The work is divided into four main stages. Firstly evaluation of the work environment and its limitations. Secondly the development and optimization of the voxel implementation. Thirdly the creation and comparison of different pathing implementations.
    [Show full text]
  • Voxel Map for Visual SLAM
    This paper has been accepted for publication at the IEEE International Conference on Robotics and Automation (ICRA), Paris, 2020. c IEEE Voxel Map for Visual SLAM Manasi Muglikar, Zichao Zhang and Davide Scaramuzza Abstract— In modern visual SLAM systems, it is a stan- Geometry-awareness dard practice to retrieve potential candidate map points from [Newcombe11] overlapping keyframes for further feature matching or direct [Whelan15] Optimal tracking. In this work, we argue that keyframes are not the [Engel14] optimal choice for this task, due to several inherent limitations, Dense representation such as weak geometric reasoning and poor scalability. We propose a voxel-map representation to efficiently retrieve map [Kaess15] This paper points for visual SLAM. In particular, we organize the map [Nardi19] [Rosinol19] points in a regular voxel grid. Visible points from a camera pose Geometric primitives are queried by sampling the camera frustum in a raycasting [Forster17] manner, which can be done in constant time using an efficient [Klein07] [Mur-Artal15] voxel hashing method. Compared with keyframes, the retrieved Sparse keyframes points using our method are geometrically guaranteed to Efficiency fall in the camera field-of-view, and occluded points can be identified and removed to a certain extend. This method also Fig. 1: The optimal SLAM systems should be efficient and have geometrical naturally scales up to large scenes and complicated multi- understanding of the map (denoted by the golden star). Direct methods (red camera configurations. Experimental results show that our dots) associate each keyframe with a semi-dense depth map. They have voxel map representation is as efficient as a keyframe map more scene information but are not efficient.
    [Show full text]
  • Future Graphics in Games
    Future graphics in games Anton Kaplanyan Lead researcher Cevat Yerli Crytek CEO Agenda • The history: Crytek GmbH • Current graphics technologies • Stereoscopic rendering • Current graphics challenges • Graphics of the future • Graphics technologies of the future • Server-side rendering • Hardware challenges • Perception-driven graphics The Past - Part 1 • March 2001 till March 2004 • Development of Far Cry • Development of CryEngine 1 • Approach: A naïve, but successful push for contrasts, by insisting on opposites to industry. size, quality, detail, brightness • First right investment into tools - WYSIWYPlay Past - Part 1: CryEngine 1 • Polybump (2001) • NormalMap extraction from High-Res Geometry • First „Per Pixel Shading“ & HDR Engine • For Lights, Shadows & Materials • High Dynamic Range • Long view distances & detailed vistas • Terrain featured unique base-texturing • High quality close ranges • High fidelity physics & AI • It took 3 years, avg 20 R&D Engineers CryENGINE 2 The Past – Part 2 – CryENGINE 2 • April 2004 till November 2007 • Development of Crysis • Development of CryEngine 2 • Approach: Photorealism meets interactivity! • Typically mutual exclusive directions • Realtime productivity with WYSIWYPlay • Extremely challenging, but successful CryEngine 2 - Way to Photorealism The image cannot be displayed. Your computer may not have enough memory to open the image, or the image may have been corrupted. Restart your computer, and then open the file again. If the red x still appears, you may have to delete the image and then insert it again. The Past - Part 2: CryEngine 2 • CGI Quality Lighting & Shading • Life-like characters • Scaleable architecture in • Both content and pipeline • Technologies and assets allow various configurations to be maxed out! • Crysis shipped Nov 2007, works on PCs of 2004 till today and for future..
    [Show full text]
  • GPU Volume Voxelization Exploration of the Performance Characteristics of Different GPU-Based Implementations
    EXAMENSARBETE INOM TEKNIK, GRUNDNIVÅ, 15 HP STOCKHOLM, SVERIGE 2019 GPU Volume Voxelization Exploration of the performance characteristics of different GPU-based implementations GRIGORY GLUKHOV ALEKSANDRA SOLTAN KTH SKOLAN FÖR ELEKTROTEKNIK OCH DATAVETENSKAP GPU Volume Voxelization Exploration of the performance characteris- tics of different GPU-based implementations Aleksandra Soltan Grigory Glukhov Examiner Johan Montelius Supervisor Thomas Sjöland A thesis presented for the degree of Bachelor of Information and Communication Technology KTH Royal Institute of Technology School of Electrical Engineering and Computer Science SE-100 44 Stockholm, Sweden June 2019 page intentionally left blank 1 Abstract In recent years, voxel-based modelling has seen a reintroduction to computer game development through massive graphics hardware improvements. Never- theless, polygons continue to be the default building block of 3D objects, intro- ducing a need for the transformation of polygon meshes into voxel-based models; this process is known as voxelization. Efficient voxelization algorithms take ad- vantage of the flexibility and control offered by modern, programmable GPU pipelines. However, the variability in possible approaches poses the question of how different GPU-based implementations affect voxelization performance. This thesis explores the impact of GPU-based improvements by comparing four different implementations of a solid voxelization algorithm. The implemen- tations include a naive transition from the CPU to the GPU, a non-branching execution path approach, data pre-processing, and a combination of the two previous approaches. Benchmarking experiments run on four, standard polygo- nal models and three graphics cards (NVIDIA and AMD) provide runtime and memory usage data for each implementation. A comparative analysis is per- formed on the basis of this data to determine the performance impact of the GPU-based adjustments to the voxelization algorithm implementation.
    [Show full text]
  • Modifying Compressed Voxel Representations
    DOI: 10.1111/cgf.13916 EUROGRAPHICS 2020 / U. Assarsson and D. Panozzo Volume 39 (2020), Number 2 (Guest Editors) Interactively Modifying Compressed Sparse Voxel Representations V. Careil1;2 , M. Billeter1 , E. Eisemann1 y 1Delft University of Technology, The Netherlands 2Université de Rennes, France Figure 1: To test and demonstrate our method for editing large sparse voxel geometries, we have implemented an interactive prototype application with support for interactive editing operations. The left image shows a copy operation in the Epic Citadel scene, voxelized at a resolution of (128k)3. The statue inside this building contains about 170k voxels. The middle image shows larger scale edits, copying an entire building (order of 80M voxels). The right images illustrate tools to add and delete voxels, as well as paint voxel color attributes. The bottom right image uses the San Miguel model, voxelized at (64k)3, where we first solidified, then carved a hole in a column. Abstract Voxels are a popular choice to encode complex geometry. Their regularity makes updates easy and enables random retrieval of values. The main limitation lies in the poor scaling with respect to resolution. Sparse voxel DAGs (Directed Acyclic Graphs) overcome this hurdle and offer high-resolution representations for real-time rendering but only handle static data. We introduce a novel data structure to enable interactive modifications of such compressed voxel geometry without requiring de- and recompression. Besides binary data to encode geometry, it also supports compressed attributes (e.g., color). We illustrate the usefulness of our representation via an interactive large-scale voxel editor (supporting carving, filling, copying, and painting).
    [Show full text]
  • Voxel Size Calibration for High-Resolution CT
    10th Conference on Industrial Computed Tomography, Wels, Austria (iCT 2020), www.ict-conference.com/2020 Voxel Size Calibration for High-resolution CT 2 1 Marek Zemek1, Pavel Blažek1, Jan Šrámek , Jakub Šalplachta , Tomáš Zikmund1, Petr Klapetek1,2, Yoshihiro Takeda3, Kazuhiko Omote3, Jozef Kaiser1 1Central European Institute of Technology, Brno University of Technology, Brno, Czech Republic, e-mail: [email protected], [email protected], [email protected], [email protected], [email protected] 2Czech Metrology Institute, Department of Primary Nanometrology and Technical Length, Brno, Czech Republic, e-mail: [email protected], [email protected] 3Rigaku Corporation, 3-9-12, Matsubara-cho, Akishima-shi, Tokyo, 196-8666, Japan, e-mail: [email protected], [email protected] http://www.ndt.net/?id=25109 Abstract In cone-beam X-ray computed tomography (CT), distances between the source, object, and detector influence the visual fidelity and voxel size of a reconstructed volume. Calibration using reference objects is an appropriate tool for preventing errors in the estimates of these distances. There is, however, a lack of such objects for high-resolution systems with a small field of view (FoV). In this work, we propose a method to measure the distances mentioned above, improving the determination of voxel size. We use a custom reference object suitable for a FoV of around one millimeter. Many approaches have been developed for this calibration task and discussed in the literature, but none apply to CT scanners with a small FoV and a cone-beam magnification More info about this article: close to one.
    [Show full text]
  • Evaluation of Super-Voxel Methods for Early Video Processing
    Evaluation of Super-Voxel Methods for Early Video Processing Chenliang Xu and Jason J. Corso Computer Science and Engineering, SUNY at Buffalo fchenlian,[email protected] Abstract Suite of 3D Supervoxel Segmentations SWA GB GBH Mean Shift Nyström Supervoxel segmentation has strong potential to be in- corporated into early video analysis as superpixel segmen- tation has in image analysis. However, there are many plausible supervoxel methods and little understanding as to when and where each is most appropriate. Indeed, we are not aware of a single comparative study on supervoxel seg- mentation. To that end, we study five supervoxel algorithms in the context of what we consider to be a good supervoxel: namely, spatiotemporal uniformity, object/region boundary detection, region compression and parsimony. For the eval- Benchmark Videos 3D Supervoxel Metrics uation we propose a comprehensive suite of 3D volumetric Figure 1. We comparatively evaluate five supervoxel methods on quality metrics to measure these desirable supervoxel char- 3D volumetric metrics that measure various desirable characteris- acteristics. We use three benchmark video data sets with tics of the supervoxels, e.g., boundary detection. a variety of content-types and varying amounts of human annotations. Our findings have led us to conclusive evi- dence that the hierarchical graph-based and segmentation perpixels have for image analysis. To that end, we perform by weighted aggregation methods perform best and almost a thorough comparative evaluation of five supervoxel meth- equally-well on nearly all the metrics and are the methods ods; note that none of these methods had been proposed of choice given our proposed assumptions.
    [Show full text]