Basic Distinctions Definitions Epstein (1965) Familiar Size Experiment

Total Page:16

File Type:pdf, Size:1020Kb

Basic Distinctions Definitions Epstein (1965) Familiar Size Experiment Distance, depth, and 3D shape Basic distinctions cues • Types of depth cues • Pictorial depth cues: familiar size, relative size, brightness, occlusion, shading and shadows, aerial/ – Monocular vs. binocular atmospheric perspective, linear perspective, height – Pictorial vs. movement within image, texture gradient, contour – Physiological • Other static, monocular cues: accommodation, blur, [astigmatic blur, chromatic aberration] • Depth cue information • Motion cues: motion parallax, kinetic depth effect, – What is the information? dynamic occlusion – How could one compute depth from it? • Binocular cues: convergence, stereopsis/binocular – Do we compute depth from it? disparity: crossed vs. uncrossed disparity, random- dot stereogram and the correspondence problem, – What is learned: ordinal, relative, absolute depth, fusion and rivalry, neural coding depth ambiguities • Cue combination Definitions Distance, depth, and 3D shape cues • Spatial vision (2D) vs. Space perception (3D) • Pictorial depth cues: familiar size, relative size, • Distance: Egocentric distance, distance from the [brightness], occlusion, shading and shadows, aerial/ observer to the object atmospheric perspective, linear perspective, height • Depth: Relative distance, e.g., distance one within image, texture gradient, contour object is in front of another or in front of a • Other static, monocular cues: accommodation, blur, background [astigmatic blur, chromatic aberration] • Motion cues: motion parallax, kinetic depth effect, • Surface Orientation: Slant (how much) and tilt dynamic occlusion (which way) • Binocular cues: convergence, stereopsis/binocular • Shape: Intrinsic to an object, independent of disparity viewpoint • Cue combination Epstein (1965) familiar size Monocular depth cues experiment Retinal projection depends on size and distance How far away is the coin? 1 Relative size as a cue to depth Relative size as a cue to depth Occlusion as a cue to depth Shading, reflection, and illumination illumination occlusion reflection shading Shading – assumption of light-from-above Shading (flip the photo upside-down) 2 Cast Shadows Dynamic Cast Shadows Shading and contour Aerial/Atmospheric Perspective Geometry of Linear Perspective Linear perspective Retinal projection depends on size and distance: Size in the world (e.g., in meters) is proportional to size in the retinal image (in degrees) times the distance to the object 3 Ames room Size constancy Ames room The Ames Room Texture Height Within the Image 1. Density 2. Foreshortening 3. Size 4 Distance, depth, and 3D shape Monocular Physiological Cues cues • Accommodation – estimate depth based • Pictorial depth cues: familiar size, relative size, brightness, occlusion, shading and shadows, aerial/ on state of accommodation (lens shape) atmospheric perspective, linear perspective, height required to bring object into focus within image, texture gradient, contour • Other static, monocular cues: accommodation, blur, • Blur – objects that are further or closer [astigmatic blur, chromatic aberration] than the accommodative distance are • Motion cues: motion parallax, kinetic depth effect, dynamic occlusion increasingly blur • Binocular cues: convergence, stereopsis/binocular • Astigmatic blur disparity • Cue combination • Chromatic aberration Distance, depth, and 3D shape Motion Parallax cues • Pictorial depth cues: familiar size, relative size, brightness, occlusion, shading and shadows, aerial/ atmospheric perspective, linear perspective, height within image, texture gradient, contour • Other static, monocular cues: accommodation, blur, [astigmatic blur, chromatic aberration] • Motion cues: motion parallax, kinetic depth effect, dynamic occlusion • Binocular cues: convergence, stereopsis/binocular disparity • Cue combination The Kinetic Depth Effect Dynamic (Kinetic) Occlusion 5 Distance, depth, and 3D shape Vergence Angle As One cues Binocular Source • Pictorial depth cues: familiar size, relative size, brightness, occlusion, shading and shadows, aerial/ atmospheric perspective, linear perspective, height within image, texture gradient, contour • Other static, monocular cues: accommodation, blur, [astigmatic blur, chromatic aberration] • Motion cues: motion parallax, kinetic depth effect, dynamic occlusion • Binocular cues: convergence, stereopsis/binocular disparity • Cue combination Vergence Angle As One Vergence Angle As One Binocular Source Binocular Source Vergence Angle As One Retinal Disparity as a Source of 3D Binocular Source Information Stereograms (anaglyphs) 6 Sir Charles Wheatstone’s Binocular Famous Invention disparity Disparity Uncrossed disparity Disparity Zero retinal disparity Crossed disparity Stereopsis (literally, “seeing solid”) - 3D vision resulting from slight differences in left and right eye images, arising because the two eyes view the world from slightly different perspectives Wheatstone stereoscope (c. 1838) Disparity - slight differences in positions of “features” in the left and right eyes’ views Fixation point zero disparity uncrossed disparity crossed disparity Red = right eye’s image 7 Red-green anaglyph Dual mirror stereoscope Also: Polarizing filter stereoscope, LCD shutter glasses, … Stereograms (“Magic Eye”) 8 Autostereogram surface Autostereogram surface Autostereogram surface Autostereogram surface Autostereogram surface Autostereogram surface 9 Autostereogram surface Autostereogram surface Autostereogram surface Autostereogram surface The horopter: the locus of points Magnitude of Disparity Signifies in the world with zero disparity Depth Difference relative to fixation Gaze-normal (fronto-parallel) Geometric horopter plane 10 Disparity Magnitude Also Varies Stereoacuity: The smallest with Viewing Distance resolvable disparity d Stereopsis works only within 10 - 20 ft of the observer; once the visual axes are parallel, objects beyond the point of fixation provide no disparity. Also: disparities don’t imply depth; they first must be scaled for the viewing distance. d Under ideal conditions ≈ 5 arc seconds !!! You should be able to compute what the difference in distance between two objects would need to be in order to give rise to a disparity value this small. Remember: 1 deg = 60 arc min 1 arc min = 60 sec How to make a random-dot stereogram Left eye image Right eye image x A A y B B How Does the Brain “Solve” This “Correspondence” Problem? What dot in one eye's view goes with which dot in the other eye's view? left image right image Derive the solution that maximizes the overall number of matches - i.e., that is most globally consistent. 11 Disparity Distribution of disparity preferences selectivity in V1 Things that can happen with 2 eyes Binocular rivalry • Fusion • Suppression • Diplopia • Rivalry Binocular rivalry Binocular rivalry Right eye V1 L R Binocular L neurons R Left eye Ocular dominance columns 12 Predicted fMRI responses during rivalry Wave of activity in V1 follows with t Percept percept during binocular rivalry Local contrast Neural activity FMRI signal time to peak peak amplitude fMRI response Depth Cue Combination Depth Cue Combination • There are these dozens of depth cues we • The result of many recent studies: Humans are have reviewed typically “optimal” at depth cue combination • Yet, you typically have only a single • That is, they combine all the information from the various cues. percept of depth (slant, shape, distance, • Further, cues are given higher weight if they are …) for each image feature. How are the more reliable. cues combined? • This “reliability” can change from scene to scene, or even from place to place within a single scene. 13 .
Recommended publications
  • Interacting with Autostereograms
    See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/336204498 Interacting with Autostereograms Conference Paper · October 2019 DOI: 10.1145/3338286.3340141 CITATIONS READS 0 39 5 authors, including: William Delamare Pourang Irani Kochi University of Technology University of Manitoba 14 PUBLICATIONS 55 CITATIONS 184 PUBLICATIONS 2,641 CITATIONS SEE PROFILE SEE PROFILE Xiangshi Ren Kochi University of Technology 182 PUBLICATIONS 1,280 CITATIONS SEE PROFILE Some of the authors of this publication are also working on these related projects: Color Perception in Augmented Reality HMDs View project Collaboration Meets Interactive Spaces: A Springer Book View project All content following this page was uploaded by William Delamare on 21 October 2019. The user has requested enhancement of the downloaded file. Interacting with Autostereograms William Delamare∗ Junhyeok Kim Kochi University of Technology University of Waterloo Kochi, Japan Ontario, Canada University of Manitoba University of Manitoba Winnipeg, Canada Winnipeg, Canada [email protected] [email protected] Daichi Harada Pourang Irani Xiangshi Ren Kochi University of Technology University of Manitoba Kochi University of Technology Kochi, Japan Winnipeg, Canada Kochi, Japan [email protected] [email protected] [email protected] Figure 1: Illustrative examples using autostereograms. a) Password input. b) Wearable e-mail notification. c) Private space in collaborative conditions. d) 3D video game. e) Bar gamified special menu. Black elements represent the hidden 3D scene content. ABSTRACT practice. This learning effect transfers across display devices Autostereograms are 2D images that can reveal 3D content (smartphone to desktop screen). when viewed with a specific eye convergence, without using CCS CONCEPTS extra-apparatus.
    [Show full text]
  • Stereoscopic Label Placement
    Linköping Studies in Science and Technology Dissertations, No. 1293 Stereoscopic Label Placement Reducing Distraction and Ambiguity in Visually Cluttered Displays Stephen Daniel Peterson Department of Science and Technology Linköping University SE-601 74 Norrköping, Sweden Norrköping, 2009 Stereoscopic Label Placement: Reducing Distraction and Ambiguity in Visually Cluttered Displays Copyright © 2009 Stephen D. Peterson [email protected] Division of Visual Information Technology and Applications (VITA) Department of Science and Technology, Linköping University SE-601 74 Norrköping, Sweden ISBN 978-91-7393-469-5 ISSN 0345-7524 This thesis is available online through Linköping University Electronic Press: www.ep.liu.se Printed by LiU-Tryck, Linköping, Sweden 2009 Abstract With increasing information density and complexity, computer displays may become visually cluttered, adversely affecting overall usability. Text labels can significantly add to visual clutter in graphical user interfaces, but are generally kept legible through specific label placement algorithms that seek visual separation of labels and other ob- jects in the 2D view plane. This work studies an alternative approach: can overlap- ping labels be visually segregated by distributing them in stereoscopic depth? The fact that we have two forward-looking eyes yields stereoscopic disparity: each eye has a slightly different perspective on objects in the visual field. Disparity is used for depth perception by the human visual system, and is therefore also provided by stereoscopic 3D displays to produce a sense of depth. This work has shown that a stereoscopic label placement algorithm yields user per- formance comparable with existing algorithms that separate labels in the view plane. At the same time, such stereoscopic label placement is subjectively rated significantly less disturbing than traditional methods.
    [Show full text]
  • Algorithms for Single Image Random Dot Stereograms
    Displaying 3D Images: Algorithms for Single Image Random Dot Stereograms Harold W. Thimbleby,† Stuart Inglis,‡ and Ian H. Witten§* Abstract This paper describes how to generate a single image which, when viewed in the appropriate way, appears to the brain as a 3D scene. The image is a stereogram composed of seemingly random dots. A new, simple and symmetric algorithm for generating such images from a solid model is given, along with the design parameters and their influence on the display. The algorithm improves on previously-described ones in several ways: it is symmetric and hence free from directional (right-to-left or left-to-right) bias, it corrects a slight distortion in the rendering of depth, it removes hidden parts of surfaces, and it also eliminates a type of artifact that we call an “echo”. Random dot stereograms have one remaining problem: difficulty of initial viewing. If a computer screen rather than paper is used for output, the problem can be ameliorated by shimmering, or time-multiplexing of pixel values. We also describe a simple computational technique for determining what is present in a stereogram so that, if viewing is difficult, one can ascertain what to look for. Keywords: Single image random dot stereograms, SIRDS, autostereograms, stereoscopic pictures, optical illusions † Department of Psychology, University of Stirling, Stirling, Scotland. Phone (+44) 786–467679; fax 786–467641; email [email protected] ‡ Department of Computer Science, University of Waikato, Hamilton, New Zealand. Phone (+64 7) 856–2889; fax 838–4155; email [email protected]. § Department of Computer Science, University of Waikato, Hamilton, New Zealand.
    [Show full text]
  • Visual Secret Sharing Scheme with Autostereogram*
    Visual Secret Sharing Scheme with Autostereogram* Feng Yi, Daoshun Wang** and Yiqi Dai Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China Abstract. Visual secret sharing scheme (VSSS) is a secret sharing method which decodes the secret by using the contrast ability of the human visual system. Autostereogram is a single two dimensional (2D) image which becomes a virtual three dimensional (3D) image when viewed with proper eye convergence or divergence. Combing the two technologies via human vision, this paper presents a new visual secret sharing scheme called (k, n)-VSSS with autostereogram. In the scheme, each of the shares is an autostereogram. Stacking any k shares, the secret image is recovered visually without any equipment, but no secret information is obtained with less than k shares. Keywords: visual secret sharing scheme; visual cryptography; autostereogram 1. Introduction In 1979, Blakely and Shamir[1-2] independently invented a secret sharing scheme to construct robust key management scheme. A secret sharing scheme is a method of sharing a secret among a group of participants. In 1994, Naor and Shamir[3] firstly introduced visual secret sharing * Supported by National Natural Science Foundation of China (No. 90304014) ** E-mail address: [email protected] (D.S.Wang) 1 scheme in Eurocrypt’94’’ and constructed (k, n)-threshold visual secret sharing scheme which conceals the original data in n images called shares. The original data can be recovered from the overlap of any at least k shares through the human vision without any knowledge of cryptography or cryptographic computations. With the development of the field, Droste[4] provided a new (k, n)-VSSS algorithm and introduced a model to construct the (n, n)-combinational threshold scheme.
    [Show full text]
  • A Review and Selective Analysis of 3D Display Technologies for Anatomical Education
    University of Central Florida STARS Electronic Theses and Dissertations, 2004-2019 2018 A Review and Selective Analysis of 3D Display Technologies for Anatomical Education Matthew Hackett University of Central Florida Part of the Anatomy Commons Find similar works at: https://stars.library.ucf.edu/etd University of Central Florida Libraries http://library.ucf.edu This Doctoral Dissertation (Open Access) is brought to you for free and open access by STARS. It has been accepted for inclusion in Electronic Theses and Dissertations, 2004-2019 by an authorized administrator of STARS. For more information, please contact [email protected]. STARS Citation Hackett, Matthew, "A Review and Selective Analysis of 3D Display Technologies for Anatomical Education" (2018). Electronic Theses and Dissertations, 2004-2019. 6408. https://stars.library.ucf.edu/etd/6408 A REVIEW AND SELECTIVE ANALYSIS OF 3D DISPLAY TECHNOLOGIES FOR ANATOMICAL EDUCATION by: MATTHEW G. HACKETT BSE University of Central Florida 2007, MSE University of Florida 2009, MS University of Central Florida 2012 A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Modeling and Simulation program in the College of Engineering and Computer Science at the University of Central Florida Orlando, Florida Summer Term 2018 Major Professor: Michael Proctor ©2018 Matthew Hackett ii ABSTRACT The study of anatomy is complex and difficult for students in both graduate and undergraduate education. Researchers have attempted to improve anatomical education with the inclusion of three-dimensional visualization, with the prevailing finding that 3D is beneficial to students. However, there is limited research on the relative efficacy of different 3D modalities, including monoscopic, stereoscopic, and autostereoscopic displays.
    [Show full text]
  • Relative Importance of Binocular Disparity and Motion Parallax for Depth Estimation: a Computer Vision Approach
    remote sensing Article Relative Importance of Binocular Disparity and Motion Parallax for Depth Estimation: A Computer Vision Approach Mostafa Mansour 1,2 , Pavel Davidson 1,* , Oleg Stepanov 2 and Robert Piché 1 1 Faculty of Information Technology and Communication Sciences, Tampere University, 33720 Tampere, Finland 2 Department of Information and Navigation Systems, ITMO University, 197101 St. Petersburg, Russia * Correspondence: pavel.davidson@tuni.fi Received: 4 July 2019; Accepted: 20 August 2019; Published: 23 August 2019 Abstract: Binocular disparity and motion parallax are the most important cues for depth estimation in human and computer vision. Here, we present an experimental study to evaluate the accuracy of these two cues in depth estimation to stationary objects in a static environment. Depth estimation via binocular disparity is most commonly implemented using stereo vision, which uses images from two or more cameras to triangulate and estimate distances. We use a commercial stereo camera mounted on a wheeled robot to create a depth map of the environment. The sequence of images obtained by one of these two cameras as well as the camera motion parameters serve as the input to our motion parallax-based depth estimation algorithm. The measured camera motion parameters include translational and angular velocities. Reference distance to the tracked features is provided by a LiDAR. Overall, our results show that at short distances stereo vision is more accurate, but at large distances the combination of parallax and camera motion provide better depth estimation. Therefore, by combining the two cues, one obtains depth estimation with greater range than is possible using either cue individually.
    [Show full text]
  • Interactive Visual Analytics for Large-Scale Particle Simulations
    University of New Hampshire University of New Hampshire Scholars' Repository Master's Theses and Capstones Student Scholarship Winter 2016 Interactive Visual Analytics for Large-scale Particle Simulations Erol Aygar University of New Hampshire, Durham Follow this and additional works at: https://scholars.unh.edu/thesis Recommended Citation Aygar, Erol, "Interactive Visual Analytics for Large-scale Particle Simulations" (2016). Master's Theses and Capstones. 896. https://scholars.unh.edu/thesis/896 This Thesis is brought to you for free and open access by the Student Scholarship at University of New Hampshire Scholars' Repository. It has been accepted for inclusion in Master's Theses and Capstones by an authorized administrator of University of New Hampshire Scholars' Repository. For more information, please contact [email protected]. INTERACTIVE VISUAL ANALYTICS FOR LARGE-SCALE PARTICLE SIMULATIONS BY Erol Aygar MS, Bosphorus University, 2006 BS, Mimar Sinan Fine Arts University, 1999 THESIS Submitted to the University of New Hampshire in Partial Fulfillment of the Requirements for the Degree of Master of Science in Computer Science December, 2016 ALL RIGHTS RESERVED c 2016 Erol Aygar This thesis has been examined and approved in partial fulfillment of the requirements for the de- gree of Master of Science in Computer Science by: Thesis Director, Colin Ware Professor of Computer Science University of New Hampshire R. Daniel Bergeron Professor of Computer Science University of New Hampshire Thomas Butkiewicz Research Assistant Professor of Computer Science University of New Hampshire On November 2, 2016 Original approval signatures are on file with the University of New Hampshire Graduate School. Dedication This thesis is dedicated to the memory of my grandfather.
    [Show full text]
  • Construction of Autostereograms Taking Into Account Object Colors and Its Applications for Steganography
    Construction of Autostereograms Taking into Account Object Colors and its Applications for Steganography Yusuke Tsuda Yonghao Yue Tomoyuki Nishita The University of Tokyo ftsuday,yonghao,[email protected] Abstract Information on appearances of three-dimensional ob- jects are transmitted via the Internet, and displaying objects q plays an important role in a lot of areas such as movies and video games. An autostereogram is one of the ways to represent three- dimensional objects taking advantage of binocular paral- (a) SS-relation (b) OS-relation lax, by which depths of objects are perceived. In previous Figure 1. Relations on an autostereogram. methods, the colors of objects were ignored when construct- E and E indicate left and right eyes, re- ing autostereogram images. In this paper, we propose a L R spectively. (a): A pair of points on the screen method to construct autostereogram images taking into ac- and a single point on the object correspond count color variation of objects, such as shading. with each other. (b): A pair of points on the We also propose a technique to embed color informa- object and a single point on the screen cor- tion in autostereogram images. Using our technique, au- respond with each other. tostereogram images shown on a display change, and view- ers can enjoy perceiving three-dimensional object and its colors in two stages. Color information is embedded in we propose an approach to take into account color variation a monochrome autostereogram image, and colors appear of objects, such as shading. Our approach assign slightly when the correct color table is used.
    [Show full text]
  • Durham E-Theses
    Durham E-Theses Stereoscopic 3D Technologies for Accurate Depth Tasks: A Theoretical and Empirical Study FRONER, BARBARA How to cite: FRONER, BARBARA (2011) Stereoscopic 3D Technologies for Accurate Depth Tasks: A Theoretical and Empirical Study, Durham theses, Durham University. Available at Durham E-Theses Online: http://etheses.dur.ac.uk/3324/ Use policy The full-text may be used and/or reproduced, and given to third parties in any format or medium, without prior permission or charge, for personal research or study, educational, or not-for-prot purposes provided that: • a full bibliographic reference is made to the original source • a link is made to the metadata record in Durham E-Theses • the full-text is not changed in any way The full-text must not be sold in any format or medium without the formal permission of the copyright holders. Please consult the full Durham E-Theses policy for further details. Academic Support Oce, Durham University, University Oce, Old Elvet, Durham DH1 3HP e-mail: [email protected] Tel: +44 0191 334 6107 http://etheses.dur.ac.uk 2 Stereoscopic 3D Technologies for Accurate Depth Tasks: A Theoretical and Empirical Study by Barbara Froner A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy School of Engineering and Computing Sciences Durham University United Kingdom Copyright °c 2011 by Barbara Froner Abstract Stereoscopic 3D Technologies for Accurate Depth Tasks: A Theoretical and Empirical Study Barbara Froner In the last decade an increasing number of application ¯elds, including medicine, geoscience and bio-chemistry, have expressed a need to visualise and interact with data that are inherently three-dimensional.
    [Show full text]
  • SIRDS-Paper.Ps
    Displaying 3D Images: Algorithms for Single Image Random Dot Stereograms Harold W. Thimbleby,† Stuart Inglis,‡ and Ian H. Witten§* Abstract This paper describes how to generate a single image which, when viewed in the appropriate way, appears to the brain as a 3D scene. The image is a stereogram composed of seemingly random dots. A new, simple and symmetric algorithm for generating such images from a solid model is given, along with the design parameters and their influence on the display. The algorithm improves on previously-described ones in several ways: it is symmetric and hence free from directional (right-to-left or left-to-right) bias, it corrects a slight distortion in the rendering of depth, it removes hidden parts of surfaces, and it also eliminates a type of artifact that we call an “echo”. Random dot stereograms have one remaining problem: difficulty of initial viewing. If a computer screen rather than paper is used for output, the problem can be ameliorated by shimmering, or time-multiplexing of pixel values. We also describe a simple computational technique for determining what is present in a stereogram so that, if viewing is difficult, one can ascertain what to look for. Keywords: Single image random dot stereograms, SIRDS, autostereograms, stereoscopic pictures, optical illusions † Department of Psychology, University of Stirling, Stirling, Scotland. Phone (+44) 786–467679; fax 786–467641; email [email protected] ‡ Department of Computer Science, University of Waikato, Hamilton, New Zealand. Phone (+64 7) 856–2889; fax 838–4155; email [email protected]. § Department of Computer Science, University of Waikato, Hamilton, New Zealand.
    [Show full text]
  • B.S., Wright State University, 1992
    THE EFFECTS OF DEPTH AND ECCENTRICITY ON VISUAL SEARCH IN A DEPTH DISPLAY A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Engineering By GEORGE ANGELO REIS B.S., Wright State University, 1992 2009 Wright State University WRIGHT STATE UNIVERSITY SCHOOL OF GRADUATE STUDIES January 2, 2009 I HEREBY RECOMMEND THAT THE THESIS PREPARED UNDER MY SUPERVISION BY George Angelo Reis ENTITLED The Effects of Depth and Eccentricity on Visual Search in a Depth Display BE ACCEPTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of Science in Engineering. ______________________________ Yan Liu, Ph.D. Thesis Director ______________________________ S. Narayanan, Ph.D. Department Chair Committee on Final Examination _________________________ Yan Liu, Ph.D. _________________________ Paul Havig, Ph.D. _________________________ Misty Blue, Ph.D. _________________________ Joseph F. Thomas, Jr., Ph.D. Dean, School of Graduate Studies ABSTRACT Reis, George Angelo. M.S. Egr., Department of Biomedical, Industrial, and Human Factors Engineering. Wright State University, 2009. The Effects of Depth and Eccentricity on Visual Search in a Depth Display. The attribute of depth has been shown to provide saliency or conspicuity for items in a visual search task. Novel displays that present information in real physical depth offer potential benefits. Previous research has studied depth in visual search but depth was mostly realized without real physical separation of display elements. This study aimed to better understand the effects of real physical depth and eccentricity on visual search in a depth display. Through this understanding, we can appropriately utilize this new technology. An experiment was conducted to test four hypotheses regarding how depth, eccentricity, target feature, and screen location would affect target acquisition speed.
    [Show full text]
  • Psychophysics of Autostereogram Videos: Blur, Contrast and Repetition Period
    Psychophysics of Autostereogram Videos: Blur, Contrast and Repetition Period Georgios Papadimitriou I V N E R U S E I T H Y T O H F G E R D I N B U Master of Science in Artificial Intelligence School of Informatics University of Edinburgh 2010 Abstract Autostereograms are single image stereograms that take advantage of the binocular fusion and stereopsis of the human vision system. In this way, through autostere- ograms, we visualise in three dimensions objects or scenes that are embedded in two- dimensional images. In addition to (static) autostereograms there are autostereogram videos which are either videos that are created from animated depth masks of ob- jects/scenes or videos that are composed of sequences (frames) of static autostere- ograms. In the work presented in this thesis, we investigated the psychophysical as- pects of Random Dot Autostereogram (RDA) videos with respect to blur, contrast and repetition period of the random dots that constitute the repetitive patches inside a ran- dom dot autostereogram. The approach we followed focused on human performance data gathering by conducting experiments on human subjects that we tested for stere- opsis achievement and how fast it (stereopsis) was achieved. The stimuli we used were autostereogram videos of basic objects (cubes, tubes, pyramids, disks and pentagons) in which we varied the setting of one of the aforementioned features (blur, contrast etc.) each time while keeping the rest fixed. With respect to blur, our findings showed that there is an upper threshold of uniform blur radius at 33-35 pixels above which subjects were unable to achieve stereopsis.
    [Show full text]