Lecture 6 Depth Perception
Total Page:16
File Type:pdf, Size:1020Kb

Load more
Recommended publications
-
Binocular Disparity - Difference Between Two Retinal Images
Depth II + Motion I Lecture 13 (Chapters 6+8) Jonathan Pillow Sensation & Perception (PSY 345 / NEU 325) Spring 2015 1 depth from focus: tilt shift photography Keith Loutit : tilt shift + time-lapse photography http://vimeo.com/keithloutit/videos 2 Monocular depth cues: Pictorial Non-Pictorial • occlusion • motion parallax relative size • accommodation shadow • • (“depth from focus”) • texture gradient • height in plane • linear perspective 3 • Binocular depth cue: A depth cue that relies on information from both eyes 4 Two Retinas Capture Different images 5 Finger-Sausage Illusion: 6 Pen Test: Hold a pen out at half arm’s length With the other hand, see how rapidly you can place the cap on the pen. First using two eyes, then with one eye closed 7 Binocular depth cues: 1. Vergence angle - angle between the eyes convergence divergence If you know the angles, you can deduce the distance 8 Binocular depth cues: 2. Binocular Disparity - difference between two retinal images Stereopsis - depth perception that results from binocular disparity information (This is what they’re offering in “3D movies”...) 9 10 Retinal images in left & right eyes Figuring out the depth from these two images is a challenging computational problem. (Can you reason it out?) 11 12 Horopter: circle of points that fall at zero disparity (i.e., they land on corresponding parts of the two retinas) A bit of geometric reasoning will convince you that this surface is a circle containing the fixation point and the two eyes 13 14 point with uncrossed disparity point with crossed -
Qt6hb7321d.Pdf
UC Berkeley UC Berkeley Previously Published Works Title Recovering stereo vision by squashing virtual bugs in a virtual reality environment. Permalink https://escholarship.org/uc/item/6hb7321d Journal Philosophical transactions of the Royal Society of London. Series B, Biological sciences, 371(1697) ISSN 0962-8436 Authors Vedamurthy, Indu Knill, David C Huang, Samuel J et al. Publication Date 2016-06-01 DOI 10.1098/rstb.2015.0264 Peer reviewed eScholarship.org Powered by the California Digital Library University of California Submitted to Phil. Trans. R. Soc. B - Issue Recovering stereo vision by squashing virtual bugs in a virtual reality environment. For ReviewJournal: Philosophical Only Transactions B Manuscript ID RSTB-2015-0264.R2 Article Type: Research Date Submitted by the Author: n/a Complete List of Authors: Vedamurthy, Indu; University of Rochester, Brain & Cognitive Sciences Knill, David; University of Rochester, Brain & Cognitive Sciences Huang, Sam; University of Rochester, Brain & Cognitive Sciences Yung, Amanda; University of Rochester, Brain & Cognitive Sciences Ding, Jian; University of California, Berkeley, Optometry Kwon, Oh-Sang; Ulsan National Institute of Science and Technology, School of Design & Human Engineering Bavelier, Daphne; University of Geneva, Faculty of Psychology and Education Sciences; University of Rochester, Brain & Cognitive Sciences Levi, Dennis; University of California, Berkeley, Optometry; Issue Code: Click <a href=http://rstb.royalsocietypublishing.org/site/misc/issue- 3DVIS codes.xhtml target=_new>here</a> to find the code for your issue.: Subject: Neuroscience < BIOLOGY Stereopsis, Strabismus, Amblyopia, Virtual Keywords: Reality, Perceptual learning, stereoblindness http://mc.manuscriptcentral.com/issue-ptrsb Page 1 of 29 Submitted to Phil. Trans. R. Soc. B - Issue Phil. -
Relationship Between Binocular Disparity and Motion Parallax in Surface Detection
Perception & Psychophysics 1997,59 (3),370-380 Relationship between binocular disparity and motion parallax in surface detection JESSICATURNERand MYRON L, BRAUNSTEIN University ofCalifornia, Irvine, California and GEORGEJ. ANDERSEN University ofColifornia, Riverside, California The ability to detect surfaces was studied in a multiple-cue condition in which binocular disparity and motion parallax could specify independent depth configurations, On trials on which binocular disparity and motion parallax were presented together, either binocular disparity or motion parallax could indicate a surface in one of two intervals; in the other interval, both sources indicated a vol ume of random points. Surface detection when the two sources of information were present and compatible was not betterthan detection in baseline conditions, in which only one source of informa tion was present. When binocular disparity and motion specified incompatible depths, observers' ability to detect a surface was severely impaired ifmotion indicated a surface but binocular disparity did not. Performance was not as severely degraded when binocular disparity indicated a surface and motion did not. This dominance of binocular disparity persisted in the presence of foreknowledge about which source of information would be relevant. The ability to detect three-dimensional (3-D) surfaces action ofdepth cues in determining the perceived shape has been studied previously using static stereo displays of objects has been a subject of considerable attention. (e.g., Uttal, 1985), motion parallax displays (Andersen, Biilthoffand Mallot (1988) discussed four ways in which 1996; Andersen & Wuestefeld, 1993), and structure depth information from different cues may be combined: from-motion (SFM) displays (Turner, Braunstein, & An accumulation, veto, disambiguation, and cooperation. -
3D Computational Modeling and Perceptual Analysis of Kinetic Depth Effects
Computational Visual Media DOI 10.1007/s41095-xxx-xxxx-x Vol. x, No. x, month year, xx–xx Research Article 3D Computational Modeling and Perceptual Analysis of Kinetic Depth Effects Meng-Yao Cui1, Shao-Ping Lu1( ), Miao Wang2, Yong-Liang Yang3, Yu-Kun Lai4, and Paul L. Rosin4 c The Author(s) 2020. This article is published with open access at Springerlink.com Abstract Humans have the ability to perceive 3D shapes from 2D projections of rotating 3D objects, which is called Kinetic Depth Effects. This process is based on a variety of visual cues such as lighting and shading effects. However, when such cues are weakened or missing, perception can become faulty, as demonstrated by the famous silhouette illusion example – the Spinning Dancer. Inspired by this, we establish objective and subjective evaluation models of rotated 3D objects by taking their projected 2D images as input. We investigate five different cues: ambient Fig. 1 The Spinning Dancer. Due to the lack of visual cues, it confuses humans luminance, shading, rotation speed, perspective, and color as to whether rotation is clockwise or counterclockwise. Here we show 3 of 34 difference between the objects and background. In the frames from the original animation [21]. Image courtesy of Nobuyuki Kayahara. objective evaluation model, we first apply 3D reconstruction algorithms to obtain an objective reconstruction quality 1 Introduction metric, and then use a quadratic stepwise regression analysis method to determine the weights among depth cues to The human perception mechanism of the 3D world has represent the reconstruction quality. In the subjective long been studied. -
Daily Mixed Visual Experience That Prevents Amblyopia in Cats Does Not Always Allow the Development of Good Binocular Depth Perception
Journal of Vision (2009) 9(5):22, 1–7 http://journalofvision.org/9/5/22/ 1 Daily mixed visual experience that prevents amblyopia in cats does not always allow the development of good binocular depth perception Department of Psychology, Dalhousie University, Donald E. Mitchell Halifax, NS, Canada Department of Psychology, Dalhousie University, Jan Kennie Halifax, NS, Canada WT Centre for Neuroimaging, UCL Institute of Neurology, D. Samuel Schwarzkopf London, UK School of Biosciences, Cardiff University, Frank Sengpiel Cardiff, Wales, UK Kittens reared with mixed daily visual input that consists of episodes of normal (binocular) exposure followed by abnormal (monocular) exposure can develop normal visual acuity in both eyes if the length of the former exposure exceeds a critical amount. However, later studies of the tuning of cells in primary visual cortex of animals reared in this manner revealed that their responses to interocular differences in phase were not reliable suggesting that their binocular depth perception may not be normal. We examined this possibility in 3 kittens reared with mixed daily visual exposure (2 hrs binocular vision followed by 5 hrs monocular exposure) that allowed development of normal visual acuity in both eyes. Measurements made of the threshold differences in depth that could be perceived under monocular and binocular viewing revealed a 10-fold superiority of binocular over monocular depth thresholds in one animal while the depth thresholds of the other two animals were poor and there was no binocular superiority. Thus, there was evidence that only one animal possessed stereopsis while the other two were likely stereoblind. While 2 hrs of daily binocular vision protected against the development of amblyopia, the poor outcome with respect to stereopsis points to the need for additional measures to promote binocular vision. -
CS 534: Computer Vision Stereo Imaging Outlines
CS 534 A. Elgammal Rutgers University CS 534: Computer Vision Stereo Imaging Ahmed Elgammal Dept of Computer Science Rutgers University CS 534 – Stereo Imaging - 1 Outlines • Depth Cues • Simple Stereo Geometry • Epipolar Geometry • Stereo correspondence problem • Algorithms CS 534 – Stereo Imaging - 2 1 CS 534 A. Elgammal Rutgers University Recovering the World From Images We know: • 2D Images are projections of 3D world. • A given image point is the projection of any world point on the line of sight • So how can we recover depth information CS 534 – Stereo Imaging - 3 Why to recover depth ? • Recover 3D structure, reconstruct 3D scene model, many computer graphics applications • Visual Robot Navigation • Aerial reconnaissance • Medical applications The Stanford Cart, H. Moravec, 1979. The INRIA Mobile Robot, 1990. CS 534 – Stereo Imaging - 4 2 CS 534 A. Elgammal Rutgers University CS 534 – Stereo Imaging - 5 CS 534 – Stereo Imaging - 6 3 CS 534 A. Elgammal Rutgers University Motion parallax CS 534 – Stereo Imaging - 7 Depth Cues • Monocular Cues – Occlusion – Interposition – Relative height: the object closer to the horizon is perceived as farther away, and the object further from the horizon is perceived as closer. – Familiar size: when an object is familiar to us, our brain compares the perceived size of the object to this expected size and thus acquires information about the distance of the object. – Texture Gradient: all surfaces have a texture, and as the surface goes into the distance, it becomes smoother and finer. – Shadows – Perspective – Focus • Motion Parallax (also Monocular) • Binocular Cues • In computer vision: large research on shape-from-X (should be called depth from X) CS 534 – Stereo Imaging - 8 4 CS 534 A. -
Interacting with Autostereograms
See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/336204498 Interacting with Autostereograms Conference Paper · October 2019 DOI: 10.1145/3338286.3340141 CITATIONS READS 0 39 5 authors, including: William Delamare Pourang Irani Kochi University of Technology University of Manitoba 14 PUBLICATIONS 55 CITATIONS 184 PUBLICATIONS 2,641 CITATIONS SEE PROFILE SEE PROFILE Xiangshi Ren Kochi University of Technology 182 PUBLICATIONS 1,280 CITATIONS SEE PROFILE Some of the authors of this publication are also working on these related projects: Color Perception in Augmented Reality HMDs View project Collaboration Meets Interactive Spaces: A Springer Book View project All content following this page was uploaded by William Delamare on 21 October 2019. The user has requested enhancement of the downloaded file. Interacting with Autostereograms William Delamare∗ Junhyeok Kim Kochi University of Technology University of Waterloo Kochi, Japan Ontario, Canada University of Manitoba University of Manitoba Winnipeg, Canada Winnipeg, Canada [email protected] [email protected] Daichi Harada Pourang Irani Xiangshi Ren Kochi University of Technology University of Manitoba Kochi University of Technology Kochi, Japan Winnipeg, Canada Kochi, Japan [email protected] [email protected] [email protected] Figure 1: Illustrative examples using autostereograms. a) Password input. b) Wearable e-mail notification. c) Private space in collaborative conditions. d) 3D video game. e) Bar gamified special menu. Black elements represent the hidden 3D scene content. ABSTRACT practice. This learning effect transfers across display devices Autostereograms are 2D images that can reveal 3D content (smartphone to desktop screen). when viewed with a specific eye convergence, without using CCS CONCEPTS extra-apparatus. -
Acquired Monocular Vision Rehabilitation Program
Volume 44, Number 4, 2007 JRRDJRRD Pages 593–598 Journal of Rehabilitation Research & Development Acquired Monocular Vision Rehabilitation program Carolyn Ihrig, OD;1–2* Daniel P. Schaefer, MD, FACS3–4 1Buffalo Department of Veterans Affairs Medical Center, Buffalo, NY; 2Department of Ophthalmology, Division of Low Vision, 3Department of Otolaryngology, and 4Department of Ophthalmology, Division of Oculoplastic, Orbital Plastics and Reconstructive Surgery, Ira G. Ross Eye Institute, University at Buffalo School of Medicine and Biomedical Sciences, The State University of New York, Buffalo, NY Abstract—Existing programs concerning patients with low to help with the adjustment to the loss of depth percep- vision do not readily meet the needs of the patient with tion, training on safety and social concerns, and support- acquired monocular vision. This article illustrates the develop- ive counseling would have been beneficial [1]. ment, need, and benefits of an Acquired Monocular Vision Currently, low-vision rehabilitation programs are Rehabilitation evaluation and training program. This proposed available for the partially sighted, such as those suffering program will facilitate the organization of vision rehabilitation from macular degeneration, but they do not readily meet with eye care professionals and social caseworkers to help the needs of the patient with acquired monocular vision. patients cope with, as well as accept, and recognize obstacles they will face in transitioning suddenly to monocular vision. The survey just noted inspired the development of the Acquired Monocular Vision Rehabilitation (AMVR) evaluation and training program to guide and teach these specific skills to each patient with acquired monocular Key words: acquired monocular vision, acquired monocular vision. -
Recovery of 3-D Shape from Binocular Disparity and Structure from Motion
Perception & Psychophysics /993. 54 (2). /57-J(B Recovery of 3-D shape from binocular disparity and structure from motion JAMES S. TI'ITLE The Ohio State University, Columbus, Ohio and MYRON L. BRAUNSTEIN University of California, Irvine, California Four experiments were conducted to examine the integration of depth information from binocular stereopsis and structure from motion (SFM), using stereograms simulating transparent cylindri cal objects. We found that the judged depth increased when either rotational or translational motion was added to a display, but the increase was greater for rotating (SFM) displays. Judged depth decreased as texture element density increased for static and translating stereo displays, but it stayed relatively constant for rotating displays. This result indicates that SFM may facili tate stereo processing by helping to resolve the stereo correspondence problem. Overall, the re sults from these experiments provide evidence for a cooperative relationship betweel\..SFM and binocular disparity in the recovery of 3-D relationships from 2-D images. These findings indicate that the processing of depth information from SFM and binocular disparity is not strictly modu lar, and thus theories of combining visual information that assume strong modularity or indepen dence cannot accurately characterize all instances of depth perception from multiple sources. Human observers can perceive the 3-D shape and orien tion, both sources of information (along with many others) tation in depth of objects in a natural environment with im occur together in the natural environment. Thus, it is im pressive accuracy. Prior work demonstrates that informa portant to understand what interaction (if any) exists in the tion about shape and orientation can be recovered from visual processing of binocular disparity and SFM. -
The Morphological Basis for Binocular and ON/OFF Convergence in Tree Shrew Striate Cortex
The Journal of Neuroscience, April 1992, 12(4): 1319-I 334 The Morphological Basis for Binocular and ON/OFF Convergence in Tree Shrew Striate Cortex E. Christopher Muly and David Fitzpatrick Department of Neurobiology, Duke University Medical Center, Durham, North Carolina 27710 We used retrograde and anterograde transport methods and sponse),reflecting distinct inputs from corresponding popula- single-cell reconstructions to examine the projection from tions of retinal ganglion cells (Kuffler, 1953; Hubel and Wiesel, layer IV to supragranular layers in the tree shrew’s striate 1961). In contrast, single cortical neurons have both ON and cortex. We found that neurons in the ON and OFF subdivi- OFF responsesdue to the convergenceof information conveyed sions of layer IV (IVa and IVb, respectively) have overlapping by ON and OFF center LGN neurons(Hubel and Wiesel, 1962; terminal fields throughout layers II and Ill. Despite their over- Schiller, 1982; Tanaka, 1983; Sherk and Horton, 1984). lap, these projections are organized in a highly stratified, The generation of binocular, dual-sign (ON/OFF) responses mirror-symmetric fashion that respects the vertical position from monocular, single-sign(ON or OFF) responsesmust reflect of neurons within each sublayer. Neurons in the middle of a simple anatomical fact: the axons of four different classesof layer IV (lower IVa and upper IVb) project to layers Ills/b, II, LGN neurons (i.e., ipsilateral ON, ipsilateral OFF, contralateral and I; neurons located at the edges of layer IV (upper IVa ON, and contralateral OFF), or cortical neuronsthat convey the and lower IVb) project to the lower half of layer Illc; and information from theseLGN neurons, synapseon a singlepost- neurons in the middle of IVa and the middle of IVb project synaptic neuron. -
Stereopsis and Stereoblindness
Exp. Brain Res. 10, 380-388 (1970) Stereopsis and Stereoblindness WHITMAN RICHARDS Department of Psychology, Massachusetts Institute of Technology, Cambridge (USA) Received December 20, 1969 Summary. Psychophysical tests reveal three classes of wide-field disparity detectors in man, responding respectively to crossed (near), uncrossed (far), and zero disparities. The probability of lacking one of these classes of detectors is about 30% which means that 2.7% of the population possess no wide-field stereopsis in one hemisphere. This small percentage corresponds to the probability of squint among adults, suggesting that fusional mechanisms might be disrupted when stereopsis is absent in one hemisphere. Key Words: Stereopsis -- Squint -- Depth perception -- Visual perception Stereopsis was discovered in 1838, when Wheatstone invented the stereoscope. By presenting separate images to each eye, a three dimensional impression could be obtained from two pictures that differed only in the relative horizontal disparities of the components of the picture. Because the impression of depth from the two combined disparate images is unique and clearly not present when viewing each picture alone, the disparity cue to distance is presumably processed and interpreted by the visual system (Ogle, 1962). This conjecture by the psychophysicists has received recent support from microelectrode recordings, which show that a sizeable portion of the binocular units in the visual cortex of the cat are sensitive to horizontal disparities (Barlow et al., 1967; Pettigrew et al., 1968a). Thus, as expected, there in fact appears to be a physiological basis for stereopsis that involves the analysis of spatially disparate retinal signals from each eye. Without such an analysing mechanism, man would be unable to detect horizontal binocular disparities and would be "stereoblind". -
Algorithms for Single Image Random Dot Stereograms
Displaying 3D Images: Algorithms for Single Image Random Dot Stereograms Harold W. Thimbleby,† Stuart Inglis,‡ and Ian H. Witten§* Abstract This paper describes how to generate a single image which, when viewed in the appropriate way, appears to the brain as a 3D scene. The image is a stereogram composed of seemingly random dots. A new, simple and symmetric algorithm for generating such images from a solid model is given, along with the design parameters and their influence on the display. The algorithm improves on previously-described ones in several ways: it is symmetric and hence free from directional (right-to-left or left-to-right) bias, it corrects a slight distortion in the rendering of depth, it removes hidden parts of surfaces, and it also eliminates a type of artifact that we call an “echo”. Random dot stereograms have one remaining problem: difficulty of initial viewing. If a computer screen rather than paper is used for output, the problem can be ameliorated by shimmering, or time-multiplexing of pixel values. We also describe a simple computational technique for determining what is present in a stereogram so that, if viewing is difficult, one can ascertain what to look for. Keywords: Single image random dot stereograms, SIRDS, autostereograms, stereoscopic pictures, optical illusions † Department of Psychology, University of Stirling, Stirling, Scotland. Phone (+44) 786–467679; fax 786–467641; email [email protected] ‡ Department of Computer Science, University of Waikato, Hamilton, New Zealand. Phone (+64 7) 856–2889; fax 838–4155; email [email protected]. § Department of Computer Science, University of Waikato, Hamilton, New Zealand.