Visually Imbalanced Stereo Matching

Total Page:16

File Type:pdf, Size:1020Kb

Visually Imbalanced Stereo Matching Visually Imbalanced Stereo Matching Yicun Liu∗13† Jimmy Ren∗1 Jiawei Zhang1 Jianbo Liu12 Mude Lin1 SenseTime Research1 The Chinese University of Hong Kong2 Columbia University3 {rensijie,zhangjiawei,linmude}@sensetime.com1 [email protected] [email protected] Abstract Understanding of human vision system (HVS) has in- spired many computer vision algorithms. Stereo matching, which borrows the idea from human stereopsis, has been ex- tensively studied in the existing literature. However, scant attention has been drawn on a typical scenario where binoc- ular inputs are qualitatively different (e.g., high-res master camera and low-res slave camera in a dual-lens module). Recent advances in human optometry reveal the capabil- ity of the human visual system to maintain coarse stereop- sis under such visually imbalanced conditions. Bionically aroused, it is natural to question that: do stereo machines share the same capability? In this paper, we carry out a systematic comparison to investigate the effect of various imbalanced conditions on current popular stereo match- Figure 1. Illustration of visually imbalanced scenarios: (a) input left view (b) Input downgraded right view, from top to bottom: ing algorithms. We show that resembling the human vi- monocular blur, monocular blur with rectification error, monocular sual system, those algorithms can handle limited degrees of noise. (c) Disparity predicted from mainstream monocular depth monocular downgrading but also prone to collapses beyond (only left view as input)/stereo matching (stereo views as input) a certain threshold. To avoid such collapse, we propose a algorithms: from top to bottom are DORN [9], PSMNet [7] and solution to recover the stereopsis by a joint guided-view- CRL [31]. (d) Disparity generated from our proposed framework. restoration and stereo-reconstruction framework. We show the superiority of our framework on KITTI dataset and its performance [45]. extension on real-world applications. However, little attention has been drawn on the imbal- 1. Introduction anced condition between the stereo views. In many real- world cases, the visual quality of the left view and the right There have been remarkable signs of progress in under- view are not guaranteed to be matched. It is common for standing and mimicking the human vision system, and lots human stereopsis to suffer from different degrees of ani- of works are focusing on sensing the 3D structures sur- sometropia and astigmatism in binocular vision [6, 21]; or rounding us. In human’s visual brain, depth perception is in- for computer vision, the master and slave camera in a dual- terceded by a set of scale-variant spatial filters, where low- lens module to have different resolution, lens blur, imaging frequency stimuli establish coarse stereopsis, and then high- modality, noise tolerance, and rectification accuracy [43]. frequency visual cues escalate stereo acuity [5]. Early re- Not until recently, discovery in optometry reveals that it searchers in computer vision define this problem as search- is attainable for the human to maintain decent stereo acuity ing for corresponding pixels [3], edges [1], or patches [2] with imbalanced binocular signals. In fact, the monocular among different views. Taxonomy and benchmark were downgrading jeopardizes stereo acuity for high spatial fre- later constructed in [35]. With large datasets becoming quencies components, but low-frequency targets like struc- available, NN-based stereo algorithms exhibited superior tures are merely affected through a natural process called spatial frequency tuning [24]. With this unearthing in mind, ∗Equal contribution. Code will be available at github.com/ DandilionLau/Visually-Imbalanced-Stereo we tend to ask: are stereo machines able to handle qualita- †Work was done during internship at SenseTime Research. tively imbalanced inputs? 43212029 Figure 2. Intuition behind our proposed guided view synthesis framework: predicting the latent view solely based on single view is an i ill-posed problem, as there exists a bunch of plausible novel views IR with different disparities. However, with the geometric information in the downgraded right view Ir as a guide, the task is achievable. Even though high-frequency component of Ir is missing, rough object contour can still be inferred, as shown in our toy example. The contour provides a positional hint for the later displacement prediction. We design a systematic comparison to answer this ques- with real-world imbalanced factors. tion. In a controlled-variant setting, we test several ma- The main contributions of our work are threefold: jor monocular degradation effects on current mainstream stereo matching algorithms, including both NN-based and • We discover that constructing stereopsis from imbal- heuristics-based methods. By selectively increasing the anced views is not only feasible for human visual sys- corrupted levels of monocular downgrading factors, we tems but also achievable for computer vision. It is show that existing stereo matching frameworks are resis- the first work to consider the imbalanced condition for tant to mere degrees of monocular downgrading. Neverthe- stereo matching. less, stereo matching accuracy quickly degenerates as the • We explore the potential of current stereo machines on monocular downgrading increases. Similar to human stere- the task of visually imbalanced stereo matching and opsis, all tested algorithms are observed to ‘collapse’ be- examine the threshold of ‘stereo collapse’ on different yond specific downgrading threshold, leading to unreason- models and various imbalanced conditions. able disparity predictions. • We exploited a guided view synthesis framework to re- Ideally, there exist potential cures to alleviate such col- store the corrupted view and tackle scenarios beyond lapse, but each of them has a certain limit. One intuitive ‘stereo collapse’, which is even out of the capability of approach is to conduct depth estimation only based on the human stereopsis. high-quality monocular view. However, it cannot generalize well for unseen scenarios because it relies on prior knowl- edge of the object size and other physical attributes. An- 2. Related Work other approach is to conduct stereo matching on a lower Depth Perception There are diverse methods proposed resolution as a compromise to the information loss in the to predict depth from single view [26], stereo views [27], downgraded view, but low-res solutions cannot satisfy the and multiple views [19]. Among these variants, stereo demand of sharp disparity for tasks like portrait defocusing. matching is the currently popular way for low-cost depth Taking a detour in thinking, instead of directly predict- perception. For the traditional stereo matching setup, a rep- ing the disparity from imbalanced views, it is easier to first resentative taxonomy was proposed in [35]. Many com- restore the corrupted view using the high-quality textures in prehensive testbeds were later introduced for quantitative the main view and then conduct stereo matching. With the evaluation of different stereo frameworks [10, 35]. Analy- vague object contour observed in the corrupted view, human sis of subpixel calibration error and radiometric differences beings are pretty good at hallucinating the missing textures emerges in [15, 14]. Significant progress in single view by ‘moving’ the objects from the high-quality view to the depth estimation emerged in [9], but estimating depth from corresponding position in the corrupted view. The problem a single view remains hard considering the ill-posed body of predicting the dense disparity map between imbalanced of the problem. binocular inputs can be decomposed into two sub-problems: Human Stereopsis Early theoretical frameworks of view restoration guided by limited structural information in binocular disparity detection mechanisms were proposed in the corrupted view, and reconstruct stereopsis based on the [41, 11]. Neural models of frequency-variant spatial filters restored view. For the first sub-problem, we formulate it as in the human’s visual brain were then proposed and devel- a guided view-synthesis process and designed a structure- oped in [5, 4]. To better analyze the neural network sub- aware displacement prediction network to achieve that. Our serving stereopsis, a large portion of work has been con- approach achieves unprecedented performance and demon- ducted to characterize the functional attributes for these ba- strates impressive generalization capabilities on a dataset sic operators in the visual brain [32, 33, 38]. 43222030 Model All / Est 1X 2X 3X 5X 8X 10X 15X 20X SVS D1-bg 14.13% 15.66% 19.53% 24.20% 62.60% 79.49% 83.98% 89.11% 25.18% SGBM D1-fg 21.99% 22.35% 25.36% 32.32% 58.36% 80.16% 85.48% 90.39% 20.77% D1-all 15.88% 16.76% 20.49% 25.88% 61.89% 79.60% 84.23% 89.32% 24.44% D1-bg 3.09% 3.12% 3.31% 4.69% 11.40% 24.38% 89.51% 98.16% 25.18% DispNetC D1-fg 3.16% 3.21% 3.28% 4.23% 11.08% 24.72% 89.75% 98.35% 20.77% D1-all 3.10% 3.13% 3.30% 4.62% 11.35% 24.44% 89.55% 98.18% 24.44% D1-bg 3.02% 3.05% 3.25% 4.84% 12.24% 29.16% 94.47% 99.42% 25.18% CRL D1-fg 2.89% 3.00% 3.18% 4.41% 12.36% 28.76% 95.27% 99.66% 20.77% D1-all 3.00% 3.04% 3.24% 4.77% 12.26% 29.09% 94.60% 99.64% 24.44% D1-bg 2.36% 2.75% 5.63% 8.23% 20.86% 91.81% 99.32% 99.89% 25.18% PSMNet D1-fg 5.72% 5.78% 8.42% 10.25% 18.85% 100.00% 100.00% 100.00% 20.77% D1-all 2.92% 3.25% 6.21% 10.01% 20.52% 92.93% 99.91% 99.97% 24.44% Table 1. Performance of stereo algorithms under different levels of monocular blur: we mark the ‘turning point’ observed as red. D1- bg/D1-fg/D1-all refers to average percentage of outliers only over background/foreground/full regions.
Recommended publications
  • Binocular Disparity - Difference Between Two Retinal Images
    Depth II + Motion I Lecture 13 (Chapters 6+8) Jonathan Pillow Sensation & Perception (PSY 345 / NEU 325) Spring 2015 1 depth from focus: tilt shift photography Keith Loutit : tilt shift + time-lapse photography http://vimeo.com/keithloutit/videos 2 Monocular depth cues: Pictorial Non-Pictorial • occlusion • motion parallax relative size • accommodation shadow • • (“depth from focus”) • texture gradient • height in plane • linear perspective 3 • Binocular depth cue: A depth cue that relies on information from both eyes 4 Two Retinas Capture Different images 5 Finger-Sausage Illusion: 6 Pen Test: Hold a pen out at half arm’s length With the other hand, see how rapidly you can place the cap on the pen. First using two eyes, then with one eye closed 7 Binocular depth cues: 1. Vergence angle - angle between the eyes convergence divergence If you know the angles, you can deduce the distance 8 Binocular depth cues: 2. Binocular Disparity - difference between two retinal images Stereopsis - depth perception that results from binocular disparity information (This is what they’re offering in “3D movies”...) 9 10 Retinal images in left & right eyes Figuring out the depth from these two images is a challenging computational problem. (Can you reason it out?) 11 12 Horopter: circle of points that fall at zero disparity (i.e., they land on corresponding parts of the two retinas) A bit of geometric reasoning will convince you that this surface is a circle containing the fixation point and the two eyes 13 14 point with uncrossed disparity point with crossed
    [Show full text]
  • Relationship Between Binocular Disparity and Motion Parallax in Surface Detection
    Perception & Psychophysics 1997,59 (3),370-380 Relationship between binocular disparity and motion parallax in surface detection JESSICATURNERand MYRON L, BRAUNSTEIN University ofCalifornia, Irvine, California and GEORGEJ. ANDERSEN University ofColifornia, Riverside, California The ability to detect surfaces was studied in a multiple-cue condition in which binocular disparity and motion parallax could specify independent depth configurations, On trials on which binocular disparity and motion parallax were presented together, either binocular disparity or motion parallax could indicate a surface in one of two intervals; in the other interval, both sources indicated a vol­ ume of random points. Surface detection when the two sources of information were present and compatible was not betterthan detection in baseline conditions, in which only one source of informa­ tion was present. When binocular disparity and motion specified incompatible depths, observers' ability to detect a surface was severely impaired ifmotion indicated a surface but binocular disparity did not. Performance was not as severely degraded when binocular disparity indicated a surface and motion did not. This dominance of binocular disparity persisted in the presence of foreknowledge about which source of information would be relevant. The ability to detect three-dimensional (3-D) surfaces action ofdepth cues in determining the perceived shape has been studied previously using static stereo displays of objects has been a subject of considerable attention. (e.g., Uttal, 1985), motion parallax displays (Andersen, Biilthoffand Mallot (1988) discussed four ways in which 1996; Andersen & Wuestefeld, 1993), and structure­ depth information from different cues may be combined: from-motion (SFM) displays (Turner, Braunstein, & An­ accumulation, veto, disambiguation, and cooperation.
    [Show full text]
  • Recovery of 3-D Shape from Binocular Disparity and Structure from Motion
    Perception & Psychophysics /993. 54 (2). /57-J(B Recovery of 3-D shape from binocular disparity and structure from motion JAMES S. TI'ITLE The Ohio State University, Columbus, Ohio and MYRON L. BRAUNSTEIN University of California, Irvine, California Four experiments were conducted to examine the integration of depth information from binocular stereopsis and structure from motion (SFM), using stereograms simulating transparent cylindri­ cal objects. We found that the judged depth increased when either rotational or translational motion was added to a display, but the increase was greater for rotating (SFM) displays. Judged depth decreased as texture element density increased for static and translating stereo displays, but it stayed relatively constant for rotating displays. This result indicates that SFM may facili­ tate stereo processing by helping to resolve the stereo correspondence problem. Overall, the re­ sults from these experiments provide evidence for a cooperative relationship betweel\..SFM and binocular disparity in the recovery of 3-D relationships from 2-D images. These findings indicate that the processing of depth information from SFM and binocular disparity is not strictly modu­ lar, and thus theories of combining visual information that assume strong modularity or indepen­ dence cannot accurately characterize all instances of depth perception from multiple sources. Human observers can perceive the 3-D shape and orien­ tion, both sources of information (along with many others) tation in depth of objects in a natural environment with im­ occur together in the natural environment. Thus, it is im­ pressive accuracy. Prior work demonstrates that informa­ portant to understand what interaction (if any) exists in the tion about shape and orientation can be recovered from visual processing of binocular disparity and SFM.
    [Show full text]
  • Binocular Vision
    BINOCULAR VISION Rahul Bhola, MD Pediatric Ophthalmology Fellow The University of Iowa Department of Ophthalmology & Visual Sciences posted Jan. 18, 2006, updated Jan. 23, 2006 Binocular vision is one of the hallmarks of the human race that has bestowed on it the supremacy in the hierarchy of the animal kingdom. It is an asset with normal alignment of the two eyes, but becomes a liability when the alignment is lost. Binocular Single Vision may be defined as the state of simultaneous vision, which is achieved by the coordinated use of both eyes, so that separate and slightly dissimilar images arising in each eye are appreciated as a single image by the process of fusion. Thus binocular vision implies fusion, the blending of sight from the two eyes to form a single percept. Binocular Single Vision can be: 1. Normal – Binocular Single vision can be classified as normal when it is bifoveal and there is no manifest deviation. 2. Anomalous - Binocular Single vision is anomalous when the images of the fixated object are projected from the fovea of one eye and an extrafoveal area of the other eye i.e. when the visual direction of the retinal elements has changed. A small manifest strabismus is therefore always present in anomalous Binocular Single vision. Normal Binocular Single vision requires: 1. Clear Visual Axis leading to a reasonably clear vision in both eyes 2. The ability of the retino-cortical elements to function in association with each other to promote the fusion of two slightly dissimilar images i.e. Sensory fusion. 3. The precise co-ordination of the two eyes for all direction of gazes, so that corresponding retino-cortical element are placed in a position to deal with two images i.e.
    [Show full text]
  • Motion Parallax Is Asymptotic to Binocular Disparity
    Motion Parallax is Asymptotic to Binocular Disparity by Keith Stroyan Mathematics Department University of Iowa Iowa City, Iowa Abstract Researchers especially beginning with (Rogers & Graham, 1982) have noticed important psychophysical and experimental similarities between the neurologically different motion parallax and stereopsis cues. Their quantitative analysis relied primarily on the "disparity equivalence" approximation. In this article we show that retinal motion from lateral translation satisfies a strong ("asymptotic") approximation to binocular disparity. This precise mathematical similarity is also practical in the sense that it applies at normal viewing distances. The approximation is an extension to peripheral vision of (Cormac & Fox's 1985) well-known non-trig central vision approximation for binocular disparity. We hope our simple algebraic formula will be useful in analyzing experiments outside central vision where less precise approximations have led to a number of quantitative errors in the vision literature. Introduction A translating observer viewing a rigid environment experiences motion parallax, the relative movement on the retina of objects in the scene. Previously we (Nawrot & Stroyan, 2009), (Stroyan & Nawrot, 2009) derived mathematical formulas for relative depth in terms of the ratio of the retinal motion rate over the smooth pursuit eye tracking rate called the motion/pursuit law. The need for inclusion of the extra-retinal information from the pursuit eye movement system for depth perception from motion
    [Show full text]
  • Depth Perception, Part II
    Depth Perception, part II Lecture 13 (Chapter 6) Jonathan Pillow Sensation & Perception (PSY 345 / NEU 325) Fall 2017 1 nice illusions video - car ad (2013) (anamorphosis, linear perspective, accidental viewpoints, shadows, depth/size illusions) https://www.youtube.com/watch?v=dNC0X76-QRI 2 Accommodation - “depth from focus” near far 3 Depth and scale estimation from accommodation “tilt shift photography” 4 Depth and scale estimation from accommodation “tilt shift photography” 5 Depth and scale estimation from accommodation “tilt shift photography” 6 Depth and scale estimation from accommodation “tilt shift photography” 7 Depth and scale estimation from accommodation “tilt shift photography” 8 more on tilt shift: Van Gogh http://www.mymodernmet.com/profiles/blogs/van-goghs-paintings-get 9 Tilt shift on Van Gogh paintings http://www.mymodernmet.com/profiles/blogs/van-goghs-paintings-get 10 11 12 countering the depth-from-focus cue 13 Monocular depth cues: Pictorial Non-Pictorial • occlusion • motion parallax relative size • accommodation shadow • • (“depth from focus”) • texture gradient • height in plane • linear perspective 14 • Binocular depth cue: A depth cue that relies on information from both eyes 15 Two Retinas Capture Different images 16 Finger-Sausage Illusion: 17 Pen Test: Hold a pen out at half arm’s length With the other hand, see how rapidly you can place the cap on the pen. First using two eyes, then with one eye closed 18 Binocular depth cues: 1. Vergence angle - angle between the eyes convergence divergence If you know the angles, you can deduce the distance 19 Binocular depth cues: 2. Binocular Disparity - difference between two retinal images Stereopsis - depth perception that results from binocular disparity information (This is what they’re offering in 3D movies…) 20 21 Retinal images in left & right eyes Figuring out the depth from these two images is a challenging computational problem.
    [Show full text]
  • Perceiving Depth and Size Depth
    PERCEIVING DEPTH AND SIZE Visual Perception DEPTH 1 Cue Approach Identifies information on the retina Correlates it with the depth of the scene Different cues Previous knowledge Slide 3 Aditi Majumder, UCI Depth Cues Oculomotor Monocular Binocular Slide 4 Aditi Majumder, UCI 2 Oculomotor Cues Convergence Inward movement for nearby objects Outward movements for farther objects Accommodation Tightening of muscles the change the shape of the lens Slide 5 Aditi Majumder, UCI Monocular Cues (Pictorial) Occlusion Atmospheric perspective Relative height Linear perspective Cast shadows Texture gradient Relative size Movement-based cues Familiar size Motion Parallax Deletion and accretion Slide 6 Aditi Majumder, UCI 3 Occlusion Slide 7 Aditi Majumder, UCI Relative Height Slide 8 Aditi Majumder, UCI 4 Conflicting Cues Slide 9 Aditi Majumder, UCI Cast Shadows Slide 10 Aditi Majumder, UCI 5 Relative Size Slide 11 Aditi Majumder, UCI Familiar Size Slide 12 Aditi Majumder, UCI 6 Atmospheric Perspective Slide 13 Aditi Majumder, UCI Linear Perspective Texture Gradient Slide 14 Aditi Majumder, UCI 7 Presence of Ground It helps in perception Removal of ground causes difficulty in perception Slide 15 Aditi Majumder, UCI Motion Parallax Further points have lesser parallax Closer points move more Slide 16 Aditi Majumder, UCI 8 Deletion and Accretion Slide 17 Aditi Majumder, UCI Binocular Cues Binocular disparity and stereopsis Corresponding points Correspondence problems Disparity information in the brain Slide 18 Aditi Majumder,
    [Show full text]
  • Disparity Channels in Early Vision
    11820 • The Journal of Neuroscience, October 31, 2007 • 27(44):11820–11831 Symposium Disparity Channels in Early Vision Anna W. Roe,1 Andrew J. Parker,2 Richard T. Born,3 and Gregory C. DeAngelis4 1Department of Psychology, Vanderbilt University, Nashville, Tennessee 37203, 2Department of Physiology, Anatomy, and Genetics, University of Oxford, Oxford OX1 3PT, United Kingdom, 3Department of Neurobiology, Harvard Medical School, Boston, Massachusetts 02115, and 4Brain and Cognitive Sciences, University of Rochester, Rochester, New York 14627 The past decade has seen a dramatic increase in our knowledge of the neural basis of stereopsis. New cortical areas have been found to represent binocular disparities, new representations of disparity information (e.g., relative disparity signals) have been uncovered, the first topographic maps of disparity have been measured, and the first causal links between neural activity and depth perception have been established. Equally exciting is the finding that training and experience affects how signals are channeled through different brain areas, a flexibility that may be crucial for learning, plasticity, and recovery of function. The collective efforts of several laboratories have established stereo vision as one of the most productive model systems for elucidating the neural basis of perception. Much remains to be learned about how the disparity signals that are initially encoded in primary visual cortex are routed to and processed by extrastriate areas to mediate the diverse capacities of three-dimensional vision that enhance our daily experience of the world. Key words: stereopsis; binocular disparity; MT primate; topography; choice probability Linking psychology and physiology in binocular vision ration for many others.
    [Show full text]
  • Depth Perception 0
    Depth Perception 0 Depth Perception Suggested reading:! ! Stephen E. Palmer (1999) Vision Science. ! Read JCA (2005) Early computational processing in binocular vision and depth perception. Progress in Biophysics and Molecular Biology 87: 77-108 Depth Perception 1 Depth Perception! Contents: Contents: • The problem of depth perception • • WasDepth is lerning cues ? • • ZellulärerPanum’s Mechanismus fusional area der LTP • • UnüberwachtesBinocular disparity Lernen • • ÜberwachtesRandom dot Lernen stereograms • • Delta-LernregelThe correspondence problem • • FehlerfunktionEarly models • • GradientenabstiegsverfahrenDisparity sensitive cells in V1 • • ReinforcementDisparity tuning Lernen curve • Binocular engergy model • The role of higher visual areas Depth Perception 2 The problem of depth perception! Our 3 dimensional environment gets reduced to two-dimensional images on the two retinae. Viewing Retinal geometry image The “lost” dimension is commonly called depth. As depth is crucial for almost all of our interaction with our environment and we perceive the 2D images as three dimensional the brain must try to estimate the depth from the retinal images as good as it can with the help of different cues and mechanisms. Depth Perception 3 Depth cues - monocular! Texture Accretion/Deletion Accommodation (monocular, optical) (monocular, ocular) Motion Parallax Convergence of Parallels (monocular, optical) (monocular, optical) Depth Perception 4 Depth cues - monocular ! Position relative to Horizon Familiar Size (monocular, optical) (monocular, optical) Relative Size Texture Gradients (monocular, optical) (monocular, optical) Depth Perception 5 Depth cues - monocular ! ! Edge Interpretation (monocular, optical) ! Shading and Shadows (monocular, optical) ! Aerial Perspective (monocular, optical) Depth Perception 6 Depth cues – binocular! Convergence Binocular Disparity (binocular, ocular) (binocular, optical) As the eyes are horizontally displaced we see the world from two slightly different viewpoints.
    [Show full text]
  • Binocular Eye Movements Are Adapted to the Natural Environment
    The Journal of Neuroscience, April 10, 2019 • 39(15):2877–2888 • 2877 Systems/Circuits Binocular Eye Movements Are Adapted to the Natural Environment X Agostino Gibaldi and Martin S. Banks Vision Science Program, School of Optometry University of California, Berkeley, Berkeley, California 94720 Humans and many animals make frequent saccades requiring coordinated movements of the eyes. When landing on the new fixation point, the eyes must converge accurately or double images will be perceived. We asked whether the visual system uses statistical regu- larities in the natural environment to aid eye alignment at the end of saccades. We measured the distribution of naturally occurring disparities in different parts of the visual field. The central tendency of the distributions was crossed (nearer than fixation) in the lower field and uncrossed (farther) in the upper field in male and female participants. It was uncrossed in the left and right fields. We also measuredhorizontalvergenceaftercompletionofvertical,horizontal,andobliquesaccades.Whentheeyesfirstlandedneartheeccentric target, vergence was quite consistent with the natural-disparity distribution. For example, when making an upward saccade, the eyes diverged to be aligned with the most probable uncrossed disparity in that part of the visual field. Likewise, when making a downward saccade, the eyes converged to enable alignment with crossed disparity in that part of the field. Our results show that rapid binocular eye movements are adapted to the statistics of the 3D environment, minimizing the need for large corrective vergence movements at the end of saccades. The results are relevant to the debate about whether eye movements are derived from separate saccadic and vergence neural commands that control both eyes or from separate monocular commands that control the eyes independently.
    [Show full text]
  • Depth Interval Estimates from Motion Parallax and Binocular Disparity Beyond Interaction Space Barbara Gillam University of New South Wales
    University of Wollongong Research Online Faculty of Health and Behavioural Sciences - Papers Faculty of Science, Medicine and Health (Archive) 2011 Depth interval estimates from motion parallax and binocular disparity beyond interaction space Barbara Gillam University of New South Wales Stephen Palmisano University of Wollongong, [email protected] Donovan Govan University of New South Wales Publication Details Gillam, B., Palmisano, S. A. & Govan, D. G. (2011). Depth interval estimates from motion parallax and binocular disparity beyond interaction space. Perception, 40 (1), 39-49. Research Online is the open access institutional repository for the University of Wollongong. For further information contact the UOW Library: [email protected] Depth interval estimates from motion parallax and binocular disparity beyond interaction space Abstract Static and dynamic observers provided binocular and monocular estimates of the depths between real objects lying well beyond interaction space. On each trial, pairs of LEDs were presented inside a dark railway tunnel. The nearest LED was always 40 m from the observer, with the depth separation between LED pairs ranging from 0 up to 248 m. Dynamic binocular viewing was found to produce the greatest (ie most veridical) estimates of depth magnitude, followed next by static binocular viewing, and then by dynamic monocular viewing. (No significant depth was seen with static monocular viewing.) We found evidence that both binocular and monocular dynamic estimates of depth were scaled for the observation distance when the ground plane and walls of the tunnel were visible up to the nearest LED. We conclude that both motion parallax and stereopsis provide useful long-distance depth information and that motion parallax information can enhance the degree of stereoscopic depth seen.
    [Show full text]
  • Stability of Binocular Depth Perception with Moving Head and Eyes RAYMOND Van EE,*T CASPER J
    Vision Res., Vol. 36, No. 23, pp. 3827-3842, 1996 Pergamon Copyright © 1996 Elsevier Science Ltd. All rights reserved PH: S0042-6989(96)00103-4 Printed in Great Britain 0042-6989/96 $15.00 + 0.00 Stability of Binocular Depth Perception with Moving Head and Eyes RAYMOND van EE,*t CASPER J. ERKELENS* Received 23 February 1995; in revised form 22 December 1995; in final form 26 March 1996 We systematically analyse the binocular disparity field under various eye, head and stimulus positions and orientations. From the literature we know that certain classes of disparity which involve the entire disparity field (such as those caused by horizontal lateral shift, differential rotation, horizontal scale and horizontal shear between the entire half-images of a stereogram) lead to relatively poor depth perception in the case of limited observation periods. These classes of disparity are found to be similar to the classes of disparities which are brought about by eye and head movements. Our analysis supports the suggestion that binocular depth perception is based primarily (for the first few hundred milliseconds) on classes of disparity that do not change as a result of ego-movement. Copyright © 1996 Elsevier Science Ltd. Binocular disparity Binocular vision Depth perception Stereopsis Robot vision INTRODUCTION movement, the disparity field before and after the The separation of the human eyes causes each eye to see a movements will also change. However, this time the disparity of the object and environment change together. disparate image of the outside world. Generally, it has In short, during eye and head movements, the images of been accepted that positional disparities are sufficient to the entire visual world are continuously changing on both generate a three-dimensional (3D) percept (e.g.
    [Show full text]