Distance, depth, and 3D shape Basic distinctions cues • Types of depth cues • Pictorial depth cues: familiar size, relative size, brightness, occlusion, shading and shadows, aerial/ – Monocular vs. binocular atmospheric perspective, linear perspective, height – Pictorial vs. movement within image, texture gradient, contour – Physiological • Other static, monocular cues: accommodation, blur, [astigmatic blur, chromatic aberration] • Depth cue information • Motion cues: motion parallax, kinetic depth effect, – What is the information? dynamic occlusion – How could one compute depth from it? • Binocular cues: convergence, stereopsis/binocular – Do we compute depth from it? disparity: crossed vs. uncrossed disparity, random- dot stereogram and the correspondence problem, – What is learned: ordinal, relative, absolute depth, fusion and rivalry, neural coding depth ambiguities • Cue combination Definitions Distance, depth, and 3D shape cues • Spatial vision (2D) vs. Space perception (3D) • Pictorial depth cues: familiar size, relative size, • Distance: Egocentric distance, distance from the [brightness], occlusion, shading and shadows, aerial/ observer to the object atmospheric perspective, linear perspective, height • Depth: Relative distance, e.g., distance one within image, texture gradient, contour object is in front of another or in front of a • Other static, monocular cues: accommodation, blur, background [astigmatic blur, chromatic aberration] • Motion cues: motion parallax, kinetic depth effect, • Surface Orientation: Slant (how much) and tilt dynamic occlusion (which way) • Binocular cues: convergence, stereopsis/binocular • Shape: Intrinsic to an object, independent of disparity viewpoint • Cue combination Epstein (1965) familiar size Monocular depth cues experiment Retinal projection depends on size and distance How far away is the coin? 1 Relative size as a cue to depth Relative size as a cue to depth Occlusion as a cue to depth Shading, reflection, and illumination illumination occlusion reflection shading Shading – assumption of light-from-above Shading (flip the photo upside-down) 2 Cast Shadows Dynamic Cast Shadows Shading and contour Aerial/Atmospheric Perspective Geometry of Linear Perspective Linear perspective Retinal projection depends on size and distance: Size in the world (e.g., in meters) is proportional to size in the retinal image (in degrees) times the distance to the object 3 Ames room Size constancy Ames room The Ames Room Texture Height Within the Image 1. Density 2. Foreshortening 3. Size 4 Distance, depth, and 3D shape Monocular Physiological Cues cues • Accommodation – estimate depth based • Pictorial depth cues: familiar size, relative size, brightness, occlusion, shading and shadows, aerial/ on state of accommodation (lens shape) atmospheric perspective, linear perspective, height required to bring object into focus within image, texture gradient, contour • Other static, monocular cues: accommodation, blur, • Blur – objects that are further or closer [astigmatic blur, chromatic aberration] than the accommodative distance are • Motion cues: motion parallax, kinetic depth effect, dynamic occlusion increasingly blur • Binocular cues: convergence, stereopsis/binocular • Astigmatic blur disparity • Cue combination • Chromatic aberration Distance, depth, and 3D shape Motion Parallax cues • Pictorial depth cues: familiar size, relative size, brightness, occlusion, shading and shadows, aerial/ atmospheric perspective, linear perspective, height within image, texture gradient, contour • Other static, monocular cues: accommodation, blur, [astigmatic blur, chromatic aberration] • Motion cues: motion parallax, kinetic depth effect, dynamic occlusion • Binocular cues: convergence, stereopsis/binocular disparity • Cue combination The Kinetic Depth Effect Dynamic (Kinetic) Occlusion 5 Distance, depth, and 3D shape Vergence Angle As One cues Binocular Source • Pictorial depth cues: familiar size, relative size, brightness, occlusion, shading and shadows, aerial/ atmospheric perspective, linear perspective, height within image, texture gradient, contour • Other static, monocular cues: accommodation, blur, [astigmatic blur, chromatic aberration] • Motion cues: motion parallax, kinetic depth effect, dynamic occlusion • Binocular cues: convergence, stereopsis/binocular disparity • Cue combination Vergence Angle As One Vergence Angle As One Binocular Source Binocular Source Vergence Angle As One Retinal Disparity as a Source of 3D Binocular Source Information Stereograms (anaglyphs) 6 Sir Charles Wheatstone’s Binocular Famous Invention disparity Disparity Uncrossed disparity Disparity Zero retinal disparity Crossed disparity Stereopsis (literally, “seeing solid”) - 3D vision resulting from slight differences in left and right eye images, arising because the two eyes view the world from slightly different perspectives Wheatstone stereoscope (c. 1838) Disparity - slight differences in positions of “features” in the left and right eyes’ views Fixation point zero disparity uncrossed disparity crossed disparity Red = right eye’s image 7 Red-green anaglyph Dual mirror stereoscope Also: Polarizing filter stereoscope, LCD shutter glasses, … Stereograms (“Magic Eye”) 8 Autostereogram surface Autostereogram surface Autostereogram surface Autostereogram surface Autostereogram surface Autostereogram surface 9 Autostereogram surface Autostereogram surface Autostereogram surface Autostereogram surface The horopter: the locus of points Magnitude of Disparity Signifies in the world with zero disparity Depth Difference relative to fixation Gaze-normal (fronto-parallel) Geometric horopter plane 10 Disparity Magnitude Also Varies Stereoacuity: The smallest with Viewing Distance resolvable disparity d Stereopsis works only within 10 - 20 ft of the observer; once the visual axes are parallel, objects beyond the point of fixation provide no disparity. Also: disparities don’t imply depth; they first must be scaled for the viewing distance. d Under ideal conditions ≈ 5 arc seconds !!! You should be able to compute what the difference in distance between two objects would need to be in order to give rise to a disparity value this small. Remember: 1 deg = 60 arc min 1 arc min = 60 sec How to make a random-dot stereogram Left eye image Right eye image x A A y B B How Does the Brain “Solve” This “Correspondence” Problem? What dot in one eye's view goes with which dot in the other eye's view? left image right image Derive the solution that maximizes the overall number of matches - i.e., that is most globally consistent. 11 Disparity Distribution of disparity preferences selectivity in V1 Things that can happen with 2 eyes Binocular rivalry • Fusion • Suppression • Diplopia • Rivalry Binocular rivalry Binocular rivalry Right eye V1 L R Binocular L neurons R Left eye Ocular dominance columns 12 Predicted fMRI responses during rivalry Wave of activity in V1 follows with t Percept percept during binocular rivalry Local contrast Neural activity FMRI signal time to peak peak amplitude fMRI response Depth Cue Combination Depth Cue Combination • There are these dozens of depth cues we • The result of many recent studies: Humans are have reviewed typically “optimal” at depth cue combination • Yet, you typically have only a single • That is, they combine all the information from the various cues. percept of depth (slant, shape, distance, • Further, cues are given higher weight if they are …) for each image feature. How are the more reliable. cues combined? • This “reliability” can change from scene to scene, or even from place to place within a single scene. 13 .
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages13 Page
-
File Size-