Active Vision (In Humans)

Active Vision (In Humans)

Seeing Yarbus 67 Active Vision (in Humans) Christoph Rasche Justus Liebig Universität Germany • Saccades (rapid shifts in gaze position): 3-4 times/second • Attentional shifts: several/between saccades: fastest: 33ms • Also referred to as overt and covert attentional shifts Cognitive Influence More Famous Examples Yarbus 67 → endless amount of information (‘a picture is worth a 1000 words’) → scene representation allows precise saccades (top-down) Scan Path Reading Noton & Stark 71 Idea: viewer uses a memorized sequence of fixations to obtain object/scene understanding Today: limitedly accepted. Forward saccade: ca. 7-9 letters (adjusted to fontsize!) Reverse saccades Nevertheless: clusters of fixations! Landing point: avg. 3rd, 4th letter (→ precise saccadic target selection) → no explanations so far 1 Eye Trackers Other Types of Eye Movements • Saccades (so far) • Smooth pursuit: following a moving object • for stabilization during self motion: VOR: vestibulo-ocular reflex (sense of balance) OKR: optokinetic reflex (flow field) • modern ones based on video cameras → can combine to Nystagmus: • calibration required saw-tooth pattern (sitting in train and watching the landscape) • typical accuracy: 0.5 – 2.0 degrees • maximum accuracy: ~0.3 deg (head fixated) A Fixation Saccadic Flight Limitedly Ballistic Zoomed Fixational eye-movements: Microsaccades Drift Tremor Scientific debate: purposely or accidental? saccadic flight a trajectory. In rare cases: flight rerouting! → system is highly dynamic Timing Peripheral Acuity Ranges dead time degs acuity latency (50-100) • Fovea 2 eye field (150-250) • Parafovea 10 30% parafovea time • Eye field 40 10% fovea stimulus saccade saccade • Head field 180 onset onset offset 30-40ms • latency and dead time overlap: while this saccadic target is being computed, the subsequent saccade target is already programmed too! • Saccadic amplitudes [degs]: – max: ca. 30 → blazing flexibility 30% – object analyzing: ca. 5-10 10% – scene orienting: ca. 10-20 • Saccades required for survival? Not necessarily. 10 5 0 5 10 2 Recognition in Periphery Brain Areas Involved → isolated objects recognized better → all over! (short words are skipped during reading) The The more Illustrative Elaborate Version Version Berthoz Attentional Shifts Cueing [covert shifts] Posner • Preattentive [= distributed attention] – fast, automatic, massively parallel • Postattentive [= attentive,focused attn] –slow,serial • Definition of attention is difficult your’s is as good as anyone else’s • Metaphors – Spot-light –Zoom lens – Object versus region based → Spotlight 3 Visual Pop-Out Treisman Treisman’s Feature Integration Theory → parallel/serial distinction Criticism: many pop-outs possible. Saliency Map Itti & Koch Goal: to mimic a bottom-up driven visual search of attentional and saccadic shifts Band-Pass Filter: DOG (difference of Gaussians) • popular in psychology/neuroscience due to attempt to connect to visual search studies and neurophysiology • limitedly used in computer science due to time-intensive DOG filtering Information Maximization Zoom Lens Attention Bruce & Tsotsos Zelinsky maybe just that? 4 A Framework Still Unexplained For Saccade Generation Findlay & Gilchrist some still try: e.g. Fourier Transform Saccadic Target Selection Gaze During Driving Richards and Kaufmann vanishing point (tangent point) car following 3 observers: → preferred landing positions (relatively precise!) → also unexplained When reacting: fixation duration increases. But why? To exploit attentional shifts? Optic Flow The Challenge on Ground Gibson 50 optic flow + eye-/head movements → Focus of Expansion [FoE] can serve as a direct → potential masking by retinal flow pointer of where to steer to (may be so in the air) a) how is OF recovered (ER) b) OF used at all? 5 Retinal Flow Patterns… Another Wann & Land Potential Steering Mechanism Donges 78, Land&Horwood 95 • estimation of curvature by observing tangent point (apex of the bend) criticism: humans can not estimate → could be even useful/necessary for steering curvature so accurately • debate: both flows are used in some way visual system may use multiple cues • criticism: too noisy during driving Gaze During Daily Activities Land Action Control → sequential (almost continuous) access of pieces of representation (a stream?) [that’s why AI failed] • gaze ahead by 0.6 secs • action: small saccades • between actions: large saccades → gaze behavior as a window to frame representation Diagnostic Human-Computer Interaction Duchowski [exploiting gaze] • Neuroscience • The study of interaction between people (users) and – Attentional Neuroscience, Brain Imaging computers • Psychology – Reading, Scene Perception (Perception of Art/Film), • = MMI (man-machine interaction), = CHI Visual Search, Natural Tasks, Auditory Language Processing, Speech Production • Industrial Engineering, Human Factors • (Human Factors: emphasizes human behavior) – Aviation, Driving, Visual Inspection (food, X-rays) • (Ergonomics: emphasizes comfort of use) • Marketing/Advertising – Copy Testing, Print Advertising 6 However Gaze-Controllable Interface? Jacob 90 • Traditional input: • How to ‘emulate’ a click? keyboard, mouse → awkward at times → blinks, dwell time: Midas Touch Problem • Toward advanced user interfaces • Gaze (landing) is inaccurate: → include speech, gaze → difficult to select small icons quickly: • Dream: selection by gaze (since mid 80’s) Space Problem (requires an eye-tracker) - speed (gaze is always first) - comfort (reducing repetitive strain inj.) → not suitable as a pure output modality (thus, no wide-spread use so far) Eye-Typing for Disabled People Dasher [see wikipedia] letters flying in from the right side does not require high precision But why difficult? See experiments before…??? Gestures Drewes, Schmidt, 2007 Gaze-Assistive Systems • Gaze not used to steer but just recorded in order to make display adjustments • they are relative – no absolute position necessary, therefore: → no high accuracy of tracker required → no need for calibration → no Midas-Touch problem → still slow 7 MAGIC Pointing Zhai, Morimoto, Ihde 99 Large Screens • Principal: To place mouse cursor near gaze position. Cursor movement to be completed by hand movement. Problem: refresh slow • Goal: To save on duration for hand movement → gaze-contingent refresh: liberal conservative - high-resolution refresh at the center of fixation, - low-resolution refresh in the periphery •Manual and Gaze-Input Cascaded Gaze-Contingent Displays Recommended Reading • Active Vision: The Psychology of Looking and Seeing John M Findlay and Iain D Gilchrist • Vision Science: Photons to Fixating center Phenomenology Stephen E Palmer → research: is it comfortable? 8.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    8 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us