LD5655.V856 1984.M843.Pdf (13.98Mb)

Total Page:16

File Type:pdf, Size:1020Kb

Load more

Analyzing Perspective Line Drawings using Hypothesis Based Reasoning by Prasanna Govind Mulgaonkar Dissertation submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of DOCTOROF PHILOSOPHY in Computer Science and Applications APPROVED: Linda G. Shapiro Robert M. Haralick J'ohn W. Roach Roger W. Ehrich Ezra A. Brown August, 1984 Blacksburg, Virginia ANALYSIS OF PERSPECTIVE LINE DRAWINGS USING HYPOTHESIS BASED REASONING by Prasanna Govind Mulgaonkar Committee Chairman: Dr. Linda G. Shapiro Computer Science (ABSTRACT) One of the important is~ues in the middle levels of com- puter vision is how world knowledge should be gradually in- serted into the reasoning process. In this dissertation, we develop a technique which uses hypothesis based reasoning to reason about perspective line drawings using only the const- raints supplied by the equations of perspective geometry. We show that the problem is NP complete, and that it can be solved using modular inference engines for propagating constraints over the set of world level entities. We also show that theorem proving techniques, with their attendant complexity, are not necessary because the real valued attri- butes of the world can be computed in closed form based only on the spatial relationships between world entities and mea- surements from the given image. ACKNOWLEDGEMENTS ...... For they were the best of times and they were the worst of times A dissertation is not created in isolation. It is the product of the interaction between the people surrounding the author, and the times and the places involved. I would like to thank everyone who in one way or the other contri- buted to making my time as a graduate student worthwhile and in some direct or indirect way helped make this dissertation what it is. Dr. Linda Shapiro, and Dr. Robert Haralick guided me throughout my graduate career. All that I have learnt about computer vision, is directly attributable to their time and patience. Dr. Shapiro supported my graduate education by continuous funding on her NSF grant which is what enabled me to survive. I also want to thank Dr. Roger Ehrich for serv- ing on both my masters and my doctoral advisory committees. Dr. John Roach, and Dr. Ezra Brown also provided valuable input to this work. I am deeply indebted to my wife Anjali to whom this work is dedicated. Without her constant encouragement and sup- port, I would not have been able to go through all the hear- taches and headaches that a PhD. requires. Amit, our three iii month old son did his share by making sure that I stayed awake on the nights that I needed to. Indeed this is a fa- mily PhD. rather than an individual accomplishment. To all the friends at the SDA Lab, who provided a friend- ly and stimulating working environment, I wish to say "Thank You". Especially, Ting-Chuen (King of Threshold) Pong, Greg (IBM Spy) Bushman, Greg (Mr. GIPSY) Fabregas, Tom Laffey, Trinh Ho, Einar Ostevold, and all the other folks helped make the last five years something I can look back on. Espe- cially Hyonam Joo, who helped write, debug and often fix on very short notice, the PROLOG interpreter which was used for all the experiments described in this dissertation. Life has facets other than just academic, and we owe a great deal to all our other friends in Blacksburg. Jyothi and Ramesh, and Nileema and Rajendra made extra-curricular life enjoyable and fun. Finally I want to thank my parents for instilling in me the spirit of hard work and striving for academic excel- lence. Their preparatory work led up to this moment. I would also like to acknowledge the help that my in-laws gave. They looked after our baby and our house in the crucial months leading up to my defense so that I could be free to concen- trate on my work and my wife could finish her master's de- gree without any distractions. iv TABLE OF CONTENTS ABSTRACT . ii ACKNOWLEDGEMENTS iii Chapter I. INTRODUCTION 1 II. HYPOTHESIS BASED REASONING FOR VISION. 7 Introduction ........... 7 Hypothesis Generation and Testing 9 The Optimal Hypothesis Problem ...... 11 NP Completeness of the Optimal Hypothesis Problem ..... 15 Formal Definition of CLP ....... 16 Reformulation of the CLP ...... 17 Equivalence of the Two Formulations 17 The Optimal Hypothesis Problem and the Hypothesis Decision Problem 18 Transformation of the CLP to HD 19 Determining Consistency of a Hypothesis . 24 Inference Engines ............... 26 Characterization of Inference Engines . 40 Predicate Calculus with Restrictions .... 41 Inference Engines as a Reasoning Path. 44 Searching for the Maximal Hypothesis. 48 Searching the Lattice. 50 The Search Space as a Binary Tree. 59 Forward Checking. 64 Logical Inconsistency and Completeness for Pruning . • • 68 Applications to Computer Vision . 70 III. CONTROL STRUCTURES FOR REASONING 72 Requirements for the Control. ..... 74 Controlling the Inference Engines . 77 Agenda Based Control Strategy .. 80 The Network Model for Module Control .... 83 Control of Inference Engines using Associative Tables . 88 Conclusion . 92 V IV. KNOWLEDGESOURCES FOR ANALYZING PERSPECTIVE IMAGES 94 Geometry of Perspective Projections of Lines . 97 Vanishing Points ..... 100 Vanishing Trace of a Plane 103 Computing Focal Length 107 Center of the Screen Coordinate System 111 Projection of Points .... 112 Projection of a Rectangle .. 115 Perspective Projection of a Circle 120 Perspective Projection of a Circle 121 Plane of the Circle Parallel to the Screen 124 Inverse of the projection ... 125 Circle on a Z = Constant Plane ..... 126 Inverse of the Projection .... 127 Circle on any Plane Perpendicular to the Screen . 128 Two Solutions for the Plane of the Circle 133 Inverse Projection Properties for General Circles . .. 139 Computing the Focal Length 140 Normal to the Plane of the Circle Given the Focal Length ......... 142 Normal to the Circle Given the Projection of the Center ..... 143 Comments on the Approach of Chu 145 Logical Consistency Rules 148 V. IMPLEMENTATIONAND EXPERIMENTALRESULTS 154 Introduction . 154 Input Data Structure ....... 155 Global Tables and Control of Engines 161 Control of Inference Engines 163 Experimental Results ..... 164 Analysis of Timing and Solutions Generated 164 VI. LITERATURE REVIEW. 192 Vision Systems .. 192 Brooks ACRONYMsystem. 192 Generalized Blob technique 195 The Hanson and Riseman VISIONS system 197 The Levine image segmentations system ... 199 Knowledge Based Systems 201 Iv'I.ACSYMA. 201 DENDRAL . 202 MYCIN ..... 203 PIP . 205 HEARSAY-II .... 205 vi Other Relevant Research 206 VII. CONCLUSION 210 BIBLIOGRAPHY 212 Appendix A. DESCRIPTION OF THE PROLOGAND RATFOR CODE. 217 The top level .... 217 The Backtracking Tree Search. 219 State recording .... 225 Application of Inference Engines. 226 Trace information ........ 232 Efficiency of the Search. 233 B. INPUT DATA FOR TEST EXAMPLES 235 Architectural Scene ..... 235 The Table Scene 250 Image of a rectangle 258 Fountain Image ....... 260 Image of a rectangular parallelipiped 264 Complex Image of the Table Scene. 270 Towers Image ...... 272 C. RESULTS FOR THE SAMPLE DATA 274 Results for the Architectural Scene 274 Results for the Table image . 313 Results for the Table image given the focal length . 323 Results for the rectangle in the Z plane. 328 Results for the fountain scene . 330 Results for the Rectangular Parallelipipid 335 Inconsistent Data . 340 Complex Table Image . 344 Results for the Tower Image 357 D. DATA FOR TIMING EXPERIMENTS ... 368 VITA 371 vii LIST OF FIGURES Figure paae 1. Lattice corresponding to the example .. 54 2. The algorithm for searching a lattice. 57 3. Tree representation for the example . 60 4. A subtree rooted at an internal node . 67 5. The camera coordinate system . 99 6. Computing the focal length. 109 7. The perspective projection of a rectangle 116 8. Rotation of the axes ... 129 9. The cone of rays from th,e origin . 135 10. The right elliptical cone 136 11. The two solutions for the ci~cle. 138 12. Constructing the projection of the center 144 13. An architectural scene 165 14. The drawing of a table scene. 166 15. The base of an ornate fountain 168 16. Number of calls without chunking. 170 17. Average CPU time in seconds 171 18. Efficiency of the process 172 19. Number of consistency checks. 173 20. Average CPU time in seconds per solution. 174 21. Efficiency of subset checking .... 175 22. Number of hypotheses vs. size 176 viii 23. Total CPU time vs. size of hypothesis 177 24. Average CPU time vs. size of hypothesis 178 25. Number of calls vs. size of hypothesis 179 26. Total CPU time vs. size of hypothesis 180 27. Average CPU time vs. size of hypothesis 181 28. Number of hypotheses by type. 182 29. Average CPU time by type ... 183 30. Number of hypotheses by type. 184 31. Average CPU time per hypothesis by type 185 32. Effect of chunking ....... 186 33. Effect of chunking on average CPU time .. 187 34. Effect of forward checking ...... 188 35. Average CPU time with forward checking 189 36. Digitized architectural scene 236 37. The Digitized Table Image 251 38. Synthetic image of a rectangle .. 259 39. Digitized image of a fountain 261 40. Image of a Rectangular Parallelipiped 265 41. Complicated line drawing of the table scene 271 42. Artificial image of two towers .. 273 43. Test data for timing experiments .. 369 ix Chapter I INTRODUCTION The central problem in computer vision, is one of produc- ing an automatic system capable of analyzing unconstrained pictorial information of the same complexity and with the same accuracy as the human visual system. This is currently an unsolved problem. However, we do know that the solution of this problem must involve many diverse knowledge sources and applications of techniques from a wide variety of dis- ciplines.
Recommended publications
  • Scheme and Syllabus

    Scheme and Syllabus

    Scheme and Syllabus III – VI Semester Master of Computer Applications (MCA) 2015-2018 P.E.S. COLLEGE OF ENGINEERING Mandya - 571401. Karnataka (An Autonomous Institution under VTU Belagavi) Grant-in-aid Institution Accredited by NBA, New Delhi Approved by AICTE, New Delhi. Ph: 08232 - 220043 Fax: 08232 - 222075 Web: www.pescemandya.org P.E.S COLLEGE OF ENGINEERING, MANDYA – 571 401 (An Autonomous Institution Under VTU Belgaum) MASTER OF COMPUTER APPLICATIONS (MCA) SCHEME OF TEACHING AND EXAMINATION 2015-18 III SEMESTER Examination Sl. Hrs/Week Course Code Course Title Credits Marks No L :T : P : H CIE SEE Total 1 P15MCA31 Computer Networks 4 : 0 : 0 : 4 4 50 50 100 2 P15MCA32 Programming with Java 3 : 2 : 0 : 5 4 50 50 100 3 P15MCA33 Software Engineering 4 : 0 : 0 : 4 4 50 50 100 4 P15MCA34 Computer Graphics 3 : 2 : 0 : 5 4 50 50 100 5 P15MCA35X Elective - I 4 : 0 : 0 : 4 4 50 50 100 6 P15MCA36 Network Lab 0 : 0 : 2 : 2 1 50 50 100 7 P15MCA37 Java Programming Lab 0 : 0 : 3 : 3 1.5 50 50 100 8 P15MCA38 Computer Graphics Lab 0 : 0 : 3 : 3 1.5 50 50 100 Aptitude and Reasoning 9 P15MHU301 Development-BEGINNER 2 : 0 : 0 : 2 1 50 50 100 (ARDB)* Total 32 25 450 450 900 L: Lecture, T: Tutorial, P: Practical, H: Total Hours; CIE: Continuous Internal Evaluation; SEE: Semester End Examinations * Taught by Humanities Science and Management Department ELECTIVES Sl. Course code Elective Group - I No. 1 P15MCA351 Accountancy and Financial Management 2 P15MCA352 System Software 3 P15MCA353 Basics of MIS & E-Commerce 4 P15MCA354 Graph Theory and Combinatorics 5 P15MCA355 Business Data Processing Using COBOL SYLLABUS 2015 – 2018 Page 1 P.E.S COLLEGE OF ENGINEERING, MANDYA – 571 401 (An Autonomous Institution Under VTU Belgaum) MASTER OF COMPUTER APPLICATIONS (MCA) SCHEME OF TEACHING AND EXAMINATION 2015-18 IV SEMESTER Examination Hrs/Week Sl.
  • EECS 490 DIGITAL IMAGE PROCESSING December 2004

    EECS 490 DIGITAL IMAGE PROCESSING December 2004

    EECS 490 DIGITAL IMAGE PROCESSING December 2004 SESSION 1 Document Deskewing Mike Adams Algorithm for Text Document De-Skewing Nattapon Chaimanonart Document Skew Detection Using the Hough Transformation Wen-Teng Chang Skew Correction for Text Document Using Fourier Domain Analysis Dmitriy Goldman Image De-Skewing Using Fourier Transform Issac Hirt Image Skew Detection and Correction using Morphology Ruchi Kothari Detection and Correction of Skew in Document Images Michael K. Lee Optical Character Recognition Preparation Using MATLAB Deng-Hung Liu Skew Correction for Document Images Using Gradient and Fourier Analysis Daniel Pendergast Multi-technique Algorithm for De-skewing and Text Bounding for OCR Applications Iouri Petriaev OCR Preparation: Design of an Algorithm for Document Analysis Using the Global Processing via Hough Transform Algorithm for Text Document De-Skewing Mike Adams Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, OH, Email: [email protected] ABSTRACT 2) Threshold the Sobel result to create a binary im- This paper presents a method for the skew correction of age. text documents in digital image format, using primarily the 3) Perform morphological ‘closing’ and ‘thickening’ Hough Transform. The Hough Transform provides a good so text is thinned out and resembles many points estimate of line angles in an image and this can be extended on a line. The Hough Transform is more accurate, to lines of text. Positive angles between 0 and 90 degrees if it operates on clear lines in the document im- are estimated correctly in both test images while negative age[1]. A pre processed image is shown below.
  • Cb D0e0f Sut5thgvyx Cthgv V

    Cb D0e0f Sut5thgvyx Cthgv V

    IMPROVEMENT OF ZONE CONTENT CLASSIFICATION BY USING BACKGROUND ANALYSIS ¡ Yalin Wang , Robert Haralick , and Ihsin T. Phillips ¢ Department of Electrical Engineering University of Washington Seattle, WA 98195 U.S.A. £ Department of Computer Science/Software Engineering Seattle University, Seattle, WA 98122 U.S.A. ¤ ylwang, haralick, [email protected] ¥ Abstract. This paper presents an improved zone content classification method. Motivated by our novel background-analysis-based table identification research, we added two new features to the feature vector from one previously published method [7]. The new features are the total area of large horizontal and large ver- tical blank blocks and the number of text glyphs in the zone. A binary decision tree is used to assign a zone class on the basis of its feature vector. The train- ing and testing data sets for the algorithm include ¦¨§ ©¨ images drawn from the UWCDROM-III document image database. The classifier is able to classify each given scientific and technical document zone into one of the nine classes, text ¦¨ classes (of font size ¦ pt and font size pt), math, table, halftone, map/drawing, ruling, logo, and others. The improved zone classification method ¨ ¨© ©¨ raised the accuracy rate to ¨ from and reduced the median false ¨ alarm rate to ¦ from . 1 Problem Statement " Let ! be a set of zone entities. Let be a set of content labels, such as text, table, * math, etc. The function #%$&!(')" associates each element of with a label. The ! / function +,$-!.'0/ specifies measurements
  • AIPR 2014: Democratization of Imagery Cosmos Club, Washington DC October 14-16, 2014

    AIPR 2014: Democratization of Imagery Cosmos Club, Washington DC October 14-16, 2014

    1 Applied Imagery Pattern Recognition Workshop Dedicated to facilitating the interchange of ideas between government, industry and academia in an elegant setting conducive to technical interchange across a broad range of disciplines 2014 IEEE Applied Imagery Pattern Recognition Workshop AIPR 2014: Democratization of Imagery Cosmos Club, Washington DC October 14-16, 2014 AIPR 2014 is financially sponsored by the IEEE Computer Society Technical sponsorship and support is provided by IEEE and the IEEE Computer Society Technical Committee on Pattern Analysis and Machine Intelligence The AIPR Committee thanks Cybernet Systems Corporation, Integrity Applications Incorporated and ObjectVideo Incorporated for their generous support of this year's workshop. 1 2 The AIPR Officers: Chairman: Neelam Gupta, ARL Deputy Chair: John Irvine, Draper Laboratory Program Chairs: Guna Seetharaman, AFRL/RI, Rome, NY Karl Walli, Col USAF Secretary: James Aanstoos, Mississippi State University Treasurer: Al Williams, Self Employed Engineering Consultant Associate Treasurer: Carlos Maraviglia, NRL Local Arrangements: Donald J. Gerson, Gerson Photography Publicity: Peter Costianes, AFRL, Emeritus Web Master: Charles J. Cohen, Cybernet External Support: Pete Doucette, Integrity Applications Incorporated Registration Chair: Jeff Kretsch, Raytheon BBN Technologies Proceedings Chair: Franklin Tanner, Hadron Industries Student Paper Award Murray Loew, GWU Chair: Paul Cerkez, DCS Corp. AIPR Committee Members: Jim Aanstoos, Mississippi State University Guna Seetharaman,
  • GC Bulletin 2020-21

    Bulletin 2020–21 Bulletin The Graduate Center The City University of New York 2020–21 Volume Forty-Three / NUMBER ONE 365 Fifth Avenue, New York, NY 10016-4309 www.gc.cuny.edu General: 1.212.817.7000 Admissions Office: 1.212.817.7470 (TDD users should call the New York Relay Center at 1.800.662.1220.) 2020–21 Bulletin, The Graduate Center, City University of New York NOTICE OF POSSIBLE CHANGES The City University of New York reserves the right, because of changing conditions, to make modifications of any nature in the academic programs and requirements of the University and its constituent colleges without advance notice. Tuition and fees set forth in this publication are similarly subject to change by the Board of Trustees of The City University of New York. The Uni- versity regrets any inconvenience this may cause. ACCREDITATION The City University of New York is registered by the New York State Department of Education: Office of Higher Education and the Professions, Cultural Education Center, Room 5B28, Albany, NY 12230; Telephone: 1.518.474.5851; http://www.nysed.gov/heds/IRPSL1.html. The Graduate Center has been accredited by the Commission on Higher Education of the Middle States As- sociation of Colleges and Schools since 1961, last reaffirmed in 2010. Seehttp://www.gc.cuny. edu/CUNY_GC/media/CUNY-Graduate-Center/PDF/Policies/General/Accreditation.pdf. 2 THE GRADUATE CENTER CALENDAR 2020–21 FALL 2020 July 31 (Fri.) .............................. Financial aid registration deadline. Last day for returning students to register full time in order to ensure fellowship payment during the first week of classes.
  • IAPR Newsletter, Pag. 34, ICPR 2016 Chalearn LAP

    IAPR Newsletter, Pag. 34, ICPR 2016 Chalearn LAP

    THE INTERNATIONAL ASSOCIATION FOR PATTERN RECOGNITION www.iapr.org Volume 39, Number 1 NewsletterFebruary 2017 Special Issue with highlights of the 23rd International Conference on Pattern Recognition IssueIN THIS Letter from the President Letter from the President Simone Marinai CALLS for PAPERS It is an honor for me to have IAPR event, in particular the ICPR Calls from IAPR Committees been elected President of the (International Conference on Pattern IAPR. In the next two years, I Recognition), a unique opportunity. Getting to Know... will have the privilege of working It is non-trivial to recall that the “I” Petia Radeva, IAPR Fellow with an exceptional Executive in IAPR stands for International. Committee: Massimo Tistarelli (1st IAPR...The Next Generation: Within the IAPR, scholars from a Vice President), Edwin Hancock Sungmin Eum wide range of countries freely take (2nd Vice President), Alexandra ICPR Highlights: on different responsibilities: holding Branzan Albu (Secretary), Apostolos From the General Chair offices, serving on committees, Antonacopoulos (Treasurer), and Traditional Dance of Mexico organizing events that take place in the Past President Ingela Nyström. Coffee Break/Lunch for Women nearly every area of the world. This is a great team, and I am sure Plenary Talk Abstracts that we will do our best to serve our With the admission of two new Invited Talk Abstracts association. members (Pakistan and Vietnam) Workshops/Tutorials/Contests there are now 49 member societies 2016 IAPR Fellows The IAPR has been my scientific represented on the Governing ICPR 2016 Awards family since I first participated in an Board. And, in 2018, the IAPR will 2016 Governing Board Meeting IAPR conference in 1995.
  • Imaging and Artificial Intelligence: Intersection and Synergy

    Imaging and Artificial Intelligence: Intersection and Synergy

    IEEE Applied Imagery Pattern Recognition Workshop Dedicated to facilitating the interchange of ideas between government, industry, and academia 45th AIPR Workshop Imaging and Artificial Intelligence: Intersection and Synergy Cosmos Club, Washington, D.C. October 18-20, 2016 AIPR 2016 is sponsored by the IEEE Computer Society Technical Committee on Pattern Analysis and Machine Intelligence with generous support from Cybernet Systems Corporation, Descartes Labs, Inc., and The Charles Stark Draper Laboratory, Inc. AIPR Workshop Committee Chairman Peter Doucette, U.S. Geological Survey Deputy Chair Daniela Moody, Los Alamos National Laboratory Program Chairs Murray H. Loew, George Washington University Kannappan Palaniappan, University of Missouri at Columbia Secretary Carlos Maraviglia, Naval Research Laboratory Treasurer Al Williams, Self-employed Engineer Local Arrangements Donald J. Gerson, Gerson Photography Publicity Peter Costianes, Air Force Research Laboratory, Emeritus Webmaster Charles J. Cohen, Cybernet External Support John Irvine, Draper Laboratory Christoph Borel-Donohue, Army Research Laboratory Registration Chair Jeff Kretsch, Raytheon BBN Technologies Proceedings Chair Franklin Tanner, Hadron Industries Student Paper Award Chair Paul Cerkez, DCS Corp. AIPR Committee Members: Jim Aanstoos, Mississippi State University Robert Mericsko, Booz Allen Hamilton Eamon Barrett, Retired Keith Monson, FBI Bob Bonneau, AFOSR Carsten Oertel, MITRE Filiz Bunyak, University of Missouri- William Oliver, Brody School of Medicine Columbia
  • Computer Vision and Remote Sensing – Lessons Learned

    Computer Vision and Remote Sensing – Lessons Learned

    Photogrammetric Week '09 Dieter Fritsch (Ed.) Wichmann Verlag, Heidelberg, 2009 Förstner 241 Computer Vision and Remote Sensing – Lessons Learned WOLFGANG FÖRSTNER, Bonn ABSTRACT Photogrammetry has significantly been influenced by its two neigbouring fields, namely Computer Vision and Remote Sensing. Today, Photogrammetry has been become a part of Remote Sensing. The paper reflects its growing relations with Computer Vision, based on a more than 25 years experience of the author with the fascinating field between cognitive, natural and engineering science, which stimulated his own research and transferred him into a wanderer between two worlds. “Two roads diverged in a wood, and I – I took the one less traveled by, and that has made all the difference.” Robert Frost 1. INTRODUCTION My first contact with a scientific community not being of geodetic nature was in 1983. Franz Leberl had organized the Specialist Workshop on Pattern Recognition in Photogrammetry in Graz (Leberl 1984) on behalf of the International Society of Photogrammtry and Remote Sensing by its Working Group III/5 Mathematical Pattern Recognition and Image Analysis. Well known photogram- metrists presented papers, such as Ed Mikhail or Ian Dowman. But also researches from a – that time – foreign world, such as Harlyn Baker or Robert Haralick, gave a flavour of techniques, not known in the core photogrammetric community, though adressing the same topics, specifically automatic surface reconstruction. The stimulating discussions with people from Pattern Recognition were the starting point for regular travels into and through a fascinating world around the own origin, geodesy and photogrammetry, which is still ongoing, full in the spirit of the workshop: “ ..
  • Piecewise-Smooth Optical Flow and Motion-Based Detection and Tracking

    Piecewise-Smooth Optical Flow and Motion-Based Detection and Tracking

    Robust Visual Motion Analysis: Piecewise-Smooth Optical Flow and Motion-Based Detection and Tracking Ming Ye A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Washington 2002 Program Authorized to Offer Degree: Electrical Engineering University of Washington Abstract Robust Visual Motion Analysis: Piecewise-Smooth Optical Flow and Motion-Based Detection and Tracking by Ming Ye Co-Chairs of Supervisory Committee: Professor Robert M. Haralick Electrical Engineering Professor Linda G. Shapiro Computer Science and Engineering This thesis describes new approaches to optical flow estimation and motion-based detection and tracking. Statistical methods, particularly outlier rejection, error analysis and Bayesian inference, are extensively exploited in our study and are shown to be crucial to the robust analysis of visual motion. To recover optical flow, or 2D velocity fields, from image sequences, certain models of brightness conservation and flow smoothness must be assumed. Thus how to cope with model violations especially motion discontinuities becomes a very challenging issue. We first tackle this problem from a local approach, that is, finding the most representative flow vector for each small image region. We recast the popular gradient-based method as a two-stage regression problem and apply adaptive robust estimators to both stages. The estimators are adaptive in the sense that their complexity increases with the amount of outlier contamination. Due to the limited contextual information, the local approach has spatially varying uncertainty. We evaluate the uncertainty systematically through covari- ance propagation. Pointing out the limitations of local and gradient-based methods, we further propose a matching-based global optimization technique.
  • Graphics Recognition a Historical Perspective and Recent Advances Rangachar Kasturi

    Graphics Recognition a Historical Perspective and Recent Advances Rangachar Kasturi

    Graphics Recognition A Historical Perspective and Recent Advances Rangachar Kasturi Computer Science and Engineering University of South Florida Tampa, Florida, USA The copyright for the illustrations and other objects used in this presentation are owned by the respective copyright holders; they are included here to acknowledge their contributions to the Document Analysis and Recognition research. Computer Vision and Pattern Recognition Laboratory IAPR/ICDAR Award Sargur Srihari, Nominator Lawrence O’Gorman, Endorser Gabriella Sanniti di Baja, Endorser Karl Tombre, Endorser Computer Vision and Pattern Recognition Laboratory IAPR/ICDAR Award Committee Dimosthenis Karatzas (Award Chair) Bart Lamiroy (Award Chair) Dan Lopresti Jean-Marc Ogier Josep Llados Simone Marinai Cheng-Lin Liu Marcus Liwicki Venu Govindaraju ICDAR 2017 Organizing Committee Koichi Kise and Team Computer Vision and Pattern Recognition Laboratory IAPR/ICDAR Award I would like to Dedicate this award to My 70+ Students Who Performed Much of the Research I am Recognized for Over the Past 35 Years Computer Vision and Pattern Recognition Laboratory Thank You Some of those who made it all possible Computer Vision and Pattern Recognition Laboratory Representing them all here is Sameer Antani, Ph.D. Sameer Antani: Video Content Characterization via Robust Recognition of Scene and Caption Text, Ph.D., 2001 Computer Vision and Pattern Recognition Laboratory IAPR Leaders, Freeman, Fu, Sakai, and Pavlidis at First Meeting of the IAPR Governing Board, Kyoto, 1978 Computer Vision and Pattern Recognition Laboratory What is Graphics Recognition? A Typical Document: Three Components Text Regions - Recognized by OCR Images - Processed by Image Processing Graphics- Analyzed by Graphics Recognition Graphics include Block Diagram, Chart, Graph, Line Art, Map, Schematic, Table, etc.