Ethically Speaking #3: Machine Vision and AI Impacts of Patient Care, with Mohammad Mahoor, Phd Transcript

Total Page:16

File Type:pdf, Size:1020Kb

Ethically Speaking #3: Machine Vision and AI Impacts of Patient Care, with Mohammad Mahoor, Phd Transcript Ethically Speaking #3: Machine Vision and AI Impacts of Patient Care, with Mohammad Mahoor, PhD Transcript Dylan Doyle Burke 0:12 Welcome to the Ethically Speaking Podcast. I am your host, Dylan Doyle Burke. Ethically Speaking is a podcast dedicated to asking the fundamental questions of what it means to be ethical in a rapidly evolving world. This episode is part of a series focused on interrogating the ethical questions of artificial intelligence and machine learning. Ethically Speaking, is sponsored by the Institute for Enterprise Ethics housed at the Daniels College of Business at the University of Denver. The Institute's mission is to serve the public good by integrating ethics into professional and corporate cultures. In today's episode, we are delighted to be in conversation with Dr. Mohammed Mahoor. Dr. Mahoor is a professor in the Department of Electrical and Computer Engineering at the University of Denver, and the Director of the Computer Vision and Social Robotics Laboratory. Dr. Mahoor received his PhD from the University of Miami, and his research focuses on visual pattern recognition, social robot design, and bioengineering. Dr. Mahoor is an expert on computer vision and pattern recognition, machine learning and deep learning, algorithm development, effective computing (including facial recognition), socially assistive robotics, social behavior analysis, and so, so much more. By the end of this episode, we hope that folks listening will have a better sense of how machine vision and artificial intelligence robotics can help patients who may have Alzheimer's, dementia, or other neurological concerns. We also hope that folks at the end of this episode have a better sense of the ethical themes surrounding machine vision as it continues to grow and develop out in industry and out in our world. I'm here with Dr. Mohammed Mahoor. Welcome to the show, Dr. Mahoor. Dr. Mohammad Mahoor 2:34 Thank you. Good morning, and thanks for having me. Dylan Doyle Burke 2:37 Absolutely. So, as we get started, could you tell us a little bit more about yourself and how you got into this field of artificial intelligence and machine learning? Dr. Mohammad Mahoor 2:44 Yes, so I've been in faculty at the University of Denver in the Department of Electrical and Computer Engineering for about 12 years now. And my research is about computer vision and machine learning nowadays. They call it AI. Although, you know, AI is a general buzzword term nowadays, but specifically my research is about effective computing, emotion recognition and then using emotion recognition technology in social robotics. So, the idea is to humanize, you know, machines and robots, right? So [to] use social robots to assist and help people with different disabilities such as children with autism, older adults with depression, social isolation, Alzheimer's disease and dementia. So, I would say, you know, effective computing, and social robotics, right now. Dylan Doyle Burke 3:43 And we'll get more into the specifics of your research. But how did you how do you find yourself working on this? Was there like a moment where you decided "this is what I want to focus on?" Dr. Mohammad Mahoor 3:52 Yes. So, I mean, that when I was a graduate student, I was very interested in image processing. And then I got really interested in face recognition. That was my PhD all about biometrics and facial face recognition, face identification. And then I worked as a postdoc in the Psychology Department. So, I learned from psychology [about] right modified emotion. And then the question that you're working on back then, what the research actually was all about, [was] using automated technology or computer algorithms to analyze people's eye gaze, attention, and facial expressions. And particularly, we wanted to see how emotion actually is developed in children over time. So, I worked with developmental psychologists, and my job as postdoc was to come up with a computer vision algorithm. And yeah, to automatically recognize facial expressions and also eye gaze attention. So that's how I got into the field of affective computing and emotion recognition. And then after I joined DU, then... Really, I enjoy what's going on here in the Engineering School at DU. And I saw a lot of robots here. And then, with the help of one of the faculty actually, be focused mostly on social robotics, or humanoid robots. That's the name that they use it in the past. And so, then we thought about applications. Since I worked as a postdoc in the Psychology Department, I'd worked with children with autism, then I thought that we can use technology to make a better life for children with autism. So that's the first research that I started on social robotics. And then after that, we thought about other applications of social robots. And then depression was another application that we worked on. And then we thought "okay, so maybe even people with cognitive impairment" such as older adults with Alzheimer's disease or dementia, that a social robot could help as well. Dylan Doyle Burke 6:24 Yeah, I am curious about the "how" of how machine vision can help and whether that's changed through time as the technology has changed. Dr. Mohammad Mahoor 6:37 Yeah, absolutely. So, machine vision or computer vision... We call it mostly computer vision because we work with computers on just the specific machines to be able to run those algorithms. So, the idea is to basically... I mean, that you use a camera to be able to see the world, and then [use] true AI and machine learning to be able to do object recognition, body tracking, even medical image analysis, image understanding and so on, so forth. So, in, I mean that in early days mostly, computer vision was all about image processing maybe 30 years ago. But then other problems actually -- I mean, AI researchers worked on other problems such as motion, tracking motion estimation, pose estimation, and then object detection, face recognition, face detection, and so on, so forth -- But nowadays with the improvement in machine learning AI and specifically deep learning and convolutional neural networks, you're able to build systems that are end to end; they are more powerful, they can basically model in that huge amount of data, they have more capacity in terms of machine learning. Dylan Doyle Burke 7:58 Can you give an example of something that's possible now with deep learning and neural networks, that wouldn't have been possible? Dr. Mohammad Mahoor 8:06 Yeah, so deep learning, I mean, it's something that we had in the 80s, but we didn't have enough data and also, we didn't have enough computers. You know, powerful computers to be able to train models. And of course, there are advancements in deep learning techniques and neural networks. So, one of the things that [maybe] 10 years ago or 15 years ago, we could do it but not a good accuracy was object detection and object recognition. But using deep learning, we can train a model using millions of images, and samples. Using, for example, imagenet data set that contains millions of images. So, then we can do object recognition with a very high accuracy, very close or even better than human accuracy so we can beat human accuracy. So, what does it mean, "object recognition?" It means that you have an image, and you look into that image, and then you want it to know what objects are in the image. To, for example, to label them saying that if you take an image, a picture of my office, so we can say that those are all books, right? That's a computer. That's, you know, my backpack and so on and so forth. So that sort of thing. Dylan Doyle Burke 9:29 And so then how do you how do you apply that to say a child with autism? How would that help them out in the field? Dr. Mohammad Mahoor 9:36 Yes. So, in the case of autism, for example, it can be used to... I mean to practice social skills with them, to interact with them, and then to the interaction; we basically teach them eye gaze attention, turn taking, and emotional recognition, describing emotions and so on and so forth. So, we use it as a tool to help them and assist them. With... I mean, to practice social skills is not... So, the idea is not to supplant human [work], but it's basically to assist [the] therapies. And it's mostly because children with autism they like objects, robots and technology, and human interaction is very overwhelming for them. So, we take the advantage of technology to connect with them and help them. So, I have seen even some... I mean that people have created technology that are wearable devices and then children with autism can use them to give them suggestions. It's not been commercialized yet. I think this is still research, projects, and research tools, but I see a promise for them in the future to be used. Dylan Doyle Burke 11:00 So, in that rollout of when it does become commercialized, and it moves from academia to industry, how do we do that well? Dr. Mohammad Mahoor 11:13 So, you mean that where we are now in terms of how many companies have been commercializing these kinds of technologies? Or... Dylan Doyle Burke 11:22 And also, with a gaze towards ethics when we're talking about robotics, and then it goes out into the market, maybe to publicly traded companies and kind of those kinds of things? Dr.
Recommended publications
  • Vision-Based Positioning for Internet-Of-Vehicles Kuan-Wen Chen, Chun-Hsin Wang, Xiao Wei, Qiao Liang, Chu-Song Chen, Ming-Hsuan Yang, and Yi-Ping Hung
    http://ieeexplore.ieee.org/Xplore Vision-Based Positioning for Internet-of-Vehicles Kuan-Wen Chen, Chun-Hsin Wang, Xiao Wei, Qiao Liang, Chu-Song Chen, Ming-Hsuan Yang, and Yi-Ping Hung Abstract—This paper presents an algorithm for ego-positioning Structure by using a low-cost monocular camera for systems based on from moon the Internet-of-Vehicles (IoV). To reduce the computational and For local model memory requirements, as well as the communication load, we construc,on tackle the model compression task as a weighted k-cover problem for better preserving the critical structures. For real-world vision-based positioning applications, we consider the issue of large scene changes and introduce a model update algorithm to For image collec,on address this problem. A large positioning dataset containing data collected for more than a month, 106 sessions, and 14,275 images is constructed. Extensive experimental results show that sub- meter accuracy can be achieved by the proposed ego-positioning algorithm, which outperforms existing vision-based approaches. (a) Index Terms—Ego-positioning, model compression, model up- date, long-term positioning dataset. Download local 3D Upload newly scene model acquired images for model update I. INTRODUCTION NTELLIGENT transportation systems have been exten- I sively studied in the last decade to provide innovative and proactive services for traffic management and driving safety issues. Recent advances in driving assistance systems mostly For image matching provide stand-alone solutions to these issues by using sensors and posi1oning limited to the line of sight. However, many vehicle accidents occur because of other vehicles or objects obstructing the (b) view of the driver.
    [Show full text]
  • Artificial Intelligence in Health Care: the Hope, the Hype, the Promise, the Peril
    Artificial Intelligence in Health Care: The Hope, the Hype, the Promise, the Peril Michael Matheny, Sonoo Thadaney Israni, Mahnoor Ahmed, and Danielle Whicher, Editors WASHINGTON, DC NAM.EDU PREPUBLICATION COPY - Uncorrected Proofs NATIONAL ACADEMY OF MEDICINE • 500 Fifth Street, NW • WASHINGTON, DC 20001 NOTICE: This publication has undergone peer review according to procedures established by the National Academy of Medicine (NAM). Publication by the NAM worthy of public attention, but does not constitute endorsement of conclusions and recommendationssignifies that it is the by productthe NAM. of The a carefully views presented considered in processthis publication and is a contributionare those of individual contributors and do not represent formal consensus positions of the authors’ organizations; the NAM; or the National Academies of Sciences, Engineering, and Medicine. Library of Congress Cataloging-in-Publication Data to Come Copyright 2019 by the National Academy of Sciences. All rights reserved. Printed in the United States of America. Suggested citation: Matheny, M., S. Thadaney Israni, M. Ahmed, and D. Whicher, Editors. 2019. Artificial Intelligence in Health Care: The Hope, the Hype, the Promise, the Peril. NAM Special Publication. Washington, DC: National Academy of Medicine. PREPUBLICATION COPY - Uncorrected Proofs “Knowing is not enough; we must apply. Willing is not enough; we must do.” --GOETHE PREPUBLICATION COPY - Uncorrected Proofs ABOUT THE NATIONAL ACADEMY OF MEDICINE The National Academy of Medicine is one of three Academies constituting the Nation- al Academies of Sciences, Engineering, and Medicine (the National Academies). The Na- tional Academies provide independent, objective analysis and advice to the nation and conduct other activities to solve complex problems and inform public policy decisions.
    [Show full text]
  • Visual Prosthetics Wwwwwwwwwwwww Gislin Dagnelie Editor
    Visual Prosthetics wwwwwwwwwwwww Gislin Dagnelie Editor Visual Prosthetics Physiology, Bioengineering, Rehabilitation Editor Gislin Dagnelie Lions Vision Research & Rehabilitation Center Johns Hopkins University School of Medicine 550 N. Broadway, 6th floor Baltimore, MD 21205-2020 USA [email protected] ISBN 978-1-4419-0753-0 e-ISBN 978-1-4419-0754-7 DOI 10.1007/978-1-4419-0754-7 Springer New York Dordrecht Heidelberg London Library of Congress Control Number: 2011921400 © Springer Science+Business Media, LLC 2011 All rights reserved. This work may not be translated or copied in whole or in part without the written permission of the publisher (Springer Science+Business Media, LLC, 233 Spring Street, New York, NY 10013, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in connection with any form of information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed is forbidden. The use in this publication of trade names, trademarks, service marks, and similar terms, even if they are not identified as such, is not to be taken as an expression of opinion as to whether or not they are subject to proprietary rights. Printed on acid-free paper Springer is part of Springer Science+Business Media (www.springer.com) Preface Visual Prosthetics as a Multidisciplinary Challenge This is a book about the quest to realize a dream: the dream of restoring sight to the blind. A dream that may have been with humanity much longer than the idea that disabilities can be treated through technology – which itself is probably a very old idea.
    [Show full text]
  • Machine Vision for the Iot and I0.4
    International Journal of Management, Technology And Engineering ISSN NO : 2249-7455 MACHINE VISION FOR THE IOT AND I0.4 ROHIT KUMAR1, HARJOT SINGH GILL2 1Student, Department of Mechatronics Engineering Chandigarh University, Gharuan 2Assistant Professor, Department of Mechatronics Engineering Chandigarh University, Gharuan Abstract: Human have six sense and vision is one of the most used sense in the daily life. So here the concept if we have to replace the human by machine .We need the high quality sensor that can provid the real time vision for the machine . It also help to make control system. It can be used in industry very easily at different stages of manufacturing for preventive maintenance and fault diagnosis. Machine vision aslo help in industry to speed up inspection processes and reduces the wastage by preventing production quality. For instance, consider an arrangement of pictures demonstrating a tree influencing in the breeze on a splendid summer's day while a cloud moves over the sun changing the power and range of the enlightening light. For instance, consider an arrangement of pictures demonstrating a tree influencing in the breeze on a brilliant summer's day while a cloud moves over the sun adjusting the power and range of the enlightening light. Key words: Artificial Intelligence, Machine vision, Artificial neural network, , image processing. Introduction: The expanded utilization, consciousness of value, security has made a mindfulness for enhanced quality in customer items. The interest of client particular custom-misation, increment of rivalry has raised the need of cost decrease. This can be accomplished by expanding the nature of items, decreasing the wastage amid the generation, adaptability in customisation and quicker creation.
    [Show full text]
  • An Overview of Computer Vision
    U.S. Department of Commerce National Bureau of Standards NBSIR 82-2582 AN OVERVIEW OF COMPUTER VISION September 1982 f— QC Prepared for iuu National Aeronautics and Space . Ibo Administration Headquarters 61-2562 Washington, D.C. 20546 National Bureau ot Standard! OCT 8 1982 ftsVac.c-'s loo - 2% NBSIR 82-2582 AN OVERVIEW OF COMPUTER VISION William B. Gevarter* U.S. DEPARTMENT OF COMMERCE National Bureau of Standards National Engineering Laboratory Center for Manufacturing Engineering Industrial Systems Division Metrology Building, Room A127 Washington, DC 20234 September 1982 Prepared for: National Aeronautics and Space Administration Headquarters Washington, DC 20546 U.S. DEPARTMENT OF COMMERCE, Malcolm Baldrige, Secretary NATIONAL BUREAU OF STANDARDS, Ernest Ambler, Director * Research Associate at the National Bureau of Standards Sponsored by NASA Headquarters * •' ' J s rou . Preface Computer Vision * Computer Vision — visual perception employing computers — shares with "Expert Systems" the role of being one of the most popular topics in Artificial Intelligence today. Commercial vision systems have already begun to be used in manufacturing and robotic systems for inspection and guidance tasks. Other systems at various stages of development, are beginning to be employed in military, cartograhic and image inter pretat ion applications. This report reviews the basic approaches to such systems, the techniques utilized, applications, the current existing systems, the state-of-the-art of the technology, issues and research requirements, who is doing it and who is funding it, and finally, future trends and expectations. The computer vision field is multifaceted, having many participants with diverse viewpoints, with many papers having been written. However, the field is still in the early stages of development — organizing principles have not yet crystalized, and the associated technology has not yet been rationalized.
    [Show full text]
  • (12) United States Patent (10) Patent No.: US 9,547,804 B2 Nirenberg Et Al
    USO09547804B2 (12) United States Patent (10) Patent No.: US 9,547,804 B2 Nirenberg et al. (45) Date of Patent: Jan. 17, 2017 (54) RETNAL ENCODER FOR MACHINE (58) Field of Classification Search VISION CPC ...... G06K 9/4619; H04N 19760; H04N 19/62; H04N 19/85; G06N 3/049 (75) Inventors: Sheila Nirenberg, New York, NY (US); (Continued) Illya Bomash, Brooklyn, NY (US) (56) References Cited (73) Assignee: Cornell University, Ithaca, NY (US) U.S. PATENT DOCUMENTS (*) Notice: Subject to any disclaimer, the term of this 5,103,306 A * 4/1992 Weiman .................. GOS 5.163 patent is extended or adjusted under 35 348/400.1 U.S.C. 154(b) by 0 days. 5,815,608 A 9/1998 Lange et al. (21) Appl. No.: 14/239,828 (Continued) (22) PCT Filed: Aug. 24, 2012 FOREIGN PATENT DOCUMENTS CN 101.239008 A 8, 2008 (86) PCT No.: PCT/US2O12/052348 CN 101.336856. A 1, 2009 S 371 (c)(1), (Continued) (2), (4) Date: Jul. 15, 2014 OTHER PUBLICATIONS (87) PCT Pub. No.: WO2O13AO29OO8 Piedade, Moises, Gerald, Jose, Sousa, Leonel Augusto, Tavares, PCT Pub. Date: Feb. 28, 2013 Goncalo, Tomas, Pedro. “Visual Neuroprothesis: A Non Invasive System for Stimulating the Cortex'IEEE Transactions on Circuits (65) Prior Publication Data and Systems-I: Regular Papers, vol. 53 No. 12, Dec. 2005.* US 2014/0355861 A1 Dec. 4, 2014 (Continued) Primary Examiner — Kim Vu Related U.S. Application Data Assistant Examiner — Molly Delaney (60) Provisional application No. 61/527.493, filed on Aug. (74) Attorney, Agent, or Firm — Foley & Lardner LLP 25, 2011, provisional application No.
    [Show full text]
  • A Neuromorphic Approach for Tracking Using Dynamic Neural Fields on a Programmable Vision-Chip
    A Neuromorphic Approach for Tracking using Dynamic Neural Fields on a Programmable Vision-chip Julien N.P. Martel Yulia Sandamirskaya Institute of Neuroinformatics Institute of Neuroinformatics University of Zurich and ETH Zurich University of Zurich and ETH Zurich 8057 Zurich, Switzerland 8057 Zurich, Switzerland [email protected] [email protected] ABSTRACT DNF activation Saliency map In artificial vision applications, such as tracking, a large Frame amount of data captured by sensors is transferred to pro- Processor Array cessors to extract information relevant for the task at hand. Smart vision sensors offer a means to reduce the computa- Lens tional burden of visual processing pipelines by placing more processing capabilities next to the sensor. In this work, we use a vision-chip in which a small processor with memory is located next to each photosensitive element. The architec- Scene ture of this device is optimized to perform local operations. To perform a task like tracking, we implement a neuromor- phic approach using a Dynamic Neural Field, which allows to segregate, memorize, and track objects. Our system, con- sisting of the vision-chip running the DNF, outputs only the activity that corresponds to the tracked objects. These out- Cellular Processor Array Vision-chip puts reduce the bandwidth needed to transfer information as well as further post-processing, since computation happens Figure 1: An illustration of the system implemented at the pixel level. in this work: A vision-chip is directed at a scene and captures a stream of images processed on-chip CCS Concepts in a saliency map. This map is fed as an input to a •Computing methodologies ! Object detection; Track- Dynamic Neural Field (DNF), also run on-chip and ing; Massively parallel and high-performance simulations; used for tracking objects.
    [Show full text]
  • AI: What It Is, and Should You Use It
    AI: What it is, and should you use it AI: What it is, and should you use it In this E-Guide: The hype surrounding artificial intelligence (AI) and all its potential applications is seemingly unending—but, could it really as powerful as people say it is? AI (artificial intelligence) Keep reading to discover what AI is by definition, why it’s becoming a hot commodity for contact centers, and the pros and cons of implementing it for yourself. 5 trends driving AI in contact centers The pros and cons of customer service AI Page 1 of 21 SPONSORED BY AI: What it is, and should you use it AI (artificial intelligence) Margaret Rouse, WhatIs.com Artificial intelligence (AI) is the simulation of human intelligence processes by machines, especially computer systems. These processes include learning (the acquisition of AI (artificial information and rules for using the information), reasoning (using rules to reach approximate intelligence) or definite conclusions) and self-correction. Particular applications of AI include expert systems, speech recognition and machine vision. 5 trends driving AI in contact centers AI can be categorized as either weak or strong. Weak AI, also known as narrow AI, is an AI system that is designed and trained for a particular task. Virtual personal assistants, such as The pros and cons of Apple's Siri, are a form of weak AI. Strong AI, also known as artificial general intelligence, is customer service AI an AI system with generalized human cognitive abilities. When presented with an unfamiliar task, a strong AI system is able to find a solution without human intervention.
    [Show full text]
  • Neural Network Based Fault Detection on Painted Surface
    UMEÅ UNIVERSITY MASTER THESIS Neural network based fault detection on painted surface Author: Midhumol Augustian A thesis submitted in partial fulfillment of the requirements for the degree of Masters in Robotics and Control Engineering Department of Applied Physics and Electronics Umeå University 2017 iii Declaration of Authorship I, Midhumol Augustian, declare that this thesis titled, “Neural network based fault detection on painted surface ” and the work presented in it are my own. I confirm that: • This work was done wholly while in candidature for a Masters degree in Robotics and control Engineering at Umeå University. • Where I have consulted the published work of others, this is always clearly attributed. • Where I have quoted from the work of others, the source is always given. With the exception of such quotations, this thesis is entirely my own work. • I have acknowledged all main sources of help. • Where the thesis is based on work done by myself jointly with others, I have made clear exactly what was done by others and what I have contributed myself. Signed: Date: v Abstract Machine vision systems combined with classification algorithms are being in- creasingly used for different applications in the age of automation. One such application would be the quality control of the painted automobile parts. The fundamental elements of the machine vision system include camera, illumi- nation, image acquisition software and computer vision algorithms. Tradi- tional way of thinking puts too much importance on camera systems and ig- nores other elements while designing a machine vision system. In this thesis work, it is shown that selecting an appropriate illumination for illuminating the surface being examined is equally important in case of machine vision system for examining specular surface.
    [Show full text]
  • Vision, Brain and Assistive Technologies Rationale Course
    Vision, Brain and Assistive Technologies Rationale Based on the World Health Organization 2012 Report, there are more than 285 million visually impaired people, of which 39 million are blind. About 65% of all people who are visually impaired are aged 50 and older, while this age group comprises about 20% of the world's population. With an in- creasing elderly population in many countries, more people will be at risk of age-related visual impairment. Research on multimodal and alternative perception will have a long term impact on the health and wellness of soci- ety, not only for the visually challenged, but for people who often work in dangerous environments, such as firefighters, drivers and soldiers. Course Description This course will discuss modern vision science and explore how the brain sees the world, thus including introductory on computational neuroscience, mo- tion, color and several other topics. Then the basics of computer vision will be introduced, for both preprocessing and 3D computer vision. Finally, we will discuss the needs and state-of-art in sensing, processing and stimulation for assisting visually challenged people (blind and visually impaired people) using advanced technologies in machine vision, robotics, displays, materials, portable devices and infrastructure. The course will be offered as an interdisciplinary seminar course, in which a few lectures will be provided from the recommended textbook on human vision as well as the lecture notes of the instructor on computer vision, and then students from mathematics, physics, electrical engineering, computer science and psychology and other social sciences will be assigned to read, present and discuss materials in vision, brain, computing and devices for assisting the visually impaired.
    [Show full text]
  • Machine Vision and Industrial Automation Accelerating Adoption in Industrial Automation Jayavardhana Gubbi Jayavardhana P Balamuralidhar
    Machine Vision and Industrial Automation Accelerating Adoption in Industrial Automation Jayavardhana Gubbi Jayavardhana P Balamuralidhar Machine vision has harnessed deep learning admirably. Disruptive solutions are in the market for security and surveillance, autonomous vehicles, industrial inspection, digital twins, and augmented reality (AR). However, historically, data acquisition within computer vision has been hamstrung by manual interpretation of captured image data. Fortunately though, deep learning has generated data-driven models that could replace handcrafted preprocessing and feature extraction superbly. IN BRIEF The future has already arrived. TCS has, for instance, not only delivered revolutionary deep learning-powered machine vision solutions, but is also using advanced software algorithms to design industry-grade spatial cognition-enabled computer vision applications. A field established in the 1960s, Machine vision today it’s only in the past 30 months or so that the machine vision Key technological developments in industry has harnessed deep miniaturized cameras, embedded learning—with extremely devices, and machine learning good results. Poised where AI, have historically enabled adoption imaging sensors, and image of machine vision in a multitude processing meet, machine vision of applications—satellite image is now making a massive impact in analysis and medical image industrial applications. interpretation, among others— across diverse domains. This has brought disruptive industry-grade solutions to the But the game-changer today is market. Table 11 sets out the key the power and affordability of industry sectors in which computer computing devices, including vision has made inroads. the advent of digital and mobile cameras. This has resulted in image The key application trends processing spreading to other bringing machine vision to the fore areas and making humongous include security and surveillance, volumes of real (as against autonomous vehicles, industrial synthetic) visual data available inspection, digital twins, and AR.
    [Show full text]
  • Improving Perception from Electronic Visual Prostheses
    Improving Perception from Electronic Visual Prostheses Justin Robert Boyle BEng (Mech) Hons1 Image and Video Research Laboratory School of Engineering Systems Queensland University of Technology Submitted as a requirement for the degree of Doctor of Philosophy February 2005 Keywords image processing, visual prostheses, bionic eye, artificial human vision, visual perception, subjective testing, visual information ii Abstract This thesis explores methods for enhancing digital image-like sensations which might be similar to those experienced by blind users of electronic visual prostheses. Visual prostheses, otherwise referred to as artificial vision systems or bionic eyes, may operate at ultra low image quality and information levels as opposed to more common electronic displays such as televisions, for which our expectations of image quality are much higher. The scope of the research is limited to enhancement by digital image processing: that is, by manipulating the content of images presented to the user. The work was undertaken to improve the effectiveness of visual prostheses in representing the visible world. Presently visual prosthesis development is limited to animal models in Australia and prototype human trials overseas. Consequently this thesis deals with simulated vision experiments using normally sighted viewers. The experiments involve an original application of existing image processing techniques to the field of low quality vision anticipated from visual prostheses. Resulting from this work are firstly recommendations for effective image processing methods for enhancing viewer perception when using visual prosthesis prototypes. Although limited to low quality images, recognition of some objects can still be achieved, and it is useful for a viewer to be presented with several variations of the image representing different processing methods.
    [Show full text]