Safeswim: Overhead Vision to Help People See Better
Total Page:16
File Type:pdf, Size:1020Kb
SafeSwim: Overhead vision to help people see better Steve Mann, Christina Mann, Faraz Sadrzadeh-Afsharazar, Zhao Lu, Jiten Saini, Jacqueline Lian MannLab Canada, Toronto, Ontario, Canada, M5T 1G5 Abstract—Traditionally video surveillance consists of overhead Efferent Sensors Surveillance Efferent Sensors cameras to help authorities such as lifeguards or security profes- Efferent Mind Body Mind Afferent Body Human Machine A erent sionals monitor individuals. This monitoring, if in service of the ff Senses Cityscape drawing from Wikimedia Commons Mind monitored individuals, serves them only indirectly. More recently, Body Human Machine Humachine Smart City a new kind of sensing called “sousveillance” has emerged, as a Afferent Senses Sousveillance form of sensing by, and in direct service to, individuals. In this Human Humachine Humachines paper, we explore an overhead camera (e.g. aerial drone-based Fig. 1: Self, technology, and society as closed-loop feed- videography) to help create a form of real-time seeing aid, with specific application to help one or more individual swimmers see back systems (Humanistic Intelligence). Note the fractal (self- better and navigate, e.g. to avoid collisions with other swimmers similar) nature of humanistic intelligence, society, and the (or the walls) in pools, as well as navigating in lakes, rivers, smart city, etc.. and other bodies of open water. Additionally the system can be used to teach swimming (e.g. learn by seeing from a birds-eye- view) as well as to gamify swim-based activities. SafeSwim (and sousveillance in general) is a new form of “seeing” in which time, we adapt to seeing the world in this new way, and the those being watched have direct visual access to the live video technology becomes, in some sense, “part of us”, through the stream. It is a direct departure from traditional surveillance long-term HI feedback loop [2]. where authorities such as police or lifeguards watch individuals In a given space or context, we often have more than one without giving them direct access to a live video stream. humachine, e.g. we might have 2 or more swimmers wearing Index Terms—Wearable computing, Wearables, Sensing, Mo- bile computing, Swimming, Drones, Sousveillance, Machine vi- the “Smart Swim” or other augmented reality eyeglass. Thus sion. we need to think of Minsky et al.’s concept of the “Society I. INTRODUCTION of Intelligent Veillance” in which HI (wearable AI) applies across a community [8]. Wearable computing [1] combined with wearable AI [2] As we design and build smart pools, “Smart Beaches”, smart provides a new way of sensing and undertanding the enviro- buildings, smart streets, and smart cities, we need to also ment around us. Wearable sensing technologies can create new consider the idea of “smart people”, i.e. putting “intelligence” and extended human sensing capabilities which work similarly (sensors and computation) on people, not just buildings and to sensory organs. This concept is explored in Fig 1. The lampposts and the skies above them. In this way, humachines human mind may be regarded as one or more feedback loops. are more than just a nexus of human and machine, they form The human may similarly be regarded as a feedback loop a new society and a new concept of technology-and-society. between the mind and body, i.e. efferent nerves carry signals Therefore, HI feedback loops can be understood and classified from the mind to the body, and afferent nerves carry signals according to the following nested taxonomy: from the body to the mind [3], thus completing a closed loop feedback control system [4], [5], shown in leftmost Fig. 1. • Self and technology (e.g. combined human+machine); Humanistic Intelligence (HI) [6], also known as Wearable • Self and society (interaction between humans, huma- AI [2], is a framework for wearable computers and other forms chines, etc.), which includes self and technology; of closed-loop feedback between a human and machine(s). • Self and the environment (interaction between the aug- Technologies like shoes, wearable computers, bicycles, and mented human and the natural or built environment, e.g. automobiles are technologies that “become part of us.” This cyborg-city interaction), which includes self, technology, symbiosis between human and machine is often referred to as a and society. “humachine” [7], “cyborg” [5], bionic, or “augmented human”. This human-centered taxonomy considers the sensory infor- Humanistic technologies typically consist of sensors in the mation available at three key levels as shown in Fig 1). The machine, and our human senses, forming a feedback loop, as third layer, “Self and the environment”, becomes especially shown in the center part of Fig. 1. The example shown here clear when we finally grant individuals a birds-eye-view of is a Vuzix “Smart Swim” eyeglass or other similar augmented large bodies of open water, where an individual swimmer can swimglass. After wearing an eyeglass for a long period of see not only their stroke (i.e. use our system as a way to learn how to swim), but also the water conditions overall, such Authors thank Vuzix (Paul Travers) for equipment donation, and members of SwimOP™ (the online “Swim at Ontario Place” group we created), as algae blooms, floating garbage in some places, etc.. This including Zhao Lu and Trevor McKee, for help with beach cleanup efforts. insight gives an individual swimmer a new way to navigate. For example, he or she can choose to swim along the path-of- Tresspass Robbery prevention prevention least-pollution, or along the path-of-maximum-safety. Violence prevention Pollution or safety or other disiderata/undisiderata can be overlaid in an augmented reality as a field, e.g. a danger- field or safety-field, such as to guide a swimmer using this Security new birds-eye-view, thus contextualizing sousveillance at its Public (crime safety highest level – “Self and the environment”. prevention) Operational This calls for a new way of thinking, in which smart pools, (organizational) smart beaches, and smart cities provide direct, and immediate efficiency feedback to individual citizens, hence promoting the living Surveillance Safety (accident quality of the end-users in the system. prevention) Individual's Physical health A. Safety and security tree Affective To understand our new way of thinking and contrast it Personal Personal Mental with traditional surveillance, we construct a mental model safety Suicurity efficiency (corruption Sousveillance based on a tree. We can think of surveillance (security, etc.) prevention) VMP (memory aid) as forming the trunk of a tree with various branches such as (1) crime prevention, (2) operational efficiency (e.g. so Seeing aid lifeguards can watch swimmers and see how big crowds are Fig. 2: The “Securitree”(Security Tree) model getting, and thus know where to deploy additional staff), and (3) accident prevention (e.g. automatic computer-assisted drowning detection), etc.. This model matches what one typically finds in smart city planning. We propose, also, that there be roots for the tree, and that the roots form various aspects of sousveillance (inverse surveillance), such as to support personal safety, individual health and well-being, etc. The “Securitree” is composed of two main sections, with the Fig. 3: Underwater virtual reality has been a well-established parts of the tree above the ground representing surveillance and practice over the past 20 years or so. It provides a full the parts of tree below the ground representing sousveillance. immersive/submersive experience in which it is very easy to We regard security as the trunk of a tree having surveillance suspend disbelief while floating in a virtual world. branches like public safety, operational efficiency, and crime prevention. There are surveillance sub-branches, e.g. crime prevention has sub-branches such as robbery prevention, vio- interactive virtual reality) [9] and VR float tanks [10], [11] lence prevention, . The tree is rooted in sousveillance, with using the Mersivity™ underwater VR headset (Fig 3). roots like personal safety, “suicurity” (corruption prevention), personal efficiency, and individual’s health. Sousveillance has D. Background sub-branches, e.g. individual health has sub-branches such as The tracking of swimmers is a well-researched field. One physical, affective, mental, (which itself has sub-roots VMP, of the state-of-the art swimmer tracking algorithms uses non- seeing aid, . ), as shown in Fig 2. linear joint transform and colour histogram analysis to ac- B. EyeTap curately detect the locations of the swimmers. Such a hybrid approach is needed for the reliable detection of the swimmers. The EyeTap is a wearable device worn in front of one In such an environment, the rapid movements of swimmers, or both eyes, acting as if the eye itself is a camera. A bodily rotations, as well as water splashes can hinder the display overlaying computer generated imagery can be viewed reproducibility of the detection [12], [13]. from the right eye. The left eye is unobstructed. Similar to the EyeTap, our SafeSwim™ system allows the user to II. METHODOLOGY be fully immersed in their surroundings. The user is able A. Proposed system architecture to see unobstructed through one eye, while their other eye has a display providing real-time safety-oriented situational In a swimmer’s context, in order to close the HI sensory awareness as demonstrated in Fig 7 from video processed from feedback loop, some form of sensory technology is required. an overhead DJI Mavic Mini drone. We are proposing to use a “sousveillance” (inverse surveil- lance) drone to capture a bird’s eye view of the swimmer and C. Immersive/Submersive (“Mersive”) VR make it available to the swimmer in real-time (this immediate Fully immersive/submersive (underwater) VR has been pre- availability to the end-user is what makes it sousveillance viously explored through “immersive multimedia” (underwater rather than surveillance).