Gesture Recognition on Few Training Data Using Slow Feature Analysis and Parametric Bootstrap

Total Page:16

File Type:pdf, Size:1020Kb

Gesture Recognition on Few Training Data Using Slow Feature Analysis and Parametric Bootstrap Gesture Recognition on Few Training Data using Slow Feature Analysis and Parametric Bootstrap Patrick Koch, Wolfgang Konen, and Kristine Hein Abstract— Slow Feature Analysis (SFA) has been established clearly differs from traditional game controllers. But even as a robust and versatile technique from the neurosciences to though there exist a large number of games for the Nintendo learn slowly varying functions from quickly changing signals. Wii, recognition of complex gestures is still a challenging Recently, the method has been also applied to classification tasks. Here we apply SFA for the first time to a time series task. In spite of the large number of classification approaches classification problem originating from gesture recognition. The using the Wiimote controller with the infrared device, e.g. the gestures used in our experiments are based on acceleration work from Lee [Lee08], we focus here on approaches where signals of the Bluetooth Wiimote controller (Nintendo). We no infrared device is used. Although the task becomes more show that SFA achieves results comparable to the well-known difficult then, we are not so dependent on the Wii structure Random Forest predictor in shorter computation time, given a sufficient number of training patterns. However – and this and can easily transfer our approach to other applications. is a novelty to SFA classification – we discovered that SFA This can be of high importance when the use of infrared requires the number of training patterns to be strictly greater is not possible for any reason and only acceleration based than the dimension of the nonlinear function space. If too few sensors are available. Similar works exist, e.g., a recent study patterns are available, we find that the model constructed by by Schlomer¨ et al. [SPHB08], who present a classification SFA severely overfits and leads to high test set errors. We analyze the reasons for overfitting and present a new solution tool based on a Hidden Markov Chain approach and Liu et + based on parametric bootstrap to overcome this problem. al. [LW 09] who apply personalized gesture recognition to a user authentication problem. Rehm et al. [RBA08] describe I. INTRODUCTION classification of Wii gestures with different methods and Slow Feature Analysis (SFA) is a new learning algorithm expose the influence of cultural behaviours in gestures. Mal- emerging from neuroscience which is capable of learning mestig and Sundberg [MS08] use the Wiimote to recognize unsupervised new features or ’concepts’ from multidimen- gestures for sign language recognition with good results. The sional time signals. SFA has been originally developed in commercial product LiveMove2 [Inc09] integrates features of context of an abstract model of unsupervised learning of the Nintendo Motion Plus addon but no benchmark results invariances in the visual system of vertebrates [Wis98] and are available for publicity. is described in detail in [WS02], [Wis03]. Although SFA is The Slow Feature Analysis and Random Forest algorithms inspired from neuroscience, it does not have the drawbacks are briefly introduced in the next section. We also give of conventional ANNs such as long training times or strong detailed information about the acquisition and preparation of dependencies on initial conditions. Instead, SFA is fast in the gathered data. The experimental results are described in training and it has the potential to find hidden features out of Sec. 3 and discussed in Sec. 4. We finish with a conclusion of multidimensional signals, as has been shown impressively by the results and give an outlook of our future work in Sec. 5. [Ber05] for handwritten-digit recognition. For these reasons II. METHODS we apply SFA for the first time to a complex time series A. Slow Feature Analysis classification problem originating from gesture recognition, namely acceleration signals of the Nintendo Bluetooth Wi- Slow Feature Analysis has been invented by Wiskott imote controller and observe several interesting facts when and Sejnowski [WS02], [Wis03] and later extended by applying this method to small data sets. Berkes [Ber05] for classification. The original SFA approach The inclusion of accelerometers in electronic devices such for time series analysis is defined as follows: For a multivari- as mobile phones or game controllers for gesture recognition ate time series signal ~x(t) where t indicates time, find the set has become more and more popular in consumer electronics. of real-valued output functions g1(~x); g2(~x); :::; gM (~x), such While in the nineties, the recognition of gestures placed a that each output function high demand on hardware and was only present in the labs yj(t) = gj(~x(t)) (1) of research institutes, gesture recognition has now made its 1 way into the homes. With over 50 million units sold [Shi09] minimally changes in time : the reason for the success of the Nintendo Wii console can 2 ∆yj(t) = hy_j it is minimal (2) be connected to the novel type of controller handling, which To exclude trivial solutions we add some constraints: Patrick Koch, Wolfgang Konen, and Kristine Hein are with the Depart- ment of Computer Sciences and Engineering Science, Cologne University hyjit = 0 (zero mean) (3) of Applied Sciences, 51643 Gummersbach, Germany (email: fpatrick.koch, 1 wolfgang.konen, [email protected]) h·it means average over time and y_ indicates the time derivative. TABLE I 2 ABSOLUTE EXECUTION TIMES (IN SECONDS) FOR FIVE EXAMPLE hy it = 1 (unit variance) (4) j GESTURES PERFORMED BY TEN TEST PERSONS.EACH GESTURE WAS RECORDED TEN TIMES. hykyjit = 0 (decorrelation for k > j) (5) Gesture Minimum Median Maximum Std.Dev. The third equation is only relevant from the second slow Circle 0.773 1.607 4.914 0.343 Throw 0.416 0.957 2.306 0.471 signal on and higher ones to prevent higher signals from Frisbee 0.376 0.883 2.036 0.353 learning features already represented by slower signals. Bowling 0.530 1.325 2.303 0.377 For arbitrary functions this problem is difficult to solve, Z 0.980 1.607 4.280 0.453 but SFA finds a solution by expanding the input signal into a nonlinear function space by applying certain basis R-implementation of Random Forest [LW02]. It has been functions, e.g. monomials of degree d. This expanded signal shown in the past that the algorithm is quite robust to is sphered to fulfill the constraints of equations (3), (4) and overfitting and noise, that is why we use it as a comparison (5). Then SFA calculates the time derivative of the sphered to the SFA approach. expanded signal and determines from its covariance matrix the normalized eigenvector with the smallest eigenvalue. Finally the sphered expanded signal is projected onto this eigenvector to obtain the slowest output signal y1(t). Berkes [Ber05] has extended this approach to classify a set of handwritten digits. The main idea of this extension is to create many small time series out of the class patterns. Let us assume that for a K-class problem each class cm 2 fc1; :::; cK g has got Nm patterns. We then reformulate the ∆-objective function (2) for SFA with distinct indices k and l as the mean of the difference over all possible pairs: N K m 2 1 X X (m) (m) Fig. 1. Wiimote Controller ∆(yj) = · gj(pk ) − gj(pl ) (6) npair m=1 k;l=1; k<l C. Data Acquisition and Data Preparation (m) where npair denotes the total count of all pairs and pk Nintendo has invented the Wiimote Controller (Fig. 1) (m) and pl represent the k-th and l-th class pattern of class as a Bluetooth compliant game controller for the Nintendo m. In our case such a pattern can be understood as a Wii console. In its basic version an acceleration sensor, a gesture, consisting of a concatenated signal from the 3 axis so called accelerometer, is implemented inside the device, accelerometer of the Wiimote controller. Constraints (3), (4) as well as one infrared device for position determination and (5) can be reformulated then by substituting the average of the controller during games. Additionally there exists over time with the average over all patterns, such that the an extension to the Wiimote, the Wii Motion Plus which learned functions are going to have zero mean, unit variance contains two gyrometers for calculation of angular velocities. and be decorrelated [Ber05]. In order to keep things as simple as possible with respect As [Ber05] has shown, the (K − 1) slowest SFA output to the sensor and hardware side we neither use the sensor signals are expected to have a low intra-class variation, but information from the Wii infrared sensor (sensor bar) nor usually a high inter-class variation. Therefore it is natural from the Wii Motion Plus sensors. As observed in other to train a standard Gaussian classifier on the slowest (K − studies by Schlomer¨ et al. [SPHB08] and Rehm et al. 1) SFA outputs produced from the training records. The [RBA08] we expect to get comparably good and accurate Gaussian classifier will seek an optimal position and shape of recognition rates with the sensor data solely taken from the a Gauss function for each class in this (K − 1)-dimensional accelerometer. space. We recorded five different gestures from ten test per- All SFA calculations were performed with the extended sons with the Avetana Bluetooth library together with a MATLAB toolkit sfa-tk V2.0 from [Kon09], [Ber05]. modified implementation of the Wiigee framework by Pop- pinga [Pop07b]. Accelerometer data were recorded with B. Random Forest their timestamps at a rate of approximately 100 Hz, not The Random Forest (RF) predictor by Breiman [Bre01] is equidistantly delivered from the Bluetooth interface. In a an extension of the well-known classification and regression preliminary data preparation step we interpolated the ac- trees (CART) [BFSO84].
Recommended publications
  • Gesture Recognition for Human-Robot Collaboration: a Review
    International Journal of Industrial Ergonomics xxx (2017) 1e13 Contents lists available at ScienceDirect International Journal of Industrial Ergonomics journal homepage: www.elsevier.com/locate/ergon Gesture recognition for human-robot collaboration: A review * Hongyi Liu, Lihui Wang KTH Royal Institute of Technology, Department of Production Engineering, Stockholm, Sweden article info abstract Article history: Recently, the concept of human-robot collaboration has raised many research interests. Instead of robots Received 20 July 2016 replacing human workers in workplaces, human-robot collaboration allows human workers and robots Received in revised form working together in a shared manufacturing environment. Human-robot collaboration can release hu- 15 November 2016 man workers from heavy tasks with assistive robots if effective communication channels between Accepted 15 February 2017 humans and robots are established. Although the communication channels between human workers and Available online xxx robots are still limited, gesture recognition has been effectively applied as the interface between humans and computers for long time. Covering some of the most important technologies and algorithms of Keywords: Human-robot collaboration gesture recognition, this paper is intended to provide an overview of the gesture recognition research Gesture and explore the possibility to apply gesture recognition in human-robot collaborative manufacturing. In Gesture recognition this paper, an overall model of gesture recognition for human-robot collaboration is also proposed. There are four essential technical components in the model of gesture recognition for human-robot collabo- ration: sensor technologies, gesture identification, gesture tracking and gesture classification. Reviewed approaches are classified according to the four essential technical components. Statistical analysis is also presented after technical analysis.
    [Show full text]
  • Study of Gesture Recognition Methods and Augmented Reality
    Study of Gesture Recognition methods and augmented reality Amol Palve ,Sandeep Vasave Department of Computer Engineering MIT COLLEGE OF ENGINEERING ( [email protected] ,[email protected]) Abstract: However recognizing the gestures in the noisy background has always been a With the growing technology, we difficult task to achieve. In the proposed humans always need something that stands system, we are going to use one such out from the other thing. Gestures are most technique called Augmentation in Image desirable source to Communicate with the processing to control Media Player. We will Machines. Human Computer Interaction recognize Gestures using which we are going finds its importance when it comes to to control the operations on Media player. working with the Human gestures to control Augmentation usually is one step ahead when the computer applications. Usually we control it comes to virtual reality. It has no the applications using mouse, keyboard, laser restrictions on the background. Moreover it pointers etc. but , with the recent advent in also doesn’t rely on certain things like gloves, the technology it has even left behind their color pointers etc. for recognizing the gesture. usage by introducing more efficient This system mainly appeals to those users techniques to control applications. There are who always looks out for a better option that many Gesture Recognition techniques that makes their interaction with computer more have been implemented using image simpler or easier. processing in the past. Keywords: Gesture Recognition, Human Computer Interaction, Augmentation I.INTRODUCTION: Augmentation in controlling the media player operations. Media player always In Human Computer interaction, the first requires mouse or keyboard to control it.
    [Show full text]
  • Hand Interface Using Deep Learning in Immersive Virtual Reality
    electronics Article DeepHandsVR: Hand Interface Using Deep Learning in Immersive Virtual Reality Taeseok Kang 1, Minsu Chae 1, Eunbin Seo 1, Mingyu Kim 2 and Jinmo Kim 1,* 1 Division of Computer Engineering, Hansung University, Seoul 02876, Korea; [email protected] (T.K.); [email protected] (M.C.); [email protected] (E.S.) 2 Program in Visual Information Processing, Korea University, Seoul 02841, Korea; [email protected] * Correspondence: [email protected]; Tel.: +82-2-760-4046 Received: 28 September 2020; Accepted: 4 November 2020; Published: 6 November 2020 Abstract: This paper proposes a hand interface through a novel deep learning that provides easy and realistic interactions with hands in immersive virtual reality. The proposed interface is designed to provide a real-to-virtual direct hand interface using a controller to map a real hand gesture to a virtual hand in an easy and simple structure. In addition, a gesture-to-action interface that expresses the process of gesture to action in real-time without the necessity of a graphical user interface (GUI) used in existing interactive applications is proposed. This interface uses the method of applying image classification training process of capturing a 3D virtual hand gesture model as a 2D image using a deep learning model, convolutional neural network (CNN). The key objective of this process is to provide users with intuitive and realistic interactions that feature convenient operation in immersive virtual reality. To achieve this, an application that can compare and analyze the proposed interface and the existing GUI was developed. Next, a survey experiment was conducted to statistically analyze and evaluate the positive effects on the sense of presence through user satisfaction with the interface experience.
    [Show full text]
  • Improving Gesture Recognition Accuracy on Touch Screens for Users with Low Vision
    Improving Touch Interfaces CHI 2017, May 6–11, 2017, Denver, CO, USA Improving Gesture Recognition Accuracy on Touch Screens for Users with Low Vision Radu-Daniel Vatavu MintViz Lab MANSiD Research Center University Stefan cel Mare of Suceava Suceava 720229, Romania [email protected] Figure 1. Gesture articulations produced by people with low vision present more variation than gestures produced by people without visual impairments, which negatively affects recognizers’ accuracy rates. Ten (10) superimposed executions of a “star” gesture are shown for six people: a person without visual impairments (a); three people with congenital nystagmus and high myopia (b), (c), (d); and two people with chorioretinal degeneration (e), (f). ABSTRACT INTRODUCTION We contribute in this work on gesture recognition to improve Today’s touch screen devices are little accessible to people the accessibility of touch screens for people with low vision. with visual impairments, who need to employ workaround We examine the accuracy of popular recognizers for gestures strategies to be able to use them effectively and indepen- produced by people with and without visual impairments, and dently [17,18,42]. Because smart devices expose touch screens we show that the user-independent accuracy of $P, the best not adapted to non-visual input, touch and gesture inter- recognizer among those evaluated, is small for people with action pose many challenges to people with visual impair- low vision (83.8%), despite $P being very effective for ges- ments [11,12,20,30], which can only be addressed with a thor- tures produced by people without visual impairments (95.9%).
    [Show full text]
  • Controlling Robots with Wii Gestures
    WiiRobot: Controlling Robots with Wii Gestures Aysun Bas¸c¸etinc¸elik Department of Computer Science Brown University Providence, RI, 02906 [email protected] May 15, 2009 Abstract In this project, we introduce WiiRobot, a sim- ple and reproducable gesture based robot con- troller. Our proposed system uses the Nintendo Wii Remote Controller to interact with a robot. The user first trains the system using Wii Re- mote in order to recognize future gestures. For the recognition process, we implemented two simple algorithms; K-Nearest Neighbour and sum of square differences. Our results show that using only a few number of repetitions per gesture is enough for recognition. 1 Introduction Finding a natural and effective way of communication with robots is important in order to enhance their contribution in our daily lives. For this purpose, there have been various developments in the area of Human- Figure 1: User controlling the robot with Wii Remote Robot Interaction. It is known that using a computer, a gamepad or a keyboard can be teleoperative but it is still unnatural and difficult to control. Hence, it is our fingers or with a mouse. We use gestures while we still one of the major research directions to design ease- are doing work with our laptops and phones. Moreoever, of-use and intuitive modes of communication with robots. after the introduction of Nintendo Wii Console, we were introduced to a new hardware, Wii Remote Controller, Another form of communication is gestures. Currently, which has the capability of creating gestures and using it gestures are not just limited to face, hand and body for a different intuitive control of video games.
    [Show full text]
  • A Wii-Based Gestural Interface for Computer Conducting Systems
    A Wii-based gestural interface for computer conducting systems Lijuan Peng David Gerhard Computer Science Department Computer Science Department University of Regina University of Regina SK, Canada, S4S 0A2 SK, Canada, S4S 0A2 [email protected] [email protected] Abstract This system is developed using Max/MSP/Jitter 1 and Java. With the increase of sales of Wii game consoles, it is be- The user stands in front of the computer and the Wii remote coming commonplace for the Wii remote to be used as an is mounted with the camera pointing toward the user. The alternative input device for other computer systems. In this accelerometer functionality and buttons on the Wii remote paper, we present a system which makes use of the infrared are ignored. Unlike other video-camera-based infrared sys- camera within the Wii remote to capture the gestures of a tems, no positional configuration is required. The infrared conductor using a baton with an infrared LED and battery. baton is held in the user’s right hand. It should be noted that Our system then performs data analysis with gesture classi- as long as the camera in the Wii remote can ”see” the entire fication and following, and finally displays the gestures us- conducting window, the specific location of the Wii remote ing visual baton trajectories and audio feedback. Gesture is irrelevant. The setup should be configured so that the user trajectories are displayed in real time and can be compared has full range of conducting motion and is comfortable dur- to the corresponding diagram shown in a textbook.
    [Show full text]
  • Analysis and Categorization of 2D Multi-Touch Gesture Recognition Techniques THESIS Presented in Partial Fulfillment of the Requ
    Analysis and Categorization of 2D Multi-Touch Gesture Recognition Techniques THESIS Presented in Partial Fulfillment of the Requirements for the Degree Master of Science in the Graduate School of The Ohio State University By Aditi Singhal Graduate Program in Computer Science and Engineering The Ohio State University 2013 Master's Examination Committee: Dr. Rajiv Ramnath, Adviser Dr. Jay Ramanathan, Co-Adviser Copyright by Aditi Singhal 2013 Abstract Various techniques have been implemented for wide variety of gesture recognition applications. However, discerning the best technique to adopt for a specific gesture application is still a challenge. Wide variety, complexity in the gesture and the need for complex algorithms make implementation of recognition difficult even for basic gesture applications. In this thesis, different techniques for two dimensional (2D) touch gestures are reviewed, compared and categorized based on user requirements of the gesture applications. This work introduces two main paradigms for gesture applications: a) gestures for direct manipulation and navigation, b) gesture-based languages. These two paradigms have specific and separate roles in 2D touch gesture systems. To provide the clear distinction between two paradigms, three different algorithms are implemented for basic to complex 2D gestures using simple sometimes, as well as complex techniques such as linear regression and Hidden Markov Models. Thereafter, these algorithms are analyzed based on their performance and their fit with application requirements. ii Dedication I lovingly dedicate this thesis to my husband Shashnk Agrawal, who supported me each step of the way, my son Aryan Agrawal who in his 3 year old wisdom knew that mommy should not be disturbed while studying, and my late father Dr.
    [Show full text]
  • Gesture Recognition Using 3D Appearance and Motion Features
    Gesture Recognition Using 3D Appearance and Motion Features Guangqi Ye, Jason J. Corso, Gregory D. Hager Computational Interaction and Robotics Laboratory The Johns Hopkins University [email protected] Abstract Alternatively, methods that attempt to capture the 3D infor- mation of the hand [11] have been proposed. However, it is We present a novel 3D gesture recognition scheme that com- well-known that, in general circumstances, the stereo prob- bines the 3D appearance of the hand and the motion dynam- lem is difficult to solve reliably and efficiently. ics of the gesture to classify manipulative and controlling Human hands and arms are highly articulate and de- gestures. Our method does not directly track the hand. In- formable objects and hand gestures normally consist of 3D stead, we take an object-centered approach that efficiently global and local motion of the hands and the arms. Manip- computes the 3D appearance using a region-based coarse ulative and interaction gestures [14] have a temporal nature stereo matching algorithm in a volume around the hand. that involve complex changes of hand configurations. The The motion cue is captured via differentiating the appear- complex spatial properties and dynamics of such gestures ance feature. An unsupervised learning scheme is carried render the problem too difficult for pure 2D (e.g. template out to capture the cluster structure of these feature-volumes. matching) methods. Ideally we would capture the full 3D Then, the image sequence of a gesture is converted to a se- information of the hands to model the gestures [11]. How- ries of symbols that indicate the cluster identities of each ever, the difficulty and computational complexity of visual image pair.
    [Show full text]
  • Augmented Reality by Using Hand Gesture
    International Journal of Engineering Research & Technology (IJERT) ISSN: 2278-0181 Vol. 3 Issue 4, April - 2014 Augmented Reality by using Hand Gesture Jyoti Gupta Amruta Bankar Mrunmayi Warankar Anisha Shelke Computer Department Computer Department Computer Department Computer Department Dr. D.Y. Patil COE Pune, Dr. D.Y. Patil COE Pune, Dr. D.Y. Patil COE Pune, Dr. D.Y. Patil COE Pune, India India India India Abstract — In this paper we have discussed a human-computer mixed scene can then be displayed on different devices. interaction interface (HCI) based on Hand gesture recognition. It Between the two prominent AR methods, video-based AR has is a challenging problem in its general form. We consider a set of attracted the most attention from researchers. manual commands and a reasonably structured environment, From study of Augmented Reality we are developing an and develop a simple, yet effective, procedure for gesture application in which we can navigate a virtual 3D model using recognition. Our approach contains steps for segmenting the hand region, locating the fingers, and finally classifying the hand gesture. In this the user interact with webcam and gesture. The algorithm is invariant to translation, rotation, and different virtual symbols on screen are provided for navigating scale of the hand. We demonstrate the effectiveness of the 3D object, user will select a particular button from screen by technique on real imagery. placing his hand in front of that particular virtual key. We use However, technology is progressing. In particular, Augmented Image Processing algorithms like Grayscaling [9], Blurring Reality (AR), an emerging Human-Computer Interaction [6], Thresholding [7], HSV model [15], Blob Detection [8] for technology, which aims to mix or overlap computer generated 2D recognition of hand.
    [Show full text]
  • Chapter 10. Gesture Recognition Matthew Turk 1. Introduction
    Chapter 10. Gesture Recognition Matthew Turk 1. Introduction .................................................................................................................................................................. 1 2. The Nature of Gesture ................................................................................................................................................. 3 3. Representations of Gesture ........................................................................................................................................ 6 3.1 Pen-based gesture recognition............................................................................................................................ 8 3.2 Tracker-based gesture recognition ..................................................................................................................... 9 3.2.1 Instrumented gloves ...................................................................................................................................... 9 3.2.2 Body suits ..................................................................................................................................................... 11 3.3 Passive vision-based gesture recognition....................................................................................................... 12 3.3.1 Head and face gestures ............................................................................................................................... 15 3.3.2 Hand and arm gestures...............................................................................................................................
    [Show full text]
  • AUTOMATIC RECOGNITION of CHILDREN's TOUCHSCREEN STROKE GESTURES by ALEX SHAW a DISSERTATION PRESENTED to the GRADUATE SCHOOL O
    AUTOMATIC RECOGNITION OF CHILDREN'S TOUCHSCREEN STROKE GESTURES By ALEX SHAW A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY UNIVERSITY OF FLORIDA 2020 c 2020 Alex Shaw ACKNOWLEDGMENTS I would like to thank my advisor, Dr. Lisa Anthony, for her continued support, encouragement, and advice throughout my PhD. I thank my co-advisor, Dr. Jaime Ruiz, as well as the rest of my committee, for their invaluable support through the process. Finally, I thank my labmates for being excellent coworkers and I thank my parents for their support through the long journey that obtaining a PhD has been. 3 TABLE OF CONTENTS page ACKNOWLEDGMENTS..................................3 LIST OF TABLES......................................7 LIST OF FIGURES.....................................8 ABSTRACT......................................... 13 CHAPTER 1 INTRODUCTION................................... 15 1.1 Contributions................................... 17 2 RELATED WORK................................... 19 2.1 Developmentally Appropriate Prompts and Feedback.............. 20 2.2 Selection of Gestures............................... 21 2.3 Recognition and Classification of Children's Gestures.............. 23 2.3.1 Challenges in Studying Children's Gestures............... 24 2.3.2 Summary................................. 26 3 RECOGNITION METHODS.............................. 27 3.1 Template Matchers............................... 27 3.2 Feature-based Statistical
    [Show full text]
  • Bringing Gesture Recognition to All Devices
    Bringing Gesture Recognition To All Devices Bryce Kellogg†, Vamsi Talla†, and Shyamnath Gollakota University of Washington † Co-primary Student Authors Abstract Existing gesture-recognition systems consume signifi- cant power and computational resources that limit how they may be used in low-end devices. We introduce AllSee, the first gesture-recognition system that can op- erate on a range of computing devices including those with no batteries. AllSee consumes three to four or- ders of magnitude lower power than state-of-the-art sys- tems and can enable always-on gesture recognition for smartphones and tablets. It extracts gesture information from existing wireless signals (e.g., TV transmissions), but does not incur the power and computational over- heads of prior wireless approaches. We build AllSee prototypes that can recognize gestures on RFID tags and power-harvesting sensors. We also integrate our hard- Figure 1: AllSee Prototype. It has two main compo- ware with an off-the-shelf Nexus S phone and demon- nents: a pluggable receiver that extracts the amplitude strate gesture recognition in through-the-pocket scenar- of wireless signals (e.g., TV and RFID transmissions), ios. Our results show that AllSee achieves classification and our classification logic implemented on a microcon- accuracies as high as 97% over a set of eight gestures. troller. It also comes with LEDs and a UART interface. use power-intensive analog components such as oscilla- 1 Introduction tors and high-speed ADCs. Second, they require rich sig- nal processing capabilities — computing optical flows, There is growing interest in using human gestures as a FFTs, frequency-time Doppler profiles, etc., is compu- means of interaction with computing devices.
    [Show full text]