Online Parametric and Non Parametric Drift Detection Techniques for Streaming Data in Machine Learning: a Review P.V.N

Total Page:16

File Type:pdf, Size:1020Kb

Online Parametric and Non Parametric Drift Detection Techniques for Streaming Data in Machine Learning: a Review P.V.N ONLINE PARAMETRIC AND NON PARAMETRIC DRIFT DETECTION TECHNIQUES FOR STREAMING DATA IN MACHINE LEARNING: A REVIEW P.V.N. Rajeswari1, Dr. M. Shashi2 1Assoc. Professor, Dept of Comp. Sc. & Engg., Visvodaya Engineering College, Kavali, AP, India 2Professor, Dept of Comp. Sc. & System Engg, Andhra University, Visakhapatnam, AP, India Abstract applied in the areas like, image processing, data Enormous amount of data streams are mining, medical diagnosis, search engines, etc. generated by large number of real time Machine learning categorized into two types applications with the stupendous namely Batch learning and Online learning based advancement in technology. Incremental and on representation and presentation of training online - offline learning algorithms are more samples. Batch learning systems accept a large relevant to process the data streams in the number of examples and learn to build a model data mining context. Detecting the changes from all of them at once. In contrast, Online and reacting against them is an interesting learning systems are given examples research area in current era in knowledge sequentially as streaming data [1] from which discovery process, in Machine Learning. The learning happens incrementally. target function also is changed frequently, due to the occurrence of changes in data streams. Learning from data streams aims to This is an intrinsic problem of online learning, capture the target concept as it changes over popularly known as Concept Drift. To handle time, and it is a current research area of growing the problem despite of the learning model, interest. For instance, changes can emerge due to novel methods are needed to examine the changing interests of people towards news performance metrics. This examination will topics, shopping preferences, energy be done during the learning process and consumption, etc. (e.g. influenced by seasonal prompt drift signals whenever a significant changes). Spam filtering [2] is another example, variation has been detected. The Concept wherein spammers try to elude filters by Drift problem can be detected using various disguising their emails as genuine with newer techniques. This paper focuses on reviewing tricks challenging spam filters to continuously on existing Parametric and non-Parametric enhance their capability to successfully identify techniques Online for drift detection. spam over time. So it is possible that a learning Keywords: Stream mining; Concept drift; model previously induced may be incompatible Parametric; Non-parametric; Machine with the present data, making an update Learning; mandatory. This type of problem is commonly known as Concept Drift. Many online learning I. INTRODUCTION algorithms[18], which are variants of the base Adaptation of science and technological models such as Rule-based systems, Decision outcomes including computerization in every Trees, Naive Bayes, Support Vector Machines, walk of life leads to accumulation of voluminous Instance based learning, and ensemble of amount of data that need to be processed by classifiers [3] [4], have been implemented for machines for pattern extraction for its effective handling Concept Drift. usage. Machine Learning algorithms are widely ISSN (PRINT): 2393-8374, (ONLINE): 2394-0697, VOLUME-5, ISSUE-3, 2018 45 INTERNATIONAL JOURNAL OF CURRENT ENGINEERING AND SCIENTIFIC RESEARCH (IJCESR) In supervised incremental learning, a Exponentially Weighted Moving Average well-extended approach continuously monitors a For Concept Drift Detection (ECDD) [15]. performance measure such as accuracy, or speed etc., of the learning model to handle Concept The comparative study of all algorithms is Drift. A Concept Drift is assumed if a significant provided in Table 1. fall in this measure is estimated and required actions are defined to update the model II. ONLINE PARAMETRIC DRIFT according to the latest data available. In this DETECTION TECHNIQUES scenario, automatic/ inherent change detectors of A. Shewhart’s Control Charts the learning algorithm play a crucial role [5] [6] To verify the validity of manufacturing or [7]. These detectors often operate over a stream business control state, Shewhart’s control charts of real values corresponding to a given (widely known as Process Behavior charts) are performance measure. Most existing approaches used as statistical tools [17]. In other words, monitor changes in terms of a suitable statistic certain statistical assumptions about the process (such as the mean or median) with standard data it produces are used to test the steadiness of deviation [8], to successfully capture the some of the process properties over time. Mean, distributional changes associated with the stream median, variance or standard deviation, of performance measures. Thus, the problem of distribution shape or fraction of non-conforming concept drift detection is reduced to estimating items are the commonly considered properties in significant changes in the statistic calculated the charts. from the sequence of values that measure a performance characteristic. Often, this stream of Shewhart’s control charts are used to real values is also huge (probably infinite), as the continuously monitor any deviations from the learning model is monitored over longer times. current state. These are aimed to identify events, which are very unlikely when the controlled Therefore, it is very common to enforce process is in stable condition. An alarm is used restrictions on these online change detectors [8], to signal when any change is noticed in the [9]. The computational complexity required to stability of the process, so that possible steps process each performance value must be constant should be taken for investigating the causes of as well as methods should be single-pass, where the change as well as for correction. each performance value is processed once and then discarded. These change detectors must also For instance, in case of the control limits able to deal with common types of change UCL (Upper Control Limit) or LCL (Lower widespread in much real-world data [4]. Under Control Limit)) are exceeded. The control limits these conditions, many traditional statistical are constructed at ±3σ from the mean of the approaches that assume a fixed size of the input performance measure, which can be exceeded data for estimating distributional changes are not with relative frequency of 0.27%. suitable. In addition to the LCL and UCL, warning Some of the most studied parametric limits for LWL and UWL are also constructed at schemes to detect changes online are: ±2σ and as well as ±σ. Having multiple limits Shewhart’s control charts [17] defined at various levels around the mean Exponentially Weighted Moving Average statistic, provides fine-grain control through (EWMA) [11] more sophisticated rules specifying low Control charts and Page’s cumulative sum probability event possessing when the process is (CUSUM) [10] under control. Procedure and Non-parametric approaches Some of the following rules can be are Drift Detection Method (DDM) [5] selected in the Shewhart control chart dialog Early Drift Detection Method (EDDM) [6] panel; by default all available rules are selected Adaptive WINdow (ADWIN) [7] [17] Adaptive WINdow2 (ADWIN2) [7] Detection With a Statistical Test of Equal Proportions STEPD [16] ISSN (PRINT): 2393-8374, (ONLINE): 2394-0697, VOLUME-5, ISSUE-3, 2018 46 INTERNATIONAL JOURNAL OF CURRENT ENGINEERING AND SCIENTIFIC RESEARCH (IJCESR) One point exceeds LCL/UCL B. CUSUM : Cumulative Sum Control Two out of three points exceed LWL or Chart UWL limits CUSUM is commonly used as sequential Four out of five points are above/below the analysis technique in statistical quality control. It central line and exceed ± limits is mainly used for monitoring abrupt or sudden Six consecutive points show change detection. In this, ‘θ’ is used as the increasing/decreasing trend parameter of probability distribution referred to Eight consecutive values are beyond ± as "quality number". It is a technique to verify limits any changes in mean, median, etc., for deciding Nine points above/below the middle line when to take corrective action. When the Difference of consecutive values alternates CUSUM [10] method is applied to changes in in sign for fourteen points mean, it can be used for step detection of a time Fifteen points are within ± limits series. Alternative forms of CUSUM charts are given below: Application of Shewhart Control Charts Direct form or Recursive forms for monitoring a control process involves two One- or Two-sided forms steps namely Chart construction and Chart application. X[n] is defined as a discrete random The purpose of construction step is to signal with independent and identically specify the middle/ central line (CL) and control distributed samples in CUSUM technique. Each limits to describe the interval of real process sample follows a probability density function correctly. When these values are not provided in (PDF) represented as p(x[n], θ) that depends on advance, assuming normally distributed data, the ‘θ’ value known as ‘deterministic parameter’ they are set to mean and interval which contains i.e., it can be considered as either mean ‘µx’ or 2 ’ 99.3% of available training data, with the the variance ‘σ x of X[n]. At the change time ‘n’, statistical characteristics. These are based on the discrete random signal may possess one observations of the process under control, abrupt change occurrence, indicated with ‘c’, and excluding outliers or otherwise
Recommended publications
  • Learning and Estimation of Single Molecule Behavior
    Learning and Estimation of Single Molecule Behavior Sivaraman Rajaganapathy1;a, James Melbourne1;b, Tanuj Aggarwal2;c, Rachit Shrivastava1;d, and Murti V. Salapaka1;e Abstract— Data analysis in single molecule studies often of tension. The unfolding of a domain leads to a step involves estimation of parameters and the detection of abrupt change in the length of the protein. How much time is changes in measured signals. For single molecule studies, tools needed for the domains to unfold, the relationship to the for automated analysis that are crucial for rapid progress, need to be effective under large noise magnitudes, and often must force applied, the changes in the structure under folding assume little or no prior knowledge of parameters being stud- and unfolding events are questions that are studied with ied. This article examines an iterated, dynamic programming AFM based force spectroscopy. Most of the studies involve based step detection algorithm (SDA). It is established that numerous iterations of experiments. Often, data is collected given a prior estimate, an iteration of the SDA necessarily from a large number of experiments, not just to ensure a improves the estimate. The analysis provides an explanation and a confirmation of the effectiveness of the learning and high confidence on statistical information, but also because estimation capabilities of the algorithm observed empirically. experiment success rates are sometimes intentionally kept Further, an alternative application of the SDA is demonstrated, low. For example, in single molecule force spectroscopy, wherein the parameters of a worm-like chain (WLC) model the protein solution is diluted enough to ensure that the are estimated, for the automated analysis of data from single the success rate of an attachment forming between the tip molecule protein pulling experiments.
    [Show full text]
  • Edge Detection with a Preprocessing Approach
    Journal of Signal and Information Processing, 2014, 5, 123-134 Published Online November 2014 in SciRes. http://www.scirp.org/journal/jsip http://dx.doi.org/10.4236/jsip.2014.54015 Edge Detection with a Preprocessing Approach Mohamed Abo-Zahhad1, Reda Ragab Gharieb1, Sabah M. Ahmed1, Ahmed Abd El-Baset Donkol2 1Department of Electrical Engineering Communication and Electronics, Assuit University, Assiut, Egypt 2Department of Electrical Engineering and Computer Sciences, Nahda University, Beni-Suef, Egypt Email: [email protected], [email protected], [email protected], [email protected] Received 18 August 2014; revised 15 September 2014; accepted 7 October 2014 Copyright © 2014 by authors and Scientific Research Publishing Inc. This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/ Abstract Edge detection is the process of determining where boundaries of objects fall within an image. So far, several standard operators-based methods have been widely used for edge detection. Howev- er, due to inherent quality of images, these methods prove ineffective if they are applied without any preprocessing. In this paper, an image preprocessing approach has been adopted in order to get certain parameters that are useful to perform better edge detection with the standard opera- tors-based edge detection methods. The proposed preprocessing approach involves computation of the histogram, finding out the total number of peaks and suppressing irrelevant peaks. From the intensity values corresponding to relevant peaks, threshold values are obtained. From these threshold values, optimal multilevel thresholds are calculated using the Otsu method, then multi- level image segmentation is carried out.
    [Show full text]
  • Modèles De Markov Cachés Récents Pour La Détection Et La Reconnaissance D’Activités À Partir D’Un Capteur IMU Placé Sur Une Jambe Haoyu Li
    Modèles de Markov cachés récents pour la détection et la reconnaissance d’activités à partir d’un capteur IMU placé sur une jambe Haoyu Li To cite this version: Haoyu Li. Modèles de Markov cachés récents pour la détection et la reconnaissance d’activités à partir d’un capteur IMU placé sur une jambe. Autre. Université de Lyon, 2019. Français. NNT : 2019LYSEC041. tel-02537351 HAL Id: tel-02537351 https://tel.archives-ouvertes.fr/tel-02537351 Submitted on 8 Apr 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. No d’ordre NNT : 2019LYSEC41 THESE DOCTEUR DE L’ÉCOLE CENTRALE DE LYON Spécialité: Informatique Recent Hidden Markov Models for Lower Limb Locomotion Activity Detection and Recognition using IMU Sensors dans le cadre de l’École Doctorale InfoMaths présentée et soutenue publiquement par HAOYU LI December 2019 Directeur de thèse: Stéphane Derrode, PR à l’Ecole Centrale de Lyon Co-directeur de thèse: Wojciech Pieczynski, PR à Télécom SudParis JURY François Septier Université Bretagne Sud Rapporteur Faicel Chamroukhi Université de Caen Rapporteur Latifa Oukhellou IFSTTAR Présidente Lamia Benyoussef EPITA Examinatrice Stéphane Derrode É́cole Centrale de Lyon Directeur de thèse Wojciech Pieczynski Télécom SudParis Co-directeur de thèse Contents Abstract v Résumé vii 1 Introduction 1 2 Background 7 2.1 Available sensors for collecting data ..................
    [Show full text]
  • A Method for Wavelet-Based Time Series Analysis of Historical Newspapers
    MSc thesis Master’s Programme in Computer Science A Method for Wavelet-Based Time Series Analysis of Historical Newspapers Jari Avikainen December 3, 2019 Faculty of Science University of Helsinki Supervisor(s) Prof. Hannu Toivonen Examiner(s) Prof. Hannu Toivonen, Dr. Lidia Pivovarova Contact information P. O. Box 68 (Pietari Kalmin katu 5) 00014 University of Helsinki,Finland Email address: [email protected].fi URL: http://www.cs.helsinki.fi/ HELSINGIN YLIOPISTO – HELSINGFORS UNIVERSITET – UNIVERSITY OF HELSINKI Tiedekunta — Fakultet — Faculty Koulutusohjelma — Utbildningsprogram — Study programme Faculty of Science Master’s Programme in Computer Science Tekij¨a— F¨orfattare— Author Jari Avikainen Ty¨onnimi — Arbetets titel — Title A Method for Wavelet-Based Time Series Analysis of Historical Newspapers Ohjaajat — Handledare — Supervisors Prof. Hannu Toivonen Ty¨onlaji — Arbetets art — Level Aika — Datum — Month and year Sivum¨a¨ar¨a— Sidoantal — Number of pages MSc thesis December 3, 2019 43 pages Tiivistelm¨a— Referat — Abstract This thesis presents a wavelet-based method for detecting moments of fast change in the textual contents of historical newspapers. The method works by generating time series of the relative frequencies of different words in the newspaper contents over time, and calculating their wavelet transforms. Wavelet transform is essentially a group of transformations describing the changes happening in the original time series at different time scales, and can therefore be used to pinpoint moments of fast change in the data. The produced wavelet transforms are then used to detect fast changes in word frequencies by examining products of multiple scales of the transform. The properties of the wavelet transform and the related multi-scale product are evaluated in relation to detecting various kinds of steps and spikes in different noise environments.
    [Show full text]
  • On Change Point Detection Using the Fused Lasso Method∗
    Submitted to the Annals of Statistics ON CHANGE POINT DETECTION USING THE FUSED LASSO METHOD∗ By Cristian R. Rojas and Bo Wahlberg KTH Royal Institute of Technology In this paper we analyze the asymptotic properties of `1 penal- ized maximum likelihood estimation of signals with piece-wise con- stant mean values and/or variances. The focus is on segmentation of a non-stationary time series with respect to changes in these model parameters. This change point detection and estimation problem is also referred to as total variation denoising or `1 -mean filtering and has many important applications in most fields of science and engi- neering. We establish the (approximate) sparse consistency proper- ties, including rate of convergence, of the so-called fused lasso signal approximator (FLSA). We show that this only holds if the sign of the corresponding consecutive changes are all different, and that this estimator is otherwise incapable of correctly detecting the underlying sparsity pattern. The key idea is to notice that the optimality con- ditions for this problem can be analyzed using techniques related to brownian bridge theory. 1. Introduction. Methods for estimating the mean, trend or variance of a stochastic process from time series data have applications in almost all areas of science and engineering. For non-stationary data it is also impor- tant to detect abrupt changes in these parameters and to be able to segment the data into the corresponding stationary subsets. Such applications include failure detection and fault diagnosis. An important case is noise removal from a piecewise constant signal, for which there are a wide range of proposed de- noising methods.
    [Show full text]
  • Title: Single-Molecule Techniques in Biophysics: a Review of the Progress in Methods and Applications
    Title: Single-molecule techniques in biophysics: a review of the progress in methods and applications Article now published in Reports on Progress in Physics doi: 10.1088/1361-6633/aa8a02 Authors: Helen Miller1, Zhaokun Zhou1, Jack Shepherd, Adam J. M. Wollman and Mark C. Leake2 Affiliations: Biological Physical Sciences Institute (BPSI), University of York, York,YO10 5DD, U.K. 1 These authors contributed equally 2 Correspondence to be spent to [email protected] Abstract Single-molecule biophysics has transformed our understanding of biology, but also of the physics of life. More exotic than simple soft matter, biomatter lives far from thermal equilibrium, covering multiple lengths from the nanoscale of single molecules to up several orders of magnitude to higher in cells, tissues and organisms. Biomolecules are often characterized by underlying instability: multiple metastable free energy states exist, separated by levels of just a few multiples of the thermal energy scale kBT, where kB is the Boltzmann constant and T absolute temperature, implying complex inter-conversion kinetics in the relatively hot, wet environment of active biological matter. A key benefit of single-molecule biophysics techniques is their ability to probe heterogeneity of free energy states across a molecular population, too challenging in general for conventional ensemble average approaches. Parallel developments in experimental and computational techniques have catalysed the birth of multiplexed, correlative techniques to tackle previously intractable biological questions. Experimentally, progress has been driven by improvements in sensitivity and speed of detectors, and the stability and efficiency of light sources, probes and microfluidics. We discuss the motivation and requirements for these recent experiments, including the underpinning mathematics.
    [Show full text]
  • An Anomalous Noise Events Detector for Dynamic Road Traffic Noise
    sensors Article An Anomalous Noise Events Detector for Dynamic Road Traffic Noise Mapping in Real-Life Urban and Suburban Environments Joan Claudi Socoró *, Francesc Alías and Rosa Ma Alsina-Pagès GTM—Grup de recerca en Tecnologies Mèdia, La Salle, Universitat Ramon Llull, Quatre Camins, 30, 08022 Barcelona, Spain; [email protected] (F.A.); [email protected] (R.M.A.-P.) * Correspondence: [email protected]; Tel.: +34-93-2902-452 Received: 6 August 2017; Accepted: 10 October 2017; Published: 12 October 2017 Abstract: One of the main aspects affecting the quality of life of people living in urban and suburban areas is their continued exposure to high Road Traffic Noise (RTN) levels. Until now, noise measurements in cities have been performed by professionals, recording data in certain locations to build a noise map afterwards. However, the deployment of Wireless Acoustic Sensor Networks (WASN) has enabled automatic noise mapping in smart cities. In order to obtain a reliable picture of the RTN levels affecting citizens, Anomalous Noise Events (ANE) unrelated to road traffic should be removed from the noise map computation. To this aim, this paper introduces an Anomalous Noise Event Detector (ANED) designed to differentiate between RTN and ANE in real time within a predefined interval running on the distributed low-cost acoustic sensors of a WASN. The proposed ANED follows a two-class audio event detection and classification approach, instead of multi-class or one-class classification schemes, taking advantage of the collection of representative acoustic data in real-life environments. The experiments conducted within the DYNAMAP project, implemented on ARM-based acoustic sensors, show the feasibility of the proposal both in terms of computational cost and classification performance using standard Mel cepstral coefficients and Gaussian Mixture Models (GMM).
    [Show full text]
  • An Online Algorithm for Piecewise Curve Estimation Using &Ell;<Sup>
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2015 VIVA: An Online Algorithm for Piecewise Curve Estimation Using &ell;0 Norm Regularization Richard Benjamin Voigt University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Applied Mathematics Commons, Electrical and Electronics Commons, and the Operational Research Commons Recommended Citation Voigt, Richard Benjamin, "VIVA: An Online Algorithm for Piecewise Curve Estimation Using &ell;0 Norm Regularization" (2015). Publicly Accessible Penn Dissertations. 1158. https://repository.upenn.edu/edissertations/1158 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/1158 For more information, please contact [email protected]. VIVA: An Online Algorithm for Piecewise Curve Estimation Using &ell;0 Norm Regularization Abstract Many processes deal with piecewise input functions, which occur naturally as a result of digital commands, user interfaces requiring a confirmation action, or discrete-time sampling. Examples include the assembly of protein polymers and hourly adjustments to the infusion rate of IV fluids during treatment of burn victims. Estimation of the input is straightforward regression when the observer has access to the timing information. More work is needed if the input can change at unknown times. Successful recovery of the change timing is largely dependent on the choice of cost function minimized during parameter estimation. Optimal estimation of a piecewise input will often proceed by minimization of a cost function which includes an estimation error term (most commonly mean square error) and the number (cardinality) of input changes (number of commands). Because the cardinality (ℓ0 norm) is not convex, the ℓ2 norm (quadratic smoothing) and ℓ1 norm (total variation minimization) are often substituted because they permit the use of convex optimization algorithms.
    [Show full text]
  • Hidden Markov Models, Theory and Applications.Indd
    HIDDEN MARKOV MODELS, THEORY AND APPLICATIONS Edited by Przemyslaw Dymarski Hidden Markov Models, Theory and Applications Edited by Przemyslaw Dymarski Published by InTech Janeza Trdine 9, 51000 Rijeka, Croatia Copyright © 2011 InTech All chapters are Open Access articles distributed under the Creative Commons Non Commercial Share Alike Attribution 3.0 license, which permits to copy, distribute, transmit, and adapt the work in any medium, so long as the original work is properly cited. After this work has been published by InTech, authors have the right to republish it, in whole or part, in any publication of which they are the author, and to make other personal use of the work. Any republication, referencing or personal use of the work must explicitly identify the original source. Statements and opinions expressed in the chapters are these of the individual contributors and not necessarily those of the editors or publisher. No responsibility is accepted for the accuracy of information contained in the published articles. The publisher assumes no responsibility for any damage or injury to persons or property arising out of the use of any materials, instructions, methods or ideas contained in the book. Publishing Process Manager Ivana Lorkovic Technical Editor Teodora Smiljanic Cover Designer Martina Sirotic Image Copyright Jenny Solomon, 2010. Used under license from Shutterstock.com First published March, 2011 Printed in India A free online edition of this book is available at www.intechopen.com Additional hard copies can be obtained
    [Show full text]
  • Real-Time Step Detection Using Unconstrained Smartphone
    Gonçalo Miguel Grenho Rodrigues Bachelor Degree in Biomedical Engineering Sciences Real-Time Step Detection Using Unconstrained Smartphone Dissertation submitted in partial fulfillment of the requirements for the degree of Master of Science in Biomedical Engineering Adviser: Hugo Filipe Silveira Gamboa, Professor Auxiliar, Faculdade de Ciências e Tecnologia, Universidade NOVA de Lisboa DRAFT: June 9, 2020 Real-Time Step Detection Using Unconstrained Smartphone Copyright © Gonçalo Miguel Grenho Rodrigues, Faculty of Sciences and Technology, NOVA University Lisbon. The Faculty of Sciences and Technology and the NOVA University Lisbon have the right, perpetual and without geographical boundaries, to file and publish this dissertation through printed copies reproduced on paper or on digital form, or by any other means known or that may be invented, and to disseminate through scientific repositories and admit its copying and distribution for non-commercial, educational or research purposes, as long as credit is given to the author and editor. This document was created using the (pdf)LATEX processor, based in the “novathesis” template[1], developed at the Dep. Informática of FCT-NOVA [2]. [1] https://github.com/joaomlourenco/novathesis [2] http://www.di.fct.unl.pt Acknowledgements Without the support of many important people, I would not have been able to reach this important accomplishment in my academic and personal life. First of all, I would like to thank Professor Hugo Gamboa for accepting me as his master thesis student and for allowing me to develop and work on this project, to develop myself and become a better Biomedical Engineer. To Ricardo Leonardo, who support me through the whole development of my master thesis project, guiding and helping me every step of the way, I would like to express my deepest gratitude.
    [Show full text]
  • A Novel Walking Detection and Step Counting Algorithm Using Unconstrained Smartphones
    sensors Article A Novel Walking Detection and Step Counting Algorithm Using Unconstrained Smartphones Xiaomin Kang, Baoqi Huang * and Guodong Qi † College of Computer Science, Inner Mongolia University, Hohhot 010021, China; [email protected] (X.K.); [email protected] (G.Q.) * Correspondence: [email protected]; Tel.: +86-471-499-3132 † Current Address: State Grid Corporation of China, Beijing 100031, China. Received: 13 December 2017; Accepted: 17 January 2018; Published: 19 January 2018 Abstract: Recently, with the development of artificial intelligence technologies and the popularity of mobile devices, walking detection and step counting have gained much attention since they play an important role in the fields of equipment positioning, saving energy, behavior recognition, etc. In this paper, a novel algorithm is proposed to simultaneously detect walking motion and count steps through unconstrained smartphones in the sense that the smartphone placement is not only arbitrary but also alterable. On account of the periodicity of the walking motion and sensitivity of gyroscopes, the proposed algorithm extracts the frequency domain features from three-dimensional (3D) angular velocities of a smartphone through FFT (fast Fourier transform) and identifies whether its holder is walking or not irrespective of its placement. Furthermore, the corresponding step frequency is recursively updated to evaluate the step count in real time. Extensive experiments are conducted by involving eight subjects and different walking scenarios in a realistic environment. It is shown that the proposed method achieves the precision of 93.76% and recall of 93.65% for walking detection, and its overall performance is significantly better than other well-known methods.
    [Show full text]
  • Low Complexity TOA Estimation for Impulse Radio UWB Systems
    MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Low Complexity TOA Estimation for Impulse Radio UWB Systems I. Guvenc; Zafer Sahinoglu TR2005-025 December 2005 Abstract Due to extremely narrow pulses, an impulse radio signaling has a strong potential for high- precision positioning. Highly dispersive nature of ultra-wideband (UWB) channels makes time of arrival (TOA)estimation extremely challenging, where the leading-edge path is not necessarily the strongest path. Since the bandwidth of a received UWB signal is very large, the Nyquist rate sampling becomes impractical, hence motivating lower complexity and yet accurate ranging techniques at feasible sampling rates. In this paper, we consider TOA estimation baased on symbol rate samples that are obtained after a square-law device. Signal conditioning techniques based on wavelets and a bank of cascaded multi-scale energy collection filters are introduced, where correlations across multiple scales are exploited for edge and peak enhancements towards a more accurate detection. An adaptive threshold selection approach based on the minimum and maximum values of the energy samples is introduced, and optimal values of the thresholds for different signal to noise ratios (SNRs) are investigated via simulations. Theoreticaal closed form expressions are derived for mean absolute TOA estimation error, and compared with simulations. The performances of the discussed algorithms are tested on IEEE 802.15.4a residential line-of- sight (LOS) and non-LOS channels. Simulation results show that the introduced multi-scale energy product technique supported with a search-back step to detect the leading edge performs better than all the other techniques, excluding the pure threshold comparison algorithm at very large SNR values.
    [Show full text]