Eventdtw: an Improved Dynamic Time Warping Algorithm for Aligning Biomedical Signals of Nonuniform Sampling Frequencies

sensors Article EventDTW: An Improved Dynamic Time Warping Algorithm for Aligning Biomedical Signals of Nonuniform Sampling Frequencies 1, 1, 1 1 2 Yihang Jiang y, Yuankai Qi y, Will Ke Wang , Brinnae Bent , Robert Avram , Jeffrey Olgin 2 and Jessilyn Dunn 1,* 1 The Departments of Biomedical Engineering and Biostatistics & Bioinformatics, Duke University, Durham, NC 27708, USA; [email protected] (Y.J.); [email protected] (Y.Q.); [email protected] (W.K.W.); [email protected] (B.B.) 2 The Division of Cardiology and the Cardiovascular Research Institute, University of California San Francisco, San Francisco, CA 94143, USA; [email protected] (R.A.); jeff[email protected] (J.O.) * Correspondence: [email protected] First co-authors. y Received: 19 March 2020; Accepted: 6 May 2020; Published: 9 May 2020 Abstract: The dynamic time warping (DTW) algorithm is widely used in pattern matching and sequence alignment tasks, including speech recognition and time series clustering. However, DTW algorithms perform poorly when aligning sequences of uneven sampling frequencies. This makes it difficult to apply DTW to practical problems, such as aligning signals that are recorded simultaneously by sensors with different, uneven, and dynamic sampling frequencies. As multi-modal sensing technologies become increasingly popular, it is necessary to develop methods for high quality alignment of such signals. Here we propose a DTW algorithm called EventDTW which uses information propagated from defined events as basis for path matching and hence sequence alignment. We have developed two metrics, the error rate (ER) and the singularity score (SS), to define and evaluate alignment quality and to enable comparison of performance across DTW algorithms. We demonstrate the utility of these metrics on 84 publicly-available signals in addition to our own multi-modal biomedical signals. EventDTW outperformed existing DTW algorithms for optimal alignment of signals with different sampling frequencies in 37% of artificial signal alignment tasks and 76% of real-world signal alignment tasks. Keywords: dynamic time warping; signal alignment; nonuniform sampling 1. Introduction Dynamic time warping (DTW) is a class of pattern matching algorithms for aligning temporal sequences. DTW aims to provide an optimal similarity-based match between two sequences regardless of dynamic spatiotemporal differences, searching for the optimal path by minimizing cumulative point-to-point distances. DTW allows for a flexible definition of cumulative point-to-point distances to optimize alignment performance for different datasets. Because of this flexibility, DTW is widely used in speech recognition under varying speaking speeds [1], gesture recognition [2,3], and time series clustering [4]. 1.1. DTW for Different Sampling Frequencies Measurement modalities utilizing multiple simultaneous sensors with different sampling frequencies and/or uneven sampling rates are common in many applications, particularly in biomedicine [5,6]. In practice, we have observed that DTW performs poorly when applied to signals Sensors 2020, 20, 2700; doi:10.3390/s20092700 www.mdpi.com/journal/sensors Sensors 2020, 20, 2700 2 of 13 thatSensors were 2020 simultaneously, 20, x FOR PEER REVIEW measured at different frequencies and/or when each individual signal2 of 14 has a dynamic, uneven sampling frequency. This is due to the singularity problem [7,8], which results that were simultaneously measured at different frequencies and/or when each individual signal has froma dynamic, inaccurate uneven matching sampling of frequency. a single observation This is due to in the one singularity time sequence problem with [7,8], a largewhich numberresults of consecutivefrom inaccurate observations matching in of another, a single resulting observation in an in uneven one time distribution sequence ofwith point-to-point a large number alignments. of Theconsecutive singularity observations problem is in undesirable another, resulting but common in an uneven in the distribution alignment of of biomedical point-to-point signals alignments. of different andThe dynamic singularity sampling problem frequencies. is undesirable but common in the alignment of biomedical signals of differentA real-world and dynamic example sampling of a singularity frequencies. problem using two biomedical devices to measure the same biologicalA real-world phenomenon example (heart of rate)a singularity with two problem different using modalities two biomedical with diff erentdevices sampling to measure frequencies the (ECG,same blackbiological (~1.26) phenomenon and photoplethysmogram (heart rate) with (PPG),two different blue (~0.2 modalities Hz)) is with shown different in Figure sampling1a. Here, wefrequencies explored (ECG, whether black accounting (~1.26) and for photoplethysmogram local structural information (PPG), blue when (~0.2 performing Hz)) is shown DTW-based in Figure 1a. signal alignmentHere, we explored could both whether reduce accounting singularity for occurrences local structural and information improve alignment when performing accuracy DTW-based [8]. signal alignment could both reduce singularity occurrences and improve alignment accuracy [8]. (a) (b) FigureFigure 1.1. RealReal multi-modal biomedical biomedical signals signals underlie underlie the the need need for for both both an animproved improved alignment alignment methodologymethodology and an improved metric metric to to evaluate evaluate the the resulting resulting alignment: alignment: (a) (Hearta) Heart rate rate measured measured withwith ECG ECG (black, (black, ~1.26 ~1.26 Hz) Hz) and and photoplethysmogram photoplethysmogram (PPG) (PPG) (blue, (blue, ~0.2 ~0.2 Hz); Hz); (b) an(b) example an example of warping of andwarping distortion and distortion observed observed when comparing when comparing the heart th ratese heart measured rates measured by ECG by (black, ECG (black, ~1.26 Hz)~1.26 and Hz) PPG (blue,and PPG ~0.2 (blue, Hz). ~0.2 Hz). 1.2.1.2. ExistingExisting DTWDTW Improvement Methods Methods VariousVarious improvedimproved DTW algorithms have have been been de developedveloped and and applied applied to different to different non-temporal non-temporal datasetsdatasets [[9,10].9,10]. KeoghKeogh et al. developed developed derivative derivative DTW DTW (dDTW), (dDTW), which which produces produces intuitively intuitively correct correct “feature-to-feature”“feature-to-feature” alignmentalignment between between two two sequence sequencess by by using using the the first first derivative derivative of time of time series series sequencessequences as as the the basis basis for for DTW DTW alignment. alignment. Zhao Zh etao al. et developed al. developed shape shape DTW DTW (sDTW), (sDTW), which which computes similaritiescomputes similarities between observations between obse basedrvations on based their on local their neighborhoods local neighborhoods instead instead of based of based only only on their single-pointon their single-point amplitudes amplitudes [11]. However, [11]. However, we have we observed have observed that widely that widely used or used current or current state-of-the-art state- DTWof-the-art algorithms, DTW algorithms, including dDTWincluding and dDTW sDTW, and do sDTW, not produce do not optimal produce alignment optimal alignment results with results respect towith the respect singularity to the problem singularity when problem the signal when frequencies the signal frequencies differ. When differ. the When frequency the frequency of the companion of the companion sequence is different than that of the reference sequence, dDTW and sDTW can result in Sensors 2020, 20, 2700 3 of 13 sequence is different than that of the reference sequence, dDTW and sDTW can result in suboptimal alignments because they use information from proximal observations within the companion sequence which may not have corresponding observations in the reference signal. In this paper, we are aiming to improve the performance of the DTW algorithm in aligning unevenly down-sampled signals. A common method employed to reduce instances of singularities involves adding penalties to the cumulative distance minimization function in the DTW algorithm [8]. Early approaches that constrain the warping path can be categorized into three types: Windowing [12], slope weighting [13], and step patterning [14]. However, these common methods are not robust because they apply a hard-coded limit to the warping path which can result in improper alignment. The parameters of these algorithms also have to be tuned empirically for each application through optimization experiments, making these methods less generalizable to new applications and different types of signals. 1.3. Existing DTW Evaluation Metrics Despite extensive research on DTW algorithm development to solve the singularity problem, methods to evaluate signal alignment are currently limited. Since DTW is a matching result calculated from distance measurements but not a metric [15], we need a new metric to evaluate signal alignment quality. Often, signal alignment evaluation is performed qualitatively by visually examining spatiotemporal feature-to-feature matching. A common quantitative evaluation method is the mean deviation metric [16], but this requires evenly sampled signals with same sampling rate, which is uncommon when recording real-world biomedical signals using different types of sensors. Other quantitative metrics that have been used to evaluate signal alignment include the length of the warping path [8] and

Eventdtw: an Improved Dynamic Time Warping Algorithm for Aligning Biomedical Signals of Nonuniform Sampling Frequencies

Logmine: Fast Pattern Recognition for Log Analytics

Role of Similarity Measures in Time Series Analysis

Similarity Processing in Multi-Observation Data

Clustering Human Behaviors with Dynamic Time Warping and Hidden Markov Models for a Video Surveillance System

Diffeomorphic Temporal Alignment Nets

Shape Dynamic Time Warping

Making Time-Series Classification More Accurate Using Learned

Mining and Similarity Search in Temporal Databases

Comparison of HMM and DTW for Isolated Word Recognition System of Punjabi Language

Needleman–Wunsch Algorithm

Neural Time Warping for Multiple Sequence Alignment

Shapedtw: Shape Dynamic Time Warping