Arxiv:1008.1804V1 [Nlin.CD]

A new surrogate data method for nonstationary time series Diego L. Guar´ın Lópeza,1,∗, Alvaro A. Orozco Gutierreza, Edilson Delgado Trejosb aDepartment of Electrical Engineering. Universidad tecnológica of Pereira. Pereira, Colombia. bResearch center at the Instituto Tecnológico Metropolitano. Medell´ın, Colombia. Abstract Hypothesis testing based on surrogate data has emerged as a popular way to test the null hypothesis that a signal is a realization of a linear stochastic process. Typically, this is done by generating surrogates which are made to conform to autocorrelation (power spectra) and amplitude distribution of the data (this is not necessary if data are Gaussian). Recently, a new algorithm was proposed, the null hypothesis addressed by this algorithm is that data are a realization of a non stationary linear stochastic process, surrogates generated by this algorithm preserve the autocorrelation and local mean and variance of data. Unfortunately, the assumption of Gaussian amplitude distribution is not always valid. Here we propose a new algorithm; the hypothesis addressed by our algorithm is that data are a realization of a nonlinear static transformation of a non stationary linear stochastic process. Surrogates generated by our algorithm preserve the autocorrelation, amplitude distribution and local mean and variance of data. We present some numerical examples where the previously proposed surrogate data methods fail, but our algorithm is able to discriminate between linear and nonlinear data, whether they are stationary or not. Using our algorithm we also confirm the presence of nonlinearity in the monthly global average temperature and in a small segment of a signal from a Micro Electrode Recording. Keywords: Computational methods in statistical physics and nonlinear dynamics, Hypothesis testing, Surrogate data, Time series analysis 1. Introduction to the fact that it does not requiere a pivotal statistics [2]. In order to test a null hypothesis at a level of signif- Surrogate data method, initially introduced by icance α, onehas to generate1/α−1(2/α−1) surrogates Theiler et al. [1] is nowadays one of the most popular for a one side (two side) test. Then, one simply evokes tests used in nonlinear time series analysis to investigate whatever statistic is of interest and compares the value the existence of nonlinear dynamics underlying experi- of this statistic computed from data to the distribution of mental data. The approachis to formulatea null hypoth- values elicited from the surrogates. If the statistic value esis for a specific process class and compare the system of the data deviates from that of the surrogates, then the output to this hypothesis. The surrogate data method null hypothesis may be rejected. Otherwise, it may not. can be undertaken in two different ways: Typical real- The classical methods for constrained realizations arXiv:1008.1804v1 [nlin.CD] 10 Aug 2010 izations are Monte Carlo generated surrogates from a named (i) Random shuffle (RS); (ii) Random phase model that provides a good fit to the data; constrained (RP); and, (iii) Amplitude adjusted Fourier transform realizations are surrogates generated from the time se- (AAFT) surrogates [1], were developed to test the null ries to conform to certain properties of the data. The hypothesis that the data came from a (i) i.i.d gaussian latter approach is preferable for hypothesis testing due random process, (ii) linear correlated stochastic process; and (iii) nonlinear static transformation of a linear stochastic process. Surrogates generated with the ∗Corresponding author. Email addresses: [email protected] (Diego L. Guar´ın RS method preserves the amplitude distribution (AD) of López), [email protected] (Alvaro A. Orozco Gutierrez), the original data, while the ones generated with the RP [email protected] (Edilson Delgado Trejos) algorithm preserve the autocorrelation (AC) and surro- 1Diego L. Guar´ın López is supported by the Universidad Tec- gates generated with the AAFT algorithm preserve both nológica de Pereira and Colciencias, Contract No 5304 of 2010. Preprint submitted to Nonlinear Analysis: Real World Applications October 29, 2018 the AD and the AC of the original data (in general this is methods, followed by an introduction to our method, not true, this is why an improved version of the AAFT named Amplitude Adjusted Truncated Fourier Trans- algorithm was presented, referred as iAAFT [3]). form (AATFT). Then we introduce a methodology to Recently, Richard et al. [4] showed that surrogates gen- accept or reject a null hypothesis and proceed to apply erated with the mentioned methods are stationarized the methods to several simulated and real time series, versions of the original data. This imply that while showing the utility of each one. Finally we present some the statistical properties of the data might be time de- concluding remarks. pendent, the statistical properties of the surrogates will not. Because of this, when data becomes from a non- 2. Surrogate data methods stationary process, it is impossible to make a statistical comparison of data with its surrogates. So, the classi- As mentioned, the surrogate data methods, originally cal surrogate data methods are not applicable to non- introduced by Theiler et al. [1], has become a very pop- stationary process. Due the importance of this kind of ular method for hypothesis testing. The original algo- process, many modifications of the classical methods rithms can be stated as follows: have been presented. The first one can be attributed to Schreiber and Schmitz [5], in this approach the surro- 2.1. The existing algorithms gate data preserves the AC and any other desired prop- 2.1.1. Random Phase surrogates (RP) erty of the original time series. To generate a surrogate The surrogate data is generated by the following pro- one starts by random shuffling the data, then measur- cedure: ing (for example) the AC of the surrogates and defin- ing an error function as the square difference of data 1. Start with the original data x[t], t = 1, ··· , N. AC minus the surrogate AC. One has to keep permuting 2. Compute z[n], the Fourier transform of x[t]. pairs until the error function is minimized. To gener- 3. Randomize the phases: z′[n] = z[n]eıφ[n]. ate surrogates for non stationary time series, one has to Where φ[1] = 0 and φ[n] ∈ N(0, 2π), n = 2,..., N. ensure that the surrogates also preserve the local mean 4. Symmetrize z′[n] (to obtain a real inverse Fourier and variance of the data. This procedure can be done Transform): iteratively by means of any optimization algorithm, but z′[n − i + 1] = z′[i + 1], i = 1,..., f loor(n/2), there is no guarantee that one will not be stuck in a lo- if N is even then z′[n/2 + 1] = abs(z′[n/2 + 1]). cal minimum (this issue was overcome in [5] by using z[n] is the complexb conjugate of z[n]. the simulated annealing optimization method). Unfortu- 5. Obtain x′[t], the inverse Fourier transform of z′[n]. nately, this method requires a lot of computational time, b ′ so it is of limited applicability. x [t] is the surrogate data of x[t]. Recently, Nakamura et al. [6] presented a modifica- The surrogates maintain the linear correlation of the tion of the RP method which makes it suitable for non- data, but by means of the phases randomization, any stationary data, they called its method Truncated Fourier nonlinear structure is destroyed. Transform (TFT). Surrogates generated with the TFT algorithm are constrained to preserve the AC and the lo- 2.1.2. Amplitude Adjusted Fourier Transform surro- cal mean and variance of data so, surrogateswill be non- gates (AAFT) stationary if original data are non-stationary. Through The surrogate data is generated by the following pro- this method it is possible to test the null hypothesis cedure: that the data came from a non-stationary linear cor- 1. Start with the original data x[t], t = 1,..., N. related stochastic process. Since surrogates generated 2. Sort the data Sx[k], k = 1,..., N. with this method do not preserve the AD of data, fur- 3. Compute z[n], the Fourier transform of x[t]. ther hypothesis (e.g, data are a realization of a nonlinear statical transformation of a non-stationary linear corre- 4. Make a ranked time series Rx[t] defined to satisfy = lated stochastic process) can not be tested. The aim of Sx[Rx[t]] x[t]. = this paper is to present a new surrogate data method 5. Create a random data set g[t], t 1,..., N. through which is possible to obtain surrogates that are 6. Sort the random gaussian number Sg[k], k = constrained to preserve the AC, AD and local mean and 1,..., N. variance of data, but are otherwise random. 7. Define a new time series y[t] = Sg[Rx[t]]. This document is organized in the following way; ini- 8. Generate a surrogate time series y′[t] from y[t] us- tially we briefly introduce the RP, AAFT and the TFT ing the RP algorithm. 2 9. Make a ranked time series Ry′[t] of y′[t]. 2.2. A new algorithm 10. The surrogate time series of x[t] is given by x′[t] = 2.2.1. Amplitude Adjusted Truncated Fourier Trans- Sx[Ry′[t]]. form surrogates (AATFT) Surrogates generated with the TFT algorithm do not ′ x [t] is the surrogate data of x[t]. preserve AD of data (this is actually not true, if fc is It is evident that this process achieves two aims: First, high enough the surrogates AD will eventually be like just as with RP algorithm, the power spectra (and there- the data AD, but this imply that surrogates are too simi- fore linear correlation) of the data is preserved in the lar to data).

Arxiv:1008.1804V1 [Nlin.CD]

Statistical Analysis for Long Term Correlations in the Stress Time

Acknowledgements

Arxiv:2004.07515V1 [Gr-Qc] 16 Apr 2020 Trailing Heliocentric Orbit Around the Sun at an Angle of 20 Degrees Between the Sun and Earth

Durham E-Theses

On the Characterization of the Deterministic/Stochastic and Linear/Nonlinear Nature of Time Series

Nonlinear Dependencies in and Between Time Series

Measures of Analysis of Time Series (MATS): a MATLAB Toolkit for Computation of Multiple Measures on Time Series Data Bases

Cross-Scale Interactions and Information Transfer

Memory in Network Flows and Its Effects on Spreading Dynamics And

University of Florida Thesis Or Dissertation Formatting

Synchronization, Variability, and Nonlinearity Analysis: Applications to Physiological Time Series

Testing for Nonlinearity in High-Dimensional Time Series from Continuous Dynamics A