Arxiv:1907.04747V1 [Gr-Qc] 10 Jul 2019 Itored Through Laser Links with a Sensitivity Level of to Spectral Leakage, Aﬀecting Both the Gravitational Sig- Pm/√Hz

Gravitational-wave parameter estimation with gaps in LISA: a Bayesian data augmentation method Quentin Baghi1,∗ James Ira Thorpe1, Jacob Slutsky1, John Baker1, Tito Dal Canton1, Natalia Korsakova2, and Nikos Karnesis3 1NASA Goddard Space Flight Center, 8800 Greenbelt Rd, Maryland, USA 2ArtémisUMR 7250, Observatoire de la Côte d'Azur, Boulevard de l'Observatoire, CS 34229-F, 06304 NICE Cedex 4 and 3Laboratoire Astroparticules et Cosmologie, UniversitéParis Diderot, 10 Rue Alice Domon et LéonieDuquet, 75013 Paris, France (Dated: July 11, 2019) By listening to gravity in the low frequency band, between 0.1 mHz and 1 Hz, the future space- based gravitational-wave observatory LISA will be able to detect tens of thousands of astrophysical sources from cosmic dawn to the present. The detection and characterization of all resolvable sources is a challenge in itself, but LISA data analysis will be further complicated by interruptions occurring in the interferometric measurements. These interruptions will be due to various causes occurring at various rates, such as laser frequency switches, high-gain antenna re-pointing, orbit corrections, or even unplanned random events. Extracting long-lasting gravitational-wave signals from gapped data raises problems such as noise leakage and increased computational complexity. We address these issues by using Bayesian data augmentation, a method that reintroduces the missing data as auxiliary variables in the sampling of the posterior distribution of astrophysical parameters. This provides a statistically consistent way to handle gaps while improving the sampling efficiency and mitigating leakage effects. We apply the method to the estimation of galactic binaries parameters with different gap patterns, and we compare the results to the case of complete data. I. INTRODUCTION phasemeters. In addition, the measurement is likely to be affected by transient perturbations in the data. Such The laser interferometer space antenna (LISA) [1], events have been observed in LIGO-Virgo [5] and in LISA the future space-borne gravitational-wave observatory Pathfinder [6] data. In some cases the safest solution to under development by ESA and NASA, will probe avoid their impact on the science performance may be to gravitational-wave radiation in the milihertz regime, with discard the parts of the data where they arise, thereby a peack sensitivity between 0.1 mHz to 1 Hz. Unlike inducing gaps in the exploitable data streams. the ground-based detectors LIGO [2] and Virgo [3], LISA Assessing the impact of data gaps on the observation will be constantly observing a large number of sources, of gravitational wave sources with LISA is important to some of them emitting long-lasting signals on periods of quantify the relative impact of different kinds of inter- months to years. The detection and characterization of ruptions, in order to inform the design of the instrument. all resolvable sources is a challenge for the data analy- Furthermore, reducing this impact to minimum is crucial sis [4]. In addition, over such time scales, the instrument to be able to optimize the scientific return of the mission. is very likely to undergo interruptions in the measure- In the problem under study, we define data gaps as the ment, which will add extra complication in the extraction absence of usable data points during certain time spans of gravitational signals. in time series that are originally evenly sampled. Data The LISA observatory is a constellation of three satel- gaps can be problematic for two reasons. First, the dis- lites forming a triangle whose side pathlengths are mon- crete Fourier transform (DFT) of gapped data is subject arXiv:1907.04747v1 [gr-qc] 10 Jul 2019 itored through laser links with a sensitivity level of to spectral leakage, affecting both the gravitational sig- pm=pHz. Inertial references are provided by free-falling nal and the stochastic noise. This may lead to increased test-masses housed in the satellites, and the gravitational bias and variance when the computation of the likelihood signal is obtained from several interferometric measure- is done in the Fourier domain. Second, in addition to the ments. In such a complex system, measurement inter- leakage effect, another problem is that the diagonal ap- ruptions can be caused by various phenomena. One of proximation of the covariance matrix in Fourier space is them is the re-pointing process of the satellites' high not valid for gapped data. Appropriately weighting the gain antennas, during which the measured data may be data to account for the noise would require to compute saturated or too perturbed to be usable to infer scien- the covariance in the time domain, which is computa- tific information. Another one is the re-locking of the tionally expensive and may not be possible for long data laser frequencies, which may be needed to maintain het- samples. Therefore appropriate analysis methods must erodyne frequencies in the sensitive bandwidth of the be developed. Few works have addressed the problem of gravitational-wave parameter inference in the pres- ∗ [email protected] ence of data gap. Pollack [7] studied the recovery 2 of monochromatic signals in the presence of data servations, in order to demonstrate the performance of disturbances and encountered complications for daily such an approach. gaps. Carréand Porter [8] studied the effect of gaps on To analyse measurements of gravitational-wave detec- the precision of ultra-compact galactic binary (UCB) tors, we usually model the data by a multivariate Gaus- parameter estimation based on the Fisher information sian and stationary distribution [15]. In this case the con- matrix (FIM) approximation. Their approach is to ditional distribution of missing data given the observed apply apodization, i.e. a window in the time domain data is also Gaussian and can be written explicitly. How- which has smooth transitions at the gap edges, allowing ever, it involves the computation of the product of the them to reduce spectral leakage. They provide a first inverse covariance matrix of observed data with the vec- assessment of the impact of hour-long gaps on parameter tor of model residuals. As Fourier diagonalization is not 3 estimation, showing that errors increase by 2% to 9% possible, this inversion would require O(No ) operations, depending on the parameter. Worst cases are obtained where No is the number of observed data points. In the for gap frequencies larger than one per week. case of stationary noise, more efficient ways to perform However, these studies do not entirely address the this computation are possible, by taking advantage of it- problem of noise correlations in the presence of gaps. erative inversion methods and efficient matrix-to-vector Besides the fact that FIM calculations do not always computations using the fast Fourier transform (FFT) properly represent correlations between parameters, the [16{18]. However, we found that this was a too heavy diagonal approximation of the noise covariance matrix bottleneck for the large number of iterations involved in in the Fourier domain is generally not valid for gapped Markov-chain Monte-Carlo (MCMC) algorithms used to data. Likewise, treating the remaining data segments as sample the posterior distribution of gravitational-wave independent measurements may lead to modeling errors. parameters. Instead, we adopt an approximation to per- Indeed, apodization windowing is a sub-optimal weight- form the imputation step, which is done conditionally on ing of the data, the optimal one being provided by the the nearest observations around gaps [19]. inverse covariance of the entire vector of observed data. In this work we develop a blocked-Gibbs data aug- In the present work we tackle the problem of gaps mentation algorithm which estimates the posterior dis- based on statistical inference, by studying its impact tribution of signal and noise parameters. In order to on Bayesian parameter estimation, and by proposing an demonstrate the performance of this method on a sim- adapted method that optimally takes into account noise ple example, we apply it to the characterization of UCBs correlations. in simulated LISA data. In Sec. II we present the gen- eral Gaussian stationary model that we adopt to describe Statistical inference in the presence of missing data is gravitational-wave measurements, both in the complete a well covered problem in the statistical literature [9{ and gapped data cases. Then in Sec. III we describe the 11]. To circumvent the computational issues that can standard method of time-domain windowing that can be arise when directly computing the likelihood with re- used to handle data gaps. We show how to optimize spect to the observed data, a common trick is to intro- it before highlighting its drawbacks. This leads us to duce a step where missing data are attributed a statis- introduce the data augmentation method as an alterna- tically consistent value, a process called \imputation". tive approach in Sec. IV, where we describe its theoret- This allows one to efficiently compute the likelihood from ical basis. In Sec. V we present an application to the the reconstructed data using standard methods for com- case of UCB parameter estimation, where we describe plete data sets. In the framework of maximum likeli- the time-domain model used to simulate LISA data and hood estimation (MLE), this approach corresponds to the the frequency-domain model used for the data analysis. expectation-maximization (EM) algorithm [12]. In the In Sec. VI we detail the simulations used in this study, framework of Bayesian estimation, which is more widely and in Sec. VII we present the results of the gravitational- used in gravitational-wave data analysis, the equivalent wave parameter estimation. We finally draw conclusions algorithm is known as data augmentation (DA) [13]. In in Sec. VIII. this procedure, the missing data are treated as auxiliary variables of the model, and sampled along with the parameters of interest.

Arxiv:1907.04747V1 [Gr-Qc] 10 Jul 2019 Itored Through Laser Links with a Sensitivity Level of to Spectral Leakage, Aﬀecting Both the Gravitational Sig- Pm/√Hz

Windowing Techniques, the Welch Method for Improvement of Power Spectrum Estimation

Windowing Design and Performance Assessment for Mitigation of Spectrum Leakage

Effect of Data Gaps: Comparison of Different Spectral Analysis Methods

Chapter 2: the Fourier Transform Chapter 2

Fourier Analysis

Deconvolution of the Fourier Spectrum

DFT & Digital Signal Processing Experiment

FFT Spectrum Analysis (Fast Fourier Transform) What Is Frequency Analysis?

The Fundamentals of FFT-Based Signal Analysis and Measurement Michael Cerna and Audrey F

Spectral Analysis of Signals

Spectral Leakage and Windowing

Spectral Density Estimation for Nonstationary Data with Nonzero Mean Function Anna E