Extracting cosmological information from small scales in weak gravitational lensing data
José Manuel Zorrilla Matilla
Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy under the Executive Committee of the Graduate School of Arts and Sciences
COLUMBIA UNIVERSITY
2020 © 2020
José Manuel Zorrilla Matilla
All Rights Reserved Abstract Extracting cosmological information from small scales in weak gravitational lensing data José Manuel Zorrilla Matilla
This work is concerned with how to extract information encoded in small scales of non-Gaussian fields, with the purpose of learning about cosmology using weak gravitational lensing. We do so by comparing different methods on simulated data sets. The topic is relevant, for upcoming galaxy surveys will map the late evolution of the matter density field, which is non-Gaussian, with an unprecedented level of detail, and any improvement on the analysis techniques will increase the experiments’ scientific return. First, we investigate some non-Gaussian observables used in the weak lensing community. We analyze to what extent they are sensitive to the background expansion of the universe, and to what extent to the evolution of the structures responsible for the lensing. We then focus our attention on one such statistic, lensing peaks, and assess the performance of a simple halo-based model that has been proposed to forecast their abundance. We find some shortcomings of that semi-analytic approach, and proceed to review some minimal requirements for numerical simulations used to forecast non-Gaussian statistics, to reduce their computational cost while fulfilling the accuracy and precision required by future experiments. Second, we propose a novel measurement, that of the temperature dipole induced on the cosmic microwave background induced by the rotation of ionized gas around galaxies, as an additional observation to help constrain the distribution of baryonic matter on the smallest scales probed by WL experiments. The uncertainty in this distribution is a major theoretical systematic for future surveys. Third, we show how deep neural networks can be used to map pixel-level data into the cosmological parameters of interest, by-passing the previous compression step of measuring pre-designed statistics. We provide the first (simulation-based) credible contours based on neural networks applied to weak lensing data, and discuss how to interpret these models. Table of Contents
List of Tables ...... vi
List of Figures ...... x
Acknowledgments ...... xxii
Dedication ...... xxiii
Chapter 1: Introduction ...... 1
1.1 Weak lensing cosmology ...... 1
1.1.1 Weak lensing fundamentals...... 2
1.1.2 Experimental results ...... 4
1.2 Cosmological inference based on WL measurements ...... 5
1.2.1 WL statistics ...... 7
1.3 Deep learning applied to weak lensing cosmology...... 9
1.3.1 Deep learning basics ...... 10
1.3.2 Applications of deep learning to WL...... 17
1.4 Structure of dissertation...... 18
Chapter 2: Geometry and growth contributions to cosmic shear observables ...... 21
2.1 Introduction...... 21
i 2.2 Disentangling geometry from growth in simulations...... 23
2.2.1 Simulating weak lensing maps...... 23
2.2.2 Isolating the effect of geometry vs. growth...... 26
2.3 Sensitivity to Ω< and w ...... 27
2.3.1 Power spectrum...... 28
2.3.2 Equilateral bispectrum ...... 30
2.3.3 Lensing peaks...... 32
2.3.4 Minkowski functionals...... 34
2.4 Impact on parameter inference ...... 36
2.5 Discussion...... 41
2.6 Conclusions...... 44
Chapter 3: Do dark matter halos explain lensing peaks? ...... 48
3.1 Introduction...... 48
3.2 Predicting peak counts ...... 50
3.2.1 N-body simulations...... 51
3.2.2 Camelus...... 52
3.2.3 Parameter inference...... 52
3.3 Results...... 57
3.4 Discussion...... 66
3.5 Conclusions...... 75
Chapter 4: Optimizing simulation parameters for weak lensing analyses involving non- Gaussian observables ...... 77
ii 4.1 Introduction...... 77
4.2 Methods...... 78
4.2.1 Simulating convergence maps ...... 78
4.2.2 Assessing the impact of hyper-parameters...... 81
4.2.3 Hyper-parameter configurations ...... 83
4.2.4 Observables...... 84
4.3 Results and discussion ...... 89
4.3.1 Lens plane thickness ...... 89
4.3.2 Mass resolution...... 96
4.4 Conclusions...... 101
Chapter 5: Probing gaseous galactic halos through the rotational kSZ effect ...... 105
5.1 Introduction...... 105
5.2 Modeling the rotational kSZ (rkSZ) signal from galaxies ...... 106
5.2.1 The rkSZ imprint on the CMB...... 106
5.2.2 Galactic atmospheres: electron density...... 107
5.2.3 Galactic atmosphere: kinematics...... 110
5.3 Characterizing the observed rkSZ signal...... 112
5.3.1 Aperture filter...... 114
5.3.2 Matched filter...... 115
5.4 Measurement signal-to-noise and required number of galaxies ...... 115
5.5 Stacking Planck data at the positions of MaNGA galaxies ...... 122
5.5.1 Galaxy data: MaNGA...... 122
iii 5.5.2 CMB data: Planck ...... 124
5.5.3 Stacking...... 124
5.6 Discussion...... 129
5.6.1 Model and observational uncertainties...... 129
5.6.2 Measurement uncertainties...... 132
5.6.3 Detection feasibility...... 133
5.7 Conclusions...... 136
Chapter 6: Non-Gaussian information from weak lensing data via deep learning ...... 139
6.1 Introduction...... 139
6.2 Data...... 142
6.2.1 Mock convergence maps ...... 142
6.2.2 Neural network training and architecture...... 144
6.2.3 Alternative descriptors ...... 149
6.3 Results...... 152
6.4 Discussion...... 157
6.4.1 Non-Gaussian information extracted by the neural network ...... 157
6.4.2 Effect of the smoothing scale on the results ...... 160
6.4.3 Bias in the CNN predictions ...... 163
6.5 Conclusions...... 165
Chapter 7: Interpreting deep learning models for weak lensing ...... 167
7.1 Introduction...... 167
7.2 Model and data ...... 169
iv 7.3 Network performance relative to alternative statistics ...... 170
7.4 Interpreting DNNs with saliency methods ...... 175
7.4.1 Method comparison and selection ...... 177
7.4.2 Mapping attributions back to physical space ...... 180
7.5 Discussion and conclusions...... 182
Chapter 8: Conclusions and future work ...... 184
8.1 Summary of results...... 184
8.2 Future work...... 186
References ...... 212
Appendix A: Gaussian likelihood approximation ...... 213
Appendix B: Sensitivity of results to interpolation ...... 216
Appendix C: Impact of filter misalignment and centering errors...... 218
Appendix D: Variance of aperture filter dipole measurements ...... 221
Appendix E: Selecting galaxies from MaNGA for stacking...... 223
v List of Tables
2.1 Parameters of the eight models explored around the fiducial model (Ω< = 0.26, F = −1.0). All models are spatially flat with ΩΛ = 1 − Ω< and consider a constant equation of state parameter F for DE...... 23
2.2 Δ j2 for different cosmological models computed for the power spectrum and three non-Gaussian observables (equilateral bispectrum, peak counts and Minkowski functionals) over noisy ^ maps with source galaxies at either I = 1 or I = 2. . . . . 37
2.3 Marginalized errors on Ω< and F, orientation of the Fisher ellipse (measured as the angle between its major axis and the F axis), and figure-of-merit (FOM; defined as c/, with the area of the error ellipse). The errors correspond to a 68% confidence level, scaled to a 1000 deg2 survey. All calculations were done on noisy ^ maps with source galaxies at either I = 1 or I = 2...... 39
3.1 Cosmological parameters for the fiducial model. All other cosmologies share these parameters except Ω< and f8...... 50
3.2 The main tunable parameters of CAMELUS and their values used in this study. Dark matter halos are assumed to have a Navarro-Frenk-White (NFW) density profile, defined by its inner slope (U), and its concentration parameter, the ratio between V the virial and scale radii, determined itself by 2 and V: 2 ≡ 20 " , where 0 NFW 1+I "★ I is the halo’s redshift, " its mass and "★ its pivot mass (see [170] for a detailed description of the halo density profile characterization)...... 53
3.3 Description of the thresholds used in this study to bin the convergence peak counts by their signal-to-noise (S/N) ratio, as well as the mean peak counts from data ob- tained from both the N-body and the CAMELUS models in the fiducial cosmology, in the bins used for inference...... 56
vi 3.4 Comparison of !26 2f (95.4%) credible contours. The figure-of-merit, >", is the inverse of the area of the credibility regions. Also displayed are the percentage changes in the area of the credibility region and its centroid (arithmetic mean) shift. We find looser constraints (≈ 30%) for N-body data, whose predictive power is greatly diminished when low significance peaks are excluded from the analysis. . 63
3.5 Effect of using a cosmology-dependent covariance matrix. 1f (68.3%) and 2f (95.4%) credible contours are computed using the three likelihoods described in § 3.2.3( !26, !BE6 and !E6). The analysis is done twice, using only high significance peaks (S/N > 3) and all the peaks. We report the figure of merit (FoM); defined as the inverse of the area of the credibility region), changes in the credibility regions and shifts in their centroid. Introducing a cosmology-dependent covariance into the j2 term of the Gaussian likelihood has a bigger impact than introducing it in the determinant term. Also, the effect is bigger when only high peaks are included. 64
3.6 Impact on the models’ predictive power of the lowest significance peak bin in- cluded in analysis. Figure of merit (FoM) and change in 2f contour area (Δ%) for constant, semi-varying and variable covariance likelihoods...... 70
4.1 Memory and computational time requirements for main simulation tasks for differ- ent hyper-parameter values. Each CPU time unit is a core hour (representing wall clock time if computed in series). Changes in CPU time are relative to the fiducial run (in bold)...... 84
4.2 Individual plane and snapshot size and the respective total storage requirements for both intermediate data products. In bold, values for the fiducial configuration. . . . 85
4.3 Goodness of fit for different lens plane thickness configurations, based on the re- duced j2 (i.e. j2 per degree of freedom). Configurations that yield good fits (j2 ≤ 1), implying that they are indistinguishable from the fiducial case, are high- lighted in bold. Power spectrum: values for a range of ℓ ∈ [200, 12000] and in parenthesis ℓ ∈ [200, 3532]. PDF: values for a range of ^ ∈ [−3.0, 5.0] in units of the shape noise r.m.s., and in parenthesis ^ ∈ [−3.0, 3.1]. Peak counts: values for a range of ^ ∈ [−0.5, 5.0] in units of the shape noise r.m.s., and in parenthesis ^ ∈ [−0.5, 4.0]. MFs: values for a range of ^ ∈ [−2.0, 5.0] in units of the shape noise r.m.s., and in parenthesis ^ ∈ [−2.0, 3.0]...... 97
vii 4.4 Goodness of fit as in Table 4.3, but for different mass resolution configurations. Configurations with j2 ≤ 1, implying that they are indistinguishable from the fiducial case, are highlighted in bold. Power spectrum: values for a range of ℓ ∈ [200, 12000] and in parenthesis ℓ ∈ [200, 3532]. PDF: values for a range of ^ ∈ [−3.0, 5.0] in units of the shape noise r.m.s., and in parenthesis ^ ∈ [−3.0, 3.1]. Peak counts: values for a range of ^ ∈ [−0.5, 5.0] in units of the shape noise r.m.s., and in parenthesis ^ ∈ [−0.5, 4.0]. MFs: values for a range of ^ ∈ [−2.0, 5.0] in units of the shape noise r.m.s., and in parenthesis ^ ∈ [−2.0, 3.0]...... 103
5.1 Instrumental configurations considered for different existing and planned CMB experiments, defined in each case by the beam’s FWHM and the rms noise level. . . 118
5.2 The number of galaxies required for a 3f detection of a rkSZ signal. For each combination of CMB experiment and galaxy survey type, the number on the left corresponds to the aperture filter (measured at 0.1 'vir) and the one on the right to matched filtering. The combination of Planck and a SAMI-like survey, using the aperture filter and adopting the multi-phase slow-rotator model, yields a required number of galaxies exceeding those available within the stellar-mass and redshift range of the galaxy survey. This detection is therefore impossible, and is marked in italics (see § 5.6.3 for discussion)...... 122
5.3 Minimum footprint size of surveys required for a 3f detection of a rkSZ signal, in deg2. This corresponds to the required number of galaxies, shown in Table 5.2, and converted to sky coverage using the stellar mass function described in § 5.6.3. An "X" indicates that the required sky area exceeds the full sky. The number on the left corresponds to the requirement using an aperture filter (measured at 0.1 'vir) and the one on the right to the use of matched filtering...... 134
6.1 Summary of the neural network’s architecture. Convolutional layers increase the depth of the network by applying different filters (sub-layers) to the same input. The number of neurons in a layer is determined by the dimension of its output. The number of weights for a convolutional layer is given by >DC (8= × 9 + 1), where >DC is the number of feature maps that the layer outputs and 8= the number of feature maps the layer is fed with. A fully connected layer is defined by (#8= + 1) × #>DC weights, where #8= is the number of nodes in the previous layer and #>DC the number of nodes in the fully connected layer...... 147