Machine Learning for Particle Identification and Deep Generative Models Towards Fast Simulations for the ALICE Transition Radiation Detector at CERN

MACHINE LEARNING FOR PARTICLE IDENTIFICATION & DEEP GENERATIVE MODELS TOWARDS FAST SIMULATIONS FOR THE ALICE TRANSITION RADIATION DETECTOR AT CERN Christiaan Gerhardus Viljoen Department of Statistics || Department of Physics Faculty of Science CERN-THESIS-2020-003 //2020 University of Cape Town This thesis is submitted in partial fulfilment of the Degree of Master of Science Supervised by: Dr Thomas Dietel Dedicated to my mother, Elizabeth Suzanna Bloem Viljoen, who has always inspired me to follow my higher passions, despite the myriad difficulties that life makes us face; and to search fearlessly and incessantly for the deeper truths underlying our everyday world. Abstract This Masters thesis outlines the application of machine learning techniques, predominantly deep learning techniques, towards certain aspects of particle physics. Its two main aims: particle identification and high energy physics detector simulations are pertinent to research avenues pursued by physicists working with the ALICE (A Large Ion Collider Experiment) Transition Radiation Detector (TRD), within the Large Hadron Collider (LHC) at CERN (The European Organization for Nuclear Research). Aim 1: Particle Identification The first aim of this project focused on the application of machine learning techniques towards particle identification; in particular, the classification of electrons (푒) versus pions (휋) produced during proton-Lead (pPb) collisions during various runs from LHC16q. Various neural network architectures, hyperparameter settings, etc. were assessed by optimising an electron acceptance cut-off point (푡푐푢푡) in the distribution of the classifying neural network’s 푃(푒푙푒푐푡푟표푛) estimates, which minimises the amount of pion contamination (i.e. pion efficiency, 휀휋), whilst maintaining a high rate of electron acceptance (i.e. electron efficiency, 휀푒), specifically 휀푒 ≈ 90%. Summary of Results for Particle Identification Particle identification in this thesis was performed on uncalibrated TRD digits data, in contrast to work that has been done in this area before. For this reason, the presented results are only comparable, in terms of accuracy, to some of the less useful methods that have been investigated by others. Nonetheless, a much more comprehensive search across the space of possible neural network types and architectures was done in this project and this exploratory work has resulted in some interesting findings coming to light, such as the usefulness of the Focal Loss function in mitigating for the extreme class imbalances present in this dataset. The main goals of the first aim of this thesis were: (1) to show that the obtained results were comparable to previous studies, with slightly inferior performance (as expected, due to the omission of the calibration phase of data pre-processing); and 2) getting to know the dataset at hand well enough to enable the second aim of this thesis, which is discussed below. Aim2: High Energy Physics Detector Simulations The second aim of this project centred around determining whether detector simulations obtained from Geant4 were as accurate as they are usually assumed to be. Additionally, for the first time, exploratory research was done into the feasibility of making use of latent variable/ deep generative models for fast detector simulations for the ALICE Transition Radiation Detector. To this end, a wide variety of generative models were prototyped, and the results of a few choice models will be presented. Summary of Results for High Energy Physics Detector Simulations Perhaps the most important result of this thesis is the fact that a convolutional neural network was able to distinguish between simulated data (obtained by making use of Geant4 as the detector simulation component) and true data obtained by the TRD during a specific LHC pPb run. An investigation is presented which delves into some of the major differences between the two types of data. Additionally, various deep generative models were prototyped towards the simulation of the TRD detector response, some of these models produced results which could indicate that these models would be fruitful avenues to explore as part of the detector simulation component of the O2 software currently being developed for LHC Run 3. TABLE OF CONTENTS 1 INTRODUCTION ...................................................................................................................................................................................... 7 1.1 BACKGROUND ................................................................................................................................................................................................. 7 1.2 AIMS ................................................................................................................................................................................................................. 7 1.3 SUMMARY OF WORK DONE & MAJOR FINDINGS ...................................................................................................................................... 8 2 HIGH ENERGY PHYSICS & CERN .................................................................................................................................................... 12 2.1 THE STANDARD MODEL OF PARTICLE PHYSICS ..................................................................................................................................... 12 2.2 THE QUARK GLUON PLASMA (QGP) ...................................................................................................................................................... 16 2.3 CERN ........................................................................................................................................................................................................... 18 3 THEORY: STATISTICAL METHODS & MACHINE LEARNING ................................................................................................. 35 3.1 STATISTICAL METHODS ............................................................................................................................................................................. 35 3.2 BACKGROUND: ARTIFICIAL INTELLIGENCE, MACHINE LEARNING & DEEP LEARNING ................................................................... 39 3.3 MATHEMATICAL BASIS: ARTIFICIAL NEURAL NETWORKS .................................................................................................................. 40 3.4 CONVOLUTIONAL NEURAL NETWORKS ................................................................................................................................................... 48 3.5 RECURRENT NEURAL NETWORKS ............................................................................................................................................................ 51 4 IMPLEMENTATION: MACHINE LEARNING FOR PARTICLE IDENTIFICATION............................................................... 54 4.2 PARTICLE IDENTIFICATION RESULTS: PREVIOUS THESES ................................................................................................................... 59 4.3 PARTICLE IDENTIFICATION IMPLEMENTATION: THIS THESIS .............................................................................................................. 61 4.4 CHAPTER DISCUSSION AND CONCLUSIONS .............................................................................................................................................. 73 5 THEORY: HIGH ENERGY PHYSICS DETECTOR SIMULATIONS ............................................................................................. 76 5.1 INTRODUCTION ............................................................................................................................................................................................ 76 5.2 MONTE CARLO SIMULATIONS: GEANT4 ................................................................................................................................................. 76 5.3 DEEP GENERATIVE MODELS ..................................................................................................................................................................... 77 5.4 ADVERSARIAL AUTOENCODERS ................................................................................................................................................................ 83 6 IMPLEMENTATION: HIGH ENERGY PHYSICS DETECTOR SIMULATIONS ....................................................................... 85 6.1 PREAMBLE: ASSESSING SIMULATION PERFORMANCE........................................................................................................................... 85 6.2 IMPLEMENTATION: GEANT4 ..................................................................................................................................................................... 85 6.3 IMPLEMENTATION: VARIATIONAL AUTOENCODERS.............................................................................................................................. 88 6.4 IMPLEMENTATION: GENERATIVE ADVERSARIAL NETWORKS.............................................................................................................. 92 6.5 IMPLEMENTATION: ADVERSARIAL AUTOENCODERS ............................................................................................................................. 95 6.6 CHAPTER CONCLUSIONS ...........................................................................................................................................................................

Machine Learning for Particle Identification and Deep Generative Models Towards Fast Simulations for the ALICE Transition Radiation Detector at CERN

Decays of the Tau Lepton*

Introduction to Subatomic- Particle Spectrometers∗

Machine-Learning-Based Global Particle-Identification Algorithms At

Particle Identification

CLEO III, a Dectector to Measure Rare $ B $ Decays and CP Violation

Mini-Jet Production in Proton-Antiproton Interactions and Particle Production in Heavy-Ion Collisions

An Introduction to Jet Substructure and Boosted-Object Phenomenology

Particle Identification Tools and Performance in the Ship Experiment

(EIC): an Introduction to the Particle Identification System

35. Particle Detectors at Accelerators

Measurement of Jet Substructure in Boosted Top Quark Decays at CMS

Particle Identification in ALICE: a Bayesian Approach