Practical Statistics for Particle Physics

Center for Cosmology and Particle Physics Practical Statistics for Particle Physics Kyle Cranmer, New York University Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 1 Center for Cosmology and Particle Physics Lecture 3 Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 98 Center for Cosmology and Outline Particle Physics Lecture 1: Building a probability model ‣ preliminaries, the marked Poisson process ‣ incorporating systematics via nuisance parameters ‣ constraint terms ‣ examples of different “narratives” / search strategies Lecture 2: Hypothesis testing ‣ simple models, Neyman-Pearson lemma, and likelihood ratio ‣ composite models and the profile likelihood ratio ‣ review of ingredients for a hypothesis test Lecture 3: Limits & Confidence Intervals ‣ the meaning of confidence intervals as inverted hypothesis tests ‣ asymptotic properties of likelihood ratios ‣ Bayesian approach Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 99 Center for Cosmology and LEP Higgs Particle Physics Nchan ni sifs(xij )+bifb(xij ) A simple likelihood L(x|H ) P ois(ni|si + bi) Q = 1 = i j si+bi L x H ! Nchan ! ni ratio with no free ( | 0) i P ois(ni|bi) j fb(xij) parameters !Nchan ni ! sifs(xij) q = ln Q = −stot + ln 1+ # b f (x ) $ "i "j i b ij 50 (a) LEP 0.12 2 Observed mH = 115 GeV/c Expected for background 40 LEP Expected for signal 0.1 plus background -2 ln(Q) 30 0.08 20 10 0.06 Probability density 0 0.04 -10 Observed 0.02 Expected for background -20 Expected for signal plus background 0 -30 -15 -10 -5 0 5 10 15 106 108 110 112 114 116 118 120 2 -2 ln(Q) mH(GeV/c ) Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 85 Center for Cosmology and The Test Statistic and its distribution Particle Physics Consider this schematic diagram signal + background background-only observed Probability Density 1-CL! b CL" s+b Test Statistic signal like background like The “test statistic” is a single number that quantifies the entire experiment, it could just be number of events observed, but often its more sophisticated, like a likelihood ratio. What test statistic do we choose? And how do we build the distribution? Usually “toy Monte Carlo”, but what about the uncertainties... what do we do with the nuisance parameters? Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 86 Center for 680 where the ai are the parameters used to parameterize the fake-tau background and !Cosmologyrepresents and all nui- An example Particle Physics 681 sance parameters of the model: "H ,mZ,"Z,rQCD,a1,a2,a3. When using the alternate parameterization 682 of theEssentially, signal, the exact you form need of Equation to fit 14your is modified model to to coincide the data with parametetwice: rs of that model. 683 Figureonce 14 with shows everything the fit to the floating, signal candidates and once for a mwithH = 120signal GeV fixed Higg withto 0 (a,c) and without 684 (b,d) the signal contribution. It can be seen that the background shapes and normalizations are trying to P (m, a µ =0, νˆ(µ = 0; m, a)) 685 accommodate the excessλ( nearµ =m 0)## = =120 GeV, but| the control samples are constraining the variation. 686 Table 13 shows the significance calculated from theP profile(m, a likelihoodµ,ˆ νˆ) ratio for the ll-channel, the lh- 687 channel, and the combined fit for various Higgs boson masses.| P (m, a µ,ˆ νˆ) P (m, a µ =0, νˆ(µ = 0; m, a)) | | 14 14 ATLAS ATLAS 12 VBF H(120)$ ## $lh 12 VBF H(120)$ ## $lh s = 14 TeV, 30 fb-1 s = 14 TeV, 30 fb-1 10 10 Events / ( 5 GeV ) 8 Events / ( 5 GeV ) 8 6 6 4 4 2 2 0 0 60 80 100 120 140 160 180 60 80 100 120 140 160 180 M ## (GeV) M ## (GeV) Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 100 (a) (b) 14 14 ATLAS ATLAS 12 VBF H(120)$ ## $ll 12 VBF H(120)$ ## $ll s = 14 TeV, 30 fb-1 s = 14 TeV, 30 fb-1 10 10 Events / ( 5 GeV ) 8 Events / ( 5 GeV ) 8 6 6 4 4 2 2 0 0 60 80 100 120 140 160 180 60 80 100 120 140 160 180 M ## (GeV) M ## (GeV) (c) (d) Figure 14: Example fits to a data sample with the signal-plus-background (a,c) and background only 1 (b,d) models for the lh- and ll-channels at mH = 120 GeV with 30 fb− of data. Not shown are the control samples that were fit simultaneously to constrain the background shape. These samples do not include pileup. 27 Center for Cosmology and Properties of the Profile Likelihood Ratio Particle Physics After a close look at the profile likelihood ratio P (m, a µ, νˆ(µ; m, a)) λ(µ)= | P (m, a µ,ˆ νˆ) | one can see the function is independent of true values of ! ‣ though its distribution might depend indirectly Wilks’s theorem states that under certain conditions the distribution of -2 ln ! ("="0) given that the true value of " is "0 converges to a chi-square distribution ‣ more on this tomorrow, but the important points are: ‣ “asymptotic distribution” is known and it is independent of ! ! ● more complicated if parameters have boundaries (eg. µ! 0) Thus, we can calculate the p-value for the background-only hypothesis without having to generate Toy Monte Carlo! Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 101 Center for Cosmology and Toy Monte Carlo Particle Physics Explicitly build distribution by generating “toys” / pseudo experiments assuming a specific value of µ and !. ‣ randomize both main measurement m and auxiliary measurements a ‣ fit the model twice for the numerator and denominator of profile likelihood ratio ‣ evaluate -2ln "(µ) and add to histogram Choice of µ is straight forward: typically µ=0 and µ=1, but choice of ! is less clear ‣ more on this tomorrow This can be very time consuming. Plots below use millions of toy pseudo- experiments on a model with ~50 parameters. signalplusbackground signalplusbackground background 10 10 background test statistic data test statistic data 1 1 2-channel 5-channel 10-1 10-1 3.35σ 10-2 4.4σ 10-2 10-3 10-3 10-4 10-4 10-5 10-5 10-6 10-6 10-7 0 5 10 15 20 25 30 35 40 0 5 10 15 20 25 30 35 40 45 Profile Likelihood Ratio Profile Likelihood Ratio Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 102 Center for Cosmology and What makes a statistical method Particle Physics To describe a statistical method, you should clearly specify ‣ choice of a test statistic ● simple likelihood ratio (LEP) QLEP = Ls+b(µ = 1)/Lb(µ = 0) ˆ ˆ ● ratio of profiled likelihoods (Tevatron) QTEV = Ls+b(µ =1, νˆ)/Lb(µ =0, νˆ) ● profile likelihood ratio (LHC) λ(µ)=Ls+b(µ, νˆ)/Ls+b(ˆµ, νˆ) ‣ how you build the distribution of the test statistic ● toy MC randomizing nuisance parameters according to π(ν) • aka Bayes-frequentist hybrid, prior-predictive, Cousins-Highland ● toy MC with nuisance parameters fixed (Neyman Construction) ● assuming asymptotic distribution (Wilks and Wald, more tomorrow) ‣ what condition you use for limit or discovery ● more on this tomorrow Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 103 Center for Cosmology and Experimentalist Justification Particle Physics So far this looks a bit like magic. How can you claim that you incorporated your systematic just by fitting the best value of your uncertain parameters and making a ratio? It won’t unless the the parametrization is sufficiently flexible. So check by varying the settings of your simulation, and see if the profile likelihood ratio is still distributed as a chi-square Nominal (Fast Sim) Here it is pretty stable, but 10-1 miss Smeared PT ATLAS Q2 scale 1 it’s not perfect (and this is Probability 2 10-2 Q scale 2 Q2 scale 3 a log plot, so it hides some Q2 scale 4 -3 pretty big discrepancies) 10 Leading-order tt Leading-order WWbb -4 Full Simulation 10 For the distribution to be -1 -5 L dt=10 fb 10 independent of the nuisance parameters your 10-6 parametrization must be 02468101214161820sufciently flexible. log Likelihood Ratio Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 104 Center for Cosmology and A very important point Particle Physics If we keep pushing this point to the extreme, the physics problem goes beyond what we can handle practically The p-values are usually predicated on the assumption that the true distribution is in the family of functions being considered ‣ eg. we have sufficiently flexible models of signal & background to incorporate all systematic effects ‣ but we don’t believe we simulate everything perfectly ‣ ..and when we parametrize our models usually we have further approximated our simulation. ● nature -> simulation -> parametrization At some point these approaches are limited by honest systematics uncertainties (not statistical ones). Statistics can only help us so much after this point. Now we must be physicists! Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 105 Center for Cosmology and Particle Physics Confidence Intervals (Limits) Kyle Cranmer (NYU) CERN School HEP, Romania, Sept. 2011 106 Center for Cosmology and Confidence Interval Particle Physics What is a “Confidence Interval? LEP1 and SLD 80.5 LEP2 and Tevatron (prel.) ‣ you see them all the time: 68% CL ] Want to say there is a 68% chance GeV [ 80.4 that the true value of (mW, mt) is in W this interval m 80.3 ‣ but that’s P(theory|data)! ∆α mH [GeV] 114 300 1000 Correct frequentist statement is that 150 175 200 the interval covers the true value mt [GeV] 68% of the time ‣ Bayesian “credible interval” does mean probability parameter is ‣ remember, the contour is a function of the data, which is random.

Practical Statistics for Particle Physics

Estimating Confidence Regions of Common Measures of (Baseline, Treatment Effect) On

Theory Pest.Pdf

Random Vectors

The Likelihood Principle

Statistical Theory

Statistical Data Analysis Stat 4: Confidence Intervals, Limits, Discovery

Confidence Intervals for Functions of Variance Components Kok-Leong Chiang Iowa State University

On the Relation Between Frequency Inference and Likelihood

Monte Carlo Methods for Confidence Bands in Nonlinear Regression Shantonu Mazumdar University of North Florida

1 the Likelihood Principle

Model Selection by Normalized Maximum Likelihood

P Values, Hypothesis Testing, and Model Selection: It’S De´Ja` Vu All Over Again1