Search for pair-produced vectorlike B quarks in proton-proton collisions at √s = 8 TeV
The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters.
Citation Khachatryan, V. et al. “Search for Pair-Produced Vectorlike B Quarks in Proton-Proton Collisions at √s = 8 TeV.” Physical Review D 93.11 (2016): n. pag. © 2016 CERN for the CMS Collaboration
As Published http://dx.doi.org/10.1103/PhysRevD.93.112009
Publisher American Physical Society
Version Final published version
Citable link http://hdl.handle.net/1721.1/106854
Terms of Use Creative Commons Attribution 3.0 Unported licence
Detailed Terms http://creativecommons.org/licenses/by/3.0/ PHYSICAL REVIEW D 93, 112009 (2016) Search for pair-produced vectorlikepffiffi B quarks in proton-proton collisions at s =8 TeV
V. Khachatryan et al.* (CMS Collaboration) (Received 25 July 2015; published 15 June 2016) A search for the production of a heavy B quark, having electric charge −1=3 and vector couplings to W, Z, and H bosons, is carried out using proton-proton collision data recorded at the CERN LHC by the CMS experiment, corresponding to an integrated luminosity of 19.7 fb−1. The B quark is assumed to be pair produced and to decay in one of three ways: to tW, bZ,orbH. The search is carried out in final states with one, two, and more than two charged leptons, as well as in fully hadronic final states. Each of the channels in the exclusive final-state topologies is designed to be sensitive to specific combinations of the B quark- antiquark pair decays. The observed event yields are found to be consistent with the standard model expectations in all the final states studied. A statistical combination of these results is performed, and upper limits are set on the cross section of the strongly produced B quark-antiquark pairs as a function of the B quark mass. Lower limits on the B quark mass between 740 and 900 GeVare set at a 95% confidence level, depending on the values of the branching fractions of the B quark to tW, bZ, and bH. Overall, these limits are the most stringent to date.
DOI: 10.1103/PhysRevD.93.112009
I. INTRODUCTION This paper describes the search for a vectorlike B quark −1=3 The discovery of a Higgs boson [1–3] has intensified the of electric charge , using data recorded by the CMS search for physics beyond the standard model (SM). experiment from proton-proton collisions at a center-of- Various extensions of the SM predict the existence of mass energy of 8 TeV at the CERN LHC in 2012. It is B new heavy quarks, which arise quite naturally in grand assumed that quark-antiquark pairs are produced strongly unification schemes [4] and in composite Higgs [5,6], little for B quark masses within the range of this search, which Higgs [7–10], and top quark condensate [11] models. The extends to 1 TeV. The B quark may decay either via the couplings to the SM gauge bosons of the left- and right- charged-current process B → tW or via the FCNC proc- handed components of these quarks are symmetric, esses B → bZ and B → bH. Feynman diagrams for the B so they are called vectorlike [12]. Vectorlike quarks may quark pair production and decay processes are shown in be singlets, doublets, or triplets under the electroweak Fig. 1. Searches are performed in several different final SUð2Þ × Uð1Þ transformation [13]. They have bare mass states, including those containing single leptons, lepton terms that are invariant under the electroweak gauge pairs (dileptons) of opposite or identical charge, three or transformation [14]. Moreover, their couplings to the scalar more leptons, or consisting entirely of hadronic activity sector are independent of mass. Thus, the existence of without any identified leptons. We search for an excess of vectorlike quarks is not ruled out by the recent discovery of events over the backgrounds in mutually exclusive final a Higgs boson, in contrast to additional quarks in more states and set limits on the pair-production cross section conventional fourth-generation models [15]. for all values of the B quark branching fractions with the In several beyond the standard model scenarios, vector- constraint BðB → tWÞþBðB → bZÞþBðB → bHÞ¼1. like quarks are considered partners of the top and bottom Experimental searches for a vectorlike B quark have quarks [16]. Both the charged-current [17] and the flavor previously been reported by experiments at the Fermilab changing neutral current (FCNC) [18,19] decay processes Tevatron and the CERN LHC. A 95% C.L. lower limit on are allowed. The ratio of the predicted rates depends on the the mass of the B quark was set at 268 GeV by the CDF model: in some models the FCNC process dominates [20]; Collaboration [21]. Recently the ATLAS Collaboration set in others the two modes are comparable in rate. lower limits on the B quark mass ranging from 575 to 813 GeV, for different combinations of the B quark branching fractions [22]. The present analysis improves *Full author list given at the end of the article. upon these existing results, setting the most stringent limits to date on the mass of the vectorlike B quark. Published by the American Physical Society under the terms of the Creative Commons Attribution 3.0 License. Further distri- The paper is divided into several sections. Section II bution of this work must maintain attribution to the author(s) and gives an overview of the CMS detector. Section III the published article’s title, journal citation, and DOI. describes the details of the simulations used for signal
2470-0010=2016=93(11)=112009(32) 112009-1 © 2016 CERN, for the CMS Collaboration V. KHACHATRYAN et al. PHYSICAL REVIEW D 93, 112009 (2016) the CMS detector, together with a definition of the coordinate system used and the relevant kinematic varia- bles, can be found in Ref. [23].
III. SIGNAL AND BACKGROUND SIMULATION The following section details the simulation methods used to generate events for modeling the signal and background processes. One of the main backgrounds in many of the channels is SM tt¯ production. This process is simulated with the MADGRAPHv5.1.1event generator [24], using the CTEQ6L1 parton distribution function (PDF) [25]. Events are interfaced with PYTHIAv6 [26] for shower modeling and hadronization. These simulation methods are used for the W þ jets and Z þ jets samples, in addition to SM tt¯ production. For W þ jets and Z þ jets events, up to four additional partons are allowed at the matrix element level during generation. WW WZ ZZ B Diboson processes , ,and are generated with FIG. 1. Feynman diagrams for the dominant quark pair- tW s B PYTHIA 6.424, and single top quark processes ( , -channel, production process (top) and for the quark decay modes t – (bottom). and -channel) are generated using POWHEG 1.0 [27 30] and interfaced with PYTHIA for shower modeling and hadroniza- tion. Both the diboson and single top processes are generated and background processes. Section IV describes the with the CTEQ6M PDF set. The rare processes ttW¯ , ttZ¯ ,and ¯ reconstruction of physics objects and the event selections tbZ are simulated withMADGRAPHv5. specific to each individual channel considered in this Normalizations for the background processes are ini- analysis. Section V describes background estimation tech- tially set according to theoretical predictions and are niques for each of the channels, as well as the specific allowed to vary within the corresponding uncertainties B methods used to discriminate the quark signal from the during cross section limit extraction. For W þ jets and background, while Sec. VI describes the systematic uncer- Z þ jets processes, we use the calculations found in tainties evaluated for each channel and their treatment in Refs. [31–33].Fortt¯ and single top samples, we normalize combination. Finally, Sec. VII provides further details on using cross sections calculated in Refs. [34] and [35], the combination of analysis channels, and Sec. VIII respectively. Finally, for diboson and rare processes, we use presents the results obtained from this analysis. A summary cross sections computed in Refs. [36] and [37,38], is presented in Sec. IX. respectively. To model the kinematic properties of the pp → BB¯ II. CMS DETECTOR signal process, we use samples of simulated events pro- The central feature of the CMS apparatus is a super- duced with the MADGRAPHv5 generator and CTEQ6L1 PDF conducting solenoid of 6 m internal diameter, providing a set, allowing for up to two additional partons in the final magnetic field of 3.8 T. Within the solenoid volume are a state of the hard scatter matrix element. The generated silicon pixel and strip tracker, a lead tungstate crystal events are then interfaced with PYTHIAv6 for parton shower electromagnetic calorimeter (ECAL), and a brass and modeling and hadronization. scintillator hadron calorimeter, each composed of a barrel Samples are generated for B quark masses between 500 and two end cap sections. Muons are measured in gas- and 1000 GeV, in steps of 50 GeV, for each of the six ionization detectors embedded in the steel flux-return yoke distinct combinations of decay products: tWtW, tWbZ, outside the solenoid. Extensive forward calorimetry com- tWbH, bZbZ, bHbZ, and bHbH. The standard model plements the coverage provided by the barrel and end cap final states identical to those listed here are not considered, detectors. The first level of the CMS trigger system, as the rates are negligible relative to the other background composed of custom hardware processors, uses informa- processes. By reweighting events from these different tion from the calorimeters and muon detectors to select samples, an arbitrary combination of branching fractions the most interesting events in a fixed time interval of less to tW, bZ, and bH can be probed. To normalize the than 4 μs. The high-level trigger processor farm further simulated samples to expected event yields, we use cross decreases the event rate from around 100 kHz (the sections computed to next-to-next-to-leading order maximum allowed output from the first level) to around (NNLO) using both HATHOR [39] and TOP++2.0[40]. The 400 Hz, before data storage. A more detailed description of numerical values used for the B quark pair-production cross
112009-2 SEARCH FOR PAIR-PRODUCED VECTORLIKE B … PHYSICAL REVIEW D 93, 112009 (2016) TABLE I. Production cross sections for pp → BB¯ , used to hadrons and photons to construct the various tau lepton normalize simulated signal samples to expected event yields. The hadronic decay modes. The HPS PF tau leptons are required cross sections are computed to NNLO with Top++2.0 [40]. p > 20 jηj < 2 3 to have T GeV and p. ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi. Additionally, we τ ΔR¼ ðΔηÞ2 þðΔϕÞ2 >0 1 MðBÞ (GeV) Cross section σ (pb) require h to be separated by . from electron and muon candidates, where ϕ is the azimu- 450 1.153 thal angle. 500 0.590 550 0.315 The isolation of the lepton candidates (including elec- 600 0.174 trons, muons, and decays of tau leptons to electrons or 650 0.0999 muons) is measured by the activity in a cone of aperture ΔR p 700 0.0585 around the lepton direction at the primary vertex. The T of 750 0.0350 charged particles originating at the primary vertex and the p 800 0.0213 T of the neutral particles and photons are summed in this 850 0.0132 cone (excluding the lepton candidate itself) to obtain the 900 0.00828 isolation variable. Contributions to the neutral hadron and 950 0.00525 photon energy components due to pileup interactions are 1000 0.00336 subtracted. For the electron isolation, this contribution is determined using the jet area technique [46], which computes the transverse energy density of neutral particles sections as a function of mass, generated with TOP++2.0 using the median of the neutral energy distribution in a [402.0, are listed in Table I. sample of jets with p > 3 GeV. In the case of the Finally, to reproduce the LHC running conditions, T muons, the pileup energy density from neutral particles simulated events are reweighted to match the observed is estimated to be half of that from charged hadrons, based distribution of the number of reconstructed primary vertices on measurements performed in jets [47]. The difference per bunch crossing in data. between the isolation algorithms arises because electrons and muons are reconstructed using different techniques. IV. EVENT RECONSTRUCTION Electrons, with large energy deposition in the calorimeters, Events from the LHC collision data or from simulation behave similarly to jets in this respect, while the are reconstructed using the particle flow (PF) algorithm reconstruction of muons relies more heavily on tracking [41,42], which collects information from all subdetectors to information. The isolation value, defined as the energy reconstruct all detected particles in an event. Events are reconstructed in a cone of ΔR ¼ 0.3 (0.4) around an required to have at least one reconstructed vertex. The electron (muon) candidate, is required to be less than p interaction vertex with the largest sum of the transverse 0.15 (0.12) times the electron (muon) T for the lepton to p2 momentum squared T of associated tracks is considered be considered isolated. The lepton identification and the primary interaction vertex. Charged particles originat- isolation conditions remove most of the nonprompt lepton ing from other vertices due to additional inelastic proton- backgrounds. proton collisions within the same bunch crossing (pileup) Particles are clustered to form hadronic jets using the k are rejected. anti- T algorithm [48] with a distance parameter of 0.5. Electron candidates are reconstructed from clusters of Throughout this paper such clusters are referred to as AK5 p > 30 jηj < 2 4 energy deposited in the ECAL matched to charged particle jets. The AK5 jets with T GeV and . are trajectories identified in the tracker [43]. Electrons and selected, with further requirements that the jet has at least p jηj < 2 4 muons with T above 30 GeVand pseudorapidity . two associated tracks and that at least 1% of the jet energy are accepted, excluding electrons with 1.44 < jηj < 1.57, fraction is measured in the calorimeters, to remove poorly in the transition region between the ECAL barrel and end reconstructed jets. Jet energy corrections are applied; these cap. The muon candidates are reconstructed using infor- are derived from simulation and are matched to measure- mation from the tracker and the muon spectrometer. Muon ments in data [49]. candidates are required to have only a small amount of Jets arising from the hadronization of b quarks (b jets) energy deposited in the calorimeters. Further quality are identified using the combined secondary vertex (CSV) requirements are imposed on the muon tracks and the fit b-tagging algorithm [50], which uses information from to the matched segments in the muon detectors [44]. Only tracks and secondary vertices associated with jets to tracks originating from the primary interaction vertex are compute a likelihood-based discriminator to distinguish considered. between jets from b quarks and those from charm or light Electron and muon candidates are reconstructed and quarks and gluons. The b-tagging discriminator returns a identified based on quality selection requirements, while value between 0 and 1, with higher values indicating a τ hadronically decaying tau leptons ( h) are identified using higher probability of the jet to originate from a bottom the Hadron Plus Strip (HPS) [45] algorithm, which relies on quark. A discriminator threshold is chosen which gives a
112009-3 V. KHACHATRYAN et al. PHYSICAL REVIEW D 93, 112009 (2016) b-tagging efficiency of about 70%, with a mistagging rate TABLE II. A summary of analysis channels entering the of about 1.5% for jets originating from light-flavor quarks combination, along with the number of selected leptons, the p – variable used for signal discrimination, and the B quark decay or gluons with T in the range of 80 120 GeV. The b-tagging efficiency is measured in data and simulation, mode providing the best sensitivity for the channel. and corrections are applied to simulated events to account Number of Discriminating Best decay p η for any differences, as a function of T and [51]. The leptons variable mode missing transverse momentum vector is defined as þ S tW Lepton jets 1 T the projection on the plane perpendicular to the beams S tW Same-sign 2 T of the negative vector sum of the momenta of all recon- dilepton structed particles in an event. Its magnitude is referred to as Opposite-sign 2 MðllbÞ bZ E S p T. The quantity T is defined as the scalar sum of the T of dilepton p E ≥3 S tW bZ the jets, lepton T, and T in the event. Multilepton T , H bH At very high Lorentz boost, the products of hadronically All-hadronic 0 T decaying bosons may be merged into a single reconstructed jet. In this regime, the W, Z, or Higgs bosons are identified as jets clustered with the Cambridge-Aachen algorithm Table II summarizes these channels in terms of their [52,53] using a larger distance parameter of 0.8 [54].In defining characteristics: the number of selected leptons, this paper, they are referred to as CA8 jets. For bosons with p the discriminating variable used for limit setting, as well as T above approximately 200 GeV, decay products are the decay mode of the B quark for which the channel is expected to be clustered into a single CA8 jet. Each CA8 jet most sensitive. can be decomposed into constituent subjets using a jet pruning algorithm [55] to resolve those decay products. Lepton þ jets The pruning algorithm removes soft and wide-angle com- A. channel ponents of the jet during a reclustering, and the last iteration Charged leptons from the decays of W and Z bosons of the clustering process is reversed to identify two subjet are selected using the criteria described in Sec. IV and are candidates within each pruned jet. Jet properties such as jet required to be isolated from jets. The lepton trajectories mass, N-subjettiness [56] (used to determine the consis- are also required to have a transverse impact parameter of tency of a jet with N hypothesized subjets), and the mass less than 0.02 cm and a longitudinal impact parameter of drop, defined as the ratio of the most massive subjet to the less than 1 cm in magnitude, relative to the primary vertex. mass of the pruned jet, are used to identify these bosons. The final selection requires events to have exactly one p > 200 The trigger selection for each channel entering the isolated lepton and at least four jets with T , 60, combination can be different, depending on the final state 40, 30 GeV, respectively, of which at least one is a bottom p of interest. For the single-lepton channel, two trigger jet. The minimum number of jets and the jet T require- selections are utilized: either a single electron with ments are selected to enhance sensitivity to the BB¯ signal p > 27 p > 40 T GeV or a single muon with T GeV. For with B → tW decays. To further suppress the SM back- both of the lepton pair channels, as well as the multilepton grounds, we use the centrality, C, defined as the scalar p channel, three trigger algorithms are used for final states sum of the T of the jets divided by the scalar sum of the including two electrons, two muons, or one electron and jet energies, requiring C>0.4. We require events to have E > 20 one muon. In each of these dilepton trigger algorithms, T GeV. Corrections due to differing trigger, lepton, p p > events are selected if the highest- T lepton has T and b jet identification efficiencies in data and simulation 17 p GeV and the second-highest T lepton has are applied to simulated events. p > 8 T GeV. No charge requirement is applied in the Events are divided into categories containing 0, 1, or ≥ 2 trigger selection, allowing these trigger algorithms to be tagged hadronically decaying W, Z, or Higgs bosons using used in all three channels with two or more leptons. Finally, the CA8 jets. The identification criteria for these signatures p the all-hadronic channel uses a trigger algorithm requiring require the CA8 jet to have T greater than 200 GeV and to p p > the total scalar T sum of reconstructed jets (with T be matched to an AK5 jet. The AK5 jets matched to CA8 30 GeV and jηj < 3.0) in the detector to be greater than jets are then excluded from b-tagging requirements. The 750 GeV. The offline requirements for each channel of the two subjets identified with the pruning algorithm [55] are analysis are designed to be fully efficient given these trigger required to have an invariant mass between 50 and requirements. Differences in the trigger selections used 150 GeV, to be consistent with a W, Z, or Higgs boson. between analysis channels lead to small differences in the To further reduce SM backgrounds, the mass drop is total amount of integrated luminosity utilized in each required to be less than 0.4. The efficiency of this channel. heavy-boson tagging algorithm is approximately 50% The details of the event selections for each individual [57], and correction factors are applied to compensate analysis channel are given in the following subsections. for efficiency differences between data and simulation. To
112009-4 SEARCH FOR PAIR-PRODUCED VECTORLIKE B … PHYSICAL REVIEW D 93, 112009 (2016) -1 discriminate the B quark signal from the expected back- 19.7 fb (8 TeV) S grounds, the T distribution is used. CMS M(B) = 450 GeV Simulation 102 M(B) = 600 GeV B. Same-sign lepton pair channel M(B) = 700 GeV M(B) = 750 GeV Events enter the same-sign (SS) dilepton channel if they contain two leptons (ee, μμ, and eμ) having the same electric charge. Events containing an additional recon- 10 structed electron, muon, or tau lepton candidate are removed from the final selection. This channel is optimized for B → tW decays but maintains some sensitivity for bZ Events/ 50 GeV and bH decays. In events with a top quark and a W boson, 1 high hadronic activity is expected in addition to the lepton 0 200 400 600 800 1000 pair, and therefore events are included in the signal region only if they contain four or more jets in addition to the M(llb) [GeV] lepton pair. B B FIG. 2. The reconstructed mass of the quark candidate in the In this channel the quarks are not fully reconstructed. opposite-sign lepton pair channel, using the invariant mass of the S Instead, to discriminate signal from background, the T dilepton and b jet in simulated events. E > 30 distribution is used. Events with T GeV are catego- S – – – – rized into five T bins (0.2 0.4, 0.4 0.6, 0.6 0.8, 0.8 1.2, S > 1 2 The p selection criteria are chosen such that the triggers and T . TeV) for each of the dilepton channels. T are fully efficient on these events. We sort multilepton events into exclusive categories C. Opposite-sign lepton pair channel based on the number of leptons, lepton flavor, and relative B Z S In this channel, optimized for the decaying to a charges, as well as the kinematic quantity T. First, we boson and b quark, the Z boson candidates are recon- separate events containing hadronically decaying tau lep- structed from pairs of electrons or muons having opposite tons, as their reconstruction is less efficient and this results electric charge, with the identification and isolation criteria in lower-purity categories. p previously described. The two highest T leptons of the Next, we classify each event in terms of the maximum same flavor but opposite charge are used. The pairwise number of opposite-sign and same-flavor (OSSF) lepton invariant mass of these two objects, MðllÞ, where l pairs that can be made by using each lepton only once. For represents an electron or a muon, is required to be in example, both μþμ−μ− and μþμ−e− contain only one pair the range of 60–120 GeV, consistent with lepton pairs of OSSF leptons and are denoted OSSF1, μþμþe− contains originating from a Z boson decay. Furthermore, the Z → no such OSSF pair and is denoted OSSF0, while μþμ−eþe− lþl− p ðllÞ > 150 candidates are required to have T GeV. contains two such pairs and is denoted OSSF2. Thus Events are further required to have at least one b jet with orthogonal categories of events are defined that contain p > 80 Z T GeV. The requirements are optimized to select 0, 1, or 2 OSSF lepton pairs. These categories are further bosons and b jets originating from the decay of a heavy B divided, depending on whether or not a lepton pair has quark (>500 GeV), where the decay products are MðllÞ in the range 75–105 GeV, consistent with a Z boson p expected to have large T. The kinematic properties of decay. Same-flavor dilepton pairs consistent with low-mass one B quark are reconstructed from the Z boson and the resonances are excluded from the search region with a p b MðllbÞ B highest- T jet, with used to discriminate the requirement of MðllÞ > 12 GeV. quark signal. The invariant mass distributions of the B MðBÞ quark candidates for different are shown in Fig. 2.A E. All-hadronic channel signal peak can be identified over a continuous falling B background. This reconstruction strategy allows the other The final channel contributing to the search for quarks B quark from the BB¯ quark-antiquark pair to decay into includes events reconstructed in an all-hadronic topology, bH B bZ, bH,ortW. to increase sensitivity to the decay mode of the vectorlike quark. The search in this channel is designed for Higgs bosons decaying to a pair of b quarks. Because of D. Multilepton channel the high mass of the B quark, the Higgs boson is expected Events in this channel must have at least three leptons, to be highly Lorentz boosted; consequently the b quarks consisting of electrons, muons, or tau leptons decaying into from the Higgs boson decay have a small angular separa- τ p fully hadronic states ( h). The highest T (leading) electron tion. To reconstruct this signature, jet substructure algo- p > 20 or muon is required to have T GeV, and the rithms are used. The Higgs boson is reconstructed using a p > 10 p > 300 subleading leptons are required to have T GeV. single CA8 jet. This jet is required to have T GeV.
112009-5 V. KHACHATRYAN et al. PHYSICAL REVIEW D 93, 112009 (2016) The pruned jet mass is required to be in the range uncertainties, while the multijet normalization is allowed to 90
112009-6 SEARCH FOR PAIR-PRODUCED VECTORLIKE B … PHYSICAL REVIEW D 93, 112009 (2016) 19.2 fb-1 (8 TeV) 19.8 fb-1 (8 TeV) 105 105 CMS Data tt+jets CMS Data tt+jets μ 4 e+Jets, 0 V-tags W+jets Z+jets 4 +Jets, 0 V-tags W+jets Z+jets 10 Single t tVV+jets 10 Single t tVV+jets Diboson QCD Diboson QCD 3 3 10 Uncertainty B → tW 100% 10 Uncertainty B → tW 100% M(B) = 0.75 TeV M(B) = 0.75 TeV 102 102
Events / bin 10 Events / bin 10
1 1
10-1 10-1
2 ST [TeV] 2 ST [TeV] 1 1 0 0
Pull -1 Pull -1 -2 -2 0.5 1.0 1.5 2.0 2.5 0.5 1.0 1.5 2.0 2.5
ST [TeV] ST [TeV] 19.2 fb-1 (8 TeV) 19.8 fb-1 (8 TeV) 105 105 CMS Data tt+jets CMS Data tt+jets μ 4 e+Jets, 1 V-tag W+jets Z+jets 4 +Jets, 1 V-tag W+jets Z+jets 10 Single t tVV+jets 10 Single t tVV+jets Diboson QCD Diboson QCD 3 3 10 Uncertainty B → tW 100% 10 Uncertainty B → tW 100% M(B) = 0.75 TeV M(B) = 0.75 TeV 102 102
Events / bin 10 Events / bin 10
1 1
10-1 10-1
2 ST [TeV] 2 ST [TeV] 1 1 0 0 Pull -1 Pull -1 -2 -2 0.5 1.0 1.5 2.0 2.5 0.5 1.0 1.5 2.0 2.5
ST [TeV] ST [TeV] 19.2 fb-1 (8 TeV) 19.8 fb-1 (8 TeV) 105 105 CMS Data tt+jets CMS Data tt+jets μ 4 e+Jets, 2 V-tags W+jets Z+jets 4 +Jets, 2 V-tags W+jets Z+jets 10 Single t tVV+jets 10 Single t tVV+jets Diboson QCD Diboson QCD 3 3 10 Uncertainty B → tW 100% 10 Uncertainty B → tW 100% M(B) = 0.75 TeV M(B) = 0.75 TeV 102 102
Events / bin 10 Events / bin 10
1 1
10-1 10-1
2 ST [TeV] 2 ST [TeV] 1 1 0 0 Pull -1 Pull -1 -2 -2 0.5 1.0 1.5 2.0 2.5 0.5 1.0 1.5 2.0 2.5
ST [TeV] ST [TeV] S ≥2 V þ þ FIG. 3. The T distributions in the 0, 1, and -tag categories in the electron jets channel (left) and muon jets channel (right). The uncertainty bands shown include statistical and all systematic uncertainties, added in quadrature for each single bin. The horizontal bars on the data points indicate the bin width. The difference between the observed and expected events divided by the total statistical and systematic uncertainty of the background prediction (pull) is shown for each bin in the lower panels.
112009-7 V. KHACHATRYAN et al. PHYSICAL REVIEW D 93, 112009 (2016)
Npp Npf Nfp Nff
0 1−1 ð1 − p1Þð1 − p2Þ p1ð1 − p2Þð1 − p1Þp2 p1p2 B C B ð1 − p1Þð1 − f2Þ p1ð1 − f2Þð1 − p1Þf2 p1f2 C ¼ NLL NTL NLT NTT B C : ð1Þ @ ð1 − f1Þð1 − p2Þ f1ð1 − p2Þð1 − f1Þp2 f1p2 A
ð1 − f1Þð1 − f2Þ f1ð1 − f2Þð1 − f1Þf2 f1f2
S After using this method to estimate the background from the contributions containing nonprompt leptons, the T S distribution is used to discriminate signal events from background. The T distributions for the dielectron, dimuon, and electron-muon channels are shown in Fig. 4.
19.6 fb-1 (8 TeV) 19.6 fb-1 (8 TeV)
→ → B tW 100% B tW 100% 104 B→ tW 100% B→ tW 100% 104 CMS M(B)=450 GeV M(B)=600 GeV CMS M(B)=450 GeV M(B)=600 GeV B→ tW 100% μμ ee M(B)=750 GeV Data B→ tW 100% M(B)=750 GeV Data 3 3 Prompt, Prompt Prompt, Non-prompt 10 10 Prompt, Prompt Prompt, Non-prompt Non-prompt, Non-prompt Charge Mis ID
Uncertainty on Bkg 2 Non-prompt, Non-prompt Uncertainty on Bkg 102 10
10 10 Events/ bin Events/ bin 1 1
10-1 10-1
2 2 0 0 Pull Pull -2 -2 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
ST [TeV] ST [TeV]
19.6 fb-1 (8 TeV)
B→ tW 100% B→ tW 100% M(B)=450 GeV M(B)=600 GeV 4 CMS 10 μ B→ tW 100% e M(B)=750 GeV Data Prompt, Prompt Prompt, Non-prompt 3 10 Non-prompt, Non-prompt Charge Mis ID
Uncertainty on Bkg 102 Events/ bin
10
1
10-1
2 0 Pull -2 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
ST [TeV]
FIG. 4. The ST distributions used for signal discrimination in the same-sign dilepton channel. The distribution is shown for the three dilepton categories used: dielectron (top left), dimuon (top right), and electron-muon (bottom). The horizontal bars on the data points indicate the bin width. The difference between the observed and expected events divided by the total statistical and systematic uncertainty of the background prediction (pull) is shown for each bin in the lower panels.
112009-8 SEARCH FOR PAIR-PRODUCED VECTORLIKE B … PHYSICAL REVIEW D 93, 112009 (2016) -1 C. Opposite-sign lepton pair channel 19.7 fb (8 TeV) 1 Z→ e+e- The main background in the opposite-sign dilepton Data CMS Z þ channel is from the inclusive jets process (93%), with 0.8 the remaining fraction due to tt¯ þ jets and diboson proc- D B esses. Instead of using simulated events, control samples 0.6 C A in data are used to predict the normalization and shape of MðllbÞ the spectrum of the background. The background 0.4 is estimated from data using an ABCD method to predict L J the bZ invariant mass distribution MðllbÞ in the signal CSV discriminator value 0.2 region, labeled B, using control regions A, C, and D. The K I classification of the events into region A, B, C,orD is made 0 using event selection variables that are largely uncorrelated 123456 for the background samples. The two variables chosen are Njets N b -1 the number of jets, jets, and the -tagging discriminator of 19.7 fb (8 TeV) p Z 1 Z→ μ+μ- the highest T jet in the event. With an identified boson CMS decaying leptonically, there will be at least two jets Data 0.8 expected in signal events, providing discrimination power D B against SM background processes. N ¼ 1 N > 1 0.6 C A The selections used are (i) either jets or jets and (ii) events with the leading jet either passing or failing the b-tagging discriminator threshold (>0.679). 0.4 N b L J These selections divide the jets vs -tagging discriminator plane into the four regions shown in Fig. 5. The signal CSV discriminator value 0.2 K I contribution outside the signal region B was found to be 0 negligible using simulated event samples. Under the 123456 N hypothesis of complete noncorrelation between jets and Njets the b-tagging discriminator, the number of background events in the signal region would be given by FIG. 5. The event distribution in the plane of Njets vs the b- NB ¼ NA × ND=NC, where NX is the number of events tagging discriminator value, used to define the regions A, B, C, and D for the opposite-sign dilepton Z → eþe− (left) and Z → in the corresponding region. However, residual correlation þ − between the two variables is present and must be taken into μ μ (right) channels. The region B is the signal region, while the I J K L account in the background estimation procedure. The others constitute the control regions. The regions , , , and are used for estimation of systematic uncertainties. All other correlation is measured from data using an alternative selection criteria used to select the B quark candidates have been set of control regions defined using the following criteria: applied. The area of each bar is proportional to the number of (i) N ¼ 1 or N > 1 and (ii) 0.244