arXiv:0805.3580v2 [astro-ph] 6 Aug 2008 rdcinwudb upiei edo nexplana- an of need of this detection in from a surprise deviation Second, a any tion. and be slowly would chang- very prediction be scale only should with slope inflation ing primordial Moreover, the pre- spectrum. that some blue predicts and slope, some and actual shape red the potential dicting on inflationary differ a models the gives inflationary of spectrum first, measure power effort: primordial direct multi-pronged the a as of done measurement At is inflation. this of moment, models the actu- different might between that distinguish be observables ally to constraining seem is into effort which put of of lot being both a Consequently, 10], observations. 9, ini- by 8, confirmed a of 7, predict spectrum [6, scale-invariant fluctuations generically tial nearly models and Inflationary in universe data-sets successful flat [5]). distinct tremendously very e.g. of been number (see has large which very inflation a 4], of describing cosmol- 3, that current is 2, in paradigm [1, standard topics The debated research. hotly ogy most the of one seprtcmdl 1,1,1,1] ie rminflation- from differ 14], such 13, inflation, 12, to [11, Alternatives models [11]. ekpyrotic occurs scale as inflation energy predict low major a that a at models out rule inflationary and of de- inflation effectively class of will scale energy Universe, the early termine the gravita- from as waves interpreted tional if background, microwave cosmic h rgno tutr omto nteuies is universe the in formation structure of origin The 1 osrit nlclpioda o-asint rmlarg from non-Gaussianity primordial local on Constraints eklyCne o omlgclPyis hsc Departm Physics Physics, Cosmological for Center Berkeley Anˇze Slosar, flresaedt optalmto h au of value the on predictions limit these a use put par we to In Next data halos. scale suppressed. large of is of history effect t and merging the violated as find be we such We may properties, mass. assumption other on this on depends that only argue depen bias to only halo formalism red it assuming first function, that we mass show Here of and scale. formalism with split growing peak-background is amplitude whose bias, cuaindpnsol nhl as egtalmtof limit a get we mass, halo on only depends occupation inlfo h DSpooercqaa ape ftelatte the If to sample. weaken our datasets, limits photometric of the SDSS range wide the a from use signal we While confidence. (99.7%) urn iisfo h MP5ya nlss ihn evidenc no with analysis, year 5 WMAP the from limits current ASnmes 88.k 98.80.Cq an 98.80.Jk, observationally numbers: both PACS pursued further scale th be indicate large should results against and our CMB tested models, to thoroughly formation be and to quasar needs method the eetwr a hw httelclnnGusaiyparame non-Gaussianity local the that shown has work Recent 6 .INTRODUCTION I. arneBree ainlLbrtr,nvriyo Cali of Laboratory,University National Berkeley Lawrence 4 hsc eatet nvriyo aiona ekly C Berkeley, California, of University Department, Physics 3 nttt o hoeia hsc,Uiest fZrc,Z Zurich, of University Physics, Theoretical for Institute 1 hitpe Hirata, Christopher B rneo nvriy rneo,NwJre 84,USA 08544, Jersey New Princeton, University, Princeton md oaiaini the in polarization -mode nvriyo aiona eklyClfri 42,USA 94720, California Berkeley California, of University 2 atc / 3-3 aaea aiona915 USA 91125, California Pasadena, 130-33, M/C Caltech − 5 1( 31 eateto srpyia cecs etnHall, Peyton Sciences, Astrophysical of Department − 96) f < NL 2 Dtd uut6 2008) 6, August (Dated: Uroˇs Seljak, < 7 +6.Teelmt r oprbet h strongest the to comparable are limits These (+96). +70 f NL ny tnadprmtrzto fteprimordial potential the scale-independent the of so-called of parameterization the value is standard local non-Gaussianity the A on mean- type, depends only. local it the that to of ing is accessible non-Gaussianity models not that the of and predict departures Most small constraints. the observational very current Stan- that the are predicts Gaussianity conditions. inflation from initial field single in dard non-Gaussianity is simplest paper, the 18]. out 17, 16, rule of [15, would inflation detected mode of if main models subdom- a and the at level present as inant be out could these ruled formation, structure now while be detected. per- isocurvature can turbations; are generate they could waves models Thus, multifield gravitational Third, primordial negligible. wave always if gravitational is falsified expected CMB the in that signal in predictions ary eto oteptnil[9 20]: [19, cor- potential quadratic the a to includes rection one which in parameterization, inrno edand field random sian orcin yia au of value typical A correction. where naini fteodro lwrl aaee n thus and parameter roll slow 10 of order order of the of is inflation h rmrilfil utain(sue ob Gaussian be to (assumed fluctuation field between primordial transformation the nonlinear from contribution the ,4 3, hncmiigaldt suigta halo that assuming data all combining When . orhdrcin n h n efcso nthis in on focus we one the and direction, fourth A n n arneBree ainlLaboratory, National Berkeley Lawrence and ent hre Ho, Shirley − φ tti sacmeiiemto relative method competitive a is this at 9( 29 stepioda oeta sue ob Gaus- a be to assumed potential primordial the is r oelda eetmresthen mergers recent as modelled are r iua,i h ii frcn mergers recent of limit the in ticular, rv hsrsl ihntecnetof context the within result this erive so h supino universality of assumption the on ds ona eklyC 42,USA 94720, CA Berkeley fornia, obndrsl sdmntdb the by dominated is result combined esaedpnetba ildepend will bias dependent scale he tutr iuain ihrealistic with simulations structure theoretically. d ncnucinwt compendium a with conjunction in − fapstv inlin signal positive a of e − hnueetne Press-Schechter extended use then ter 2 65) 2] u hsi ieyt esapdby swamped be to likely is this but [21], 5 f lfri 42,USA 94720, alifornia NL rc,Switzerland urich, n ihlPadmanabhan Nikhil and f < nue scale-dependent a induces = Φ NL f NL < φ cl structure scale e 7 +3 t95% at (+93) +70 ecie h mltd fthe of amplitude the describes + f f NL NL φ f 2 NL o tnadso roll slow standard for , While . 6 f (1) NL 2 if it started from the pure Bunch-Davies vacuum) and the large scale structure (LSS) of the universe. The main the observable (such as CMB temperature fluctuation), problem is that non-linearities add their own phase cor- which generically gives fNL of order unity (see e.g. [22]). relations between Fourier modes that can very quickly Models where fNL is significantly higher include multi- swamp the primordial signal. Historically, the focus was field inflation [23, 24, 25] as well as models where non- on the mass function of very massive virialised structures Gaussianity arises during reheating [26, 27] or preheating [42, 43, 44, 45, 46]. The motivation was the notion that [28, 29, 30, 31]. Non-slow roll inflation models may also very massive virialised objects correspond to very rare lead to a significant non-Gaussianity, but are constrained peaks in the initial density field and therefore their num- because they may not lead to enough inflation in the first ber density should be an exponentially sensitive probe −5 place. Note that since φ 10 even fNL 100, compa- of those peaks at the high mass end, allowing a unique rable to the present limits,∼ generates non-Gaussian∼ sig- probe of the primordial peak structure. While results natures only at a 10−3 level, so the non-Gaussian signal generally agree with this picture, the observational task one is searching for is very small. Overall, any detection is made very difficult by the low number statistics of such of fNL above unity would be a major surprise in need of objects, uncertainties in the mass-observable relation and an explanation within the inflationary paradigm. its scatter, and selection effects. Very recently, non-Gaussianity in ekpyrotic models has A different method has been recently proposed by also been studied with the results suggesting that non- Dalal et al. [47]. By extending the classical calculation Gaussianity in these models is generically large [32] and for calculating the clustering of rare peaks in a Gaus- often correlated with the spectral slope ns [33, 34], in sian field [48] to the fNL-type non-Gaussianity, they have the sense that the redder the spectrum the higher non- shown that clustering of rare peaks exhibits a very dis- Gaussianity one may expect. Thus, non-Gaussianity is tinct scale-dependent bias on the largest scales. The an- emerging as one of the strongest discriminators among alytical result has been tested using N-body simulations, the models attempting to explain the origins of structure which confirm this basic picture. in the universe. The purpose of this paper is two-fold, first to provide Traditionally, the cleanest method for detecting the a better theoretical understanding of the effect and the non-Gaussianity has been to measure the bispectrum or range of its applicability, and second, to apply it to the 3-point function of the Cosmic Microwave Background real data. We begin in Section II by providing a new, (CMB). The initial 3-year Wilkinson Anisotropy Probe more general, derivation of the nonlinear bias induced by (WMAP) result gave a limit on fNL of 54 0 at 99.5% significance, with modified. 2σ range 27 < fNL < 147 [? ]. This result is surprising We then use this formalism and apply it to a wide se- as, taken at its face value, it implies a very non-standard lection of publicly available large scale structure data. In inflation or something else all together. Interestingly, a Sections III–IV we discuss the data, methodology, main similar result has been obtained by Jeong & Smoot using results and systematic issues, including application of the one-point distribution function of the CMB [38]. The Section II to the derived observational constraints. In WMAP 5-year results also prefer positive fNL from their Section V we discuss the results and present some direc- bispectrum analysis, although zero fNL is within its 2- tions for the future. sigma significance [5], namely 9

B. Application to universal mass functions C. Halo merger bias

We now apply Equation (14) to halo abundance models The above statements apply to biasing of objects whose with a universal mass function. Universal mass functions HOD depends only on the mass of the halo. However this are those that depend only significance ν(M), i.e. may not be true for the ; in particular there are many lines of evidence that suggests that quasar activity d ln ν is triggered by recent mergers [53, 54, 55]. Therefore we n(M)= n(M, ν)= M −2νf(ν) , (15) d ln M should consider the standard bias b and large-scale bias ∆b for recent mergers, which is in general not the same 2 2 as the bias of all haloes of the final mass [56, 57]. where we define ν = δc /σ (M) and f(ν) is the fraction of mass that collapses into haloes of significance between This section considers the simplified case in which ν and ν + dν. Here δc = 1.686 denotes the spherical quasars are triggered by a merger between a halo of mass collapse linear over-density and σ(M) is the variance of M1 and one of mass M2, after which the quasar lives for a −1 time t H in the new halo of mass M0 = M1 + M2. the density field smoothed with a top-hat filter on the Q ≪ scale enclosing mass M. Universality of the halo mass This requires us to understand the dependence of the function has been tested in numerous simulations, with number of recent mergers on amplitude, which we will results generally confirming the assumption even if the again express as σ8. Unlike the case of the mass func- specific functional forms for f(ν) may differ from one tion there are no accurate fitting formulae to the merger another. rate that have been tested against N-body simulations The significance of a halo of mass M depends on for a variety of cosmologies. Therefore we will take two approaches here. The first will be to consider the recent the background density field δl, so one can compute merger probabilities from the extended Press-Schechter ∂n/∂δl(x) and insert it into Equation (4) [51], (ePS) formalism. With ePS, we will find that for haloes of a given mass the probability of being a recent merger 2 d −1 b =1 ν ln[νf(ν)]. (16) is proportional to σ8 . In this picture, the bias of the − δc dν quasars in this case is the same as the halo bias b(M0), but the f -induced bias ∆b is less for recent mergers (This is > 1 for massive haloes since the last derivative NL than for all haloes of mass M . However there is no rig- is negative in this case.) 0 orous error bound on ePS calculations, so it is desirable The derivative ∂ ln n/∂ ln σ8 appearing in Equation to have an independent way to get the dependence of (14) can be obtained under the same universality assump- merger histories on σ8. We therefore consider a second tion. In fact, the calculation is simpler. The definition method of getting the recent merger probability during of the significance implies ν σ−2, so that d ln ν/d ln M ∝ 8 the matter-dominated era by using redshift scaling rela- does not depend on σ8 at fixed M. Therefore n νf(ν) ∝ tions from N-body simulation results. The latter method and −1 confirms the ePS σ8 relation. ∂ ln n ∂ ln ν ∂ ln[νf(ν)] d = = 2ν ln[νf(ν)]. (17) ∂ ln σ8 ∂ ln σ8 ∂ ln ν − dν 1. Extended Press-Schechter calculation Thus by comparison to Equation (16), we find: We will work in the ePS formalism in which the merger 2 history seen by a given particle is con- Ωm H0 ∆b(M, k)=3fNL(b 1)δ (18) trolled by the linear density field δ(M) measured to- − c k2T (k)D(z) c   day, spherically smoothed on a mass scale M in La- grangian space [58]. At time t a particle is inside a This is equivalent to the previously derived expressions halo of mass M if δ(M ′) > ω(t) for any M ′ > M, ≥ [47, 52]. However, it is more general, because it is in- where ω(t) = δcD(t0)/D(t) is the ratio of the threshold dependent of the form of f(ν). It is therefore valid for over-density for collapse δc to the growth function D(t). the Press-Schechter mass function as well as for the more In this picture it is convenient to replace the smooth- accurate Sheth-Tormen function. It is also valid for any ing scale M with the variance of the density field on that object that obeys a halo occupation distribution (HOD) scale, S(M)= δ(M)2 . The smoothed density field δ(S) that depends only on the halo mass, N (M), since in then follows a randomh i walk as a function of S; this ran- h i this case both b and ∆b are linearly averages of their dom walk is usually assumed to be Markovian because values for individual masses: (i) each Fourier mode is independent for Gaussian initial conditions, and (ii) one neglects the difference between b(M)n(M) N (M) dM b = h i , (19) smoothing with a top-hat in Fourier space (in which each n(M) N (M) dM R mode is independent) and the physically motivated top- h i hat in real space. and similarly for ∆bR. In this formalism we would like the probability that a 5 halo of mass M0 at time t was actually of mass M1 at an (14) to recent mergers we need only understand how the earlier time t t and experienced a merger with a halo of number density of recent mergers varies with σ8. Since − Q mass M2 = M0 M1. As argued by Lacey & Cole [58], the number density of recent mergers is the product of the − the probability for a particular dark matter particle in number density of haloes of mass M0 and the probability this halo to have been in an object of mass

ω(t tQ) ω(t) From Equations (16) and (17), the first term evaluates Pparticle(

1 tQ ω˙ M0 dS1 P (M1 M0) dM1 = | | dM1. | √ (S S )3/2 M dM 2. Scaling from N-body simulations 2π 1 0 1 1 − (22)

This formula was first derived by Lacey & Cole [58], The ePS formalism predicts that the probability of a albeit in a slightly different form [they computed halo of mass M0 being a recent merger, P (M1 M0)dM1, −1 | P (M0 M1)]. It has two well-known deficiencies. One is proportional to σ . While the qualitative result that | 8 is that it is not symmetric under exchange of M1 and massive haloes are more likely to be recent mergers in M2, especially for extreme mass ratios [59, 60]. Another low-σ8 than high-σ8 cosmologies is supported by N-body is that it does not contain merger bias in the Gaussian simulations [62, 63], the quantitative validity of the 1 − case, i.e. the bias of mergers is simply b(M0) [56]. This exponent does not appear to be well-tested. Neverthe- is because of the assumption that the trajectory δ(S) is less, in the matter-dominated era it is possible to deter- a Markovian random walk, which is not quite correct. mine the exponent from the redshift dependence of the For example, the analytic explanations for merger bias merger rate. of high-mass haloes [61] are based on non-Markovian be- The key is that there is no preferred timescale in haviour due to the fact that the physically meaningful the Einstein-de Sitter ; scale factor and linear smoothing in real space does not correspond to a sharp growth factor are both proportional to t2/3 and therefore cutoff at some kmax. The corresponding merger history the rescaling of initial amplitude is mathematically iden- bias is based on the correlation coefficient γ between tical to rescaling of time. Hence two N-body simulations δ(S) and dδ(S)/dS; one would have γ = 0 if one used whose initial conditions differ only by the normalization the Fourier-space rather than real-space top-hat filter. of the primordial perturbations will evolve through ex- This subtlety is however not required to understand why actly the same sequence of halo formation and mergers, the merger bias in fNL cosmologies is significant on large except that the scale factor of each merger is re-scaled −1 scales. according to amerger σ8 . For our application we would also need to integrate Note that for ΛCDM∝ cosmologies this correspondence over the range of masses M1 that define a major merger, between scaling time and scaling normalisation breaks as but since the result does not actually depend on this we the amplitude of fluctuations at the onset of cosmic ac- will not explicitly write it. In order to apply Equation celeration will be different for different initial amplitudes. 6

Therefore results of this rescaling do not apply to the low- These results provide an independent calculation of est redshifts, where the becomes important, ∂ ln P (M1 M0)/∂ ln σ8 that is on a more solid footing but since in our analysis the only sample where recent than ePS.| The agreement of the logarithmic derivatives mergers may be relevant is the quasar sample, which has at the 10% level is remarkable, especially given that a redshift distribution peaking at z 1.7, this is a minor ePS does∼ not do so well at predicting the absolute merger deficiency. ∼ rate. We want to test this relation from the merger his- tory statistics [64] in the Millenium Simulation [65]. The recent merger probability, which we have denoted 3. Summary P (M1 M0)dM1, is related to the merger rate B/n de- fined by| Ref. [64] by: We can write a generalised expression for the fNL in- duced scale dependent bias as B(M0, ξ) dz ∂M1 P (M1 M0)dM1 = tQ dξ, (25) | n(M0) dt ∂ξ 2 Ωm H0 ∆b(M, k)=3fNL(b p)δc , (31) where ξ > 1 is the mass ratio of the progenitors. Here − k2T (k)D(z) c   B/n is the merger rate per final halo of mass M0 per unit redshift per unit ξ. The derivative with respect to σ8 is where 1 1, was even weaker if one split on formation redshift instead although in some cases (galaxy mass haloes, 3:1 mergers) of concentration. Ref. [69] found almost no dependence the actual scaling is closer to B/n (1 + z)0.1. These re- on formation redshift in the relevant range δ /σ 2, but ∝ c ≥ sults suggest the scaling ∂ ln P (M1 M0)/∂ ln σ8 is in the their lowest quintile of concentration is 25% more bi- range of 1 (the ePS prediction) to| 1.1. If we plug this ased than the mean b(M) and even larger∼ effects are seen − − into Equation (23) then one derives ∂ ln nmerger/∂ ln σ8 if one splits by substructure. Ref. [57] found that the bias equal to δ (b 1.6) (for 1) or δ (b 1.65) (for 1.1). for recent major mergers was enhanced by 5% relative c − − c − − ∼ 7 to b(M). It is clear that the strength of this effect de- under consideration in this paper these data sets are not pends strongly on the second parameter used, but in the directly sensitive to the fNL parameter. However, they case of the definitions related to the mergers, the devia- are needed to constrain the basic cosmological model and tion in b(M) is significantly less than the corrections to thus the shape and normalisation of the matter power the ∆b, which replaces b 1 with b 1.6. We will there- spectrum. The large scale structure data discussed bel- − − fore assume that the theoretical uncertainties are fully low are thus simultaneously able to fit for fNL and other absorbed by the expression in Equation (31). auxiliary parameters. Finally, while there is ample evidence that quasar ac- Most of our large-scale structure data is drawn from tivity is often triggered by mergers, it is probably not the Sloan Digital Sky Survey (SDSS). The SDSS drift- the case that all quasars live in recently merged halos. scans the sky in five bands (ugriz) [77] under photomet- Therefore the true value of p for quasar population lies ric conditions [78, 79] using a 2.5-meter optical telescope between 1 and 1.6, since the true population of host ha- [80] with 3 degree field of view camera [81] located in New los lies somewhere between randomly selected halos and Mexico, USA [78]. The photometric and astrometric cal- recently merged halos. Therefore, our limits with p 1.6 ibration of the SDSS and the quality assessment pipeline should be viewed as a most conservative reasonable∼ op- are described by Refs. [82, 83, 84, 85, 86]. Bright galax- tion. ies [87], luminous red (LRGs) [88], and quasar candidates [89] are selected from the SDSS imaging data for spectroscopic follow-up [90]. This paper uses imag- III. METHOD AND DATA ing data through the summer of 2005, which was part of SDSS Data Release 6 (DR6) [91], and spectroscopic data through June 2004 (DR4) [92]. After reviewing and extending the theoretical formal- The requirement of large scales and highly biased trac- ism we turn to its application to the real data. We would ers leads us to explore several different large scale data like to use Equation (31) to put constraints on the value sets, in particular SDSS LRGs, both spectroscopic [93] of the f parameter. Since the effect is significant only NL and photometric [94] samples, and photometric quasars on very large scales we need to use the tracers of large (QSOs) from SDSS [95]. In all these cases we use auto- scale structure at the largest scales available. In addi- correlation power spectrum, which is sensitive to f 2 . tion, the effect scales as b p, where p is typically 1 but NL These datasets can be assumed to be statistically inde- can be as large as 1.6 in special− cases, hence we need very pendent. As explained below, when spectroscopic and biased tracers of large scale structure to measure the ef- photometric LRGs are analysed together, we take care fect. We discuss below our choice of observational data. exclude those photometric redshift bins that have sig- Finally, the effect changes the power on large scales and nificant overlap with spectroscopic sample. Our quasars in principle this can also be achieved with a change in have typical redshifts of z 1.5 – 2 and only 5% overlap the primordial power spectrum, although this degener- with LRGs due to photoz∼ errors. Moreover, they are in acy exists only in the presence of one tracer: with two the Poission noise limited regime, so overlap in volume is tracers with different biases one can separate f from NL less relevant. This is discussed in more detail in [95]. In the changes in the initial power spectrum. Here we as- addition, we also use cross-correlation of all these sam- sume that the basic model is one predicted by the sim- ples, as well as nearby galaxies from the 2-micron All-Sky plest models of structure formation and we do not allow Survey (2MASS) [37] and radio sources from the NRAO for sudden changes in the power spectrum beyond what VLA Sky Survey (NVSS) [96] with CMB, as analyzed in is allowed by the standard models, which assume the Ho et al. [95]. Since this is a cross-correlation between power spectrum slope n to be constant. We use Markov s the galaxies and matter (as traced by the ISW effect), Chain Monte Carlo method to sample the available pa- the dependence is linear in f . rameter space using a modified version of the popular NL public package cosmomc[70]. In addition to fNL, we fit for the standard parameters of the minimal concordance 2 2 A. Spectroscopic LRGs from SDSS cosmological model: ωb =Ωbh , ωCDM =ΩCDMh , θ, τ, ns and log A, where θ is the ratio of the sound horizon to the angular diameter distance at decoupling (acting as a We use the spectroscopic LRG power spectrum from proxy for Hubble’s constant), τ is teh optical depth and Tegmark et al. [93], based on a galaxy sample that covers A is the primordial amplitude of the power spectrum. 4000 square degrees of sky over the redshift range 0.16 z 0.47. We include only bins with k 0.2h/Mpc. We≤ All priors are wide enough so that they do not cut the ≤ ≤ posterior at any plane in the parameter space. model the observed data as We always use standard cosmological data as our base- 1+ Qk2 P (k) = [b + ∆b(k,f )]2 P (k) , (32) line model. These include the WMAP 5-year power- observed NL lin 1+ Ak spectra [71, 72] and additional smaller-scale experiments (VSA, CBI, ACBAR) [73, 74, 75], as well as the super- where the last term describes small-scale non-linearities novae measurements of luminosity distance from the Su- [97], with Q treated here as a free parameter (bound be- −1 pernova Legacy Survey (SNLS) [76]. For values of fNL tween zero and 40) and A = 1.4h Mpc. For realistic 8

2 values of fNL and Q, the non-Gaussian bias ∆b is present where ∆ (k) is the linear matter power spectrum today. only at large scales and the Q-term is present only at The window functions are given by small scales; there is no range of scales over which both D(r) dN are important. We explicitly confirmed that there is no W (k) = (b + ∆b) j (kr) dr and ℓ D(0) dr ℓ correlation between Q and fNL present in our MCMC Z chains and we let the data to determine the two param- D(r) dN 2ℓ2 +2ℓ 1 W r(k) = Ω0.6(r) − j (kr) eters. ℓ m D(0) dr (2ℓ 1)(2ℓ + 3) ℓ Z h − As discussed above if the halos in which objects re- ℓ(ℓ 1) − j −2(kr) side have undergone recent mergers and thus depend on −(2ℓ 1)(2ℓ + 1) ℓ − properties other than halo mass then the simple scaling (ℓ + 1)(ℓ + 2) with (b 1) may not be valid. This is unlikely to be rel- jℓ+2(kr) dr, (35) − −(2ℓ + 1)(2ℓ + 3) evant for LRGs, which are old red galaxies sitting at the i center of group and cluster sized halos. Moreover, num- where dN/dr is the redshift distribution normalised to ber density of LRGs is so high that it is reasonable to unity and written in terms of comoving distance r, and D assume that almost every group sized halo contains one, is the growth function. The code automatically switches since otherwise it is difficult to satisfy both the number to the Limber approximation when this becomes accu- density and high bias at the same time [98]. For LRGs rate. Note that for the low multipoles, it is essential to we thus do not expect there is a second variable in addi- include the redshift-space distortion even for a photomet- tion to halo mass and halo occupation models find that ric survey because a significant amount of power comes 13 populating all halos with mass above 10 M⊙/h with an from Fourier modes that are not transverse to the line of LRG is consistent with all the available data [99]. Hence sight. we will only use p = 1 in Equation (31). The same also We use an independent bias parameter for each red- holds for the photometric LRGs discussed below. shift slice. In addition to the bias dependence, we also use the Equation (32) to take into account non-linear cor- rections. While strictly speaking the value of Q should be different for each slice, we use a single free parameter Q for all slices. We have explicitly checked that non-linear B. Photometric LRGs from SDSS corrections are negligible for k < 0.1h/Mpc and there- fore this is not a major issue. Since there is a strong overlap between the spectroscopic and photometric sam- We use data from Padmanabhan et al. [94], who pro- ple for z < 0.45, we use only slices 5–7 when combining vide the LRG angular power spectrum measured in 8 this data with spectroscopic sample. redshift slices (denoted 0–7) covering the range 0.2 < zphoto < 0.6 in slices of width ∆zphoto =0.05. The power spectrum is based on 3500 square degrees of data. We C. Photometric quasars from SDSS use only data-points that correspond to k < 0.1h/Mpc at the mean redshift of each individual slice. We use We use the power spectrum of the high redshift the full Bessel integration to calculate the angular power quasar photometric sample that has recently been con- spectrum on largest scales and account for the redshift- structed for ISW and CMB lensing cross-correlation stud- distortion power as described in Padmanabhan et al. [94]: ies [95, 100]. The sample covers 5800 square degrees of sky and originally had two photometric redshift ranges, 0.65 < zphoto < 1.45 (“QSO0”) and 1.45 < zphoto < gg gv vv Cℓ = Cℓ + Cℓ + Cℓ , (33) 2.00 (“QSO1”). It consists of ultraviolet-excess (UVX; u g < 1.0) point sources, classified photometrically as quasars− [101], and with photometric redshifts [102]. The where superscripts g and v denote galaxies over-density classification and photometric redshifts were at the time and velocity terms respectively. The bias and β depen- of sample construction only available over a subset of the dence has been put back into the Bessel integral as it now survey region; they were extended to the remaining re- depends on the value of k. The three terms are given by gion using a nearest-neighbor algorithm in color space the integrals: [95]. As described below, we only used the QSO1 sam- ple as QSO0 appears to suffer from systematic errors on large scales. dk Cgg = 4π ∆2(k) W (k) 2, The largest angular scales in the quasar data are sub- ℓ k | ℓ | Z ject to at least three major sources of systematic error: dk Cgv = 8π ∆2(k) [W ∗(k)W r(k)] , and stellar contamination, errors in the Galactic extinction ℓ k ℜ ℓ ℓ maps, and calibration errors. All of these are potentially Z dk much worse than for the LRGs: some stars (e.g. M dwarf- Cvv = 4π ∆2(k) W r(k) 2, (34) ℓ k | ℓ | white dwarf binaries) can masquerade as quasars, and we Z 9

are relying on the u band where extinction is most severe QSO0-star power spectrum QSO1-star power spectrum and the photometric calibration is least well understood. 0.004 0.004 These errors were discussed in the context of ISW stud- 0.003 0.003 0.002 0.002 π π ies [95], but if one wishes to use the quasar auto-power 0.001 0.001 /2 /2 qs ℓ 0 qs ℓ 0 spectrum on the largest angular scales the situation is -0.001 -0.001 +1)C +1)C ℓ ℓ more severe. ( -0.002 ( -0.002 ℓ -0.003 ℓ -0.003 We investigated this subject by computing the cross- -0.004 -0.004 power spectrum of each QSO sample with the SDSS 2 3 5 10 20 30 50 2 3 5 10 20 30 50 18.0 1.4). The cross-power QSO0-red star power spectrum QSO1-red star power spectrum − should be zero in the absence of systematics but it could 0.004 0.004 0.003 0.003 be positive if there is stellar contamination in the QSO 0.002 0.002 π π 0.001 0.001 sample. Either positive or negative correlation could re- /2 /2 qs ℓ 0 qs ℓ 0 sult from photometric calibration errors which shift the -0.001 -0.001 +1)C +1)C ℓ ℓ

( -0.002 ( -0.002 quasar and stellar locus in nontrivial ways. The results ℓ ℓ -0.003 -0.003 of this correlation are shown in Figure 1, with error bars -0.004 -0.004 estimated from the usual harmonic space method, 2 3 5 10 20 30 50 2 3 5 10 20 30 50 ℓ ℓ qq −1 ss (C + nq )C qs ℓ ℓ FIG. 1: The correlation of each quasar sample with stars and σ(Cℓ )= 2 2 , (36) [(ℓmaxq + 1) ℓ ]fsky with the red stars. − min qq where Cℓ is the quasar autopower spectrum, nq is the ss 2. We then assume that the functional form for b(z) number of quasars per steradian, and Cℓ is the star au- −1 is either topower spectrum; the ns term is negligible. (Aside from boundary effects, this is the same error that one 1+ z 5 b(z) 1+ (38) would obtain by correlating random realizations of the ∝ 2.5 quasar field with the actual star field.) From the fig-   ure, QSO1 appears clean, but QSO0 appears contami- as measured in [103], or that its form is given by nated: the first bin (2 ℓ < 12) has a correlation of qs ≤−4 b(z) 1/D(z), (39) Cℓ = (2.9 1.0) 10 with the red stars. This is ∝ a 2.9σ−result± and strongly× suggests some type of sys- as would be valid if the clustering amplitude is not tematic− in the QSO0 signal on the largest angular scales. changing with redshift. In both cases the constant We are not sure of the source of this systematic, but the of proportionality is determined from the normali- amount of power in the quasar map that is correlated sation condition with the red stars is dn bdn 1 dz = dz =1. (40) qs 2 dz dz b(z) ℓ(ℓ + 1) qq ℓ(ℓ + 1) Cℓ −4 Z Z Cℓ (corr) = ss 2 10 . (37) 2π 2π Cℓ ∼ × 3. Once both dn/dz and b(z) are known we calculate The variation in QSO0 density that is correlated with the the theoretical angular power spectrum using the red stars is thus at the 1.4% level. This is consistent same code as for photometric LRGs, taking into with the excess auto-power∼ in QSO0 in the largest scale account all available ℓ points. This is the theoreti- qq −4 cal spectrum that is used to calculate the χ2 that bin, which is at the level of [ℓ(ℓ + 1)/2π]Cℓ 3 10 (see figure 10 in [95]) and is comparable to∼ what× one goes into the MCMC procedure. In principle, all might expect from the 1–2% calibration errors in SDSS free parameters that determine the shape of bdn/dz [84], although other explanations such as the extinction should be varied through the MCMC, rather than map are also possible. Because of this evidence for sys- being fixed at the best-fit point. However, in the tematics, we have not used the QSO0 autopower spec- limit of Gaussian likelihood, the two procedures are trum in our analysis; in what follows we only use QSO1. equivalent, while the latter offers significant speed QSO0 may be added in a future analysis if our under- advantages. standing of the systematics improves. 4. We calculate χ2 using two different methods. Our Because the redshift distribution is poorly known and standard method is to use all points and the full needs to be determined internally from the data sample covariance matrix assuming a Gaussian likelihood: itself the data is analyzed in a two step procedure: χ2 = (d t) C−1(d t), (41) 1. Power spectrum points with ℓ = 30 . . . 200 are used − · − to constrain the product (bdn/dz)(z), where b is the where d and t are data and theory vectors of Cℓ’s, linear bias at redshift z and dn/dz is the normalised respectively. This is likely to be a good approxima- radial window function. Although the fNL is taken tion except on the largest scales, where small num- into account at this step, its effect is sub-dominant. ber of modes leads to corrections that may affect 10

D. Cross-correlation between galaxies and dark QSO autopower window functions matter via Integrated Sach-Wolfe effect 0.18 L =6 0.16 ctr Lctr=18 0.14 One can also look for the fNL using the cross- 0.12 correlation between a tracer like galaxies or QSOs and 0.1 dark matter. Since we do not have dark matter maps L 0.08 from large scales we can use cosmic microwave back- W 0.06 ground (CMB) maps as a proxy. If the gravitational 0.04 potential is time dependent, as is the case in a universe 0.02 dominated by dark energy or curvature then this leads 0 to a signature in CMB, the so-called integrated Sach- -0.02 Wolfe (ISW) effect [105]. This signature can easily be 0 10 20 30 40 50 related to the dark matter distribution, but part of the L signal is not coming from ISW but from the primary CMB anisotropies at the last scattering surface. These FIG. 2: The window functions for the ℓ = 6 and ℓ = 18 bins of the quasar power spectrum. act as a noise and lead to a large sampling variance on large scales and as a result the statistical power of this technique is weakened. At the moment ISW is only de- tected at 4σ level [106]. Our procedure closely follows the outer limits (e.g. 3σ). To test this we model that of [95]∼ and is in many respects very similar to that 2 the first QSO bin with an inverse-χ distribution, of the Section III C. We use all 9 samples present in Ho but neglecting covariance of this bin with higher ℓ et al., although the discriminating power is mostly com- bins: ing from the NVSS-CMB cross-correlation because NVSS sample is available over 27 361 deg2 area and the tracers, 2 dℓ +1/n tℓ +1/n radio galaxies, are biased with b 2. First, bdn/dz and χ = N ln + 1 + b(z) are determined for each sample.∼ Here we always use tℓ +1/n dℓ +1/n −     b(z) 1/D(z) for all samples except the quasar sample. Gaussian χ2 for other points, (42) Then∝ we calculate the ISW Limber integral:

where N 21.5 is the effective number of modes 3Ω H2T contributing∼ to the first bin and n is the number of CgT = m 0 CMB dz [b(z) + ∆b(k(z),z)] ℓ c3(ℓ +1/2)2 quasars per steradian. (This was called the “equal Z dn d D(z) variance” case in Ref. [104], and is appropriate if (1 + z) D(z)P (k(z)) , (43) there is equal power in all modes; this is the case × dz dz D(0)   here to a first approximation since we find that the Poisson noise 1/n dominates.) where k(z) = (ℓ +1/2)/χ. Again, we used the full win- dow function for the NVSS-CMB correlation to avoid an unnecessary bias and we assume p = 1 for all the samples. Since the power rises dramatically at low ℓ in the fNL models, we have included the full window function in our calculation of the binned power spectrum, Cbin = IV. RESULTS ℓ WℓCℓ. The window functions for the lowest two QSO bins are shown in Figure 2, and the algorithm for their P computation is presented in Appendix A. We begin by plotting data-points and theoretical pre- As shown in the Section II C 1, if halos have undergone dictions for six of the datasets and values of fNL in Figure recent mergers then we would expect p 1.6 instead of 3. This plot deserves some discussion. The easiest and p 1 in Equation (31). Recent mergers∼ could be a plau- most intuitive to understand is the case of spectroscopic ∼ sible model for QSOs, whose activity could be triggered LRGs. We note that the inclusion of the fNL param- by a merger [53, 54, 55]. For QSOs used in this work we eter modifies the behaviour of the power spectrum on do not find that their number density places a significant the largest measured scales. Naively, one would expect constraint: at z 1.8 the number density of halos with LRGs to be very competitive at constraining fNL. In ∼ b 2.5 3 is one to two orders of magnitude higher than practice, however, fNL is degenerate with matter den- the∼ measured− number density, hence we can easily pick sity, which also affects the shape of the power spectrum and choose the halos with a recent merger and still satisfy and hence the constraints are somewhat weaker. Since the combined number density and bias constraint. This the effect of fNL raises very strongly, just a couple of is in agreement with conclusions of [103], which looked points on largest scales might break this degeneracy, im- at a similar QSO sample from 2dF. Because of this un- proving constraints by a significant factor. The effect certainty we therefore run another separate analysis for in the photometric LRG samples is similar, although we QSOs with p =1.6 (QSO merger case). are now looking the angular space, where the dependence 11

FIG. 3: This figure shows 6 datasets that are most relevant for our constraints on the value of fNL. In the left column show the NVSSxCMB Integrate Sach Wolfe Cross correlation, the QSO1 power spectrum, the spectroscopic LRG power spectrum, while the right column shows the last three slices of the photometric LRG sample. The lines show the best fit fNL = 0 model (black, solid) and two non-Gaussian models: fNL = 100 (blue, dotted), fNL = 100 (red, dashed). The ISW panel additionally − shows the fNL = 800 model as green, dot-dashed line. While changing fNL, other cosmological parameters were kept fixed. See text for further discussion. 12

FIG. 4: This figure shows the median value (red points) and 1,2 and 3-sigma limits on fNL obtained from different probes (vertical lines). The data set used are, from top to bottom: Photometric LRGs, Photometric LRGs with only slices 0–4 used, Spectroscopic LRGs, Integrated Sach Wolfe effect, photometric QSO, photometric QSOs using b(z) 1/D(z) biasing scheme (see Section III C), photometric QSOs using alternative χ2 calculation scheme (see Section III C), using∝ a scale dependent bias formula appropriate for recently merged halos (Section II C), Combined sample, Combined sample using a scale dependent bias formula appropriate for recently merged halos (for QSO), the last two resoluts to which a statistically independent WMAP 5 bispectrum fNL constraint was added. See text for discussion.

has been smeared out. The QSO plot again shows simi- of fNL produce anomalously large power in the angular lar behaviour, with two caveats. First, the changes in the power spectrum if bdn/dz has significant contributution predicted power spectrum on small scales are a result of at high-z tail, which probes large scales. Therefore, the the fact that bdn/dz is perturbed with changing fNL, al- bdn/dz fitting procedure skews the distribution towards though this is a minor effect. Second, the increase in the lower redshifts, leading to a lower bias overall. At very power at smallest ℓ for negative fNL is due to the fact that large values, e.g. fNL = 800, this effect is so severe that for sufficiently negative fNL (or sufficiently large scales), the b 1/D(z) scaling forces b < 1 at the low-redshift δb < 2b and hence power spectrum rises again above end. This∝ implies ∆b < 0, and the large-scale ISW sig- what− is expected in the Gaussian case. The more unex- nal actually goes negative (see top-left panel of Fig. 3). pected is the NVSS-CMB cross-correlation. Naively, one Therefore, the ISW is surprisingly bad at discriminating would expect that the first point of that plot will produce fNL and we were unable to fit the first NVSS ISW data a very strong fNL “detection”. However, the CMB cross- point with a positive fNL. This behavior is however only correlation signal is only linearly dependent on fNL, while of academic interest because the other data sets strongly cross-correlations of NVSS with other tracers of struc- rule out these extreme values of fNL. ture are quadratically dependent on fNL. Large values We ran a series of MCMC chains with base cosmolog- 13

Data set fNL strains fNL in both directions. The second and third

+54+101+143 point have some excess power relative to the best fit Photometric LRG 63− − − 85 331 388 fNL = 0 model and so give rise to a slightly positive value +115+215+300of fNL in the final fits. This is however not a statistically Photometric LRG (0-4) 34−194−375−444 − significant deviation from fNL = 0. +74+139+202 Spectroscopic LRG 70−83−191−371 We also test the robustness of fNL constraints from

+647+755+933 the quasar sample power spectrum by performing the ISW 105−337−1157−1282following tests. First we replace the form of evolution of +26+47+65 bias from that in Eq. (38) to that in Eq. (39). Second we QSO 8−37−77−102 replaced the details of the likelihood shape for the first +28+49+69 QSO(b=1/D) 8−38−81−111 points, we replace Eq. (41) with Eq. (42). In both cases,

2 +27+52+72 the effect on the constraints was minimal, as shown in QSO alternative χ 10−40−74−101 the Figure 4 and the Table I. +30+58+102 The large-scale quasar power spectrum could also be QSO merger 12−44−94−138 affected by spurious power (e.g. calibration fluctuations +23+42+65 Combined 28−24−57−93 or errors in the Galactic extinction map) or 1-halo shot

+16+39+65 noise, either of which would dominate at large scales over Comb. merger 31−27−62−127 the conventional P (k) k autocorrelation. At the level ∝ +18+33+52 of the current data these are not affecting our results: Combined + WMAP5 bispectrum fNL 36−17−36−57 uncorrelated power adds to the power spectrum Cℓ, and +13+29+53 Combined merger + WMAP5 bispectrum fNL 36−17−36−57 if present it would only would only tighten the upper limits on the power spectrum given by our ℓ = 6 and 18 points. The 1-halo shot noise would contribute an added TABLE I: Marginalised constraints on fNL with mean, 1σ constant to C since z 1.7 haloes are unresolved across (68% c.l.), 2σ (95% c.l.) and 3σ (99.7%) errors. Ordering ℓ ∼ exactly matches that of the Figure 4. See text for discussion. our entire range of ℓ; the observed power at ℓ 250 constrains the 1-halo noise to be much less than the∼ error bars at ℓ =6, 18. Nevertheless, if we had detected excess power in the quasars at the largest scales, a much more ical data and one of the four data sets considered above, detailed analysis would have been necessary to show that as well as a run in which all data were combined (with it was in fact due to fNL and not to systematics. the exception of slices 0–4 of photometric LRGs as de- If quasars are triggered by mergers, then they do scribed above). The results are summarized in Table I not reside in randomly picked halos and the use of the and visualized in the Figure 4. Eq. (18) may not be appropriate. As discussed in Sec- We note several interesting observations. When only tion IIC, one can replace the (b 1) factor in Eq. (18) slices 0–4 of the photometric LRG sample are used, the with a modified factor (b 1.6).− We quote the results results are weaker than those of the spectroscopic LRGs. in this case in the Table I− as “QSO merger.” The error The two trace comparable volume, but photometric sam- bars have increased by about 40%, which is somewhat ple does not use radial mode information and as a re- less than naively expected from a population with an av- sult its errors are expectedly larger. On the other hand, erage bias of around 2.5. This is due to various feedback the overall photometric sample performs somewhat bet- related to bdn/dz fitting. Moreover, this is likely to be ter than the spectroscopic LRG sample, due to its larger an extreme case since it assumes that all quasars live in volume: it traces LRGs up to z 0.6 as opposed to recently merged halos with short lifetimes, so we would z 0.45 for spectroscopic sample. ∼ ∼ expect that the true answer is somewhere in between the We find that the ISW is constraining the fNL param- two results. On the other hand, it is also unclear how eter rather weakly. This is somewhat disappointing, but accurate extended Press-Schechter formalism is for this not surprising, since the cross-correlation between LSS application, so there is some uncertainty associated with and CMB is weak and has only been detected at a few this procedure. Recent progress in understanding quasar sigma overall. In addition, as mentioned above, fNL en- formation and its relation to the underlying halo popu- ters only linearly (rather than quadratically) in the ISW lation [54] gives us hope that this can be solved in the expressions and is strongly degenerate with determina- future. tion of bdn/dz. Given that ISW S/N can only be im- We also quote results for the combined analysis. In this proved by another factor of 2 at most even with perfect ∼ case, the error on fNL shrinks somewhat more than one data we do not expect that it will ever provide competi- would expect assuming that error on fNL measurement tive constraints on fNL. from each individual dataset is independent. This is be- We find that the quasar power spectra give the cause other parameters, notably matter density and am- strongest constraints on fNL. This is due to their large plitude of fluctuations get better constrained when data volume and high bias. The lowest ℓ point in this data are combined. Furthermore, we note that when quasars set at ℓ = 6 is a non-detection and therefore highly con- are assumed to be recent mergers the errors expand by 14 an expected amount at the 3σ level, but hardly at a 2σ level. We have carefully investigated this anomaly and it seems to be due a peculiar shape of the likelihood surface and subtle interplay of degeneracies between fNL, matter density and amplitude of fluctuations.

V. DISCUSSION

The topic of this paper is signature of primordial non- Gaussianity of local type (the so called fNL model) in the large scale structure of the universe. Specifically, it was recently shown that this type of non-Gaussianity gives rise to the scale dependence of the highly biased trac- ers [47]. We extend this analysis by presenting a new derivation of the effect that elucidates the underlying as- sumptions and shows that in its simplest form it is based on the universality of the halo mass function only. Our FIG. 5: This figure shows 1 and 2σ contours on the ns-fNL derivation also allows for possible extensions of the sim- plane for our best combined data set with additional WMAP plest model, in which tracers of large scale structure de- 5 year bispectrum constraint (assumed to be independent of pend on properties other than the halo mass. One that ns). Red lines are predictions from the ekpyrotic models and we explore in some detail is the effect of recent merg- correspond to values of fixed γ and varying ǫ. Different line correspond to different values of γ, which varies between γ = ers. By using extended Press-Schechter model we show 1 (flat, constant, negative fNL) to γ = 0.2 in steps of 0.1. that in this case the effect has the same functional form, −The dashed green line corresponds to the theoretically− favored but the predicted scaling with fNL for a given bias has a value of γ = 1/√3 according to [107]. smaller amplitude than in the simplest model. − Second, we apply these results to constrain the value of fNL from the clustering of highly biased tracers of In this combined result, fNL = 0 is at just around 2σ, large scale structure at largest scales. In our analysis we which taken at a face value suggests than the evidence find that the best tracers are highly biased photometric for a significant non-Gaussianity found in 3 year WMAP quasars from SDSS at redshifts between 1.5 and 2, fol- data by [? ] may have been a statistical fluctuation lowed by photometric and spectroscopic LRGs at redshift rather than evidence of a real signal. around 0.5. Our final limits at 95% (99.7%) confidence The results are already sufficiently strong to con- are strain the ekpyrotic models of generating the primor- dial structure: these generically predict much higher 29 ( 65) 0.3 is disfavored at 2σ. If we assume quasars to live in recently merged halos, Our results are very∼ − promising and with this first anal- we get essentially the same result with the upper limit ysis we already obtain constraints comparable to the best relaxed to 89. previous constraints. However, we should be cautious 15 and emphasize that this is only the first application of shows that small values of fNL do not change mass func- this new method to the data and there are several is- tion enough to affect this deduction. On the LRG side, sues that require further investigation. While analyti- the most obvious extension of our work would be to pur- cally the method is well motivated and can be derived sue the luminosity dependent clustering analysis of the on very general grounds, as shown in Section IIC1, the spectroscopic sample. It is well known that LRG cluster- method needs to be verified further in N-body simula- ing amplitude is luminosity dependent [110] and so select- tions using large scale tracers comparable to these used ing only brighter LRGs would lead to a higher bias sam- in our analysis. Equation (18) has been calibrated with ple and could improve our limits, but ideally one would N-body simulations using matter-halo cross-correlation want to perform the analysis with optimal weighting to [47]. Auto-correlation analysis of biased halos, which is minimize the large scale errors as in [93]. Similar type of the basis for the strongest constraints derived here, is luminosity dependent analysis could also be done on the typically noisier and has not been verified at the same LRG photometric sample used here [94]. level of accuracy, although there is no obvious reason Finally, with the future data sets from several planned why it should give any different results. Still, it would be or ongoing surveys, especially those related to the bary- useful to have larger simulations where the scale depen- onic oscillations most of which use highly biased tracers dent bias could be extracted with high significance from of large scale structure, a further increase in sensitivity auto-correlation analysis. In addition, it would be useful should be possible [47]. The ultimate application of the to verify the scaling relations in simulations on samples large-scale clustering method would involve oversampling defined as closely as possible to the real data should be the 3-dimensional density field in several samples with a selected, in our case by choosing halos with mean bias of range of biases, so that excess clustering due to fNL can 2 at z =0.5 for the LRG and b =2.7 at z =1.7 for QSO be cleanly separated from contamination due to errors samples. in calibration or extinction correction, which are major A second uncertainty has to do with the halo bias de- challenges as one probes below the 1% level. The ulti- pendence on parameters other than halo mass. We have mate sensitivity of the method will likely depend on our shown that for QSOs we need to allow for the possibility ability to isolate these systematics and these should be that they are triggered by recent mergers and we have the subject of future work. presented extended Press Schechter predictions for the Overall, the remarkably tight constraints obtained amplitude of the effect in this case, which can affect the from this first analysis on the real data, while still sub- limits. To some extent these predictions have been ver- ject to certain assumptions, is a cause of optimism for ified using simulations [64], but we do not have a re- the future and we expect that the large scale clustering liable model of populating QSOs inside halos to predict of highly biased tracers will emerge as one of the best which of the limits is more appropriate for our QSO sam- methods to search for non-Gaussianity in initial condi- ple. Moreover, just as the overall halo bias has recently tions of our universe. been shown to depend on variables other than the halo mass [66], it is possible that the large scale fNL correction also depends on variables other than the mass and merg- Acknowledgements ing history: while we have shown that extended Press- Schechter already predicts the dependence on the merg- We acknowledge useful discussions with Niayesh Af- ing history, other dependences less amenable to analytic shordi and Dan Babich. calculations may also exist. It is clear that these issues A.S. is supported by the inaugural BCCP Fellowship. deserve more attention and we plan to investigate them C.H. is supported by the U.S. Department of Energy un- in more detail in the future using large scale simulations der contract DE-FG03-02-ER40701. N.P. is supported combined with realistic quasar formation models. by a Hubble Fellowship HST.HF- 01200.01 awarded by On the observational side, there are many possible ex- the Space Telescope Science Institute, which is operated tensions of our analysis that can be pursued. In this by the Association of Universities for Research in As- paper we have pursued mostly quasars and luminous red tronomy, Inc., for NASA, under contract NAS 5-26555. galaxies (LRGs), two well studied and highly biased trac- Part of this work was supported by the Director, Office ers of large scale structure. On quasar side, we have of Science, of the U.S. Department of Energy under Con- only analyzed photometric QSO samples split by redshift, tract No. DE-AC02-05CH11231. U.S. is supported by but we should be able to obtain better constraints if we the Packard Foundation and NSF CAREER-0132953 and also use the luminosity information, specially if brighter by Swiss National Foundation under contract 200021- quasars are more strongly biased [103]. Moreover, it is 116696/1. worthwhile to apply this analysis also to z > 3 spec- Funding for the Sloan Digital Sky Survey (SDSS) and troscopic quasar sample from SDSS [108], which is very SDSS-II has been provided by the Alfred P. Sloan Foun- highly biased. Its modelling appears to require almost dation, the Participating Institutions, the National Sci- every massive halo to host a quasar [109], which would ence Foundation, the U.S. Department of Energy, the reduce the uncertainties related to secondary parameter National Aeronautics and Space Administration, the halo bias dependence. An order of magnitude estimate Japanese Monbukagakusho, and the Max Planck Society, 16 and the Higher Education Funding Council for England. The parameters are then estimated according to The SDSS Web site is http://www.sdss.org/. −1 The SDSS is managed by the Astrophysical Research pi = (F )ij (qj qj noise), (A4) − h i Consortium (ARC) for the Participating Institutions. where q noise is the expectation value of q for Poisson The Participating Institutions are the American Mu- h j i j seum of Natural History, Astrophysical Institute Pots- noise. This is equal to dam, University of Basel, University of Cambridge, Case 1 −1 −1 Western Reserve University, The University of Chicago, qj noise = Tr C CiC N , (A5) Drexel University, Fermilab, the Institute for Advanced h i 2  Study, the Japan Participation Group, The Johns Hop- where N is the Poission noise matrix (i.e. a diagonal kins University, the Joint Institute for Nuclear Astro- matrix with entries equal to the reciprocal of the mean physics, the Kavli Institute for Particle number of galaxies per pixel). This quantity can be com- and Cosmology, the Korean Scientist Group, the Chi- puted using the same machinery as used to compute F. nese Academy of Sciences (LAMOST), Los Alamos Na- The actual expectation values of the binned power tional Laboratory, the Max-Planck-Institute for Astron- spectra pi for a general power spectrum [i.e. not nec- omy (MPIA), the Max-Planck-Institute for Astrophysics essarily Equation (A1)] are given by (MPA), New Mexico State University, Ohio State Univer- sity, University of Pittsburgh, University of Portsmouth, −1 pi = (F )ij ( qj total qj noise); (A6) , the United States Naval Observa- h i h i − h i tory, and the University of Washington. since signal and noise are uncorrelated this reduces to:

−1 1 −1 −1 −1 p = (F ) q signal = (F ) Tr C C C S , APPENDIX A: WINDOW FUNCTIONS h ii ij h j i 2 ij i (A7) S This appendix describes the computation of the win- where is the signal covariance matrix. Its entries are dow functions. We begin with a brief revisit of the prin- ℓ ciples underlying the window function (see the references ∗ Sαβ = Cℓ Yℓm(α)Y (β), (A8) for more details) and then describe our computational ℓm ℓ m=−ℓ method. X X where α and β are pixels: 1 α, β Npix. It could also be written as ≤ ≤ 1. Principles ℓ S = C Y Y† , (A9) The power spectra in this paper were computed using ℓ ℓm ℓm =− the methodology of Padmanabhan et al. [111], imple- Xℓ mXℓ mented on the sphere as described in Refs. [94, 112]. The where Yℓm is a vector of length Npix containing the val- power spectrum is estimated from a vector x of length ues of the spherical harmonic Yℓm at each pixel. With Npix containing the galaxy overdensities in each of the some algebra the expectation value collapses down to: Npix pixels. Following the notation of Ref. [94], we esti- mate the power spectrum in bins, p = W C , (A10) h ii iℓ ℓ ˜i ℓ Cℓ = piCℓ, (A1) X i where the window function W is: X iℓ where C˜i is 1 if multipole l is in the ith bin, and 0 other- ℓ ℓ 1 −1 † −1 −1 wise, and p are the parameters to be estimated. We can Wiℓ = (F )ij Y C CiC Yℓm. (A11) i 2 ℓm C =− then define the template matrices, i, which are the par- mXℓ tial derivatives of the covariance matrix of x with respect to pi. The quadratic estimators are then defined: 2. Computation 1 T −1 −1 qi = x C CiC x, (A2) 2 Like the other matrix operations with million-pixel maps, direct computation of W using Equation (A11) is where C is an estimate of the covariance matrix used to iℓ not feasible; it is O(N 3 ). We have therefore resorted to weight the data (our choice is described in Ho et al. [95]). pix Monte Carlo methods, analogous to those used for trace We also build a Fisher matrix, estimation, to simultaneously solve for all of the ℓ and 1 m terms in Equation (A11). Define a random vector z F = Tr C−1C C−1C . (A3) ij 2 i j of length Npix and with entries consisting of independent  17 random numbers 1 (i.e. probability 1/2 of being 1 and To do a Monte Carlo evaluation of the average, we can 1/2 of being 1).± Then zzT = 1, so we can write: take a random vector z, compute the quantities C−1z and − h i −1 C Ciz. The latter dominates the computation time, as ℓ −1 1 it requires one expensive C operation for each power W = (F −1) Y† C−1zzT C C−1Y , iℓ 2 ij ℓm i ℓm spectrum bin, but it is also needed in the Monte Carlo * =− + mXℓ evaluation of the Fisher matrix Fij and hence comes with (A12) † no added cost. The inner product Y u for any pixel- or ℓm space vector u is the spherical harmonic transform of u, 3 2 ℓ / for which “fast” O(Npix ) algorithms exist. We use the 1 −1 † −1 † −1 Wiℓ = (F )ij (Y C z)(Y C Ciz) . implementation of the spherical harmonic transform of 2 ℓm ℓm *m=−ℓ + Hirata et al. [112]. X (A13)

[1] A. A. Starobinskiˇi, Soviet Journal of Experimental and [27] T. Suyama and M. Yamaguchi, Phys. Rev. D 77, 023505 Theoretical Physics Letters 30, 682 (1979). (2008). [2] A. H. Guth, Phys. Rev. D 23, 347 (1981). [28] K. Enqvist et al., Physical Review Letters 94, 161301 [3] A. D. Linde, Physics Letters B 108, 389 (1982). (2005). [4] A. Albrecht and P. J. Steinhardt, Physical Review Let- [29] K. Enqvist et al., ArXiv High Energy Physics - Phe- ters 48, 1220 (1982). nomenology e-prints (2005). [5] E. Komatsu et al., ArXiv e-prints 803, (2008). [30] A. Chambers and A. Rajantie, Physical Review Letters [6] V. F. Mukhanov and G. V. Chibisov, Soviet Journal of 100, 041302 (2008). Experimental and Theoretical Physics Letters 33, 532 [31] A. Jokinen and A. Mazumdar, Journal of Cosmology (1981). and Astro-Particle Physics 4, 3 (2006). [7] S. W. Hawking, Physics Letters B 115, 295 (1982). [32] P. Creminelli and L. Senatore, Journal of Cosmology [8] A. H. Guth and S.-Y. Pi, Physical Review Letters 49, and Astro-Particle Physics 11, 10 (2007). 1110 (1982). [33] J.-L. Lehners and P. J. Steinhardt, Phys. Rev. D 77, [9] A. A. Starobinskij, Physics Letters B 117, 175 (1982). 063533 (2008). [10] J. M. Bardeen, P. J. Steinhardt, and M. S. Turner, Phys. [34] J.-L. Lehners and P. J. Steinhardt, ArXiv e-prints 804, Rev. D 28, 679 (1983). (2008). [11] J. Khoury, B. A. Ovrut, P. J. Steinhardt, and N. Turok, [35] E. Komatsu et al., Astrophys. J. Supp.148, 119 (2003). Phys. Rev. D 64, 123522 (2001). [36] P. Creminelli, L. Senatore, M. Zaldarriaga, and M. [12] J. Khoury, P. J. Steinhardt, and N. Turok, Physical Tegmark, Journal of Cosmology and Astro-Particle Review Letters 91, 161301 (2003). Physics 3, 5 (2007). [13] P. J. Steinhardt and N. Turok, Science 296, 1436 (2002). [37] T. H. Jarrett et al., Astron. J.119, 2498 (2000). [14] E. I. Buchbinder, J. Khoury, and B. A. Ovrut, Physical [38] E. Jeong and G. F. Smoot, ArXiv e-prints 710, (2007). Review Letters 100, 171302 (2008). [39] C. Hikage et al., ArXiv e-prints 802, (2008). [15] A. R. Liddle and A. Mazumdar, Phys. Rev. D 61, [40] A. Cooray, D. Sarkar, and P. Serra, ArXiv e-prints 803, 123507 (2000). (2008). [16] S. Gupta, K. A. Malik, and D. Wands, Phys. Rev. D [41] A. Cooray, C. Li, and A. Melchiorri, Phys. Rev. D 77, 69, 063513 (2004). 103506 (2008). [17] R. Bean, J. Dunkley, and E. Pierpaoli, Phys. Rev. D [42] R. Scoccimarro, E. Sefusatti, and M. Zaldarriaga, Phys. 74, 063503 (2006). Rev. D 69, 103513 (2004). [18] M. Beltr´an, ArXiv e-prints 804, (2008). [43] F. Lucchin, S. Matarrese, and N. Vittorio, Astron. As- [19] E. Komatsu and D. N. Spergel, Phys. Rev. D 63, 063002 trophys.162, 13 (1986). (2001). [44] S. Matarrese, L. Verde, and R. Jimenez, Astrophys. J. [20] A. Gangui, F. Lucchin, S. Matarrese, and S. Mollerach, 541, 10 (2000). Astrophys. J. 430, 447 (1994). [45] J. Robinson and J. E. Baker, Mon. Not. R. As- [21] J. Maldacena, Journal of High Energy Physics 5, 13 tron. Soc.311, 781 (2000). (2003). [46] A. J. Benson, C. Reichardt, and M. Kamionkowski, [22] N. Bartolo, E. Komatsu, S. Matarrese, and A. Riotto, Mon. Not. R. Astron. Soc.331, 71 (2002). Phys. Rep.402, 103 (2004). [47] N. Dalal, O. Dor´e, D. Huterer, and A. Shirokov, Phys. [23] A. Linde and V. Mukhanov, Phys. Rev. D 56, 535 Rev. D 77, 123514 (2008). (1997). [48] J. M. Bardeen, J. R. Bond, N. Kaiser, and A. S. Szalay, [24] F. Bernardeau and J.-P. Uzan, Phys. Rev. D 66, 103506 Astrophys. J. 304, 15 (1986). (2002). [49] R. K. Sheth and G. Tormen, Mon. Not. R. As- [25] D. H. Lyth, C. Ungarelli, and D. Wands, Phys. Rev. D tron. Soc.308, 119 (1999). 67, 023503 (2003). [50] W. H. Press and P. Schechter, Astrophys. J. 187, 425 [26] G. Dvali, A. Gruzinov, and M. Zaldarriaga, Phys. Rev. (1974). D 69, 023505 (2004). [51] S. Cole and N. Kaiser, Mon. Not. R. Astron. Soc.237, 18

1127 (1989). [82] J. A. Smith et al., Astron. J.123, 2121 (2002). [52] S. Matarrese and L. Verde, Astrophys. J. Lett.677, L77 [83] D. L. Tucker et al., Astronomische Nachrichten 327, 821 (2008). (2006). [53] E. Hatziminaoglou et al., Mon. Not. R. Astron. Soc.343, [84] N. Padmanabhan et al., Astrophys. J. 674, 1217 (2008). 692 (2003). [85] J. R. Pier et al., Astron. J.125, 1559 (2003). [54] P. F. Hopkins, L. Hernquist, T. J. Cox, and D. Kereˇs, [86] Z.ˇ Ivezi´c et al., Astronomische Nachrichten 325, 583 Astrophys. J. Supp.175, 356 (2008). (2004). [55] T. Urrutia, M. Lacy, and R. H. Becker, Astrophys. J. [87] M. A. Strauss et al., Astron. J.124, 1810 (2002). 674, 80 (2008). [88] D. J. Eisenstein et al., Astron. J.122, 2267 (2001). [56] S. R. Furlanetto and M. Kamionkowski, [89] G. T. Richards et al., Astron. J.123, 2945 (2002). Mon. Not. R. Astron. Soc.366, 529 (2006). [90] M. R. Blanton et al., Astron. J.125, 2276 (2003). [57] A. R. Wetzel et al., Astrophys. J. 656, 139 (2007). [91] J. K. Adelman-McCarthy et al., Astro- [58] C. Lacey and S. Cole, Mon. Not. R. Astron. Soc.262, phys. J. Supp.175, 297 (2008). 627 (1993). [92] J. K. Adelman-McCarthy et al., Astro- [59] M. R. Santos, V. Bromm, and M. Kamionkowski, phys. J. Supp.162, 38 (2006). Mon. Not. R. Astron. Soc.336, 1082 (2002). [93] M. Tegmark et al., Phys. Rev. D 74, 123507 (2006). [60] A. J. Benson, M. Kamionkowski, and S. H. Hassani, [94] N. Padmanabhan et al., Mon. Not. R. Astron. Soc.378, Mon. Not. R. Astron. Soc.357, 847 (2005). 852 (2007). [61] N. Dalal, M. White, J. R. Bond, and A. Shirokov, ArXiv [95] S. Ho et al., ArXiv e-prints 801, (2008). e-prints 803, (2008). [96] J. J. Condon et al., Astron. J.115, 1693 (1998). [62] J. D. Cohn, J. S. Bagla, and M. White, Mon. Not. R. As- [97] S. Cole et al., Mon. Not. R. Astron. Soc.362, 505 (2005). tron. Soc.325, 1053 (2001). [98] R. Mandelbaum and U. Seljak, Journal of Cosmology [63] J. D. Cohn and M. White, Astroparticle Physics 24, 316 and Astro-Particle Physics 6, 24 (2007). (2005). [99] N. Padmanabhan, M. White, P. Norberg, and C. Por- [64] O. Fakhouri and C.-P. Ma, Mon. Not. R. As- ciani, ArXiv e-prints 802, (2008). tron. Soc.386, 577 (2008). [100] C. M. Hirata et al., ArXiv e-prints 801, (2008). [65] V. Springel et al., Nature (London) 435, 629 (2005). [101] G. T. Richards et al., Astrophys. J. Supp.155, 257 [66] L. Gao, V. Springel, and S. D. M. White, (2004). Mon. Not. R. Astron. Soc.363, L66 (2005). [102] M. A. Weinstein et al., Astrophys. J. Supp.155, 243 [67] R. H. Wechsler et al., Astrophys. J. 652, 71 (2006). (2004). [68] Y. P. Jing, Y. Suto, and H. J. Mo, Astrophys. J. 657, [103] C. Porciani and P. Norberg, Mon. Not. R. As- 664 (2007). tron. Soc.371, 1824 (2006). [69] L. Gao and S. D. M. White, Mon. Not. R. As- [104] J. R. Bond, A. H. Jaffe, and L. Knox, Astrophys. J. tron. Soc.377, L5 (2007). 533, 19 (2000). [70] A. Lewis and S. Bridle, Phys. Rev. D 66, 103511 (2002). [105] R. K. Sachs and A. M. Wolfe, Astrophys. J. 147, 73 [71] G. Hinshaw et al., Astrophys. J. Supp.170, 288 (2007). (1967). [72] L. Page et al., Astrophys. J. Supp.170, 335 (2007). [106] T. Giannantonio et al., Phys. Rev. D 77, 123520 (2008). [73] K. Grainge et al., Mon. Not. R. Astron. Soc.341, L23 [107] J.-L. Lehners and N. Turok, Phys. Rev. D 77, 023516 (2003). (2008). [74] C. L. Kuo et al., Astrophys. J. 664, 687 (2007). [108] Y. Shen et al., Astron. J.133, 2222 (2007). [75] A. C. S. Readhead et al., Astrophys. J. 609, 498 (2004). [109] M. White, P. Martini, and J. D. Cohn, ArXiv e-prints [76] P. Astier et al., Astron. Astrophys.447, 31 (2006). 711, (2007). [77] M. Fukugita et al., Astron. J.111, 1748 (1996). [110] W. J. Percival et al., Astrophys. J. 657, 645 (2007). [78] D. G. York et al., Astron. J.120, 1579 (2000). [111] N. Padmanabhan, U. Seljak, and U. L. Pen, New As- [79] D. W. Hogg, D. P. Finkbeiner, D. J. Schlegel, and J. E. tronomy 8, 581 (2003). Gunn, Astron. J.122, 2129 (2001). [112] C. M. Hirata et al., Phys. Rev. D 70, 103501 (2004). [80] J. E. Gunn et al., Astron. J.131, 2332 (2006). [81] J. E. Gunn et al., Astron. J.116, 3040 (1998).