arXiv:astro-ph/0606739v2 5 Jan 2007 h asso ula lc oe nLmnu litclGal Elliptical Luminous in Holes Black Nuclear of Masses The C/ikOsraoy or fSuisi srnm n Astro and Astronomy in Studies of Board Observatory, UCO/Lick mlctosfrteSaeDniyo h otMsieBlack Massive Most the of Density Space the for Implications h bevtre fteCrei nttto fWsigo,P Washington, of Institution Carnegie the of Observatories The h bevtre fteCrei nttto fWsigo,P Washington, of Institution Carnegie the of Observatories The pc eecp cec nttt,30 a atnDie Ba Drive, Martin San 3700 Institute, Science Telescope Space eateto srnm,Uiest fClfri,Berkele California, of University Astronomy, of Department ainlOtclAtooyObservatory Astronomy Optical National eateto srnm,Uiest fMcia,AnArbor Ann Michigan, of University Astronomy, of Department Arbor Ann Michigan, of University Astronomy, of Department rneo nvriyOsraoy etnHl,Princeton Hall, Peyton Observatory, University Princeton eateto srnm,Uiest fTxs utn X787 TX Austin, Texas, of University Astronomy, of Department B bevtr,Uiest fAioa usn Z85721 AZ Tucson, Arizona, of University Observatory, LBT aiona at rz A95064 CA Cruz, Santa California, lxiV Filippenko V. Alexei oga Richstone Douglas ct Tremaine Scott alGebhardt Karl acPostman Marc ihr Green Richard o .Lauer R. Tod lnDressler Alan .M Faber M. S. .C Aller C. M. usC Ho C. Luis ..Bx272 usn Z85726 AZ Tucson, 26732, Box P.O. , 2 umte to Submitted ,C 94720-3411 CA y, tmr,M 21218 MD ltimore, sdn,C 91101 CA asadena, sdn,C 91101 CA asadena, hsc,Uiest of University physics, h srpyia Journal Astrophysical The J08544 NJ , I48109 MI , 48109 MI , 12 xe and axies Holes. 1 – 2 –

John Kormendy

Department of Astronomy, University of Texas, Austin, TX 78712

John Magorrian

Department of Physics, University of Durham, Durham, United Kingdom, DH1 3LE

Jason Pinkney

Department of Physics and Astronomy, Ohio Northern University, Ada, OH 45810

ABSTRACT

Black hole masses predicted from the M• σ relationship conflict with those pre- − dicted from the M• L relationship for the most luminous , such as brightest − cluster galaxies (BCGs). This is because stellar velocity dispersion, σ, increases only weakly with luminosity for BCGs and other giant ellipticals. The M• L relationship − predicts that the most luminous BCGs may harbor black holes with M• approaching 10 9 10 M⊙, while the M• σ relationship always predicts M• < 3 10 M⊙. Lacking direct − × determination of M• in a sample of the most luminous galaxies, we advance arguments that the M• L relationship is a plausible or even preferred description for BCGs and − other galaxies of similar luminosity. Under the hypothesis that cores in central stellar density are formed by binary black holes, the inner-core cusp radius, rγ, may be an independent witness of M•. Using central structural parameters derived from a large sample of early-type galaxies observed by HST, we argue that L is superior to σ as an indicator of r in luminous galaxies. Further, the observed r M• relationship for 11 γ γ − core galaxies with measured M• appears to be consistent with the M• L relationship − for BCGs. BCGs have large cores appropriate for their large luminosities that may be difficult to generate with the more modest black hole masses inferred from the M• σ − relationship. M• L may be expected to hold for BCGs, if they were formed in dissipa- ∼ tionless mergers, which should preserve ratio of black hole to stellar mass. This picture appears to be consistent with the slow increase in σ with L and the more rapid increase in effective radii, Re, with L seen in BCGs as compared to less luminous galaxies. If BCGs have large BHs commensurate with their high luminosities, then the local black 9 hole mass function for M• > 3 10 M⊙ may be nearly an order of magnitude richer × than what would be inferred from the M• σ relationship. The volume density of − the most luminous QSOs at earlier epochs may favor the predictions from the M• L − relationship.

Subject headings: galaxies: nuclei — galaxies: structure — black hole physics – 3 –

1. The Most Luminous Galaxies The Most Massive Black Holes ⇐⇒ Nearly every elliptical and spiral bulge has a black hole at its center (Magorrian et al. 1998). The masses of the black holes, M•, are related to the V -band luminosity, L, and average stel- lar velocity dispersion, σ, of their host galaxies (Dressler 1989; Kormendy 1993; Kormendy & Richstone 1995; Magorrian et al. 1998; Ferrarese & Merritt 2000; Gebhardt et al. 2000a; Tremaine et al. 2002; H¨aring & Rix 2004). The M• σ and M• L relationships are powerful tools as they allow the − − prediction of black hole masses — which are difficult to measure directly — from readily available galaxy parameters.

The black hole population in the most massive galaxies has yet to be assayed, however, which means that estimates of M• in these objects are based on extrapolations of relationships defined by smaller galaxies. The current record for largest black hole mass measured directly is M• 9 ∼ 3 10 M⊙ in M87 (Harms et al. 1994), yet M87 is only the second-ranked galaxy in a cluster of × modest richness. Brightest cluster galaxies (BCGs) in nearby Abell clusters are typically 3 ∼ × more luminous (Postman & Lauer 1995) and may host proportionately more massive BHs. Testing this hypothesis through measurements of stellar dynamics requires both high sensitivity and high spatial-resolution, given the low central surface brightnesses and relatively large distances of BCGs. Such observations were not possible with the Hubble Space Telescope (HST) even before the failure of the Space Telescope Imaging Spectrograph; they are only now becoming feasible with the advent of adaptive optics spectroscopy on 10m class telescopes.

9 A number of arguments suggest that black holes with M• > 3 10 M⊙ do exist, even if × this conclusion is not universal (e.g. McLure et al. 2004). Netzer (2003) argues that some QSOs 10 have M• > 10 M⊙ based on an empirical relationship between M•, broad-line width and nuclear luminosity for AGN. Bechtold et al. (2003) and Vestergaard (2004) also argue that some QSOs have black holes approaching this mass. Of particular relevance for BCGs is the hypothesis that cluster cooling flows are inhibited by AGN heating from the central galaxy (Binney & Tabor 1995; Churazov et al. 2002). Recent Chandra observations support a picture in which episodic AGN outbursts in BCGs heat the intra-cluster medium (Voit & Donahue 2005); the energetics required 10 to terminate cooling flows imply M• > 10 M⊙ for many clusters (Fabian et al. 2002).

Arguments for such massive black holes appear to be in conflict, however, with the expectations from the M• σ relationship applied to the local galaxy velocity-dispersion distribution function. −

1Based on observations made with the NASA/ESA Hubble Space Telescope, obtained at the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS 5-26555. These observations are associated with GO and GTO proposals # 5236, 5446, 5454, 5512, 5943, 5990, 5999, 6099, 6386, 6554, 6587, 6633, 7468, 8683, and 9107. 2The National Optical Astronomy Observatory is operated by AURA, Inc., under cooperative agreement with the National Science Foundation. – 4 –

Tremaine et al. (2002) find

−1 log(M•/M⊙) = (4.02 0.32) log(σ/200 km s ) + 8.19 0.06, (1) ± ± −1 −1 for H0 =70km s Mpc (which we will use throughout this paper). The Sheth et al. (2003) local velocity dispersion function shows a strong cut-off at σ 400 km s−1, which implies that galaxies 9 ≈ harboring black holes with M• > 3 10 M⊙ would be extremely rare. Bernardi et al. (2006a) have × identified a handful of galaxies with σ> 400 km s−1, but their results do not alter this conclusion.

Extrapolation of the M• σ relationship to galaxies more massive than M87 assumes that − σ (and not galaxy mass) is the fundamental parameter for determining M•. The uncertainty in such an extrapolation is underscored by Wyithe (2006), who argues that the M• σ relationship is − curved rather than linear in log-log space, in the sense that, at the high-σ end, the “log-quadratic” relationship predicts higher M• than does equation (1). The Wyithe M• σ relationship, implies 9 − that the space density of black holes with M• > 5 10 M⊙ may be substantially higher than that × implied by equation (1) (although the exact difference is highly sensitive to both the details of the velocity dispersion distribution function, and the assumed level of cosmic scatter in the M• σ − relationship).

In this paper we point out that the M• L relationship applied to the most luminous galaxies − predicts M• values that are significantly larger than those predicted by either the Tremaine et al. (2002) or Wyithe (2006) M• σ relationships. This difference arises because BCGs do not follow the − Faber & Jackson (1976) relationship between L and σ. The relationship between L and σ “plateaus” at large L in the sense that BCGs have relatively low σ for their high L (Oegerle & Hoessel 1991; see also Boylan-Kolchin et al. 2006, who have seen this effect in simulations.)

Resolution of which of the M• L or M• σ relationships is most representative of the black − − hole population in the most massive galaxies will only be possible when black hole masses can be measured in such galaxies. In advance of such work, however, we can advance a number of arguments that suggest that the M• L is a plausible and perhaps even preferred description for − such systems.

The first set of arguments are based on the central structure of BCGs and other luminous elliptical galaxies that have cores in their central brightness profiles (Lauer et al. 1995; Laine et al. 2002). A core is evident as a radius at which the steep envelope of the galaxy “breaks” and transitions to an inner cusp with a shallow slope in logarithmic coordinates. The favored theory for core formation posits that cores are formed when are ejected from the galaxy’s center by the decay of a binary BH created in a merger (Begelman et al. 1980; Ebisuzaki et al. 1991; Faber et al. 1997; Quinlan & Hernquist 1997; Milosavljevi´c & Merritt 2001). The size of the core then reflects the total mass ejected, which should be a function of M•. The size of the core may thus be an independent witness of M•. In BCGs and other galaxies of similar luminosity, galaxy luminosity is more closely related to the physical scale of the cores than σ, and the observed core size M• relationship for galaxies with cores and directly measured black hole masses appears to be consistent with the M• L relationship. − – 5 –

A second set of arguments come from considering the formation of BCGs. If BCGs are formed in “dry” mergers, then the ratio of black hole to stellar mass should be preserved over mergers, leading to the observed M• L relationship. In contrast, σ may change little over such mergers, − and no longer track black hole mass as well it does for the less luminous galaxies from which the M• σ has been determined. − Lastly, we consider the relative predictions of the M• L and M• σ relationships for the − − volume mass distribution function of black holes and which we compare to the predictions from QSO luminosity functions. A decisive discrimination between the two relationships is not possible without a better understanding of the cosmic scatter in both relationships, but the Tremaine et al. (2002) version of the M• L relationship probably predicts too few high mass black holes to support − the QSO luminosity function.

2. A Large Sample of Early Type Galaxies With Central Structure Characterized by HST

We start by comparing the two predictions M•(σ) (M• predicted from the M• σ relationship) − and M•(L) (M• predicted from the M• L relationship) for a sample of 219 galaxies for which we − have central structural parameters derived from HST imagery (Lauer et al. 2007a). We then present the separate relationships between core structure versus σ and L. This leads in turn to two separate predictions for how core size should be related to M•, which can be compared to the observed relationship between core size and M• for 11 core galaxies that have direct M• determinations.

The galaxy sample combines several different HST imaging programs that all used the Nuker- law parameterization (Lauer et al. 1995) to characterize the central starlight distributions. The properties and definition of this sample are presented in detail in Lauer et al. (2007a), but briefly, we combine surface photometry presented in Lauer et al. (1995), Faber et al. (1997), Laine et al. (2002), Rest et al. (2001), Ravindranath et al. (2001), Quillen et al. (2000), and Lauer et al. (2005). This diverse source material has been transferred to a common photometric system (V band) and −1 −1 − a common distance scale, adopting H0 =70 km s Mpc . The primary source of distances is the SBF survey of Tonry et al. (2001), but when possible we use the group memberships in Faber et al. (1989) and average SBF distances over the group. As the Tonry et al. (2001) SBF scale is consistent with H0 = 74, we scale up their SBF distances by 6%. The treatment of galaxies not in the SBF survey is detailed in Lauer et al. (2007a). The sample is listed in Table 1. It comprises 120 core galaxies, 87 power-law galaxies, and 12 intermediate galaxies.

The most important Nuker-law parameter for the present analysis is the break radius, rb, which is used to calculate the cusp radius, rγ, which in turn is used to represent the physical scale of the core (this parameter is discussed in detail in 4.1 and Appendix C). The average error in r is 30%, § γ based on comparison of Nuker parameters to non-parametric estimates of the same parameters.

Central velocity dispersions are provided by the “Hyperleda” augmentation of the Prugniel & Simien – 6 –

(1996) compendium of published velocity dispersions; no values were available for 30 of the total of 219 galaxies. We adopt a 10% typical error in σ. The M• σ relationship as initially presented by − Gebhardt et al. (2000a) used the average luminosity-weighted velocity dispersion measured in a slit along the major axis interior to the effective radius. Velocity dispersion profiles are unfortunately not available for the bulk of the galaxies; however, Gebhardt et al. (2000a) showed that the central values are likely to be within 5% of the radial averages.

2.1. Galaxy Luminosities

The sources of the present galaxy luminosities are discussed in detail in Lauer et al. (2007a). Most of the magnitudes are derived from VT or BT values drawn from the RC3 (de Vaucouleurs et al. 1991). Bulge luminosities are given for S0 and spiral galaxies based on bulge/disk decompositions in the literature. Absolute luminosities were calculated using the Schlegel et al. (1998) Galactic extinction values; we assume a typical MV error of 10%. The accuracy of the BCG luminosities is of special concern as we will argue that they imply higher M• than would be inferred from the σ values for the same galaxies. The present BCG luminosities are based on fitting r1/4 laws to the inner portions (r < 50 kpc) of the R-band Postman & Lauer (1995) brightness profiles, limiting the fits to radii that are well matched by this function. Graham et al. (1996) show that BCG brightness profiles are better described by S´ersic profiles with S´ersic n > 4, which is also true of giant elliptical galaxies in general (e.q. Ferrarese et al. 2006; Kormendy et al. 2007). However, BCGs with S´ersic n > 4, typically have extremely large effective radii that are factors of several larger than the actual radial limit of the surface photometry; this in turn implies unrealistically large total luminosities. The r1/4 laws give a conservative lower limit for BCG total luminosities. Even so, the derived luminosities are systematically much larger than those provided by the Sloan Digital Sky Survey (SDSS). We resolve this issue in Appendix A with a demonstration that the SDSS BCG luminosities are strongly biased to low values by excessive sky subtraction. The NIR apparent magnitudes provided by the 2MASS Extended Source Catalogue (Jarrett et al. 2000, 2003) have also been used to provide BCG total luminosities (Batcheldor et al. 2006); however in Appendix B we show that the 2MASS apparent magnitudes are also likely to be underestimates.

A separate issue raised by a number of our colleagues is that BCG luminosities may need to be “corrected” for intracluster light (ICL). One such treatment of ICL assumes that the BCG is coincident with the center of the cluster potential, and that the composite BCG+ICL can be modeled as two superimposed r1/4 laws (cf. Gonzalez et al. 2005). The ICL component is then subtracted to yield the “true” BCG luminosity. A key feature of such models is that the ICL profile is assumed to continue to rise in brightness at radii well interior to where it dominates, thus implying a substantial contribution at even small radii. There is little physical justification for a correction of this form, however. As noted above, giant elliptical galaxies in general (not just BCGs) have S´ersic n > 4. Further, the presumption that BCGs sit exactly at the center of their clusters is an – 7 – idealization that is actually realized in only a small fraction of systems. Postman & Lauer (1995) show that BCGs are typically displaced from the geometric cluster center by 90 kpc in projection ∼ and 260 km s−1 in velocity. Patel et al. (2006) showed that BCGs are typically displaced from ∼ the centroid of cluster X-ray emission by 129 kpc, consistent with the Postman & Lauer (1995) analysis. Lastly, the presumption that the ICL follows an r1/4 law into small radii is not uniquely demanded, and is probably inconsistent with the large velocity dispersion of stars truly not bound to the BCG. Again, BCGs are well described over a large radial range by S´ersic laws; in no case in the Graham et al. (1996) sample are there any profiles that have a distinct feature that objectively supports a two component model. This is not to say that ICL is not present, but the surface brightness at which it dominates even in the two component models are well outside the radii at which we measure the r1/4 laws used to estimate total luminosity (typically less than 50 kpc). The Zibetti et al. (2005) models of ICL show that it begins to dominate the BCGs at r 80 kpc from ∼ the BCG centers, corresponding to µ 26. We conclude that a strong correction to our BCG r ∼ luminosities for ICL is poorly justified.

3. A Contradiction Between the M• σ and M• L Relationships − −

The M• L relationship emerged from the first attempts to relate black hole mass to properties − of the host galaxy (Dressler 1989; Kormendy 1993; Kormendy & Richstone 1995). Much of the recent work on this problem, however, has focused on the M• σ relationship due to its apparent − smaller scatter (although see Novak et al. 2006 on the significance of this), as well as arguments that σ, rather than galaxy luminosity is the more fundamental parameter that determines how galaxies were formed (e.g., Wyithe & Loeb 2005). While L and σ are related by the Faber & Jackson (1976) relationship, since the discovery that galaxies lie on a “fundamental-plane” determined by L, σ, and the effective radius, Re, (Djorgovski & Davis 1987; Dressler et al. 1987), we know that neither L nor σ alone is sufficient to codify the full range of galaxy properties. The M• L relationship − thus may contain information that is not a trivial projection of the M• σ relationship. − The relationship between M• and L is shown in Figure 1. Most of the galaxies shown are 3 −1 −1 those presented in Tremaine et al. (2002), transformed to H0 = 70 kms Mpc . Due to the large scatter of the data points in Figure 1, estimating a mean M• L relationship is likely to be − sensitive to the fitting algorithm. We have elected to use the “symmetric” least-squares algorithm of Press et al. (1992) throughout this analysis. This technique allows for errors in both variables being fitted, and finds the best slope and intercept parameters without assigning either parameter as the independent or dependent variable. As a way of bracketing uncertainties in the mean M• L −

3 We augment the Tremaine et al. (2002) sample with M• determinations in NGC 1399 (Houghton et al. 2006), NGC 3031 (Bower et al. 2000), NGC 3998 (Bower et al. 2000), NGC 4374 (Bower et al. 1998), NGC 4486B (Kormendy et al. 1997), NGC 4945 (Greenhill et al. 1997), NGC 5128 (Marconi et al. 2001), NGC 7332 (Nelson et al. 2000), and Cygnus A (Tadhunter et al. 2003). – 8 – relationship, we performed one fit using all the data points, but for a second fit we used only galaxies with M < 19, because they appear to have less scatter. The fit to all data points gives V −

log(M•/M⊙) = (1.40 0.17)( M 21)/2.5 + 8.41 0.11, (2) ± − V − ± which is shown as the dashed line in Figure 1. Just fitting galaxies with M < 19 gives V −

log(M•/M⊙) = (1.70 0.22)( M 21)/2.5 + 8.22 0.08, (3) ± − V − ± which is shown as the dotted line in Figure 1. Both relationships agree well for 23 < M < − V 19; their differences in slope cause them to diverge slightly when extrapolated to more luminous − galaxies. Both relationships also agree well with the H¨aring & Rix (2004) relationship between M• and galaxy mass transformed back to luminosity, which we consider as a third M• L relationship. − Novak et al. (2006) found that the M•-mass relationship was not significantly less tight than the M• σ relationship, given the errors of the various samples. If so, then the reduced scatter in − the M•-mass relationship means that it should serve well as a relationship between M• and L; we −0.092(M +22) transform it by adopting M/L 6 10 V M⊙/L⊙, based on the M/L estimates given V ≈ × in Gebhardt et al. (2003); this gives

log(M•/M⊙) = (1.38 0.07)( M 22)/2.5 + 8.78 0.10. (4) ± − V − ± This is shown in Figure 1 as the solid line; within errors it is essentially identical to equation (2) for 25 < M < 23, the interval over which we will be extrapolating the M• L relationship to − V − − the most luminous galaxies in the sample.

Figure 2 shows M•(L) based on a combination of the three relationships presented in Figure 1 plotted against M•(σ) from equation (1) for the sample. The error bars along the M•(L) axis reflect the minimum and maximum predictions of M• given by the three relationships shown in Figure 1; the central values plotted are the mean of the minimum and maximum predicted M•. The L and 10 σ predictors diverge at large L, with all three M• L relationships predicting M• 10 M⊙ for − ∼ 9 the most luminous galaxies, while equation (1) predicts no values of M• larger than 3 10 M⊙. ∼ × The errors in M•(L) increase somewhat with galaxy luminosity but are much smaller than the differences between M•(L) and M•(σ), which approach an order of magnitude for some of the most luminous galaxies.4

The differences between M•(L) and M•(σ) cannot be reconciled by the Wyithe (2006) log- quadratic M• σ relationship. The asymmetric error bars in the σ-based predictions of M• shown − in Figure 2 reflect the implied change in predicted M• if the Wyithe (2006) relationship is used instead of the Tremaine et al. (2002) log-linear M• σ relationship. The Wyithe (2006) relationship − predicts slightly larger M• only for the largest σ values ( 30%), but still does not match the even ∼

4The error bars in Figure 2 do not include the systematic errors associated with the uncertainties in the individual relationships themselves. – 9 –

larger M•(L) for the same galaxies. As expected, M•(L) and M•(σ) do agree on average for the sample galaxies that actually have direct M• determinations, since it was this subset of galaxies that defined the relationships in the first place.

The disagreement of the two M• predictors for the larger set of galaxies lacking direct M• determinations can be traced to changes in the form of the L σ relationship as a function of − galaxy luminosity. Figure 3 shows this relationship for the sample galaxies. The typical σ value appears to level off for large L; indeed, there appears to be little relationship between σ and L for galaxies with M < 22. While most of the galaxies in this luminosity range are BCGs, other bright V − ellipticals show the same behavior. Put simply, the high luminosities of BCGs and other similarly bright ellipticals are not matched by similarly large velocity dispersions. The M• σ relationship − thus predicts unexceptional black hole masses for these exceptionally luminous galaxies.

This “saturation” in σ at BCG luminosities was noted in the BCG velocity dispersion study of Oegerle & Hoessel (1991), but it appears only weakly in the SDSS study of Bernardi et al. (2003). We suggest that this may be due to the use of different BCG luminosities, based on the analysis of the SDSS magnitudes of BCGs presented in Appendix A. For the core galaxies, we find L σ7, ∼ a much steeper relationship than the classic L σ4. Specifically, a symmetrical least-squares fit ∼ (Press et al. 1992) to the 99 core galaxies with M < 21 and having a σ value produces: V − M = 2.5 (6.5 1.3) log(σ/250 km s−1) 22.45 0.18. (5) V − ± − ± However, since the L σ relationship appears to be nonlinear, even this fit may not be the best − approximation for the most luminous galaxies. This result also contrasts with the relationship measured for power-law galaxies alone,

M = 2.5 (2.6 0.3) log(σ/150 km s−1) 20.30 0.10. (6) V − ± − ±

The distribution of points with M• measurements shows what appears to be a bias in the BH sample: galaxies with M 22.5 with measured M• have a higher-than-average σ than V ∼ − typical galaxies at this luminosity — or conversely have low luminosities for their σ values (see also Bernardi et al. 2006c). The 7 galaxies with measured M• at M 22.5 have average V ∼ − σ = 311 25 km s−1, while equation (5) predicts only 250 km s−1 at M 22.5 in agreement ± ∼ V ∼− with the average σ at this luminosity for the SDSS sample (Bernardi et al. 2003). If σ is the best 4 predictor of M•, then the black holes in these galaxies should be on average (314/250) 2.4 ≈ × more massive than is typical for galaxies with M 22.5. The M• L relationship in turn would V ∼− − be biased at the high luminosity end, and the large black hole masses predicted from L shown in Figure 2 will be over-estimates. Conversely, if L is the better predictor of M•, then then the M• σ − relationship would be biased to predict lower M• than would be correct.

The possibility that the galaxies with measured M• are a biased sampling of the L σ re- 8 − lationship is echoed in Figure 2. For M• > 10 M⊙, M•(L) is on average greater that M•(σ) for galaxies in the present sample. Lowering M•(L) by the bias factor inferred above, or increasing – 10 –

M•(σ) by a similar factor would bring the average predictions into excellent agreement, however. Note the galaxies with measured M• in Figure 2, are presently in excellent agreement, since these are the very systems used to define the M• σ and M• L relationships. − − Figure 2 also shows, however, that the large M•(L) predicted for the most luminous galaxies still deviate from M•(σ) by a much larger factor than this putative bias. The strong curvature in L σ relationship leads to the upward curvature in M•(L) versus M•(σ) well in excess of the − selection biases implied by Figure 3. Any luminosity-based predictor of M• calibrated for M > 22 V − would still predict M• in excess of the M• σ relationship for M < 22, since σ for the brightest − V − galaxies does not increase with luminosity.

4. Core Structure as an Independent Witness of M•

4.1. The Cusp Radius

Resolving whether L or σ is the best predictor for M• for galaxies with M < 23 will only be V − possible when real M• determinations can be made in this luminosity regime. Lacking this, we can attempt to obtain preliminary information by considering whether the central structure of galaxies may provide an independent witness to M•. We characterize the physical scale of the core by the “cusp radius,” rγ, which is the radius at which the negative logarithmic-slope of a galaxy’s surface brightness profile reaches a pre-specified value γ′. This measure of core size was first proposed by Carollo et al. (1997); we will discuss it in detail in Appendix C. The core is also characterized by the cusp brightness, Iγ, the local surface brightness at rγ (µγ is Iγ expressed in magnitude units). In terms of the Nuker-law parameters, for γ γ′ β, ≤ ≤ γ′ γ 1/α r r − ; (7) γ ≡ b β γ′  − Iγ is then found directly from the fitted Nuker-law,

γ α (γ−β)/α (β−γ)/α rb rγ Iγ = 2 Ib 1+ . (8) rγ    rb  

′ Carollo et al. (1997) advocated use of rγ with γ = 1/2 as a core scale-parameter. We show ′ in Appendix C that using rγ with γ = 1/2, indeed gives tighter correlations with other galaxy parameters than the choice of rb as a scale parameter. While the Nuker-law rb is still used to calculate rγ, we no longer use it directly as a measure of core size, in contrast to the analysis presented in Faber et al. (1997). Lastly, we emphasize that since rγ is generally well interior to rb it is not meant to describe the actual complete extent of the core; it is just a convenient representative scale. – 11 –

4.2. Core Structure and Galaxy Parameters

It has long been known that the physical scale of cores in early-type galaxies is correlated with galaxy luminosity (Lauer 1985; Kormendy 1985). This relationship may be due to the action of central black holes on the central distribution of stars (e.g., Faber et al. 1997). Figures 4 and 5 show the relationships between σ, L, and cusp radius, r , for the present sample. The r σ γ γ − relationship is particularly steep, as rγ varies by over two orders of magnitude, while σ changes by only a factor of two. For core galaxies with M < 21, a symmetrical least-squares fit gives V − log(r /pc) = (8.4 1.6) log(σ/250 km s−1) + 1.99 0.09, (9) γ ± ± while the r L relationship is γ − log(r /pc) = (1.32 0.11)( M 23)/2.5 + 2.28 0.04. (10) γ ± − V − ±

Of the two relationships, L is the better predictor of rγ , with only 0.31 rms scatter in log rγ, while the scatter of log rγ versus σ is 0.63. Note that BCGs and non-BCG core galaxies appear to follow the same relationships between rγ and σ or L. For the non-BCG core galaxies,

log(r /pc) = (7.4 1.2) log(σ/250 km s−1) + 2.00 0.07, (11) γ ± ± and log(r /pc) = (1.94 0.29)( M 22)/2.5 + 1.77 0.06, (12) γ ± − V − ± while for the BCGs,

log(r /pc) = (15.2 7.5) log(σ/300 km s−1) + 2.80 0.31, (13) γ ± ± and log(r /pc) = (1.24 0.17)( M 23)/2.5 + 2.28 0.05. (14) γ ± − V − ± While the slopes of the relationships are different from those for the entire sample of core galaxies, there is no significant difference between the relationships within the parameter ranges in which BCGs and non BCGs overlap. A critical result that is evident in Figure 5 is that while BCGs have larger cores than less luminous core galaxies, they are completely consistent with the larger total luminosity of BCGs. In contrast, there is essentially no correlation between rγ and σ for rγ > 300 pc, as is evident in Figure 4; luminosity is a much better predictor of core size in BCGs than σ.

The core is characterized by a surface brightness as well as a physical scale, thus one could also explore the relationships between Iγ and σ or L, but as we show in Figure 6, Iγ and rγ are so closely related that they can be regarded as interchangeable. The fitted relationship between the two parameters for core galaxies with M < 21 is V − µ = 2.5 (1.05 0.07) log(r /100 pc) + 16.23 0.10, (15) γ ± γ ± where µ is I in units of V band magnitudes per square-arcsecond. γ γ − – 12 –

Lastly, Iγ and rγ can be combined to estimate the stellar mass of the core interior to the cusp 2 as Mγ = πIγ rγ(M/LV ), again using the conversion between mass and light given in the context of equation (4). Symmetrical fits give the relationships between Mγ and L or σ as;

log(M /M⊙) = (1.35 0.13)( M 22)/2.5 + 9.17 0.05, (16) γ ± − V − ± and −1 log(M /M⊙) = (8.6 1.0) log(σ/200km s ) + 8.55 0.13. (17) γ ± ±

4.3. Core Scouring and Black Hole Mass

4.3.1. Black Hole Mass and rγ

The existence of the r σ and r L relationships implies empirical relationships between γ − γ − rγ and M•, given the separate M• σ and M• L relationships. By combining equation (1) with 2.1±0.4− − equation (9) we find r M• , or more precisely, γ ∼ 9 M•(σ)= log(r /pc) = (2.1 0.4) log(M•/10 M⊙) + 2.7 0.2. (18) ⇒ γ ± ± 1.0±0.1 At the same time, we can also combine equations (10) and (4) to find r M• , or γ ∼ 9 M•(L)= log(r /pc) = (0.96 0.09) log(M•/10 M⊙) + 1.9 0.1. (19) ⇒ γ ± ± Equations (18) and (19) are inconsistent. The conflict between M•(L) and M•(σ) leads in turn to contradictory predictions for how the physical scale of cores is related to black hole mass.

Comparison of the observed r M• relationship to the two inferred relationships presented γ − above may offer a path to determining which of M•(σ) or M•(L) is more accurate for the most massive galaxies. In the “core-scouring” scenario, cores are created by the orbital decay of a massive binary black hole, which would be formed during the merging of two galaxies. As the merger progresses, black holes in the nuclei of the progenitor galaxies are brought to the center of the merged system by dynamical friction. While the center of the merger may initially be highly concentrated (Milosavljevi´c& Merritt 2001), as is the case for power-law galaxies, central stars interacting with the binary black hole are ejected from the center as the binary hardens. The ejection of stars erodes the steep central stellar density profile, creating a shallow cusp, or break from the steeper profile that still persists at larger radii. A core is the region of the galaxy interior to the break (cf. Lauer et al. 1995).

Under this hypothesis, the relationship between core scale and M• ought to be more funda- mental than either of the r σ or r L relationships alone. The action of the black hole mass γ − γ − on stellar orbits at the galaxy center creates the core structure directly, and the r σ and r L γ − γ − relationships are then merely consequences of the separate M• σ and M• L relationships. Ac- − − cording to this logic, we would conclude that the larger cores of BCGs are evidence of higher BH masses. – 13 –

A major caveat standing in the way of this conclusion is that core scouring may not lead directly to a clean relationship between rγ and M•. The binary BH ejects a total mass of stars, Mej, that is expected to be proportional to the total merged M• (Quinlan 1996; Milosavljevi´c& Merritt 2001; Merritt 2006). However, the resultant rγ would depend on the radii over which stars are ejected from the center. Further, Merritt (2006) presents simulations that show that core formation should be a cumulative process. Cores formed in one merger event will be depleted even further in subsequent mergers, presumably leading to even larger increases in rγ that would reflect not only the total BH mass, but the integrated merger history as well. Under this hypothesis, cores resulting from successive dry mergers would be abnormally large compared to their BH masses, potentially explaining the extra-large cores of BCGs, which are thought to be formed by such multiple mergers. Under scrutiny, however, this explanation seems difficult to support, since the cores of BCGs show no excess compared to the luminosities of their host galaxies, and it is this latter quantity that is probably the best indicator of the amount of dry merging that any massive elliptical has experienced. In other words, the core masses of BCGs galaxies are the same fixed fraction of their total light as in other galaxies, not some amplified value driven by multiple mergers. Thus, we seem to be driven back to the basic explanation that the larger cores of BCGs are due simply to larger BH masses.

Can we use actual core data to identify the correct r M• relationship? Figure 7 tries this γ − by plotting rγ versus M• for the 11 core galaxies for which there are direct determinations of M•. A symmetric fit to rγ and M• for these galaxies has the form

9 log(r /pc) = (0.83 0.25) log(M•/10 M⊙) + 2.20 0.10. (20) γ ± ±

This equation is essentially consistent with equation (19), the relationship inferred from M•(L), rather than equation (18), which inferred from M•(σ). At the same time, the scatter in Figure 7 is large, thus this result is sensitive to how the r M• relationship is fitted. For example, if r is γ − γ treated as the independent variable in an attempt to predict M•, given rγ, then

9 log(r /pc) = (1.5 0.8) log(M•/10 M⊙) + 2.20 0.11 (21) γ ± ±

(although we express rγ as the dependent variable for comparison with the relationships above). The slope of this relationship is intermediate between that in equations (18) and (19). For completeness, if M• is treated as the independent variable, which corresponds to the scouring hypothesis that M• determines rγ , then

9 log(r /pc) = (0.59 0.18) log(M•/10 M⊙) + 2.19 0.10 (22) γ ± ±

These three fits do not in fact suffice to identify the “correct” r M• relation for four reasons: γ − 1) the various slopes differ considerably because the native scatter in the data is large; 2) we are seeking the “true” underlying relationship (i.e., the “theorist’s” question of Novak et al. 2006), but without a knowledge of cosmic scatter and its separate contribution to both M• and rγ, we cannot fit the data properly to find it; 3) the slopes in equations (18) and (19) were likewise meant to – 14 – embody “true” relations, but they were derived from prior fits that themselves suffered a similar ambiguity; and 4) the sample of core galaxies with measured M• is potentially biased in some way that is not understood (cf. Figures 3 to 6), and any new fit based on these galaxies might therefore not be representative. On this last point, we emphasize caution. While the galaxies with measured M• may on average have offsets in the parameter plots shown, this does not mean a priori that the directly observed r M• relationship is biased. The small number of core galaxies with measured γ − M• plus the number of parameters in play means that understanding any biases must await a richer sample.

Likewise, the sample of core galaxies with measured M• will have to be increased considerably before it can be used to convincingly discriminate between the M•(L) and M•(σ) relations. Nev- ertheless, we may be able to obtain some guidance in advance of such observations by comparing M• estimated from rγ to values estimated from L or σ. Figure 8 shows the results of using either equation (20) or (21) to predict M• from rγ , in analogy to Figure 2, which compared predictions of M• based on σ versus L. Both versions of the r M• relationship predict larger M• than would be γ − inferred from M•(σ). The symmetrically-fitted rγ M• in equation (20) appears to be consistent 10 − with M•(L), also predicting M• 10 M⊙ for the most massive galaxies. ∼ Presently, the large scatter in the observed r M• relationship and the attendant uncertainties γ − in any empirical relationship derived from it does not decisively favor M•(L) over M•(σ). Equations (20) and (21) however, on average predict greater M• than would be inferred from M•(σ), while equation (20) is consistent with the larger black hole masses implied by L for the most massive galaxies. At this early stage the r M• relationship thus may favor consistency with the M•(L) γ − relationship. The fact that the scatter of rγ on L is smaller than that on σ (as would be expected if rγ is produced directly by black hole scouring and M• correlates more closely with L) as well as the fact that M•(L) plausibly explains the large core of BCGs as being due to more massive black holes, whereas M•(σ) seems to provide no ready explanation this, may offer additional support that M•(L) is more appropriate for the most massive galaxies.

4.3.2. Black Hole Mass and Core Mass

An alternative approach to explore the relationship between core structure and black hole 2 mass is to compare the core mass, Mγ = πIγ rγ(M/LV ), rather than rγ, to M•. Although, as we noted earlier, Iγ and rγ are closely related, so the relationship between Mγ and M• will contain information similar to the r M• relationship, core mass should be a more direct indicator of the γ − amount of core scouring and its relationship to black hole mass. If cores are created from power- law galaxies by core scouring following a dry merger, one might expect that the core mass would be approximately proportional to the black-hole mass. This conjecture is supported by N-body calculations by Merritt (2006), who argues that the core mass produced by scouring in a single merger is fM•, where M• is the mass of the merged black hole and f 0.5, largely independent ≃ ≃ of the mass ratio of the merging black holes; he also argues that the total core mass after N dry – 15 – mergers should be given by f 0.5N. Direct estimation of the mass ejected from the core by ≃ scouring is much more difficult observationally than theoretically, because we do not know the state of the galaxy before the merger. Thus we will simply use Mγ as an “indicative” core mass, recognizing that the factor f relating indicative core mass to black-hole mass is very uncertain, but should be approximately independent of galaxy luminosity for core galaxies.

Figure 9 shows the relationships between M and M• as derived from the M• σ and M• L γ − − relationships. By combining M•(σ) (equation 1) with the M σ relationship (equation 17), we γ − find 9 M•(σ)= log(M /M⊙) = (2.2 0.3) log(M•/10 M⊙) + 10.29 0.18, (23) ⇒ γ ± ± while the combinations of M•(L) (equation 4) with the M L relationship (equation 16) gives γ − 9 M•(L)= log(M /M⊙) = (1.0 0.1) log(M•/10 M⊙) + 9.39 0.11. (24) ⇒ γ ± ±

The relation between indicative core mass and black-hole mass predicted by the M• L relation − is essentially linear, as expected, while the relation predicted by the M• σ relation is twice as − 9 steep. Moreover the ratio of indicative core mass to black-hole mass is 2.4 at M• = 10 M⊙ ∼ from the M• L relation, not far from the value of order unity that we might expect, while the − corresponding value from the M• σ relation is 19. It is difficult to devise dynamical models in − ∼ which core scouring could be efficient enough to create cores with mass so much bigger than the black-hole mass.

5. The Growth of the Most Massive Galaxies and Why M•(L) Might be Favored Over M•(σ)

Having found suggestive but not conclusive arguments to prefer one relation over the other, we turn now to physical arguments for additional guidance. We stress again that the tension between M•(L) and M•(σ) arises due to the breakdown, or curvature, in the L σ relationship at high − galaxy masses. It is appropriate to inquire at least briefly into physical reasons why this might happen, and whether this offers insight into which of M•(L) and M•(σ) might be preferred for the most massive galaxies.

Curvature in a gravitational scaling relation may signal a breakdown in perfect homology in galaxy formation, which in turn may reflect a change in the relative importance of different physical processes as a function of galaxy size. One such effect, suggested some time ago, is an increase in the importance of dissipationless (i.e., “dry”) merging in forming the most massive ellipticals (Bender et al. 1992; Faber et al. 1997; Naab et al. 2006). A second effect, following logically from the first, is a change in the nature of dissipationless mergers with galaxy mass. As hierarchical clustering proceeds, clusters of galaxies become more massive, and previously formed elliptical galaxies at the centers of these clusters merge. Each new round of merging thus increases the galaxy stellar mass along with the cluster dark halo mass in which it is embedded. The largest – 16 – ellipticals are thus produced by dry mergers at the centers of the largest clusters. These are BCGs. The above scenario is supported by the steep environmental dependence among bright ellipticals, the brightest ones being found in the densest environments (Hogg et al. 2004).

There are at least two trends that might contribute to a breakdown in perfect homology for dry merging to produce the observed curvature in the L σ relationship. The first is a change in the − typical orbital eccentricity of central merging pairs as cluster mass grows. Boylan-Kolchin et al. (2006) have suggested that head-on collisions may become more frequent when massive clusters merge, and their N-body simulations indicate less loss of energy from stars to dark matter in such collisions. The resultant stellar remnants are puffed up in radius and have significantly lower stellar velocity dispersions compared to encounters with normal orbital geometry. A second effect, not considered by them, is the fact that the ratio of cluster velocity dispersion to internal galaxy velocity dispersion also rises as clustering proceeds. This appears to happen because gas cooling is reduced in large dark-matter halos (e.g., Birnboim & Dekel 2003), which means that the baryonic masses of central galaxies grow more slowly than their dark-matter halos. This is why the stellar velocity dispersions within BCG galaxies in large clusters are so much lower than those of their surrounding clusters, whereas the same is not true of ellipticals in small groups (cf. Figure 2 of Blumenthal et al. 1984). The net result is that central merging pairs will approach each other at relatively higher speeds, with the potential of injecting more orbital kinetic energy into the final stellar remnant. This would also cause the remnant to puff up and have smaller final velocity dispersion.

The relative importance of these two effects can only be decided using realistic two-component N-body simulations containing both stars and dark matter that are appropriately embedded in a cosmological clustering scenario. It seems probable that both effects will be found to play a role. The point for now is that there are at least two reasons to expect non-homology in dissipationless mergers, and thus two reasons for curvature in the L σ relationship. − If this logic is correct, it points towards M•(L) as being the proper scaling law for massive galaxies. That is because the major growth in black hole mass during dissipationless merging occurs by merging black holes as the galaxies themselves merge. With little mass accretion directly onto black holes during this stage and no attendant formation, black hole mass should increase in proportion to galaxy mass. The ratio of stellar mass to black hole mass is constant over dry merging, consistent with M• scaling linearly with galaxy mass. Conversely, for M•(σ) to be maintained over dry merging, given the plateau in σ at high galaxy luminosity, would require one of the two merging black holes to be ejected from the galaxy as a common occurrence. These arguments provide yet a another reason to prefer the M•(L) relation over M•(σ).

Regardless of which mechanism is dominant for determining the velocity dispersion of the merged galaxy, “puffing-up” of the remnant does appear to happen in the most luminous galaxies, supporting an overall scenario in which velocity dispersion in the largest galaxies does not increase strongly via mergers. The evidence for this comes from the effective radii of the largest galaxies (see also Bernardi et al. 2006b). Figure 10 shows the R L relationship for the whole sample, where R e − e – 17 – is the effective radius measured from r1/4-law fits to those galaxies that have ground-based surface photometry extending to large radii (Lauer et al. 2007a). For low galaxy luminosity, the the mean R L relationship is relatively shallow. We find e − log(R /pc) = ( 0.50 0.08)(M + 21)/2.5 + 3.62 0.04 (25) e − ± V ± based on fits to just power-law galaxies. This stands in contrast to the steeper relationship defined by core galaxies with M < 21, V − log(R /pc) = ( 1.18 0.06)(M + 23)/2.5 + 4.27 0.02 (26) e − ± V ± The transition between the two forms occurs at M 22, which corresponds to the luminosity V ≈ − at which the average central structure changes from power-law to core (Lauer et al. 2007a). This is also the scale at which σ starts to plateau in Figure 3 — the leveling-off of the L σ relationship − is thus associated with a rapid increase in Re with L not seen in less luminous galaxies. This is as predicted if extra energy is injected into these galaxies by merging: the objects will have lower σ but larger Re.

It should be stressed that the arguments presented in support of the M•(L) relation in this section apply only to bright ellipticals, which are those produced by dissipationless merging. The mass-accretion processes that built black holes when galaxies were younger were drastically different and might have obeyed different scaling laws. The M•(σ) law might be a better fit to such galaxies, which in general will be smaller than the objects considered in this paper. The broader point of this discussion, however, is that “non-homology” processes may have affected the growth of galaxies generally, with the result that a single black hole scaling law with global galaxy properties might not fit all galaxies.

6. The Space Density of the Most Massive Black Holes

The preceding sections have presented suggestive if not conclusive reasons to suspect that the M• L relation might be a better predictor of black hole mass than the M• σ relation for the − − most massive galaxies.

1. The velocity dispersions of the most massive elliptical galaxies rises slowly if at all with galaxy luminosity implying that their black holes are no larger than those of much smaller ellipticals if M• σ is the governing relation — this seems rather surprising. −

2. The core-scouring model says that rγ should correlate directly with M• whereas correlations between rγ and L or σ should be secondary; if so, the smaller scatter of rγ on L validates L as the more accurate predictor of M•.

3. The M• L relation offers a simple explanation for the large cores of BCGs in terms of bigger − black holes whereas the M• σ relation offers no such ready explanation. − – 18 –

4. Either of the two fits of M• on rγ for core galaxies with measured BHs (equations 20 and 21) predicts large M• when extrapolated to luminous galaxies, in better agreement with M• L − than with M• σ. − 5. The largest elliptical galaxies are believed to be formed by dry merging, which predicts that black hole mass should grow in proportion to stellar mass; the observed M• L relation is ∝ thus the simplest relation predicted on these grounds. By contrast, the saturation of black holes mass in the largest galaxies that is predicted by the M• σ relation requires that one − of the two merging BHs be ejected from the galaxy as a common occurrence, which may not by natural.

6. The largest elliptical galaxies are likely formed by dry mergers at the centers of massive clusters; non-homology merger arguments plausibly explain the low velocity dispersions (and large radii) of such galaxies, but the same arguments then imply that σ ought not to be a fundamental parameter for predicting black hole mass in the biggest galaxies.

10 7. As noted in the introduction, there is evidence from AGN physics that M• 10 M⊙ in some ∼ systems.

Although these lines of argument are not conclusive, they motivate us to consider the impli- cations for the local BH mass function should the M• L relation prove correct. The differences − from the M• σ relation are large, as we shall show. In this section we first compute these two − mass functions based on M• σ and M• L, and then we compare the resulting mass functions − − to estimates of the relic black hole mass function based on the space density of QSOs as a function of luminosity and .

6.1. The Black Hole Cumulative Mass Distribution Functions

We first compute the black hole mass function by combining the M•(σ) predictor with the velocity-dispersion function (the space density of galaxies as a function of velocity dispersion), We then repeat the calculation, but then using M•(L) combined with the galaxy luminosity function. Our analysis follows the precepts of Yu & Tremaine (2002), departing in the choice of dispersion functions. Both calculations use the same formalism, thus for the sake of generality we denote the log of either the velocity dispersion, σ, or luminosity of the galaxy, L, by s and assume that the correlations of BH mass M• with either parameter can be formalized through the statement that the probability of finding a galaxy a given BH mass is 2 2 −1/2 [log(M•) F (s)] dP (M•) = (2π∆ ) exp − d log(M•), (27) − 2∆2  where F (s) is the ridge line of either log(M•) s relation. The number of BHs near a given mass − is then ∞ 2 dn(M•) 2 −1/2 dn [log(M•) F (s)] = (2π∆ ) exp −2 ds, (28) d log(M•) Z−∞ ds − 2∆  – 19 – and the cumulative distribution is ∞ dn(M•) n(M•)= d log(M•). (29) ZM• d log(M•)

For the dispersion-based predictor, we start with the Sheth et al. (2003) SDSS-based velocity- dispersion function. Bernardi et al. (2006a) reprocessed the SDSS data and recovered a number of galaxies with larger dispersions than those used in Sheth et al. (2003). We use that set of high dispersion galaxies to compute a cumulative dispersion function above σ = 350km s−1 by

n(>σ)= N(>σ)/V, (30) where N(> σ) is the number of galaxies with dispersions greater than σ and V is the Sloan survey volume given by Bernardi et al. (2006a) as 3.34 10−7Mpc (for H =70km s−1 Mpc−1 and × 0 z < 0.3). This cumulative function is well approximated by a power law. Differentiating it gives an estimate of the dispersion function above σ = 350km s−1 of

−10.27 dn(M•) −6 σ −3 = 6.67 10 −1 Mpc (31) d log(σ) × 350km s  Equation 31 predicts about 10 times as many galaxies with dispersions greater than 400km s−1 as the Schechter function fit given in Sheth et al. (2003). Above σ = 350km s−1 we add it to the Sheth et al. (2003) dispersion function in the analysis of equation (28).

We think this is the best available estimate of dn/dlog(σ) for early-type galaxies at zero redshift. We combine the Sheth/Bernardi dispersion function with the Tremaine et al. (2002) −1 M•(σ) predictor (equation 1) F (s) = 8.19 + 4.02 x where x = s log(200 km s ), in equations × − 28 and 29. As an alternative, we consider the σ – BH mass function using the Wyithe (2006) predictor F (s) = 8.11 + 4.2 x + 1.6 x2. We illustrate both cumulative BH mass functions × × so derived in Figure 11. Choosing the Wyithe predictor only increases the predicted BH number 9 density modestly near M• = 3 10 M⊙. × Inclusion of the cosmic scatter in the M• relationships is crucial (Yu & Tremaine 2002; Yu & Lu 2004; Tundo et al. 2006). The total population of galaxies at any given L or σ will host black holes with a range in M•. The final BH mass functions thus are not a simple “relabeling” of the L or σ distributions with black hole mass, but are rather a convolution of the of these distributions with an assumed distribution of M• at constant L or σ. This convolution is especially critical at a large BH mass, where both the galaxy luminosity and dispersion functions decline rapidly. As a result of cosmic scatter, most of the high mass BHs actually come from “modest” galaxies with unusually large BHs for their luminosities or dispersions, as compared to the expected contribution of massive black holes from the most massive galaxies (Lauer et al. 2007b).

We emphasize that ∆ above is the scatter about the mean relation due to cosmic scatter in the relation and not due to measurement error. Tremaine et al. (2002) notes that the ∆=0.30 scatter about their derivation of M•(σ) might be entirely due to measurement error, leaving no – 20 – room for cosmic scatter. In plots of BH mass functions in Figure 11 we illustrate illustrate 3 values of the cosmic scatter for the Tremaine and Wyithe results: ∆ = 0, 0.15 and 0.30. This scatter is probably not larger than 0.30, but may be considerably smaller.

We compute the the L-based black hole mass function by the same approach, starting with the Blanton et al. (2003) SDSS g′ galaxy luminosity function. We convert the g′ galaxy luminosity function to V band using g′ = V + 0.41 — suitable for E galaxies at z = 0 (Fukugita et al. 1995). − Comparison of the Blanton et al. galaxy luminosity function with the Postman & Lauer (1995) BCG survey suggests that the Blanton work undercounts BCGs. We argue in Appendix A that this may be due to the effects of excessive sky subtraction on the most luminous galaxies. To correct for this, we have added an estimate of the space density of BCGs as a function of V-band luminosity to the Blanton et al. sample. We used the Postman & Lauer (1995) BCG sample, which is volume-limited, to construct an estimate of the luminosity function in RC , transforming it to V using R = V 0.55. Using the combined dn/dM , we determine the number of BHs above a C − V specified mass M• from equations 29 and 28, where we set s = MV and F (s) is the right-hand side of equation (4) — the H¨aring & Rix (2004) predictor. Because Blanton et al. (2003) represented their luminosity function as that observed at z = 0.1, rather than the present epoch, both k-corrections and corrections for evolutionary fading are required. These two terms fortuitously cancel each other: Blanton et al. (2003) show that their sample dims by 0.2 mag in g′ from z = 0.1 to the present, while the filter k-correction to transform to z =0 is 0.20 mag. Lastly, we use ∆ = 0.25 − and 0.50, larger values than were used for M•(σ), given the larger scatter in M•(L).

The cumulative BH mass functions based on the two different methods are shown in Figure 8 11. Near BH masses of 10 M⊙ (luminosities near L∗) and lower, the L-based function overpredicts BH numbers by a factor of two and larger. In part, this is due to the fact that the L-based mass function includes galaxies that are disk-dominated, thereby overestimating the numbers and masses of BH at lower masses, since the L M• relationship is based on bulges and elliptical galaxies. At − higher masses this correction is negligible since most galaxies are ellipticals or bulge dominated S0s. There is also the possibility that the L M• relationship is biased in the sense that the galaxies − with black hole mass determinations have larger velocity dispersions than the average values for galaxies of their luminosities (see Figure 1). If so, this would cause the L-based mass function in Figure 11 to shift to the left, in better agreement with the σ-based mass function.

9 For M• > 10 M⊙ the L and σ mass functions diverge. The L based mass function predicts a local density of the most massive black holes that is about an order of magnitude greater than 9 would be inferred from the M• σ relationship for M• > 2 10 M⊙. This disagreement was − × foretold in Figure 2 — the present Figure 11 simply recasts the disagreement between M• L and − M• σ in terms of the BH mass function. − – 21 –

6.2. The Black Hole Distribution Function Inferred From QSOs

There are two different approaches that can be used to infer the present BH mass function from quasar counts (specifically from the joint distribution of quasar numbers as a function of redshift and luminosity). One line, started by So ltan (1982), relies on energy conservation. Under that argument, the energy emitted by the quasars at any redshift propagates through the universe declining in co- moving density due to the redshift of the photons as (1 + z)−1, and thus behaves exactly like a background. Hence the observed quasar flux translates directly to the total energy emitted given a known redshift distribution of emitters, and further translates into the total mass accreted by black holes, given their radiative efficiency. An alternate approach, started by Small & Blandford (1992), is to assume that all quasars go through a phase in which they accrete at the Eddington limit, followed by a period of slower or intermittent accretion according to a universal model dependent on BH mass and time. This assumption, together with a continuity argument (the number of BHs at a given mass changes only by accretion and merging) permits the recovery, not merely of the local BH density, but also of the local BH mass function. This second approach achieves a more detailed result than the So ltan argument, but at the expense of additional assumptions.

A third more limited approach, which shares some logic with Small-Blanford, is to assume simply that the BHs of known mass accrete near the Eddington limit for some period and to ignore their fainter growth period — the so called “lightbulb model.” In this model quasars are either on or off. Because the number of luminous quasars in the universe varies strongly with time, the model doesn’t count BHs directly, rather it counts those that are accreting. In what follows, we evaluate the lightbulb model at z = 2.5 where the top end of the quasar LF is largest. We assume 9 that no BHs above 10 M⊙ are destroyed by mergers since that time, so the fall-off in the LF is due to a halt in accretion. At earlier times the BH mass function may be evolving. The full-width half maximum of the bright quasar LF is about 109 yrs. So our lightbulb model implicitly assumes that every massive BH accretes for 109f yrs, where f is defined as the duty fraction, and then shuts off. This idealization ignores low mass BHs and low-level accretion. Another significant limitation is that the model provides no procedure to identify the mass below which it fails, although that failure is implicit within the assumptions: some of the lower luminosity quasars must be high mass BHs accreting at less than their Eddington limit. A third limitation is that the model is fundamentally inconsistent: the assignment of mass corresponding to a quasar luminosity gives the instantaneous mass of an accreting BH, while the present day mass function depends on its final mass. If the quasar is “on” for less than the Salpeter time (for BH mass to e-fold in Eddington-limited accretion), then the problem is small, but if it is on for much longer than the Salpeter time it is catastrophic: the present BH mass may be much larger than that assigned to the quasar.

Nonetheless, the lightbulb model permits a comparison of the quasar LF with the number density of the most massive BHs in the local universe, with the duty fraction as a free parameter, under the assumption that the brightest objects in the quasar luminosity function are accreting near the Eddington limit. This approach was used by Richstone et al. (1998) to make a crude estimate of the duty fraction of BH accretion. We perform a similar analysis here. We start with the – 22 –

Richards et al. (2005) luminosity function at z = 2.5, where the bright quasar density is greatest. The Richards et al. (2005) fit reports number densities per magnitude at an AB absolute magnitude at rest-frame λ =1500A.˚ We integrate their fitting function from a given luminosity to infinity to get a cumulative number of objects brighter than that luminosity, we apply a bolometric correction of 5 (Marconi et al. 2004), and then we deduce a mass from the Eddington limit. This procedure identifies the number density of BHs greater than a given mass accreting at a given redshift. We compute this cumulative density at redshift z = 2.5 where the density of bright quasars is greatest. We divide this result by the duty fraction f. We adopt f = 0.03 based on the extensive work by Steidel et al. (2002) and Adelberger & Steidel (2005). This yields the line labeled “lightbulb” in Figure 11.

An improvement on both the lightbulb approach and the Small-Blandford approach is to use a physical model for the accretionary evolution of the BHs. One such model is the merger-induced accretion model that has been explored in detail by Springel et al. (2005) and Robertson et al. (2006). They simulate the merger of disk+bulge+BH galaxies containing gas using the GADGET code, treating the BH growth by computing the Eddington-limited Bondi accretion rate at their smallest resolution element. They compute the luminosity of the accreting BH from the accretion rate under reasonable assumptions about the radiative efficiency. Their simulations permit the development of a model (Hopkins et al. 2006) that predicts the X-ray background and the zero- redshift BH mass function from the quasar LF. We believe the Hopkins model is a profound advance over simpler analyses. While it might turn out that their model does not correspond in detail to the quasar phenomenon, the approach may have broader utility. We summarize the salient points of their model below.

The Hopkins et al. (2006) simulations exhibit very complex behavior of luminosity as a function of time for a given , but the time spent above a given luminosity turns out to be a universal profile over a wide range of galaxy or merger parameters, provided it is scaled appropriately with the peak luminosity and relic BH mass of the merger. For their simulations, the lifetime near a given bolometric luminosity L can be parametrized as

dt/d log L = t∗ exp( L/L∗ ), (32) Q − Q ∗ ∗ where the timescale tQ (a crude quasar lifetime) and luminosity scale LQ are functions of the peak luminosity Lp as follows. ∗ LQ = 0.2Lp, (33) and −0.11 ∗ Lp tQ = 1.37 Gyr. (34)  1010L⊙  The final or relic BH mass has a one-to-one correspondence with the peak luminosity given by

−0.11 Lp M• = 1.24 MEdd(Lp), (35) 1013L⊙  × – 23 –

where MEdd(Lp) is the mass with an Eddington luminosity of Lp. An ensemble of objects with the same Lp should have a luminosity distribution proportional to the dt in equation 32. Hopkins et al. (2006) use the model of quasar lifetimes described above together with a log- normal distribution of quasar birth rate per unit time to match the quasar luminosity function. We use their parameterization

2 n˙ ∗ [log(Lp/L∗)] n˙ = exp 2 , (36) σ∗√2π − 2σ∗  wheren ˙ is the number of quasars born per unit comoving volume per unit time. Hopkins et al. (2006) find a good fit to the X-ray and optical quasar luminosity functions with

L∗(z)= L∗(0) exp(kτ), (37) where τ is the dimensionless lookback time τ = H0 dt and the other parameters are presumed constant. In what follows we use their best fit modelR with (log L∗, k, logn ˙ ∗, σ∗) = (9.94, 5.61, −3 −1 -6.29, 0.91), with L∗ in solar units andn ˙ ∗ in comoving Mpc Myr .

We can compute the present day density of quasar relics by integrating the quasar birthrate over time at any specific mass or Lp. Therefore the cumulative density of BHs above a given mass M• is −∞ zmax dt n(M•)= n˙ dz d log Lp. (38) ZLp(M•) Z0 dz 

Following Hopkins et al. (2006) we set zmax = 3. We plot the result of equation (38) in Figure 11. An important feature of the Hopkins model is that owing to the exponential distribution of time above a given luminosity (equation 32) the quasar spends only a fraction of its lifetime 9 accreting near the Eddington rate. For example, a 10 M⊙ relic BH had a peak luminosity, Lp of 13 ∗ 3.03 10 L⊙ and spent the time te = tQ log e E1(5/e) = 15Myr above a factor of 1/e of Lp, × ∞ × where E (x)= exp( u)/u du is the usual exponential integral. The Hopkins model guarantees 1 x − that the BH willR accrete enough mass, but not too much, over its lifetime.

Figure 11 permits us to compare the lightbulb and Hopkins models with the two relic BH 9 mass functions. The BH mass functions diverge at about 10 M⊙. The dispersion-based predictors 9 predict considerably fewer BH at above 10 M⊙ than the luminosity-based predictors. They are not consistent with the lightbulb model; consistency with the Hopkins model is possible with the Wyithe (2006) form of M•(σ), but with the assumption of more cosmic scatter in the M• σ − relationship than is probably realistic. The luminosity-based mass function is barely consistent with the lightbulb model, but probably overpredicts the AGN density compared to the Hopkins model. We thus cannot make a clear determination between the dispersion-based and luminosity- based BH mass predictors by comparing zero redshift BH demographics to quasar demographics; however, the linear Tremaine et al. (2002) M•(σ) relationship is disfavored in all of the present models to explain the QSO population. – 24 –

An important caveat is that our calculations have neglected the effects of dry merging on the most massive galaxies after the epoch of QSOs. Merging might produce high mass black holes as a relatively recent phenomenon, thereby helping to reconcile the estimates made from M•(L) with the QSO population. Another caveat for both results is the possibility of super-Eddington accretion among the biggest BHs (Begelman 2006). If common, super-Eddington accretion would make it very hard to make any estimates of the mass function of relic BHs from quasar LFs.

7. Conclusions

The M• σ relationship has come to be the “gold standard” for predicting black hole masses − from galaxy properties due to its small scatter and its implications for illuminating the co-formation of galaxies and their nuclear black holes. At first sight, the M• L relationship might be dismissed − 4 4 as a simple consequence of the Faber-Jackson relationship. With M• σ and L σ , one would ∼ ∼ expect something like M• L. The larger scatter in the M• L relationship further suggests that ∼ − the M• σ relationship is really more fundamental. But as galaxy luminosity increases, σ levels − off and the basic Faber-Jackson relationship does not appear to hold. At BCG luminosities there are no direct measurements of M• and M•(σ) versus M•(L) present contradictory extrapolations. The contradiction essentially begs the question, do these exceptionally luminous galaxies have exceptionally massive black holes? The M• L relationship answers this in the affirmative, while − for the M• σ relationship to be correct we must accept the puzzling result that the black holes − in BCGs have relatively modest masses. But this question leads to a broader issue, namely. is is 10 there a significant population of black holes with M• approaching 10 M⊙ in the local universe?

The best way to answer these questions is to attempt to weigh the BHs in BCGs. With the advent of LASER-guided adaptive optics-fed spectrographs on 10m class telescopes, it is now possible to do this. This paper may therefore be premature. However, given the high attention to the M• σ relationship and its implications for galaxy formation, we believe that advancing − the implications of M• L relationship offers an important alternative view that should not be − overlooked. Lacking hard measures of M• in the most massive galaxies, we have marshalled a number of less-direct arguments that this hypothesis may be favored for the most luminous galaxies.

The first set of arguments is based on the hypothesis that cores in the most luminous galaxies are created in a “core scouring” process in which a binary BH created in the merger of two galaxies evacuates stars from the center of the newly-merged product. There presently is little observational support for the creation of binary black holes in mergers, but abundant theoretical work shows that realistic cores can be created by binary black holes, and the prevalence of nuclear black holes in galaxies overall strongly argues that such binaries must be created as a natural consequence of mergers. If so, the physical scale of cores, which we have parameterized as rγ may be an independent witness of M•, and thus use the large cores in BCGs as an indicator of their black hole masses.

Based on central structural parameters derived from HST observations, we find that the large – 25 –

cores in BCGs are commensurate with their high luminosities, while σ is a poor predictor of rγ for r > 300 pc. The scatter in the r L relationship is much smaller than that in the r σ γ γ − γ − relationship, again implying that L and core scale are more closely related. The observed r M• γ − relationship for 11 core galaxies with directly determined black hole masses has large scatter, but appears to be more consistent with the M• L rather than the M• σ relationship. Lastly. the − − core masses in BCGs are over an order of magnitude larger than the black hole masses estimated from M•(σ), but are only a few times larger than those estimated from M•(L); making such large cores with the smaller σ-based black hole masses would be a strong challenge for the core-scouring hypothesis of core formation.

The second set of arguments comes from considering theoretical arguments concerning whether or not L rather than σ should predict M• in BCGs. The favored origin of BCGs is that they are the remnants of dissipationless purely-stellar mergers of less-luminous elliptical galaxies, augmented by ongoing galactic cannibalism of elliptical galaxies in the rich environments at the center of galaxy clusters. The plateau in the L σ relationship plus but the steeper L R relationship at high − − e galaxy luminosity presented here strongly favor this formation scenario. The luminosity of a BCG is the sum of the luminosities of its progenitors. Similarly, setting aside the possible ejection of nuclear black holes in the final stages of a merger, the final nuclear black hole mass should be the sum of the progenitor black holes. Stated more directly, the ratio M•/L should be largely invariant over dissipationless mergers, leading to M• L at the high end of the galaxy LF. In contrast, σ ∼ appears to be nearly constant over these mergers. In effect, even if a relationship between M• and σ were set up in the initial stages of galaxy formation, it might not survive in a dissipationless merging hierarchy.

A final argument comes from attempting to infer the z = 0 space density of the remnant black holes associated with the most luminous QSOs seen at z 2. As noted in the Introduction, the ∼ properties of the broad-line regions in the most luminous QSOs argue that they are powered by 10 black holes with M• 10 M⊙. The heating of the intra-cluster medium in the richest galaxy ∼ clusters may also demand that some black holes in BCGs approach this mass. The critical part of this analysis is understanding how to correct the QSO space density for QSO luminosity evolution. The remnant black holes last forever, but the QSOs represent only those BHs made visible during an epoch of high mass-accretion, which presumably lasts only a small fraction of the age of the universe. We used the Hopkins et al. (2006) simulations to estimate the QSO lifetimes. The resulting shape and space density of the bright end of the QSO LF falls between the higher space density of the most massive black holes implied by M•(L) and those implied by M•(σ), while the simple “lightbulb” model of QSO duty cycles favors the M•(L) relation. This treatment is sensitive to the assumed amount of cosmic scatter in both M• relationships; however, it appears difficult for the log-linear M• σ relationship to explain the the observed space densities of the most luminous − QSOs without assuming that its cosmic scatter is larger than is likely to be the case.

This research was supported in part by several grants provided through STScI associated with – 26 –

GO programs 5512, 6099, 6587, 7388, 8591, 9106, and 9107. Our team meetings were generously hosted by the National Optical Astronomy Observatory, the Observatories of the Carnegie Institu- tion of Washington, the Aspen Center for Physics, the Leiden Observatory, and the University of California, Santa Cruz Center for Adaptive Optics. We thank Mariangela Bernardi, Megan Don- ahue, Tom Jarrett, Michael Strauss, and Mark Voit for useful discussions. We thank Qingjuan Yu for kindly reminding us to include cosmic scatter in calculation of the black hole mass functions. We also thank the referee for many excellent suggestions. – 27 –

A. The Luminosities of BCGs and Comparison to SDSS Magnitudes

Our analysis depends critically on the accuracy of the absolute luminosities of the brightest galaxies in the sample, such as BCGs. This is underscored by the bright-end disagreements of our L σ relationship and galaxy luminosity function with those based on Sloan Digital Sky Survey − (SDSS) data (Bernardi et al. 2003 and Blanton et al. 2003, respectively). We thus describe the derivation of our BCG luminosities, and compare them to magnitudes based on the SDSS for BCGs in common. We conclude that the SDSS BCG magnitudes are strongly affected by sky subtraction errors, causing them to be biassed to significantly fainter values.

The BCGs in the present sample come from the Laine et al. (2002) HST BCG study. This program, in turn, was based on the volume-limited Postman & Lauer (1995) BCG sample, which provides ground-based profiles and aperture photometry.5 As outlined in 2.1, we derive apparent § magnitudes of the BCGs by fitting the classic r1/4 form to the inner portions of the brightness profiles, where the inner limit of the fit was set to avoid seeing and the outer limit was specified to avoid portions of the profile that appeared to rise above the r1/4 fit. Graham et al. (1996) showed that the BCG profiles could be described by single-component S´ersic (1968) forms, but ones that often had index n> 4. The apparent magnitudes, which were derived by integrating the r1/4-law over radius, thus if anything are underestimates of the total BCG fluxes. An alternative to this procedure would be to integrate the S´ersic forms, however, as is shown in Graham et al. (1996), the S´ersic Re and n values are closely coupled, such that large n is matched with large Re, The implied total magnitude strongly diverges as n increases, and must essentially be regarded as unphysical extrapolations because the derived Re is typically well outside the actual radial domain of the surface brightness profile for large n; this is not true for n = 4. A contrasting treatment that occurs in much of the literature is based on the presumption that BCG must be completely well described by r1/4-laws (in contrast to other giant elliptical galaxies, which also have n > 4), and that S´ersic n> 4 is really the signature of an intracluster light component that must be subtracted. We conclude that an unambiguous procedure to derive total BCG luminosity does not presently exist. Our procedure of deriving magnitudes from just the inner portion of the profile that is well described by an r1/4-law again should give a sensible lower limit to the total luminosity.

The high luminosity end of the Blanton et al. (2003) luminosity function falls well below the BCG space densities measured by Postman & Lauer (1995). The Bernardi et al. (2003) L σ − relationship shows no plateau at its bright limit. These discrepancies would both be explained if the total magnitudes for BCGs in the SDSS database are significantly underestimated. We checked this hypothesis by examining the 25 Postman & Lauer (1995) BCGs present in the SDSS DR4 6 database. In detail, we compared the total RC in Laine et al. (2002) against

5Postman & Lauer (1995) did not actually publish their BCG surface brightness profiles, but they were presented graphically in the BCG profile analysis of Graham et al. (1996). 6Two additional BCGs were mis-identified in the SDSS database as stars. – 28 – the SDSS r “model magnitude,” (which in almost all cases is the most luminous total magnitude provided by the SDSS database) transformed by R = r 0.31. The results are shown in Figure C − 12 as a function of effective radius (based on our fits). A strong systematic trend is evident such that larger galaxies have greater offsets between the two total magnitudes. The median r 0.31 R value is 0.54 mag and rises to 1.57 mag for the NGC 6166 (the BCG in A2199). As − − C an additional check, we also compared the SDSS r magnitudes against the maximum RC aperture magnitude published by Postman & Lauer (1995). The maximum aperture radius was not defined in any rigorous way and does not correspond to any fixed fraction of the total galaxy flux, but the magnitude is a model-independent integration of all the flux within the published radius. The median difference between the SDSS model r magnitudes (transformed to RC ) and the maximum aperture magnitude is 0.24 mag and rises to values over a full magnitude for the largest galaxies. This demonstrates directly that the SDSS model magnitudes for the galaxies in question cannot be regarded as total magnitudes.

Conversations with a number of experienced users of the SDSS database for bright galaxies warned us that the SDSS pipeline measured sky levels on angular scales too small to accommodate bright nearby BCGs of the sort observed by Postman & Lauer (1995), and indeed the results shown in Figure 12 strongly suggest that a sky-subtraction error affects the SDSS magnitudes. As a check, we plot the SDSS r surface brightness profiles against the Postman & Lauer (1995) profiles for three of the BCGs with the largest magnitude differences in Figure 13. The SDSS profiles agree well at small radii but all fall below the Postman & Lauer (1995) profiles at large radii, consistent with large sky subtraction errors.

The large SDSS sky-subtraction errors for bright galaxies may have important implications for the Bernardi et al. (2003) and Blanton et al. (2003) studies, but exactly how is not clear. Both SDSS studies are based on galaxy samples with higher mean than the Postman & Lauer (1995) sample. Their BCGs should be angularly smaller and thus be less vulnerable to sky sub- traction errors. Typical BCGs in the SDSS sample are listed in the Miller et al. (2005) sample of galaxy clusters identified from SDSS galaxy catalogues. Figure 14 shows a histogram of SDSS model r magnitudes (converted to MV ) for the BCGs identified by Miller et al. (2005) compared to a histogram of all Postman & Lauer (1995) BCGs with MV based on their total RC magnitudes. There is a clear offset between the two samples, with the Postman & Lauer (1995) BCGs appearing to be typically one magnitude brighter than the Miller et al. (2005) BCGs. However, a histogram of SDSS r magnitudes for the 25 Postman & Lauer (1995) BCGs in common with SDSS agrees well with the Miller et al. (2005) BCG histogram, yet these are the magnitudes shown to in be error. We conclude that the total magnitudes of nearby SDSS BCGs are wrong.

B. The Use of Catalogued 2MASS XSC Apparent Luminosities of BCGs

After the first version of this paper was posted on astro-ph, Batcheldor et al. (2006) presented a M• L relationship for BCGs based on apparent magnitudes extracted from the 2MASS Ex- − – 29 – tended Source Catalogue (XSC) (Jarrett et al. 2000, 2003). The implied NIR luminosity differential between BCGs and other giant elliptical galaxies is greatly reduced from that of the present work. As a result, the plateau in the L σ relationship presented here is greatly reduced in the NIR and − the conflict between M•(σ) and M•(L) is thus resolved. Batcheldor et al. (2006) further suggest that the relatively higher luminosities inferred from the optical photometry may imply that the envelopes of the BCGs are extremely blue.

We have not conducted a complete comparison of the present photometry with that pro- vided by the 2MASS XSC, but a spot check of a few systems makes it clear that the 2MASS imagery from which the catalogue magnitudes were derived is extremely shallow compared to that of Postman & Lauer (1995), which is the source of the R band optical photometry (transformed to V ) used in this paper. The most likely explanation for the difference between the present and Batcheldor et al. (2006) results is that the 2MASS images are simply not deep enough to obtain accurate total luminosities of the BCGs, at least as represented by the automatic reductions used to generate the XSC magnitudes.

Figure 15 shows the J band 2MASS image of NGC 2832, the BCG in A0779 compared to the central portion of the R band image obtained by Postman & Lauer (1995). The J sky level is 15.67 magnitudes per square arcsecond versus the R band sky level of 20.90. Accounting for the R J = 1.68 color in the center of the galaxy implies that the J band has a sky level effectively 26 − × brighter. Further, the 2MASS image is a 7.8s exposure obtained with a 1.3m telescope as compared to the 200s R image obtained with a 2.1m telescope (Postman & Lauer 1995). The J image is thus considerably shallower than the R image as is evident in Figure 15. The galaxy envelope in the J band image disappears into the noise at radii at which it is still clearly present in R. This problem is exacerbated in the H and K bands, which have yet brighter sky levels.

Despite the much brighter J sky level, the J and R band profiles of the A0779 BCG are consistent, as is evident in Figure 16, which compares the R profiles measured by Postman & Lauer (1995) to J profiles derived by us from the 2MASS archive images for the three BCGs shown earlier in Figure 13. The final J band isophote shown for A0779 falls fully 6.8 magnitudes below the sky ∼ level, but is still in agreement with the R band profile within the large error bars, which represent a 0.004 magnitude error in the 2MASS J sky level.

1/4 An r law fitted to the J band profile of A0779 yields mJ = 9.04, only 0.08 magnitudes fainter than m = 8.96, estimated by subtracting the R J = 1.68 color of the central isophotes from m J − R obtained from the Postman & Lauer (1995) photometry. These values are in poor agreement with the XSC isophotal (Jmk20fe ), and extrapolated (Jmext ) mJ values of 9.78 and 9.67, respectively, ′′ however. The isophotal radius, rk20fe, is 53. 3, well interior to the limits of the surface photometry shown in Figure 16. The extrapolated magnitude is based on a S´ersic fit to a surface brightness profile generated by the XSC pipeline. However, even for giant elliptical galaxies the XSC S´ersic index is limited to n < 1.5 (Jarrett et al. 2003; Jarrett, private communication). The XSC cal- culation of total magnitudes thus assumes that the galaxies essentially have exponential profiles, – 30 – rather than the r1/4 form standard for giant elliptical galaxies. The XSC pipeline gives n = 1.17 for A0779, for example, while Graham et al. (1996) find n> 10 based on the Postman & Lauer (1995) photometry. An exponential cutoff explains both the small difference between the XSC isophotal and total magnitudes, as well as the large deficit of both magnitudes compared to a total magnitude estimated from an r1/4 law.

A similar pattern is seen for the two other BCGs shown in Figure 16. For A2052, the BCG ′′ 1/4 has Jmk20fe = 10.92, corresponding to rk20fe = 38. 4, and Jmext = 10.60, while an r fit to the surface photometry recovered from the 2MASS archive image gives mJ = 9.58, a full magnitude brighter and only 0.21 magnitudes dimmer than the mJ = 9.36 inferred from the R band imagery with R J = 1.79. For A2199, the BCG (NGC 6166) has Jmk20fe = 10.51, corresponding to − ′′ rk20fe = 50. 0, and Jmext = 10.41, generated from a S´ersic fit with n = 1.18 (Jarrett, private communication); Graham et al. (1996) find n = 6.9. An r1/4 fit to the surface photometry recovered from the 2MASS archive image gives mJ = 9.66, 0.75 magnitudes brighter, but 0.41 magnitudes dimmer than m = 9.20 inferred from the R band imagery with R J = 1.82. J − To summarize, our fits to the 2MASS J images for the three BCGs shown give mJ values markedly brighter than the XSC apparent magnitudes, but that are much more consistent with the R photometry of Postman & Lauer (1995). As a check, we found that the aperture photometry in

Postman & Lauer (1995) interpolated to rk20fe was in excellent agreement with the Jmk20fe values, assuming the R J values derived by comparing the inner portions of the surface photometry − profiles; however, rk20fe is well interior to the limits of the optical photometry, thus the XSC isophotal magnitudes cannot be regarded as a total apparent magnitudes. The nearly exponential profile form used by the XSC pipeline explains why Jmext is only modestly brighter than Jmk20fe for the three galaxies shown, but is much dimmer than our fits, which assume S´ersic n = 4, the standard value for giant elliptical galaxies.

This analysis thus leads us to conclude that the extremely blue BCG envelopes and the rel- atively modest NIR BCG luminosities advanced by Batcheldor et al. (2006) are artifacts. We do note that the J surface photometry does fall below the R photometry at large radii in all three BCGs, but this occurs for J isophotes well within the noise and well below the even the errors in the sky levels. Any color gradients implied by the profiles presented in Figure 16 are not significant, and in any case are considerably smaller than would be implied by direct comparison of the XSC apparent magnitudes to the parameters given in this paper.

C. The Selection of rγ over rb as the Core Scale

The break-radius parameter in the Nuker-law has been used directly as the indicator of physical core scale in earlier work by our group. Faber et al. (1997) showed that rb is correlated with MV for core galaxies. The scatter about the r M relationship is large, however. With the present b − V much larger sample of galaxies, we were able to conduct extensive searches for other parameters – 31 – that might reduce the scatter, with the goal of better understanding how core structure varies with galaxy properties.

Several plots and parameter fits were conducted to search for correlations between residuals of the rb L and rb σ relationships with the Nuker profile shape parameters α, β, and γ. We − − ′ ′ also tried local values of γ, γ , measured at constant fraction of rb interior to rb (γ (frb) with f < 1). A correlation emerged between the Nuker-law outer profile slope β and the residuals in the r L relationship, as is shown in Figure 17. Larger than average cores correspond to galaxies b − with large β and vice versa. While one might be tempted to use this relationship to yield some sort of “β-corrected” break radii, the form of the correlation suggests a simpler, more useful approach.

In the Nuker law, rb marks the maximum curvature of the profile in logarithmic coordinates, but also the location where the profile slope reaches the average of γ and β. Since the range of γ for core galaxies is greatly restricted compared to that of β, this effectively means that rb will fall at steeper portions of the profile for steeper β. This suggests that some sort of “cusp radius,” rγ , a radius at which the profile reaches a nominal slope value, γ′, would provide a better description of the core scale than rb. The exact definition of rγ we use is given in equation (8). Carollo et al. ′ (1997) had already suggested use of rγ with γ = 1/2, the choice that we adopt here. Figure 18 compares the r L relationship for γ′ = 1/2 (shown earlier in Figure 5) and the γ − more traditional r L relationship. A symmetrical fit to the latter yields b − log(r ) = ( 0.71 0.04) (M + 21.64) /2.5 + (2.05 0.02), (C1) b − ± V ± which can be compared to the r L relation given in equation (10). For most galaxies β > 1, so γ − in general, r < r ; on average r 0.8r for γ′ = 1/2. We measured the rms scatter in the r L γ b γ ∼ b γ − relationship for core galaxies with M < 21 as a function of γ′ over the range 0.4 < γ′ < 0.7. A V − broad minimum in the scatter of 0.31 in log r occurs at γ′ 1/2. This is significantly smaller than γ ∼ the 0.38 scatter in log r for the r L relationship. The reduced scatter in the r L relationship b b − γ − as compared to the r L relationship is clearly evident in Figure 18. We have thus chosen to use b − rγ over rb as the core scale. ′ Evaluation of rγ at γ = 1/2 also leads to a clean separation of core and power-law galaxies. Since power-law galaxies have γ′ > 0.5 at the HST resolution limit, they are naturally excluded from the analysis. The upper limits for rγ for power-law galaxies however have the same physical values as their rb limits; since rγ < rb for core galaxies, this may create a false impression that the upper limits of rγ for power-law galaxies are more in line with the typical rγ values of core galaxies. The issue of whether or not intermediate galaxies can be included within the class of core galaxies, or should be treated separately, unfortunately depends on which relationship is being considered. As is evident in Figure 18, the intermediate galaxies with M 21 appear to be V ≤ − evenly distributed about the the mean r L relationship; their mean residual about the relationship b− is 0.06 0.15 in log(r ). In contrast, the same galaxies fall preferentially to the compact side of the − ± b r L relationship, now having a mean log(r ) residual of 0.60 0.11; given this systematic offset, γ − γ − ± we conclude that the intermediate galaxies should be treated separately from the core galaxies. – 32 –

REFERENCES

Adelberger, K. L & Steidel, C. C. 2005 ApJ, 630, 50

Batcheldor, D., Marconi, A., Merritt, D., & Axon, D. J. 2006, ArXiv Astrophysics e-prints, arXiv:astro-ph/0610264

Bechtold, J., et al. 2003, ApJ, 588, 119

Begelman, M. C. 2006, ApJ, 643, 1065

Begelman, M. C., Blandford, R. D., & Rees, M. J. 1980, Nature, 287, 307

Bender, R., Burstein, D., & Faber, S. M. 1992, ApJ, 399, 462

Bernardi, M., Hyde, J. B., Sheth, R. K., Miller, C. J., & Nichol, R. C. 2006b, ArXiv Astrophysics e-prints, arXiv:astro-ph/0607117

Bernardi, M., Sheth, R. K., Tundo, E., & Hyde, J. B. 2006c, ArXiv Astrophysics e-prints, arXiv:astro-ph/0609300

Bernardi, M., et al. 2003, AJ, 125, 1849

Bernardi, M., et al. 2006a, AJ, 131, 2018

Binney, J., & Tabor, G. 1995, MNRAS, 276, 663

Birnboim, Y., & Dekel, A. 2003, MNRAS, 345, 349

Blanton, M. R., et al. 2003, ApJ, 592, 819

Blumenthal, G. R., Faber, S. M., Primack, J. R., & Rees, M. J. 1984, Nature, 311, 517

Bower, G. A., Wilson, A. S., Heckman, T. M., Magorrian, J., Gebhardt, K., Richstone, D. O., Peterson, B. M., & Green, R. F. 2000, BAAS, 32, 1566

Bower, G. A., et al. 1998, ApJ, 492, L111

Boylan-Kolchin, M., Ma, C.-P., & Quataert, E. 2006, MNRAS, 369, 1081

Cappellari, M., Verolme, E. K., van der Marel, R. P., Kleijn, G. A. V., Illingworth, G. D., Franx, M., Carollo, C. M., & de Zeeuw, P. T. 2002, ApJ, 578, 787

Carollo, C. M., Franx, M., Illingworth, G. D., & Forbes, D. 1997, ApJ, 481, 710

Churazov, E., Sunyaev, R., Forman, W., B¨ohringer, H. 2002, MNRAS, 332, 729 de Vaucouleurs, G., de Vaucouleurs, A., Corwin, H. G., Buta, R. J., Paturel, G., & Fouque, P. 1991, Volume 1-3, XII, 2069 pp. 7 figs.. Springer-Verlag Berlin – 33 –

Djorgovski, S., & Davis, M. 1987, ApJ, 313, 59

Dressler, A. 1989, IAU Symp. 134: Active Galactic Nuclei, 134, 217

Dressler, A., Lynden-Bell, D., Burstein, D., Davies, R. L., Faber, S. M., Terlevich, R., & Wegner, G. 1987, ApJ, 313, 42

Ebisuzaki, T., Makino, J., & Okumura, S. K. 1991, Nature, 354, 212

Faber, S. M., & Jackson, R. E. 1976, ApJ, 204, 668

Faber, S. M., Tremaine, S., Ajhar, E. A., Byun, Y., Dressler, A., Gebhardt, K., Grillmair, C., Kormendy, J., Lauer, T. R., & Richstone, D. 1997, AJ, 114, 1771

Faber, S. M., Wegner, G., Burstein, D., Davies, R. L., Dressler, A., Lynden-Bell, D., & Terlevich, R. J., 1989, ApJS, 69, 763

Fabian, A. C., Voigt, L. M., & Morris, R. G. 2002, MNRAS, 335, L71

Ferrarese, L., Ford, H. C., & Jaffe, W. 1996, ApJ, 470, 444

Ferrarese, L., & Merritt, D. 2000, ApJ, 539, L9

Ferrarese, L., et al. 2006, ApJS, 164, 334

Fukugita, M., Shimasaku, K., & Ichikawa, T. 1995, PASP, 107, 945

Gebhardt, K., et al. 2000a, ApJ, 539, L13

Gebhardt, K., et al. 2000b, AJ, 119, 1157

Gebhardt, K., et al. 2003, ApJ, 583, 92

Gebhardt, K., et al. 2007, in preparation

Gonzalez, A. H., Zabludoff, A. I., & Zaritsky, D. 2005, ApJ, 618, 195

Graham, A., Lauer, T. R., Colless, M., & Postman, M. 1996, ApJ, 465, 534

Greenhill, L. J., Moran, J. M., & Herrnstein, J. R. 1997, ApJ, 481, L23

H¨aring, N. & Rix, H. 2004, ApJ, 604, L89

Harms, R. J., et al. 1994, ApJ, 435, L35

Hogg, D. W., et al. 2004, ApJ, 601, L29

Hopkins, P. F., Hernquist, L., Cox, T. J., Di Matteo, T., Robertson, B., & Springel, V. 2006, ApJS, 163, 1 – 34 –

Hougton, R. C. W., Magorrian, J., Sarzi, M., Thatte, N., Davies, R. L., Krajnovic, D. 2006, MNRAS, in press

Jarrett, T. H., Chester, T., Cutri, R., Schneider, S., Skrutskie, M., & Huchra, J. P. 2000, AJ, 119, 2498

Jarrett, T. H., Chester, T., Cutri, R., Schneider, S. E., & Huchra, J. P. 2003, AJ, 125, 525

Kormendy, J. 1985, ApJ, 295, 73

Kormendy, J. 1993, Coleccion Nuevas Tendencias (The nearest active galaxies), Proceedings of the meeting on The nearest active galaxies, held in Madrid in May 1992, Madrid: Consejo Supe- rior de Investigaciones Cientificas, —c1993, Edited by J. Beckman, L. Colina and H. Netzer, p. 197-218., 197

Kormendy, J., Fisher, D. B., Cornell, M. E., & Bender, R. 2007, ApJ, submitted

Kormendy, J., & Richstone, D. 1995, ARA&A, 33, 581

Kormendy, J., et al. 1997, ApJ, 482, L139

Laine, S., van der Marel, R. P., Lauer, T. R., Postman, M., O’Dea, C. P., & Owen, F. N. 2003, AJ, 125, 478

Lauer, T. R. 1985, ApJ, 292, 104

Lauer, T. R., et al. 1995, AJ, 110, 2622

Lauer, T. R., et al. 2005, AJ, 129, 2138

Lauer, T. R., et al. 2007a, ApJ, submitted

Lauer, T. R., et al. 2007b, in preparation

Macchetto, F., Marconi, A., Axon, D. J., Capetti, A., Sparks, W., & Crane, P. 1997, ApJ, 489, 579

Maciejewski, W., & Binney, J. 2001, MNRAS, 323, 831

Magorrian, J., et al. 1998, AJ, 115, 2285

Marconi, A., Capetti, A., Axon, D. J., Koekemoer, A., Macchetto, D., & Schreier, E. J. 2001, ApJ, 549, 915

Marconi, A., Risaliti, G., Gilli, R., Hunt, L. K., Maiolino, R., & Salvati, M. 2004, MNRAS, 351, 169

McLure, R. J., Willott, C. J., Jarvis, M. J., Rawlings, S., Hill, G. J., Mitchell, E., Dunlop, J. S., & Wold, M. 2004, MNRAS, 351, 347 – 35 –

Merritt, D. 2006, ApJ, 648, 976

Miller, C. J., et al. 2005, AJ, 130, 968

Milosavljevi´c, M., & Merritt, D. 2001, ApJ, 563, 34

Naab, T., Khochfar, S., & Burkert, A. 2006, ApJ, 636, L81

Nelson, C. H., Weistrop, D., Bower, G. A., Green, R. F., & STIS Team 2000, BAAS, 32, 701

Netzer, H. 2003, ApJ, 583, L5

Novak, G. S., Faber, S. M., & Dekel, A. 2006, ApJ, 637, 96

Oegerle, W. R., & Hoessel, J. G. 1991, ApJ, 375, 15

Patel, P., Maddox, S., Pearce, F. R., Arag´on-Salamanca, A., & Conway, E. 2006, MNRAS, 370, 851

Postman, M., & Lauer, T. R. 1995, ApJ, 440, 28

Press, W. H., Teukolsky, S. A., Vetterling, W. T., & Flannery, B. P. 1992, Numerical Recipes (2d ed.; Cambridge: Cambridge Univ. Press)

Prugniel, P., & Simien, F. 1996, A&A, 309, 749

Quillen, A. C., Bower, G. A., & Stritzinger, M. 2000, ApJS, 128, 85

Quinlan, G. D. 1996, New Astronomy, 1, 35

Quinlan, G. D., & Hernquist, L. 1997, New Astronomy, 2, 533

Ravindranath, S., Ho, L. C., Peng, C. Y., Filippenko, A. V., & Sargent, W. L. W. 2001, AJ, 122, 653

Rest, A., van den Bosch, F. C., Jaffe, W., Tran, H., Tsvetanov, Z., Ford, H. C., Davies, J., & Schafer, J. 2001, AJ, 121, 2431

Richards, G. T., et al. 2005, MNRAS, 360, 839

Richstone, D., et al. 1998, Nature, 395, A14

Robertson, B., Hernquist, L., Cox, T. J., Di Matteo, T., Hopkins, P. F., Martini, P., & Springel, V. 2006, ApJ, 641, 90

Salpeter, E. E. 1964, ApJ, 140, 796

Schechter, P. 1976, ApJ, 203, 297

Schlegel, D. J., Finkbeiner, D. P., & Davis, M. 1998, ApJ, 500, 525 – 36 –

S´ersic, J. L. 1968, Cordoba, Argentina: Observatorio Astronomico, 1968

Sheth, R. K., et al. 2003, ApJ, 594, 225

Small, T. A., & Blandford, R. D. 1992, MNRAS, 259, 725

So ltan, A. 1982, MNRAS, 200, 115

Springel, V., Di Matteo, T., & Hernquist, L. 2005, MNRAS, 361, 776

Steidel, C. . et al. 2002, ApJ576, 653

Tadhunter, C., Marconi, A., Axon, D., Wills, K., Robinson, T. G., & Jackson, N. 2003, MNRAS, 342, 861

Tonry, J. L., Dressler, A., Blakeslee, J. P., Ajhar, E. A., Fletcher, A. B., Luppino, G. A., Metzger, M. R., & Moore, C. B. 2001, ApJ, 546, 681

Tremaine, S. et al. 2002, ApJ, 574, 740

Tundo, E., Bernardi, M., Hyde, J. B., Sheth, R. K., & Pizzella, A. 2006, ArXiv Astrophysics e-prints, arXiv:astro-ph/0609297 van der Marel, R. P., & van den Bosch, F. C. 1998, AJ, 116, 2220

Verdoes Kleijn, G. A., van der Marel, R. P., Carollo, C. M., & de Zeeuw, P. T. 2000, AJ, 120, 1221

Vestergaard, M. 2004, ApJ, 601, 676

Voit, G. M., & Donahue, M. 2005, ApJ, 634, 955

Wyithe, J. S. B. 2006, MNRAS, 365, 1082

Wyithe, J. S. B., & Loeb, A. 2005, ApJ, 634, 910

Yu, Q., & Lu, Y. 2004, ApJ, 610, 93

Yu, Q., & Tremaine, S. 2002, MNRAS, 335, 965

Zibetti, S., White, S. D. M., Schneider, D. P., & Brinkmann, J. 2005, MNRAS, 358, 949

This preprint was prepared with the AAS LATEX macros v5.2. – 37 –

Table 1. Galaxy Parameters

σ rγ µγ Galaxy Morph MV (km/s) P log(pc) (V-Band) Alt-ID

NGC 0404 S0 17.19 38 0.23 14.62 − − \ NGC 0474 S0 20.12 164 1.15 15.20 − \ NGC 0507 S0 23.02 311 2.22 16.62 − ∩ NGC 0524 S0+ 21.85 253 1.57 15.24 − ∧ NGC 0545 BCG 22.98 242 2.16 16.52 A0194-M1 − ∩ NGC 0584 E 21.38 207 0.95 14.06 − ∩ NGC 0596 E 20.90 152 0.63 13.99 − \ NGC 0720 E 22.20 242 2.54 17.22 − ∩ NGC 0741 E 23.27 291 2.46 17.48 − ∩ NGC 0821 E 21.71 200 0.66 13.75 − ∧ NGC 0910 BCG 22.79 257 2.21 16.96 A0347-M1 − ∩ NGC 1016 E 22.90 294 2.25 17.01 − ∩ NGC 1023 S0 20.53 204 0.36 12.90 − − \ NGC 1052 E 21.17 208 1.46 14.35 − ∩ NGC 1172 E 20.13 112 0.64 14.09 − \ NGC 1316 E 23.32 228 1.54 14.30 − ∩ NGC 1331 E 18.58 58 1.07 17.07 − \ NGC 1351 S0 20.08 137 1.01 14.21 − − \ NGC 1374 E 20.57 185 0.96 14.57 − ∩ NGC 1399 E 22.07 342 2.23 16.76 − ∩ NGC 1400 S0 21.37 256 1.47 15.25 − − ∩ NGC 1426 E 20.78 153 0.71 14.28 − \ NGC 1427 E 20.79 162 0.61 14.11 − \ NGC 1439 E 20.82 154 0.71 13.85 − \ NGC 1500 BCG 22.75 263 1.99 16.34 A3193-M1 − ∩ NGC 1553 S0 22.06 177 1.01 13.54 − \ NGC 1600 E 23.02 337 2.82 18.17 − ∩ NGC 1700 E 21.95 235 1.01 13.50 − ∩ NGC 2300 S0 21.74 261 2.12 16.82 − ∩ NGC 2434 E 21.33 188 0.64 14.60 − \ NGC 2549 S0 19.17 143 0.51 13.98 − \ NGC 2592 E 20.01 265 0.82 13.76 − \ – 38 –

Table 1—Continued

σ rγ µγ Galaxy Morph MV (km/s) P log(pc) (V-Band) Alt-ID

NGC 2634 E 20.83 181 0.93 14.57 − \ NGC 2636 E 19.19 69 0.87 15.22 − \ NGC 2685 S0+ 19.72 94 0.84 14.16 − \ NGC 2699 E 20.25 141 0.84 14.14 − \ NGC 2778 E 18.75 162 0.67 13.97 − \ NGC 2832 BCG 23.76 335 2.52 17.11 A0779-M1 − ∩ NGC 2841 Sb 20.57 206 1.09 14.54 − ∧ NGC 2872 E 21.62 285 1.06 13.65 − \ NGC 2902 S0 20.59 . . . 2.15 16.95 − ∧ NGC 2907 Sa 21.23 . . . 1.22 13.47 − \ NGC 2950 S0 19.73 182 0.58 12.99 − \ NGC 2974 E 21.09 227 0.64 13.77 − \ NGC 2986 E 22.32 262 2.07 16.24 − ∩ NGC 3056 S0+ 18.98 . . . 0.80 14.10 − \ NGC 3065 S0 19.64 160 0.86 13.93 − \ NGC 3078 E 21.95 250 0.95 13.23 − \ NGC 3115 S0 21.11 252 0.30 12.65 − − \ NGC 3193 E 21.98 194 1.38 14.70 − ∩ NGC 3266 S0 20.11 . . . 0.85 14.95 − \ NGC 3348 E 22.18 238 1.96 16.05 − ∩ NGC 3377 E 20.07 139 0.36 12.24 − \ NGC 3379 E 21.14 207 1.72 15.59 − ∩ NGC 3384 S0 19.93 148 0.36 13.03 − − \ NGC 3414 S0 20.25 237 0.81 13.56 − \ NGC 3551 BCG 23.55 268 2.37 17.35 A1177-M1 − ∩ NGC 3585 E 21.93 207 1.28 14.29 − ∧ NGC 3595 E 20.96 . . . 0.93 14.67 − \ NGC 3599 S0 19.93 85 0.65 14.64 − \ NGC 3605 E 19.61 92 0.65 14.96 − \ NGC 3607 S0 19.88 224 1.77 16.26 − ∩ NGC 3608 E 21.12 193 1.31 15.05 − ∩ NGC 3610 E 20.96 162 0.64 12.86 − \ – 39 –

Table 1—Continued

σ rγ µγ Galaxy Morph MV (km/s) P log(pc) (V-Band) Alt-ID

NGC 3613 E 21.59 210 1.65 15.65 − ∩ NGC 3640 E 21.96 182 1.47 15.39 − ∩ NGC 3706 S0 22.31 270 1.60 14.16 − − ∩ NGC 3842 BCG 23.18 314 2.48 17.40 A1367-M1 − ∩ NGC 3900 S0+ 20.80 118 1.16 14.25 − \ NGC 3945 S0+ 20.25 174 0.59 14.19 − \ NGC 4026 S0 19.79 178 0.48 12.96 − \ NGC 4073 E 23.50 278 2.13 16.55 − ∩ NGC 4121 E 18.53 86 0.79 14.55 − \ NGC 4128 S0 20.79 203 0.92 13.62 − \ NGC 4143 S0 19.68 214 0.88 13.98 − \ NGC 4150 S0 18.66 85 0.85 13.87 − \ NGC 4168 E 21.80 184 2.26 17.58 − ∩ NGC 4239 E 18.50 62 1.06 16.82 − ∧ NGC 4261 E 22.26 309 2.31 16.09 − ∩ NGC 4278 E 21.05 238 1.77 15.82 − ∩ NGC 4291 E 20.64 285 1.63 15.29 − ∩ NGC 4365 E 22.18 256 2.15 16.53 − ∩ NGC 4374 E 22.28 282 2.11 15.67 − ∩ NGC 4382 S0+ 21.96 179 1.69 15.34 − ∩ NGC 4387 E 19.25 104 0.54 15.13 − \ NGC 4406 E 22.46 235 1.90 16.03 − ∩ NGC 4417 S0 18.94 131 0.94 13.96 − \ NGC 4434 E 19.19 120 0.54 14.44 − \ NGC 4458 E 19.27 103 0.80 13.57 − ∩ NGC 4464 E 18.82 127 0.54 13.92 − \ NGC 4467 E 17.51 68 0.54 15.07 − \ NGC 4472 E 22.93 291 2.25 16.53 − ∩ NGC 4473 E 21.16 179 1.73 15.40 − ∩ NGC 4474 S0 18.42 87 0.72 14.74 − \ NGC 4478 E 19.89 138 1.32 15.50 − ∩ NGC 4482 E 18.87 26 2.05 19.52 − ∧ – 40 –

Table 1—Continued

σ rγ µγ Galaxy Morph MV (km/s) P log(pc) (V-Band) Alt-ID

NGC 4486 E 22.71 332 2.65 17.25 − ∩ NGC 4486B cE 17.98 170 1.08 14.44 − ∩ NGC 4494 E 21.50 150 0.54 13.40 − \ NGC 4503 S0 19.57 111 0.63 14.42 − − \ NGC 4551 E 19.37 108 0.54 14.86 − \ NGC 4552 E 21.65 253 1.60 15.17 − ∩ NGC 4564 E 20.26 157 0.63 13.43 − \ NGC 4589 E 21.35 224 1.40 15.41 − ∩ NGC 4621 E 21.74 225 0.54 12.43 − \ NGC 4636 E 21.86 203 2.21 16.76 − ∩ NGC 4648 E 20.24 220 0.83 13.34 − \ NGC 4649 E 22.51 336 2.34 16.77 − ∩ NGC 4660 E 20.13 188 0.54 12.53 − \ NGC 4696 BCG 24.33 254 2.44 17.77 A3526-M1 − ∩ NGC 4697 E 21.49 174 0.41 14.13 − \ NGC 4709 E 22.32 242 2.02 16.91 − ∩ NGC 4742 E 19.90 109 0.51 12.43 − \ NGC 4874 E 23.49 278 2.99 18.98 − ∩ NGC 4889 BCG 23.73 401 2.84 17.80 A1656-M1 − ∩ NGC 5017 E 20.67 184 0.99 13.30 − \ NGC 5061 E 22.01 186 1.39 14.06 − ∩ NGC 5077 E 22.07 256 1.96 16.07 − ∩ NGC 5198 E 21.23 196 1.33 15.19 − ∩ NGC 5308 S0 21.26 211 0.90 13.15 − − \ NGC 5370 S0 20.60 133 1.04 15.34 − \ NGC 5419 E 23.37 333 2.65 17.53 − ∩ NGC 5557 E 22.62 254 1.82 15.58 − ∩ NGC 5576 E 21.31 183 1.21 14.38 − ∩ NGC 5796 E 21.98 288 1.02 14.40 − ∧ NGC 5812 E 21.39 200 0.84 14.27 − \ NGC 5813 E 22.01 239 1.89 16.32 − ∩ NGC 5831 E 21.00 164 0.85 14.41 − \ – 41 –

Table 1—Continued

σ rγ µγ Galaxy Morph MV (km/s) P log(pc) (V-Band) Alt-ID

NGC 5838 S0 20.51 266 1.03 13.61 − − \ NGC 5845 E 19.98 234 1.14 13.81 − \ NGC 5898 E 21.65 218 1.43 15.41 − ∧ NGC 5903 E 21.90 198 2.17 17.07 − ∩ NGC 5982 E 21.97 240 1.80 15.62 − ∩ NGC 6086 BCG 23.11 336 2.53 17.26 A2162-M1 − ∩ NGC 6166 BCG 23.80 310 3.17 19.32 A2199-M1 − ∩ NGC 6173 BCG 23.59 278 2.32 16.72 A2197-M1 − ∩ NGC 6278 S0 20.81 150 0.99 13.97 − \ NGC 6340 S0 19.46 144 0.91 14.54 − \ NGC 6849 S0 22.78 216 1.98 16.81 − ∩ NGC 6876 E 23.58 234 2.17 17.02 − ∩ NGC 7014 BCG 22.18 263 1.83 15.54 − ∩ NGC 7052 E 22.35 271 2.29 16.19 − ∩ NGC 7213 Sa 21.71 182 1.83 15.88 − ∩ NGC 7332 S0 19.62 124 0.67 12.78 − \ NGC 7457 S0 18.62 69 0.43 15.86 − − \ NGC 7578B BCG 23.41 214 2.06 16.19 A2572-M1 − ∩ NGC 7619 E 22.94 322 2.03 15.90 − ∩ NGC 7626 E 22.87 276 1.66 14.98 − ∧ NGC 7647 BCG 23.97 282 2.28 17.14 A2589-M1 − ∩ NGC 7727 Sa 21.19 196 0.48 14.11 − ∧ NGC 7743 S0+ 20.18 84 1.03 14.07 − \ NGC 7785 E 22.08 245 1.32 15.28 − ∩ IC 0115 BCG 22.67 . . . 2.45 17.24 A0195-M1 − ∩ IC 0613 BCG 22.27 262 2.05 16.25 A1016-M1 − ∩ IC 0664 BCG 22.86 336 2.07 15.81 A1142-M1 − ∩ IC 0712 BCG 23.29 345 2.69 17.68 A1314-M1 − ∩ IC 0875 S0 20.21 . . . 1.01 13.45 − \ IC 1459 E 22.51 306 1.94 15.39 − ∩ IC 1565 BCG 22.99 303 1.65 16.86 A0076-M1 − ∩ IC 1633 BCG 23.91 355 2.43 16.66 A2877-M1 − ∩ – 42 –

Table 1—Continued

σ rγ µγ Galaxy Morph MV (km/s) P log(pc) (V-Band) Alt-ID

IC 1695 BCG 23.90 364 2.36 16.68 A0193-M1 − ∩ IC 1733 BCG 23.43 301 2.68 17.63 A0260-M1 − ∩ IC 2738 BCG 22.18 275 1.57 16.15 A1228-M1 − \ IC 4329 BCG 23.95 275 2.34 16.26 A3574-M1 − ∩ IC 4931 BCG 23.47 273 2.42 16.86 A3656-M1 − ∩ IC 5353 BCG 22.64 262 2.04 16.37 A4038-M1 − ∩ UGC 4551 S0 19.78 167 0.82 15.00 − \ UGC 4587 S0 20.77 . . . 1.05 14.79 − \ UGC 6062 S0 20.34 . . . 1.01 14.66 − \ VCC 1199 E 20.34 . . . 0.54 15.42 − \ VCC 1440 E 20.34 . . . 0.54 15.62 − \ VCC 1545 E 20.34 . . . 0.54 17.49 − \ VCC 1627 E 20.34 . . . 0.54 15.68 − \ ESO 378-20 S0 20.97 . . . 1.06 13.11 − \ ESO 443-39 S0 20.93 . . . 1.06 14.97 − \ ESO 447-30 S0 21.17 . . . 1.00 13.99 − \ ESO 462-15 E 22.83 305 1.19 14.75 − \ ESO 507-27 S0 20.89 203 1.08 14.25 − \ ESO 507-45 S0 23.28 311 1.81 14.95 − ∧ MCG 11-14-25A E 19.08 . . . 1.38 15.77 − ∩ MCG 08-27-18 S0 20.03 89 1.07 15.09 − \ A0119-M1 BCG 24.01 294 2.81 18.52 − ∩ A0168-M1 BCG 23.12 345 2.00 16.90 − ∩ A0189-M1 BCG 21.89 230 1.51 17.56 − \ A0261-M1 BCG 22.95 . . . 1.64 18.22 − \ A0295-M1 BCG 23.11 . . . 2.63 17.81 − ∩ A0376-M1 BCG 23.60 276 2.64 18.10 − ∩ A0397-M1 BCG 23.42 289 2.70 17.80 − ∩ A0419-M1 BCG 21.79 . . . 1.58 17.08 − \ A0496-M1 BCG 24.28 273 2.61 18.14 − ∩ A0533-M1 BCG 22.68 . . . 2.28 17.08 − ∩ A0548-M1 BCG 22.75 220 2.22 17.12 − ∩ – 43 –

Table 1—Continued

σ rγ µγ Galaxy Morph MV (km/s) P log(pc) (V-Band) Alt-ID

A0634-M1 BCG 22.70 245 2.17 17.16 − ∩ A0912-M1 BCG 22.24 . . . 1.63 16.36 − ∩ A0999-M1 BCG 22.45 272 2.29 17.03 − ∩ A1020-M1 BCG 22.65 345 2.32 16.87 − ∩ A1631-M1 BCG 23.34 249 2.12 16.49 − ∩ A1831-M1 BCG 23.51 . . . 2.84 18.58 − ∩ A1983-M1 BCG 22.35 270 1.73 15.50 − ∧ A2040-M1 BCG 23.46 223 2.28 17.41 − ∩ A2052-M1 BCG 23.04 216 2.46 18.53 − ∩ A2147-M1 BCG 23.16 278 2.90 19.03 − ∩ A2247-M1 BCG 22.66 209 1.60 20.06 − \ A3144-M1 BCG 22.28 . . . 2.28 16.79 − ∩ A3376-M1 BCG 23.29 300 3.11 18.93 − ∩ A3395-M1 BCG 24.23 276 2.52 18.09 − ∩ A3528-M1 BCG 24.30 434 2.61 17.54 − ∩ A3532-M1 BCG 24.58 . . . 2.51 17.49 − ∩ A3554-M1 BCG 23.99 . . . 2.62 18.36 − ∩ A3556-M1 BCG 23.65 . . . 2.48 16.95 − ∩ A3558-M1 BCG 24.92 275 3.12 19.29 − ∩ A3562-M1 BCG 24.32 236 2.84 18.87 − ∩ A3564-M1 BCG 22.68 . . . 2.12 16.63 − ∩ A3570-M1 BCG 22.54 268 2.01 16.19 − ∩ A3571-M1 BCG 25.30 325 3.03 19.39 − ∩ A3677-M1 BCG 22.21 . . . 2.14 16.49 − ∩ A3716-M1 BCG 23.75 247 2.56 17.99 − ∩ A3736-M1 BCG 23.98 . . . 2.70 17.93 − ∩ A3747-M1 BCG 22.65 232 2.00 15.97 − ∩

Note. — Derivation of the parameters listed are presented in Lauer et al. (2007a). Bulge luminosities are given for S0 and spiral galaxies. Velocity dispersion are provided by the “Hyperleda” database. The profile type, P, is – 44 –

= power-law, = intermediate form, and = core. BCG identifications \ ∧ ∩ are referred to their hosting Abell clusters; see Postman & Lauer (1995) for details. – 45 –

Table 2. Core Galaxies with Measured Black Hole Masses

Galaxy MV log(rγ /pc) Iγ (V) log(M•/M⊙) M• Source

NGC 1399 22.07 2.23 16.76 8.95 1, 2 − NGC 3379 21.14 1.72 15.59 8.00 3 − NGC 3608 21.12 1.31 15.05 8.28 4 − NGC 4261 22.26 2.31 16.09 8.72 5 − NGC 4291 20.64 1.63 15.29 8.49 4 − NGC 4374 22.28 2.11 15.67 9.00 6, 7 − NGC 4473 21.16 1.73 15.40 8.04 4 − NGC 4486 22.71 2.65 17.25 9.48 8 − NGC 4649 22.51 2.34 16.77 9.30 4 − NGC 7052 22.35 2.29 16.19 8.52 9 − IC 1459 22.51 1.94 15.39 9.18 10, 11 −

Note. — Black hole mass references are 1) Houghton et al. (2006), 2) Gebhardt et al. (2007), 3) Gebhardt et al. (2000b), 4) Gebhardt et al. (2003), 5) Ferrarese et al. (1996), 6) Bower et al. (1998), 7) Maciejewski & Binney (2001), 8) Macchetto et al. (1997), 9) van der Marel & van den Bosch (1998), 10) Verdoes Kleijn et al. (2000), and 11) Cappellari et al. (2002). For galaxies with two references, the black hole mass is an average value. – 46 –

Haring & Rix

All Points

Fig. 1.— Black hole masses for all black holes with direct mass determinations are plotted as a function of MV . Galaxies are drawn from the Tremaine et al. (2002) sample with augmentations as described in the text. The solid line is the H¨aring & Rix (2004) relationship (equation (4)) between 0.23 M• and galaxy mass transformed to luminosity using M/L L with zeropoint M/L = 6 at V ∼ V V M = 22. A symmetrical least-squares fit to all data points is shown as the dashed line (equation V − 2), and a fit to just the galaxies with M < 19 is shown as the dotted line (equation 3). V − – 47 –

BCG Core Core Intermediate Power Law BH Detected

Fig. 2.— M•(L) versus M•(σ) the sample galaxies that have σ measurements. Power-law galaxies are plotted as red dots, core galaxies are blue dots, “intermediate” galaxies are plotted as small open circles, and BCGs with resolved cores are plotted as green squares. Galaxies with large circles have directly determined black hole masses; however, the predicted rather than observed M• values are still plotted. The M• σ relationship is that of Tremaine et al. (2002). The asymmetric error − bar in the horizontal direction shows the change in predicted M• if the (Wyithe 2006) log-quadratic M• σ relationship is used instead. M•(L) is the average of the minimum and maximum predictions − for a given L from the three M• L relationships in Figure 1, with the error bars showing the range − of the predictions. – 48 –

100

BCG Core Core Intermediate Power Law BH Detected

-18 -20 -22 -24

Fig. 3.— The relationship between central velocity dispersion, σ, and L for the sample is plotted. A fit to just the core galaxies and BCGs (solid line; equation 5) gives L σ7, a much steep relationship ∼ then the standard L σ4 Faber-Jackson relationship, and L σ2 for the power-law galaxies alone ∼ ∼ (dashed line; equation 6). It is this change in slope that leads to conflicting predictions for M• from the M• L and M• σ relation for the most luminous galaxies. Core galaxies with directly − − measured black hole masses are circled. – 49 –

BCG Core 1000 Core Intermediate Power Law BH Detected

100

10

1 100

Fig. 4.— Cusp radius, rγ , is plotted as a function of stellar velocity dispersion. The power-law galaxies are now plotted as triangles to indicate that their cusp radii are only upper limits. The solid line is the fitted relationship (equation 9) between rγ and σ for core galaxies. The figure shows 4 that rγ is a steep function of σ. If M• σ as equation (1), the observed empirical relationship ∼ 2.1±0.4 between r and σ (solid line) implies that r M• . γ γ ∼ – 50 –

BCG Core 1000 Core Intermediate Power Law BH Detected

100

10

1 -18 -20 -22 -24

Fig. 5.— Cusp radius, rγ , is plotted as a function of total galaxy luminosity. Power-law galaxies are plotted as to indicate that their r values are as upper limits. The solid line shows the best-fit ∇ γ relationship between rγ and MV for core galaxies (equation 10). The figure shows that rγ varies 1.4 nearly linearly with L. If M• LV as in equation (4), the observed empirical relationship between ∼ 0.96±0.09 r and L (solid line) implies that r M• . γ γ ∼ Note that for core galaxies the range in rγ at any given L is smaller than it is at a given σ. – 51 –

12

BCG Core Core Intermediate Power Law BH Detected 14

16

18

20 1 10 100 1000

Fig. 6.— Cusp brightness, µγ, is plotted as a function of cusp radius, rγ . The power-law triangles are rotated and shown as arrows to reflect that the points are only upper limits for both Iγ and rγ . The tight relationship between Iγ and rγ (equation 15) means that either can serve for the other in the context of relating core structure to M•, L or σ. – 52 –

Fig. 7.— Black hole mass versus core size, rγ, for the 11 core galaxies that have M• measurements. The red line is the symmetric fit between M• and rγ provided by equation (20), while the blue line gives the fit presented in equation (21), which assumes that rγ is the independent variable. – 53 –

200 300 400 -21 -22 -23 -24 -25

200 300 400 -21 -22 -23 -24 -25

Fig. 8.— The four panels plot M• predicted from rγ as function of σ and MV for core galaxies, where green symbols are BCGs, and blue symbols are the remaining core galaxies. The red lines give M• values predicted either from the M• σ relationship (equation 1) or the M• L relationship − − (equation 4). The upper panels give M• predicted by rγ through equation (20), which was derived by a symmetrical fit to the points in Figure 7. The bottom panels, in contrast, use equation (21), which was derived assuming rγ as the independent variable. Both equations typically predict M• in excess of the predictions from the M• σ relationship. − – 54 –

Fig. 9.— The two panels plot core mass Mγ as function of M•(σ) and M•(L) for core galaxies, where green symbols are BCGs, and blue symbols are the remaining core galaxies. The red lines give the mean M M• relationships inferred by combining either the M• σ relationship (equation 1) with γ − − the M σ (equation 17), or the M• L relationship (equation 4) with the M L relationship γ − − γ − (equation 16). – 55 –

BCG Core 1000 Core Intermediate Power Law

-18 -20 -22 -24

Fig. 10.— Effective radius as a function of luminosity for the galaxy sample. The steepening of the R L relationship sets in at M < 22, similar to the luminosity at which the velocity dispersion e − V − starts to “plateau” in Figure 3. We attribute this to a progressive change in the character of “dry mergers” at higher galaxy masses. The shallow line is defined by a fit to power-law galaxies only (equation 25), while the steep line is a fit to core galaxies with M < 21 (equation 26). Power-law V − galaxies and core galaxies, however, have similar R at M 21 where the transition between e V ∼ − the two forms takes place. – 56 –

Fig. 11.— The log cumulative density of black holes above a given mass versus log M• for different mass functions. The red curve in both panels is derived from a lightbulb model for quasars applied to the Richards et al. (2005) best-fit quasar luminosity function from SDSS, evaluated at z = 2.5. The model assumes they radiate at their Eddington Luminosity with a duty fraction f = 0.03 (see text). The pink curve in both panels is the BH mass function produced by the Hopkins et al. (2006) model for quasar, described in detail in the text. The cyan curve in both panels is the BH mass function obtained by augmenting the Blanton et al. (2003) best-fit Schechter (1976) luminosity function for SDSS galaxies with the Postman & Lauer (1995) brightest cluster galaxies and calibrated by the H¨aring & Rix (2004) relation. The solid and dotted lines above the curve show the effect of cosmic scatter of 0.25 and 0.50 (respectively) about the mean relation (derived by H¨aring & Rix (2004)) between BH mass and galaxy mass. The green curve (in the left panel) is a BH mass function predicted from the SDSS velocity dispersion function (Sheth et al. 2003) augmented by the Bernardi et al. (2006a) high-dispersion sample and calibrated by the Tremaine et al. (2002) M•(σ) predictor for zero cosmic scatter. The solid and dotted lines above the curve show the effect of a cosmic scatter of 0.15 and 0.30 in the decimal log of the BH mass about the mean Tremaine relation. The dark blue curve in the right panel illustrates the same dispersion function calibrated instead by the Wyithe (2006) M•(σ) predictor. – 57 –

2

1.5

1

0.5

0

-0.5 10 100 Effective Radius (arcsec)

Fig. 12.— SDSS r “model magnitudes” are compared to two separate RC magnitudes for the Postman & Lauer (1995) BCGs inc common, where the SDSS values are transformed assum- ing RC = r 0.31. Red points show the difference between SDSS model r magnitudes and the − 1/4 Laine et al. (2002) RC total BCG luminosity versus effective radius, Re, which is derived from r fits to the Postman & Lauer (1995) surface brightness profiles. The blue points are the same exer- cise, but with the Postman & Lauer (1995) maximum-aperture magnitudes used instead. Clearly, the larger a BCG is, the more the present total luminosity disagrees with the SDSS value. The maximum-aperture magnitudes are not intended to be interpreted as a total magnitude, but pro- vide a model-independent lower limit on its value. Since the maximum-aperture magnitudes are brighter than the SDSS magnitudes for most BCGs, this clearly shows that the SDSS values cannot be regarded as total luminosities. – 58 –

18 A0779

20 A2052

22 A2199

24

26

10 100 Mean Radius (arcsec)

Fig. 13.— Surface brightness profiles from Postman & Lauer (1995) (black) are compared to “profmean” SDSS profiles (red) for three BCGs. The SDSS r band magnitudes are transformed assuming R = r 0.31. The SDSS profiles all fall below the Postman & Lauer (1995) profiles at C − large radii, consistent with excessive sky subtraction by the SDSS pipeline. – 59 –

1

All Postman & Lauer BCGs (PL mags)

0.8 PL/SDSS BCGs in common (SDSS mags) Miller et al. C4 SDSS BCGs (SDSS mags)

0.6

0.4

0.2

0 -20 -22 -24

Fig. 14.— Histograms of estimated total MV are shown for three BCG samples. Magnitudes for 1/4 the Postman & Lauer (1995) volume-limited BCG sample (blue) are based on r fits to RC -band surface photometry. The Miller et al. (2005) BCG sample is based on SDSS model r magnitudes, and has typical luminosities one magnitude smaller than the Postman & Lauer (1995) sample. The histogram of the subset of Postman & Lauer (1995) BCGs observed by SDSS (green) agrees well with the Miller et al. (2005) sample when SDSS r model magnitudes are used instead to estimate total MV , yet we argue that these magnitudes are strongly affected by excessive sky subtraction. This concordance implies that the C4 BCGs, are also likely to have had their total luminosities under-estimated. – 60 –

Fig. 15.— The archived 2MASS J band image of NGC 2832, the BCG in A0779, is compared to a portion of the R band image used by Postman & Lauer (1995) to derive surface photometry profile shown in the next figure. The stretch has been set to be the same for both images. The R band image has been binned to a 0.′′91 pixel scale to roughly match the 1.′′0 scale of the J band image. The sky level of the 2MASS image is effectively 26 brighter in J, taking the observed R J color × − of the galaxy into account. The J band image is clearly considerably shallower than the R band image, and the envelope of the galaxy disappears into the noise at radii where it is still clearly present in the R band. – 61 –

A0779 18

20 A2052

A2199 22

24

26

1 1.5 2 2.5 3

Fig. 16.— Major axis surface J photometry profiles (red) derived from 2MASS J band archive images are compared to the R profiles derived by Postman & Lauer (1995) (black) for the three BCGs shown in Figure 13. The 0.4% error in the 2MASS sky levels gives the large error bars. ∼ The last J band isophotes fall 7 magnitudes below the sky, and thus are less significant than ∼ the errors in the sky levels. The R and J profiles agree within the errors, and total luminosities estimated by r1/4-law fits to the even the J band profiles are considerably larger than the 2MASS XSC apparent luminosities. – 62 –

1

0

-1

0.5 1 1.5 2 2.5 3

Fig. 17.— Residuals about the mean r M relationship are plotted for core galaxies as a function b − V of the logarithmic envelope slope β. At any given MV , excessively large cores (positive residuals) correspond to higher β (steeper envelopes) while excessively small cores correspond to small β. – 63 –

BCG Core 1000 Core Intermediate Power Law

100

10

1

1000

100

10

1 -18 -20 -22 -24

Fig. 18.— The r L and r L relationships for core galaxies are compared. The lines are the γ − b − mean relationships for M < 21. The r L relationship has smaller scatter for core galaxies V − γ − with M < 21. Power-law galaxies are plotted the same in both panels, with upper limits on r V − γ to be the same as upper limits on rb.