<<

PHYS 633: Introduction to Stellar Astrophysics Spring Semester 2006 Rich Townsend ([email protected])

Convective Energy Transport From our derivation of the radiative diffusion equation, we find that in regions of a star where energy is transported by radiation alone, the physical satisfies ∇ = ∇rad, (1) where the radiative temperature gradient is defined in terms of the local flux F (r) by 3 κρH ∇ ≡ P F (r), (2) rad 4ac T 4 and HP is the pressure scale height, ∂r P r2 H ≡ −P = . (3) P ∂P Gmρ If we wish to include the energy transported due to conduction (usually unim- portant except in degenerate stars), we can replace the κ term in the definition of ∇rad with κ + κcon, where κcon is the conductive opacity. So, in a radiative region, ∇rad tells us what the physical temperature gradient ∇ will be. But what is its interpretation in a convective region? Energy is still transported by radiative diffusion when is occurring, but the convection itself also transports energy. In such cases, ∇rad provides an indicator of the temperature gradient that would ensue if the convection could artificially be switched off, leaving radiation to carry all of the flux. Hence, the local flux F (r) can be expressed as 4ac T 4 F (r) = FC(r) + FR(r) = ∇rad, (4) 3 κρHP where FC is the convective energy flux, and FR is the radiative energy flux, 4ac T 4 FR(r) = ∇. (5) 3 κρHP

We can write FR in the form, ∇ FR(r) = F (r), (6) ∇rad which allows us to express the physical temperature gradient at each radius in the form  F (r) ∇ = ∇ 1 − C . (7) rad F (r)

1 Clearly, for a positive convective flux, we have ∇ < ∇rad; so the effect of convec- tion is to make the temperature gradient less steep than it would be if energy was transported by radiation/conduction alone. In fact, as the convection becomes more and more efficient (we’ll see what that means further on), the physical temperature gradient approaches the adiabatic gradient ∇ad. Due to the Schwarzschild stability criterion, the above expression only holds in regions where ∇rad itself is greater than the adiabatic gradient  ∂ ln T  ∇ad ≡ . (8) ∂ ln P s Otherwise, radiation is able to transport all of the flux, with a temperature gradient ∇ = ∇rad < ∇ad that gives a stable hydrostatic equilibrium (i.e., no convection, FC(r) = 0). Given eqn. (7), we can calculate the physical gradient ∇ throughout a con- vective region, if we know both the total flux F (r) and the convective flux FC(r). We obtain the former from the local luminosity l(r); but how do we go about deriving the convective flux? Here lies one of the biggest problems in present-day stellar astrophysics: there is no comprehensive, physically-realistic theory for convective transport of energy. None. Nada. This topic has received an inordinately-large amount of attention over the past 50 years or so, but still remains extremely problematical. Is all lost, then? Fortunately not. Although there are no complete models, there do exist very useful parameterizations of convection, which turn out to produce workable results. The most simple of these is the mixing- theory developed originally by Ludwig Prandtl, and first applied in the field of astrophysics by Erika B¨ohm- Vitense.. The theory is based on the assumption that the rising and falling fluid elements responsible for the convective energy transport retain their identity over a typical radial distance `m (the mixing length), before they dissolve into their new surroundings. Let’s focus on an average fluid element at radius r, which we assume has already traveled a distance `m/2 since it began moving (see Fig. 1). Adopting the same notation as for the convective stability analysis, let ∆TE be the change in the temperature of the element during its motion, and let ∆TA be the corresponding change in the ambient fluid surrounding the element. We may write these two quantities as

`m ∆TE = −∇E T (9) 2HP and `m ∆TA = −∇ T (10) 2HP where  ∂ ln T  ∇E ≡ (11) ∂ ln P E is the temperature gradient (w.r.t. pressure) experienced by the fluid element. If during its motion the element remains completely insulated from its surround- ings, then ∇E = ∇ad. But, more realistically, the element will lose some of its

2 Figure 1: The path followed by the average fluid element. excess heat to its surroundings, so the general inequality for the ‘four nablas’ in convective regions (∇ad < ∇rad) is

∇ad < ∇E < ∇ < ∇rad. (12)

In terms of the temperature changes above, the excess heat per unit mass of the element over its surroundings is given by

(T ∆sE − T ∆sA) = cP (∆TE − ∆TA) (13)

`m = T cP (∇ − ∇E) . (14) 2HP Hence, the convective flux — being the amount of excess heat transported by

3 the element through radius r, per unit area, per second — is just

`m FC(r) = ρv(T ∆sE − T ∆sA) = ρvT cP (∇ − ∇E) , (15) 2HP where v is the suitably-averaged velocity of the element. The next step in our derivation is to obtain values for v and ∇E. Let’s focus first on the convective velocity. This quantity is of course non-zero because the element has been accelerated by buoyancy forces acting upon it. As with the stability analysis, the buoyancy force per unit mass (i.e., the acceleration) is given by g g = − (∆ρ − ∆ρ ), (16) b ρ E A where ∆ρE and ∆ρA are just the density changes in the element and its sur- roundings. With a little algebra, that takes advantage of the fact that the element remains in pressure equilibrium with its surroundings, we can write these density changes in terms of the nablas as

`m ∆ρE = −ρ(α − δ∇E) (17) 2HP and `m ∆ρA = −ρ(α − δ∇) , (18) 2HP where α and δ are the usual isothermal compressibility and isobaric thermal expansion coefficients. Hence, the buoyancy force per unit mass is

`m gb = gδ(∇ − ∇E) . (19) 2HP This expression gives the force when the element reaches radius r; assume that the average force between r −`m/2 and r is then just half this value. We further assume that drag forces on the element — due to having to push surrounding fluid out of the way — will mean that only half of the average force is able to ac- celerate the element. Hence, we write the (assumed to be uniform) acceleration of the element as `m a = gb/4 = gδ(∇ − ∇E) . (20) 8HP Given this acceleration, we can evaluate the element’s velocity at r as

2 2 `m `m v = 2a = gδ(∇ − ∇E) . (21) 2 8HP So where have we got to? With reference to eqn. (15), we have eliminated the convective velocity v, so the only outstanding unknown quantity (apart from the mixing length `m) is the element’s temperature gradient ∇E. To get a

4 handle on this quantity, we need to think about how rapidly the element cools off as it rises. We can write the rate-of-change of temperature as ∂T   ∂T  ∂P  ∂T  ∂s = + . (22) ∂t E ∂P s ∂t E ∂s p ∂t E T ∂P T ∂s = ∇ad + (23) P ∂t cP ∂t E The first term on the right-hand side comes from adiabatic expansion, and the second represents heat losses at constant pressure to the element’s surroundings. Since the element moves a distance v dt in time dt, we can rewrite this equation in terms of the spatial temperature gradient experienced by the element, ∂T  1 ∂T T ∂P  T ∂s = = ∇ad + (24) ∂r E v ∂t P ∂r E cP v ∂t E Multiplying through by the pressure scale height, we therefore find after a little algebra     ∂ ln T HP ∂s = ∇E = ∇ad − . (25) ∂ ln P E cP v ∂t E We’re further along now; we have an expression for the elemental tempera- ture gradient ∇E, in terms of ∇ad and the rate at which the element exchanges heat with its surroundings. In a twist of irony, this heat loss is actually caused by radiative transport of energy. If the element has an approximate dimension d, then the radiative flux through its surface can be expressed using the diffusion equation, 4acT 3 (∆T − ∆T ) f ≈ E A , (26) r 3κρ d/2 4acT 4 (∇ − ∇ ) ` ≈ E m . (27) 3κρ d HP Here, we have used the temperature difference between the element and the ambient fluid to write the temperature gradient term. In terms of fr, the rate of heat exchange per unit mass of the element will be ∂s f A T = − r , (28) ∂t E Bρ (note the minus sign, indicating a heat loss), where A is the surface area of the element, and B its volume. Substituting this into the expression (25) for ∇E, and eliminating fr using eqn. (26), we obtain

HP frA ∇E = ∇ad + , (29) T cP vBρ 3 4ac T `mA = ∇ad + 2 (∇ − ∇E) . (30) 3 κρ cP v d B

5 The right-most fraction in this expression contains information about the ge- ometry of the fluid elements. For a spherical element with radius equal to `m, it would equal 6/`m; but most authors (including Kippenhahn & Weigert) take ` A 9/2 m = , (31) d B `m in order to achieve agreement with the original derivations by B¨ohm-Vitense. With this substitution, we have 6acT 3 ∇E = ∇ad + 2 (∇ − ∇E). (32) κρ cP v`m With this final expression, we now have four equations that describe energy transport in a convection zone. To recap:  ∇ = ∇ 1 − FC  ,  rad F  `m FC = ρvT cP (∇ − ∇E) , 2HP 2 2 `m v = gδ(∇ − ∇E) ,  8HP  6acT 3 ∇E = ∇ad + 2 (∇ − ∇E). κρ cP v`m

There are four unknowns in these equations: FC, ∇, v and ∇E (we assume that we know `m somehow). So, we have enough information in principle to calculate the physical temperature gradient ∇. In practice, it is useful to eliminate FC, ∇ and v, to obtain a single expression for the physical temperature gradient:

2 2 ∇ = ∇ad + ξ − U , (33) where ξ is the (single) real root of the cubic equation 8U (ξ − U)3 + (ξ2 − U 2 − W ) = 0, (34) 9 and the parameters U and W are defined by

3 s 3acT 8HP U ≡ 2 2 ,W = ∇rad − ∇ad. (35) κρ cP `m gδ What is the interpretation of these parameters? Well, W simply measures how much the radiative temperature gradient exceeds the adiabatic temperature gradient. Since we can write the convective flux in the form

∇rad − ∇ FC(r) = F (r) (36) ∇rad

(see eqn. 7), and since ∇ > ∇ad in a convection zone, we can see that W gives the upper limit on the total flux that will be transported by convection: W 4ac WT 4 FC(r) < F (r) = . (37) ∇rad 3 κρHP

6 To understand the significance of U, let’s look at the limiting cases where this parameter is very small or very large. In the U → 0 case, eqn. (34) has the solution ξ = 0. Hence, ∇ → ∇ad, and we see that the convection is very efficient; the temperature gradient need only be marginally above the adiabatic gradient for convection to transport the stellar flux. In the opposite case of very large U case, it is less clear how to solve eqn. (34). Let us first introduce the parameter

x ≡ ξ2 − U 2. (38)

Since ∇ is bounded between ∇ad and ∇rad, we can see from eqn (33) that x must remain small even when U becomes large. Hence, we can make the approximation ξ ≈ U + x/2U (39) in the cubic equation (34), to obtain

x3 8U + (x − W ) = 0. (40) 8U 3 9 For U  1, this becomes 8U (x − W ) = 0, (41) 9 giving us the solution x = W , or

ξ2 − U 2 = W. (42)

Substituting this back into eqn. (33), we obtain the result

∇ → ∇rad (43) in the limit U → ∞. Based on the foregoing analysis, we can interpret U as a measure of the efficiency of the convection: the U → 0 case corresponds to efficient convection (∇ → ∇ad), and the U → ∞ case corresponds to inefficient convection (∇ → ∇rad).

7