EPJ manuscript No. (will be inserted by the editor)
Mesoscopy and thermodynamics
L. G. Moretto, J. B. Elliott, and L. Phair
Nuclear Science Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720
Received: date / Revised version: date
Abstract. The interplay of thermodynamics and statistical mechanics is discussed with special attention
to mesoscopic systems and phase transitions.
PACS. 25.70.Pq – 05.70.Jk – 24.60.Ky – 64.60.Ak
1 Introduction course of theoretical speculation is wisely trimmed by the
correctness of thermodynamical empiricism.
Thermodynamics is an empirical science devoted to the The “recent” extension of experimental [1–3] and the- description of macroscopic systems. It follows formally oretical research to systems that are neither wholly mi- from its three principles and stands firmly independent croscopic nor macroscopic, thus aptly named mesoscopic, of any microscopic description. Statistical mechanics, on should be enlightened by the same wisdom. While names the other hand, is founded on the general principles of and functions (temperature, entropy, free energy...) were mechanics, and attempts to bridge the gap from the mi- borrowed from thermodynamics, the urge to label the field croscopic description of a system to its macroscopic ther- as “novel” led to rash statements such as the primacy of modynamical properties. This attempt has met with un- one ensemble (microcanonical) over the others, or even deniable success through the introduction of ensembles to the affirmation of a “new” thermodynamics or to the such as microcanonical, canonical, grand canonical, etc., “failure” of the “old” thermodynamics and its principles. whereby microstates, constrained to satisfy some specific These rash statements have often been accompanied macroscopic conditions are used to obtain average values by a cavalier disregard for the physical definitions and of macroscopic variables. Its success, however, must be identification of phases, claiming the presence of phase measured in terms of its agreement with thermodynam- transitions on the basis of possibly myopic interpretations ics, which is the final judge. Thus the potentially unbridled of anecdotal numerical results. 2 L. G. Moretto et al.: Mesoscopy and thermodynamics
The resulting rush to calculate properties of a plethora any given energy. The result of these calculations is the of individual mesoscopic systems underscores the essential level density ρ(E) as a function of energy, or, equivalently, limitations of these efforts, which renounce the beautiful the entropy S(E) as a function of energy. Anomalies ap- generality of thermodynamics for results as limited and pear in the dependence of S(E) on E, like regions of con- specific as the systems under consideration become legion. stant slope, negative first derivatives and positive second
This direction leads to as many individual “statistical me- derivatives (concex intruders). The elementary nature of chanics” as there are mesoscopic systems. We shall discuss this procedure seems to ward off any confusion. this point further and suggest a natural solution. Yet the transition from microcanonical to the more
The study of numerical rather than analytical methods commonly used canonical language and variables becomes has given rise to two different problems. On one hand, the dangerous and misleading in correspondence to the very size of the system is finite, on the other, the choice of the anomalies for which one is searching. Thus the resulting algorithm becomes of paramount significance. negative temperature, negative specific heat, etc. tend to create misunderstanding and confusion. The first problem, which leads to the need of extrap- Our own approach uses the reductionistic device of de- olating correctly to the thermodynamic limit, is a nega- scribing the plethora of mesoscopy in terms of the “old” tive aspect of these numerical computations. On the other thermodynamics, which is augmented by the least num- hand, the ever increasing importance of microscopic sys- ber of new quantities necessary to describe the finiteness tems in experimental and applied science, makes the finite of the systems. This idea, of course, is not new. It is em- size of great interest in itself, and the attending differences bodied, for instance, in the “liquid drop” model, so much from the thermodynamic limit are drawing ever increasing successful in nuclear physics. It consists of expanding ther- attention. The second problem is related to the first. Some modynamic functions such as the free energy in terms of algorithms strive to compute the partition function versus a “finiteness” variable A−1/3 where A is the “size - mass” temperature (in the canonical approach), while others try of the finite system. So we write: to compute the entropy versus energy (the microcanoni-
2/3 1/3 cal approach). A lot of discussion has arisen concerning F (A, T ) = av(T )A + as(T )A + acA + ··· (1) the merits of the two approaches, in particular with re- where av(T ), as(T ), ac(T ) are the temperature T depen- gards to the observation of interesting anomalies and to dent volume, surface, curvature, etc. free energy coeffi- the extrapolation to the thermodynamic limit. cients. The leading term is the extensive volume term
In the struggle between the two camps, the upper hand av(T )A, the others (which can be shape dependent, e.g.
0 seems to be taken by the microcanonical algorithms which as(T ) = as(T )f(s/s0) where s is the actual surface s of just generate and count configurations in phase space at the object and s0 is the surface of its equivalent sphere) L. G. Moretto et al.: Mesoscopy and thermodynamics 3 are the terms that account for finiteness and vanish for The lack of particle or energy exchange with the environ-
A → ∞. The shape variables can either be fixed, or let ment makes the calculational aspect particularly transpar- free to assume the value demanded by the free energy ent. Literally one counts the number of configurations of minimization. The fact that in the nuclear case the ex- the system for a narrow energy interval around any given pansion can be safely stopped at the surface term while energy; that is the level density at that energy. Its loga- retaining accuracies of the order of one percent down to rithm is the entropy, now the “thermodynamic” potential sizes as small as A = 20 gives us the hope that just by in terms of V , N and E. That is all there is to do. However, adding a surface term we may obtain similarly adequate measurements typically deal with the canonical world of descriptions for many other mesoscopic systems. Further- temperature T and variables depending on T . more, for short range forces it happens that the surface Consequently one takes the derivative of entropy with coefficient is nearly equal and opposite to the volume co- respect to energy and calls that the inverse temperature efficient, i.e. a (T ) = −a (T ). This leads to a great deal ∂S 1 v s = β = . (2) ∂E T of economy and to an extreme generality. From this equation other quantities like the caloric curve It is important to appreciate that in any calculation ∂E E = E(T ) and specific heat CV = ∂T |V are determined. applied to mesoscopic systems, the presence of the sur- But it is here that problems arise. Equation (2) is a macro- face as a physical boundary, and boundary condition is of scopic thermodynamic relationship. But what is its mi- paramount importance for all calculations. Many of the croscopic origin? And does it apply all the time? In the claimed novel and general features resulting from finite following we consider this in detail. system calculations are either strongly dependent on such boundary conditions, or are artifacts of unwittingly intro- 2.1 The transformation from energy scale to duced boundary conditions which may not have necessar- temperature scale ily any specific physical meaning.
We shall exemplify the above ideas by considering phase We are going to perform this transformation in two equiva- coexistence and phase transitions. lent ways: the first more physical, the second more formal. Let us consider a microcanonical system fully charac-
terized by the relationship 2 Microcanonical and canonical worlds and ρ = ρ(E),S = S(E). (3) their languages Let us put a small system (thermometer) with a few de-
The microcanonical world is conceptually the simplest grees of freedom (e.g. a classical one dimensional harmonic one. It deals with an ensemble of totally isolated systems. oscillator) into contact with the microcanonical system. 4 L. G. Moretto et al.: Mesoscopy and thermodynamics
Any energy state ε of the thermometer is associated The independent variable β is the inverse of the canonical with a probability p (ε) equal to the degeneracy of the temperature T . In the canonical world at constant β (T ), microcanonical system at energy E − ε the microcanonical world at constant E is represented as
a weighted sum, the weight being a Boltzmann factor. p(ε) ' ρ(E − ε)ρT (ε) (4) Equation (9) transforms the microcanonical world of where the level density of the one dimensional harmonic energy E and level density ρ(E) in to the canonical world oscillator ρT (ε) is constant. Given the “smallness” of the 1 of temperature T = β and partition function Z(β). This thermometer, ε is microscopically small compared to the transformation is a Laplace transform. “macroscopic” energy E. So the following Taylor expan- Notice that for any given T or β there is a distribution sion is in order of energies, and not a single value. The distribution in E 2 ∂ ln ρ 1 2 ∂ ln ρ ln p(ε) = ln ρ(E)− ε + ε 2 −· · · . (5) is the integrand of Eq. (9) ∂E ε=0 2 ∂E ε=0
But (E − E )2 p(E) = ρ(E) exp(−βE) = A exp − 0 . (10) ∂ ln ρ ∂S 1 2σ2 = = , (6) ∂E ∂E T ∂2 ln ρ 1 This distribution is usually sharply peaked. The maximum 2 = − 2 (7) ∂E T CV is given by and ∂ ln p ∂ ln ρ = − β = 0 (11) ε 1 ∂E ∂E p(ε) = ρ(E) exp − − 2 − · · · . (8) T T CV or Assuming the second term of the expansion to be small, ∂ ln ρ 1 = (12) the population of the energy state of a thermometer is ∂E T given by a Boltzmann factor, uniquely characterized by which, for any T , can be solved for the most probable the “temperature” T , which, without more ado can be value of E. be attributed to our microcanonical system. The quan- The variance in E is given by tity CV , thermodynamically identifiable with the heat ca- 1 ∂2 ln ρ 1 2 = 2 = 2 (13) pacity of the system, is macroscopically large making the σ ∂E T CV truncation of the expansion to the linear term reasonable. or More formally, let us consider the relationship between p ∆E = T CV . (14) microcanonical and canonical ensembles in terms of the re-
Where CV is again the heat capacity. lationship between partition function Z and level density 1 Since E ∝ N ∝ CV , ∆E ∝ √ and becomes negligible ρ. The partition function is given by N in the thermodynamic limit of N → ∞. In order to go X Z ∞ Z (β) = exp(−βEi) = ρ(E) exp(−βε)dE. (9) i 0 from the canonical to the microcanonical world we need L. G. Moretto et al.: Mesoscopy and thermodynamics 5 to invert Eq. (10). The inverse Laplace transform is 3 Microcanonical anomalies and canonical
1 I ρ(E) = Z(β) exp(βE)dβ. (15) disasters 2πi
Again, the microcanonical world is an integral over a dis- 3.1 Negative temperatures tribution of the canonical worlds, and a given value of the Let us begin with a simple model: a chain of N links. One microcanonical energy corresponds to a distribution in β unit of energy is necessary to break one link. Microcanon- or T given by the integrand of Eq. (15). The inverse trans- ically the energy is form is an integral in the complex plane. Under ordinary E = n (23) circumstances (to be defined), the integrand has a saddle point located at a coordinate β0 defined by where n is the number of broken links. ∂ ln Z The level density parameter is + E = 0 (16) ∂β N! ρ = . (24) or n!(N − n)! ∂ ln Z E = − . (17) ∂β The level density and the entropy have a maximum for
For any given E one can solve Eq. (16) for β0. The right N n = 2 , so the temperature hand side of Eq. (17) can be reduced to a Gaussian integral 1 ∂S = (25) 1 Z +i∞ (β − β )2 T ∂n ρ(E) = Z(β ) exp (β E) exp − 0 idβ 2πi 0 0 2σ2 −i∞ goes from 0 to ∞ as n goes from 0 to N and from −∞ to (18) 2 0 as n goes from N to N. We are faced with a “negative” where 2 temperature! 1 ∂2 ln Z ∂E ∂E ∂T 2 = − 2 = − = − = CV T (19) σβ ∂β ∂β ∂T ∂β Canonically we have or N X N! Z = exp(−βn) = (1 + exp(−β))N . (26) 2 1 n!(N − n)! σβ = 2 . (20) 0 CV T This β-variance can be transformed into a T -variance The energy is
2 2 T ∂ ln Z N exp(−β) σT = (21) E = − = = n. (27) CV ∂β 1 + exp(−β) or Despite our prejudice that temperatures ought to be pos- T ∆T = √ . (22) itive, everything seems to be well behaved by generalizing CV The fluctuations in T decrease as √1 and vanish in the the range of β (and T ) from −∞ to +∞. N thermodynamic limit. Everything appears very normal and Furthermore, it can be seen from Eq. (15) that there stable. Yet something may go terribly wrong in transform- is a saddle point in the complex plane. The Gaussian ap- ing microcanonical results into a canonical world. proximation is legitimate since CV is positive throughout 6 L. G. Moretto et al.: Mesoscopy and thermodynamics the range of β, and the contour integral can be evaluated and the level density is
5 by the method of steepest descents, yielding ∂n n5 1 R 2 ρ ' ' ' − . (33) r ∂E 2R 2R E − R N N n(N − n) ρ = / 2π . (28) (N − n)N−nnn N The entropy is
Not surprisingly this is the Stirling approximation to Eq. (24) 3 5 S ' ln ρ ' ln R − ln 2 − ln(R − E). (34) 2 2 and it is exceedingly precise from 0 < n ≤ N. Therefore, The “temperature” is nothing prevents the translation from microcanonical to 1 ∂S 5 1 canonical world in the entire energy range, even if this β = = = (35) T ∂E 2 R − E costs us the intuitive feeling that there ought not to be or negative temperatures. 2 T = (R − E). (36) Even if everything looks nice and stable, let us try to 5 This strange temperature is maximal at E ∼ 0 (T = 2 R) put a thermometer into contact with our system. Equa- 5 and decreases linearly with E and vanishes at the ioniza- tion (8) is reported below tion limit E = R. ε ε2 ρ(E − ε) = ρ(e) exp − − 2 . (29) T T CV The heat capacity is
If T is negative, clearly there is no convergence. Energy 5 C = − (37) V 2 will be drawn into the thermometer which will probably These results seem (and are!) counter intuitive. Is there explode. We call this first order divergence. anything wrong? The analysis is fine, up to the evaluation
of the entropy. The problems begin when we proceed to 3.2 Negative specific heat calculate the temperature.
The simplest model with negative specific heat that comes In order to see what the problems are, we must go back to mind is a very interesting one: the hydrogen atom. The to our two ways of going from the microcanonical to the energy levels are given by canonical world.
1 E = R 1 − (30) n n2 3.2.1 First method where R is the Rydberg constant. The degeneracy is
n−1 Let us put a thermometer (a classical one dimensional har- X (2l + 1) = n2. (31) monic oscillator) in contact with our system. The resulting l=0
The above level spacing, including the degeneracy is energy distribution of the thermometer is
2 ∂E 2R 1 E ε = (32) p(ε) = p(E − ε) = ρ(E) exp − − 2 − · · · . (38) ∂n n3 n2 T T CV L. G. Moretto et al.: Mesoscopy and thermodynamics 7
1 This expansion must converge to be meaningful. The saddle point located at β = β0 = T exists only if
2 1 σ = 2 is positive. If CV is negative the integrand This requires that β T CV
1 diverges away from β0. Conceptually β0 = is the least E T 0 < T E,CV 1 0 < T ∼ E,CV ∼ N . N probable value of the distribution in β, hardly a meaning- (39) ful choice to characterize the system. Again, if CV < 0 the If these considerations are met, in other words if T is mi- temperature remains undefined. croscopic and CV is macroscopic, then the expansion can In conclusion: systems whose microcanonical entropy be stopped at the first term, the quantity T is well de- ∂S versus energy curve presents a negative slope ∂E < 0 can fined and can be used as an intuitive variable to label the be formally assigned a negative temperature in the canon- system. The average energy of the thermometer is micro- ical language as the most probable value of a peaked dis- scopic and of order T . If CV is negative, the thermometer tribution with finite variance. However, when placed into energy distribution diverges. Its mean energy, controlled contact with a thermometer, the average energy of the by the second order term, diverges, and the thermome- thermometer will diverge. This a first order divergence. ter explodes. We call this second order divergence. Thus 2 Systems for which ∂ S is positive (negative specific if the expansion does not converge, the parameter T be- ∂E2 heat) cannot be assigned a meaningful temperature in the comes physically meaningless. Canonically the system has −1 canonical language. The quantity T = ∂S is the least no temperature, and quantities defined in terms of deriva- ∂E probable value of the distribution. A thermometer in con- tives with respect to temperature are meaningless. tact with such systems will be subject to a runaway in- So, for instance, negative heat capacity is, at the very crease in its mean energy. This a second order divergence. least, a misnomer! The mathematically well defined but