<<

The Inductive Theory of : Summary and Synthesis Steven A. Frank1 Department of Ecology and Evolutionary , University of California, Irvine, CA 92697–2525 USA The theory of natural selection has two forms. Deductive theory describes how populations change over time. One starts with an initial population and some rules for change. From those assumptions, one calculates the future state of the population. Deductive theory predicts how populations adapt to environmental challenge. Inductive theory describes the causes of change in populations. One starts with a given amount of change. One then assigns different parts of the total change to particular causes. Inductive theory analyzes alternative causal models for how populations have adapted to environmental challenge. This chapter emphasizes the inductive analysis of causea,b.

1. Introduction 1 23. Changes in transmission and total change 9 2. Constraints on selection 2 24. Choice of predictors 9 3. The origin of variation 2 25. Sufficiency and invariance 9 4. Selection by itself 3 26. Transmission versus selection 10 5. Goals of selection theory 3 27. Mutation versus selection 10 6. Partitioning causes of change 4 28. Stabilizing selection 11 7. Models of selection: prelude 4 29. Clade selection 11 8. Frequency change 4 30. Kin and group selection 11 9. Change caused by selection 5 31. Competition versus cooperation 12 10. Change during transmission 5 32. ESS by fitness maximization 12 11. Characters and covariance 5 33. Dynamics of phenotypic evolution 13 12. Quantitative and genetic characters 5 34. Other topics 13 13. Variance, distance or information 6 35. Deductive versus inductive perspectives 13 14. Characters and coordinates 6

15. Variance and scale 6 Introduction. Darwin got essentially everything 16. Description and causation 6 right about natural selection, adaptation, and biologi- 17. Partitions of cause 7 cal design. But he was wrong about the processes that determine inheritance. 18. Partitions of fitness 7 Why could Darwin be wrong about heredity and ge- netics, but be right about everything else? Because the 19. Testing causal hypotheses 7 essence of natural selection is trial and error learning. arXiv:1412.1285v2 [q-bio.PE] 12 Nov 2016 20. Multiple characters and nonadditivity 8 Try some different approaches for a problem. Dump the ones that fail and favor the ones that work best. Add 21. Partitions of characters 8 some new approaches. Run another test. Keep doing that. The solutions will improve over time. Almost ev- 22. Heritability and the response to selection 8 erything that Darwin wanted to know about adaptation and biological design depended only on understanding, in a general way, how the traits of individuals evolve by trial and error to fit more closely to the physical and a)homepage: https://stevefrank.org social challenges of reproduction. b)A condensed and simplified version of this manuscript will appear Certainly, understanding the basis of heredity is im- as: Frank, S. A. and Fox, G. A. 2017. The inductive theory of natural selection. Pages 000–000 in The Theory of Evolution, S. M. portant. Darwin missed key problems, such as genomic Scheiner and D. P. Mindell, eds. University of Chicago Press (in conflict. And he was not right about every detail of adap- press). tation. But he did go from the absence of understanding 2 to a nearly complete explanation for biological design. What he missed or got wrong requires only minor ad- justments to his framework. That is a lot to accomplish in one step. How could Darwin achieve so much? His single great- est insight was that a simple explanation could tie ev- erything together. His explanation was natural selection in the context of descent with modification. Of course, not every detail of life can be explained by those sim- ple principles. But Darwin took the stance that, when major patterns of nature could not be explained by selec- tion and descent with modification, it was a failure on his part to see clearly, and he had to work harder. No one else in Darwin’s time dared to think that all of the great FIG. 1. The logarithmic . The grows out at a complexity of life could arise from such simple natural rate that increases with the angle, α, at the leading edge. processes. Not even Wallace. The shell on the right closely follows a logarithmic spiral. The shell drawing on the right is from Thompson Now, more than 150 years after The Origin of Species, (1961, p. 173). we still struggle to understand the varied complexity of natural selection. What is the best way to study the the- ory of natural selection: detailed genetic models or simple diverse shapes of sheep, ram, and goat horns can be re- phenotypic models? Are there general truths about nat- duced to modulation of a few simple rules of growth. In ural selection that apply universally? What is the role of general, a small number of generative processes in devel- natural selection relative to other evolutionary processes? opment set tightly constrained contours on the possible Despite the apparent simplicity of natural selection, range of final form: controversy remains intense. Controversy almost always reflects the different kinds of questions that various peo- The distribution of forces which manifest ple ask and the different kinds of answers that various themselves in the growth and configuration people accept as explanations. Natural selection itself of a horn is no simple nor merely superficial remains as simple as Darwin understood it to be. matter. One thing is co-ordinated with an- other; the direction of the axis of the horn, the form of its sectional boundary, the spe- Constraints on selection. One can look at the di- cific rates of growth in the mean spiral and verse and complicated biological world, and marvel at at various parts of its periphery—all these how much can be understood by the simple process of play their parts, controlled in turn by the natural selection. Or one can look at the same world and supply of nutriment which the character of feel a bit outraged at how much a naively simple-minded the adjacent tissues and the distribution of view misses of the actual complexity. That opposition the blood-vessels combine to determine. To between simplicity and complexity arose early in the his- suppose that this or that size or shape of tory of the subject. horn has been produced or altered, acquired D’Arcy Thompson (1917) emphasized that physical or lost, by Natural Selection, whensoever one processes influence growth and set the contours that bi- type rather than another proved serviceable ological design must follow. Those physically imposed for defence or attack or any other purpose, contours limit natural selection as an explanation for or- is an hypothesis harder to define and to sub- ganismal form. For example, mollusks often grow their stantiate than some imagine it to be (Thomp- shell by uniform addition at the leading edge. That uni- son, 1992, p. 213). form growth produces shells that follow a logarithmic spiral (Fig. 1). The rules of growth determine the range of forms that A smaller angle of deposition at the leading edge causes may occur. Physical processes constrain variation. tighter coiling of the shell. The physical laws of growth set the primary contours of pattern. Natural selection can modulate design only within those strongly con- The origin of variation. The tension between the strained contours. Much of the order in nature arises constraints on variation and the power of natural selec- from the of growth rather than from selection. tion to shape observed pattern recurs throughout the his- Thompson applied that logic to a vast range of natural tory of post-Darwinian biology. If selection is trial and history. He showed that the great variety of shell patterns error, progress depends on the way in which new alter- arises from just a few additional rules of growth. Natu- native trials arise. A trait cannot be selected if it never ral selection apparently modulates only a small number occurs. The origin of variation as a constraint on selec- of angles and rates of deposition. Similarly, the bizarrely tion forms perhaps the greatest criticism against selection by itself as a creative force. Haldane (1932, p 94) framed 3

TABLE I. Natural Selection

Domain: Evolutionary change in response to natural selection.

Propositions:

1. Evolutionary change can be partitioned into natural selection and transmission.

2. Adaptation arises as natural selection accumulates information about the environment.

3. Information is often lost during transmission of characters from ancestors to descendants.

4. The balance between information gain by selection and information loss by transmission often explains the relative roles of different evolutionary forces.

5. Fitness describes the evolutionary change by natural selection.

6. Fitness can be partitioned into distinct causes, such as the amount of change caused by different characters.

7. Characters can be partitioned into distinct causes, such as different genetic, social, or environmental components.

8. The theory of natural selection evaluates alternative causal decompositions of fitness and characters.

9. Key theories of natural selection identify sufficient causes of change. The fundamental theorem identifies the variance in fitness as a sufficient statistic for change by selection. Kin selection decomposes change by selection into sufficient social forces.

the problem more broadly than Thompson: of evolution to the biological sciences partly It is perfectly true, as critics of Darwinism explains why the theory of Natural Selection never tire of pointing out, that ... no new should have been so fully identified with its character appears in the species as the result role as an evolutionary agency, as to have suf- of selection. Novelty is only brought about fered neglect as an independent principle wor- by selection as the result of the combination thy of scientific study. ... The present book, of previously rare characters. with all the limitations of a first attempt, is at least an attempt to consider the theory of How do rare characters first arise? That question is be- Natural Selection on its own merits. yond the scope of a chapter on the theory of natural selec- tion, because the origin of characters mostly depends on Natural selection does not stand alone. But selection forces other than selection. But it is essential to keep this remains the only evolutionary force that could potentially point in mind when considering the history of the theory explain adaptation. With the warnings about the origin of selection and the status of modern debates about how of variation in mind, I now turn to study of selection by one should think about selection in relation to evolution. itself.

Selection by itself. In another great book, Fisher Goals of selection theory. Selection theory must be agreed with Thompson about the complexity of evolu- evaluated with regard to two alternative goals. First, how tionary forces. Yet Fisher (1930, p. vii) began his book can one improve predictions about evolutionary change by saying in traits. Prediction is valuable when using artificial se- Natural Selection is not Evolution. Yet, ever lection to enhance performance. For example, one may since the two words have been in common use, seek greater milk production from cows or reduced an- the theory of Natural Selection has been em- tibiotic resistance or stronger binding of a molecule to ployed as a convenient abbreviation for the a cellular surface receptor. In these cases, the primary theory of Evolution by means of Natural Se- goal is improved prediction of outcomes to achieve great- lection, put forward by Darwin and Wallace. est performance at least cost. Understanding the causes This has had the unfortunate consequence of the outcomes can be helpful and interesting, but causal that the theory of Natural Selection itself has analysis is secondary to performance. scarcely ever, if ever, received separate con- The second goal concerns the causal analysis of traits. sideration. ... The overwhelming importance How have various evolutionary forces shaped traits? Why 4 do particular patterns of traits occur? Understanding Models of selection: prelude. Improvement by cause depends on comparing the predictions of alterna- trial and error is a very simple concept. But applying tive explanatory models. However, prediction to evaluate that simple concept to real problems can be surprisingly cause differs from prediction to optimize costs and ben- subtle and difficult. can help, but it can efits with regard to a desired target. A causal model also hinder. One must be clear about what one wants seeks to isolate the forces ultimately responsible for pat- from the mathematics and the limitations of what math- tern, whereas a model to optimize performance may not ematics can do. By mathematics, I simply mean the steps provide a correct interpretation of cause. In this chap- by which one starts with particular assumptions and then ter, I focus on how to use selection theory to understand derives logical conclusions or empirical predictions. cause. The output of mathematics reflects only what one puts in. If different mathematical approaches lead to differ- ent conclusions, that means that the approaches have Partitioning causes of change. made different assumptions. Strangely, false or appar- ently meaningless assumptions often provide a better de- Since path analysis depends on structure, and scription of the empirical structure of the world than structure in turn depends on the cause-and- precise and apparently true assumptions. From Fisher effect relationship among the variables, we (1930, p. ix) shall first say a few words about the way these terms will be used. ... There are a number The ordinary mathematical procedure in of formal definitions as to what constitutes a dealing with any actual problem is, after ab- cause and what an effect. For instance, one stracting what are believed to be the essen- may think that a cause must be doing some- tial elements of the problem, to consider it thing to lead to something else (effect). While as one of a system of possibilities infinitely this is clearly one type of cause-and-effect re- wider than the actual, the essential relations lationship, we shall not limit ourselves to that of which may be apprehended by generalized type only. Nor shall we enter into philosophi- reasoning, and subsumed in general formu- cal discussions about the nature of cause-and- lae, which may be applied at will to any par- effect. We shall simply use the words ‘cause’ ticular case considered. Even the word pos- and ‘effect’ as statistical terms similar to in- sibilities in this statement unduly limits the dependent and dependent variables, or [pre- scope of the practical procedures in which he dictor variables and response variables] (Li, is trained; for he is early made familiar with 1975, p. 3). the advantages of imaginary solutions, and can most readily think of a , or an alter- One can often partition total evolutionary change into nating current, in terms of the square root of separate components. Those separate components may minus one. sometimes be thought of as the separate causes of evo- lutionary change. The meaning of “cause” is of course The immense power of mathematical insight from false a difficult problem. We are constrained by the fact that or apparently meaningless assumptions shapes nearly ev- we only have access to empirical correlations, and cor- ery aspect of our modern lives. The problem with the relation is not causation. Within that constraint, I fol- intuitively attractive precise and realistic assumptions is low Li’s suggestion to learn what we can about causa- that they typically provide exactness about a reality that tion by studying the possible structural relations between does not exist. One never has a full set of true assump- variables. Those structural relations express hypotheses tions. By contrast, false or apparently meaningless as- about cause. Alternative structural relations may fit the sumptions, properly chosen, can provide profound insight data more or less well. Those alternatives may also sug- into the logical and the empirical structure of nature. gest testable predictions that can differentiate between That truth may not be easy to grasp. But experience the relative likelihood of the different causal hypotheses has shown it to be so over and over again. (Crespi, 1990; Frank, 1997b, 1998; Scheiner, Mitchell, and Callahan, 2000). In evolutionary studies, one typically tries to explain Frequency change. I begin with a basic model of fit- how environmental and biological factors influence char- ness and frequency change. There are n different types acters. Causal analysis separates into two steps. How of individuals. The frequency of each type is qi. Each type has Ri offspring, where R expresses reproductive do alternative character values influence fitness? How ¯ P much of the character values is transmitted to following success. Average reproductive success is R = qiRi, summing over all of the different types indexed by the i generations? These two steps are roughly the causes of ¯ selection and the causes of transmission. subscripts. Fitness is wi = Ri/R, used here as a mea- sure of relative success. The frequency of each type after selection is 0 qi = qiwi. (1) 5

To obtain useful equations of selection, we must consider Characters and covariance. We can express the change. Subtracting qi from both sides of Eq. (1) yields fundamental equation of selection (Eq. 3) in terms of the covariance between fitness and character value. Many of ∆qi = qi (wi − 1) , (2) the classic equations of selection derive from the covari- 0 ance form. Combining Eqs. (2) and (3) leads to in which ∆qi = qi − qi is the change in the frequency of each type. X X ∆sz¯ = ∆qizi = qi (wi − 1) zi. (6)

Change caused by selection. We often want to The right-hand side matches the definition for covariance know about the change caused by selection in the value ∆sz¯ = Cov(w, z). (7) of a character. Suppose that each type, i, has an associ- ated character value, zi. The average character value in We can rewrite a covariance as a product of a regression P the initial population isz ¯ = qizi. The average char- coefficient and a variance term 0 P 0 0 acter value in the descendant population isz ¯ = qizi. For now, assume that descendants have the same aver- ∆sz¯ = Cov(w, z) = βzwVw, (8) age character value as their ancestors, z0 = z . Then i i where βzw is the regression of phenotype, z, on fitness, w, z¯0 = P q0z , and the change in the average value of the i i and Vw is the variance in fitness. The statistical covari- character caused by selection is ance, regression, and variance functions commonly arise X X X in the literature on selection (Robertson, 1966; Price, z¯0 − z¯ = ∆ z¯ = q0z − q z = (q0 − q ) z , s i i i i i i i 1970; Lande and Arnold, 1983; Falconer and Mackay, where ∆s means the change caused by selection when 1996). ignoring all other evolutionary forces (Price, 1972b; Ewens, 1989; Frank and Slatkin, 1992). Using ∆qi = 0 Quantitative and genetic characters. The char- qi − qi for frequency changes yields acter z can be a quantitative trait or a gene frequency X ∆sz¯ = ∆qizi. (3) from the classical equations of population genetics. In a population genetics example, assume that each indi- This equation expresses the fundamental concept of se- vidual carries one allele. For the ith individual, zi = 0 lection (Frank, 2012b). Frequencies change according to when the individual carries the normal allelic type, and differences in fitness (Eq. 2). Thus, selection is the change zi = 1 when the individual carries a variant allele. Then in character value caused by differences in fitness, hold- the frequency of the variant allele in the ith individual is ing constant other evolutionary forces that may alter the pi = zi, the allele frequency in the population isp ¯ =z ¯, character values, zi. and the initial frequencies of each of the N individuals is qi = 1/N. From Eq. (6), the change in allele frequency is Change during transmission. We may consider the 1 X other forces that alter characters as the change during ∆sp¯ = (wi − 1) pi. (9) 0 N transmission. In particular, define ∆zi = zi − zi as the difference between the average value among descendants From the prior section, we can write the population ge- derived from ancestral type i and the average value of netics form in terms of statistical functions P 0 ancestors of type i. Then qi∆zi, is the change dur- ing transmission when measured in the context of the ∆sp¯ = Cov(w, p) = βpwVw. (10) descendant population. Here, q0 is the fraction of the i For analyzing allele frequency change, the population descendant population derived from ancestors of type i. genetics form in Eq. (9) is often easier to understand than Thus, the total change, ∆¯z =z ¯0 −z¯, is exactly the sum Eq. (10), which is given in terms of statistical functions. of the change caused by selection and the change during This advantage for the population genetics expression to transmission study allele frequency emphasizes the value of using spe- X X 0 ∆¯z = ∆qizi + qi∆zi, (4) cialized tools to fit particular problems. By contrast, the more abstract statistical form in a form of the Price equation (Price, 1972a; Frank, Eq. (10) has advantages when studying the conceptual 2012b). We may abbreviate the two components of total structure of natural selection and when trying to par- change as tition the causes of selection into components. Suppose, ∆¯z = ∆ z¯ + ∆ z,¯ (5) for example, that one only wishes to know whether the al- s c lele frequency is increasing or decreasing. Then Eq. (10) which partitions total change into a part ascribed to nat- shows that it is sufficient to know whether βpw is pos- ural selection, ∆s, and a part ascribed to changes in char- itive or negative, because Vw is always positive. That acters during transmission, ∆c. The change in transmis- sufficient condition is difficult to see in Eq. (9), but is sion subsumes all evolutionary forces beyond selection. immediately obvious in Eq. (10). 6

Variance, distance or information. The variance the change in character value caused by selection. We in fitness, Vw, arises in one form or another in every ex- can think of that replacement as altering the coordinates pression of selection. Why is the variance a universal on which we measure change, from the frequency changes 0 metric of selection? Clearly, variation matters, because described by fitness, wi = qi/qi, to the character values selection favors some types over others only when the al- described by zi. ternatives differ. But why does selection depend exactly Although this description in terms of coordinates may on the variance rather than on some other measure of seem a bit abstract, it is essential for thinking about variation? evolutionary change in relation to selection. Selection I will show that natural selection moves the population changes frequencies. The consequences of frequency for a certain distance. That distance is equivalent to the the change in characters depend on the coordinates that variance in fitness. Thus, we may think about the change describe the translation between frequency change and caused by selection equivalently in terms of variance or characters (Frank, 2012c, 2013a). distance. Consider the expression from Eq. (4) for total evolu- Begin by noting from Eq. (2) that ∆qi/qi = wi − 1. tionary change Then, the variance in fitness is X X 0 ∆¯z = ∆qizi + qi∆zi.  2 2 X 2 X ∆qi X (∆qi) Vw = qi(wi − 1) = qi = . qi qi This is an exact expression that includes four aspects of (11) evolutionary change. First, the change in frequencies, The squared distance in Euclidean geometry is the sum ∆qi, causes evolutionary change. Second, the amount of the squared changes in each dimension. On the right is of change depends on the coordinates of characters, zi. the sum of the squares for the change in frequency. Each Third, the change in the coordinates of characters during dimension of squared distance is divided by the original transmission, ∆zi, causes evolutionary change. Fourth, frequency. That normalization makes sense, because a the changed coordinates have their consequences in the small change relative to a large initial frequency means context of the frequencies in the descendant population, 0 less than a small change relative to a small initial fre- qi. quency. The variance in fitness measures the squared dis- tance between the ancestral and descendant population in terms of the frequencies of the types (Ewens, 1992). Variance and scale. In models of selection, one of- When the frequency changes are small, the expres- ten encounters the variance in characters, Vz, rather than sion on the right equals the Fisher information mea- the variance in fitness, Vw. The variance in characters is sure (Frank, 2009). A slightly different measure of infor- simply a change in scale with respect to the variance in mation arises in selection equations when the frequency fitness—another way in which to describe the translation changes are not small (Frank, 2012c), but the idea is the between coordinates for frequency change and the coor- same. Selection acquires information about environmen- dinates for characters. In particular, tal challenge through changes in frequency. Thus, we may think of selection in terms of variance, ∆sz¯ = Cov(w, z) = βwzVw = βzwVz, (12) distance or information. Selection moves the population thus V = γV . Here, γ is based on the regression coeffi- frequencies a distance that equals the variance in fitness. z w cients. The value of γ describes the rescaling between the That distance is equivalent to the gain in information by variance in characters and the variance in fitness. Thus, the population caused by selection. when Vz arises in selection equations, it can be thought of as the rescaling of Vw in a given context. Characters and coordinates. We can think of fit- ness and characters as alternative coordinates in which Description and causation. Eq. (12) describes as- to measure the changes caused by natural selection in sociations between characters and fitness. We may rep- frequency, distance, and information. Using Eq. (2), we resent those associations as z ↔ w. Here, we know only can rewrite the variance in fitness from Eq. (11) as that a character, z, and fitness, w, are correlated, as ex-

X 2 X pressed by Cov(w, z). We do not know anything about Vw = qi(wi − 1) = ∆qiwi. the causes of that correlation. But we may have a model about how variation in characters causes variation in fit- Compare that expression with Eq. (3) for the change in ness. To study that causal model, we must analyze how character value caused by selection the hypothesized causal structure predicts correlations X between characters, fitness and evolutionary change. Al- ∆sz¯ = ∆qizi. ternative causal models provide alternative hypotheses and predictions that can be compared with observation If we start with the right side of the expression for the (Crespi, 1990; Frank, 1997b, 1998; Scheiner, Mitchell, variance in fitness and then replace wi by zi, we obtain and Callahan, 2000). 7

Regression equations provide a simple way in which to y. If we expand Cov(w, z) in Eq. (12) with the full ex- express hypothesized causes (Li, 1975). For example, we pression for fitness in Eq. (14), we obtain may have a hypothesis that the character z is a primary cause of fitness, w, expressed as a directional path dia- ∆sz¯ = (βwz·y + βwy·zβyz) Vz. (15) gram z → w. That path diagram, in which z is a cause of w, is mathematically equivalent to the regression equa- Following Queller (1992), I abbreviate the three regres- tion sion terms. The term, βyz = r, describes the association between the phenotype, z, and the other predictor of fit- wi = φ + βwzzi + i, (13) ness, y. An increase in z by the amount ∆z corresponds to an average increase of y by the amount ∆y = r∆z. in which φ is a constant, and i is the difference between The term, βwy·z = B, describes the direct effect of the the actual value of zi and the value predicted by the other predictor, y, on fitness, holding constant the fo- model, φ + βwzzi. cal phenotype, z. The term, βwz·y = −C, describes the direct effect of the phenotype, z, on fitness, w, holding constant the effect of the other predictor, y. Partitions of cause. To analyze causal models, we The condition for the increase of z by selection is focus on the general relations between variables rather ∆sz¯ > 0. The same condition using the terms on the than on the values of particular types. Thus, we can drop right side of Eq. (15) and the abbreviated notation is the i subscripts in Eq. (13) to simplify the expression, as in the following expanded regression equation rB − C > 0. (16)

w = φ + βwz·yz + βwy·zy + . (14) This condition applies whether the association between z and y arises from some unknown extrinsic cause (Fig. 2a) Here, fitness w depends on the two characters, z and y or by the direct relation of z to y (Fig. 2b). (Lande and Arnold, 1983). The partial regression coeffi- This expression in Eq. (16) describes the condition for cient βwz·y is the average effect of z on w holding y con- selection to increase the character, z, when ignoring any stant, and βwy·z is the average effect of y on w holding z changes in the character that arise during transmission. constant. Regression coefficients minimize the total dis- Thus, when one wants to know whether selection acting tance (sum of squares) between the actual and predicted by this particular causal scheme would increase a charac- values. Minimizing the residual distance maximizes the ter, it is sufficient to know if this simple condition holds. use of the information contained in the predictors about the actual values. This regression equation is exact, in the sense that it is Testing causal hypotheses. If selection favors an an equality under all circumstances. No assumptions are increase in the character z, then the condition in Eq. (16) needed about additivity or linearity of z and y or about will always be true. That condition simply expresses the normal distributions for variation. Those assumptions fact that the slope of fitness on character value, βwz, must arise in statistical tests of significance when comparing be positive when selection favors an increase in z. The the regression coefficients to hypothesized values or when expression rB − C is one way in which to partition βwz predicting how the values of the regression coefficients into components. However, the fact that rB − C > 0 change with context. does not mean that the decomposition into those three Note that the regression coefficients, β, often change components provides a good causal explanation for how as the values of w or z or y change, or if we add another selection acts on the character z. predictor variable. The exact equation is a description of There are many alternative ways in which to partition the relations between the variables as they are given. The the total effect of selection into components. Other char- structure of the relations between the variables forms a acters may be important. Environmental or other extrin- causal hypothesis that leads to predictions (Li, 1975). sic factors may dominate. How can we tell if a particular causal scheme is a good explanation? If we can manipulate the effects r, B or C directly, we Partitions of fitness. We can interpret Eq. (14) as a can run an experiment. If we can find natural compar- hypothesis that partitions fitness into two causes. Sup- isons in which those terms vary, we can test compara- pose, for example, that we are interested in the direct tive hypotheses. If we add other potential causes to our effect of the character z on fitness. To isolate the direct model, and the original terms hold their values in the effect of z, it is useful to consider how a second character, context of the changed model, that stability of effects y, also influences fitness (Fig. 2). under different conditions increases the likelihood that The condition for z to increase by selection can be the effects are true. evaluated with Eq. (12). That equation simply states Three points emerge. First, a partition such as rB −C that z increases when it is positively associated with fit- is sufficient to describe the direction of change, because ness. However, we now have the complication shown in a partition simply splits the total change into parts. Sec- Eq. (14) that fitness also depends on another character, ond, a partition does not necessarily describe causal rela- 8

y B y B r w r –C z –C w z (a) (b)

FIG. 2. Path diagrams for the effects of phenotype, z, and secondary predictor, y, on fitness, w. (a) An unknown cause associates y and z. The arrow connecting those factors points both ways, indicating no particular directionality in the hypothesized causal scheme. (b) The phenotype, z, directly affects the other predictor, y, which in turn affects fitness. The arrow pointing from z to y indicates the hypothesized direction of causality. The choice of notation matches kin selection theory, in which z is an altruistic behavior that reduces the fitness of an actor by the cost C and aids the fitness of a recipient by the benefit, B, and r measures the association between the behaviors of the actor and recipient. Although that notation comes from kin selection theory, the general causal scheme applies to any pair of correlated characters that influences fitness (Lande and Arnold, 1983; Queller, 1992). From Frank (2013a). tions in an accurate or useful way. Third, various meth- Fisher (1918) first presented this regression for pheno- ods can be used to test whether a causal hypothesis is a type in terms of alleles as the predictors. Suppose good explanation. X g = bjxj, (17) j Multiple characters and nonadditivity. Instead of the simple causal schemes in Fig. 2, there may be mul- in which xj is the presence or absence of an allelic type. tiple characters yj correlated with z. Then the condition Then each bj is the partial regression of an allele on phe- P for the increase of z becomes rjBj −C > 0, in which rj notype, which describes the average contribution to phe- is the regression of yj on z, and Bj is the partial regres- notype for adding or subtracting the associated allelic sion of w on yj, holding constant all other characters. type. The coefficient bj is called the average allelic ef- This method also applies to multiplicative interactions fect, and g is called the breeding value (Fisher, 1930; between characters. For example, suppose π12 = y1 × y2, Crow and Kimura, 1970; Falconer and Mackay, 1996). for characters y1 and y2. Then we can use π12 as a char- When g is defined as the sum of the average effects of the acter in the same type of analysis, in which r would be underlying predictors, then βzg = 1, and the regression of π12 on z, and B would be the partial re- z = g + δ, (18) gression of w on π12 holding constant all other variables. where δ = z −g is the difference between the actual value Partitions of characters. We have been studying and the predicted value. the partition of fitness into separate causes, including the Some facts will be useful in the next section. If we take the average of both sides of Eq. (18), we getz ¯ =g ¯, role of individual characters. Each character may itself ¯ be influenced by various causes. Describe the cause of a because δ = 0 by the theory of regression. If we take the character by a regression equation variance of both sides, we obtain Vz = Vg + Vδ, noting that, by the theory of regression, g and δ are uncorre- lated. z = φ + βzgg + δ, in which φ is a constant that is traditionally set to zero in this equation, g is a predictor of phenotype, the regres- Heritability and the response to selection. To sion coefficient βzg is the average effect of g on phenotype study selection, we first need an explicit form for the z, and δ = z − βzgg is the residual between the actual relation between character value and fitness, which we value and the predicted value. This regression expression write here as describes phenotypic value, z, based on any predictor, g. For predictors, we could use temperature, neighbors’ be- w = φ + βwzz + . havior, another phenotype, epistatic interactions given as the product of allelic values, symbiont characters, or an Substitute that expression into the covariance expression individual’s own genes. of selection in Eq. (12), yielding

∆sz¯ = Cov(w, z) = βwzVz = sVz, (19) 9 because φ is a constant and  is uncorrelated with z, caus- component expressed in the coordinates of the average ing those terms to drop out of the covariance. Here, the effects of the predictors, and ∆tz¯ = ∆cg¯, the total change selective coefficient s = βwz is the effect of the character in coordinates with respect to the average effects of the on fitness. Expand sVz by the partition of the character predictors. variance given in the previous section, which leads to

∆sz¯ = sVz = sVg + sVδ = ∆gz¯ + ∆nz.¯ (20) Choice of predictors. If natural selection dominates other evolutionary forces, then we can use the theory of We can think of g as the average effect of the predictors natural selection to analyze evolutionary change. When of phenotype that we have included in our causal model does selection dominate? From Eq. (22), the change in of character values. Then sV = ∆ z¯ is the component g g phenotype caused by selection is ∆g. If the second term of total selective change associated with our predictors, ∆t is relatively small, then we can understand evolution- and ary change primarily through models of selection. A small value of the transmission term, ∆t, arises if ∆gz¯ = ∆sz¯ − ∆nz,¯ (21) the effects of the predictors in our causal model of phe- notype remain relatively stable between ancestors and shows that the component of selection transmitted to de- descendants. Many factors may influence phenotype, in- scendants through the predictors included in our model, cluding alleles and their interactions, maternal effects, ∆ , is the change caused by selection, ∆ , minus the part g s various epigenetic processes, changing environment, and of the selective change that is not transmitted through so on. Finding a good causal model of phenotype in terms the predictors, ∆ . Although it is traditional to use al- n of predictors is an empirical problem that can be studied leles as predictors, we can use any hypothesized causal by testing alternative causal schemes against observation. scheme. Thus, the separation between transmitted and Note that the equations of evolutionary change do not nontransmitted components of selection depends on the distinguish between different kinds of predictors. For hypothesis for the causes of phenotype. example, one can use both alleles and weather as pre- If we choose the predictors for g to be the individual dictors. If weather varies among types and its average alleles that influence phenotype, then V is the traditional g effect on phenotype transmits stably between ancestors measure of genetic variance, and sV is that component g and descendants, then weather provides a useful predic- of selective change that is transmitted from parent to tor. Variance in stably transmitted weather attributes offspring through the effects of the individual alleles. The can lead to changes in characters by selection. fraction of the total change that is transmitted, V /V , g z Calling the association between weather and fitness an is a common measure of heritability. aspect of selection may seem strange or misleading. One can certainly choose to use a different description. But the equations themselves do not distinguish between dif- Changes in transmission and total change. This ferent causes. section describes the total evolutionary change when con- sidered in terms of the parts of phenotype that are trans- mitted to descendants. Here, the transmitted part arises Sufficiency and invariance. To analyze natural se- from the predictors in an explicit causal hypothesis about lection, what do we need to know? Let us compare two phenotype. alternatives. One provides full information about how The expression of characters in terms of predictors the population evolves over time. The other considers from Eq. (18) is z = g + δ. From that equation,z ¯ =g ¯, only how natural selection alters average character val- because the average residuals of a regression, δ¯, are zero. ues at any instant in time. Thus, when studying the change in a character, we have A full analysis begins with the change in frequency ∆¯z = ∆¯g, which means that we can analyze the change given in Eq. (2), as in a character by studying the change in the average ef- fects of the predictors of a character. Thus, from Eq. (4), ∆q = q (w − 1) . we may write the total change in terms of the coordinates i i i of the average effects of the predictors, g, yielding For each type in the population, we must know the initial frequency, qi, and the fitness, wi. From those values, each X X 0 ∆¯z = ∆qigi + qi∆gi = ∆gz¯ + ∆tz,¯ (22) new frequency can be calculated. Then new values of fit- nesses would be needed to calculate the next round of in which ∆tz¯ is the change in the average effects of the updated frequencies. Fitnesses can change with frequen- predictors during transmission (Frank, 1997b, 1998). The cies and with extrinsic conditions. That process provides total change divides into two components: the change a full description of the population dynamics over time. caused by the part of selection that is transmitted to The detailed output concerning dynamics reflects the de- descendants plus the change in the transmitted part of tailed input about all of the initial frequencies and all of phenotype between ancestors and descendants. Alter- the fitnesses over time. natively, we may write ∆gz¯ = ∆sg¯, the total selective 10

A more limited analysis arises from the part of total ing transmission. Selection among groups favors coop- evolutionary change caused by selection. If we focus on eration; selection within groups favors selfishness that the change by selection in the average value of a character decays the transmission of cooperative behavior. at any point in time, we have Total change in terms of selection and transmission X (Eq. 5) is ∆sz¯ = ∆qizi = Cov(w, z) = βzwVw, ∆¯z = ∆sz¯ + ∆cz¯ = ∆S + ∆τ, from Eqs. (6) and (8). To calculate the average change caused by selection, it is sufficient to know the covari- which may alternatively be expressed in terms of pre- ance between the fitnesses and character values over the dictors, as in Eq. (22). An equilibrium balance between population. We do not need to know the individual fre- selection and mutation, or between different levels of se- quencies or the individual fitnesses. A single summary lection, occurs when statistic over the population is sufficient. A single as- ∆S = −∆τ. (23) sumed input corresponds to a single output. We could, of course, make more complicated assumptions and get The strength of selection bias relative to endogenous more complicated outputs. What we get out matches change during transmission is what we put in. ∆S Invariance provides another way to describe sufficiency. R = log , (24) ∆τ The mean change in character value caused by selection is invariant to all aspects of variability except the co- assuming that the forces oppose (Frank, 2012a). The variance. The reason is that the variance in fitness, Vw, provides a natural measure of relative strength, describes the distance the population moves with regard centered at zero when the balance in Eq. (23) holds. to frequencies, and the regression βzw rescales the dis- We may write the selection term as ∆S = sVz from tance along coordinates of frequency into distance along Eq. (12), in which the selective coefficient s = βwz is the coordinates of the character. slope of fitness on character value. Then, if the equilib- The analysis of two characters influencing fitness pro- rium in Eq. (23) exists, the character variance is vides another example of sufficiency and invariance. The −∆τ condition for the average value of the focal character to Vz = . (25) increase by selection is given by rB − C > 0 in Eq. (16). s That condition shows that all other details about vari- ability and correlation between the two characters and fitness do not matter. Thus, the direction of change Mutation versus selection. Suppose each individ- caused by selection is invariant to most details of the ual has one allele. Let a normal allele have character population. Simple invariances of this kind often provide value z = 0 and a mutant allele have value z = 1. Then great insight into otherwise complex problems (Frank, the average character value in the population,z ¯ ≡ q, is 2013b). the frequency of the mutant allele. The variance in the Fisher’s fundamental theorem of natural selection is a character is the binomial variance of the allele frequency, simple invariance (Frank, 2012d). The theorem states Vz = q(1 − q), thus ∆S = sq(1 − q). The selective inten- that, at any instant in time, the change in average fit- sity against the mutant allele, s, is negative, because the ness caused by selection is equal to the variance in fit- mutant decreases fitness. We may uses ˆ = −s to obtain a ness. Fisher was particularly interested in the transmis- positive value for the intensity of selection. The change sible component of fitness based on a causal model of in frequency during transmission is ∆τ = µ(1 − q), in alleles. Thus, his variance in fitness is Vg from Eq. (20) which normal alleles at frequency 1 − q are changed into based on the average effects of alleles on fitness. Fisher’s mutant alleles at a rate µ. theorem shows that the change in mean fitness by selec- Using these expressions in the balance between selec- tion is invariant to all details of variability in the popula- tion bias and transmission bias (Eq. 23) yields the equi- tion except the variance associated with the transmissible librium frequency of the mutant allele predictors. µ q = . (26) sˆ Transmission versus selection. The ratio of selective intensity to transmission bias is

In evolutionary theory, a gene could be de- sqˆ R = log . (27) fined as any hereditary information for which µ there is a ... selection bias equal to several This ratio depends on frequency, q. Whensq ˆ > µ, selec- or many times its rate of endogenous change tion pushes down the mutant frequency. Whensq ˆ < µ, (Williams, 1966). mutation pushes up the mutant frequency. The relative Selection and transmission often oppose each other. strength of selection to transmission is often frequency Selection increases fitness; mutation decays fitness dur- dependent (Fig. 3a). 11

8 (a) (b) log(s/^ ) = 5 R 4 6 r = 4

2 4 r = 2 log(s/^ ) = 2 = 0 0 2 r 2 = – ^ 0 r –2 log(s/) = –1 –4 –2 r = –4 –4 –6 –6

Selection-transmission ratio, –8 –8 –6 –4 –2 0 2 4 6 –4 –2 0 2 4 Mutant frequency, log[q/(1–q)] Competitive intensity, log[z/(1–z)]

FIG. 3. The opposing forces of selection and transmission change with context. Equilibrium occurs when the forces balance at R = 0 in Eq. (24). In these examples, when the ratio R is positive, selection dominates and pushes down the character values, and when R is negative, transmission bias dominates and pushes up character values. (a) The opposition of selection and mutation (Eq. 27). (b) The opposition of selection bias among groups versus transmission bias within groups (Eq. 30), withr ˆ = r/(1 − r). All use base 10. From Frank (2012a).

Stabilizing selection. The previous example con- q of asexuality would be cerned directional selection. In that case, mutation adds µ q ≈ , deleterious mutations and selection removes those mu- sˆ tations. Alternatively, stabilizing selection may favor an matching the simple genetic model of mutation-selection optimal character value, and mutation may tend to move balance in Eq. (26). the character value away from the optimum. Van Valen also applied this approach to mammals. In For example, fitness may drop off with the squared mammals, genera with larger body size survive longer distance from the optimum. Let the underlying character than genera with smaller body size, but the smaller- value be y, and the squared distance from the optimum bodied genera produce new genera at a higher rate. The y∗ be z = (y − y∗)2. The drop off in fitness with squared net reproductive rate of small genera is higher, giving deviation from the optimum is given by s = β = −sˆ. wz a selective advantage to small-bodied genera over large- If mutations occur with probability µ, and a mutation bodied genera. Within genera, there is a bias towards has an equal chance of increasing or decreasing character larger body size. The distribution of mammalian body value by c, then the transmission bias in terms of squared size is influenced by the balance between selection among deviations is ∆τ = µc2 = V , in which we can think of ∆τ µ genera favoring smaller size and selection within genera as the variance added by mutation, V . From Eq. (25), µ favoring larger size. the equilibrium balance between mutation and selection occurs at V Kin and group selection. Van Valen considered V = µ . z sˆ clade selection as roughly similar to the balance of se- lection among clades and mutation within clades. A more accurate description arises by considering the rela- Clade selection. The opposition of selection at differ- tive strength of selection at the higher and lower levels ent levels is similar to the balance between mutation and Frank (2012a). selection. For example, Van Valen (1975) argued that Start with the equality ∆S = −∆τ that describes the sexuality may increase the reproductive rate of clades by fundamental balance between selection bias and trans- enhancing the speciation rate. That advantage in compe- mission bias (Eq. 23). We may rewrite the balance as tition between clades may be offset by the disadvantage ∆SA = −∆SW , the equality of selection among groups of sexuality within clades, because sexual reproduction is and selection within groups. We can express selective less efficient than asexual reproduction. change as ∆S = sV, the product of selective intensity If the selection bias between clades favoring sexuality and character variance (Eq. 19). Thus, the balance be- iss ˆ, and the transmission bias against sexuality within tween selection at a higher level and transmission bias clades is µ, then the approximate equilibrium frequency caused by selection at a lower level is

sA VA = −sW VW . 12

The total variance is the sum of the variances among and declines. In this model, reduced competitiveness may within groups, VT = VA + VW , thus we may write be thought of as increased cooperation. Thus, r deter- mines the balance between cooperation favored by selec- s V = −s (V − V ). A A W T A tion among groups and competitiveness favored by the transmission bias of selection within groups (Fig. 3b). Dividing both sides by the total variance, VT , yields

sA r = −sW (1 − r), (28) ESS by fitness maximization. We could partition in which r = V /V measures the correlation in charac- A T the fitness expression in Eq. (29) into components by us- ter values between individuals within groups or, equiva- ing regression equations. That approach would split fit- lently, the ratio of the variance among groups to the total ness into the part explained by the character for individ- variance. ual competitiveness, z, and the part explained by group In general, the variance V provides a weighting that competitiveness, z . Each part would be weighted by its describes the consequences of selection. Thus, r can be W partial regression of fitness on the character, holding the thought of as the fraction of the total weighting of selec- other character constant. The problem is that the non- tion that happens at the group level, and 1 − r can be linearity in Eq. (29) makes it difficult to calculate the thought of as the fraction of the total weighting of selec- regression coefficients. tion that happens within groups. Here, r may often be If we assume that character variances are small, the fol- interpreted as a form of the regression coefficient of relat- lowing trick often makes the analysis very easy (Frank, edness from kin selection theory (Hamilton, 1970). Thus, 1995, 1997a, 1998; Taylor and Frank, 1996). With small we may think of this balance between levels of selection variances, the partial regression coefficients of w with re- in terms of either kin or group selection (Hamilton, 1975; spect to each character will be close to the partial deriva- Frank, 1986). tive of w with respect to each character. For two char- acters, z and y, the condition for selection to favor an increase in z is Competition versus cooperation. This section il- lustrates the balance between opposing selection at dif- dw ∂w dy ∂w = + > 0. (31) ferent levels. Suppose fitness is dz ∂y dz ∂z z w = (1 − z ). (29) z W Because variances are small, we can relate each derivative W term to a regression coefficient and also to the r, B and The first term describes the competitiveness of a type, C terms of Fig. 2. In particular, dw/dz = βwz, describes z, relative to the average competitiveness of its neigh- the slope of fitness on character value, which determines bors within a group, zW . The second term, 1 − zW , de- the direction of selection. That total change depends scribes the success of the group in competition against on dy/dz = βyz = r, which describes the slope of the other groups (Frank, 1994, 1995). character y relative to the focal character z. The term We can use Eq. (28) to evaluate the balance between ∂w/∂y = βwy·z = B describes the slope of fitness on y, selection at the group level and selection within groups. holding constant z. The term, ∂w/∂z = βwz·y = −C, de-

The selective intensity among groups is sA = −1, the scribes the direct effect of z on fitness, holding constant, slope of group fitness, 1 − zW , relative to group phe- y. notype, zW . The selective intensity within groups is Using those definitions, the condition reduces to rB − sW = (1 − zW )/zW , which is the change in individual C > 0. The method extends to any number of char- fitness, w, with the change in individual character value, acters. The advantage here is that the specific terms z, holding constant group phenotype, zW . follow automatically from differentiation, given any ex-

Substituting these values for sA and sW into Eq. (28) pression for the functional relation between fitness and yields a balance, ∆S = −∆τ, between group and indi- various characters. The method assumes that charac- vidual selection as ter variances are small and that characters vary contin- 1 − z uously. The approach works well in finding a potential − r = − W (1 − r). (30) z equilibrium that is an evolutionary stable strategy (ESS), W which means that the equilibrium character value tends If the population approaches an equilibrium balance be- to outcompete other character values that differ by a tween the opposing components of selection, then indi- small amount (Maynard Smith and Price, 1973; May- vidual and group phenotypes converge such that z = nard Smith, 1982). ∗ zW = z , and we obtain For simple problems, the equilibrium reduces to rB = C, which balances opposing components of selection. If z∗ = 1 − r. we use the fitness expression in Eq. (29), letting y ≡

The coefficient r measures the of individuals zW , and assume that variances are small so that near ∗ within groups or, equivalently, the variance in pheno- a balance we have z = zW = z , then the method in type among groups. As r increases, competitiveness z∗ Eq. (31) allows the use of simple differentiation to obtain 13 z∗ = 1 − r. cess and lower variability in success may be favored rela- tive to a character with higher average success and higher variability. The theory closely matches economic con- Dynamics of phenotypic evolution. The differen- cepts of risk aversion (Gillespie, 1973; Frank, 2011a). tiation method simplifies study of the equilibrium bal- Reproductive valuation arises when individuals in dif- ance between opposing components of selection (Eq. 31). ferent classes contribute differently to future populations. We only need an expression for the causal relations be- Old individuals may have less chance of future reproduc- tween fitness and characters. The main problem concerns tion than young individuals. When evaluating the po- dynamics. Real populations may never follow a path to tential success of a character, one must consider if the the equilibrium balance of forces. To study dynamics, character occurs differently between classes of individu- one could specify all aspects of how genes and other pre- als (Charlesworth, 1994). dictors affect phenotypes, and all aspects of spatial and Nonheritable variation arises when the same set of temporal variability. Then one could calculate the dy- genes or predictors leads to diverse character values. Fit- namical path along which populations evolve. ness of a type is the average over the range of character Equilibrium balance and dynamical analysis are two values expressed, altering the relation between transmis- distinct tools, each with advantages and limitations. sible predictors of characters and fitness. Variability in The equilibrium models require few specific assumptions. expression can itself be a character influenced by selec- They describe simply how changed assumptions alter pre- tion (West-Eberhard, 2003; Frank, 2011b). dicted outcomes. But simple equilibrium models ignore Evolvability arises when a character increases the po- many constraints and other forces that must be present tential for future evolution. A character may raise the in any real case. A mathematician trained in the careful chance of producing novel phenotypes. Future potential analysis of dynamical models often finds such extreme for exploring phenotypic novelty may reduce immediate simplifications abhorrent. So many things are either un- fitness. Selection theory must be extended to evaluate specified or potentially wrong, that almost certainly some conflicting forces at different timescales (Wagner and Al- aspect of the analysis will be misleading or incorrect. tenberg, 1996). The dynamical models require more assumptions and provide more insight into the evolutionary paths that populations may follow. But those models, by requiring Deductive versus inductive perspectives. Some- significant detail as input, may be exact mathematical times it makes sense to think in terms of deductive predic- analyses that apply exactly to nothing. Biologists trying tions. What do particular assumptions about initial con- to understand natural history are often puzzled by the ditions, genetic interactions, fitness of individuals, and use of so many precise assumptions that cannot match spatial complexity predict about evolutionary dynamics? any real aspect of biology. Sometimes it makes sense to think in terms of induc- Each approach, applied judiciously, reflects a different tive analysis. Given certain frequency changes and the aspect of reality. Given the complexities of most biolog- total distance between ancestor and descendant popula- ical problems, the simpler equilibrium method has been tions, how much do different causes explain of that total much more successful in the design and interpretation of distance? empirical studies. The more detailed dynamical models Mathematical theories often analyze deductive models help most when exploring, in theory, potentially complex of dynamics. Practical applications to empirical prob- interactions that would be difficult study by simpler ap- lems often inductively partition causes. In practical ap- proaches. plications, one asks: How well do various alternative causal structures fit with the observed or assumed pat- tern of change? What character values are causally con- Other topics. Evolutionary conflict arises when the sistent with lack of change near an equilibrium? causes of characters transmit in different ways. Differ- The deductive and inductive approaches each have ent transmission pathways lead to opposing relations be- benefits. Deductive approaches often provide the only tween character values and fitness. Simple theories of way to study the consequences of particular assumptions. selection identify such conflicts. The complexity of con- Inductive approaches often provide the only way to an- flict creates challenges in analyzing the causes of selection alyze the causes of particular patterns. This article em- (Burt and Trivers, 2008). phasized the inductive analysis of cause. Drifting frequencies arise in small populations. When Consider random drift and selection. Deductively, one relatively few individuals reproduce, random processes assumes randomness in small populations and differences often influence success more strongly than differences in in expected reproductive success. From those assump- character values. The evolution of characters may de- tions, one calculates the probability that a population pend on chance events rather than adaptive changes fa- ends up in a particular state. vored by selection (Crow and Kimura, 1970). Inductively, one starts with an observed or assumed Variable selection arises when environments change total distance between the initial and final popula- over time or space. A character with lower average suc- tion. Causal hypotheses partition that total change 14 into random and selective components—one component Frank, S. A., “Natural selection. I. Variable environments and un- caused by random sampling processes and one compo- certain returns on investment,” Journal of Evolutionary Biology nent caused by characters that influence reproductive 24, 2299–2309 (2011a). Frank, S. A., “Natural selection. II. Developmental variability and success. Which of the alternative causal partitions of evolutionary rate,” Journal of Evolutionary Biology 24, 2310– total change best fits all of the assumptions or all of the 2320 (2011b). available data? Frank, S. A., “Natural selection. III. Selection versus transmission The nature of selection encourages an inductive per- and the levels of selection,” Journal of Evolutionary Biology 25, spective (Frank, 2009, 2012b). Populations change in 227–243 (2012a). Frank, S. A., “Natural selection. IV. The Price equation,” Journal frequency composition. Those frequency changes—the of Evolutionary Biology 25, 1002–1019 (2012b). actual distance between the ancestral and descendant Frank, S. A., “Natural selection. V. How to read the fundamental populations—cause populations to acquire information equations of evolutionary change in terms of information the- inductively about the environment. The transmissible ory,” Journal of Evolutionary Biology 25, 2377–2396 (2012c). predictors of characters correlated with fitness determine Frank, S. A., “Wright’s adaptive landscape versus Fisher’s funda- mental theorem,” in The Adaptive Landscape in Evolutionary the fraction of the inductively acquired information re- Biology, edited by E. Svensson and R. Calsbeek (Oxford Uni- tained by the population. versity Press, New York, 2012) pp. 41–57. Frank, S. A., “Natural selection. VI. Partitioning the information in fitness and characters by path analysis,” Journal of Evolutionary Biology 26, 457–471 (2013a). Acknowledgments. National Science Foundation Frank, S. A., “Natural selection. VII. history and interpretation of grant DEB–1251035 supports my research. Parts of this kin selection theory,” Journal of Evolutionary Biology 26, 1151– chapter were taken from my series of articles on natural 1184 (2013b). selection published in the Journal of Evolutionary Biol- Frank, S. A. and Slatkin, M., “Fisher’s fundamental theorem of ogy. natural selection,” Trends in Ecology and Evolution 7, 92–95 (1992). Gillespie, J. H., “Natural selection with varying selection coef- ficients - a haploid model,” Genetical Research 21, 115–120 Burt, A. and Trivers, R., Genes in Conflict: The Biology of Selfish (1973). Genetic Elements (Belknap Press, Cambridge, MA, 2008). Hamilton, W. D., “Selfish and spiteful behaviour in an evolutionary Charlesworth, B., Evolution in Age–Structured Populations, 2nd model,” Nature 228, 1218–1220 (1970). ed. (Cambridge University Press, Cambridge, 1994). Hamilton, W. D., “Innate social aptitudes of man: an approach Crespi, B. J., “Measuring the effect of natural selection on phe- from evolutionary genetics,” in Biosocial Anthropology, edited notypic interaction systems,” American Naturalist 135, 32–47 by R. Fox (Wiley, New York, 1975) pp. 133–155. (1990). Lande, R. and Arnold, S. J., “The measurement of selection on Crow, J. F. and Kimura, M., An Introduction to Population Ge- correlated characters,” Evolution 37, 1212–1226 (1983). netics Theory (Burgess, Minneapolis, Minnesota, 1970). Li, C. C., Path Analysis (Boxwood, Pacific Grove, California, Ewens, W. J., “An interpretation and proof of the fundamental 1975). theorem of natural selection,” Theoretical Population Biology Maynard Smith, J., Evolution and the Theory of Games (Cam- 36, 167–180 (1989). bridge University Press, Cambridge, 1982). Ewens, W. J., “An optimizing principle of natural selection in evo- Maynard Smith, J. and Price, G. R., “The logic of animal conflict,” lutionary population genetics,” Theoretical Population Biology Nature 246, 15–18 (1973). 42, 333–346 (1992). Price, G. R., “Selection and covariance,” Nature 227, 520–521 Falconer, D. S. and Mackay, T. F. C., Introduction to Quantitative (1970). Genetics, 4th ed. (Longman, Essex, England, 1996). Price, G. R., “Extension of covariance selection mathematics,” An- Fisher, R. A., “The correlation between relatives on the supposition nals of Human Genetics 35, 485–490 (1972a). of Mendelian inheritance,” Transactions of the Royal Society of Price, G. R., “Fisher’s ‘fundamental theorem’ made clear,” Annals Edinburgh 52, 399–433 (1918). of Human Genetics 36, 129–140 (1972b). Fisher, R. A., The Genetical Theory of Natural Selection (Claren- Queller, D. C., “A general model for kin selection,” Evolution 46, don, Oxford, 1930). 376–380 (1992). Frank, S. A., “Hierarchical selection theory and sex ratios I. Gen- Robertson, A., “A mathematical model of the culling process in eral solutions for structured populations,” Theoretical Popula- dairy cattle,” Animal Production 8, 95–108 (1966). tion Biology 29, 312–342 (1986). Scheiner, S. M., Mitchell, R. J., and Callahan, H. S., “Using path Frank, S. A., “Kin selection and virulence in the evolution of proto- analysis to measure natural selection,” Journal of Evolutionary cells and parasites,” Proceedings of the Royal Society of London Biology 13, 423–433 (2000). B 258, 153–161 (1994). Taylor, P. D. and Frank, S. A., “How to make a kin selection Frank, S. A., “Mutual policing and repression of competition in the model,” Journal of Theoretical Biology 180, 27–37 (1996). evolution of cooperative groups,” Nature 377, 520–522 (1995). Thompson, D. W., (Cambridge University Frank, S. A., “Multivariate analysis of correlated selection and kin Press, Cambridge, 1917). selection, with an ESS maximization method,” Journal of The- Thompson, D. W., On Growth and Form, abridged edition (Cam- oretical Biology 189, 307–316 (1997a). bridge University Press, Cambridge, 1961). Frank, S. A., “The Price equation, Fisher’s fundamental theorem, Thompson, D. W., On Growth and Form, 2nd ed (Canto reprint of kin selection, and causal analysis,” Evolution 51, 1712–1729 1961 revised edition) (Cambridge University Press, Cambridge, (1997b). 1992). Frank, S. A., Foundations of Social Evolution (Princeton Univer- Van Valen, L. M., “Group selection, sex, and fossils,” Evolution sity Press, Princeton, New Jersey, 1998). 29, 87–94 (1975). Frank, S. A., “Natural selection maximizes Fisher information,” Wagner, G. P. and Altenberg, L., “Complex adaptations and the Journal of Evolutionary Biology 22, 231–244 (2009). evolution of evolvability,” Evolution 50, 967–976 (1996). 15

West-Eberhard, M. J., Developmental Plasticity and Evolution (Oxford University Press, New York, 2003). Williams, G. C., Adaptation and Natural Selection (Princeton Uni- versity Press, Princeton, NJ, 1966).