Developmental Science 19:2 (2016), pp 251–274 DOI: 10.1111/desc.12309

PAPER A mathematical model of the evolution of individual differences in developmental plasticity arising through parental bet-hedging Willem E. Frankenhuis,1 Karthik Panchanathan2 and Jay Belsky3

1. Behavioural Science Institute, Radboud University Nijmegen, The Netherlands 2. Department of Anthropology, University of Missouri, USA 3. Human Ecology, University of California, Davis, USA

Abstract

Children vary in the extent to which their development is shaped by particular experiences (e.g. maltreatment, social support). This variation raises a question: Is there no single level of plasticity that maximizes biological fitness? One influential hypothesis states that when different levels of plasticity are optimal in different environmental states and the environment fluctuates unpredictably, may favor parents producing offspring with varyinglevels of plasticity. The current article presents a mathematical model assessing the logic of this hypothesis – specifically, it examines what conditions are required for natural selection to favor parents to bet-hedge by varying their offspring’s plasticity. Consistent with existing theory from biology, results show that between- individual variation in plasticity cannot evolve when the environment only varies across space. If, however, the environment varies across time, selection can favor differential plasticity, provided fitness effects are large (i.e. variation in individuals’ plasticity is correlated with substantial variation in fitness). Our model also generates a novel restriction: Differential plasticity only evolves when the cost of being mismatched to the environment exceeds the benefits of being well matched. Based on mechanistic considerations, we argue that bet-hedging by varying offspring plasticity, if it were to evolve, would be more likely instantiated via epigenetic mechanisms (e.g. pre- or postnatal developmental programming) than genetic ones (e.g. mating with genetically diverse partners). Our model suggests novel avenues for testing the bet-hedging hypothesis of differential plasticity, including empirical predictions and relevant measures. We also discuss several ways in which future work might extend our model.

Research highlights Introduction

• We formalize Jay Belsky’s bet-hedging hypothesis of Developmental plasticity – the ability to adjust develop- differential plasticity. ment based on experience – is ubiquitous in nature • Results support the hypothesis’ logical coherence, but (Schlichting & Pigliucci, 1998), and evolves because it only under restrictive conditions. allows organisms to adaptively tailor their phenotypes to • Our model suggests novel avenues for empirically a range of environmental states (Dall, Giraldeau, Ollson, testing the bet-hedging hypothesis. McNamara & Stephens, 2005; West-Eberhard, 2003). • We suggest multiple theoretical extensions of our Empirical studies of humans show that the degree of model. plasticity itself may vary across individuals (‘differential plasticity’); that is, some individuals are shaped more ... it is advisable to divide goods which are exposed to than others by the same kinds of experiences (Belsky, some danger into several portions rather than to risk them 1997, 2005; Belsky & Pluess, 2009a, 2009b; Boyce & Ellis, all together. (Daniel Bernoulli, 1738 (trans. 1954), p. 30) 2005; Ellis, Boyce, Belsky, Bakermans-Kranenburg & van IJzendoorn, 2011; for studies of non-human ani- Put all your eggs in one basket and then watch that basket. mals, see Dingemanse & Wolf, 2013). In some cases, (Mark Twain, 1894, Pudd’nhead Wilson and Other Tales) highly plastic individuals are also more susceptible to

Address for correspondence: Willem E. Frankenhuis, Behavioural Science Institute; Radboud University Nijmegen; Montessorilaan 3, PO Box 9104; 6500 HE, Nijmegen, The Netherlands; e-mail: [email protected]

© 2015 John Wiley & Sons Ltd 252 Willem E. Frankenhuis et al. experience: that is, they are more adversely affected by offspring: one genetic by producing genetically diverse harmful environments as well as benefit more from offspring, the other experiential through pre- and post- supportive circumstances (‘differential susceptibility’; natal regulation of offspring plasticity. Belsky’s experi- Belsky, Jonassaint, Pluess, Stanton, Brummett et al., ential version of the bet-hedging hypothesis was inspired 2009; Boyce, Chesney, Alkon, Tschann, Adams et al., by the work of Boyce and Ellis (2005), who first 1995; Pluess & Belsky, 2011, 2013; Ellis et al., 2011). proposed that between-individual variation in biological Such for-better-and-for-worse susceptibility (Belsky, Bak- sensitivity to context – i.e. reactivity of neurobiological ermans-Kranenburg & van IJzendoorn, 2007) implies stress systems – can result from differences in experience that children may benefit or suffer differentially from (discussed below). In this paper, however, we are not such experiences as nurturance or abuse (Belsky & primarily concerned with the biological mechanisms that Pluess, 2009a, 2009b; Boyce et al., 1995), as well as from instantiate plasticity, or with the pathways through prevention and intervention efforts (Belsky & van which parents influence their offspring’s levels of plas- IJzendoorn, 2015; van IJzendoorn, Bakermans-Kranen- ticity. We provide references here (Belsky & Pluess, 2013; burg, Belsky, Beach, Brody et al., 2011). It is crucial to Ellis et al., 2011), and elsewhere in the paper, for readers recognize that for better and for worse, in this context, seeking more information about these mechanisms and refers to mental-health outcomes, not fitness payoffs. We processes. note this up front because we will later propose that, in fitness terms, plastic individuals achieve relatively fixed Motivating the model payoffs (rather than variable payoffs) across environ- mental conditions, because plasticity enables the devel- Our main goal is to present a mathematical model that opment of locally adaptive phenotypes (increasing evaluates the logical coherence of Belsky’s (1997, 2005; fitness), but also comes at a cost (reducing fitness). Belsky & Pluess, 2009a) bet-hedging hypothesis of A theoretical question is why between-individual var- differential plasticity. Specifically, we examine what are iation in plasticity exists. Explanations of differential the necessary conditions for natural selection to favor plasticity include differences in genes and epigenetic parents to hedge their bets by varying offspring plastic- regulation (Belsky et al., 2009; Belsky & Pluess, 2009a, ity. The idea that diversification of investments can be 2009b), as well as differences in prior experiences (Boyce & adaptive in an unpredictably fluctuating environment is Ellis, 2005; Ellis et al., 2011; Frankenhuis & Panchana- not novel: In both biology and economics, well-devel- than, 2011a, 2011b; Pluess & Belsky, 2011). This article oped literatures have addressed this issue, even if focuses on Belsky’s proposal (1997, 2005; Belsky & Pluess, psychologists – Belsky’s (1997, 2005; Belsky & Pluess, 2009a) that, when different levels of plasticity are optimal 2009a) intended audience – have not thought in such (in terms of biological fitness) in different environmental terms. Economists have focused on the conditions in states and the environment fluctuates unpredictably, which investors should diversify their portfolio so as to natural selection may favor parents producing offspring maximize their profit in a temporally fluctuating market with varying levels of plasticity. Before proceeding, a note (Bernoulli, 1738, trans. 1954; Markowitz, 1952; Stearns, on terminology: There is no implication that parents 2000). Biologists have focused on strategies that optimize deliberately weigh their options after estimating environ- organisms’ reproductive success in randomly varying mental variability and then decide on their offspring’s environments (reviewed in Childs, Metcalf & Rees, 2010; levels of plasticity. Rather, reproductive decisions result Donaldson-Matasci, Bergstrom & Lachmann, 2013; from mechanistic processes that are the products of Donaldson-Matasci, Lachmann & Bergstrom, 2008; natural selection; organisms need not be consciously Ellis, Figueredo, Brumbach & Schlomer, 2009; Meyers aware of these processes. Accordingly, when we refer to & Bull, 2002; Simons, 2009, 2011). As we will show parental ‘decisions’ or ‘choices’ – for instance, to produce below, Belsky’s (1997, 2005; Belsky & Pluess, 2009a) particular proportions of fixed and plastic offspring – we version of the bet-hedging argument is formally similar are referring to observable outcomes (epiphenomena) that to some classical bet-hedging models from biology (e.g. result from the entire array of causal mechanisms that Moran, 1992; Philippi & Seger, 1989; extended in determine behavior (from molecules to neural networks), Starrfelt & Kokko, 2012). including but not limited to cognitive processes. In biology, bet-hedging is typically discussed in the context of fixed traits, which are non-responsive to local conditions. Biologists have shown, for instance, that Mechanisms of bet-hedging parents might bet-hedge by producing offspring of Belsky (2005; Pluess & Belsky, 2009, 2011) stipulated two variable sizes (Hopper, 1999; Olofsson, Ripa & Jonzen, mechanisms by which parents could diversify their 2009), or variable developmental delay times (i.e.

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 253 diapause; Childs et al., 2010; Cohen, 1966). In contrast, Before presenting the model, we first chronicle indi- Belsky applies the logic of bet-hedging to developmental vidual differences in susceptibility to environmental plasticity itself (i.e. to contexts where trait development influences (for more extensive review, see Aron, Aron depends on local conditions). In this sense, Belsky’s & Jagiellowicz, 2012; Bakermans-Kranenburg & van hypothesis is qualitatively distinct from existing bet- IJzendoorn, 2007, 2011; Belsky & Pluess, 2009a; Kim- hedging accounts. If bet-hedging theory can be applied Cohen, Caspi, Taylor, Williams, Newcombe et al., 2006), to developmental plasticity, this is interesting and may be including related theory. Because our main goal is to empirically fruitful, because plasticity is a property of formalize and analyze Belsky’s (1997, 2005; Belsky & many physiological systems in many species (Fischer, Pluess, 2009a) bet-hedging hypothesis of differential Van Doorn, Dieckmann & Taborsky, 2014). Moreover, plasticity, we will not compare theories (see Ellis et al., developmental plasticity is a basic property of many 2011). mechanisms of the human mind and hence a long- standing focus of inquiry in psychology in general (e.g. the effects of early-life experience on later-life outcomes) From risk alleles to plasticity genes and in particular. Our contribution is to capture and analyze Belsky’s The notion that individuals differ in their susceptibility hypothesis within a formal specialist-generalist, bet- to environmental effects has a long history in psychiatry hedging framework. We conceptualize low-plasticity and psychology (Belsky & Pluess, 2009a). In fact, most strategies (which produce the same phenotype despite gene–environment interaction (G9E) research con- variable environmental conditions) as specialists adapted ducted over the past decade has been informed by the to a particular niche, and high-plasticity strategies (which so-called diathesis-stress or dual-risk framework (Zuck- produce different phenotypes in different environmental erman, 1999), which stipulates that some individuals are conditions) as generalists across niches (see Wilson & especially susceptible to negative effects of contextual Yoshimura, 1994). This interpretation roots developmen- adversity. The focus of the differential-susceptibility tal psychologists’ growing interest in differential plastic- model, including G9E, differs from the diathesis-stress ity within a well-developed body of bet-hedging theory framework. Whereas the latter focuses exclusively on from evolutionary biology, and it could facilitate inte- vulnerability, the former stipulates that some individuals gration between models of bet-hedging and models of are not just more vulnerable to contextual adversity, but frequency-dependent selection that include specialists also benefit more from supportive environmental condi- and generalists (Ellis, Jackson & Boyce, 2006; Ellis et al., tions. This appears to be the case for some polymor- 2011; Wilson & Yoshimura, 1994). We regard this as phisms long regarded as ‘vulnerability genes’ or ‘risk important given the general lack of consideration of alleles’, leading to the proposal that they be regarded as evolutionary principles – and of evolutionary modeling in ‘plasticity genes’ instead (Belsky et al., 2009) – with particular – in the field of developmental psychology, ‘plasticity’ referring to phenotypic outcomes, not fitness which hinders much-needed theoretical integration with – payoffs (as noted earlier, we will argue below that plastic and consilience across – the life sciences. Our work might individuals attain moderate, not variable, fitness pay- also inspire biologists to consider between-individual offs). Notably, two recent meta-analyses of G9E differences in plasticity as a product of bet-hedging. research support the differential susceptibility theorizing: Applying the logic of bet-hedging to plastic traits is one focused on dopamine-related genes carried by reasonable when (a) phenotypic development is not fully children 10 years of age or younger (Bakermans-Kran- reversible and (b) individuals use imperfect cues to enburg & van IJzendoorn, 2011), and the other on the 5- estimate the environmental state, resulting in maladap- HTTLPR polymorphism in Caucasian children under tive developmental programming in some individuals. If 18 years of age (van IJzendoorn, Belsky & Bakermans- both conditions hold, then within any generation some Kranenburg, 2012). Both of these meta-analyses indicate individuals develop phenotypes that are well matched to that children carrying putative plasticity alleles are more the environment, and others do not. On Belsky’s (1997, susceptible to both the negative effects of environmental 2005; Belsky & Pluess, 2009a) account, well-matched adversity and the positive effects of social support (in individuals are those whose genetic composition is well terms of mental health). suited to the environment and/or those whose early-life Importantly, recent experimental human intervention cues accurately predicted their adult environment; mis- work that involves randomly assigning individuals to matched ones are those whose genetic composition is alternative contextual conditions (Belsky & Van IJzen- poorly suited to the environment and/or those whose doorn, 2015), thereby overcoming the risk that G9E early-life cues failed to predict their adult environment. findings are an artifact of gene–environment correlation,

© 2015 John Wiley & Sons Ltd 254 Willem E. Frankenhuis et al. provides additional support to the claim that some long- Shirtcliff, 2011; Ellis et al., 2011; Pluess & Belsky, regarded vulnerability genes function as plasticity genes 2011). This view is central to Boyce and Ellis’ (2005) (for review, see Belsky & Pluess, 2013). Indeed, a recent theory of Biological Sensitivity to Context (BSC), which meta-analysis of experimental interventions designed to regards physiological reactivity as a contextually regu- promote well-being while chronicling truly causal envi- lated plasticity factor. ronmental effects reveals that DRD4 and 5-HTTLPR function as plasticity genes, such that carriers of certain Biological sensitivity to context allelic variants of these polymorphisms benefit more – in mental-health terms – from these efforts (e.g. to promote Boyce and Ellis (2005) argue that individuals growing up early literacy, to prevent teen alcohol abuse) than do under extreme environmental conditions may benefit others (van IJzendoorn & Bakermans-Kranenburg, from developing heightened BSC (Ellis & Boyce, 2008; 2015). From an evolutionary perspective, two hypotheses Ellis, Essex & Boyce, 2005; Ellis et al., 2011). Such have been proposed to explain genetic variation in heightened reactivity can augment vigilance to dangers plasticity that undergirds, by hypothesis, individual and threats in stressful environments and enhance the differences in susceptibility. benefits derived from support and care in protective ones (Del Giudice et al., 2011; Ellis et al., 2006). In environ- ments with an intermediate level of stress, individuals Frequency-dependent selection may down-regulate reactivity, thus avoiding the costs One hypothesis posits that frequency-dependent selec- associated with persistently elevated levels of physiolog- tion maintains genetic variation in plasticity (Ellis et al., ical reactivity when the benefits do not outweigh the cost 2006, 2011; Wilson & Yoshimura, 1994; see also Wolf, (Ellis & Boyce, 2008). van Doorn & Weissing, 2008, 2011). Frequency-depen- dent selection refers to conditions in which a pheno- Prenatal programming of plasticity type’s fitness is dependent on its frequency relative to other phenotypes in a population. Individual differences A related view, developed by Belsky (1997, 2005; Belsky & in plasticity can persist due to frequency-dependent Pluess, 2009a; Belsky et al., 2007), stipulates that negative selection when plastic phenotypes attain higher fitness emotionality in infants and young children – a known than less-responsive phenotypes in a population com- correlate of physiological reactivity – also functions as a posed mostly of the latter – and vice versa (Ellis et al., plasticity factor. Children manifesting high levels of 2006, 2011; Wilson & Yoshimura, 1994). negativity (e.g. fear, distress, inhibition) prove not just more susceptible to the adverse effects of negative environments (e.g. harsh parenting, maternal depression), Parental bet-hedging but also benefit more from supportive ones (in terms of The current article focuses on the bet-hedging hypothesis mental health, not fitness). Children’s developmental of differential plasticity (Belsky, 1997, 2005; Belsky & experiences, including exposure to prenatal stress (e.g. Pluess, 2009a). This hypothesis states that it might be Huizink, Bartels, Rose, Pulkkinen, Eriksson et al., 2008; adaptive for parents to produce offspring with varying O’Connor, Ben-Shlomo, Heron, Golding, Adams et al., levels of developmental plasticity because different levels 2005; Pesonen, Raikk€ onen,€ Strandberg & Jarvenp€ a€a,€ of plasticity may be optimal in different environmental 2005), appear to influence both their physiological reac- states. Since the future is uncertain, natural selection tivity (e.g. Claessens, Daskalakis, van der Veen, Oitzl, de might favor parents who spread their investments and Kloet et al., 2011; Ellis & Boyce, 2008; Kaiser & Sachser, associated risk by producing offspring that vary in their 2009), even when measured in adulthood (Heim, New- sensitivity to environmental influences, including par- port, Wagner, Wilcox, Miller et al., 2002), and their enting (Belsky, 1997, 2005; Belsky & Beaver, 2011; Ellis negative emotionality (Belsky, Fish & Isabella, 1991). et al., 2011; Pluess & Belsky, 2010, 2013; see also Figueredo & Wolf, 2009). Consistency in early experiences Frankenhuis and Panchanathan (2011a, 2011b) recently Experiential regulation of plasticity proposed a third experiential process that may contribute to individual differences in plasticity: stochastic sam- As noted, between-individual variation in plasticity may pling. In some developmental domains, organisms may result from between-individual differences in prior expe- face a tradeoff between sampling more cues to the riences (Boyce & Ellis, 2005; Del Giudice, Ellis & environmental state and tailoring their phenotypes to

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 255 local conditions. In these domains, some individuals may This depends on the costs to plasticity, which might receive a homogeneous sample set, resulting in a include (a) phenotypic–environment mismatch resulting confident estimate about the environmental state, leading from prediction error (i.e. during developmental pro- such individuals to specialize early in life, and thereby gramming, cues may imperfectly indicate current or sacrifice plasticity (assuming that phenotypic develop- future environmental states, resulting in a suboptimal ment is not fully reversible). In contrast, others may phenotype; Donaldson-Matasci et al., 2013; Nettle, receive a heterogeneous set of cues, resulting in a less Frankenhuis & Rickard, 2013; Rickard, Frankenhuis & confident estimate, leading them to defer ‘phenotypic Nettle, 2014), (b) constitutive costs (e.g. energy required commitment’ (in order to avoid mismatch), keep sam- for building and maintaining the neural-cognitive pling, and specialize later. As a consequence, individuals machinery required for plasticity), (c) information search may come to differ in their levels of plasticity. costs (e.g. time spent sampling environmental cues), and (d) a lower degree of phenotypic integration (e.g. add- ons may be less effective than the same phenotypic Developing the model element integrated early in development) compared with fixed phenotypes that specialize from birth to fit a In building a model of Belsky’s (1997, 2005; Belsky & particular environmental state (for reviews, see Auld, Pluess, 2009a) bet-hedging argument of differential plas- Agrawal & Relyea, 2010; DeWitt, Sih & Wilson, 1998; ticity, our first challenge is to translate for-better-and-for- Relyea, 2002). worse outcomes, which are typically defined in terms of We will not consider these costs in detail here, but mental health, quality of life, and social desirability, into assume that there is some cost to plasticity. This cost is biological fitness (Belsky, 2008; Ellis, Del Giudice, Dish- such that within any generation the fitness of plastic ion, Figueredo, Gray et al., 2012; Frankenhuis & Del individuals is lower than that of (fixed) specialists Giudice, 2012; Frankenhuis & de Weerth, 2013; Manuck, matching the environmental state. However, the cost of 2010). In biology, traits are considered beneficial to plasticity is low enough for the fitness of plastic individ- organisms to the extent that they increase individuals’ uals to be higher than that of (fixed) specialists not relative survival and reproductive success. In contrast, matching the environmental state. We capture this idea in developmental psychologists tend to view distressing or our model by assuming that plastic individuals accrue a socially undesirable behavior as inherently maladaptive, fitness of 1 in each environment, which is intermediate and behaviors enhancing well-being and social integration between that of specialists who match the environmental as inherently adaptive. These different notions of ‘adap- state – and attain a fitness payoff of 1 +b– and that of tive’ are conceptually orthogonal: desirable behavior may specialists who do not match the environment and thus (but need not) enhance reproduction, and fitness-enhanc- attain a payoff of 1c. We assume that all plastic ing behavior may (but need not) have desirable features. individuals attain a fitness of 1 (instead of some attaining 1 +band others 1c) because we want to ensure that plastic individuals, as a group, are always situated in Fitness payoffs between matched and mismatched specialists. Plas- Individuals who are relatively sensitive to their environ- tic individuals are thus ‘generalists’, sacrificing specific- ment are ‘plastic’. They adaptively match their pheno- ity for breadth (Wilson, 1994; Wilson & Yoshimura, types to local conditions more than their peers, 1994). developing danger-adapted phenotypes (e.g. high stress) In contrast, specialists are less malleable, adapting in dangerous environments and safe-adapted phenotypes their phenotypes less to context, developing relatively (e.g. low stress) in safe environments. Whereas danger- similar phenotypes even in different environments (com- adapted phenotypes are considered maladaptive from a pared with plastic individuals). When their phenotype mental-health perspective, but biologically adaptive in matches the environmental state, they thrive because a dangerous environment, safe-adapted phenotypes in a they achieve a viable phenotype–environment fit without safe environment are considered adaptive from an paying a cost for plasticity; in this case, specialists attain evolutionary as well as a mental-health perspective. higher fitness (1 +b) than plastic individuals. However, From a fitness viewpoint, this raises a question: If plastic when the phenotypes of specialists are not well matched individuals match their phenotypes to local conditions, to the environment, they suffer, attaining lower fitness shouldn’t they attain the same or higher fitness than (1c) than plastic individuals. Specialists thus sacrifice individuals who do not adjust development (i.e. fixed breadth for specificity (Wilson, 1994; Wilson & Yoshim- types), hence outcompete them? ura, 1994).

© 2015 John Wiley & Sons Ltd 256 Willem E. Frankenhuis et al.

Spatial and temporal environmental variation as the arithmetic mean of the fitness of all their offspring2: the fraction of offspring in the safe patch The bet-hedging hypothesis of differential plasticity multiplied by their fitness, plus the fraction of offspring (Belsky, 1997, 2005; Belsky & Pluess, 2009a) is based in the dangerous patch multiplied by their fitness (and so on the idea that parents cannot predict with certainty forth, if there are more patches). This calculation implies what environmental state their offspring will experience. that even if some offspring attain low fitness (e.g. they However, environments can be unpredictable in different die), parents might still do well, depending on the fitness ways, and these may result in different selection pres- attained by their other offspring. sures, hence different adaptations (in our case, different offspring compositions; see below). Here we examine two kinds of environmental variation well known in evolu- Temporal environmental variation tionary biology: spatial and temporal environmental In contrast, in environments that vary only temporally variation (some use the terms individual-level and popu- (and not spatially), within a single generation all individ- lation-level; e.g. Bergstrom & Godfrey-Smith, 1998; uals confront the same environmental state (e.g. danger- Donaldson-Matasci et al., 2008); we do not analyze ous); however, in the next generation, their offspring may their combination (see, e.g. Carja, Furrow & Feldman, face a different state (e.g. safe). Thus, if the environment 2014). The appropriate fitness calculations under spatial fluctuates temporally, the entire population experiences and temporal environmental variation constitute a rich variation across generations. In such environments, long- field of inquiry in biology; such calculations depend on term fitness depends on the fitness of one generation, the intricacies and degree of realism of assumptions multiplied by the fitness of the next generation, and so about organisms and their physical and social environ- forth. The average fitness of this series will not be the ments. In this article, we will discuss just two stylized arithmetic mean, but the geometric (i.e. multiplicative) scenarios. We recommend the following works to readers mean – the n-th root of the product of n fitness values3 seeking more information or interested in developing (Dempster, 1955; King & Masel, 2007; Lewontin & more refined follow-up models (Frank, 2011; Leimar, Cohen, 1969). Unlike with spatial variation, fluctuations 2009; McNamara, Trimmer, Eriksson, Marshall & in success can be catastrophic when the environment Houston, 2011). varies temporally. If an entire set of offspring is mis- matched to its environment in one generation and fails to Spatial environmental variation reproduce, this lineage will be wiped out. In environments that vary exclusively spatially (and not temporally), the world is divided into different patches, Formal definition of bet-hedging each with a particular state (e.g. dangerous or safe). Variance in fitness across environments lowers the Within a given generation, any offspring may develop in geometric mean, but not the arithmetic mean. Indeed, either patch, and the associated probabilities are equal the geometric mean is often approximated by the for all individuals (i.e. juveniles disperse from their natal arithmetic mean minus a variance term – most com- patch and settle on a new patch at random); some monly, the variance divided by two times the arithmetic offspring will develop in one patch (e.g. safe), others in mean4 (Frank, 2011; Starrfelt & Kokko, 2012; Stearns, another patch (e.g. dangerous), and so forth (if there are more than two patches). Across generations, however, individuals always face the same spatial distribution, and 2 We assume discrete, non-overlapping generations, which consist of a so parents attain the same expected fitness within each single selective life stage: organisms are born and reproduce; mature generation – and thus also across generations; that is, individuals die; and the cycle repeats. Parents and offspring do not there is no variance in parents’ fitness across genera- coexist (see also General Discussion section). 3 tions.1 In this scenario, parents’ fitness can be computed This assumes an infinitely large and well-mixed population (Starrfelt & Kokko, 2012). When populations are finite, variance in offspring number reduces fitness in proportion to the inverse of the population size (see Frank & Slatkin, 1990, Gillespie, 1974, and Proulx & Day, 1 This expectation being identical across generations assumes that 2001, for discussions of geometric mean fitness in finite populations, genotypes produce an infinite number of offspring. If genotypes particularly small ones; for a discussion of the evolution of bet-hedging produce a finite number of offspring (as they do in reality), they will in large, structured populations, see Lehmann & Balloux, 2007). experience variance in fitness across generations (Starrfelt & Kokko, 4 This approximation assumes that the fitness of individuals does not 2012). Because this variance becomes very small at even modest deviate much from the average fitness of their genotype within a given population sizes (Hopper, Rosenheim, Prout & Oppenheim, 2003), we generation (Starrfelt & Kokko, 2012). If it does, a structurally similar, will ignore it here. albeit less succinct, approximation may be preferable.

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 257

2000; see Young & Trent, 1969, for other approxima- by developing offspring in the model we later present. tions). The long-term fitness of genotypes thus depends Each roulette wheel has a certain number of red pockets not only on their immediate (arithmetic) expectation for and a certain number of black pockets, where black the next generation, but also on variance in their fitness pockets correspond to a safe environment and red across generations (Dempster, 1955). This insight formed pockets to a dangerous environment (we ignore numbers the foundation of bet-hedging theory in biology (Slatkin, associated with different pockets). The fraction of black 1974). A mean–variance tradeoff exists if an increase in a pockets is given by the probability p. For example, if strategy’s short-term (i.e. next-generation) expected p = .75 there are three times as many black pockets as arithmetic mean also increases its long-term (i.e. across red pockets for each and every roulette wheel in the generations) expected fitness variance. Bet-hedging strat- casino. egies are those which, when confronting this tradeoff, Next to each wheel is a felt table with three squares, sacrifice short-term fitness to reduce long-term fitness one black, one red, and one white. Before each spin, variance (Seger & Brockmann, 1987; Philippi & Seger, gamblers place their chips on these squares. A gambler 1989). represents a parent and the set of all gamblers in the casino represents an evolving population of parents. Each chip represents a child. Just as a gambler chooses Diversified vs. conservative bet-hedging to bet on black, red, or white, a parent can have a child We will focus on situations in which parents hedge their that is a safe-specialist, a danger-specialist, or a gener- bets by producing diverse offspring (i.e. a mixture of alist. A chip bet on the white square returns back that types); this is known as ‘diversified’ bet-hedging (cap- same chip regardless of the outcome of the spin (i.e. no tured by the idiom: ‘Don’t put all your eggs in one gain, no loss). A chip bet on the black square returns basket’). Formally, ‘bet-hedging’ applies to any strategy 1 + b chips if the wheel comes up black (where b that increases geometric mean fitness while sacrificing represents a fraction of one chip, varying between 0 and short-term arithmetic mean fitness. If, for instance, 1) and 1c chips if the wheel comes up red (where c parents are selected to produce only plastic offspring in represents a fraction between 0 and 1). Similarly, a chip order to reduce variation in fitness, despite a reduction in bet on red returns 1 + b chips if the outcome is red and short-term arithmetic mean fitness, this is bet-hedging, 1c if the outcome is black. too; it is called ‘conservative’ bet-hedging (captured by Gamblers must bet all of their chips every time the the idiom: ‘A bird in the hand is worth two in the bush’) wheels spin (i.e. they cannot reserve chips, though betting because individuals avoid extreme payoffs. on white results in the same outcome). Each gambler has Diversified bet-hedging can be instantiated in different a strategy she uses in placing her bets. Some strategies ways: by producing genetically diverse offspring, or by will, over time, do better than other strategies. And we producing genetically homogenous offspring each of might expect that most gamblers end up deploying which randomly develops a phenotype (as if drawn from similar strategies. There is, after all, a uniquely best a fixed probability distribution). The latter is called strategy for any combination of values for p, b, and c. ‘adaptive coin flipping’ because, metaphorically, each Note: we chose roulette as a metaphor, rather than individual flips a coin to determine what phenotype he or poker, because our model assumes that the payoff of a she will develop: e.g. specialist or generalist (Cooper & strategy is not dependent on its frequency relative to Kaplan, 1982; Kaplan & Cooper, 1984; Salathe, Van other strategies in the population. We leave an extension Cleve & Feldman, 2009). We defer further discussion of of our model that incorporates frequency-dependent instantiation to the General Discussion section. selection – in which parents are playing games not only against nature, but against each other as well – for a future study. A gambling metaphor On some nights, the casino limits gamblers such that they can only bet one chip on each wheel every time the Before introducing the mathematical model, we intro- wheels spin (e.g. if a gambler has 20 chips, she must place duce a gambling metaphor in order to introduce key 20 bets on 20 different roulette wheels). On other nights, concepts, develop intuitions, and preview results. Imag- the casino forces gamblers to place all of their chips on ine a casino with a large number of identical roulette just one wheel each time the wheels spin (e.g. the gambler wheels. All wheels are spun simultaneously throughout must now place all 20 of her chips on just one of the the night (e.g. every five minutes). Each spin of the roulette wheels). wheels corresponds to a generation and the outcome of a In the first version, in which separate bets are placed spin corresponds to the environmental state experienced on separate wheels, a gambler does best by betting all of

© 2015 John Wiley & Sons Ltd 258 Willem E. Frankenhuis et al. her chips on just one color (i.e. red, black, or white); she should never place any chips on the white square. The should never bet on different colors. To understand why, precise distribution between red and black depends on we need to calculate the expected return in any round the values of p, b,andc. When the cost of being wrong for betting on each of the three colors. If a gambler bets exceeds the benefit of being correct (c>b), a gambler on black, her expected payoff is 1 + pb (1p)c (see sometimes does best by placing some chips on the white Appendix for details). When this expectation is greater square and the remainder on either the black square (if than 1, the gambler will, over time, make more money p > .5) or the red square (if p < .5), but never both. betting on black than betting on white, which is the safe These betting strategies ensure a good payoff without bet. If, instead, a gambler bets on red, her expected suffering catastrophic failure. payoff is 1 + b(1p) pc. Again, if this is greater than This gambling metaphor has fundamental similarities 1, she will make more money, on average, by betting on with the problem faced by parents in choosing the red than on white. If both of these expectations (i.e. developmental strategies of their offspring when the betting on red and betting on black) are less than 1, the environment fluctuates across time and space. The nights gambler does poorly betting on either black or red; her in which gamblers can bet only one chip per roulette best play is to bet on white and preserve her chips. If wheel corresponds to a purely spatially varying ecology, one expectation (e.g. betting on red) is greater than 1 whereas the nights in which gamblers must place all their and the other expectation less than 1 (e.g. black), she chips on just one wheel corresponds to a purely should bet on the color that has an expected rate of temporally varying ecology. return greater than 1. And if both are greater than 1, she makes the most money betting on the color with the higher expected rate of return. In this game, because The mathematical model gamblers bet each chip on a separate wheel and because the outcomes across wheels are independent, losing at In our model, parents can produce plastic offspring any particular wheel is no big deal, especially when (generalists), who can adapt their phenotype to local betting a large number of chips each time. What conditions, attaining a payoff of 1 in each of the two matters, in the long run, is the arithmetic average across environmental states. Parents can also produce fixed wheels. In this case, that average is maximized by offspring (safe- and danger-specialists), who are less betting on just one color, depending on the values of p, able to adapt to local conditions, attaining high payoffs b,andc. (relative to generalists) in environments matching their Things are less simple in the second version of the phenotypes (1 +b), and low payoffs (relative to game, in which a gambler must place all of her chips on generalists) in environments not matching their pheno- just one wheel each time the wheels spin. If, for types (1c). We assume that b and c range between 0 example, she places all of her chips on the red square and 1; this assumption ensures that mismatched and black comes up, she stands to lose money; the specialists do not attain negative fitness, and implies larger the value of c, the more the gambler loses. When that well-matched specialists can at most attain double c = 1, the most it can be, the gambler is wiped out. the fitness of generalists. This assumption is justified Rather than placing all of her chips on a single color, because empirical work shows that fitness effects of she must be more cautious in this version of the game, natural phenotypic variation virtually always fall especially when the value of c is large, hedging her bets within this range in humans (reviewed in Keller & to insure against catastrophe. In this type of game, a Miller, 2006; Nettle & Pollet, 2008; Penke, Denissen & gambler seeks to maximize the geometric mean payoff, Miller, 2007; Stearns, Byars, Govindaraju & Ewbank, not the arithmetic mean (see above section Spatial and 2010), as well as other organisms (reviewed in Endler, Temporal Environmental Variation). To maximize the 1986; Hoekstra, Hoekstra, Berrigan, Vignieri, Hoang geometric mean, the gambler must take into account et al., 2001; Kingsolver, Hoekstra, Hoekstra, Berrigan, variation in her payoffs across gambles. How to Vignieri et al., 2001; Morrissey & Hadfield, 2012; maximize the geometric mean exactly depends on the Siepielski, Gotanda, Morrissey, Diamond, DiBattista values of p, b, and c. et al., 2013). To preview results for this second version of the game (which we describe in more detail below), when the Optimal offspring distribution benefit of a correct bet exceeds the cost of an incorrect bet (b>c), a gambler does best by placing all her chips on Our goal is to compute the fraction of safe-specialists the black square, all her chips on the red square, or (x), danger-specialists (y), and plastic individuals (z) that distributing her chips between black and red squares; she natural selection favors parents to produce. Each of these

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 259 fractions ranges between 0 and 1, and all three must sum these terms represents the arithmetic mean of fitness, to 1. We are particularly interested in the region where averaging across the three types of offspring. 0

© 2015 John Wiley & Sons Ltd 260 Willem E. Frankenhuis et al.

With temporal variation, differential plasticity can is high, there is little point in producing a second type to sometimes be favored. In the Appendix, we provide capture some benefits in the rare environment; individ- analytical results. Figure 1 depicts the regions of param- uals do best by producing only the type that is well eter space in which differential plasticity is uniquely matched to the common environment, despite suffering favored; that is, where it attains higher, not lower or the occasional loss. When the benefit and cost are large equal, fitness than all of the other strategies (see also (b,c >> 0) and p is not that high, selection favors figures in Appendix). producing a mixture of specialists. A mixture of special- Consistent with previous findings from biology (Don- ists does better than only safe-specialists because the aldson-Matasci et al., 2008; Moran, 1992; Starrfelt & mixture experiences much less variation in fitness across Kokko, 2012), natural selection only favors bet hedging time. And when the environment varies temporally, what when fitness effects are large (i.e. the variation in fitness matters is geometric mean fitness, not arithmetic mean associated with different types must be large), in order to fitness. A high variance in payoff reduces the geometric reduce costly (or even catastrophic) variance in fitness. mean. We also report a novel result: Natural selection only Focusing on the region in which selection favors a favors differential plasticity when the cost of being mixture of specialists, we can ask why a mixture of mismatched exceeds the benefit of being well adapted specialists beats differential plasticity (see the Appendix (c>b; Figure 1; see also figures in Appendix). We reflect for a proof). In this region, when p is not that high and b, on these results below. c >> 0, producing only safe-specialists results in too much exposure to risk. When the environment is dangerous, safe-specialists do very poorly. To mitigate When benefits exceed costs this risk, selection favors mixing safe-specialists with When the benefit of being well matched is larger than the another type, either danger-specialists or generalists. cost of being mismatched (b>c), selection never favors When the benefit of being well matched exceeds the cost differential plasticity (x+z = 1). Instead, selection favors of mismatch, this other type should be danger-specialists. producing either all safe-specialists (x = 1) or a mixture To understand why, we can think about the effect of of specialists (x+y = 1). Producing all safe-specialists is the two kinds of mixing on the variance in fitness across favored when the benefit of being well matched and the environments. With a mixture of specialists, there is cost of being mismatched are small (b,c << 1) or the always a mix that results in the same or nearly identical probability of experiencing the safe environment is high. payoffs in each environment (see figures in Appendix); With small fitness effects, fitness variance is also small, this is true even if the environmental state is highly so we are back to a world that is approximately like variable (e.g. p = .6). With differential plasticity, there is spatial variation; selection favors producing only the no way of mixing safe-specialists and generalists to type that has the highest arithmetic mean fitness. When p eliminate variance in payoff across environments; the

p = 0.6 p = 0.75 p = 0.9

3 .3 0 0.67 G D G 0.33 D D M 0.8 0.8 1 1 G 0.67 M 0.6 0 0.9 0.330.7 0 1 S Cost 0.4 M 1 7 S 0.2 0.6 0.9 1 0.8 S 1 S 0 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.8 Benefit

Figure 1 The x-axis represents the additional benefit (b) fixed specialists obtain, compared with plastic generalists, if their phenotype matches the environmental state. The y-axis represents the additional cost (c) fixed specialists incur, compared with plastic generalists, if their phenotype does not match the environmental state. Contour lines indicate the optimal fraction of safe- specialists, when the probability of a safe environment equals 0.6, 0.75, and 0.9 (from left to right). In regions denoted ‘S’ (shaded dark gray), only safe-specialists are favored; if ‘G’ (shaded medium gray), pure generalists; if ‘M’ (shaded light gray), mixtures of specialists (i.e. safe- and danger-specialists); if ‘D’ (not shaded), differential plasticity (i.e. mixtures of generalists and safe-specialists). If b > c, differential plasticity is never favored. If c > b, differential plasticity can be favored, especially if c is large.

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 261 only way to eliminate fitness differences in the two differential plasticity. If these three conditions are met, environments is to produce all generalists. Mixtures of our model can account for differential plasticity (i.e. the specialists can always reach lower variance in payoffs coexistence of plastic generalists and fixed specialists). than differential plasticity, which all else being equal (e.g. However, our model cannot account for a coexistence of the arithmetic mean) results in higher geometric mean plastic generalists and multiple fixed specialists (i.e. fitness. different specialists adapted to different environments). Including density-dependence in the model will likely change this result (see Kawecki & Ebert, 2004; Savolai- When costs exceed benefits nen et al., 2013; Wilson & Yoshimura, 1994); however, a When the cost of mismatch exceeds the benefit of being future study should formally examine this. well matched (c>b), selection sometimes favors differen- tial plasticity. When the magnitude of benefit and cost Empirical predictions are small (c,b << 1), selection favors producing only generalists when p is below some threshold (the precise Before discussing the plausibility of large fitness effects, value of this threshold depends on the values of b and c). we will first derive empirical predictions. We will focus When the magnitude of fitness effects is large (c,b >> 0), on two predictions, which apply to contemporary selection often favors differential plasticity over a mix- populations only to the extent that individuals inhabit ture of specialists. environments that share properties with those environ- To understand why, we can think about gains and ments in which parental bet-hedging may have originally losses. With differential plasticity, the safe-specialists do evolved (i.e. no evolutionary disequilibrium). somewhat better than generalists in the more common First, if parental bet-hedging explains differential safe environment and a lot worse in the more rare plasticity, we should expect large fitness consequences dangerous environment. With a mixture of specialists, of variation in plasticity. For example, at a given level of the safe-specialists do reasonably well in the safe environmental harshness, danger-specialists should environment and a lot worse in the dangerous environ- attain much higher fitness than plastic generalists who ment; the danger-specialists do reasonably well in the should attain much higher fitness than safe-specialists. dangerous environment and very poorly in the safe Measuring fitness (even proxies) is difficult in long-lived environment. Because the cost exceeds the benefit, organisms such as humans, but it is not impossible. For differential plasticity is exposed to smaller losses than a instance, fieldwork on Yanomamo€ Indians of Amazonas mixture of specialists. As a result, differential plasticity indicates that 88% of unokai (men who have killed) sired experiences less variance in payoffs across environments at least one offspring, compared with 49% of non-unokai compared to a mixture of specialists, thereby achieving (Chagnon, 1988; see also Walker & Bailey, 2013). higher geometric mean fitness. Accordingly, among the Yanomamo,€ an individual who develops a danger-adapted phenotype (e.g. physical strength, high levels of vigilance) might be more likely General discussion (than conspecifics who develop safe-adapted pheno- types) to become unokai, and less likely to become a We presented a mathematical model in order to examine victim, thus garnering fitness benefits. However, for whether individual differences in plasticity could result selection to potentially favor differential plasticity (as from parental bet-hedging. Our results support the opposed to only danger-specialists), at other times safe- hypothesis’ logical coherence in that there are ecological specialists would need to have a large advantage. In those scenarios in which natural selection might favor differ- periods, selection pressures may resemble those faced by ential plasticity. However, three conditions must simul- the Waorani of Ecuador, among whom more aggressive taneously hold. First, environmental variation must men have fewer children surviving to reproductive age occur temporally, not exclusively spatially. Second, (Beckerman, Erickson, Yost, Regalado, Jaramillo et al., fitness effects must be large (i.e. variation in individuals’ 2009). Moreover, for differential plasticity to beat plasticity is correlated with substantial variation in mixtures of specialists, in times when aggression is fitness). Both of these conditions are consistent with favored, the benefits of aggression must be smaller than other biological models of bet-hedging (Donaldson- its costs in times when it is disfavored. Matasci et al., 2008; Moran, 1992; Starrfelt & Kokko, Second, we expect the extent to which individuals 2012). Third, the costs of being mismatched must exceed differ in their plasticity levels to depend on: (a) environ- the benefits of being well matched. To our knowledge, mental variance and (b) the cost–benefit ratio (i.e. the this result is novel and may be specific to the evolution of extent to which the costs of being mismatched exceed the

© 2015 John Wiley & Sons Ltd 262 Willem E. Frankenhuis et al. benefits of being well matched). Specifically, populations Giudice et al., 2011) and other animals (Dingemanse, experiencing larger environmental variance should Kazem, Reale & Wright, 2010; Sih & Del Giudice, 2012; include more plastic generalists, and fewer specialists, Stamps & Groothuis, 2010). Initial evidence suggests than populations experiencing smaller environmental that plasticity levels are correlated across domains, but it variance (Figure 1; for evidence of bet-hedging plants would be premature to draw firm conclusions (Belsky & producing proportions of offspring types that track Pluess, 2013). Future research is needed to clarify this frequencies of environmental states, see Graham, Smith important issue. & Simons, 2014, and Rajon, Desouhant, Chevalier, Debias & Menu, 2014; for evidence of bet-hedging Limitations and future directions plants producing proportions of offspring types that track both frequencies of environmental states and Models are by design simplified, idealized versions of ontogenetic information, see Sadeh, Guterman, Gersani reality, the goal of which is to capture some essential & Ovadia, 2009, and Simons, 2014). Further, we expect components of a process or system. Moreover, models the proportion of plastic generalists to be larger, and the may serve different kinds of purposes. One distinction is proportion of specialists to be smaller, in those popula- that between general and specific models (Parker & tions in which the cost–benefit ratio is greater (Figure 1), Maynard Smith, 1990): ‘General models promote under- as larger mismatch costs penalize specialists. Having standing of qualitative features. The parameters of such stated predictions, we now turn to the question whether models may not be easy to measure. Specific models are the conditions our results point to plausibly pertain to based on a particular system and have parameters that human evolutionary history. can be measured so that predictions can be made’ (Houston & McNamara, 2005, p. 934). We presented a general model whose goal is to: (a) explicate assump- Plausibility of large fitness effects tions, (b) test their theoretical consequences and thus the Our results indicate that selection can favor differential logical cogency of hypotheses, and (c) understand plasticity, provided fitness effects are large (i.e. variation interactions between variables that are difficult to intuit, in individuals’ plasticity is correlated with substantial if not impossible, without the help of formalizations (see variation in fitness). Larger fitness effects increase fitness Fawcett, Hamblin & Giraldeau, 2013; Frankenhuis, variance (lowering geometric mean fitness), which can be Panchanathan & Barrett, 2013). We have provided one reduced by producing mixtures of types – including, if way of capturing and analyzing the bet-hedging argu- c>b, differential plasticity. Meta-analyses of studies of ment of differential plasticity (Belsky, 1997, 2005; Belsky wild animal populations show that frequencies of fitness & Pluess, 2009a, 2009b). However, our work is imperfect effects are exponentially distributed, with smaller effects and incomplete, and we hope that future research will being much more common than larger ones (reviewed in address these limitations. Endler, 1986; Hoekstra et al., 2001; Kingsolver et al., First, our model does not address how mechanistic 2001; for studies of humans, see Keller & Miller, 2006; instantiation might constrain optimality (McNamara & Nettle & Pollet, 2008; Penke et al., 2007; Stearns et al., Houston, 2009). We assume that parents can produce 2010). Moreover, recent analyses suggest that large optimal proportions of offspring types, unhindered by fitness effects are even less common than earlier surveys genetic, developmental, physiological, or cognitive con- indicated (Morrissey & Hadfield, 2012; Siepielski et al., straints. Such an assumption of unbounded optimality is 2013). The plausibility of large fitness effects in the sometimes called the ‘phenotypic’ or ‘behavioral’ gambit present case depends, among other things, on the extent (Grafen, 1984; Fawcett et al., 2013; Frankenhuis et al., to which ‘plasticity levels’ are correlated across different 2013). In real organisms, however, constraints abound. developmental domains. If an individual’s plasticity level To give one example: we assume that offspring can in one domain (e.g. metabolic adaptation) predicts her produce exactly the same distributions of offspring types plasticity levels in other domains (e.g. reproductive as their parents did (‘like begets like’). This assumption development, stress responsivity), then the effects on is realistic in haploid asexual organisms, where selection overall fitness could be large. In contrast, if plasticity happens among competing clones, but not in humans, levels are narrowly trait-specific, their effects will be who have two sets of chromosomes. Diploid genetic restricted to single traits as well, reducing their impact on systems (a) constrain the offspring distributions that overall fitness. The extent to which plasticity levels are parents can produce (i.e. some types of offspring may not correlated across domains is an open and interesting be producible), and (b) limit the extent to which parents question, which has recently come into focus in studies of can control the distribution of those offspring types that humans (Aron et al., 2012; Belsky & Pluess, 2013; Del they can in fact produce (e.g. due to random shuffling of

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 263 alleles, recombination, and other processes); this is why see Del Giudice, 2012). Although maintenance of genetic heterozygotes, who are less susceptible to sickle-cell variation in fluctuating environments does not normally anemia than homozygotes, are not universal in popula- result from bet-hedging (i.e. from strategies that sacrifice tions exposed to malaria. short-term mean fitness in order to reduce long-term Given the randomness inherent to sexual reproduc- fitness variance), in some cases it might (Svardal et al., tion, it seems implausible that in species producing small 2011). It will thus be interesting to examine how numbers of offspring, like humans, parents will diversify including overlapping generations affects scope for their offsprings’ levels of plasticity using genetic means – differential plasticity to evolve. for instance, by mating with multiple, genetically diverse A third limitation is that we assume discrete types of partners (e.g. serial monogamy, promiscuity). Instead, it offspring (one plastic type, and two fixed types); our is more likely that parental bet-hedging, if it occurs in model does not consider continuous variation in the humans, is instantiated via epigenetic mechanisms, such degree of plasticity. It is uncertain whether between- as pre- or postnatal programming (Belsky, 2005; Belsky individual variation in plasticity would also evolve if & Pluess, 2009a, 2011; Ellis et al., 2011; for evidence in individuals could develop such continuous levels. Con- non-human animals, see Crean & Marshall, 2009). In ceivably, in that case, parents might produce all offspring this scenario, natural selection would favor parents who with the same, intermediate level of plasticity. This is transmit to their offspring, not so much different genetic especially worth exploring because human G9E inter- variants, but rather variation in epigenetic settings that action research suggests that variation in plasticity may determines the extent to which experience shapes phe- be better characterized in terms of a gradient than in notypic development (i.e. epigenetic variation in sensi- typological terms (Belsky & Pluess, 2009a, 2009b; Belsky tivity to developmental programming). An alternative & Beaver, 2011). possibility is that natural selection favors adaptive ‘coin- Fourth, our model makes specific assumptions about flipping’, in which parents produce offspring that rates of environmental change. It assumes that, first, the stochastically vary in their levels of plasticity, irrespective environmental state (e.g. safe or dangerous) is stable of offspring experience (Bull, 1987; Cooper & Kaplan, enough within generations for developmental program- 1982; Kaplan & Cooper, 1984; Salathe et al., 2009). ming to evolve, but not perfectly stable, resulting in Regardless, it would be valuable to formally examine maladaptive developmental programming in some indi- how diploid genetics might influence scope for differen- viduals. Second, the environmental state is variable tial plasticity to evolve. between generations to an extent that parents cannot A second limitation is that our model assumes predict the environment their offspring will experience discrete, non-overlapping generations, which consist of any better than the long-term average probabilities of a single selective life stage, as these assumptions are environmental states. In other words, our model assumes implicit in the way we computed geometric mean fitness: high environmental auto-correlation within generations, organisms are born and reproduce; mature individuals and no environmental auto-correlation between genera- die; and the cycle repeats. Parents and offspring do not tions. An extension of the current model would be to coexist. This assumption does not hold for humans. consider between-generation auto-correlation in the Therefore, future work should explore variants of our environment. Recent evidence suggests that there have model that include overlapping generations. Classical been periods in human history characterized by climatic biological models show that temporally fluctuating fluctuations on the scale of decades to millennia, which is selection – in which the relative fitnesses of different very rapid over evolutionary timescales but rather slow phenotypes vary over time – is ineffective in maintaining over one or even several individual lifetimes (Potts, 1998; genetic variation if generations are non-overlapping (e.g. Richerson, Boyd & Bettinger, 2001). This means that Frank & Slatkin, 1990; but see Svardal, Rueffler & being born at a time of nutritional stress, or abundance, Hermisson, 2011). However, as Del Giudice (2012) notes, would have predicted – albeit imperfectly – a lifetime of this result changes ‘in species with (a) overlapping such conditions for oneself and one’s offspring. There generations in which juveniles and adults coexist, and might have been between-generation autocorrelation in (b) multiple life stages, at least [one] of which is dimensions of the social environment as well. For temporarily “shielded” from the [selective] effects of instance, within-society differences in social status environmental change. When these conditions are met, (determining access to resources) may have been mod- temporally fluctuating selection becomes extremely erately stable across lifetimes in ancestral societies as effective in maintaining genetic variation, as multiple they are in many extant societies (Borgerhoff Mulder, life stages store genetic variation and maintain it as the Bowles, Hertz, Bell, Beise et al., 2009) and in some non- environment changes’ (p. 55; for supporting references, human primates (Cheney, 1977). Formal models show

© 2015 John Wiley & Sons Ltd 264 Willem E. Frankenhuis et al. that moderate degrees of environmental stability across Bakermans-Kranenburg, M.J., & van IJzendoorn, M.H. generations provide favorable conditions for the evolu- (2011). Differential susceptibility to rearing environment tion of systems of epigenetic inheritance (Jablonka, depending on dopamine-related genes: new evidence and a 23 – Oborny, Molnar, Kisdi, Hofbauer et al., 1995; Lach- meta-analysis. Development and Psychopathology, ,39 mann & Jablonka, 1996). With between-generation auto- 52. Beckerman, S., Erickson, P.I., Yost, J., Regalado, J., Jaramillo, correlation, uncertainty is reduced across generations. It L., et al. (2009). Life histories, blood revenge, and reproduc- is not clear what effect this type of uncertainty reduction tive success among the Waorani of Ecuador. Proceedings of will have on the scope for the evolution of bet-hedging. the National Academy of Sciences of the of America, 106, 8134–8139. Conclusions Belsky, J. (1997). Variation in susceptibility to environmental influence: an evolutionary argument. Psychological Inquiry, Belsky’s bet-hedging hypothesis (Belsky, 1997) is widely 8, 182–186. cited and is having interdisciplinary impact, inspiring Belsky, J. (2005). Differential susceptibility to rearing influence: research not only in observational and experimental an evolutionary hypothesis and some evidence. In B. Ellis & studies in developmental psychology, but also in related D. Bjorklund (Eds.), Origins of the social mind: Evolutionary – fields, such as clinical science, pedagogy, and public psychology and child development (pp. 139 163). : Guilford Press. policy (White, Li, Griskevicius, Neuberg & Kenrick, Belsky, J. (2008). War, trauma and children’s development: 2013). Despite its success, however, the bet-hedging observations from a modern evolutionary perspective. Inter- hypothesis has never been formalized, even though national Journal of Behavioral Development, 32, 260–271. related models have long existed in the biological Belsky, J., Bakermans-Kranenburg, M.J., & van IJzendoorn, sciences. Here we have provided such an analysis. Results M.H. (2007). For better and for worse: differential suscep- support the argument’s logical coherence, but only under tibility to environmental influences. Current Directions in restrictive conditions. We hope that future research will Psychological Science, 16, 300–304. extend and modify our work, resulting in a family of Belsky, J., & Beaver, K.M. (2011). Cumulative-genetic plasticity, models, with each examining the consequences of parenting and adolescent self-regulation. Journal of Child 52 – particular assumptions. Psychology and Psychiatry, , 619 626. Belsky, J., Fish, M., & Isabella, R. (1991). Continuity and discontinuity in infant negative and positive emotionality: family antecedents and attachment consequences. Develop- Acknowledgements mental Psychology, 27, 421–431. Belsky, J., Jonassaint, C., Pluess, M., Stanton, M., Brummett, This research was supported by a Veni grant from the B., et al. (2009). Vulnerability genes or plasticity? Molecular Netherlands Organization for Scientific Research Psychiatry, 14, 746–754. (NWO) (016.155.195) awarded to Willem Frankenhuis, Belsky, J., & Pluess, M. (2009a). Beyond diathesis stress: and by the Hungarian Science Foundation Grant OTKA differential susceptibility to environmental influences. Psy- NK83997. We thank Rob Boyd, Bruce Ellis, Tim chological Bulletin, 185, 885–908. Fawcett, Marco Del Giudice, Mate Lengyel, Roberto Belsky, J., & Pluess, M. (2009b). The nature (and nurture?) of Schonmann, the three anonymous reviewers, and the plasticity in early human development. Perspectives on 4 – editor Denis Mareschal for their valuable comments. Psychological Science, , 345 351. Belsky, J., & Pluess, M. (2013). Beyond risk, resilience and dysregulation:-phenotypic plasticity and human develop- ment. Development and Psychopathology, 25, 1243–1261. References Belsky, J., & Van IJzendoorn, M.H. (2015). What works for whom? Genetic moderation and epigenetic mediation of Aron, E.A., Aron, A., & Jagiellowicz, J. (2012). Sensory intervention efficacy. Development and Psychopathology, 27, processing sensitivity: a review in the light of the evolution of Special Issue 1, 1–16. biological responsivity. Personality and Bergstrom, C.T., & Godfrey-Smith, P. (1998). On the evolution 16 – Review, , 262 282. of behavioral heterogeneity in individuals and populations. Auld, J.R., Agrawal, A.A., & Relyea, R.A. (2010). Re-evalu- Biology and Philosophy, 13, 205–231. ating the costs and limits of adaptive phenotypic plasticity. Bernoulli, D. (1954). Exposition of a new theory on the 277 – Proceedings of the Royal Society B, , 503 511. measurement of risk. Econometrica, 22,23–36. Translation of Bakermans-Kranenburg, M.J., & van IJzendoorn, M.H. Bernoulli, D. (1738). Specimen theoriae novae de mensura (2007). Research review: genetic vulnerability or differential sortis. Papers Imp. Acad. Sci. St. Petersburg, 5, 175–192. – susceptibility in child development the case of attach- Borgerhoff Mulder, M., Bowles, S., Hertz, T., Bell, A., Beise, J., 48 ment. Journal of Child Psychology and Psychiatry, , et al. (2009). The intergenerational transmission of wealth 1160–1173.

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 265

and the dynamics of inequality in pre-modern societies. Dingemanse, N.J., & Wolf, M. (2013). Between-individual Science, 326, 682–688. differences in behavioural plasticity within populations: Boyce, W.T., Chesney, M., Alkon, A., Tschann, J.M., Adams, causes and consequences. Animal Behaviour, 85, 1031–1039. S., et al. (1995). Psychobiologic reactivity to stress and Donaldson-Matasci, M.C., Bergstrom, C.T., & Lachmann, M. childhood respiratory illnesses: results of two prospective (2013). When unreliable cues are good enough. American studies. Psychosomatic Medicine, 57, 411–422. Naturalist, 182, 313–327. Boyce, W.T., & Ellis, B.J. (2005). Biological sensitivity to Donaldson-Matasci, M.C., Lachmann, M., & Bergstrom, C.T. context: I. An evolutionary-developmental theory of the (2008). Phenotypic diversity as an adaptation to environmen- origins and functions of stress reactivity. Development and tal uncertainty. Evolutionary Ecology Research, 10, 493–515. Psychopathology, 17, 271–301. Ellis, B.J., & Boyce, W.T. (2008). Biological sensitivity to Bull, J.J. (1987). Evolution of phenotypic variance. Evolution, context. Current Directions in Psychological Science, 17, 183– 41, 303–315. 187. Carja, O., Furrow, R.E., & Feldman, M.W. (2014). The role of Ellis, B.J., Boyce, W.T., Belsky, J., Bakermans-Kranenburg, migration in the evolution of phenotypic switching. Proceed- M.J., & van IJzendoorn, M.H. (2011). Differential suscep- ings of the Royal Society B, 281, 1794. tibility to the environment: an evolutionary-neurodevelop- Chagnon, N. (1988). Life histories, blood revenge, and warfare mental theory. Development and Psychopathology, 23,7–28. in a tribal population. Science, 239, 985–992. Ellis, B.J., Del Giudice, M., Dishion, T.J., Figueredo, A.J., Cheney, D.L. (1977). Acquisition of rank and development of Gray, P., et al. (2012). The evolutionary basis of risky reciprocal alliances among free-ranging immature baboons. adolescent behavior: implications for science, policy, and Behavioral Ecology and , 2, 303–318. practice. Developmental Psychology, 48, 598–623. Childs, D.Z., Metcalf, C.J.E., & Rees, M. (2010). Evolutionary Ellis, B.J., Essex, M.J., & Boyce, W.T. (2005). Biological bet-hedging in the real world: empirical evidence and sensitivity to context: II. Empirical explorations of an challenges revealed by plants. Proceedings of the Royal evolutionary-developmental theory. Development and Psy- Society B, 277, 3055–3064. chopathology, 17, 303–328. Claessens, S.E.F., Daskalakis, N.P., van der Veen, R., Oitzl, Ellis, B.J., Figueredo, A.J., Brumbach, B.H., & Schlomer, G.L. M.S., de Kloet, E.R., et al. (2011). Development of individ- (2009). Fundamental dimensions of environmental risk: the ual differences in stress responsiveness: an overview of impact of harsh versus unpredictable environments on the factors mediating the outcome of early life experiences. evolution and development of life history strategies. Human Psychopharmacology, 214, 141–154. Nature, 20, 204–268. Cohen, D. (1966). Optimizing reproduction in a randomly varying Ellis, B.J., Jackson, J.J., & Boyce, W.T. (2006). The stress environment. Journal of Theoretical Biology, 12,119–129. response system: universality and adaptive individual differ- Cooper, W.S., & Kaplan, R.H. (1982). Adaptive ‘coin-flipping’: ences. Developmental Review, 26, 175–212. a decision-theoretic examination of natural selection for Endler, J.A. (1986). Natural selection in the wild. Princeton, NJ: random individual variation. Journal of Theoretical Biology, Princeton University Press. 94, 135–151. Fawcett, T.W., Hamblin, S., & Giraldeau, L.-A. (2013). Crean, A.J., & Marshall, D.J. (2009). Coping with environ- Exposing the behavioral gambit: the evolution of learning mental uncertainty: dynamic bet hedging as a maternal and decision rules. Behavioral Ecology, 24,2–11. effect. Philosophical Transactions of the Royal Society B, 364, Figueredo, A.J., & Wolf, P.S.A. (2009). Assortative pairing and 1087–1096. life history strategy: a cross-cultural study. Human Nature, Dall, S.R.X., Giraldeau, L.-A., Ollson, O., McNamara, J.M., & 20, 317–330. Stephens, D.W. (2005). Information and its use by animals in Fischer, B., Van Doorn, G.S., Dieckmann, U., & Taborsky, B. evolutionary ecology. Trends in Ecology and Evolution, 20, (2014). The evolution of age-dependent plasticity. American 187–193. Naturalist, 183, 108–125. Del Giudice, M. (2012). Sex ratio dynamics and fluctuating Flynn, E.G., Laland, K.N., Kendal, R.L., & Kendal, J.R. selection on personality. Journal of Theoretical Biology, 297, (2013). Developmental niche construction. Developmental 48–60. Science, 16, 296–313. Del Giudice, M., Ellis, B.J., & Shirtcliff, E.A. (2011). The Frank, S.A. (2011). Natural selection. I. Variable environments Adaptive Calibration Model of stress responsivity. Neurosci- and uncertain returns on investment. Journal of Evolutionary ence and Biobehavioral Reviews, 35, 1562–1592. Biology, 24, 2299–2309. Dempster, E.R. (1955). Maintenance of genetic heterogeneity. Frank, S.A., & Slatkin, M. (1990). Evolution in a variable Cold Spring Harbor Symposium, 20,25–32. environment. American Naturalist, 136, 244–260. DeWitt, T.J., Sih, A., & Wilson, D.S. (1998). Costs and limits of Frankenhuis, W.E., & Del Giudice, M. (2012). When do plasticity. Trends in Ecology and Evolution, 13,77–81. adaptive developmental mechanisms yield maladaptive out- Dingemanse, N.J., Kazem, A.J.N., Reale, D., & Wright, J. comes? Developmental Psychology, 48, 628–642. (2010). Behavioural reaction norms: where animal personal- Frankenhuis, W.E., & de Weerth, C. (2013). Does early-life ity meets individual plasticity. Trends in Ecology and Evolu- exposure to stress shape, or impair, cognition? Current tion, 25,81–89. Directions in Psychological Science, 22, 407–412.

© 2015 John Wiley & Sons Ltd 266 Willem E. Frankenhuis et al.

Frankenhuis, W.E., & Panchanathan, K. (2011a). Balancing Keller, M.C., & Miller, G.F. (2006). Resolving the paradox of sampling and specialization: an adaptationist model of common, harmful, heritable mental disorders: which evolu- incremental development. Proceedings of the Royal Society tionary genetic models work best? Behavioral and Brain B, 278, 3558–3565. Sciences, 29, 385–452. Frankenhuis, W.E., & Panchanathan, K. (2011b). Individual Kim-Cohen, J., Caspi, A., Taylor, A., Williams, B., Newcombe, differences in developmental plasticity may result from R., et al. (2006). MAOA, maltreatment, and gene–environ- stochastic sampling. Perspectives on Psychological Science, ment interaction predicting children’s mental health: new 6, 336–347. evidence and a meta-analysis. Molecular Psychiatry, 11, 903– Frankenhuis, W.E., Panchanathan, K., & Barrett, H.C. (2013). 913. Bridging developmental systems theory and evolutionary King, O.D., & Masel, J. (2007). The evolution of bet-hedging psychology using dynamic optimization. Developmental Sci- adaptations to rare scenarios. Theoretical Population Biology, ence, 16, 584–598. 72, 560–575. Gillespie, J.H. (1974). Natural selection for within-generation Kingsolver, J.G., Hoekstra, H.E., Hoekstra, J.M., Berrigan, D., variance in offspring number. Genetics, 76, 601–606. Vignieri, S.N., et al. (2001). The strength of phenotypic Grafen, A. (1984). Natural selection, and in the natural populations. American Naturalist, selection. In J.R. Krebs & N.B. Davies (Eds.), Behavioural 157, 245–261. ecology: An evolutionary approach (2nd edn.) (pp. 62–84). Lachmann, M., & Jablonka, E. (1996). The inheritance of Oxford: Blackwell Scientific Publications. phenotypes: an adaptation to fluctuating environments. Graham, J.K., Smith, M.L., & Simons, A.M. (2014). Experi- Journal of Theoretical Biology, 181,1–9. mental evolution of bet hedging under manipulated envi- Lehmann, L., & Balloux, F. (2007). Natural selection on ronmental uncertainty in Neurospora crassa. Proceedings of fecundity variance in subdivided populations: kin selection the Royal Society B, 281, 20140706. meets bet hedging. Genetics, 176,1–17. Heim, C., Newport, D.J., Wagner, D., Wilcox, M.M., Miller, Leimar, O. (2009). Environmental and genetic cues in the A.H., et al. (2002). The role of early adverse experience and evolution of phenotypic polymorphism. Evolutionary Ecol- adulthood stress in the prediction of neuroendocrine stress ogy, 23, 125–135. reactivity in women: a multiple regression analysis. Depres- Levene, H. (1953). Genetic equilibrium when more than one sion and Anxiety, 15, 117–125. ecological niche is available. American Naturalist, 87, 331– Hoekstra, H.E., Hoekstra, J.M., Berrigan, D., Vignieri, S.N., 333. Hoang, A., et al. (2001). Strength and tempo of directional Lewontin, R.C., & Cohen, D. (1969). On population growth in selection in the wild. Proceedings of the National Academy of a randomly varying environment. Proceedings of the National Sciences, USA, 98, 9157–9160. Academy of Sciences, USA, 62, 1056–1060. Hopper, K.R. (1999). Risk-spreading and bet-hedging in insect McNamara, J.M., & Houston, A.J. (2009). Integrating function population biology. Annual Review of Entomology, 44, 535– and mechanism. Trends in Ecology and Evolution, 24, 670– 560. 675. Hopper, K.R., Rosenheim, J.A., Prout, T., & Oppenheim, S.J. McNamara, J.M., Trimmer, P.C., Eriksson, A., Marshall, J.A., (2003). Within-generation bet hedging: a seductive explana- & Houston, A.I. (2011). Environmental variability can select tion? Oikos, 101, 219–222. for optimism or pessimism. Ecology Letters, 14,58–62. Houston, A.I., & McNamara, J.M. (2005). John Maynard Manuck, S.B. (2010). The reaction norm in gene9environment Smith and the importance of consistency in evolutionary interaction. Molecular Psychiatry, 15, 881–882. game theory. Biology and Philosophy, 20, 933–950. Markowitz, H. (1952). Portfolio selection. Journal of Finance, 7, Huizink, A.C., Bartels, M., Rose, R.J., Pulkkinen, L., Eriksson, 77–91. C.J., et al. (2008). Chernobyl exposure as stressor during Maynard Smith, J. (1982). Evolution and the theory of games. pregnancy and hormone levels in adolescent offspring. Cambridge: Cambridge University Press. Journal of Epidemiology and Community Health, 62, e5. Maynard Smith, J., & Hoekstra, R. (1980). Polymorphism in a Jablonka, E., Oborny, B., Molnar, I., Kisdi, E., Hofbauer, J., varied environment: how robust are the models? Genetical et al. (1995). The adaptive advantage of phenotypic memory. Research, 35,45–57. Philosophical Transactions of the Royal Society B, 350, 133– Meyers, L.A., & Bull, J.J. (2002). Fighting change with change: 141. adaptive variation in an uncertain world. Trends in Ecology Kaiser, S., & Sachser, N. (2009). Effects of prenatal social stress and Evolution, 17, 551–557. on offspring development: pathology or adaptation? Current Moran, N.A. (1992). The evolutionary maintenance of alter- Directions in Psychological Science, 18, 118–121. native phenotypes. American Naturalist, 139, 971–989. Kaplan, R.H., & Cooper, W.S. (1984). The evolution of Morrissey, M.B., & Hadfield, J.D. (2012). Directional selection developmental plasticity in reproductive characteristics: an in temporally replicated studies is remarkably consistent. application of the ‘adaptive coin-flipping’ principle. Ameri- Evolution, 66, 435–442. can Naturalist, 123, 393–410. Nettle, D., Frankenhuis, W.E., & Rickard, I.J. (2013). The Kawecki, T.J., & Ebert, D. (2004). Conceptual issues in local evolution of Predictive Adaptive Responses in human life adaptation. Ecology Letters, 7, 1225–1241. history. Proceedings of the Royal Society B, 280, 20131343.

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 267

Nettle, D., & Pollet, T.V. (2008). Natural selection for male Savolainen, O., Lascoux, M., & Merila,€ J. (2013). Ecological wealth. American Naturalist, 172, 658–666. genomics of local adaptation. Nature Reviews Genetics, 14, O’Connor, T.G., Ben-Shlomo, Y., Heron, J., Golding, J., 807–820. Adams, D., et al. (2005). Prenatal anxiety predicts individual Schlichting, C.D., & Pigliucci, M. (1998). Phenotypic evolution: differences in cortisol in pre-adolescent children. Biological A reaction norm perspective. Sunderland, MA: Sinauer. Psychiatry, 58, 211–217. Seger, J., & Brockmann, H.J. (1987). What is bet-hedging? In Olofsson, H., Ripa, J., & Jonzen, N. (2009). Bet-hedging as an P.H. Harvey & L. Partridge (Eds.), Oxford surveys in evolutionary game: the trade-off between egg size and evolutionary biology. Oxford: Oxford University Press. number. Proceedings of the Royal Society B, 276, 2963–2969. Sih, A., & Del Giudice, M. (2012). Linking behavioural syndromes Parker, G.A., & Maynard Smith, J. (1990). Optimality theory in and cognition: a behavioural ecology perspective. Philosophical evolutionary biology. Nature, 348,27–33. Transactions of the Royal Society B, 367, 2762–2772. Penke, L., Denissen, J.J.A., & Miller, G.F. (2007). The Siepielski, A.M., Gotanda, K.M., Morrissey, M.B., Diamond, evolutionary genetics of personality. European Journal of S.E., DiBattista, J.D., et al. (2013). The spatial patterns of Personality, 21, 549–587. directional phenotypic selection. Ecology Letters, 16, 1382– Pesonen, A.-K., Raikk€ onen,€ K., Strandberg, T.E., & 1392. Jarvenp€ a€a,€ A.-L. (2005). Continuity of maternal stress from Simons, A.M. (2009). Fluctuating natural selection accounts the pre- to the postnatal period: associations with infant’s for the evolution of diversification bet hedging. Proceedings positive, negative and overall temperamental reactivity. of the Royal Society B, 276, 1987–1992. Infant Behavior and Development, 28,36–47. Simons, A.M. (2011). Modes of response to environmental Philippi, T., & Seger, J. (1989). Hedging one’s evolution- change and the elusive empirical evidence for bet hedging. ary bets, revisited. Trends in Ecology and Evolution, 4,41–44. Proceedings of the Royal Society B, 278, 1601–1609. Pluess, M., & Belsky, J. (2009). Differential susceptibility to Simons, A.M. (2014). Playing smart vs. playing safe: the joint rearing experience: the case of childcare. Journal of Child expression of phenotypic plasticity and potential bet hedging Psychology and Psychiatry and Allied Disciplines, 50, 396–404. across and within thermal environments. Journal of Evolu- Pluess, M., & Belsky, J. (2010). Children’s differential suscep- tionary Biology, 27, 1047–1056. tibility to effects of parenting. Family Science, 1,14–25. Slatkin, M. (1974). Hedging one’s evolutionary bets. Nature, Pluess, M., & Belsky, J. (2011). Prenatal programming of 250, 704–705. postnatal plasticity? Development and Psychopathology, 23, Stamps, J., & Groothuis, G.G.T. (2010). The development of 29–38. animal personality: relevance, concepts and perspectives. Pluess, M., & Belsky, J. (2013). Vantage sensitivity: individual Biological Reviews, 85, 301–325. differences in response to positive experiences. Psychological Starrfelt, J., & Kokko, H. (2012). Bet-hedging – a triple trade- Bulletin, 139, 901–916. off between means, variances and correlations. Biological Potts, R. (1998). Variability selection in hominid evolution. Reviews, 87, 742–755. Evolutionary Anthropology, 7,81–96. Stearns, S.C. (2000). Daniel Bernoulli (1738): evolution and Proulx, S.R., & Day, T. (2001). What can invasion analyses tell economics under risk. Journal of Biosciences, 25, 221–228. us about evolution under stochasticity in finite populations? Stearns, S.C., Byars, S.G., Govindaraju, D.R., & Ewbank, D. Selection, 2,2–15. (2010). Measuring selection in contemporary human popu- Rajon, E., Desouhant, E., Chevalier, M., Debias, F., & Menu, lations. Nature Reviews Genetics, 11, 611–622. F. (2014). The evolution of bet hedging in response to local Svardal, H., Rueffler, C., & Hermisson, J. (2011). Comparing ecological conditions. American Naturalist, 184,E1–E15. environmental and genetic variance as adaptive response to Relyea, R.A. (2002). Costs of phenotypic plasticity. American fluctuating selection. Evolution, 65, 2492–2513. Naturalist, 159, 272–282. Van IJzendoorn, M.H., & Bakermans-Kranenburg, M. (2015). Rickard, I.J., Frankenhuis, W.E., & Nettle, D. (2014). Why are Genetic differential susceptibility on trial: meta-analytic childhood family factors associated with timing of matura- support from randomized controlled experiments. Develop- tion? A role for internal state. Perspectives on Psychological ment and Psychopathology, 27, Special Issue 1, 151–162. Science, 9,3–15. Van IJzendoorn, M., Bakermans-Kranenburg, M., Belsky, J., Richerson, P.J., Boyd, R., & Bettinger, R.L. (2001). Was Beach, S., Brody, G., et al. (2011). Gene-by-environment agriculture impossible during the Pleistocene but mandatory experiments: a new approach to find missing heritability. during the Holocene? A climate change hypothesis. American Nature Reviews Genetics, 12, 881. Antiquity, 66, 387–411. Van IJzendoorn, M.H., Belsky, J., & Bakermans-Kranenburg, Sadeh, A., Guterman, H., Gersani, M., & Ovadia, O. (2009). M.J. (2012). Serotonin transporter genotype 5HTTLPR as a Plastic bet-hedging in an amphicarpic annual: an integrated marker of differential susceptibility? A meta-analysis of child strategy under variable conditions. Evolutionary Ecology, 23, and adolescent gene-by-environment studies. Translational 373–388. Psychiatry, 2, e147. Salathe, M., Van Cleve, J., & Feldman, M.W. (2009). Evolution Walker, R.S., & Bailey, D.H. (2013). Body counts in lowland of stochastic switching rates in asymmetric fitness land- South American violence. Evolution and Human Behavior, 34, scapes. Genetics, 182, 1159–1164. 29–34.

© 2015 John Wiley & Sons Ltd 268 Willem E. Frankenhuis et al.

West-Eberhard, M.J. (2003). Developmental plasticity and We assume that safe-specialists in a safe environment evolution. New York: Oxford University Press. attain the same fitness as danger-specialists in a danger- White, A.E., Li, Y.J., Griskevicius, V., Neuberg, S.L., & ous environment (namely, 1+b). We also assume that Kenrick, D.T. (2013). Putting all your eggs in one basket: safe-specialists in a dangerous environment attain the life-history strategies, bet hedging, and diversification. Psy- same fitness as danger-specialists in a safe environment chological Science, 24, 715–722. (namely, 1c). This symmetry in payoffs justifies limiting Wilson, D.S. (1994). Adaptive genetic variation and human > < evolutionary psychology. Ethology and Sociobiology, 15, 219– our analyses to cases where p 1 p (with p 1 p, our 235. results would be a mirror image). Note that the higher p Wilson, D.S., & Yoshimura, J. (1994). On the coexistence of is, the more predictable the environment is across specialists and generalists. American Naturalist, 144, 692–707. generations from the parents’ viewpoint; if p=1p, the Wolf, M., van Doorn, G.S., & Weissing, F.J. (2008). Evolu- environmental state is completely unpredictable from tionary emergence of responsive and unresponsive personal- one generation to the next. ities. Proceedings of the National Academy of Sciences, USA, 105, 15825–15830. Wolf, M., van Doorn, G.S., & Weissing, F.J. (2011). On the Parameters of social responsiveness and behavioural consistency. Proceedings of the Royal Society B, 278, 440–448. w parental fitness Young, W.E., & Trent, R.H. (1969). Geometric mean approx- imation of individual security and portfolio performance. x fraction of safe-specialists (fixed type adapted to safe Journal of Financial and Quantitative Analysis, 4, 179–199. environment) Zuckerman, M. (1999). Vulnerability to psychopathology: A y fraction of danger-specialists (fixed type adapted to biosocial model. Washington, DC: American Psychological dangerous environment) Association. z1xy fraction of plastic type p probability of a safe environment (range: 0–1) Received: 5 July 2014 1p probability of a dangerous environment Accepted: 25 February 2015 b benefit to a fixed individual when fitting the local ecology (range: 0–1) c cost to a fixed individual when mismatched to local Appendix ecology (range: 0–1)

Bet-hedging model

Model set up Intragenerational spatial variation In spatially varying environments, the environment Parents can produce plastic offspring, who are adaptable, consists of different patches, each with a particular state achieving a payoff of 1, irrespective of the environmental (e.g. safe or dangerous). Within any generation, some state. Parents can also produce fixed offspring (special- offspring are born in one environmental state and other ists), who are less adaptable, achieving a high payoff in a offspring in another. In such environments, parental safe environment (1+b), and a low payoff in a dangerous fitness is given by the arithmetic mean across patches. environment (1c). Our goal is to compute the fraction of safe-specialists (x), danger-specialists (y), and plastic individuals (z) that Fitness function natural selection favors parents to produce. Each of these w = p(payoff in safe environment) + (1-p)(payoff in fractions ranges between 0 and 1. We are particularly dangerous environment) interested in the region where 0

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 269 of well-matched safe-specialists (i.e. the probability of a Fitness function safe environment multiplied by the benefit that safe- specialists obtain in this environment), and c(1p) represents the expected cost of mismatched safe-special- w ¼ð½1 x yÞð1Þþxð1 þ bÞþyð1 cÞ p ð4Þ ists (i.e. the probability of a dangerous environment ð1pÞ ½ð1 x yÞð1Þþxð1 cÞþyð1 þ bÞ multiplied by the cost that safe-specialists incur in this environment). In the third term, b(1p) represents We ask what values of x and y maximize parents’ fitness. parents’ expected benefit of well-matched danger-spe- Normally, we would find these values where the deriv- cialists (i.e. the probability of a dangerous environment ative of the fitness function w equals zero. However, the multiplied by the benefit that danger-specialists obtain in current case is exceptional, as shown below. Taking the this environment), and pc represents the expected cost of natural log: mismatched danger-specialists (i.e. the probability of a safe environment multiplied by the cost that danger- lnðwÞ¼p ln½þ1 x y þ xð1 þ bÞþyð1 cÞ specialists incur in this environment). ð5Þ ð Þ ½ þ ð Þþ ð þ Þ If both of these expectations are negative, then parents 1 p ln 1 x y x 1 c y 1 b maximize fitness (w) by producing x+y=0 (i.e. all plastic children). Simplifying within brackets: If the former expectation is positive and latter nega- lnðwÞ¼p lnð1 þ xb ycÞþð1 pÞlnð1 xc þ ybÞ tive, then parents maximize fitness by producing x=1 (i.e. ð Þ all safe-specialists). 6 If the former expectation is negative and latter positive, then parents maximize fitness by producing Taking the partial derivative with respect to x: y=1 (i.e. all danger-specialists). @lnðwÞ pb cð1 pÞ ¼ ð7Þ If both expectations are positive, then parents maxi- @x 1 þ xb yc 1 xc þ yb mize fitness by producing all offspring of the type that has the highest expectation (i.e. either x=1ory=1). Taking the partial derivative with respect to y: Thus, if environmental variation is exclusively spatial, @lnðwÞ bð1 pÞ pc parents always maximize their fitness by producing ¼ ð8Þ @y 1 xc þ yb 1 þ xb yc either all safe- or danger-specialists, or all plastic individuals, and never a mixture (of both specialist Both partial derivatives equal zero when: types, or of one or both of the specialist types and the pb cð1 pÞ plastic type); thus, in this scenario, we would expect no ¼ ð Þ þ þ 0 9 variation in plasticity due to bet-hedging. 1 xb yc 1 xc yb bð1 pÞ pc ¼ 0 ð10Þ 1 xc þ yb 1 þ xb yc Intergenerational temporal variation Rewrite as: u=1+xbyc and: v=1xc+yb: In environments that vary temporally between gener- pb cð1 pÞ ations (but which remain stable within generations), ¼ 0 ð11Þ u v the state of the environment changes simultaneously bð1 pÞ pc for all individuals across time. For instance, in one ¼ 0 ð12Þ generation all individuals may develop in a safe patch, v u their offspring in a dangerous patch, and so forth. We Reorganizing: assume that environmental states are independent and pbv ¼ð1 pÞcu ð13Þ identically distributed (IID) across generations. In this case, if generations are discrete and non-overlapping, pcv ¼ð1 pÞbu ð14Þ parental fitness is given by the average of sequential Solve for v: payoffs across generations. The average of n values  1 p c multiplied is the nth root of their product, their v ¼ u ð15Þ geometric mean. p b

© 2015 John Wiley & Sons Ltd 270 Willem E. Frankenhuis et al.

 1 p b monotonically increasing or decreasing, depending on v ¼ u ð16Þ p c the values of p, b and c. These conditions simultaneously hold when:   1 p c 1 p b Exploring the edges u ¼ u ð17Þ p b p c For each combination of parameter values (i.e. p, b, and ð1pÞ c), we want to know the associated fitness of the edge Dividing out p , multiplying both sides by bc, and reorganizing: solutions. The edge solution that yields the highest fitness for a given set will be favored by natural selection. c2u ¼ b2u ð18Þ We find the optimal edge solution for x between points = = = = = uðb2 c2Þ¼0 ð19Þ (x 0, y 0) and (x 1, y 0) by entering y 0 into our fitness equation ln(w). uðb þ cÞðb cÞ¼0 ð20Þ lnðwy¼0Þ¼p lnð1 þ xb 0cÞþð1 pÞlnð1 xc þ 0bÞ Formally, this equation has four solutions: u=0, b=c, ð28Þ b=c,andc=b. However, the last two of these are not biologically meaningful, so we ignore them. The second Taking the partial derivative with respect to x: solution is an implausible knife edge (b=c), and we will @lnðw ¼ Þ pb ð1 pÞc show later that differential plasticity is never uniquely y 0 ¼ ð Þ @ þ 29 favored in this condition. Thus, we will focus on the first x 1 xb 1 xc solution: Setting this partial derivative to zero and solving for x: ¼ þ ¼ ð Þ u 1 xb yc 0 21 ð Þ ^ pb c 1 p ¼ þ ¼ ð Þ xy¼0 ¼ ð30Þ v 1 xc yb 0 22 bc

Solving each for x: When 0\x^y¼0\1, differential plasticity is favored over yc 1 only plastic individuals, and over only one type of xb ¼ yc 1 ¼) x ¼ ð23Þ b specialist. This is when: yb þ 1 c c bc xc ¼ yb þ 1 ¼) x ¼ ð24Þ \p\ þ ð31Þ c b þ c b þ c b þ c Next,we find the optimal edge solution for y between These conditions simultaneously hold when: points (x=0, y=0) and (x=0, y=1) by entering x=0 into ln yc 1 yb þ 1 (w), and taking the partial derivative with respect to y: ¼ ð25Þ b c @lnðw ¼ Þ cp ð1 pÞb x 0 ¼ þ ð32Þ Multiplying both sides by bc, and reorganizing: @y 1 yc 1 þ yb 1 y ¼ ð26Þ Setting this derivative to zero, and solving for y: b c bð1 pÞpc y^ ¼ ¼ ð33Þ We obtain the corresponding x: x 0 bc c 1 ¼ ¼) ¼ ð Þ When 0\y^ \1, a mix of y and z is favored over only xb 1 x 27 x¼0 b c b c plastic individuals, and over only one type of specialist: This equilibrium x ¼ y ¼ 1 cannot be attained. If b>c, b bc b bc \p\ ð34Þ then x<0 and y<0; if c>b, then x>1andy>1. These results b þ c b þ c b þ c show that selection never simultaneously favors the optimal x and y (unless, perhaps, when b=c, discussed Finally, we find the optimal edge solution for x between later). If b6¼c, selection will take x and y to the edges of a points (x=1, y=0) and (x=0, y=1), by entering triangle, bounded by points: (x=0, y=0), (x=1, y=0), and z=1xy=0 (i.e. y=1x) into ln(w), and taking the (x=0, y=1). Within this triangle, the fitness function is partial derivative with respect to x:

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 271

the proof holds for all values of p. If taken into account, @lnðwz¼0Þ pðb þ cÞ ð1 pÞðb þ cÞ ¼ (1) and (2) strengthen DP’s superiority over MS. Barring @x 1 þ xb cð1 xÞ 1 xc þ bð1 xÞ the strategies converging on production of only Special- ð Þ 35 ists, DP beats MS.

Setting this derivative to zero, and solving for x: Differential plasticity is never favored if b >c pð1 þ bÞð1 pÞð1 cÞ x^ ¼ ¼ ð36Þ z 0 b þ c We can show that when b>c, DP never beats MS: ð1 þ xbÞpð1 xcÞ1 p ð1 þ xb ycÞpð1 xc þ ybÞ1 p When 0\x^ ¼ \1, a mix of specialists is favored over z 0 ð Þ either pure specialist: 43 1 c 1 þ b \ \ ð Þ If MS attains equal or higher fitness than DP in either þ p þ 37 2 b c 2 b c environment, its geometric mean fitness across environ- ments will also equal or exceed that of DP (in which case > Differential plasticity can be favored if c >b DP is never uniquely favored if b c): 1 þðx^ Þb 1 þðx Þb ðy Þc ð44Þ We know that natural selection will favor edge solutions; dp ms ms ð^ Þ ð Þ þð Þ ð Þ hence, we will consider differential plasticity (DP) as 1 xdp c 1 xms c yms b 45 producing safe-specialists and plastic individuals only (not danger-specialists). DP beats Mixtures of Specialists Deducting 1 from all sides: (MS) if: ð^ Þ ð Þ ð Þ ð Þ xdp b xms b yms c 46 ð1 þ xbÞpð1 xcÞ1 p [ ð1 þ xb ycÞpð1 xc þ ybÞ1 p ð^ Þ ð Þ þð Þ ð Þ xdp c xms c yms b 47 ð38Þ Solving for xms (note: the sign flips in the bottom Dividing by DP: equation):  p 1p ð^ Þ þð Þ 1 þ xb yc 1 xc þ yb xdp b yms c ð ÞðÞ 1 [ ð39Þ xms 48 1 þ xb 1 xc b ðx^ Þc þðy Þb dp ms ðx Þð49Þ Taking the natural log: c ms   1 þ xb yc 1 xc þ yb 0 [ p ln þð1 pÞln Combined in one line: 1 þ xb 1 xc ð40Þ ðx^ Þb þðy Þc ðx^ Þc þðy Þb dp ms ðx Þ dp ms ð50Þ b ms c ¼ ^ ^ Assume DP produces xdp xms yms (it can, because   ^ [ ^ Þ c b xms yms : ð^ Þþð Þ ð Þð^ Þþð Þ ð Þ  xdp yms xms xdp yms 51 þ þ b c [ 1 xb yc þð Þ 1 xc yb 0 p ln þð Þ 1 p ln ð Þ ð Þð^ Þ 1 x y b 1 x y c c xms xdp b ð Þ ð Þ 52 ð41Þ b yms c ^ ¼ þ Distributing terms in the denominators: Where xms>yms. Consider xdp 0 e, where e is tiny:   ð þ Þ 1 þ xb yc 1 xc þ yb c xms 0 e b ð Þ 0 [ p ln þð1 pÞln 53 1 þ xb yb 1 xc þ yc b yms c ð Þ c x e b 42 ms ð54Þ b yms c The left log term is negative because yc<yb. The right one because ybc, MS can always fulfill this condition (note: since of log terms <1 are steeper than that of log terms >1, and xms>yms, the left side of the equation always holds). Now (2) the left log term is weighted by p, which >1p. Thus, consider x^dp ¼ 1 e:

© 2015 John Wiley & Sons Ltd 272 Willem E. Frankenhuis et al.

  c b Arithmetic mean fitness, variance in fitness, and ð1 eÞþðy Þ ðx Þð1 eÞþðy Þ ð55Þ ms b ms ms c geometric mean fitness

In the article, we present the regions of parameter space Now yms can be taken down to a very small number such in which differential plasticity is uniquely favored in a that xms will be between the two ends. Note that yms must temporally fluctuating environment (Figure 1). Here, we be smaller than 0+e. To see this, consider yms=0+e present the arithmetic mean fitness, variance in fitness, (meaning that xms=1e): and geometric mean fitness, for three strategies (in   c b order): differential plasticity, mixtures of specialists, and ð1 eÞþð0 þ eÞ ð1 eÞð1 eÞþð0 þ eÞ b c only safe-specialists. We do not present a separate figure for pure generalists: its arithmetic and geometric mean ð56Þ   fitnesses are 1, and its fitness variance is 0, in the entire c b parameter space. ðeÞ 0 ðeÞ ð57Þ b c In the following three figures, the x-axis represents the additional benefit (b) fixed specialists obtain, compared

If, however, yms=0+d, where dc, differential term larger than one. Thus, if b>c, MS can always match plasticity is never favored. If c>b, differential plasticity or beat DP. can be favored, especially if c is large.

© 2015 John Wiley & Sons Ltd Bet-hedging by varying offspring plasticity 273

Differential Plasticity p = 0.6 p = 0.75 p = 0.9 1 G D GD.4 1.6 D 1 1 M 1.2 0.8 1 G 8 1 1. 1.4 0.6 2 1. M S Cost 0.4 1.2 M 1.4 S 0.2 1.6 S S 0 1

G D GD0 D 2 M 0 0.8 0. G 0 M 0.6 0.4 0.2 S Cost 0.4 M 0.2 0.4 S 0.2 S S 0 1

1 .2 1 G 1 D GD D 1.4 M 0.8 G M 1.6 0.6 1.2

1 M S

Cost 1.4 0.4 1.2 S

0.2 1.8 .4 S 1 S 1.6 0 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Benefit Benefit Benefit

Mixture of Specialists p = 0.6 p = 0.75 p = 0.9 1 1 0.6 G 1.2 1.4 0.8 D D D .6 M 0.8 G 1 0.8 1

G 1.8

0.6 1.2 1.2 M S 1 1.4 Cost 0.4 1.6 M S 0.2 1.4 S S 0 1 G D D D G 2 M 0.8 0.

G M 0.6 .2 S 0 Cost 0.4 M S 0.2 S S 0 1 1 G D D 2 1.4 0.6 D 1. M 0.8 G 1 0.8 G M 1.6 0.6 1 0.8 M S

Cost 1.2 .4 0.4 1 S 1.2

0.2 1.8 1.4 S S 1.6 0 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Benefit Benefit Benefit

© 2015 John Wiley & Sons Ltd 274 Willem E. Frankenhuis et al.

Pure Specialists p = 0.6 p = 0.75 p = 0.9 1

8

.

1.2 1 G D D GD 0.8 M

4

0.8 . 1 1 G M 1.2 0.8 0.6 4 1.

1 S

Cost 0.4 6 M 1. S 1.6 0.2 1.4 1 S 1.2 S 0

1 0.4 0.8 0. 6 0.2 D D GD M 0.8 0.6 G M 0.2 0.6 0.4 G S Cost 0.4 M S 0.2 0. 2 S S 0 1 0.2 0 0.4 0.2 0 0.8 0.6 0.4 0.2 0 0.4 0.6 D D .8 1 1 4 0.6 0 1.2 1. M 0.8 0.8 G 1.2 G D G 1.6 0.6 1 M M S Cost 1.4 0.4 1.2 S

0.2 1.8

.4 6 1 S S 1. 0 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Benefit Benefit Benefit

© 2015 John Wiley & Sons Ltd