ON PO Opinion is intended to facilitate communication between reader and author and reader and II reader. Comments, viewpoints or suggestions arising from published papers are welcome. N Discussion and debate about important issues in , e.g. theory or terminology, may II also be included. Contributions should be as precise as possible and references should be PO kept to a minimum. A summary is not required. ON

Entropy and diversity

Lou Jost, Ban˜os, Tungurahua, Ecuador ([email protected]).

Entropies such as the ShannonÁ/Wiener and GiniÁ/Simpson determine the identity of a sampled ; it is the indices are not themselves diversities. Conversion of these to mean depth of a maximally-efficient dichotamous key. effective number of species is the key to a unified and intuitive interpretation of diversity. Effective numbers of species derived Tothmeresz (1995), Ricotta (2003) and Keylock (2005) from standard diversity indices share a common set of intuitive have shown that most other nonparametric diversity mathematical properties and behave as one would expect of a indices are also generalized entropies. Entropies are diversity, while raw indices do not. Contrary to Keylock, the lack of concavity of effective numbers of species is irrelevant as long as reasonable indices of diversity, but this is no reason to they are used as transformations of concave alpha, beta, and claim that entropy is diversity. gamma entropies. The practical importance of this transformation In physics, economics, information theory, and other is demonstrated by applying it to a popular community similarity measure based on raw diversity indices or entropies. The standard sciences, the distinction between the entropy of a system similarity measure based on untransformed indices is shown to and the effective number of elements of a system is give misleading results, but transforming the indices or entropies fundamental. It is this latter number, not the entropy, to effective numbers of species produces a stable, easily interpreted, sensitive general similarity measure. General that is at the core of the concept of diversity in biology. overlap measures derived from this transformed similarity Consider the simplest case, a community consisting of S measure yield the Jaccard index, Sørensen index, Horn index of equally-common species. In virtually any biological overlap, and the Morisita /Horn index as special cases. Á context, it is reasonable to say that a community with sixteen equally-common species is twice as diverse as a community with eight equally-common species. Thus, What is diversity? when all species are equally common, diversity should be proportional to the number of species. It is natural to set The plethora of diversity indices and their conflicting the proportionality constant to unity, so that a commu- behavior has led some authors (Hurlbert 1971) to conclude that the concept of diversity is meaningless. nity with eight equally-common species has a diversity of Diversity is not meaningless but has been confounded eight species and a community with sixteen equally- with the indices used to measure it; a diversity index is common species has a diversity of sixteen species. The difference in behavior between an entropy and a diversity not necessarily itself a ‘‘diversity’’. The radius of a sphere S is an index of its volume but is not itself the volume, and is clear here. The Shannon entropy ai1pi logbpi using the radius in place of the volume in engineering (calculated using base b/2 for the logarithm) is 3.0 equations will give dangerously misleading results. This for the first community and 4.0 for the second commu- is what biologists have done with diversity indices. The nity; the entropy of the second community is not twice most common diversity measure, the ShannonÁ/Wiener that of the first. (For any choice of base b, if the entropy index, is an entropy, giving the uncertainty in the of the first community is x, the entropy of the second outcome of a sampling process. When it is calculated community is x/logb 2.) The entropy gives the uncer- using logarithms to the base two, it is the minimum tainty in the species identity of a sample, not the number number of yes/no questions required, on the average, to of species in the community.

Accepted 3 January 2006 Subject Editor: Heikki Seta¨la¨ Copyright # OIKOS 2006 ISSN 0030-1299

OIKOS 113:2 (2006) 363 S q This does not mean that Shannon entropy is a poor biology) are monotonic functions of ai1pi , or limits index of diversity; on the contrary, it is the most of such functions as q approaches unity. These include profound and useful of all diversity indices, but its value , Shannon entropy, all Simpson mea- gives the uncertainty rather than the diversity. If it is sures, all Renyi entropies (Renyi 1961, Pielou 1975), all chosen as a diversity index, then all communities that HCDT or ‘‘Tsallis’’ entropies (Keylock 2005; our termi- share a particular value of Shannon entropy are nology follows Czachor and Naudts 2002), and many equivalent with respect to their diversity (according to others. All such measures yield a single expression for this index). A diversity index thus creates equivalence diversity when the algorithm of the preceding section is classes among communities. In each of these equivalence applied to them: classes there is one community whose species are all S 1=(1q) equally common. The intuitive definition of diversity q qD p (1) just given applies to that community, showing that its i Xi1  diversity is equal to its number of species; all other communities in the equivalence class must also have this [Proof 1] These are often called ‘‘Hill numbers’’, but they same diversity. are more general than Hill’s (1973) derivation suggests. Finding the diversity of a community thus reduces to The exponent and superscript q may be called the the problem of finding an equivalent community (one ‘‘order’’ of the diversity; for all indices that are functions S q that has the same value of the diversity index as the of ai1pi , the true diversity depends only on the value of community in question) composed of equally-common q and the species frequencies, and not on the functional species. This is a matter of simple algebra: calculate the form of the index. This means that when calculating the diversity index for D equally-common species (each diversity of a single community, it does not matter species therefore with a frequency of 1/D), set the whether one uses Simpson concentration, inverse Simp- resulting expression equal to the actual value of the son concentration, the GiniÁ/Simpson index, the second- diversity index, and solve that equation for D. This value order Renyi entropy, or the HurlbertÁ/SmithÁ/Grassle of D is the diversity of the community according to the index with m/2; all give the same diversity: chosen diversity index. Table 1 gives the results of this S algorithm for some common diversity indices. The 2 2 D1= pi (2) number D has been called the ‘‘effective number of Xi1  species’’ by MacArthur (1965); in physics it is the The superscript 2 on the diversity indicates that this is a number of states associated with a given entropy, and diversity of order 2. in economics it is called the ‘‘numbers equivalent’’ of a The order of a diversity indicates its sensitivity to diversity measure (Patil and Taillee 1982). I will refer to common and rare species. The diversity of order zero it simply as the diversity. (q/0) is completely insensitive to species frequencies and is better known as species richness. All values of q less than unity give diversities that disproportionately Diversity of order q favor rare species, while all values of q greater than unity disproportionately favor the most common species Most nonparametric diversity indices used in the (Tsallis 2001, Keylock 2005). The critical point that sciences (including all generalized entropies used in weighs all species by their frequency, without favoring

Table 1. Conversion of common indices to true diversities.

Index x: Diversity in terms of x: Diversity in terms of pi:

S S 0 0 Species richness x pi x pi i1 i1 P S P S Shannon entropy x  pi ln pi exp(x) exp  pi ln pi     iP1 iP1 S S 2 2 Simpson concentration x pi 1/x 1= pi i1 i1 PS PS 2 2 GiniÁ/Simpson index x 1 pi 1/(1/x) 1= pi i1 i1 s P SP 1=(1q) q 1/(1q) q HCDT entropy x 1 pi =(q1) [(1/ (q/1)x)] pi       iP1 iP1 s S 1=(1q) Renyi entropy x ln pq =(q1) exp(x) pq  i   i  iP1 iP1

364 OIKOS 113:2 (2006) either common or rare species, occurs when q/1; Eq. 1 exp(Ha Hb)exp(Hg)(5a) is undefined at q/1 but its limit exists and equals so S (exp(H ))(exp(H ))exp(H )(5b) 1Dexp  p ln p exp(H)(3) a b g  i i Xi1 or This is the exponential of Shannon entropy, but it arises ()()() naturally here without any reference to information theory. The central role this quantity plays in biology, (5c) information theory, physics, and mathematics is not a Shannon or order 1 diversity thus necessarily follows matter of definition, prejudice, or fashion (as some Whittaker’s (1972) multiplicative law. The minimum biologists have claimed) but rather a consequence of its possible beta diversity is unity, which occurs when all unique ability to weigh elements precisely by their communities are identical. The maximum possible beta frequency, without disproportionately favoring either diversity is N, the number of communities; this occurs rare or common elements. Biologists would have dis- when all N communities are completely distinct and covered it and used it as their main diversity index even if equally weighted. Alpha and beta diversity are indepen- information theory did not exist. dent of each other regardless of the community weights. Equation 1 has the properties intuitively expected of a Keylock (2005), following Lande (1996), has criticized diversity. For all values of q it always gives exactly S when the use of diversities (Hill numbers) because they are applied to a community with S equally-common species. often not concave functions, and so alpha diversity For all values of q it also possess the ‘‘doubling’’ property might sometimes be greater than gamma diversity. This introduced by Hill (1973): suppose we have a community would be a valid criticism if we averaged individual of S species with arbitrary species frequencies diversities directly (w1D(H1)/w2D(H2)/...) to obtain q p1, ...pi, ...ps, with diversity D. Suppose we divide the alpha diversity; an alpha diversity calculated this way each species into two equal groups, say males and would indeed sometimes exceed the gamma diversity. females, and we treat each group as a separate ‘‘species’’. However, there is no theoretical justification for aver- Intuitively, we have doubled the diversity of the commu- aging diversities in this way. Diversities are not sub- nity by this reclassification, and indeed the diversity of stitutes for entropies but rather transformations of them the doubled community calculated according to Eq. 1 is after all entropic calculations (such as calculation of the q always 2/ D regardless of the values of the pi. [Proof 2] alpha entropy) have been done. The logic is analogous to working with variances when their mathematical proper- ties (such as additivity) are useful, and then converting Alpha, beta, and gamma diversities the result to a standard deviation at the end for comparison with experiment. Let H stand for any The Shannon entropies of multiple communities can be generalized entropy or diversity index, and let D(H) be averaged to give what is known in information theory as the function that transforms H into a true diversity. If the ‘‘conditional entropy’’, Ha, of the set of commu- the underlying entropies are concave, then Ha will be less nities. Because Shannon entropy is a concave function, than or equal to Hg (Lande 1996). Hence if the Ha is always less than or equal to the gamma entropy Hg, transformation function D(H) is monotonically increas- the entropy of the pooled communities (Shannon 1948, ing, the transformed alpha entropy D(Ha) will be less Lande 1996). Though Ha is often called the ‘‘alpha than or equal to the transformed gamma entropy D(Hg). diversity’’ in biology, it is of course really an entropy. It In the Shannon case, the function that converts entropy can be converted to the true alpha diversity by Eq. 3: H to diversity D is the exponential function, which is 1 Da /exp(Ha). Likewise the amount of information monotonically increasing: if x5/y, then exp(x)5/exp(y). provided by knowledge of the sample location is often Because Shannon entropy is concave, Ha is always less called the beta diversity in biology but is actually an than or equal to Hg, and so it follows that exp(Ha) is entropy. Like the alpha entropy, it can be converted to always less than or equal to exp(Hg). Shannon alpha the true beta diversity by Eq. 3. The same tranformation diversity is always less than or equal to Shannon gamma also converts gamma entropy to true gamma diversity. diversity, and the concavity of D(H) plays no role in this.

The relation between the Shannon alpha, beta, and The other commonly-used concave entropy is the GiniÁ/ gamma entropy follows directly from information the- Simpson index. If one defines the alpha GiniÁ/Simpson ory: index Ha of a set of communities as the average of the GiniÁ/Simpson indices of the individual communities (as H H H a b g is traditional in biology, though see below), then by

By converting both sides of this equation to true concavity Ha is always less than or equal to Hg (Lande diversities via Eq. 3, the relation between alpha, beta, 1996). This index is transformed to a true diversity by the and gamma diversity is obtained: function 1/(1/H) (which is obtained by the algorithm

OIKOS 113:2 (2006) 365 described in the first section). The GiniÁ/Simpson index L alpha diversity index=gama diversity index always lies within the interval [0, 1), and in this domain the Ha=Hg (6) transformation function 1/(1/H) is a monotonically increasing function of H. Hence if x and y are numbers (H here refers to any generalized entropy or diversity index, not necessarily the Shannon entropy.) This in the range [0, 1) and if x is less than or equal to y, 1/(1/x) similarity measure is designed to be used with species must be less than or equal to 1/(1/y). Since Ha is always richness, Shannon entropy, the GiniÁ/Simpson index, or less than or equal to Hg, the alpha diversity 1/(1/Ha) will therefore always be less than or equal to the gamma any other concave measure. It is intended to give the proportion of total regional ‘‘diversity’’ contained in the diversity 1/(1/Hg). The concavity of the transformation function is irrelevant. average single community, but it does not distinguish It is important to note that the alpha (or conditional) entropies from diversities, and this causes problems. When applied to N equally-weighted communities it entropy is not uniquely defined for the GiniÁ/Simpson index or other non-Shannon diversity measures (Taneja gives sensible results when used with species richness 1989, Yamano 2001). In physics, for example, the (which is after all a true diversity, the diversity of order currently accepted definition of the conditional zero). In this case the similarity measure has awell-defined maximum of 1.0 when the communities are all identical, GiniÁ/Simpson index is not w1H1/w2H2/ ...but 2 2 2 2 and awell-defined minimum of 1/N when the communities [(w1)H1/(w2)H2/...]/[w1/w2/...] (Tsallis et al. 1998, Abe and Rajagopal 2001). There are many other are completely dissimilar. However, when this same definitions in the literature (Taneja 1989). Each satisfies similarity measure is used with Shannon entropy or the a different subset of the theorems which apply to their GiniÁ/Simpson index, its value approaches unity as Shannon counterpart, and no definition satisfies them community diversity becomes large, no matter how dissimilar the communities! [Proof 3] Whittaker (1972) all. The traditional biological definition of GiniÁ/Simp- son alpha entropy agrees with the current physics first noted this problem in ratios of gamma and alpha definition only when community weights are equal. Shannon entropies. The inflation of similarity arises These ambiguities apply also to the definition of beta because most entropies, including Shannon entropy and for non-Shannon measures. Until these issues are the Gini-Simpson index, have a nearly flat slope when resolved at a deeper level, one should avoid the use of diversity is high; if a set of communities is very diverse, Ha and H will therefore be nearly equal, and as alpha the GiniÁ/Simpson index to calculate alpha and beta for g unequally-weighted samples or communities. The issues becomes large their ratio will be approach unity regardless involved are explained in more detail in Appendix 2. In of their similarity.This makes the measure uninterpretable the following applications we restrict ourselves to the as a summary statistic; a similarity of 0.999 using the case of equal weights. GiniÁ/Simpson index in Eq. 6 may mean communities are nearly identical, may mean they are moderately similar, or may mean they are completely distinct. This problem can be avoided by converting the Application entropies to true diversities before taking their ratio, obtaining: The distinction between an entropy and a true diversity q q M D(H )= D(H )(7) is not merely semantic; confusion between entropy and a g diversity has serious practical consequences. Entropies where qD(H) means the diversity associated with the and other diversity indices have a wide variety of ranges entropy H. The reciprocal of this formula has been and behaviors; if applied to a system of S equally proposed as a measure of beta diversity for the Shannon common species, some vary as S, some as log S, some case (MacArthur 1965), and for the GiniÁ/Simpson index as 1/S, some as 1Á/1/S, etc. Some have unlimited ranges (Olszewski 2005). The conversion to true diversities puts while others are always less than unity. A general these diversity indices on a common footing, eliminating formula or equation designed to accept raw indices is the erratic behavior of Eq. 6 when used with different likely to give reasonable results for some of these but not indices. Most importantly, because both alpha and for others. By calling all of these indices ‘‘diversities’’ and gamma diversities have the ‘‘doubling’’ property de- treating them as if they were interchangeable in formulas scribed earlier, their ratio is immune to the false inflation or analyses requiring diversities, we will often generate of similarity which is characteristic of a ratio of misleading results. Converting indices to true diversities entropies. Unlike a ratio of entropies, it has a fixed gives them a set of common behaviors and properties, minimum value (indicating complete dissimilarity) of permitting the development of truly general index- 1/N for N equally-weighted communities, whether used independent formulas and analytical techniques. with species richness, Shannon entropy, or the GiniÁ/ Consider for example a popular general similarity or Simpson index, and regardless of the alpha diversity of homogeneity measure (Lande 1996, Veech et al. 2002) the communities. [Proof 4] Thus we can easily tell based on raw diversity indices: whether a set of N equally-weighted communities is

366 OIKOS 113:2 (2006) distinct and we can meaningfully compare the similarity community, and the difference reaches or exceeds a values based on different indices and between different significance level of 0.001 for 63 of the 74 species in the sets of N communities (whose alpha diversities may published data. A reasonable similarity measure should differ). reveal this distinctness. However, the similarity measure The difference between the ratio of entropies and the Eq. 6, the ratio of entropies, gives an inflated similarity ratio of true diversities can be seen by applying each of value of 0.95 for the two communities using the GiniÁ/ them to two imaginary equally-weighted bird commu- Simpson index. Most ecologists looking at this 95% nities with no species in common: a community in Costa similarity value would wrongly assume that the forest is Rica and one in Indonesia, each with 50 equally common homogeneous in the vertical dimension and that the bird species. Since no species are shared between Costa canopy and understory communities are very similar in Rica and Indonesia, the pooled diversity is 100 equally- species composition. In contrast Eq. 7, the ratio of true common species. Equation 6 gives the proportion of total diversities, shows that the average community really pooled ‘‘diversity’’ contained in the average community as shares not 95% but only 59% of the total diversity, using 0.5 using species richness, 0.85 using the Shannon entropy, the GiniÁ/Simpson index. This value is close to the and 0.99 using the GiniÁ/Simpson index. Inflation is theoretical minimum of 50% for two completely dissim- apparent in the latter two values, which are close to unity ilar equally-weighted communities, showing that the (which wouldindicate maximalsimilarity) eventhoughthe canopy and understory communities are really very communities have no similarity at all. Furthermore, the distinct from each other. The authors of the butterfly similarity values vary erratically depending on the diver- study used additional methods to demonstrate the sity index used, even though the species probabilities in the importance of the canopy/understory distinction, con- individual and pooled communities are perfectly uniform, cluding that ‘‘the vertical dimension is a major structural with no dominance or concentration effects that could component in tropical forest butterfly communities’’ account for the differences. Since the value that indicates (Engen et al. 2002). The similarity measure based on complete dissimilarity depends in a complex way on the entropies, Eq. 6, hides this structure while the similarity alpha diversity and on the index used, these similarity measure based on true diversities, Eq. 7, highlights it. values cannot be directly compared between different sets Additive partitioning studies which rely exclusively on of communities and cannot be easily interpreted. Eq. 6 for the evaluation of their results need to be re- In contrast the similarity of these two completely examined using the corrected measure, Eq. 7. distinct communities using true diversities, Eq. 7, is 0.50 Other useful similarity measures can be derived from for all these diversity measures, reflecting the fact that if Eq. 7 for use with equally-weighted samples or commu- two communities are completely distinct and equally- nities of a region. The minimum value of Eq. 7 for N equally-weighted samples or communities, 1/N, ob- weighted the average community must contain half the viously depends on the number of samples or commu- total pooled diversity. When two equally-weighted com- nities in the region. The measure can be transformed munities are completely dissimilar, we would obtain this onto the interval [0,1] to facilitate comparisons between same value of 0.50 even if the communities each had very regions with different numbers of samples or commu- different numbers of species, and even if each species were nities. Since qD(H )/qD(H ) goes from 1/N to 1, the not equally-common, and no matter whether we used a g linear transformation diversities calculated from species richness, Shannon en- tropy, or the GiniÁ/Simpson index. The measure accurately S [ qD(H )= qD(H )1=N]=[11=N](8) and robustly indicates the degree of similarity without a g confounding this with total diversity. ranges from 0 to 1. This new measure equals zero when The ratio of true diversities not only has superior all the samples or communities are completely distinct, mathematical properties but is also more biologically equals unity when all are identical, and is linear in the meaningful than the ratio of entropies. This can be best proportion of total diversity contained in the average demonstrated on a real two-community system, the sample or community. When this measure is applied to canopy and understory fruit-feeding butterfly commu- two equally-weighted samples or communities, it sim- nities of a South American rain forest (DeVries and plifies to the Jaccard index (Jaccard 1902) when used

Walla 2001). The published raw data set (11696 indivi- with species richness, and to the MorisitaÁ/Horn index duals) is large enough to eliminate bias in the estimates of (Horn 1966) when used with the GiniÁ/Simpson index. diversity measures, and the statistical weights of the two [Proof 5] For the butterfly data mentioned above, it gives communities are within 1% of equality, making it an 0.18 with the GiniÁ/Simpson index. ideal test. The distinctness of the canopy versus unders- Sometimes the focus of a study is not on the amount tory butterfly communities is obvious from inspection of of homogeneity or shared diversity in a region but on the the published raw data (rearranged here as Table 2); on amount of species overlap between two communities. the average a species is more than 30 times more common This requires a direct comparison between communities, in its preferred community than in its nonpreferred whose original statistical weights are then irrelevant and

OIKOS 113:2 (2006) 367 Table 2. Canopy and understory butterfly communities in a tropical rain forest (modified from DeVries and Walla 2001). Butterfly species, along with their number of captures in the canopy and understory, are listed in descending order of abundance and include the 74 most common species (those with 8 or more captures). The canopy and understory butterfly communities are not similar to each other; nearly all species are much more abundant in one community than in the other.

Species Canopy Understory Species Canopy Understory

Historis acheronta 1882 26 Callicore hesperis 44 1 Panacea prola 1028 535 Opsiphanes invirae 40 5 Nessaea hewitsoni 19 984 Baeotus deucalion 40 1 Morpho achilles 5 751 Archaeprepona licomedes 1 39 Taygetis sp. 1 8 621 Taygetis mermeria 0 40 Colobura dirce 273 250 Memphis arachne 36 1 Bia actorion 8 426 Mycelia capenas 31 5 Catoblepia berecynthia 3 336 Paulogramma pyracmon 32 0 Historis odius 299 11 Diarthria clymena 29 0 Catoblepia xanthus 4 289 Catoblepia soranus 0 27 Panacea divalis 244 21 Chloreuptychia herseis 0 27 Caligo idomenius 2 242 Catonephele numilia 25 1 Catonephele acontius 176 67 Cithaerias aurorina 0 21 Callicore hystapes 201 1 Chloreuptychia arnaea 0 19 Panacea regina 182 14 Cissia erigone 0 19 Caligo eurilochus 2 184 Pareuptychia binocula 1 17 Memphis florita 136 33 Caligo teucer 0 17 Batesia hypochlora 36 113 Chloreuptychia hewitsonii 0 17 Morpho menelaus 1 131 Memphis polycarmes 15 1 Archaeprepona demophon 63 63 Mageuptychia nr. helle-1 97 Hamadryas arinome 108 14 Hamadryas feronia 15 0 Mageuptychia antonoe 117 3 Archaeprepona demophoon 14 0 Smyrna blomfildia 100 1 Hamadryas chloe 2 12 Tigridia acesta 29 65 Pierella lena 0 14 Opsiphanes cassina 72 18 Cissia myncea 58 Callicore cyllene 81 0 Mageuptychia analis 67 Haetera piera 0 77 Pierella astyoche 0 13 Prepona laertes 74 2 Taygetis virgilia 0 13 Hamadryas amphinome 72 3 Cissia proba 0 12 Hamadryas laodamia 60 0 Pareuptychia ocirrhoe 1 11 Caligo placidianus 0 57 Agrias sardanapolis 10 0 Opsiphanes quiteria 8 49 Baeotus amazonicus 90 Zaretis itys 50 4 Cissia terrestris 36 Temenis laothe 49 3 Prepona pylene 80 Taygetis sp. 3 0 51 Antirrhea avernus 08 Archaeprepona amphimachus 5 44 Chloreuptychia tolumnia 17 Taygetis sp. 2 0 49 Taygetis valentina 08 are taken to be equal in the calculation of alpha, beta, nities using the Gini-Simpson index is 18%. For order 2 and gamma. The most useful kind of overlap measure diversities Eqs. 8 and 9 necessarily give the same values. has been called a ‘‘true overlap measure’’ (Wolda 1981): For q other than 2.0 the values will differ between Eq. 8 when two communities each consist of S equally- and 9. common species, with C shared species, a true overlap measure gives the value C/S, the fraction of one community which overlaps with the other. A true overlap measure for HCDT entropies can be derived from Eq. 7, Conclusions since the (q-1)th power of the ratio of alpha to gamma diversities is linear in C/S. We can transform that ratio The confusion between entropy and diversity fuels much onto the interval [0,1] to obtain the true measure of of the debate about diversity measures, obscures patterns overlap: in field studies, and hinders theoretical progress. Most diversity indices are entropies, not diversities, and their Overlap (of order q) mathematical behavior usually does not correspond to q q q1 q1 q1 biologists’ theoretical or intuitive concept of diversity. [ D(Ha)= D(Hg)] [1=2] = 1[1=2] f g f g All indices can be transformed into true diversities, (9) which possess a uniform set of mathematical properties that accurately capture the diversity concept. [Proof 6] This yields the Sørensen index (Sørensen 1948) Conversion of indices to true diversities facilitates when q/0, the Horn index of overlap (Horn 1966) in the interpretation of results. If researchers find a pre-treat- limit as q approaches 1, and the MorisitaÁ/Horn index ment Shannon entropy of 4.5 and a post-treatment

(Horn 1966) when q/2. [Proof 7] For the butterfly data Shannon entropy of 4.1, they usually go no further than the overlap between canopy and understory commu- to say that the difference is small, and then fall back on

368 OIKOS 113:2 (2006) statistical tests to see if the difference is statistically extensive and nonextensive generalizations. Á/ Phys. Lett. A significant. It is more informative to give the diversities 298: 369Á/374. DeVries, P. and Walla, T. 2001. and community instead of the entropies; in this case, the diversities are 90 structure in neotropical fruit-feeding butterflies. Á/ Biol. J. species and 60 species respectively.It is now easy to see that Linn. Soc. 74: 1Á/15. the difference between pre- and post-treatment diversities Engen, S., Lande, R., Walla, T. et al. 2002. Analyzing spatial structure of communities using the two-dimensional Poisson is not small but enormous. The question of the real lognormal species abundance model. Á/ Am. Nat. 160: 60Á/ magnitude of the drop is important and is separate from its 73. statistical significance. It is essential to have informative, Hill, M. 1973. Diversity and evenness: a unifying notation and interpretable diversity and similarity measures, so we can its consequences. Á/ Ecology 54: 427Á/432. Horn, H. 1966. Measurement of ‘‘overlap’’ in comparative go beyond mere statistical conclusions. ecological studies. Á/ Am. Nat. 100: 419Á/424. It is especially useful to convert to true diversities when Hurlbert, S. 1971. The nonconcept of diversity: a critique and dealing with multiple kinds of indices. If a community has alternative parameters. Á/ Ecology 52: 577Á/586. Jaccard, P. 1902. Lois de distribution florale dans la zone alpine. a species richness of 100, a Shannon entropy of 3.91 and Á/ Bull. Soc. Vaudoise Sci. Nat. 38: 69Á/130. GiniÁ/Simpson index of 0.967, it is not obvious how these Keylock, C. 2005. Simpson diversity and Shannon-Wiener compare. Converting to diversities gives 100, 50 and 30 index as special cases of a generalized entropy. Á/ Oikos 109: 203Á/207. species; the big drops in the diversities as q increases Lande, R. 1996. and partitioning of species diversity, indicate a high degree of dominance in the community. If and similarity among multiple communities. Á/ Oikos 76: 5Á/ the communities had been completely without dominance 13. (all species equally likely) the diversities would have been Lewontin, R. 1972. The apportionment of human diversity. Á/ Evol. Biol. 6: 381Á/398. 100 species for all values of q. MacArthur, R. 1965. Patterns of species diversity. Á/ Biol. Rev. Shannon beta entropy is additive but this forces 40: 510Á/533. Shannon beta diversity to be multiplicative; both sides Olszewski, T. 2004. A unified mathematical framework for the measurement of richness and evenness within and among in the debate over the additivity or multiplicativity of communities. Á/ Oikos 104: 377Á/387. beta are correct, but one side is talking about entropy Patil, G. and Taillie, C. 1982. Diversity as a concept and its while the other is talking about diversity. measurement. Á/ J. Am. Statist. Ass. 77: 548Á/561. Each diversity index has unique properties that are Pielou, E. 1975. Ecological diversity. Á/ Wiley. Renyi, A. 1961. On measures of entropy and information. Á/ In: useful for specific applications (this is the real point of Neymann, J. (ed.), Proc. 4th Berkeley Symp. Math. Statist. Hurlbert’s criticism of diversity as a unified concept). Probabil. (Vol. 1). Univ. of California Press, pp. 547Á/561. The present paper is not intended to argue against the Ricotta, C. 2003. Parametric scaling from species relative abundances to absolute abundances in the computation appropriate use of raw indices in these index-specific of biological diversity: a first proposal using Shannon’s applications. However, since raw diversity indices exhibit entropy. Á/ Acta Biotheor. 51: 181Á/188. a wide variety of mathematical behaviors, they cannot all Shanon, C. 1948. A mathematical theory of communication. Á/ Bell System Tech. J. 27: 379Á/423, 623Á/656. give reasonable results when directly inserted into a Sørensen, T. 1948. A method of establishing groups of equal general diversity equation or formula. Converting raw amplitude in plant sociology based on similarity of species indices to true diversities (Hill numbers) makes possible content. Á/ K. dan Vidensk Selsk Biol. Skr. 5: 1Á/34. the construction of meaningful index-independent gen- Taneja, I. 1989. On generalized information measures and their applications. Á/ Adv. Electron. Electron Phys. 76: 327Á/413. eral equations, measures, or formulas involving diversity. Tsallis, C. 2001. Nonextensive statistical mechanics and thermo- dynamics: historical background and present status. Á/ In: Acknowledgements Á/ I acknowledge helpful talks with Abe, S. and Okamoto, Y. (eds), Nonextensive statistical Dr. Thomas Walla and Harold Greeney. This work was mechanics and its applications, series lecture notes in supported by grants from John and Ruth Moore to the physics. Springer-Verlag, pp. 3Á/98. Population Biology Foundation. Tsallis, C., Mendes, R. and Plastino, A. 1998. The role of constraints within generalized nonextensive statistics. Á/ Phys. A 261: 534Á/554. Tothmeresz, B. 1995. Comparison of different methods for diversity ordering. Á/ J. Veg. Sci. 6: 283Á/290. References Veech, J., Summerville, K., Crist, T. et al. 2002. The additive partitioning of species diversity: recent revival of an old Abe, S. and Rajagopal, A. 2001. Nonadditive conditional idea. Á/ Oikos 99: 3Á/9. entropy and its significance for local realism. Á/ Phys. Lett. Whittaker, R. 1972. Evolution and measurement of species A 289: 157Á/164. diversity. Á/ Taxon 21: 213Á/251. Aczel, J. and Daroczy, Z. 1975. On measures of information and Wolda, H. 1981. Similarity indices, sample size and diversity. their characterization. Á/ Academic Press. Á/ Oecologia 50: 296Á/302. Czachor, M. and Naudts, J. 2002. Thermostatistics based on Yamano, T. 2001. A possible extension of Shannon’s informa- Kolmogorov-Nagumo averages: unifying framework for tion theory. Á/ Entropy 3: 280Á/292.

OIKOS 113:2 (2006) 369 Appendix 1. Proofs is nonzero, so Hg separates into Note: H represents any generalized entropy or diversity S S index.  w1pi1 ln (w1pi1) w2pi2 ln (w2pi2)... Xi1 Xi1 Proof 1 which simplifies to

S S q If /H a (pi) is continuous and monotonic (hence ð i1 Þ w1 (pi1)(ln w1 ln pi1) invertible) and has a value x, then the algorithm in the Xi1 text to find its diversity gives: S w (p )(ln w ln p )... D 2 i2 2 i2 H (1=D)q x Xi1 Xi1  S S w p ln p (w ln w1) p  H(D×(1=D)q)(x) 1 i1 i1 1 i1 Xi1 Xi1 (1=D)(q1) H1(x) S S w2 pi2 ln pi2 (w2 ln w2) pi2 ... D[1=(H1(x))]1=(q1) Xi1 Xi1

S q 1 1 S q w1H1 (w1 ln w1)(1)w2H2 (w2 ln w2)(1) Since xH ai1(pi) ; H (x)H H ai1(pi)  S q ð Þ ð ð ÞÞ H H /ai1(pi) , so D is independent of the function H(. . .) a w and the diversity associated with any such diversity index where H is the entropy of the community weights. Thus can be expressed as: w Ha/Hg for completely distinct communities is Ha/(Ha/ S 1=(q1) S 1=(1q) Hw) or 1/(1/Hw/Ha). Hw is fixed by the weights so as Ha q q D 1 (p )  (p ) becomes large, Hw/Ha approaches zero and Ha/Hg  = i   i  Xi1 Xi1 approaches 1.

GiniÁ/Simpson case. When all communities are distinct Proof 2 (no shared species) only one of the pij in each term of S q Hg, If the original community had ai1(pi) equal to x, doubling each element of the sum while halving each S q 2 probability is equivalent to replacing each element pi by 1 (w1pi1 w2pi2 ...) ; 1q q i1 (2 )(pi ). Thus the sum for the doubled system is X 1q S q (2 )(x). Any monotonic function H/ ai1(pi) gives a is nonzero, so H separates into S q 1=1q ð Þ g diversity of ai1(pi) , by Proof 1. Thus the ð Þ 1q 1/(1q) S S diversity of the doubled system is [(2 )(x)] / 2 2 2 2 1/(1q) 1 w p  w p ... 2[x ]/2x. 1 i1 2 i2 Xi1 Xi1 

Proof 3 and so Ha/Hg can be written

S S As shown by Lande (1996) for concave diversity 2 2 1 w1pi1  w2pi2 ... measures, H / /H ;H/H is maximal and equals   = a B g a g Xi1 Xi1 unity when all communities are identical, and for a S S 2 2 2 2 given value of Ha,Ha/Hg is minimal when all commu- 1 w p  w p ...   1 il 2 i2  nities are distinct. Any Ha/Hg is therefore bounded Xi1 Xi1 between these values.The following proofs show that the lower bound approaches unity as alpha becomes where the numerator is the alpha component of the high; hence all Ha/Hg approach unity when alpha is GiniÁ/Simpson index. As the alpha diversity becomes high. large, each of the sums making up the numerator approach zero. The same sums appear in the denomi- Shannon case. When all communities are distinct, only nator (though with different weight coefficients) so one of the p in each term of H , ij g all the summations in both the numerator and

S denominator approach zero. Then Ha/Hg is (1/o)/  (w1pi1 w2pi2 ...) ln (w1pil w2pi2 ...) (1/d) and as o and d approach zero this approaches Xi1 unity.

370 OIKOS 113:2 (2006) S S 1=N pq  pq ... Proof 4   i1 i2 = Xi1 Xi1 S S 1=(1q) Proof 1 can be used to write the desired inequality as q q q q (1=N) pi1 (1=N) pi2 ... 1=N q q  Xi1 Xi1  1=N5 D(Ha)= D(Hg) 1 S S For q/1, D(H) is exp(H) by the algorithm in the text q q and also by taking the limit of the result of Proof 1 as q (1=N) pi1  pi2 ...   = approaches 1. When all communities are distinct and Xi1 Xi1 equally weighted, Hg separates into S 1=(1q) [(1=N)(p p ...)]q S S i1 i2  Xi1  (1=Npi1) ln (1=Npi1) (1=Npi2) Xi1 Xi1 which can be rewritten ln (1=Np )... Â i2 S S which simplifies to q q 15 pi1  pi2 ...  = S Xi1 Xi1 1=N (pi1)(ln 1=Nln pi1) S 1=(1q) i1 q X [(pi1 pi2 ...)] S Xi1  1=N (pi2)(ln 1=Nln pi2)... Xi1 S S 1=N p ln p (1=N ln 1=N) p  Case q /1. The above inequality can then be rewritten i1 i1 i1 Xi1 Xi1 S S S S S q q q 1=N p ln p (1=N ln 1=N) p ... [(pi1 pi2 ...)] ] pi1  pi2 ... i2 i2 i2 Xi1 Xi1 Xi1  Xi1 Xi1 (1=N)H (1=N ln 1=N)(1) with the inequality reversing because we are taking the 1 q (1=N)H2 (1=N ln 1=N)(1)... reciprocal of both sides. Each term (pi1/pi2/. . .) of the q left-hand summation is greater than or equal to pi1/ Ha N(1=N ln 1=N)Ha ln N q q pi2/ . . . since (x) is a convex function, so the inequality 1 so D(Hg)/ exp (Ha/ln N)/N exp (Ha). is true. 1 1 Hence the ratio D(Ha)/ D(Hg)/ exp (Ha)/N exp (Ha)/ 1/N. Case qB/1. For this case the proof is the same; the inequality can be written Proof 5

S S S q q q q q [ D(H )/ D(H )/1/N]/[1/1/N] for two equally- [(pi1 pi2 ...)] 5 pi1  pi2 ... a g   q q 1=(1q) Xi1 Xi1 Xi1 weighted communities is [( la= lg) 0:5]=[0:5]: For q/0 this becomes and this is true term by term since for this case (x)q is a concave function. S S (0:5) p0 (0:5) p0  i1 i2= Xi1 Xi1 Case q/1. The inequality can be proven by noting that q q S the limit of D(Ha)/ D(Hg) as q approaches 1 exists, so 0 q q [0:5 pi1 0:5 pi2) ]0:5 = 0:5 D(H )/ D(H ) is continuous at q/1; the above inequal- f g a g Xi1  ity holds since it holds for q/1 and qB/1.  [0:5 N 0:5 N ]=[N ]0:5 = 0:5 f 1 2 tot g f g We prove equality when the communities are  [N N ]N = N f 1 2 totg f totg completely distinct by noting that for q not equal to 1 (Nshared)=(N1 N2 Nshared)Nshared=Ntot the gamma diversity for distinct communities separates into which is the Jaccard index.

S S 1=(1q) For q/2 the measure can be written q q q q (1=N) pi1 (1=N) pi2 ...   S Xi1 Xi1 2  (0:5 pi1 0:5 pi2) q q  = so the ratio D(Ha)/ D(Hg) becomes Xi1

OIKOS 113:2 (2006) 371 S S [Ntot (N1 N2)]=[(N1 N2)=2] 2 2 (0:5) pi1 (0:5) pi2 0:5 = 0:5 f g [(N1 N2)Ntot]=[(N1 N2)=2]  Xi1 Xi1   S S S [Nshared]=[(N1 N2)=2]  (p p )2 p2  p2 1  i1 i2 = i1 i2 which is the Sørensen index. Xi1 Xi1 Xi1 For the Shannon case it is necessary to take the limit S S S of the measure as q approaches unity: 2 2 2 2  (pi1 pi2 2 pi1pi2) pi1  pi2  = S S Xi1 Xi1 Xi1 (0:5 p 0:5 p )q 0:5 pq S S  i1 i2 = i1 2 2 Xi1 Xi1 pi1  pi2 S Xi1 Xi1  0:5 pq 21q 121q : S S S i2 =f g Xi1  2 p p p2  p2  i1 i2= i1 i2 Xi1 Xi1 Xi1 The numerator and denominator approach zero as q approaches 1, so L’Hospital’s rule applies and the which is the MorisitaÁ/Horn index for two equally- quotient of derivatives is weighted communities. S q Proof 6 (0:5 pi1 0:5 pi2) ln (0:5 pi1 0:5 pi2) Xi1  If each community has S equally-common species and S S 1 q 0:5 pq 0:5 pq they share C species, Dg equals  i1 i2  Xi1 Xi1  q q 1=(1q) [(2S2C)(1=2S) C(1=S) ] S S 2 (1) 0:5 pq 0:5 pq q i1 i2 and Da equals  Xi1 Xi1 

S S 1=(1q) S S q q q q  (0:5) p (0:5) p 0:5 pi1 ln pi1 0:5 pi2 ln pi2 i1 i2 Â    Xi1 Xi1  Xi1 Xi1 [(0:5) S(1=S)q (0:5) S(1=S)q]1=(1q) S (0:5 p 0:5 p )q q 1=(1q) Â i1 i2 [S(1=S) ] S: Xi1  Then the similarity measures is [21q ln 2] 21q ln 2 =f g q q q1 q1 q1 [( Da= Dg) (1=2) ]=[1(1=2) ] which, in the limit as q approaches 1, equals  [ 2S2C (1=2S))q C(1=S)q]= f ð Þ S [S1q]21q = 121q g f g (0:5 pi1 0:5 pi2) ln (0:5 pi1 0:5 pi2) q 1q 1q   [(2S2C)(1=2) (1=S)C=S)]2 = 12 Xi1 f g f g S S  [(2S(1=2)q(1=S)2C(1=2)q(1=S))C=S]21q f g 0:5 pi1 ln pi1 0:5 pi2 ln pi2 ln 2 [ln 2] 1q  = 12 Xi1 Xi1 = f g  [21q 21q(C=S)C=S)]21q = 121q which is the Horn index of overlap for equally-weighted f g f g communities.  C=S (121q) = 121q f g f g The result for q/2 follows from Proof 5 since, for C=S: q/2, [( qD = qD )q1 (1=2)q1]=[1(1=2)q1] Proof 7 g a q q 1=(1q) [( Da= Dg) 0:5]=[0:5]: For q/0 the measure is

S S S Appendix 2. Definitions of alpha and beta 0 0 0 (0:5 pi1 0:5 pi2) (0:5) pi1 (0:5) pi2  =  The alpha and beta components of Shannon entropy Xi1 Xi1 Xi1 are well-understood; their analogues are widely used in 2 = 12 many disciplines and backed up by an immense amount  f g of mathematical theory. They possess the following  N = [N N ]=[2] 2 ff totg f 1 2 g g important properties:

372 OIKOS 113:2 (2006) 1) Alpha and beta both use the same metric. Beta vs Alpha 2) Alpha can never exceed gamma. 1.0 3) If all the communities in a region have the same Shannon entropy, say H0, then the alpha entropy of Beta component of the region is H . 0 0.75 Shannon entropy 4) Alpha and beta are independent.

The importance of Properties 1 and 2 has been emphasized by Lande (1996) and Lewontin (1972). 0.5

Property 3 expresses, in the most noncommital and Beta “Beta” component of Gini– general way possible, the notion that alpha is in some Simpson index using sense an average of the diversity indices of the individual additive definition communities. Property 4 is implicit in all uses of the 0.25 concepts of alpha and beta; a low value of alpha should not, by itself, force beta to be high, and vice versa. In biology the definitions of alpha and beta for Shannon entropy, 0.25 0.5 0.75 1.0 Ha w1H1 w2H2 ... (A1) Alpha

Ha Hb Hg; (A2) Fig. A1. Beta versus alpha for two equally-weighted commu- nities with no species in common. The definition Hb /Hg /Ha are often carried over directly to the GiniÁ/Simpson yields a beta component which is independent of the alpha index (Lande 1996, Veech et al. 2002). However, there is component when it is applied to the Shannon entropy, but not an essential difference between the Shannon entropy and when it is applied to the GiniÁ/Simpson index. the GiniÁ/Simpson index; the Shannon entropy is not bounded above, while the GiniÁ/Simpson index always Consider any two distinct communities with no species lies between zero and one. This means that both the in common, such as communities A and B in Table A1. alpha and gamma GiniÁ/Simpson indices have values Suppose a region consists of communities A and B with very close to unity when communities are highly diverse. equal statistical weights. The alpha Shannon entropy is Therefore whenever alpha is high, beta defined through 0.9764 and the beta Shannon entropy is 0.6931; the true Eq. A2 must always be close to zero. Alpha and beta beta diversity is therefore exp (0.6931) which is 2.00, defined by this equation are not independent but correctly indicating that there are two distinct commu- inversely related. This violation of property 4 is graphi- nities. One could do the same calculation for commu- cally demonstrated in Fig. A1, which contrasts the nities A and C, obtaining a different alpha Shannon behavior of the Shannon and GiniÁ/Simpson beta entropy, 1.488, but the same beta Shannon entropy, components defined by Eq. A2. 0.6931 since these are also two distinct communities. The Property 1 is also violated when beta is defined by same could be done for communities B and C, or any Eq. A2. To understand what it means for alpha and beta other pair of completely distinct communities; regardless to have the same metric, consider how Shannon entropy of their alpha, their beta Shannon entropy will always be works. When there are two equally likely alternatives, 0.6931. Beta is independent of alpha and characterizes Shannon entropy (using natural logarithms) is always the degree of overlap of the communities, using the same 0.6931; whether this number comes from a calculation of metric as alpha. (Had there been two equally-common alpha or of beta makes no difference in its interpretation. species in each community, alpha would have been If there are two distinct equally-likely species in each 0.6931; the same metric applies to both.) community, alpha will be 0.6931, and if there are two Doing the same calculations with the GiniÁ/Simpson completely distinct equally likely comunities in the index gives a ‘‘beta’’ (as defined by Eq. A2) of 0.2050 for region, beta will be 0.6931. Alpha and beta share the communities A and B, but 0.1550 for communities A same metric and so their contributions to total diversity and C, and 0.1000 for communities B and C. The degree can be reasonably compared (by converting them to true of overlap is confounded with the alpha diversity. diversities as shown in the main text). When the GiniÁ/ Converting the ‘‘beta’’ values to true diversities yields Simpson index is applied to a system with two equally- 1.258, 1.183, and 1.111, which do not have obvious likely alternatives, it gives a value of 0.5. Yet if beta is interpretations. defined through Eq. A2, when there are two equally- It can be proven that the proper way to combine two likely completely distinct communities beta does not independent components of a GiniÁ/Simpson index is usually equal 0.5. In fact beta has no fixed value in this not through Eq. A2 but through situation; its value depends on alpha and cannot be interpreted without knowledge of alpha. Ha HaHb Hb Hg (A3)

OIKOS 113:2 (2006) 373 Table A1. Alpha, beta, and gamma indices for various hypothetical regions each consisting of two equally-weighted completely distinct communities. Alpha differs between pairs of communities, but beta Shannon entropy is independent of alpha and is identical for any pair of equally-weighted completely distinct communities; Shannon beta diversity always equals 2.00 for such pairs. The GiniÁ/Simpson index behaves irregularly when beta is defined by Eq. A2 but behaves like Shannon entropy when beta is properly defined by Eq. A3.

Species: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

Community A: 0.6 0.4 00000000000000 Community B: 0 0 0.3 0.2 0.4 0.1 0000000000 Community C: 0 0 0 0 0 0 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1 0.1

A/BA/CB/C

Shannon Ha 0.9764 1.487 1.791 Shannon Hg 1.6696 2.181 2.484 GiniÁ/Simpson Ha 0.5900 0.6900 0.8000 GiniÁ/Simpson Hg 0.7950 0.8450 0.9000 Shannon H /H /H 0.6931 0.6931 0.6931 1 b g a D(Hb)/exp(Hb) 2.000 2.000 2.000 GiniÁ/Simpson ‘‘H ’’ from H /H 0.2050 0.1550 0.1000 2 b g a D(‘‘Hb’’)/1/(1-‘‘Hb’’) 1.258 1.183 1.111 GiniÁ/Simpson H from Eq. A3 0.5000 0.5000 0.5000 2 b D(Hb)/1/(1/Hb) 2.000 2.000 2.000

(Aczel and Daroczy 1975, Tsallis 2001, Keylock 2005). requirement that the alpha component is some kind of This expression not only allows alpha and beta to be ‘‘average’’ of the diversities of the individual commu- independent, but also ensures that alpha and beta use nities). Therefore for this case the same metric, as demonstrated below. Comparison of 2 2 l(w1lw2l...)=lb distinct (A9) Eq. A2 and A3 shows that the ‘‘Hb’’ calculated by Eq. A2 is really not H but H /H H , explaining the shape b b b a and by factoring out l, the value of lb distinct is deter- of the graph of GiniÁ/Simpson ‘‘Hb’’ in Fig. A1 (its mined: y-intercept is Hb and its slope is /Hb). 2 2 It remains to find the correct expression for GiniÁ/ lb distinct (w1 w2 ...)(A10) Simpson alpha, given its independence from beta. The Equation A10 can be substituted into Eq. A8 to yield an alpha Gini-Simpson index can be written 1/la, beta q expression for la: can be written 1/lb, and gamma can be written 1/lg. Then Eq. A3 can be written: 2 2 2 2 la(w1l1 w2l2 ...)=(w1 w2 ...): (A11) (1 )(1 )(1 )(1 )(1 )(A4) la la lb lb lg Because la is by assumption independent of the beta component (which measures the distinctness of the which simplifies to communities), this result cannot depend on the beta la lg=lb: (A5) component and must therefore apply even when the communities are not distinct. The alpha component of The gamma component Hg is by definition the GiniÁ/ the GiniÁ/Simpson index is therefore: Simpson index applied to the pooled communities, so that H 1[(w2l w2l ...)=(w2 w2 ...)]: (A12) S a 1 1 2 2 1 2 2 lg  [(w1pi1 w2pi2 ...) ]: (A6) or, in terms of Hj, Xi1 H (w2H w2H ...)=(w2 w2 ...): (A13) When all communities are completely distinct (no shared a 1 1 2 2 1 2 species), this reduces to This turns out to be the same definition of alpha (conditional) entropy recently proposed for this index in N 2 physics (Abe and Rajagopal 2001). Note that when lg distinct  wj lj (A7) Xj1 community weights are all equal, this reduces to the normal biological definition of the alpha GiniÁ/Simpson 2 where lj is pij for the jth community. Inserting this result index. into Eq. A5 gives an expression for la when all commu- These new results can be applied to the communities nities are distinct: in Table A1. For all pairs of distinct communities, 2 2 regardless of the value of H ,H is now 0.500, as l (w l w l ...)=l : (A8) a b a distinct 1 1 2 2 b distinct expected. Converting this to a true diversity gives 2.00 in When all communities have identical values of l, so that each case, just as with the Shannon entropy, correctly lj /l for all j, la distinct must also equal l by property 3 (the indicating that there are two distinct communities in

374 OIKOS 113:2 (2006) each region. Alpha and beta are now truly measured in example, let community A, with weight 0.7, contain the same metric, and alpha is independent of beta, so three species with frequencies {0.5, 0.2, 0.3} and com- beta can be interpreted on its own. In addition, the munity B, with weight 0.3, contain the same species with numbers equivalents of Ha,Hb, and Hg now follow frequencies {0.9, 0.05, 0.05}. Alpha is 0.55 while gamma Whittaker’s law, alpha diversity times beta diversity is 0.54. This violates property 2. Olszewski (2004), using equals gamma diversity, just as in the Shannon case. a very different approach, has also noted counterintui- (This follows at once from Eq. A5 and the transforma- tive behavior in this index when community weights are tion function D(HGini Á Simpson)/1/l.) unequal (see his Fig. 6). The unavoidable conclusion of

When weights are equal, therefore, Properties 1Á/4 are the present derivation is that the GiniÁ/Simpson index all satisfied by the components of the GiniÁ/Simpson cannot be decomposed into meaningful alpha and beta index, as long as its Hb is defined by Eq. A3. In this case components (i.e. components that possess Properties 1Á/ the alpha component is the same as that traditionally 4) when the community weights are unequal. A paper in used in biology. However, when community weights are preparation will generalize the above considerations to unequal, the alpha component defined by Eq. A12 (the most non-Shannon diversity indices. only definition that makes alpha and beta independent) For additional explanation and worked examples, see is sometimes greater than the gamma component. For the author’s website, www.loujust.com.

OIKOS 113:2 (2006) 375