<<

Journal of Systematics and Evolution 46 (3): 349–374 (2008) doi: 10.3724/SP.J.1002.2008.08056 (formerly Acta Phytotaxonomica Sinica) http://www.plantsystematics.com

Tracking character evolution and biogeographic history through time in —Does choice of methods matter? Qiu-Yun (Jenny) XIANG* David T. THOMAS (Department of Biology, State University, Raleigh, NC 27695, USA)

Abstract This study compares results on reconstructing the ancestral state of characters and ancestral areas of distribution in Cornaceae to gain insights into the impact of using different analytical methods. Ancestral charac- ter state reconstructions were compared among three methods (parsimony, maximum likelihood, and stochastic character mapping) using MESQUITE and a full Bayesian method in BAYESTRAITS and inferences of ancestral area distribution were compared between the parsimony-based dispersal-vicariance analysis (DIVA) and a newly developed maximum likelihood (ML) method. Results indicated that among the six and char- acters examined, “perfect” binary characters (no homoplasy, no polymorphism within terminals, and no missing data) are little affected by choice of method, while homoplasious characters and missing data are sensitive to methods used. Ancestral areas at deep nodes of the phylogeny are substantially different between DIVA and ML and strikingly different between analyses including and excluding at three deepest nodes. These results, while raising caution in making conclusions on trait evolution and historical biogeography using conventional methods, demonstrate a limitation in our current understanding of character evolution and biogeography. The biogeographic history favored by the ML analyses including fossils suggested the origin and early radiation of likely occurred in the late and earliest Tertiary in Europe and intercontinental disjunctions in three lineages involved movements across the North Atlantic Land Bridge (BLB) in the early and mid Tertiary. This result is congruent with the role of NALB for post- migration and in connecting tropical floras in and , and in eastern and South America. However, alternative hypotheses with an origin in eastern Asia and early Trans-Beringia migrations of the cannot be ruled out. Key words AReA, BAYESTRAITS, BEAST, biogeography, chromosome evolution, Cornaceae, fruit and inflores- cence evolution, LAGRANGE, MESQUITE, divergence time.

Reconstructing ancestral states is a powerful hood (ML) (Harvey & Pagel, 1991; Schluter et al., method to understand the pathways and pattern of 1997; Pagel, 1999), Bayesian inference (Huelsenbeck character evolution, which holds the key to our under- et al., 2000, 2001; Ronquist 2004; Pagel et al., 2004), standing of evolutionary processes (Cunningham, and stochastic character mapping (SCM) (Nielsen, 1999; Maddison & Maddison, 1992, 2001; Ronquist, 2002; Huelsenbeck et al., 2003). Until approximately 2004). During the past decade, character mapping on a five ago, the easy use of the parsimony method phylogeny has become more and more a common implemented in MacClade 3.0 and 4.03 (Maddison & component in research articles to test evolutionary Maddison, 1992, 2001) had made it the most useful hypotheses, as well as to understand evolution of approach for tracking evolutionary history of mor- organism traits and behaviors, and more recently of phological characters (see Huelsenbeck et al., 2003; genes, genomes, and gene functions (Blanchette et al., Ronquist, 2004). The recent development of other 2004; Ouzounis, 2005; e.g., Rossnes et al., 2005; methods and their automation in computer packages Renner et al., 2007; Kondo & Omland, 2007; Zhang et provided alternative methods that are more robust al., 2008; Ekman et al., 2008; also see Ronquist, and realistic for users to study character evolution and 2004). A number of analytical methods have been overcome drawbacks of the parsimony method [e.g., developed for reconstructing ancestral states of char- ML and stochastic character mapping in MESQUITE acters, which may be divided into four major catego- 2.01 (Maddison & Maddison, 2007), the ML method ries: parsimony (MP) (Maddison & Maddison, 1992, in BAYESTRAITS 1.0 (Pagel & Meade, 2006), the 2001, 2007; Rossnes et al., 2005), maximum likeli- Bayesian methods in BAYESTRAITS 1.0 (Pagel & Meade, 2006) and SIMMAP (Bollback, 2006)]. Both ——————————— ML and SCM in MESQUITE 2.01 apply stochastic Received: 17 April 2008 Accepted: 4 May 2008 * Author for correspondence. E-mail: [email protected]; models of character state change and can explicitly Tel.: 919-515-2728; Fax: 919-515-3436. accommodate uncertainty in ancestral states. The 350 Journal of Systematics and Evolution Vol. 46 No. 3 2008

Bayesian approach in BAYESTRAITS and SIMMAP character evolution. During the past decade, the estimates the instantaneous rates of character change parsimony-based dispersal-vicariance analysis (DIVA; using maximum likelihood and accommodates phy- Ronquist, 1996, 1997) has been the most widely used logenetic uncertainty by evaluating the ancestral approach in phylogenetic biogeography (e.g., Xiang & character state on sampled from the posterior Soltis, 2001; Donoghue & Smith, 2004; Sanmartín & distribution. In previous practice, most authors applied Ronquist, 2004; Xiang et al., 2006; Hines, 2008) due one or the other method in their studies, and few used to many advantages of the method, e.g., permitting multiple approaches to explore the differences of inference of explicit vicariant and dispersal events, results derived from different methods. Recently, their relative timing, and dispersal directions, as well Pedersen et al. (2007) employed maximum parsimony as the availability of fast and user-friendly software in MacClade 4.03 (Maddison & Maddison, 2001) and (DIVA 1.1, Ronquist, 1997). However, the method Bayesian construction with BAYESTRAITS 1.0 (Pagel inherits the intrinsic principle of parsimony, and & Meade, 2006) in the moss Bryaceae and therefore it may underestimate character transforma- found largely congruent results. In contrast, Ekman et tion events (i.e., dispersal). Although some studies al. (2008) found striking differences among parsi- applied the character model of likelihood methods for mony, maximum likelihood, stochastic character biogeographic studies to estimate the likelihood of mapping, and Bayesian reconstructions in their study ancestral areas (Nepokroeff et al., 2003), the models of the fungi group Ascus (Lecanorales). It is unclear to of character evolution are not appropriate for geo- what extent the congruence or incongruence is af- graphic ranges (see Ree & Smith, 2008). As a re- fected by phylogenetic uncertainty and the nature of sponse to the need of more and better analytical the characters under consideration. Explicit knowl- biogeographic methods, Ree and colleagues recently edge of how reconstructions change under different developed a likelihood-based approach for inference methods (and how this depends on the type of charac- of ancestral geographic areas (Ree et al., 2005; Ree & ter) would be useful in guiding the choice of methods Smith, 2008). The newly developed method represents for a study. It is also particularly important to a significant advance in biogeographic methodology re-evaluate our previous understanding of character by implementing a maximum likelihood statistical evolution based on a single method, especially using model that includes information from biological and parsimony alone. Inconsistent results from different abiotic factors in calculating the likelihood of bio- methods would suggest that many previous studies on geographic pathways for a given phylogenetic and character evolution based on a single method may the distributions of taxa. For example, the rate of need to be reevaluated. This is especially relevant to dispersal and local , lineage surviving time, studies published before the newer methods (e.g., ML, and the probabilities of dispersal between geographic Bayesian, and SCM) became available. In the present ranges at different geological times (Ree et al., 2005; study, we used BAYESTRAITS v 1.0 (Pagel & Moore et al., 2006) can all be used in the estimation. Meade, 2006) to reconstruct evolutionary histories of Their methods model geographic range evolution by six key discrete morphological traits in Cornaceae and stochastic dispersal, local extinction, and speciation in compared the results with those from MP, ML, and a set of discrete areas in continuous time. Unlike the SCM implemented in MESQUITE 2.01(Maddison & character models, a single taxon is allowed to have Maddison, 2007) to gain more insight into the sensi- more than one character state (e.g., a wide distribution tivity of “character mapping” to the choice of meth- in two or more areas), as a result of direct dispersal ods. (Ree et al., 2005). A recent version of the method of Inferring the biogeographic histories (e.g., origin, Ree and Smith (2008) uses instantaneous rates of persistence, dispersal, and extinction) of lineages is dispersal and local extinction to significantly improve analogous to reconstruction of character evolutionary the computation time for parameter estimation previ- histories in many ways (but see below) and is funda- ously done using a simulation approach in Ree et al. mental to understanding the origin and evolution of (2005). This makes it feasible for ancestral area the modern distribution of biodiversity. However, reconstruction using ML optimization analogous to comparing to the study of character histories, there are character mapping. In the present study, we apply the fewer quantitative statistical methods available for methods of Ree et al. (2005) and Ree & Smith (2008) historical biogeography. The present practice of to Cornaceae and compare the results from those tracking biogeographic histories on phylogeny is, in derived from DIVA. Knowledge of congruences and this regard, facing more challenges than studying discordance between the results from DIVA and the XIANG & THOMAS: Evolution & biogeography of Cornaceae 351

ML analysis is particularly useful for evaluating our patterns involving intercontinental disjunctions in previous understanding of lineage biogeographic eastern Asia, North America, Europe, tropical eastern histories and synthesis on global biogeography based Africa, and South America (Fig. 1; Thorne, 1972; on DIVA (Donoghue & Smith, 2004; Sanmartín et al., Boufford & Spongberg, 1983; Wu, 1983; Milne & 2001; Sanmartín & Ronquist, 2004). Abbott, 2002; Donoghue & Smith, 2004; also see The dogwood family Cornaceae is the key mem- Xiang et al., 2006; Xiang et al., in review). With an ber of , a lineage occupying a special position excellent record that includes fruit stones that on the tree of life of angiosperms for being basal in can be reliably identified into subgroups (Eyde, 1988; the Asteridae (APG, 2003; Soltis et al., 2005). Manchester, 1994; Crane et al., 1990; Manchester et The family has been circumscribed differently among al., in press) and a well-resolved, robust phylogeny, authors (Eyde, 1988; Takhtajan, 1987; see Xiang et the genus is ideal for a comprehensive biogeographic al., 1993; APG, 2003; Fan & Xiang, 2003; Soltis et analysis and provides a good model for exploring al., 2005). Here we take the narrow concept proposed the impacts of different analytical methods and fossils by Takhtajan (1987) that Cornaceae contains only in phylogenetic biogeography. Xiang et al. (2006) Cornus L. s. l., based on the following reasons. First, proposed, based on results of DIVA and fossils, that the closest genus to Cornus is Craib, the genus evolved and early diversified into the four based on results of molecular phylogenetic studies major in Europe followed by mutiple intercon- (Chase et al., 1993; Xiang et al., 1998, 2002; Fan & tinental migrations mostly across the North Atlantic Xiang, 2003). Second, Alangium has long been recog- Land Bridge. The migration routes in three of the four nized as a family, Alangiaceae DC. (de Candolle, disjunct lineages however remained to be evaluated 1828). Third, the name of Alangiaceae has been with divergence time information not available in proposed to be conserved (Bullock, 1959; APG, 2003). Xiang et al. (2006). In this study, the biogeographic Cornaceae s.s. consists of approximately 50–55 history of Cornus is reconstructed using the likeli- that are morphologically diverse. Species of hood-based method of Ree et al. (2005) and Ree & the genus exhibit striking variation in inflorescence Smith (2008). The ancestral areas at deep nodes are architecture and display, fruit type, structure, and compared to those inferred from DIVA (Ronquist, color, among other attributes (see Xiang et al., 2006). 1997). The impacts of fossils and different constraints As a consequence of the morphological heterogeneity, on maximum areas are also explored for each method. the species have been grouped in various ways, in Furthermore, the results from DIVA and ML are also multiple genera, subgenera, or sections during the compared with an analysis treating distributions as a past century (see Eyde, 1988; Murrell, 1993, 1996; single, multistate character in BAYESTRAITS. Xiang et al., 1996; Fan & Xiang, 2001; Xiang & The major goal of this study is to understand the Boufford, 2005). Recent molecular phylogenetic morphological evolution of Cornus in space and time studies reconstructed a well resolved, robust species using the most recent and best method available. A phylogeny for Cornus and resolved the species into second goal of the study is to gain new insights on the four major clades (Xiang et al., 2006; Xiang et al., in influences of method choice in reconstructing the review, available upon request). These are: (1) the ancestral state of characters and ancestral areas of blue-or-white fruited group (BW); (2) the cornelian biogeographic distributions. cherries (CC), (3) the big-bracted group (BB), and (4) the dwarf dogwoods (DW). Relationships within and 1 Material and methods among each group are also well resolved, and mostly strongly supported (Xiang et al., 2006; Xiang et al., in 1.1 Estimation of ancestral character states review). This provides a solid basis to track the evolu- Sampling of characters and phylogeny: Six char- tionary histories of morphological characters in the acters mostly from inflorescence architecture and fruit genus and help to understand the tempo and pattern of morphology were chosen for the analyses (Table 1), key morphological changes associated with the dog- including (1) chromosome numbers, (2) inflorescence radiation, such as changes in inflorescence and , (3) inflorescence type, (4) development of fruit architecture and morphology that must have inflorescence bud, (5) fruit type, and (6) fruit color. played important roles in the genus diversification. Five of the six characters are multistate and one is With its highest species diversity in eastern Asia binary (character 5). For all of these characters, all and eastern North America, the dogwood genus also taxa have a single state except for character 6, which exhibits several historically important biogeographic is polymorphic within one terminal taxon (represented 352 Journal of Systematics and Evolution Vol. 46 No. 3 2008

Fig. 1. Approximate geographic distributions of Cornus major clades. BB, big-bracted dogwoods; CC, cornelian cherries; DW, dwarf dogwoods; BW, blue- or white-fruited dogwoods; BW1, C. oblonga and C. peruviana; BW2, C. alternifolia and C. controversa.

by C. racemosa Lam.), of the blue-fruited or white- ford, closely related to C. capitata Wall. Subgenus fruited clade (BW) and the outgroups. The six charac- Kraniopsis consists of ~30 species and was repre- ters represent a range of character conditions that are sented by three species, each representing one of the commonly encountered by investigators, e.g., charac- three subclades indentified by the ITS-matK species ters without polymorphism and homoplasy (character phylogeny that included all of the species (Xiang et 5), characters with missing data and homoplasy al., 2006). Species within each of the subclades are (character 1), characters with homoplasy, but no morphologically highly similar (not variable for the missing data (characters 2, 3, 4), and characters with morphological characters under study), thus missing polymorphism and homoplasy (character 6). The these species will not affect tracking the evolutionary well-resolved molecular phylogenetic trees of Cornus histories of these characters in the genus. derived from six gene regions (rbcL, matK, ndhF, 1.2 Optimization of ancestral character states atpB, ITS, and 26S rDNA; Xiang et al., in review) Among the various methods, the Bayesian ap- were used as the basis for reconstruction of ancestral proach implemented in BAYESTRAITS 1.0 (Pagel & states. These trees contain 22 Cornus species and two Meade, 2006) stands out by not only taking into outgroup taxa, Alangium and Hand. account both branch length and phylogenetic uncer- -Mazz., sister and close relative of Cornus, respec- tainty, but also allowing the user to explore a variety tively (Xiang et al., 1998, 2002; Fan & Xiang, 2003). of models for character transition, defining nodes of The 22 Cornus species represent all the subgenera interest, and a reversible-jump Markov Chain Monte recognized so far by different authors at different Carlo (Pagel & Meade, 2006). We used 1,000 trees times and all the species from each subgenus (see with branch lengths randomly selected from the tree Xiang et al., 2006), with the exception for subgen. pool after burn-in from the Bayesian analysis of the Syncarpea from the Big-bracted group (BB) and six-gene data of Cornus (Xiang et al., in review). The subgen. Kraniopsis from the BW group. The former is 50% majority rule tree of these 1000 trees is identical missing C. multinervosa, closely related to C. kousa to that from all the Bayesian trees after burn-in, Hance, and C. elliptica (Chun) Q. Y. Xiang & Bouf- suggesting that these 1000 trees are likely representative XIANG & THOMAS: Evolution & biogeography of Cornaceae 353

Table 1 Character state of six morphological characters in Cornus generations were applied. These parameters were (character 1–6) and distribution of species (character 7) recommended by the program manual. Two separate Characters 1 2 3 4 5 6 7 analyses with a rate parameter of 2 and 5, respectively C. nuttallii Audubon 0 2 2 1 0 4 3 were performed. Both analyses were commanded to C. florida L. 0 2 2 2 0 4 2 estimate the ancestral state for each character at each C. kousa Hance 0 2 2 3 1 4 1 node of the phylogeny. The nodes were specified C. disciflora Moc. & Sesse ex 0 1 2 2 0 2 34 using the command “addMRCA”. A total of DC. C. capitata Wall. 0 2 2 1 1 4 1 10,000,000 iterations were run for each analysis. C. oligophlebia Merr. 0 0 0 0 0 1 1235 The evolutionary history of each of the six char- C. hongkongensis Hemsley 0 2 2 1 1 4 1 acters was also traced over the 50% majority rule tree C. alternifolia L. f. 1 0 0 3 0 1 2 from Bayesian analysis of Xiang et al. (in review) using parsimony, maximum likelihood (ML), and C. controversa Hemsley 1 0 0 3 0 1 1 Stochastic character mapping (SCM) available in C. peruviana J. F. Macbr. 0 0 0 0 0 1 4 MESQUITE 2.01 (Maddison & Maddison, 2007). For C. walteri Wangerin 0 0 0 0 0 1 1235 ML and SCM analyses, both the “best” tree topology C. racemosa Lam. 0 0 0 0 0 01 23 and a chronogram (the “best” tree with branch length C. oblonga Wall. 0 0 0 0 0 1 1 in time) were used for a comparison. The chronogram C. chinensis Wangerin 2 1 1 2 0 2 1 was derived from the divergence time analysis using C. sessilis Torr. ex Durand 1 1 1 3 0 2 3 the program Multidivtime (Thorne et al., 1998; Ki- C. eydeana QY Xiang & YM 2 1 1 2 0 3 1 shino et al., 2001) from Xiang et al. (in review). In Shui C. mas L. 2 1 1 2 0 4 5 parsimony reconstruction, the character states were C. officinalis Seib. & Zucc. 2 1 1 2 0 4 1 treated as “unordered (i.e., allow free transformation C. volkensii Harms ? 11 2 0 2 7 of a character state to any other states). The ML C. suecica L. 0 2 0 0 0 4 1235 reconstructions were conducted using the MK1 model C. canadensis L. f. 0 2 0 0 0 4 123 of evolution (Schluter et al., 1997; Pagel, 1999). The Mk1 (Markov k-state 1 parameter model) is a k-state C. unalaschkensis Ledeb. 0 2 0 0 0 4 3 generalization of the Jukes-Cantor model, corre- Alangium 0 0 0 0 0 01234 1 or 123567 sponding to Lewis’s (2001) Mk model, which gives Diplopanax 0 0 0 0 0 124 12345 equal probability (or rate) for changes between any Character: character state, score: two character states. The rate of change can be esti- 1. Chromosome number: n=11, 0; n=10, 1; n=9, 2. mated for each individual character based on the data 2. Bracts: rudimentary (minute, early ), distal to inflorescence ESQUITE branches, 0; 4, nonpetaloid at the base of inflorescence, 1; 4 or 6, and branch length by M . The stochastic petaloid at the base of inflorescence, 2. character mapping (Nielsen, 2002; Huelsenbeck et al., 3. Inflorescence type: branched compound cymes, 0; umbel (cymose), 2003) in MESQUITE uses continuous-time Markov 1; glomerule (capitate cyme), 2. models to derive posterior distributions of ancestral 4. Developmental state of winter inflorescence bud: floral buds undeveloped, 0; floral buds developed and unprotected, 1; floral buds state. It simulates realizations of precise histories of developed and protected by inflorescence bracts, 2; floral buds character evolution in a way consistent with Mk1 developed and protected by scales of a mixed bud, 3. model and ancestral states (Maddison & Maddison, 5. Fruit type: simple, 0; compound, 1. 6. Fruit color: white, 0, blue or black, 1; then black, 2; dark purple 2007). Reconstructions of ancestral states for charac- 3; red, 4. ter 6, which has polymorphism for some taxa, were 7. Distributions: eastern Asia, 1; eastern North America, 2; western not allowed for ML and SCM on MESQUITE 2.01. North America, 3; Central and South America, 4; Europe, 5; Austra- lia, 6; Africa, 7. Therefore, only analyses with MP and BAYESTRAIT represents a subclade in areas 2 and 3, and C. were conducted for this character. oligophlebia and C. walteri represent a subclade occurring in areas of 1.3 Optimization of ancestral distributions 1, 2, 3, and 5 (Xiang et al., 2006). These areas were coded for these three species accordingly in the analysis. Diplopanax was coded as 1, The estimation of ancestral distributions was 2, 3, 4, and 5 to incorporate distributions of taxa in the clade it repre- performed using the same phylogenetic trees as for sents (nysssoids, mastixioids, Grubbia and Curtisia) in the Cornales. character ancestral state reconstructions. Three meth-

ods, the parsimony-based dispersal-vicariance method of the entire tree pool. The model of “multistates”, (DIVA) (Ronquist, 1997), the ML-based methods of MCMC mode, an exponential prior seeded from a Ree et al. (2005) and Ree & Smith (2008), and the uniform on the interval 0 to 10 (rjhp exp 0.010), a Bayesian method using a character model (Pagel et al., burn-in of 10,000 and sampling of the chain every 300 2004) were employed. Seven geographic areas were 354 Journal of Systematics and Evolution Vol. 46 No. 3 2008 defined, eastern Asia (EAS), eastern North America tree using a maximum likelihood method by the (ENA), western North America (WNA), Central and program. To reduce computation time, the following South America (CSAM), Europe (EUR), areas were disallowed at all nodes due to their wide (AUS), and Africa (AFR) to cover all the endemic disjunction, which requires prior extinction in their distributional areas of Cornus and its outgroups. The intervening areas or long distance dispersal method of DIVA finds the best biogeographic (EAS-CSAM, EAS-AFR, ENA-AUS, ENA-AFR, pathways given the tree topology and distributions of WNA-AUS, WNA-AFR, CSAM-EUR, EUR-AUS, taxa by minimizing dispersal and extinction events. In EAS-ENA-FR, EAS-WNA-FR, ENA-WNA-AUS, the analysis, each taxon was scored for presence (1) or ENA-WNA-FR, AM-EUR-AUS, EAS-ENA-WNA- absence (0) for each of these areas. The distributions FR, ENA-NA-US-AFR). To explore the impacts of of Cornus racemosa, C. oligophlebia Merr., and C. excluding these areas as a prior and influences of rates walteri Wangerin, each representing a subclade in the of extinction and dispersal, changes of physical con- BW group, were scored according to the occurrence of nection between geographic areas through time, all species in the represented subclade. To explore analyses were ran using AReA v. 2.1 (Smith, 2006; influences of maximum number of areas constrained based on Ree et al., 2005) without excluding areas as a for each node, analyses with constraints of maximum prior using a range of rates of V (dispersal) and L areas of 4 and 2, respectively, were performed for (extinction) including those estimated from LA- comparison. Furthermore, analyses using two different GRANGE as well as the following: V=L=0.005; V= models of area coding for the outgroup Alangium were 0.09, L=0.01; V=0.01, L=0.09; V=0.09, L=0.01; V= also conducted under these two different maximum 0.02, L=0.18; V=0.18, L=0.02; V=0.0047, L=0.0033. area constraints. One model coded Alangium for all The analyses with AReA and LAGRANGE were both areas of its occurrence including fossils (all of the conducted for the chronogram with and without seven areas except Central and South America). The fossils. For analysis with fossils, the branch length in other model coded Alangium for only the root area of time of the fossil lineages in the chronogram was the genus (eastern Asia) that was inferred from bio- arbitrarily estimated as the length of time between the geographic analysis of the genus based on the phy- earliest and latest appearances of the fossil taxa. If logeny using DIVA (Feng et al., in review). For the there was only one report for a fossil taxon, the branch other outgroup Diplopanax, the distribution was length in time of that fossil was arbitrarily estimated scored for occurrence of all taxa of the clade it repre- as the time range of the geological period of its occur- sents in the Cornales (nyssoids, mastixioids, Grubbia rence. For example, the fossil species of BB was P. Bergius and Curtisia (Burm.) G. A. Sm.; Fan & found in the mid Oligocene bed. The branch length in Xiang, 2003; Xiang et al., 2002; Xiang et al., 2007). time of the BB fossil lineage was arbitrarily given as Optimizations were conducted using the same tree 5.1 million years (my), approximately the length of topology as for character ancestral state reconstruc- time of the mid Oligocene. The fossil lineages were tions described above. connected to the stem lineage of the represented ML inferences of geographic range evolution crown group at the time point corresponding to the were conducted for the same distribution matrix under age of the fossils. This analysis was done mainly to the constraints of maximum areas 4 and 2, respec- see the impact of new geographic areas represented by tively, on a chronogram from the Bayesian analysis of fossils, but not occupied by extant species of a line- six gene regions, the same chronogram as the one for age, on optimization of ancestral areas of the perspec- ancestral character state reconstruction. The program tive lineages. Information of age, occurrence, and LAGRANGE (Ree & Smith, 2008) was employed to phylogenetic placements of fossils is from Xiang et al. run the analysis with a simple model of one rate of (2006). The analyses were run for 1,000,000 replicates dispersal and extinction constant over time and among as suggested in the manual. For analyses with AReA, lineages. This program not only finds the most likely the Mesophytic model for probabilities of dispersal ancestral areas at a node and the split of the areas in between geographic areas at different geological time the two descendant lineages, it also calculates the periods was implemented (Smith, 2006). The different probabilities of these most-likely areas (Ree & Smith, models have not been coded into LAGRANGE (Ree, 2008) at each node, a feature lacking in the program pers. comm.). Optimization of ancestral areas using AReA that was based on Ree et al. (2005). The values the Bayesian method in BAYESTRAITS v.1.0 was done of rate of dispersal and extinction implemented in the as described above for character mapping, but using analysis with LAGRANGE were estimated based on the “rjhp exp 0.0 30 and “ratedev 15”. XIANG & THOMAS: Evolution & biogeography of Cornaceae 355

1.4 Estimation of divergence time using BEAST probable state to be 2 for node g (pp=0.38) and 1 for Phylogenetic dating using five-gene data (rbcL, node r (pp=0.5), which implies some degree of uncer- matK, ndhF, atpB, and 26S rDNA) was conducted tainty. For character 4, the MP method did not resolve using r8s (Sanderson, 2002) and Multidivtime (Thorne the ancestral states at g and r nodes (equivocal). et al., 1998; Kishino et al., 2001) in a separate study However, the ML and Stochastic character mapping (Xiang et al., in review). These methods relied on only analyses show that the ancestral states of these two a single phylogeny. Here we estimated the divergence nodes are state 0 (ML, most likely; SCM, 1.0 pp), time using the same data set again using BEAST 1.4.3 while BAYESTRAITS shows that the ancestral state at (Drummond & Rambaut, 2006) for a comparison. this node is most probable state 2 (Fig. 2B; pp: 0.65). BEAST applies Bayesian methods and searches for the For nodes a & b of character 4, ML and SCM sug- optimal phylogeny and estimates divergence time gested alternative states (2 vs. 1), while BAYESTRAITS simultaneously. The divergence time estimated from suggested the two states are nearly equal probable, BEAST was a summary of estimations based on all the similar to MP (Fig. 2B). For nodes c and f of Charac- optimal Bayesian trees found in the analysis, thus ter 4, the ancestral states are equivocal in MP and ML taking the phylogenetic uncertainty into account. The analyses, but SCM and BAYESTRAITS resolved it as analyses with BEAST were performed using the model state 1 (Fig. 2B). In general, the optimal states at the suggested by Modeltest 3.7 (Posada & Crandall, 1998) lower nodes of the phylogeny estimated from BAYE- with an uncorrelated exponential relaxed molecular STRAITS using 1,000 trees are less certain than estima- clock model for 30 million generations. Multiple runs tions from MP, ML, and SCM based on a single, were done to determine the number of generations “best” phylogeny for all characters analyzed. required to allow all node time estimates to reach a When temporal information was taken into ac- stationary distribution. Four nodes were constrained count by using a chronogram for estimation with MP, by fossils, as done for Multidivtime (Xiang et al., ML, and SCM, the results showed a few differences 2005; Xiang et al., in review). for Characters 1, 2 and 4 under the same method. For Character 1, node m (Fig. 2A) was reconstructed as state 1 (green dots) by SCM and as equivocal by ML 2 Results based on the tree topology; whereas in the reconstruc- 2.1 Optimization of ancestral states tions based on a chronogram, this node was inferred Results from ancestral character state optimiza- as state 2 (black dots) by both SCM and ML methods tions from MP, ML, and SCM without temporal (Fig. 3). For Character 2, node r (Fig. 2A) was in- constraints are presented in Figs. 2 and 3 for all nodes. ferred to be equivocal by ML method in the topol- Results from BAYESTRAITS are provided in Table 2 as ogy-based analysis, but in the chronogram-based well as Figs. 2 and 3. These results showed that for analysis, this node changed to be most likely state 0 binary characters without homoplasy, e.g., character 5, (open circle) (Fig. 3). For character 4, the topol- the results at all nodes are identical and certain among ogy-based analysis with the ML method optimized MP, ML, SCM, and Bayesian methods (Fig. 2). For nodes a and b to be mostly likely state 2 (green dots) characters with homoplasy, there are substantial (Fig. 2), but in the chronogram-based ML analysis, differences in the relative certainty of optimal states at state 2 and state 1 (blue dots) at this node were equally some nodes (e.g., nodes m & l for character 1, node r probable (Fig. 3). In the topology-bsed SCM analysis, for character 2, nodes g & r for character 3, nodes f, g, nodes a and b were both certain to be state 1 (blue m, & r for character 4 among the four methods, as dots) (Fig. 2) while in the chronogram-based SCM well as nodes l, p, r, & s for character 6 between MP analysis, both nodes were reconstructed as state 2 and Bayesian inference). There are only a few nodes (green dots) (Fig. 3). In general, discrepancies be- among all characters showing striking difference in tween ML and SCM in topology-based analyses the optimal states inferred from different methods. disappeared in the chronogram-based analyses (com- These are nodes g and r for character 3 (inflorescence pare node m for character 1 and 4 and node r for type) and nodes a, b, c, f, g, r for character 4 (winter character 2 in Figs. 2 & 3). condition of inflorescence bud) (Fig. 2B). For charac- 2.2 Ancestral distributions ter 3, nodes g and r were convincingly resolved as 2.2.1 Between coding models for Alangium The state 0 by MP, ML, and SCM, but BAYESTRAITS results from analyses using AReA and DIVA with resulted in extremely low posterior probabilities for Alangium coded for its root (indicated by a “*” in state 0 at these two nodes and suggested the most Table 3) or for all areas of its occurrence (column 356 Journal of Systematics and Evolution Vol. 46 No. 3 2008

XIANG & THOMAS: Evolution & biogeography of Cornaceae 357

358 Journal of Systematics and Evolution Vol. 46 No. 3 2008

Fig. 2. Evolutionary histories of six key morphological characters on inflorescence and fruit morphology reconstructed using four different methods (MP, maximum parsimony; ML, maximum likelihood; SCM, stochastic character mapping in MESQUITE 2.01, and BAYESTRAITS 1.0). Results for the same character from different methods are presented in the same column. Posterior probabilities of <100% from BAYESTRAITS are presented as numbers. Character state and posterior probability at each node is presented in Table 2. Characters are referred to in Table 1. Species from left to right on each tree: , C. florida, C. disciflora, C. capitata, C. hongkongensis, C. kousa, C. suecica, C. canadensis, C. unalaschkensis, C. sessilis, C. volkensii, C. eydeana, C. chinensis, C. mas, C. officinalis, C. oblonga, C. peruviana, C. racemosa, C. walteri, C. oligophlebia, C. alterni- folia, C. controversa, Alangium, and Diplopanax. A, For characters 1 & 2. B, For characters 3 & 4. C, For characters 5 & 6.

XIANG & THOMAS: Evolution & biogeography of Cornaceae 359

Fig. 3. Evolutionary histories of characters inferred from the chronogram of the “best” tree. Only those different from inferences based on the “best” tree topology are presented. 360 Journal of Systematics and Evolution Vol. 46 No. 3 2008

Table 2 Results of reconstruction of ancestral states for six characters (explained in Table 1) from BAYESTRAITS 1.0 based on 1,000 Bayesian trees Character 1 2 3 4 5 6 node a 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (0) 0.50+0.27 (1) 1.00+0.00 (0) 1.00+0.00 (4) 0.47+0.27 (2) b 1.00+0.00 (0) 0.97+0.10 (2) 1.00+0.00 (2) 0.52+0.35 (1) 0.99+0.02 (0) 1.00+0.00 (4) 0.48+0.35 (2) c 1.00+0.00 (0) 1.00+0.00 (2) 1.00+0.00 (0) 0.96+0.09 (1) 1.00+0.00 (1) 1.00+0.00 (4) d 1.00+0.00 (0) 1.00+0.00 (2) 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (4) e 1.00+0.00 (0) 1.00+0.00 (2) 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (4) f 1.00+0.00 (0) 1.00+0.00 (2) 1.00 +0.00 (2) 0.42+0.33 (2) 0.98+0.05 (0) 1.00+0.00 (4) 0.57+0.34 (1) g 0.96+0.04 (0) .094+0.07 (2) 0.38+0.19 (2) 0.22+0.48 (1) 0.96+0.04 (0) 0.95+0.05 (4) 0.48+0.19 (0) h 0.99+0.01 (0) 0.99+0.01 (0) 0.99+0.01 (0) 0.99+0.01 (0) 0.99+0.01 (0) 0.99+0.01 (1) i 0.99+0.02 (0) 1.00+0.00 (0) 1.00+0.00 (0) 0.94+0.12 (0) 1.00+0.00 (0) 0.99+0.01 (1) j 1.00+0.00 (2) 1.00+0.00 (1) 1.00+0.00 (1) 1.00+0.00 (2) 1.00+0.00 (0) 0.99+0.01 (4) k 1.00+0.00 (2) 1.00+0.00 (1) 1.00+0.00 (1) 1.00+0.00 (2) 1.00+0.00 (0) 0.50+ 0.01 (2) 0.35+0.29 (4) 0.13+0.18 (3) l 0.70+0.26 (2) 1.00+0.00 (1) 1.00+0.00 (1) 0.99+0.03 (2) 1.00+0.00 (0) 0.87+0.20 (2) m 0.64+0.30 (2) 1.00+0.00 (1) 1.00+0.00 (1) 0.94+0.10 (2) 1.00+0.00 (0) 0.95+0.14 (2) n 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (0) 0.99+0.01 (1) o 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (0) 0.99+0.01 (1) p 0.83+0.13 (0) 1.00+0.00 (0) 1.00+0.00 (0) 0.59+0.28 (0) 1.00+0.00 (0) 1.00+0.00 (1) 0.33+0.28 (3) q 0.99+0.01 (1) 1.00+0.00 (0) 1.00+0.00 (0) 1.00+0.00 (3) 1.00+0.00 (0) 0.99+0.01 (1) r 0.65+0.21 (0) 0.94+0.07 (2) 0.38+0.19 (2) 0.48+0.19 (0) 0.96+0.05 (0) 0.95+0.05 (4) 0.22+0.18 (1) s 0.98+0.05 (0) 0.73+0.12 (0) 0.93+0.10 (0) 0.81+0.21 (0) 1.00+0.00 (0) 0.53+0.30 (1) 0.26+0.25 (2) First number in each cell is the posterior probability and the second number is the standard deviation. Numbers in parenthesis indicate the character state (explained in Table 1). Nodes a–s are marked on the phylogeny in Figs. 2 and 3.

without indicated by a star in Table 3) showed little different constraints (Table 3). These results were also difference for possible optimal ranges at each node, quite consistent, especially at the four lowest nodes, although there were more alternative solutions at some with the analyses from AReA without restriction on nodes from one of the coding models in DIVA and maximum areas if fossils were included. AReA (Table 3). There was also little difference 2.2.3 With and without fossils The reconstruction between optimizations with and without the outgroups of ancestral areas at some nodes were strikingly in a rooted tree. different between including fossils and excluding 2.2.2 Constraints of maximum areas at a node fossils (Table 3). With fossils, the reconstructions at For DIVA, optimal solutions at each node was usually deepest nodes (nodes 30, 41, 28, 42) favored Europe. more numerous when maximum areas were con- This area was not included in any of the solutions at strained as four and contained the solutions from these nodes from analyses without fossils except at maximum area constrained as 2 (Table 3). An excep- node 42 from the ML analysis with LAGRANGE with a tion was found in the analysis without fossils and constraint of maximum area of 4. coded Alangium for its root area only. In this analysis, 2.2.4 Among DIVA, ML, and Bayesian methods solutions at most of the eight compared nodes were When all conditions were identical in the analyses, either identical between the two constraints or more results from DIVA and ML showed differences at inclusive from analyses with constraint of maximum most nodes, and most of the time, DIVA suggested areas equal to 2. For ML analyses from LAGRANGE, more alternatives while ML was able to narrow down the most probable areas with the highest likelihood at to one or two most probable areas with the highest each node were highly consistent among the two likelihood (Table 3). ML and DIVA remarkably XIANG & THOMAS: Evolution & biogeography of Cornaceae 361

362 Journal of Systematics and Evolution Vol. 46 No. 3 2008

Fig. 4. Comparison of divergence times estimated from BEAST with those from Multidivtime and r8s in Xiang et al. (in review). Node numbers correspond to those marked in Fig. 5 (22–42). Estimations from BEAST at nodes 22, 29, 31, 40, and 41 (shaded in green) are significantly younger than estimates from Multidivtime and r8s.

differed in the estimations at nodes 30, 40, and 41 in in review) (Fig. 4). Exceptions were found for five analyses without fossils and with maximum areas nodes (shaded in green in Fig. 5). The estimations a constrained as 2 (Table 3, comparing ML in Column from BEAST were much younger for nodes 22, 31, and a G and DIVA in Column H) and differed at the nodes 40, 41, but older for node 29 from BEAST. Divergence 34, 27, 40, 41, 42 in analyses without fossils and with time estimations indicate that Cornus split from its maximum areas constrained as 4 (Table 3, comparing sister Alangium in the and early MLa in Column L and DIVAa in Column M). The radiation of the genus into several clades occurred also differences between ML and BayesTraits were much during the late Cretaceous and the earliest Tertiary less, however, only at nodes 37 and 40 (Table 3, (Fig. 5). b b comparing ML in column J and BAYESTRAITS in column K). In analyses with fossils, ML and DIVA differed at all of the eight nodes (Table 3, comparing 3 Discussion MLa in column A and DIVAa in column B). At nodes 30 and 28, DIVA favored western North Amer- 3.1 Does choice of methods matter? ica (area 3) while ML favored Europe (area 5). At The comparison of ancestral state reconstruction node 40, DIVA consistently optimized a disjunct for six characters shows that the impact of method ancestral distribution in eastern Asia and Central and choice depends on the nature of characters. For char- South America (areas 14), but ML suggested distribu- acters with no homoplasy, no polymorphism, and no tions involving eastern Asia and eastern North Amer- missing data (e.g., character 5), the reconstruction of ica (areas 12) and a number of alternatives all with the ancestral state was consistent among all methods low posterior probabilities (Table 3, column A). In compared and between topology-based and chrono- general, reconstruction of ancestral distributions at the gram-based analyses (see results & Fig. 2C). This lower nodes of the Cornus phylogeny using different suggests that for perfect and clean data meeting these methods showed substantial differences and uncer- conditions, choice of methods does not affect the tainty. Probabilities of the most likely ancestral area inference of evolutionary trend, although analysis estimated from ML methods at each node were mostly using a chronogram is better in that it provides tem- below 50% (Table 3). poral information on the events of character state 2.3 Divergence time transitions (see Fig. 3). However, for characters with Estimations of divergence time using BEAST homoplasy, but no missing data and no polymorphism were congruent at most nodes with those from previ- (characters 2, 3, 4), character state reconstructed at ous studies using Multidivtime and r8s (Xiang et al., nodes including the taxa carrying the homoplasy (e.g., XIANG & THOMAS: Evolution & biogeography of Cornaceae 363

Fig. 5. Chronogram of Cornus from showing major variation and evolutionary trends of inflorescence and fruit characters, as well as ancestral areas estimated from the maximum likelihood method. The chronogram was derived from Bayesian inference of the phylogeny of the six gene regions rbcL,matK, ndhF, atpB, ITS, and 26S rDNA and divergence time estimation based on rbcL,matK, ndhF, atpB, and 26S rDNA using Multidivtime in Xiang et al. (in review). This topology and chronogram was used for analyses of character evolution and biogeographic history. BB, big-bracted dogwoods; BW, blue- or white-fruited dogwoods; CC, cornelian cherries; DW, dwarf dogwoods. All nodes are strongly supported with 1.00 posterior probability and bootstrap values >90%, except a few terminal nodes, 32, 39, 22, 31 and one deep node, 28. Nodes 42, 34, 30, and 28 were constrained by fossils in all divergence time analyses. Numbers connected by a dash at each node indicate geographic area splitting in the two descendant lineages; the number before the dash represents areas inherited down to the upper daughter lineage while the number after the dash represents areas inherited by the lower daughter. Alternative area combinations at a node are separated by forward slash. Areas in italics have lower likelihood and lower probability (see Table 3). Areas in blue at eight deep nodes are inferred from ML analyses without fossils by LAGRANGE and areas in red are inferred from ML analyses including fossils using LAGRANGE and AReA. 1=eastern Asia, 2=eastern North America, 3=western North America, 4=Central and South America, 5=Europe, 6=Australia, and 7=Africa. The mean divergence time estimation from Multidivtime for each node is indicated by number in parentheses in million years ago (mya). The horizontal axis shows the time scale in million years ago. Nodes numbered from 22–42 correspond to those in Fig. 4. Nodes with divergence times estimated from BEAST that are significantly different from estimations from Multidivtime and r8s (nodes 22, 29, 31, 40, and 41) are shaded in green.

nodes g and r for characters 2, 3, nodes c, f, g, m, and lineage with missing data was strikingly different r for character 4) is method-dependent (Figs. 1, 2). In from that based on the topology only (Figs. 2A, 3), such cases, the certainty on the reconstructed state suggesting that reconstruction of the ancestral state increases from parsimony, to ML, and then to SCM with missing data is sensitive to branch length (or time methods, while the reconstructions by BAYESTRAITS available for evolution). Finally, for characters with often suggested a different character state as being polymorphism and homoplasy, but no missing data, most probable. Furthermore, analyses based on the the inference from parsimony and BAYESTRAITS was best tree topology only or chronogram can result in quite congruent (character 6; Figs. 2C, 3). Comparis- different reconstructions at some of these nodes (see ons of results from ML and SCM for this type of Figs. 2 & 3 for characters 2, 3, 4). For characters with character cannot be made because the ML and SCM in missing data and homoplasy, but no polymorphism MESQUITE cannot perform estimation for data with (character 1), reconstruction of the ancestral state polymorphism within a taxon at the present. Devel- based on a chronogram at the node including the opment of algorithms that can cope with this type of 364 Journal of Systematics and Evolution Vol. 46 No. 3 2008 character is desirable since polymorphism is not interesting. User-defined step matrix for cost of uncommon. Nonetheless, these results indicate that change for multistate characters can only be applied to the incongruence among analyses mostly involves parsimony analysis. An experiment with the ML homoplasious characters, implying difficulty and analysis using the Asymmetrical 2-parameter uncertainty in reconstructing the ancestral state of Markov-k model for the only binary character in this characters with homoplasy. This finding, although study (character 5) with a cost of 0.5 for state 0 (clus- may not be surprising, is particularly important to bear tered ) to state 1 (fused fruits) and cost of 1 for in mind in studies of comparative biology, e.g., when state 1 to state 0 did not change the result. Experi- attempting to demonstrate adaptive evolution of ments with MP analyses using step matrices for species or genes with repeated gain or loss of a certain multistate characters also showed little difference. ecological trait or gene function in different species by Only at node m (common ancestor of CC) of character reconstructing the ancestral state on a phylogeny. It is 1, the reconstruction changed from equivocal to state recommended that when reconstructing the ancestral 1 (x=10), a state inconsistent with the inferences from state for homoplasious characters, sensitivity to BAYESTRAITS, ML, and SCM (Figs. 2A, 3). methods and models of character evolution should be Compared to the ML and SCM methods in explored. For example, one can use BAYESTRAITS to MESQUITE, the full Bayesian influences approach explore the influence of different rates of character implemented in BAYESTRAITS estimates the parame- transition and prior distributions on the reconstruction. ters of a model of discrete trait evolution and infer Careful analyses can help to avoid over confident or ancestral state by combining information from uncer- incorrect conclusions on important biological prob- tainty of phylogeny and uncertainty in the estimate of lems derived by simple estimations using maximum ancestral state (Pagel et al., 2004). It uses the con- parsimony on a single tree topology. tinuous-time Markov model (allowing more than one The results from this study further indicate that, change along a branch) to estimate the maximum for the nodes with phylogenetic uncertainty (e.g., likelihood values and their posterior probability relatively lower bootstrap support), the ML and SCM distribution of rate parameters based on the phylogeny analyses with the simple Mk1 model and a single best and observations on the value of the traits in each tree (no matter whether including or excluding tem- species. The method uses the most recent common poral information) can result in confident reconstruc- ancestor approach to overcome the limitation posed tions that are not favored by BAYESTRAITS (e.g., node from variation in tree topologies (e.g., not all of the r for characters 2, 3, 4; corresponding to node 28 in trees necessarily contain the internal node or nodes of Fig. 5). Furthermore, the reconstructions with ML and interest) (Pagel et al., 2004). In addition, the method SCM at nodes with strong support (e.g., node g, for takes into account branch length (or temporal infor- characters 3, 4; Figs. 2B, 3) can also be overconfident mation) in the estimation of ancestral state. These compared to BAYESTRAITS when some level of features probably make BAYESTRAITS a better choice uncertainty exists at other nodes of the phylogeny. for studying homoplasious character evolution. The The discrepancy between ML, SCM, and BAYE- program is quite user-friendly and allows authors to STRAITS found in this study reflects a combination of explore various parameters for prior distributions and uncertainties in both ancestral state and phylogenetic rate of character transitions. reconstructions. If the Mk1 model implemented in the 3.2 Character evolution in Cornus ML and SCM analyses is not the correct model for the The following discussion on character evolution characters under investigation, the use of this model in in Cornus is based on results from BAYESTRAITS and the ML and SCM analyses could have been responsi- chronogram-based ML and SCM analyses. ble for the incongruence. Unfortunately, there are few 3.2.1 Chromosome numbers In Cornus, most alternative models available for comparison in MES- species have x=11, three species have x=10 and three QUITE 2.01. At present, the Asymmetrical 2-parameter have x=9 (see Table 1). In species with x=9 there are Markov-K model is the only alternative model in two large metacentric chromosomes while in species MESQUITE under the ML and SCM methods. Yet this with x=10 one pair of the large metacentric chro- model is limited to binary data for defining differential mosomes are replaced by two telocentric pairs and in rates for forward and backward changes. There is no species with x=11, the two pairs of large metacentric option in MESQUITE for applying differential rates of chromosomes are replaced by four pairs of telocentric changes between states of multistate morphological chromosomes (Dermen, 1932). Based on this evi- characters, which are probably most common and dence, Dermen (1932) proposed chromosomal XIANG & THOMAS: Evolution & biogeography of Cornaceae 365

Fig. 6. Biogeographic histories of the four intercontinental disjunct lineages of Cornus favored by the ML analysis including fruit fossils. BB, big-bracted dogwoods; CC, cornelian cherries. Numerical numbers represent geographic areas corresponding to those defined in Table 1, Table 3, and Fig. 5.

Robinsonian fission as the mechanism for chromo- these model-based methods is more dynamic. These some number evolution in Cornus. This scenario was methods inferred a change from x=11 to x=9 (rather not supported by the chloroplast DNA phylogeny than 10 as in parsimony) in the common ancestor of (Xiang et al., 1996). Based on the parsimony principle CC (node m) and x=9 to x=10 in C. sessilis Torr. ex and the chloroplast DNA phylogeny, Xiang et al. Durand (Fig. 3). This implies two chromosomal (1996) and Xiang & Eyde (1995) suggested that the fusions (at node m) and one fission (in C. sessilis) in chromosome number in Cornus evolved from x=11 to the CC lineage and one chromosomal fusion in the 10 and 9; with two independent reductions to x=10, pagoda dogwoods (node q) occurred during the evolu- once in the common ancestor of the CC lineage (node tion of Cornus. Furthermore, divergence time dating m) and once in the common ancestor of pagoda and the reconstructions from ML, SCM and BAYE- dogwoods (node q), and a further reduction from x=10 STRAITS suggest that the fusion events in the cornelian to x=9 later within the CC lineage. This implies three cherries occurred back in the Eocene, while the fusion independent chromosomal fusion events. The hy- event in the pagoda dogwoods occurred not earlier pothesis partially agrees with the reconstruction of than the Miocene (Figs. 3, 4). ancestral state in the present study. The ML, SCM, Chromosome fission has been considered a major and BAYESTRAITS all suggested x=11 being ancestral, mechanism leading to speciation and increases chro- but the evolutionary pathway reconstructed using mosome numbers in closely related species that are 366 Journal of Systematics and Evolution Vol. 46 No. 3 2008 derived rapidly (see Godfrey & Masters, 2000; Kol- following . The blue- or white-fruited lineage nicki, 2000), while chromosome fusion is considered (BW) produce large and branched compound cymes the major mechanism causing a decrease in chromo- with one rudimentary, early deciduous (before anthe- some numbers (Murnane, 2006). Both, especially sis) and minute at the branching point of the fusion, have been reported from and animals, lower part of the inflorescence (see Fig. 5). The e.g., fission in flies (Rousselet et al., 2000) and beans buds are not preformed in this group, except probably (Schubert & Rieger, 1990), and fusion in (Ijdo in the pagoda dogwoods (C. alternifolia L.f. and C. et al., 1991 ), moths (Traut & Clarke, 1997), Droso- controversa Hemsley). In these two species, the phila (Yu et al., 1999), and Arabidopsis (Heacock et inflorescence is preformed in the fall and covered by al., 2004). Loss of telomeres, occurring spontaneously several alternatively arranged and imbricate scales that or via exogenous DNA damage, (Murnane, 2006) has also protect the buds. In the fourth lineage, the been considered the major cause for genome instabil- dwarf dogwood group (DW), the inflorescence archi- ity which provide “sticky ends” for chromosome tecture is an intermediate between the BB and BW fusion (Murnane, 2006). In Cornus, it is possible that (see Fig. 5). These species have a highly reduced spontaneous loss of telomeres in acrocentric chromo- branched compound cyme that is not preformed, but somes had led to chromosome fusion and resulted in a subtended by four large petaloid bracts. decrease of chromosome numbers in three lineages at The ancestral form of inflorescence architecture different geological times. in Cornus was inferred to be branched compound 3.2.2 Inflorescence architecture and morphology cymes with rudimentary bracts, like those found in the The different major lineages of Cornus exhibit strik- BW lineage (Fig. 5). Two alternative pathways of ingly different inflorescence architectures (Fig. 5). All subsequent evolution of were sug- species of the big-bracted (BB) lineage bear cymose gested, one revealed by BAYESTRAITS and the other heads (glomerule) subtended by four petaloid bracts by the ML and SCM analyses. The two pathways except in two species. The pacific flowering dogwood mainly differ in the reconstruction of character state at C. nuttallii Audubon usually has six bracts and the the two deep nodes (g and r in Figs. 2, 3, correspond- Mexican flowering dogwood C. disciflora Moc. & ing to nodes 28 and 30 in Fig. 5). The BAYESTRAITS Sesse ex DC. has four, small, broadly ovate, scale-like reconstructions at these two nodes are illustrated with bracts that are deciduous prior to anthesis and before images at the right side of the nodes in Fig. 5, while they expand to become petaloid. The conditions of those by ML and SCM are illustrated with images at inflorescence buds among species of the BB lineage the left of the nodes in Fig. 5. The Bayesian pathway also vary. In some species the bud is preformed in the implies a series of dramatic changes of inflorescence summer or fall and are covered completely by the architecture during the early diversification of the scale-like bracts (e.g., in C. florida L. and C. dis- genus in the late Cretaceous or early Tertiary. Sup- ciflora). These bracts will expand and develop into the pression of inflorescence branching, preformation of large petaloid structure during the next growing inflorescence bud, development of four base bracts season in C. florida, but will fall off before their into broadly ovate scales, and enlargement and peta- expansion in C. disciflora. In two species (e.g., in C. loidy of the scales all occurred together during the kousa and C. multinervosa (Pojark.) Q. Y. Xiang), the intial divergence of the genus, leading to umbels inflorescence bud is covered by the expanded leaf bud subtended by four large petaloid bracts for the com- scales beneath the inflorescence bud (thus the whole mon ancestor of BB , CC, and DW (image at right of bud is mix bud; Xiang, 1987). In a few species (e.g., node 28 in Fig. 5; node r in Figs. 2, 3). During the C. nuttallii, C. hongkongensis Hemsley, C. capitata, divergence between CC and BB+DW lineages, bract & C. elliptica), the inflorescence bud is not protected. expansion and petaloidy were lost in the CC stem The young bracts are non-scaly and only partially lineage (node 27 in Fig. 5; node m in Figs. 2, 3), while cover the bud. In the cornelian cherry group (CC), umbels were replaced by a head via suppression of species bear cymose umbels subtended by four development along with a reversal to a broadly ovate, scale-like bracts (like those in C. non-preformed inflorescence bud in the BB+DW stem disciflora) (see Fig. 5), which are persistent through lineage, resulting in glomerules subtended by four the anthesis and young fruit stage. The inflorescence large petaloid bracts in the common ancestor of BB buds are also preformed in the fall and covered by the and DW (image at right of node 30 in Fig. 5; node f in scale-like bracts, but the bracts never expand to Figs. 2, 3). When DW diverged from BB, a reversal to become petaloid during the growing season of the the branched compound cyme occurred in the DW XIANG & THOMAS: Evolution & biogeography of Cornaceae 367 stem lineage through slight elongation of inflores- form (compound cymes) (images at left of nodes 42, cence branches, leading to the mini-form of branched 28, 30 in Fig. 5). Major changes of inflorescence compound cymes with 4, large petaloid bracts ob- architecture occurred more recently, after the diversi- served in the modern dwarf dogwood species (C. fication into the four major clades. The suppression of canadensis L. f., C. suecica L., and C. unalaschkensis inflorescence branching occurred in CC and BB Ledeb.) (image on the branch of DW in Fig. 5). In the lineages separately. The development of 4-fold bracts BB group, later losses of bract enlargement and and their petaloidy occurred before the condensation petaloidy occurred in C. disciflora in the mid Tertiary, of the inflorescence branches in the BB lineage. This associated with a southward migration in North pathway implies that the umbels with four scale-like America into moister mountains of Mexico and bracts in the cornelian cherries (CC) are an apomor- further to Central America. phy and evolved from the ancestral inflorescence type This scenario, although favored (has the highest (compound cymes), while the minicompound cymes posterior probability) by BAYESTRAITS among the with four showy bracts in the dwarf dogwoods alternatives, has much uncertainty as seen from the (DW) represent a plesiomorphy and a retention of the variable posterior probability values at the two deep ancestral inflorescence (Fig. 5), in contrast to the nodes (all pp < 0.95) (see characters 2, 3, 4 in Figs. 2, inferences from BAYESTRAITS. It is difficult to dis- 3). The uncertainty does not seem too closely corre- tinguish the two hypotheses without additional evi- lated with the phylogenetic uncertainty. First, the dence, e.g., developmental and genetic data, to deter- posterior probabilities at nodes with uncertain recon- mine which pathway is more likely. Given that the structions of ancestral states are not similar to the ML and SCM analysis used the “best” phylogeny posterior probabilities in the phylogenetic trees (Figs. based on six gene regions (ITS, 26S rDNA, matK, 2, 3, 5). Nodes with high pp values in the phylogenetic rbcL, atpB, and ndhF) (Xiang et al., in review), which tree (e.g., nodes f and g) can have low pp values for is also supported by four other low copy nuclear genes the favored state or two alternative favored states of (PI & AP3 genes: Zhang, 2006; Zhang et al., 2008. the inflorescence characters (Figs. 2, 3). Thus, the antR-Cor: Fan et al., 2004, 2007. waxy: Xiang, unpub- uncertainty must come from a combination of phy- lished), this “best” phylogeny may well represent the logenetic uncertainty and ambiguity in ancestral state species phylogeny, adding some support to the ML reconstruction. However, it is noteworthy that this and SCM pathway. In other words, the ancestor of scenario suggested by BAYESTRAITS on inflorescence BB-DW-CC was more likely bearing branched com- evolution is novel. An ancestral inflorescence of pound cymes with early deciduous rudimentary bracts umbels subtended by petaloid bracts has never been and the inflorescence was not preformed, and the proposed and is not found in extant species or fossils. ancestor of BB-DW was more likely to bear a con- If it is true, an umbel subtended by petaloid bracts densed compound cyme with four petaloid bracts, like must have become extinct (due to unknown disadvan- the DW species (Fig. 5). Nonetheless, the ancestral tages). The early Tertiary cornelian cherry fossils (all state of inflorescence architecture at the two deep represented by fruits; see Eyde, 1988; Crane et al., nodes above the root of the genus remains uncertain to 1990; Xiang et al., 2003, 2005) might have been the some extent. Histological studies on the morphogene- lineage bearing this type of inflorescence that has sis of different inflorescence types will be helpful to become extinct. It would be interesting to find out if evaluate which ancestral state at these nodes may be there are any fossils with such morphology from the more likely from a developmental perspective. Cou- sites where the fossil fruit stones were discovered. If pling with a molecular genetic approach, such studies this evolutionary pathway is true, the umbel with four would further unravel the molecular basis underlying scale-like bracts in the modern cornelian cherries (CC) the evolution of inflorescence architectures, a project represents a retention of the ancestral inflorescence now is being on-going in Xiang’s lab. from the common ancestor of CC and BB+DW, while 3.3 Evolution of fruits and potential ecological the compound cyme in the dwarf dogwoods (DW) causes represents a reversal and the four showy bracts in the Fruits of the dogwoods are drupaceous develop- DW species are plesiomorphic. ing from a 2-locular inferior , with a fleshy layer In the alternative pathway suggested by of pulp and a stony endocarp. In most species, the ML-SCM (“best”-phylogeny based) the early ances- fruits are simple, but in the Asian species of the tors of the four major lineages of Cornus (nodes g, r, s big-bracted (BB) lineage, fruits are compound (fused, in Figs. 2, 3) all retained the ancestral inflorescence appearing like a strawberry with stones) (see Fig. 5). 368 Journal of Systematics and Evolution Vol. 46 No. 3 2008

Character ancestral state reconstruction and diver- the occurrence of other colored fruits at lower fre- gence time dating indicate that this dramatic change in quency. A partial list includes effects on avian forag- fruit structure of Cornus, fusion of clustered fruits, ing and against natural enemies, physiological adapta- occurred in the mid or late Miocene, associated with tions, results of selection acting on correlated charac- an isolation of the BB lineage in eastern Asia, and was ters, constraint by metabolic costs, competition for proposed to be a result of selections by monkeys (Fig. dispersal agents, as well as constraint by phylogenetic 5; Eyde, 1985). history (Willson & Whelan, 1990). However, as The fleshy fruits exhibit various colors at the indicated by the authors, there appears to be no known mature stage among species, including blue, black, data to strongly support any of these hypotheses. white, red, red-black and dark purple (Fig. 5). The In Cornus, given that the differentiation in fruit blue and black fruits are found in most species of the color seems to correlate with phylogenetic lineages BW lineage and a few species of the group have white and with changes in inflorescence architecture and fruits. Red fruits characterize the BB and DW line- morphology, it is possible that phylogenetic constraint ages, except for C. disciflora which has red fruits coupled with selection acting on the inflorescences maturing into black color. The fruit colors in the could have contributed to the fruit color divergence cornelian cherries (CC lineage) vary among species among the major lineages of Cornus. However, other from red, red-black, to dark purple. Ancestral state factors, such as avian foraging, defense against reconstructions using parsimony and BayesTraits pathogens, and differences in taste or nutrients might (analyses with ML and SCM methods in MESQUITE have also played important roles in the fruit color were not permitted due to polymorphism) suggested evolution in the genus as discussed below. that the fruit color at the root of Cornus is more likely The fruits of dogwoods are eaten by mostly , to be blue or black, and several evolutionary changes but also by rodents and other mammals (Eyde, 1988). in fruit color occurred during the genus radiation in There is no clear evidence for color preference (e.g., different geological time periods. The early changes white fruits of C. alba Linn. Mant. and black fruits of appeared to be accompanied with the major alterations C. sanguinea L. are both eaten by European redstarts), in inflorescence architecture (Fig. 2C, character 6; Fig. but there seems to be a preference in fruit size and 5). A red color stage in fruits evolved from the blue display positions (on branches vs. on ground). For and black fruits in the early Tertiary (node r; Fig. example, small birds do not swallow and disperse the 2C). Red fruits evolved in lineages with petaloid relatively large cornelian cherry fruits (CC) and some bracts (node g) in the early Tertiary and also in the species have fruits persistent on the branch (e.g., C. ancestor of C. mas L. and C. officinalis Seib. & Zucc. sanguinea with black fruit) while other species have of the cornelian cherry group (CC) in the mid to late fruits fallen to the ground (e.g., C. florida with red Tertiary (node j in Figs. 2C, 5). Red maturing into fruit) (Baird, 1980; Eyde, 1988). In the later case, black fruits also evolved twice, once in C. disciflora in color may make a difference. Red color is easier for the mid Tertiary and once in ancestor of CC lineage birds to find on the ground than blue or black fruits. (node m) or in the ancestor of CC+BB+DW clade The red fruits of the rhizomatous little dwarf dog- (node r) in the very early Tertiary (Fig. 2C). Further- woods (DW) are dispersed mostly by ground-feeding more, dark purple fruit evolved in C. eydeana QY birds and mammals (see Eyde, 1988). The fused red Xiang & YM Shui from red-black fruits in the CC fruits in the Asian big-bracted dogwoods (like the lineage, and white fruits evolved from blue-black fruits in C. kousa) are favored by monkeys that take fruits in the BW clade (Fig. 2C). the sweet pulp and discard the , and were con- Evolution of fruit color in fleshy fruited plants is sidered as the mechanism for the evolution of the an old topic in evolutionary ecology, but our under- fused large fruits (Eyde, 1985). Furthermore, phyto- standing of the underlying ecological causes has been chemistry studies showed that the different colors in very limited (Willson et al., 1989; Willson & Whelan, fruits are a result of production of different kinds of 1990). Most fleshy fruits consumed by vertebrates in anthocyanins (Vareed et al., 2006; Bjorøy et al., 2007) the temperate zones are red and/or black which are and their relative quantities. These studies reported also common in tropical and subtropical fruits (Will- that fresh fruits of C. alternifolia and C. controversa son et al., 1989; Willson & Whelan, 1990). Willson (BW lineages, black fruits) contain mostly delphinidin and Whelan (1990) invoked a wide array of ecological 3-O-glucoside and delphinidin 3-O-rutinoside, as well and evolutionary causes to explain the globally most as a very small amount of cyanidin 3-O-glucoside frequent occurrence of red and black fruits along with (Vareed et al., 2006). Fresh fruits of C. florida and C. XIANG & THOMAS: Evolution & biogeography of Cornaceae 369 kousa (BB lineage, red fruits) contain mostly cyanidin indicate uncertainty in ancestral area reconstructions 3-O-galactoside and cyanidin 3-O-glucoside, with a at lower phylogenetic nodes (Table 3). Inclusion and negligible amount of delphinidin 3-O-glucoside, while exclusion of fossils have a striking effect on the major anthocyanins in C. mas and C. officinalis uncertainty (see Table 3 and Fig. 5). Other factors, fruits (CC lineage, red fruits) are delphinidin 3-O- such as coding models for outgroup distributions, galactoside, cyanidin 3-O-galactoside and pelargo- maximum area constraints, and including or excluding nidin 3-O-galactoside. In fruits (BW, outgroups in the rooted tree influenced the ancestral white), five anthocyanins were detected (delphinidin area estimation to a lesser extent (Table 3). The 3-O-β-galactopyranoside-3′,5′-di-O-β-glucopyranoside, differences among the analyses clearly suggest chal- delphinidin 3-O-β-galactopyranoside-3′-O-β-glucopy- lenge and limitation in estimating lineage bio- ranoside cyanidin 3-O-β-galactopyranoside-3′-O-β- geographic histories. At one hand, adding fossils in glucopyranoside, the 3-O-β-galactopyranosides of the analyses could have biased the results if the fossils delphinidin and cyanidin), with delphinidin 3-O-β- were not correctly placed on the phylogeny and the galactopyranoside-3′,5′-di-O-β-glucopyranoside being branch length leading to the fossil taxa were errone- the most abundant (Bjorøy et al., 2007). The relative ously constrained in the ML analyses. Thus caution amounts of the major anthocyanins within each spe- must be taken to evaluate the fossil data (e.g., taxo- cies also vary. Therefore, the alternations in fruit color nomic and age identifications) and to determine the of Cornus must be related to changes in regulation of phylogenetic affinities of the fossil taxa to lessen this the anthocyanin pathway. These data also indicate that problem. On the other hand, one might take an ex- the red fruits in cornelian cherries and big-bracted treme approach by excluding fossils in the analysis to dogwoods do not contain the same kinds of antho- avoid the problems associated with fossils. The short- cyanins, suggesting the origins of red fruits in the two coming of this approach is obvious. The bio- lineages were mostly likely via different developmen- geographic history inferred with incomplete informa- tal pathways. tion (e.g., without considering the fossils) could also Although functions of anthocyanins in plants are be heavily biased. A cautious and feasible approach still debated (Willson & Whelan, 1990; Gould & would be to compare results from analyses including Lister, 2006), masking of chlorophylls by antho- and excluding fossils. cyanins was found to reduce risk of photo-oxidative Results from this study and those from Xiang et damage to leaf cells in some plants as they senesce, al. (2006) have also shown that DIVA is sensitive to including the white-fruited red osier dogwood (Cor- approaches of geographic area coding for terminal nus sericea L.) and black-fruited Pagoda dogwood taxa of the ingroup. Coding only the ancestral area of () (Feild et al., 2001; Hoch et al., a terminal taxon (recommended by DIVA) vs. coding 2003; Lee et al., 2003). There also have been reports distributional areas of all of the constitutent species of on anti-cancer, anti-inflammatory and antioxidant the terminal taxon could lead to different conclusions. effects of Cornus anthocyanins (Seeram et al., 2002; For example, when the fossil lineage of cornelian Vareed et al., 2006) as well as their uses in treatment cherries was coded for occurrence only in Europe (the of diabetes mellitus-related disorders (Jayaprakasam ancestral area of the lineage estimated from Xiang et et al., 2006; Nair et al., 2006). The pigments del- al., 2005), the root area of Cornus was optimized as phinidin 3-O-β-galactopyranoside-3′,5′-di-O-β-glu- Europe under the maximum area constraint of 2 using copyranoside and delphinidin 3-O-β-galactopyrano- DIVA (Xiang et al., 2006). However, in the present side-3′-O-β-glucopyranoside rich in the black fruits of study, the fossil lineage of cornelian cherries was C. controversa and C. alternifolia were confirmed to coded for all distribution areas of its occurrrence show growth inhibitory activity towards various (Europe and western North America), a case probably human cancer cell lines (Vareed et al., 2006). The most common since the root areas of most lineages evidence suggests that differences in avian foraging, included in an analysis are unknown, the same analy- alteration in nutrient and taste, and defense against sis with DIVA under maxiareas of 2 optimized the pathogens could have played a role in fruit color root area of Cornus to be quite uncertain, most in- evolution in Cornus. cluding western North America (Table 3, Column E). 3.4 Biogeographic history The ancestral areas optimized in Xiang et al. (2006) Ancestral area estimation using parsimony and for these lower phylogenetic nodes with the coding maximum likelihood methods under different condi- approach favored by DIVA (coding for ancestral areas tions show substantial differences at some nodes and for higher hierarchical terminal taxa) are largely 370 Journal of Systematics and Evolution Vol. 46 No. 3 2008 congruent with the results inferred from ML analyses never be ruled out. This again demonstrates the limi- in the present study (see Table 3, column A and D), tation and difficulty in biogeographic analyses. Con- which mostly include Europe. However, without sidering all of the fossil evidence and results from divergence time information in Xiang et al. (2006), different analyses for Cornus, it is likely that the evaluation of alternative intercontinental dispersal genus diversified into major lineages and spread routes for the genus was limited. around the globe via NALB and BLB quickly in the The evolutionary pathway of Cornus in space early Tertiary, soon after its origin in the late Creta- and time favored by ML analyses using LAGRANGE ceous. Taken at face value, the results support including fossils is illustrated in Figures 5 and 6. The post-Eocene migration of plants across the NALB in origin and early diversification of Cornus was sug- the BW lineage (from node 37 to 36) (Xiang et al., gested to be in the Eurasian continent in the late 2005), suggesting that the NALB may have been more Cretaceous and early Tertiary (Fig. 5). Three intercon- important for plant migration during the later Tertiary tinental dispersals out of Europe into eastern North time than previously thought (Tiffney and Manches- America occurred in the BW lineage in different ter, 2001; see Xiang et al., 2005). The results are in geological times, twice in the early Tertiary at ~52 agreement with the role of the NALB in connecting mya and ~46 mya and once in the mid ~22 mya (Fig. the floras in Tropical Asia, America and Africa (Fig. 5). These migrations must have crossed the North 6, CC lineage, C. oblonga-C. peruviana J. F. Macbr.), Atlantic Land Bridge (NALB). Although NALB was as suggested by Xiang et al. (2005, 2006), Lavin et al. believed to have been broken by the early Eocene, (2000), and Davis et al. (2002). migration of temperate plants up to the early Miocene The divergence time between Cornus oblonga by hopping across the island chains was considered Wall. and C. peruviana J. F. Macbr. was estimated to possible (Tiffney, 1985). This result is congruent with be in the Eocene by Multidivtime and r8s and in the post-Eocene Trans-NALB migration of plants (see Miocene by BEAST (node 40 in Fig. 4; Figs. 5, 6). Xiang et al., 2005). The migrations of the BB lineage Based on the ancestral areas reconstructed by LA- from Europe into eastern and western North America, GRANGE and AReA (Table 3), both of these diver- as well as Central America at ~27 mya could have, gence times agree with migration from Europe east- however, occurred via either NALB and the Bering ward to Asia and westward to eastern North America Land Bridge (BLB) which was believed to be avail- via NALB and further southward via long distance able throughout most of the Tertiary (Tiffney & dispersal into South America by birds or by water (see Manchester, 2001). The early dispersal of the CC discussion in Xiang et al., 2006 and references lineage from Europe into western North America at therein). However, a divergence time in the Eocene ~42 mya was more likely to have taken the NALB also agrees with migration along the Tethys seaway route because in the Eocene, Tran-BLB dispersal margins, which was considered to be a viable migra- through Asia was blocked by the Turgai Strait (Tiff- tion route for the thermophilic boreotropical flora in ney & Manchester, 2001). However, the recent dis- the early Eocene (Tiffney, 1985) and was proposed to covery of dogwood leaf fossils from the Paleocene of be used for spread of the tropical family Alangiaceae western North America and northeastern Asia where (Feng et al., in review). fruit fossils of the CC lineage of the same age were In summary, this study showed that reconstruc- also found (Manchester et al., in press) may lend tions of the ancestral state of characters and ancestral support to early Tertiary Trans-BLB migration of the area distributions are affected by both analytical CC lineage at high latitudes. The leaf fossils were not methods and phylogenetic uncertainty. Researchers used in the biogeographic analysis due to their phy- are encouraged to apply the most sophisticated logenetic uncertainty within the genus. method possible available and conduct vigorous It must be noted that the biogeographic history analyses to explore alternative hypotheses before inferred here and shown in Fig. 5 and Fig. 6 is con- reaching a conclusion for evolution of characters that tingent upon the fruit fossil evidence available at the are homoplasious. The substantial differences between present and only the ancestral ranges with the highest DIVA and ML analyses that included and excluded likelihood scores and greatest probabilities. Given that fossils suggest the need of a reevaluation of the global there are alternative ranges with slightly lower likeli- biogeographic patterns previously assembled using hood scores and lower probabilities at many nodes, DIVA and raise special attentions to the use of fossil uncertainty clearly remains with this biogeographic data in biogeographic analyses. Researchers history. Furthermore, long distance dispersal can reconstructing ancestral areas with a phylogeny XIANG & THOMAS: Evolution & biogeography of Cornaceae 371 containing widely distributed higher hierarchical 1993. of plants: an analysis of terminal taxa (e.g., genus, family, etc.) should be nucleotide sequences from the plastid gene rbcL. Annals particularly careful with the choice of methods. of the Botanical 80: 528–580. Crane PR, Manchester SR, Dilcher DL. 1990. A preliminary Acknowledgements The authors thank R. Ree, S. survey of fossil and well-preserved reproductive structures from the Sentinel Butte Formation (Paleocene) Smith, and B. Moore for assistance and guidance in near Almont, North Dakota. Fieldiana Geol. 1418: 1–63. biogeographic analyses using LAGRANGE and AReA. Cunningham C. 1999. Some limitations of ancestral We also thank the Deep Time Research Coordination character-state reconstruction when testing evolutionary Network supported by the National Science Founda- hypotheses. Systematic Biology 48: 665–674. tion (NSF) funded to D. E. Soltis (DEB-0090283), the Davis CC, Bell CD, Mathews S, Donoghue MJ. 2002. Phytogeography of the Northern Hemisphere working Laurasian migration explains Gondwanan disjuncts: evidence from Malpighiaceae. Proceedings of the National group supported by NESCent funded to M. Donoghue Academy Sciences USA 99: 6833–6837. and P. Manos, and the ClockWork group supported by de Candolle AP. 1828. Prodromus Systematis Naturalis Regni NESCent funded to J. Clark and B. Wiegmann for the Vegetabilis 3: 203. workshops, symposia, and discussion with the par- Dermen H. 1932. Cytological studies of Cornus. Journal of ticipants of each group. We also thank the two re- Arnold 13: 401–417. viewers and the journal editors Y-L Qiu and Z-D Chen Donoghue MJ, Smith SA. 2004. Patterns in the assembly of temperate forests around the Northern Hemisphere. for valuable comments to improve this manuscript. Philosophical Transactions of the Royal Society, London B This research was supported by the National Science 359: 1633–1644. Foundation (DEB-0444125) funded to Xiang and Drummond AJ, Rambaut A. 2006. BEAST v1.4 [online]. benefited from a Multidisciplinary grant for faculty Available from http://beast.bio.ed.ac.uk/. research and professional development grant at North Ekman S, Andersen LH, Wedin M. 2008. The limitations of Carolina State University funded to Xiang, Franks and ancestral state reconstruction and the evolution of the Ascus in the Lecanorales (Lichenized Ascomycota). Xie. Systematic Biology 57: 141–156. Eyde RH. 1985. The case for monkey-mediated evolution in big-bracted dogwoods. Arnoldia (Jamaica Plain) 45: 2–9. References Eyde RH. 1988. Comprehending Cornus: Puzzles and progress in the systematics of the dogwoods. Botanical Review APG (Angiosperm Phylogeny Group). 2003. An update of the (Lancaster) 54: 233–351. Angiosperm Phylogeny Group classification for the orders Fan CZ, Purugganan MD, Thomas DT, Wiegmann BM, Xiang and families of flowering plants: APG II. Botanical QY. 2004. Heterogeneous evolution of the Myc-like Journal of Linnean Society 141: 399–436. Anthocyanin regulatory gene and its phylogenetic utility Baird JW. 1980. The selection and use of fruit by birds in an in Cornus L. (Cornaceae). Molecular Phylogenetics and eastern USA forest. Wilson Bulletin 92: 63–73. Evolution 33: 580–594. Bjorøy Ø, Fossen T, Andersen ØM. 2007. Anthocyanin Fan CZ, Xiang QY. 2001. Phylogenetic relationships within 3-galactosides from Cornus alba “Sibirica” with Cornus L. (Cornaceae) based on 26S rDNA sequences. glucosidation of the B-ring. Phytochemistry 68: 640–645. American Journal of Botany 88: 1131–1138. Blanchette M, Green ED, Miller W, Haussler D. 2004. Fan C, Xiang QY. 2003. Phylogenetic analyses of Cornales Reconstructing large regions of an ancestral mammalian based on 26S rRNA and combined 26S rDNA-matK-rbcL genome in silico. Genome Research 14: 2412–2423. sequence data. American Journal of Botany 90: Bollback JP. 2006. SIMMAP: Stochastic character mapping of 1357–1372. discrete traits on phylogenies. BMC Bioinformatics 7: 88. Fan C, Xiang QY, Remington DL, Purugganan MD, Wiegmann Boufford DE, Spongberg SA. 1983. Eastern Asian-eastern BM. 2007. Evolutionary patterns in the antR-Cor gene in North American phytogeographical relationships—a the dwarf dogwood complex (Cornus, Cornaceae). history from the time of Linnaeus to the twentieth century. Genetica 130: 19–34. Annals of the Missouri Botanical Garden 70: 423–439. Feild TS, Lee DW, Holbrook NM. 2001. Why leaves turn red in Bullock AA. 1959. Nomina familiarum conservanda proposita. autumn. The role of anthocyanins I senescing leaves of Taxon 8: 154–181. red-osier dogwood. Plant Physiology 127: 566–574. Chase MC, Soltis DE, Olmstead RG, Morgan D, Les DH, Feng CM, Manchester SR, Xiang QY(J). Phylogeny and Mishler BD, Duvall MR, Price RA, Hills HG, Qiu Y-L, biogeography of Alangiaceae (Cornales) inferred from Kron KA, Rettig JH, Conti E, Palmer JD, Manhart JR, DNA sequences, morphology, and fossils. Molecular Sytsma KJ, Michaels HJ, Kress WJ, Karol KG, Clark WD, Phylogenetics and Evolution. (in review, available from Hedren M, Gaut BS, Jansen RK, Kim K-J, Wimpee CF, the first author upon request) Smith JF, Furnier GR, Strauss SH, Xiang Q-Y, Plunkett Godfrey LR, Masters JC. 2000. Kinetochore reproduction GM, Soltis PS, Swensen S, Williams SE, Gadek PA, theory may explain rapid chromosome evolution Quinn CJ, Eguiarte LE, Golenberg E, Learn GH Jr, Proceedings of National Academy of Sciences USA 97: Graham SW, Barrett SCH, Dayanandan S, Albert VA. 9821–9823. 372 Journal of Systematics and Evolution Vol. 46 No. 3 2008

Gould KS, Lister C. 2006. Flavonoid functions in plants. In: Maddison DR, Maddison WP. 2001. MacClade4: Analysis of Andersen OM, Markham KR eds. Flavonoids: Chemistry, phylogeny and character evolution. Version 4.03. Biochemistry and Applications. Boca Raton: CRC Press. Sunderland, MA: Sinauer Associates. 397–441. Maddison WP, Maddison DR. 2007. Mesquite: a modular Harvey PH, Pagel MD. 1991. The Comparative Method in system for evolutionary analysis [online]. Version 2.01. Evolutionary Biology. Oxford: Oxford University Press. Available from http://mesquiteproject.org. Heacock M, Spangler E, Riha K, Puizina J, Shippen DE. 2004. Manchester SR. 1994. Fruits and seeds of the Middle Eocene Molecular analysis of telomere fusions in Arabidopsis: Nut Beds flora, Clarno Formation, North Central Oregon. multiple pathways for chromosome end-joining. The Palaeontography of America 58: 1–205, 70 pls. EMBO Journal 23: 2304–2313. Manchester SR, Xiang QY, Kodrul TM, Akhmetiev M. Hines HM. 2008. Historical biogeography, divergence times, Leaves of Cornus (Cornaceae) from the paleocene of and diversification patterns of Bumble Bees North American and Asia confirmed by trichome (Hymenoptera: Apidae: Bombus). Systematic Biology 57: characters. International Journal of Plant Sciences. (in 58–75. press) Hoch WA, Singsaas EL, McCown BH. 2003. Resorption Milne RI, Abbott RJ. 2002. The origin and evolution of Tertiary protection. Anthocyanins facilitate nutrient recovery in relict floras. Advances in Botanical Research 38: 282–314, autumn by shielding leaves from potentially damaging pl. 5–7. light levels. Plant Physiology 133: 1296–1305. Moore BR, Smith SA, Donoghue MJ. 2006. Increasing data Huelsenbeck JP, Nielsen R, Bollback JP. 2003. Stochastic transparency and estimating phylogenetic uncertainty in mapping of morphological characters. Systematic Biology supertrees: Approaches using nonparametric 52: 131–158. bootstrapping. Systematic Biology 55: 662–676. Huelsenbeck JP, Rannala B, Masly JP. 2000. Accommodating Murnane JP. 2006. Telomeres and chromosome instability DNA phylogenetic uncertainty in evolutionary studies. Science Repair. Vol. 5. No. 9-10. (8 September 2006). 1082–1092. 288: 2349–2350. Murrell ZE. 1993. Phylogenetic Relationships in Cornus Huelsenbeck JP, Ronquist F, Nielsen R, Bollback JP. 2001. (Cornaceae). Systematic Botany 18: 469–495. Bayesian inference of phylogeny and its impact on Murrell ZE. 1996. A new of Cornus in South and evolutionary biology. Science 294: 2310–2314. Central America. Systematic Botany 21: 273–288. Ijdo JW, Baldini A, Ward DC, Reeders ST, Wells RA. 1991. Nair MG, Jayaprakasam B, Olson LK, Vareed SK. 2006. Insulin Origin of human chromosome 2: an ancestral secretion by anthocyanins and anthocyanidins. US Patent telomere-telomere fusion. Proceedings of the National and Trademark Office, Patent Application Academy of Sciences USA 88: 9051–9055. No:20060025353. Jayaprakasam B, Olson LK, Schutzki RE, Tai MH, Nair MG. Nepokroeff M, Sytsma KJ, Wagner WL, Zimmer EA. 2003. 2006. Amelioration of obesity and glucose intolerance in Reconstructing ancestral patterns of colonization and high-fat-fed C57BL/6 mice by anthocyanins and ursolic dispersal in the Hawaiian understory tree genus Psychotria acid in Cornelian cherry (). Journal of (Rubiaceae): a comparison of parsimony and likelihood Agriculture and Food Chemistry 54: 243–248. approaches. Systematic Biology 52: 820–838. Kishino H, Thorne JL, Bruno WJ. 2001. Performance of a Nielsen R. 2002. Mapping mutations on phylogenies. divergence time estimation method under a probabilistic Systematic Biology 51: 729–739. model of rate evolution. Molecular Biology and Evolution Ouzounis CA. 2005. Ancestral state reconstructions for 18: 352–361. genomes. Current Opinion in Genetics & Development Kolnicki RL. 2000. Kinetochore reproduction in animal 15: 595–600. evolution: Cell biological explanation of karyotypic Pagel M. 1999. The maximum likelihood approach to fission theory. Proceedings of the National Academy of reconstructing ancestral character states of discrete Sciences USA 97: 9493–9497. characters on phylogenies. Systematic Biology 48: Kondo B, Omland KE. 2007. Ancestral state reconstruction of 612–622. migration: multistate analysis reveals rapid changes in Pagel M, Meade A. 2006. Bayesian analysis of correlated new world orioles (Icterus spp.) The Auk 124: 410–419. evolution of discrete characters by reversible-jump Lavin M, Thulin M, Labat JN, Pennington RT. 2000. Africa, the Markov chain Monte Carlo. American Naturalist 167: odd man out: molecular biogeography of Dalbergioid 808–825. legumes (Fabaceae) suggests otherwise. Systematic Pagel M, Meade A, Barker D. 2004. Bayesian estimation of Botany 25: 449–467. ancestral character states on phylogenies. Systematic Lee DW, O’Keefe J, Hobrook NM, Field TS. 2003. Pigment Biology 53: 673–684. dynamics and autumn leaf senescence in a New Pedersen N, Holyoak DT, Newton AE. 2007. Systematics and deciduous forest, eastern USA. Ecological Research 18: morphological evolution within the moss family Bryaceae: 677–694. A comparison between parsimony and Bayesian methods Lewis PO. 2001. A likelihood approach to estimating for reconstruction of ancestral character states. Molecular phylogeny from discrete morphological character data. Phylogenetics and Evolution 43: 891–907. Systematic Biology 50: 913–925. Posada D, Crandall KA. 1998. Modeltest: testing the model of Maddison DR, Maddison WP. 1992. MacClade: Analysis of DNA substitution. Bioinformatics 14: 817–818. phylogeny and character evolution. Version 3.0. Ree RH, Moore BR, Webb CO, Donoghue MJ. 2005. A Sunderland, MA: Sinauer Associates. likelihood framework for inferring the evolution of XIANG & THOMAS: Evolution & biogeography of Cornaceae 373

geographic range on phylogenetic trees. Evolution 59: 243–273. 2299–2311. Tiffney BH, Manchester SR. 2001. The use of geological and Ree RH, Smith SA. 2008. Maximum-likelihood inference of paleontological evidence in evaluating plant geographic range evolution by dispersal, local extinction, phylogeographic hypotheses in the Northern Hemisphere and . Systematic Biology 57: 4–414. tertiary. International Journal of Plant Sciences 162: Renner SS, Beenken L, Grimm GW, Kocyan A, Ricklefs RE. S3–S17. 2007. The evolution of dioecy, heterodichogamy, and Traut W, Clarke CA. 1997. Karyotype evolution by labile sex expression in Acer. Evolution 61: 2701–2719. chromosome fusion in the moth genus Orgyia. Hereditas Ronquist F. 1996. DIVA version 1.1 [online]. Computer 126: 77–84. program and manual available by anonymous FTP from Vareed SK, Reddy MK, Schutzki RE, Muraleedharan G. 2006. Uppsala University (ftp.uu.se or ftp.systbot.uu.se). Anthocyanins in Cornus alternifolia, , Ronquist F. 1997. Dispersal-vicariance analysis: a new . Life Sciences 78: 777–784. approach to the quantification of historical biogeography. Willson MF, Irvine AK, Walsh NG. 1989. Vertebrate dispersal Systematic Biology 46: 195–203. syndromes in some Australian and New Zealand plant Ronquist F. 2004. Bayesian inference of character evolution. communities, with geographic comparisons. Biotropica Trends in Ecology and Evolution 19: 475–481. 21: 133–147. Rossnes R, Eidhammer I, Liberles DA. 2005. Phylogenetic Willson MF, Whelan CJ. 1990. The evolution of fruit color in reconstruction of ancestral character states for gene fleshy-fruited plants. The American Naturalist 136: expression and mRNA splicing data. BMC 790–809. Bioinformatics, 6: 127. Wu ZY. 1983. On the significance of Pacific intercontinental Rousselet J, Monti L, Auger-Rozenberg MA, Parker JS, discontinuity. Annals of the Missouri Botanical Garden Lemeunier F. 2000. Chromosome fission associated with 70: 577–590. growth of ribosomal DNA in Neodiprion abietis Xiang QY(J). 1987. System and synopsis of Cornus subgen. (Hymenoptera: Diprionidae). Proceedings of the Royal Syncarpea (Nakai) Q. Y. Xiang (Cornaceae). Bulletin of Society: Biological Sciences 267: 1819–1823. Botanical Research (Harbin) 7 (2): 33–52. Sanderson MJ. 2002. Estimating absolute rates of molecular Xiang QY(J), Boufford DE. 2005. Cornaceae, Mastixiaceae, evolution and divergence times: A penalized Toricelliaceae, Helwingiaceae, Aucubaceae. In: Wu ZY, likelihood approach. Molecular Biology and Evolution 19: Raven PH eds. Flora of . Beijing: Science Press; St. 101–109. Louis: Missouri Botanical Garden Press. 14: 206–234. Sanmartín I, Enghof H, Ronquist F. 2001. Patterns of animal Xiang QY(J), Brunsfeld SJ, Soltis DE, Soltis PS. 1996. dispersal, vicariance and diversification in the Holarctic. Phylogenetic relationships in Cornus based on chloroplast Biological Journal of Linnean Society 73: 345–390. DNA restriction sites: implications for biogeography and Sanmartín I, Ronquist F. 2004. Southern Hemisphere character evolution. Systematic Botany 21: 515–534. biogeography analyzed with event-based models: Plant Xiang QY(J), Eyde RH. 1995. Chromosome number of Cornus versus animal patterns. Systematic Biology 53: 216–243. sessilis (Cornaceae): phylogenetic affinity and evolution Schluter D, Price T, Mooers AO, Ludwig D. 1997. Likelihood of chromosome numbers in Cornus. Sida 16: 765–768. of ancestor states in adaptive radiation. Evolution 51: Xiang QY(J), Manchester SR, Thomas D, Zhang WH, Fan CZ. 1699–1711. 2005. Phylogeny, biogeography, and molecular dating of Schubert I, Rieger R. 1990. Alteration by centric fission of the cornelian cherries (Cornus, Cornaceae)—tracking Tertiary diploid chromosome number in Vicia faba L. Genetica 81: plant migration. Evolution 59: 1685–1700. 67–69. Xiang QY(J), Moody ML, Soltis DE, Fan CZ, Soltis PS. 2002. Seeram NP, Schutzki R, Chandra A, Nair MG. 2002. Relationships within Cornales and circumscription of Characterization, quantification, and bioactivities of Cornaceae—matK and rbcL sequence data and effects of anthocyanins in Cornus species. Journal of Agricultural outgroups and long branches. Molecular Phylogenetics and Food Chemistry 50: 2519–2523. and Evolution 24: 35–57. Smith SA. 2006. AReA: Ancestral Range Analysis v.2.1 Xiang QY(J), Shui Y, Murrell Z. 2003. Cornus eydeana [online]. Available from http://blackrim.org/programs/ (Cornaceae), A new cornelian cherry from China—Notes area.html. on Systematics and Evolution. Systematic Botany 28: Soltis DE, Soltis PS, Endress PK and Chase MW. 2005. 757–764. Phylogeny and Evolution of Angiosperms. Sunderland, Xiang QY(J), Soltis DE. 2001. Dispersal-Vicariance analyses MA: Sinauer Associates. of intercontinental disjuncts: historical biogeographical Takhtajan AL. 1987. Sistema Magnoliofitov. Izdatelístvo implications for angiosperms in the Northern Hemisphere. ìNauka,î Leningrad. International Journal of Plant Sciences 162: S29–39. Thorne JL, Kishino H, Painter S. 1998. Estimating the rate of Xiang QY(J), Soltis DE, Morgan DR Soltis PS. 1993. evolution of the rate of molecular evolution. Molecular Phylogenetic relationships of Cornus L. sensu lato and Biology and Evolution 15: 1647–1657. putative relatives inferred from rbcL sequence data. Thorne RF. 1972. Major disjunctions in the geographic ranges Annals of the Missouri Botanical Garden 80: 723–734. of seed plants. Quaternary Review in Biology 47: 365–411. Xiang QY(J), Soltis DE, Soltis PS. 1998. Phylogenetic Tiffney BH. 1985. The Eocene North Atlantic land bridge: its relationships of Cornaceae and close relatives inferred importance in Tertiary and modern phytogeography of the from matK and rbcL sequences. American Journal of Northern Hemisphere. Journal of Arnold Arboretum 66: Botany 85: 285–297. 374 Journal of Systematics and Evolution Vol. 46 No. 3 2008

Xiang QY(J), Thomas DT, Zhang W, Manchester SR, Murrell review, available from the first author upon request) Z. 2006. Species level phylogeny of the Dogwood genus Yu Y-C, Linà F-J, Chang H-Y. 1999. Stepwise chromosome Cornus (Cornaceae) based on molecular and evolution in Drosophila albomicans. Heredity 83: 39–45. morphological evidence—implication in and Zhang WH. 2006. Molecular evolution of floral homeotic Tertiary intercontinental migration. Taxon 55: 9–30. B-class genes in the dogwood genus Cornus Xiang QY(J), Thomas DT, Manchester MR. 2007. Phylogeny (Cornaceae)—gene duplication, selection, and and Biogeography of Cornales—Increased Sampling in coevolution. Ph.D. Dissertation. North Carolina State Taxa and Characters, Placement of Hydrostachys, BEAST University. Available from http://www.lib.ncsu.edu/ vs. Multidivtime, and DIVA vs. AreA. Abstract presented theses/available/etd-10142006-234225. at meeting of Botany & Plant Biology. July, 2007, Chicago. Zhang WH, Xiang QY, Thomas DT, Wiegmann BM, Frohlich Xiang QY(J), Thone JL, Seo TK, Zhang WH, Thomas DT, MW, Soltis DE. 2008. Molecular evolution of Ricklefs RE. Rate of nucleotide substitution in Cornus L. PISTILLATA-like genes in the dogwood genus Cornus (Cornaceae)—A remarkable increase in the circumboreal (Cornaceae). Molecular Phylogeny and Evolution 47: lineage. Molecular Phylogenetics and Evolution. (in 175–195.