TRPLSC 1409 No. of Pages 8

Opinion Turnip Time Travels: Age Estimates in Andreas Franzke,1,* Marcus A. Koch,1,2 and Klaus Mummenhoff3

Results of research in life sciences acquire a deeper meaning if they can also be Trends discussed in temporal contexts of evolution. Despite the importance of the The mustard family is one of the most mustard family (Brassicaceae) as a prominent angiosperm model family, a important model families for vir- robust, generally accepted hypothesis for a family-wide temporal framework tually all areas of contemporary plant does not yet exist. The main cause for this situation is a poor fossil record of the sciences. family. We suggest that the few known fossils require a critical re-evaluation of A reliable temporal framework for the phylogenetic and temporal assignments as a prerequisite for appropriate evolutionary history of the family is essential as this unifies evolutionary molecular dating analyses within the family. In addition, (palaeo)biogeographi- hypotheses from various disciplines cal calibrations, not explored so far in the family, should be integrated in a of plant research. synthesis of various dating approaches, with each contributing their specific The conflict of hypotheses on node possibilities and limitations. ages within the family is often not appropriately perceived in the What is Clade Age Estimation About? community. Age estimates based on molecular clock models are among one of the most important and Previous dating methods relying solely fascinating applications in modern evolutionary biology. Early, somewhat simplistic approaches on a few fossils attributed to the family (e.g., relying on constant molecular clock models), have been replaced during the past two and alternative approaches remain unsatisfactory. decades by highly elaborate methods and new concepts. Contemporary approaches for choosing suitable molecular clock models that accommodate different forms of evolutionary rate heterogeneity and methods for handling multilocus data sets and for different calibration techniques (see Glossary) are regularly reviewed (e.g., [1]). Due to advances in methods of estimation and recognition of the problems in measuring the absolute fit between evolutionary models and data, current approaches will be regarded in the near future as being overly simplistic. This also implies that dating analysis for a given group should always be interpreted with a healthy dose of historical criticism. For example, results of different studies might be difficult to compare as taxon sampling can have effects on molecular clock dating analysis and the manner in which palaeontological evidence is used for calibrating trees is often subjective (see [1] and references therein). Last, but not least, the result of molecular age estimates – in the 1 same manner as a result of a phylogenetic reconstruction from a sample of gene loci – is ‘only’ Heidelberg Botanic Garden, Centre for Organismal Studies (COS) one hypothesis regarding clade ages. Too often the jargon used in these respective publications Heidelberg, Heidelberg University, D- sounds all too convincing, as the results of the dating analysis (ages based on molecular clock 69120 Heidelberg, Germany 2 concepts) are presented as hard facts. This opinion article outlines our viewpoints on and Department of Biodiversity and Plant Systematics, Centre for Organismal critique of recent approaches used for age estimates within the mustard family. We believe our Studies (COS) Heidelberg, Heidelberg general thoughts on this to be relevant for a broad spectrum of the scientific community, as many University, D-69120 Heidelberg, Brassicaceae taxa now serve as model systems for numerous and diverse fields in plant German 3Biology Department, Botany, sciences [2] (Boxes 1 and 2). Osnabrück University, D-49069 Osnabrück, Germany Current Problems of Age Estimates in the Brassicaceae Several recent Brassicaceae-focused studies aimed at providing age estimates for and within *Correspondence: the family using relaxed molecular clock approaches, our own work included: Franzke et al. [email protected] [3] estimated rather young ages (e.g., Brassicaceae crown group age of ca 15 My) and have (A. Franzke).

Trends in Plant Science, Month Year, Vol. xx, No. yy http://dx.doi.org/10.1016/j.tplants.2016.01.024 1 © 2016 Elsevier Ltd. All rights reserved. TRPLSC 1409 No. of Pages 8

Box 1. An Introduction to and Brassicaceae Glossary The order Brassicales comprises 16 families and about 4700 species, marked by the presence of specialised cells Biogeography: study of the (myrosin cells) and the production of sulfur-containing metabolites known as glucosinolates (mustard oil glucosides). The distribution of species and Brassicaceae family or Cruciferae (mustards or crucifers) is the most species-rich member of the order Brassicales (ca ecosystems in geographic space and 3700 species) and includes Arabidopsis thaliana as one of the most important model species in plant biology and through geological times. numerous important crop such as cabbage (Brassica oleracea), canola (Brassica napus, Brassica rapa), and Calibration: converting genetic mustard (Sinapis alba, Brassica nigra). Moreover, the family comprises an increasing number of species that serve as distances to absolute times, usually fi [2_TD$IF] study systems in many elds of plant science and evolutionary research. Brassicaceae are readily distinguished from by means of fossils, geologic, or other Brassicales families by a cruciform (cross-shaped) corolla, six stamens (the outer two shorter than the inner four), a biogeographic evidence or nucleotide capsule often with a septum, and a pungent, watery sap. However, the systematics and of the family are very substitution rates. complex. Comprehensive molecular studies revealed 51 monophyletic tribes in four major lineages [8], with Clade: group of organisms (species, [3_TD$IF] Aethionemeae as sister to the remaining Brassicaceae [57,58], and almost every character that has been used for genera, etc.) derived from a common classical taxonomy exhibits substantial homoplasy, especially those of fruits. Radiation of the family most probably ancestor. – started in the Irano Turanian region. Most molecular datings indicate a pre-Miocene origin of the family and evidence Core Brassicaceae: all recent from palaeobotany and palaeoecology favours a Miocene radiation [30] (but see [4] for older age estimates). Never- lineages except the sister tribe ‘ ’ theless, in both cases the radiation of Brassicaceae was largely extrinsically driven by Miocene climate changes that Aethionemeae. created open and drier habitats and these new ecological niches became characteristically occupied by members of Crown group age: age of the clade the family. Poorly resolved early cladogenesis argues for rapid colonisation of the newly formed arid and semiarid areas that includes all recent taxa of a ‘ ’ worldwide. The most important intrinsic motor for the increase of this family of over 3700 extant species was suggested group. fi to be WGDs, which provided the genetic raw material for biological radiation and diversi cation [8,11,22,30,38]. Ecological niche: the ecological role and space that an organism fills in an ecosystem. been quite rightly criticised [4] for relying on only a single secondary calibration (Box 3). Couvreur Geological ages: Miocene (23–5.3 – et al. [5] estimated a crown group age of the family of ca 37.5 My and was criticised [4] using only Mya); Pleistocene (2.5 0.01 Mya); Quaternary (2.5–0 Mya). a single fossil constraint. The work of Beilstein et al. [4] is based on a molecular clock model Homoplasy/homoplasious: calibrated with four fossils including a potentially overlooked Brassicaceae fossil, Thlaspi nonhomologous similarities due to primaevum, as a minimum age constraint for the stem group age of Thlaspi (i.e., a rather convergence or parallel evolution. By contrast, homologous similarity is terminal split in the Brassicaceae phylogeny between Thlaspi arvense and Alliaria petiolata). In inherited through common ancestry. this study, the crown node age of the Brassicaceae was dated at approximately 54 Mya and Irano–Turanian region: one of the clade age estimates are two- to threefold older than previously calculated. More recent richest floristic areas of the Holarctic comprehensive nuclear transcriptome and multigene-locus-based studies provided indepen- Kingdom in Southwest Asia, with most of its species diversity in the dent and convergent evidence for a Brassicaceae crown group age close to the 37.5 My of [5]; Iranian plateau, Anatolian plateau, however, not relying on the abovementioned Thlaspi fossil: 31.8 Mya [6] and 37.1 Mya [7]. and Central Asia. Another recent Brassicaceae-focused whole-plastome-based analysis without the Thlaspi fossil Model plant: reference species in calibration provided a Brassicaceae crown group age of 32.4 My [8]. These latter estimates on furthering detailed understanding of – mechanisms and processes in plants. Brassicales-focused studies [5 8] are all in agreement with other recent angiosperm-wide Ideally, models are diploids, have few studies (e.g., [9–12]). Interestingly, the most extreme and oldest Brassicaceae age estimates and small chromosomes, well- of [4] are preferentially cited, predominantly in context with ages for specific nodes within developed genetics, and rapid life Brassicaceae (e.g., the split between Arabidopsis thaliana and Arabidopsis lyrata). In approxi- cycles, are easily transformed, and have extensive sets of technical mately one-third of these publications the authors cite age estimates from [4] only and draw resources and databases curated by major conclusions with no reference to alternative results, or used the results of this study to infer international resource centres. substitution rates for 18 nuclear and organelle markers frequently used for systematics and Molecular dating: estimating the population genetics [13]. This includes many high-impact publications [14–25]. We believe that divergence dates of two or more lineages by comparing their DNA or the outcome obtained by [4] is debatable at the very least as there are some major discrepancies protein sequence data. between this study and the other abovementioned studies [5–12]. The general methodology for Molecular systematics: use of the the phylogenetic analysis and molecular age estimates employed by [4] is without doubt state of structure of molecules to gain information on evolutionary the art. We appreciate their exemplary practice for a strict evaluation of the fossils used for relationships. calibration, accepting only well-documented vouchers fulfilling minimum requirements including Monophyletic group: comprises a a clear citation record, photographic evidence or accurate reproduction, and a fossil collection last common ancestor and all of its number. Consequently, due to the lack of missing primary literature, the authors excluded a fossil descendants. Node: in a rooted phylogenetic tree, seed (and possibly part of the carpel) that was attributed to Rorippa [4], which had been used in each node with descendants prior studies for calibrating molecular clocks (e.g., [26,27]). However, the significantly older represents the inferred most recent divergence times of [4] are also incongruent with other recent results: Bell et al. [9] estimated common ancestor of the divergence times across the angiosperms (36 calibration points, 567 taxa, relaxed-clock model) descendants. Radiation: increase in taxonomic and calculated the stem node age for the Brassicaceae at ca 32 Mya versus ca 65 Mya in [4] and diversity or morphological disparity. ca 42 Mya versus ca 71 Mya in [4] for the Brassicaceae/Cleomaceae/Capparaceae clade. This

2 Trends in Plant Science, Month Year, Vol. xx, No. yy TRPLSC 1409 No. of Pages 8

Box 2. The Brassicaceae as a Model Plant Family Relaxed molecular clock Thirty years ago, Arabidopsis thaliana emerged as the model organism of choice for research in plant biology. No single approach: analytical methods for species can address the diverse range of ecological and evolutionary questions of current interest, but the relatives of this molecular dating that relax the model species provide experimental opportunities to use the information from and tools of A. thaliana to unravel the assumption of nucleotide substitution molecular mechanisms underlying important and putatively adaptive traits beyond A. thaliana and its shrunken genome rate constancy among lineages. [59]. Thus, in addition to the well-known model plants A. thaliana [60] and Brassica species [61], several other Stem group: organisms close to but Brassicaceae taxa are currently used as study objects in modern plant biology [2]. Here we give a selection of these outside a particular crown group that model species (for the phylogenetic position, see Figure 1 in [30]) and some key words to characterise some of the always lack features present at the addressed research interests: Arabidopsis halleri, Noccaea caerulescens (heavy metal tolerance and hyperaccumulation base of the crown group to which [62]); Arabidopsis lyrata, Arabidopsis suecica (self-incompatibility and genome evolution [63]); Arabis alpina (perennial they are attached. habit [64]); Cardamine hirsuta (leaf architecture [65]); Capsella sp. (self-incompatibility [66,67]); Capsella bursa-pastoris Systematics: study of evolutionary (flowering time, floral architecture [68]); Boechera sp. (apomixis [69] and plant–insect and plant–pathogen interactions relationships between groups of [70]); Diplotaxis sp. (mating system changes [71]); Iberis sp. (flower symmetry [72]); Lepidium sp. (seed physiology [73] organisms (species, genera, etc.). and fruit structure [74]); Eutrema, Thellungiella sp. (salt stress [75]); Brassicaceae sp. (WGDs, genome evolution, and Taxon (plural, taxa): taxonomic rank diversification [8,22,38,52,76–78]); and Brassicales (coevolutionary interactions between plants and butterflies [6] and at any level (e.g., species, , glucosinolate evolution [79]). To evaluate the evolutionary context of these characters there is great demand to classify family, order, division). the processes underlying these characters/traits in a geological time frame. This, however, requires proper calibration of Taxonomy: description, molecular markers and phylogenies. identification, naming, and classification of organisms at various ranks. Tribe: a taxonomic category placed latter node was also very recently dated as being ca 44 My old [10] in the most comprehensive between a subfamily and a genus. Whole-genome duplication [7_TD$IF] angiosperm-wide dating analysis so far, which relied on 137 fossil calibrations and several (WGD): an event creating an molecular markers. It should be noted, however, that Magallón et al. [10] presented this estimate organism with extra copies of the incorrectly as the stem age of the Brassicaceae family s.str., although their analysis did not entire genome (also called include representatives of Cleomaceae, the sister family to the Brassicaceae. The same node polyploidy). WGD events are of different ages and can be caused by fi was dated as being even signi cantly younger (ca 34 Mya) in other recent comprehensive large- hybridisation combining genomes of scale dating analyses [11,12]. Also noteworthy are the unexpectedly high age estimates for splits different species (allopolyploidy) or by within genera in [4] (e.g., Lepidium, ca 16 Mya), implying that many accepted Quarternary the multiplication of the same genome. biogeographical scenarios for Brassicaceae taxa are highly questionable. This would also be true for the split between ecotypes of A. thaliana dated to 4.3 Mya in [4] versus Pleistocene splits in [28]. Arias et al. [29], following a similar approach, used fossil T. primaevum for dating nodes within the tribe Brassiceae. This approach resulted in a Pleistocene origin of Brassica cultivars (ca 90 000 ya), although it is well recognised that cultured plants did not originate (as a consequence of domestication) before the Holocene (11 500–2200[8_TD$IF] ya). The inferred compara- tively high age estimates of [4] suggest, therefore, that the analysis may have been biased towards older age estimates, potentially due to the incorporation of the putative Thlaspi fossil dated to 30 Mya [4].

Current Brassicales Fossil Situation We doubted earlier the attribution of T. primaevum to extant Thlaspi (for details see [30]). It is well known that fruit characters are highly homoplasious throughout the Brassicaceae family [30]; therefore, assignments of (such old) ‘Thlaspi’ fossil fruits to a distinct Brassicaceae taxon might be per se very questionable. Indeed, there is homoplasy in almost every morphological character in the crucifers [30]. Following this train of thought, one should also be sceptical when considering all described oldest macrofossils (fruits, seeds) that have been assigned to the genus level. Fossils assigned to Draba, Sinapis, Thlaspi, Cochlearia,andClypeola are reported as being from the late Pliocene and the late Miocene of Germany, respectively (see references in [30]). The phylogenetic placements of these fossils, however, have never been confirmed.ThisisalsotrueforfossilsofBunias from the Pleistocene of Russia and also the abovementioned Rorippa fossil from the Pliocene of Russia (see [4] and references therein). Fossil fruits from the late Miocene of Germany had been incorrectly determined as Draba and Lepidium, respectively, and also require further studies to determine their affinity (see [4] and references therein). These fossils are characterised by geological boundary intervals of temporal resolution. For example, the abovementioned Rorippa fossil is assigned to the Pliocene, reflecting a time span from 2.5 to 5 Mya. Therefore, in a molecular dating analysis,

Trends in Plant Science, Month Year, Vol. xx, No. yy 3 TRPLSC 1409 No. of Pages 8

Box 3. Dating Approaches Fossil Calibration Calibrations are usually based on fossil evidence, whereby a minimum constraint on the age of a clade is based on the timing of its earliest fossil representative. There has been extensive research into the proper use of fossil data for calibrating molecular trees and clocks and problems associated with the methodology (with uncertainty in fossil age and phylogenetic position representing the two greatest challenges); the reader is referred to [1,31,32,45][4_TD$IF] and the references therein. There is general agreement that the fossil record remains the most reliable source of information for the calibration of phylogenetic trees, although associated assumptions and potential bias must be taken into account.

Secondary Calibration Secondary or indirect calibrations are node ages derived from previous analyses applied to an independent data set currently under study without reference to the original calibrations used to generate them [37]. Secondary calibration represents the most commonly applied age constraint after fossils, despite many problems associated with its use [37]. The primary problem with this approach is that sources of error generated by the first dating analysis become subsumed into new estimates, resulting in divergence dates of increasingly dubious reliability. Thus, the use of secondary calibration should be a last resort; the reader should consult [37] and the references therein for critical discussion of this method.

Geological and (Palaeo)biogeographical Calibration Divergence of species can sometimes be attributed to geophysical isolating mechanisms or the appearance of new habitats (e.g., formation of islands, mountain systems, seaways, deserts, other geological events). This information can be used to calibrate phylogenetic trees and estimates of molecular rates once the timing of such an event is known in setting a maximum age at a node [46]. However, this procedure is prone to some distorting factors, including errors in the estimation of geological ages, the degree of association between geological events and genetic divergences, and the impacts of taxon sampling and lineage extinction [1]. Correlating the age of taxa with that of associated palaeogeo- graphical events is probably one of the most promising methods, but the reader is referred to [1,45–47] and the references therein for details and critical evaluation of the methodology.

Evidence from MA Lines In MA experiments the mutation rates of spontaneous mutations are studied in replicated inbred lines. The resulting rates, independent of any external calibration (e.g., fossils) could then potentially be used for age estimates. However, such experimentally estimated mutation rates could be an order of magnitude or more higher than substitution rates based on measurement over geological timeframes [45][5_TD$IF] .

‘only’ the minimum age of the geological boundary should be used for node calibrations [31,32]. In addition, the Brassicaceae fossils (already) mentioned represent terminal taxa of the Brassicaceae. As it has been shown that calibrations at terminal nodes generally lead to rate and date estimates with higher error and lower precision [1], these fossils might be valuable for dating closely related terminal groups ‘only’. However, a critical re-evaluation of the taxonomic and temporal assignments of these fossils is urgently needed (Figure 1). Brassicaceae fossils for calibrating (deeper) nodes within the family are presently de facto not available. For larger-scale Brassicaceae phylogenies based on conservative-enough markers, it is putatively possible to include at least some external fossils of closely related families (Brassicales), which were also used in the abovementioned larger-scale dating analyses. However, the fossil record outside Brassicaceae (i.e., for the closely related families in the order Brassicales) is also scarce. At present this is as follows:[9_TD$IF] (i) Turonian fossil flowers assigned to Dressiantha bicarpellata dated at ca 89 Mya [33], the oldest known putative Brassicales fossils [4].AsDressiantha could be a stem representative or a member of the Brassicales crown group it should be used conservatively, calibrating the Brassicales stem age [10]. (ii) A leaf fossil from a Palaeocene formation (ca 61.7 Mya [34]) that was assigned to the genus Akania (Akaniaceae) but not formally assigned to a species (Akania sp.). This could be used as a minimum time constraint for the Akaniaceae stem group. (iii) Silicified wood from Neogene sediments (Late Karpatian, 17.0–16.3 Mya [35]) assigned to Capparidoxylon holleisii were clearly attributed to Capparaceae [35]. As this fossil wood is distinguished by only a very few anatomical characters from wood of the extant genus Capparis [35], it has been used to calibrate the crown age of the Capparaceae [4]. As stated above, we do not agree with the

4 Trends in Plant Science, Month Year, Vol. xx, No. yy TRPLSC 1409 No. of Pages 8

Figure 1. Correct Taxonomic Assign- ment of Fossils Is Crucial for Mole- cular Dating Analyses. We therefore suggest that the few known Brassicaceae fossils require critical re-evaluation. This cartoon illustrates our opinion that this is also true for Thlaspi primaevum, a fossil fruit that was recently used for molecular analysis and resulted in relatively old diver- gence times compared with several other published dating analyses. Cartoon drawn by Louis Werner.

attribution of fossil T. primaevum to extant Thlaspi. At present we would therefore suggest including T. primaevum only very conservatively to constrain the Brassicaceae stem/crown age and then to test the influence of this fossil on age estimates. A similar test (T. primaevum fossil included versus not included in dating analyses) was recently conducted by [7] and age estimates with T. primaevum were indeed higher. Interestingly, Magallón et al. [10] used T. primaevum as a constraint for the Arabidopsis–Brassica split, incorrectly designated as the crown node of Brassicaceae as representatives of the tribe Aethionemeae[3_TD$IF] (sister to all remaining Brassicaceae) were not included. However, the authors used ca 23 My as a constraint, corresponding to the upper boundary of the Oligocene, instead of the currently established age of the according fossil flora being ca 32 My old [36]. The inclusion of Brassicales fossils (and corresponding sequences) is putatively valuable only for larger-scale phylogenies, as here only conservative markers can be applied, which allow uncritical align- ments of sequences from Brassicaceae taxa and representatives of other Brassicales. This is also true for secondary calibrations (node ages from previous analyses) derived from the abovementioned large-scale analyses (e.g., [10]).

Alternatives to Primary Fossil Dating Approaches Secondary calibrations are derived from node age estimates from previous studies applied to an independent data set without reference to the original calibrations used to generate them. Therefore, uncertainty in the original analyses (e.g., due to non-critical assessment of fossil evidence) could lead to compounded errors. As reliable (oldest) fossils for primary calibrations within the Brassicaceae are not available at present, secondary calibration, successively applied from more basal nodes to more terminal nodes within the family, is one of the remaining options for dating estimates within the crucifers. Potential bias associated with the original study should then be taken into account and reported [37].

Before discussing geologic and (palaeo)biogeographical calibrations, where we see some unexploited potential for calibrating more terminal nodes within the family, we wish to address two other approaches in the context of age estimates in the family. A key finding is that all Brassicaceae species share a common whole-genome duplication (At-/ WGD) event in their history [11,38]. Thus, the age of the At-/ WGD should coincide with the age of the Brassicaceae and, therefore, should represent an alternative independent approach of estimating the age of the Brassicaceae. Analogically, this should be true for at least four additional independent lineage-specific WGDs within the family (e.g., the Br-/ WGD characterising the tribe Brassi- ceae). However, inferring the exact timing of WGDs is not straightforward, as the general

Trends in Plant Science, Month Year, Vol. xx, No. yy 5 TRPLSC 1409 No. of Pages 8

problems described above also apply for dating WGDs; for example, rate heterogeneity within Outstanding Questions and among lineages and rate variation among genes, even at synonymous sites [39,40]. Is it possible to improve the reliability of However, the most important point in this respect is the fact that the substitution rates used phylogenetic and temporal assignment fi of known Brassicaceae fossils for to nally scale molecular divergence into time were also calibrated, directly or indirectly, with molecular clock calibrations within the fossils; moreover – to the best of our knowledge – in Brassicaceae studies always under the family? We suggest exploiting the syn- assumption of a strict molecular clock. Another fascinating approach for divergence time ergy of transdisciplinary research estimates could be based on (general) mutation rates inferred from mutation accumulation efforts of palaeobotanists and Brassi- caceae (molecular) systematists. (MA) experiments (Box 3). Such an experiment was performed for A. thaliana and the time of divergence between A. thaliana and A. lyrata was calculated as being ca 18 Mya [41]. This result Can (palaeo)biogeographical molecular is in general agreement with the (T. primaevum) fossil-based high age estimate for the same split clock calibration approaches, new for (13 Mya) of [4], which was consequently regarded as independent confirmation of the MA dating the Brassicaceae, improve the tempo- ral framework of the family? Recent approach [42]. However, the MA-based dating approach for this particular split was based on approaches in this developing field also the assumption of a generation time of 1 year. This is true for extant A. thaliana. However, closely correlate demographic events with related Arabidopsis taxa including A. lyrata are biennials, mostly even perennials [43].As geological calibrations. annuality and selfing are commonly associated [44], the lineage of A. thaliana might be annual Is it possible to unify hypotheses from – fi since only very recent times (Middle Pleistocene, ca 0.44 1 Mya), when transition to sel ng in A. various dating approaches for a family- thaliana occurred (see [43] and references therein). This would imply that the MA-based dating wide temporal framework? This might approach for the A. lyrata–A. thaliana split would be at least (unbelievably) 36 My old. Moreover, also deepen our understanding of how MA experiment-derived spontaneous as mutation rates from MA experiments typically exceed long-term substitution rates at least by mutation rates correlate with long-term an order of magnitude (see [45] and references therein), this particular split would ‘realistically’ be substitution rates. 72 My old. In conclusion, much more research is needed to clarify how the mutation rate observed in one species can be assigned to related species to base age estimates on results of MA experiments.

Concluding Remarks: Biogeography as Part of the Solution? Besides results based on critical re-evaluated Brassicaceae fossils and the stepwise sec- ondary calibration approaches mentioned above to infer age estimates within the family, biogeographical calibrations could be an alternative option for mutual comparison between the two methods (see Outstanding Questions). As in all other calibrating methods, this type of external calibration approach carries numerous risks [1,45–48] that we do not want to address here in detail; nevertheless, it would seem to be a promising approach [49].Herewe see unexploited potential in biogeographical calibrations for the Brassicaceae family as – to our knowledge – this approach has not so far been applied. We therefore suggest that geological or palaeoclimate events in the Neogene–Quarternary should be tested to calibrate terminal nodes in Brassicaceae phylogenies; for example, the onset of the Messinian Salinity Crisis and the origin of the Aegean Islands (Ricotia [50]), the formation of Socotra (endemic Brassicaceae [51]), the emergence of the Sahara (South African endemic Brassicaceae [27,52], the uplift of the Southern Alps in New Zealand (Pachycladon [53]), the uplift of the Himalayas and the Tibetan plateau (tribe Arabidae [54]), the longitudinal range split and genetic differentiation (Eurasian steppe plant Clausia aprica [55]), the origin of the Hawaiian islands (Hawaiian endemic Lepidium [27]), or the recolonisation of formerly glaciated areas such as the Arctic or High Alpine regions during Pleistocene glaciations (e.g., Arabis alpina [56]). This approach is no panacea but might be a starting point for a more holistic view on the evolutionary history of the family, where, for example, a Miocene age estimate for typical ‘known’ Quaternary biogeographical patterns should provoke some scepticism.

Acknowledgments Nora Hohmann, Nicolai Nürk, José Ignacio Lucas-Lledó, Diego Salariato, Barıs¸ Ozüdog€ ˘ ru, Lydia Gramzow, Günter Theißen, and Steven Manchester are acknowledged for discussion, Graham Muir and Lucille Schmieding for proofreading, and Louis Werner for drawing the cartoon. The authors thank two anonymous reviewers and Fabien Condamine for constructive advice on an earlier version of the manuscript. A.F., M.A.K., and K.M. were funded by the German Research Foundation (DFG) (grants MU 1137/7-1/2, MU 1137/9-1, and KO 2302/13-1).

6 Trends in Plant Science, Month Year, Vol. xx, No. yy TRPLSC 1409 No. of Pages 8

References 1. Ho, S.Y.W. and Duchêne, S. (2014) Molecular-clock methods 25. Beilstein, M.A. et al. (2015) Evolution of the telomere-associated for estimating evolutionary rates and timescales. Mol. Ecol. 23, protein POT1a in Arabidopsis thaliana is characterized by positive 5947–5965 selection to reinforce protein–protein interaction. Mol. Biol. Evol. – 2. Schmidt, R. and Bancroft, I.,eds (2011) In Genetics and Genomics 32, 1329 1341 of the Brassicaceae. Plant Genetics and Genomics: Crops and 26. Koch, M. et al. (2000) Comparative evolutionary analysis of Models (Vol. 9), Springer chalcone synthase and alcohol dehydrogenase loci in Arabi- 3. Franzke, A. et al. (2009) Arabidopsis family ties: molecular dopsis, Arabis and related genera. Mol. Biol. Evol. 17, – phylogeny and age estimates in the Brassicaceae. Taxon 58, 1483 1498 425–437 27. Mummenhoff, K. et al. (2001) Chloroplast DNA phylogeny and 4. Beilstein, M.A. et al. (2010) Dated molecular phylogenies indicate a biogeography of Lepidium (Brassicaceae). Am. J. Bot. 88, – Miocene origin for Arabidopsis thaliana. Proc. Natl. Acad. Sci. U.S. 2051 2063 A. 107, 18724–18728 28. Beck, J.B. et al. (2008) Native range genetic variation in Arabi- fl 5. Couvreur, T. et al. (2010) Molecular phylogenetics, temporal diver- dopsis thaliana is strongly geographically structured and re ects – sification, and principles of evolution in the mustard family (Bras- Pleistocene glacial dynamics. Mol. Ecol. 17, 902 915 sicaceae). Mol. Biol. Evol. 27, 55–71 29. Arias, T. et al. (2014) Diversification times among Brassica (Bras- 6. Edger, P.A. et al. (2015) The butterfly plant arms-race escalated by sicaceae) crops suggest hybrid formation after 20 million years of – gene and genome duplications. Proc. Natl. Acad. Sci. U.S.A. 112, divergence. Am. J. Bot. 101, 86 91 8362–8366 30. Franzke, A. et al. (2011) Cabbage family affairs: the evolutionary – 7. Huang, C-H. et al. (2015) Resolution of Brassicaceae phylogeny history of Brassicaceae. Trends Plant Sci. 16, 108 116 using nuclear genes uncovers nested radiations and supports 31. Sauquet, H. et al. (2012) Testing the impact of calibration on convergent morphological evolution. Mol. Biol. Evol. Published molecular divergence times using a fossil-rich group: the case online October 29, 2015. http://dx.doi.org/10.1093/molbev/ of Nothofagus (Fagales). Syst. Biol. 61, 289–313 msv226 32. Parham, J. et al. (2012) Best practices for justifying fossil calibra- 8. Hohmann, N. et al. (2015) A time-calibrated road map of Brassi- tions. Syst. Biol. 61, 346–359 caceae species radiation and evolutionary history. Plant Cell 27, 33. Gandolfo, M.A. et al. (1998) A new fossil flower from the Turonian – 2770 2784 of New Jersey: Dressiantha bicarpellata gen. et sp. nov. (Cappar- 9. Bell, C.D. et al. (2010) The age and diversification of the angio- ales). Am. J. Bot. 85, 964–974 – sperms. Am. J. Bot. 97, 1296 1303 34. Iglesias, A. et al. (2007) A Paleocene lowland macroflora from 10. Magallón, S. et al. (2015) A metacalibrated time-tree documents Patagonia reveals significantly greater richness than North Ameri- the early rise of flowering plant phylogenetic diversity. New Phytol. can analogs. Geology 35, 947–950 – 207, 437 453 35. Selmeier, A. (2005) Capparidoxylon holleisii nov. spec., a silicified 11. Tank, D.C. et al. (2015) Nested radiations and the pulse of angio- Capparis (Capparaceae) wood with insect coprolites from the sperm diversification: increased diversification rates often follow Neogene of southern Germany. Zitteliana 45, 199–209 – whole genome duplications. New Phytol. 207, 454 467 36. Lielke, K. et al. (2012) Reconstructing the environment of the 12. Zanne, A.E. et al. (2014) Three keys to the radiation of angio- northern Rocky Mountains during the Eocene/Oligocene transi- sperms into freezing environments. Nature 506, 89–92 tion: constraints from the palaeobotany and geology of south- – 13. Huang, C-C. et al. (2012) Evolutionary rates of commonly used western Montana, USA. Acta Palaeobot. 52, 317 358 nuclear and organelle markers of Arabidopsis relatives (Brassica- 37. Hipsley, C.A. and Müller, J. (2014) Beyond fossil calibrations: ceae). Gene 499, 194–201 realities of molecular clock practices in evolutionary biology. Front. 14. Cao, J. et al. (2011) Whole-genome sequencing of multiple Ara- Genet. 5, 138 bidopsis thaliana populations. Nat. Genet. 43, 956–963 38. Schranz, M.E. et al. (2012) Ancient whole genome duplications, fi 15. Bartish, I.V. et al. (2012) Phylogeny and colonization history of novelty and diversi cation: the WGD Radiation Lag-Time Model. – Pringlea antiscorbutica (Brassicaceae), an emblematic endemic Curr. Opin. Plant Biol. 15, 147 153 from the South Indian Ocean Province. Mol. Phylogenet. Evol. 65, 39. De Bodt, S. et al. (2005) Genome duplication and the origin of 748–756 angiosperms. Trends Ecol. Evol. 20, 591–597 16. Bekaert, P. et al. (2012) Metabolic and evolutionary costs of 40. Van de Peer, Y. et al. (2009) The flowering world: a tale of herbivory defense: systems biology of glucosinolate synthesis. duplications. Trends Plant Sci. 14, 680–688 – New Phytol. 196, 596 605 41. Ossowski, S. et al. (2010) The rate and molecular spectrum of 17. Gschwend, A.R. et al. (2012) Rapid divergence and expansion of spontaneous mutations in Arabidopsis thaliana. Science 327, the X chromosome in papaya. Proc. Natl. Acad. Sci. U.S.A. 109, 92–94 – 13716 13721 42. Guo, Y.L. et al. (2011) Evolution of the S-locus region in Arabi- 18. Lomax, B.H. et al. (2012) An experimental evaluation of the use of dopsis relatives. Plant Physiol. 157, 937–946 d13 d13 C3 C plant tissue as a proxy for the paleoatmospheric CO2 43. Koch, M. et al. (2008) Arabidopsis thaliana's wild relatives: an signature of air. Geochem. Geophys. Geosyst. Published online updated overview on systematics, taxonomy and evolution. Taxon September 20, 2012. http://dx.doi.org/10.1029/2012GC004174 57, 933–943 19. Wang, J. et al. (2012) Sequencing papaya X and Yh chromosomes 44. Barrett, S.C.G. et al. (2014) The demography and population reveals molecular basis of incipient sex chromosome evolution. genomics of evolutionary transitions to self-fertilization in plants. – Proc. Natl. Acad. Sci. U.S.A. 109, 13710 13715 Philos. Trans. R. Soc. Lond. B Biol. Sci. Published online August 5, 20. Hough, J. et al. (2013) Patterns of selection in plant genomes. 2014. http://dx.doi.org/10.1098/rstb.2013.0344 – Annu. Rev. Ecol. Evol. Syst. 44, 31 49 45. Ho, S.Y.W. et al. (2011) Time-dependent rates of molecular evo- 21. Yang, R. et al. (2013) The reference genome of the halophytic plant lution. Mol. Ecol. 20, 3087–3101 Eutrema salsugineum. Front. Plant Sci. 4, 46 46. Heads, M. (2011) Old taxa on young islands: a critique of the use of 22. Kagale, S. et al. (2014) Polyploid evolution of the Brassicaceae island age to date island-endemic clades and calibrate phyloge- during the Cenozoic Era. Plant Cell 26, 2777–2791 nies. Syst. Biol. 60, 204–218 23. Kagale, S. et al. (2014) The emerging biofuel crop Camelina sativa 47. Ho, S.Y.W. et al. (2015) Biogeographic calibrations for the molec- retains a highly undifferentiated hexaploid genome structure. Nat. ular clock. Biol. Lett. 11, 20150194 Commun. 23, 5706 48. Svenson, U. et al. (2012) Are Asteraceae 1.5 billion years old? A 24. Schlaeppi, K. et al. (2014) Quantitative divergence of the bacterial reply to Heads. Syst. Biol. 61, 522–532 root microbiota in Arabidopsis thaliana relatives. Proc. Natl. Acad. 49. Heads, M. (2005) Dating nodes on molecular phylogenies: a – Sci. U.S.A. 111, 585 592 critique of molecular biogeography. Cladistics 21, 62–78

Trends in Plant Science, Month Year, Vol. xx, No. yy 7 TRPLSC 1409 No. of Pages 8

€ 50. Ozüdog˘ ru, B. et al. (2015) Phylogenetic perspectives, diversifica- and an extreme bottleneck. Proc. Natl. Acad. Sci. U.S.A. 106, tion, and biogeographic implications of the eastern Mediterranean 5246–5251 – endemic genus Ricotia L. (Brassicaceae). Taxon 64, 727 740 67. Slotte, T. et al. (2013) The Capsella rubella genome and the 51. Banfield, L.M. et al. (2011) Evolution and biogeography of the flora genomic consequences of rapid mating system evolution. Nat. of the Socotra Archipelago (Yemen). In The Biology of Island Floras Genet. 45, 831–835 – (Bramwell, D., ed.), pp. 197 225, Cambridge University Press 68. Hameister, S. et al. (2009) Genetic differentiation and reproductive 52. Mandáková, T. et al. (2012) Whole-genome triplication and spe- isolation of a naturally occurring floral homeotic mutant within a cies radiation in the southern African tribe Heliophileae (Brassica- wild-type population of Capsella bursa-pastoris (Brassicaceae). ceae). Taxon 61, 989–1000 Mol. Ecol. 18, 2659–2667 53. Heenan, P.B. and McGlone, M.S. (2013) Evolution of New Zealand 69. Mau, M. et al. (2015) Hybrid apomicts trapped in the ecological Alpine and open-habitat plant species during the late Cenozoic. niches of their sexual ancestors. Proc. Natl. Acad. Sci. U.S.A. 112, N. Z. J. Ecol. 37, 105–113 E2357–E2365 54. Koch, M. et al. (2012) Systematics, taxonomy and biogeography 70. Ali, J.G. and Agrawal, A.A. (2012) Specialist versus generalist of three new Asian genera from the Brassicaceae, tribe Arabideae: insect herbivores and plant defense. Trends Plant Sci. 17, an ancient distribution circle around the Asian high mountains. 293–302 – Taxon 61, 955 969 71. Eschmann-Grupe, G. et al. (2004) Extent and structure of 55. Franzke, A. et al. (2004) Quaternary genome and areal splits of genetic variation in two colonising Diplotaxis species (Brassi- Eurasian steppe plants. Mol. Ecol. 13, 2789–2795 caceae) with contrasting breeding systems. Plant Syst. Evol. – 56. Koch, M. et al. (2006) Three times out of Asia Minor – the phy- 244, 31 43 logeography of Arabis alpina L. (Brassiaceae). Mol. Ecol. 15, 72. Busch, A. et al. (2012) Corolla monosymmetry, the evolution of a 825–839 morphological novelty in the Brassicaceae family. Mol. Biol. Evol. – 57. Al-Shehbaz, I.A. (2012) A generic and tribal synopsis of the Bras- 29, 1241 1254 sicaceae (Cruciferae). Taxon 61, 931–954 73. Graeber, K. et al. (2014) DELAY OF GERMINATION 1 mediates a 58. Kiefer, M. et al. (2013) BrassiBase: introduction to a novel knowl- conserved coat dormancy mechanism for the temperature- and edge database on Brassicaceae evolution. Plant Cell Physiol. 55, gibberellin-dependent control of seed germination. Proc. Natl. – e3 Acad. Sci. U.S.A. 111, E3571 E3580 59. Alonso-Blanco, C. et al. (2009) What has natural variation taught 74. Mühlhausen, A. et al. (2013) Evidence that an evolutionary transi- us about plant development, physiology, and adaptation? Plant tion from dehiscent to indehiscent fruits in Lepidium (Brassica- Cell 21, 1877–1896 ceae) was caused by a change in the control of valve margin identity genes. Plant J. 75, 824–835 60. Koornneef, M. and Meinke, D. (2010) The development of Arabi- dopsis as a model plant. Plant J. 61, 909–992 75. Wu, H.J. et al. (2012) Insights into salt tolerance from the genome of Thellungiella salsuginea. Proc. Natl. Acad. Sci. U.S.A. 109, 61. Liu, S. et al. (2014) The Brassica oleracea genome reveals the 12219–12224 asymmetrical evolution of polyploid genomes. Nat. Commun. 5, 3930 76. Mandáková, T. et al. (2010) Fast diploidization in close mesopo- lyploid relatives of Arabidopsis. Plant Cell 22, 2277–2290 62. Hanikenne, M. et al. (2008) Evolution of metal hyperaccumulation required cis-regulatory changes and copy number expansion of 77. Mandáková, T. et al. (2013) The more the merrier: recent HMA4. Nature 453, 391–394 hybridization and polyploidy in Cardamine. Plant Cell 25, 3280–3295 63. Hu, T.T. et al. (2011) The Arabidopsis lyrata genome sequence and the basis of rapid genome size change. Nat. Genet. 43, 476–481 78. Moghe, G.D. et al. (2014) Consequences of whole-genome tripli- cation as revealed by comparative genomic analyses of the wild 64. Wang, R. et al. (2009) PEP1 regulates perennial flowering in Arabis radish Raphanus raphanistrum and three other Brassicaceae alpina. Nature 459, 423–427 species. Plant Cell 26, 1925–1937 65. Vlad, D. et al. (2014) Leaf shape evolution through duplication, 79. Hofberger, J.A. et al. (2013) Whole genome and tandem duplicate regulatory diversification, and loss of a homeobox gene. Science retention facilitated glucosinolate pathway diversification in the 343, 780–783 mustard family. Genome. Biol. Evol. 5, 2155–2173 66. Guo, Y.L. et al. (2009) Recent speciation of Capsella rubella from Capsella grandiflora, associated with loss of self-incompatibility

8 Trends in Plant Science, Month Year, Vol. xx, No. yy