American Journal of Botany 97(5): 856–873. 2010.
PATTERNS AND CAUSES OF INCONGRUENCE BETWEEN PLASTID AND NUCLEAR SENECIONEAE (ASTERACEAE) PHYLOGENIES 1
Pieter B. Pelser 2,8 , Aaron H. Kennedy 3,4 , Eric J. Tepe 5 , Jacob B. Shidler 4 , Bertil Nordenstam 6 , Joachim W. Kadereit 7 , and Linda E. Watson 3
2 University of Canterbury, School of Biological Sciences, Private Bag 4800, Christchurch 8140 New Zealand; 3 Oklahoma State University, Department of Botany, 104 Life Science East, Stillwater, Oklahoma 74078 USA; 4 Miami University, Department of Botany, 316 Pearson Hall, Oxford, Ohio 45056 USA; 5 University of Cincinnati, Department of Biological Sciences, 614 Rieveschl Hall, Cincinnati, Ohio 45221 USA; 6 Swedish Museum of Natural History, Department of Phanerogamic Botany, P.O. Box 50007 SE-104 05 Stockholm, Sweden; and 7 Johannes Gutenberg-Universit ä t Mainz, Institut f ü r Spezielle Botanik und Botanischer Garten, 55099 Mainz Germany
One of the longstanding questions in phylogenetic systematics is how to address incongruence among phylogenies obtained from multiple markers and how to determine the causes. This study presents a detailed analysis of incongruent patterns between plastid and ITS/ETS phylogenies of Tribe Senecioneae (Asteraceae). This approach revealed widespread and strongly supported incongruence, which complicates conclusions about evolutionary relationships at all taxonomic levels. The patterns of incongru- ence that were resolved suggest that incomplete lineage sorting (ILS) and/or ancient hybridization are the most likely explana- tions. These phenomena are, however, extremely diffi cult to distinguish because they may result in similar phylogenetic patterns. We present a novel approach to evaluate whether ILS can be excluded as an explanation for incongruent patterns. This coales- cence-based method uses molecular dating estimates of the duration of the putative ILS events to determine if invoking ILS as an explanation for incongruence would require unrealistically high effective population sizes. For four of the incongruent patterns identifi ed within the Senecioneae, this approach indicates that ILS cannot be invoked to explain the observed incongruence. Alter- natively, these patterns are more realistically explained by ancient hybridization events.
Key words: ancient hybridization; deep coalescence; ETS; incomplete lineage sorting; incongruence; ITS; molecular dating; plastid sequences.
Since the onset of the use of DNA sequences for phylogeny tween phylogenies obtained from the analysis of different reconstruction, molecular systematics has experienced a steady genes, genic regions, and genomes ( Degnan and Rosenberg, increase in the number of DNA regions used to resolve evolu- 2009 ). Incongruence may, among others, indicate differences tionary relationships (D egnan and Rosenberg, 2009) . This de- in the evolutionary histories of the DNA regions employed velopment has signifi cantly contributed to our understanding (i.e., gene tree – species tree discordance), which could result of the evolutionary history of numerous lineages by clarifying from hybridization or incomplete lineage sorting (ILS; Doyle, relationships that were previously unresolved in studies using 1992 ; Maddison, 1997 ; Buckley et al., 2006 ; Liu and Pearl, fewer markers and less data (e.g., Kuzoff and Gasser, 2000 ; 2007 ). ILS is the failure of ancestral polymorphisms to track P ryer et al., 2004; P anero and Funk, 2008) . In addition, multi- speciation events accurately, which may result in incongru- gene studies have enabled a more detailed understanding of ence between gene trees and species trees. Unfortunately, ILS macroevolution by revealing congruence or incongruence be- can result in phylogenetic patterns similar to those observed for hybridization events ( Doyle, 1992 ; Seelanan et al., 1997 ; Holder et al., 2001 ; Buckley et al., 2006 ; Holland et al., 2008 ; 1 Manuscript received 21 September 2009; revision accepted 22 February Joly et al., 2009 ). Therefore ILS and hybridization are often 2010. diffi cult to distinguish. Furthermore, in the absence of an ef- The authors thank A. Aksoy, R. J. Bayer, R. Cairns-Wicks, P. Carillo fective methodology to distinguish between them ( Joly et al., Reyes, G. V. Cron, V. A. Funk, E. B. Knox, R. R. Kowal, A. Marticorena, 2009 ), there are few empirical studies that present a detailed J. L. Panero, T. Sultan Quedensley, T. F. Stuessy, and I. R. Thompson for assessment of incongruent patterns and even fewer in which providing DNA or tissue samples. They are grateful to the curators of B, the specifi c causes for these patterns were studied ( Wiens and BHCB, CANB, E, F, HYO, L, LPB, M, MEL, MJG, MO, MU, P, PRE, PREM, S, TENN, TEX, U, UC, US, WAG, WIS, WU, and XAL for Hollingsworth, 2000 ; Van der Niet and Linder, 2008 ; Morgan permission to include their Senecioneae specimens in their studies. R. H. et al., 2009 ). There is therefore an urgent need for research that Ree kindly provided his python script for coding indels. H. P. Comes, J. H. explores ways to examine incongruent patterns and to deter- Degnan, M. Fishbein, and A. Varsani are acknowledged for helpful mine the causes ( Buckley et al., 2006 ; Holland et al., 2008 ; discussions. P. C. Wood and the Miami University Center for Bioinformatics Degnan and Rosenberg, 2009 ). In this paper, we aim to con- and Functional Genomics provided technical support. The authors greatly tribute to the development of new approaches to the study of appreciate the time and efforts of two anonymous reviewers and their help incongruent phylogenetic patterns by documenting strongly to improve an earlier version of this manuscript. Funding for this project supported topological incongruence in tribe Senecioneae was provided by NSF grant DEB-0542238 to L.E.W. and P.B.P. 8 (Asteraceae) and using a new approach to establish whether Author for correspondence (e-mail: [email protected]) ILS can be excluded as an explanation for incongruent pat- doi:10.3732/ajb.0900287 terns. This coalescent-based method uses estimates of the
American Journal of Botany 97(5): 856–873, 2010; http://www.amjbot.org/ © 2010 Botanical Society of America 856 May 2010] Pelser et al.— Incongruence between Senecioneae phylogenies 857 duration of putative ILS events to determine if invoking ILS as herbarium specimens at B, BHCB, F, L, MO, MU, P, S, TENN, TEX, UC, US, an explanation for incongruence would require unrealistically WAG, and XAL, or from fi eld-collected leaf tissue preserved on silica gel. Phylogenetic analyses were performed with DNA sequences from eight re- high effective population sizes. gions: the ITS (ITS1, 5.8S, ITS2) and ETS of the nuclear genome, plus the The Senecioneae are one of the largest tribes in the Aster- ndhF gene, the trnL intron, and psbA - trnH , 5 and 3 trnK , and trnL-F intergenic aceae (ca. 3100 species and 155 genera) with an almost world- spacers of the plastid genome. To root tribe Senecioneae in subfamily Asteroi- wide distribution, and it exhibits remarkable morphological and deae, partial sequences of the plastid rbcL gene were obtained for representa- ecological diversity. Senecio is the largest genus in the tribe (ca. tives of the major Senecioneae lineages and the other Asteraceae tribes, in 1000 species) and is perceived as taxonomically diffi cult be- addition to the above markers. However, the relationship among outgroups is cause of its size and extensive morphological diversity. In a not discussed since this is not a goal of this paper. recent study, ITS sequence data were used to reconstruct a phy- DNA extraction, PCR amplifi cation, and sequencing — Total genomic logeny for the Senecioneae and to redefi ne a previously poly- DNA was extracted using the Qiagen DNeasy Plant Mini Kit (Qiagen, Valen- phyletic Senecio ( Pelser et al., 2007 ). This study included 186 cia, California, USA). PCR amplifi cation of the ITS, ndhF , psbA-trnH , 5 and Senecio species and 114 of the 150 genera recognized by 3 trnK , trnL , and trnL-F regions followed Pelser et al. (2002 , 2003 , 2007 ) or Nordenstam (2007) . Because the molecular phylogeny placed minor modifi cations thereof. The ETS region was partially amplifi ed with several lineages outside of core S enecio , a new circumscription primer AST1 ( Markos and Baldwin, 2001 ) or a modifi cation of primer ETS2 resulted with several taxa being transferred to other genera ( Bayer et al., 2002 ; 5 -CAA CTT CCA CCT GGC TTA CCT CC-3 ) as forward primers and 18S-ETS ( Baldwin and Markos, 1998 ) as the reverse primer. The within the tribe and others to be described as new genera ( Nor- rbcL gene was amplifi ed in two parts using forward primers 1F and a modifi ca- denstam et al., 2009c ; unpublished data). Furthermore, seven tion of primer 636F ( Fay et al., 1997 ; 5 -GCG TTG GAG AGA CCG TTT genera (A etheolaena , Cadiscus , Culcitium , Hasteola , Iocenes , CT-3 ), and reverse primers 1460R ( Savolainen et al., 2000 ) and a modifi cation Lasiocephalus , and Robinsonia ) were deeply embedded within of primer 724R ( Fay et al., 1997 ; 5 -TCG CAT GTA CCC GCA GTA GC-3 ). Senecio and thus, are being transferred into the genus to arrive PCR products were purifi ed with the Wizard SV Gel and PCR Clean-Up Sys- at a monophyletic delimitation (Pelser et al., 2007 ; Nordenstam tem (Promega, Madison, Wisconsin, USA). Cycle sequencing was carried out with the BigDye Terminator v.3.1 (Applied Biosystems, Foster City, Califor- et al., 2009b ; unpublished data). It is important to note that nia, USA) cycle sequencing kit. The fl uorescently labeled samples were run on while this new delimitation of Senecio is largely based on an an ABI 3130xl or 3730 automated sequencer at the Center for Bioinformatics ITS phylogeny, the major clades are also supported by plastid and Functional Genomics at Miami University (Oxford, Ohio, USA). The pro- sequence data composed of ndhF , the trnL intron, and the psbA- gram Sequencher v.4.8 (Gene Codes, Ann Arbor, Michigan, USA) was used for trnH , 5 and 3 trnK , and trnL-F intergenic spacers ( Pelser et al., trace fi le editing. 2007 ). In addition, the plastid data augmented the utility and effi cacy of the ITS for phylogeny reconstruction in the Sene- DNA sequence alignment and phylogeny reconstruction — DNA sequences were aligned with the program CLUSTALX v.1.83 ( Thompson et al., 1997 ) cioneae by supporting many of the resolved evolutionary rela- using the default penalty settings of the program. This alignment was edited tionships at the species, generic, and subtribal levels in the tribe. using the program Se-Al v.2.0a11 ( Rambaut, 1996 ) by excluding portions of Furthermore, many patterns of relationships in the ITS Sene- sequences that could not be unambiguously aligned (mostly of ITS and ETS cioneae phylogeny were corroborated by morphological, kary- accessions of tribes distantly related to the Senecioneae) and replacing those ological, and/or biogeographic data and supported the taxonomic with a missing data symbol. A Python script (Richard Ree, Field Museum, Chi- conclusions from previous Senecioneae studies (reviewed in cago, Illinois, USA) was used to code indels as binary characters using the simple indel coding method of Simmons and Ochoterena (2000). Pelser et al., 2007 ; Nordenstam et al., 2009a ). Although the For several species, sequences of multiple accessions were available (Ap- plastid data provided additional support for the ITS phylogeny pendix S1). In the fi rst round of heuristic searches performed under maximum with overall congruence between the two topologies observed, parsimony (MP, see below) for each of the DNA regions individually, all avail- some incongruence was present that prevented taxonomic con- able sequences were included. When multiple accessions of the same species clusions for some lineages ( Pelser et al., 2007 ). Because the formed a clade, a consensus sequence was generated in which polymorphisms plastid phylogeny was only based on a subset of the Sene- were coded as ambiguous nucleotide characters, and this consensus sequence was used for subsequent phylogenetic analyses ( Pelser et al., 2007 ). This strat- cioneae taxa that were sampled for the ITS analyses (73 vs. 614 egy was preferred over selecting one conspecifi c accession among those avail- species), it was not possible to suffi ciently resolve these pat- able to avoid subjective decisions in the process of selecting a single examplar terns for a detailed analysis of the extent and understanding of and to be able to include all available data that potentially contribute to the the underlying causes of the incongruence at the generic level. phylogeny reconstruction. In addition, this approach allowed for the inclusion In the current study, we therefore have signifi cantly increased of longer sequence reads for taxa of which individual accessions only resulted the taxon and marker sampling for the plastid and nuclear data in partial sequences. Comparisons to results of parsimony analyses did not re- veal decreased resolution when multiple accessions per species were included, and conducted extensive phylogenetic analyses. The goals were and analyses in which consensus sequences also were included placed these (1) to identify strongly supported relationships that are incon- among the individual accessions of the same taxon. gruent between trees produced with different data sets and (2) to The MP phylogeny was reconstructed with the program TNT 1.0 ( Goloboff explore possible causes of the incongruence. et al., 2008 ) using the Driven Search option with the default settings for Secto- rial Searches (RSS, CSS, and XSS), Ratchet, Tree Drifting, and Tree Fusing, 10 initial random addition sequences, terminating the search after fi nding mini- mum length trees fi ve times. Bootstrap support (BS; Felsenstein, 1985 ) was MATERIALS AND METHODS calculated with Poisson independent reweighting using 1000 replicates. Bayes- ian inference (BI) analyses were performed using the program MrBayes 3.1.2 Taxon and character sampling — The taxa selected for this study were cho- ( Huelsenbeck and Ronquist, 2001 ) on the Redhawk Cluster at Miami Univer- sen from the plastid and ITS phylogenies of Pelser et al. (2007 ) to represent the sity (EM64T Cluster with 128 dual nodes with 4 GB of memory and 150 GB of Senecioneae genera included in that study. In addition, 31 Senecioneae genera disk space per node). Prior to the BI analyses, the Akaike information criterion were added for which material was not previously available to us (Appendix (AIC) in the program MrModeltest 2.2 ( Nylander, 2004 ) was employed to se- S1; see Supplemental Data with the online version of this article) resulting in a lect nucleotide substitution models for each of the DNA regions. These analy- sampling of approximately 94% of all Senecioneae genera. A total of 27 genera ses selected the GTR+I+ model for each of the individual DNA regions and from 26 other Asteraceae tribes was included for outgroup comparisons (Ap- the combinations of DNA regions analyzed, which was therefore used in all BI pendix S1). DNA samples of representatives of the taxa not included in Pelser analyses. Indel characters were included as “restriction type ” data in the BI et al. (2007 ) were obtained from tissue samples taken with permission from analyses. These analyses were performed using two independent simultaneous 858 American Journal of Botany [Vol. 97
runs. The Markov chain Monte Carlo analyses (MCMC; Geyer, 1991 ) were run temperature setting of 0.0001), a user-defi ned starting tree was supplied using with up to 40 chains per analysis, temperature settings ranging between 0.0001 the MP 50% majority rule consensus topology of the combined plastid-ITS/ and 0.2, and one tree per 1000 generations saved. BI analyses were run until the ETS data set. Each chain was then initiated with a slightly perturbed version of average deviation of split frequencies between both simultaneous analyses this starting tree by setting the parameter nperts to 1. This approach resulted in reached a value below 0.01. The burn-in values were determined empirically convergence of the chains. from the likelihood values. Trees were visualized using the program FigTree v.1.2.2 ( Rambaut, 2009 ). ITS orthology/paralogy assessment — To determine if specimens possessed divergent ITS copies, potentially indicating that the incongruence is caused by Identifi cation and localization of incongruence — The incongruence length inaccurate assessment of orthology, we cloned and sequenced the ITS region difference test (ILD; Farris et al., 1995 ) is one of the most widely applied meth- for accessions of strongly incongruent lineages. Because ITS trees were not ods for assessing incongruence ( Darlu and Lecointre, 2002 ; Hipp et al., 2004 ). strongly incongruent with ETS trees and both regions are adjacent, the ETS Its use has, however, been criticized, because of a high false positive rate region was not cloned and sequenced. ITS PCR products were cloned using the ( Cunningham, 1997 ; Darlu and Lecointre, 2002 ; Hipp et al., 2004 ). Because al- TOPO TA Cloning Kit for Sequencing (Invitrogen, Carlsbad, California, USA). ternative methods for testing incongruence (e.g., the Templeton [ Templeton, Between 5 and 10 clones per accession were PCR-amplifi ed directly from 1983] , Kishino – Hasegawa [ Kishino and Hasegawa, 1989], and Shimodaira – plated culture with the manufacturer ’ s supplied M13 plasmid primers. PCR Hasegawa [ Shimodaira and Hasegawa, 1999] tests) may suffer from errors as products were cycle sequenced with M13 plasmid primers. Sequencing and well (Cunningham, 1997 ; Shimodaira and Hasegawa, 1999 ; Buckley et al., alignment followed the protocol outlined above. 2001 ; Shimodaira, 2002 ; Hipp et al., 2004 ), incongruence is probably best stud- ied using a combination of methods ( Hipp et al., 2004 ). In this study, we there- Long-branch attraction — Because long-branch attraction ( Felsenstein, fore explored patterns of phylogenetic incongruence using two approaches: the 1978 ) is a possible explanation for topological incongruence, MP and BI phy- ILD test and an assessment of incongruent patterns that are supported by high lograms were visually inspected for the presence of exceptionally long branches BS or BI values. To reduce the chance of false positives and following the associated with strongly incongruent lineages. A series of phylogenetic analy- recommendations of Cunningham (1997) , we considered only ILD P -values ses was subsequently carried out in which accessions with long branches were below 0.01 as evidence of signifi cant incongruence. individually excluded in turn. This was done to determine whether the exclu- Strongly supported incongruence is here defi ned as incongruent patterns that sion of these accessions resulted in changes in topology, which could indicate are supported by BS values 80% and/or posterior probabilities (PP) 0.95 as the presence of long-branch attraction. well as ILD values of P < 0.01. The ILD tests were conducted in PAUP* with 500 to 10 000 replicates and 1 to 100 random addition sequences, depending on the size and complexity of the data sets analyzed. Invariant characters were re- Incomplete lineage sorting — Topological incongruence caused by hybrid- moved from the data sets prior to performing ILD tests ( Cunningham, 1997 ). ization and ILS can be diffi cult to distinguish, because both phenomena may This methodology revealed strong incongruence between trees obtained from result in similar topological patterns. Coalescent theory, however, predicts that the plastid and ITS/ETS data sets, but not among the plastid markers or between ancestral polymorphisms are likely to coalesce within approximately 5N e gen- the ITS and ETS. Subsequent phylogenetic analyses were therefore focused on erations ( Ne being the effective population size; Rosenberg, 2003 ; Degnan and identifying lineages with strongly supported, but incongruent, topological Rosenberg, 2009 ) and that congruence between gene trees and species trees placements in trees obtained from the plastid vs. the ITS/ETS data sets. becomes highly probable. Information about generation times and estimates of Lineages with strongly supported, yet incongruent, phylogenetic placements the duration of an ILS event can therefore be used to calculate the minimum N e in the plastid vs. ITS/ETS trees were identifi ed using a novel two-step approach that must be assumed to explain incongruence due to ILS. If these calculations designed to examine complex patterns involving multiple incongruent lineages result in N e estimates that are much higher than observed in nature, then ILS can for which some also have internal incongruence. First, plastid and ITS/ETS be excluded, and hybridization is favored as the likely explanation for the ob- trees were visually compared to identify the largest mutually exclusive lineages served incongruence. that form a clade in all or some consensus trees. In this way, lineages that are To calculate Ne estimates, we estimated molecular dates using penalized resolved as closely related in both plastid and ITS/ETS trees were identifi ed. likelihood analyses performed in the program r8s 1.71 ( Sanderson, 2002 , 2003 ) Among these lineages are clades that are retrieved in all trees (e.g., Tussilagini- on the Redhawk Cluster at Miami University. Separate analyses were carried nae s.s.; Figs. 1, 2 ), but also those lineages that form a clade in some, but not all, out for the plastid and the ITS/ETS data sets using the topologies and branch trees. The latter category of lineages were only included when their taxa are in lengths obtained in the BI analyses. Some accessions were pruned from the in- close phylogenetic proximity (e.g., Lachanodes - S. thapsoides group; Figs. 1, put trees to meet the r8s requirement of nonzero branch lengths. Several cali- 2 ). Although Senecio s.s. is not resolved as monophyletic in either the plastid or bration points outside and within the Senecioneae were used based on previous the ITS/ETS trees, support for its nonmonophyly is low ( Figs. 1, 2 ), and it age estimates in Asteraceae ( Wikstr ö m et al., 2001; Kim et al., 2005 ; Hershkovitz comprises a single clade in trees obtained from a combined plastid-ITS/ETS et al., 2006 ), fossil evidence ( Graham, 1996 ), and inferred ages for islands data set (not shown). Senecio s.s. was therefore also identifi ed as one of the or archipelagos with endemic Senecioneae taxa ( Baker et al., 1967 ; McDougall major lineages present in both plastid and ITS/ETS trees. and Schmincke, 1976 ; McDougall et al., 1981 ; Stuessy et al., 1984 ; Cronk, Secondly, these identifi ed major lineages were then examined for the pres- 1987 ; Ancochea et al., 1990 ; Geldmacher et al., 2000 ; Table 1 ). The smoothing ence of strongly supported internal incongruence by evaluating branch support parameters for the penalized likelihood analyses were determined using cross- values and then subjecting them to ILD tests that only included the taxa of the validation tests. lineage under investigation. This method allowed for the identifi cation of In addition to estimating divergence dates using penalized likelihood, mo- strongly incongruent accessions and subclades within each of these major lin- lecular dating studies were performed using a Bayesian approach with the pro- eages without the confounding effects of unrelated incongruence within or gram BEAUti/BEAST v.1.4.7 ( Drummond and Rambaut, 2007 ) on the among other major lineages. In addition, branch support values and ILD tests Redhawk Cluster at Miami University. To compare the dating results of the were employed in a similar fashion to study the incongruence among the major BEAST analyses to those estimated by r8s, we also used the input trees from the lineages. Due to the large size of the data sets, one or two placeholder species r8s analyses as the starting trees in BEAST, and their topology was fi xed. The were included in the latter round of ILD tests to eliminate the confounding ef- BEAST analyses were performed with the GTR+I+ model, the uncorrelated fects of internal incongruence within the major lineages. relaxed lognormal clock, and the Yule tree prior. The calibration points used in After strongly incongruent lineages were identifi ed, MP and BI analyses of the r8s analysis were specifi ed using uniform distributions with an upper and a combined plastid-ITS/ETS data set were performed in which each taxon in a lower bound. Following the instructions in the BEAST manual, the weights of strongly incongruent lineage was included twice: once as a plastid-only acces- the operators for the treeModel were modifi ed to improve the effi ciency of the sion (ITS/ETS characters coded as missing) and once as an ITS/ETS-only ac- MCMC: the upDownOperator, uniformOperator on internalNodeHeights, nar- cession (plastid characters coded as missing; Pirie et al., 2008 ). This approach rowExchangeOperator, and subtreeSlideOperator were set at 115 and the wil- was used to resolve a generic level tree without the confounding effects of taxa sonBaldingOperator and the wideExchangeOperator were set at 23. Nine with strongly supported incongruence between the plastid and ITS/ETS parti- (plastid data) and 11 (ITS/ETS data) independent BEAST runs totaling tions and to examine their alternative phylogenetic placements relative to a 136 687 000 (plastid data) and 318 672 000 (ITS/ETS data) generations were backbone composed of lineages among which relationships are not strongly carried out to ensure that all parameters had an effective sampling size greater incongruent. Because BI analyses of this data set using various temperature than 200 (after a burnin of 10% was removed). Convergence was examined settings and numbers of chains did not reach convergence (40 chains per run, using the program Tracer v.1.4 (Rambaut and Drummond, 2007 ). May 2010] Pelser et al.— Incongruence between Senecioneae phylogenies 859
Table 1. Calibration points for r8s and BEAST molecular dating either data set. Again, the 95% HPD intervals of the BEAST analyses were analyses. used to account for some uncertainty in the data, and these age estimate ranges were extended with the r8s results when the r8s estimates fell outside Age of the ranges calculated with BEAST. Finally, the ranges of age estimates for constraint Age estimates from the onset and the end of the putative ILS event were used to calculate its Calibration point used (Myr) literature (Myr) References minimum and maximum duration. Using these estimates of putative ILS durations and information about Root (fi xed) 35 33 – 36.5 Wikstr ö m et al., 2001; generation times in the Senecioneae, the effective population sizes needed to Kim et al., 2005 ; explain incongruent patterns invoking ILS were calculated. Most Sene- Hershkovitz et al., 2006 cioneae species fl ower in their fi rst or second year, and generation times of MRCA of 24 – 38 24 – 38 Wikstr ö m et al., 2001; one or two years are characteristic for most lineages. To test for robustness Cichorioideae and Kim et al., 2005 ; of our conclusions to violations of this assumption, effective population Asteroideae Hershkovitz et al., 2006 sizes were also calculated using generation times of 5 and 10 yr. Senecioneae Anthemideae 23 Late Oligocene Graham, 1996 populations typically range between several dozen to a few thousand indi- (fossil pollen) viduals (e.g., W id é n and Andersson, 1993; Comes and Abbott, 1998 ; M ü ller- MRCA of Helianthus 13 – 19 13 – 19 Wikstr ö m et al., 2001; Sch ä rer and Fischer, 2001 ). N e values of 20 000 (plastid data) and 40 000 and Tagetes Kim et al., 2005 ; (ITS/ETS data) were therefore selected as conservative estimates of maxi- Hershkovitz et al., 2006 mum population sizes. These coalescent calculations were carried out for MRCA of Senecio 16 – 19 16 – 19 Kim et al., 2005 ; each of the strongly incongruent lineages, but were also performed for sce- and Blennosperma Hershkovitz et al., 2006 narios in which the confl icting phylogenetic positions of multiple taxa were Lachanodesa 14.5 14.5 Baker et al., 1967 ; assumed to be nonindependent (i.e., a single ILS event resulting in incongru- (age of St. Helena) Cronk, 1987 ent positions of multiple lineages). Because species-level sampling within Pladaroxylona 14.5 14.5 Baker et al., 1967 ; Senecio was limited (48/~1000 species) and accurate estimates of the dura- (age of St. Helena) Cronk, 1987 tion of putative ILS events could not be obtained, these calculations were not Pericallis auritaa 14.3 14.3 Geldmacher et al., performed for S enecio species. (age of Porto Santo) 2000 Pericallisa 14.3 14.3 McDougall and (age of Porto Santob ) Schmincke, 1976; Geldmacher et al., RESULTS 2000 Bethencourtiaa 11.6 11.6 Ancochea et al., 1990 Identifi cation and localization of incongruence — Although (age of Tenerifeb ) an ILD test comparing all fi ve plastid markers resulted in P = Lordhoweaa 6.9 6.9 McDougall et al., 1981 0.005, the consensus trees obtained from the individual markers (age of Lord Howe Island) did not reveal well-supported (BS values 80% and/or PP Robinsoniaa 4.2 4.2 Stuessy et al., 1984 0.95) incongruence. Sequence data from all plastid markers (age Juan Fern á ndez were therefore combined into a single data set, which was used Islands) for all subsequent analyses. The ITS and ETS trees (not pre- Robinsonia 2.4 2.4 Stuessy et al., 1984 sented) were mostly congruent, and incongruent clades were masafueraea (age of Masafuera) not well-supported. Further analyses were carried out with a a Island or archipelago endemics. combined ITS/ETS data set, even though an ILD test resulted in b Oldest island in distribution area. P = 0.004. The results of the MP and BI analyses of the com- bined plastid data and the combined ITS/ETS data sets are sum- marized in Table 2 and presented in Figs. 1 and 2 . The ILD test After obtaining age estimates of divergences, we calculated the duration for the plastid vs. the ITS/ETS data sets indicated signifi cant of each putative ILS event as follows. First, the estimated time of onset of an incongruence (P = 0.004), and a visual comparison of their to- ILS event was determined from both the plastid and ITS/ETS BEAST chro- pologies and branch support values revealed well-supported nograms. Although the plastid and ITS/ETS BEAST chronograms resulted incongruence ( Figs. 1, 2 ). Therefore, further studies of incon- in different age estimates for the onset of ILS events, we assume that these gruence within Senecioneae were confi ned to examining the estimates are of the same speciation event in the species tree. To approxi- mate the age of each speciation event that marks the start of an ILS event, we strongly confl icting topologies of the plastid vs. the ITS/ETS used the overlap in the 95% highest posterior density (HPD) intervals of the data sets. Discussion in this paper is limited to relationships plastid and ITS/ETS chronograms. For a few of the strongly incongruent within the Senecioneae only and does not include outgroup lineages, r8s estimates of the onset of putative ILD events were slightly out- relationships. side the ranges obtained from BEAST. In these cases, the ranges of the as- Although all analyses of plastid and ITS/ETS data resolved sumed time interval of the ILS onset calculated from the BEAST chronograms the core of Senecioneae (excluding Doronicum and Abrota- were extended with the r8s ages. Secondly, the ages of the most recent com- mon ancestor (MRCA) of each strongly incongruent lineage and its nonin- nella ) as monophyletic ( Figs. 1, 2 ; BS 100%, PP 0.99 and 1.00) congruent sister group were recorded from the plastid and ITS/ETS and supported Capelio as sister to the well-supported clade chronograms. These age estimates signify the end of a putative ILS event in composed of the remaining genera forming the Senecioneae
Table 2. Character information and clade support data. In the combined plastid-ITS/ETS data set strongly incongruent taxa were included as separate plastid and ITS/ETS accessions (see text).
No. of clades with No. of Gap Variable Informative Data set characters characters characters characters 50% BS 80% BS PP 0.5 PP 0.95
Plastid 12494 1293 3253 (26.0%) 1406 (11.3%) 125 (55.1%) 87 (38.3%) 194 (85.5%) 132 (58.2%) ITS/ETS 3303 914 1605 (48.6%) 1096 (33.2%) 116 (50.5%) 76 (33.0%) 195 (84.8%) 127 (55.2%) Combined plastid-ITS/ETS 15797 207 4861 (30.7%) 2505 (15.9%) 158 (56.8%) 100 (36.0%) 248 (89.2%) 180 (64.7%) 860 American Journal of Botany [Vol. 97
Fig. 1. Bayesian inference (BI) topology from the plastid data set. Bayesian consensus percentages (posterior probabilities 100) are placed above branches, bootstrap support above 50% below branches. Branch color indicates congruence with ITS/ETS BI tree ( Fig. 2 ): green: congruent; yellow: clade absent in ITS/ETS BI tree, but not contradicted; orange: clade incongruent, but not with posterior probabilities 0.95 in both trees; red: incongruent with posterior probabilities 0.95 in both trees. May 2010] Pelser et al.— Incongruence between Senecioneae phylogenies 861
Fig. 2. Bayesian inference (BI) topology from ITS/ETS data set. Bayesian consensus percentages (posterior probabilities 100) are placed above branches, bootstrap support above 50% below branches. Branch color indicates congruence with plastid BI tree ( Fig. 1 ): green: congruent; yellow: clade absent in plastid BI tree, but not contradicted; orange: clade incongruent, but not with posterior probabilities 0.95 in both trees; red: incongruent with posterior probabilities 0.95 in both trees. 862 American Journal of Botany [Vol. 97
(BS 100%, PP 0.99 and 1.00), relationships among many of the teria outlined in the Materials and Methods, we identifi ed the other taxa are obscured by complex patterns of topological in- following taxa and lineages as having strongly incongruent congruence. Among the 29 major lineages of Senecioneae that placements: C aputia , C ineraria , S teirodiscus , P haneroglossa , were identifi ed ( Table 3 ), only two exhibit strongly supported the Lamprocephalus - Oresbia clade, Lordhowea , the gy- internal incongruence: the New World 1 group and Senecio nuroids, Jacobaea , Packera , Senecio otites , Lomanthus , the ( Table 3 ). New World 2 group, S . fl avus , S. hispidissimus , S. psilocar- Within New World 1 group, incongruence is located in pus , S. squarrosus , S. cadiscus , and S. pinnatifolius . The MP clade 3 ( Figs. 1, 2 ) in which Lomanthus and the Caxamarca - and BI analyses of the combined plastid-ITS/ETS data set in Pseudogynoxys clade take incongruent positions ( Table 3 ). which the strongly incongruent lineages were included as sep- An ILD test of the data set that was solely composed of the arate plastid and ITS/ETS accessions resulted in trees that accessions of clade 3 indicated signifi cant incongruence be- generally have slightly higher support than the individual tween the plastid and ITS/ETS data ( P = 0.001). This confl ict plastid and ITS/ETS consensus trees ( Fig. 3 ; Table 2 ) . There is not signifi cant ( P = 0.256) if L omanthus is excluded, but are only a few topological differences between the MP 50% remains statistically signifi cant if only the C axamarca - Pseud- majority rule consensus tree (not shown) and the BI phylog- ogynoxys clade is excluded (P = 0.001). Similarly, in a series eny ( Fig. 3 ), and all of these differences are weakly supported. of MP and BI analyses of a data set in which only the acces- The phylogenetic positions of the plastid and ITS/ETS acces- sions of clade 3 were included, incongruence supported with sions of the strongly incongruent lineages in the combined high BS and PP values only disappeared when L omanthus was plastid-ITS/ETS trees ( Fig. 3 ) correspond well to those in the excluded (not shown). separate plastid and ITS/ETS trees ( Figs. 1, 2 ). Within Senecio , S. fl avus , S. cadiscus , S. pinnatifolius , S. hispidissimus , S. psilocarpus , and S. squarrosus are placed ITS orthology/paralogy assessment — Cloned ITS sequences in strongly supported incongruent positions ( Table 3 ; P = selected from 18 strongly incongruent Senecioneae lineages 0.001). Removing all six taxa from the data sets resulted in (Appendix S1) revealed a relatively high level of polymor- trees without high BS and PP values for incongruent patterns phism. Most specimens had between 3 and 24 (0.4– 3.2%) poly- (not shown) and an insignifi cant ILD value ( P = 0.06). When morphic nucleotide positions; however, the cloned sequences any fi ve of these six species were excluded, ILD tests indi- from Oresbia had higher levels (7.8%) with 59 polymorphic cated signifi cant incongruence (P = 0.001 – 0.007), and incon- nucleotide positions. Despite this high level of polymorphism, gruent patterns were supported with high support values (not sequences of the ITS clones obtained from the same speci- shown). men always formed well-supported clades in MP analyses In addition to documenting strongly supported internal (results not shown) and with the directly sequenced ITS PCR incongruence for two of the 29 major Senecioneae lineages, 13 products. others exhibit well-supported incongruence with regard to their phylogenetic positions relative to the 16 remaining lineages Long-branch attraction — Visual inspection of the plastid among which well-supported incongruence was not identifi ed and ITS/ETS trees revealed several species in the Tussilagini- ( Table 4 ). These 13 incongruent lineages are Caputia , New nae s.s. clade with relatively long branches: Blennosperma , World 2 group, Jacobaea , Lordhowea , gynuroids, Lampro- Crocidium , Ischnea , Robinsonecio , and the Arnoglossum - Bark- cephalus - Oresbia clade, Phaneroglossa , Cineraria , Steirodis- leyanthus - Yermo clade (Appendix S2). Although the relation- cus , Packera , Emilia - Bethencourtia group, Senecio otites , and ships among B lennosperma , Crocidium , and I schnea and Io . An ILD test in which placeholders for each of the 29 major between these and other genera in the Tussilagininae s.s. clade Senecioneae lineages were included indicated signifi cant in- are not affected by well-supported incongruence, the relation- congruence between the plastid and ITS/ETS data (Table 5 : test ship between Robinsonecio and the Arnoglossum - Barkleyan- 1; P = 0.001). Excluding the placeholders for the 13 lineages thus - Yermo clade shows incongruence that is well supported in that have well-supported incongruent phylogenetic positions the BI trees. In contrast to being more distantly related in the still resulted in signifi cant incongruence among the remaining ITS/ETS phylogenies ( Fig. 2 ), plastid data place Robinsonecio lineages ( Table 5 : test 2; P = 0.006). Only when Othonninae sister to the Arnoglossum - Barkleyanthus - Yermo clade ( Fig. 1 ). (i.e., Othonna ) was also excluded, an ILD test did not indicate Reciprocal exclusion of Robinsonecio and the Arnoglossum - signifi cant incongruence ( Table 5 : test 3; P = 0.052). Adding Barkleyanthus - Yermo clade (results not shown) in both the ITS/ placeholders for each of the incongruent major lineages indi- ETS and plastid data sets, however, does not change the rela- vidually to the nonsignifi cant incongruent data set of test 3 in- tionships in clade 4. troduced signifi cant incongruence for all of these groups ( Table Within Senecioninae, Emilia , Jacobaea , and Packera have 5 : tests 4 – 13) except for the Emilia - Bethencourtia group, Sene- branches that are much longer than other genera in this subtribe cio otites , and Io (tests 14 – 16). Although adding all three of (Appendix S2). These three genera form a clade in the ITS/ETS these placeholders (test 17), the Emilia - Bethencourtia group MP trees (not shown); however, Packera is placed more distant and Senecio otites (test 18), or Senecio otites and Io (test 19), to the other two genera in the ITS/ETS BI trees ( Fig. 2 ). Recip- resulted in signifi cant incongruence (Table 5 ), incongruence rocal exclusion of two of these three genera from MP analyses was insignifi cant ( P = 0.013) in an ILD test in which the Emilia - (results not presented) resulted in the phylogenetic positions of Bethencourtia group and Io were added to the insignifi cantly Emilia and Jacobaea remaining unchanged when the two other incongruent selection of placeholder species used in test 3 (test genera were excluded (resp. Jacobaea and Packera , and Emilia 20). Inspection of the BS and PP values in trees obtained from and Packera ). Emilia and Jacobaea were therefore not affected the data sets used in ILD tests 1 – 20 generally confi rmed these by long-branch attraction, but when both are excluded, Packera fi ndings (not shown). occupies a different position in the ITS/ETS trees. Because the In summary, after exploring incongruence within and results of the BI analyses of the ITS/ETS data agree with the among the major Senecioneae lineages and following the cri- plastid trees in indicating a close relationship between the New May 2010] Pelser et al.— Incongruence between Senecioneae phylogenies 863
y-
taxa
ca
hthites (ined.)
oseris - opappus ocephalus phylogenetic ophorbium aujasia Emilia Kleinia Capelio F Othonna Bolandia ephr S. viscosus Caucasalia assocephalum Caxamar T esia, Mikaniopsis S. deltoideus woodii Centr Arrhenec Caputia medle Placeholder Lampr Cr Dendr Daur incongruent ) 1 1 1 1 P ( n.a. n.a. n.a. n.a. 0.97 0.026 0.001 0.114 0.001 0.015 0.022 0.255 0.709 0.157 0.9998 ict ILD test study
to confl P Y Y Y Y N Y Y N N N N N N N N P n.a. n.a. n.a. n.a. S Support for internal N Y N Y N Y N N N N N N N N N B selected n.a. n.a. n.a. n.a.
S. were , s (BS
and form a ,
via S. vestitus S. and group clade (BS o , clade sister (BS 60%, ) - and sister to the , and are most closely ojark ermo major P Y Fig. 2 Pippenalia (BI: PP 0.97) - Xenophyllum glossum the amancalia (BS 54%, PP 0.98) Bethencourtia ed on a clade with clade (BS 75%, PP S. viscosus T , and gynoxys - , of , and stulosus, S. gayanus Robinsonecio clade, Arno yanthus S. ilicifolius acobaea
J S. fi b essea S. psilocarpus anecio 50%, PP 0.99) J Pseudo ermo sister to - , S. pinnatifolius < Ir Y erneria vadensis Psacalium - is placed basal to the core of and , Barkle - ITS/ETS trees ( - W sister to 50%, PP 1.00) , and sister to a clade composed of and sister to S. lastarrianus 0.97) (BS < S. ne sister to ea ca avus Placeholders anaetes sister to the s.s. (BS 100%, PP 1.00) PP yanthus osus ook horrhiza , and adr glossum ae Pippenalia them. 50%, S. triodon jar Arno to 1.00) Psacaliopsis PP 0.96) Roldana Barkle Psacalium Lomanthus Misbr < Caxamar Char 77%, PP 0.97) Senecio fl Senecio related to members of Clade 6 (BS 99%, PP 1.00) S. hispidissimus squarr (BS 91%, PP 1.00) S. cadiscus Dolic clade (BS Senecio lineatus (MP: 93%) or is unresolv Bethencourtia
, within (BS a ,
form
and 50%, - , < (BS
S. clade sister icting relationships within group patterns sister to S.
(BS illasenoria anecio )
V S. viscosus Ir sister to , and elanthophor glossum , are not most alamancalia ermo T and Confl (BS 55%, PP T Y , - S. viscosus Fig. 1 , and , , and Arno S. hollandii , and gynoxys species and form vadensis incongruent clade (BS 54%, PP 1.00) Psacaliopsis yanthus Bethencourtia S. gayanus cibarrigoa essea vadensis horrhiza the S. ne Digitacalia J S. psilocarpus Pseudo ermo sister to and , , S. pinnatifolius Senecio Pittocaulon s.s. and a member of clade Y of Gar Plastid trees ( - , is deeply nested within the Barkle sister to the S. ne - Dolic (BS 100%, PP 1.00) and , sister to a clade formed by and are members of clade 7 (BS 96%, and sister to ca avus anaetes sister to Senecio (BS 77%, PP 1.00) yanthus osus analyses adr glossum obaea Robinsonecio and to PP 0.98) Roldana Nelsonianthus (BS 71%, PP 1.00) 97%, PP 1.00) Dor 1.00) clade 6 (BS 89%, PP 1.00) S. hispidissimus squarr PP 1.00) S. triodon lastarrianus Senecio fl core of composed of S. cadiscus closely related to and clade 6, which instead are more closely related to other clade 7 with these (BS 96%, PP 1.00) Arno Robinsonecio Barkle Lomanthus Char Caxamar Caucasalia a clade (BS 91%, PP 1.00) Senecio lineatus 100%, PP 1.00) and Emilia Y Y Y Y N Y N N N Y Y Y Y Y N Y Y Y lineages n.a. trees ITS/ETS orms clade in ). Y Y Y Y N N Y Y Y Y Y Y Y Y Y N N Y F n.a. trees Plastid Senecioneae able 5
T ) major clade
clade the
- of
htites a Figs. 1, 2
esbia ec c ( Er amma Or - - cation oup gr
s.s. Senecio thapsoides gates - Senecio clade
Meso gre - Identifi
se d
ocephalus
orld 1 gr Bethencourtia orld 2 group gyne - W W hanodes 3.
assocephalum w patterns between them (see ussilagininae Capelio Cr Lampr Bolandia Caputia Lac Emilia Senecio group Senecio Stilpno group T Table 18. 5. 2. New 1. 3. Quadridentates 6. 17. Clade 5 within Major Senecioneae group 8. Brachyglottidinae 7. Ne 4. 9. Othonninae 11. Gynuroids 16. 10. 12. Synotoids 13. Madagascan group 15. 14. 864 American Journal of Botany [Vol. 97
World 2 group and Packera , Packera may very well be a mem-
ber of the New World 2 group.
taxa
a posterior bootstrap glossa aria er o k Io osenecio sodoma c odiscus Incomplete lineage sorting — Molecular dating analyses by dhowea a ericallis acobaea P J P using r8s yielded age estimates that were often older than ages Ciner Lor Steir Cher Senecio otites Phaner Dendr retrieved with BEAST ( Figs. 4 and 5 ) and sometimes outside Placeholder inference the 95% HPD interval of the BEAST estimates (Appendix supported S3). The 95% HPD intervals of the plastid and ITS/ETS anal-
) yses were often overlapping. Estimates of the duration of pu- P Bayesian ( n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. tative ILS events obtained with r8s are within or slightly ict ILD test patterns
with outside the ranges suggested by BEAST. Using these esti- confl P mates and assuming generation times of 1, 2, 5, and 10 yr, P n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a.
. coalescent analyses indicate that for four of the 10 strongly S Support for internal B n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. n.a. incongruent lineages that were examined (C aputia , the L am- supported (incongruent 0.95 procephalus -O resbia clade, the New World 2 generic group and Packera , and Jacobaea ) effective population sizes must clades study be assumed that are higher than the selected thresholds of this S. pinnatifolius N e = 20 000 (plastid data) and 40 000 (ITS/ETS data) to explain )
in their incongruent phylogenetic positions by ILS ( Table 6 ). If , and
Fig. 2 multiple strongly incongruent phylogenetic positions of line- used incongruent ages are assumed to be the result of single ILS events, even = higher effective population sizes of ancestral lineages are re- PP criteria quired (not shown). b S. cadiscus , the trees; ITS/ETS trees ( to osus DISCUSSION 80% or BI posterior probabilities
ITS/ETS Topological incongruence — One of the longstanding and in- S. squarr according , and creasingly prominent questions of phylogenetic systematics is how to address incongruence between phylogenies obtained
plastid from multiple data sets and how to determine its cause (e.g.,
in Sullivan, 1996 ; Wiens and Hollingsworth, 2000 ; Rokas et al., 2003 ; Holland et al., 2008 ; Degnan and Rosenberg, 2009 ). In- incongruence S. psilocarpus icting relationships within group 80%
, )