<<

Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 O:10.1080/10635150701324225 DOI: online 1076-836X / print 1063-5157 ISSN: ety eetn h s ftetr eo h species the below term the of an- use of the regardless extend organisms, We cestry. of assemblage named any Amato, 2004) and Amato, DeSalle and 2000; Russello al., 2004; et conserva- (Goldstein in biology issues tion to and application 1987) increasing field al., its the et impacts (Avise of phylogeography linchpin intraspecific the of is methods re- phylogenetic level accurately species using the and to below relationships Baum ability historical construct the 1988; Furthermore, Donoghue, 1995). and Shaw, Queiroz species phylogenetic (de the concept of versions some define using to ability species Sites our on in impinges (reviewed and 2004) methods Marshall, and analytical boundaries of species variety testing a using and delimiting pre- a for is it requisite because important termi- is groups of monophyletic phy- identification nal between Accurate boundary tokogeny). defined the and pinpoints clade logeny that the node test the (i.e., by to groups 2004) monophyletic Mort, reveal terminal to and and Crawford hypotheses 2001; evolutionary Leverich, 2003; population-level anal- and al., (Schaal phylogenetic et plants of Stuessy seed a bounds the been lower within has the ysis there press years, Graham to 2000; recent al., push In et Qiu 2000). Chaw 2000; 1999; Olmstead, al., and Donoghue, et Bowe and 1999; Mathews al., et 1993; 1966; al., through (Hennig, et made analysis Chase phylogenetic been of has application relationships the plant seed level Copyright Biol. Syst. ntdSae eateto giutr,Arclua eerhSrie ainlCne o eei eore Preservation, Resources Genetic for Center National Service, Research Agricultural Agriculture, of Department States United o h upsso hssuy edfie“aa as “taxa” define we study, this of purposes the For Tr mnospors oad nesadn higher understanding towards progress emendous 62:0–2,2007 56(2):302–320, itnusigTria oohltcGop rmRtclt aa Performance Taxa: Reticulate from Groups Monophyletic Terminal Distinguishing  c est adcp eie rmpicplcodnt aa hwdspro efrac otete-ae n network and tree-based the to against reticulate performance used analysis; coordinate superior when principal .] showed that, network; phylogenetic procedure data, reticulate parsimony; clustering a coordinate introgression; has [Hybridization; nonparametric principal tested. taxon existing from procedures monophyletic an derived a of that landscape (inferring signal application density error phylogenetic novel II a revealing a conflicting type by of present the probability introgression we complex, high Lastly, of a were topologically history). stages in was weighting resulted early methods flow successive the these gene during by reverse revealed when evolution and However, reticulate signal. NeighborNet of performance phylogenetic signature The analysis. phylogenies. conflicting ephemeral bootstrap as an interpretable parsimony identifying easily to of were worse capable that comparable performed topologies networks was spanning yield Minimum NeighborNet not methods. did of network and two methods of taxa sampled use tree-based between the most flow explored gene than we when alternative, justify phylogenetic an to delimiting difficult As or is bifur- possible. purposes, defining standard is conservation of of for purposes populations use the prioritizing the for for that groups or taxa monophyletic conclude species, reticulate We terminal before reconstruction. identify systems phylogeny to natural of methods in required tree-based methods Based be cating tree-based simulation. may using events the detected flow of gene reliably meth- phases of tree-based be thousands some Bifurcating will many performance. during that relative estimate appearing assessing we topologies gradual model, for a incorrect the our metric that on used supported, a We highly hypothesis as hybridization. with null introgressive flow poorly, of gene the performed stages continued of ods with early rejection error the erroneous I during phenetic, type true (the likely of of were was error erosion monophyletic) that performance I not taxa relative type is and examined, interest the history of methods reticulate evaluated taxon all known we In with evolu- sets data, groups. taxa data an between simulated monophyletic to in distinguishing terminal applied Using reticulation for been cause nature. approaches frequently to in network have hybridization and evolution hybridize via tree-based, of to model flow bifurcating known gene strictly for taxa a potential containing on obvious based the methods analytical of tree, spite tionary In groups. other and plants Abstract.— oit fSseai Biologists Systematic of Society yrdzto sawl-ouetd aua hnmnnta scmo tlwtxnmclvl ntehigher the in levels taxonomic low at common is that phenomenon natural well-documented, a is Hybridization fPeei,Te-ae,adNtokProcedures Network and Tree-Based, Phenetic, of otClisClrd 02,UA -al rcadlmrclsaeeu(C.M.R.) [email protected] E-mail: USA; 80521, Collins,Colorado Fort P ATRICK .R A. EE AND EEVES 302 C HRISTOPHER oaaadCadl,20) ait falternative of variety A 2001). Crandall, therein; and citations and Posada 2000, Legendre, Nixon, 1995; and Doyle, Davis 1990; 1992; Wheeler, and Nixon 1990; Dade, Mc- 1979; Wanntorp, and Bremer 1975; been (Sneath, has criticized processes evolutionary reticulate sets from data derived to methods”) (“tree-based trees bifurcating and phylo- reticulate of relationships. production genetic the is taxa gene between of flow consequence One taxa obvious. between is populations alleles weak; as of such be movement to for believed potential generally the level, thus, are species flow nature the gene to in Below barriers 1997). occurs between Arnold, flow 1959; gene hybridization Knobloch, (Stebbins, interspecific 1971; that that (Grant, and known possible 1972) is is species it plant many 1996), al., et taxa. from different individuals two involves that de- reproduction and herein as fined is Goldstein “Hybridization” 1990; others). and Nixon, 2000; and DeSalle, Wheeler its by of argued spite (as in intent original usage, (1966) Hennig’s common from be- deviation the purported We become ancestor. has common this single that lieve a organisms from of descended group and a has Queiroz indicate de to (1988), by the termi- Donoghue expanded use We as as 1988). “monophyletic,” use Donoghue, term and for Queiroz appropriate (de taxa” units nal are inclusive “populations least because the populations, include to level h plcto fmtosta rdc hierarchical produce that methods of application The (Ellstrand debated been has ubiquity its Although .R M. ICHARDS 1111 ot ao Street, Mason South Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 nsm ae fitrpcfi yrdzto n many and hybridization interspecific of cases (e.g., weak some are hybrids in reproduction F1 to barriers over biological predominate when back- will complex individuals However, crossed individuals. outnum- backcrossed may hybrids ber F1 cases hybridization) some in interspecific (e.g., of taxa between flow gene prevent to ate tree. resulting include the (which in taxa individuals, individuals), reticulate) backcrossed (or F1 hybrid of to fate opposed the as consider undesirable and only 2003); they them al., (2) et make (Scotland that reconstruction phylogeny features for other Ellstrand, and and (Rieseberg 1993), hybrids in inheritance of terns pat- unpredictable nonindependence, to prone characters, are morphological which on relied they (1) termi- because his- tory reticulate distinguish a with to taxa from is lineages monophyletic goal nal the provide if studies guidance un- were these incomplete present phenomenon, topology our the tree of to overall derstanding essential on Although effect predictable. its less and hybrid tree the of the par- position the derived in time, most the of the remainder containing The time) clade ent. the the to that of tree basal most-parsimonious (65% was the in often position were a at hybrids resolved F1 charac- artificial morphological ters, considering when 1997) that, 1992, showed (1990, McDade by studies 1985). empirical Funk, Detailed 1980; Platnick, and Nelson and 1979; Wanntorp, (Bremer analysis cladistic tree-based, on bridization solution. satisfactory a methods offer alternative analysis whether of test arguably bias to may (2) an and analysis which conclusions, tree-based in of manner application de- and illegitimate to degree (1) the important therefore termine is It concerns. clearly articulated numerous, despite of sets analysis data the reticulate for potentially used routinely be to tree-based continue Moreover, methods sets. data simulated only known considered nonreticulate but methods with correct network per- some sets the of reveal examined formance (2005) data to al. et to Cassens ability history. reticulate applied their when for relationships evaluated been not networks. using routine from homoplasy signal is reticulate It true distinguish sets. hy- to data difficult as all by virtually caused such in be present processes homoplasy, also routine may reticulate but to recombination or Conflicting due bridization set. be data may underlying signal the phy- conflicting in of signal consequence a within logenetic as Cycles arise set. can data network single a a acceptable of for set solutions (or result the alternative summarize a cycles to are used graphs algorithm contain these the of in that present cycles graphs The “loops”). produce 1994; can 1990; and Legendre Smouse, methods 2000; network (Hein, methods, Xu, and tree-based 2000; Unlike 2002). al., Makarenkov, Excoffier data et Bandelt 1992; 1997; such be Fitch, al., to et analyzing claim Templeton that for developed phyloge- been appropriate have “network” methods) termed netic (collectively procedures 2007 ncssweepszgtcioaigmcaim oper- mechanisms isolating postzygotic where cases In hy- of consequences the explored have studies Past have methods network most promise, their of spite In R EE N IHRSDSIGIHN OOHL RMRETICULATION FROM MONOPHYLY RICHARDS—DISTINGUISHING AND EEVES n hlgntcsga.Dpnigo admgenetic random on conflict- Depending in signal. resulting a phylogenetic taxon, from recipient ing states a into character taxon of donor introgression by followed may be unidi- hybridization taxa, of two case between simplified flow gene the rectional In hybrids). F1 states predictable with readily character not (unlike is of taxa level such distribution of species individuals the among the because at below studies spe- and pose empirical to for likely problems are cial taxa Reticulate understood. not well is reconstruction phylogeny on sequences) or viduals sequence was recombinant sequences single non-recombinant of a considered. tree only larger a of within fate be- the populations natural cause to Crandall extended easily and not Posada are reconstruc- of (2002) phylogeny conclusions of the However, accuracy tion. the less be impact to to character- shown likely with were individuals Sequences back-crossed tree. of true istics the with to reconstruction, relative nonhybrid tree taxa among is erroneous sequence rearrangements analyses topological in the frequent phylogenetic result of in to parent) 50% likely that each hybrids with to suggest F1 sequence belonging of to characteristics recombinant interpreted with in (a taxa be of purpose may inclusion different the study a the par- with each mind, conceived by proportion Although contributed the sequence relation-ent. recombinant and topological the sequences the of “parent” on between dependent in ship is sequences recombinant analysis including as an of well impact as the showed They that hybrids, analyzed. F1 were individuals, of back-crossed characteristics se- with recombinant when quences methods analytical tree-based of recon- the of of accuracy effect phylogeny. the the structed on consider however, sequences not, recombinant with did not They shape, were sequences present. tree recombinant branches when found in internal were shorter than change and branches predictable terminal longer a and analy- caused phylogenetic distance- ses (ML)-based in the likelihood sequences that maximum recombinant found (2000) of reticu- Hein inclusion with and individual Schierup an ancestry. in late many scored if were expected loci be re- would unlinked that assem- single states chimeric character a the of blage because mimics sequence relevant DNA are combinant studies These Crandall, 2002). and Posada 2000; Hein, and (Schierup phylogenetic analyses in sequences DNA recombinant simulated tree- of applicability data. the reticulate to verifying methods based of only is component reconstruction phylogeny one on rare hybrids F1 be the of understanding often effect individuals, will back-crossed hybrids genotypes to F1 relative migrant, Because to 1950). opposed selection (Stebbins, as be because resident, will and favor hybrid population may F1 local rare the a of for members mates the potential because of is This majority hybridization). intraspecific of cases h feto eiuaetx a poe osnl indi- single to opposed (as taxa reticulate of effect The accuracy the examined (2002) Crandall and Posada Two tde aecniee h feto including of effect the considered have studies 303 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 hlgntcsga ndt esdrvdfo reticulate from derived processes. sets historical data in signal 1998), phylogenetic (Trueman, (RSW) atr weighting and methods successive network we two reverse addition, of In performance structure. the tree-like do evaluate a that fit tree- methods data phenetic assume of two not as assessment well an as methods include based We introgres- of hybridization. process continuous sive a during sampled individuals be back-crossed to complex data multiple, multilocus allows a that simulating pre- employ for a We model with reticulation. population-level historical simulated of sets pattern data defined in taxa reticulate from groups monophyletic terminal distinguish to procedures inadequate of artifact an analysis are an that sampling. in conclusions included avoid be to must many taxon that per suggests individuals levels interindividual signal conflicting of in presence variation The indicate reticulation. to signal historical conflicting on rely that problem- methods for potentially among atic is taxon levels single signal a recip- within conflicting individuals the in in Variation individuals taxon. among ient observed donor be pro- the to will the traceable taxon are in that states variation character of states, portion of character fixation and introgressed event the intermediate hybridization the the between during in period Furthermore, present signif- flow. of that gene with spite in icant taxa than identify, to making less difficult potentially histories be reticulate individual, will hybrid F1 taxon individuals an recipient in present a signal of the expected Accordingly, conflicting are taxon. be of recipient that would the magnitude taxon individual in fixed hybrid donor become F1 the to the from in number states small found character a only the selection, of of strength the and drift 304 n rsl,19) h ubro usiuin e site per substitutions of (Rambaut number Seq-Gen The 1997). using Grassly, and sets data 1000-character four the for a history taxon. reticulate within a recipient infer flow to gene ability the of affected direction tree the the whether used between examine was direction, to occurred opposite the in flow but Sim2, gene as taxa same which in between relationship Sim4, topological them. the and hybridization ge- distance of explored netic terms in Sim3 both taxa, and divergent increasingly Sim2 between and taxa between taxa hybridization sister modeled between Sim1 flow considered. gene were distinct unidirectional topologically for Four alternatives evaluated. gene be in involved for can directly tree exchange those than simplest five- other the A taxa relationships topological is between 1a). on flow it gene (Fig. of because effect point the used which starting was the tree as taxon used was D),E) nti td,w vlaeteaiiyo i analytical six of ability the evaluate we study, this In ietxnsatn re eecetdb simulating by created were trees starting Five-taxon ((((A,B),C), form the of tree five-taxon unrooted An ebsdpoeueitne oietf conflicting identify to intended procedure ee-based eeaino iuae aaSets Data Simulated of Generation M TRASAND ATERIALS M ETHODS SYSTEMATIC BI osnu falms asmnosteswsfound was strict PAUP trees a a procedure, parsimonious to tree” most subsequent “best all the of For consensus data. an- to the used alyze were procedures parsimony-based Two als. (in- 1b): (Fig. mating follows The as generations. random proceeded simulation nonoverlapping with fully and selfing), model size, cluding Wright-Fisher population at a constant (available followed authors the Reproduction by written http://lamar.colostate.edu/∼reevesp/Hybridize.html). program a starting the using replicating by 10 to Seq-Gen. from set derived was haplotypes taxon each (N) size of population The drift reproduction. genetic simulated and during haploid, recombination as to rather subject but loci, sequence unlinked DNA linked subsequently a as not treated were characters the evolution, se- data. DNA quence describe binary to developed contained was model matrices this Although al- resulting states the character so two (Jukes only lowed model model with but Jukes-Cantor substitution 1969), the The Cantor, and to branches. equivalent all was for used 0.1 to set was e fdt essmltd(siae optto time computation (estimated simulated = sets data of num- large ber the given methods tractable ML computationally not differences. were character mean cal- were using distances culated Genetic reconstruction. tree for used htteniho-onn N)agrtmi PAUP in algorithm except (NJ) analyses neighbor-joining parsimony the the that as procedures same using per- the performed were were sacrificing analyses replicates Distance-based without formed. bootstrap searches hundred expedite were Five to accuracy. (2000) order Olmstead in and used Debry of settings the “TBR-M” procedure, “bootstrap” the strategy, For repli- on. MULTREES search addition cates, taxon random heuristic 100 swapping, settings: branch TBR following the used .Rpa tp o5s htfierpiaern r per- are runs replicate five that so 5 to 1 steps Repeat that 6. so times 199 additional an 4 to 1 the steps within Repeat mating 5. random of generations 10 Allow 4. both choosing by progeny additional nine Produce 3. selecting, randomly by haplotype progeny a Produce 2. taxon recipient from haplotype one choose Randomly 1. l aast otie v aaad5 individu- 50 and taxa five contained sets data All simulated were drift genetic and Hybridization 38dy)adwr o used. not were and days) 1388 LG VOL. OLOGY omdfrec ftefu trigdt esfrall for sets data flow. starting gene four of the models topological of each for formed are run. hybridiza- generations) per consecutive total acquired 2200 200 during to events (corresponding tion sets data 200 analysis. for set data acquire then taxon, recipient taxon. 2. recipient step in the as haplotypes within progeny from Assemble random at parents 2. Parent either or in 1 present Parent state character taxon the character, donor each for the with hybridized be 2). (Parent to 1) (Parent nlsso iuae aaSets Data Simulated of Analysis ∗ Sofr,19)sac that search 1999) (Swofford, ∗ was 56 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 u re u fapsil 0( trigdt sets data starting (4 20 possible a consen- proce- of of tree” number out the “best trees as the sus support measured for was bootstrap support clade; percent dure, each the sup- with procedure, as associated “bootstrap” measured the was For port analyses. the ing fsmltdrtclt vlto.Hbiiain hc rdcdasnl 1idvda ntercpettxn a olwdb 0generations 10 by followed was sampled. Model taxon, (b) were recipient taxon. sets the recipient data in to which individual donor after from F1 taxa, flow single gene within a of mating produced direction random which indicate of Arrows Hybridization, internodes. evolution. along reticulate shown simulated is of sets data starting for probability tion 2007 ,C ,E B E C D E C D E D and CD, BE, BD, BC, AE, PAUP AD, from AC, extracted DE, was AB, CE E, D, C, B, n rat 06.Gntcdsacswr acltdus- calculated were distances Genetic 2006). Bryant, Split and 2000). (Huson 4.4b16 SplitsTree al., using et calculated were (Schneider networks cal- 2.0 were Arlequin 1994) using Smouse, culated and Excoffier (MSN; networks elct usprdt e)ta eovdtecaeof clade the resolved that set) data interest. per runs replicate o asmn n Jaaye,spotfrcae A, clades for support analyses, NJ and parsimony For F Two IGURE ewr ehd eeue.Mnmmspanning Minimum used. were methods network .()Tplgclmdl fuiietoa eeflwsmltdwti v-ao re.Ec ao nldd1 niiul.Substitu- individuals. 10 included taxon Each trees. five-taxon within simulated flow gene unidirectional of models Topological (a) 1. R EE N IHRSDSIGIHN OOHL RMRETICULATION FROM MONOPHYLY RICHARDS—DISTINGUISHING AND EEVES ∗ o lsgnrtddur- generated files log × 5 aaercmdlcutrn POM) o h F the For (PCO-MC). clustering modal parametric ss nydnradrcpettx eecniee and considered were taxa recipient and donor only ysis, F named is AB|CDE split possi- (e.g., two AB). names the taxon of of smaller subsets the ble CE only and using nontriv- splits CD, simplicity, here the BE, For named and are BD, analysis. E BC, further AE, for and AD, collected D, were AC, C, DE, B, AB, values splits A, support ial splits and trivial set, the data for per repli- performed bootstrap were hundred cates One settings. splits transformation “EqualAngle” and 2004) Moulton, were and (Bryant transforma- networks tion distances Split “NeighborNet” with method. constructed “UncorrectedP” the ing st Two n rnia oriaeaayi olwdb non- by followed analysis coordinate principal and , oheacia ttsia rcdrswr used: were procedures statistical nonhierarchical st anal- 305 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 a sd S11i ELsrp htfre PAUP forces ap- that script tree-based PERL a a is RSW1.1 using used. 2002) sets Gibbs, was and (Trueman data RSW1.1 program simulated the proach, the in nal θ 306 dnie oflcigsga sfollows: as RSW signal (1998). conflicting Trueman identifies of procedure RSW the implement o vrg,fu lseswudb on nol one only sets). in data starting found simulated be 200 would of out time clusters the four of 99.5% average, sets (on data correct starting the PCO- in predict clusters the to of number expected R because be case, conservative would procedure this is MC (in analyses value for minimum 0.0401) extreme this Using as kernel approximated uniform was density) spherical follows: cluster the estimate to of used de- dimensions (which the R parameter fines options: smoothing value fixed-radius following The the the for CASCADE=1. and in 1993) METHOD=6; MODECLUS Kuo, STANDARD; PROC and using (Sarle The inferred SAS coor- was coordinates. principal data the principal within dinate identified three used clusters first of subsequently number mod- the were analy- program compute EIGEN The for to and sets. used data DCENTER commonly marker ules dominant and are of (Landry and sis data be 1996) multi-locus to argued LaPointe, dis- binary been Jaccard have for they appropriate coordinates, because affect used principal will were tances of coefficient distance calculation of the choice the the dis- using Although Software). (1908) (Exeter calculated NTSYS (Lewis of Jaccard SIMQUAL were module GDA program PCO-MC, haplotypes application between For the tances 2002). using Zaykin, data and haploid for .Asml aasto hrcesta r inconsistent are that characters of set data data (overall) a Assemble complete 2. the for tree optimal an Find 1. .Cnld htoealdt e otiscnitn sig- conflicting contains set data overall that Conclude step in 4. assembled set data the for tree optimal an Find 3. .Fn h mletvlei h ito 0 ausob- values R 200 of list the in value smallest the Find 4. PROC using 2 step from data PCO Analyze 3. 200 the for coordinates principal three first the Obtain 2. sets data starting terminal) (50 five-taxon 200 Simulate 1. p To (CI set. a hntesfo tp3aesgicnl different significantly tree. are 1 3 step step from from trees when nal 2. andi tp3 hsi h au o sdi this in used R for value the is This study. 3. step in tained set. data starting given a inference in correct clusters the five smaller cause of are value that minimum R this of than Values present. which are PCO at clusters R each for four For value minimum values. the R determine set, of data range a and MODECLUS sets. data above. described as (sensu xmn h ee fcnitn hlgntcsig- phylogenetic conflicting of level the examine < )wt refo tp1. step from tree with 1) eradCcehm 94 a calculated was 1984) Cockerham, and Weir SYSTEMATIC BI ∗ to = esdt esgicnl ogr(sn one-tailed individuals a (using taxon longer significantly recipient be and to ceased taxon be- branches be donor of the length to mean tween defined the when was point a history at ap- inferred Reticulate the nontopological used: in a was apparent so proach not networks Topological was spanning used. history minimum was reticulate value, of higher a evidence to return not did J“ette, J“otta, F “bootstrap,” NJ tree,” “bootstrap,” “best ap- parsimony NJ each tree,” using “best analyzed (parsimony were proach flow) gene of models ical with lsescagdfo v ofu wt R (with four inferred to of five number from values. the changed higher which clusters at to point return the PCO-MC, subsequently For not did and 0.95 eeflw nttl 600dt es(0 aast per sets data (200 sets data persistent 16,000 by run total, caused error In I flow. type of gene erosion of measure rate a the therefore of is TIRH met. were criteria be- cutoff required fore events hybridization of number the as sured value. higher a and to 95% return below not dropped did taxon recipient split the trivial containing the for value support bootstrap the inferredwhenever For was history network. one reticulate spanning method, from NeighborNet minimum the distinct the statistically on recip- longer based and donor another no which were at taxa point the ient this identify of to effect was The test taxon. recipient the within individuals hc yeIerr(h roeu eeto ftenull the of rejection erroneous (the H error hypothesis, I type which eedtrie yasnl ersi asmn search parsimony PAUP heuristic using partition single data a by each determined for were topologies optimal “bootstrap” and aforementioned settings, the using performed RSW1.1 by were conducted searches at Bootstrap clades points. all time for all measured be could sec- support levels way, conflicting “significant” this of In find sets. data to simulated all forced in signal to was ondary set RSW1.1 was that RSW1.1 For so in 3. 1% value and percent 1 cutoff steps the from study, trees this percent”), bootstrap “cutoff the in (the present support are bootstrap min- of user-specified level some by imum supported each differ- clades, when ent found is difference “significant” a 4, step For eiuaehsoywsifre hnF when inferred was history reticulate h eiin ao rpe eo 5.FrF For 95%. reticu- below dropped for searches, taxon value recipient support the “bootstrap” the whenever donor For inferred was the history versa. within late vice nested or found taxon, was members more one or whenever taxon recipient the for replicate. reticulate inferred a was given history that a meant criterion for this tree adopting found best practice, In the longer in no monophyletic was be taxon to recipient be the to defined when was inferred history reticulate searches, the tree” For “best inferred. be could history reticulate a that was claimed it which beyond procedure, estab- analytical each was for criterion lished cutoff a occurred, ) lutionary h iet nerdrtclt itr TR)wsmea- was (TIRH) history reticulate inferred to time The nodrt dniyrgosin regions identify to order error.—In I type of Erosion LG VOL. OLOGY × α 5r ≤ piaeruns eplicate .5 hntema egho rnhsamong branches of length mean the than 0.05) ∗ defaults. 0 h eiin ao sntadsic evo- distinct a not is taxon recipient the : vlaino Performance of Evaluation × trigdt sets data starting 4 st C-C S,and MSN, PCO-MC, , st rpe below dropped = × .41 and 0.0401) st topolog- 4 analyses, t -test 56 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 ro)cudte ecluae o ahtm on (see point time formulas). each for for 1 calculated I Appendix be type to then (equivalent could history error) probability reticulate the of inference and no error), of II a type had i.e., taxon history, probability monophyletic reticulate a the that (determining taxon), failure reticulate of the (cor- signal success identifying conflicting of rectly probability on The based determined. history was levels) reticulate a suggested was have monophyletic (i.e., be to avail- to inferences sets known incorrect taxon data a of when 20 number reticu- the the of for Similarly, inferences able. determined correct was of sampled history and number late percent the cutoff point, each time For found. been to have considered was signal conflicting per- “significant” “cutoff cents,” for these support exceeded splits 85%, bootstrap or When clades 90%, contradictory 50%). 95%, 60%, 65%, (99%, 70%, values 75%, cutoff support bootstrap history. reticulate a to had known monophyletic taxon be a that suggested occurred signal error hy- II conflicting type when null and rejected), the falsely (causing be to present be pothesis not then that was of would evidence hence history such history, and reticulate when reticulate occurred C, error a taxon I Type to have supported. was may it closely A as as taxon was B A taxon taxon to that related conclusion The A. signal then taxon conflicting support, for substantial of bootstrap evidence 98% be would had there AC split and bootstrap 95% support had AB split phylo- if example, conflicting For signal. of genetic magnitude data the RSW indicate con- secondary and partitions) for overall (or the relative splits in clades The contradictory tradictory two collected. for likewise values were support partitions analyses signal corresponding RSW secondary and the of overall for the from values clades simulation Support the collected. across previously were described error splits non-trivial the sup- II for bootstrap values NeighborNet, port type For used. and was course, I time cri- type cutoff anal- of single sensitivity ysis involving a approach, using different reticulation evaluated A of terion. fairly evidence be this However, not set. could data a in sig- conflicting nal exposing by reticulation indicate also can by caused TIRH in differences factors. significant of these variance for analysis the Nested test apportion model. and to used the was into (ANOVA) recombina- built variance of drift processes and stochastic the tion from starting or the flow, set, gene of data model topological could the runs to due between be observed TIRH proce- in given differences a For dure, met. the been had whether criterion cutoff determine relevant to examined and NeighborNet) 2007 nlssi umrzduigrslsfo h parsi- the from results using summarized is analyses oflcigsga a vlae sn different 9 using evaluated was signal Conflicting RSW and signal.—NeighborNet conflicting of Evidence h feto eeflwbtentx ntree-based on taxa between flow gene of effect The oooia osqecso eiuaeEvolution Reticulate of Consequences Topological nTe-ae Analyses Tree-Based on R EE N IHRSDSIGIHN OOHL RMRETICULATION FROM MONOPHYLY RICHARDS—DISTINGUISHING AND EEVES R ESULTS enlbace eevdhg otta upr Fg 4c, (Fig. inset). support bootstrap in- high began), received topological flow branches gene ternal of after soon period internal occurred taxa, (which the to change recipient During subsequent and changes. gradually, donor topological eroded sup- of that average, monophyly shows on the struc- 4 tree for Figure the simulations. port for the support during statistical ture determine to sets within structure hierarchical caused false A. D taxon of taxon donor appearance intro- from A Characters the taxon 2. recipient Figure to for duced described taken points run time Sim3 3 representative at single Figure a above. from trees described pro- shows ordered phases an the showed through all gression runs, replicate between ied sets. data replicate the of any monophyletic in distinct groups as identified longer no were D and sta- (t final, reached the was which at topology point ble the shows 2f taxa Figure D. from and individuals A all un- a containing an of showed polytomy and runs majority resolved 20 A, the the from point, trees clade consensus this strict within After the nested 2f. Figure D like nested clade topology A D, recipi- clade clade included and within topologies donor Alternate distinct taxa. ent retained tree majority-rule recipient and donor taxa. sister because were Sim1 in occurred changes (t (t Sim3 Sim2 for point for this than reach hybridization to of required number was larger events slightly A found Sim3. changes during topological internal simula- 20 all all in completing D tions, t clade to By sister as 2d. recovered was Figure A clade like were or topology ing lentv oooisa t at BC. clade topologies of Alternative had creation A the forcing taxon oc- position, point, new this had a At assumed tree trees. of starting majority the the in curred to relative change topological h regvni iue2.B t By 2c. Figure like in were given or tree ABC polytomy the this contained at either topologies alternative point two time The 2b). (Fig. AB clade urn fe i ylso yrdzto n eei drift genetic oc- and (t hybridization 2a), ob- of (Fig. cycles first six condition after The starting curring the data. from Sim3 change of served analysis parsimony during gene of taxa. effects between topological flow the generalize most to tool are the a and as course determine used time the to over used clades recovered frequently were trees majority- consensus variation, this runs rule Given replicate points. sampling between some at observed were strict the trees of topology consensus the in simi- Differences parsimony. qualitatively to were pro- lar methods Sim4 NJ complex from and more Results the Sim2, in Sim3. seen responses Sim1, the of in subset a examined duced conditions flow topological gene The data. for Sim3 of analysis mony otta upr auswr acltdfraldata all for calculated were values support Bootstrap var- changes topological of timing exact the Although (t point last the shows 2e Figure = occurred that changes topological the shows 2 Figure ) a eraei h rqec frcvr of recovery of frequency the in decrease a was 6), = 9 rSm (t Sim4 or 19) = = 11 6) tti on,tx A taxa point, this At 163). hwdete h start- the either showed = = 11 7.N topological No 17). = Fg c,tefirst the 2c), (Fig. 6 twihthe which at 96) = 3(i.2d), (Fig. 23 = 307 23) Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 hretpt ln h ewr hticue l individ- all included that network the along path shortest (t another one from guishable 3, (Fig. length in t diminished progressively A D taxon taxon t between and 3, branch the (Fig. transition, C topological taxon final to taxon adjacent taxon that became such to B occurred similar change more topological its second lost became a A and D, taxon B as taxon flow, event to gene unique similarity continuing a With as nodes 3). visible adjacent Fig. not in but became 5, A Fig. taxon in (summarized recipient and D donor taxon that such network the of rearrangement a involved t 3; (Fig. flow taxon re- gene the cipient within changes, structure, therefore topological and any variation, to induced shown Prior are run 3. Sim3 Figure single in a and during 5, observed four sum- Figure MSNs in the to the as for occurred used conditions changes is starting Topological possible Sim3 response. study. the this marize in represented MSNs three former were the only random, used at sets chosen were data here starting D-E-C- four the E-D-C-A-B; Because D-E-C-A-B. E-D-C-B-A; B-A; used: topology tree ing sgns oooia eragmnsaogitra rnhsocre onatrtesato yrdzto n eefloe yalonger a by followed were and eroded. hybridization gradually groups of monophyletic start as the taxa recipient separated after and taxa direction soon opposite donor between the occurred the flow in is for Gene branches tree the events, support nodes. internal within where hybridization at (c) “flowed” among of frequency and period clade (b) recipient number rearrangements clade in the Topological in with Arrows as measured labeled rearrangements branch. genes. (t), are topological bold as Time predictable trees a caused change. by consensus internodes marked topological Majority-rule more is next or figure. (A) two the each taxon by during of recipient moved The bottom (f). A the in taxon at shown which shown is to topology position stable tree final, the the and indicate (a), in shown are flow gene 308 = oreulylkl S oooiseitfrtestart- the for exist topologies MSN likely equally Four F 3 6 ni oo n eiin een ogrdistin- longer no were recipient and donor until 96) 23, IGURE oooia osqecso eiuaeEvolution Reticulate of Consequences Topological .Tplgclefc fogig ndrcinlgn o ewe aao asmn nlss h nta oooyaddrcinof direction and topology initial The analysis. parsimony on taxa between flow gene unidirectional ongoing, of effect Topological 2. nNtokAnalyses Network on = ) h rttplgclchange topological first The 6). = 6) tn ieddthe did time no At 163). = 1.Atrthis After 11). SYSTEMATIC BI DadA eeeulywl upre,laigt the to leading supported, well equally splits were Conflicting AB taxa. and recipient AD similar- and growing to the donor relative indicating between decrease E, ity to and C, begun B, had splits D trivial and A splits trivial oywsspotd yt er- his- By the reticulate supported. a phase, was had this tory B At taxon AB. that interpretation split roneous to comparable weight was split a that and values, support bootstrap large nering t 3, be- flow (Fig. gene gan after Soon strictly evolution). a of under model bifurcating simulated were (which sets data those starting (t as 3 such Figure splits, in conflicting shown for values support 27% high sim- was 200 sets in data tree starting starting ulated the with conflicted that splits and (± for length) mean (branch The were weight support. split bootstrap networks by measured the conflicting as of sets, splits, evidence minor data only with starting tree-like, largely 3. the Figure in in shown are Initially, run Sim3 single a to NeighborNet B-C-E-A,D. E-A,D-C-B; observed: conditions were end until possible Two course, met. time was criterion simulation cutoff the the distinguish- of a much remained for group taxon another able from recipient individual the an thus include taxon, also A taxon of uals h pi ewrsrsligfo plcto of application from resulting networks split The LG VOL. OLOGY = ) pi Cbcm oepoiet gar- prominent, more became BC split 6), = )wr cainlyosre in observed occasionally were 0) = 11, h pi egtfrthe for weight split the D otta support bootstrap SD) ± 6.Hwvr very However, 26%. 56 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 astgtycreae.Tefia,sal oooywas al- topology not stable are t final, by values reached The support correlated. split tightly and ways weight split Thus 2007 sdt eemn fvraini IHaogreplicate among TIRH in variation if determine to used Replicate pattern. identical. similar became a D followed simulations and A taxa and appeared t At point. time this at taxa reticu- suggested reticulate also a were be D have to and may B taxa A However, history. taxon late that interpretation correct ogrta h rnhswti eiin ao .Tefis he rnia oriae r lte o C nlss io aito and/or methods. variation all Minor in analyses. apparent times PCO were 100 for taxon to donor plotted 10 the are typically from coordinates were flow principal branches gene three These by first taxa. taxon recipient The between the connections A. lengths into show taxon introduced branch panels recipient structure Relevant MSN within historical analyses. the in branches NeighborNet are in and the individuals branches parsimony letters; than Bold capital for longer with analyses. shown indicated are MSN are values for (populations) Taxa support shown top. bootstrap are at Relevant indicated numbered. points time are at and sampled lowercase were sets Data taxa. between flow iial erae nteSm elct hw in shown replicate Sim3 t the at always 3, in not Figure did decrease: support similarly bootstrap However, AD. smaller much became than DE for t weight sub- split By the decreased as decrease. had stantially signal to conflicting began of Bootstrap also evidence mea- strong BC. AB and as split AD importance, for to in support relative weight, diminish split to by began sured DE and AB etdaayi fvrac was variance of analysis error.—Nested I type of Erosion F IGURE eaiePromneo hntc Tree-Based, Phenetic, of Performance Relative .Tm eissoigtesgaueo eiuaeeouino oraayia ehd.Cre rosidct ieto fgene of direction indicate arrows Curved methods. analytical four on evolution reticulate of signature the showing series Time 3. = = 6 sbosrpspotfrsltD dis- DE split for support bootstrap as 163 6 pi AD split 96, n ewr Procedures Network and R EE N IHRSDSIGIHN OOHL RMRETICULATION FROM MONOPHYLY RICHARDS—DISTINGUISHING AND EEVES = 0% pi DE split 100%, = 3 splits 23, = 100%. = 96, n h IHvle fietclsmlto usbe- runs simulation identical to): (≥ of methods values tween compar- TIRH by the ranked be ing also could performance Relative h olwn ar fmtoswr o significantly and ( not “bootstrap” NeighborNet parsimony were another: methods one models, from of different topological pairs four following all the across Averaged 6. Figure the method. estimate analytical to given a sufficient for four likely parameter the the were TIRH of each sets to for significant data than runs starting no due replicate rather five because found, drift, largely Moreover, was genetic set. effect is data and starting Sim4 flow the gene and of Sim3, dynamics different within Sim2, found using TIRH in by Sim1, model variation topological Therefore, caused flow. particular gene be a of within to sets effect found data significant starting was No 1). TIRH (Table flow on gene starting define to between used models topological differences between differences and/or to sets data attributable was runs S ( MSN efrac aknsbsdo IHaesonin shown are TIRH on based rankings Performance P = .56,F 0.8596), P = niae etrta requivalent or than better indicates .5) asmn bs re and tree” “best parsimony 0.453), st n J“ette”( tree” “best NJ and P = 0.2284). 309 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 u.Ist b n d hwsmayvle u xadtergo ftplgclcag pae1 o lrt.Sm eut eesmlrto similar were results the Sim1 of clarity. course for the during 1) changed (phase values support change representative clade topological recipient single of and a Sim2. donor to region from only identical values the so nearly support occurred, expand were bootstrap changes results but topological shows Sim4 values no (c) simulation. that support summary Inset except of identical. show erosion Sim3 were gradual genetically and (d) (d) 3, recipient Sim2 change; for and phase and topological taxon; values (b) 1, donor donor Error of phase Insets (c). 4, group taxa: in phase run. sister nonsister only monophyletic inference; between the direction flow 2 is one gene phase taxon in of (reticulate) for consequence indicated recipient a that are inference are value, erroneous that bootstrap 2, phases mean phase characteristic the four from indicate deviation graphs Sim2 standard similar. one as shown bars, error 310 elct simulations. replicate graphs, .Prioy“bootstrap” Parsimony 3. NeighborNet 2. PCO-MC 1. .N bs tree” “best NJ 7. MSN 6. tree” “best Parsimony 5. “bootstrap” NJ 4. F ulations) ulations) ulations) IGURE x ai snme fhbiiaineet ic tr fsmlto.(,b rqec fslce ldsfudi asmn re rm20 from trees parsimony in found clades selected of Frequency b) (a, simulation. of start since events hybridization of number is -axis ≥ .Spotfrslce ldsatrprioyaayi fSm n i3dt.Cieinue o erhssona otm nall In bottom. at shown searches for used Criterion data. Sim3 and Sim2 of analysis parsimony after clades selected for Support 4. J“ette”(0o 0simulations) 80 of (70 tree” “best NJ ≥ egbre 6 f8 simulations) 80 of (64 NeighborNet ≥ ≥ y ≥ Ai snme ftes c )Ma otta upr o eetdclades. selected for support bootstrap Mean d) (c, trees. of number is -Axis asmn bosrp 6 f8 sim- 80 of (62 “bootstrap” Parsimony F asmn bs re 8 f8 sim- 80 of (80 tree” “best Parsimony st 7 f8 simulations) 80 of (77 ≥ ≥ S 4 f8 simulations) 80 of (48 MSN J“otta”(7o 0sim- 80 of (77 “bootstrap” NJ SYSTEMATIC BI egbre,MN asmn bs re”N “best NJ tree,” F “best and tree,” parsimony MSN, 1). (Table criteria NeighborNet, parsi- “bootstrap” PCO-MC, NJ and using “bootstrap,” flow mony gene of models topological ieto fgn o ihntete a oefc on effect no had tree the ( within TIRH. flow used Sim4 gene criteria and of the Sim2 direction of between any found were for TIRH in dif- ferences significant No flow. gene of models between ferences infiatdfeec nTR a on between found was TIRH in difference significant A LG VOL. OLOGY st ehd eeisniie( insensitive were methods y Ai sbosrpspotvle o clarity, For value. support bootstrap is -Axis P > .5.Teeoe the Therefore, 0.05). P > .5 odif- to 0.05) 56 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 onaddrcinwt n ftopsil nl tbentok hw ttebto fec ae.Arw mhsz rearrangements emphasize in Arrows shown panel. are flow each gene of of bottom model the Sim3 at the shown to networks due stable transitions final, topological possible Major two column. networks. of the each one within of with top direction the downward at a shown is networks starting possible 2007 Total (error) sets data Within models within sets data Between models flow gene Between variation of Source cont’d ANOVA Total (error) sets data Within models within sets data Between models flow gene Between variation of Source cont’d ANOVA Total (error) sets data Within models within sets data Between models flow gene Between variation of Source cont’d ANOVA Total (error) sets data Within models within sets data Between models flow gene Between variation of Source ANOVA h rbblt ficretifrne nrae dramat- increases inferences incorrect of that probability demonstrate results the Sim1 zero than The 7). be Fig. rather must in homoplasy, indicated success (as of routine probability conflicting the to and any due reticulation, Sim1, be oc- in must flow Thus gene signal taxa. when sister possible between not are curs clades or splits Conflict- criterion. ing cutoff of the as probability values different support the bootstrap nine using shows inferences erroneous 7 con- and Figure successful when define significant. to varying was used by flict values evaluated support Neigh- were bootstrap the methods the by RSW revealed and signal borNet conflicting of plications h antd n im- and magnitude signal.—The conflicting of Evidence F IGURE .Tplgclefc fogig ndrcinlgn o ewe aao iiu pnigntokaayi.Oeo four of One analysis. network spanning minimum on taxa between flow gene unidirectional ongoing, of effect Topological 5. R EE N IHRSDSIGIHN OOHL RMRETICULATION FROM MONOPHYLY RICHARDS—DISTINGUISHING AND EEVES T df df df df 79 64 12 79 64 12 79 64 12 79 64 12 ABLE 3 3 3 3 .Nse nlsso ainei TIRH. in variance of analysis Nested 1. 30, 24, 46, 40, 60, 47, 19, 12, 1, 5, 2824.70 2756.10 4657.80 1901.80 6894.10 5762.70 238.01 657.21 856.42 296.81 065.24 408.44 578.22 905.21 610.30 062.7 SS SS SS SS asmn “bootstrap” Parsimony asmn bs tree” “best Parsimony peea ntefc fpritn eeflw o Sim2, the For by flow. measured was (as gene success but of persistent probability flow cumulative of the gene face of the onset in the sig- ephemeral after decreased conflicting soon correct then appeared Thus nal simulation flow. gene the continued in suc- with of early probability increased the 3), cess and (Sim2 visualized be could not should and in error, used. II result sug- be type 95% pattern, of to similar probability 90% increased a below an show values 3 support that and gesting Sim2 95%, signal present. conflicting and correct be for 90% possible to below not is drops it percent when even cutoff the as ically ngn o oesweecretcnitn signal conflicting correct where models flow gene In MSN F st < 0.0001 .8251 .0774 .8205 .3956 .6729 .0602 .7774 P P P P 11, 3, 7 3 8 4 6 1 4 9 1, 3263.90 1248.70 2145.30 3986.30 , , , , , , , , 989.5 370.50 805.0 292.4 541.6 406.8 092.0 735.2 459.7 485.2 845.50 129.00 SS SS SS SS J“bootstrap” NJ J“ettree” “best NJ NeighborNet PCO-MC <.0001 .2354 .1222 .6546 .9011 .9033 .9039 .0222 P P P P 311 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 C aawr rae sadniylnsaeadaaye sn oprmti oa lseigpoeue(C-C,rtclt history reticulate (PCO-MC), procedure clustering modal tested. methods nonparametric network a and using tree-based analyzed the and with than landscape sooner density significantly a inferred as was treated were data PCO h rbblt fa norc neec xeddthe methods. both exceeded for inference inference correct incorrect a of an probability of Sim3, of probability duration success, the the For of failure. probability of probability cumulative high low and of terms in poorly,both performed methods both (Sim3), conditions flow gene complex more conditions under However, RSW. sig- than flow better conflicting nal revealed gene NeighborNet the Sim2, in Under modeled probability 0.5. RSW the whereas exceeded Sim2, 0.9, seldom exceeded in NeighborNet time for of success of periods brief for More- RSW. for over, than was NeighborNet 90% for higher above 57-fold percents to cutoff 5- for curve) the under area 312 rmheacia,bfraigpoess hra F whereas derived processes, sets bifurcating data hierarchical, parsi- to from applied example, appropriately For most is processes. mony genetic population phylogenetic between and continuum the spans evalu- application appropriate methods their aggregate, analytical in because, chosen The in were ated in sets. relationship data performance of simulated relative patterns the their reticulate correct, for revealing meth- evaluated nonhierarchical were two ods and methods, methods, network tree-based Two two taxa). simulated between (i.e., com- hybridization process reticulate was ongoing that an by process) plicated bifurcating generated hierarchical, that a (i.e., from signal phylogenetic containing sets els(nahadSkl 93 oaaadCrandall, and Posada both 1973; span Sokal, 2001). data and when (Sneath use realms for recommended been analysis have ordination and Networks varia- nature. largely in genetic tokogenetic are relationships of where partitioning populations, among the tion describe to designed v-ao re.TR esrdi ubro yrdzto vnssnesato iuain ro asso n tnaddvainaoethe ( above different deviation significantly standard were one procedures show analytical bars of Error simulation. groups of which start indicate since columns events above hybridization of Letters number mean. in measured TIRH trees. five-taxon ipemdlo eeflwwsue ognrt data generate to used was flow gene of model simple A F IGURE .Rltv efrac faayia rcdrsue oifrrtclt itr ne ortplgclmdl fgn o in flow gene of models topological four under history reticulate infer to used procedures analytical of performance Relative 6. D ISCU SSION SYSTEMATIC BI st was aehsoy hsba a rvosybe demonstrated been previously has bias reticu- recipient a This has history. it the late fact in that when group inference monophyletic a erroneous is taxon the NJ in and result parsimony of will use the 2; phase, previous the (phase and t this in 3, arrangement taxa Fig. group 2d, recipient Fig. sister and well-supported donor with a static, remained topol- ogy inferred the where occurred phase second a ments, taxa. but among supported, relationships highly incorrect of inference the in na- result from may sampled ture) (i.e., sets data reticulate replicate real 20 of of analysis tree-based 9 Thus respectively. and NJ, and 20, parsimony of for 5 runs in shown 85% topology exceeded 2c erroneous Figure the in taxon on support branches sister Bootstrap recovered. internal original were for its taxon neither donor the with nor it topolo- placed erroneous that Sim3, of gies 1 phase during structure tree be- begins lineages. flow non-sister gene tween anytime expected recipient be should and restruc- the turing rapid This donor in taxa. sister tree until monophyletic as the genes, appeared within as “flowed” direction rapid taxon opposite recipient a (phase the period Sim4), this 1), During Sim3, flow. the gene following Sim2, of occurred commencement order nodes (e.g., branching more internal tree in or shift starting two the by First, c). separated in 4a, were (Fig. taxa be- taxa whenever flow monophyletic gene previously persistent tween with associated charac- phases four revealed teristic simulations the of course the along olwn h re eido oooia rearrange- topological of period brief the Following underlying the through moved taxon recipient the As values support and topologies tree of Examination LG VOL. OLOGY osqecso eiuaeEvolution Reticulate of Consequences = nTe-ae Analyses Tree-Based on 3.I eldt esaqie during acquired sets data real In 23). P < .0)fo n nte.When another. one from 0.001) 56 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 ee triga h o ftelfms ouno ahgopo iegah:lf oun 9,9% 0;mdl oun 5,7% 70%; 75%, 85%, reticulate column, a middle has error). error). 90%; taxon II I 95%, (type recipient (type history 99%, history the reticulate reticulate column, a that of has inference left inference taxon no successful graphs: monophyletic of a a nine probability that of inference the of probability erroneous indicate group an areas the of each Gray-shaded indicate probability of the regions indicate column White-shaded regions leftmost 50%. Black-shaded history. 60%, the 65%, of column, top right the at starting were, 1. 2007 o eibeidctrta h ao sadsic evolu- distinct is a tree is taxon bifurcating lineage. the tionary a that a indicator in of reliable group a appearance not monophyletic the a possible, as is taxon reticulation stud- phylogenetic where level ies lower In 1988). (sensu Donoghue, and species defining phylogenetic for delimiting and and purposes conservation for presents lineages also ary bias This species. crop agr of evolution hypotheses the domestication ef- of reticulate) cannot (i.e., multiple sets monophyletic) and data (i.e., single multilocus between distinguish from fectively derived trees who, (2003), NJ that concluded Brown approach, simulation and different a Allaby on by based algorithm NJ the for x F Ai stm,maue nhbiiaineet rm0t 0.Bosrpvle sda h uofcieinfrifrigsgicn conflict significant inferring for criterion cutoff the as used values Bootstrap 200. to 0 from events hybridization in measured time, is -Axis IGURE a ikfrerr nietfigdsic evolution- distinct identifying in errors for risk eat .Sniiiyaayi fcnitn inlrvae yNihoNtadRW nalgraphs, all In RSW. and NeighborNet by revealed signal conflicting of analysis Sensitivity 7. R EE N IHRSDSIGIHN OOHL RMRETICULATION FROM MONOPHYLY RICHARDS—DISTINGUISHING AND EEVES eQueiroz de etcae xetfrdrn h nlsae fthe of stages final the during for recip- the back- except outside complex clade, found simulation, not ient our were individuals In than other crossed parents. places in their tree a with in found be infrequently may oc- structure the tree in of 4). change members (phase further curred became no and taxa clade, recipient same and donor the from monophyletic. individuals all not introgression, was persistent with Lastly, taxon reticulate correctly the to that began suggest trees reconstructed which delayed at phase point This the was c). 4a, taxa Fig. recipient 3; and (phase eroded donor gradually of monophyly the for port cae(92 hwdta 1hbi individuals hybrid F1 that showed (1992) McDade sup- which during occurred phase lengthy a Third, y ai spoaiiy cldfo to 0 from scaled probability, is -axis 313 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 sial ml ouainszsadhg eeflwrates flow gene high and sizes unreal- population the small with istically Even recombination. and exten- flow after gene inferred sive be only could reticulation tree-based methods, using the when However, accepting true. by as the hypothesis taxon reveal null recipient to the of able history ultimately reticulate were used methods based null gene the simulation of commencement accepts flow. the the correctly after of that soonest course the method hypothesis the the over for proba- error lowest cumulative is I the type way, of perform- another bility poorly after Put than sooner methods. flow hypothesis ing gene of null commencement the the reject cease erroneously ef- will error the methods to high-performing I of method: measure type a a the of is ficacy simulations of the erosion in of early ana- apparent rate various The the by methods. rejected lytical erroneously longer lineage) (that evolutionary no distinct hypothesis was a not null is taxon the recipient before the hybridization necessary of number was the events, by measured as flow, gene much how determine to distinct wished We a lineage. be evolutionary to taxon methods recipient all the simulation, identified the erroneously in stages early process very these evolutionary Dur- ing monophyletic. reticulate speaking, strictly longer, a no was of and product taxon a recipient the became soon taxon, was recipient taxon As donor the sets. the into in introgressed data present state starting character the single a in as relationships high branching was the taxa for mono- among and a the taxon for under each Support of simulated phyly evolution. was of that model set bifurcating data starting terminal phylo- reconstructed in trees. history, genetic reticulate in their group, of monophyletic a reticulatespite as a appear infre- of generally members will (with taxa), taxon between taxa flow within gene quent when occurs that, predict mostly results our recombination cases, these In sys- hybrids. over F1 natural predominate of individuals back-crossed studies they where to However, tems applicable data individuals. necessarily cor- molecular not hybrid be are multi-locus, F1 might containing large, results sets to their hand, extended other rectly present the the with On comparable study. directly be their not may so results characters, hypothetical morphological or examined matrices (1993) character Ellstrand and Rieseberg a follow should flow. belong, gene they unidirectional persistent the during which pattern as predictable to well as taxa individuals, clado- reticulate that back-crossed indicate in of results position individuals Our the conclude predicted. hybrid readily F1 not (1993) is of grams Ellstrand placement and the that Rieseberg Funk and occasionally (1983), Humphries were (1985), taxon. donor they the within when nested polytomy, into collapse 314 l tree- methods.— All tree-based of performance Relative 50- five-taxon, a with began flow gene Simulated and (1992), McDade (1985), Funk (1983), Humphries rso fTp Error I Type of Erosion SYSTEMATIC BI ehd sd F used, methods (discussed signal of conflicting below). of possibility evidence using the points 1) time by (phase earlier at mitigated taxa reticulate be identifying correctly cases, some in may, er- I type of parsi- probability ro the The as method. correctly well “bootstrap” as be mony performed to criterion hypothesis This null accepted. the below re- considered dropped the we support of 95%, bootstrap members for When only level taxon. contained cipient support that split bootstrap trivial the the determining by sured the for alone MSNs data. of phylogenetic use of analysis the against which recommend to we Accordingly, inter- degree manner. be meaningful historically the can a that in unclear preted topologies produces is method non- MSN it the in sets, the relationships data of reconstructing Cassens reticulate performance for poor of the method those describe MSN and who results, (2005), our al. et start- upon topology tree four Based starting of 5). simulated (Fig. same possibility the the for MSNs by ing gene complicated reticulate was to More- method flow MSN 6). the (Fig. of performed tree” response the “best MSN over, parsimony the criterion, to within this similar observed poorly, Using length recipient taxon. branch and mean recipient the donor and between taxa mean observed the in length differences re- significant method branch of MSN erosion the the of on performance lied the test to devised begun has flow gene sampled when data (i.e., for 2 recently). occur and 100% still 1 if phases would during error even mono- study, I claiming type this before phyly, of required was results support the bootstrap How- on monophyletic. be based taxon to ever, reticulate asserted a erroneously that be chances will monophyletic the phase consequently, of is duration and, the decrease 2 taxon potential to high terminal very with set a be data required should that real value hypothesize support of bootstrap to analysis the the signal, in reticulate were used method this be If method to “bootstrap.” any recommend parsimony to than reason all other no crite- virtually is tree-based there in the tested, of occurred ria Therefore, but simulations. 80) effect of average (77 an just the not was of “bootstrap” NJ outperformance over method The “bootstrap” parsimony “bootstrap.” NJ than ( better significantly Parsi- performed support. “bootstrap” of character mony resolution ever-diminishing of loss with to clade prior support a occur bootstrap to of expected be erosion would that ( given alone surprising, topology not tree utilized on relied methods that that outperformed Methods significantly support unambiguous. bootstrap were rankings 1000. than greater 600 approximately to reticulate from varied a identified before be could generations taxon of number the modeled, rr fall methods.—Of nonhierarchical of performance Relative mea- was method NeighborNet the of Performance criterion methods.—The network of performance Relative poorly, performed methods tree-based all Although LG VOL. OLOGY mishg sn hsciein u hsdrawback this but criterion, this using high emains st efre h ors,ee when even poorest, the performed P < .01.Ti is This 0.0001). P < 0.0001) 56 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 l aawr nldd F included, When were hybridizing. taxa be all to known taxa those only culation inl hsi o rtcs falgtmt s fF of use legitimate a of criticism a not is phylogenetic This containing sets signal. data in reticulation tifying sn h eeoscieino nern reticulation inferring of criterion F when generous the using 2007 atgsoe te oheacia lseigmethods. clustering nonhierarchical other over vantages set. data given a stable for R most membership of the and determine number range cluster to a explored be data, should real values PROC analyzing in of When available use number MODECLUS. legitimate cluster of for the four tests permit significance least would at landscape landscape” smoother (where A density variation). “density internal data no exhibited smoother simulated clusters five a the with to than lead realized be should within also variation taxa natural might where sets, method data real using Improve- PCO-MC when error. the I type in error, promote per- II ments type values it promote R values small because R whereas Large parameter I errors. type II useful between type trade-off a and inevitable the is over control R mits for set, precisely data define any to and difficult densities cluster Although evaluate membership. to smooth- used the (R) of parameter estimation ing accurate more by achieved be associations. to statistical dimensions for scrutinized many simultaneously be permits of clustering number modal large of cation a appli- in Subsequent eigenvectors). emerge (or asso- dimensions possible freely such more calculated, may are ciations coordinates principal apparent. becoming When as- from some individuals prevent may among networks) sociations extent, lesser a trees to bifurcating (and, of relation- structure Second, historical low-dimensional synapomorphy. rigid, of the than indicator individuals better among a ships be of coalescent may average independent genes, an many by overall as contributed an calculated distances recombination, measure, of distance presence to genetic the bound- peculiar in factors phylogeny/tokogeny First, two the ary. of across collusion sampled to data due be may ods support. bootstrap and from 100% large, A received remained taxa taxon other A distinguished all and that split D NeighborNet or the C net- taxa spanning minimum between of the work taxon along sister distance monophyletic the the D, taxon as A taxon bootstrap 100% for retained support parsimony when iden- point nearly a were at D tical and A taxa for coordinates principal t at Sim3 in example, gene For of flow. commencement the principal following similar values acquire coordinate rapidly to taxa representing hybridizing clusters show the time over Plots values. TIRH ( param- smoothing value conservative eter a using when even ods, tokogenetically among populations. related variation genetic describing for ofie ifrne ewe ohbiiigtx.Thus, taxa. F nonhybridizing between differences fixed to st oprmti oa lseighsanme fad- of number a has clustering modal Nonparametric might method PCO-MC the of improvement Further meth- other over PCO-MC of outperformance The meth- other all outperformed significantly PCO-MC osntapa ob naporaemti o iden- for metric appropriate an be to appear not does st rpe eo .5adicuigi h cal- the in including and 0.95 below dropped R = .41 htbae eut oadlarge toward results biased that 0.0401) R EE N IHRSDSIGIHN OOHL RMRETICULATION FROM MONOPHYLY RICHARDS—DISTINGUISHING AND EEVES st i o eit rm10due 1.0 from deviate not did = 3(i.3,tefis three first the 3), (Fig. 23 st iuae aawt nw eiuaehistory. reticulate appropriate known using neither with tested data knowledge, formally simulated been our 2002; yet To has al., 2003). method et Double, Raxworthy and 2002; Abbott Trueman, and retic- implemented in- infrequently (Nicholas which although been RSW, in has whereas attractive, sets important, tuitively data be for may used ulation commonly now NeighborNet are as such methods Network 2004). and Moulton, Bryant 1998; (Trueman, recombination hybridization intergenic as or such processes reticulate by caused sets data phylogenetic in signal conflicting identifying for ods in warranted. taxa is sets reticulate data simulated identify and real correctly complex to methods sim- this ordination- based of of ability the results of promising examination further the study on ple indicated Based set. groups data distinct a by evolutionarily identify of to de- is number goal be the the when not essential is need Sec- This priori. clusters used. a of fined be number nonnormal may expected or coordinates) the ondly, principal nonuniform as thus, (such required; data are data distribution underlying of the about assumptions no First, itrtx eg,Sm) hs oflcigsga criteria signal conflicting Thus, Sim1). (e.g., cor- taxa between occurred revealing sister flow gene of when signal incapable conflicting rect were likely. methods more even both Lastly, were inferences incorrect flow gene conditions, complex topologically under re- possible history reticulate mained of phylogenetic inference underlying correct a the Although by in tree. separated internodes taxa more between or re- occurred two whenever has high flow be gene will error cent II nature, type from of sampled probability sets the data reticulate real are to RSW) applied and NeighborNet as conflicting (such identify signal that phylogenetic methods when re- These that, monophyletic. imply retic- was sults a it have fact to in taxon when a history ulate identifying error: II were type RSW to and prone NeighborNet both Sim3), (e.g., taxa lated error. I type in result to still expected would analysis be NeighborNet 2, phase correspond- to period roughly regions a “ero-ing During the criteria. using error” with I obtained type of be overlap sion could not inferences do correct inference where correct a result criteria in signal conflicting where course simu- time the lation of regions the Nevertheless, Sim2). those by “bootstrap” (e.g., modeled processes parsimony reticulate cumulative specific than highly lower error certain for a I have type to of probability expected with be coupled Neigh- might TIRH, is for borNet signal ranking retic- performance conflicting identifying second-place on its correctly based of taxa (Fig. possibility ulate signal the phylogenetic When conflicting 7). his- revealing reticulate by of indications tory correct ongoing producing ca- were of an NeighborNet pable of and RSW 1) both phase process, stages reticulate to early the roughly during corresponding Sim2) (i.e., (e.g., flow gene of models We meth- as proposed been have RSW and NeighborNet hngn o curdbtentodsatyre- distantly two between occurred flow gene When aesonta,udrpriua topological particular under that, shown have vdneo oflcigSignal Conflicting of Evidence 315 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 e a eiuaehistory. data reticulate a within a taxon has specific for set a useful that universally hypothesis the be testing not may reticulate networks of hallmark evolution), (the data signal a conflicting which contains to extent set the of visualization Neighbor- the as In permit such Net 2006). methods Bryant, network while and words, (Huson other topology sup- entirety, tree-like its in a taken set, ports data a which to degree the of examination preliminary for meth- useful primarily network be re- split may ods that signal suggestions conflicting prior support the find- ings Our clear. of entirely not is utility analyses network by in the vealed reticulation sets, generalized in- data more may some of taxa) indication related the distantly duce two between flow unidirectional as gene (such reticulate, process reticulate not simple is signal a history (3) and the conflicting when even significant observed be of may evidence (2) common, were of criteria complexity error” not. I topological type of “erosion the whereas flow, by gene affected clearly were 316 inlgn o.A nraei ouainsz sex- is (e.g., realis- sizes size For population 2). tic population (Appendix TIRH in increase greatly increase multidirec- to pected An or flow. bidirectional gene of gene in effect tional the variation and rate, rates, mutation popula- flow level, include ploidy These size, simulation. tion this in modeled not systems. natural from sets ana- data when complex of important more consequences be lyzing may the that predicting variables for unmodeled foundation ISSR, pro- a RAPD, model simplified vides as this that such believe We SNP. sec- markers or AFLP, of molecular lineages study bacterial binary a conjugating using in observed between be contact might ondary generated what that to producing reality, akin process biological in data the based was (irre- of model data The specifics real it). of the lev- analysis of the signal spective that in conflicting signal encountered conflicting of be of might range range the reticulate wide of generate representative a els, to with rather that sets but process data nature particular in any occur mimic might precisely to not was two trees. between five-taxon within flow taxa gene unidirectional and haploid only was considered it simplified: highly introgressive was of hybridization model The genes. in- of coalescent analysis dependent separate to opposed as loci from unlinked data many of analysis simultaneous the study with The concerned reproduction. is with to applica- taxa barriers between most nonexistent flow or are gene weak intraspecific results of The cases study. to ble present the to ply edmb eelduigbfraigte-ae proce- tree-based bifurcating or using revealed events be would hybrid history seldom (>10,000 reticulate true large modeled that so generations) those >100,000 be as may such TIRH flow here, gene and distance genetic ie ht()gn o ewe itrtx a be may taxa sister between flow gene (1) that Given eea aibe motn nntrlsseswere systems natural in important variables Several however, model, simplified this using for reason The ap- limitations of number a simulations, all with As eeaiyo Findings of Generality N > 00 n eeso starting of levels and 1000) SYSTEMATIC BI es(.. hr oo n eiin aawr o yet NeighborNet, not were PCO-MC, taxa used recipient then We and homogeneous). donor where data (i.e., randomly non-trivial of We sets totality the data. vari- from sets simulated data undertook of our 200 sampled we of performance surrogate, reanalysis the a blind realistic evaluate a As not methods. to is analytical data it ous real observed, use been have to usually not reticulate will events a hybridization from past derived because were Furthermore, process. data the if priori, a know, predict. to clade difficult new is this tree of the position contrast in the in case, unidirectional Thus, ancestry. the to common accumu- and synapomorphies mutation which late: by the than mechanism rather fixation, usual and drift the of via processes flow, stochastic gene to due apparent appear these that would genes Note synapomorphies emerge. exchanging would taxa clade single two a that the in of synapomorphies inclusion apparent the of support assemblage novel branches. A internal for support character an- eroding one by towards position other them starting forcing simultaneously, their tree the from in away exchange under- gene taxa the going of both pull would states less Bidirectional character of case. is flow unidirectional topology the tree in the on than flow predictable taxa, more gene hybridizing bidirectional where the of cases effect separates internode in one However, than decrease. also TIRH so decrease, should to expected is taxa hybridizing two the between With homogeneity to possible. time the flow, is gene bidirectional flow gene multidirectional of or order rectional rank the examined. affect methods to the for likely performance seems relative expected rate nor is rate flow flow mutation gene in gene variation Neither in TIRH. increase decrease to An process minimal. hybrid ongoing be an may for im- TIRH the on so mutation 1998), of al., pact et (Drake replication per nucleotide h aea hc nrgesdcaatr r fixed, 10 of order are the on , characters generally in low are be introgressed to rates thought Mutation which infinity. approaches approaches at TIRH population at a rate rate in the the fixed As TIRH. are increase mutations to which Thus is mutation arise. of taxa effect recipient the (rather and distinguish donor that and homogenize) characters pop- than taxa new a within fixation, into occur to states drift mutations character as However, novel introduces ulation. it that in size. the population to increasing similar of be effect should organisms polyploid from or data diploid analyzing of effect tree-based The supported, analyses. phylogenetic be strongly to on presumed now based taxa plant monophyletic level lower many ruled for be out important cannot history is reticulate that finding suggests this it conse- because However, the bias. ameliorate this may of time quences over dis- fluctuation size nature, finding population In in towards groups. biased monophyletic) (e.g., inherently tinct are that dures hnaayigra aa twl fe edfcl to difficult be often will it data, real analyzing When bidi- populations, structured with systems natural In h feto uaini iia ota fgn flow gene of that to similar is mutation of effect The LG VOL. OLOGY Recommendations −10 per 56 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 2007 ei aasets. phyloge- data in taxa netic reticulate identifying other for methods no PCO-MC tested provides than the study of this any recommend identified, to be evidence can value R appro- incorrect an priate that an provided Therefore, and time. time, the cor- of the 20% a of result 6% applied, obtained the was likewise of result was rect 6% obtained NeighborNet was When result incorrect time. of an 7% but obtained time, was the result to correct subjected a were analysis, PCO-MC re- parsimony using that inference sets no data in When sulted that alone. over PCO-MC methods using combining achieved by attained be advan- no could Neigh- Furthermore, tage 28%. for and and 34.5%, 32.5%, 37.5%, 3.5%, and borNet, 96%, 3%, these was 64.5%, bootstrap, were errors parsimony frequencies I For respectively. type 0.5%, and and errors, an- II correct of type frequency order swers, the PCO-MC, rank For the same. the NeighborNet), was and each parsimony of for performance crite- 100% cutoff the (R When maximize error. to method II altered type were of ria frequency high a because primarily of methods two other the derperformed un- NeighborNet NeighborNet. and “bootstrap” simony made. been had inferences until the hidden but after known was de- which answer, mono- criteria correct inferences the error” were to our I compared which we type Finally, previously. of and scribed “erosion history the reticulate using phyletic a had taxa which determine to methods “bootstrap” parsimony and eosrcighge ee eainhp mn mono- among relationships level for higher appropriate reconstructing methods, tree-based bifurcating plying n orcl eetdfralmnpyei aa n“norc”ase eutdfo alr omk orc neec o l aaincluded taxa monophyletic. all be for to inference inferred correct were a taxa taxon, make all reticulate to when the failure made for a was accepted from history correctly resulted reticulate was answer of hypothesis inference” “incorrect” null “No An the analysis. taxa. when the monophyletic obtained in all was answer for “correct” rejected A correctly flow. and gene simulated of models four iue8sosta C-Cotefre ohpar- both outperformed PCO-MC that shows 8 Figure hssuydmntae oeo h iflso ap- of pitfalls the of some demonstrates study This F IGURE .Rltv efrac fPOM,prioy“otta, n egbre ehd ndt essmlda admfo all from random at sampled sets data in methods NeighborNet and “bootstrap,” parsimony PCO-MC, of performance Relative 8. = .01frPOM,bosrpcutoff bootstrap PCO-MC, for 2.0001 R C EE N IHRSDSIGIHN OOHL RMRETICULATION FROM MONOPHYLY RICHARDS—DISTINGUISHING AND EEVES ONC LUSIONS = rusfo eiuaetx ndt esta pnthe span that sets tokogeny. data and phylogeny in between taxa boundary reticulate from groups monophyletic terminal net- distinguishing or for tree-based methods than work useful generally may more PCO-MC) be Nonhier- to as prove evolution. (such reticulate procedures of ordination archical product a are groups II monophyletic type clearly to that histo- finding prone towards reticulate highly error—biased are identify methods to such condi- However, used ries. restrictive successfully very certain, be signal (under tions) phylogenetic may sets, conflicting data reveal in to designed are which RSW, as such methods tree-based or NeighborNet, is purposes, for conservation or questionable. for species, delimiting populations or defining prioritizing of purpose groups the monophyletic for terminal identify in to pro- alone tree-based hybridize of cedures use to the that suggest known results phyloge- Our taxa nature. supported among strongly stud- relationships reveal some netic to of purport conclusions that the ies challenge taxa. results of among These inference relationships the erroneous in supported, result highly may data reticulate substan- of a analysis tree-based has that it show we when Moreover, finding history. even reticulate towards tial monophyletic error—biased be to I taxon type a because to prone methods are tree-based they using identifiable readily relationships. reticulate to We lead have past where may studies, hybridization level lower to taxa, terminal phyletic hn o epu omnso rf eso ftemanuscript. the of version draft a on comments helpful for Zhang hssuysget htntokmtossc as such methods network that suggests study This hwta aawt eiuaehsoismyntbe not may histories reticulate with taxa that show We hn eis sa,Mr imn,DvdTn,adLibing and Tank, David Simmons, Mark Islam, Melissa thank A CK NOWLEDGEMENTS 317 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 318 ic,W .19.Ntok n ia vlto.J o.Evol. Mol. J. evolution. viral and Networks 1997. M. W. Fitch, geo- and frequencies allele Using 1994. Smouse. E. P. and L., Excoffier, Distribution 1996. Rieseberg. H. L. and Whitkus, R. 1998. C., Crow. F. N. J. Ellstrand, and Charlesworth, species D. for Charlesworth, topologies B. tree W., J. allele Drake, of irrelevance The 1995. J. J. Doyle, genet- conservation of expansion The 2004. Amato. G. and R., DeSalle, reduced of study simulation A 2000. Olmstead. G. R. systematics and W., Phylogenetic R. 1988. Debry, Donoghue. J. M. and and K., variation, Queiroz, genetic De Populations, 1992. Nixon. C. K. and I., J. Davis, markers molecular Single-locus 2004. Mort. E. J. M. and D., Crawford, Palmer. D. J. and Vincent, M. T. Cheng, Y. Parkinson, L. C. S.-M., Chaw, Les, H. D. Morgan, D. Olmstead, G. R. Soltis, E. D. W., M. Chase, Evaluating 2005. Milinkovitch. C. M. and Mardulyn, P. I., Cassens, al- agglomerative An NeighborNet: 2004. V. Moulton, in and reticulation D., and Bryant, Hierarchy 1979. Wanntorp. Phylogeny H.-E. and 2000. K., Bremer, DePamphilis. W. C. and Coat, G. M., L. Bowe, the on perspectives Genealogical 1995. Shaw. L. K. and A., D. Baum, networks: Median 2000. Richards. M. and Macaulay, V. H.-J., Bandelt, Neigel, E. J. Lamb, T. Bermingham, E. Ball, M. R. Arnold, J. C., of J. Avise, Uni- origins Oxford the evolution. and and hybridization data Natural 1997. AFLP L. M. 2003. Arnold, Brown. A. T. and G., R. Allaby, and shy of Phylogeography 2003. Double. C. M. and L., C. Abbott, 4Spl) S65–S75. 44(Suppl.): 136:343–359. Genetics parsimony. variance Molec- species: ular a within trees gene reconstruct to subdivision graphic 93:5090– USA Sci. Acad. Natl. Proc. 5093. hybrids. plant spontaneous of 148:1667–1686. Genetics mutation. spontaneous of Rates 20:574–588. Bot. Syst. alternative. non-topological a and delimitation, 5:702–712. Genet. Rev. Nat. ics. 49:171– 179. Biol. Syst. analysis. resampling bootstrap in effort tree-search 4:317–338. problem. species the and 41:421–435. Biol. Syst. species. phylogenetic of delimitation the Observations levels: 53:631–635. Taxon taxonomic comments. lower and at relationships inferring for conifers. and 97:4086–4091. Gnetales USA Sci. of Acad. origin Natl. and Proc. Mono- gymnosperms genomes: extant three all of from phyly inferred phylogeny plant Seed 2000. gene A. V. 80:528–580. nucleotide plastid and of the Dayanandan, analysis S. from an plants: Barrett, sequences seed H. of C. Golenberg, S. E. 1993. Graham, Eguiarte, Albert. E. S. L. Learn, Quinn, Swensen, H. J. M. C. G. S. Gadek, Soltis, A. S. P. P. Furnier, Williams, Plunkett, R. E. M. G. S. G. Smith, S. F. Xiang, B. Q.-Y. J. Hedren, Wimpee, Strauss, F. M. H. C. Sytsma, Clark, S. Kim, J. D. K.-J. W. K. Jansen, Karol, A. Manhart, K. G. K. R. R. K. Qiu, J. Gaut, Kress, -L. Palmer, J. Y. W. D. Hills, Michaels, J. J. G. Conti, H. H. E. Price, Rettig, A. H. R. J. Duvall, Kron, R. M. Mishler, D. B. maxi- global 54:363–372. the Biol. outperform Syst. approach? algorithms parsimony existing mum se- Do simulated data: using quence methods construction “network” intraspecific Biol. Mol. networks. phylogenetic of 21:255–265. Evol. construction the for gorithm 28:624–627. Zool. Syst. systematics. Extant 97:4092–4097. USA compartments: Sci. are Acad. genomic relatives Natl. closest three Proc. Gnetales’ conifers. all and on monophyletic are based gymnosperms plants seed of Stephenson, Louis. G. St. A. Garden, and Botanical Hoch Missouri C. eds.). (P. biosystematics plant to proaches 289–303 Pages problem. species 16:8–28. Evol. Phylogenet. Mol. mtDNA. human two from and studies simulation, case one reduction, greedy and construction Speedy and 18:489–522. genetics Syst. population Ecol. Rev. between Annu. bridge systematics. phylogeography: DNA Intraspecific 1987. mitochondrial Saunders. The C. N. and Reeb, A. C. York. New Press, versity 46:448–453. Genome crops. domesticated se- Molec- 12:2747–2758. taxonomy. DNA Ecology and ular history mitochondrial population for from Implications quences: inferred albatrosses white-capped R EFERENCES in .An isuiBt Garden Bot. Missouri Ann. rbcL. xeietladmlclrap- molecular and Experimental SYSTEMATIC BI uslo .A,adG mt.20.Amlclrpyoeyof phylogeny molecular A 2004. Amato. G. and 2002. A., and M. Russello, Nussbaum. molecular can A. What 1993. R. Ellstrand. and C. N. Forstner, and H., J. L. Rieseberg, R. M. J., C. Raxworthy, the for application An Seq-Gen: 1997. on Grassly. C. recombination N. of and effect A., Rambaut, The 2002. Crandall. A. K. genealogies: and gene D., Intraspecific Posada, 2001. Crandall. A. K. and D., Posada, M. Soltis, S. P. Soltis, E. D. Bernasconi-Quadroni, F. phylo- Lee, the J. of Y.-L., amplification Qiu, An 1990. Wheeler. D. Q. and C., K. Nixon, fam- the of taxonomy The 2002. Trueman. H. W. J. and L., W. Nicholas, cladograms: in branching Multiple 1980. Platnick. I. N. and G., Nelson, Compar- III. systematics. phylogenetic and Hybrids 1997. L. McDade, impact The II. systematics. phylogenetic and Hybrids 1992. L. McDade, of Patterns I. systematics. phylogenetic and Hybrids 1990. L. McDade, phy- angiosperm of root The 1999. Donoghue. J. M. and S., Mathews, Software analysis. data Genetic GDA: 2002. Zaykin. D. and O., biogeo- P. Lewis, of Reconstruction 2002. Makarenkov. V. and P., Legendre, Classif. J. evolution. reticulate on section Special 2000. (ed.) P. Legendre, phylogenet- in problems RAPD 1996. LaPointe. F.-J. and plants. A., P. flowering Landry, in hybridization Intergeneric 1972. W. I. molecules. Knobloch, protein of Evolution 1969. Cantor. R. C. and H., T. Jukes, Bul- florale. distribution la net- sur rescherches phylogenetic Nouvelles of 1908. P. Application Jaccard, 2006. Bryant. D. and H., D. Huson, 89–103 Pages analysis. hybrid in data Primary 1983. J. C. Humphries, Press, Illinois of University systematics. Phylogenetic 1966. W. Hennig, recom- to subject sequences of evolution Reconstructing Conser- 1990. 2000. J. Vogler. P. Hein, A. and Amato, G. DeSalle, R. Z., P. hi- Goldstein, nested species, Phylogenetic 2000. DeSalle. R. New and Press, Z., P. University Goldstein, Columbia . Plant 1971. E. V. Grant, genes chloroplast 17 of Utility 2000. Olmstead. G. R. and W., Mis- S. Graham, Ann. hybridization. and patterns Phylogenetic 1985. A. V. Funk, n osrain o.Pyoee.Eo.30:421–437. Evol. Phylogenet. taxonomy, Mol. biogeography, conservation. parrot and Neotropical for Implications zona: Rev. Crit. 12:213–241. hybridization? Sci. plant Plant about us tell markers morphological 415:784–787. Nature dispersal. oceanic by radiation Chameleon 13:235–238. Biosci. Appl. phyloge- Comput. along trees. evolution netic sequence DNA of simulation Carlo Monte 54:396–402. Evol. Mol. J. estimation. phylogeny of accuracy the Tr 402:404–407. Nature and genomes. plastid 1999. nuclear mitochondrial, Chase. W. from M. Evidence and angiosperms: Savolainen, earliest V. The Chen, Z. Zimmer, A. E. Zanis, 6:211–223. Cladistics concept. species genetic 4:453–470. cladistic Nematology A analysis. Nematoda): (Monhysterida: 1951 Chitwood, Xyalidae ily Two 22:669–683. Bot. Syst. methods. distance with ison 46:1329–1346. Evolution analysis. cladistic on hybrids of cladistic for implications 44:1685–1700. their Evolution and analysis. hybrids in expression character 282:947– 950. Science genes. phytochrome duplicate from inferred logeny authors. the by distributed Biol. Syst. reticulograms. using 51:199–216. networks evolutionary and graphic 17:153–195. 25:283–290. Scripta Zool. ics. 21:97–103. Taxon York. New Press, Academic ed.). 21–132 Pages Soci´et´ la de letin 23:254–267. Evol. Biol. Mol. studies. evolutionary in works Columbia eds.). Funk, York. A. V. New and Press, Platnick University I. (N. Society Hennig Willi in Urbana. 98:185–200. Biosci. Math. parsimony. using bination 14:120–131. Biol. Conserv. boundary. species the at genetics vation 16:364–384. Cladistics fixation. character and erarchies, Bot. York. J. Am. angiosperms. basal the of 87:1712–1730. phylogeny the inferring for 72:681–715. Garden Bot. souri LG VOL. OLOGY e rfigit ewrs rnsEo.Eo.16:37–45. Evol. Ecol. Trends networks. into grafting ees dacsi ldsis rceig fteScn etn fthe of Meeting Second the of Proceedings cladistics: in Advances nepeain.Ss.Zo.29:86–91. Zool. Syst. interpretations. in eV amla rti eaoim I H .Munro, N. (H. III metabolism, protein Mammalian uos e cecsNtrle 44:223–270. Naturelles Sciences des audoise Ama- 56 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 fpplto ieo hoeia iet ooeet.Ae r h aea a,ecp htthe that except (a), as same the are Axes homogeneity. to time theoretical on size population of Tr analy- cladistic A 1992. Sing. F. C. and Crandall, A. K. R., A. Templeton, wfod .L 99 PAUP 1999. L. D. Swofford, Guo, P. Y. Jankowics, J. Mullner, N. A. Tremetsberger, K. F., T. Stuessy, Proceedings evolution. in hybridization of role The 1959. L. G. Uni- Stebbins, Columbia plants. in evolution and Variation 1950. L. G. Stebbins, prin- the taxonomy; Numerical 1973. Sokal. R. R. and A., H. evolution. P. Sneath, reticulate of representation Cladistic 1975. A. H. P. delimiting Sneath, for criteria Operational Phylogeny 2004. 2003. Marshall. C. Bennett. J. R. and W., J. J. Sites, and Olmstead, G. R. W., R. Scotland, software A Arlequin: 2000. Excoffier. L. and Roessli, D. S., on Schneider, recombination of Consequences 2000. Hein. and J. biology and population H., M. Plant Schierup, 2001. Leverich. J. W. SAS and procedure. A., MODECLUS B. The Schaal, 1993. Kuo. A.-H. and S., W. Sarle, 2007 uv niae hoeia iet ooeet o ouainsz N size population for homogeneity to time theoretical indicates Curve ea,J .H 98 ees ucsiewihig yt Biol. Syst. weighting. successive Reverse 1998. H. W. J. ueman, 47:733–737. 132:619–633. Cladogram Genetics III. estimation. data. sequence DNA restric- and from mapping endonuclease inferred tion haplotypes with associations phenotypic of sis Massachusetts. Sunderland, .M az,adR .Sme.20.Temligo systematics of 4:287–296. melding level: genus The populational the from the 2003. examples at Samuel. investigations M. through biogeography R. and and Baeza, M. C. 103:231–251. Society Philosophical American the of York. New Press, versity San Freeman, H. W. classification. numerical Francisco. of practice and ciples 24:360–368. Zool. Syst. 35:199–227. Syst. Evol. Ecol. Rev. Annu. species. 52:539–548. Biol. Syst. morphology. of role The reconstruction: Geneva. of and University Genetics Anthropology, of 2.0.1.1. Department version Lab. analysis, Biometry data genetics population for 156:879–891. Genetics analysis. phylogenetic traditional 50:679–695. Taxon systematics. Carolina. North Cary, Institute, SAS P-256. Report Technical oy( mony F IGURE ∗ n te ehd) eso .b0 iae Associates, Sinauer 4.0b10. version methods), other and 1 a feto triggntcdsac ntetm eurdfrtohbiiigtx obcm eeial indistinguishable. genetically become to taxa hybridizing two for required time the on distance genetic starting of Effect (a) A1. R EE N IHRSDSIGIHN OOHL RMRETICULATION FROM MONOPHYLY RICHARDS—DISTINGUISHING AND EEVES Hypochaeris ∗ hlgntcaayi sn parsi- using analysis Phylogenetic : Atrca) ai pl Ecol. Appl. Basic (Asteraceae). = lt itr a bandfrapriua iepit ie ht20 that point: Given time point. each time for analyzed particular were a sets data for different obtained was history ulate “probability.” indicates “Pr” where any at simulation, Therefore, the set. of data course time incorrect one the and any during correct point for particular both observed is, be could that former inferences nonexclusive, The history. were reticulate outcomes of inference two no inference and incorrect history, an reticulate history, were of reticulate for of outcomes levels inference Three correct signal presented. a are possible: conflicting analyses on RSW erroneous based and and NeighborNet history successful reticulate of of probability inferences the calculating for Formulas soit dtr dinPaterson Adrian Editor: Associate 2006; September 12 returned reviews 2006; June 6 submitted First Biol. Mol. evolution. reticulate under analysis Phylogenetic 2000. S. the Xu, at looking of way Another 1990. Nixon. C. K. the and for D., F-statistics Q. Wheeler, Estimating 1984. Cockerham. C. C. and S., B. Weir, Tr D bevdi hssuy b Effect (b) study. this in observed (±SD) values mean show Points 10. ea,J .H,adM ib.20.RW..Sfwr distributed Software RSW1.1. 2002. Gibbs. M. and H., W. J. ueman, nlacpac 7Nvme 2006 November 27 acceptance final 17:897–907. Evol. Cladistics Donoghue. and Queiroz de to 6:77–81. reply A problem: species 38:1358–1370. Evolution structure. population of analysis authors. the by Let N = Pr(success) h oa ubro aast nwihn neec fretic- of inference no which in sets data of number total the y rn inference) Pr(no + ai si log in is -axis Pr(failure) A PPENDIX + 10 scale. rn inference) Pr(no = 1 /0 (2) N/20. = (1) 1 319 Downloaded By: [USYB - Systematic Biology] At: 21:43 30 April 2007 hw ordc o1 to reduce to shown nrltosisaogptnilyitrreigppltos hc pera pas nteimages. the perspective different in slightly “peaks” a as provides appear coordinates principal which plotted populations, of interbreeding pair potentially Each among data. coordinate relationships principal on of landscape density a as visualized hr sol n orc neec osbeprdt e.Temaximum C The for set. data I value per for possible maximum value inference correct The one only point. is time there particular analyzed a sets data 20 at all across summed inferences incorrect of number Pr(inference)= inference, an yields run a that probability the Accordingly, 320 orc neec rmtettlt fifrne,or: inferences, of totality the from inference correct a Pr(success) yfcoig[1 factoring By [1 is: (1) equation of expansion the Therefore, Likewise: 1 − e C Let − (N/20)] uuu lupulus (Humulus varieties hops American North native of history evolutionary reticulate The (N/20). = > = h oa ubro orc neecsadI and inferences correct of number total the 0bcuemlil norc neecsaepsil.The possible. are inferences incorrect multiple because 20 × Pr(success) rifrne utpidb h rbblt fsampling of probability the by multiplied Pr(inference) − Pr(failure) [C/(C N2) rmtefis w em,Euto 5 seasily is (5) Equation terms, two first the from (N/20)] = + 1. I)] = = + [1 [1 [1 − − − (N/20)] (N/20)] (N/20)]× × × [C/(C [I/(C [I/(C + + + ).(4) I)]. ).(3) I)]. I)] + = N/20 SYSTEMATIC BI = 0because 20 h total the = 1. (5) d[D,R] aat eoegntclyietclcnb prxmtdb h fol- the by approximated hybridizing be d[D,R] two can for equation: identical required lowing genetically time become the to model, taxa haploid present the In eonzda itntlnae sn oto h nltclmethods analytical the of study. most this in using be considered lineages longer no distinct will as taxa interbreeding recognized two before required be hybridization this will of events under thousands many identical sizes, genetically population realistic become For nec- model. to relationship events populations hybridization two the of for shows number essary the A1b and Figure size in formula. population the shown between than the as larger by Therefore, slightly were predicted sampling. either study set values this characters in data introgressed observed to values the A1a, prior Figure of lost all) or not fixed ran- were (but of (10) most generations of mating, number dom fixed A1a). a (Fig. used distance simulation genetic the Because starting identi- become with to logarithmically taxa increases ge- hybridizing cal for data that required to and time prior taxon The neutral, recipient sampling. are the set in states occurred has character fixation produced all until drift is that netic individual event, hybrid flow F1 gene single per a that 1/N. is assumes drift equation genetic via This proportion fixed parental the become and to donor expected 0.5, alleles of is introgressed progeny of proportion hybrid F1 hybridiza- the in of began), present number states flow the character gene as (measured since t-1 events and tion t time at taxa recipient LG VOL. OLOGY t n d[D,R] and t −1 r h eei itne ewe h oo and donor the between distances genetic the are t vars. ≈ A d[D,R] PPENDIX lupuloides, t −1 − 2 (0.5)(d[D,R] and neomexicanus, t −1 (/) where )(1/N), pubescens) 56