Posted on Authorea 1 Oct 2020 | The copyright holder is the author/funder. All rights reserved. No reuse without permission. | https://doi.org/10.22541/au.160157511.10593343 | This a preprint and has not been peer reviewed. Data may be preliminary. sesn pce onaiscnb atclrycalnigfrtx htaentesl morphologically of easily comparisons quantitative not context, are this that In taxa preferences. for habitat challenging similar Wiens particularly superficially 1997; have be or Mayden can morphological, maydistinguished 1963; on boundaries which emphases 1957, species of their 1942, each Assessing in (Mayr species, differ Freudenstein assessments 1998; that distinguishing Queiroz concepts species and de species contradictory divergences 2004; of Loera resolving or plethora 2009; for inconsistent Wilkins a genetic, 2007; information to produce historical, Queiroz led a ecological diverse de has from or by 2004; This suffer affected Wiens genetic, identification are 2003; 2012). Marshall and speciation al. & of delimitation (Sites et processes species factors the in MacDonald stochastic because employed and 2013; part ecological, procedures Magurran in the 2004; limitations, 2019), (Mace of IPBES initiatives number conservation 2017; modern al. of amidst et (Coates consequences units planet political operational our and of the losses economic, comprise biodiversity social, and habitat biological, rapid greater the increasingly has delimitation Species Introduction and ecological combines that delimitation Keywords: species hesperis integrated S. to concepts. of approach delimitation species an related species of S. current reconciles utility of the and the that data populations suggest to demonstrate suggest genomic southern analyses We modelling and and integrity. these northern niche revised, genetic of between be applied their Results flow should then reinforcing gene lineages. We and contemporary genomic to hindering recovered combination contributing expansion. is the a hesperis, range conditions to among ecological attributed flow post-glacial to be gene and may adaptation to which zerene, that barriers hesperis, S. and substantial S. divergence species, recovered within ecological also another we assess but with SNPs, atlantis, introgression S. genomic and past apply Using muddled We hesperis of its S. for discordance. delimitation. between notorious mito-nuclear only species is not and integrated which divergence, complex, variation in species morphological aid multidisciplinary ) to delimitations, new, approach (: data an present atlantis-hesperis species present sequence Speyeria and we the next-generation Here, dynamics, to with otherwise. approach genomics population classify models this both to on niche difficult in focused been ecological advances complex, have research combining that Recent often taxa for to in taxa. and boundaries boon many heterogenous species for a clarifying is challenge for been opportunities speciation a have remained of modelling has process changes ecological boundaries the rapid and species However, amidst of particularly characterization biodiversity, landscapes. robust planet’s natural and our and categorizing conditions and environmental understanding to to essential is delimitation Species Abstract 2020 1, October 3 2 1 Campbell Erin a butterflies using of delimitation group confusing species ecological and genomic Reconciling ffiito o available not Affiliation Entomology of Museum Texas Alberta of University clgclseito,hbttsiaiiy pce eiiain ouaingnmc,introgression genomics, population delimitation, species suitability, habitat speciation, ecological , 1 ahr MacDonald Zachary , tal. et 2016). 1 dGage Ed , 1 2 ad Gage Randy , tal. et 08 Stanton 2018; 3 n ei Sperling Felix and , tal. et 09.Wiespecies While 2019). 3 Posted on Authorea 1 Oct 2020 | The copyright holder is the author/funder. All rights reserved. No reuse without permission. | https://doi.org/10.22541/au.160157511.10593343 | This a preprint and has not been peer reviewed. Data may be preliminary. tts eefe eerdt ste“otenlnae) n hs curn hogottesouthwestern the United throughout the occurring in those Dakota and South lineage”), and “northern Montana the and as Canada to in referred Saskatchewan hereafter and States; Alberta, Columbia, British between as SNPs well in as divergence, genetic (Scott al. (Campbell alone and morphology populations morphological using identify sympatric of reliably between assessments hybridization on Campbell of 2017; based lack species, apparent distinct an Hammond as 1981; Brown taxa and these Miller 1951; to (Grey Mountains 2009). similarity considered Rocky variation Dunford morphological have as the 1975; treatments well Moeck along taxonomic as 1947; south Some areas Grey and some & Columbia in Passos British species (dos each contact between to species species similarity within Manitoba two morphological The from substantial 2019). areas exhibiting (Pelham Colorado, forest Manitoba mixed southeastern in and in recognized Dakota other South taxonomically to is east variation and More America Newfoundland. to Mountains hesperis Rocky S. the in from subspecies America North five include These 2019). Graham 2003; Marshall Jezkova & 2008, (Sites evolutionary Stockman data diverged & genetic recently of Bond among basis 2006, conservatism the Wiens and on divergence delineated potential Elith often niche and 2000; lineages, assess Zimmermann associations, to & habitat Guisan framework niches, 2001; comparative Peterson ecological 1985; quantify (Austin to species al. single used Manel of be 2009; distributions may geographic 2001, (Schluter factors, species environmental of and processes reinforcement inferring to and Balkenhol approach diversification integrative 2003; ecological an facilitated to has contribute (ENMs) these that models of niche ecological combination of due development some either approaches, or al. other introgression, using et historical characterize ambiguity, to (Hohenlohe difficult morphological factors historically (Andrews diversification, been organisms have recent non-model that to in taxa divergences in greatly limits genomic (SNPs), species fine-scale polymorphisms recent, nucleotide al. single detect of et to (Baird thousands capacity RADseq of our as genotyping phylogenetic such extending enabled techniques strengthen have sequencing methods and DNA related high-throughput speciation so. of of done development convincingly modes The not on mutation- have approaches or inferences traditional valuable selection when without provide inferences ecologically- speciation thereby that (e.g., can niche suggest while implicated frameworks speciation), would are ecological processes species (i.e., speciation, order Graham speciation other sister underlying that 2003; between process suggests (Sperling principal divergence has conservatism integrity a niche selection is genomic selection significant divergent species’ divergent events, ecologically-based example, based a whether speciation For of into ongoing reinforcement insight and 2004). and permits historical speciation information to of of (Carstens contributed resolution sources species two sister the these parapatric in of or aid allopatric can for divergences particularly ecological and genomic oc 95 ufr 09.Attlo 6seisaecretyrcgie in recognized evolutionary currently understood Scott are poorly 1947; with species peris Grey complexes 16 & species 1947; Passos of several (dos Grey and total relationships & subspecies A Passos variable (dos morphologically 2009). lineages 110 Dunford component among 1975; relationships Moeck evolutionary ambiguous and variability genus butterfly The 06 01 Phillips 2011; 2006, 21)hv eetyson ae nalmtdnme fseies usata eoi divergence genomic substantial specimens, of number limited a on based shown, recently have (2019) Ewrs 84 and 1864) (Edwards, 06.Sc eoi aahv rvnvlal o lrfigpplto yaisadre-assessing and dynamics population clarifying for valuable proven have data genomic Such 2016). 08 Hinojosa 2018; hc a 1sbpce curn ndirmaosadoe oet hogotwsenNorth western throughout forests open and meadows drier in occurring subspecies 21 has which , tal. et tal. et sensu .hesperis S. tal. et Speyeria 09 Riva 2019; 06.EM,gnrlyprmtrzdb eaigseis curne ogeographic to occurrences species’ relating by parameterized generally ENMs, 2016). oi 02.Icroaino clgclifrainit eei pce delimitation species genetic into information ecological of Incorporation 2012). Nosil tal. et tal et .atlantis S. 03 Wagner 2013; cde,17 Lpdpea ypaia)i elkonfrisphenotypic its for known well is Nymphalidae) (Lepidoptera: 1872 Scudder, ouain curn ot n ato h ok onan (throughout Mountains Rocky the of east and north occurring populations 06 imrane l 00.Hwvr Nsmyas eue na in used be also may ENMs However, 2010). al. et Zimmermann 2006; . 09 Hundsdoerfer 2019; tal. et Ewrs 82 omalrecmlxcnann 6sbpce (Pelham subspecies 26 containing complex large a form 1862) (Edwards, 09 Thompson 2019; .atlantis S. tal. et tal. et .hesperis S. tal. et tal. et 03 Escudero 2013; 98 ufr 09 ehm21) mn these, Among 2019). Pelham 2009; Dunford 1998; htaebodydsrbtdi oie odad across woodlands conifer in distributed broadly are that tal. et 09 Loera 2009, 98 pe arn20) diinly Campbell Additionally, 2005). Warren & Opler 1998; 2 tal. et ob useisof subspecies a be to 09.Aogiegnmcavne,continued advances, genomic Alongside 2019). 09.Hwvr h aarmi icl to difficult remain taxa the However, 2019). tal. et tal. et tal. et 02 Newton 2012; 03.Frhroe h integration the Furthermore, 2013). 04 Vargas 2014; tal. et 03.Rcn okrecognizes work Recent 2013). tal. et .atlantis S. Speyeria tal. et tal. et 07 eMoya de 2017; tal. et 07 Abdelkrim 2017; 2020). swl sover as well as , ae noverall on based tal. et 04 oa & Kozak 2004, 08 and 2008) .hes- S. tal. et tal. et tal. et et et Posted on Authorea 1 Oct 2020 | The copyright holder is the author/funder. All rights reserved. No reuse without permission. | https://doi.org/10.22541/au.160157511.10593343 | This a preprint and has not been peer reviewed. Data may be preliminary. hlgntcaaye o ohthe both for analyses maximum Phylogenetic a to analyses locus genetic per population data and missing Phylogenetic global the reduce further (Danecek to 0.1.14 and 20%. 3% of vcftools of in frequency allele the filtering minor in minimum SNP processing additional final Paris conducted of We during principle output “r80” was (the population locus single any of 80% in in construction catalog the during following: locus the for except settings novo Campbell follows (Poland mitochondrial extraction, DNA processing and of generation data subspecies Molecular 14 in specimens of 113 specimens of outgroup comprised was of dataset subspecies Campbell among total by relationships The analysis in . Structure discordance mito-nuclear SNP-based showed since between additionally admixture, hybridization Gage. putative contemporary E. limited of and indicated collection (2019) ancient reference markings both of personal field for the specimens or test 1975), University to Young (Moeck -20 Brigham addition at information at In collection frozen range Museum Bean or including the ethanol in sources, in multiple preserved Campbell using Following (Bird either made sample. were were each and from subspecies nets extracted aerial was using DNA collected were Specimens species identification stable and to collection Specimen contribute should that 2012), ecological Methods Nosil and their broad 2003) 1990; to demonstrates (Sperling integrity Andersson contributor methods genomic 1976; likely these the Valen delimitations. as a of such (Van is integration concepts, concepts species selection Our species of divergent integrity. reconciliation recovered ecologically-based the genomic to for whether of utility contributed infer reinforcement alternate have to and testing may basis speciation 2003), that genomic (Sperling a factors contact on phylogeographic of and regions diversification patterns delimitation in the genetic species integrity in involved genomic on factors genomic their hypotheses extrinsic strengthen maintain the to that of modelling some ecological populations of clarify use to the the and for of niches boundaries proof-of-concept ecological species a in of provide variation to assessments there thus to is ambiguities, correspond objective taxonomic divergences Our some genomic clarify whether helped assess associations. have habitat to assessments and attempts genetic no these been While have where 2017). is regions Sims western complexity 2013; in phylogenetic declines Further population for significant initiatives including conservation lineage”). complex, for this (“southern implications of important part States considered been United historically between not have the intermediates that species of with interactions regions by provided Basin Great and sn h tcsv . ieie(Catchen pipeline 2.3 v. Stacks the using tal. et .atlantis-hesperis S. tal. et 95 rc afa 03 ufr 09 Warren 2009; Dunford 2003; Kaufman & Brock 1995; 02 irr rprto n eunig n nta iohnra n N aaprocessing data SNP and mitochondrial initial and sequencing, and preparation library 2012) .atlantis, S. . eadtoal s Nst opr clgclnce feouinr iegsidentified lineages evolutionary of niches ecological compare to ENMs use additionally We tal. et .hesperis S. .cbl cybele cybele S. 21,21)ads r o ecie ndti ee Nswr genotyped were SNPs here. detail in described not are so and 2019) (2017, 8seiesi orsbpce of subspecies four in specimens 18 pce ope.W use We complex. species .atlantis S. COI and cstacks COI eeapicto,dRD(Peterson ddRAD amplification, gene n Fbiis 75 TbeS1) (Table 1775) (Fabricius, .zerene S. aaee,wihcnrl h ubro imthstlrtdper tolerated mismatches of number the controls which parameter, and eeadtefitrdgnmcSP eecnutdi IQ-TREE in conducted were SNPs genomic filtered the and gene a e o2isedo ;w nyrtie oita eefound were that loci retained only we 1; of instead 2 to set was , .hesperis S. populations tal. et Bidvl 82 (Campbell 1852) (Boisduval, 3 .zerene S. enovo de .zerene S. 01 Rochette 2011; .hesperis S. eicue eea pcmn of specimens several included we , tal. et rga fSak ordc eoi linkage. genomic reduce to Stacks of program .zerene S. tal. et Nst eoe itntgntccutr of clusters genetic distinct recover to SNPs hc a utpesbpce experiencing subspecies multiple has which , 07;adasnl,rno N rmeach from SNP random single, a and 2017); and tal. et . 21) opooia dnictosto identifications morphological (2019), tal. et and .hesperis S. .hsei,S atlantis, S. hesperis, S. tal. et 02,adcmaio ospecimens to comparison and 2012), n o hlgntcaayi,two analysis, phylogenetic for and , .zerene S. .hesperis, S. tal. et 01 orti nylc iha with loci only retain to 2011) 09 ihdfutparameter default with 2019) tal. et 02 n w-nyeGBS two-enzyme and 2012) n hlgntcanalyses phylogenetic and , oocr(McHugh co-occur 09.Ti a have may This 2019). 9seiesi three in specimens 19 and .zerene S. .zerene S. ° until C tal. et tal. et de to Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. pce eiiain n lent sinet eoee ntecutrn n hlgntcaaye:(i) and analyses: the following phylogenetic model, and hesperis species” clustering “2 the the in (ii) recovered assignments “ alternate the of TESS and and models Structure delimitations delimitation hybridization. the Wespecies contemporary species in Utah). by admixture five biased and genomic tested weren’t from no Colorado, We analyses each or Idaho, these little specimens from that had three (sampled ensure that and analysis to population population, BFD* results central Arizona the the for and and individuals Mexico chose population New Utah the southern from the Dakota) specimens South four and sampled we Colorado, follows: Montana, as Columbia, clusters Colorado) British genetic and major the Ontario, from Using Alberta, specimens analysis. from six the from to up three each between speed six included to we below), 27 described to (results individuals of number total (Grummer Factors of Bayes (Leach´e estimations calculate model different to supported represents used are that which distribution model, posterior delimitation probabilities a (Bryant inferring outputs tree by then SNPs species from and the times change, divergence frequency and sizes, allele population of effective trees, (Leach´e estimate to BFD* (MSC) using testing (Bryant delimitation plug-in species of conducted value We each of analyses introgression runs and 10 delimitation the the Species recommendations, for sampled program score Following and (DIC) Structure), optimal runs. Criterion in the MCMC Information “generations” identified Deviance the to the during (analogous averaged variance sweeps estimation. we and (Durand 200,000 neighbourhood parameter and model Voronoi interaction 50,000 admixture These during spatial of CAR sampling period specimen. the unequal burn-in each using or a for analysis irregular coordinates this with geographic ran regions We the for between correct distances weights Euclidean pairwise computing by optimal the We determine (Pritchard and prior. ) runs a average as to 2015) locations of sampling cluster using northern without each model for admixture runs results. the TESS these Mountains; using influenced Rocky run tested geography the were that of extent analyses northeast the Structure and clarify southwest to sampled compared were populations Structure of non-spatial and sets in genetically over-splitting to inferring population corresponded to for that contribute can component frequency, which genetic allele spatial when (Chen distance, useful programs by in particularly a isolation is changes to This incorporating discontinuities. on correlates geographic additionally structure based from result clustering by may that population Structure populations disparate to population from approach the differs Bayesian of TESS similar representation a visual (Chen take a 2.3.1 programs outputs version which TESS 2015), and Bryant between & relationships (Leigh PopART genetic program the in Mitochondrial (Hoang strapping (Nguyen 1.3.10 .atlantis S. priori a K nonrhr n otenseis i)te“ pce”mdl hc up northern lumps which model, species” “3 the (iv) species; southern and northern into -0wt uni eido 5,0 eeain,7000MM his n 0replicate 10 and chains, MCMC 750,000 generations, 150,000 of period burn-in a with 1-10 = tal. et K oe,floigtecretseisdlmtto for delimitation species current the following model, ” COI .zerene S. n v h 6seis oe,floigthe following model, species” “6 the (v) and ; tal. et tal. et au.W locnutdsprt usrcueaaye for analyses substructure separate conducted also We value. tal. et tal. et 00.W sdTS oinfer to TESS used We 2000). eedt a sdt ul iiu pnighpoyentok(Bandelt network haplotype spanning minimum a build to used was data gene .hesperis S. 05.Mdltsig HaR rnhtsig n 00rpiae futaatboot- ultrafast of replicates 1000 and testing, branch SH-aLRT testing, Model 2015). 07.Campbell 2007). 02 o ES (Bouckaert 2 BEAST for 2012) K 08 eecnutdi h program. the in conducted were 2018) smldfo let,Iao eaa n aiona and California) and Nevada, Idaho, Alberta, from (sampled tal. et stelws au of value lowest the as tal. et 02.BD upt agnllklho siain ME o ahspecies each for (MLE) estimations likelihood marginal outputs BFD* 2012). COI 04.Bcueti rga scmuainlydmnig erdcdthe reduced we demanding, computationally is program this Because 2014). tal. et testing , altps eue tutr eso .. (Pritchard 2.3.4 version Structure used We haplotypes. , 07 oifrpplto tutr nteSPdtst hl both While dataset. SNP the in structure population infer to 2007) tal. et v rmtenrhr lse of cluster northern the from five K K tutr eut;(i)te“ pce”mdl hc splits which model, species” “4 the (iii) results; Structure 2 = 21)soe toggntcsbsrcuigwithin sub-structuring genetic strong showed (2019) - o ah eue LMA .11(Kopelman 1.1 v. CLUMPAK used We each. for 1-5 = .alni,S hesperis, S. atlantis, S. K K K twihteDCsoe stabilized. scores DIC the which at -,adicroae egt nteVrninetwork Voronoi the on weights incorporated and 2-7, = osdrn both considering 4 tal. et , 04.SAPue h utseiscoalescent multispecies the uses SNAPP 2014). n,frtesuhr lsesof clusters southern the for and, K tal. et tutr n ESresults. TESS and Structure 6 = tal. et .zrn,S hesperis, S. zerene, S. Δ K 09 o 0rpiae per replicates 10 for 2009) and K 04 mlmne nteSNAPP the in implemented 2014) .hesperis S. tal. et tutr nlssa guide a as analysis Structure 6 = (Evanno .zerene, S. .zrn,S atlantis, S. zerene, S. 04 n eemn h best the determine and 2014) tal. et smldfo Alberta, from (sampled nomdb existing by informed .atlantis S. 05 n LnPr( and 2005) and K .hesperis S. tal. et .atlantis S. tal. et .hesperis S. .hesperis S. n then and , (sampled K n the and with , 1999) 2000) tal. et S. K ; , Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. sdt vlaeispeitv oe.Mdleauto a opee sn eevroeaigcharacteristic operating receiver using a completed Hijmans withholding was dismo, evaluation package each Model R power. lists, the predictive (via its locality evaluate ENM for to an different allow used fit five to to used lineage, generated data were polygons each localities the and For these of habitat. to of localities coordinates available clipped 20% sample the were locality to different layers around duplicate polygons data within polygons GIS removed generated convex lineages. we were three minimum points the background km) of 10,000 in (100 each and used to buffered variables assigned generated individuals predictor we variables, sequenced the ENMs, predictor of 1. (GIS) fitting sources File systems Before Supplemental and information in derivation provided geographic environmental; are of or analyses heterogeneous set geographic these across a either distributions and as geographic individuals classified potential sequenced and of associations localities habitat predict ecological (Elith species’ as landscapes quantify well to data as presence-only of niches modelling entropy maximum machine-learning uses identifiedwhich ( lineages genomic species major ecological three whether distinct infer to study, (ENMs) this models niche in ecological of series a created We the of by estimation indicated divergence events niche admixture plotted Ecological edges and model, migration tree each the for likelihood between residuals concordance maximum the and the events, of migration on assessment putative an of including significance evidence, statistical of the of lines multiple populations considering between by events determined migration for putatively a support and Ultimately, populations parental two negative between significantly (Reich event a introgression population populations; an admixed mixed of hypothesis and the parental supports specified TreeMixstatistic the of significance re-ran triplets using and between we weight edges admixture the model, migration estimate estimate each plotted calculated to For substantiate additionally jackknifing of tree. to We used blocks the ons and edge. using results, to migration Kozak our replicates edges each 2012; in migration bootstrap of consistency Pritchard 0-5 100 ensure & to added with (Pickrell times sequentially phylogeny model three then likelihood the and maximum of these SNPs, unrooted between score 50 an admixture likelihood of built experiencing the directionality putatively We improve infers as 2018). to program and order the edges by in migration interpreted populations plots are subsequently which TreeMix residuals, flow. highest gene the have that between pairs introgression lation putative testing. assess to model to BFD* 2010) 2012) (Bouckaert during Pritchard and recovered & 2.0.1 (Pickrell trees v. TreeMix of DensiTree used distribution We used posterior additionally the We in discordance tree. topological credibility visualize clade maximum the generate with distribution, our and 200 (R 3.6.1 R of in distribution implemented the 2012) determine gamma (Revell to a 0.6-99 it us phytools BFD* parameter, used package single and data. the a 2017), using Team with missing IQ-TREE Core distribution also some from gamma We output a contained lineages. have tree dataset between to SNP set size our we population which because in prior, λ, analysis tree differences Yule the probable birth-only a reflect in implements to sites MCMC non-polymorphic via included sampled be parameters to mutation the rate set we recommendations, BFD* Following CCcan,2000br-ngnrtos n 4pt apigses ovrec a sesdusing assessed was Convergence steps. sampling path 24 (Rambaut and 1.6.0 generations, v. Tracer burn-in 200,000 chains, MCMC for distribution parameter narrow the to due oenn pcainrt (Leach´e rate speciation governing .atlantis S. β .atlantis S. o10 olwn rga eomnain,w e u aepir oas olwagamma a follow also to priors rate our set we recommendations, program Following 100. to rei ssaGusa oe oetmt rf ewe ouain,adietfispopu- identifies and populations, between drift estimate to model Gaussian a uses TreeMix . tal. et λ =10, 06 01.Frec ftetrelnae,ipt o Nsicue h georeferenced the included ENMs for inputs lineages, three the of each For 2011). 2006, northern , α 1 and =1, tal. et sensu tal. et λ =65.96, 08,adTeAnttr247(rmod&Rmat20)wsue to used was 2007) Rambaut & (Drummond 2.4.7 TreeAnnotator and 2018), oi 02 Nswr tuigMxn otae(Phillips software MaxEnt using fit were ENMs . 2012) Nosil 09 ikel&Picad2012). Pritchard & Pickrell 2009; β .hesperis S. 20(ec´ n ocar 08,adrnec cnrowt million 1 with scenario each ran and 2018), Bouckaert (Leach´e and =250 tal. et α K 2 and =2, .atlantis S. fl nlss o aho hs v it,termiig8%of 80% remaining the lists, five these of each For analysis. -fold 04.W calculated We 2014). n southern and , λ β β and cl aaee ihan with parameter scale 3..T vi oeta apigba yteprogram the by bias sampling potential avoid To =32.9. f and 5 β threepop ttsisfralpsil obntoso populati- of combinations possible all for statistics 3 niae ytedt,w ute eae our relaxed further we data, the by indicated .hesperis S. .hesperis S. omn mlmne nTreeMix. in implemented command λ .hsei,S atlantis, S. hesperis, S. rmtemxmmlklho consensus likelihood maximum the from u and tal. et f aif rtrafrcasfiainas classification for criteria satisfy , v statistics. 3 α 01 n h ihed2%were 20% withheld the and 2011) o1 n loe h coalescence the allowed and 1, to hp aaee f2 hsgave This 2. of parameter shape .zrn,S hesperis, S. zerene, S. and .zerene S. tal et f 2006), . tests 3 tal. et λ was f to 3 Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. euti eirclgn o.Uigti igerssac ufc,w hncluae arieresistance pairwise calculated then could we that surface, contact resistance into single come will this lineages Using two averaged flow. the We gene 2006; of 2018). al. S´anchez-Ram´ırez reciprocal individuals al. et 2010; in dispersing et Crispo 2008, result that southern 2004; al. probability and stretches Orr et the northern & large Thorpe reflecting the (Coyne habitat; 2007; of within dispersal Beier unsuitable surfaces disperse to & resistance through to barriers likely McRae moving significant et more 2006; Wang when pose are 2010; McRae thereby resistance organisms al. habitat that et high unsuitable Storfer assumes experience 2008; of approach the al. and This as et 2020). habitat parameterized Wang al. surfaces 2007; suitable et Beier resistance MacDonald & using 2012; (McRae southern accomplished suitability al. and was habitat northern This predicted habitat. of the and inverse between suitable diversification of dispersal underlying arrangements to process to barriers evolutionary present-day dominant ( are and the lineages vicariance there selecti- identified (i.e., been divergent the dispersal has ecologically-based of to than speciation), reinforcement barriers rather ecological from speciation), (i.e., resulting mutation-order drift on or genetic selection that without possible speciation therefore is It tributions. southern hesperis and S. northern in The flow gene habitat. to available barriers effect in of negative differences significant Assessment from a and arising structure of Within- biases absence in to the lineages. identical attributed model, of models, effects be localities. effects mixed pair “within- linear mixed random each the each linear the Within of for using of above. evaluations compared described 20% those model again different to were between-lineage a scores withholding and AUC each of within- between-lineage lists, localities repeated using locality actual then lineage random the each We of different for five polygons ENMs km convex using null minimum point/10,000 above of buffered used (1 series habitat the a individuals available to in built sequenced confined differences we from localities possibility, arise this generated may investigate randomly divergence To niche ENMs. of fitted parameterize inferences the was to spurious of ENM that divergence. effect each niche possible negative significant which is suggesting significant It for different, a significantly lineage are model, comparison the effects pairwise of the mixed in ID involved linear lineages the each ENMs, Within five effect. lineage’s overlapping random “within- partially each a these from as fit when lineages, resulting included to ENMs nonindependence of was lineage’s used for one pairs localities control of For To power of the evaluation. localities. sets a in lineage’s model reduction using other significant (1) the a fit between-lineage was predicting there models, or whether quantified effects (0) thereby (Brooks mixed within-lineage pair models glmmTMB a each linear package to for R generalized the corresponded “within- scores in using a variables AUC link models, between-lineage dependent logit effects a and as “between-lineage” and Within- used of distribution data. then estimation beta validation were for the lineages data validation of of fitted its as as polygon within used used convex lineage were ( were other minimum lineage points the localities one Background of lineage’s for scores. we localities ENMs other AUC lineages, the this, the of predicted accomplish and pair To ENMs each polygon. models of For convex set fashion. minimum lineage’s pairwise buffered with one a respective 1, which in – quantified with 0 was power suitability from lineages assessed habitat ranging genomic score predict among suitability predictive divergence to habitat Niche a models’ used received the suitability. were cell higher of grid lineage indicating 1-km estimates each values Each inflated higher for area. avoid study within ENMs to entire habitat fitted the estimation only across evaluation, AUC meaning model in polygon, Following available convex power. minimum as buffered defined ( lineage’s was points each Background polygons power. to predictive confined greater indicating were values evaluation higher with 1, (Phillips and scores 0 (AUC) area-under-the-curve and analysis (ROC) vs. ewe-ieg”bnr aibewudidct htpeitdhbttsiaiiyo h two the of suitability habitat predicted that indicate would variable binary between-lineage” vs. ewe-ieg”bnr aibewudidct htosre ih iegne cannot divergences niche observed that indicate would variable binary between-lineage” vs. .hesperis S. ewe-ieg”bnr rdco aiberflce hte ahACscore AUC each whether reflected variable predictor binary between-lineage” n 1,0)ue nteemdleautoswr ofie otebuffered the to confined were evaluations model these in used =10,000) iegsietfidi hssuyaelreyprpti nterdis- their in parapatric largely are study this in identified lineages sensu 2 o ahlineage). each for oi,21) ots o hspsiiiy easse whether assessed we possibility, this for test To 2012). Nosil, .hesperis S. 6 iegst eeaeasnl eitnesurface resistance single a generate to lineages K tal. et fl nlsswscmltda described as completed was analysis -fold 06.ACsoe r on between bound are scores AUC 2006). tal. et .hesperis S. 07.Wti iermixed linear Within 2017). n 1,0)ue nmodel in used =10,000) iegsrelated lineages Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. rmnrhr th dh,suhr otn,adsuhetr ooao;2 otenUa;ad3 New 3. and Utah; southern 2. Colorado); southwestern and Montana, southern Idaho, Utah, northern from southern separated further at peak small eio n rzn)with Arizona) with and Colorado) Mexico, southeastern and California, Dakota, southern South Montana, Alberta, of Columbia, values optimal two suggested data supported SNP of analysis Structure TESS and Structure, network, Haplotype the on monophyly Campbell in with reduction a and discordance mito-nuclear extensive nausicaa of capitanensis h. S. hesperis S. and Colorado, southwestern chitone h. S. contained grade southern Dakota, irene South from h. 1945 S. Grey, & Passos dos Montana and with 1926) southern paraphyletic and northern additionally the was mean for relationship 204, shows polyphyletic depth: a tree indicates locus species max. SNP 8, The depth: locus (min. SNPs informative. 1026 phylogenetically The contained 65.6). dataset depth: SNP locus our filtering, After analyses phylogenetic and construction Dataset southern and northern the Results of that reinforcement suggest would to distance contributes Euclidean out likely hesperis inte- partialling selection genomic after divergent the distance reinforcing lineage ecologically-based mechanism and principal distance from a resistance southern resulting as between and flow out northern gene ruled the reduced be of that cannot grity for suggest (vicariance) would controlling dispersal distance after to Euclidean barriers out between lineage partialling li- exist same after different dispersal distance the to of southern lineage barriers of individuals and significant between individuals whether northern evaluates distances between effectively the resistance those analysis This whether than distance. evaluate greater Euclidean different to significantly to test were or mantel 0) neages partial = Euclidean a (value matrix of used lineage pairwise matrix We same third pairwise the A a 2005). to 1). Bivand generated = belonged & (value also individuals (Pebesma lineages We sp whether 2007), package 1996). indicated in R Beier distance”) al. the individuals) & (“lineage using et sequenced (McRae in- individuals (Chandra of distances all them surface) localities between circuit organized (i.e., distances resistance to and nodes (i.e., analogous 2017) between graph are time Etten a distances commute (van random-walk resistance gdistance expected package These measuring matrix. R pairwise the a using to individuals all between distances .h dorothea h. S. lineages. .h brico h. S. , .hesperis S. .hesperis S. Ewrs 84 rmAioa hlgntccmaio fteSPand SNP the of comparison Phylogenetic Arizona. from 1874) (Edwards, .h dennisi h. S. K Bidvl 89 rmClfri,btddntehbtcnitn egahcsbcutrn.The sub-clustering. geographic consistent exhibit not did but California, from 1869) (Boisduval, rmNwMxc n rzn,adbodysprtdasuhr e eia ouainof population Mexican New southern a separated broadly and Arizona, and Mexico New from Fg 1,wihgopdabodynorthern broadly a grouped which S1), (Fig. 2 = K Ewrs 89 ape nsuhatr Utah, southeastern in sampled 1879) (Edwards, tal. et .h tetonia h. S. .Ti resolved This 6. = oc,14 ape nteSni onan fNwMxc,wihcutrdwith clustered which Mexico, New of Mountains Sandia the in sampled 1947 Moeck, oln,18 ape nteScaet onan rmtemr otenpopulation northern more the from Mountains Sacramento the in sampled 1988 Holland, ieg a itrto sister was lineage lse seiesfo th dh,suhr otn,suhetr ooao New Colorado, southwestern Montana, southern Idaho, Utah, from (specimens cluster (2019). oda ct pmr 98and 1998 Spomer, & Scott Kondla, o ass&Ge,14 rmAlberta, from 1945 Grey, & Passos dos COI .zerene S. .hesperis S. .hesperis S. .h viola h. S. aae osse f68sts(7 nain n 1vrat,5 fwihwere which of 58 variant), 71 and invariant (577 sites 648 of consisted dataset o ass&Ge,14 ape nnrhr thadsuhr Montana, southern and Utah northern in sampled 1945 Grey, & Passos dos .cybele S. .hesperis S. .zrn,S atlantis S. zerene, S. .atlantis, S. Fg a.LnPr( 2a). (Fig. o ass&Ge 95fo dh.Amnpyei ld contained clade monophyletic A Idaho. from 1945 Grey & Passos dos notregorpial-endcutr:1 cnrl (specimens “central” 1. clusters: geographically-defined three into , iegs infiatcreainbtenrssac itneand distance resistance between correlation significant A lineages. .zerene S. .atlantis S. .h ratonensis h. S. iegs lentvl,teasneo infiatcorrelation significant a of absence the Alternatively, lineages. n otie subspecies contained and n tefhdtomjrgorpi ruig.Alarge A groupings. geographic major two had itself and , 7 and , K n northern and , .zerene S. supported ) .h beani h. S. ct,18 rmsuhatr ooao and Colorado, southeastern from 1981 Scott, .h hutchinsi h. S. K .hesperis S. .h electa h. S. .hesperis S. niaigsbtutr ntedata. the in substructure indicating , smnpyei lds(i.1a) (Fig. clades monophyletic as rmBiihColumbia British from COI K .hesperis S. and 6 = .h beani h. S. lse seiesfo British from (specimens cluster re(i.1) agl consistent largely 1b), (Fig. tree iegs h otenlineage northern The lineages. o ass&Ge,14 from 1947 Grey, & Passos dos Ewrs 88 ape in sampled 1878) (Edwards, COI sdsic clusters distinct as Δ Bre Benjamin, & (Barnes K diinlyhda had additionally aaesrecovered datasets .atlantis, S. .h lurana h. S. , n a and , , .h. S. and Δ but S. K Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. otenppltos(i.2) usrcueaayi of analysis Substructure 2b). (Fig. Idaho populations the northern and populations, Utah Colorado southern southwestern Mexico the New Similarly, northern the of Two nausicaa Arizona. and Mexico zerene re oee h eodms omntplg 71 ftesmldtes hw nprl)depicted MCC purple) the in in shown presented trees, topology the sampled the of the as 87.4% of same for the (7.1% accounted was topology which panel), common topology, right most recovered 3a, zerene second major Fig. the The in however blue tree, tree. in between the (shown relationship on trees the sampled species in discordance indicated three indicated additionally remaining results tree the BFD* MCC the of The visualization DensiTree panel). left 3a, northern (Fig. and 0.87 of southern probability the that posterior a 1) with of supported probability (posterior clade northern the LnPr( of both analysis substructuring; Substructure geographic Alberta. and Montana eaieBysFco B) niaigta hsmdlwstebs upre ytedt Tbe1.The 1). (Table data the northern by the supported for support best strong the indicated was tree model (MCC) this credibility that clade indicating maximum ( (BF), model Factor species” Bayes “4 negative the southern BFD*, in and tested , models species five the Californian Of the of analyses rest introgression the and with delimitation consistent Species haplotype a had and California, from instead 2a). hesperis (Fig. S. cluster one single hybrids: a likely supporting into further the S1), populations of Utah (Fig. that southern to and score central DIC the lumped it that given supported and TESS locality, sampling to corresponded that data estimate the in sub-structure meaningful any eaawsietclt h major with the admixed to identical was Nevada third the Interestingly, the from distinct haplogroups. of most consisted the was group containing and group One Utah and Montana, haplogroups. Idaho, distinct Alberta, three with network, Comparatively, “ location. major sampling the in one from observed Manitoba; different not and minimally from distinct Alberta were marginally in Manitoba were sampled Colorado 1940) from each Chermock, specimens for 1998 group haplotype Spomer, & major the and specimens canadensis “distinct” the For between differences species. nucleotide either two within or variation one haplotype distinct little depicted very network was there haplotype spanning minimum The hesperis with admixed was sbslt h te he species three other the to basal as rmClfri htwr o hrdb any by shared not were that California from population. K pcmn perdt emxue ewe h otenNwMxc n eta populations. central and Mexico New southern the between mixtures be to appeared specimens dsPso,13)seiessmldi nai n ubcadthe and Quebec and Ontario in sampled specimens 1935) Passos, (dos ouain u igeseie of specimen single our population, ,w ugs that suggest we 1, = .atlantis, S. .hesperis S. .hesperis S. .hesperis S. .hesperis, S. K .zrn zerene zerene S. .hsei/.zerene hesperis/S. S. .zerene S. .hesperis S. 5 = . hshpoyevrainlreycreae osmln oain-eastern - location sampling to correlated largely variation haplotype this K hsaayi a agl ogun ihthe with congruent largely was analysis This . nteSrcueo ESaaye fSP;tesnl likely single the SNPs; of analyses TESS or Structure the in a h ihs agnllklho siae(L)adtems strongly most the and (MLE) estimate likelihood marginal highest the had ) ESrsls u aldt d n ennflgorpi substructuring geographic meaningful any add to failed but results, TESS 5 = n one and , n lotalteseiessmldhdietclhpoye eadesof regardless haplotypes identical had sampled specimens the all almost and K ape rmsuhr thsae netywt h otenNwMexico New northern the with ancestry shared Utah southern from sampled sotmlfrti nlss ohTS n tutr niae few a indicated Structure and TESS Both analysis. this for optimal as 5 = K , rmClfri a nemdaebtenthe between intermediate was California from .zerene S. u h ld containing clade the but .hesperis S. samr ennflresult. meaningful more a is 1 = .zerene S. K .zerene S. .h electa h. S. . ld iegdsihl ale hntecaecontaining clade the than earlier slightly diverged clade and ) hr oooyacutn o h eann .%o h sampled the of 5.4% remaining the for accounting topology third A .h viola h. S. a uhmr altp iest ntemnmmspanning minimum the in diversity haplotype more much had .hesperis S. altp htwsfudin found was that haplotype Δ altp Fg c,btteeidvdasddntappear not did individuals these but 2c), (Fig. haplotype K rmClfri hrdacsr ihtesuhr Utah southern the with ancestry shared California from niae noptimal an indicated .zerene S. pcmn eeitreit ewe h eta and central the between intermediate were specimens 8 pcmn eeitreit ewe h eta and central the between intermediate were specimens priori a hollandi .h dorothea h. S. .zerene S. ape rmClfri (putatively California from sampled .alni hollandi atlantis S. pcmn ape rmNvd,Ua,Idaho, Utah, Nevada, from sampled specimens .hesperis S. .zerene S. .atlantis S. pce Fg c.I l ae hr eeonly were there cases all In 2c). (Fig. species altp.Ti egahcptenwas pattern geographic This haplotype. ” .zrn,S atlantis, S. zerene, S. .hesperis S. niae eei ieecsin differences genetic indicated K K pcmn n l h Arizona the all and specimens .zerene S. n southern and lse i o niaeadditional indicate not did cluster tutr eut,ecp that except results, Structure 6 = .alni hollandi atlantis S. and f2 oee hsddntproduce not did this however 2, of .zrn gunderi zerene S. and .hesperis S. rmAbraadoefrom one and Alberta from .hesperis S. .alni sorocko atlantis S. .hesperis S. .atlantis S. pcmn ape from sampled specimens K .zerene S. .hesperis S. .zerene S. northern a similar a had 6 = lses however clusters, and and n second a and , Cemc & (Chermock ape from sampled .zerene S. .h irene h. S. Δ yrdwas hybrid specimens. .hesperis S. .atlantis S. .atlantis S. .atlantis S. .atlantis S. K .zerene S. a less was Kondla cannot .h. S. and S. S. ) Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. niae osgicn euto ntepwro ahlnaesnl Nswe rdcigterandom “within- the the predicting of when coefficient ENMs the null Specifically, lineage’s each scores each lineages. AUC to of other resulting confined power the addressing localities the for models generated in generated effects randomly reduction localities mixed with significant Linear built no polygon. ENMs indicated convex null minimum using buffered niches. repeated lineage’s ecological were and associations analyses habitat These their in divergent significantly “within- are the comparison: lineages of pairwise southern coefficient and each northern The for ENMs and lineage’s negative lineages. each two significantly of other was power the northern variable the of predictor in each reduction binary of significant lineage” Contributions localities a was the ENMs. there predicting to the indicated when of contribution models each effects greatest For mixed the lineages. 2. Linear had Table among in days considerably reported degree varied of are convex variables growing permuted, minimum other density and as randomly buffered of cover measured was highest their variable ENMs, land each the to to lineages, after variables adjacent that three scores predictor and AUC environmental and are in within and drop lineages niches, found geographic the genomic of generally ecological contributions three is and Relative the lineage polygon. associations each that for suggests habitat habitat landscape respective suitable study their entire in the divergent across surfaces as suitability polygon, southern habitat convex and minimum 0.063), = buffered lineage’s s.e. scores: each AUC within mean by suitability indicated habitat predicted sufficiently ENMs divergence niche Ecological northern and Mexico/Arizona central the into ( population from admixture 61 the Of 0.01, = weight (migration 0.09, atlantis = weight (migration between events migration significant, statistically still the but than rather 0.07, substantial, branch = less the weight two of (migration Pritchard indicated interior & (Pickrell the also populations along model between unsampled plotted from This edge admixture migration or of a 2012). admixture historical 0); addition indicate = continued may containing p that terminus branch 0.41, suggesting the = similar, weight of The highly migration interior fit. models were improve the for greatly events from not plateau migration did event to point five seemed this 1. this and past but events Fig. four, S2), migration in phylogeny three, (Fig. events SNP for improved migration the indicated plots of with generally addition consistent model migration the largely each with was likelihood for fit maximum residuals S2) population unrooted Fig. pairwise The the 3b, containing species. Heatmaps (Fig. these TreeMix between in introgression for produced test phylogeny to TreeMix used we analyses, between relationship discordant the Given depicted The orange) in (shown trees ouainand in population populations: some between weights models Though 0.017). .atlantis S. mgainwih 0.02, = weight (migration .hesperis S. .zerene S. .zerene S. f et o nrgeso htw optd nytrewr infiat(al 2;oeindicated one S2); (Table significant were three only computed, we that introgression for tests 3 p ) n nte from another and 0), = /northern .atlantis S. .zerene S. .hesperis S. n h ld otiigteNwMxc/rzn n central and Mexico/Arizona New the containing clade the and n h ld otiigtecnrladNwMexico/Arizona New and central the containing clade the and ( β -0.292; = m p .hesperis S. and 4 p a o-infiat(irto egt 0.02, weight: (migration non-significant was .hesperis S. p n h otenUa ouainof population Utah southern the and .4)wr non-significant. were 0.345) = .hesperis S. .3) n between and 0.037), = ouain( population .5) ewe h e Mexico/Arizona New the between 0.058), = .hesperis S. .atlantis S. m .zerene S. p p a iia eiul Fg 2,te niae o-infiatmigration non-significant indicated they S2), (Fig. residuals similar had 5 0.028), = .9) n ewe h eta n northern and central the between and 0.092), = itrrltosi a ossetyrcvrdi ahsmldtopology. sampled each in recovered consistently was relationship sister m ma .0,se .8)(i.4.Vsa npcino predicted of inspection Visual 4). (Fig. 0.085) = s.e. 0.803, = (mean ( β .zerene S. h irto debtenteNwMexico/Arizona New the between edge migration the 4 -0.333; = ma .0,se .8) northern 0.084), = s.e. 0.809, = (mean .zerene S. p ouain notecentral the into populations stesse ao othe to taxon sister the as .0) h hr etsgetdgn o rmteNew the from flow gene suggested test third The 0.003). = .atlantis S. .zerene S. n h e eioAioapplto of population Mexico/Arizona New the and p and .zerene S. .2) oehr hs eut niaeta l three all that indicate results these Together, 0.023). = 9 .alni/.hesperis atlantis/S. S. n southern and otecentral the to and m .atlantis S. oe niae usata migration substantial a indicated model 3 .hesperis S. .atlantis/ S. .hesperis S. .hesperis S. p .hesperis S. .6) n in and 0.169), = .hesperis S. mgainwih 0.07, = weight (migration norpyoeei n BFD* and phylogenetic our in notecentral the into northern .hesperis S. ouain( population ( β .hesperis S. .hesperis S. .hesperis S. -0.6811; = ouain(i.3b; (Fig. population vs. ouain and populations m .hesperis S. between-lineage” -;teresidual the 3-5; ma 0.810, = (mean .atlantis S. m vs. p h edges the 5 populations populations populations .hesperis S. .hesperis S. .hesperis S. 0.035). = p between- < clade. 0.00), p and S. = Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. u N hlgn Fg a.Hwvr h eainhpof on relationship work the phylogenetic However, between molecular recent 1a). as between interesting, (Fig. relationship phylogeny sister SNP a our and between species, history distinct evolutionary as shared complex, zerene a S. suggest analyses genomic Our between introgression historical of contexts for conservation group Evidence in meaningful species approach and confusing This stable more notoriously science. concepts. and be citizen a species quantitatively ultimately ecological and uses should incorporate and that study genetic delimitations to of Our species integration phenotypic difficult supports analytical accounts. only the been an history – demonstrate are have characteristics natural to genes – butterflies ecological to environment by But relegated informed their usually delimitations more available. with are Species increasingly individuals become as species. of have renaissance a interactions a methods makes undergone what analytical have of and data part genetic technology by sequencing informed of sophisticated are present separation that might delimitations geographic that Species habitat the suitable to of attributed arrangements to best less Discussion not is are and lineage this parapatry) dispersal. each lineage, to (i.e., of barriers lineages other individuals significant the the dispersing of of although ranges that, individuals the suggests contact related result to significantly This not likely was distance. lineage) different Euclidean vs. and (same ( distance distance lineage resistance that lineages. to indicated hesperis three test S. the mantel among in partial A habitat flow available gene in to differences barriers from of arising Assessment biases to attributed be cannot niches 0.015; atlantis for S. non-significant was variable predictor binary hl h eut forSPbsdaaye n h ako eunevraini h iohnra haplotype mitochondrial the in variation sequence of lack the between and analyses shared SNP-based our of results and the between While admixture introgression SNP-based probable by indicated to explicitly due more also is but analyses, phylogenetic our southern al. et neto i nrgeso ffr luil n etbehptei htcudepanteosre haplotype observed the explain could that hypothesis testable and plausible a offers introgression throughout via infection haplotypes mitochondrial particular (Werren(McHugh variation of haplotype spread reducing Ahmed the is species, facilitate process and can populations a that such bacteria (Sonsthagen facilitating endosymbiotic haplotypes recently for has mitochondrial event variable bottleneck candidate severe other a A of to leading loss sweep the selective strong of caused a range that the suggesting variability, across genetic SNPs in nuclear mtDNA, to contrast In in and haplotype TreeMix this them; of pervasiveness between the flow gene of entire zerene, the direction throughout the occurs clarify sufficiently from introgression not indicated do they species, p 09.I slkl httercvrdplpyyof polyphyly recovered the that likely is It 2019). ugssta toiiae in originated it that suggests tal. et .zerene S. .1) hs eut niaeta bevddffrne nhbttascain n ecological and associations habitat in differences observed that indicate results These 0.810). = .hesperis S. tal. et F*seisdlmtto upre eaaino otenadsouthern and northern of separation supported delimitation species BFD* . n southern and 2015). 03,btntytwti the within yet not but 2013), .zerene S. and Wolbachia and .hesperis S. r -0.111; = .hesperis S. .zerene S. and .zerene S. .hesperis S. .hesperis S. netoshv enrpre nseveral in reported been have infections / p rsne eei atydet h msino other of omission the to due partly is here presented .atlantis S. .8)atrpriligotterltosi ewe ieg distance lineage between relationship the out partialling after 0.987) = ( .hesperis S. into .zerene S. β 0.020; = ape ag,btol nteNvdnpr fterneof range the of part Nevadan the in only but range, sampled .hesperis S. upr yohsso nrgeso ewe hs non-sister these between introgression of hypothesis a support .hesperis S. d Moya (de .atlantis-hesperis S. .atlantis S. Wolbachia n ovldt t origin. its validate to and p xaddsmln fbt pce snee oclarify to needed is species both of sampling Expanded . .5) n otenadsouthern and northern and 0.751), = 10 .atlantis S. oee u neec htthe that inference our however , Speyeria and .hesperis S. tal. et COI n northern and etg&Wlah 94 maternally-inherited, 1924, Wolbach, & Hertig .hesperis S. .zerene S. .zerene S. altp nlss(i.2 i.3 al S2). Table 3, Fig. 2, (Fig. analyses haplotype 07 Campbell 2017; eeal niae o-itrrelationship non-sister a indicates generally tal. et n northern and ope (Hamm complex ontso h aeovosreduction obvious same the show not do Speyeria 08 Kodandaramaiah 2008; n h itrrltosi between relationship sister the and oteohrcae aid hsis This varied. clades other the to tal. et .hesperis S. .hesperis S. 07 us ign 2005). Jiggins & Hurst 2017; tal. et .hesperis S. .alni,S hesperis, S. atlantis, S. pce,including species, tal. et ( 07 09 Thompson 2019; 2017, β .hesperis S. and -0.012; = .hesperis S. Speyeria 2014). .hesperis S. ossetwith consistent , .zerene, S. tal. et p Wolbachia haplotype pce in species 0.822), = .zerene S. f lineages tests 3 which 2013; ( β and S. = Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. eetv fti eetrneepnin swl sdet e arest ipra n eeflwi the in flow gene and dispersal to barriers few to due northern as of southern well habitat of and as forest habitat and retreat, expansion, prairie/open northern other glacial in range homogenous each structure last recent relatively with population the this genes recovered of exchange following of lack to reflective the habitat continued then available initially accurate, newly is have hypothesis may into lineages region these geographic that this northern from in southern poleward substructure and any northern recover sampled to southern failed of TESS population and northern Structure between Both flow southern 3a). and hesperis northern (Fig. splitting S. for species support clear distinct indicated into SNPs using delimitation species BFD* maintained be Our from should regions. distinct lineage sympatric subspecies evolving the in between independently sorocko, even differentiation species, SNP a. between and S. flow mitochondrial hollandi, little gene relatively limit indicate to also enough results significant likely are Riva associations surveys genetic 2009; other Dunford with 2001; consistent are results these between Canada); Alberta, in (mostly (Thompson ranges their of southern portions the between within admixture clusters of genetic northern any the to that than show consistently results hesperis Our S. in limits species of 4). in divergence Assessment (Fig. geographically-structured lineages clear the demonstrate between landscape associations study southern entire habitat and genomic the to northern across lineages’ barriers between habitat parapatry the with suitable present-day of associated that speciation reinforcement suggest hesperis therefore mutation-order S. and We or diversification vicariance). selection (i.e., to without dispersal dis- contributing speciation selection to divergent process than barriers ecologically-based rather significant principal that integrity, hypothesis any the the detect between to been not support habitat did has further available southern analyses conferring localities in them, resistance-based and between random differences our northern persal from distributions, to although their built Furthermore, attributed in ENMs be allopatric divergence). null cannot niche separation, divergences (spurious geographic niche lineages of observed degree that to indicate between correspond estimated approximately divergences to genomic indica- of hesperis an that S. as these to of also corresponds Comparison but ing divergence, events. between niche speciation greatest was hesperis of ecological divergence strength reflect niche that likely relative suggest ern divergences the associations models genomic of habitat across not estimates measure their coefficient or in a whether differ as of “within- significantly only tor the data, not of genomic magnitude interpreted relative of be The basis niches. the ecological on and delineated the lineages, that tionary indicated ENMs taxa. of these between Comparison relationship historical divergence the niche clarify of further Evidence may which specimens, sampled the in Nevadan between sharing .hesperis S. hc eeapoiaeyeuvln ntersrnt fpiws ih iegne hsorder- This divergence. niche pairwise of strength their in equivalent approximately were which , .atlantis S. sacneune ahrta as,o clgclybsddvretslcin spoetosof projections as selection, divergent ecologically-based of cause, than rather consequence, a is Fg ,Fg ,Fg 3) Fig. 2, Fig. 1, (Fig. tal. et ept ra egahcsmln,adTeMxand TreeMix and sampling, geographic broad despite olwdb northern by followed , 09 Campbell 2019; .hesperis S. and . .hesperis S. .hesperis, S. .atlantis S. and .hesperis S. .zrn gunderi zerene S. tal. et .a canadensis a. S. ntesuhr ok onan.Tehpoyesaigbetween sharing haplotype The Mountains. Rocky southern the in .hesperis S. 09,adterslso u Nsidct htdvrecsi habitat in divergences that indicate ENMs our of results the and 2019), tal. et Fg ,TbeS) hsi loteapoiaebudr ewe the between boundary approximate the also is This S2). Table 2, (Fig. n northern and southern nrgosweete oocr(Bird co-occur they where regions in . .hesperis S. lhuhdge fdvrecsbtenpiso iegsappears lineages of pairs between divergences of degree Although 09.Ps okhsntddffrne nmcohbttpreference micro-habitat in differences noted has work Past 2019). .atlantis, S. and and iegs oi spsil htnorthern that possible is it so lineages, .hesperis S. .hesperis, S. eaiet h te aai hsstudy this in taxa other the to relative .hesperis S. .atlantis S. .hesperis S. .hesperis, S. .hesperis S. s southern vs. 11 northern n eonto fteseissau of status species the of recognition and , and nteasneo otmoaynceradmixture nuclear contemporary of absence the in ieg smr eeial iia to similar genetically more is lineage ieg,btw i o eoe n evidence any recover not did we but lineage, vs. .hesperis S. .zerene, S. ept apigbt pce nsympatric in species both sampling despite .hesperis S. .hesperis S. ewe-ieg”cecetetmt may estimate coefficient between-lineage” .atlantis S. f nlssidctdpoal gene probable indicated analyses 3 atclryi rna h central the near or in particularly n southern and , and tal. et oprdt h oedisjunct more the to compared n otenadsouthern and northern and .atlantis S. .hesperis S. 95 up Shepard & Guppy 1995; . .hesperis S. Thus, .hesperis S. .atlantis S. .zerene S. .hesperis S. s northern vs. .atlantis S. a epartly be may r largely are .hesperis S. .atlantis S. .atlantis S. .atlantis S. s south- vs. expanded .zerene S. fthis If . evolu- san is S. Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. he,M . ruoJr .V,Wlh .J n aaaa .Y 21)Wlahai utrisand butterflies in Wolbachia (2015) Y. 1-9. A. 12, Kawahara, Zoology, and in J. Frontiers frequency. J. infection Welch, in N. V., structure Puillandre, E. geographic Araujo-Jnr, moths: and Z., P, integrative M. Zaharias, an Ahmed, in Y., 4591-4611. sequencing Kantor, 27, RAD Ecology, using A., Conoidea) Molecular (Turridae, Fedosov, framework. gastropods B., marine of Buge, species Delimiting L, (2018) Aznar-Cormando, J., Abdelkrim, well support. and as computational Alberta for support, of References (www.westgrid.ca) sequencing University Westgrid MaxEnt. for the and Universit´e Laval in (www.computecanada.ca) at at Canada modelling Unit Syst`emes (IBIS) Compute niche Services des as to Int´egrative We Biology et related Bean Biologie Molecular IDs. guidance the de the specimen their acknowledge with Institut and for help to collection, Majchrzak to wish Yasmine specimens specimen additionally and museum We with their Peers to help Michael access Lee, thank for James for also Valgardson University Fisher, Young and Kirby Mark Brigham Ferguson, E.O.C. and Simon at to Dupuis, Biodiversity Museum Sperling, Julian in Thomas Grants Campbell, (ACA) Barb Nazari, thank Association Vazrick Conservation to wish Alberta authors Doctoral by to The – and grant Scholarship Z.G.M. Discovery Z.G.M, Graduate (NSERC) to Canada Council D) Bell Research – Graham Engineering (CGS Alexander and NSERC Sciences an Natural (RGPIN-2018-04920), taxa. a F.A.H.S by of funded range was broad work a This in diversification conservation, lineage for of units patterns Acknowledgements biodiversity influence meaningful that delimit factors taxonomies, delimitation extrinsic stable informing species inte- characterize to of distinctiveness, readily contribute and method ecological to species be multidisciplinary to positioned This could for thus divergence and management. useful is genomic biodiversity science, particularly and links citizen habitat be that in of framework should prioritization visibility coalescent-based conservation analyses high delimitations a multispecies ecological or into species and and impacts grated phylogeographic genetic conservation genetic have for in of that priors subsequent integration modelling delimitations in Bayesian an approach ecological inform this Such of on reciprocally selection. build incorporation modelling. to divergent to the used opportunity ecologically-based great particular, be - maintenance is mechanism may In the there likely that attribute a suggest work. to to we us systematic results, lineages enabled our these have to of surfaces addition each In resistance in landscape integrity and genomic ENMs of using analyses between and divergences genomic , species-level substantiate results Our several of status Conclusions conservation between the introgression to detected due also We pursued be where persist. Basin, may Great Pleistocene the and the and California before in diverged particularly assessment, that genomic lineages and sampling study, further this requires that as in genomic to Following sampled referred their name species. be maintain distinct to level, subspecies continue as nonetheless southern should recognized lineage that oldest be northern environment the should the lineages 2019), the two (Pelham and priority the taxonomic that and species recommend northern other therefore We between with integrity. divergences interactions ecological by and shaped evolutionary of recent lineages support southern unambiguously results Our a the likely in to even is revision lineages, Taxonomic selection southern divergent and dispersal. northern ecologically-based to the barriers that of of suggest reinforcement absence and similarly divergence ENMs underlying process that complex. principal emphasizes this and in expansion, boundaries range this to southern prior and northern and .hsei/.nausicaa hesperis/S. S. .nausicaa S. orpeetteetr otenlnae hstxnmcrvso ean hypothesis a remains revision taxonomic This lineage. southern entire the represent to , .hesperis S. .hesperis S. .hesperis S. htmyb oreo ula eete-pce redsodne hsshould This discordance. tree tree-species gene nuclear of source a be may that u loidct htteelnae urnl xs sdsic entities distinct as exist currently lineages these that indicate also but , ute ugssta nrgeso ewe hs pce occurred species these between introgression that suggests further COI 12 .zerene S. sol atal nomtv o lrfigspecies clarifying for informative partially only is .h nausicaa h. S. subspecies. .atlantis S. hudb lvtdt species to elevated be should , , .hesperis S. and , .hesperis S. .nausicaa S. .zerene S. and , Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. ace,J . mrs . oelh,P,Cek,W,adPsltwi,J .(01 tcs uligand building 171-182. Stacks: delimitation. (2011) 1, species H. G3, J. at sequences. Postlethwait, fail short-read and from to W., novo How Cresko, de P., (2013) Hohenlohe, loci D. A., genotyping J. Amores, Satler, M., J. and Catchen, M., phylogeny N. species Reid, 4369-4383. SNP-based A., 22, (2019) Ecology, T. Molecular H. Pelletier, A. C., B. F. Carstens, mito-nuclear Sperling, widespread and demonstrates V., DOI:10.1111/syent.12393 Speyeria) Entomology, R. Nymphalidae: Systematic (Lepidoptera: Gage, discordance. Cross-platform butterflies V., fritillary (2017) E., greater e84- H. Gage, of 6, A. O., Resources, F. Ecology E. Sperling, Molecular Campbell, and genus. K., butterfly nonmodel Muirhead, a R., in J. SNPs e93. Dupuis, novo-aligned S., de NCBI Data C. of butterflies. (2020). compatibility Davis, of H. O., A. group E. F. confusing Sperling, Campbell, a and using V., delimitation R. Gage, Molecular species NNNN-NNNN. V., ecological analysis. E. accessions and coalescent Gage, genomic G., full Reconciling Z. a MacDonald, from: O., in E. trees species Campbell, Inferring [dataset] gene (2012) bypassing 1917-1932. R. 29, markers: A. Evolution, Choudhury, genetic and and biallelic Biology A., from N. Rosenberg, directly M, J., Maechler trees Felsenstein, HJ, R., Bouckaert, Skaug 378–400. D., 9(2), A, Bryant, Journal, Nielsen Generalized R York: CW, Zero-inflated The for New Berg Packages Modeling.” Among A, America, Flexibility Mixed and Magnusson North Linear Speed KJ, Balances of “glmmTMB Benthem (2017). Butterflies van BM Bolker to K, Kristensen Guide ME, Field Brooks Kaufman (2003) K. Compu- Co. Kaufman, PLoS Mifflin Houghton and analysis. P. evolutionary Bayesian J. for Brock, platform software a e1003537. 2: 10, Biology, BEAST tational 1372-1373. (2014) 26, J. Bioinformatics, A. K trees. Drummond, J., phylogenetic Heled, of sets R., of Bouckaert, the sense finding and making divergence species: DensiTree: genetic cohesion (2010) extreme R. delimiting with Bouckaert, for 628-646. spiders method 57(4), trapdoor Biology, integrative Californian Systematic The An of structuring. Butterflies. (2008) group geographic Alberta K. a 1995. in A. H. interface Stockman, A. population-species F. & Sperling, E., and J. M., Canada. Bond, E. Alberta, Pike, Edmonton, G., Alberta, N. of Kondla, Museum J., Provincial G. Hilchie, D., C. Bird, 37-48. 16, Evolution, and Appli- Biology Methods, R Molecular and Concepts, P., Genetics: Forster, H., Landscape Bandelt, (2016) Cresko, L. UK. U., Waits, Sussex, E. A., West Selker, Wiley, Storfer, markers. A., cations. RAD S., Z. sequenced Cushman, Lewis, using L., N., mapping A. Balkenhol, genetic and Shiver, discovery C., SNP e3376. M. Rapid 3, Currey, (2008) ONE, S., A. PLoS E. T. Johnson, Atwood, ecology and D., of A., P. W. review Etter, Annual A., theory. N. niche of Baird, power and the methods, 39-61. Harnessing ordination 16(1), 2016. concept, systematics, P. Continuum and Hohenlohe, (1985) and P. G., M. Luikart, Austin, genetics. R., evolutionary M. and Miller, 375-382. ecological M., 39(3), for Taxon, J. ecology. RADseq Good, and R., concepts K. species Andrews, force: driving The (1990) L. Andersson, het . aga,T,W,C,Xe . uhr,M . abu,A,and A., Rambaut, A., M. Suchard, D., Xie, C., Wu, T., Vaughan, D., uhnert, ¨ h,A 19)Mda-onn ewrsfrifrigitapcfi phylogenies. intraspecific inferring for networks Median-joining (1999) A. ¨ ohl, aueRvesGenetics Reviews Nature 13 , 17 81-92. , Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. ry .P 15)Tesbpcaino pyraalni.TeLpdpeitsNw,5 31-35. 5, News, Lepidopterist’s The atlantis. Speyeria phylogenetics of Integrating 58(8), subspeciation (2004) Evolution, The C. frogs. (1951) Moritz, P. dendrobatid & L. in Grey, J., mechanisms C. speciation Schneider, explore C., concept— to species J. 1781-1793. models the Santos, and niche R., Biodiversity environmental S. (2016). and Ron, T. H., B. C. Sinn, syw098. the Graham, & Biology, using A., Systematic individuals R. enough. Folk, of not B., are clusters M. Lineages of Broe, number 2611-2620. V., the 14, J. Ecology, Freudenstein, Detecting Molecular (2005) study. J. simulation infer and Goudet, A to Phylogenetics Structure: and tool software S., Molecular a Regnaut, (Cyperaceae). as G., Genotyping-by-sequencing Carex Evanno, (2014) in L. study A. case 359-367. Hipp, a 79, M., explanation Evolution, hybridization: statistical Hahn, ancestral A R., (2011). and A. J. phylogeny D. C. 43-57. Eaton, Yates, Novel 17(1), & M., (2006). Distributions, E., Escudero, J. and Y. Li, Diversity Chee, Dud´ık, ecologists. & M., T., for 129-151. . Hastie, MaxEnt . 29(2), . J., Ecography, of S. A., data. Phillips, Guisan, occurrence J., S., from Elith, Ferrier, distributions Dud´ık, M., species’ R., proportions of Anderson, admixture prediction P. improve of C., methods 2611-2620. inference Graham*, 14, Spatial H. Evolution, J., (2009) and Fran¸cois,Elith*, O. Biology and Molecular E., zones. O. contact secondary Gaggiotti, Nym- and F., (Lepidoptera: Jay, literature atlantis- E., relevant the Durand, and and 1-74. images, Scudder 90, Speyeria type genus Mundi, accounts, fritillary Insecta species greater phalidae). with the complexes, of species overview Taxonomic hesperis (2009) BMC C. trees. J. sampling Dunford, by analysis evolutionary 1-30. Bayesian 214. BEAST: 1370, 7: (2007) Novitates, Biology, A. Museum Evolutionary with Rambaut, American Nymphalidae) and J., (Lepidoptera, localities. A. Speyeria type Drummond, of of catalogue fixations Systematic and 879-886. (1947) P. types 56, L. Biology, of Grey, Systematic designations and Delimitation. Press, F., University Species C. Passos, and Oxford dos Concepts speciation. Species and speciation. (2007) of species K. process forms: Queiroz, the Endless de and (Eds.) criteria, H. species S. species, York. Berlocher, of New concept J., lineage D. general and Howard, Interrelationships The IN 2017. (1998) I. K. R. Hill, Queiroz, and de N., Wahlberg, Analysis X., Bao, Genomes C.., 1000 tenney, of 2156-2158. K., . diversification . W. 27, . Savage, Bioinformatics, A., S., of R. VCFtools. M. Moya, DePristo, and influence de E., format relative Banks, call 49-62. A., variant The 15, C. The Ecology, (2006) Albers, (2011) Molecular G., A.P. Group. . Abecasis, Hendry, guppies. A., & Auton, in P., flow M.T. Danecek, gene Kinnison, on geography D.N., and Reznick, MA. selection P., Sunderland, natural Bentzen, Associates, the 165. Sinauer E., with 6, Speciation. dealing Crispo, Evolution, (2004). units: and H.A. conservation Ecology Orr, and in & diversity Frontiers J.A. genomics. Genetic Coyne, of (2018) age C. the Moritz, in and 747-756. continuum M., 7, species-population Notes, Byrne, Ecology J., Molecular study. D. comparison spatial Coates, a ascertaining and algorithms a program clustering computer of Bayesian new (2007) Fran¸cois, resistance a O. and structure: electrical 312-340. F., population Forbes, The 6(4), E., Complexity, 1996. Durand, Computational P. C., times. Tiwari, Chen, cover & and R. commute Smolensy, its W.L., captures Ruzzo, graph P., Raghavan, A.K., Chandra, Argynnis arcu and Fabricius Speyeria cde butteflies. Scudder 14 ytmtcEntomology Systematic , 42 635-649. , Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. oa,K . in,J 20)De ih osraimpooeseito?Acs td nNorth in study case A speciation? promote conservatism niche Does (2006) 2604-2621. J. 60(12), Evolution, Wiens, salamanders. & American H., Ecology Molecular K. program K. a Kozak, across Clumpak: inferences (2015) structure I. Mayrose, population 179-1191. and packaging 5167-5176. 15, A., and 3, Resources, N. Evolution, modes Rosenberg, and clustering M., identifying Ecology Jakobsson, for J., species. Mayzel, butterfly not M., a N. Deceptive is in (2013) Kopelman, DNA markers H. nuclear mitochondrial A. and in F. morphology divergence Sperling, in Wolbachia-associated and reflected N., phylogeography: Wahlberg, and S., taxonomy Bromilow, phylogeography J., single-locus the T. on impacts Simonsen, Pleistocene U., (2009) Kodandaramaiah, ( R. B. mouse Riddle, pocket & desert L., the Z. Marshall, of Intergovernmental H. D´ıaz, R., and J. S. the Jaeger, Settele, Germany. of T., J. Jezkova, Bonn, Brondizio, services secretariat, S. ecosystem IPBES E. Royal (editors). and Services. Ngo the biodiversity Ecosystem population, T. and of on Biodiversity in on report Proceedings marker Platform assessment a Science-Policy symbionts. as Global DNA inherited (2019) mitochondrial IPBES of with effects Problems 1525-1534. the (2005) 272, M. B, studies: 2136-2150. Society F. phylogenetic 11, Jiggins, an Evolution, and reveal and and data Biology phylogeographic D. SNP Genome D. Genome-wide hawkmoths. (2019) G. of M. Hurst, group Mutanen, and a J., in diversity I. Kitching, species M., of rainbow K. overestimation in Ecology, Lee, C. introgression Molecular K., Muhlfeld, of A., sequencing. C., patterns Hundsdoerfer, RAD M. Genomic Boyer, paired-end (2013) overlapping N., G. by Kamps-Hughes, Luikart, 3002-3013. illuminated R., and 22, trout M. A., cutthroat Improving Miller, E. westslope J., Johnson, UFBoot2: and S. W., (2018) Amish, F. S. D., Allendorf, L. 518-522. M. Vinh, C., Day, 35, and Evolution, A., Q., P. and B. Biology Hohenlohe, Ming, Molecular A., approximation. Haeseler, bootstrap von ultrafast O., Thymelicus the Chernomor, butterfly T., the D. in Hoang, discordance mitonuclear https://doi.org/10.1111/mec.15153 striking (2019) DOI: uncover R. Ecology. genomics Vila, Molecular and species: sylvestris. N., at: cryptic Alvarez, Dinc˘a, of V., C., online package mirage Pitteloud, R A Available A., data. Koub´ınov´a, M. raster C., Szenteczki, ‘dismo’. with J. D., modeling Hinojosa, Package and analysis (2011). Geographic raster: http://CRAN.R-project.org/package=raster. J. (2012). 2.0-12. J. version Elith, Etten, van & and R.J., Hijmans, J. the Leathwick, of Journal S., Argynnini). Phillips, http://cran.r-project.org/web/packages/dismo/index.html. compatibility (Nymphalidae: R.J, genomic butterflies of Hijmans, fritillary studies Wolbachia greater Hybridization 263-273. (2014) the (2013) 67, C. Society, W. in C. Lepidopterists’ Bergman, Nice, expression and and phenotypic D., 14:6. A., and Science, J. McCorkle, Fordyce, C., of L., P. Journal M. Hammond, Ecological Forister, concern. ecology. conservation A., of Pike, in Lepidoptera A., models and C. infection distribution Handley, A., habitat C. Predictive British Hamm, (2000). Vancouver, E. Press; 147-186. UBC N. 135(2-3), Columbia. Zimmermann, modelling, British & of A., Butterflies Guisan, (2001) H. J. Shepard, 414p. Columbia. and S., C. Systematic simu- Guppy, Phrynosomatidae). factors: Bayes (Squamata: using group delimitation species Species 119-133. scalarsis (2014) Sceloporus 63, W. Biology, the T. Reeder, to and applications Jr., and W. lations R. Bryson, A., J. Grummer, heoiu penicillatus Chaetodipus 15 .Junlo amlg,9() 306-320. 90(2), Mammalogy, of Journal ). Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. etn .G,Sart,J,Hnrxo,B . ekrbta,S,&Bn,J .(00 nertv species Antro- Integrative (2020) (Araneae: E. unicolor J. Antrodiaetus Bond, Appalachian Ecology. & Molecular southern S., complex. the Derkarabetian, species in E., sub- diaetidae) diversity B. new cryptic Hendrixson, a reveals J., of delimitation Starrett, description G., and America L. records, of Newton, comments, Specialists. butterflies Reprint Speyeria: the Entomological in of Angeles: variability checklist Los Geographic catalogue/ species. (1975) A H. 2: Society. A. No. Lepidopterists’ Moeck, The Memoir 143. (1981) Pp. M. Mexico. F. of Brown, north and 19885-19890. Proceedings D., populations. 104, L. USA, and Miller, Sciences, plant in of flow Academy gene predicts National theory the Circuit of (2007). 1551-1561. P. 60, Beier Evolution, H., B. resistance. zerene McRae, by S. Isolation threatened (2006). the mole- H. of A B. management 1237-1253. (2013) McRae, the 17, G., for Conservation, Binford, implications Insect and its of R., and Journal Buskirk, Speyeria Van hippolyta. of T., analysis Marzulla, MA. phylogenetic C., Cambridge, cular Greever, Press, P., University Bierzychudek, Harvard A. Evolution. McHugh, and Species Washington, Animal Sciences, (1963) of Advancement E. the Mayr, for York. Association New American Press, problem. University species DC. Columbia problem. The species. (1957) species of E. the origin Mayr, of the Hall. saga and & the Systematics Chapman in (1942) 381–424). denouement E. (pp. The Mayr, biodiversity concepts: landscape of species combining units of The genetics: hierarchy Species: 189-197. Landscape A In 18(4), (1997) (2003) Evolution, L. P. & R. Taberlet, Ecology Mayden, & in Trends G., genetics. Luikart, population K., and Sons. ecology M. & Schwartz, Wiley Royal S., John diversity. the Manel, biological of Measuring Transactions (2013). Philosophical 711-719. E. conservation. 359(1444), A. Sciences, species Magurran, Biological in B: taxonomy Series of London. role of and The flow Society Ecology. (2004) Gene Molecular M. (2020). specialist. G. F.A.H. habitat Sperling Mace, vagile S.E., a Nielsen in J.H., variation Acorn, diversity. genetic C.S., butterfly Davis climate-associated J.R., in Dupuis and trends Z.G., richness long-term MacDonald species assessing between 617-629. relationships for 26(3), Negative inadequate conservation, (2017) indices and H. Biodiversity J. diversity Acorn, common & render E., evenness S. Nielsen, G., conser- Z. Niche phylogenetics MacDonald, lands: Molecular arid Ephedra. genus American 437-450. the North in 65(2), in speciation evolution, Diversification explain construction. and (2012). habitat of M. network expansion S. and haplotype Ickert-Bond, divergence & vatism, for V., software Sosa, Full-feature I., 1110-1116. PopART: Loera, 6, (2015) Evolution, and D. Ecology Bryant, genome- in using and Methods delimitation W., Species J. (2014) 534-542. R. Leigh, R. 63, Biology, tutorial Bouchaert, Systematic and a N., data. SNAPP: V. SNP with Minin, wide delimitation K., genomics, M. species speciation Fujita, and common D., and Leach´e, trees is A. population Species admixture on Workshop (2018) Genome-wide example. R. 2018. worked R. D. and Bouckaert, C. and https://doi.org/10.1101/414201 Jiggins, D. doi: and Leach´e, 414201, A. M., bioRxiv, Joron, radiation. O., Heliconius W. the across McMillan, M., K. Kozak, 16 ek Krumlov Cesky ˇ Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. ´ nhzRmıe,S,Rc,Y,Bry .H,Ewrs . al .E,Hnn .T,&Mrh,R W. (Gopherus R. tortoise Murphy, desert & Agassiz’s T., in B. structure Henen, population E., drives and A. flow Karl, gene T., limits Edwards, Landscape H., (2018). K. paired-end for Berry, DOI:10.1111/MEC.15253 methods Y., Ecology, analytical S´anchez-Ram´ırez, Molecular Rico, 2: Stacks genomics. S., (2019) population M. charac- RADseq-based J. integrative improve Catchen, eye”: sequencing and Speyeria). G., Rivera-Col´on, “by (Nymphalidae: A. C., Identification butterflies N. 2019. Rochette, fritillary 259-267. H. greater 24, J. Conservation of Acorn, Insect identification of and field Journal F., regional Carroll, informs assessment O., ter E. Campbell, Methods F., things). Riva, other 217-223. (and 3, biology comparative Evolution, phylogenetic and for package Ecology R population in An Indian phytools: Reconstructing (2012) (2009) J. L. L. Revell, Singh, and in L., summarization 489-494. A. Posterior 461, Price, N., (2018) Nature, Patterson, history. A. K., M. 901-904. Thangaraj, Suchard, 67, D. and Biology, Reich, G., Systematic Baele, 1.7. Tracer D., using Xie, phylogenetics J., Statistical Bayesian A. for Foundation Drummond, R A. computing. Rambaut, statistical for https://www.R-project.org/. environment URL multilocus and Austria. language using Vienna, A structure Computing, R: population (2017) Team. of Core Inference R (2000) 945-959. P. genetic Donnelly, 155, high-density Genetics, and data. of M., genotype Development Stephens, approach. K., (2012) genotyping-by-sequencing J. J. two-enzyme Pritchard, Jannink, novel and a E., using wheat M. genome-wide and e32253. from Sorrells, barley mixtures J., P. and for Brown, splits maps population A., of J. e1002967-17. Inference Poland, 8, (2012) Genetics, K. PLoS J. data. geographic frequency Pritchard, species allele of and modeling complete K., entropy J. a Maximum Pickrell, (2006). with E. 231-259. Canada; R. 190(3-4), Schapire, and modelling, & P., Ecological States R. distributions. USA. United Anderson, CA, J., of the Hills, S. Foundation/Journal Beverly Phillips, of Research 40, Lepidoptera butterflies Volume The Lepidoptera, the literature. the systematic of on and Research catalogue descriptive the A RADseq: of (2019) digest bibliography Double P. (2012) J. E. Pelham, H. Hoekstra, and S., H. Fisher, for One H., method E. Kay, inexpensive N., an J. The Weber, modeling. K., 9–13. niche B. 5(2), ecological Peterson, News, on STACKS. R based for R.” distributions 599-605. geographic in map 103(3), data species’ road Condor, Predicting spatial a (2001). for space: T. methods A. parameter and Peterson, “Classes in (2005). Lost RS 2017. of Bivand M. EJ, Museum J. Pebesma Gillette Catchen, Evolution P. and and C. Ecology R., butterfly in J. the for Methods Stevens, list of P., names Contributions J. Scientific USA. Paris, Mexico. 4. Colorado, America. Collins, of Fort North University, North of State America, Lepidoptera Colorado Diversity, (2005) North D. of A. effecti- Warren, species and and fast A. Press. P. a University Opler, IQ-TREE: Oxford (2015) speciation. Evolution, Ecological Q. and (2012). B. Biology P. Nosil, Molecular Minh, phylogenies. and likelihood A., maximum Haeseler, estimating von 32:268-274. for A., algorithm H. stochastic Schmidt, ve -T., L. Nguyen, , 7 e37135. , enovo de , 8 N icvr n eoyigi oe n o-oe species. non-model and model in genotyping and discovery SNP 1360-1373. , 17 LSOne PLoS PLoS , 7 , Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. arn .D,Dvs .J,Gihn .V,Pla,J . n tnead .M 21)Itrcielisting Interactive (2012) M. spatial E. Stangeland, in and geography P., and J. ecology Pelham, V., of N. roles Grishin, butterflies the J., American Quantifying K. of (2012) Davis, D., B. A. J. correlate Warren, 175-182. to Losos, 16(2), modelling & letters, suitability Ecology E., Habitat R. divergence. (2008) genetic Glor, K. 989-1000. J., L. 23(8), Lin, I. Ecology, & Wang, Landscape L., connectivity. C. and landscape Bridgman, with 787-798. boundaries C., flow 22, species K. Ecology, gene Seehausen, Yang, Molecular of and H., resolution radiation. A., Y. adaptive Sivasundar, unprecedented Wang, cichlid L., provide Victoria Greuter, Lake data S., Mwaiko, the sequence M., in RAD of O. relationships Selz, Genome-wide pattern S., New (2013) a Wittwer, reveal I., O. Diplostephium). Keller, signals E., Astereae: phylogenomic C. Conflicting Wagner, (Asteraceae: (2017) diversification B. high-Andean 1736-1750. B. recent 214, Simpson, a Phytologist, M., in E. evolution 233-239. Ortiz, reticulate Taxon, M., oaks. and O. multispecies, Vargas, Statis- species, of Ecological Journal (1976) Grids.” L. Geographical Valen, on Van Routes and 21. Distances 76(13), gdistance: Software, Package tical “R 6(4). (2017) speciation Genetics, J allopatric PLoS and Etten ecological van archipelago. for 3071- tests island 363, Genetic an (2010) Sci., on H. Johansson, anoles Biol. & in B, Y., Surget-Groba, Society S., Royal R. geographic the Thorpe, and of ecology Transactions of importance Philosophical relative The anoles. 3081. (2008) in 209-224. H. speciation the Johansson, 10, for within & , Y., isolation species radiation. Surget-Groba, true adaptive S., clarify recent R. analyses a Thorpe, Phylogenomic of (2019) evidence I. where despite R. genetics: Speyeria Hill, genus Landscape the and butterfly (2010). J., P. in Baumsteiger, L. delimitation Waits, E., & 3496-3514. Thompson, Species R., 19(17), mill? Ecology, Holderegger, Molecular F., the now?. S. for we Spear, 101-113. grist are A., 20, M. More Genetics, Murphy, Conservation (2019) A., conservation. Storfer, W. for M. implications Pedersen, its Tingskov Bruford, A., and P. and Orozco-terWengel, era M., R., genomic I-R. H. Russo, K., Butterflies Siegismund R. Flight: Waples, C-E., P., Taking Frandsen, Evolution Chicago. G., and Press, W. Ecology D. Chicago Stanton, (Eds). of W. University Watt, phylogenies. systems. level and study higher P. to Erlich, model definitions C., as species Boggs, from IN systematics: molecular 431-458. Butterfly coot Pp. of (2003) Hawaiian H. effects from A. bottleneck 9925-9934. example F. of 7, Sperling, An Evolution, implications populations: and Genetic recovering Ecology (2017) naturally biology. gallinule. G. in systematic Hawaiian J. diversity and in underwood, genetic issue and of E., Renaissance severities R. a differing Wilson, species: A., Delimiting S. 462-470. (2003) Sonsthagen, 18, C. Evolution, J. and Marshall, Ecology and 45. in distinct Trends 8, Jr., are Insects, W., atlantis conservation. J. Speyeria Nymphalidae) Sites, and (Lepidoptera: hesperis Speyeria Speyeria (2017) R. (1998) S. M. Sims, S. 1-32. 8, Spomer, Series, 737-741. and New 323(5915), G., Papilio Science, N. 372-380. species. alternative. Kondla, 16(7), its A., evolution, and J. & speciation Scott, ecology ecological in for Trends Evidence species. (2009) of D. origin Schluter, the and Ecology (2001) D. Schluter, 11231. 8(1), reports, Scientific agassizii). < www.butterfliesofamerica.com 18 > Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. O n G odce aaaaye n rt h nta aucitdat l uhr otiue to contributed authors all draft; Figures manuscript revisions. and draft initial Tables manuscript the and wrote collection, and specimen analyses design, data study conducted ZGM and EOC Campbell citation: Data contributions acceptance. Author manuscript New pending respectively, (2010) SRA, C. NCBI J. and Svenning, & B., COI P. Pearman, H., 985-989. Accessibility Press. C. 33(6), Ecography, California Data Graham, of modelling. C., University distribution (CA): T. 914–923. species Berkeley 163, Jr, in idea. Nat. Edwards trends the E., Am. of N. it? history study A Zimmermann, we Species: should (2009) how S. and J. biology. speciation Wilkins, invertebrate is of What manipulators (2004) master J. Wolbachia: J. (2008) Wiens, E. 741-751. M. 6, Clark, Microbiology, and Reviews L., Nature Baldo, H., J. Werren, eesqec n a dA/w-nyeGSsqecn lswl emd vial nGenBank on available made be will files sequencing GBS ddRAD/two-enzyme raw and sequence gene 19 tal. et (2020). Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. eta ufc sdpce nteright. image: the right correspond on Top depicted colours (A). is haplotype in surface and results ventral sequences, subclustering between and image: differences clustering left nucleotide Structure of the number geographic to the and genetic indicate between (C) correspondence and in show northern to (B) between hesperis in S. divergence map the genetic on The charts major structure. pie as a plotted indicate are analysis consistently (A) results southern TESS and Structure 2: Figure indicate nodes on circles Grey (A). in membership group their to according values coloured bootstrap are (B) in Specimens 1: Figure .hesperis S. aiu ieiodpyoeisbsdo A ula Nsad()mitochondrial (B) and SNPs nuclear (A) on based phylogenies likelihood Maximum egahcassmn fpplto eei lseigof clustering genetic population of assessment Geographic n altp hrn between sharing haplotype and .hsei beani hesperis S. COI > 75%. altp ewr o h aeseies()dpcsalc fgorpi tutr in structure geographic of lack a depicts (C) specimens same the for network haplotype iegs ouain of Populations lineages. h oslwn ufc ssono h etsd fec pcmn n the and specimen, each of side left the on shown is surface wing dorsal The . .hesperis S. .hesperis S. 20 n Nevadan and and .atlantis S. .zerene S. .atlantis S. dnie nthe in identified ace ln h branches the along Hatches . and .alni hollandi atlantis S. .hesperis S. K COI 6Structure =6 sn SNPs. using sequence. lower ; Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. infiatmgaineet hwn uaieitorsinbtenppltosof populations between introgression output putative (left) and showing phylogeny events likelihood the migration maximum in recovered the significant populations the depicts the for (B) using TreeMix support topology. by branch that and depicting poor tree, distribution of MCC posterior source the the on as nodes (= zerene SNPs theta both between of at discordance estimates red zerene genealogical the in the southern indicate indicated visualizes and branches are (A) northern in the values (overlaid supporting along probability tree colour), values (MCC) Posterior (in grey DensiTree credibility by clade species. maximum output distinct a cloud” both as “tree as a visualized as model and black), species” “4 BFD* the depicts 3: Figure .atlantis S. and /southern .alni/.hesperis. atlantis/S. S. F*seisdlmtto n rei nrgeso eut sn Ns h etpnli (A) in panel left The SNPs. using results introgression TreeMix and delimitation species BFD* h oe eiul o hsaayi r niae ytehampo h ih n(B). in right the on heatmap the by indicated are analysis this for residuals model The . .hesperis S. ld nteMCte,adidctstreatraerltosisbetween relationships alternate three indicates and tree, MCC the on clade ecnae bv ahDnirerpeettepooto ftesi the in trees of proportion the represent DensiTree each above Percentages K 21 tutr nlss(i.2,wt he statistically three with 2), (Fig. analysis Structure 6 = N e o ahlnae h ih ae in panel right The lineage. each for ) .zrn,S hesperis, S. zerene, S. .hesperis S. S. S. Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. otnnaiy0005 .600 6.05 7.16 0 9.61 34.62 0.41 26.7 0 0 3.97 4.46 5.34 0 s.e. 0 6.22 41.59 0.18 39.52 1.26 0.38 0 0 4.06 mean 20.12 3.98 0 0.62 0 12.79 0.56 0.17 0 10.5 0 42.71 s.e. 3.02 0.43 0.28 0 35.04 5.1 0 18.74 2.61 0.19 1.9 13.97 0.44 5.44 19.54 0 15.13 1.89 mean 0 54.78 1.47 0.84 4.73 10.07 BF 3.1 s.e. precipitation Aug) MLE to 0 (Jun summer 15.87 6.58 snow as precipitation mean temperature minimum extreme days degree growing days 9.99 degree chilling 0 month warmest continentality the of month temperature coldest mean the of temperature mean cover land load heat ruggedness terrain variable Predictor therefore was model species” “4 the support; model strongest indicate optimal. values considered (BF) Factor Bayes negative 1: Table ” species” ”6 ” species” ”4 ”north species” ”3 species” ”2 tested priori a groups received a Species cell grid suitability. 1-km higher Each model indicating values BFD* area. higher study five with entire the 0–1, of the from average across ( ranging The suitability score points habitat. habitat suitability available predict Background habitat sample to to used evaluation. polygon were convex model minimum models for buffered lineage’s used each localities within occurrence of 20% hesperis 4: Figure n C southern (C) and , F*mdlslcinrsls h ihs agnllklho siain(L)admost and (MLE) estimation likelihood marginal highest The results. selection model BFD* rdce aia utblt ufcsbsdo Nsfr(A) for ENMs on based surfaces suitability habitat Predicted S atlantis” ”S. ” .zerene S. atlantis S. .hesperis S. .hesperis S. ”, ,” ”, .hesperis S. north + ” , S atlantis ”S. .zerene” S. .atlantis S. + .hesperis S. .atlantis S. o ahlnae v Nswr ul,ec ihodn different a withholding each built, were ENMs five lineage, each For . ,”north ”, ”north , ,” ”, .alni .atlantis S. atlantis S. .zerene S. ,”south ”, ”south ”, .hesperis” S. .hesperis S. 22 -13863.6 ” .hesperis S. hesperis S. ,”eta” suhU” N Z 267117587 -22657.1 AZ” & ”NM UT”, ”south ”central”, ”, ”south , ,” ”, + northern .hesperis” S. .zerene S. .zerene S. pyraatlantis Speyeria n .hesperis S. 13751067.8 -14397.5 ” 1848-97.6 -13814.8 ” 000 eegenerated were 10,000) = B northern (B) , northern

1423-802.6 -13462.3 .hesperis S.

S. southern .hesperis S. southern .hesperis S. Posted on Authorea 1 Oct 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160157511.10593343 — This a preprint and has not been peer reviewed. Data may be preliminary. sdfrmdleauto.Tema n tnaddvain(.. frltv otiuin o ahpredictor each for localities reported. contributions occurrence relative are of of ENMs 20% (s.e.) different permuted. five deviation a randomly the standard withholding was and across each variable mean variable built, The each were evaluation. precipitation. after ENMs model Aug) values five for to AUC lineages, used three in (Jun the drop summer of the and warmest each as For snow, and measured as coldest was precipitation the contribution temperature, Relative of minimum temperatures mean extreme cover mean month, days), land 0 between coldest gree and below difference the aspect), degree-days the of and (continentality), month, temperature months slope warmest mean terrain the included on of variables (based temperature predictor load Environmental heat categories). ruggedness, (12 terrain included variables dictor atlantis 2: Table northern , h eaiecnrbto % fgorpi n niomna rdco aibe to variables predictor environmental and geographic of (%) contribution relative The .hesperis S. n southern and , ° .hesperis S. ciln eredy) eredy bv 5 above degree-days days), degree (chilling C 23 clgclncemdl EM) egahcpre- Geographic (ENMs). models niche ecological ° goigde- (growing C Speyeria