Received: 23 April 2020 | Accepted: 26 November 2020 DOI: 10.1111/jbi.14055

RESEARCH ARTICLE

Pleistocene climatic fluctuations promoted alternative evolutionary histories in aequatorialis, an endemic palm from western

Sebastián Escobar1 | Andrew J. Helmstetter2 | Scott Jarvie1,3 | Rommel Montúfar4 | Henrik Balslev1 | Thomas L.P. Couvreur2,4

1Section for Ecoinformatics and Biodiversity, Department of Biology, Aarhus University, Abstract Aarhus C, Denmark Aim: Pleistocene (2.58 Ma–11.7 ka) climatic fluctuations have shaped intraspecific 2 IRD, UMR DIADE, Université de genetic patterns worldwide; however, their impact on in many regions re- Montpellier, Montpellier, France 3Center for Biodiversity Dynamics in a mains unknown. In order to determine the impact of Pleistocene climatic fluctuations Changing World (BIOCHANGE), Department on the tropical rain forests of western Ecuador, we explored the evolutionary history of Biology, Aarhus University, Aarhus C, Denmark of the endemic palm Phytelephas aequatorialis. 4Facultad de Ciencias Exactas y Naturales, Location: Western Ecuador, north-western . Pontificia Universidad Católica del Ecuador, Taxon: Phytelephas aequatorialis (). Quito, Ecuador Methods: One hundred and seventy-six nuclear genes were sequenced in 91 indi- Correspondence viduals for phylogenomic and population structure analyses. The time of divergence Sebastián Escobar, Section for Ecoinformatics and Biodiversity, between identified genetic lineages was estimated using a coalescent phylogenomic Department of Biology, Aarhus University, analysis. Palaeoecological niche modelling analyses were performed to determine DK 8000 Aarhus C, Denmark. Email: [email protected] areas of historical climatic suitability since the Last Glacial Maximum (LGM; 22 ka) that potentially acted as forest refugia during the Pleistocene. A Wilcoxon test and Funding information SENESCYT; Independent Research Fund Pearson correlations were used to explore how current levels of genetic diversity, in Denmark, Grant/Award Number: 9040- terms of expected heterozygosity (Hs), have been shaped by several palaeoclimatic 00136B; Agence Nationale de la Recherche, Grant/Award Number: ANR-15- CE02- and geographic factors. 0002-01; International Mixed Laboratory Results: Phylogenomic and population structure analyses revealed two main ge- BIO_INCA; International Palm Society netic lineages with a north–south distribution, which diverged 1.14 Ma during the Handling editor: Carina Hoorn Pleistocene. Two potential Pleistocene refugia were identified, one along the Pacific coast of Ecuador and one in the Andean foothills of south-western Ecuador. The loca- tion of these refugia agrees with the spatial location of the two genetic lineages. Within the Andean foothills, Hs was lower for the southern lineage than for the northern line- age. Hs significantly increased with decreasing latitude across the species as a whole. Main conclusions: Pleistocene climatic fluctuations promoted intraspecific diver- gence in P. aequatorialis within the rain forests of western Ecuador. The Andean foot- hills of south-western Ecuador could be an important area for rain forest evolution

Henrik Balslev and Thomas L. P. Couvreur should be considered joint senior authors.

This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. © 2021 The Authors. Journal of Biogeography published by John Wiley & Sons Ltd.

Journal of Biogeography. 2021;00:1–15. wileyonlinelibrary.com/journal/jbi | 1 2 | ESCOBAR et al.

because they potentially remained climatically suitable throughout the Pleistocene. Lower genetic diversity in the southern lineage, which apparently remained isolated in the Andean foothills during glacial cycles, adds evidence to the presence of a for- est refugium in south-western Ecuador. The geographic pattern in genetic diversity suggests that P. aequatorialis colonized western Ecuador from the north. This study supports the role of Pleistocene climatic fluctuations in promoting intraspecific diver- gence, and for the first time, we show their impact west of the .

KEYWORDS Arecaceae, ecological niche modelling, genetic diversity, last glacial maximum, north–south genetic discontinuity, phylogenomics, phylogeography, population structure, rain forest

1 | INTRODUCTION decreasing to 2,000 mm to the south (Dodson & Gentry, 1991). In addition to climatic variation, the rain forests of western Ecuador Climatic fluctuations during the Pleistocene (2.58 Ma – 11.7 ka) present a wide altitudinal range from sea level up to 1,500 m.a.s.l. have shaped the current genetic patterns of and animal species They are distributed in the lowlands and along the small cordilleras across species-rich tropical rain forests worldwide. The forest refu- of the Pacific coast, as well as along the adjacent western Andean gium hypothesis stipulates that Pleistocene glacial cycles promoted foothills that gradually narrow towards the south (Cerón et al., 1999; retraction and fragmentation of tropical rain forests, isolating popu- Dodson & Gentry, 1991; Sierra, 2013). Thus, the rain forests of lations within climatic suitable areas enhancing allopatric speciation western Ecuador could have responded differently to Pleistocene (Haffer, 1969). Even though the role of the Pleistocene in promoting climatic fluctuations because of their north/south topographic and speciation across the Neotropics has been questioned (Bush, 1994; climatic heterogeneity. Colinvaux et al., 1996, 2000; Hoorn et al., 2010; Leite et al., 2016), The location of rain forest refugia during the Pleistocene in west- it has played a major role in shaping intraspecific genetic diversity ern Ecuador is still poorly understood. Overall, Neotropical rain for- (Hewitt, 2000). Intraspecific genetic lineages have originated in ests are thought to have retracted leading to isolated forest patches areas where tropical species took refuge during the Pleistocene during glacial cycles where temperatures dropped ~5°C compared to because isolated populations restricted gene flow (Cabanne pre-industrial values (Heine, 2000; Hewitt, 2000). Based on current et al., 2016; Carnaval & Bates, 2007; Carnaval et al., 2009; Diniz- patterns of plant distribution, lowland rain forests are hypothesized Filho et al., 2016; Thomas et al., 2012; Vasconcellos et al., 2019). to have survived the Pleistocene within the Chocó refugium, situ- Additionally, higher levels of genetic diversity are usually observed ated along the Pacific coast in north-western Ecuador and western in palaeoclimatically suitable areas where tropical species persisted (Gentry, 1982; Prance, 1982). Although evidence of the during the Pleistocene (Carnaval et al., 2009; Faye et al., 2016; climatic stability of this region has only been documented for the Mousalli et al., 2009; Walker et al., 2009) because long-term popu- Holocene (10 ka – present; Behling et al., 1998), the high levels of lations tend to accumulate unique genetic diversity when compared precipitation linked to its proximity to the Pacific Ocean may have al- to more recently expanding ones (Excoffier et al., 2009). Within the lowed rain forests to persist in these lowlands during the Pleistocene Neotropics, the intraspecific genetic implications of the Pleistocene (Hooghiemstra & Van Der Hammen, 1998). have not been explored west of the Andes, hampering our under- In addition, the rain or cloud forests that occur on the Andean standing of the recent evolutionary history of this region. foothills of western Ecuador (hereinafter referred as premon- Western Ecuador is a species-rich and heterogeneous region lo- tane rain forests) could have also acted as refugia during the cated west of the Ecuadorian Andes. The rain forests that occur in Pleistocene. Glacial cooling promoted the down-slope displacement western Ecuador form a highly diverse ecoregion in a relatively small of Neotropical premontane and montane rain forests (Colinvaux area of 34,000 km2 (Gentry, 1986; Olson et al., 2001). For instance, et al., 1996; Valencia et al., 2010). However, the moisture of the ad- 870 native plant species have been recorded in just 1 km2 with jacent lowlands could have determined whether they were able to ~20% of endemicity (Dodson & Gentry, 1978). Unfortunately, this displace and spread into the lowlands ('moist forest model"; Ramírez- biodiversity is highly threatened due to widespread deforestation Barahona & Eguiarte, 2013) or whether they contracted into refugia and land conversion for agricultural purposes (Cerón et al., 1999; at mid-elevations by the opposing forces of aridity and cooling ('dry Dodson & Gentry, 1991; Ferrer-Paris et al., 2019; Olson et al., 2001; refugia model'; Ramírez-Barahona & Eguiarte, 2013). The 'moist for- Sierra, 2013). The rain forests of western Ecuador form a limit be- est model' predicts population connectivity in the lowlands, diffuse tween the humid Chocó to the north and the dry Tumbes region to population genetic structure and high genetic diversity because of the south. Therefore, these rain forests present a north/south gra- spatial heterogeneity (Ramírez-Barahona & Eguiarte, 2013). In this dient of annual precipitation, averaging 7,000 mm to the north and sense, the Andean foothills of north-western Ecuador may have not ESCOBAR et al. | 3 acted as Pleistocene refugia because premontane rain forests poten- Ecuador (Table S1). In each locality, we sampled between one and tially moved down-slope into the Pacific coast and remained in the three individuals, which were separated by at least 100 m between Chocó refugium (Hooghiemstra & Van Der Hammen, 1998). The 'dry them to avoid the sampling of closely related individuals. Two pre- refugia model', in contrast, predicts the isolation of populations at viously collected individuals of the sister species P. tumacana from mid-altitude with subsequent genetic structuring and loss of genetic south-western Colombia were used as outgroup in the phylogenomic diversity because of altitudinal contraction and population reduc- analyses. We collected tissue from the youngest of the individu- tion (Ramírez-Barahona & Eguiarte, 2013). Given that the lowlands als, which was then silica gel-dried in plastic bags. We isolated total of south-western Ecuador remained dry during recent glacial cycles genomic DNA from each sample using the MATAB and chloroform (Heusser & Shackleton, 1994), the adjacent Andean foothills could separation methods as described in Mariac et al. (2014), and checked have harboured premontane rain forests during the Pleistocene be- DNA quality and quantity using a NanoDrop™ One spectrophotom- cause they were not able to displace into the arid lowlands. eter (Thermo Scientific™). Here we explore how Pleistocene climatic fluctuations have shaped intraspecific genetic diversity patterns of an endemic plant species from the rain forests of western Ecuador. We reconstruct 2.2 | Library preparation, sequencing, the evolutionary history of the vegetable palm Phytelephas filtering and assembly aequatorialis Spruce, by inferring its intraspecific phylogenomic relationships, population structure, and palaeoclimatic niche dis- Laboratory work was performed at the facilities of the Institut de tribution. This species is an emblematic element of these forests Recherche pour le Développement (IRD) in Montpellier, France. (Acosta-Solís, 1948; Barfod, 1991; Cerón et al., 1999), occurring We prepared genomic libraries using a modified protocol from from sea level to 1,500 m.a.s.l. (Borchsenius et al., 1998). According Rohland and Reich (2012) and captured nuclear markers using to previous molecular dating studies, P. aequatorialis diverged from a bait kit that targets 176 palm-specific nuclear genes (Heyduk its sister species Phytelephas tumacana O.F. Cook approximately 3 et al., 2016). Total DNA for each individual was sheared to a mean – 4 Ma, shortly before the Pleistocene (Barfod et al., 2010; Escobar target size of 350 bp using a Bioruptor® Pico sonication device et al., 2020; Trénel et al., 2007). Speciation in the Phytelephas (Diagenode). We then repaired, ligated and nick filled-in the DNA apparently occurred by the action of Andean uplift (Escobar before amplifying it for 10 cycles as a pre-hybridization step. Next, et al., 2020) followed by shifts in pollination mechanisms and vege- after cleaning and quantifying the DNA, we bulked the samples in tative morphology (Barfod et al., 2010). Thus, posterior intraspecific two libraries and added the biotin-labelled baits to the libraries in diversification in Phytelephas species may have occurred during the order to hybridize the targeted regions of the bait kit used (Heyduk Pleistocene. In this context, P. aequatorialis appears as an adequate et al., 2016). We used streptavidin-coated magnetic beads to im- model species for studying how Pleistocene climatic fluctuations mobilize the hybridized biotin-labelled baits applying a magnetic impacted the evolution of rain forest species in western Ecuador. field, and then discarded the unbounded DNA contained in the su- We used a genomic approach by targeting 176 nuclear genes using pernatant. We amplified the enriched DNA fragments eluted from a custom palm baiting kit (Heyduk et al., 2016) sequenced for 91 the beads in a RT-PCR during 15 cycles to complete adapters. Final individuals sampled across the entire distribution of P. aequatorialis. libraries were sequenced with an Illumina® MiSeq (paired-end, We aimed to test whether intraspecific genetic lineages in P. aequa- length 150 bp) at the facilities of the CIRAD (Montpellier, France) torialis are explained by areas of historical climatic suitability that using two lanes. potentially acted as lowland Pleistocene refugia in north-western High-performance computing analyses were done at the South Ecuador and in the Andean foothills of south-western Ecuador. We Green Platform at the IRD. We followed Couvreur et al. (2019) for also explored how several factors have contributed to shape the cleaning and filtering the reads. We controlled the quality using genetic diversity in P. aequatorialis, including the 'moist forest' and FastQC (http://www.bioin​forma​tics.babrah​ am.ac.uk/proje​cts/ 'dry refugia' models (Ramírez-Barahona & Eguiarte, 2013), palaeo- fastqc), demultiplexed using the 'demultadapt' script (https:// climatic suitability during the Last Glacial Maximum (LGM; 22 ka), github.com/Maillo​ l/demul​tadapt) with a 0-mismatch threshold latitude and longitude. With this study we aim to contribute to the and removed the adapters using Cutadapt v.1.2.1 (Martin, 2011). understanding of the recent evolution of Neotropical rain forests Reads with length <35 bp and quality mean values (Q) <30 were west of the Andes. discarded using a custom script for quality filtering (https:// github.com/South ​G r e e n ​P l a t f ​o r m / a r c a d ​-hts/blob/maste ​r / s c r i p ​t s / arcad_hts_2_Filter_Fastq_On_Mean_Quali​ty.pl). Then, we paired 2 | MATERIALS AND METHODS forward and reverse sequences using a comparison script adapted from TOGGLe (Tranchant-Dubreuil et al., 2018), and checked the 2.1 | Geographic sampling and DNA extraction quality again using FastQC. The last 6 bp of the reverse sequences were trimmed to ensure the removal of barcodes in sequences We sampled and georeferenced 91 adult individuals identified in the <150 bp using the 'fastx' script (https://github.com/agord​on/ field as P. aequatorialis from 46 different localities across western fastx_toolkit). 4 | ESCOBAR et al.

2.3 | Phylogenomic inference processed the DNA sequences of 88 individuals with the pipeline SECAPR v.1.14 (Andermann et al., 2018). This pipeline is useful DNA sequences were processed using HybPiper v.1.2 (Johnson when SNPs need to be called because it creates a pseudo refer- et al., 2016) to identify target exonic regions and off-target introns, ence formed by the consensus sequences of the target loci, with which are usually more variable than exons. As a result, we obtained the exception of paralog loci that were identified and automati- 'supercontigs' which are formed by targeted regions and surround- cally removed during the process. The use of the aforementioned ing off-target sequences. We filtered the dataset by keeping only pseudo reference is more efficient than the use of the bait kit ref- the loci with >75% of their length reconstructed in >75% of the in- erence in the phylogenomic analyses because the latter is made dividuals. The remaining supercontigs were aligned using the '—auto' from distantly related palm species (Helmstetter et al., 2020). option in MAFFT v.7.305 (Katoh & Standley, 2013) and were cleaned We used BWA v.0.7.12 (Li & Durbin, 2009) to map this pseudo using Gblocks v.0.91b (Castresana, 2000) with default parameters reference to our paired and cleaned reads. GATK v.4 (McKenna and all allowed gap positions. et al., 2010) was used to remove duplicates and to call SNPs. We

HybPiper also detects potential paralogous loci, which were ver- filtered the data by keeping only biallelic SNPs with a mapping ified by building exon trees in RAxML v.8.2.9 (Stamatakis, 2014), in- quality >40%, depth >25, and quality by depth >2 using BCFtools cluding all putative paralogs. We used a GTR + G substitution model (Li, 2011). Additionally, we removed SNPs with allele frequency because it is the most common and general model for DNA analyses <0.01 and excluded monomorphic sites. (Stamatakis, 2016), producing similar inferences as other models The number of clusters were determined with a Discriminant (Abadi et al., 2019). We also used the option '-f a' to perform 100 Analysis of Principal Components (DAPC; Jombart et al., 2010) rapid bootstraps as node support. For this purpose, we used a refer- using the package 'adegenet' (Jombart, 2008) in R v.3.5.3 (R Core ence where exons are split instead of being merged into genes. We Team, 2019). We used successive K-means with 100,000 iterations examined the exon trees manually to check if the potential paralog per value of K (number of clusters) up to K = 20. We examined the sequences that were flagged by HybPiper clustered together, which change in the Bayesian Information Criterion (BIC) of K = 1–20 to would mean that the potential paralogs are more closely related to determine the number of clusters within P. aequatorialis. We then potential paralogs in other individuals than to the alternative se- did a cross-validation of the DAPC analysis to test that the number quences in the same individual (Helmstetter et al., 2020). In this of principal components chosen was correct. case, we treated the exon as a paralog. If >50% of exons belonging to a certain gene were determined as paralogs, then the gene was marked as 'true' paralog and removed from further steps. 2.5 | Divergence time estimates We used the resulting alignments to infer the phylogenomic relation- ships within P. aequatorialis using the two individuals of P. tumacana as We estimated the divergence time between the resulting genetic outgroup. We used two approaches to construct the intraspecific tree, lineages by constructing a dated coalescent tree using five random coalescence and maximum likelihood. For the coalescence approach, individuals from each clade/genetic cluster, along with the two in- we used the alignments to infer individual gene trees in RAxML with a dividuals of P. tumacana as outgroup. We calculated the root-to-tip GTR + G model, 100 rapid bootstraps and no partitions. We then used variance of their respective RAxML gene trees and rooted them these gene trees to construct a coalescent tree in ASTRAL-III v.5.6.3 with phyx. We then calculated the variance using SortaDate (https:// (Zhang et al., 2018), collapsing branches with support <10 and annotat- github.com/FePhy​FoFum/​Sorta​Date/) and chose the 32 most clock- ing the local posterior probability (pp) as node support. For the maximum like loci (most suitable for dating) to reduce the highly demanding likelihood approach, we first added empty sequences where individuals computational requirements of further steps (Smith et al., 2018). were missing in our alignments and concatenated them with phyx (Brown The best nucleotide substitution model of evolution was deter- et al., 2017), generating one partition per loci. We built the maximum like- mined for each alignment using ModelTest (https://github.com/ lihood tree in RAxML using a GTr + G model and 100 rapid bootstraps. ddarr​iba/model​test). The corresponding 32 fasta alignments were Phylogenomic trees were visualized in FigTree v.1.4 (http://tree.bio.ed.ac. converted to nexus format using PGDSpider v.2.1.1.5 (Lischer & uk/softw​are/figtr​ee/). Three individuals of P. aequatorialis showed high Excoffier, 2012). node support with P. tumacana in the two intraspecific trees, suggesting The coalescence analysis was run using Bayesian inference close phylogenomic relatedness between these individuals. Therefore, in BEAST v.2.5.2 (Bouckaert et al., 2019). We used the template we used this mixed clade to root our trees and excluded these three indi- 'StarBeastWithSTACEYops' and assigned the individuals to their viduals of P. aequatorialis from further analyses (see Results). respective lineage. We used a strict clock of 0.001 with the 'esti- mate' box checked and the best nucleotide substitution model of

evolution for each locus as determined in ModelTest. The tree was 2.4 | Population structure generated with a Coalescent Constant Population prior because it is particularly suitable for describing intraspecific relationships (Ritchie We studied the population structure of P. aequatorialis by deter- et al., 2017). The tree was constrained with a secondary calibra- mining the number of genetic clusters within its distribution. We tion placed at the stem node between P. aequatorialis and its sister ESCOBAR et al. | 5 species P. tumacana. We used a uniform prior and constrained the was <0.7 (Dormann et al., 2013). We identified variables that could age of the node between the 95% highest posterior density heights potentially limit the distribution of the species. Potential evapo- (95% HPD) of the speciation time between P. aequatorialis and P. tu- transpiration (PET) is one of the strongest predictors of plant diver- macana (see Escobar et al., 2020 for details), which were used as sity patterns at broad scales (Eiserhardt et al., 2011 and references the minimum and maximum values of the prior (3.01–3.76 Ma). We therein). More specifically, low precipitation and low temperatures ran three independent Markov Chain Monte Carlo (MCMC) analyses can constrain the distribution of palms (Eiserhardt et al., 2011). We with 100 million generations sampling every 10,000 trees in BEAST. therefore chose PET of the wettest quarter and PET of the driest We checked that the MCMC runs reached convergence using Tracer quarter from ENVIREM, and precipitation of the driest month (BIO v.1.7 (Rambaut et al., 2018), ensuring that all effective sample sizes 14) and temperature annual range (BIO 7) from WorldClim as our (ESS) >200. The log and tree files of the three runs were combined predictor variables. We did not choose minimum temperature of the with LogCombiner v.2.5.2 (Bouckaert et al., 2019) with a burn-in coldest month (BIO 6) as a predictor variable because it was highly of 10%. TreeAnnotator v.2.5.2 (Bouckaert et al., 2019) was used to correlated with PET of the driest quarter (>0.7). generate a maximum clade credibility (MCC) tree by summarizing the To model the potential current and LGM distributions of P. aequa- remaining 27,003 trees, and to calculate 95% HPD. The tree was vi- torialis, we used the new 'maxnet' R-package to fit a maximum en- sualized in FigTree. tropy (Maxent) model (Phillips et al., 2017). To reduce the effects of sampling bias with this presence-only algorithm, we used a weight- ed-target group approach (Phillips et al., 2009). This approach uses 2.6 | Palaeoecological niche modelling a background sample likely to contain the same sampling bias as the occurrence records for the focal species, allowing the bias to be can- We modelled the ecological niche of the species under current celled out of the fitting and thus more accurate presence-only mod- conditions, and then projected it to the LGM to determine the els can be produced (Baumgartner et al., 2018; Merow et al., 2017; potential effect of Pleistocene glaciations on the distribution of Phillips et al., 2009). For this purpose, we used 1,149 occurrence P. aequatorialis. In addition to occurrence records collected from records of palm species within a 200 km buffer around the records our fieldwork, we downloaded records from four online reposito- of P. aequatorialis. The palm records were downloaded from the ries: the Global Biodiversity Information Facility (GBIF.org, 2020), same sources listed above and cleaned in the same way. The buffer the Berkeley Ecoengine (Ecoengine), the Botanical Information size selected includes environments that are probably accessible to and Ecological Network (BIEN v.4.1) and the Latin American the species given its dispersal limitations and the configuration of Seasonality Dry Tropical Forest Floristic Network (DRYFLOR). natural barriers (Barve et al., 2011). We used the linear, quadratic We accessed occurrence records from the repositories using the and product features in the Maxent model to produce smoother R-packages 'BIEN' (Maitner et al., 2018) and 'spocc' (Chamberlain & response curves that reduce overfitting, which is especially rele- Boettiger, 2017), and direct download (http://www.dryfl​or.info/). vant when projecting to novel conditions (Merow et al., 2013). The We cleaned the downloaded occurrence data before use with the performance of the model was assessed by calculating the average package 'CoordinateCleaner' (Zizka et al., 2019), removing records Continuous Boyce Index (CBI; Hirzel et al., 2006) through a fivefold that met any of the following criteria: recorded prior to 1950; du- systematic spatial blocking technique using the package 'blockCV' plicated records; coordinate uncertainty greater than 5 km; impos- (Valavi et al., 2019). CBI values range between −1 and 1, where sible or unlikely coordinate; within 10 km of a country capital or values >0 indicate the model's output is positively correlated with political unit centroids; and within 100 m of biodiversity institu- the true probability of presence and values <0 indicate it is nega- tions. All occurrence records were spatially thinned at a distance tively correlated with the true probability of presence (Fielding & of at least 10 km to reduce spatial sampling bias using the pack- Bell, 1997). Model fit was inspected by the average omission rate age 'spThin' (Aiello-Lammens et al., 2015). We retained and used based on the minimum training presence value (ORMTP). ORMTP val- a total of 86 records after cleaning and filtration of data, which ues range from 0 for models that are not overfit to 1 for models were transformed to a Behrmann projection to be used along with that are overfit. For the LGM, we averaged the projections of the resampled climatic variables. three General Circulation Models continuous range maps to form a The current (average 1950–2000) and LGM (22 ka) variables were consensus continuous map. Current and LGM continuous suitability downloaded at a spatial resolution of 2.5 arc-minute from WorldClim maps were converted to binary maps indicating suitable/unsuitable 1.4 (https://world​clim.org; Hijmans et al., 2005) and ENVIREM habitat. For this purpose, we used a threshold that maximizes the (http://envir​em.github.io; Title & Bemmels, 2018). Variables were sum of sensitivity and specificity, a frequently recommended ap- then resampled to 5 km resolution using bilinear interpolation with a proach that tends to reflect the prevalence of the modelled species Behrmann projection. Climatic variables from the LGM were down- well (Liu et al., 2013, 2016). We then overlaid the two binary maps loaded for the three General Circulation Models CCSM4 (Gent to show areas of historical climatic suitability that potentially worked et al., 2011), MIROC-ESM (Watanabe et al., 2011) and MPI-ESM-P as refugia during the LGM. We used this overlaid map as an approx- (Giorgetta et al., 2013). To minimize potential issues with collinearity imation of the potential refugia location during previous Pleistocene between predictor variables, we ensured the Pearson's R correlation glacial cycles. 6 | ESCOBAR et al.

2.7 | Determinants of genetic diversity 3 | RESULTS

We explored how palaeoclimatic and geographic factors may 3.1 | Phylogenomic inference have contributed to shape current levels of genetic diversity in P. aequatorialis. For each individual, we obtained the expected het- We obtained a total of 47.167 million reads from the 91 adult in- erozygosity (Hs) using the R-package 'adegenet' (Jombart, 2008) dividuals of P. aequatorialis. We identified 157 loci with >75% of and the index of palaeoclimatic suitability during the LGM its length reconstructed in >75% of individuals. Sixteen loci were from the consensus continuous map using the package 'raster' considered as paralogs and therefore removed, resulting in 141 (Hijmans et al., 2020). We first explored how the 'moist forest' supercontigs. The resulting ASTRAL coalescent tree separated and 'dry refugia' models have potentially shaped genetic diver- most of the P. aequatorialis individuals from the two individuals of sity in the Andean foothills. To do so, we tested whether individ- P. tumacana (Figure 1a). Three individuals sampled in the north- uals from the Andean foothills that acted as Pleistocene refugia ern part of the distribution of P. aequatorialis formed a clade with present lower Hs than the individuals from the foothills that did P. tumacana with high local posterior probability as node support not act as refugia using a Wilcoxon rank sum test in R. We also (pp = 1). The other individuals of P. aequatorialis separated in two performed Pearson's R correlations to regress Hs against values major clades, suggesting the presence of two distinct evolution- of palaeoclimatic suitability during the LGM, latitude and longi- ary units within the species. One of the clades was formed by 79 tude for all individuals. Additionally, genetic differentiation (FST) individuals sampled from northern and central populations (north- between the clusters was calculated with the package 'hierfstat' ern clade), whereas the other clade was formed by nine individu- (Goudet, 2005) using the Nei's method (Thomas et al., 2012). als sampled from southern populations (southern clade). Only the

(a) (b)

FIGURE 1 Intraspecific genetic lineages of the palm Phytelephas aequatorialis in the rain forests of western Ecuador. (a) ASTRAL coalescent tree showing phylogenomic relationships between individuals with local posterior probabilities (pp) as node support. (b) Spatial distribution of the genetic clusters at K = 2 as determined by the DAPC analysis using 88 individuals, and of the individuals used as outgroup in the phylogenomic analyses. The shading represents the Andes and smaller cordilleras in the Pacific coast. The Chimbo River is represented by the light blue line. (c) Membership probability of the 88 individuals at K = 2 as determined by the DAPC analysis ESCOBAR et al. | 7

3.3 | Palaeoecological niche modelling

The mean CBI value was 0.91, indicating that the model perfor- mance is better than a random model and therefore adequate. The

model did not over fit as indicated by the low ORMTP value of 0.02. The model accurately predicted the current known distribution of the species with exception of the northernmost part of the range (Figure 3a). The lowlands of north-western Ecuador appeared un- suitable even though P. aequatorialis has been reported in that area, whereas an area in the Andean foothills of Colombia, where the species has not been reported from, appeared as suitable. Overall, climatic suitability of P. aequatorialis decreased during the LGM and its potential distribution remained in the Pacific coast while decreas- ing on the Andean foothills (Figure 3b). The overlaid map formed FIGURE 2 Dated coalescent tree inferred to estimate the time by the two suitable/unsuitable maps for current and past conditions of divergence between the two intraspecific genetic lineages of the palm Phytelephas aequatorialis in the rain forests of western during the LGM showed two areas of historical climatic suitability Ecuador. The tree was obtained through Bayesian inference (Figure 3c). The first one is located in the Pacific coast while a second in BEAST using a Coalescent Constant Population prior and a smaller area lies on the Andean foothills of south-western Ecuador. StarBeastWithSTACEYops template. Five random individuals of The location of the larger area of historical climatic suitability agrees P. aequatorialis from each genetic lineage were used along with with the spatial distribution of the northern clade/genetic cluster, two individuals of the sister species Phytelephas tumacana as whereas the smaller area agrees with the southern clade/genetic outgroup. The 95% highest posterior density heights (95% HPD) of the time of speciation between P. aequatorialis and P. tumacana cluster. ( 3.01 – 3.76 Ma; Escobar et al., 2020) were used to set a uniform prior to constrain the tree where the arrow indicates. The blue bar represents the 95% HPD of the time estimation and the shade 3.4 | Determinants of genetic diversity represents the time frame of the Pleistocene (2.58 Ma – 11.7 ka)

The nine individuals from the Andean foothills of south-western southern clade showed high node support. The RAxML maximum Ecuador that belong to the southern lineage (Figure 1; Table S1) likelihood tree recovered the same two major clades in P. aequa- showed lower Hs than the 19 individuals from the Andean foot- torialis (Figure S1). Nevertheless, one individual of P. aequatorialis hills of north-western Ecuador that belong to the northern lineage from the northern part of its distribution resolved as sister of the (Wilcoxon rank sum test: W = 11, p = 0.0000278; Figure 4a). We two major clades. The same three individuals of P. aequatorialis did not detect a significant correlation between Hs and the index of 2 from the north formed a clade with P. tumacana with high boot- palaeoclimatic suitability during the LGM (R = 0.0019, p = 0.6854, strap as node support (93). Again, only the southern clade showed t = −0.4064, df = 86; Figure 4b). A significant but weak increase 2 high node support (bootstrap = 100; Figure S1). in Hs with decreasing latitude was inferred (R = 0.099, p = 0.0028, t = 3.0753, df = 86; Figure 4c). Hs was not correlated with longitude 2 (R = 0.0115, p = 0.3183, t = 1.0037, df = 86; Figure 4d). Genetic dif-

3.2 | Population structure and divergence ferentiation (FST) between the two genetic clusters was 0.023. time estimates

A total of 1,394 SNPs were identified from 118 different loci with 4 | DISCUSSION 0% of missing data. We kept 50 principal components after cross- validation, as it was showed appropriate for inferring population Our phylogenomic and population structure analyses revealed structure in DAPC analyses (Figure S2). Two genetic clusters were the presence of two distinct latitudinally oriented evolutionary determined within P. aequatorialis (Figure 1b) based on the values of lineages in the palm species Phytelephas aequatorialis within the BIC and on the analyses of principal components (Figures S3, S4). All rain forests of western Ecuador (Figure 1). Although our intraspe- individuals showed a membership probability to their corresponding cific trees were in general poorly supported, the southern clade cluster >0.9 (Figure 1c). The two resulting clusters are composed by showed high support under the two phylogenomic approaches used the same individuals forming the two major clades in the ASTRAL (Figure 1a; Figure S1). In addition, the individuals from the southern coalescent tree (Figure 1a). The division between the two genetic clade formed a distinct genetic cluster (Figure 1b, 1c), adding more lineages lies around −2.4° in latitude. We estimated that the two ge- evidence to their distinct evolutionary history. A similar pattern of netic clusters diverged during the Pleistocene 1.14 Ma (95% HPD: phylogeographic structure was detected for the widely distributed 1.01 – 2.48; Figure 2). Neotropical palm Oenocarpus bataua, in which two genetic clusters 8 | ESCOBAR et al.

(a) (b) (c)

Climatic suitability Palaeoclimatic suitability Historical climatic suitability

1.0 1.0 2 0.8 0.8 1 0.6 0.6 0 0.4 0.4 0.2 0.2 North 0.0 0.0 South Occurrence records Occurrence records

FIGURE 3 Palaeoecological niche modelling of the palm Phytelephas aequatorialis in the rain forests of western Ecuador using Maxent and a weighted-target group approach for (a) current conditions, and then projected to (b) the Last Glacial Maximum (LGM; 22 ka). Palaeoecological niche during the LGM was modelled under three different General Circulation Models (CCSM4, MIROC-ESM, MPI- ESM-P), which were averaged to construct a consensus continuous map. Dotted line represents the Pacific coastline during the LGM while continuous line represents the current coastline. (c) Areas of historical climatic suitability determined by overlaying binary suitable/ unsuitable maps for current and past conditions (LGM) ranging from 0 (species not present in any of the two periods) to 2 (species present in the two periods). Binary maps were obtained by applying a threshold of maximum specificity and sensitivity to continuous suitability maps. Coordinates of analysed individuals were transformed to Behrmann projection to plot them over the map. Note the overlap of the individuals of the two lineages with areas of historical climatic suitability were also identified in western Ecuador: one located in north-west- The two evolutionary lineages detected for P. aequatorialis di- ern Ecuador/south-western Colombia, and a second one in south- verged 1.14 Ma (95% HPD: 1.01 – 2.48) during the Pleistocene western Ecuador and extending to the east of the Andes (Escobar (Figure 2), a period when glacial cycles impacted the distribution and et al., 2018). Although the genetic structure patterns of both species genetic diversity of species (Hewitt, 2000, 2004). Based on our ge- are not completely identical due to the cross-Andean distribution of netic and palaeoecological niche analyses (Figure 3c), P. aequatorialis O. bataua, they nevertheless suggest the existence of a north–south potentially took refuge in two areas of historical climatic suitability in phylogeographic break in south-western Ecuador. The Chimbo River western Ecuador during the Pleistocene. The spatial distribution of has been proposed as a geographic barrier for some reptiles and am- the recovered genetic lineages overlaps with that of the two inferred phibians in south-western Ecuador (Arteaga et al., 2016), and it lies potential refugia (Figure 3c), indicating that these areas of historical close to the phylogeographical break of P. aequatorialis (Figure 1b). climatic suitability could have played a vicariant role in P. aequato- Nevertheless, individuals that belong to the northern lineage were rialis. Populations from northern and southern lineages apparently sampled south of the river, indicating that the intraspecific diver- became isolated along the narrow strips of the Andean foothills of gence in P. aequatorialis is not caused by this geographic barrier. south-western Ecuador ~1 Ma during a glacial cycle and developed Such north–south phylogeographic breaks have been reported in distinct genetic footprints. Gene flow between northern and south- other tropical rain forests because of differences in past and cur- ern populations could have occurred during the following intergla- rent climate and in geological history, including those from west- cial cycles, hindering speciation between the two genetic lineages as ern Amazonia (Escobar et al., 2018; Jimenez-Vasquez et al., 2017; suggested by their low level of genetic differentiation (FST = 0.023). Reis et al., 2020; Roncal et al., 2015; Symula et al., 2003), eastern The low admixture detected for the three northernmost individuals (Mousalli et al., 2009), central Africa (Bryja et al., 2014; Faye of the southern lineage (Figure 1c) partially supports the existence et al., 2016; Hardy et al., 2013; Helmstetter et al., 2020; Heuertz of gene flow between the two lineages. It is possible that stronger et al., 2014; Ley et al., 2017; Piñeiro et al., 2017) and Australia secondary contact has occurred within areas not sampled here. (Schneider et al., 1998). Analysing the phylogeographic patterns of Therefore, detailed analyses of the populations located between more plant species will help to elucidate whether a phylogeographic the recovered distributions of the two genetic lineages will help break in south-western Ecuador is common for in the region. to elucidate whether gene flow between the lineages has impeded ESCOBAR et al. | 9

(a) (b)

0.45

0.40

0.35 Expected heterozygosity ( Hs ) 0.30 W = 11 R2 = 0.0019 P < 0.0001 P = 0.6854

Andean foothills Andean foothills 0.1 0.200.30.4 .5 0.6 South North Palaeoclimatic suitability

(c) (d)

0.45

0.40

0.35 Expected heterozygosity ( Hs ) 0.30 R2 = 0.099 R2 = 0.0115 P = 0.0028 P = 0.3183

−3 −2 −1 0 1 −80.5−80.0−79.5 −79.0 Latitude Longitude

FIGURE 4 Determinants of genetic diversity of the palm Phytelephas aequatorialis in the rain forests of western Ecuador. (a) Expected heterozygosity (Hs) in the individuals from the Andean foothills of south-western Ecuador, which potentially acted as a Pleistocene refugium, is significantly lower than in the individuals from the northern Andean foothills (Table S1) according to a Wilcoxon rank sum test. Blue and red lines show the median values of Hs. (b) Hs of all individuals is not significantly correlated with the index of palaeoclimatic suitability during the Last Glacial Maximum (22 ka). (c) Hs increases with decreasing latitude, (d) but does not with longitude. Blue circles represent individuals from the northern lineage, whereas red circles represent individuals from the southern lineage, as in Figure 1. Dotted lines show the trend of the correlations speciation. Continuous isolation and secondary contact between populations from the Andean foothills of north-western Ecuador the two lineages probably happened several times during the last migrated down-slope during glacial cycles (Figure 3), in agreement 1 Ma. These contraction/expansion dynamics have promoted intra- with the 'moist forest model' of montane and premontane rain specific divergence in P. aequatorialis during the Pleistocene. This has forest evolution (Ramírez-Barahona & Eguiarte, 2013). Given that been observed in plants and animals from other tropical (Cabanne the Pacific coast remained moist at the time (Hooghiemstra & Van et al., 2016; Carnaval & Bates, 2007; Carnaval et al., 2009; Diniz- Der Hammen, 1998), the populations from the Andean foothills of Filho et al., 2016; Faye et al., 2016; Hardy et al., 2013; Helmstetter north-western Ecuador were able to spread into the lowlands and et al., 2020; de Lima et al., 2014; Thomas et al., 2012; Vasconcellos potentially maintain connectivity with the populations from the et al., 2019) and temperate regions (Blanco-Pastor et al., 2019; Park Chocó refugium. Similarly, palynological data showed that plants & Donoghue, 2019). from the eastern Ecuadorian Andean foothills descended ~1,000 m The northern genetic lineage apparently took refuge along the during glacial cycles (Brunschön & Behling, 2010; Cárdenas humid lowlands and on the small mountain ranges of the Pacific et al., 2011) when western Amazonia remained humid and covered coast, but not along the Andean foothills (Figure 3c). The inferred by rain forest (Colinvaux et al., 2000). High palaeoclimatic suitability refugium for the northern lineage overlaps with the southern sec- in coastal regions has also been detected in the Brazilian Atlantic re- tion of the suggested Chocó refugium (Gentry, 1982; Prance, 1982), gion (Carnaval et al., 2009; Mousalli et al., 2009) and in central Africa which could have extended further south to the Pacific coast of (Faye et al., 2016; Helmstetter et al., 2020), indicating that coastal western Ecuador. Our palaeoclimatic analyses suggest that the areas might have been important refugia for tropical rain forests. 10 | ESCOBAR et al.

We also inferred that the southern genetic lineage potentially climatic fluctuations on intraspecific diversity during the LGM varied took refuge in a narrow strip of Andean foothills in south-western between regions and even was different among closely related spe- Ecuador over 1,000 m.a.s.l. (Figure 3c; Table S1). These populations cies within a same region. Thus, the current levels of genetic diver- could have survived the Pleistocene isolated on the slopes of the sity in P. aequatorialis have been potentially shaped by the isolation Andes because they were unable to migrate down-slope into the of individuals in the Andean foothills during glacial cycles and by the lowlands of south-western Ecuador, which remained arid during re- species' colonization history. cent glacial cycles (Heusser & Shackleton, 1994). Thus, the southern Phylogenomic relationships and geographical limits between P. lineage evolved into a distinct genetic group following isolation as aequatorialis and its sister species, P. tumacana, remain to be elu- shown by our phylogenomic (Figure 1a; Figure S1) and population cidated. Phytelephas tumacana occurs in south-western Colombia structure analyses (Figure 1b, 1c). In contrast with the northern (Bernal & Galeano, 2010) at ~50 km from the three individuals lineage, these results support the 'dry refugia model' (Ramírez- of P. aequatorialis with which it formed a highly supported clade Barahona & Eguiarte, 2013) in the premontane rain forests of (Figure 1a; Figure S1). This suggests that P. tumacana could also south-western Ecuador. Thus, the Pleistocene climatic fluctuations occur in north-western Ecuador and we could have sampled it incor- could have affected contiguous rain forests differently depending rectly as P. aequatorialis. There is also the possibility that we sampled on their latitudinal location. A similar response of the Andes may a hybrid population between the two species, given the close prox- have occurred next to the dry inter-Andean valleys of Colombia imity of their geographic ranges. One of the northernmost individu- where palynological data suggested that the Andean foothills con- als of P. aequatorialis resolved with high support as a sister clade of stituted safe havens for rain forests (Bush et al., 2011; Hooghiemstra the two intraspecific genetic lineages in the maximum likelihood tree & Van Der Hammen, 2004). Rain forests also have remained stable (Figure S1), which adds uncertainty to the identity of P. aequatorialis throughout the Pleistocene in the Eastern Arc Mountains of Kenya populations in north-western Ecuador. In addition, the inability of and Tanzania where the lowlands were suggested to be dry during our palaeoecological niche modelling to predict the current distribu- glacial cycles (Finch et al., 2009; Mumbi et al., 2008). Additional tion of P. aequatorialis in north-western Ecuador suggests that differ- studies on the genetic composition and palaeoclimatic niche of plant ent climatic conditions could be involved in its differentiation with P. and animal species are necessary to determine whether the Andean tumacana. Detailed genetic, morphological and phenological studies foothills of south-western Ecuador were indeed a refugium for rain must be performed to these populations to confirm their taxonomic forest species. identity. The reduced genetic diversity detected in the individuals from the Andean foothills of south-western Ecuador (Figure 4a), an area that potentially acted as a Pleistocene refugium, provides further 5 | CONCLUSIONS evidence for the 'dry refugia model' acting on the Andean foothills of south-western Ecuador. These individuals may present lower This is the first demonstration of the potential effects of genetic diversity compared to the individuals from the northern Pleistocene climatic fluctuations on the distribution and genetic Andean foothills, which did not act as forest refugia, because of patterns of a plant species endemic to the rain forests of west- geographic isolation and potential population reduction during the ern Ecuador using nuclear genomic data. Our results support the Pleistocene (Ramírez-Barahona & Eguiarte, 2013). Nevertheless, it role of Pleistocene climatic fluctuations and areas of historical cli- is also possible that this decrease in genetic diversity is due to dif- matic suitability in promoting intraspecific divergence in tropical ferences in the number of individuals analysed and, therefore, these species. Phytelephas aequatorialis shows a discrete north–south results should remain as exploratory. In addition, genetic diversity distribution of its intraspecific genetic lineages estimated to have follows a geographic pattern by increasing jointly with decreasing originated ~1 Ma. This divergence was potentially caused by the latitude (Figure 4c). This suggests that P. aequatorialis could have col- isolation of populations into two rain forest refugia during glacial onized western Ecuador from the north because genetic diversity cycles, as suggested by the palaeoecological niche of P. aequato- tends to decrease when species expand their ranges (Hewitt, 1999, rialis during the LGM. The Chocó refugium could have extended 2000, 2004; Park & Donoghue, 2019; Petit et al., 2003; Zinck further south to the Pacific coast of western Ecuador and could & Rajora, 2016). In contrast, the palaeoclimatic suitability index have included the populations of P. aequatorialis that descended during the LGM did not influence genetic diversity in P. aequatorialis from the northern Andean foothills. The Andean foothills of (Figure 4b), indicating that areas with more suitable palaeoclimatic south-western Ecuador may be an important area for rain forest conditions do not necessarily harbour higher current genetic diver- evolution because they could have remained climatically suitable sity. A similar pattern was observed in the African rain forests for through the Pleistocene. This promoted intraspecific divergence the palm Podococcus acaulis using unique haplotypes as a measure and decreased genetic diversity in the populations of P. aequatori- of genetic diversity (Faye et al., 2016). However, the sister species alis from that region. Therefore, this study also provides phylogeo- Podococcus barteri showed higher levels of unique genetic diversity graphic and palaeoclimatic evidence for the 'moist forest' and 'dry in areas with high palaeoclimatic suitability during the LGM in the refugia' models acting on the Andean foothills of western Ecuador same rain forests (Faye et al., 2016). This suggests that the impact of (Ramírez-Barahona & Eguiarte, 2013), showing that contiguous ESCOBAR et al. | 11 premontane rain forests could have been affected differently by processing of targeted enriched Illumina sequences, from raw reads to alignments. PeerJ, 2018(7), 1–15. https://doi.org/10.7717/ Pleistocene climatic fluctuations. Our study shows that the rain peerj.5175 forests of south-western Ecuador should be further studied and Arteaga, A., Pyron, R. A., Peñafiel, N., Romero-Barreto, P., Culebras, J., protected to preserve its unique intraspecific genetic diversity Bustamante, L., Yánez-Muñoz, M. H., & Guayasamin, J. M. (2016). that has potentially originated during the Pleistocene. Comparative phylogeography reveals cryptic diversity and repeated patterns of cladogenesis for amphibians and reptiles in Northwestern Ecuador. PLoS One, 11(4), e0151746. https://doi.org/10.1371/journ​ ACKNOWLEDGEMENTS al.pone.0151746 We thank Ecuador's Ministerio del Ambiente for providing the sam- Barfod, A. S. (1991). A monographic study of the subfamily pling permit to undertake this research (MAE-DNB-CM-2018-0082). Phytelephantoideae (Arecaceae). Opera Botanica, 105, 1–73. We also thank Rodrigo Bernal for providing leaf material. This re- Barfod, A. S., Trénel, P., & Borchsenius, F. (2010). Drivers of diversifi- cation in the palms (Arecaceae: , s e a r c h w a s f i n a n c i a l l y s u p p o r te d by t h e S EN E S C Y T (P h D s c h o l a r s h i p Phytelepheae) – Vicariance or adaptive shifts in niche traits? In O. to S.E.), the Independent Research Fund Denmark (9040-00136B Seberg, G. Peterson, A. Barfod, & J. Davis (Eds.), Diversity, Phylogeny, to H.B.), the Agence Nationale de la Recherche (ANR-15- CE02- and Evolution in the (pp. 225–243). 0002-01 to T.L.P.C.) and the International Mixed Laboratory (LMI) Barve, N., Barve, V., Jiménez-Valverde, A., Lira-Noriega, A., Maher, S. P., Peterson, A. T., Soberón, J., & Villalobos, F. (2011). The crucial role of BIO_INCA. We are also very grateful to the generous financial sup- the accessible area in ecological niche modeling and species distribu- port provided by the International Palm Society (IPS) for funding a tion modeling. Ecological Modelling, 222(11), 1810–1819. https://doi. field trip to western Ecuador without which part of the sampling org/10.1016/j.ecolm​odel.2011.02.011 would not have been possible (field grant to S.E.). We dedicate Baumgartner, J. B., Esperón-Rodríguez, M., & Beaumont, L. J. (2018). Identifying in situ climate refugia for plant species. Ecography, 41(11), this article to Dr. Calaway H. Dodson (1928-2020), botanist and 1850–1863. https://doi.org/10.1111/ecog.03431 orchidologist, who worked for many years on the flora of Ecuador Behling, H., Hooghiemstra, H., & Negret, A. J. (1998). Holocene history and authored the five volumes of 'Orchids of Ecuador'. He was also of the Chocó rain forest from Laguna Piusbi, southern Pacific low- the first Honorary Director of the Herbario Nacional del Ecuador lands of Colombia. Quaternary Research, 50(3), 300–308. https://doi. org/10.1006/qres.1998.1998 (QCNE), now part of the Museo Ecuatoriano de Ciencias Naturales Bernal, R., & Galeano, G. (2010). Palmas de Colombia: guía de campo. in Quito, Ecuador. Panamericana Formas e Impresos S.A. Blanco-Pastor, J. L., Manel, S., Barre, P., Roschanski, A. M., Willner, E., DATA AVAILABILITY STATEMENT Dehmer, K. J., & Sampoux, J. P. (2019). Pleistocene climate changes, The fastq sequences (R1 and R2) for all individuals have been submit- and not agricultural spread, accounts for range expansion and admix- ture in the dominant grassland species Lolium perenne L. Journal of ted to Genbank SRA under the BioProject PRJNA660607 https:// Biogeography, 46(7), 1451–1465. https://doi.org/10.1111/jbi.13587 www.ncbi.nlm.nih.gov/sra/PRJNA​660607. Additional data rele- Borchsenius, F., Borgtoft Pedersen, H., & Balslev, H. (1998). Manual to the vant to this study can be found at Dryad https://doi.org/10.5061/ Palms of Ecuador. AAU Reports 37. dryad.98sf7​m0gz. Bioinformatic scripts used for this study can be Bouckaert, R., Vaughan, T. G., Barido-Sottani, J., Duchêne, S., Fourment, M., Gavryushkina, A., Heled, J., Jones, G., Kühnert, D., De Maio, found at https://github.com/ajhel​mstet​ter/afrodyn and at https:// N., Matschiner, M., Mendes, F. K., Müller, N. F., Ogilvie, H. A., du github.com/sesco​barva​squez/​tagua. Plessis, L., Popinga, A., Rambaut, A., Rasmussen, D., Siveroni, I., … Drummond, A. J. (2019). BEAST 2.5: An advanced software platform ORCID for Bayesian evolutionary analysis. PLoS Computational Biology, 15(4), 1–28. https://doi.org/10.1371/journa​ l.pcbi.1006650 Sebastián Escobar https://orcid.org/0000-0002-4918-7967 Brown, J. W., Walker, J. F., & Smith, S. A. (2017). Phyx: Phylogenetic tools Andrew J. Helmstetter https://orcid.org/0000-0003-3761-4981 for unix. Bioinformatics, 33(12), 1886–1888. https://doi.org/10.1093/ Scott Jarvie https://orcid.org/0000-0002-0086-2351 bioin​forma​tics/btx063 Rommel Montúfar https://orcid.org/0000-0001-5309-4889 Brunschön, C., & Behling, H. (2010). Reconstruction and visualization of upper forest line and vegetation changes in the Andean depression Henrik Balslev https://orcid.org/0000-0002-7101-7120 region of southeastern Ecuador since the Last Glacial Maximum – A Thomas L.P. Couvreur https://orcid.org/0000-0002-8509-6587 multi-site synthesis. Review of Palaeobotany and Palynology, 163(1–2), 139–152. https://doi.org/10.1016/j.revpa​lbo.2010.10.005 REFERENCES Bryja, J., Mikula, O., Patzenhauerová, H., Oguge, N. O., Šumbera, R., & Verheyen, E. (2014). The role of dispersal and vicariance in the Abadi, S., Azouri, D., Pupko, T., & Mayrose, I. (2019). Model selection Pleistocene history of an East African mountain rodent, Praomys may not be a mandatory step for phylogeny reconstruction. Nature delectorum. Journal of Biogeography, 41(1), 196–208. https://doi. Communications, 10(1), https://doi.org/10.1038/s4146​7-019-08822​-w org/10.1111/jbi.12195 Acosta-Solís, M. (1948). Tagua or vegetable ivory–a forest product of Bush, M. B. (1994). Amazonian speciation: A necessarily complex model. Ecuador. Economic Botany, 2(1), 46–57. https://doi.org/10.1007/ Journal of Biogeography, 21(1), 5. https://doi.org/10.2307/2845600 BF029​07918 Bush, M. B., Hanselman, J. A., & Hooghiemstra, H. (2011). Andean mon- Aiello-Lammens, M. E., Boria, R. A., Radosavljevic, A., Vilela, B., & tane forests and climate change. In M. Bush, J. Flenley, & W. Gosling Anderson, R. P. (2015). spThin: An R package for spatial thinning (Eds.), Tropical Rainforest Responses to Climatic Change (2nd ed.), (pp. of species occurrence records for use in ecological niche models. 1689–1699). doi: https://doi.org/10.1007/978-3-642-05383​-2 Ecography, 38(5), 541–545. https://doi.org/10.1111/ecog.01132 Cabanne, G. S., Calderón, L., Trujillo Arias, N., Flores, P., Pessoa, Andermann, T., Cano, Á., Zizka, A., Bacon, C., & Antonelli, A. (2018). R., d'Horta, F. M., & Miyaki, C. Y. (2016). Effects of Pleistocene SECAPR-A bioinformatics pipeline for the rapid and user-friendly 12 | ESCOBAR et al.

climate changes on species ranges and evolutionary processes in the diversity and distributions across spatial scales. Annals of Botany, Neotropical Atlantic Forest. Biological Journal of the Linnean Society, 108(8), 1391–1416. https://doi.org/10.1093/aob/mcr146 119(4), 856–872. https://doi.org/10.1111/bij.12844 Escobar, S., Helmstetter, A. J., Montúfar, R., Couvreur, T. L. P., & Balslev, Cárdenas, M. L., Gosling, W. D., Sherlock, S. C., Poole, I., Pennington, R. H. (2020). Phylogenomic relationships and historical biogeography in T., & Mothes, P. (2011). The response of vegetation on the Andean the South American vegetable ivory palms (Phytelepheae). BioRxiv, Flank in Western Amazonia to Pleistocene Climate Change. Science, https://doi.org/10.1101/2020.09.03.280941 333(6051), 1055–1059. https://doi.org/10.1126/scien​ce.1207888 Escobar, S., Pintaud, J., Balslev, H., Bernal, R., Moraes Ramírez, M., Carnaval, A. C., & Bates, J. M. (2007). Amphibian DNA shows marked Millán, B., & Montúfar, R. (2018). Genetic structuring in a Neotropical genetic structure and tracks Pleistocene climate change in palm analyzed through an Andean orogenesis-scenario. Ecology and northeastern Brazil. Evolution, 61(12), 2942–2957. https://doi. Evolution, 8(16), 8030–8042. https://doi.org/10.1002/ece3.4216 org/10.1111/j.1558-5646.2007.00241.x Excoffier, L., Foll, M., & Petit, R. J. (2009). Genetic consequences Carnaval, A. C., Hickerson, M. J., Haddad, C. F. B., Rodrigues, M. T., & of range expansions. Annual Review of Ecology, Evolution, and Moritz, C. (2009). Stability predicts genetic diversity in the Brazilian Systematics, 40, 481–501. https://doi.org/10.1146/annur​ev.ecols​ Atlantic Forest Hotspot. Science, 323(5915), 785–789. https://doi. ys.39.110707.173414 org/10.1126/scien​ce.1166955 Faye, A., Deblauwe, V., Mariac, C., Richard, D., Sonké, B., Vigouroux, Y., Castresana, J. (2000). Selection of conserved blocks from multiple align- & Couvreur, T. L. P. (2016). Phylogeography of the genus Podococcus ments for their use in phylogenetic analysis. Molecular Biology and (Palmae/Arecaceae) in Central African rain forests: Climate stability pre- Evolution, 17(4), 540–552. https://doi.org/10.1093/oxfor​djour​nals. dicts unique genetic diversity. Molecular Phylogenetics and Evolution, 105, molbev.a026334 126–138. https://doi.org/10.1016/j.ympev.2016.08.005 Cerón, C., Palacios, W. A., Valencia, R., & Sierra, R. (1999). Propuesta pre- Ferrer-Paris, J. R., Zager, I., Keith, D. A., Oliveira-Miranda, M. A., liminar de un sistema de clasificación de vegetación para el Ecuador. Rodríguez, J. P., Josse, C., González-Gil, M., Miller, R. M., Zambrana- In R. Sierra (Ed.), Las Formaciones Naturales de la Costa del Ecuador. Torrelio, C., & Barrow, E. (2019). An ecosystem risk assessment of doi: https://doi.org/10.13140/​2.1.4520.9287 temperate and tropical forests of the with an outlook on Chamberlain, S., & Boettiger, C. (2017). R Python, and Ruby clients for future conservation strategies. Conservation Letters, 12(2), 1–10. GBIF species occurrence data. PeerJ Preprints, 5, e3304v1. https:// https://doi.org/10.1111/conl.12623 doi.org/10.7287/peerj.prepr​ints.3304v1 Fielding, A. H., & Bell, J. F. (1997). A review of methods for the assess- Colinvaux, P. A., De Oliveira, P. E., & Bush, M. B. (2000). Amazonian and ment of prediction errors in conservation presence/absence models. neotropical plant communities on glacial time-scales: The failure of Environmental Conservation, 24(1), 38–49. https://doi.org/10.1017/ the aridity and refuge hypotheses. Quaternary Science Reviews, 19(1– S0376​89299​7000088 5), 141–169. https://doi.org/10.1016/S0277​-3791(99)00059​-1 Finch, J., Leng, M. J., & Marchant, R. (2009). Late Quaternary vegeta- Colinvaux, P. A., De Oliveira, P. E., Moreno, J. E., Miller, M. C., & Bush, tion dynamics in a biodiversity hotspot, the Uluguru Mountains M. B. (1996). A long pollen record from Lowland Amazonia: Forest of Tanzania. Quaternary Research, 72(1), 111–122. https://doi. and cooling in glacial times. Science, 274(5284), 85–88. https://doi. org/10.1016/j.yqres.2009.02.005 org/10.1126/scienc​ e.274.5284.85 GBIF.org (November 2020) GBIF Occurrence Download https://doi. Couvreur, T. L. P., Helmstetter, A. J., Koenen, E. J. M., Bethune, K., org/10.15468/​dl.r3aacm Brandão, R. D., Little, S. A., Sauquet, H., & Erkens, R. H. J. (2019). Gent, P. R., Danabasoglu, G., Donner, L. J., Holland, M. M., Hunke, E. C., Phylogenomics of the major tropical plant family Annonaceae using Jayne, S. R., Lawrence, D. M., Neale, R. B., Rasch, P. J., Vertenstein, targeted enrichment of nuclear genes. Frontiers in Plant Science, 9, M., Worley, P. H., Yang, Z.-L., & Zhang, M. (2011). The community cli- https://doi.org/10.3389/fpls.2018.01941 mate system model version 4. Journal of Climate, 24(19), 4973–4991. de Lima, N. E., Lima-Ribeiro, M. S., Tinoco, C. F., Terribile, L. C., & https://doi.org/10.1175/2011J ​C L I 4 0 ​8 3 . 1 Collevatti, R. G. (2014). Phylogeography and ecological niche model- Gentry, A. H. (1982). Neotropical floristic diversity: Phytogeographical ling, coupled with the fossil pollen record, unravel the demographic connections between central and South America, Pleistocene cli- history of a Neotropical swamp palm through the Quaternary. Journal matic fluctuations, or an accident of the Andean Orogeny? Annals of Biogeography, 41(4), 673–686. https://doi.org/10.1111/jbi.12269 of the Missouri Botanical Garden, 69(3), 557–593. https://doi. Diniz-Filho, J. A. F., Barbosa, A. C. O. F., Collevatti, R. G., Chaves, L. J., org/10.2307/2399084 Terribile, L. C., Lima-Ribeiro, M. S., & Telles, M. P. C. (2016). Spatial Gentry, A. H. (1986). Species richness and floristic composition of Chocó autocorrelation analysis and ecological niche modelling allows infer- Region plant communities. Caldasia, 15(71–75), 71–91. https://doi. ence of range dynamics driving the population genetic structure of org/10.15446/​caldasia a Neotropical savanna tree. Journal of Biogeography, 43(1), 167–177. Giorgetta, M. A., Jungclaus, J., Reick, C. H., Legutke, S., Bader, J., https://doi.org/10.1111/jbi.12622 Böttinger, M., Brovkin, V., Crueger, T., Esch, M., Fieg, K., Glushak, K., Dodson, C. H., & Gentry, A. H. (1978). Flora of Río Palenque (Vol. 4; Gayler, V., Haak, H., Hollweg, H.-D., Ilyina, T., Kinne, S., Kornblueh, Selbyana, Ed.). doi: https://www.jstor.org/stabl​e/41759574 L., Matei, D., Mauritsen, T., … Stevens, B. (2013). Climate and carbon Dodson, C. H., & Gentry, A. H. (1991). Biological extinction in Western cycle changes from 1850 to 2100 in MPI-ESM simulations for the Ecuador. Annals of the Missouri Botanical Garden, 78(2), 273–295. coupled model intercomparison project phase 5. Journal of Advances https://doi.org/10.2307/2399563 in Modeling Earth Systems, 5(3), 572–597. https://doi.org/10.1002/ Dormann, C. F., Elith, J., Bacher, S., Buchmann, C., Carl, G., Carré, jame.20038 G., Marquéz, J. R. G., Gruber, B., Lafourcade, B., Leitão, P. J., Goudet, J. (2005). Hierfstat, a package for R to compute and test hier- Münkemüller, T., McClean, C., Osborne, P. E., Reineking, B., Schröder, archical F-statistics. Molecular Ecology Notes, 5(1), 184–186. https:// B., Skidmore, A. K., Zurell, D., & Lautenbach, S. (2013). Collinearity: doi.org/10.1111/j.1471-8286.2004.00828.x A review of methods to deal with it and a simulation study eval- Haffer, J. (1969). Speciation in Amazonian forest birds. Science, 165(3889), uating their performance. Ecography, 36(1), 27–46. https://doi. 131–137. org/10.1111/j.1600-0587.2012.07348.x Hardy, O. J., Born, C., Budde, K., Daïnou, K., Dauby, G., Duminil, J., Eiserhardt, W. L., Svenning, J. C., Kissling, W. D., & Balslev, H. (2011). Ewédjé, E.-E., Gomez, C., Heuertz, M., Koffi, G. K., Lowe, A. J., Geographical ecology of the palms (Arecaceae): Determinants of Micheneau, C., Ndiade-Bourobou, D., Piñeiro, R., & Poncet, V. (2013). ESCOBAR et al. | 13

Comparative phylogeography of African rain forest trees: A review Huicungo (Arecaceae) and evaluation of potential anatomical adap- of genetic signatures of vegetation history in the Guineo-Congolian tations. Botanical Journal of the Linnean Society, 185(3), 291–306. region. Comptes Rendus Geoscience, 345(7–8), 284–296. https://doi. https://doi.org/10.1093/botli​nnean/​box060 org/10.1016/j.crte.2013.05.001 Johnson, M. G., Gardner, E. M., Liu, Y., Medina, R., Goffinet, B., Shaw, Heine, K. (2000). Tropical South America during the Last Glacial A. J., Zerega, N. J. C., & Wickett, N. J. (2016). HybPiper: Extracting Maximum: Evidence from glacial, periglacial and fluvial records. coding sequence and introns for phylogenetics from high-through- Quaternary International, 72(1), 7–21. https://doi.org/10.1016/S1040​ put sequencing reads using target enrichment. Applications in Plant -6182(00)00017​-3 Sciences, 4(7), 1600016. https://doi.org/10.3732/apps.1600016 Helmstetter, A. J., Amoussou, B. E. N., Bethune, K., Kandem, N. G., Glèlè Jombart, T. (2008). Adegenet: A R package for the multivariate analysis Kakaï, R., Sonké, B., & Couvreur, T. L. P. (2020). Phylogenomic ap- of genetic markers. Bioinformatics, 24(11), 1403–1405. https://doi. proaches reveal how climate shapes patterns of genetic diversity in org/10.1093/bioin​forma​tics/btn129 an African rain forest tree species. Molecular Ecology, 29(18), 1–14. Jombart, T., Devillard, S., & Balloux, F. (2010). Discriminant analysis of https://doi.org/10.1111/mec.15572 principal components: A new method for the analysis of geneti- Heuertz, M., Duminil, J., Dauby, G., Savolainen, V., & Hardy, O. J. (2014). cally structured populations. BMC Genetics, 11(1), 94. https://doi. Comparative phylogeography in rainforest trees from Lower Guinea, org/10.1186/1471-2156-11-94 Africa. PLoS One, 9(1), e84307. https://doi.org/10.1371/journ​ Katoh, K., & Standley, D. M. (2013). MAFFT multiple sequence align- al.pone.0084307 ment software version 7: Improvements in performance and us- Heusser, L. E., & Shackleton, N. J. (1994). Tropical climatic vatiation on ability. Molecular Biology and Evolution, 30(4), 772–780. https://doi. the Pacific Slopes of the Ecuadorian Andes Based on a 25,000-year org/10.1093/molbe​v/mst010 pollen record from deep-sea sediment core Tri 163–31B. Quaternary Leite, Y. L. R., Costa, L. P., Loss, A. C., Rocha, R. G., Batalha-Filho, Research, 42, 222–225. H., Bastos, A. C., Quaresma, V. S., Fagundes, V., Paresque, R., Hewitt, G. M. (1999). Post-glacial re-colonization of European biota. Passamani, M., & Pardini, R. (2016). Neotropical forest expan- Biological Journal of the Linnean Society, 68, 87–112. https://doi. sion during the last glacial period challenges refuge hypothe- org/10.1111/j.1095-8312.1999.tb011​60.x sis. Proceedings of the National Academy of Sciences of the United Hewitt, G. M. (2000). The genetic legacy of the Quaternary ice ages. States of America, 113(4), 1008–1013. https://doi.org/10.1073/ Nature, 405(6789), 907–913. https://doi.org/10.1038/35016000 pnas.15130​62113 Hewitt, G. M. (2004). Genetic consequences of climatic oscillations in Ley, A. C., Heuertz, M., & Hardy, O. J. (2017). The evolutionary history the Quaternary. Philosophical Transactions of the Royal Society of of central African rain forest plants: Phylogeographical insights from London. Series B: Biological Sciences, 359(1442), 183–195. https://doi. sister species in the climber genus Haumania (Marantaceae). Journal org/10.1098/rstb.2003.1388 of Biogeography, 44(2), 308–321. https://doi.org/10.1111/jbi.12902 Heyduk, K., Trapnell, D. W., Barrett, C. F., & Leebens-Mack, J. (2016). Li, H. (2011). A statistical framework for SNP calling, mutation discovery, Phylogenomic analyses of species relationships in the genus Sabal association mapping and population genetical parameter estimation (Arecaceae) using targeted sequence capture. Biological Journal of the from sequencing data. Bioinformatics, 27(21), 2987–2993. https:// Linnean Society, 117(1), 106–120. https://doi.org/10.1111/bij.12551 doi.org/10.1093/bioin​forma​tics/btr509 Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G., & Jarvis, A. (2005). Li, H., & Durbin, R. (2009). Fast and accurate short read alignment with Very high resolution interpolated climate surfaces for global land Burrows-Wheeler transform. Bioinformatics, 25(14), 1754–1760. areas. International Journal of Climatology, 25(15), 1965–1978. https:// https://doi.org/10.1093/bioin​forma​tics/btp324 doi.org/10.1002/joc.1276 Lischer, H. E. L., & Excoffier, L. (2012). PGDSpider: An automated data Hijmans, R. J., van Etten, J., Sumner, M., Cheng, J., Bevan, A., Bevan, R., & conversion tool for connecting population genetics and genomics Greenberg, J. A. (2020). Package “raster”. R Package, 3(3-13), 1–249. programs. Bioinformatics, 28(2), 298–299. https://doi.org/10.1093/ Retrieved from https://cran.r-projec​ t.org/web/packa​ges/raste​r/ras- bioin​forma​tics/btr642 ter.pdf Liu, C., Newell, G., & White, M. (2016). On the selection of thresholds for Hirzel, A. H., Le Lay, G., Helfer, V., Randin, C., & Guisan, A. (2006). predicting species occurrence with presence-only data. Ecology and Evaluating the ability of habitat suitability models to predict spe- Evolution, 6(1), 337–348. https://doi.org/10.1002/ece3.1878 cies presences. Ecological Modelling, 199(2), 142–152. https://doi. Liu, C., White, M., & Newell, G. (2013). Selecting thresholds for the pre- org/10.1016/j.ecolm​odel.2006.05.017 diction of species occurrence with presence-only data. Journal of Hooghiemstra, H., & Van Der Hammen, T. (1998). Neogene and Quaternary Biogeography, 40(4), 778–789. https://doi.org/10.1111/jbi.12058 development of the neotropical rain forest: The forest refugia hypoth- Maitner, B. S., Boyle, B., Casler, N., Condit, R., Donoghue, J., Durán, esis, and a literature overview. Earth Science Reviews, 44(3–4), 147– S. M., & Enquist, B. J. (2018). The BIEN R package: A tool to ac- 183. https://doi.org/10.1016/S0012​-8252(98)00027​-0 cess the botanical information and ecology network (BIEN) data- Hooghiemstra, H., & Van Der Hammen, T. (2004). Quaternary Ice-Age base. Methods in Ecology and Evolution, 9(2), 373–379. https://doi. dynamics in the Colombian Andes: Developing an understand- org/10.1111/2041-210X.12861 ing of our legacy. Philosophical Transactions of the Royal Society B: Mariac, C., Scarcelli, N., Pouzadou, J., Barnaud, A., Billot, C., Faye, Biological Sciences, 359(1442), 173–181. https://doi.org/10.1098/ A., Kougbeadjo, A., Maillol, V., Martin, G., Sabot, F., Santoni, S., rstb.2003.1420 Vigouroux, Y., & Couvreur, T. L. P. (2014). Cost-effective enrich- Hoorn, C., Wesselingh, F. P., ter Steege, H., Bermudez, M. A., Mora, A., ment hybridization capture of chloroplast genomes at deep multi- Sevink, J., Sanmartin, I., Sanchez-Meseguer, A., Anderson, C. L., plexing levels for population genetics and phylogeography stud- Figueiredo, J. P., Jaramillo, C., Riff, D., Negri, F. R., Hooghiemstra, ies. Molecular Ecology Resources, 14(6), 1103–1113. https://doi. H., Lundberg, J., Stadler, T., Sarkinen, T., & Antonelli, A. (2010). org/10.1111/1755-0998.12258 Amazonia through time: Andean uplift, climate change, landscape Martin, M. (2011). Cutadapt removes adapter sequences from evolution, and biodiversity. Science, 330(6006), 927–931. https://doi. high-throughput sequencing reads. EMBnet.Journal, 17(1), 10. org/10.1126/scienc​ e.1194585 https://doi.org/10.14806/​ej.17.1.200 Jimenez-Vasquez, V., Millán, B., Machahua, M., Kahn, F., Ramirez, R., McKenna, A., Hanna, M., Banks, E., Sivachenko, A., Cibulskis, K., Kernytsky, Pintaud, J. C., & Roncal, J. (2017). Dry season characteristics in A., Garimella, K., Altshuler, D., Gabriel, S., Daly, M., & DePristo, M. A. western Amazonia underlie the divergence of Astrocaryum section (2010). The genome analysis toolkit: A MapReduce framework for 14 | ESCOBAR et al.

analyzing next-generation DNA sequencing data. Genome Research, Rohland, N., & Reich, D. (2012). Cost-effective, high-throughput DNA se- 20(9), 1297–1303. https://doi.org/10.1101/gr.107524.110 quencing libraries for multiplexed target capture. Genome Research, Merow, C., Bois, S. T., Allen, J. M., Xie, Y., & Silander, J. A. (2017). Climate 22(5), 939–946. https://doi.org/10.1101/gr.128124.111 change both facilitates and inhibits invasive plant ranges in New Roncal, J., Couderc, M., Baby, P., Kahn, F., Millán, B., Meerow, A. W., & England. Proceedings of the National Academy of Sciences of the United Pintaud, J. C. (2015). Palm diversification in two geologically con- States of America, 114(16), E3276–E3284. https://doi.org/10.1073/ trasting regions of western Amazonia. Journal of Biogeography, 42(8), pnas.16096​33114 1503–1513. https://doi.org/10.1111/jbi.12518 Merow, C., Smith, M. J., & Silander, J. A. (2013). A practical guide to Schneider, C. J., Cunningham, M., & Moritz, C. (1998). Comparative MaxEnt for modeling species' distributions: What it does, and why phylogeography and the history of endemic vertebrates in the Wet inputs and settings matter. Ecography, 36(10), 1058–1069. https:// Tropics rainforests of Australia. Molecular Ecology, 7(4), 487–498. doi.org/10.1111/j.1600-0587.2013.07872.x https://doi.org/10.1046/j.1365-294x.1998.00334.x Mousalli, A., Moritz, C., Williams, S. E., & Carnaval, A. C. (2009). Variable Sierra, R. (2013). Patrones y factores de deforestación en el Ecuador con- responses of skinks to a common history of rainforest fluctua- tinental, 1990–2010. Y un acercamiento a los próximos 10 años. : tion: Concordance between phylogeography and palaeo-distri- Conservación Internacional y Forest Trends. bution models. Molecular Ecology, 18(3), 483–499. https://doi. Smith, S. A., Brown, J. W., & Walker, J. F. (2018). So many genes, so little org/10.1111/j.1365-294X.2008.04035.x time: A practical approach to divergence-time estimation in the ge- Mumbi, C. T., Marchant, R., Hooghiemstra, H., & Wooller, M. J. (2008). nomic era. PLoS One, 13(5), e0197433. https://doi.org/10.1371/journ​ Late Quaternary vegetation reconstruction from the Eastern Arc al.pone.0197433 Mountains. Tanzania. Quaternary Research, 69(2), 326–341. https:// Stamatakis, A. (2014). RAxML version 8: A tool for phylogenetic analysis doi.org/10.1016/j.yqres.2007.10.012 and post-analysis of large phylogenies. Bioinformatics, 30(9), 1312– Olson, D. M., Dinerstein, E., Wikramanayake, E. D., Burgess, N. D., Powell, 1313. https://doi.org/10.1093/bioin​forma​tics/btu033 G. V. N., Underwood, E. C., & Kassem, K. R. (2001). Terrestrial ecore- Stamatakis, A. (2016). The RAxML v8.2.X Manual. Manual/Tutorial, pp. gions of the World: A new map of life on earth. BioScience, 51(11), 1–61. 933. Symula, R., Schulte, R., & Summers, K. (2003). Molecular systemat- Park, B., & Donoghue, M. J. (2019). Phylogeography of a widespread east- ics and phylogeography of Amazonian poison frogs of the genus ern North American shrub. Viburnum Lantanoides. American Journal of Dendrobates. Molecular Phylogenetics and Evolution, 26(3), 452–475. Botany, 106(3), 389–401. https://doi.org/10.1002/ajb2.1248 https://doi.org/10.1016/S1055​-7903(02)00367​-6 Petit, R. J., Aguinagalde, I., De Beaulieu, J. L., Bittkau, C., Brewer, S., Thomas, E., van Zonneveld, M., Loo, J., Hodgkin, T., Galluzzi, G., & van Cheddadi, R., & Vendramin, G. G. (2003). Glacial refugia: Hotspots Etten, J. (2012). Present spatial diversity patterns of Theobroma cacao but not melting pots of genetic diversity. Science, 300(5625), 1563– L. in the Neotropics reflect genetic differentiation in Pleistocene 1565. https://doi.org/10.1126/scienc​ e.1083264 refugia followed by human-influenced dispersal. PLoS One, 7(10), Phillips, S. J., Anderson, R. P., Dudík, M., Schapire, R. E., & Blair, M. E. e47676. https://doi.org/10.1371/journ​al.pone.0047676 (2017). Opening the black box: An open-source release of Maxent. Title, P. O., & Bemmels, J. B. (2018). ENVIREM: An expanded set of bio- Ecography, 40(7), 887–893. https://doi.org/10.1111/ecog.03049 climatic and topographic variables increases flexibility and improves Phillips, S. J., Dudík, M., Elith, J., Graham, C. H., Lehmann, A., Leathwick, performance of ecological niche modeling. Ecography, 41(2), 291– J., & Ferrier, S. (2009). Sample selection bias and presence-only 307. https://doi.org/10.1111/ecog.02880 distribution models: Implications for background and pseudo-ab- Tranchant-Dubreuil, C., Ravel, S., Monat, C., Sarah, G., Diallo, A., Helou, sence data. Ecological Applications, 19(1), 181–197. https://doi. L., & Sabot, F. (2018). TOGGLe, a flexible framework for easily build- org/10.1890/07-2153.1 ing complex workflows and performing robust large-scale NGS anal- Piñeiro, R., Dauby, G., Kaymak, E., & Hardy, O. J. (2017). Pleistocene pop- yses. BioRxiv, , https://doi.org/10.1101/245480. ulation expansions of shade-tolerant trees indicate fragmentation of Trénel, P., Gustafsson, M. H. G., Baker, W. J., Asmussen-Lange, C. B., the African rainforest during the ice ages. Proceedings of the Royal Dransfield, J., & Borchsenius, F. (2007). Mid-Tertiary dispersal, not Society B: Biological Sciences, 284(1866), https://doi.org/10.1098/ Gondwanan vicariance explains distribution patterns in the wax palm rspb.2017.1800 subfamily (Ceroxyloideae: Arecaceae). Molecular Phylogenetics and Prance, G. T. (1982). A review of the Phytogeographic evidences Evolution, 45(1), 272–288. https://doi.org/10.1016/j.ympev.2007.03.018 for Pleistocene climate changes in the Neotropics. Annals of Valavi, R., Elith, J., Lahoz-Monfort, J. J., & Guillera-Arroita, G. (2019). the Missouri Botanical Garden, 69(3), 594–624. https://doi. blockCV: An R package for generating spatially or environmentally org/10.2307/2399085 separated folds for k-fold cross-validation of species distribution R Core Team (2019). R: A Language and Environment for Statistical models. Methods in Ecology and Evolution, 10(2), 225–232. https:// Computing. R Foundation for Statistical Computing. doi.org/10.1111/2041-210X.13107 Rambaut, A., Drummond, A. J., Xie, D., Baele, G., & Suchard, M. A. (2018). Valencia, B. G., Urrego, D. H., Silman, M. R., & Bush, M. B. (2010). From Posterior summarization in Bayesian phylogenetics using Tracer 1.7. ice age to modern: A record of landscape change in an Andean Systematic Biology, 67(5), 901–904. https://doi.org/10.1093/sysbi​o/ cloud forest. Journal of Biogeography, 37(9), 1637–1647. https://doi. syy032 org/10.1111/j.1365-2699.2010.02318.x Ramírez-Barahona, S., & Eguiarte, L. E. (2013). The role of glacial cycles Vasconcellos, M. M., Colli, G. R., Weber, J. N., Ortiz, E. M., Rodrigues, in promoting genetic diversity in the Neotropics: The case of cloud M. T., & Cannatella, D. C. (2019). Isolation by instability: Historical forests during the Last Glacial Maximum. Ecology and Evolution, 3(3), climate change shapes population structure and genomic divergence 725–738. https://doi.org/10.1002/ece3.483 of treefrogs in the Neotropical Cerrado savanna. Molecular Ecology, Reis, C. A., Dias, C., Araripe, J., Aleixo, A., Anciães, M., Sampaio, I., 28(7), 1748–1764. https://doi.org/10.1111/mec.15045 Schneider, H., & Rêgo, P. S. (2020). Multilocus data of a manakin spe- Walker, M. J., Stockman, A. K., Marek, P. E., & Bond, J. E. (2009). Pleistocene cies reveal cryptic diversification moulded by vicariance. Zoologica glacial refugia across the Appalachian Mountains and coastal plain in Scripta, 49(2), 129–144. https://doi.org/10.1111/zsc.12395 the millipede genus Narceus: Evidence from population genetic, phy- Ritchie, A. M., Lo, N., & Ho, S. Y. W. (2017). The impact of the tree prior logeographic, and paleoclimatic data. BMC Evolutionary Biology, 9(1), on molecular dating of data sets containing a mixture of inter- and in- 25. https://doi.org/10.1186/1471-2148-9-25 traspecies sampling. Systematic Biology, 66(3), 413–425. https://doi. Watanabe, S., Hajima, T., Sudo, K., Nagashima, T., Takemura, T., Okajima, org/10.1093/sysbi​o/syw095 H., & Kawamiya, M. (2011). MIROC-ESM 2010: Model description ESCOBAR et al. | 15

and basic results of CMIP5-20c3m experiments. Geoscientific SUPPORTING INFORMATION Model Development, 4(4), 845–872. https://doi.org/10.5194/ Additional supporting information may be found online in the gmd-4-845-2011 Supporting Information section. Zhang, C., Rabiee, M., Sayyari, E., & Mirarab, S. (2018). ASTRAL-III: Polynomial time species tree reconstruction from partially resolved gene trees. BMC Bioinformatics, 19(S6), 153. https://doi.org/10.1186/ s1285​9-018-2129-y How to cite this article: Escobar S, Helmstetter AJ, Jarvie S, Zinck, J. W. R., & Rajora, O. P. (2016). Post-glacial phylogeography and Montúfar R, Balslev H, Couvreur TLP. Pleistocene climatic evolution of a wide-ranging highly-exploited keystone forest tree, fluctuations promoted alternative evolutionary histories in eastern white pine (Pinus strobus) in : Single refugium, multiple routes. BMC Evolutionary Biology, 16(1), 1–17. https://doi. Phytelephas aequatorialis, an endemic palm from western org/10.1186/s1286​2-016-0624-1 Ecuador. J Biogeogr. 2021;00:1–15. https://doi.org/10.1111/ Zizka, A., Silvestro, D., Andermann, T., Azevedo, J., Duarte Ritter, C., jbi.14055 Edler, D., Antonelli, A., Farooq, H., Herdean, A., Ariza, M., Scharn, R., Svantesson, S., Wengström, N., Zizka, V., & Antonelli, A. (2019). CoordinateCleaner: Standardized cleaning of occurrence records from biological collection databases. Methods in Ecology and Evolution, 10(5), 744–751. https://doi.org/10.1111/2041-210X.13152

BIOSKETCH Sebastián Escobar conducted this research as part of his PhD program at the Department of Biology in Aarhus University. He is interested in unravelling the evolutionary history and phyloge- netic relationships of tropical plants to understand the processes that have shaped current patterns of biodiversity.

Author contributions: R.M., H.B., and T.L.P.C conceived the study; S.E., S.J., R.M. and T.L.P.C. collected the data; S.E. performed labo- ratory work; S.E. analysed the data with assistance from A.J.H. and S.J.; S.E. led the writing with contribution from all authors.