Modelling establishment risk for non-indigenous species
using aquarium fish as a case study
Lidia Della Venezia
Department of Biology
McGill University
Montreal, Quebec, Canada
June 2019
A thesis submitted to McGill University in partial fulfillment of the
requirements of the degree of Doctor of Philosophy in Biology
© Lidia Della Venezia, 2019
Table of contents
Dedication ...... v
Acknowledgements ...... vi
Thesis abstract ...... vii
Résumé ...... ix
List of tables ...... xii
List of figures ...... xv
Preface ...... xviii Thesis format and style ...... xviii Contributions of co-authors ...... xix Original contributions to knowledge ...... xx
General introduction ...... 1 Introduction ...... 1 Risk assessment and risk management ...... 2 Non-indigenous species establishment ...... 4 Missing data in ecological datasets ...... 7 Methodological approach ...... 9 Thesis outline ...... 10 References ...... 12
Chapter 1: The rich get richer: invasion risk across North America from the aquarium pathway under climate change ...... 22 1.1 Abstract ...... 23 1.2 Introduction ...... 25 1.3 Methods ...... 28 1.3.1 Model formulation ...... 28 1.3.2 Data and variable choice ...... 29
ii
1.3.3 Model fitting ...... 33 1.4 Results ...... 34 1.5 Discussion ...... 38 1.6 Acknowledgements ...... 43 References ...... 44
Connecting statement ...... 57
Chapter 2: Guiding rapid response to non-indigenous aquarium fish: identifying risk factors for persistent versus casual establishment ...... 58 2.1 Abstract ...... 59 2.2 Introduction ...... 61 2.3 Methods ...... 64 2.3.1 Variable choice ...... 66 2.3.2 Model fitting ...... 67 2.3.3 Multiplicative risk factors ...... 68 2.4 Results ...... 70 2.4.1 Multiplicative risk factors ...... 70 2.4.2 Re-evaluating "risky" species in terms of persistence . . . 72 2.4.3 Comparing establishment sub-stages: casual versus persistent . 73 2.5 Discussion ...... 74 2.6 Acknowledgements ...... 78 References ...... 79
Connecting statement ...... 92
Chapter 3: Filling in FishBase: a more powerful approach to the imputation of missing trait data ...... 93 3.1 Abstract ...... 94 3.2 Introduction ...... 96 3.3 Methods ...... 100 3.3.1 Trait data ...... 100 3.3.2 Comparison of existing imputation methods . . . . 101 3.3.3 Novel imputation protocol: CRU ...... 102
iii
3.3.4 Model averaging and gap-filling ...... 107 3.3.5 Validation procedure ...... 107 3.4 Results ...... 109 3.4.1 Performance of the imputation models . . . . . 109 3.4.2 Validation of the uncertainty estimates . . . . . 110 3.4.3 Filling in FishBase ...... 111 3.5 Discussion ...... 112 3.5.1 Model comparison and ensemble imputation . . . . 112 3.5.2 Uncertainty estimation ...... 113 3.5.3 Caveats ...... 115 3.6 Conclusions ...... 116 3.7 Acknowledgements ...... 117 References ...... 118
General conclusion ...... 132 References ...... 135
Appendices ...... 136 Appendix A: Supplementary material for Chapter 1 . . . . . 137 Appendix B: Supplementary material for Chapter 2 . . . . . 141 Appendix B.1 ...... 142 Appendix B.2 ...... 147 Appendix B.3 ...... 150 Appendix B.4 ...... 153 Appendix B.5 ...... 154 Appendix C: Supplementary material for Chapter 3 . . . . . 156
iv
Dedication
I dedicate this thesis to my entire family, but especially to my parents, Ketty e Livio, you are everything I could ever ask for. To my sister Claudia, my brother Giulio, my nephews Chiara e
Lorenzo, and my dear nonna Noemi, you constantly fill my heart with love and joy. To my best friends Elisabetta, Silvia, Mery and Marina, you always support me and are pure happiness. And to Paolo, your smile brings the best out of me.
v
Acknowledgements
I would like to thank my supervisor Prof. Brian Leung, without whom my work would have not been possible and who has allowed me to grow both academically and personally. I also thank Prof. Gregor Fussmann and Prof. Frédéric Guichard for serving on my supervisory committee and for their guidance, particularly during major direction changes in my research. I am very grateful to my co-author Jason Samson, for providing insightful discussions and strong encouragement. Many thanks are due to my lab mates, Johanna Bradie, Corey Chivers, Kristina
Enciso, Alyssa Gervais, Emma Hudgins, Dat Nguyen, Victoria Reed, Natalie Richards, Anthony
Sardain, Dylan Schneider, and Shriram Varadarajan, for their consistent support and help.
Further, I acknowledge the Canadian Aquatic Invasive Species Network, the Fonds vert of the
Quebec government, McGill University, the National Sciences and Engineering Research
Council, and the Quebec Centre for Biodiversity Science for providing the necessary funding to pursue my PhD research.
I am especially grateful to my closest friends in Montreal, who have seen me through some of the happiest and toughest times of my life. In chronological order, I thank Chiara, Jay, Enrico,
Veronica, Franco, Luca, Estelle, Gianni, Fabrizio, Nunzio, Alessandra, Simone, Salvatore, Raul,
Andrea and Anna. Merci à mes francophones préférés, Céline, Andréa, Mathilde et Mattia.
Finally, my appreciation goes to Mundo Lingo for having been a second home and having allowed me to meet some of the most interesting and charming people I know, the Jarry pool for being my aquatic oasis during the summer, YouTube for always providing the perfect work soundtrack, and my kickboxing instructors and partners for having prevented several breakdowns during the most challenging months of my PhD.
vi
Thesis abstract
Invasive species cause substantial ecological and economic damages. While major modelling improvements have been made in the last decades, predictions are primarily based on single species analyses, look at a single factor at a time (i.e. environmental conditions, species traits, and propagule pressure, individually), and consider quite broad stages (e.g., establishment), which may be usefully resolved into smaller sub-stages. Further, although models arguably provide the most coherent, sophisticated predictions, most quantitative models remain unused in policy-oriented invasive species risk assessments, which largely rely on expert- opinion and simple summation across individual factors believed to influence invasions (i.e. scoring-based approaches). Of course, for quantitative analyses, data limitations typically exist.
To allow quantitative methods to be even more powerful and broadly usable, approaches are needed to alleviate those limitations and optimally use the available information. In this thesis, I advance the field of invasion biology, contributing to each of the three issues identified above.
In Chapter 1, I consider the three main predictors of biological invasions: environment, propagule pressure and species traits, and I integrate these into a coherent multispecies, geographically explicit model. I show the importance of their combination, and forecast that, for the aquarium fish invasion pathway, "the rich get richer" in that the most vulnerable current locations are likely to suffer the greatest increase in new invasions in the future. By employing an integrative approach and a multispecies perspective, this work provides support to decision making for resource managers and policy makers, and a better understanding of non-indigenous species establishment.
In Chapter 2, I recognize that while prevention may be ideal, it is not always achievable,
vii and prioritizing rapid responses is necessary for the effective management of potentially harmful non-indigenous species. To address issues of rapid response, we need to more finely resolve the establishment phase of biological invasions, and determine what happens after a new species has been detected (i.e. whether long-term persistence occurs). In this chapter, I have three objectives:
1) I separate casual (i.e. temporary) establishment and persistence (i.e. lack of subsequent extirpation), using the framework developed in Chapter 1 to generate a multispecies, geographically explicit model to prioritize rapid response efforts; 2) recognizing that mathematical models often remain unused in policy, I convert the model parameters into simple multiplicative risk factors, to facilitate broader adoption outside the modelling field; 3) from a fundamental perspective, I show that propagule pressure is important for casual establishment, but does not help predict subsequent persistence. Species traits are the most important group of predictors for casual establishment, while the environment is most relevant for persistence.
In Chapter 3, I develop an approach to more powerfully use available data (focusing on species traits). From Chapters 1 and 2, it became apparent that about 80% of trait values were missing in the trait database used (FishBase). Thus, I develop a novel, fast, simple method for imputation, based on trait Correlation, taxonomic Relatedness and Uncertainty minimization
(CRU). I compare it against existing cutting edge approaches, and show that CRU was the most accurate. Additionally, I consider and demonstrate that including CRU into an ensemble model combining existing techniques (Phylopars and missForests) yields even greater accuracy. This work fills in a substantial subset of FishBase, but also provides an approach for the imputation of other trait databases. Overall, this thesis advances understanding of ecological processes and informs environmental management while using the best available information.
viii
Résumé
Les espèces envahissantes causent des dommages écologiques et économiques importants.
Bien que des améliorations majeures aient été apportées à la modélisation au cours des dernières décennies, les prédictions reposent principalement sur des analyses par espèce, examinent un facteur à la fois (conditions environnementales, caractéristiques des espèces et pression des propagules, individuellement) et prennent en compte des étapes assez larges (par exemple, l'établissement), qui peuvent être utilement décomposées en sous-étapes plus petites. En outre, bien que les modèles fournissent sans doute les prévisions les plus cohérentes et sophistiquées, la plupart des modèles quantitatifs restent inutilisés dans les évaluations des risques des espèces envahissantes orientées sur l'action, qui reposent largement sur des opinions d’experts et une simple synthèse des facteurs individuels susceptibles d’influencer les invasions (approches fondées sur des scores). Bien sûr, pour les analyses quantitatives, des limitations de données existent généralement. Pour que les approches quantitatives soient encore plus puissantes et largement utilisables, des approches sont nécessaires pour atténuer ces limitations et utiliser de manière optimale les informations disponibles. Dans cette thèse, je fais progresser le domaine de la biologie des invasions, contribuant à chacune des trois questions identifiées ci-dessus.
Au Chapitre 1, je considère les trois principaux facteurs prédictifs des invasions biologiques : environnement, pression propagulaire et traits des espèces, et les intègre dans un modèle cohérent multi-espèce, géographiquement explicite. Je montre l’importance de leur combinaison et prévois que, pour la voie d’invasion par les poissons d’aquarium,
«les riches s’enrichissent», les localités les plus vulnérables étant susceptibles de connaître la plus forte augmentation de nouvelles invasions à l’avenir. En utilisant une approche intégrative
ix et une perspective multi-espèce, ce travail apporte un soutien à la prise de décision pour les gestionnaires de ressources et les décideurs, ainsi qu'une meilleure compréhension de l'établissement d'espèces non indigènes.
Au Chapitre 2, je reconnais que si la prévention peut être idéale, elle n’est pas toujours réalisable et que la priorité donnée aux interventions rapides est nécessaire pour la gestion efficace des espèces non indigènes potentiellement nuisibles. Pour résoudre les problèmes de réponse rapide, nous devons décomposer plus finement la phase d'établissement des invasions biologiques et déterminer ce qui se passe après la détection d'une nouvelle espèce (c'est-à-dire si une persistance à long terme se produit). Dans ce chapitre, j’ai trois objectifs : 1) je sépare l’établissement occasionnel (c’est-à-dire temporaire) et la persistance (c’est-à-dire l’absence d'extirpation ultérieure), en utilisant le cadre développé au Chapitre 1 pour générer un modèle multi-espèces, géographiquement explicite, pour hiérarchiser les efforts d’intervention rapide; 2) reconnaissant que les modèles mathématiques restent souvent inutilisés dans les politiques, je convertis les paramètres du modèle en simples facteurs de risque multiplicatifs, afin de faciliter une adoption plus large en dehors du champ de la modélisation; 3) d'un point de vue fondamental, je montre que la pression propagulaire est importante pour l'établissement occasionnel, mais ne permet pas de prédire la persistance ultérieure. Les traits des espèces constituent le groupe de prédicteurs le plus important pour l'établissement occasionnel, tandis que l'environnement est le plus pertinent pour la persistance.
Au Chapitre 3, je développe une approche pour utiliser plus efficacement les données disponibles (en se concentrant sur les traits des espèces). Dans les Chapitres 1 et 2, il est apparu qu’environ 80% des valeurs de trait manquaient dans la base de données de traits utilisée
(FishBase). Ainsi, je développe une nouvelle méthode d'imputation simple et rapide, fondée sur
x la corrélation de traits, la relation taxonomique et la minimisation d'incertitude (CRU). Je compare cette méthode aux approches de pointe existantes et montre que CRU est la plus précise. De plus, je considère et démontre qu'inclure CRU dans une modèle d'ensemble combinant des techniques existantes (Phylopars et missForests) permet une précision encore plus grande. Ce travail remplit un sous-ensemble substantiel de FishBase mais fournit également une approche pour l'imputation d'autres bases de données de traits. Globalement, cette thèse permet de mieux comprendre les processus écologiques et d’éclairer la gestion de l’environnement tout en utilisant les meilleures informations disponibles.
xi
List of tables
Table 1.1. Results of the forward selection for the joint model (PET) including environmental variables (type BIO), species traits (type TR) and environment-traits interactions (type INT). X̅ indicates the mean, s2 represents the variance, 1st & 2nd denote first and second order terms. The terms included in the final PET model and the corresponding standardized parameter values are indicated with an asterisk, while AIC improvements that were bigger than 2 units in the model selection are denoted by daggers. The last column shows the order of inclusion in the model.
Table 1.2. Top 10 species with the highest likelihood of establishment in the United States currently and in 2050, as predicted by our joint model, along with the state where they pose the highest risk, their propagule pressure, and the values of the traits retained in the PET model as important determinants of establishment, i.e. maximum temperature tolerance (Max T., °C) and maximum length (Max L., cm).
Table 2.1. Species status categories for the aquarium species in our dataset, as described on the
USGS Non-indigenous Aquatic Species website and as categorized in our study (CS/PS).
Table 2.2. AIC, AIC difference from the best model (ΔAIC), AUC, fitted 푐^ parameter and percentage of deviance explained (%dev.exp) by the full casual establishment and persistence model, and their respective submodels. The best model for each dataset is indicated in bold.
xii
Table 2.3. Parameter values for the predictors retained in each sub-stage best model after variable selection. Rank indicates the entry order of each important variable in the corresponding model, either as a first order term only (1st) or including an additional second order term (2nd).
Table 2.4. Selected rules of thumb to quickly quantify risk of casual and persistent establishment. The first column reports the average value of each relevant predictor in the equivalent model, while the other columns identify the variable values corresponding to an OR change of +1000%, +100%, +50%, +25%, -25% and -50%. The means of variables that were significant for both models differ, because the persistence model dataset represents a subset of the casual establishment model dataset (e.g., maximum length).
Table 3.1. Species functional and life-history traits used for the analysis, obtained from
FishBase. The last column indicates the percentage of missing value for each trait in the dataset.
Table 3.2. Measures of variation included in the hierarchical model for uncertainty estimation.
2 Table 3.3. Average R MSE across traits and approaches, obtained by cross-validation. Each row is either a single methodology or an ensemble of methods.
2 Table 3.4. Cross-validation predictive performance (R MSE) of MICE, missForest, Phylopars,
CRU and the best ensemble (missForest-CRU), by trait. For single approaches, the values in bold correspond to the best method for a specific trait, while under the ensemble model, the bold
xiii values indicate where averaging the predictions from CRU and missForest improved the accuracy of the imputation.
xiv
List of figures
Figure 1.1. Distributions of environmental variables (a,b), traits (c,d), interaction term (e) and propagule pressure (f) as included in the model. The plots for the environmental variables and the interaction term also depict the curves representing the distributions of values under current conditions and as forecasted for 2050 in the USA (solid and dashed lines) and in Quebec (dotted and dash-dotted lines). The black dots show the establishments occurred in the USA (see Table
A.2 in Appendix A), and their size is proportional to the number of corresponding species, except for plot (e) and (f), where each dot represents a species-location combination. The interaction term is standardized for simplicity of representation, while propagule pressure values are on a log scale.
Figure 1.2. Average establishment risk across species by state (USA) or administrative region
(Quebec) as predicted by the PET model under current (a) and future (b) climatic conditions.
Darker shades indicate a higher average risk of establishment. Very low probabilities are displayed using scientific notation, e.g. 1E-5 corresponds to one multiplied by 0.00001.
Figure 1.3. Expected numbers of establishments for the United States at highest risk, as predicted by the PET model. Clear grey areas indicate the number of species expected to be established under current conditions, while dark grey denotes the additional species forecasted to establish by the year 2050. Only the states with higher expected numbers of establishments are shown, while the remaining are pooled in a single bar (Rem.).
xv
Figure 1.4. Distribution of the estimated establishment risk in the USA and Quebec. The black dots represent the species that have established in the USA. The solid and the dashed lines correspond to the distributions of establishment probabilities predicted for the USA presently and for 2050 respectively, while the dotted and the dash-dotted lines represent those predicted for
Quebec for the same years. The probability of establishment values are reported on a log scale.
Figure 2.1. Illustrative example of rules of thumb, i.e. odds ratios (OR), derived for two species traits. Each dark dot represents the reference point, i.e. the average trait value across species in the dataset, while each triangular dot corresponds to the species s of interest.
Figure 2.2. Effect of each significant predictor on the likelihood of persistently establishing versus failing, expressed as odds ratio (OR), when gradually varying each predictor. OR equals 1
(dashed line) at each variable's mean value, reported by the corresponding point. The average values for the interaction plot (e) correspond to those of the respective main terms (b,d). The triangles indicate the OR of P. managuensis (traits) and Hawaii (environmental conditions), and their interaction (e). Very high values in (e) coincide with areas of extremely low absolute probability values, so that little probability increases determine very high OR.
Figure 3.1. Possible estimates obtainable for each missing datum using CRU, depending on the amount of information used.
Figure 3.2. P-P plots evaluating each method's performance in estimating uncertainty. The plots compare the observed and the theoretical residuals' percentiles, given the uncertainty estimates
xvi provided by (a) CRU using the HUE algorithm, (b) RMSD, (c) MICE and (d) Phylopars. missForest was excluded as it did not provide an estimate of uncertainty for each imputed value.
The 1:1 line defines the expectation for a perfect match between theoretical and observed percentiles.
Figure 3.3. Average uncertainty by number of species within (a) genus and (b) family.
xvii
Preface
Thesis format and style
This thesis is presented in a manuscript-based format, and consists of three papers. Each chapter focuses on making better use of limited ecological information, with the general scope of improving non-indigenous species establishment risk assessment, using an integrative and more accessible multispecies perspective.
Chapter 1
Della Venezia, L., Samson, J., & Leung, B. (2018). The rich get richer: Invasion risk across
North America from the aquarium pathway under climate change. Diversity and Distributions,
24, 285-296.
Chapter 2
Della Venezia, L., & Leung, B. (under review at Biological Invasions). Guiding rapid response to non-indigenous aquarium fish: identifying risk factors for persistent versus casual establishment.
Chapter 3
Della Venezia, L., & Leung, B. (under review at Ecography). Filling in FishBase: a more powerful approach to the imputation of missing trait data.
xviii
Contribution of co-authors
This thesis is composed of my original work, and I am the primary author for all chapters.
My work has been conducted in close cooperation with my supervisor, Prof. Brian Leung.
Additionally, one chapter has seen the insightful contribution of an additional co-author, who is mentioned below.
Chapter 1: Lidia Della Venezia, Brian Leung and Jason Samson conceived the project. Lidia
Della Venezia and Brian Leung formulated the model, and Lidia Della Venezia led the programming and the analysis of the data, under the consultation of Brian Leung. Lidia Della
Venezia wrote the first draft, and all authors revised the manuscript together, providing corrections and discussing ideas.
Chapter 2: Lidia Della Venezia and Brian Leung conceived the project. Lidia Della Venezia built and analysed the models, and derived the multiplicative risk factors for rapid response, assisted by helpful discussions with Brian Leung. Lidia Della Venezia led the preparation of the manuscript, with input from Brian Leung.
Chapter 3: Lidia Della Venezia and Brian Leung conceived the methodology. Lidia Della
Venezia performed all the statistical analyses, assisted by discussions with Brian Leung. Lidia
Della Venezia wrote the manuscript, with insightful contribution from Brian Leung.
xix
Original contributions to knowledge
In Chapter 1, I present an integrated approach to non-indigenous species establishment risk assessment and I show how incorporating species traits, propagule pressure and local environmental conditions in a pathway-level risk assessment framework considerably improves predictive power. This approach represents the first attempt to incorporate all three categories of predictors in a multispecies framework, allowing both species-specific and geographically explicit predictions. Further, I use this integrative approach to predict non-indigenous species establishments under a climate change scenario for the year 2050. In contrast with other findings in the literature, I demonstrate that climate change might impact more profoundly areas that are already particularly susceptible to non-indigenous species establishment. This methodology is relevant from a management perspective and it is applicable to virtually any introduction pathway.
Chapter 2 presents a multispecies approach to prioritize species and locations, and optimize management decisions in the context of rapid response. I model casual (i.e. temporary) establishment and persistence separately, to identify species that will likely become extirpated after early establishment and those that will manage to persist and potentially cause harm. I then derive practical "rules of thumb" for rapid assessment that can be used to prioritize instances of high risk. I thereby advance invasion ecology in two ways. Firstly, I deepen our understanding of the factors associated with successful establishment and their relative importance. I identify species traits and propagule pressure as most relevant for casual establishment, with the environment being more important for persistence, and propagule pressure having no effect on the latter. Secondly, I provide a quick and effective tool to prioritize the investment of resources
xx for rapid response, while also addressing the need to improve the usability of technical knowledge for policy makers.
Finally, in Chapter 3 I focus on the imputation of missing information in species traits datasets, where lacking data is the norm. Although imputation methods exist, I demonstrate that a novel, relatively simple approach based on trait Correlations, taxonomic Relatedness and
Uncertainty estimation (CRU) can be more effective and provide more reliable error estimates for every single imputed datum, compared to alternative sophisticated tools. Additionally, I explore for the first time the effectiveness of using an ensemble modelling approach to the imputation of missing species traits, and I demonstrate that averaging predictions from CRU and other well-performing methods further improves accuracy. Finally, employing the best multi- model ensemble, I fill in and provide a complete version of a substantial subset of the fish trait database FishBase. This methodology can be applied to other trait datasets, with applications in a variety of ecological fields, such as functional and community ecology, and conservation.
xxi
General introduction
Introduction
Non-indigenous species are species that are encountered outside of the range defined by their natural barriers and dispersal capacities (Mack et al., 2000). Due to increasing economic globalization, over the past few decades the number of non-indigenous species has steadily increased worldwide (Ricciardi, 2007; Seebens et al., 2017), determining what has been called a
"biotic homogenization" (McKinney & Lockwood, 1999; Olden, 2006). In fact, the process by which certain species manage to overcome their natural biogeographical barriers and reach non- native locations is often mediated by human-related activities, and these species continue to be transported around the world both intentionally and unintentionally (Hulme et al., 2008).
While not all non-indigenous species are able to establish self-sustaining populations and most of them do not cause harm (García-Berthou et al., 2005; Ricciardi and Cohen, 2007;
Williamson & Fitter, 1996), some can undergo explosions in population density (Elton, 1958).
Consequently, these species achieve widespread distributions and begin to cause substantial impacts, becoming essentially invasive (Colautti & MacIsaac, 2004). In new habitats, invasive species can interact with the native community and act as competitors or parasites (Lockwood et al., 2013), causing biodiversity loss (Doherty et al., 2016; Millenium Ecosystem Assessment
2005; Sala et al., 2000) and altering ecosystem dynamics (Ehrenfeld, 2010; Pyšek et al., 2012).
Freshwater ecosystems are particularly impacted by non-indigenous species (Lockwood et al.,
2013), which represent one of the major causes of biodiversity reduction, second only to habitat loss (Dextrase & Mandrak, 2006). In fact, freshwater species appear to be five times more prone to extinction than their terrestrial counterparts in North America (Ricciardi & Rasmussen, 1999).
1
In addition, invasive species can cause economic losses for hundreds of billions of dollars, by damaging agriculture, driving the loss of natural resources, affecting recreational activities and threatening human health (Pimentel et al., 2005).
The considerable damages caused by invasive species underlie much of the research that has been devoted to identifying the factors associated with biological invasions, and to understanding how they relate to invasion success, in order to prevent, control, and reduce the impact of harmful species (Byers et al., 2002; Keller et al., 2007; Ricciardi & MacIsaac, 2008).
Risk assessment and risk management
Most of the research currently conducted in the field of invasion ecology can be related to invasive species risk assessments (Lodge et al., 2006; Ricciardi & Rasmussen, 1998; Stohlgren
& Schnase, 2006). Developing tools to predict and evaluate the impact of invasive species is essential to identify effective management strategies and, most importantly, to inform instances where these are needed (Andersen et al., 2004a; Chadès et al., 2011; Kerr et al., 2016). While potentially harmful species should ideally be excluded from intentional introductions in non- native locations, this is hardly feasible without imposing unsustainable restrictions to certain industries (e.g., aquaculture; Naylor et al., 2001). Additionally, many currently invasive species originated from their unintentional movement around the globe, often as accidental hitchhikers of traded goods (Hulme, 2009; Westphal et al., 2008).
Hence, an increasing number of studies have focused on providing sound scientific background to inform management strategies aimed at preventing and reducing impact (Buckley,
2008; Simberloff, 2003). Among the variety of possible management actions, prevention and rapid response have been widely recommended as the most efficient and most cost-effective
(Alvarez & Solis, 2019; Lodge et al., 2006). While prevention is usually more feasible and thus
2 prioritized (Finnoff et al., 2007; Leung et al., 2002), it does not necessarily guarantee success
(Vander Zanden et al., 2010), so that rapid response remains critical (Wittenberg & Cock, 2001).
Moreover, even if many governments worldwide have implemented agendas for the prevention and control of invasive species (McGeoch et al., 2010), the resources invested are often insufficient, if not incorrectly spent (Finoff et al., 2007; Leung et al., 2002). Pinpointing high- risk species and vulnerable ecosystems as priorities for intervention would allow funds to be more efficiently used (Lohr et al., 2017; Papeş et al., 2011) and remains a challenging task of invasion ecology (Stewart-Koster et al., 2015).
One of the main obstacles to the effective management of non-indigenous species is time.
Efficient prevention and control strongly rely on timely responses to new introductions and detections of potentially invasive species (Mehta et al., 2007; Simberloff et al., 2013). However, risk assessments are often performed on a species-by-species basis (Leung et al., 2012), which makes the comparison of several taxa introduced at the same time a daunting task, particularly given the increasing number of organisms moved around the world. This has led to a call for multispecies risk assessments, where tens or even hundreds of species can be assessed simultaneously. Recently, a few multispecies frameworks have been developed successfully
(e.g., Chapman et al., 2017; Singh et al., 2015), often focusing on species belonging to the same invasion pathway (e.g., Bradie & Leung, 2015). For instance, a pathway that has received increasing attention is the aquarium trade, responsible for the introduction and the establishment of several species around the globe (Duggan, 2010; Rixon et al., 2005), including a third of the
100 worst invasive fish (Padilla & Williams, 2004). Pathway-based tools allow to simultaneously estimate risk for species belonging to the same pathway of introduction, shortening the time necessary for preliminary assessments.
3
Another essential factor for efficient risk assessments is the need to evaluate risk in a spatially explicit fashion. In fact, even non-indigenous species that can eventually cause damage are usually able to establish and become harmful only in certain suitable locations (Ricciardi et al., 2013; Williamson & Fitter, 1996). Risk assessment tools that help prioritize not only species but also geographical areas would improve the accuracy of predictions (Andersen et al., 2004b).
Although attempts have already been made over the past decades (e.g., Giljohann et al., 2011;
Pitt et al., 2009; Rouget et al., 2002), spatially explicit frameworks should be used more extensively.
Finally, while quantitative risk assessments would be more rigorous than qualitative and scoring approaches, the latter are often the frameworks used by policy makers (Cook et al.,
2010). This is due mainly to two reasons. Firstly, quantitative tools for risk assessment have started to be developed only in recent years, especially for multiple species, so that expert opinion continues to be preferred (Leung et al., 2012). Secondly, even when sophisticated quantitative tools exist, they often require substantial technical skills. Therefore, researchers should guarantee the usability of technical knowledge also to non-specialists (Cassey et al.,
2018a).
Non-indigenous species establishment
Biological invasions can be described as series of steps, each representing a potential target for management (e.g., transport, introduction, establishment, spread; Blackburn et al., 2011).
However, most studies have focused on the establishment phase (Leung et al., 2012), i.e. the process by which a non-indigenous species founds a self-sustaining population in a novel location (Lockwood et al., 2013). In fact, the earlier stages of invasions are considered critical,
4 since they generally guarantee a higher likelihood of successful management at lower costs
(Hulme, 2006; Puth & Post, 2005; Rejmánek & Pitcairn, 2002).
Among the determinants of successful establishment, the environmental conditions of the receiving location, and climate matching in particular, are essential (e.g., Duncan et al., 2014;
Hayes & Barry, 2008; Mahoney et al., 2015), and relate to the concept of niche, by which a species' spatial distribution is limited by a multivariate set of environmental variables within which the species can maintain a self-sustaining population (Jiménez-Valverde et al., 2011). This concept is at the base of species distribution models (SDM), which have been increasingly used to predict where a species can establish based on local environmental features (e.g., Ficetola et al., 2007; Gallien et al., 2010; Peterson, 2003). Another critical factor influencing establishment success is propagule pressure, i.e. the number of introduced individuals (Lockwood et al., 2005;
2009), which has been widely recognized as a factor of primary importance (Cassey et al.,
2018b; Colautti et al., 2006; Simberloff, 2009) and has been incorporated in SDMs (e.g., Leung
& Mandrak, 2007) to improve predictions. Analogously, species traits have also been identified as important determinants of successful establishments, including, among others, temperature tolerances, size and trophic level (Kolar & Lodge, 2002; Pyšek et al., 2009; Van Kleunen et al.,
2010). Given their relevance, integrating these different classes of predictors into a unified risk assessment model ideally should increase the accuracy of predictions, in comparison with studies that analyze each component individually, and it should avoid redundancy and improve efficiency (Leung et al., 2012). In addition, it might shed light on the relative importance of each factor in the successful establishment of non-indigenous species and provide insight into the processes underlying one of the earlier stages of invasions. In fact, although these predictors are known to be important, their role during establishment has not yet been fully understood
5
(Dawson et al., 2009; Essl et al., 2015; Milbau & Stout, 2008). Finally, incorporating information from each of these classes of variables would have the further advantage of allowing predictions that are both species-specific and location-specific and could potentially be used to tailor management actions, for example by reducing the number of individuals displaced (i.e. propagule pressure), or restricting species based on certain traits or only in specific locations.
Ideally, integrating information about environmental conditions would additionally permit to account for the effect of alternative drivers of global change, which have only recently commenced to be studied in conjunction with invasions (Brook et al., 2008; Didham et al.,
2007). Climate change in particular appears to have opened up several novel opportunities for invaders around the world (Walther et al., 2009), for example making temperate areas, normally considered too cold, accessible for sub-tropical and tropical species (Hellmann et al., 2008;
Rahel & Olden, 2008), potentially exacerbating the impact of invasions.
Unfortunately, the characterization of non-indigenous species establishment is often hampered by the limited availability of adequate information. Data can be missing for any predictor variable, and proxies might be required as surrogate when more detailed information is lacking (e.g., Eschtruth & Battles, 2011; Cook et al., 2019). Even species occurrences are often difficult to categorize. Simple detections of a species in the wild are frequently treated as actual establishment. However, the majority of non-indigenous species detected in the wild later becomes extirpated without anthropic interventions (Blackburn et al., 2011; Williamson & Fitter,
1996). Overlooking these failed establishments can in turn overestimate risk and misestimate which species pose a real threat (Zenni & Nuñez, 2013), with important consequences for management. Arguably, separating establishment into sub-stages and characterizing the species that manage to establish only temporarily from those that manage to persist in the long term
6 would be advantageous from both a theoretical and an applied point of view, improving our understanding of the factors determining successful invasions and avoiding misspending resources on species that would likely become extinct irrespectively of management actions.
Missing data in ecological datasets
Lack of information is an important impediment to the study of invasions, as much as to other branches of ecology (Nakagawa & Freckleton, 2008). This has become even more apparent due to the growing availability of big-scale datasets. The use of data-intensive approaches represents an extraordinary opportunity to address ecological questions that would have been unthinkable a few decades ago (Kelling et al., 2009; Luo et al., 2011), including the aforementioned pathway-level risk assessments for invasive species. Nonetheless, extensive databases are virtually always incomplete (Allison, 2002; Horton & Kleinman, 2007) and consequently part of the information they contain is often considered unusable (Nakagawa &
Freckleton, 2008). It suffices to think that, for example, many regression-based methods provided in a number of statistical programming languages automatically discard incomplete rows of data and restrict the analysis to the so-called "complete cases" (e.g., R; R Core Team,
2018). Alternatively, in-depth examination is required to evaluate how to best use patchy data, and decisions often depend on the specific case (Jones, 1996).
Classic examples of databases with very high levels of missingness are global species trait repositories, such as FishBase for fish (Froese & Pauly, 2018) and TRY for plants (Kattge et al.,
2011). Complete versions of these datasets would allow trait information to be used more efficiently. As a matter of fact, species traits represent versatile tools serving several purposes, including estimating and monitoring biodiversity (Tilman, 2001; Vandewalle et al., 2010), understanding the mechanisms behind ecosystem services (Lavorel et al., 2011), assessing risk
7 of invasions and extinctions (Liu et al., 2017) and informing conservation (Cadotte et al., 2011).
Nevertheless, trait-based metrics tend to be quite sensitive to missing data (Májeková et al.,
2016; Pakeman, 2014).
Until complete information is collected, a solution to deal with missing data is imputation, i.e. the replacement of missing values with best estimates. Reasonable replacement data can be obtained based on different mechanisms (Penone et al., 2014), from simple options like using the average or median of existing values (e.g., Nakagawa et al., 2001), to choices based on ecological hypotheses (Taugourdeau et al., 2014), to more sophisticated methods based on regression or phylogenetics (e.g., Goolsby et al., 2017; Stekhoven & Bühlmann, 2011; van
Buuren & Groothuis-Oudshoorn, 2011). Although imputation tends to outperform complete-case analyses (Nakagawa & Freckleton, 2008; van der Heijden et al., 2006), it is used relatively rarely in ecology (Nakagawa, 2015). Notably, when missing values in trait databases are replaced with plausible ones, alternative methods tend to perform differently depending on the specific trait and on the missingness rate (e.g., Poyatos et al., 2018), and not all existing methods produce a measure of the reliability (i.e. uncertainty) of their predictions (Penone et al., 2014). Thus, the choice of the appropriate method remains an open question.
Cases in which alternative methods perform differently have sometimes been tackled using a so-called ensemble modelling approach (Bates & Granger, 1969; Clemen, 1989; Palm &
Zellner, 1992; Winkler, 1989). Essentially, multiple models with desirable characteristics are combined to obtain ameliorated predictions, based on the assumption that the noise from each model would "cancel out", while a reliable signal would emerge more clearly (Bates & Granger,
1969). Ecologists have successfully used ensemble models (e.g., SDMs; Araújo & New, 2007;
Guo et al., 2015). Considering that each imputation methodology presents advantages and
8 disadvantages, averaging predictions from multiple approaches in an ensemble modelling perspective might be an avenue worth exploring to improve missing value forecasts. This would also entail the development of additional imputation algorithms that further ameliorate predictions, while accounting for uncertainty.
Methodological approach
In this thesis, I essentially adopt an ecological modelling approach. Statistical models are extensively used in ecology, where they mainly serve two purposes: inference and prediction.
Here, I predominantly focus on the predictive side of models. Nonetheless, in Chapter 1 and 2, ecological modelling and model comparison are also used to derive insight into the relevance of certain categories of biological predictors to the study of non-indigenous species establishment, and into the extent to which such predictors contribute to success or failure.
More specifically, I adapt existing modelling approaches to novel needs and modify them to incorporate additional predictors, to improve the accuracy of forecasts. From an ecological perspective, I concentrate on generating frameworks that can be applied to several species simultaneously and that can translate different sources of information into species-specific and location-specific predictions. To this aim, integration is essential to this thesis on several levels, and it includes combining different modelling approaches, diverse types of predictors, alternative drivers of environmental change, and predictions from different algorithms.
While ecological modelling represents a powerful tool for research, I recognize the importance of making sure that the rationale of the statistical approaches used is supported by ecological hypotheses and that the results are analyzed considering the underlying biological mechanisms. Therefore, the assumptions of each model are acknowledged throughout this work, and the limitations are addressed whenever possible.
9
Thesis outline
In this thesis, I address the limitations identified above. Specifically, I investigate modelling approaches to improve the understanding and management of non-indigenous species establishment, and examine the optimal use of limited ecological information.
In Chapter 1, I extend an existing modelling framework for pathway-level risk assessment of non-indigenous species establishment (Bradie & Leung, 2015) which included propagule pressure and species traits data. Focusing on freshwater fish species introduced in the USA via the aquarium pathway, I additionally incorporate information on environmental conditions and trait-environment interactions, to obtain geographically explicit, species-specific predictions.
Including the interaction terms was also necessary to simultaneously tackle the so-called 4th corner problem (Legendre et al., 1997), which aims at understanding the role played by traits in the way a species interacts with the surrounding environment. Through the inclusion of environmental information, I evaluated the effect of a climate change scenario forecasted for the year 2050 on the establishment likelihood of aquarium fish, which unveiled the need to prioritize management actions in the southernmost regions of the country for this introduction pathway.
In Chapter 2, I provide tools for rapid response to non-indigenous species detections.
Given that most non-indigenous species encountered in the wild later become extinct without intervention, I model casual (i.e. temporary) and persistent establishment using the framework developed in Chapter 1, to serve two objectives. Firstly, I derive simple "rules of thumb" for rapid assessment that will help efficiently define management priorities, by differentiating instances in which action is needed from those in which a non-indigenous species will likely go extinct without intervention. Secondly, I better characterize the establishment phase by identifying the relevant predictors of each sub-stage and their relative importance, showing that
10 while species traits and propagule pressure are most influential for achieving initial, casual establishment, the environment is critical for persistence, on which instead propagule pressure has no effect.
Finally, I address the issue of the high amount of missing data in species trait databases, which was encountered in Chapter 1 and 2. Concretely, in Chapter 3 I develop a novel imputation technique that uses information from trait Correlations, taxonomic Relatedness and
Uncertainty minimization (CRU). Uncertainty is estimated using a novel algorithm that provides error measures for each imputed datum. Using a validation approach on data available for 20 functional traits across more than 30,000 species from the FishBase database (Froese & Pauly,
2018), I demonstrate that, despite its relative simplicity, CRU performs better than more sophisticated approaches. Further, I show that the algorithm for uncertainty estimation predicts the error (i.e. the deviation between true and predicted values) better than the methods provided by other imputation approaches. Finally, I investigate the use of an ensemble modelling approach to missing data imputation, and I demonstrate that averaging predictions from alternative methods that perform differently depending on the specific trait increases the overall accuracy of imputation. Then, I use the best ensemble of imputation models to generate a complete version of a substantial subset of the FishBase dataset.
Overall, this thesis contributes a better understanding of one of the fundamental phases of biological invasions, along with tools to facilitate management practices, to prioritize resources, and to integrate information to make the best use of the available data.
11
References
Allison, P. D. (2002). Missing data. Thousand Oaks, CA: Sage Publications.
Alvarez, S., & Solis, D. (2019). Rapid Response Lowers Eradication Costs of Invasive Species: Evidence from Florida. Choices, 33, 1.
Andersen, M. C., Adams, H., Hope, B., & Powell, M. (2004a). Risk assessment for invasive species. Risk Analysis: An International Journal, 24, 787-793.
Andersen, M. C., Adams, H., Hope, B., & Powell, M. (2004b). Risk analysis for invasive species: general framework and research needs. Risk Analysis: An International Journal, 24, 893-900.
Araújo, M. B., & New, M. (2007). Ensemble forecasting of species distributions. Trends in ecology & evolution, 22, 42-47.
Bates, J. M., & Granger, C. W. (1969). The combination of forecasts. Journal of the Operational Research Society, 20, 451-468.
Blackburn, T. M., Pyšek, P., Bacher, S., Carlton, J. T., Duncan, R. P., Jarošík, V., ... & Richardson, D. M. (2011). A proposed unified framework for biological invasions. Trends in ecology & evolution, 26, 333-339.
Bradie, J., & Leung, B. (2015). Pathway‐level models to predict non‐indigenous species establishment using propagule pressure, environmental tolerance and trait data. Journal of applied ecology, 52, 100-109.
Brook, B. W., Sodhi, N. S., & Bradshaw, C. J. (2008). Synergies among extinction drivers under global change. Trends in ecology & evolution, 23, 453-460.
Buckley, Y. M. (2008). The role of research for integrated management of invasive species, invaded landscapes and communities. Journal of Applied Ecology, 45, 397-402.
Byers, J. E., Reichard, S., Randall, J. M., Parker, I. M., Smith, C. S., Lonsdale, W. M., ... & Hayes, D. (2002). Directing research to reduce the impacts of nonindigenous species. Conservation Biology, 16, 630-640.
Cadotte, M. W., Carscadden, K., & Mirotchnick, N. (2011). Beyond species: functional diversity and the maintenance of ecological processes and services. Journal of applied ecology, 48, 1079-1087.
Cassey, P., Delean, S., Lockwood, J. L., Sadowski, J., & Blackburn, T. M. (2018b). Dissecting the null model for biological invasions: A meta-analysis of the propagule pressure effect. PLoS biology, 16, e2005987.
12
Cassey, P., García-Díaz, P., Lockwood, J. L., & Blackburn, T. M. (2018a). Invasion Biology: Searching for Predictions and Prevention, and Avoiding Lost Causes. Invasion Biology: Hypotheses and Evidence, 1.
Chadès, I., Martin, T. G., Nicol, S., Burgman, M. A., Possingham, H. P., & Buckley, Y. M. (2011). General rules for managing and surveying networks of pests, diseases, and endangered species. Proceedings of the National Academy of Sciences, 108, 8323-8328.
Chapman, D., Purse, B. V., Roy, H. E., & Bullock, J. M. (2017). Global trade networks determine the distribution of invasive non‐native species. Global Ecology and Biogeography, 26, 907-917.
Clemen, R. T. (1989). Combining forecasts: A review and annotated bibliography. International journal of forecasting, 5, 559-583.
Colautti, R. I., Grigorovich, I. A., & MacIsaac, H. J. (2006). Propagule pressure: a null model for biological invasions. Biological Invasions, 8, 1023-1037.
Colautti, R. I., & MacIsaac, H. J. (2004). A neutral terminology to define ‘invasive’ species. Diversity and distributions, 10, 135-141.
Cook, C. N., Hockings, M., & Carter, R. B. (2010). Conservation in the dark? The information used to support management decisions. Frontiers in Ecology and the Environment, 8, 181-186.
Cook, G., Jarnevich, C., Warden, M., Downing, M., Withrow, J., & Leinwand, I. (2019). Iterative Models for Early Detection of Invasive Species across Spread Pathways. Forests, 10, 108.
Dawson, W., Burslem, D. F., & Hulme, P. E. (2009). Factors explaining alien plant invasion success in a tropical ecosystem differ at each stage of invasion. Journal of Ecology, 97, 657- 665.
Dextrase, A. J., & Mandrak, N. E. (2006). Impacts of alien invasive species on freshwater fauna at risk in Canada. Biological Invasions, 8, 13-24.
Didham, R. K., Tylianakis, J. M., Gemmell, N. J., Rand, T. A., & Ewers, R. M. (2007). Interactive effects of habitat modification and species invasion on native species decline. Trends in ecology & evolution, 22, 489-496.
Doherty, T. S., Glen, A. S., Nimmo, D. G., Ritchie, E. G., & Dickman, C. R. (2016). Invasive predators and global biodiversity loss. Proceedings of the National Academy of Sciences, 113, 11261-11265.
Dormann, C. F., Calabrese, J. M., Guillera‐Arroita, G., Matechou, E., Bahn, V., Bartoń, K., ... & Guelat, J. (2018). Model averaging in ecology: a review of Bayesian, information‐theoretic, and tactical approaches for predictive inference. Ecological Monographs.
13
Duggan, I. C. (2010). The freshwater aquarium trade as a vector for incidental invertebrate fauna. Biological invasions, 12, 3757-3770.
Elton, C.S. (1958). The ecology of invasions by animals and plants. Methuen, London, United Kingdom.
Duncan, R. P., Blackburn, T. M., Rossinelli, S., & Bacher, S. (2014). Quantifying invasion risk: the relationship between establishment probability and founding population size. Methods in Ecology and Evolution, 5, 1255-1263.
Ehrenfeld, J. G. (2010). Ecosystem consequences of biological invasions. Annual review of ecology, evolution, and systematics, 41, 59-80.
Eschtruth, A. K., & Battles, J. J. (2011). The importance of quantifying propagule pressure to understand invasion: an examination of riparian forest invasibility. Ecology, 92, 1314-1322.
Essl, F., Dullinger, S., Moser, D., Steinbauer, K., & Mang, T. (2015). Macroecology of global bryophyte invasions at different invasion stages. Ecography, 38, 488-498.
Ficetola, G. F., Thuiller, W., & Miaud, C. (2007). Prediction and validation of the potential global distribution of a problematic alien invasive species—the American bullfrog. Diversity and distributions, 13, 476-485.
Finnoff, D., Shogren, J. F., Leung, B., & Lodge, D. (2007). Take a risk: preferring prevention over control of biological invaders. Ecological Economics, 62, 216-222.
Froese, R. and D. Pauly. Editors. 2018. FishBase. World Wide Web electronic publication: www.fishbase.org.
Gallien, L., Münkemüller, T., Albert, C. H., Boulangeat, I., & Thuiller, W. (2010). Predicting potential distributions of invasive species: where to go from here?. Diversity and Distributions, 16, 331-342.
García-Berthou, E., Alcaraz, C., Pou-Rovira, Q., Zamora, L., Coenders, G., & Feo, C. (2005). Introduction pathways and establishment rates of invasive aquatic species in Europe. Canadian Journal of Fisheries and Aquatic Sciences, 62, 453-463.
Giljohann, K. M., Hauser, C. E., Williams, N. S., & Moore, J. L. (2011). Optimizing invasive species control across space: willow invasion management in the Australian Alps. Journal of Applied Ecology, 48, 1286-1294.
Goolsby, E. W., Bruggeman, J., & Ané, C. (2017). Rphylopars: fast multivariate phylogenetic comparative methods for missing data and within‐species variation. Methods in Ecology and Evolution, 8, 22-27.
14
Guo, C., Lek, S., Ye, S., Li, W., Liu, J., & Li, Z. (2015). Uncertainty in ensemble modelling of large-scale species distribution: effects from species characteristics and model techniques. Ecological modelling, 306, 67-75.
Hayes, K. R., & Barry, S. C. (2008). Are there any consistent predictors of invasion success?. Biological invasions, 10, 483-506.
Hellmann, J. J., Byers, J. E., Bierwagen, B. G., & Dukes, J. S. (2008). Five potential consequences of climate change for invasive species. Conservation biology, 22, 534-543.
Horton, N. J., & Kleinman, K. P. (2007). Much ado about nothing: A comparison of missing data methods and software to fit incomplete data regression models. The American Statistician, 61, 79-90.
Hulme, P. E. (2006). Beyond control: wider implications for the management of biological invasions. Journal of Applied Ecology, 43, 835-847.
Hulme, P. E. (2009). Trade, transport and trouble: managing invasive species pathways in an era of globalization. Journal of applied ecology, 46, 10-18.
Hulme, P. E., Bacher, S., Kenis, M., Klotz, S., Kühn, I., Minchin, D., ... & Pyšek, P. (2008). Grasping at the routes of biological invasions: a framework for integrating pathways into policy. Journal of Applied Ecology, 45, 403-414.
Jiménez-Valverde, A., Peterson, A. T., Soberón, J., Overton, J. M., Aragón, P., & Lobo, J. M. (2011). Use of niche models in invasive species risk assessments. Biological invasions, 13, 2785-2797.
Jones, M. P. (1996). Indicator and stratification methods for missing explanatory variables in multiple linear regression. Journal of the American statistical association, 91, 222-230.
Kattge, J., Diaz, S., Lavorel, S., Prentice, I. C., Leadley, P., Bönisch, G., ... & Cornelissen, J. H. C. (2011). TRY–a global database of plant traits. Global change biology, 17, 2905-2935.
Keller, R. P., Lodge, D. M., & Finnoff, D. C. (2007). Risk assessment for invasive species produces net bioeconomic benefits. Proceedings of the National Academy of Sciences, 104, 203-207.
Kelling, S., Hochachka, W. M., Fink, D., Riedewald, M., Caruana, R., Ballard, G., & Hooker, G. (2009). Data-intensive science: a new paradigm for biodiversity studies. BioScience, 59, 613- 620.
Kerr, N. Z., Baxter, P. W., Salguero‐Gómez, R., Wardle, G. M., & Buckley, Y. M. (2016). Prioritizing management actions for invasive populations using cost, efficacy, demography and expert opinion for 14 plant species world‐wide. Journal of applied ecology, 53, 305-316.
15
Kolar, C. S., & Lodge, D. M. (2002). Ecological predictions and risk assessment for alien fishes in North America. Science, 298, 1233-1236.
Lavorel, S., Grigulis, K., Lamarque, P., Colace, M. P., Garden, D., Girel, J., ... & Douzet, R. (2011). Using plant functional traits to understand the landscape distribution of multiple ecosystem services. Journal of Ecology, 99, 135-147.
Legendre, P., Galzin, R., & Harmelin-Vivien, M. L. (1997). Relating behavior to habitat: solutions to the fourth-corner problem. Ecology, 78, 547-562.
Leung, B., Lodge, D. M., Finnoff, D., Shogren, J. F., Lewis, M. A., & Lamberti, G. (2002). An ounce of prevention or a pound of cure: bioeconomic risk analysis of invasive species. Proceedings of the Royal Society of London B: Biological Sciences, 269, 2407-2413.
Leung, B., & Mandrak, N. E. (2007). The risk of establishment of aquatic invasive species: joining invasibility and propagule pressure. Proceedings of the Royal Society of London B: Biological Sciences, 274, 2603-2609.
Leung, B., Roura‐Pascual, N., Bacher, S., Heikkilä, J., Brotons, L., Burgman, M. A., ... & Sol, D. (2012). TEASIng apart alien species risk assessments: a framework for best practices. Ecology Letters, 15, 1475-1493.
Liu, C., Comte, L., & Olden, J. D. (2017). Heads you win, tails you lose: Life‐history traits predict invasion and extinction risk of the world's freshwater fishes. Aquatic Conservation: Marine and Freshwater Ecosystems, 27, 773-779.
Lockwood, J. L., Cassey, P., & Blackburn, T. (2005). The role of propagule pressure in explaining species invasions. Trends in Ecology & Evolution, 20, 223-228.
Lockwood, J. L., Cassey, P., & Blackburn, T. M. (2009). The more you introduce the more you get: the role of colonization pressure and propagule pressure in invasion ecology. Diversity and Distributions, 15, 904-910.
Lockwood, J. L., Hoopes, M. F., & Marchetti, M. P. (2013). Invasion ecology. John Wiley & Sons.
Lodge, D. M., Williams, S., MacIsaac, H. J., Hayes, K. R., Leung, B., Reichard, S., ... & Carlton, J. T. (2006). Biological invasions: recommendations for US policy and management. Ecological applications, 16, 2035-2054.
Lohr, C. A., Hone, J., Bode, M., Dickman, C. R., Wenger, A., & Pressey, R. L. (2017). Modeling dynamics of native and invasive species to guide prioritization of management actions. Ecosphere, 8.
16
Luo, Y., Ogle, K., Tucker, C., Fei, S., Gao, C., LaDeau, S., ... & Schimel, D. S. (2011). Ecological forecasting and data assimilation in a data‐rich era. Ecological Applications, 21, 1429-1442.
Mack, R. N., Simberloff, D., Mark Lonsdale, W., Evans, H., Clout, M., & Bazzaz, F. A. (2000). Biotic invasions: causes, epidemiology, global consequences, and control. Ecological applications, 10, 689-710.
Mahoney, P. J., Beard, K. H., Durso, A. M., Tallian, A. G., Long, A. L., Kindermann, R. J., ... & Mohn, H. E. (2015). Introduction effort, climate matching and species traits as predictors of global establishment success in non‐native reptiles. Diversity and Distributions, 21, 64-74.
Májeková, M., Paal, T., Plowman, N. S., Bryndová, M., Kasari, L., Norberg, A., ... & Le Bagousse-Pinguet, Y. (2016). Evaluating functional diversity: missing trait data and the importance of species abundance structure and data transformation. PloS one, 11, e0149270.
McGeoch, M. A., Butchart, S. H., Spear, D., Marais, E., Kleynhans, E. J., Symes, A., ... & Hoffmann, M. (2010). Global indicators of biological invasion: species numbers, biodiversity impact and policy responses. Diversity and Distributions, 16, 95-108.
McKinney, M. L., & Lockwood, J. L. (1999). Biotic homogenization: a few winners replacing many losers in the next mass extinction. Trends in ecology & evolution, 14, 450-453.
Mehta, S. V., Haight, R. G., Homans, F. R., Polasky, S., & Venette, R. C. (2007). Optimal detection and control strategies for invasive species management. Ecological Economics, 61, 237-245.
Milbau, A., & Stout, J. C. (2008). Factors associated with alien plants transitioning from casual, to naturalized, to invasive. Conservation Biology, 22, 308-317.
Millennium Ecosystem Assessment (MA) (2005). Ecosystems and Human Well-being: Current State and Trends. Volume 1, Island Press, Washington, DC.
Nakagawa, S. (2015). Missing data: mechanisms, methods and messages. Ecological statistics: Contemporary theory and application, 81-105.
Nakagawa, S., & Freckleton, R. P. (2008). Missing inaction: the dangers of ignoring missing data. Trends in Ecology & Evolution, 23, 592-596.
Nakagawa, S., Waas, J. R., & Miyazaki, M. (2001). Heart rate changes reveal that little blue penguin chicks (Eudyptula minor) can use vocal signatures to discriminate familiar from unfamiliar chicks. Behavioral Ecology and Sociobiology, 50, 180-188.
Naylor R.L., Williams S.L., & Strong D.R. (2001). Aquaculture - a gateway for exotic species. Science, 294, 1655-1656.
17
Olden, J. D. (2006). Biotic homogenization: a new research agenda for conservation biogeography. Journal of Biogeography, 33, 2027-2039.
Padilla, D. K., & Williams, S. L. (2004). Beyond ballast water: aquarium and ornamental trades as sources of invasive species in aquatic ecosystems. Frontiers in Ecology and the Environment, 2, 131-138.
Pakeman, R. J. (2014). Functional trait metrics are sensitive to the completeness of the species' trait data?. Methods in Ecology and Evolution, 5, 9-15.
Palm, F. C., & Zellner, A. (1992). To combine or not to combine? Issues of combining forecasts. Journal of Forecasting, 11, 687-701.
Papeş, M., Sällström, M., Asplund, T. R., & Vander Zanden, M. J. (2011). Invasive species research to meet the needs of resource management and planning. Conservation Biology, 25, 867-872.
Penone, C., Davidson, A. D., Shoemaker, K. T., Di Marco, M., Rondinini, C., Brooks, T. M., ... & Costa, G. C. (2014). Imputation of missing data in life‐history trait datasets: which approach performs the best?. Methods in Ecology and Evolution, 5, 961-970.
Peterson, A. T. (2003). Predicting the geography of species’ invasions via ecological niche modeling. The quarterly review of biology, 78, 419-433.
Pimentel, D., Zuniga, R., & Morrison, D. (2005). Update on the environmental and economic costs associated with alien-invasive species in the United States. Ecological economics, 52, 273-288.
Pitt, J. P., Worner, S. P., & Suarez, A. V. (2009). Predicting Argentine ant spread over the heterogeneous landscape using a spatially explicit stochastic model. Ecological Applications, 19, 1176-1186.
Poyatos, R., Sus, O., Badiella, L., Mencuccini, M., & Martínez-Vilalta, J. (2018). Gap-filling a spatially explicit plant trait database: comparing imputation methods and different levels of environmental information. Biogeosciences, 15, 2601-2617.
Puth, L. M., & Post, D. M. (2005). Studying invasion: have we missed the boat?. Ecology letters, 8, 715-721.
Pyšek, P., Jarošík, V., Hulme, P. E., Pergl, J., Hejda, M., Schaffner, U., & Vilà, M. (2012). A global assessment of invasive plant impacts on resident species, communities and ecosystems: the interaction of impact measures, invading species' traits and environment. Global Change Biology, 18, 1725-1737.
18
Pyšek, P., Jarošík, V., Pergl, J., Randall, R., Chytrý, M., Kühn, I., ... & Sádlo, J. (2009). The global invasion success of Central European plants is related to distribution characteristics in their native range and species traits. Diversity and Distributions, 15, 891-903.
R Core Team (2018). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.
Rahel, F. J., & Olden, J. D. (2008). Assessing the effects of climate change on aquatic invasive species. Conservation biology, 22, 521-533.
Rejmánek, M., & Pitcairn, M. J. (2002). When is eradication of exotic pest plants a realistic goal. Turning the tide: the eradication of invasive species, 249-253.
Ricciardi, A. (2007). Are modern biological invasions an unprecedented form of global change?. Conservation Biology, 21, 329-336.
Ricciardi, A., & Cohen, J. (2007). The invasiveness of an introduced species does not predict its impact. Biological Invasions, 9, 309-315.
Ricciardi, A., Hoopes, M. F., Marchetti, M. P., & Lockwood, J. L. (2013). Progress toward understanding the ecological impacts of nonnative species. Ecological Monographs, 83, 263- 282.
Ricciardi, A., & MacIsaac, H. J. (2008). In Retrospect: The book that began invasion ecology. Nature, 452, 34.
Ricciardi, A., & Rasmussen, J. B. (1998). Predicting the identity and impact of future biological invaders: a priority for aquatic resource management. Canadian Journal of Fisheries and Aquatic Sciences, 55, 1759-1765.
Ricciardi, A., & Rasmussen, J. B. (1999). Extinction rates of North American freshwater fauna. Conservation biology, 13, 1220-1222.
Rixon, C. A., Duggan, I. C., Bergeron, N. M., Ricciardi, A., & Macisaac, H. J. (2005). Invasion risks posed by the aquarium trade and live fish markets on the Laurentian Great Lakes. Biodiversity & Conservation, 14, 1365-1381.
Rouget, M., Richardson, D. M., Nel, J. L., & Van Wilgen, B. W. (2002). Commercially important trees as invasive aliens–towards spatially explicit risk assessment at a national scale. Biological Invasions, 4, 397-412.
Sala, O. E., Chapin, F. S., Armesto, J. J., Berlow, E., Bloomfield, J., Dirzo, R., ... & Leemans, R. (2000). Global biodiversity scenarios for the year 2100. science, 287, 1770-1774.
19
Seebens, H., Blackburn, T. M., Dyer, E. E., Genovesi, P., Hulme, P. E., Jeschke, J. M., ... & Bacher, S. (2017). No saturation in the accumulation of alien species worldwide. Nature communications, 8, 14435.
Simberloff, D. (2003). How much information on population biology is needed to manage introduced species?. Conservation Biology, 17, 83-92.
Simberloff, D. (2009). The role of propagule pressure in biological invasions. Annual Review of Ecology, Evolution, and Systematics, 40, 81-102.
Simberloff, D., Martin, J. L., Genovesi, P., Maris, V., Wardle, D. A., Aronson, J., ... & Pyšek, P. (2013). Impacts of biological invasions: what's what and the way forward. Trends in ecology & evolution, 28, 58-66.
Singh, S. K., Ash, G. J., & Hodda, M. (2015). Keeping ‘one step ahead’of invasive species: using an integrated framework to screen and target species for detailed biosecurity risk assessment. Biological invasions, 17, 1069-1086.
Stekhoven, D. J., & Bühlmann, P. (2011). MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics, 28, 112-118.
Stewart‐Koster, B., Olden, J. D., & Johnson, P. T. (2015). Integrating landscape connectivity and habitat suitability to guide offensive and defensive invasive species management. Journal of Applied Ecology, 52, 366-378.
Stohlgren, T. J., & Schnase, J. L. (2006). Risk analysis for biological hazards: what we need to know about invasive species. Risk Analysis: An International Journal, 26, 163-173.
Taugourdeau, S., Villerd, J., Plantureux, S., Huguenin‐Elie, O., & Amiaud, B. (2014). Filling the gap in functional trait databases: use of ecological hypotheses to replace missing data. Ecology and Evolution, 4, 944-958.
Thuiller, W., Richardson, D. M., Pyšek, P., Midgley, G. F., Hughes, G. O., & Rouget, M. (2005). Niche‐based modelling as a tool for predicting the risk of alien plant invasions at a global scale. Global Change Biology, 11, 2234-2250.
Tilman, D. (2001). Functional diversity. Encyclopedia of biodiversity. Academic Press, San Diego, 109-120. van Buuren, S., & Groothuis-Oudshoorn, K. (2011). mice: Multivariate Imputation by Chained Equations in R. Journal of Statistical Software, 45. van der Heijden, G. J., Donders, A. R. T., Stijnen, T., & Moons, K. G. (2006). Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: a clinical example. Journal of clinical epidemiology, 59, 1102-1109.
20
Van Kleunen, M., Weber, E., & Fischer, M. (2010). A meta‐analysis of trait differences between invasive and non‐invasive plant species. Ecology letters, 13, 235-245.
Vander Zanden, M. J., Hansen, G. J., Higgins, S. N., & Kornis, M. S. (2010). A pound of prevention, plus a pound of cure: early detection and eradication of invasive species in the Laurentian Great Lakes. Journal of Great Lakes Research, 36, 199-205.
Vandewalle, M., De Bello, F., Berg, M. P., Bolger, T., Doledec, S., Dubs, F., ... & Da Silva, P. M. (2010). Functional traits as indicators of biodiversity response to land use changes across ecosystems and organisms. Biodiversity and Conservation, 19, 2921-2947.
Walther, G. R., Roques, A., Hulme, P. E., Sykes, M. T., Pyšek, P., Kühn, I., ... & Czucz, B. (2009). Alien species in a warmer world: risks and opportunities. Trends in ecology & evolution, 24, 686-693.
Westphal, M. I., Browne, M., MacKinnon, K., & Noble, I. (2008). The link between international trade and the global distribution of invasive alien species. Biological Invasions, 10, 391-398.
Williamson, M., & Fitter, A. (1996). The varying success of invaders. Ecology, 77, 1661-1666.
Winkler, R. L. (1989). Combining forecasts: A philosophical basis and some current issues. International Journal of Forecasting, 5, 605-609.
Wittenberg, R., & Cock, M. J. (Eds.). (2001). Invasive alien species: a toolkit of best prevention and management practices. CABI.
Zenni, R. D., & Nuñez, M. A. (2013). The elephant in the room: the role of failed invasions in understanding invasion biology. Oikos, 122, 801-815.
21
Chapter 1
The rich get richer: invasion risk across North America from the aquarium pathway under
climate change.
Authors: Lidia Della Venezia, Jason Samson and Brian Leung
A version of this chapter has been published in the journal Diversity and Distributions 24(3),
285-296. It is reprinted here with permission from John Wiley & Sons, Inc.
22
1.1 Abstract
Aim: To evaluate how the establishment risk of freshwater fish species from the aquarium trade will change under a climate change scenario forecast for the year 2050.
Location: North America
Methods: In order to estimate changes in the magnitude of risk across geography and across different species in the aquarium pathway, we considered an integrated approach to modelling the probability of establishment, which simultaneously included proxies of propagule pressure, environmental variables, species traits and interactions between environment and traits. We then used the parameters of our model to predict how the risk of establishment will change under a scenario of climate change forecast for the year 2050.
Results: Our joint model performed better than submodels, suggesting that combining all components is worthwhile. The most predictive factors were precipitation, maximum temperature tolerance, maximum fish length and minimum temperature. Our joint model forecast a 40% increase in the average risk of establishment by 2050 in the United States. In contrast to our expectations, the absolute establishment risk associated with this pathway remained very low for the entire suite of species in the aquarium trade in northern regions, such as Quebec, Canada.
Instead, Florida, which has one of the highest current risks of establishment, was also forecasted to have the greatest absolute risk increase.
Main conclusions: Our methodology for risk assessment allows invasive species management strategies to consider entire suites of species at a time, and to forecast establishment risk for each species and location. While the aquarium pathway is likely to become more important for the
USA, the Quebec government should prioritize other pathways of introduction in its exotic
23 invasive species strategy. Our approach can be extended to be applied to different sets of species pertaining to the same or different pathways.
24
1.2 Introduction
Over the past few decades, invasive species have received increasing attention from the scientific community due to their potential consequences, environmentally (e.g. biodiversity loss,
Doherty et al., 2016; Mack et al., 2000; Sala et al., 2000) and economically (Pimentel et al.,
2001). In new habitats, invasive species can act as competitors, predators or parasites of species belonging to the native community (Lockwood et al., 2013), and can cause huge economic losses by interfering with agriculture, livestock, and human health (Pimentel et al., 2005). Specifically, freshwater fauna appears to be very sensitive to invasions, being characterized by extinction rates five times higher than for terrestrial fauna in North America (Ricciardi & Rasmussen, 1999;
Strayer & Dudgeon, 2010).
Many invasive species are introduced via trade, leading to the release and dispersal of potentially detrimental non-indigenous organisms (Hulme, 2009). Specifically, the aquarium commerce has been identified as an important source of propagule pressure (number of individuals introduced), and has led to the invasion and spread of various aquatic organisms in the United States, whereas fewer have established in Canada (Duggan et al., 2006; Gertzen et al.,
2008), potentially because of harsh climate conditions. The high propagule pressure from the aquarium commerce can be explained by the popularity of the aquarium hobby in North
America, with over 10% of households possessing some ornamental fish. Out of those, 96% of the volume of fish imported is represented by freshwater species (Chapman et al., 1997; Ramsey,
1985). The aquarium trade has thus been identified as a significant source of potentially invasive species both in the United States and Canada (Rixon et al., 2005) and it is important to analyze it explicitly (Gertzen et al., 2008).
25
While strides have been made predicting different phases of the invasion process, these have only recently been examined in conjunction with other drivers such as climate change
(Hobbs & Mooney, 2005; Holzapfel & Vinebrooke, 2005; Muhlfeld et al., 2014). Climate change can affect invasive species and their impact at different stages of the invasion process
(initial introduction, establishment, impact) and through different mechanisms (Hulme, 2016;
Walther et al., 2009). For example, milder winters might mean a higher chance of establishment for species adapted to warm environments, such as sub-tropical species in temperate areas
(Hellmann et al., 2008). Additionally, established non-indigenous species could expand their existing range given novel environmental conditions, potentially having an impact on a larger scale (e.g. Morrison et al., 2005). A recent study has shown the potential for several species to move from the Atlantic to the Pacific ocean and vice versa under climate change, with potential important consequences on entire communities (Wisz et al., 2015). For these reasons, climate change should be explicitly addressed to inform proper invasive species management (Mainka &
Howard, 2010). This is especially true given the fact that synergistic interactions between biological invaders and climate change have been observed and classified as an important threat to biodiversity (Brook et al., 2008). The influence of climate change on aquatic species invasiveness and impacts is difficult to predict but it will likely exacerbate the issue for freshwater aquarium species, which tend to be mainly tropical (Rahel & Olden, 2008). For example, a reduction in ice cover and an increase in oxygen conditions during the winter season, in addition to warmer water in summertime, could increase the chances of survival and reproduction for freshwater species (Rahel & Olden, 2008). Normally hostile environments for tropical fish like those in much of Canada, where harsh winters are generally a limiting factor for the survival of aquarium species, might become more suitable for establishment in a warming
26 climate. As such, there is interest in assessing changing establishment risk levels for aquarium species in northern regions, making pathway level analysis of the aquarium trade under climate change of direct and immediate policy relevance.
Notably, environmental suitability is not the only determinant of invasive species establishment. Species functional traits and their native range, propagule pressure, reproductive behaviours, taxonomy, genetics and time elapsed since a species was release, have been proven to be important predictors of establishment and impact (Heger & Trepl, 2003; Van Kleunen et al., 2010; Williamson & Fitter, 1996). Including species traits (Syphard & Franklin, 2010) and propagule pressure in species distribution models (SDM) (Leung & Mandrak, 2007) has proven effective. By combining these three components, we can model entire pathways (e.g. Bradie et al., 2013), while also considering both the geographical context, as well as the effects of changing environmental conditions such as climate change. Additionally, focusing on pathways of invasion can be advantageous when single species models cannot be built due to absence or scarcity of data.
Here, we hypothesized that the establishment risk posed by aquarium fish species across
North America would increase over time due to climate change. We also hypothesized that aquarium-mediated invasions will become an emerging threat in parts of Canada with climate change, even though these regions have not historically suffered from aquarium-mediated invasions. To estimate geographically-dependent, pathway-level risk of establishment, we used a combination of propagule pressure proxies, environment, and species traits (PET) of freshwater fish species from the aquarium trade pathway in the United States. We then projected the potential future establishment risk in the USA and in the Canadian province of Quebec under a climate change scenario forecasted for the year 2050, thus extrapolating over different
27 geographical and temporal scenarios. We chose to extrapolate predictions to Quebec as our study originated by a project in collaboration with the Quebec Ministry of Forest, Wildlife and Parks
(Ministère des Forêts, de la Faune et des Parcs (MFFP); http://mffp.gouv.qc.ca). The MFFP was interested into assessing the probability of new invasive aquatic species because the normally harsh winter conditions in Quebec prevent the establishment of aquarium species, but this barrier may no longer be effective in the future as climate change may open up opportunities for these species to establish, especially in the southernmost regions of the province.
1.3 Methods
1.3.1 Model formulation
The probability of establishment P(E) of a species from the aquarium pathway, as modelled by Bradie et al. (2013), was determined using the following equation: