DISENTANGLING THE GENETICS OF COEVOLUTION IN POTAMOPYRGUS
ANTIPODARUM AND MICROPHALLUS SP.
By
CHRISTINA E JENKINS
A dissertation submitted in partial fulfillment of the requirements for the degree of
DOCTOR OF PHILOSOPHY
WASHINGTON STATE UNIVERSITY School of Biological Sciences
JULY 2016
© Copyright by CHRISTINA E JENKINS, 2016 All Rights Reserved
© Copyright by CHRISTINA E JENKINS, 2016 All Rights Reserved
To the Faculty of Washington State University:
The members of the Committee appointed to examine the dissertation of CHRISTINA E
JENKINS find it satisfactory and recommend that it be accepted.
Mark Dybdahl, Ph.D., Chair
Scott Nuismer, Ph.D.
Joanna Kelley, Ph.D.
Jeb Owen, Ph.D.
ii Acknowledgement
First and foremost, I need to thank my committee, Mark Dybdahl, Scott Nuismer, Joanna
Kelley and Jeb Owen. They have put in a considerable amount of time helping me grow and learn as a scientist, and have consistently challenged me to be better during my Ph.D. studies. I cannot find words to thank them enough, so for now, “thank you” will need to suffice. I especially thank Mark and Scott; coadvising was an adventure and one I embarked on gladly.
Thank you for all the input and effort, even when it made all three of us cranky.
I need to thank the undergraduates and field assistants that have worked for and with me to collect data, process samples, plan field seasons and generally make my life easier. Thanks to
Jared and Caitlin for their tireless work (seriously, hours upon hours of their time) running flow cytometry to answer questions about polyploidy. Thank you to Meredith and Jordan for collecting snails, through sand flies, rain, hangovers, and occasionally hypothermia. And especially to Jordan: it is not easy traveling with someone for 6 weeks at a time on the far side of the world, but he made our trips productive and fun.
I need to thank all of my lab mates, from both the Nuismer lab at University of Idaho:
Ailene McPherson, ET Thornquiste, Anahi Espindola, Virginie Poullain and Bob Week; and from the Dybdahl lab at Washington State University: Jennifer Madrid Thorson, Jon Finger
(DIJON!) and Mark Smithson. Being in two different labs is a lot like having two families.
Neither of them knows much about each other, and there isn’t much interaction as a whole unit.
But similar to having two families, I cannot imagine going through the last six years without the endless support from every one of them.
I need to thank my friends in the graduate student and postdoc community on the
Palouse. Six years is a long time to slog through a Ph.D. and I have met some amazing people,
iii many of whom I now consider family. The non-exhaustive list of people I need to thank: Emily
Jones, Katie Shine, Diego Morales, Simon Uribe-Convers, Kayla Hardwick, Travis Hagey, Tim
McGuin, Natalie Gage, Hannah Marx, Roxy Hickey, Matt Pennell, Matt Singer, Daniel Beck,
Tyler Heather, Gen Metzger, TATE (just Tate), Wesly Loftie-Eaton, Thibault Stadler, Amy
Worthington, Maribeth Latvis, Sarah Jacobs, Marius Myrvold, Urs Weber, Andy Kramer, Ben
Weideback, Chloe Stenkamp-Strahm, Erin Weise and many more. Thank you for supporting me, for taking care of me, for encouraging me, and for believing in me. Especially when I wasn’t able to do any of these things for myself.
I need to thank a few people who have acted in the role of “partner” over the last few years. First, I need to thank Bobbi Johnson. I don’t know how I convinced the coolest person on the planet to be friends with me, but I’m sure glad I did. If I had half of the life skills that she does, I could take over the world, and I’m amazed every day that she hasn’t done so already.
Thank you for taking care of all the things I am entirely incapable of.
Kimberly Lackey and I undertook the impossible and succeeded. We were asked to teach a course for which there was no course. We wrote all the labs, the quizzes, the lectures, the exams and any other relevant material. We met constantly for two years, spoke many times a day, produced a cohesive class, and published a text book together. I could not have done this with anyone else, and I will forever be grateful for our partnership.
Finally, I need to thank my “domestic partner” Hannah Marx. We have a wine club membership, a CSA, a storage unit together and have the keys to each other’s apartment. We have done “romantic weekends” in Paris, McCall, Seattle, and many more. I can’t imagine not having you in my life, and can’t wait for our adventures in the future.
iv A Ph.D. takes a long time, and I simply would not have been able to undertake it without the support of my family and friends, who I would like to thank a million times. Especially my baby niece, Annie Grace, who makes me smile a little bit every day.
v DISENTANGLING THE GENETICS OF COEVOLUTION IN POTAMOPYRGUS
ANTIPODARUM AND MICROPHALLUS SP.
Abstract
by Christina E. Jenkins, Ph.D. Washington State University July 2016
Chair: Mark Dybdahl
Host-parasite coevolution is potentially important for many evolutionary transitions such as the evolution of sexual reproduction, ploidy, and the evolution of mating systems according to mathematical models of coevolution. In these models, host-parasite interaction is characterized by host resistance and parasites’ infectivity which is assumed to be based on a matrix of genotype by genotype specificity (GxG). Importantly, a recent trend has demonstrated that the
GxG matrix assumed in a given theoretical model will drastically alter the outcome of the above evolutionary transitions. Consequently, determining the form of genetic interaction matrices in natural populations is crucial to both understanding coevolution and the resulting evolutionary transitions. In this dissertation, I explore the genetics of host-parasite interactions in three different ways. First, using the New Zealand snail, Potamopyrgus antipodarum and its undescribed trematode parasite Microphallus sp., I tested the fit of different genetic models by comparing the resistance of triploid and tetraploid hosts, which differ in gene dosage, heterozygosity, and abundance of novel alleles. In my second chapter, I explored the molecular and genetic basis of traits associated with infection by assembling and annotating a transcriptome for Microphallus sp. and comparing the results with other similar parasites. First, to facilitate
vi comparisons, I determined the phylogenetic placement of Microphallus sp. among the trematode parasites. Second, I compared the genes expressed in Microphallus sp. with those of other well- studied trematode parasites. Finally, because trematodes infect both snail and vertebrate hosts in their life cycle, I used further comparative analyses to determine whether Microphallus sp. are expressing genes to evade the vertebrate or invertebrate immune system. Finally, in my third chapter, I developed a technique for finding the genomic regions involved in coevolution. Using genomic data, we can look for SNPs that covary spatially between the host and parasite, as these will likely mark regions involved in local adaptation. I tested the efficacy of this technique using simulated populations of hosts and parasites with coevolving and neutral loci. I altered models of infection, evolutionary and statistical parameters to determine when we are able to detect coevolving loci.
vii TABLE OF CONTENTS
Page Abstract ...... vi
TABLE OF CONTENTS ...... viii
LIST OF TABLES ...... x
LIST OF FIGURES ...... xi
Dedication ...... xiii
Introduction ...... 1
The role of ploidy in host resistance ...... 8
Abstract: ...... 8
Introduction ...... 9
Materials and Methods ...... 12
Results ...... 16
Discussion ...... 17
Acknowledgements ...... 19
The Microphallus sp. transcriptome and an analysis of its taxonomic relationship to other
Digenea parasites ...... 24
Abstract ...... 24
Introduction ...... 26
Materials and Methods ...... 29
Results ...... 33
Discussion ...... 37
Acknowledgements ...... 41
Identifying genomic hot spots of coevolution in host-parasite systems ...... 61
viii Abstract ...... 61
Introduction ...... 63
Overview of Approach ...... 65
Method testing ...... 67
Discussion ...... 74
Literature Cited ...... 82
ix LIST OF TABLES
Table 1 - Five host source populations with the respective percentage of 3N and 4N individuals within each sample and the 3 populations from which parasites were collected and the ploidy of their hosts. The ploidy of the hosts at each parasite population are included...... 22
Table 2- Results of a generalized linear model predicting host infection based on ploidy level, parasite source, host source and all possible interactions...... 23
Table 3- Information from both 454 and Illumina reads, and the subsequent hybrid assembly. .. 45
Table 4- Population specific data for reads included in the reference assembly and the percentage of reads that mapped to the reference transcriptome...... 46
Table 5 – GO terms with their associated descriptions of the annotated contigs from the
Microphallus sp. transcriptome...... 47
Table 6 – Taxanomic information, NCBI accession numbers, and references for all samples included in our 28s nrDNA phylogeny...... 49
Table 7 – Results of reciprocal blast species of Microphallus sp. transcriptome to the assembled transcriptomes of the nematode C. elegans, and the trematode parasites C. sinensis, E caproni, F. hepatica, O. viverrini, S. mansoni and T. regent...... 55
Table 8 – Comparison of Microphallus sp. transcriptome with EST libraries of the five stages of the S. mansoni life cycle...... 56
Table 9 – Candidate genes with GO annotations from the SwissProt database that are associated with the Microphallus sp. metacercaria, S. mansoni adult worms (found within vertebrates), and
S. mansoni sporocysts (found within invertebrates)...... 57
Table 10 - A summary of the evolutionary conditions needed to detect spatially covarying loci, across all eight parameters and four genetic interactions...... 81
x LIST OF FIGURES
Figure 1- Proportion of 3N (white) and 4N (black) individuals infected in each host population with each parasite source. Parasite source populations are columns; host source populations are rows...... 20
Figure 2- Proportion (+/- 1 sd) of triploid (3N; white) and tetraploid (4N; black) snail individuals infected, pooling infection rates across host populations for each parasite source. Lake Kaniere parasites infect significantly more 4N individuals, Lake Rotoroa parasites infect significantly more 3N individuals and there is no difference in infection between 3N and 4N when inoculated with Lake Poerua parasites...... 21
Figure 4 - Assembled phylogeny for Microphallodidea, including the species Microphallus sp. from New Zealand, colored red...... 42
Figure 5- Top blast hits by species...... 43
Figure 6- GO terms associated with annotated genes within the Microphallus sp. Metacercaria, S. mansoni adult worms (resides within vertebrates), and S. mansoni sporocysts (reside within invertebrates)...... 44
Figure 7 - The four different genetic interactions that were tested. The interactions can be broadly grouped into two categories: escalation and matching, continuous and discrete. The first category addresses how phenotypes will interact with each other, either based on the host matching the immune system, matching, or an arms race dynamic, escalation. The second category refers to how the infection phenotype is translated from the underlying genotype, either in an additive polygenic fashion, continuous, or with each infection locus pair explicitly interacting, discrete...... 78
xi Figure 8 - The evolutionary conditions under which the spatial covariance is strong enough to be detected are those that have an impact on local adaptation. We examined the relationship between local adaptation and type II error rate, and found a significant negative relationship. To address the sampling conditions an empiricist might need to find coevolving genomic regions, we found a significant negative relationship between number of populations sampled and type II error rate. Therefore, in order to find coevolving genomic regions, the population samples must be locally adapted and have many populations sampled...... 79
Figure 9 - Under four different models of interaction based on matching, one can determine for each given amount of local adaptation, how many populations need to be sampled to have a low type II error rate...... 80
xii
Dedication
This dissertation is dedicated to my mom and dad who provided both emotional and financial
support through the long years of my education, as well as all of my DNA. Especially to my mom, who is not only serves as my biggest fan, but is also the best editor a girl could ask for.
xiii Introduction
Coevolution between hosts and parasites has been shown to drive a number of evolutionary transitions such as the evolution of ploidy (Nuismer and Otto 2004), the evolution of outcrossing
(Morran et al. 2012; 2014), increased mutation rates (M'Gonigle et al. 2009), and the evolution and maintenance of sexual reproduction (Otto and Nuismer 2004). These theoretical predictions are all based on a fundamental assumption: within a population, each host genotype is resistant to certain parasite genotypes, and each parasite genotype is only able to infect certain host genotypes. These genotype by genotype (GxG) interactions are often characterized in coevolutionary theory as infection matrices.
Two such matrices are commonly used: matching allele model (MAM) (Frank 1991;
2000) and the gene-for-gene model (GFG) (Flor 1956; Thompson and Burdon 2002). MAM is based on self/non-self recognition molecules such as major histocompatibility complex (MHC molecules) in the vertebrate immune system (Grosberg and Hart 2000). Under this model of infection, parasites that match the host will be viewed by the host immune system as “self” and will not be eliminated. The GFG model was first conceived by observing agricultural plants and their fungi. It is based on an escalatory model of infection, where virulence alleles arise in the parasite populations and correspondingly resistance alleles arise in the host population (Flor
1971). Over time, this generates more and more resistant hosts and virulent parasites. These two often used matrices are an example of many that are based on an idea of how the immune system may work, and the validity and correct usage of each has long been debated (Parker 1994; Frank
1994; Parker 1996; Frank 1996). However, a commonality for both of these systems is a lack of concrete examples within the immunological literature that they are correct representations of how infection may occur (Dybdahl et al. 2014a).
1 What’s more, recent studies have demonstrated that the choice of matrices will alter the predicted evolutionary outcome. For example, under the GFG model of infection, sexual reproduction does not evolve, while under the MAM model, it does (Agrawal and Lively n.d.).
The MAM model leads to negative frequency dependent selection, where it is advantageous for the host genotype to be different than common genotypes within the population (Koskella and
Lively 2009). Because sexual reproduction consistently produces novel genotypes, under the
MAM model, sexual reproduction can evolve (Agrawal 2009). However, because GFG leads to more resistant hosts and more virulent parasites, sexual reproduction will not evolve (Parker
1994; Agrawal and Lively n.d.). Moreover, both of these matrices were originally conceived assuming both host and parasite are haploid. The diploid MAM models that account for heterozygote genotypes are varied in both their conception and their results (Otto and Nuismer
2004; Nuismer and Otto 2005). In this one evolutionary transition that relies on host-parasite coevolution, the variation in the underlying matrix of infection, and the variation between different diploid matrices change the outcome. This is just one example of many where theoretical results have demonstrated that understanding the genetics that underlie coevolutionary relationships are important to understanding the impact of host-parasite coevolution in natural populations.
Despite the importance of finding and understanding the genomic regions that underlie coevolution, it has proven to be extremely difficult to do so. There are a number of reasons why this is the case. One problem is that there could be many genes of small effect that are undetectable using traditional genomic measures of selection (Luikart et al. 2003; Storz 2005).
This is not an unreasonable assumption for genomic regions involved in coevolution given that both known immune proteins and virulence proteins have been demonstrated to have multiple
2 loci (Dausset 1981; Hughes and Yeager 2003). Another potential difficulty lies in the deluge of data that is gathered from next-generation sequencing experiments like RNA-seq and sorting through the variation associated with coevolution may be constrained by the wealth of variation associated with other sources. Some studies have looked for differentially expressed genomic regions in infected vs. uninfected hosts in hopes that within those differentially expressed genes lie the genomic regions of interest to coevolution (Portillo et al. 2013; Tanaka et al. 2013; Foth et al. 2014; Blomström et al. 2015; Videvall et al. 2015). However, the majority of the genes expressed in infected individuals will not be related to coevolutionary interaction, but rather appear as a result of being infected (i.e., stress response genes, or starvation genes). Without an annotated reference, it is difficult to be able to sufficiently sort through the genomic regions that are present but involved with response to something besides coevolution. Therefore, despite the increasing availability of a wide array of genomic material in non-model organisms, very few studies have characterized the underlying genetics of host parasite coevolution in natural populations. There are some systems for which we understand either the genomic regions involved in host resistance (Labrie et al. 2010; Perry et al. 2015) or the parasite’s ability to infect
(Barrett et al. 2009; Dy et al. 2014; Burmeister et al. 2015), but not the key combination of both host and parasite genes that determine the outcome of coevolution.
In this dissertation, I aim to fill this obvious gap in our scientific knowledge by developing new techniques to pinpoint coevolving genes, and by exploring the genomics of infection and resistance in a very important system for studying coevolution, the freshwater snail
Potamopygrus antipodarum and its trematode parasite, Microphallus sp. P. antipodarum is native to New Zealand and has a number of genetic and ecological characteristics that make it an ideal system to study coevolution. First, Microphallus sp. has been shown to impose strong
3 enough negative frequency selection on P. antipodarum to select for the maintenance of sexual reproduction (Dybdahl and Lively 1995b; 1998; Koskella and Lively 2007). It has therefore become one of the primary systems that has consistently demonstrated the Red Queen hypothesis
(Salathé et al. 2008). Additionally, P. antipodarum and Microphallus are found in a wide variety of different lakes and streams and much work has been done on the ecology of the hosts and how it is affected by parasites. Finally, there are diploid sexuals and both triploid and tetraploid asexuals, allowing us to examine the role of sex and ploidy in host-parasite coevolution (Neiman et al. 2011). However, despite the importance of this system for studying host-parasite coevolution, the genetics of this interaction remain entirely unknown. To better understand the genetics of this system, and better inform the consequences of host-parasite coevolution across systems, I sought to start disentangling the underlying genetics of coevolution between P. antipodarum and Microphallus.
First, I sought to determine if increased gene dosage, heterozygosity, or abundance of novel alleles alters the resistance of P. antipodarum to Microphallus sp. by comparing infection rates of tetraploids with that of triploids. As mentioned above, the infection matrices that underlie host-parasite coevolution were all conceived under the assumption that both host and parasite are haploid. When considering instead how a diploid host or parasite would behave under these same infection matrices, how resistant the heterozygote host and parasites are will drastically alter the outcome of coevolution (Nuismer and Otto 2005; Agrawal and Otto 2006).
Because polyploids have higher heterozygosity than diploids, how ploidy changes resistance to parasitic infection could give us insight into understanding how the genotypic infection matrix treats heterozygotes.
4 We collected P. antipodarum snails from five populations that are known to have asexual hosts that are either triploid or tetraploid (Neiman et al. 2011). We then exposed these snails to parasites from three different populations of parasites, and after a three-month incubation period, determined infection status. Finally, we determined ploidy of each snail using flow-cytometry.
Most importantly, there was no overall increase in resistance associated with increased ploidy, contrary to the general view that polyploids are more robust due to increased gene dosage, heterozygosity, or abundance of novel alleles. Overall, this suggests that heterozygotes are either
1) not more resistant or more susceptible than homozygotes or 2) in this system, increasing ploidy does not increase heterozygosity. However we found a significant interaction between parasite population and ploidy, and speculate that it could be due to a two step infection process, such that heterozygosity increases resistance in the first step and decreases resistance in the second step. Under coevolutionary cycling, we could then expect that parasites from populations which are fixed for the first locus, heterozygotes will be more infective, and therefore, triploids will be proportionally infected less. If instead, the parasite comes from a population where the second locus is fixed, then heterozygotes will be favored and tetraploids will be infected less.
To better facilitate this endeavor, I sought to find genomic regions involved in infection in the parasite, Microphallus sp., by examining the transcriptome of the metacercariae stage of the parasite. While there is transcriptomic data available for P. antipodarum (Wilton et al.
2012a), little to no data exists for Microphallus sp.. I first sequenced and annotated the transcriptome, providing the first transcriptome for the Microphallus sp. parasite. I found key gene ontology (GO) terms that are potentially associated with host-parasite interactions. I then looked for similarity between Microphallus sp. and other well studied or medically important parasites by isolating the 28s nrDNA genomic and used this gene to construct a phylogenetic tree
5 of Microphilidae and Digenea trematodes. I also compared the transcriptome of Microphallus sp. with the SwissProt database, and other trematodes in an effort to uncover potentially similar genes involved with infection. I found high similarity with F. hepatica, S. mansoni, and C. sinensis, for which there is some awareness of the genetic basis of infection. This similarity between species will allow us to look for similar infection associated genes within the
Microphallus sp. transcriptome. Finally, I did stage specific comparisons to determine which immune response the metacercaria stage of Microphallus sp. may be trying to evade, vertebrate or invertebrate. To this end, I compared the Microphallus sp. transcriptome assembled to the five stage of the S. mansoni life cycle. I found that there is high similarity between my transcriptome and the miracidia stage (found in snails) within S. mansoni. However, when analyzing the GO terms in the vertebrate vs. invertebrate stages in S. mansoni, I found that the Microphallus sp metacercaria is qualitatively more similar to the S. mansoni stage within their vertebrate host.
This suggests that while the metacercaria are evading their snail host immune system, they are also preparing to invade the vertebrate host. The availability of this assembled transcriptome and my subsequent analyses will facilitate future research in the genomics of host-parasite interaction in this system, and thus start disentangling the genetics of host-parasite coevolution.
Finally, I developed a technique for finding the genomic regions involved in host-parasite coevolution and tested this theory using simulated populations of coevolving hosts and parasites.
Theory has demonstrated that the biotic component of coevolution is the sum of the spatial covariance of host and parasite genotypes (Nuismer and Gandon 2008). Large spatial covariances are generated by local coevolutionary selection. For example, a mutation providing greater infectivity arising in a parasite population may cause selection on a mutation for resistance in the host population and vice versa. When examined across populations, genetic
6 marker polymorphisms associated with genomics regions that are responding to reciprocal selection will be swept to high frequency together with the beneficial mutations. We can therefore find the genomic regions involved in host-parasite coevolution by looking for single nucleotide polymorphisms (SNPs) that spatially covary between hosts and parasites. The SNPs that have the highest spatial covariance are those that contribute the most to the biotic component of local adaptation, and therefore, coevolution. I tested the infection matrices, and parameters under which this technique successfully detected the loci coevolving in simulated populations of hosts and parasites. These results not only demonstrate whether this technique works, but also how robust it is under different parameters that could increase or decrease local adaptation. This technique will provide an important tool for finding genomic regions involved in host-parasite coevolution and therefore facilitated research in understanding the maintenance of genetic variation, disease and organismal interactions in general.
7 The role of ploidy in host resistance
Authors: Christina E. Jenkins, Scott Nuismer and Mark Dybdahl
*Corresponding Author
Affiliations:
1 School of Biological Sciences, Washington State University, Pullman WA
2 Department of Biology, University of Idaho, Moscow ID
Abstract:
Polyploidy is common across a wide variety of taxa, which is striking given the many barriers to polyploid establishment. A number of hypotheses compete to explain its prevalence but one intriguing possibility is that an increase in ploidy results in increased resistance to parasitic infection. We expect polyploids to be more resistant to parasites because they could have increased dosage of immune proteins, higher heterozygosity and novel alleles. We tested whether an increase in ploidy is associated with increased resistance using the freshwater snail system Potamopyrgus antipodarum and its trematode parasite Microphallus sp.. Using experimental inoculations, we found that ploidy had no overall effect on infection rates, indicating that polyploids are not consistently more or less resistant to parasites. Instead, our results demonstrated a significant interaction between parasite source population and snail ploidy suggesting that higher ploidy increases snail resistance to parasites from some lakes but decreases snail resistance to parasites from other lakes.
Keywords: Polyploidy, Host-parasite coevolution, Potamopyrgus antipodarum
8 Introduction
Polyploidy, or whole genome duplication, is particularly widespread in plants but also occurs regularly within some animal lineages (Zhang and King 1993; Masterson 1994;
Beukeboom et al. 1998; Otto and Whitton 2000; Wendel 2000; Langston et al. 2001; Seoighe
2003; D'Souza et al. 2005; Duchemin et al. 2007; Soltis et al. 2009; Ching et al. 2009). The success of polyploid lineages is somewhat surprising given the many hurdles to their establishment including competition with their diploid progenitors (Baack 2005), initial decrease in fertility after polyploidization (Ramsey and Schemske 2002; Rausch and Morgan 2005), and minority cytotype exclusion (Levin 1975; Husband 2000). As a consequence, a range of hypotheses have been developed to explain why polyploids might have an advantage over their diploid progenitors. For instance, it has been hypothesized that polyploids may be more evolutionarily flexible and likely to innovate because they have a redundant set of genes that can potentially diverge without loss of function (reviewed in (Adams and Wendel 2005; Comai
2005). It has also been suggested that polyploidy may facilitate masking of deleterious mutations
(Otto and Whitton 2000). Another intriguing possibility is that genome duplication increases resistance to parasites (Levin 1983). Due to the ubiquitous nature of parasites, this is a tempting explanation.
An increase in ploidy is associated with many genetic changes (Adams and Wendel 2005;
Chen 2007), some of which could cause an increase in resistance to parasites. For example, increased ploidy has been associated with gene redundancy, which produces new proteins or protein subunits resulting from mutational variants in redundant copies (Comai 2005; Birchler
2012). This could potentially produce novel defense proteins and thus could increase capacity to recognize and mount an immune response to a greater diversity of pathogens. Another reason
9 polyploids could have consistently higher resistance is an increase in gene dosage; with more copies of each gene, polyploids may have increased circulating defense proteins and thus greater resistance (reviewed in (Wertheim et al. 2013; Dybdahl et al. 2014a). Polyploids could also be more resistant if there is strong coevolution and polyploids have a more rapid response to selection because they are more adaptable under particular patterns of dominance (Otto and
Whitton 2000; Choleva and Janko 2013). Finally, heterozygosity has been consistently demonstrated to be greater in populations with higher ploidy, all else being equal (White 1970;
Stenberg and Saura 2013; Tayalé and Parisod 2013; Mable et al. 2015), and under some models of infection, heterozygotes are more resistant (Nuismer and Otto 2004; Agrawal and Otto 2006).
For example, under the inverse matching allele (IMA) model, host defense involves an array of recognition molecules (e.g., antibodies) that are able to recognize specific antigens and resist parasites carrying those antigens (Frank 1994). Therefore, because heterozygotes are able to recognize more parasite genotypes, they are more resistant than homozygous hosts, and thus polyploids would be more resistant than diploids. If any of these genetic changes result from polyploidization, then individuals with higher ploidy should have higher resistance.
Although increased ploidy could result in increased resistance to parasites, other genetic changes associated with an increase in ploidy could result in a consistent decrease in resistance.
For example, under some models of coevolution, heterozygotes are less resistant than homozygotes. In the matching allele (MA) model of coevolution, resistance is based upon a system of self/non-self recognition (Mode 1958). Under this model, heterozygous hosts will be susceptible to both homozygote parasite genotypes (Nuismer and Otto 2004; Agrawal and Otto
2006), and thus polyploids with more heterozygosity should be more susceptible to parasites under an MA model. Additionally, it is possible that polyploids are less resistant due to the
10 dramatic genomic rearrangement that often occurs after polyploidization (Hufton and
Panopoulou 2009), that could make newly formed polyploids less resistant (Reviewed in (Hufton and Panopoulou 2009; King et al. 2012). Under each of these scenarios, increased ploidy would be associated with consistently lower resistance.
Finally, it is also possible that polyploids are sometimes more and sometimes less resistant to parasites. For example, gene frequency fluctuations under coevolutionary cycling can result in transiently greater or lower resistance to parasite infection. Under this scenario, resistance depends on the frequencies of resistant and infective host genotypes and parasite genotypes that are coevolved to those specific genotypes within each population. If the parasite is at some point during parasite-driven allele frequency fluctuations where it favors a specific genotype, then it will be able to infect hosts of that genotype irrespective of ploidy. Another possibility is that the genetic model of infection differs among populations. Under this scenario, ploidy will not necessarily be predictive of resistance.
To date, empirical efforts in plant polyploids have shown genome duplication can have mixed consequences for species interactions. For instance, some studies have found polyploid plants to be more resistant than diploids (Busey et al. 1992; Zhao et al. 2005; Vleugels et al.
2013), whereas others have found polyploids to be less resistant (Thompson et al. 1997;
Munzbergova 2006; Kao 2008). In other cases, no differences in resistance were detected across ploidies (Burdon and Marshall 1981; Schoen et al. 1992; Ohberg et al. 2005; Yli-Mattila et al.
2009; Gottula et al. 2014) or polyploids were found to be more resistant to some parasites but less resistant to others (Nuismer and Thompson 2001; Arvanitis et al. 2007; Halverson et al.
2007). Taken together, these studies suggest that increased ploidy has no consistent impact on resistance to parasites in plant populations. Because almost no empirical investigations of
11 animals have been conducted to date, we do not yet know whether the impacts of polyploidy are similar (Guégan and Morand 1996).
We investigated the impact of increased ploidy on resistance to the parasitic trematode species, Microphallus sp in the snail, Potamopyrgus antipodarum. Within New Zealand, snail populations are comprised of sexual diploid and asexual triploid or tetraploid individuals, with some lakes containing a mix of asexual triploid and tetraploid snails (Neiman et al. 2011). We exposed triploid and tetraploid individuals of P. antipodarum from 4 populations to their coevolving parasitic trematode, Microphallus sp., from three allopatric populations of parasites, and asked whether tetraploid individuals were more resistant to parasite infection. Unlike other studies addressing the impact of increased ploidy in this system (Parsons et al. 1986; Lively et al.
2004a; Osnas and Lively 2006; Duchemin et al. 2007), we held mating system constant by studying only asexual populations of this species that differ in ploidy. Further, by challenging snails with trematodes from allopatric lakes, we eliminated the impact of coevolutionary history.
Thus, our study was designed to investigate whether increasing ploidy, per se, influences levels of parasite resistance in this system.
Materials and Methods
Study System
Potamopygrus antipodarum is a small gastropod commonly found in New Zealand lakes and streams (Talbot and Ward 1987; Jokela and Lively 1995), and diploid, triploid, and tetraploid individuals coexist across the species range (Neiman et al. 2011). The predominant parasite of P. antipodarum is an undescribed species of Microphallus (Trematoda:
Microphallidae; (Lively 1987)). Mature Microphallus produces eggs in waterfowl, which pass
12 out of the bird in the feces. The eggs are ingested by P. antipodarum and then develop and encyst in the snail, becoming infective metacercariae approximately 3 months after exposure.
During the maturation process, the parasite sterilizes its snail host, rendering it unable to reproduce. When an infected snail is then eaten by a bird host, the parasite life cycle is completed. Decades of empirical study have shown that lake populations of snails and parasites coevolve, and that parasites are consistently adapted to local lake populations of their hosts; these coevolutionary dynamics are broadly consistent with the Red Queen hypothesis (Lively
1987, Dybdahl and Lively 1998, Jokela et al. 2009, Koskela and Lively 2009).
Host and parasite sampling
Our goal was to expose P. antipodarum snails to Microphallus sp. eggs from different, allopatric populations of Microphallus sp. because we wanted to test resistance in P. antipodarum while eliminating the effects of past or historical coevolution and adaptation to specific clonal genotypes. Our snail sampling focused on lakes that contain asexual triploids and tetraploids. Snails were collected from five such lakes on the South Island of New Zealand during January 2014 (See Table 1). One of our sampled lakes, Lake Poerua, was a source for both snails and parasites; we therefore did not expose Lake Poerua snails to sympatric Lake
Poerua parasites.
We used parasites collected from three different lakes as sources of allopatric parasites; the three populations differ in the ploidy of their hosts (Table 1). Parasite eggs were collected from wild duck feces using previously established protocol (King et al. 2011b). Feces were washed with water and the mixture was then filtered through 1 mm mesh. After two weeks of
13 twice daily washing to remove organic materials and toxins from the feces, the parasite eggs were added to the snail samples.
Experimental Inoculations
We exposed samples of the five different snail populations to Microphallus sp. eggs from three different allopatric parasite populations. All experimental inoculations were conducted at the University of Canterbury in Christchurch, New Zealand. Eggs from each parasite population were equally divided among snail containers housing snails from each host population in a full factorial design with the exception of the sympatric cross between Lake Poerua hosts and parasites (Table 1). Snails were housed in 2 liter containers for the duration of the inoculation.
Temperatures were held a constant 20 C, with a 12 hour light-dark cycle. Water was changed weekly, using filtered water. Snails were fed spirulina, a standard laboratory diet. After two weeks, the snails were moved to parasite free water. Experimental snails were returned to
Washington State University and dissected three months post exposure to determine infection status. Two inoculated populations experienced high mortality rates in transit back to
Washington State University: Lake Gunn snails that were exposed to Lake Rotoroa parasites, and Lake Rotoiti snails that were exposed to Lake Poerua parasites. These two populations were excluded from all analyses. The heads of all individuals were flash frozen for ploidy determination and stored at -80 C.
Flow Cytometry
We determined ploidy by staining snail DNA with propidium iodide (PI) and quantifying
PI fluorescence with flow cytometry. For each sample, the frozen P. antipodarum heads were
14 ground in a solution containing 10 ul 1% TritonX, 0.2 ul 0.5M EDTA, 1 ml PBS, and 50 ul propidium iodide per each sample. Propidium iodide binds to DNA and fluoresces, allowing us to treat levels of fluorescence as a proxy for amount of DNA. The solution was allowed to settle and the cells were separated from larger intact tissue by pipetting off the supernatant of the solution. Chicken (Gallus gallus) red blood cells were included in each sample as an internal positive control because diploid P. antipodarum contain approximately the same amount of
DNA as chicken red blood cells, while triploid snails have 50% more DNA, and tetraploid snails have 100% more DNA (Neiman et al. 2011). As a result, we determined ploidy for each sample by comparing fluorescence levels of snail cells to that of chicken red blood cells. Fluorescence was measured by running all prepared samples through a BD FACS Calibur flow cytometer to measure fluorescence of PI, using the FL1 channel.
Statistical Analyses
In order to distinguish among our three hypotheses, we developed a generalized linear model that allowed us to compare infection rate across combinations of snail and trematode populations. Specifically, we fit a model with infection status as the dependent variable, and ploidy, host population, parasite population and their interactions as independent variables.
� = � + � + � + � ∗ � + � ∗ � + � ∗ � + � ∗ � ∗ �
Here, � is the infection status, � , is the ploidy, � is the source population of the host, and � is the source population of the parasite.
Because � is a binomial variable, we used a generalized linear model with binomial link function. The generalized linear model is like ordinary linear regressions, but it does not rely on the assumption that the dependent variable is normally distributed; rather, error structure is
15 designated by the user, binomial in this case. In a binomial model, the sign of the B-coefficient
(i.e., the slope) for each explanatory variable within the model determines whether it is positively or negatively associated with our dependent variable, here individual infection status. All statistical analyses were conducted using the R programing platform (R Development Core
Team, 2008).
Results
Analysis of our full model revealed that ploidy had no overall impact on individual infection status (p = 0.899; Table 2). Thus, our results provide no evidence that polyploidy is consistently associated with greater or lower resistance to parasites. Additionally, we found no significant effect of host population (p = 0.871; Table 2)(Figure 1), so it does not appear that some host populations are simply more resistant than others. However, there is a significant effect of parasite (p = 0.0036; Table 2), suggesting that some parasite populations are better able to infect than other parasite populations.
Only one of the interaction terms, between parasite source and ploidy, was statistically significant (p = 0.002, Figure 2) such that parasites from some source populations infected triploids at a higher rate than tetraploids, while parasites from other source populations infected tetraploids at a higher rate than triploids. Fisher’s Exact post-hoc tests were used to determine how parasite source populations differed in their capacity to infect triploids versus tetraploids across all host populations. Within the snails of all five populations exposed to Lake Kaniere
2 parasites, a higher percentage of tetraploids were infected than triploids (χ = 5.3932; p =0.0202).
Conversely, Lake Rotoroa parasites infected a higher percentage of triploids than tetraploids (χ2
16 = 7.5367; p = 0.0061). Finally, parasites from Lake Poerua did not differ in their ability to infect triploids and tetraploids (χ2 = 0.0052; p = 0.9424) (Fig. 3).
Discussion
The possible greater resistance of polyploids to ubiquitous parasites might explain their relative abundance in the diversity of plants and animals. However, the effect of elevated host ploidy on resistance is uncertain because of the many possible genetic and phenotypic changes that can occur due to polyploidization (Comai 2005; Sémon and Wolfe 2007), and because these changes interact with the various and largely unknown genetic models determining infection and coevolution (REFS). We studied variation in resistance of asexual triploids and tetraploids of the fresh water snail Potamopyrgus antipodarum to its coevolved trematode parasite Microphallus sp.. Because we exclusively used asexual snails from populations where ploidy varies (Neiman et al. 2011), we were able to isolate the effects of increased ploidy independent of reproductive mode, a traditional problem due to the association in animals between increased ploidy and asexual reproduction (Otto 2007).
There are number of potential mechanisms that could have resulted in higher ploidy in P. antipodarum altering resistance to Microphallus sp.. For example, in snail defense against macroparasites like trematodes, snails with higher ploidy should express more gene copies and produce more effector molecules that attack the parasite and counteract their defenses. Increased heterozygosity due to an increase in ploidy could either increase or decrease resistance, dependent on the underlying model of infection. When the host recognizes and attacks the parasite (IMA), greater heterozygosity leads to resistance to a great diversity of parasite genotypes. On the other hand, when the host is attacked by parasites that match the host
17 genotype (MA), greater heterozygosity leads to greater susceptibility to a larger diversity of parasites. In contrast, we found no significant main effect of host ploidy on their likelihood of infection, suggesting that there is no overall consistent effect of ploidy on resistance. Because we did not see an overall effect of ploidy, we can conclude that gene redundancy, gene dosage, or heterozygosity does not affect resistance in this system. However, whether these genetic changes are happening and do not affect resistance, or whether these changes do not occur due to polyploidization in this system is largely unknown.
Previous studies in our system have considered the role of host ploidy with respect to parasites but were hindered by population reproductive mode. For example, a large meta-analysis of local adaptation studies in this system (Lively et al. 2004a) tested for a difference in resistance between diploids and triploids. While they found some effect of ploidy, the largest predictor of whether a host was infected was commonness of the genotype (Dybdahl and Lively 1998).
Because triploids are asexual, and therefore more common than sexually reproducing diploids, triploids tended to be infected more often, independent of ploidy. That the present study showed no effect of ploidy concurs with previous work in this system. However, because we exclusively used asexual snails from populations where ploidy varies, we are able to definitively say ploidy is not related to infection rate in these snails.
Despite the lack of overall effect of ploidy, we did find that Microphallus sp. parasites are either better or worse at infecting tetraploids depending on their parasite population of origin.
Without specific genotypic data within our system to determine what is specifically mediating the genotype by genotype interaction between host and parasite, and how this is altered by increased ploidy, we are unable to say why some parasites are better at infecting triploids or tetraploids. We considered a number of different models of infection, selection regimes and
18 coevolutionary cycling scenarios, however, we are unable to adequately explain why we see this pattern. Previous work in plant systems has demonstrated a similar effect, where some seed predators preferentially attack diploids and some prefer tetraploids (Nuismer and Thompson
2001), with a similar inability to explain why we might see this pattern. Additionally, when addressing the role of increased ploidy on plant populations, results are generally mixed. While our result does not confirm this hypothesis specifically, it does suggest that it might be the case in the P. antipodarum-Microphallus sp. system.
Thus, although it is tempting to think that the reason polyploidy is so prevalent in natural populations is because they are more resistant, the general conclusion is that there is no general pattern of resistance associated with either plant or P. antipodarum polyploids.
Acknowledgements
We would like to thank our field assistants, Jordan Erlenbach and Meredith Kee. I would like to thank the National Science Foundation IGERT and the Washington State University Elling
Fellowship for funding.
19 Lake Kaniere Lake Poerua Lake Rotoroa 0.6 Lake Gunn 0.4
0.2
0.0 0.6 Lake Haupiri 0.4
0.2
0.0 0.6 Lake Mavora 0.4
Individual Infected 0.2
0.0
% Individuals Infected 0.6 Lake Poerua 0.4
Proportion Proportion of 0.2
0.0 0.6 Lake Rotoiti 0.4
0.2
0.0 3N 4N 3N 4N 3N 4N PloidyPloidy
Figure 1- Proportion of 3N (white) and 4N (black) individuals infected in each host population with each parasite source. Parasite source populations are columns; host source populations are rows.
20 Lake Kaniere Lake Poerua Lake Rotoroa
0.5
0.4
0.3 Individual Infected
% Individuals Infected 0.2 Proportion Proportion of
0.1
0.0
3N 4N 3N 4N 3N 4N Ploidy Ploidy
Figure 2- Proportion (+/- 1 sd) of triploid (3N; white) and tetraploid (4N; black) snail individuals infected, pooling infection rates across host populations for each parasite source. Lake Kaniere parasites infect significantly more 4N individuals, Lake Rotoroa parasites infect significantly more 3N individuals and there is no difference in infection between 3N and 4N when inoculated with Lake Poerua parasites.
21 Table 1 - Infection rates of triploids, tetraploids and the average for each of the three parasite source populations (ploidy of their hosts shown in parentheses), and the five host populations with the frequency of triploid and tetraploid individuals within each sample.
%3N %4N % Parasite % 3N % 4N Infected Infected Infection Source Host Source Lake Gunn 52.94 47.05 33.33 43.75 38.23
Lake Haupiri 62.66 18.96 42.55 59.09 45.68
Lake Kaniere Lake Mavora 29.23 9.09 30 40 30.90 (2N Hosts) Lake Poerua 47.23 28.70 23.37 48.38 30.55
Lake Rotoiti 31.80 45.75 20.48 32.85 26.14
Lake Gunn 15.81 45.16 29.41 28.57 29.03
Lake Poerua Lake Mavora 31.25 11.76 30 25 29.41 (3N/4N Hosts) Lake Haupiri 59.09 14.47 29.23 27.27 28.94
Lake Haupiri 47.65 13.82 35.80 15.38 32.97
Lake Mavora 39.69 24.76 37.97 19.23 33.33 Lake Rotoroa (2N/3N Hosts) Lake Poerua 25.92 26.31 26.19 13.33 22.80
Lake Rotoiti 32.11 32.69 54.28 29.41 46.15
22 Table 2- Results of a generalized linear model predicting host infection based on ploidy level, parasite source, host source and all possible interactions. df: degrees of freedom, resid.dev: residual deviance.
Factor df resid. dev p-value Ploidy 1 0.02590 0.899 Host population 4 1.24140 0.871 Parasite source 2 11.95687 0.0036 Host source x Parasite source (interaction) 5 1.38714 0.9304 Ploidy x Host Source (interaction) 4 1.34272 0.854 Ploidy x Parasite Source (interaction) 2 12.42784 0.0020 Host Source x Parasite Source x Ploidy (interaction) 5 0.79313 0.9774
23 The Microphallus sp. transcriptome and an analysis of its taxonomic
relationship to other Digenea parasites
Authors: Christina E. Jenkins1,2,*, Diego Morales2, Daniel D. New3, Mark Dybdahl1, and Joanna
L. Kelley1
*Corresponding Author
Affiliations:
1 School of Biological Sciences, Washington State University, Pullman WA
2 Department of Biology, University of Idaho, Moscow ID
3 IBEST Genomics Resources Core, University of Idaho, Moscow ID
Abstract
The genetics of host-parasite interactions are important in coevolutionary biology. However, the traits and genes under coevolutionary selection in natural populations are largely unknown.
Arguably one of best natural systems for studying host-parasite coevolution, the fresh water snail
Potamopyrgus antipodarum and its trematode parasite Microphallus sp. is no exception. Here we used RNA-seq data to assemble and annotate the first transcriptome of Microphallus sp. to aid in identifying the genes underlying the host-parasite interaction, and facilitate future research in coevolutionary genetics. We sequenced, assembled, and annotated the Microphallus sp. transcriptome using samples of metacercaria dissected from the snail host, providing the first transcriptome for this species. To explore the genes associated with successful infection, we studied our transcriptome and designed comparisons with other well-studied trematodes. To facilitate comparative analysis with related trematodes, we first isolated 28S nrRNA and developed a phylogeny to place our undescribed species. To see which genes are expressed in the snail host, we first examined gene ontology (GO) terms. We found Microphallus sp.
24 transcripts that are associated with immunological terms, and with other pathogen terms. We then looked for similarity in the transcriptomes of our parasite and other related trematode parasites. We found high similarity with other digenean trematodes; specifically Schistosoma mansoni, Fasciola hepatica and Clonorchis sinensis all demonstrated high similarity with
Microphallus sp.. Finally, because the parasite metacercaria form in the snail host but also infect the vertebrate final host, we sought to determine if genes expressed were related to evasion of the vertebrate or invertebrate immune system. We found that although there is high similarity between our transcriptome and the transcripts from the snail portion of the S. mansoni lifecycle, the GO terms are qualitatively similar to the portion of the S. mansoni found within the vertebrate host. This work represents the first genomic and taxonomic data on Microphallus sp., which will facilitate future work on host-parasite interactions. We found putative genes associated with infection that merit further investigation. The transcriptome is similar to other parasitic helminth parasites, which is promising to look for other parasitic genes associated with infection within Microphallus sp.. Additionally, the metacercaria stage shows similarity with both the vertebrate and invertebrate portion of the S. mansoni lifecycle. This work will facilitate work on the genotypic interaction that drives the coevolutionary process.
Keywords: Microphallus sp., RNA-seq, Microphallidae taxonomy
25 Introduction
The molecular and genetic basis of host-parasite interaction is important in coevolutionary biology (Hamilton 1980; Agrawal and Lively 2001; Nuismer and Otto 2004).
Studies using RNA-seq of a few key host or parasite species of medical importance reveal putative genes involved in resistance and suggest mechanisms by which genotype by genotype interactions (GxG) are mediated (Gasnier et al. 2000a; Hurtrez-Bousses et al. 2001; Young et al.
2011; Mitta et al. 2012). But we know of no studies that have looked at both host and parasite transcriptomes and none that have done so in a coevolutionary context. The molecular mechanisms of infection and the underlying genetics are barely known for any natural system of host-parasite coevolution (Barribeau et al. 2014). It is critical that we find the genomic regions involved in host-parasite coevolution to confirm the validity of theoretical predictions of coevolution and their applicability in naturally coevolving populations.
In order to facilitate coevolutionary analysis, we need genomic resources available for both the host and parasite in a system where we understand the coevolutionary process. One classic natural system to study coevolution is the fresh water snail, Potamopyrgus antipodarum, and its trematode parasite, Microphallus sp.. In this system, we know infection is based partially on the genetic identity of the host. Researchers utilize this system to examine negative frequency dependent selection — where the most common host genotype is consistently the most infected (Dybdahl and Lively 1995b; 1998). This genetic specificity is robust to the effect of variation in the physiological condition of the host (Krist et al. 2004). The parasite populations are adapted to their local host populations and this local adaptation is the result of genotype-specific tracking by the parasite (Lively and Dybdahl 2000; Lively et al. 2004b). This system has been seminal in demonstrating the evolution and maintenance of sexual reproduction
26 (Lively 1989; Koskella and Lively 2007; 2009; King and Lively 2009; King et al. 2011a).
However, what little is known of the specific molecular or genetic mechanisms of the interaction comes from experimental infection studies and indirect inference (Dybdahl et al. 2014b).
Genomic resources have been developed for the snail host P. antipodarum as part of an ongoing characterization of their molecular and the evolutionary history. Allozymes have been developed and used to track the genotype frequencies of P. antipodarum within populations over time (Dybdahl and Lively 1995a). The phylogenetic relationship and cladal composition of snails has been determined using cytochrome B sequence data, which showed that the spread of the snail across New Zealand corresponds to the receding glaciers in the last ice age (Neiman and
Lively 2004; Dybdahl and Drown 2010). Additionally, the transcriptome of P. antipodarum has been sequenced (Wilton et al. 2012b).
On the other hand, despite 30 years of extensive research in this coevolutionary system, very little genetic or genomic data are available to address questions about the evolutionary history or infection genetics for the parasite Microphallus sp.. Allozyme variation has been used to estimate gene flow among populations (Dybdahl and Lively 1996). We know that genetic interactions are important because of widespread local parasite adaptation, and the breakdown of infection in F1 hybrids between parasite populations (Dybdahl et al. 2008). But the molecular traits and specific genes that determine infectivity are unknown. However, a comparative approach is possible because of the expanding genomic information on helminth parasites, and specifically of other trematode parasites (Gasnier et al. 2000b; Roger et al. 2008; Han et al. 2009;
Consortium et al. 2010; Howe et al. 2016).
A comparative study of genes and proteins involved in infection in Microphallus sp. is hampered by a lack of taxonomic placement of the parasite; we do not know its evolutionary
27 history, so we are unable to robustly compare it to other parasite genomes. In fact, the parasite populations in New Zealand represent an undescribed taxon that was given the genus name
Microphallus by a personal communication (Lively and McKenzi 1991). Because of the extensive work in this system done by Curt Lively, it has been suggested that this parasite be called “Microphallus livelyi” (Hechinger 2012), but this work failed to describe the morphology or phylogenetic placement of Microphallus sp. Although this is an excellent system to study host-parasite coevolution, the lack of genomic resources and taxonomic affiliation for the parasite is a barrier to understanding the underlying genetic basis of infection.
We remedied this gap in coevolutionary research by examining actively transcribed genes in its metacercaria stage found within the snail host, and comparing expressed regions to other well studied trematode transcriptomes in order to identify active genes. We first sequenced and assembled the transcriptome of Microphallus sp. To facilitate our comparative approach, we isolated the 28S nrRNA gene and used it to determine the phylogenetic placement of this undescribed parasite to other well-studied trematode parasites. We then annotated the transcriptome, to look for proteins that may be involved in the host-parasite interaction. We looked for similarity between our parasite and similar well studied digenean trematode parasites.
We then looked for similarity between our parasite and medically relevant trematode parasites using NCBI Blast. Finally, we sought to determine if Microphallus sp. metacercaria is evading the vertebrate or invertebrate immune system by comparing our transcriptome to transcripts from the five Schistosoma mansoni life cycle stages. This work represents the first molecular and taxonomic characterization of these evolutionarily important parasites and will facilitate future studies into the genetics of host-parasite coevolution in this system.
28 Materials and Methods
Sampling and Sequencing
Microphallus sp. has a complex lifecycle, infecting both a vertebrate (bird) host and an invertebrate (snail) host. Eggs containing miracidia are shed from the bird into the environment through feces. Snails ingest these eggs during regular feeding. The eggs develop into sporocysts which asexually reproduce into metacercaria within the snail host. Metacercaria are ingested by the duck when they ingest the snail, and the adults undergo sexual reproduction to produce eggs
(Galakionov et al. 2012). Due to ease of sampling, we isolated metacercaria by dissecting infected snails.
All of the Microphallus sp. parasites used in this study were isolated by dissecting P. antipodarum hosts from populations that are known to have high rates of infection (Lively
1987). We isolated Microphallus cysts from four different populations on the South Island of
New Zealand (Table 1). We collected parasites from five snails from each of the four populations.
RNA was extracted from each of the parasites using a phenol-chloroform RNA extraction protocol (Gasic et al. 2004). All individuals within populations were pooled and the samples were purified by selective depletion of the ribosomal RNA transcripts from total RNA
(RiboMinus Eukaryote Kit for RNA-Seq). We then removed any genomic DNA contamination using TURBO DNA-free kit (Ambion). We ran an RNA picochip analysis on the Agilent 2100 for quality control and found the RNA to be of sufficient quality to make into libraries. Lake
Rotoroa was of lower quality because of RNA degradation, but was of sufficient quality to include. We then made cDNA using both random hexamers and oligo-dT primers to minimize positional bias in the transcriptome data (Chapalamadugu et al. 2014). Approximately half of the
29 mRNA from each population was prepared for sequencing on a 454 sequencer. We used the protocol from the 454 sequencing cDNA rapid library prep manual to prepare the 454 libraries
(Roche Diagnostics GmbH 2010). The other half of the cDNA was made into Illumina libraries using the Apollo 324 library prep Truseq (Integenx). The samples were then run on an Illumina miSeq platform using a v2 500 cycle kit (2x257 cycles).
To remove any remaining rRNA contamination prior to assembly, the reads were filtered in silico. We created a Microphalloidea ribosomal database by searching the NCBI database for ribosomal sequences within this family of trematodes. Using Bowtie 2 (Langmead and Salzberg
2012), all reads were aligned to our ribosomal database, and reads that did not map (those that were not ribosomal) were collected in a separate fasta file. Of the 5,614,799 raw reads across both sequencing platforms, 63.9% were removed. The resulting 2,022,980 reads were used in transcriptome assembly.
Assembling Reference Transcriptomes
The reads from both the Illumina and 454 platforms from all populations were combined and assembled using the MIRA Hybrid assembler (Chevreux et al. 1999). MIRA has an internal trimming algorithm that removes the adapters from all reads prior to assembly. Additionally,
MIRA removes excess copies of any given site through digital normalization. Although
2,022,980 reads were read into the assembler, only 109,831 reads were used in the reference transcriptome, largely due to normalization (Table 3).
To assess quality, all pooled reads were then mapped back onto the reference transcriptome using bwa aligner (Li and Durbin 2009; Li et al. 2009). Additionally, reads from
30 each population were independently mapped to the reference transcriptome to determine if each population was adequately represented in the reference (Table 4).
Phylogenetics
The large ribosomal subunit, 28S nrRNA gene, which is widely used in phylogeny reconstruction, has also been specifically informative in determining relationships between species and genera in the Microphalloidea family (Tkach et al. 2003b; Galaktionov et al. 2012;
Kudlai et al. 2015). We isolated this genomic region from our reference transcriptome by comparing a generic Microphallus sp. 28S nrRNA sequence to our transcriptome and pulling out the hit with the highest similarity score (length= 1785 bp, similarity=94%). We included species from Microphallidae, Prosthogonomidae, Pleurogenidae, and Lecithodendriidae subfamilies.
Additionally we included a number of undescribed Microphallus species from Russia, Australia, and Japan (Kakui 2011; Galaktionov et al. 2012; Kudlai et al. 2015). All 28S nrRNA sequences were downloaded from the NCBI nucleotide database (Table 5).
Sequences were aligned using MAFFT v7.037b (Katoh and Standley 2013). The best-fit models of sequence evolution for the gene region was determined using a decision-theory (DT) approach (Minin et al. 2003), which was implemented in PAUP* 4.01a147 (Swofford 2002). The maximum likelihood analysis was conducted with RAxML v8.0.3 (Stamatakis 2014) using the
GTR + G model. One hundred searches for the best tree were performed and clade support was assessed with 1,000 bootstrap replicates. Bayesian inference analysis was conducted with
MrBayes v3.2.6 (Ronquist et al. 2012) on the CIPRES portal (Miller et al. 2010). Each analysis consisted of four independent runs with four Monte Carlo Markov Chains (MCMC) for 50 million generation and tree sampling every 1,000th generation, using the rate variation selected
31 by DT approach and using reversible-jump Markov Chain Monte Carlo (rjMCMC) to allow sampling across the entire substitution rate model space (nst = mixed). Convergence of the four independent MCMC runs was assessed using Tracer 1.6 (Rambaut and Drummond 2003). A
50% majority rule consensus tree was generated and posterior probability (PP) calculated after removing the first 10% of sampled trees.
Transcriptome Annotation
We used BLASTx (Camacho et al. 2009) to compare our reference transcriptome to the
NCBI non-redundant database (Pruitt 2004). We also compared the transcriptome to the
SwissProt database, but did not find hits that were not captured by the NCBI nr database BLAST results. We limited our BLAST search to hits with e-value < 10-5 and matched the resulting
NCBI entrez ID and with UniprotKB gene ontology (GO) terms (Table 6).
We found that our parasite was similar to a number of other helminth parasites, and so we compared our transcriptome to trematodes assembled transcriptomes from the WormBase database (Howe et al. 2016). Reciprocal similarity searches were conducted using blastn with an e-value threshold of 0.0001. We then calculated the percent coverage as the number of unique hits, divided by the number of sequences within the database. The higher the percent coverage, the more similar the two transcriptomes.
To determine whether Microphallus sp. metacercaria is expressing genes that might be involved with evasion of the vertebrate or invertebrate immune system, we blasted our transcriptome to ESTs from each of the five stages of the S. mansoni lifecycle. Sequences were downloaded from the NCBI EST database, and assembled into a blast database using the NCBI blast toolkit. Similarity searches were conducted using blastn with an e-value threshold of
32 0.0001. However, while blast comparisons show how similar proteins are between our parasite and the different stages of S. mansoni, it fails to capture what those similarities might be, and whether they are related to evading the immune system. As a result, we also compared the GO terms from Microphallus sp. metacercaria with ESTs found of the snail stage of S. mansoni and the vertebrate stage. We specifically focused on the GO terms related to the immune system, and immune response to determine if the metacercaria stage is evading the invertebrate or vertebrate immune system.
Results
Assessing the quality of the Transcriptome
Across all four populations and both sequencing platforms, we generated a total of
2,022,980 reads, 320,415 from the 454 platform and 1,702,565 reads from the Illumina platform.
After quality checks, a total of 109,831 reads were used in the reference assembly, out of the
2,022,980 total reads that were imported into MIRA. In total, 18,000 contigs were assembled with a mean length of 504 bp, ranging in size from 4,446 bp to 362 bp. The general quality of a de novo transcriptome assembly can be assessed by quantifying the proportion of reads that map back to the transcriptome, where greater than 50% is considered high quality (O'Neil and Emrich
2013). Of the combined trimmed 454 and Illumina reads, 83% mapped to our assembled transcriptome — signifying high quality. We then mapped the trimmed reads from each population to the reference transcriptome. The percentage of reads mapped ranged from 18.63%
(Lake Rotoroa) to 83.7% (Lake Kaniere) (Table 4).
33 Putative proteins involved in infection
For annotation, putative gene sequences were first searched using the BLASTx tool against the SwissProt database using a cut-off e-value of 10-5 and limiting our search to one result per contig. Using this approach, 1864 genes (10.4% of all contigs assembled) returned an above the cut-off BLAST result (see supplementary file). GO assignments were used to classify the function of the predicted Microphallus sp. genes (Table 5). In each of the main categories
(biological process, cellular component and molecular function) of the GO classification,
“cellular process”, “cell” and “binding” are the most represented, respectively. Not surprisingly, these are all housekeeping processes and are likely not specific to host-parasite interactions.
Interestingly, a few contigs linked to GO terms that may be associated with capacity to infect. The “leukocyte activation” GO term (0.37% of our contigs) is of some interest.
Leukocyte activation is defined as a change in morphology and behavior of a leukocyte resulting from exposure to a specific antigen. Extracellular parasites have been known to evade the host immune system through mimicking surface cell receptors. As another example, we found ~1% of our contigs linked with GO terms that are linked with viruses “viral entry into host cell” and
“virion attachment to host cell,” both of which are involved with a viral pathogen infecting the host. While our parasite is far removed from viruses, it is possible that these contigs are linked to similar immune evasion proteins and thus merit further investigation. Finally, ~ 1% of contigs were linked to immune effector process, which can be either immune response of Microphallus sp. to infection by other organisms, or could be used to evade the immune system of its host.
While these are not definitely involved in infection, it does give a suite of genes that are worth investigating.
34 Finding similar parasites
One possible avenue to find genomic regions involved in the interaction between
Microphallus and P. antipodarum is to look for genes involved in infection in similar parasites.
To that end, we sought to understand how closely related our parasite is to other digenean trematodes and other Microphallidea parasites by generating a phylogeny. Our tree contains representative species from all families of the suborder Digenia (Table 6). Our parasite is found within the Microphallidea clade, comprising four major clades with strong support (Figure 4).
The first clade consisted of the Lecithodendriidae subfamily (7 species), the second clade contained the Pleurogenidae subfamily (8 species), and the third clade contained the
Prosthogonomidae subfamily (12 species). The final clade contained all the undescribed species of Microphallus, including our species, collected from Potamopyrgus antipodarum, and largely contained the Microphallidae subfamily. Our species is a close sister taxa with an undescribed
Microphallidae species collected from the Brisbane River in Australia (Kudlai et al. 2015), and
Microphallus fusiformis. Together these three form a clade distinct from the Maritrema genus clade, and either sister to or part of the larger clade Microphallus. While our tree has them within the Microphallus clade, the nodal support is too low to definitively place them within this genus
(Figure 3 and 4).
Previous work has placed the two other species within our identified clade, Microphallus fusiformis and the Microphallidae species from Brisbane, closer to the in the Maritrema clade
(Kudlai et al. 2015), as opposed to the Microphallus clade. To determine if these three species are within the Maritrema clade, we did a topology test. Briefly, we conducted ML searches with a constraining topology to include the three species within the Maritrema clade. Then the difference in likelihood scores between the constrained and unconstrained ML trees were
35 calculated as the test statistic, and a null distribution for this test statistic was generated by simulating new datasets using the topology and parameter estimates from the constrained likelihood search. After 100 simulations, we rejected the hypothesis that these three trematode species are within the Maritrema clade (p= 0.009).
We looked for similarity with other related parasites, to determine whether we could expect to find proteins known to be involved other well studied host-parasite interactions within our system. Within the Microphallidea clade, there are no sequenced and annotated transcriptomes. However, our blast results demonstrated that our parasite was similar to several medically important, and therefore well studied parasites (Figure 5). We compared our transcriptome to other digenean trematodes for which a full transcriptome is available,
(Clonorchis sinensis, Echinostoma caproni, Fasciola hepatica, Opisthorchis viverrini,
Schistosoma mansoni and Trichobilharzia regenti) as well as the well described nematode,
Caenorhabditis elegans. While the species that were similar are not taxonomically close, they are very well characterized. Knowing how similar our parasite is to these well studied trematodes may allow us to find similar proteins involved in the host-parasite interaction from these systems within our system. We used reciprocal blast searches, and analyzed the number of similar contigs (unique hits) vs. the number of unique contigs to determine percent coverage or similarity of each trematode with Microphallus sp. (Table 7). We found that there was high similarity with S. mansoni (66%), C. sinenesis (51%), and F. hepatica (45%). The genetic basis of infection in both S. mansoni and F. hepatica has been investigated for the vertebrate host, and in S. mansoni, the molecular basis of infection has been discovered in the snail, Biomaphlaria glabrata.
36 Evading the vertebrate or invertebrate immune system
Our final approach to identifying genes that might be important to host parasite interactions was to determine whether expressed transcripts map to genes that might be involved in immune evasion in the invertebrate or vertebrate host. Microphallus sp. has a multistage life cycle involving both a vertebrate host (water fowl) and an invertebrate host (P. antipodarum). To determine the genetics of infection with the snail, we needed to discern if it is primarily expressing genes to evade the vertebrate immune or invertebrate immune system in the collected metacercaria stage. We compared our transcriptome to ESTs from each of the five stages in the similar parasite, S. mansoni. The stages can be categorized as being within the vertebrate
(humans) or the invertebrate (B. glabrata). We found that our transcriptome had the most similarity with the miracidia portion of the lifecycle (51.76%), which is the portion of the S. mansoni lifecycle where the trematode is infecting the snail (Table 7). However, the blast similarity does not tell us the function of the similar genes. Therefore, we considered the GO terms from the stage that lives within the vertebrate (adult worms) and within the invertebrate
(sporocysts), compared to the GO terms associated with our metacercaria transcriptome (Figure
6). Qualitatively, the adult stage of S. mansoni is more similar to our parasite than the sporocyst stage (Table 8).
Discussion
Over the past 30 years, this trematode parasite Microphallus sp. has been used extensively in studies of the evolutionary ecology of host-parasite interactions. Despite the extensive research done in this system, the genetics of host-parasite interaction remain unknown.
37 Here we present the first sequenced and assembled transcriptome for this parasite, and build a foundation for determining the genetic basis of host-parasite interaction.
We examined the GO terms for potential genomic regions that are involved in infection. The majority of the GO terms were comprised of general housekeeping genes; however there were some interesting exceptions. Notably, there were genes associated with immune function, and virion capsid and binding. The first group, immune function genes, could be one of two things.
The trematodes themselves could be expressing immune proteins to prevent infection of bacteria or other pathogens. However, these proteins could also be to evade the host immune system, and, given that some are associated with the vertebrate immune system (leukocyte activation), then these represent possible candidates for host-parasite interaction. Additionally, virion associated genes, especially those associated with virion capsid and attachment, are genes potentially involved in infection. While our parasite does not have an intracellular stage, the capsid proteins could be used to evade the snail or vertebrate immune systems in a similar fashion as virion proteins. These make up a promising suite of proteins that may be involved in the host-parasite interaction.
In addition to not knowing the genetics of infection, the evolutionary history and taxonomy of our parasite is entirely unknown. Hechinger (2012) described this species as “Microphallus
‘livelyi’”, but his analysis did not provide any molecular or morphological evidence to place this species within the Microphallus genus. Finding taxonomically similar parasites will allow us to look for infection associated proteins from other parasites within our parasite. Our phylogenetic analysis demonstrates that while this parasite clearly falls within the Microphallidae clade, it is not clear that it falls within the Microphallus genus. Rather it is closely related to another undescribed species of Microphallidae found in Australia and Microphallus fusiformis. These
38 three either make up a separate clade of Microphallidae, or are part of the Microphallus genus.
Previous work, which also utilized the 28S nrRNA gene, determined that the Brisbane River
Microphallidae and Microphallus fusiformis were an independent clade, but placed them closer to Maritrema rather than Microphallus (Kudlai et al. 2015). However, support for this relationship was low; other studies have found Microphallus fusiformis to be closer to the
Microphallus clade (Tkach et al. 2003b). Our topology test demonstrated that Microphallus sp.,
Microphallidae Brisbane and Microphallus fusiformis are definitively not within the Maritrema clade, but we cannot confirm that it is within the Microphallus clade. In order to better resolve the position of these three species, a few key pieces of data are needed. First, we need to include more than the one genomic region used to construct our phylogeny. However, the majority of previous phylogenies examining the Microphallidae taxonomy was constructed using only the
28S nrRNA gene region (Olson et al. 2003; Tkach et al. 2003a; Galaktionov et al. 2012), as this is the only genomic region available for most Microphalloidea. While we sequenced and assembled a transcriptome for Microphallus sp., to our knowledge, this is the first and only full transcriptome available for this clade. The paucity of transcriptomes for other Microphalloidea species, or even of data from gene regions other than 28S nrRNA, is impeding the generation of a tree that may ultimately resolve the evolutionary history of our parasite.
In addition, our phylogenetic analysis demonstrated that our parasite is related to other trematode parasites that are medically relevant and well-studied. Utilizing our blast search of the
SwissProt database, we found that our parasite had strong similarity with other helminth parasites such as Schistosoma mansoni, Echinococcus granulosa, Oxytricha trifallax, Tricuris trichuria and Hymenolypsis microstoma. We further looked for similarity between our parasite and medically relevant parasites using reciprocal blast, and found that our parasite shares high
39 similarity with S. mansoni, F. hepatica and C. senensis. The genetic basis of infection in both S. mansoni and F. hepatica has been investigated for the vertebrate host, and in S. mansoni, the molecular basis of infection has been discovered in the snail, Biomaphlaria glabrata. Therefore, the similarity between Microphallus sp. and these well studied trematodes allows us to look for similar proteins associated with infection within Microphallus sp., and could lead to finding the genomic region involved in infection.
We also examined whether the metacercaria stage used in this study is evading the invertebrate immune system (within the snail), or preparing to evade the vertebrate immune system (within the waterfowl). We found that our transcriptome has high similarity with the miracidia stage in S. mansoni, which is the portion of the S. mansoni lifecycle that penetrates and actively infects the snail Biomaphlaria glabrata. However, qualitatively, the GO terms associated with the adult worm were more similar to the Microphallus sp. metacercaria than the sporocysts. Given that the sporocysts reside for multiple generations within their snail host, and our transcript shares similarity with another portion of the S. mansoni lifecycle within the snail, this is surprising. It appears that during the metacercaria stage, the parasite is expressing genes to evade the snail immune system, due to the similarity with the miracidia, but that it is also preparing to evade the vertebrate immune system. It is worth examining the transcriptomic differences between different stages of the Microphallus sp. life cycle, to determine the difference between stages and the potential role those differences play in the genomic basis of host-parasite interaction.
Overall, this work represents the first molecular data available for the trematode parasite
Microphallus sp. of the fresh water snail P. antipodarum. From these sequences, we can look at variation within and between populations to understand the evolutionary history of the parasite
40 population, and further studies of the genetic basis of host-parasite coevolution within this system.
Acknowledgements
Data collection and analyses were performed by the IBEST Genomics Resources Core at the
University of Idaho and were supported in part by NIH COBRE grant P30GM103324.
41 Microphallus sp. Wellers Rock New 1.00 93 0.72 Microphallus sp. IBC 2010 Russia Microphallus basodactylophallus 0.99 56 Microphallus primas 1.00 100 0.60 Microphallus abortivus 1.00 90 Microphallidae sp. Okinawa Microphallus triangulatus 1.00 99 1.00 Microphallus minutus 0.86 98 60 Microphallus similis Microphallus sp. New Zealand 1.00 100 1.00 Microphallidae gen. sp. Australia 96 Microphallus fusiformis Microphallidae Maritrema poulini 0.60 56 1.00 Maritrema brevisacciferum 94 1.00 1.00 Maritrema deblocki 66 66 Maritrema neomi 0.55 0.51 Maritrema novaezealandense 56 1.00 1.00 Maritrema eroliae 72 100 Maritrema heardi
1.00 Microphalloidea 88 Maritrema oocysta Maritrema prosthometra 1.00 1.00 100 100 0.63 Maritrema arenaria Maritrema subdolum Pycnoporus megacotyle 1.00 100 Pycnoporus heteroporus 1.00 77 0.79 Prosthodendrium hurkovaae 0.81 61 Lecithodendrium linstowi Lecithodendriidae Prosthodendrium longiforme 1.00 1.00 1.00 100 100 Prosthodendrium chilostomum 100 1.00 78 Prosthodendrium parvouterus Ophiosacculus mehelyi Lecithopyge rastellus 1.00 100 1.00 Haplometra cylindracea 100 Plagiorchis vespertilionis 1.00 76 Haematoloechus varioplexus 1.00 Prosthogonomidae 1.00 100 100 0.77 Haematoloechus longiplexus 51 Telorchis assula Brachycoelium salamandrae Schistogonimus rarus 0.62 1.00 100 1.00 Prosthogonimus ovatus 100 Prosthogonimus cuneatus Parabascus semisquamosus 1.00 1.00 Pleurogenidae 98 84 1.00 Parabascus joannae 98 0.88 Parabascus duboisi 52
1.00 Allassogonoporus amphoraeformis 100 Loxogenes macrocirra Pleurogenes claviger 1.00 99 1.00 Candidotrema loossi 99 Pleurogenoides medians Prosotocus confusus Brandesia turgida 0.005
Figure 3 - Assembled phylogeny for Microphalloidea, including the species Microphallus sp. from New Zealand, colored red.
42 Schistosoma mansoni
Hungatella hathewayi
Stylonychia lemnae
Crassostrea gigas
Trichuris trichiura
Nematostella vectensis Species Names Echinococcus granulosus
Medicago truncatula
Hymenolepis microstoma
Oxytricha trifallax
0 100 200 300 400 Number of Blast Hits Figure 4- Top blast hits by species.
43 Metacercariae Adult Sporocysts Biological Process Cellular Component Molecular Function
activation of immune response chemorepellent activity leukocyte activation nutrient reservoir activity somatic diversification of immune receptors
antigen processing and presentation circadian sleep/wake cycle leukocyte homeostasis organelle structural molecule activity
antioxidant activity detoxification leukocyte mediated cytotoxicity organelle part supramolecular fiber
apolipoprotein A−I receptor activity developmental growth leukocyte migration other organism synapse
behavior developmental process localization other organism part synapse part
beta selection electron carrier activity locomotion plasmodesma T cell costimulation
binding erythrocyte differentiation macromolecular complex polymeric cytoskeletal fiber T cell selection
biological adhesion extracellular matrix membrane presynaptic process involved in synaptic transmission transcription cofactor activity
biological regulation extracellular matrix component membrane part production of molecular mediator of immune response transcription factor activity, protein binding
cargo receptor activity extracellular region membrane−enclosed lumen protein tag transcription factor activity, sequence−specific DNA binding
cartilage condensation extracellular region part metabolic process receptor activity transcription factor activity, transcription factor binding
catalytic activity growth metallochaperone activity reproduction translation regulator activity
cell hematopoietic or lymphoid organ development mitochondrial nucleoid reproductive process transmembrane receptor activity
cell adhesion hemocyte differentiation molecular function regulator response to stimulus transporter activity
cell junction hemocyte proliferation multi−organism process rhythmic process virion
cell killing immune effector process multicellular organismal process signal transducer activity virion attachment to host cell
cell part immune response myeloid cell homeostasis signaling pattern recognition receptor activity virion part
cellular component organization or biogenesis immune system development neurotransmitter secretion signaling receptor activity virus receptor activity
cellular process immune system process nucleic acid binding transcription factor activity single organism signaling
chemoattractant activity laminin receptor activity nucleoid single−organism process
Figure 5- GO terms associated with annotated genes within the Microphallus sp. Metacercaria, S.
mansoni adult worms (resides within vertebrates), and S. mansoni sporocysts (reside within
invertebrates).
44 Table 3- Information from both 454 and Illumina reads, and the subsequent hybrid assembly.
Library Information
Total number of reads 2,022,980
Total number of reads after trimming 1,719,533
Total number of reads assembled 98,259
Average trimmed 454 read length 396 bp
Shortest trimmed 454 read 246
Longest trimmed 454 read 706
Average trimmed Illumina read length 228
Shortest trimmed Illumina read 79
Longest trimmed Illumina read 256
Total number of contigs 18,000
Mean length of contigs 504 bp
Contig N50 492 bp
45 Table 4- Population specific data for reads included in the reference assembly and the percentage of reads that mapped to the reference transcriptome.
Parasite 454 Illumina Total number of Reads mapped to Population Trimmed Trimmed Trimmed Reads Reference Reads Reads Transcriptome Lake Rotoroa 91,941 437,469 529,410 18.63%
Lake Kaniere 85,008 279,225 364,233 62.36%
Lake Alexandrina 93,588 517,072 610,660 78.18%
Lake Ianthe 49,878 468,799 518,677 83.70%
46 Table 5 – GO terms with their associated descriptions of the annotated contigs from the
Microphallus sp. transcriptome.
GO Category GO Term Percentage cellular process 17.89 metabolic process 15.41 single-organism process 14.26 biological regulation 9.51 response to stimulus 6.00 localization 5.88 cellular component organization or biogenesis 5.63 multicellular organismal process 5.28 developmental process 5.13 single organism signaling 2.75 multi-organism process 2.13 reproduction 1.31 reproductive process 1.29 locomotion 1.29 immune system process 1.08 biological adhesion 1.04 cell adhesion 0.98 immune response 0.55 Biological Processes behavior 0.46 growth 0.43 immune system development 0.39 immune effector process 0.20 rhythmic process 0.20 activation of immune response 0.18 leukocyte activation 0.17 leukocyte migration 0.12 detoxification 0.10 myeloid cell homeostasis 0.08 antigen processing and presentation 0.07 virion attachment to host cell 0.07 presynaptic process involved in synaptic transmission 0.04 leukocyte mediated cytotoxicity 0.03 somatic diversification of immune receptors 0.03 production of molecular mediator of immune response 0.03 cartilage condensation 0.01 T cell selection 0.01
47 cell 18.54 cell part 18.49 organelle 15.61 organelle part 9.85 membrane 8.12 macromolecular complex 7.36 membrane part 5.35 extracellular region 5.00 extracellular region part 3.94 membrane-enclosed lumen 3.32 Cellular Component cell junction 1.23 polymeric cytoskeletal fiber 0.94 synapse 0.63 extracellular matrix 0.47 synapse part 0.42 other organism 0.16 other organism part 0.16 virion 0.15 extracellular matrix component 0.12 virion part 0.09 mitochondrial nucleoid 0.06 binding 44.42 catalytic activity 28.32 structural molecule activity 7.86 transporter activity 5.45 molecular function regulator 3.99 receptor activity 1.92 signal transducer activity 1.54 electron carrier activity 1.39 transcription factor activity, sequence- specific DNA binding 1.09 transcription factor activity, transcription factor binding 1.02 Molecular Function signaling receptor activity 0.98 transmembrane receptor activity 0.64 cargo receptor activity 0.56 antioxidant activity 0.26 signaling pattern recognition receptor activity 0.19 translation regulator activity 0.11 virus receptor activity 0.08 protein tag 0.08 laminin receptor activity 0.04 apolipoprotein A-I receptor activity 0.04 chemorepellent activity 0.04
48 Table 6 – Taxonomic information, NCBI accession numbers, and references for all samples included in our 28s nrDNA phylogeny.
Accession Taxonomic Group Species Number Source Class Trematoda Subclass Aspidogastrea Order Aspidogastrida Family Aspidogastridae Aspidogaster conchicola AY222162 Olson 2003 Cotylaspis sp AY222165 Olson 2003 Cotylogaster basiri AY222164 Olson 2003 Lobatostoma manteri AY157177 Olson 2003 Multicotyle purvisi AY222166 Olson 2003 Family Multicalycidae Multicalyx elegans AY222163 Olson 2003 Order Stichocotylida Family Rugogastridae Rugogaster hydrolagi AY157176 Olson 2003 Subclass Digenea Order Echinostomida Superfamily Echinostomoidea Family Atractotrematidae Atractotrema sigani AY222267 Olson 2003 Family Echinostomatidae Echinostoma revolutum AY222246 Olson 2003 Euparyphium melis AF151941 Olson 2003 Family Fasciolidae Fasciola gigantica AY222245 Olson 2003 Fasciola hepatica AY222244 Olson 2003 Family Haploporidae Hapladena nasonis AY222265 Olson 2003 Pseudomegasolena ishigakiense AY222266 Olson 2003 Family Hymenocotta mulli AY222239 Olson 2003 Haplosplanchnidae Skrjabinoeces similis AY222279 Olson 2003 Family Philophthalmidae Cloacitrema narrabeenensis AY222248 Olson 2003 Unidentified philophthalmid AY222247 Olson 2003 Family Psilostomidae Psilochasmus oxyurus AF151940 Olson 2003 Superfamily Heronimoidea Family Heronimidae Heronimus mollis AY116878 Olson 2003 Superfamily Paramphistomoidea Family Cladorchiidae Solenorchis travassosi AY222213 Olson 2003 Superfamily Pronocephaloidea Family Labicolidae Labicola elongata AY222221 Olson 2003 Family Notocotylidae Notocotylus sp AY222219 Olson 2003 Family Opisthotrematidae Lankatrema mannarense AY222222 Olson 2003 Family Pronocephalidae Macrovestibulum obtusicaudum AY116877 Olson 2003 Family Rhabdiopoeidae Rhabdiopoeus taylori AY222218 Olson 2003 Taprobanella bicaudata AY222112 Olson 2003 Superfamily Microscaphidioidea
49 Family Mesometridae Mesometra sp AY222216 Olson 2003 Family Microscaphidiidae Hexangium sp AY222215 Olson 2003 Order Plagiorchiida Gaevskajatrema halosauropsi AY222207 Olson 2003 Superfamily Allocreadioidea Macvicaria macassarensis AY222208 Olson 2003 Family Opecoelidae Peracreadium idoneum AY222209 Olson 2003 Family Opistholebetidae Maculifer sp. AY222211 Olson 2003 Opistholebes amplicoelus AY222210 Olson 2003 Superfamily Lepocreadioidea Family Acanthocolpidae Cableia pudica AY222251 Olson 2003 Stephanostomum baccatum AY222256 Olson 2003 Family Apocreadiidae Homalometron armatum AY222241 Olson 2003 Homalometron synagris AY222243 Olson 2003 Neoapocreadium splendens AY222242 Olson 2003 Schistorchis zancli AY222240 Olson 2003 Family Brachycladiidae Zalophotrema hepaticum AY222255 Olson 2003 Family Enenteridae Enenterum aureum AY222232 Olson 2003 Koseiria xishaense AY222233 Olson 2003 Family Gorgocephalidae Gorgocephalus kyphosi AY222234 Olson 2003 Family Gyliauchenidae Paragyliauchen arusettae AY222235 Olson 2003 Family Lepocreadiidae Preptetos caballeroi AY222236 Olson 2003 Preptetos trulla AY222237 Olson 2003 Superfamily Microphalloidea Family Microphallidae Microphallidae gen.(Australia) KT355820 Kudlai 2015 Microphallidae sp. (Okinawa) AB974360 Kakui 2011 Microphallus abortivus AY220626 Galaktionov 2012 Microphallus basodactylophallus AY220628 Galaktionov 2012 Microphallus fusiformis AY220633 Galaktionov 2012 Microphallus primas AY220627 Galaktionov 2012 Microphallus similis AY220625 Galaktionov 2012 Microphallus sp. (NZ) KJ868217 O'Dwyer 2014 Microphallus sp. (Russia) HM584142 Galaktionov 2012 Maritrema arenaria AY220629 Galaktionov 2012 Maritrema neomi AF151927 Galaktionov 2012 Maritrema oocysta AY220630 Galaktionov 2012 Maritrema prosthometra AY220631 Galaktionov 2012 Maritrema subdolum AF151926 Galaktionov 2012 Floridatrema heardi AY220632 Galaktionov 2012 Superfamily Opisthorchioidea Family Cryptogonimidae Caecincola parvulus AY222231 Olson 2003 Siphodera vinaledwardsii AY222230 Olson 2003 Mitotrema anthostomatum AY222229 Olson 2003 Family Heterophyidae Cryptocotyle lingua AY222228 Olson 2003 Galactosomum lacteum AY222227 Olson 2003
50 Haplorchoides sp. AY222226 Olson 2003 Family Opisthorchiidae Amphimerus ovalis AY116876 Olson 2003 Superfamily Plagiorchioidea Family Auridistomidae Auridistomum chelydrae AY116872 Olson 2003 Family Brachycoeliidae Brachycoelium salamandrae AF151935 Olson 2003 Mesocoelium sp. AY222277 Olson 2003 Family Cephalogonimidae Cephalogonimus retusus AY222276 Olson 2003 Family Choanocotylidae Choanocotyle hobbsi AY116865 Olson 2003 Family Dicrocoeliidae Brachylecithum lobatum AY222260 Olson 2003 Lyperosomum collurionis AY222259 Olson 2003 Dicrocoelium dendriticum AY222261 Olson 2003 Family Encyclometridae Encyclometra colubrimurorum AF184254 Olson 2003 Family Gorgoderidae Degeneria halosauri AY222257 Olson 2003 Gorgodera cygnoides AY222264 Olson 2003 Nagmia floridensis AY222262 Olson 2003 Xystretrum sp AY222263 Olson 2003 Family Lecithodendriidae Lecithodendrium linstowi AF151919 Olson 2003 Prosthodendrium longiforme AF151921 Olson 2003 Family Omphalometridae Rubenstrema exasperatum AY222275 Olson 2003 Family Pachypsolidae Pachypsolus irroratus AY222274 Olson 2003 Family Lecithodendriidae Lecithodendrium linstowi AF151919 Tkach 2003 Ophiosacculus mehelyi AF480167 Tkach 2003 Prosthodendrium chilostomum AF151920 Tkach 2003 Prosthodendrium hurkovaae AF151922 Tkach 2003 Prosthodendrium longiforme AF151921 Tkach 2003 Prosthodendrium parvouterus AY220617 Tkach 2003 Pycnoporus heteroporus AF151918 Tkach 2003 Pycnoporus megacotyle AF151917 Tkach 2003 Family Plagiorchiidae Plagiorchis vespertilionis AF151931 Tkach 2003 Lecithopyge rastellus AF151932 Tkach 2003 Haematoloechus varioplexus AF387798 Tkach 2003 Haematoloechus longiplexus AY222280 Olson 2003 Glypthelmins quieta AY222278 Olson 2003 Family Pleurogenidae Parabascus duboisi AY220618 Tkach 2003 Parabascus joannae AY220619 Tkach 2003 Parabascus semisquamosus AF151923 Tkach 2003 Brandesia turgida AY220622 Tkach 2003 Loxogenes macrocirra AY220624 Tkach 2003 Pleurogenes claviger AF151925 Olson 2003 Pleurogenoides medians AF433670 Olson 2003 Family Prosthogonimidae Prosthogonimus cuneatus AY220634 Tkach 2003 Prosthogonimus ovatus AF151928 Olson 2003 Schistogonimus rarus AY116869 Olson 2003
51 Family Telorchiidae Opisthioglyphe ranae AF151929 Olson 2003 Telorchis assula AF151915 Olson 2003 Superfamily Renicoloidea Family Renicolidae Renicola sp AY116871 Olson 2003 Superfamily Troglotrematoidea Family Orchipedidae Orchipedum tracheicola AY222258 Olson 2003 Family Paragonimidae Paragonimus iloktsuenensis AY116875 Olson 2003 Paragonimus westermani AY116874 Olson 2003 Family Troglotrematidae Nephrotrema truncatum AF151936 Olson 2003 Nanophyetus salminicola AY116873 Olson 2003 Superfamily Zoogonoidea Family Faustulidae Antorchis pomacanthi AY222268 Olson 2003 Trigonocryptus conus AY222270 Olson 2003 Bacciger lesteri AY222269 Olson 2003 Family Lissorchiidae Lissorchis kritskyi AY222250 Olson 2003 Family Monorchiidae Ancylocoelium typicum AY222254 Olson 2003 Provitellus turrum AY222253 Olson 2003 Diplomonorchis leiostomi AY222252 Olson 2003 Family Zoogonidae Deretrema nahaense AY222273 Olson 2003 Lecithophyllum botryophorum AY222205 Olson 2003 Diphterostomum sp. AY222272 Olson 2003 Zoogonoides viviparus AY222271 Olson 2003 Order Strigeida Superfamily Azygioidea Family Azygiidae Otodistomum cestoides AY222187 Olson 2003 Superfamily Bivesiculoidea Family Bivesiculidae Bivesicula claviformis AY222182 Olson 2003 Bivesicula unexpecta AY222181 Olson 2003 Bivesiculoides fusiformis AY222183 Olson 2003 Superfamily Brachylaimoidea Family Brachylaimidae Brachylaima sp. AY222167 Olson 2003 Brachylaima thompsoni AF184262 Olson 2003 Zeylanurotrema spearei AY222170 Olson 2003 Family Leucochloridiidae Leucochloridium perturbatum AY222169 Olson 2003 Urogonimus macrostomus AY222168 Olson 2003 Family Bucephalidae Prosorhynchoides gracilescens AY222224 Olson 2003 Rhipidocotyle galeata AY222225 Olson 2003 Superfamily Clinostomoidea Family Clinostomidae Clinostomum sp. AY222175 Olson 2003 Clinostomum sp. AY222176 Olson 2003 Superfamily Cyclocoeloidea Family Cyclocoelidae Family Eucotylidae Tanaisia fedtschenkoi AY116870 Olson 2003 Superfamily Diplostomoidea
52 Family Diplostomidae Alaria alata AF184263 Olson 2003 Diplostomum phoxinib AY222173 Olson 2003 Family Strigeidae Apharyngostrigea cornu AF184264 Olson 2003 Cardiocephaloides longicollis AY222171 Olson 2003 Ichthyocotylurus erraticus AY222172 Olson 2003 Superfamily Gymnophalloidea Family Callodistomidae Prosthenhystera obesa AY222206 Olson 2003 Family Fellodistomidae Fellodistomum fellis AY222282 Olson 2003 Olssonium turneri AY222283 Olson 2003 Proctoeces maculatus AY222284 Olson 2003 Steringophorus margolisi AY222281 Olson 2003 Family Tandanicolidae Prosogonarium angelae AY222285 Olson 2003 Superfamily Hemiuroidea Family Accacoeliidae Accacoelium contortum AY222190 Olson 2003 Family Derogenidae Derogenes varicus AY222189 Olson 2003 Hemiperina manteri AY222196 Olson 2003 Family Didymozoidae Unidentified didymozoid AY222192 Olson 2003 Unidentified didymozoid AY222193 Olson 2003 Unidentified didymozoid AY222194 Olson 2003 Didymozoon scombri AY222195 Olson 2003 Family Hemiuridae Dinurus longisinus AY222202 Olson 2003 Lecithochirium caesionis AY222200 Olson 2003 Lecithocladium excisum AY222203 Olson 2003 Machidatrema chilostoma AY222197 Olson 2003 Merlucciotrema praeclarum AY222204 Olson 2003 Opisthadena dimidia AY222198 Olson 2003 Plerurus digitatus AY222201 Olson 2003 Family Lecithasteridae Lecithaster gibbosus AY222199 Olson 2003
Family Sclerodistomidae Prosogonotrema bilabiatum AY222191 Olson 2003 Superfamily Schistosomatoidea Family Sanguinicolidae Unidentified sanguinicolid AY157174 Olson 2003 Aporocotyle spinosicanalis AY222177 Olson 2003 Chimaerohemecus trondheimensis AY157239 Olson 2003 Neoparacardicola nasonis AY222179 Olson 2003 Plethorchis acanthus AY222178 Olson 2003 Sanguinicola inermis AY222180 Olson 2003 Family Schistosomatidae Austrobilharzia terrigalensisb AY157249 Olson 2003 Bilharziella polonica AY157240 Olson 2003 Dendritobilharzia pulverulenta AY157241 Olson 2003 Gigantobilharzia huronensis AY157242 Olson 2003 Heterobilharzia americana AY157246 Olson 2003 Ornithobilharziella AY157248 Olson 2003
53 canaliculata Schistosoma japonicum AY157607 Olson 2003 Schistosoma mansoni AY157173 Olson 2003 Schistosomatium douthitti AY157247 Olson 2003 Schistosoma haematobium AY157263 Olson 2003 Family Spirorchiidae Spirorchis scripta AY222174 Olson 2003
Superfamily Transversotrematoidea Family Crusziella formosa AY222185 Olson 2003 Transversotrematidae Prototransversotrema steeri AY222184 Olson 2003 Transversotrema haasi AY222186 Olson 2003
54 Table 7 – Results of reciprocal blast species of Microphallus sp. transcriptome to the assembled transcriptomes of the nematode C. elegans, and the trematode parasites C. sinensis, E caproni, F. hepatica, O. viverrini, S. mansoni and T. regent.
Microphallus sp. to focal species Focal species to Microphallus sp.
Unique Unique % Unique Unique % Microphall hits in Coverage hits in Microphall Coverage us sp. focal in focal focal us sp. in focal transcripts species species species transcripts species Caenorhabditis 460 31191 1.45 400 17600 2.27 elegans Clonorchis 6911 6723 50.69 1462 16538 8.12 sinensis Echinostoma 144 18463 0.77 141 17859 0.78 caproni Fasciola 7016 8723 44.57 1713 16287 9.52 hepatica Opisthorchis 515 15841 3.15 507 17493 2.82 viverrini Schistosoma 7831 3997 66.21 2414 15586 20.41 mansoni Trichobilharzia 1015 21170 4.58 538 17462 2.99 regenti
55 Table 8 – Comparison of Microphallus sp. transcriptome with EST libraries of the five stages of the S. mansoni life cycle.
Host Life Cycle Stage Unique Hits Unique Microphallus % Coverage in in species Schistosoma to stage specific sp. transcripts each stage mansoni database Eggs 4831 13169 26.84 B. Miracidia 9316 8684 51.76 glabrata Sporocysts 4286 13714 23.81 H. sapien Cercariae 5714 12286 31.74 Adult Worm 4831 13169 26.84
56 Table 9 – Candidate genes with GO annotations from the SwissProt database that are associated with the Microphallus sp. metacercaria, S. mansoni adult worms (found within vertebrates), and
S. mansoni sporocysts (found within invertebrates).
GO Microphallus sp. S. mansoni S. mansoni Category metacercaria Adult worms Sporocysts GO Term % GO Term % GO Term % cellular process 17.89 cellular process 21.44 cellular process 29.19 metabolic process 15.41 metabolic process 19.87 metabolic process 27.36 single-organism single-organism single-organism process 14.26 process 15.56 process 9.14 biological biological biological regulation 9.51 regulation 8.26 regulation 6.19 cellular cellular component component response to organization or organization or stimulus 6 biogenesis 6.20 biogenesis 5.35 response to response to localization 5.88 stimulus 5.61 stimulus 4.01 cellular component
organization or biogenesis 5.63 localization 5.59 localization 3.70 multicellular organismal multicellular multi-organism process 5.28 organismal process 3.71 process 2.58 developmental developmental developmental process 5.13 process 3.66 process 2.50 multicellular single organism single organism organismal signaling 2.75 signaling 2.29 process 2.32 multi-organism multi-organism immune system process 2.13 process 1.46 process 1.52 Biological Processes Biological reproduction 1.31 reproduction 1.01 reproduction 1.25 reproductive reproductive reproductive process 1.29 process 0.99 process 1.20 immune system single organism locomotion 1.29 process 0.85 signaling 0.80 antigen immune system processing and process 1.08 locomotion 0.60 presentation 0.53 hematopoietic or biological lymphoid organ adhesion 1.04 immune response 0.41 development 0.53 cell adhesion 0.98 detoxification 0.32 immune response 0.49 immune response 0.55 behavior 0.32 locomotion 0.45 activation of behavior 0.46 biological adhesion 0.31 immune response 0.27 growth 0.43 growth 0.27 rhythmic process 0.27 immune system 0.39 immune system 0.23 neurotransmitter 0.09
57 development development secretion immune effector circadian process 0.2 rhythmic process 0.19 sleep/wake cycle 0.09 immune effector developmental rhythmic process 0.2 process 0.15 growth 0.09 activation of activation of erythrocyte immune response 0.18 immune response 0.13 differentiation 0.04 leukocyte antigen processing leukocyte activation 0.17 and presentation 0.12 migration 0.04 leukocyte myeloid cell migration 0.12 homeostasis 0.10 leukocyte detoxification 0.1 activation 0.10 presynaptic process involved in myeloid cell synaptic homeostasis 0.08 transmission 0.10 antigen processing and leukocyte presentation 0.07 migration 0.06 virion attachment leukocyte to host cell 0.07 homeostasis 0.03 presynaptic process involved in synaptic transmission 0.04 cell killing 0.02 leukocyte mediated cartilage cytotoxicity 0.03 condensation 0.01 somatic diversification of immune receptors 0.03 T cell costimulation 0.01 production of molecular somatic mediator of diversification of immune response 0.03 immune receptors 0.01 production of molecular mediator cartilage of immune condensation 0.01 response 0.01 T cell selection 0.01 differentiation cell 18.54 cell 20.55 cell part 23.61111111
cell part 18.49 cell part 20.47 organelle 21.46164021 macromolecular organelle 15.61 organelle 15.65 complex 20.3042328 organelle part 9.85 organelle part 10.10 organelle part 13.26058201 macromolecular membrane 8.12 complex 8.41 membrane 4.96031746 macromolecular membrane- complex 7.36 membrane 7.39 enclosed lumen 4.728835979 extracellular membrane part 5.35 membrane part 4.82 region 3.538359788 extracellular membrane- extracellular
Cellular Component Cellular region 5 enclosed lumen 3.89 region part 3.472222222 extracellular 3.94 extracellular region 3.12 cell junction 2.149470899
58 region part membrane- extracellular region polymeric enclosed lumen 3.32 part 2.67 cytoskeletal fiber 0.992063492 cell junction 1.23 cell junction 1.01 membrane part 0.760582011 polymeric supramolecular cytoskeletal fiber 0.94 fiber 0.67 plasmodesma 0.562169312 synapse 0.63 synapse 0.39 synapse part 0.099206349 extracellular matrix 0.47 synapse part 0.35 synapse 0.099206349 synapse part 0.42 plasmodesma 0.16 other organism 0.16 extracellular matrix 0.14 other organism part 0.16 nucleoid 0.06 extracellular matrix virion 0.15 component 0.05 extracellular matrix component 0.12 other organism 0.03 virion part 0.09 other organism part 0.03 mitochondrial nucleoid 0.06 virion 0.02 virion part 0.01 structural binding 44.42 binding 43.94 molecule activity 42.29 catalytic activity 28.32 catalytic activity 35.22 binding 39.43 structural structural molecule molecule activity 7.86 activity 7.86 catalytic activity 15.52 transporter transporter activity 5.45 transporter activity 5.09 activity 1.14 molecular molecular function signal transducer function regulator 3.99 regulator 2.60 activity 0.38 electron carrier translation receptor activity 1.92 activity 1.11 regulator activity 0.38
signal transducer signal transducer molecular activity 1.54 activity 0.87 function regulator 0.38 electron carrier virus receptor activity 1.39 antioxidant activity 0.77 activity 0.19 transcription factor activity, transcription factor sequence-specific activity, protein transcription DNA binding 1.09 binding 0.67 cofactor activity 0.19 transcription factor activity, transcription Molecular Function Molecular factor binding 1.02 receptor activity 0.52 protein tag 0.10 nucleic acid binding signaling receptor transcription factor activity 0.98 activity 0.52 transmembrane signaling receptor receptor activity 0.64 activity 0.28 cargo receptor transmembrane activity 0.56 receptor activity 0.18 antioxidant translation activity 0.26 regulator activity 0.09 signaling pattern 0.19 protein tag 0.08
59 recognition receptor activity translation cargo receptor regulator activity 0.11 activity 0.08 virus receptor laminin receptor activity 0.08 activity 0.05 virus receptor protein tag 0.08 activity 0.03 laminin receptor metallochaperone activity 0.04 activity 0.02 apolipoprotein A- nutrient reservoir I receptor activity 0.04 activity 0.01 chemorepellent apolipoprotein A-I activity 0.04 receptor activity 0.01 chemoattractant activity 0.01
60 Identifying genomic hot spots of coevolution in host-parasite systems
Authors: Christina E. Jenkins1,2,*, Mark Dybdahl1, and Scott L. Nuismer1
*Corresponding Author
Affiliations:
1 School of Biological Sciences, Washington State University, Pullman WA
2 Department of Biology, University of Idaho, Moscow ID
Keywords: Coevolution, molecular ecology
Abstract
The genetics of host-parasite interactions are important in coevolutionary biology. However, finding genomic regions that contribute to coevolution has proven prohibitively difficult. One promising avenue is to find loci that are involved in local adaptation, which theory has demonstrated should spatially covary between the host and parasite. Here, we formalize and evaluate the power of using spatial covariances to identify genomic regions involved in coevolution. We used individual-based simulations to test the robustness of this technique by determining our ability to detect coevolving loci under different models of infection (escalating models and matching models), when multiple loci contribute to infection, and under varying quantities of local adaptation. Under escalating models of infection, we were not able to find coevolving regions. However, when interactions are mediated by the matching models of infection, we found that when the populations that are sampled were locally adapted (high selection and low migration), and we sampled many populations, we were consistently able to detect coevolving loci. Additionally, the combination of high local adaptation and high number of populations sampled allowed us to detect coevolving regions when multiple loci are involved
61 in the infective or resistant phenotype. Overall, we developed a powerful tool to find the genomic regions involved in host-parasite coevolution.
62 Introduction
Host–parasite coevolution has the potential to drive many evolutionary transitions: from asexual to sexual (Hamilton 1980; Hamilton et al. 1990), from haploid to diploid (Nuismer and
Otto 2004; Oswald and Nuismer 2007), and from selfing to outcrossing (Agrawal and Lively
2001). However, one emerging theme from theoretical studies is that the outcome of these evolutionary transitions is dependent on the genetics of the interaction. Theoretical models investigate a small handful of genetic models of interaction between host and parasite which are designed to mimic the genes that underlie molecular interactions between host defense and parasite attack. However, the extent to which these theoretical genotype x genotype interactions
(G x G) are realistic or how they alter coevolution in natural populations is subject to debate
(Dybdahl et al 2014). In order to understand how host – parasite coevolution can cause evolutionary transitions in natural populations, we need a deeper understanding of the genes and the genetic models that mediate coevolution.
Despite the importance of coevolving genes in evolutionary biology, finding them has been prohibitively difficult. Standard approaches focus typically on either host or parasite but rarely both (Barribeau et al. 2014). One problem is that there could be many loci of small effect that together make up the resistant or infective genotype. If this were the case, then the genomic regions involved in coevolution would be undetectable using traditional genomic measures of selection (Visscher et al. 2012; Ben Lehner 2013). Given that known immune proteins often have multiple loci contributing to each protein, this is not an unreasonable barrier (Klein 1986;
Kasahara 1999). Additionally, the large number of candidate loci generated in a RNA-seq experiment poses the problem of separating out the genomic regions responding to coevolutionary selection from those responding to other factors (Romero et al. 2012). However,
63 the greatest challenge is identifying the specific loci contributing to GxG interactions and driving coevolution in both the host and parasite. Standard approaches find genes in the host (Hacquard et al. 2011) or in the parasite (Barrett et al. 2009; Sperschneider et al. 2015) independently, but not the key combinations of genes in both the host and parasite that work together to determine the outcome of the GxG interactions. Without connecting both sides of the coevolutionary interaction, our understanding of the outcomes of coevolution in natural populations remains unpredictable at best.
One promising area of research to find genotypes associated with phenotypes is to look for genomic regions that are adapted to their local environment (Coop et al. 2010; Keller et al.
2012). One definition of local adaptation is the covariance between a genotype and its environment, and therefore genotypes that covary with an environmental variable across populations are locally adapted to that variable (Blanquart et al. 2013). This principle can be extended to coevolutionary local adaptation, because parasites within a population are often locally adapted to their hosts due to reciprocal selection (Kawecki and Ebert 2004; Greischar and
Koskella 2007). Theoretically, the biotic component of local adaptation has been described as the spatial covariance between host and parasite genotypes and this theoretical prediction suggests a potentially powerful approach to detecting functionally linked loci (Nuismer and Gandon 2008).
Utilizing locally adapted populations, we can look for genomic regions (i.e. SNPs or microsatellites) that spatially covary between the host and parasite, as these will be those that are responding to coevolutionary selection. Thus, established theory suggests scanning the genomes of host and parasites for regions that covary across space, which should allow us to isolate the genomic regions involved in coevolution and provides a powerful alternative to conventional approaches.
64 Here, we formalize and evaluate the power of using spatial covariances to identify covarying genomic regions of coevolution. To this end, we addressed the following specific questions: 1) Is the ability to detect the loci undergoing coevolution affected by the genotypic model of infection? 2) Are we able to detect multiple coevolving loci that contribute to the coevolving genotype? and 3) Does the strength of local adaptation alter our ability to detect coevolving genomic regions? Overall, we determined how robust this technique is and developed a powerful new calculation to find the genomic regions involved in host-parasite coevolution.
Overview of Approach
Our Approach
Our technique utilizes spatial covariance to determine which loci are involved in host- parasite coevolution. Because of the reciprocity of coevolutionary selection, as an allele in the host for resistance increases in frequency within the population, in response an allele for resistance will also increase with the same population. Thus, the coevolving alleles will have similar allele frequencies within populations, and will spatially covary across populations
(Nuismer and Gandon 2008). We can therefore look for genomic regions that significantly spatially correlate, as these will be candidate loci for those involved in coevolution.
The reciprocal selection that defines coevolution can generate local adaptation, which ultimately underlies our technique. While an organism can, and will, adapt to both biotic and abiotic conditions, for simplicity we focus here on the GxG interactions that drive biotic local adaptation. We know that this biotic component of local adaptation, Δ , can be expressed as: