Flaviviruses Versus the Host Cell, and Evolution in the Primate Interferon Response
by
Alison Ruth Gilchrist
B.Sc., University of California at San Diego
Athesissubmittedtothe
Faculty of the Graduate School of the
University of Colorado in partial fulfillment
of the requirements for the degree of
Doctor of Philosophy
Department of Molecular Cellular and Developmental Biology
2020
Committee Members:
Sara L. Sawyer, Chair
Robert L. Garcea
Rushika Perera
Robin D. Dowell
Sabrina L. Spencer ii
Gilchrist, Alison Ruth (Ph.D., Molecular Cellular and Developmental Biology)
Flaviviruses Versus the Host Cell, and Evolution in the Primate Interferon Response
Thesis directed by Prof. Sara L. Sawyer
Long-term interactions between viruses and their hosts often develop into genetic arms races, which result in fast-evolving proteins (i.e. proteins evolving under positive natural selection), especially in immune proteins. A bioinformatic screen of proteins in a component of the primate innate immune response, the interferon system, demonstrated that proteins farther downstream of interferon induction are more likely to be evolving under positive natural selection compared to proteins in interferon induction pathways. One of the proteins under positive selection in this screen, STING, is a known target of proteases from a group of viruses called flaviviruses. The human haplotypes of STING (three of which are studied in this work) demonstrate a range of phenotypes of antagonism and interferon induction that may help explain the evolutionary history of this crucial immune protein.
The cleavage of STING demonstrates that the dengue virus protease targets host proteins for cleavage as well as viral proteins. In an attempt to identify novel targets of the dengue virus protease, a machine learning screen was used to predict possible motifs based on known motifs. This resulted in the identification of DGAT2, a newly described target of flavivirus proteases. The ability to cleave DGAT2, a host protein involved in maintaining lipid homeostasis, improves dengue virus replication, and is a conserved property of all flavivirus proteases tested. DGAT2 is not evolving under positive selection, making it a host-virus antagonistic interaction that has not resulted in an evolutionary arms race. However, identifying and describing this host-virus interaction helps us understand how dengue virus and other flaviviruses alter the host lipid environment during replication. Dedication
For my parents.
”To know how much there is to know is the beginning of learning to live.”
-Dorothy West (The Richer, the Poorer) iv
Acknowledgements
To Sara: thank you for setting up the support systems that propped me and my science up when we needed it, and for caring about the lab safety and organization that made being a bench scientist so fun. And thank you to Bob, Rushika, Robin, Sabrina, and one-time committee member
David for being unfailingly kind, supportive, and helpful.
Thank you to Nicholas Meyerson for being an unwavering hero. You helped me so much, for so long, that I will understand if you never want to speak to me again (but please do). Alex Stabell and Maryska Kaczmarek: thank you so much for welcoming me in to the lab, and for your help and friendship over the years. Thank you to Vanessa Bauer for being an incredible lab manager;
I will never forgive you for setting an impossible standard. Thank you to Elena Judd for being a wonderful technician, and then an even more wonderful undergraduate researcher—you’re going to do great things in the future. Everyone else I met and worked with in the Sawyer Lab, including but not limited to: Cody Warren, Qing Yang, Camille Paige, Kyle Clark, Will Fattor, Arturo
Barbachano-Guerrero, Joe Timpona, Emma Worden-Sapper, Sharon Wu, Obaiah Dirasantha: I’ll never forget any of you! And I can’t wait to check in on everyone over the coming years.
Thanks to my mom for letting me cry over the phone so many times, and also for bragging about me so much, making it extremely hard to drop out of grad school. Thanks to my dad for helping me climb mountains, physical and metaphorical, my whole life. Thanks to Andrew and
Daniel for nothing related to research, but everything related to sibling solidarity, and to Lena
Meyer for being my not-a-sister sister, and an endless well of support and love when I needed it.
Thank you to MCDB for a wonderful six years. To the entering MCDB class of 2014: I can’t v believe I was so lucky to spend my PhD with such wonderful classmates. To Adrian, Graycen,
Julie, Daniel, Brad, Kate, and Abby: an extra special thank you for being part of what kept me sane. For every Science Bu↵, Nerd-Niter, and ComSciCon-RMW committee member I spent time with in the last six years: I hope I learned even a small portion of what you all have to teach scientists. To every other friend and colleague that I don’t have the space to thank: know that I appreciated you dearly and you helped get me through grad school. vi
Contents
Chapter
1 Introduction 1
1.1 Viruses and hosts in evolutionary combat ...... 1
1.2 Positiveselectionintheinterferonpathway ...... 4
1.3 STING: an immune protein that is a target of flavivirus proteases ...... 5
1.4 Using machine learning to predict new targets of flavivirus proteases ...... 6
1.5 DGAT2: a novel target of flavivirus proteases ...... 7
1.6 Thesis organization ...... 9
2 Positive selection in the interferon pathway 10
2.1 Positive natural selection ...... 10
2.2 Positiveselectioninimmunepathways ...... 11
2.3 Theinterferonresponse ...... 12
2.4 Screening for positive selection in interferon pathways ...... 14
2.5 Characterization of multiple sequence alignments ...... 17
2.6 Interferon-stimulated genes experience more intense positive selection than interferon-
induction genes or randomly-selected genes ...... 19
2.7 Discussion...... 24
2.8 Methods...... 25 vii
3 STING: an immune protein that is a target of flavivirus proteases 30
3.1 Flaviviruses, and dengue viruses in particular, are world-wide pathogens with major
consequences for human health ...... 30
3.2 Human STING and the interferon response ...... 32
3.3 Human STING, but not human 78Q STING, is cleaved by multiple flaviviruses . . . 33
3.4 RodentSTINGisunderpositiveselection ...... 41
3.5 The interferon response is stimulated by STING transfection ...... 42
3.6 Active dengue virus protease inhibits interferon production ...... 44
3.7 Small di↵erences in virus replication in cell lines expressing di↵erent STING alleles . 47
3.8 Not all flavivirus proteases inhibit the STING-dependent interferon response . . . . 49
3.9 Discussion...... 49
3.10Methods...... 53
4 DGAT2: a novel target of flavivirus proteases identified by machine learning 57
4.1 Predicting targets of the dengue virus protease by machine learning ...... 57
4.2 DGATiscleavedbythedenguevirusprotease ...... 63
4.3 Mutation of the DGAT2 cleavage motif reduces viral infection...... 65
4.4 Confirmation with DGAT2 KO A549s ...... 73
4.5 ChemicallyinhibitingDGAT2activity ...... 74
4.6 Cleavage of DGAT2 is conserved in the flavivirus family...... 77
4.7 The cleavage of DGAT2: discussion ...... 79
4.8 Attempted further validation of the machine learning approach to predicting targets
offlavivirusproteases ...... 81
4.9 Methods...... 92
5 Conclusion 100 viii
Bibliography 102
Appendix
A Genes Analyzed in the Interferon Positive Selection Screen 113
B Host Proteins Predicted by Machine Learning 126 ix
Tables
Table
2.1 Many interferon genes are known to be evolving under positive selection...... 13
2.2 Genes in the interferon induction pathway and genes stimulated by interferon evolv-
ingunderpositiveselection...... 26 x
Figures
Figure
1.1 An example of the arms race between virus and host proteins ...... 3
1.2 DGAT2Biochemistry ...... 8
2.1 Simplified diagram of the interferon response...... 16
2.2 Quality and equity metrics for the three families of multiple sequence alignments
compared...... 18
2.3 Characterization of multiple species alignments ...... 21
2.4 Interferon-stimulated genes have a higher whole-gene dN/dS value than other genes,
and have more codons under positive selection than other genes...... 23
3.1 DengueviruscleavesSTINGduringinfection ...... 34
3.2 The SNPs of HAQ STING are geographically distinct ...... 36
3.3 The 78Q SNP prevents cleavage of human STING ...... 38
3.4 DENV2 cleaves STING and HAQ STING, but not 78Q STING ...... 39
3.5 Most flaviviruses cleave STING and HAQ STING, but not 78Q STING ...... 40
3.6 Rodent Sting1 is under positive selection, but only in the Hystricomorpha clade. . . 43
3.7 Transfecting STING induces interferon production ...... 45
3.8 Transfecting STING and DENV2 protease dampens interferon production...... 46
3.9 STING SNPs and protease antagonism...... 48
3.10 Non-cleavable STING reduces DENV2 replication in A549s...... 50 xi
3.11 STING alleles and flavivirus protease antagonism...... 51
3.12 STING alleles and flavivirus protease antagonism, part 2...... 52
4.1 The dengue virus polyprotein is cleaved by the dengue virus protease...... 60
4.2 Schematic of the machine learning protocol...... 62
4.3 DGAT2 motif identified and the structure of DGAT2...... 64
4.4 DGAT2iscleavedbythedenguevirusprotease...... 66
4.5 DGAT2 cleavage product not significantly degraded by the proteasome ...... 67
4.6 Wild-type A549 cells complemented with mutant DGAT2 did not inhibit dengue
virusreplication...... 69
4.7 Endogenous DGAT2 is significantly reduced after transfection of DGAT2-targeting
siRNA...... 70
4.8 Non-cleavable DGAT2 reduces DENV2 replication...... 72
4.9 The presence of non-cleavable DGAT2 inhibits dengue replication in DGAT2 knock-
outcells...... 75
4.10 DGAT2 inhibition in DGAT2 mutant cell lines did not result in significantly higher
denguevirusreplication...... 76
4.11 DGAT2 cleaved by all tested flavivirus proteases ...... 78
4.12 Proposed model for flavivirus cleavage of DGAT2 ...... 80
4.13 Example screening of predicted host proteins by western blotting...... 83
4.14 The vast majority of predicted proteins were not cleaved by the dengue virus protease. 84
4.15 Quantifying western blots led to misleading results...... 86
4.16 Mass spectrometry schematic and subcellular fractionation of Huh7 cells transfected
withdenguevirusprotease...... 88
4.17 SLC25A6 is not cleaved by the dengue virus protease...... 90
4.18 Accuracy of machine learning algorithm trained on dengue motif training data de-
creaseswithphylogeneticdistance...... 91 Chapter 1
Introduction
Viruses and their hosts have been locked in evolutionary combat for millennia. Throughout the course of my thesis research I studied the evolutionary signatures in primate immune proteins that may have been caused by viruses over the evolution of primates, and two specific interactions between a family of viruses (flaviviruses) and host proteins (STING and DGAT2). In this intro- duction I will give an overview of the questions that drove my PhD research and describe briefly the work I completed.
1.1 Viruses and hosts in evolutionary combat
Animals and viruses have been locked in evolutionary conflict for as long as animals have been evolving. Viral ”fossils”—elements of viral genomes integrated into host genomes—tell us that viruses are an ancient and persistent threat [73, 41, 20]. If a virus is capable of killing the host, it is a source of intense natural selection on the host population.
Natural selection is a theory proposed by Charles Darwin in the 19th century [16]. This theory proposes that organisms better adapted to their environment tend to survive and produce more o↵spring. In the context of viruses, the theory of natural selection suggests that a virus that is more suited to infecting and replicating in host cells will produce more progeny than viruses that are less suited to infecting host cells. Likewise, animals that are less susceptible to infection are less likely to be infected by a virus, less likely to experience the negative health impacts of virus infection, and therefore more likely to survive and produce o↵spring. Therefore viruses and their 2 hosts are in a slow battle of evolving selective traits through evolutionary time, a battle that often plays out on the frontlines: proteins [65].
As a classic example of where and how this battle might play out, picture a virus that needs to enter a host cell by binding a host receptor protein. A virus that can bind and enter may be able to create more infectious progeny, and therefore the genes that encode that binding protein will be passed on. But this places the host under selective pressure: if a mutation in a host appears that changes the receptor such that the virus can no longer bind, that host individual will resist infection and will be more fit in its environment. The gene that encodes the receptor in that individual is more likely to be passed on to future generations. Then the virus will be under selective pressure, and on and on ad infinitum [65]. This back-and-forth evolutionary pressure sometimes results in a flip-flopping of amino acids between a few states, a phenotype that gave rise to the ”Red
Queen Hypothesis” [113]. In Lewis Carroll’s Through the Looking Glass, the Red Queen explains to Alice that in her world: “it takes all the running you can do, to keep in the same place” [11].
Similarly, two proteins locked in an evolutionary back-and-forth keep ”running in place” between the same two states (Figure 1.1). In one form of Red Queen Dynamics, the direct opposition of these proteins may result in successive fixation of advantageous mutations, which represents an ”arms race” evolutionary signature. An evolutionary ”arms race” sometimes results in progressively more extreme phenotypes in the two opposing proteins, rather than the same di↵erence again and again
[83].
The type of evolution described above—selection for change, as compared to selection that maintains a sequence as the status quo—is sometimes called ”positive selection” and is characterized by a higher rate of nonsynonymous mutations (dN ) than synonymous (dS) mutations. In other words, if dN /dS is greater than 1, then nonsynonymous mutations may have been selected for in this gene. This mode of evolution is in contrast to a situation in which a gene’s sequence is under evolutionary pressure to stay constant. In this case, nonsynonymous mutations would be selected against, and dN /dS would be less than 1. Most genes in primate genomes are under this kind of selection: purifying selection [65]. Therefore, genes that are under positive selection interesting 3
Figure 1.1: Adaptation via natural selection, or positive selection, is identifiable due to the changes a↵ecting the amino acid sequences of proteins In this cartoon example, a protein receptor and a viral spike protein alternately evolve mutations that break or re-create the sequence that allows their binding.
virus mutates
virus
host
host mutates 4 targets of study, because this situation leads to questions such as: do these genes encode proteins that interact directly with pathogens? Are the regions of the genes that are evolving rapidly the sites of contact?
1.2 Positive selection in the interferon pathway
Immunity proteins are often evolving under positive selection, being an essential site of inter- action between virus and host proteins and therefore heavily influenced by the selection pressure exerted by viruses [20]. Therefore, we might expect to see an increased amount of proteins under positive selection is in the interferon pathway. The interferon response is a component of the innate immune system (a component of the immune system that is always present in cells, and does not have to ”adapt” to new pathogens) and plays an important role in defending human cells against viruses [23]. Because viruses replicate within cells of the host, their nucleic acids and proteins are exposed, to varying degrees, to the cellular environment. To exploit this vulnerability, hosts have evolved numerous intracellular sensors that recognize viral nucleic acids and proteins [23].
When cellular sensors detect one of these virus-specific structures, a signaling cascade is activated which ultimately leads to the production and secretion of one or more of several possible interferon proteins [63, 61]. Interferons then produce transcriptional changes in the infected cell, inducing expression of hundreds of host genes (called “interferon-stimulated genes,” or ISGs) that collec- tively act to limit viral replication[84]. Almost any aspect of the viral life-cycle can be targeted by interferon stimulated proteins [23, 84]. The resulting interferon-stimulated proteins act with a diversity of mechanisms to halt viral replication. Interferons do not just a↵ect these changes in the infected cell, they also signal to neighboring (even uninfected) cells and induce the same transcriptional changes in those cells [119]. In solid tissues, this signalling produces a “firewall’” of protected cells around the infected ones, making cell-to-cell spread of the virus di cult.
Viruses are known to target proteins that are both up- and downstream of the production of interferon molecules themselves [63]. Viruses sometimes inhibit interferon altogether by neutralizing the sensors and signaling pathways that lead to interferon production, while other times viral 5 antagonists are directed at specific down-stream e↵ects produced by interferon-stimulated genes.
When considering the many proteins involved in this pathway, I initially hypothesized that the genes responsible for inducing the production of interferon would be antagonized more often by viruses than genes that are turned on as a result of interferon production. Therefore I expected to
find a greater proportion of genes under positive selection in the upstream pathways of interferon and ISG induction. Counter to this hypothesis, I found the interferon-stimulated genes, and not interferon-pathway induction genes, are evolving significantly more rapidly than a random set of genes. These results are more fully described in Chapter 2.
1.3 STING: an immune protein that is a target of flavivirus proteases
Stimulator of interferon genes (STING) is an example protein in the interferon response that is under positive selection. STING is part of the well-categorized DNA-sensing pathway of cells.
Double-stranded DNA is sensed by the protein cyclic-GMP-AMP synthase (cGAS), which catalyzes the formation of cyclic 2’3’-GMP-AMP (2’3’-cGAMP). 2’3’-cGAMP binds directly to STING, the activation of which ultimately results in increased interferon production [109].
STING has been shown to be under positive selection in primates, and I and others have independently confirmed this finding using updated datasets with a greater number of primate sequences [69]. STING is also targeted by virus proteins. For example, many flaviviruses encode proteins that target STING for degradation or inactivation [127, 18, 99, 71, 17, 1]. I showed that
STING is cleaved during dengue virus infection by the dengue virus protease. Moreover, I showed that multiple flavivirus proteases can cleave human STING, and that when amino acid position 78 of STING is altered, these proteases cannot cleave STING.
There is a minor allele of human STING in which the cleavage motif is mutated [43]. There is also a major allele of STING that does not have a mutated cleavage motif, but that acts significantly di↵erently in how it stimulates the interferon response [39]. This allele, the HAQ allele, does not induce interferon as robustly as the other two alleles. I was interested in how these three di↵erent alleles are targeted by the dengue virus protease, and how infection of dengue virus is a↵ected by 6 the expression of these alleles. This work is described more in depth in Chapter 3.
1.4 Using machine learning to predict new targets of flavivirus proteases
The fact that the dengue virus protease can cleave STING, as well as other host proteins, leads to an interesting experimental challenge. The question is: what other human or host proteins can the dengue virus protease target and cleave? And further, how do we identify those proteins?
The motifs recognized by the dengue virus protease are diverse, making it di cult to identify pos- sible cleavage motifs by eye. High-throughput biological methods such as peptide-library scanning vary in their specificity and accuracy and can be costly or time consuming [50]. High-throughput computational methods can be influenced heavily by the most common known motifs, and do not necessarily identify the great diversity of motifs that could possibly be cleaved by the protease [98].
A previous graduate student in the Sawyer Lab, Alexander Stabell, attempted to design a computational screening approach based on machine learning [100]. In this method, an algorithm is trained on motifs that are known to be cleaved by the dengue protease. The hope is it can then recognize motifs that may be cleaved by the protease in the human proteome. He initially curated training data from available virus genomes and ran several iterations of this machine learning algorithm, changing parameters such as which features of the amino acids were considered, length of motifs, and the inclusion of the STING motif. One of the consistently predicted proteins from these initial screens was diacylglycerol O-acyltransferase 2 (DGAT2), a protein which we demonstrated was indeed cleaved by the protease in co-transfection experiments. I further screened approximately
100 proteins generated by these initial screens for cleavage by the dengue virus protease, none of which were cleaved.
In a later formalized iteration of the computational component of this project, I curated motifs from over 3000 dengue virus genomes (serotypes 1-4) and worked with postdoctoral researcher
Jacob Stanley to recreate this machine learning algorithm with more formal mathematical and biological requirements [101]. This algorithm generated a di↵erent list of human proteins predicted to be cleaved by the dengue virus protease. However, after testing many of the proteins predicted 7 by machine learning from the various iterations of the computational screen, DGAT2 is the only validated positive result. Future work can be done to systematically clone and further test the output of this algorithm in cleavage assays. For my thesis, I focussed on further understanding the cleavage of DGAT2 by flavivirus proteases. However, I do write briefly about some further screening techniques I explored, as well as future possibilities for screening of these predicted targets of the dengue virus protease.
1.5 DGAT2: a novel target of flavivirus proteases
DGAT2 is a diacylglycerol O-acyltransferase that catalyzes the terminal and only committed step in triacylglycerol synthesis by using diacylglycerol and fatty acyl CoA as substrates (Figure
1.2)[12]. It is an endoplasmic reticulum-resident protein that is encoded by the DGAT2 gene on chromosome 11[12]. By machine learning, we identified a possible cleavage site at amino acid position 123, and went on to show that DGAT2 is indeed cleaved at this position.
The cleavage of DGAT2 had immediately interesting implications for dengue virus replica- tion, as the process by which flaviviruses replicate in host cells is intimately connected to host lipid membranes. Lipid homeostasis under normal cellular conditions maintains a pool of lipids (includ- ing diacylgycerols) that can be converted to phospholipids for membrane synthesis, while storing some lipid content as triacylglycerols for later energy use[14]. This conversion of diacylglycerols to triacylglycerols is partly accomplished by DGAT2[12]. By cleaving DGAT2, the cellular home- ostasis would be shifted towards phospholipid synthesis and membrane synthesis. This hypothesis tracks with the observation that local surface area of lipid membranes is increased during dengue infection[118]. The cleavage of DGAT2 may be a part of the process that creates this improved environment for dengue replication.
To test how the ability to cleave DGAT2 impacts dengue replication, I created cell lines ex- pressing either wild-type (cleavable) DGAT2 or mutant (non-cleavable) DGAT2 and showed that the presence of non-cleavable DGAT2 inhibited dengue virus genomic RNA and viral progeny pro- duction. I also showed that DGAT2 is cleaved by the proteases of multiple flaviviruses, suggesting 8
Figure 1.2: A) DGAT1 and DGAT2 convert diacylglycerols to triacylglycerols in the endoplasmic reticulum. Triacylglycerols are stored in lipid droplets as part of the energy storing mechanism of animal cells. B) In the conversion of diacylglycerols to triacylglycerols, fatty-acyl CoA is used as a co-factor.
A.
DGAT1 DGAT2
Phospholipid Fatty Acyl CoA Diacylglycerol Triacylglycerol
B. Fatty-acyl CoA
O
VVVVVVV V C O CoA O O CH C CH C 2 VVVVVVVV 2 VVVVVVVV O O CH C CH C 2 VVVVVVVV VVVVVVVV DGAT2 O CH OH CH C 2 2 VVVVVVVV
1,2-Diacylglycerol Triacylglycerol 9 that the cleavage of DGAT2 is a conserved mechanism by which flaviviruses replicate. I describe the work I completed examining this hypothesis in Chapter 4.
1.6 Thesis organization
Though I hope this introduction provides an adequate road map for how this thesis is orga- nized, and for a retroactive discussion of how my dissertation projects progressed, the remainder of this thesis is organized such that each chapter can be read as a standalone story, with methods included. Therefore some of the information above is recapped and expanded in later chapters. Chapter 2
Positive selection in the interferon pathway
2.1 Positive natural selection
Natural selection is the Darwinian theory that advantageous traits in a population will be selected for[16]. Mutations occur randomly due to DNA damage or faulty replication by host polymerases, but fixation or loss of those mutations depends on selection on a population level[2].
A house-keeping gene (i.e. a gene for which it is advantageous that the function stay consistent) is more likely to be conserved over long evolutionary periods, as mutations are selected against[130].
A small percentage of genes have more flexibility—mutations that arise in these genes may become
fixed in the population because they impose a selectable advantage.
For protein coding genes, studying how the protein sequence has changed over the course of evolutionary time can reveal the signatures of selection. One useful measure of a changing sequence over time requires knowing the DNA sequence and the protein sequence. With this information it is possible to identify where a fixed mutation is a synonymous (amino acid conserving) or nonsynonymous (amino acid changing) mutation. With multiple sequences of the same gene from di↵erent animals, it is possible to approximate rates of synonymous and nonsynonymous
fixation events over the period of evolutionary time those animals represent[124]. This calculation is accomplished by calculating the number of synonymous changes and normalizing to the number of possible synonymous changes (dS) and similarly, calculating the number of nonsynonymous changes normalized to the number of possible nonsynonymous changes (dN ). The normalization to possible changes is necessary because of the degenerate nature of the codon table: because nonsynonymous 11 mutations occur more often than synonymous mutations by random chance, computational models have been developed that use statistical frameworks to account for these unequal substitution rates.
A gene that is highly conserved among many animals, i.e. a gene that has a higher dS than dN, has most likely experienced selection against nonsynonymous mutations. Housekeeping genes generally fall into this category of genes that are under ”purifying selection”. In contrast, a gene that has a significantly higher dN than dS has probably experienced some selection to diversify. Positive natural selection, herein shortened to simply ”positive selection”, refers to this latter scenario. The other explanation for a high dN /dS ratio is that there is no selection either way, and the gene is evolving ”neutrally.” Neutral evolution is usually only detected in non-coding genes, but it must be disproved before we can claim with confidence that a gene is under positive selection.
Genes that are evolving under positive selection can give us insight into the selection pressures placed on animals in the past. These rapidly-evolving genes are often evolving in direct opposition to pathogens, which can evolve in turn to evade host adaptations. This antagonism places pressure anew on the host, resulting in the ”tit-for-tat” evolution that leads to the signatures of positive selection.
2.2 Positive selection in immune pathways
The genes that encode immune proteins are often under selective pressure imposed by patho- gens, resulting in the signatures of positive selection[20]. Immune proteins are more likely to be interacting directly with pathogenic elements, and are also less likely to be housekeeping proteins, and therefore more able to tolerate nonsynonymous mutations. Examples of immune proteins under positive selection in primates include pattern recognition sensors and their downstream signalling molecules (e.g. MAVS, TRIM5, RNASEL), cytokines and cytokine receptors (e.g. IL3, CXCR2,
CASP1), marker molecules (e.g. CD4, CD5, HLA-DPA1), complement proteins (e.g. C5, C8B,
C9), and antimicrobial function proteins (e.g. TF, LTF)[114]. Primates have likely experienced pressure in the form of pathogenic antagonism that selects for nonsynonymous mutations in the genes encoding these proteins. As a result, the proteins rapidly change, allowing them to evade or 12 bind even as viruses evolve the ability to attack or evade in turn.
2.3 The interferon response
One part of the human innate immune system, the interferon response, plays an important role in defending human cells against viruses[23]. Hosts have evolved numerous intracellular sensors that recognize viral nucleic acids and proteins[23]. When cellular sensors detect one of these virus- specific structures, a signaling cascade is activated which ultimately leads to the production and secretion of one or more of several possible interferon proteins [63, 61].
There are three types of interferon classes: types I, II, and III. The three types display distinct expression patterns and have many roles in innate and adaptive immunity[80]. In humans, the primary anti-viral interferon proteins, and the ones generally produced after infection by viruses, fall under Type I. This type includes proteins encoded by 13 IFN-↵ genes, and single genes for IFN-