10634–10648 Nucleic Acids Research, 2017, Vol. 45, No. 18 Published online 11 August 2017 doi: 10.1093/nar/gkx704 Genome-scale detection of positive selection in nine primates predicts human-virus evolutionary conflicts Robin van der Lee1,*, Laurens Wiel1,2, Teunis J.P. van Dam1 and Martijn A. Huynen1
1Centre for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands and 2Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
Received May 02, 2017; Revised July 26, 2017; Editorial Decision July 30, 2017; Accepted August 02, 2017
ABSTRACT able at https://github.com/robinvanderlee/positive- selection. Hotspots of rapid genome evolution hold clues about human adaptation. We present a compara- tive analysis of nine whole-genome sequenced pri- INTRODUCTION mates to identify high-confidence targets of posi- Conservation of structure and sequence often indicate bi- tive selection. We find strong statistical evidence ological function. Rapidly evolving sequence features may for positive selection in 331 protein-coding genes however also indicate function, as they may reveal molecu- (3%), pinpointing 934 adaptively evolving codons lar adaptations to new selection pressures. But what drives (0.014%). Our new procedure is stringent and re- these rapid genetic changes during evolution? Can these changes explain the specific phenotypes of species or indi- veals substantial artefacts (20% of initial predic- viduals, such as a differential susceptibility to viruses? tions) that have inflated previous estimates. The fi- Immunity genes contain the strongest signatures of nal 331 positively selected genes (PSG) are strongly rapid evolution due to positive Darwinian selection (1–13). enriched for innate and adaptive immunity, secreted Pathogens continuously invent new ways to evade, coun- and cell membrane proteins (e.g. pattern recognition, teract and suppress the immune response of their hosts, complement, cytokines, immune receptors, MHC, thereby acting as major drivers of the observed adaptive Siglecs). We also find evidence for positive selection evolution of immune systems (14,15). In line with this, the in reproduction and chromosome segregation (e.g. structural interfaces of human proteins directly involved centromere-associated CENPO, CENPT), apolipopro- in virus interactions show accelerated evolution (16). Nu- teins, smell/taste receptors and mitochondrial pro- merous proteins involved in the virus–host interaction have teins. Focusing on the virus–host interaction, we been demonstrated to be in genetic conflict with their inter- acting viral proteins, a phenomenon that has been likened retrieve most evolutionary conflicts known to influ- to a virus–host ‘arms race’ (15). Such studies have generally ence antiviral activity (e.g. TRIM5, MAVS, SAMHD1, focused on a single gene or gene family of interest sequenced tetherin) and predict 70 novel cases through integra- across a large number of species. Evolutionary analyses can tion with virus–human interaction data. Protein struc- then predict which genes and codons may be involved in ture analysis further identifies positive selection in virus interactions. For example, mitochondrial antiviral sig- the interaction interfaces between viruses and their naling protein (MAVS) is a central signaling hub in the cellular receptors (CD4-HIV; CD46-measles, ade- RIG-I-like receptor (RLR) pathway, which recognizes in- noviruses; CD55-picornaviruses). Finally, primate fections of a wide range of viruses from the presence of their PSG consistently show high sequence variation in RNA in the cytosol. Analysis of the MAVS gene in 21 pri- human exomes, suggesting ongoing evolution. Our mates identified several positions that have evolved under curated dataset of positive selection is a rich source recurrent strong positive selection and turned out to be crit- ical for resisting cleavage by Hepatitis C virus (17). Other ex- for studying the genetics underlying human (an- amples of immunity genes showing evolutionary divergence tiviral) phenotypes. Procedures and data are avail- that directly impacts the ability to restrict viral replication include TRIM5␣ (18), PKR (19)andMxA(20).
*To whom correspondence should be addressed. Tel: +1 604 875 2345 (Ext 5273); Email: [email protected] Present addresses: Robin van der Lee, Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, BC Children’s Hospital Research Institute, University of British Columbia, Vancouver, BC, Canada. Teunis J.P. van Dam, Theoretical Biology and Bioinformatics, Department of Biology, Faculty of Science, Utrecht University, The Netherlands.