Unique Features Concerning the Proximal Origin of SARS-Cov-2

Total Page:16

File Type:pdf, Size:1020Kb

Unique Features Concerning the Proximal Origin of SARS-Cov-2 Unique features concerning the proximal origin of SARS-CoV-2 Murat Seyran1,2, Damiano Pizzol3, Parise Adadi4, Tarek Mohamed Abd El-Aziz5,6, Sk. Sarif Hassan7, Antonio Soares5, Ramesh Kandimalla8,9, Kenneth Lundstrom10, Murtaza Tambuwala11, Alaa A. A. Aljabali 12, Amos Lal13, Gajendra Kumar Azad14, Pabitra Pal Choudhury15, Vladimir N. Uversky16, Samendra P. Sherchan17, Bruce D. Uhal18, Nima Rezaei19,20,*, Adam M. Brufsky21,* 1Doctoral studies in natural and technical sciences (SPL 44), University of Vienna, Austria. 2Infection, Malignancy and Autoimmunity (NIIMA), Universal Scientific Education and Research Network (USERN), Austria. [email protected], [email protected] 3Italian Agency for Development Cooperation - Khartoum, Sudan Street 33, Al Amarat, Sudan. [email protected] 4Department of Food Science, University of Otago, Dunedin 9054, New Zealand. [email protected], [email protected] 5Department of Cellular and Integrative Physiology, University of Texas Health Science Center at San Antonio, 7703 Floyd Curl Dr, San Antonio, TX 78229-3900, USA. [email protected], [email protected] 6Zoology Department, Faculty of Science, Minia University, El-Minia 61519, Egypt. 7Department of Mathematics, Pingla Thana Mahavidyalaya, Maligram, Paschim Medinipur, 721140, West Bengal, India. [email protected] 8CSIR-Indian Institute of Chemical Technology Uppal Road, Tarnaka, Hyderabad-500007, Telangana State, India. 9Kakatiya Medical College/MGM-Hospital, DME/TSPSC, Hyderabad, Warangal, Telangana State- 506007, India. [email protected], [email protected] 10PanTherapeutics, Rte de Lavaux 49, CH1095 Lutry, Switzerland. [email protected] 11School of Pharmacy and Pharmaceutical Science, Ulster University, Coleraine BT52 1SA, Northern Ireland, UK. [email protected] 12Department of Pharmaceutical Sciences, Yarmouk University—Faculty of Pharmacy, Irbid 566, Jordan 1 13Division of Pulmonary and Critical Care Medicine, Mayo Clinic, Rochester, Minnesota, USA. [email protected] 14Department of Zoology, Patna University, Patna-800005, Bihar, India. [email protected] 15Applied Statistics Unit Indian Statistical Institute, 203 B T Road, Kolkata 700108, West Bengal, India. [email protected] 16Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612, USA. [email protected] 17Department of Environmental Health Sciences, Tulane University, New Orleans, LA, 70112, USA. [email protected] 18Department of Physiology, Michigan State University, East Lansing, MI 48824, USA. [email protected] 19Center for Immunodeficiencies, Pediatrics Center of Excellence, Children’s Medical Center, Tehran University of Medical Sciences, Tehran, Iran. 20Network of Immunity in Infection, Malignancy and Autoimmunity (NIIMA), Universal Scientific Education and Research Network (USERN), Tehran, Iran. [email protected], [email protected] 21University of Pittsburgh School of Medicine, Department of Medicine, Division of Hematology/Oncology, UPMC Hillman Cancer Center, Pittsburgh, PA, USA. [email protected] *Corresponding author Abstract There is a consensus that SARS-CoV-2 originated naturally from BatCoVs. Although based on phylogenetic analysis SARS-CoV-2 seems to be related to BatCoVs RaTG13 or RmYN02, its exact origin of lineage is not clear. In fact, the flat and non-sunken surface of the sialic acid-binding domain of SARS-CoV-2 spike protein conflicts with the general adaptation and survival pattern observed for all other coronaviruses (CoVs). Unlike RaTG13, for SARS-CoV-2 a recombination apparently occured between the S1/S2 domains of spike protein that enabled furin protease utilization. Interestingly, the clinical strains of SARS-CoV-2 do not have any further recombination 2 that would be in conflict with the recombination models of CoVs. Despite millions of SARS-CoV-2 infections, the spike protein receptor-binding domain (RBD) has not accumulated a high-frequency substitution, differentiating SARS-CoV-2 from other CoVs that have positive selection sites on their RBDs, calling into question the sources of the origin of the apparently optimized angiotensin- converting enzyme 2 (ACE2) binding of SARS-CoV-2. The SARS-CoV-2 host tropism pattern has these major discrepancies compared to other CoVs, raising questions concerning the proximal origin of SARS-CoV-2. Keywords: Severe Acute Respiratory Syndrome Coronavirus 2, Canyon Hypothesis, Template- switching model, Receptor Binding Domain, Positive Selection Site Human Pathogenic CoVs and SARS-CoV-2 The possible natural origin of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) from Bat CoV RaTG13 was explained by Andersen and colleagues 1. However, another Bat CoV (RmYN02) was recently claimed to be more genetically related to the SARS-CoV-2 than RaTG13, based on the whole genome phylogenetic analysis 1,4. Therefore, there is no genomic consensus on the exact lineage origin of SARS-CoV-2 1,4. SARS-CoV-2 is the seventh CoV capable of infecting humans, but the first and only human coronavirus (HCoV) with pandemic potential 5. During the host tropism, bat or rodent CoVs have demonstrated certain changes in their S protein receptor- binding domain (RBD) and glycan-binding N-terminal domain (NTD) 3,6. SARS-CoV-2, unlike other CoVs, does not have those signature changes. SARS-CoV-2 NTD composition conflicting with the ″Canyon Hypothesis″ The formation of canyons, depression zones, or cavities on the surfaces of the influenza virus, human rhinovirus, and Meningo virus is explained by the "Canyon Hypothesis" suggesting that the host cell receptor attachment site on a viral surface is hidden from the immune surveillance in deep surface depressions or "canyons" 7. In CoVs (except SARS-CoV-2), the S protein NTD glycan- binding domain has several sugar-binding modes, with a common feature being the hidden localization of the corresponding binding sites in cavities to limit their access to antibodies and immune cells 6. This pattern of CoVs was considered as an evolutionary measure to limit recognition of these active sites by host immune system 3. 3 The genomic comparative analysis of six HCoVs with their parent bat or rodent CoVs indicated the presence of several deletions, inserts, and recombination in their S protein NTD to evade from the host glycan-binding immune receptors, being compatible with the "Canyon Hypothesis" 3,6,8. HCoV- 229E S protein, being compared to its parent virus bat CoV, has a 187 amino acid deletion within the NTD that aims to provide means to evade the host immune C-type lectin receptors (CLRs) detecting glycan-binding viruses 8. Similarly, in the MERS-CoV, sialic acid binding site of the S Protein NTD is located 280 Å2 beneath the protein surface 9. However, the flat pattern of the SARS-CoV-2 S protein NTD is conflicting with the evolutionary host tropism strategy, not only regarding the HcoVs, but also many other human viral pathogens 2,3,6,7. SARS-CoV-2 recombination pattern is not compatible with the Template Switching (Copy- Choice) mechanism The high rate of RNA recombination in CoVs can be explained by a template-switching (copy- choice) mechanism 10. CoV RNA show discontinuous and non-progressive replication in its host cells, with replication pause at specific RNA sites creating blocks of free RNA that could fuse into another set of CoV replication, leading to the formation of different recombinant CoV strains 10. These pauses in RNA replication of CoVs could allow recombination as a result of mixed infection with other CoVs, other viruses, or even the host RNA sequences. As an example, mouse hepatitis coronavirus S protein NTD sialic acid binding domain is considered to have originated from recombination human galectin RNA sequences 14. HCoV-HKU1, when compared to the genetically related HCoV-OC43, shows a high rate of recombination in specific ORFs; e.g., P65 to NSP10, suggesting the presence of several RNA replications stops in these domains 11. Recombination can also occur with deletions in the CoVs RNA. Both HCoV-229E and the genetically related Alpaca CoV show a range from 185 to 404 nucleotide deletions in the S1 region compared to genetically related BatCoV 12. Strikingly, ORF8 was deleted in HCoV-229E compared to related alpaca and parent bat viruses, suggesting that the alpaca might serve as the first inter-host of the bat virus 12. The SARS-CoV-2 S Protein S1/S2 site corresponding to the furin recognition motif does not exist in other ″lineage B″ beta-coronavirus; e.g., Pangolin-CoV or RaTG13, suggesting that S protein S1/ S2 is not a recombination hot spot or RNA termination based on the template switching (copy- choice) model 1,10. Additionally, clinical isolates of SARS-CoV-2 S protein have not indicated any 4 recombination in this area, suggesting that the S1/S2 site furin recognition motif insertion represents a one-time event of special recombination. RmYN02, which is genetically the closest related virus to SARS-CoV-2, was proposed to have an insert like the SARS-CoV-2 S1/S2 S protein 4. However, based on the RBD, RmYN02 is phylogenetically not related to SARS-CoV-2, and the insert between the S1 and S2 domains is not the PRRA found in SARS-CoV-2, but P(deletion)AA, and the function of this insertion in RmYN02 as a furin cleavage site is not clear 4. In addtition, another genetically closest related SARS-CoV-2 virus, RmYN01, does not have that insert in the S1/S2 region 4. The S Protein of RmYN02 also has many deletions (e.g., residues 473 to 480 and 485 to 490) that do not exist in the genetically closest related SARS-CoV-2 4. SARS-CoV-2 RBD is not high frequency positive selection site The CoVs genomes have hot spots with a high frequency of amino acid substitutions, also called positive selection sites that favor host tropism, antibody resistance, or immune evasion 5. For example, analysis of the S protein RBD in HCoV-229E strains revealed the presence of 36 amino acid substitutions suggesting the positive selection site of RBD and highlighting the viral attempt of low-pathogenicity for adaptation 13.
Recommended publications
  • Multiple Recombination Events and Strong Purifying Selection at the Origin of SARS-Cov-2 Spike Glycoprotein Increased Correlated Dynamic Movements
    International Journal of Molecular Sciences Article Multiple Recombination Events and Strong Purifying Selection at the Origin of SARS-CoV-2 Spike Glycoprotein Increased Correlated Dynamic Movements Massimiliano S. Tagliamonte 1,2,† , Nabil Abid 3,4,†, Stefano Borocci 5,6 , Elisa Sangiovanni 5, David A. Ostrov 2 , Sergei L. Kosakovsky Pond 7, Marco Salemi 1,2,*, Giovanni Chillemi 5,8,*,‡ and Carla Mavian 1,2,*,‡ 1 Emerging Pathogen Institute, University of Florida, Gainesville, FL 32608, USA; mstagliamonte@ufl.edu 2 Department of Pathology, Immunology and Laboratory Medicine, University of Florida, Gainesville, FL 32610, USA; [email protected]fl.edu 3 Laboratory of Transmissible Diseases and Biological Active Substances LR99ES27, Faculty of Pharmacy, University of Monastir, Rue Ibn Sina, 5000 Monastir, Tunisia; [email protected] 4 Department of Biotechnology, High Institute of Biotechnology of Sidi Thabet, University of Manouba, BP-66, 2020 Ariana-Tunis, Tunisia 5 Department for Innovation in Biological, Agro-food and Forest Systems (DIBAF), University of Tuscia, via S. Camillo de Lellis s.n.c., 01100 Viterbo, Italy; [email protected] (S.B.); [email protected] (E.S.) 6 Institute for Biological Systems, National Research Council, Via Salaria, Km 29.500, 00015 Monterotondo, Rome, Italy 7 Department of Biology, Temple University, Philadelphia, PA 19122, USA; [email protected] 8 Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), National Research Council, Via Giovanni Amendola, 122/O, 70126 Bari, Italy * Correspondence: [email protected]fl.edu (M.S.); [email protected] (G.C.); cmavian@ufl.edu (C.M.) † Authors contributed equally as first to this work. ‡ Authors contributed equally as last to this work.
    [Show full text]
  • Inverted Repeats in Coronavirus SARS-Cov-2 Genome and Implications in Evolution
    Inverted repeats in coronavirus SARS-CoV-2 genome and implications in evolution Changchuan Yin ID ∗ Department of Mathematics, Statistics, and Computer Science University of Illinois at Chicago Chicago, IL 60607 USA Stephen S.-T. Yau ID y Department of Mathematical Sciences Tsinghua University Beijing 100084 China Abstract The coronavirus disease (COVID-19) pandemic, caused by the coronavirus SARS- CoV-2, has caused 60 millions of infections and 1.38 millions of fatalities. Genomic analysis of SARS-CoV-2 can provide insights on drug design and vaccine develop- ment for controlling the pandemic. Inverted repeats in a genome greatly impact the stability of the genome structure and regulate gene expression. Inverted repeats involve cellular evolution and genetic diversity, genome arrangements, and diseases. Here, we investigate the inverted repeats in the coronavirus SARS-CoV-2 genome. We found that SARS-CoV-2 genome has an abundance of inverted repeats. The inverted repeats are mainly located in the gene of the Spike protein. This result suggests the Spike protein gene undergoes recombination events, therefore, is essential for fast evolution. Comparison of the inverted repeat signatures in human and bat coronaviruses suggest that SARS-CoV-2 is mostly related SARS-related coronavirus, SARSr-CoV/RaTG13. The study also reveals that the recent SARS- related coronavirus, SARSr-CoV/RmYN02, has a high amount of inverted repeats in the spike protein gene. Besides, this study demonstrates that the inverted repeat distribution in a genome can be considered as the genomic signature. This study highlights the significance of inverted repeats in the evolution of SARS-CoV-2 and presents the inverted repeats as the genomic signature in genome analysis.
    [Show full text]
  • Exploring the Natural Origins of SARS-Cov-2
    bioRxiv preprint doi: https://doi.org/10.1101/2021.01.22.427830; this version posted January 22, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Exploring the natural origins of SARS-CoV-2 1 1 2 3 1,* Spyros Lytras ,​ Joseph Hughes ,​ Wei Xia ,​ Xiaowei Jiang ,​ David L Robertson ​ ​ ​ ​ ​ 1 MRC-University​ of Glasgow Centre for Virus Research (CVR), Glasgow, UK. 2 National​ School of Agricultural Institution and Development, South China Agricultural University, Guangzhou, China. 3 Department​ of Biological Sciences, Xi'an Jiaotong-Liverpool University (XJTLU), Suzhou, China. * Correspondence:​ [email protected] Summary. The lack of an identifiable intermediate host species for the proximal animal ​ ancestor of SARS-CoV-2 and the distance (~1500 km) from Wuhan to Yunnan province, where the closest evolutionary related coronaviruses circulating in horseshoe bats have been identified, is fueling speculation on the natural origins of SARS-CoV-2. Here we analyse SARS-CoV-2’s related horseshoe bat and pangolin Sarbecoviruses and confirm ​ Rhinolophus affinis continues to be the likely reservoir species as its host range extends ​ across Central and Southern China. This would explain the bat Sarbecovirus recombinants ​ in the West and East China, trafficked pangolin infections and bat Sarbecovirus recombinants linked to Southern China. Recent ecological disturbances as a result of changes in meat consumption could then explain SARS-CoV-2 transmission to humans through direct or indirect contact with the reservoir wildlife, and subsequent emergence towards Hubei in Central China.
    [Show full text]
  • WHO-Convened Global Study of Origins of SARS-Cov-2: China Part
    WHO-convened Global Study of Origins of SARS-CoV-2: China Part Joint WHO-China Study 14 January-10 February 2021 Joint Report 1 LIST OF ABBREVIATIONS AND ACRONYMS ARI acute respiratory illness cDNA complementary DNA China CDC Chinese Center for Disease Control and Prevention CNCB China National Center for Bioinformation CoV coronavirus Ct values cycle threshold values DDBJ DNA Database of Japan EMBL-EBI European Molecular Biology Laboratory and European Bioinformatics Institute FAO Food and Agriculture Organization of the United Nations GISAID Global Initiative on Sharing Avian Influenza Database GOARN Global Outbreak Alert and Response Network Hong Kong SAR Hong Kong Special Administrative Region Huanan market Huanan Seafood Wholesale Market IHR International Health Regulations (2005) ILI influenza-like illness INSD International Nucleotide Sequence Database MERS Middle East respiratory syndrome MRCA most recent common ancestor NAT nucleic acid testing NCBI National Center for Biotechnology Information NMDC National Microbiology Data Center NNDRS National Notifiable Disease Reporting System OIE World Organisation for Animal Health (Office international des Epizooties) PCR polymerase chain reaction PHEIC public health emergency of international concern RT-PCR real-time polymerase chain reaction SARI severe acute respiratory illness SARS-CoV-2 Severe acute respiratory syndrome coronavirus 2 SARSr-CoV-2 Severe acute respiratory syndrome coronavirus 2-related virus tMRCA time to most recent common ancestor WHO World Health Organization WIV Wuhan Institute of Virology 2 Acknowledgements WHO gratefully acknowledges the work of the joint team, including Chinese and international scientists and WHO experts who worked on the technical sections of this report, and those who worked on studies to prepare data and information for the joint mission.
    [Show full text]
  • Science Journals
    RESEARCH CORONAVIRUS ticles showed enhanced heterologous binding and neutralization properties against hu- Mosaic nanoparticles elicit cross-reactive immune man and bat SARS-like betacoronaviruses (sarbecoviruses). responses to zoonotic coronaviruses in mice We used a study of sarbecovirus RBD re- ceptor usage and cell tropism (38) to guide Alexander A. Cohen1, Priyanthi N. P. Gnanapragasam1, Yu E. Lee1, Pauline R. Hoffman1, Susan Ou1, our choice of RBDs for co-display on mosaic Leesa M. Kakutani1, Jennifer R. Keeffe1, Hung-Jen Wu2, Mark Howarth2, Anthony P. West1, particles. From 29 RBDs that were classified Christopher O. Barnes1, Michel C. Nussenzweig3, Pamela J. Bjorkman1* into distinct clades (clades 1, 2, 1/2, and 3) (38), we identified diverse RBDs from SARS- Protection against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and SARS-related emergent CoV, WIV1, and SHC014 (clade 1); SARS-CoV-2 zoonotic coronaviruses is urgently needed. We made homotypic nanoparticles displaying the receptor binding (clade 1/2); Rs4081, Yunnan 2011 (Yun11), and domain (RBD) of SARS-CoV-2 or co-displaying SARS-CoV-2 RBD along with RBDs from animal betacoronaviruses Rf1 (clade 2); and BM-4831 (clade 3). Of these, that represent threats to humans (mosaic nanoparticles with four to eight distinct RBDs). Mice immunized SARS-CoV-2 and SARS-CoV are human coro- with RBD nanoparticles, but not soluble antigen, elicited cross-reactive binding and neutralization responses. naviruses and the rest are bat viruses originat- Mosaic RBD nanoparticles elicited antibodies with superior cross-reactive recognition of heterologous RBDs ing in China or Bulgaria (BM-4831). We also relative to sera from immunizations with homotypic SARS-CoV-2–RBD nanoparticles or COVID-19 convalescent included RBDs from the GX pangolin clade 1/2 human plasmas.
    [Show full text]
  • The SARS-Cov-2 Arginine Dimers
    The SARS-CoV-2 arginine dimers Antonio R. Romeu ( [email protected] ) University Rovira i Virgili Enric Ollé University Rovira i Virgili Short Report Keywords: SARS-CoV-2, Sarbecovirus, Arginine dimer, Arginine codons usage, Bioinformatics Posted Date: August 2nd, 2021 DOI: https://doi.org/10.21203/rs.3.rs-770380/v1 License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License The SARS-CoV-2 arginine dimers Antonio R. Romeu1 and Enric Ollé2 1: Chemist. Professor of Biochemistry and Molecular Biology. University Rovira i Virgili. Tarragona. Spain. Corresponding author. Email: [email protected] 2: Veterinarian, Biochemist. Associate Professor of the Department of Biochemistry and Biotechnology. University Rovira i Virgili. Tarragona. Spain. Email: [email protected] Abstract Arginine is present, even as a dimer, in the viral polybasic furin cleavage sites, including that of SARS-CoV-2 in its protein S, whose acquisition is one of its characteristics that distinguishes it from the rest of the sarbecoviruses. The CGGCGG sequence encodes the SARS-CoV-2 furin site RR dimer. The aim of this work is to report the other SARS-CoV-2 arginine pairs, with particular emphasis in their codon usage. Here we show the presence of RR dimers in the orf1ab related non structural proteins nsp3, nsp4, nsp6, nsp13 and nsp14A2. Also, with a higher proportion in the structural roteinp nucleocapsid. All these RR dimers were strictly conserved in the sarbecovirus strains closest to SARS-CoV-2, and none of them was encoded by the CGGCGG sequence. Key words SARS-CoV-2, Sarbecovirus, Arginine dimer, Arginine codons usage, Bioinformatics Introduction Arginine (R) is a polar and non-hydrophobic amino acid, with a positive charged guanidine group, a physiological pH, linked a 3-hydrocarbon aliphatic chain.
    [Show full text]
  • To Stop the Next Pandemic, We Need to Unravel the Origins of COVID-19 David A
    OPINION OPINION To stop the next pandemic, we need to unravel the origins of COVID-19 David A. Relmana,b,c,d,1 We find ourselves ten months into one of the most to have been collected from bats in 2013 and 2019, catastrophic global health events of our lifetime and, respectively, in Yunnan Province, China (1). COVID-19 disturbingly, we still do not know how it began. What’s was first reported in December 2019 more than 1,000 even more troubling is that despite the critical impor- miles away in Wuhan City, Hubei Province, China. tance of this question, efforts to investigate the origins Beyond these facts, the “origin story” is missing many of the severe acute respiratory syndrome coronavirus key details, including a plausible and suitably detailed 2 (SARS-CoV-2) virus and of the associated disease, recent evolutionary history of the virus, the identity coronavirus disease 2019 (COVID-19), have become and provenance of its most recent ancestors, and sur- mired in politics, poorly supported assumptions and prisingly, the place, time, and mechanism of transmis- assertions, and incomplete information. sion of the first human infection. Even though a SARS-CoV-2 is a betacoronavirus whose apparent definitive answer may not be forthcoming, and even closest relatives, RaTG13 and RmYN02, are reported though an objective analysis requires addressing To avoid or mitigate the dire consequences of this and future pandemics (here, people in PPE bury a victim in Delhi, India in June), unraveling the origins of SARS-CoV-2 and COVID-19 will be essential—even though a definitive answer may be elusive, and an objective analysis means broaching some uncomfortable possibilities.
    [Show full text]
  • The SARS-Cov-2 Genome: Variation, Implication and Application
    26 AUGUST 2020 The SARS-CoV-2 genome: variation, implication and application This rapid review describes the severe acute respiratory syndrome coronavirus-2 (SARS- CoV-2) genome, its relationship to other coronaviruses, the variation that has occurred since SARS-CoV-2 emerged in Wuhan in late 2019, the implications of these changes and how knowledge of these changes may be utilised. This pre-print from the Royal Society is provided to assist in the understanding of COVID-19. Executive summary • SARS-CoV-2 emerged in late 2019 in Wuhan, China. The • Whole genome sequencing is a valuable addition to test, genome of many separate virus isolates from early in the track and trace, and is encouraged. For instance it: Wuhan outbreak are very closely related showing the virus – has revealed >1350 separate introductions of SARS- emerged recently in humans. CoV-2 into the UK from mid-February to mid-March • The SARS-CoV-2 genome is sufficiently different to all 2020 arising very largely from Spain, France and known coronaviruses to refute the assertion that the Italy, and not from China. COVID-19 pandemic arose by deliberate or accidental – can be used to follow transmission within specific release of a known virus and make it highly improbable communities such as hospitals, schools or factories, that the virus arose by artificial construction in a laboratory. and combined with epidemiological data enables • SARS-CoV-2 is most closely related to bat coronaviruses routes of transmission to be identified and barriers from China, but even the closest of these viruses are too to transmission implemented.
    [Show full text]
  • Genetic Spectrum and Distinct Evolution Patterns of SARS-Cov-2
    medRxiv preprint doi: https://doi.org/10.1101/2020.06.16.20132902; this version posted August 5, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license . Genetic spectrum and distinct evolution patterns of SARS-CoV-2 Sheng Liu1,2 ¶, Jikui Shen3 ¶, Shuyi Fang4, Kailing Li4, Juli Liu5, Lei Yang5, Chang-Deng Hu6,7, Jun Wan1,2,4,8 * 1. Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, Indiana, USA 2. Collaborative Core for Cancer Bioinformatics (C3B) shared by Indiana University Simon Comprehensive Cancer Center and Purdue University Center for Cancer Research, Indiana USA 3. The Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA 4. Department of BioHealth Informatics, Indiana University School of Informatics and Computing, Indiana University – Purdue University Indianapolis, Indianapolis, Indiana, USA 5. Department of Pediatrics, Indiana University School of Medicine, Indianapolis, Indiana USA 6. Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, Indiana, USA 7. Purdue University Center for Cancer Research, Purdue University, West Lafayette, Indiana USA 8. The Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, Indiana, USA ¶ These authors have equal contributions. * To whom correspondence should be addressed. Tel: 1-317-2786445; Fax: 1-317-278-9217; Email: [email protected] Keywords SARS-CoV-2, single nucleotide variant (SNV), clustering, co-infection, evolution. Abstract Four signature groups of frequently occurred single-nucleotide variants (SNVs) were identified in over twenty-eight thousand high-quality and high-coverage SARS-CoV-2 complete genome sequences, representing different viral strains.
    [Show full text]
  • WHO-Convened Global Study of the Origins of SARS-Cov-2: Terms of References for the China Part
    Terms of references – FINAL DRAFT WHO-convened Global Study of the Origins of SARS-CoV-2: Terms of References for the China Part 31 July 2020 Contents Background ............................................................................................................................................ 2 Approach ................................................................................................................................................ 2 Current knowledge supporting origin tracing work ............................................................................... 3 Phylogenetic work.............................................................................................................................. 3 Animal susceptibility and animal surveys .......................................................................................... 4 Persistence of the virus on food and surfaces ................................................................................... 4 The early cluster in Wuhan ................................................................................................................ 5 Implementation plan ............................................................................................................................. 6 Phase 1: Short term studies ............................................................................................................... 6 a) Descriptive Epidemiological Study ........................................................................................
    [Show full text]
  • Evidence for SARS-Cov-2 Related Coronaviruses Circulating in Bats and Pangolins in Southeast Asia
    ARTICLE https://doi.org/10.1038/s41467-021-21240-1 OPEN Evidence for SARS-CoV-2 related coronaviruses circulating in bats and pangolins in Southeast Asia Supaporn Wacharapluesadee1,10, Chee Wah Tan 2,10, Patarapol Maneeorn3,10, Prateep Duengkae4, Feng Zhu 2, Yutthana Joyjinda1, Thongchai Kaewpom1, Wan Ni Chia2, Weenassarin Ampoot1, Beng Lee Lim2, Kanthita Worachotsueptrakun1, Vivian Chih-Wei Chen 2, Nutthinee Sirichan4, Chanida Ruchisrisarod1, Apaporn Rodpan1, Kirana Noradechanon3, Thanawadee Phaichana3, Niran Jantarat3, Boonchu Thongnumchaima3, Changchun Tu 5,6, Gary Crameri7, Martha M. Stokes8, ✉ ✉ Thiravat Hemachudha1,11 & Lin-Fa Wang 2,9,11 1234567890():,; Among the many questions unanswered for the COVID-19 pandemic are the origin of SARS- CoV-2 and the potential role of intermediate animal host(s) in the early animal-to-human transmission. The discovery of RaTG13 bat coronavirus in China suggested a high probability of a bat origin. Here we report molecular and serological evidence of SARS-CoV-2 related coronaviruses (SC2r-CoVs) actively circulating in bats in Southeast Asia. Whole genome sequences were obtained from five independent bats (Rhinolophus acuminatus) in a Thai cave yielding a single isolate (named RacCS203) which is most related to the RmYN02 isolate found in Rhinolophus malayanus in Yunnan, China. SARS-CoV-2 neutralizing antibodies were also detected in bats of the same colony and in a pangolin at a wildlife checkpoint in Southern Thailand. Antisera raised against the receptor binding domain (RBD) of RmYN02 was able to cross-neutralize SARS-CoV-2 despite the fact that the RBD of RacCS203 or RmYN02 failed to bind ACE2. Although the origin of the virus remains unresolved, our study extended the geographic distribution of genetically diverse SC2r-CoVs from Japan and China to Thailand over a 4800-km range.
    [Show full text]
  • Did Circoviruses Intermediate the Recombination Between Bat and Pangolin Coronaviruses, Yielding SARS-Cov-2? Nabil Abid1,2*, Giovanni Chillemi3,4, Ahmed Rebai5
    Did circoviruses intermediate the recombination between bat and pangolin coronaviruses, yielding SARS-CoV-2? Nabil Abid1,2*, Giovanni Chillemi3,4, Ahmed Rebai5. 1Laboratory of Transmissible Diseases and Biological Active Substances LR99ES27, Faculty of Pharmacy, University of Monastir, Rue Ibn Sina, 5000, Monastir, Tunisia. 2High Institute of Biotechnology of Sidi Thabet, Department of Biotechnology, University of Manouba, BP-66, 2020, Ariana-Tunis, Tunisia. [email protected] 3Department for Innovation in Biological, Agro-food and Forest systems, DIBAF, University of Tuscia, via S. Camillo de Lellis s.n.c., 01100 Viterbo, Italy. [email protected] 4Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, IBIOM, CNR, Via Giovanni Amendola, 122/O, 70126, Bari, Italy. 5Laboratory of Molecular and Cellular Screening Processes, Centre of Biotechnology of Sfax, University of Sfax, PoBox 1177, 3018, Sfax, Tunisia. [email protected] *Correspondence to: Dr Nabil Abid. High Institute of Biotechnology of Sidi Thabet, Department of Biotechnology, University of Manouba, BP-66, 2020, Ariana-Tunis, Tunisia. [email protected] Abstract: Since the first reports of a coronavirus (CoV) disease 2019 (COVID-19) caused by severe acute respiratory syndrome virus (SARS-CoV-2) in Wuhan, Hubei province, China, scientists are working around the clock to find sound answers to the issue of its origin. While the number of scientific articles on SARS-CoV-2 is increasing, there are still many gaps as to its origin. All studies failed to find a coronavirus in other animals that is more similar to human SARS-COV-2 than the bat virus, considered to be the primary reservoir.
    [Show full text]