An Updated Analysis of Variations in SARS-Cov-2 Genome

An Updated Analysis of Variations in SARS-Cov-2 Genome

Turkish Journal of Biology Turk J Biol (2020) 44: 157-167 http://journals.tubitak.gov.tr/biology/ © TÜBİTAK Research Article doi:10.3906/biy-2005-111 An updated analysis of variations in SARS-CoV-2 genome 1,2 3 1, Osman Mutluhan UGUREL , Oguz ATA , Dilek TURGUT-BALIK * 1 Department of Bioengineering, Faculty of Chemical and Metallurgical Engineering, Yıldız Technical University, İstanbul, Turkey 2 Department of Basic Sciences, School of Engineering and Natural Sciences, Altınbaş University, İstanbul, Turkey 3 Department of Software Engineering, School of Engineering and Natural Sciences, Altınbaş University, İstanbul, Turkey Received: 30.05.2020 Accepted/Published Online: 10.06.2020 Final Version: 21.06.2020 Abstract: A novel pathogen, named SARS-CoV-2, has caused an unprecedented worldwide pandemic in the first half of 2020. As the SARS-CoV-2 genome sequences have become available, one of the important focus of scientists has become tracking variations in the viral genome. In this study, 30366 SARS-CoV-2 isolate genomes were aligned using the software developed by our group (ODOTool) and 11 variations in SARS-CoV-2 genome over 10% of whole isolates were discussed. Results indicated that, frequency rates of these 11 variations change between 3.56%–88.44 % and these rates differ greatly depending on the continents they have been reported. Despite some variations being in low frequency rate in some continents, C14408T and A23403G variations on Nsp12 and S protein, respectively, observed to be the most prominent variations all over the world, in general, and both cause missense mutations. It is also notable that most of isolates carry C14408T and A23403 variations simultaneously and also nearly all isolates carrying the G25563T variation on ORF3a, also carry C14408T and A23403 variations, although their location distributions are not similar. All these data should be considered towards development of vaccine and antiviral treatment strategies as well as tracing diversity of virus in all over the world. Key words: SARS-CoV-2, COVID-19, genomic diversity, codon bias, multiple sequence alignment 1. Introduction species from different genus including mammals, avians COVID-19 (novel COronaVIrus Disease 2019) is caused and reptiles (Gorblenya et al., 2006; Wu et al., 2020; by a novel pathogen that is pursued closely by all over the Gorblenya et al., 2020). Until December 2019, there were world after the worldwide pandemic was declared by WHO 6 coronavirus species known as human pathogen. Four on March 11th. It has become one of the most important of them (229E, OC43, NL63, and HKU1) have caused health problems by causing nearly 5M confirmed cases and common cold and others 2 worldwide outbreaks (SARS over 300K deaths from 213 countries/regions in a couple 2003; MERS 2012) over the last 20 years (Su et al., 2016; of months (WHO, 2020a)1. More than 17,000 papers by the Zhu et al., 2020). Recently, a 7th coronavirus species has query “COVID-19” indexed in PubMed/NCBI and nearly been discovered in December 2019 and first named as 200 of them, as of 29th May 2020, about genome of severe 2019 novel Coronavirus (2019 nCoV) and then as SARS- acute respiratory syndrome-COronaVirus-2 (SARS- CoV-2 (Gorblenya et al., 2020). CoV-2) which is the agent responsible for the disease. 1.2. The outbreaks of CoVs 1.1. Taxonomy of SARS-CoV-2 An outbreak of severe acute respiratory syndrome (SARS) SARS-CoV-2, is an enveloped, +ssRNA virus belonging was reported in November 2002. Despite patients carried to the Coronaviridae family that are classified into 4 symptoms of a viral infection, no pathogen causing major genera: Alphacoronavirus, Betacoronavirus, pneumonia was identified, and in a few months, it was Gammacoronavirus, and Deltacoronavirus by revealed that a novel CoV had caused this syndrome (Peiris phylogenetic studies and classified in the Betacoronavirus et al., 2003). The filiation studies about SARS showed that genus, Sarbecovirus and grouped as SARS-Like CoVs the early cases were mostly seen among restaurant workers (Gorblenya et al., 2020). CoVs can infect many of animal in Guandong Province, China and this information led 1WHO (2020a). Coronavirus disease (COVID-2019) situation re- researchers to suspect that transmission source of the ports. [online]. Website: https://www.who.int/emergencies/diseases/ virus might be an animal like bat or civet, that frequently novel-coronavirus-2019/situation-reports accessed 17 May 2020]. consumed in that province (Zhong et al., 2003; Guan et * Correspondence: [email protected] 157 This work is licensed under a Creative Commons Attribution 4.0 International License. UGUREL et al. / Turk J Biol al., 2003; Su et al., 2016). This was the first-known CoV studying viruses hosted by these animals (Ge et al., 2013; outbreak that caused 8096 cases and 774 deaths in 37 Li, 2016; Cui et al., 2019; Liu et al., 2019). Despite RBD countries or areas until July 2003 (WHO, 2004)2. of MERS-CoV-2 binds to dipeptidyl peptidase 4 (Raj et In the summer of 2012, 9 years after SARS outbreak al.,2013), it has been revealed that RBD of SARS-CoV-2 was controlled, a novel coronavirus disease called Middle binds ACE2, like in SARS-CoV (Tai et al., 2020). Although East Respiratory Syndrome (MERS) was reported in Saudi it has been lasted about 1 year to resolve this mechanism Arabia (WHO, 2019)3. A 60-year-old man with severe fever for the SARS-CoV (Li et al., 2003), this timing was much and cough, was diagnosed as pneumoniae and, as in SARS, shorter, as nearly 3 months, with the research ability that no pathogen causing pneumonia was detected, however, have been gained with the developing DNA sequencing fragments that amplified from some of PCR assays for the technologies and over years of experience in this area (Tai detection of coronaviruses were sequenced and analysis of et al., 2020; Ou et al., 2020). the results indicated a novel coronavirus named MERS- 1.4. Genomic studies on CoVs CoV relative with HKU4 and HKU5 (Zaki et al., 2012; de There are more than 7,000 complete genome entry Groott al., 2013; Yin and Wunderink, 2013). Despite the uploaded to Nucleotide/NCBI databases from transmission from animal to human has not been clearly Coronaviridae family between 2002–2020, more than half verified, the dromedary camels have been proposed as the of them is being SARS-CoV-2 (www.ncbi.nlm.nih.gov/ main reservoir host for the virus (Azhar et al., 2014; WHO, nuccore/, accessed 29th May 2020) and it is increasing 2019). This epidemic was mostly effective in Saudi Arabia in parallel to the development of sequencing technology. and in the gulf countries, caused nearly 2500 confirmed The coronavirus genome structure has been characterised cases with 35% death rate (WHO, 2019). by various studies (Kocherhans et al., 2001; Brian and Transmission mechanism of both SARS-CoV and Baric, 2005; Gorblenya et al., 2006; Yang and Leibowitz, MERS-CoV from animal to human was reported to be 2015; Madhugiri et al., 2016) with the genome size the direct contact with host animals or consumption around 30kb. CoVs are referred to have the largest known of raw milk, meat or urine (Yin and Wunderink, 2013; genome size among RNA viruses (Brian and Baric, 2005). WHO, 2019). The studies listed the 3 major causes of All CoV genomes contain a large gene region (named SARS epidemic termination: the public health strategies, ORF) encoding nonstructural proteins (Nsp) which are scientific development in biology and medicine, and responsible for mostly replication and the genes encoding hygiene practices (Chew, 2007). spike (S) glycoprotein, envelope (E) protein, membrane 1.3. Evolution of SARS-CoV-2 (M) glycoprotein and nucleocapsid (N) protein have also It was already known that the Spike (S) glycoprotein been found in common (Brian and Baric, 2005; Gorblenya plays a determining role in CoV infections (Sanchez et et al., 2006). al., 1999) and is effective in viral entry and pathogenesis The genome sequence of SARS-CoV-2 was first (Gallagher and Buchmeiert, 2001). During the ongoing characterised by Wu et al. in December 2019 (Wu et al, SARS outbreak, it has been discovered that the novel virus 2020) and appointed as reference genome of SARS-CoV-2 has developed a new viral entry mechanism by binding (NC_045512.2; 1-29870); consist of 11 gene regions; S protein to angiotensin-converting enzyme 2 (ACE2) ORF1ab (266-21555), S (21563-25384), ORF3a (25393- receptors (Li et al., 2003), as the identification mark of 26220), E (26245-26472), M (26523-27191), ORF6 (27202- SARS-CoV adaptation which has been caused by the 27387), ORF7a (27394-27759), ORF7b (27756-27887), mutations on S protein residues between 318–510 that ORF8 (27894-28259), N (8274-29533), ORF10 (29558- named as receptor binding protein (RBD) (Wong et al., 29674) by NCBI (Genbank, 2020). 2004; Li et al., 2005). After the SARS outbreak, animals Although this reference genome sequence is commonly such as bats, civets, and pangolins, have been seen as the used in most studies, it is extremely important to monitor main reservoir hosts of the SARS-related CoVs that has the variations in the virus genome to understand the similar entry mechanism and scientists have focused on evolution and spread of the virus and also to use this information in the development possible treatments 2WHO (2004). Summary of probable SARS cases with onset of illness from 1 November 2002 to 31 July 2003. [online]. Website: https:// and vaccines accordingly. For this purpose, GISAID www.who.int/csr/sars/country/table2004_04_21/en/ [accessed 17 (GISAID, 2020; Elbe and Buckland-Merrett, 2017; Shu, May 2020]. and McCauley, 2017) which collects and shares genome 3WHO (2019).

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    11 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us