Template for Taxonomic Proposal to the ICTV Executive Committee

To create a new Order

Code† 2005.200G.04 To create a new Order

† Code 2005.201G.04 To name the new order

† Code 2005.202G.04 To designate the following families as part of the new order: • Picornaviridae • • the unassigned genus Iflavirus • Sequiviridae • the unassigned genus • the unassigned genus • Comoviridae † Assigned by ICTV officers

Author(s) with email address(es) of the Taxonomic Proposal • Peter Christian ([email protected]) • Claude Fauquet ([email protected]) • Alexander Gorbalenya ([email protected]) • Andrew King ([email protected]) • Nick Knowles ([email protected]) • Olivier Le Gall ([email protected]) • Glyn Stanway ([email protected])

Old Taxonomic Order Order None • Family Picornaviridae • Family Dicistroviridae • Family Marnaviridae • unassigned genus Iflavirus • Family Sequiviridae • unassigned genus Cheravirus • unassigned genus Sadwavirus • Family Comoviridae

New Taxonomic Order Order Picornavirales • Family Picornaviridae • Family Dicistroviridae • Family Marnaviridae • unassigned genus Iflavirus • Family Sequiviridae • unassigned genus Cheravirus • unassigned genus Sadwavirus • Family Comoviridae

ICTV-EC comments and response of the SG

Argumentation to create a new order:

Introduction: When the first full sequences of the genomes of RNA became available, it was noticed that some plant viruses had features very similar to those of animal, including human, viruses now classified within the family Picornaviridae (Argos et al., 1984; Franssen et al., 1984; Goldbach, 1986, 1987; Goldbach and Wellink, 1988): (i) icosahedral particles about 30nm in diameter packaging a single stranded RNA genome; (ii) the genome 3’ polyadenylated and linked to a 5’-terminal protein, VPg; (iii) proteins expressed as polyprotein precursor(s), processed to mature proteins by one or more a -encoded proteinases; (iv) similarities in the sequences of the domains encoding NTPase and putative helicase (Hel), proteinase (Pro) and polymerase (Pol) domains, and (v) with these domains arranged in the same order, Hel-(VPg)-Pro-Pol, along the polyprotein. Two main differences existed between these groups of viruses: (i) Some plant viruses were known to be bipartite, i.e. the genome is divided into two parts, each carried by a separate virus particle (Le Gall et al., 2005, 2005, 2005), whereas animal viruses are invariably monopartite. (ii) Their capsids were composed of either one capsid protein (CP; about 55kDa in typical ) or two smaller CPs (35 and 20 kDa in and fabaviruses) subunits, whereas the had 3 major CPs (and sometimes an additional smaller one). In 3-D structure, however, the capsids of comoviruses and picornaviruses proved to be strikingly similar, since both were assembled from three distantly related jelly-roll domains arranged with pseudo T=3 (P3) icosahedral symmetry (Hogle et al., 1985; Rossmann et al., 1985; Chen et al., 1989; Chandrasekar and Johnson, 1998; Tate et al., 1999) (Liljas et al. 59- 84; Rossmann and Johnson, 1987). This similarity was all the more remarkable because such a structure had never previously been observed. Further complete genome sequences have revealed several more families that share many, or all, of these features. This has led to the now commonly used concept of “picorna-like viruses” and picorna-like “superfamily/supergroup” (Franssen et al., 1984; Goldbach, 1986, 1987; Goldbach and Wellink, 1988; Gorbalenya and Koonin, 1993; Koonin and Dolja, 1993). Figure 1 shows the genomic organisations of all the picorna-like viruses for which genome sequences are available. The list includes, not only the proposed members of the new order, but also more distantly related picorna-like families that, for reasons discussed below, are considered too divergent to merit inclusion. Until now no formal proposal has been made to the ICTV to create a higher order taxon. This proposal is to create an Order, named the Picornavirales, to comprise the following taxonomic groupings: Picornaviridae, Dicistroviridae, Marnaviridae, Sequiviridae, Comoviridae and the unassigned genera Iflavirus, Cheravirus and Sadwavirus. Invariant properties of the proposed order: It should be noted that not every member of the Picornavirales has been demonstrated to conform to all the taxonomic criteria listed below; for example, to confirm a jelly-roll protein structure “directly” would require the crystal structure to be determined. Nevertheless, direct evidence is available for a wide range of virus types, enabling such properties to be inferred in other cases from comparative sequence analysis, and other, evidence. • Each virus particle comprises a non-enveloped capsid containing a single species of genomic RNA. • The capsid, ~25-30 nM in diameter, is made up of 60 identical protomers, each protomer consisting of three similarly sized and distantly related domains each folded into an 8-stranded beta-barrel (jelly-roll fold). The capsid possesses T=1 icosahedral symmetry relative to protomers although the arrangement of the 3x60=180 constituent domains approximates to that of a T=3 icosahedron. The resulting pseudo-T3 (P3) symmetry is unique to members of this taxon. • The 5’ terminus of the genome, where this has been identified, is covalently attached to a 3-4 kDa virus- encoded protein (VPg). • The genomic RNA also serves as the virus messenger RNA (mRNA); i.e. there are no subgenomic mRNAs. • Each virus-encoded open reading frame (ORF) encoding multiple domains is expressed as a single polyprotein which is processed to the mature virus proteins exclusively by virus-encoded proteinase(s). • Conserved order of the non-structural protein domains (defined below) in polyprotein, Hel-VPg-Pro-Pol. • Conserved order of the three structural domains in polyprotein, their locations in the capsid being as follows: the N-terminal and middle domains (“VP2” and “VP3”, respectively, using nomenclature) are located either side of the 3-fold icosahedral axis, the C-terminal (“VP1”) at the 5-fold. Other properties which, while not invariant, are characteristic of the order: • Each virus particle typically contains a single molecule of genomic RNA, although in some plant viruses with a bipartite genome two molecules of the small genome component can be encapsidated in a single virion. • The genomic RNA is typically 3’ polyadenylated, the sole exception being the genus (family Sequiviridae), where this feature appears to be absent (Turnbull-Ross et al., 1993). • Viruses are typically monopartite (i.e. a single particle is infectious), the sole exception being several taxa of plant viruses (Comoviridae, Cheravirus, Sadwavirus), which have a bipartite genome. • Each virus RNA molecule normally encodes a single multidomain ORF, the sole exception being the Dicistroviridae where, as the name suggests, each molecule encodes two ORFs. • Individual domains of the capsid are normally cleaved proteolytically from each other to generate the three mature proteins typical of the Picornavirales. In the case of the family Comoviridae and (by analogy) probably the unassigned genus Sadwavirus, however, the processing of the CP precursor is not as extensive. This results in the jelly-roll domains remained fused in a single CP subunit: two in the large CP subunit (genera and within the family Comoviridae; probably also unassigned genus Sadwavirus (Iwanami et al., 1999) or three in the single CP subunit (genus ). • the Hel, VPg, Pro, and Pol domains of the eighth groups that make up the Picornavirales can be characterized as follows: o Hel: a superfamily III helicase domain (Gorbalenya et al., 1990), with an “A” motif (GxxGxGK(S/T)), followed about 40 amino-acids downstream by a “B” motif (DD), followed about 30 amino-acids downstream by a “C” motif (KGxxxxSxxxxx(S/T)(S/T)N). This domain occupies the central part of 2C protein in the Picornaviridae and 2C-like protein in other viruses of the Picornavirales and it is involved in NTP-binding and hydrolysis (Mirzayan and Wimmer 176-87;Pfister and Wimmer 1611-19) o VPg: a 3-4 kDa VPg with little or no characteristic sequence features except for the presence near its N-terminus of an amino-acid with a free hydroxyl group to which the RNA 5’ end is covalently linked by a phosphodiester bond o Between Hel and VPg: an hydrophobic domain which may, or may not, be cleaved proteolytically from the Hel moiety o Pro: a proteinase resembling trypsin-like serine proteases in structure and having substrate specificity resembling that of the prototype picornavirus 3C proteinase (3C-like proteinase) . It contains a catalytic triad of histidine, aspartate/glutamate and typically cysteine, in place of the canonical serine of “serine” proteases, which serves as the catalytic nucleophile (Gorbalenya et al., 1986; Bazan and Fletterick, 1988; Gorbalenya et al., 1989; Allaire et al., 1994; Matthews et al., 1994). Interestingly, the only member of the family Marnaviridae to have been sequenced so far (Lang et al., 2004) appears to possess a serine at this position (AEG, unpublished observation). The catalytic cysteine/serine is located near the C-terminus of the Pro domain and is generally followed 10-20 amino-acids downstream by a GΦH motif (where Φ is a polar residue) that appears important for substrate specificity (Bazan and Fletterick, 1988; Cheah et al., 1990; Matthews et al., 1994; Hemmer et al., 1995; Miyashita et al., 1995). o Pol: a Type I Pol domain (Kamer and Argos, 1984; Koonin, 1991) with eight conserved motifs, motifs I (KxE), V (PSGΦΦΦTΦΦΦN(S/T)) and VI (YGDD) being the most conserved ones. The motifs V and VI are part of the catalytic center of this enzyme (Hansen, Long, and Schultz 1109- 22) Phylogenetic evidence: The taxa listed above as candidate members of the Order Picornavirales appear to form a monophyletic lineage when the sequence similarity of the non-structural proteins is used to reconstruct in a hierarchical clustering analysis (Figures 3 and 5). Divergent picorna-like viruses, not assigned to the order Picornavirales: The level of structural, organisational and sequence divergence in the families , and Hypoviridae were judged too high for them to be included in the order Picornavirales, for the following reasons: • The family Caliciviridae have several characteristics atypical of the Picornavirales, as follows: (i) Genome encodes only a single jelly-roll domain. As a result, the icosahedral capsid of 180 jelly rolls is only superficially picorna-like, it exhibits true T3 symmetry (i.e. the 180 domains are identical to each other), rather than P3 symmetry. (ii) Each jelly-roll domain is linked to second, outwardly protruding, domain that is not found among the members of the Picornavirales. (iii) A major divergence can be seen in their strategy of gene expression, which involves the use of sub-genomic mRNAs. For these reasons, it was judged appropriate, on balance, to adopt a somewhat restrictive set of taxonomic criteria that exclude the Caliciviridae. • The family Potyviridae has the following major deviations (i) the members employ a single capsid protein unrelated to the jelly-roll domain that forms capsid exhibiting helical, rather than icosahedral, symmetry ; (Shukla et al., 1994) and (ii) they have a Hel domain of superfamily II rather than III found in the Picornavirales viruses (Gorbalenya and Koonin 419-29). • The members of the family Hypoviridae have the following specifics: they lack virions and 3C-like proteinase, have an atypical (double-stranded) RNA genome and a Hel domain of superfamily II, like the Potyviridae members with whom they phylogenically cluster. Discussion: The picornavirus supergroup comprises the eleven virus types listed in Fig 1, including three distantly related families, the Caliciviridae, Potyviridae and Hypoviridae, which differ markedly from the other eight. It should also be noted that there are other virus families, e.g. Sobemoviridae, Astroviridae and the order , which share certain properties with the Picornavirales viruses. The differences between these viruses and the Picornavirales are especially recurrent in relation to capsid proteins and virions (if any) they made. The caliciviruses appear the most picornavirus-like but, even in this case, there are major differences in the capsid organization, as well as the genome expression strategy, that separate this family from the other eight forming the Picornavirales (see above). As for the Potyviridae and Hypoviridae, these have deviated even more profoundly, including the lack of the icosahedral capsid. Despite these clear differences between the viruses included and not in the Order, we acknowledge that the choice of taxa to be assigned to the order Picornavirales fully depends on the criteria used. In our proposal, we have designed criteria using most conserved properties that characterize the prototype Picornaviridae and found that at least seven other virus groups pass this check. To include only one extra group, the closest Caliciviridae, in the Order, as was proposed before (Koonin and Dolja, 1993) two major criteria related to the virion architecture and genome expression would have needed to relax. We feel that this revision of the criteria would significantly “dilute” the common basis of the Order and, thus, it may not be justified for a relatively minor order expansion. We hope that the creation of the new Order, as defined in this proposal, will facilitate research on picorna-like viruses and knowledge dissemination. We also hope that this proposal will trigger further insights that in due course may lead to an appropriate higher order classification for other viruses of the supergroup that left outside the Picornavirales. Origin of the proposed order name • Derived from the name of the family Picornaviridae, to reflect the common expression “picorna-like virus”. • The etymology of “Picorna” derives from “pico” (small) and “RNA”, to recall that these are small RNA viruses. This is a common property of all viruses in the proposed order.

References Allaire M, Chernaia MM, Malcolm BA, James MN (1994) Picornaviral 3C cysteine proteinases have a fold similar to chymotrypsin-like serine proteinases. Nature 369: 72-76 Argos P, Kamer G, Nicklin MJ, Wimmer E (1984) Similarity in gene organization and homology between proteins of animal picornaviruses and a plant comovirus suggest common ancestry of these virus families. Nucleic Acids Res 12: 7251-7267 Bazan JF, Fletterick RJ (1988) Viral cysteine proteases are homologous to the trypsin-like family of serine proteases: structural and functional implications. Proc Natl Acad Sci U S A 85: 7872-7876 Chandrasekar V, Johnson JE (1998) The structure of tobacco ringspot virus: a link in the evolution of icosahedral capsids in the picornavirus superfamily. Structure 6: 157-171 Cheah KC, Leong LE, Porter AG (1990) Site-directed mutagenesis suggests close functional relationship between a human 3C cysteine protease and cellular trypsin-like serine proteases. J Biol Chem 265: 7180-7187 Chen ZG, Stauffacher C, Li Y, Schmidt T, Bomu W, Kamer G, Shanks M, Lomonossoff G, Johnson JE (1989) Protein-RNA interactions in an icosahedral virus at 3.0 A resolution. Science 245: 154-159 Franssen H, Leunissen J, Goldbach R, Lomonossoff GP, Zimmern D (1984) Homologous sequences in nonstructural proteins from cowpea mosaic virus and picornaviruses. Embo J 3: 855-861 Goldbach R (1986) Molecular evolution of plant RNA viruses. Annual Review of Phytopathology 24: 289-310 Goldbach R (1987) Genome similarities between plant and animal RNA viruses. Microbiological Sciences 4: 197-202 Goldbach R, Wellink J (1988) Evolution of plus-strand RNA viruses. Intervirology 29: 260-267 Gorbalenya AE, Blinov VM, Donchenko AP (1986) Poliovirus-encoded proteinase 3C: a possible evolutionary link between cellular serine and cysteine proteinase families. FEBS Lett 194: 253-257 Gorbalenya AE, Donchenko AP, Blinov VM, Koonin EV (1989) Cysteine proteases of positive strand RNA viruses and chymotrypsin-like serine proteases. A distinct protein superfamily with a common structural fold. FEBS Lett 243: 103-114 Gorbalenya AE, Koonin EV (1993) Comparative analysis of amino acid sequences of key enzymes of replication and expression of positive-strand RNA viruses. Validity of approach and functional and evolutionary implications. Sov. Sci. Rev. D. Physicochem. Biol. 11: 1-84 Gorbalenya, A. E. and E. V. Koonin. "Helicases: amino acid sequence comparisons and structure-function relationships." Current Opinion in Structural Biology 3 (1993): 419-29. Gorbalenya AE, Koonin EV, Wolf YI (1990) A new superfamily of putative NTP-binding domains encoded by genomes of small DNA and RNA viruses. FEBS Lett 262: 145-148 Hansen, J. L., A. M. Long, and S. C. Schultz. "Structure of the RNA-dependent RNA polymerase of poliovirus." Structure (Camb.) 5.8 (1997): 1109-22. Hemmer O, Greif C, Dufourcq P, Reinbolt J, Fritsch C (1995) Functional characterization of the proteolytic activity of the tomato black ring nepovirus RNA-1-encoded polyprotein. Virology 206: 362-371 Hogle JM, Chow M, Filman DJ (1985) Three-dimensional structure of poliovirus at 2.9 A resolution. Science 229: 1358-1365 Iwanami T, Kondo Y, Karasev AV (1999) Nucleotide sequences and of Satsuma dwarf virus. Journal of General Virology 80: 793-797 Kamer G, Argos P (1984) Primary structural comparison of RNA-dependent polymerases from plant, animal and bacterial viruses. Nucleic Acids Res 12: 7269-7282 Koonin EV (1991) The phylogeny of RNA-dependent RNA polymerases of positive-strand RNA viruses. J Gen Virol 72 ( Pt 9): 2197-2206 Koonin EV, Dolja VV (1993) Evolution and taxonomy of positive-strand RNA viruses: implications of comparative analysis of amino acid sequences. Critical Reviews in Biochemistry and Molecular Biology 28: 375-430 Lang AS, Culley AI, Suttle CA (2004) Genome sequence and characterization of a virus (HaRNAV) related to picorna-like viruses that infects the marine toxic bloom-forming alga . Virology 320: 206-217 Le Gall O, Iwanami T, Jones AT, Lehto K, Sanfacon H, Wellink J, Wetzel T, Yoshikawa N (2005) Cheravirus. In CM Fauquet, MA Mayo, J Maniloff, U Desselberger, LA Ball, eds, Virus Taxonomy, VIIIth Report of the ICTV. Elsevier/Academic Press, London, pp 803- 805 Le Gall O, Iwanami T, Jones AT, Lehto K, Sanfacon H, Wellink J, Wetzel T, Yoshikawa N (2005) Comoviridae. In CM Fauquet, MA Mayo, J Maniloff, U Desselberger, LA Ball, eds, Virus Taxonomy, VIIIth Report of the ICTV. Elsevier/Academic Press, London, pp 807-818 Le Gall O, Iwanami T, Jones AT, Lehto K, Sanfacon H, Wellink J, Wetzel T, Yoshikawa N (2005) Sadwavirus. In CM Fauquet, MA Mayo, J Maniloff, U Desselberger, LA Ball, eds, Virus Taxonomy, VIIIth Report of the ICTV. Elsevier/Academic Press, London, pp 799-802 Liljas, L. et al. "Evolutionary and taxonomic implications of conserved structural motifs between picornaviruses and insect picorna-like viruses." Archives of Virology 147.1 (2002): 59- 84. Matthews DA, Smith WW, Ferre RA, Condon B, Budahazi G, Sisson W, Villafranca JE, Janson CA, McElroy HE, Gribskov CL, et al. (1994) Structure of human rhinovirus 3C protease reveals a trypsin-like polypeptide fold, RNA-binding site, and means for cleaving precursor polyprotein. Cell 77: 761-771 Mirzayan, C. and E. Wimmer. "Biochemical studies on poliovirus polypeptide 2C: evidence for ATPase activity." Virology 199 (1994): 176-87. Miyashita K, Utsumi R, Utsumi T, Komano T, Satoh N (1995) Mutational analysis of the putative substrate-binding site of 3C proteinase of coxsackievirus B3. Biosci Biotechnol Biochem 59: 121-122 Murphy JF, Rhoads RE, Hunt AG, Shaw JG (1990) The VPg of tobacco etch virus RNA is the 49-kDa proteinase or the N-terminal 24-kDa part of the proteinase. Virology 178: 285- 288 Pfister, Thomas and Eckard Wimmer. "Polypeptide p41 of a Norwalk-Like Virus Is a Nucleic Acid-Independent Nucleoside Triphosphatase." Journal of Virology 75.4 (2001): 1611- 19. Rossmann MG, Arnold E, Erickson JW, Frankenberger EA, Griffith JP, Hecht HJ, Johnson JE, Kamer G, Luo M, Mosser AG, et al. (1985) Structure of a human common cold virus and functional relationship to other picornaviruses. Nature 317: 145-153 Rossmann, M. G. and J. E. Johnson. "Icosahedral RNA virus structure." Annual Review of Biochemistry (1987). Shukla DD, Ward CW, Brunt AA (1994) The Potyviridae. CAB International, Wallingford, UK Tate J, Liljas L, Scotti P, Christian P, Lin T, Johnson JE (1999) The crystal structure of cricket paralysis virus: the first view of a new virus family. 6: 765-774 Turnbull-Ross AD, Mayo MA, Reavy B, Murant AF (1993) Sequence analysis of the parsnip yellow fleck virus polyprotein: evidence of affinities with picornaviruses. J Gen Virol 74: 555-561

Annexes:

Figure 1: Genomic organization of the picorna-like viruses. The abbreviations are Hel: Helicase; Pro: Proteinase; Pol: Polymerase; CP: Capsid Protein; CP: Virion Protein; MP: Movement Protein. The figure includes, not only the members of the proposed order Pixcornaviraless (boxed), but also the, more divergent, families not currently proposed for inclusion.

Figure 2. Unrooted phenogram showing the relationships of the CrPV-like viruses to representatives of the Picornaviridae, Comoviridae, Sequiviridae and the floating genus Iflavirus. The phenogram was constructed from an amino acid similarity matrix of the replicase (RdRP) region of the non-structural proteins using the neighbour-joining method. A bootstrap analysis was performed and the percentage values are indicated at the branhing points. Viruses other than dicistroviruses included in the analysis, abbreviation ( ) and accession numbers [ ] are; Cowpea severe mosaic virus (CPSMV) [M83830], Encephalomyocarditis virus (EMCV) [M81861], Foot-and-mouth disease virus (FMDV) [X00871], Hepatitis A virus (HAV) [M14707], Infectious flacherie virus (IFV) [AB000906], Parsnip yellow fleck virus (PYFV) [D14066], Perina nuda virus (PnV) [AF323747], Poliovirus (PV) [J02281], Rice tungro spherical virus (RTSV) [M95497], Sacbrood virus (SBV) [AF469603]. Branch lengths are drawn to scale.

Figure taken from the VIIIth ICTV report.

Figure 3: Phylogenetic analysis of picorna-like RdRp domain protein sequences. CLUSTAL_X alignments were done with residues 1362-1619 of the HaRNAV polyprotein that represent the conserved regions I-VIII (defined in Koonin and Dolja (1993)) and the corresponding regions from the other viruses included. The tree is based on maximum likelihood distances generated with TREE-PUZZLE. The sequence from the Carnation mottle virus (CarMV) was used as an outgroup. Support values based on 10,000 puzzling steps are shown above the branches. Bootstrap values (based on 1,000 replicates) for branches that are supported by >50% by neighbor-joining analysis are labeled below the branches (a dash indicates there was no corresponding branch in the neighbor-joining tree). The maximum likelihood scale bar is shown.

Figure taken from the VIIIth ICTV report.

Figure 4. Unrooted phenogram showing the relationships of the RdRps of the Nidovirales lineages with virus of the families of the “Picornavirus-like” supergroup, Tetraviridae and . The most conserved part of RdRps from representative viruses in the Picornaviridae, Dicistroviridae, Sequiviridae, Comoviridae, Caliciviridae, Potyviridae, , Ronivirida, , Birnaviridae, Tetraviridae and unclassified insect viruses was aligned. The RdRps of Thosea asigna virus (TaV), Euprosterna elaeasa virus (EeV) and the birnaviruses were converted into the canonical ABC form before the analysis. Using an extended, gap-free version of the alignment containing 332 informative characters, an unrooted neighbor-joining tree was inferred by the ClustalX1.81 program. All bifurcations with support in > 700 out of 1000 bootstraps are indicated. Different groups of viruses are highlighted. Note that the relative positions of members of the families Arteriviridae and Roniviridae within the Nidovirales are not resolved in this tree.

Virus families and groups, viruses included in the analysis, abbreviations ( ) and the NCBI protein (unless other specified) IDs [ ] are as follows: Picornaviridae: Human poliovirus type 3 Leon strain (PV-3L) [130503] and 1 (HpeV-1) [6174922]; Iflavirus: Infectious flacherie virus (InFV) [3025415] and Unclassified insect viruses: Acyrthosiphon pisum virus (APV) [7520835]; Dicistroviridae: Drosophila C virus (DCV) [2388673]; Sequiviridae: Rice tungro spherical virus (RTSV) [9627951] and Parsnip yellow fleck virus (PYFV) [464431]; Comoviridae: Cowpea severe mosaic virus (CPSMV) [549316] and Tobacco ringspot virus (TRSV) [1255221]; Caliciviridae: Feline calicivirus F9 (FCV-F9) [130538] and Lordsdale virus (LORDV) [1709710]; Potyviridae: Tobacco vein mottling virus (TVMV) [8247947] and Barley mild mosaic virus (BaMMV) [1905770]; Coronaviridae: Human coronavirus 229E (HCoV) [12175747] and Berne (BEV) [94017]; Arteriviridae: Equine arteritis virus (EAV) [14583262]; Roniviridae: Gill-associated virus (GAV) [9082018]; Tetraviridae: Thosea asigna virus (TaV) [AF82930; nt sequence] and Euprosterna elaeasa virus (EeV) [AF461742; nt sequence]; Birnaviridae: Infectious pancreatic necrosis virus (IPNV) [133634] and Infectious bursal disease virus (IBDV) [1296811]. (Tree was modified from Gorbalenya et al., (2002), with permission).

Figure taken from the VIIIth ICTV report.

Figure 5: Unrooted phenogram derived from the RdRp domain of the viral non-structural proteins showing the relationships of representative picornaviruses, sequiviruses, dicistroviruses and the three members of the genus Iflavirus (IFV, PnV and SBV). Taxa used (with sequence accession numbers shown in brackets [ ]) were Cowpea severe mosaic virus (CPSMV) [M83830], Drosophila C virus (DCV) [AF014388], Encephalomyocarditis virus (EMCV) [M81861], Foot-and-mouth disease virus (FMDV) [X00871], Hepatitis A virus (HAV) [M14707], Poliovirus (PV) [J02281], Parsnip yellow fleck virus (PYFV) [D14066], Plautia stali intestine virus (PSIV), Rhopalosiphum padi virus (RhPV) [AF022937], and Rice tungro spherical virus (RTSV) [M95497]. A bootstrap analysis was performed and values obtained are shown next to the branching points. Branch lengths are proportional to distance.

Figure taken from the VIIIth ICTV report. Figure 6: Figure trypsin C-terminusandthe status aregiven directly inthe dendro each sequence, the GenBank accession number, the virus sequences were aligned using Clustal using the Neighbor viridae Calici Potyviridae

N N N

P P N

Y P NP051035 PECV Calici-Sapo P P

8 0 0 N

5 N

6 0 N

N 0 6 P

9 N Y 6 6 P

2 8 P

Hierarchical clustering basedon theregion betwee P

4 P

P 3 2 1 4

8

2 6 N 0

7 4

N

6 3 8 Y

A 2 N N

0 0 6 6

5 P

8 7 8

P 5 1 P

A 8 5 P

2 P

6

8 F 5 5 3 9

T 4 F 9

P Y

0

2 2

E 6

9 9 1 6

M 6

0 3 M

P F R 4 2

5 5 0

9 9

B 9 2

1 1 5

4

D 4

i M 6 5

D H 4

0 N

0 6 0

7 4 9

c H

B 9 0

V 8 4

2 P D 6

9 D V 0

F V

7

F S

F

Y

2 9

o -

2 W

3 5 2

-

M 7

M A V V E

0

77 0 C 1

C V

6 M

5

F

4

r s -

S

Y

D C

D

Y

V

C

A N B M S

V 7 i 6 C

n A a P O M

V

V V V P

M

7

a P

A F D V

1 P i a C M

- Y

P -

4

a c V l 0

B M S O C M V M

i i l

a

V

N o C i 1

c o

2 V P c Y

6

- a c a

M

v D P

l

V

P 5

S o r i P t P W

1 i a i - i

t n P

c y c

l

V - P

2 1 o

L

9 r i

V

a i

i 6 l

o L

a c P o

0

-

c i

- n ic 0 S

N

o r 5 a

t t -

5

S P B

a

i 2

r y

3 o - t

A a 6 V P

2 o

M - P t

y

i 3 n A i g

5

a i y

-

g

A r - y

- V d

c t

0 e o -

P n a o A

E N B

1

y

V

t A p N 0 m

A o -

7 1 o u e

t

g

i a - s 0 B

A R -

4 P

h y

4 c p o y n

a 5 r A

B o P

si M

- i 7

AA46547 P n 47

y 7

A o t m -

A h r a 5

E p o

I

o

9 y

a o H i r m V

e V t

s

p

c 7

p o

2 R n h o 2

t

- m N

M s

0

o o

y

9 o

P a A h t 7

A P

P

i

P o 7

- r m o

g

i - p t V

T o R T na 2

1 V c A o

4

n

h h o 7 3

t r

o M P

p 7

P y

e

i 5 V N

- t

t

7

r o B

Men A h o

9 - V

i

i P

n d

m 5

3 P c R C t t

p

a

N 2

o o y 0

i P

y M

E c - ht o

P 2 - r 5

go A o

o m M n R

6

V

6 o Z

r a p t

7

C y y

5 n o

Y N

- h P

P 5 m

3 V a A - P

M

R

N t 8

1 ic o

Tobacco mosaic mosaic Tobacco - - o

p o

P 4 B C

y 6 V t

or P

3 h y 1 m

6 a

P R

t -

P

5 P n o 9 r P

S i o N

3 a d 6

T c o

o

P

6

0 o -C i V

V o t

7 t y

r 7

0

y

n P

7 -

N P a 4 L

a P

P E i o

r 0

c - M

d o t

8

R o C N

7 y

io V

t

0

4 B P

y

r a -

n 7 0 P

V P

r 0

25 a

d P

o

5

P - i o

P 6

T t

7 o t

y i y

7

B B c e V

-

A o s 5

A

K P P

B r c 9

A

V n o

6 o

h P

a O

t

2 P o t V y

y

N 8 -E 8

i Y P 3

8 co -

P

6

6 9 r

b P

6

5 A r o o na 1

o

3 t

i Outgroup y

V

1 t

P

4 - y

P K Q

V

5 -

P

0

P ic o Y

2

B o

E o bu

9 P

gram. The outgroup used is a chimeric A r t y

A V- n 6

o

0 A a 3 t

0 y

- P A

K N

1 P -

A V

P

6 i o P

N Y

8 c

0

o b o 8

0

H t

u P

2

r 7

R n y 7

o 8

V a 6

4 t

0

N - - y 4

A E -

C P -

P

n P

6 P A

t o V

5 i e A

2 Y

3 c t

r y

1 o o 4

6

4 r P

9 n 8

o

a 8

S t

- y

E R T

-

r

P

V h y

N - i p

A n o

P s t

o i

y 0 P n

4 i -

1 c T 00 or M n V 9 HR a-E N V-B nter P 04 Pic o 2242 orna HEV -Rhi -A Pi no corna CAA -Ente 74807 ro PEV-B Picorn a-Ente NP 04 ro 5756 BEV Picorna- RdRpN-terminus. Entero NP 040958 HEV- 0.1 B Picorna-Entero acronym andashortened desc -Joining approach,andp-dist NP 040760 HEV-D Picorna-Entero BAB18312 BBWV2 Como-Faba

n thePro (CG)andthe Pol -Entero NP 041277 PV Picorna AAD3921 7 BBWV2 Como-Faba -Entero

HEV-C Pico

AAQ02675 BAA34928

N B

P BW

6 V

4 2 C

7 o 6 m

0 o

2 -Fa

L ba V

P

i c

o N

r

n P

a

- 6

P 4

a 7

r 5

e 9 c

A 2

h o

A P

A aM

7 M

2 V

2 C

9 o

1 m

H o-

P A Fa

e A b

V D a

P 3

i 8

c 1

N

o 5

r P

n 2

a

B

-

0 B P

4 W

a

1

r

e V 0

c 2 0

h

7

o N C

o

H P m

A 1 o-

V 4 F

N

9 a

P b

P 0 i a

c 1

o 2 65

r

n B

a 31 B

- W

H B

5

e A V p

1 2 a

B A Y

t C

o 4

E

P 0 om

V 4

o

0 3

P -

1 9 F

i a

c

5 B b o

6 B a

r B

9

n A W a

6

- D

N

H V

K 9

e 2

P

a

p 9 C

a

V 0 8

t o

o 0

5

I m f N 1

l

3 o

a P G -

5 F

6

Y 9 e ab 0 M

P 4 a

V D 5

1

W N 1 C

4 3 o

sequence reconstituted from thebovine

V P

5 4 m

7

I B o

N 6 f

9

l B -

a 2 F P

1 0 W a

b

V N

0 6 V a

D 5

4 P 7 1 V

9 6 C 1

3 1 S o

7 q m I

N 3

f

4 M

l N o

a 2 P

V -

S P 8 F

B a

6 3 C b

2 6

V C o

N a

0

2 P m I

I P

f

5 0 o

l M

a N

f 5 4 -

9 V C

9 P 6

1 o

l

N 8 C

I m 9

a F 6

P P R o o

0

V 1 m

2 3 C 23

2

I o 9

v f 6 M

7 -

l

a 3 4 C E V

7 1 9 o o

i 0 2 C m V

r 6 A B

N o o

1 C P

I A m

P

u f

ances arerepresented. For

l M

N

P B a P o

(GDD) sequence motifs. The

94 V -

n N S

s P 6 C ription of thetaxonomical

V

N C 9 M

C o

4

6 8

P m

I V o

7 N f

2 N 0 6

l m o

7 a

0 7 C

6 0

P P o 6

5 3 o

2 - P

N N

6 9 C 1

H 6 m

0 R

2 P 5 o

P 9 4

a o

5 M m

N 8 3

R -

9

5 T

6 C o

6 Y 8 V

0 5 N

P

r N B o

V 1 1 4

5 P

A R C m

P P 9 4 7 N 0

7

D

V S o o

S 7

4 0

6 P 6 T m i

T

B V M 0 c

I

8 5 B 6

V N 2 o S

i 5

A a

s

1 6 3 R C

N

2 0 -

N

V P t D r N

G A o r 1 9

n

4 V

C 6

P A o

e P

i

a 2 m

D 9

1 c

7 1 C

- A 6 p

r

A C

Y C

i A 5 o Y

P 6 o

i 6

4 9 M N

s 1 M

5

A

c N A

N o

r A D 8 - C t

P

V 5

2 A

P i A

N N i C 2 r

9 C

B

p V m

s D

N

o P

P 9

A

3

0 5 e

r

a N D 6

a

9

t P

D

P N

- o

0 M

r

22

7 P 5 C p N

5

C 0 D 0 0 G

o

0 -

P 6 6 S

i 5 o

V

r 8 Comoviridae N

6

c 0 0 V o

r

- 6 4

6 P N

8 B

5 F i n 2 2

C 0

1

V 0

i

73 m e

p

6 6 5 s

D 4

6

1 6

o L

B 0 C

0 0 r

a 6 p

4

4

t

a 9 5 0 o

i 9 C

S 9

i 1 I

r V

p o

6 4 R 5 7 c 5 o S 6

o 2

M 9 -

4 0

a o B

v 7 9

D N

i T

6 6 4 5 m -

7 2 C 5 M V s 7

4 m

V

C

D 3

7 T e o i V 8 5 t

1

0 o o

C r 4 S

r T

o T o p

r 4 C

1 o R

o

S -

i 7 m

i i o

A 8

S -

T

p T

V o R L N

R 6

c -

d R o N

a

P S B

a

C 7 o o

a

S o R

R R

S m e L e i d

S S

M

a V

Y -

r

s d R R p

L

w S p

S R T V V

i o N

a V w

p o

F

e o

t D

V S

S C

S a

V V

- e a

r d R

C A B a

C N V

C M VC p

w

V o o V

C S

S C o e

h o

o m

a

S

v V

h m p a V

e C S o A

m o

e o

i e d

r o m o

e o

r S

a C o - C q r

i w m

N q - m a

d e o

u - N

o a u N e

o -

q

i o e a m

- i N

- e po u

- - S p

N e W N o e

i p o

e - e C - p e o

virus Sadwa W A

q a N p o p C

virus Chera

u i

e o

a C

o k

i i C p

a C k virida Sequi

o a

A e