Supporting Information
Total Page:16
File Type:pdf, Size:1020Kb
Supporting Information Kapoor et al. 10.1073/pnas.0807979105 SI Text quence with sequence order randomized controls (8). In contrast Determination of 5 UTR Structure. In the deduced type II IRES to its detection in aphthoviruses, erboviruses, cardiovirus, and stem loops H, I, K, and L (as classified in ref. 1) and unpaired teschoviruses (mean MFEDs for coding regions 4.1–8.5%), regions showed nucleotide sequence similarity or identity to there was no sequence order-dependent structure in the coding sequences from the other four most closely related picornavirus region of the HCoSV-A1 genome (mean MFED Ϫ1.2%), a genera (Fig. S3B gray boxes, labeled 1–14). HCoSV-A1 addi- finding shared with the sole member of other genus related to tionally contained the short polypyrimidine (Py) tract (Fig. S3 A HCoSV-A1, SVV (mean MFED 0.1%). Absence of GORS and B, gray box labeled 15) immediately downstream of the last suggests that HCoSV-A1 infections may be acute and resolving, stem loop, L, and showed a similar spacing upstream from the reducing the likelihood that chronic infections with HCoSV-A1 predicted start codon of the HCoSV-A1 long ORF. Although might occur (8). there was little or no sequence identity in the duplex stems on these structures between virus groups (Fig. S3B), the H, I, K, and Prevalence of HEV Infections. We tested 41 AFP cases and all 41 L stem loops were of very similar size and relative spacing. healthy controls for the presence of HEV by using RT-nested The main structural difference between HCoSV-A1 and a PCR with primers targeting the 5Ј UTR. All positive PCRs were typical type II IRES was the existence of an extremely long, confirmed as HEV-B or HEV-non B by sequencing; 31/41 AFP bifurcated stem loop between the I and K structures (Fig. S3A). and 25/41 control stools were HEV positive, showing an equal In other type II IRESs, this region, designated stem-loop J (or rate of HEV detection in both groups (P ϭ 0.15). Coinfections domain 4–1) contains a single stem loop with lengths ranging with HEV and HCoSV were no more frequent in AFP cases than from 37–50 bases in FMDV and ERAV, 55 bases in erboviruses, controls. Significantly more HEV than HCoSV infections were and 37–43 in cardioviruses. The length of the corresponding detected in AFP (76% v 49%; P ϭ 0.008) but not in healthy region in HCoSV-A1 was 140 bases. No detectable sequence children (61% v 44%; P ϭ 0.12). HEV-positive children were homology in the terminal loop or duplex stem sequences of younger than HEV-negative children (45.3 months versus 76.3 HCoSV-A1 was found with equivalent regions in other picor- months, P ϭ 0.04) and more likely to have fever at onset of AFP naviruses. However, HCoSV-A1 retains a short region of nu- (25/31 with fever versus 6/31 without, P ϭ 0.023) and had the cleotide sequence homology in the duplex region in the proximal same male/female ratio. Whether HEV infected or not AFP stem (labeled 10 and 11) and the A-bulge (conserved region 13; cases showed the same clinical profile relative to paralytic Fig. S3B), shown to be the specific regions that bind to the asymmetry, progression at 2 weeks follow-up or total number of ribosomal eIF4B component (2, 3). oral poliovirus vaccine doses previously received. We deter- Although structures I–L represent the main conserved ele- mined whether HCoSV and HEV coinfections were more fre- ments in type II IRESs, HCoSV-A1 showed evidence for quent than expected assuming independent events. Infection additional RNA secondary structures upstream of the predicted with 1 virus genus was significantly associated with infection by IRES, including putative homologues of stem loops F and G, the other virus (P ϭ 0.01) possibly reflecting related transmis- identified in cardioviruses (4) and a possible homologue of stem sion routes. loop D upstream from a short, second interrupted predicted unpaired polypyrimidine tract (positions 551–575). These Picornavirus Sequences for Phylogenetic Analysis. For determina- HCoSV-A1 structures showed similar spacings upstream from tion of sequence divergences between cosaviruses and those of structure I in cardioviruses and contained identical (D) or other picornaviruses, sequences were aligned from 2 or more similar (F) terminal unpaired regions (tetraloops and pentaloops representative serotypes from each species within each of the 9 respectively). Sequence variation in both strands of the F stem currently classified genera of picornaviruses, members of the maintained absolute complementarity supporting the proposed proposed ‘‘Sapelovirus’’ genus, and the currently unclassified F stem loop structure. The small stem loop upstream of H viruses duck hepatitis virus (DHV) (9), Seneca Valley virus showed an identical pentaloop sequence to a structure in the (SVV), and seal picornavirus (SePV-1) (10). Sequences used for equivalent position in cardioviruses (position 398). Where de- the comparison comprised the following: Genus Aphthovirus, termined, the region encompassing stem loops D–G does not foot-and-mouth disease (FMDV) NC011450, FMDVALF, contribute to IRES function and is known to be highly variable FDI320488, NC004004, NC002554, FMV7572, NC004915, between different genera. No comparable structures are iden- AY687334, NC003992, FDI251473, AY593843, NC011451, tifiable in erbovirus or ERAV sequences (5, 6), whereas in AY593853, NC011452, equine rhinitis A virus (ERAV) FMDV, the region contains a large stem loop that forms the NC003982, ERVPOLY; genus Enterovirus Simian enterovirus cis-replicating element (7). NC003988, human enterovirus, species C NC002058, The 3Јend of the HCoSV-A1 genome was polyadenylated HPO132960, POL2LAN, POL544513, POL3L37, HPO293918, (A29). Excluding this, the 3ЈUTR was 95 bases in length and NC 001428, CXA24CG, species B NC 002347, NC 001360, contained a consecutive series of 13 U2–4 sequences interspersed NC 000881, NC 001657, NC 002601, NC 001342, NC 001656, with almost invariably purines (R1–3). Although this region is NC001472, species A NC001612, ETU22522, species D known to structurally variable between picornavirus genera and NC001430, AY426531, bovine enterovirus BEVVG527, precludes the type of structure modeling carried out on the AY508697, human rhinoviruses NC001617, DQ473500, 5ЈUTR, MFOLD did identify a thermodynamically stable large EF173418, DQ473490, EF173420, EF186077, EF077279, terminal stem loop from position 7522 to 7611, containing a EF077280, NC001490; genus Hepatovirus: avian encephalomyo- single long interrupted duplex that incorporated the stop codon carditis virus (AEV) NC003990, AJ225173, AY517471, at position 7537 (data not shown). AY275539, simian hepatitis A virus (HAV) SHVAGM27, hu- Finally, the HCoSV-A1 sequence was analyzed for evidence of man HAV HAVRNAGBM, NC001489; genus Cardiovirus: genome-scale ordered RNA structure (GORS) by comparison of encephalomyocarditis virus (EMCV) NC001479, XXEVCG, folding energies of consecutive fragments of nucleotide se- Theiler’s virus NC001366, AB090161, Saffold virus EF165067, Kapoor et al. www.pnas.org/cgi/content/short/0807979105 1of8 AM922293; genus Erbovirus:NC003077, NC003983; genus POL3L37; POL2LAN; V01148; AY790926; AB204853; Parechovirus: Ljungan virus AF538689, AF327921, AF327922, AB192877; AY421763; AY421764; NC001430; AY426531, human parechoviruses (HPeV) ECHPICORN, NC001897, NC001656, NC002347, EV11VPCD, DQ534205 and AB084913, AF055846, NC008286, AB252582; genus Kobuvirus: NC001342. All alignments were numbered by using POL3L37 as bovine kobuvirus NC004421 and Aichi virus NC001918 and a reference sequence. DQ028632; Genus Teschovirus: PEN011380, AF296088, AF296107, AF296112, AF296090, AF296092, AF296093, Picornavirus Sequences for Genome-Scale Ordered RNA Structure AF296094, AF296119, AF296117; currently unclassified viruses Comparisons. The following complete genome sequences were DHV: EF382778, DQ249301, DQ249300, DQ249299, used: for the genus Aphthovirus, GenBank accession nos. EF093502, SVV: DQ641257, SePV-1: NC009891 and members AF154271, FMDVALF, NC002554, NC003982, NC003992, of the provisionally assigned genus ‘‘Sapelovirus’’: porcine en- NC004004, and NC002527 for FMDV; for the genus Cardio- terovirus A NC003987, simian picornavirus 1 AY064708 and virus, accession nos. NC001479 and MNG POLY for encepha- duck picornavirus NC006553. lomyocarditis virus and NC001366 for Theiler’s virus; for the For comparison with human enterovirus diversity, the follow- genus Enterovirus, accession nos. NC001428, NC001430, ing datasets were used: HEV species B: AY302539, NC000873, NC001472, NC001490, NC001612, NC001617, NC001752, DQ246620, DQ534205, AY302558, AF465517, NC001360, NC001859, NC002058, NC003986, NC003988, POL3L37, and AF311939, EF174468, EF174469, AF085363, NC000881, SVDMPS; for the genus Erbovirus, accession no. NC003983; for CXU57056, AY752945, AF231763, AY673831, AY875692, the genus Hepatovirus, accession nos. NC003990 for avian AF114383, NC001342, AY167104, AY167105, E11577590, encephalomyocarditis virus, SHVAGM27 for simian HAV, and E11577589, E11577594, AY167106, E11276224, EV11VPCD, NC001489 for human HAV; for the genus Kobuvirus, accession AY036578, AY302559, AF524867, AF524866 and NC001656; nos.s NC004421 for bovine kobuvirus and NC001918 for Aichi for interspecies comparisons and intergenus comparisons (with virus; and for the genus Teschovirus, accession nos. AB038528, the sapelovirus sequences listed above), the following sequences AF231769, AF296087, AF296091,