Computational Exploration of Diversity on Transcriptomic Datasets

Digitaler Anhang der Dissertation zur Erlangung des Doktorgrades (Dr. rer. nat.) der Mathematisch-Naturwissenschaftlichen Fakultät der Rheinischen Friedrich-Wilhelms-Universität Bonn

vorgelegt von Simon Käfer aus Andernach

Bonn 2019

Table of Contents 1

Table of Contents

1 Preliminary Work - Phylogenetic Tree Reconstruction 3 1.1 Non-segmented RNA ...... 3 1.2 Segmented RNA Viruses ...... 4 1.3 Flavivirus-like Superfamily ...... 5 1.4 -like Viruses ...... 6 1.5 Togavirus-like Superfamily ...... 7 1.6 -like Viruses ...... 8

2 TRAVIS - True Positive Details 9 2.1 INSnfrTABRAAPEI-14 ...... 9 2.2 INSnfrTADRAAPEI-16 ...... 10 2.3 INSnfrTAIRAAPEI-21 ...... 11 2.4 INSnfrTAORAAPEI-35 ...... 13 2.5 INSnfrTATRAAPEI-43 ...... 14 2.6 INSnfrTBERAAPEI-19 ...... 15 2.7 INSytvTABRAAPEI-11 ...... 16 2.8 INSytvTALRAAPEI-35 ...... 17 2.9 INSytvTBORAAPEI-47 ...... 18 2.10 INSswpTBBRAAPEI-21 ...... 19 2.11 INSeqtTAHRAAPEI-88 ...... 20 2.12 INShkeTCLRAAPEI-44 ...... 22 2.13 INSeqtTBNRAAPEI-11 ...... 23 2.14 INSeqtTCJRAAPEI-20 ...... 24 2.15 INSeqtTCZRAAPEI-47 ...... 25 2.16 INSeqtTDXRAAPEI-19 ...... 27 2.17 INSlupTBDRAAPEI-17 ...... 28 2.18 INSlupTBKRAAPEI-31 ...... 29 2.19 INSlupTBMRAAPEI-34 ...... 30 2.20 INSlupTBURAAPEI-45 ...... 31 2.21 INSlupTAFRAAPEI-44 ...... 32 2.22 INSntgTABRAAPEI-216 ...... 33 2.23 INSlupTASRAAPEI-89 ...... 34 2.24 INSqiqTBFRAAPEI-61 ...... 36 2.25 INSqiqTBLRAAPEI-83 ...... 37 2.26 INSqiqTBNRABPEI-90 ...... 38 2.27 INSqiqTCTRAAPEI-75 ...... 40 2 Table of Contents

2.28 INSlupTATRAAPEI-90 ...... 41 2.29 INSqiqTCXRAAPEI-90 ...... 42 2.30 INSqiqTDLRAAPEI-72 ...... 43 2.31 INSobdTDTRAAPEI-18 ...... 44 2.32 INSobdTDYRAAPEI-30 ...... 45 2.33 INSerlTCGRAAPEI-32 ...... 47 2.34 INSkzdTABRAAPEI-136 ...... 48 2.35 INSkzdTACRAAPEI-171 ...... 49 2.36 INSofmTBLRAAPEI-71 ...... 50 2.37 INSofmTCYRAAPEI-79 ...... 51 2.38 INSqiqTDDRABPEI-136 ...... 52 2.39 INSfkjTBIRAAPEI-202 ...... 53 2.40 INSerlTAKRAAPEI-83 ...... 54 2.41 INSfkjTBMRAAPEI-206 ...... 55 2.42 INSofmTCERAAPEI-22 ...... 57 2.43 INSofmTCFRAAPEI-26 ...... 58 2.44 INSinlTAARABPEI-43 ...... 59 2.45 INSinlTAPRAAPEI-33 ...... 60 2.46 INSinlTAWRAAPEI-44 ...... 61 2.47 RINSinlTCARAAPEI-55 ...... 62 2.48 RINSinlTCNRAAPEI-33 ...... 63 2.49 RINSymlTABRAAPEI-202 ...... 64 2.50 RINSwvkTAURAAPEI-56 ...... 65 2.51 ANIsrmTAAWRAAPEI-225 ...... 66 2.52 WHANIsrmTMAFRAAPEI-14 ...... 67 2.53 WHANIsrmTMCHRAAPEI-56 ...... 68 2.54 INSeqtTBBRAAPEI-75 ...... 69 2.55 INSobdTDIRAAPEI-84 ...... 71 1. PRELIMINARY WORK - PHYLOGENETIC TREE RECONSTRUCTION 3

1 Preliminary Work - Phylogenetic Tree Reconstruction

1.1 Non-segmented RNA Viruses

non-segmented RNA virus (-)

0.3 II 1KV_mono_000165

III

KM817639-Shuangao_Insect_Virus...

1KV_mono_000207

1KV_mono_000169

1KV_mono_000166

1KV_mono_000067

1KV_mono_000135

1KV_mono_000172

1KV_mono_000223

1KV_mono_000138

1KV_mono_000020

1KV_mono_000109 KM817645-Wuhan_Ant_Virus_1KV_mono_000061

1KV_mono_000111

1KV_mono_000195 NC_025382-Spodoptera_frugiperd...

NC_025253-Farmington_virus_(st...

1KV_mono_000179 1KV_mono_000117 KM817652-Wuhan_Insect_virus_6_...

KM817649-Wuhan_House_Fly_Virus...

1KV_mono_000035

1KV_mono_000011

NC_007642-Lettuce_necrotic_yel...

1KV_mono_000123

KM817659-Wuhan_Mosquito_Virus_...

KM817636-Shayang_Fly_Virus_3_(... 1KV_mono_000148 KM817650-Wuhan_Insect_virus_4_...

KM817642-Tacheng_Tick_Virus_7_...

NC_016136-Potato_yellow_dwarf_...

KM817637-Shuangao_Bedbug_Virus... KM817647-Wuhan_Fly_Virus_3_(st...

HM849039-Soybean_cyst_nematode... KM817651-Wuhan_Insect_virus_5_...

1KV_mono_000023 1KV_mono_000098 1KV_mono_000114

1KV_mono_000204 1KV_mono_000171

NC_002251-Northern_cereal_mosa... KM817634-Sanxia_Water_Strider_...

1KV_mono_000016 1KV_mono_000084 1KV_mono_000118

1KV_mono_000013 1KV_mono_000191

1KV_mono_000054

KM817631-Jingshan_Fly_Virus_2_...

1KV_mono_000027 KF823814-Fox_fecal_rhabdovirus... 1KV_mono_000150

1KV_mono_000039 1KV_mono_000201

NC_025401-Sunguru_virus_(strip...

KM817638-Shuangao_Fly_Virus_2_... NC_025405-Niakha_virus_(stripp...

1KV_mono_000062 1KV_mono_000115 1KV_mono_000232

NC_025399-Oak-Vale_virus_ 1KV_mono_000102 1KV_mono_000040 1KV_mono_000063 1KV_mono_000139 NC_018629-Ikoma_lyssavirus_(st... 1KV_mono_000147 Anphevirus 1KV_mono_000002 IV

1KV_mono_000188 KM817643-Taishun_Tick_Virus_(s... 1KV_mono_000053

1KV_mono_000024 NC_001542-Rabies_virus_(stripp... 1KV_mono_000124 KM817661-Xincheng_Mosquito_Vir...

NC_025393-Arboretum_virus

1KV_mono_000051 1KV_mono_000094

Lyssavirus 1KV_mono_000192 1KV_mono_000203

1KV_mono_000096

KM205018-Bahia_Grande_virus_(s...

NC_025340-Long_Island_tick_rha... 1KV_mono_000177

1KV_mono_000164 NC_025359-Moussa_virus_(stripp... 1KV_mono_000043

NC_025354-Curionopolis_virus 1KV_mono_000031 1KV_mono_000092 1KV_mono_000095 1KV_mono_000030

KF360973-North_Creek_virus_(st... 1KV_mono_000082

1KV_mono_000091 NC_025384-Culex_tritaeniorhync...

1KV_mono_000007 1KV_mono_000209 1KV_mono_000228 JX297815-Bas-Congo_virus_ 1KV_mono_000036 KM817629-Bole_Tick_Virus_2_(st... 1KV_mono_000081 1KV_mono_000107 KM817640-Tacheng_Tick_Virus_3_...

I 1KV_mono_000021 1KV_mono_000065 KM817660-Wuhan_Tick_Virus_1_(s...

1KV_mono_000048

1KV_mono_000100

NC_013135-Drosophila_melanogas... V KM817657-Wuhan_Louse_Fly_Virus... 1KV_mono_000042

KM817655-Wuhan_Louse_Fly_Virus...

1KV_mono_000060 KM817656-Wuhan_Louse_Fly_Virus...NC_007020-Tupaia_virus_(stripp... 1KV_mono_000106

1KV_mono_000200 1KV_mono_000058

1KV_mono_000224

NC_002526-Bovine_ephemeral_fev... KM817630-Huangpi_Tick_Virus_3_...

KM817658-Wuhan_Louse_Fly_Virus...

NC_020804-Tibrogargan_virus_(s... 1KV_mono_000012 NC_011639-Wongabel_virus_ 1KV_mono_000212

NC_025387-Scopthalmus_maximus_... NC_001560-Vesicular_stomatitis... 1KV_mono_000078 1KV_mono_000131 VI NC_002803-Spring_viraemia_of_c...

KM817653-Wuhan_Insect_virus_7_... NC_020803-Perch_rhabdovirus NC_024702-Soybean_cyst_nematode...

KM817654-Wuhan_Louse_Fly_Virus... KM350503-Santa_barbara_virus NC_012703-Nyamini_virusNC_024376-Sierra_Nevada_virus_...

1KV_mono_000167 1KV_mono_000077 KM817648-Wuhan_House_Fly_Virus... NC_025341-Fikirini_bat_rhabdov...

NC_025362-Xiburema_virus KM817646-Wuhan_Fly_Virus_2 KM817646-Wuhan_Fly_Virus_2

KM817662-Yongjia_Tick_Virus

1KV_mono_000181 KM817635-Shayang_Fly_Virus_2

1KV_mono_000028

1KV_mono_000071 1KV_mono_000001 1KV_mono_000206 1KV_mono_000068

1KV_mono_000052 NC_024296-Avian_Bornavirus. Vesiculovirus 1KV_mono_000009 1KV_mono_000015 1KV_mono_000004 1KV_mono_000198 1KV_mono_000199 NC_001607-Borna_disease_virus_... 1KV_mono_000180 1KV_mono_000162 1KV_mono_000044 1KV_mono_000006 NC_024778-Reptile_Bornavirus

1KV_mono_000059 1KV_mono_000152

1KV_mono_000116 1KV_mono_000174

1KV_mono_000149 KM817644-Wenzhou_Crab_Virus_1_...

1KV_mono_000156 1KV_mono_000194 1KV_mono_000141 VII 1KV_mono_000099

1KV_mono_000110 KM817598-Shayang_Fly_Virus_1_(...

1KV_mono_000022 Crustavirus

1KV_mono_000087

KM817613-Shuangao_Lacewing_Vir...

KM817632-Lishi_Spider_Virus_2_... KM817614-Shuangao_Insect_Virus... 1KV_mono_000033

1KV_mono_000029 1KV_mono_000041 KM817641-Tacheng_Tick_Virus_6_...

1KV_mono_000101

1KV_mono_000154 KM817603-Wenzhou_Crab_Virus_3_... KM817633-Sanxia_Water_Strider_... VIII KM817599-Tacheng_Tick_Virus_4_...

1KV_mono_000113 Chengtivirus

1KV_mono_000057 1KV_mono_000047

1KV_mono_000190 Arlivirus

KC601997-Sclerotinia_sclerotio... 1KV_mono_000046 1KV_mono_000049 Wastrivirus

1KV_mono_000055 1KV_mono_000225 1KV_mono_000142 KM817595-Changping_Tick_Virus_...

KM817600-Tacheng_Tick_Virus_5_...

1KV_mono_000155 NC_001652-Infectious_hematopoi... 1KV_mono_000034 XIII 1KV_mono_000104 KJ746903-Deer_tick_mononegavir...

KM817611-Wuhan_Tick_Virus_2_(s...

KM817594-Changping_Tick_Virus_...

1KV_mono_000170 Novirhabdovirus KM817609-Wuhan_Louse_Fly_Virus...

1KV_mono_000090 1KV_mono_000076

1KV_mono_000050 1KV_mono_000132 KM817606-Wuhan_Louse_Fly_Virus... IX 1KV_mono_000227

KM817612-Shuangao_Fly_Virus_1_... 1KV_mono_000134

KM817597-Lishi_Spider_Virus_1_... NC_025403-Achimota_virus_1_(st... NC_025403-Achimota_virus_1_(st...

1KV_mono_000105 KM817601-Wenzhou_Crab_Virus_2_...KM817610-Wuhan_Mosquito_Virus_... 1KV_mono_000025 1KV_mono_000083

1KV_mono_000176 1KV_mono_000183

1KV_mono_000127

KM817604-Wuchang_Cockraoch_Vir...

1KV_mono_000026 1KV_mono_000010

1KV_mono_000202

1KV_mono_000075

1KV_mono_000089 1KV_mono_000251 NC_005084-Fer-de-lance_virus NC_005084-Fer-de-lance_virus

1KV_mono_000014 NC_025360-Atlantic_salmon_para...

NC_025264-Feline_morbillivirus...

1KV_mono_000093 1KV_mono_000196 NC_001906-Hendra_virus_(stripp... 1KV_mono_000193

NC_002200-Mumps_virus_(strippe... 1KV_mono_000160

NC_001552-Sendai_virus_(stripp...

NC_025256-Bat_paramyxovirus_(s... NC_007454-J-virus NC_025343-Sosuga_virus_(stripp... XII

NC_004074-Tioman_virus_(stripp...

NC_025386-Salem_virus NC_005339-Moosman_virus_(strip...

NC_025352-Moijang_virus_(strip... 1KV_mono_000216

NC_003443-Human_parainfluenza_...

NC_002199-Tupaia_paramyxovirus... NC_001498-Measles_virus_(strip...

NC_025345-Sunshine_virus XI X

NC_025407-Avian_paramyxovirus_...

NC_002549-Zaire_ebolavirus

NC_025349-Avian_paramyxovirus_...

NC_019531-Aivan_paramyxovirus

NC_002617-Newcastle_disease_vi... NC_016144-Lloviu_cuevavirus NC_001608-Marburg_virus

NC_001803-Respiratory_syncytia...

NC_007652-Avian_metapneumoviru...

Figure 1: Non-segmented RNA Viruses

[h] 4 1. PRELIMINARY WORK - PHYLOGENETIC TREE RECONSTRUCTION

1.2 Segmented RNA Viruses

segmented RNA virus (-)

0.3

Thogoto Influenza XO Y Quaranja

M Isa NC_006503-Infectious_salmon_anemia_virus (stri... (stri... NC_006503-Infectious_salmon_anemia_virus

1KV_orthomyxo_000003

1KV_orthomyxo_000049

1KV_bunya_000043

O 1KV_bunya_000164 NC_006495-Thogoto_virus III NC_002204-Influenza_B_virus Herbe

GU969313-Dhori_virus NC_006308-Influenza_C_virus

NC_001925-Bunyamwera_virus

1KV_bunya_000031 Emara NC_009894-Akabane_virus NC_009894-Akabane_virus

KM817663-Wuhan_Louse_Fly_virus_1 NC_002021-Influenza_A_virus NC_002021-Influenza_A_virus

KM272174-Gamboa_virus

NC_004108-La_Crosse_virus

NC_029567-Raspberry leaf blotch virus RNA1 - ORF 2 (fra... HM627178-Leanyer_virus HM627178-Leanyer_virus H II NC_005776-Oropouche_virus IV Orthobunya NC_029562-Fig mosaic virus RNA1 - ORF 1 (frame 1) trans... Tospo NC_013105-European mountain ash ringspot-associated vir...

z-Shuangao_Insect_virus_1

JQ659256-Herbert_virus T 1KV_bunya_000193

1KV_orthomyxo_000005 1KV_bunya_000068 NC_015298-Rose rosette virus RNA1 - ORF 1 (frame 3) tra... NC_030660-Pigeonpea sterility mosaic virus 2 RNA1 - ORF...v NC_029575-Pigeonpea sterility mosaic virus, RNA1 - ORF ...

1KV_orthomyxo_000082

NC_022595-Murrumbidgee_virus KM225257-Rio_Preto_da_Eva_virus 1KV_bunya_000177 KF981636-Khurdun_virus

NC_025796-Wellfleet_Bay_virus FJ861697-Johnston_Atoll_virus V 1KV_bunya_000121 1KV_orthomyxo_000032 KM817618-Shayang_Spider_virus_3 1KV_bunya_000160

KM817616-Jiujie_Fly_virus (str...

FJ861695-Quaranfil_virus NC_018070-Bean_necrotic_mosaic_virus (stripp...

KM817626-Wuhan_Mosquito_virus_7 1KV_bunya_000055 1KV_bunya_000052 NC_002052-Tomato_spotted_wilt_virus (strippe... KM817687-Whenzhou_Shrimp_virus_2 1KV_orthomyxo_000071 R 1KV_orthomyxo_000063

1KV_bunya_000173 1KV_bunya_000180

1KV_bunya_000172

1KV_bunya_000047 1KV_orthomyxo_000028

1KV_bunya_000165

KM817625-Wuhan_Mosquito_Virus_6

KM817623-Wuhan_Mosquito_virus_4 KM817624-Wuhan_Mosquito_Virus_5

1KV_orthomyxo_000040 1KV_bunya_000202 O KM817622-Wuhan_Mosquito_Virus_3 1KV_bunya_000046 1KV_bunya_000065

KM817671-Jiangxia_Mosquito_virus_2 1KV_bunya_000005 1KV_bunya_000170 VI 1KV_orthomyxo_000034 KM817617-Sanxia_Water_Strider_virus_3

1KV_bunya_000153 KM817619-Shuangao_Insect_virus_4 1KV_orthomyxo_000072 KM817678-Shuangao_Bedbug_virus_1 1KV_bunya_000171

1KV_orthomyxo_000067 1KV_bunya_000044

KM817682-Shuangao_Mosquito_virus KM817615-Jingshan_Fly_virus_1

KM817620-Wuhan_Louse_Fly_virus_3 1KV_bunya_000062 KM817621-Wuhan_Louse_Fly_virus_4 NC_010707-Thottapalayam_virus Hanta NC_005235-Dobrava-Belgrade_virus

NC_005217-Sin_Nombre_virusKF958465-Camp_Ripley_virus 1KV_orthomyxo_000086 I 1KV_bunya_000060 KM817677-Shayang_Spider_virus_2 1KV_bunya_000041 1KV_bunya_000023 VII 1KV_bunya_000168 1KV_orthomyxo_000089

1KV_orthomyxo_000070

1KV_bunya_000048 1KV_orthomyxo_000033

1KV_bunya_000036 KM817693-Wuhan_Insect_virus_3

1KV_orthomyxo_000050

1KV_orthomyxo_000008

1KV_bunya_000105 1KV_bunya_000030 1KV_orthomyxo_000055 1KV_bunya_000128 1KV_bunya_000058

1KV_bunya_000096

1KV_bunya_000122 1KV_bunya_000022

1KV_orthomyxo_000081 KM817688-Wuhang Cockraoch virus 1 1KV_bunya_000143

1KV_orthomyxo_000036 1KV_bunya_000084

1KV_bunya_000017 1KV_bunya_0000701KV_bunya_000174

1KV_orthomyxo_000012 1KV_bunya_000077 1KV_bunya_000113 1KV_bunya_000175 1KV_bunya_000051 VIII

1KV_orthomyxo_000010 1KV_bunya_000039

1KV_bunya_000050 1KV_bunya_000032 1KV_bunya_000003

1KV_bunya_000028 1KV_bunya_000158 1KV_bunya_000130 1KV_orthomyxo_000079

1KV_bunya_000120 1KV_bunya_000008

1KV_bunya_000127 1KV_orthomyxo_000075 KM817697-Wuhan_Mosquito_virus_1

KJ434182-Kigluaik_virus 1KV_bunya_000035

A Reptarena KM817627-Wuhan_Mothfly_virus KM817698-Wuhan_Mosquito_virus_2 Phasma

1KV_orthomyxo_000023 KJ434185-Nome_phantom_virus

1KV_orthomyxo_000011

1KV_orthomyxo_000044 NC_018482-Golden_Gate_virus 1KV_bunya_000176

KM817680-Shuangao_Insect_virus_2

1KV_orthomyxo_000043 R NC_023762-Boa_arenavirus

1KV_orthomyxo_000014 1KV_orthomyxo_000052 NC_018484-CAS_virus

KP710246-Ferak_virus

1KV_bunya_000194 NC_010250-Oliveros_virus 1KV_bunya_000018 NC_010252-Cupixi_virus KM817675-Sanxia_Water_Strider_virus_2 (strippe... E NC_005080-Junin_virus

NC_006313-Sabia_virus Fera 1KV_bunya_000157 NC_010703-Whitewater_Arroyo_virus 1KV_bunya_000137 NC_006439-Pichinde_virus 1KV_bunya_0000661KV_bunya_000082

NC_005897-Pirital_virus

1KV_bunya_000098

N NC_010759-Flexal_virus KM817692-Wuhan_Insect_virus_2 IX

NC_010761-Parana_virus 1KV_bunya_000038

NC_012777-Lujo_virus

NC_004291-Lymphocytic_choriomeningitis_virus (... NC_018711-Lunk_virus 1KV_bunya_000083

NC_023763-Merino_Walk_virus

A NC_004297-Lassa_virus KP710232-Jonchet_virus NC_026019-Wenzhou_virus

NC_007904-Mobala_virus NC_006574-Mopeia_virus NC_016153-Luna_virus 1KV_bunya_000117 Jon 1KV_bunya_000190

1KV_bunya_000049

1KV_bunya_000178 1KV_bunya_000054

XVII 1KV_bunya_000013 Mammarena 1KV_bunya_000007

1KV_bunya_000155

1KV_bunya_000012 KM817670-Jiangxia_Mosquito_virus_1 1KV_bunya_000011

1KV_bunya_000001 1KV_bunya_000099 1KV_bunya_000034

1KV_bunya_000191 1KV_bunya_000080

KM817696-Wuhan_Millipede_virus_2 1KV_bunya_000010

1KV_bunya_000064 1KV_bunya_000104 XVΙ KM817702-Xinzhou_Spider_virus 1KV_bunya_000067 KM817676-Shayang_Spider_virus_1 1KV_bunya_000125

1KV_bunya_000094 KJ746877-South_Bay_virus

KM817685-Wenzhou_Tick_virus 1KV_bunya_000002

KM817674-Sanxia_Water_Strider_virus_11KV_bunya_000056 (strippe...

KM817667-Huangpi_Tick_virus_1

KM817683-Tacheng_Tick_virus_1 KF298274-ACC9.4_uncultured_virus KM817699-Wuhan_Spider_virus KM817701-Xinzhou_Mosquito_virus1KV_bunya_000134

KF892055-Issyk-Kul_virus

1KV_bunya_000024 JF911697-Erve_virus 1KV_bunya_000037 KM817705-Zhee_Mosquito_virus

JN661158-Basiki_virus KM817681-Shuangao_Insect_virus_3

1KV_bunya_0000251KV_bunya_000076

KM817700-Wutai_Mosquito_virus XV 1KV_bunya_000053 X 1KV_bunya_000021 1KV_bunya_000131 A NC_005301-Crimean-Congo_hemorrhagic_fever_vi... 1KV_bunya_000185

KM001085-Phasi_Charoen-like_virus 1KV_bunya_000063 1KV_bunya_000063 KM817669-Huangshi_Humpbacked_Fly_virus (stripp... KM817689-Wuhan_Fly_virus_1 1KV_bunya_000016 1KV_bunya_000201

1KV_bunya_000057 1KV_bunya_000106

1KV_bunya_000123 1KV_bunya_000145 1KV_bunya_000072 Y

1KV_bunya_000069

1KV_bunya_000136 1KV_bunya_000116 1KV_bunya_000129 Nairo 1KV_bunya_000092 1KV_bunya_000140 KF543244-Cumuto_virus

1KV_bunya_000100 1KV_bunya_000019

HQ541738-Gouleako_virus

1KV_bunya_000115

1KV_bunya_000108 1KV_bunya_000108 1KV_bunya_000114 KM817703-Yichang_Insect_virus N 1KV_bunya_000026 1KV_bunya_000133

1KV_bunya_000073 KF892052-Rukutama_virus KF892052-Rukutama_virus

1KV_bunya_000139 NC_005214-Uukuniemi_virus NC_005214-Uukuniemi_virus

KM817668-Huangpi_Tick_virus_2

KM817691-Wuhan_Insect_virus_1 (strip... KJ746873-Blacklegged_tick_phlebovirus 1KV_bunya_000079

KM817686-Whenzhou_Shrimp_virus_1 1KV_bunya_000152 KF892046-Khasan_virus

1KV_bunya_000184 1KV_bunya_000075 1KV_bunya_000075 1KV_bunya_000089

1KV_bunya_000045 KM817666-Dabieshan_Tick_virus KM817666-Dabieshan_Tick_virus 1KV_bunya_000095

KM817704-Yongjia_Tick_virus_1 KM817704-Yongjia_Tick_virus_1 1KV_bunya_000101 1KV_bunya_000111

1KV_bunya_000142 KC601996-Sclerotinia_sclerotiorum_phlebo-lik... KM817672-Lihan_Tick_virus

KM817695-Wuhan_Millipede_virus_1 KM817664-Bole_Tick_virus_1 KM817664-Bole_Tick_virus_1 NC_003755-Rice_stripe_virus KM817690-Wuhan_horsefly_virus U KM817665-Changping_Tick_virus

KM817684-Tacheng_Tick_virus_2

1KV_bunya_000150

KM048311-American_dog_tick_phlebovirus (stri...

NC_006319-Toscana_virus

NC_014397-Rift_Valley_fever_virus

NC_002323-Rice_grassy_stunt_virus NC_018136-SFTS_virus NC_018136-SFTS_virus

HM849040-Soybean_cyst_nematode_associated_Uu...

NC_015374-Candiru_virus

1KV_bunya_000015 B

KM817694-Wuhan_Louse_Fly_virus_2 XII XI

XIV KF186497-Malsoor_virus Gouko XIII

KF848980-Hunter_island_virus

NC_021242-Lone_Star_virus Phlebo

KM817673-Qingnian_Mosquito_virus

Tenui

Figure 2: Segmented RNA Viruses 1. PRELIMINARY WORK - PHYLOGENETIC TREE RECONSTRUCTION 5

1.3 Flavivirus-like Superfamily

Flavivirus-like superfamily (+)

0.3

IV Jingmen 1KV_flavi_000060 VI

V

1KV_mono_000275

NC_012812-Bovine_viral_diarrhea_virus_3

NC_002657-Classical_swine_fever_virus

1KV_picorna_000568 1KV_mono_000243 1KV_mono_000270 I NC_003679-Border_disease_virus Pesti

1KV_mono_000300

KR902709-Wuhan_cricket_virus

NC_003678-Pestivirus_Giraffe-1

NC_002032-Bovine_viral_diarrhea_virus_2

NC_001461-Bovine_viral_diarrhea_virus_1

NC_025677-Norway_rat_pestivirus NC_024018-Pronghorn_antelope_pestivirus 1KV_flavi_000010 NC_023176-Porcine_pestivirus V 1KV_flavi_000106

KR011347-Porcine_pestivirus_1 1KV_flavi_000179 1KV_flavi_000179

1KV_mono_000258 1KV_flavi_000103 KR902721-Wuhan_aphid_virus_1 1KV_flavi_000180

KR902725-Wuhan_aphid_virus_2 KR902739-Shayang_spider_virus_4

NC_024113-Jingmen_Tick_Virus

KR902730-Xinzhou_spider_virus_3

KR902713-Wuhan_flea_virus A 1KV_picorna_000790

1KV_flavi_000012

1KV_flavi_000028 1KV_flavi_000084

1KV_flavi_000007

II 1KV_mono_000297

KR902717-Shuangao_insect_virus L III 1KV_flavi_000210 1KV_flavi_000086

1KV_flavi_000209

1KV_flavi_000120

1KV_picorna_000738 1KV_picorna_001059 F 1KV_flavi_000080

1KV_flavi_000115

1KV_flavi_000024 1KV_flavi_000157 VII KR902732-Shayang_fly_virus_4 1KV_mono_000273 1KV_flavi_0000361KV_flavi_000158

KR902734-Shuangao_lacewing_virus_2

1KV_flavi_000045

1KV_mono_000266 1KV_flavi_000057

1KV_flavi_000071

1KV_flavi_000019 KR902738-Xingshan_cricket_virus

AF346759-Tamana_bat_virus 1KV_flavi_000083 NC_001564-Cell_fusing_agent_virus1KV_flavi_0000116

1KV_mono_000256 NC_027817-Parramatta_River_virus

NC_027819-Mercadeo_virus NC_028137-Macrosiphum_euphorbiae_virus_1

NC_008604-Culex_flavivirus 1KV_flavi_000149 KC505248-Palm_Creek_virus 1KV_flavi_000013

NC_024299-Nienokoue_virus_ KR902733-Gamboa_mosquito_virus

1KV_flavi_000097

KR902740-Sanxia_water_strider_virus_6 NC_004119-Montana_myotis_leukoencephalitis_virus NC_003676-Apoi_virus KJ469370-Batu_Cave_virusNC_026620-Jutiapa_virus

NC_003675-Rio_Bravo_virus

NC_001672-Tick-borne_encephalitis_virus NC_003635-Modoc_virus

1KV_mono_000238

NC_005039-Yokose_virus NC_020252-Gentian_Kobu-sho-associated_virus

NC_023424-Tyuleniy_virus 1KV_flavi_000129 NC_008718-Entebbe_bat_virus

NC_027999-Paraiso_Escondido_virus NC_024077-Soybean_cyst_nematode_virus_5VIII

KR902735-Xinzhou_spider_virus_2 NC_008719-Sepik_virus

I NC_002031-Yellow_fever_virus KR902737-Wuhan_centipede_virus

DQ859057-Bouboui_virus KR902741-Tacheng_tick_virus

DQ859056-Banzi_virus

Flavi DQ859060-Edge_Hill_virus NC_017086-Chaoyang_virus

KR902731-Beihai_barnacle_viurs_1 NC_016997-Donggang_virus

NC_024805-Ilomantsi_virus_ KR902736-Bole_tick_virus_4 KC496020-Barkedji_virus_

1KV_picorna_000781 NC_024017-Nhumirim_virus_

1KV_flavi_000027

1KV_flavi_000029 IX NC_012533-Kedougou_virus

1KV_picorna_001039

NC_002640-Dengue_virus_4 NC_007580-St._Louis_encephalitis 1KV_flavi_000143 NC_012534-Bagaza_virus NC_012534-Bagaza_virus

NC_026623-Cacipacore_virus NC_009028-Ilheus_virus

NC_001474-Dengue_virus_2 AY632542-Rocio_virus v NC_001563-West_Nile_virus NC_001563-West_Nile_virus v

NC_001477-Dengue_virus_1 DQ859064-Spondweni_virus NC_012532-Zika_virus

NC_009029-Kokobera_virus

KC788512-New_Mapoon_virus

AY632538-Iguape_virus

NC_009026-Bussuquara_virus

EU159426-Nounane_virus

KC796073-Bat_pegivirus_isolate_PDB-303

HM047196-Bat_GB-like_virus

GU566734-GB_virus_D

KC796088-Bat_pegivirus_isolate_PDB-1715

NC_020902-Equine_Pegivirus_1

KC145265-Theiler's_disease-associated_virus

KC796082-Bat_pegivirus_isolate_PDB-24

KC796075-Bat_pegivirus_isolate_PDB-106

NC_001837-Hepatitis_GB_virus_A

NC_024377-Simian_pegivirus NC_001710-GB_virus_C/Hepatitis_G_virusPegi NC_025679-Norway_rat_pegivirus

NC_021154-Rodent_pegivirus

KC796080-Bat_pegivirus_isolate_PDB-1698

KC796093-Bat_pegivirus_isolate_PDB-34.1

KC796076-Bat_pegivirus_isolate_PDB-620

NC_027998-Human_pegivirus_2

NC_025672-Norway_rat_hepacivirus_1 1KV_flavi_000176

NC_021153-Rodent_hepacivirus

NC_025673-Norway_rat_hepacivirus_2

1KV_mono_000289 NC_026797-Bovine_hepacivirus 1KV_flavi_000004

NC_001655-Hepatitis_GB_virus_B

KC796077-Bat_hepacivirus_isolate_PDB-112 1KV_flavi_000054

KC796091-Bat_hepacivirus_isolate_PDB-445

XIII KC796074-Bat_hepacivirus_isolate_PDB-829

NC_009827-Hepatitis_C_virus_genotype_6

NC_004102-Hepatitis_C_virus_genotype_1

NC_009823-Hepatitis_C_virus_genotype_2

NC_009826-Hepatitis_C_virus_genotype_5

NC_009825-Hepatitis_C_virus_genotype_4

EF108306-Hepatitis_C_virus_genotype_7a

1KV_flavi_000005 NC_024889-Equine_hepacivirus

1KV_mono_000261 KR902729-Wenling_shark_virus

1KV_flavi_000168 Hepaci 1KV_flavi_000030

1KV_picorna_000478

1KV_flavi_000002

1KV_flavi_000008 1KV_flavi_000225 1KV_flavi_000224

1KV_mono_000293 1KV_picorna_000866 1KV_flavi_000223

1KV_flavi_000043 1KV_picorna_001045 1KV_picorna_001047 XII 1KV_flavi_000126

1KV_flavi_000227 1KV_flavi_0002211KV_flavi_000144

1KV_picorna_001017

1KV_flavi_000044

1KV_flavi_000048

1KV_flavi_000063 1KV_flavi_000046 1KV_flavi_000139 1KV_flavi_000153

1KV_toga_000141

1KV_flavi_000040

NC_003530-Carnation_ringspot_virus_RNA_1NC_003633-Oat_chlorotic_stunt_virus 1KV_flavi_000022

NC_011515-Carrot_mottle_virus

NC_001818-Galinsoga_mosaic_virus

NC_001777-Tobacco_necrosis_virus_A NC_015227-Trailing_lespedeza_virus_1 1KV_picorna_000476 1KV_picorna_0004691KV_picorna_000003

1KV_flavi_000025

NC_020469-Furcraea_necrotic_streak_virus NC_003487-Tobacco_necrosis_virus_D NC_002598-Panicum_mosaic_virus

1KV_flavi_000160

NC_000939-Pothos_latent_virus NC_003627-Maize_chlorotic_mottle_virus

NC_005985-Pelargonium_chlorotic_ring_pattern_virus 1KV_flavi_000037 NC_007017-Pelargonium_line_pattern_virus

NC_026240-Pelargonium_ringspot_virus NC_001265-Carnation_mottle_virus 1KV_flavi_000166

1KV_flavi_000151 NC_020415-Rosa_rugosa_leaf_distortion_virus NC_001554-Tomato_bushy_stunt_virus

1KV_flavi_000161 1KV_flavi_000162

1KV_flavi_000018

NC_007729-Maize_necrotic_streak_virus

1KV_flavi_000041

1KV_flavi_000152

1KV_picorna_001005

1KV_flavi_000178

1KV_picorna_000899 1KV_flavi_000035

1KV_picorna_001030

1KV_picorna_000002 1KV_flavi_000208 1KV_picorna_000983 X

1KV_flavi_000172 XI TO M B US

Figure 3: Flavivirus-like Superfamily 6 1. PRELIMINARY WORK - PHYLOGENETIC TREE RECONSTRUCTION

1.4 Picornavirus-like Viruses

Picornavirus-like virus (+)

0.3

1KV_picorna_000087

1KV_picorna_000468

1KV_picorna_000903 1KV_picorna_000913

1KV_picorna_000118 1KV_picorna_0002011KV_picorna_000119

NC_021566-Nilaparvata_lugens_honeydew_virus2 1KV_picorna_000815

1KV_picorna_000840

1KV_picorna_000042

1KV_picorna_001001

1KV_picorna_000949 1KV_picorna_000951 1KV_nido_000026

AB766259-Nilaparvata_lugens_honeydew_virus_1 1KV_picorna_000741 1KV_picorna_000417 1KV_picorna_000337 I 1KV_picorna_000397 1KV_picorna_000828 NC_014137-Slow_bee_paralysis_virus1KV_picorna_000333 1KV_picorna_000471 1KV_picorna_000819 NC_023627-Laodelphax_striatella_honeydew_virus (stripped...

1KV_picorna_000941

1KV_picorna_000544 1KV_picorna_000334 1KV_picorna_000675 1KV_picorna_000948 1KV_picorna_000748 1KV_picorna_000472 1KV_picorna_000425 1KV_picorna_000160 1KV_picorna_000341 1KV_picorna_000820 1KV_picorna_000376 1KV_picorna_000390 1KV_picorna_000286 1KV_picorna_000379 1KV_picorna_000242 1KV_picorna_000211 1KV_picorna_000243 1KV_picorna_000244

1KV_picorna_000241 1KV_picorna_000971 1KV_picorna_000345

1KV_picorna_000821 1KV_picorna_000920 1KV_picorna_000007

1KV_picorna_001012

1KV_picorna_000923 1KV_picorna_000962 1KV_picorna_000326 1KV_picorna_0008591KV_nido_000072 1KV_picorna_000564 1KV_picorna_000182 1KV_picorna_000576 1KV_nido_000051 1KV_picorna_000740 1KV_picorna_000994

1KV_picorna_001011

1KV_picorna_000338 1KV_picorna_000555 1KV_picorna_000562

1KV_nido_000025

1KV_picorna_000447 1KV_nido_000068

1KV_nido_000067 1KV_nido_000047 1KV_picorna_000563

1KV_picorna_000919

1KV_picorna_000448 1KV_picorna_000846

1KV_picorna_000473

1KV_picorna_000637

1KV_picorna_001037

NC_021567-Nilaparvata_lugens_honeydew_virus_3 1KV_picorna_000671

1KV_picorna_0000911KV_picorna_0006701KV_picorna_000888 1KV_picorna_000550 1KV_picorna_000644

NC_024016-Heliconius_erato_iflavirus NC_023022-Formica_exsecta_virus 1KV_picorna_000347 1KV_picorna_0010501KV_picorna_000931 1KV_picorna_0009901KV_picorna_000791 1KV_flavi_000134

1KV_picorna_000922

NC_023483-Antheraea_pernyi_iflavirus 1KV_picorna_000789

1KV_picorna_000853

1KV_picorna_000926 1KV_picorna_001007 1KV_picorna_000265 1KV_picorna_001016

1KV_picorna_000816 NC_004830-Deformed_wing_virus 1KV_picorna_000938 1KV_picorna_000438 1KV_picorna_000507 1KV_picorna_000958 1KV_picorna_000264 1KV_picorna_000837 1KV_picorna_000707

1KV_picorna_000261 1KV_picorna_000708 1KV_picorna_000779 1KV_picorna_000185 1KV_picorna_000466 1KV_picorna_000209 1KV_picorna_000895 1KV_picorna_000196 1KV_picorna_000195 1KV_picorna_0009601KV_picorna_000959 1KV_picorna_000876 1KV_picorna_000092 1KV_picorna_000260 1KV_picorna_000961

1KV_picorna_000035 1KV_picorna_000162 1KV_picorna_000263 1KV_picorna_000216 1KV_picorna_000755 1KV_picorna_000099

1KV_picorna_000262 1KV_picorna_000565 1KV_picorna_000197 1KV_picorna_000739 1KV_picorna_000105 1KV_picorna_000275 1KV_picorna_000813 1KV_picorna_000213 1KV_picorna_000963 1KV_picorna_000775 1KV_picorna_000783 1KV_picorna_0010091KV_picorna_001008 1KV_picorna_001010 1KV_picorna_000144 1KV_picorna_000370 1KV_picorna_000985

1KV_picorna_000921

1KV_picorna_001026 1KV_picorna_000722 1KV_picorna_0003891KV_picorna_000230 1KV_picorna_000360 1KV_picorna_000374 1KV_picorna_000984 1KV_picorna_000419 1KV_picorna_000975 1KV_picorna_000974 1KV_picorna_000127

1KV_picorna_000280 1KV_picorna_000318 1KV_picorna_000477 1KV_picorna_000308 1KV_picorna_000285 1KV_picorna_000942 1KV_picorna_000098 1KV_picorna_000992 1KV_picorna_000909 1KV_picorna_000310 1KV_picorna_000953 1KV_picorna_000097 II 1KV_picorna_000056 1KV_picorna_000901 1KV_picorna_0005771KV_picorna_000606 1KV_picorna_000206 1KV_picorna_000339 1KV_picorna_000906 1KV_picorna_000604 1KV_picorna_000603 1KV_picorna_001018 1KV_picorna_000371 1KV_picorna_000089 1KV_picorna_000136 1KV_picorna_0001211KV_picorna_000674 1KV_picorna_000602 1KV_picorna_000607 1KV_picorna_000120 1KV_picorna_000335 1KV_picorna_000548 1KV_picorna_000135 1KV_picorna_000509 1KV_picorna_000893 1KV_picorna_000756 1KV_picorna_000108 1KV_picorna_000134 1KV_picorna_000601 1KV_picorna_000122 1KV_picorna_000115 1KV_picorna_000892 1KV_picorna_001032 1KV_picorna_000239 1KV_picorna_000605 1KV_picorna_000049 1KV_picorna_000927 1KV_picorna_000766 1KV_picorna_000612 1KV_mono_000305 1KV_picorna_000137 1KV_picorna_000222 1KV_picorna_000256 1KV_picorna_000050 1KV_picorna_000090 1KV_picorna_001014 1KV_picorna_000827 1KV_picorna_000729 1KV_picorna_000611 1KV_picorna_000608 1KV_picorna_000697 1KV_picorna_000594 1KV_picorna_000219 1KV_picorna_000887 1KV_picorna_000613 1KV_picorna_000609 1KV_picorna_000171 1KV_picorna_000978 1KV_picorna_000851 1KV_picorna_001019 1KV_picorna_000966 1KV_picorna_000155 1KV_nido_000061 1KV_picorna_000403 1KV_picorna_000614 1KV_picorna_000625 KM015260-Soybean-associated bicistronic virus 1KV_picorna_000615 1KV_nido_000034 1KV_picorna_000192 1KV_picorna_000661 1KV_picorna_001015 1KV_picorna_000208 1KV_picorna_000936 1KV_nido_000049 1KV_picorna_000800 1KV_picorna_000205 1KV_picorna_000081 1KV_picorna_000487 1KV_picorna_000398 1KV_picorna_000592 1KV_picorna_000688 1KV_picorna_000955 1KV_picorna_000004 1KV_picorna_000636 1KV_picorna_000664 1KV_picorna_000353 1KV_picorna_000556 1KV_picorna_000566 1KV_picorna_000812 1KV_picorna_000823 1KV_picorna_000750 1KV_picorna_000443 1KV_picorna_000298 1KV_picorna_000174 1KV_picorna_000788 1KV_picorna_000988 1KV_picorna_000303 1KV_picorna_000582 1KV_picorna_000080 1KV_picorna_000269 1KV_picorna_000972 1KV_picorna_000093 1KV_picorna_000270 1KV_nido_000008 1KV_picorna_000078_rdvrp_orf 1KV_picorna_000497 1KV_picorna_000184 1KV_picorna_000804

JF720348-Lygus_lineolaris_virus 1KV_nido_000011 1KV_picorna_000434 1KV_picorna_000798 1KV_nido_000057 1KV_picorna_000964 1KV_picorna_000220 1KV_picorna_000328 1KV_picorna_000861 1KV_picorna_000777 1KV_nido_000033 1KV_picorna_000902 1KV_nido_000055 1KV_nido_000073 1KV_picorna_000181 1KV_picorna_000400 1KV_picorna_000426 1KV_picorna_000183 1KV_picorna_000032 1KV_picorna_000832 1KV_picorna_000501 1KV_picorna_000358 1KV_picorna_000579 1KV_nido_000039 1KV_picorna_000760 1KV_picorna_000834 1KV_nido_000015 1KV_picorna_000570 1KV_nido_000064 1KV_picorna_000976 EF428566-Solenopsis1KV_nido_000063 invicta virus 2 1KV_picorna_000649 1KV_picorna_000367 1KV_picorna_000924 1KV_picorna_000063 1KV_picorna_000965 1KV_picorna_000742

NC_022611-Halyomorpha_halys_virus 1KV_picorna_000314

1KV_picorna_000332 1KV_picorna_000838 1KV_picorna_000470

NC_002066-Sacbrood_virus 1KV_picorna_000129 1KV_picorna_000822 1KV_picorna_001051 1KV_picorna_000536 III 1KV_picorna_000300 1KV_picorna_000486 1KV_picorna_000157 1KV_picorna_000176 1KV_picorna_000172 1KV_picorna_000498

1KV_picorna_000998 1KV_picorna_0005331KV_picorna_000534 1KV_picorna_000873 1KV_picorna_000532 1KV_picorna_000847 1KV_picorna_000999

1KV_flavi_000196 1KV_picorna_000375 KJ186789-Spodoptera exigua virus IV 1KV_picorna_000904 1KV_picorna_000734 1KV_picorna_000799 1KV_picorna_000006 NC_007919-Nora virus 1KV_picorna_000221 1KV_picorna_000651JQ898344-Cadicistrovirus

1KV_picorna_000173 1KV_picorna_000580 1KV_picorna_000202 1KV_picorna_000365 1KV_picorna_000540

1KV_picorna_000589 1KV_picorna_000894 1KV_picorna_000223 JQ898345-Niflavirus1KV_picorna_000317 1KV_picorna_000647 1KV_picorna_000648 1KV_picorna_000623 1KV_picorna_000662 1KV_picorna_000583 NC_009530-Brevicoryne_brassicae_picorna-like_virus (stri... 1KV_picorna_001000 1KV_nido_000002 1KV_picorna_000782 1KV_picorna_000945 1KV_nido_000054 1KV_nido_000016 1KV_picorna_000575 1KV_picorna_000689 1KV_picorna_000350 1KV_picorna_000574 1KV_picorna_000017 1KV_picorna_000826 1KV_picorna_000060 1KV_picorna_000454 1KV_picorna_000855 1KV_picorna_000455

1KV_picorna_000083 1KV_picorna_000836 NC_016405-Spodoptera_exigua_iflavirus_1 1KV_picorna_000700 1KV_picorna_000349 NC_003781-Infectious_flacherie_virus-TS 1KV_picorna_000982 1KV_picorna_000068 1KV_picorna_000678 1KV_picorna_000681 1KV_picorna_000524 1KV_picorna_000052 V 1KV_picorna_000889 1KV_picorna_000917 1KV_picorna_000665 1KV_picorna_000346 1KV_picorna_000356 1KV_picorna_000313 1KV_picorna_000227 1KV_picorna_000428 1KV_picorna_000312 1KV_picorna_000046 1KV_picorna_000378 1KV_picorna_000968 1KV_picorna_000516 1KV_picorna_000551 1KV_picorna_000159 1KV_picorna_000170 1KV_picorna_000752 1KV_picorna_000663

1KV_picorna_000467 1KV_picorna_000368 1KV_picorna_000103 1KV_picorna_000427 1KV_picorna_000860 1KV_picorna_000297 1KV_picorna_000126 1KV_picorna_000423 1KV_picorna_000321

1KV_picorna_000198 1KV_picorna_000084

1KV_picorna_000528 1KV_picorna_000751 1KV_picorna_000529 1KV_picorna_000292 1KV_picorna_000234

1KV_picorna_000272 1KV_picorna_000391

1KV_picorna_000490 NC_025835-Dinocampus_coccinellae_paralysis_virus (stripp... 1KV_picorna_000191

1KV_picorna_000273 1KV_picorna_000437 1KV_picorna_000770 1KV_picorna_000142

1KV_picorna_000871 1KV_picorna_000075 1KV_picorna_000086 1KV_picorna_000870 1KV_picorna_000512 1KV_picorna_000267 1KV_picorna_000833

KM017739-Fesavirus 4 1KV_picorna_000727 1KV_picorna_000277 1KV_picorna_000076 NC_003680-Barley yellow dwarf virus 1KV_nido_000046 1KV_picorna_000143 NC_006937-Fusarium graminearum dsRNA mycovirus-1vHM480375-Tetnovirus 1 1KV_picorna_000658 1KV_picorna_000200 1KV_picorna_000061 VI

1KV_picorna_000278 1KV_picorna_000047 1KV_picorna_0008301KV_picorna_000736 1KV_picorna_000569 1KV_picorna_000048 1KV_picorna_000150 NC_024699-Penicillium roqueforti ssRNA mycovirus 1 NC_003113-Perina_nuda_virus 1KV_picorna_000274 1KV_picorna_000449 1KV_picorna_000168 1KV_picorna_000066

1KV_picorna_000396 1KV_picorna_000458 NC_024485-Rosellinia necatrix fusarivirus 1 NC_023676-Spodoptera_exiqua_iflavirus_2 1KV_picorna_000268 VII NC_028470-Pleospora typhicola fusarivirus 1 1KV_picorna_000787 1KV_picorna_000283 KP900890-Macrophomina phaseolina single-stranded RNA virus 1 1KV_picorna_000786 1KV_picorna_000718 NC_028467-Penicillium aurantiogriseum fusarivirus 1 NC_027208-Sclerotinia sclerotiorum fusarivirus 1 1KV_picorna_000745 NC_029056-Alternaria brassicicola fusarivirus 1 JQ898341-Nedicistrovirus 1KV_picorna_000797 1KV_picorna_000744 1KV_picorna_001020 NC_008029-Homalodisca_coagulata_virus XVIII1KV_picorna_000153 1KV_picorna_000154 NC_003779-Plautia_stali_intestine_virus NC_006431-Cryphonectria 4 endogenous virus 1KV_picorna_000386 NC_017099-Valsa ceratosperma hypovirus 1 1KV_picorna_000377 NC_003783-Triatoma_virus NC_000960-Cryphonectria hypovirus 3 NC_003784-Black_queen_cell_virus 1KV_picorna_000388 NC_024685-Phomopsis longicolla hypovirus NC_003782-Himetobi_P_virus1KV_picorna_000598 1KV_picorna_0005961KV_picorna_000916 NC_001492-Cryphonectria hypovirus 1 1KV_picorna_000929 NC_015939-Sclerotinia sclerotiorum hypovirus 1 1KV_picorna_000020 1KV_flavi_000154 NC_025219-Cripavirus Hypoviridae NC_026813-Fusarium graminearum hypovirus 2 1KV_flavi_000020 NC_023680-Fusarium graminearum hypovirus 1 1KV_mono_000285 1KV_flavi_000059

NC_022896-Sclerotinia sclerotiorum hypovirus 2 NC_006559-Solenopsis_invicta_virus 1KV_picorna_0006961KV_picorna_000695 1KV_nido_000071 1KV_flavi_000003 1KV_flavi_000163 NC_001616-Potato virus Y

1KV_picorna_000342 NC_001814-Ryegrass mosaic virus NC_004807-Kashmir_bee_virus 1KV_picorna_000351 NC_008558-Blackberry virus Y NC_002548-Acute_bee_paralysis_virus-TS 1KV_flavi_000006 1KV_picorna_000772 1KV_flavi_000155 NC_001886-Wheat streak mosaic virus

1KV_flavi_000064 NC_003797-Sweet potato mild mottle virus NC_001834-Drosophila_C_virus NC_002990-Barley yellow mosaic virus 1KV_nido_000004 1KV_picorna_000778NC_003924-Cricket_paralysis_virus-TS 1KV_picorna_000989 1KV_picorna_000642 1KV_picorna_000072 MMU58771-Maclura mosaic virus

1KV_mono_000284 XVII 1KV_picorna_000132 NC_012931-Wheat yellow dwarf virus 1KV_picorna_000581 NC_014793-Mud_crab_dicistrovirus 1KV_picorna_000698 NC_001747-Potato leafroll virus 1KV_picorna_000480 1KV_picorna_000552 NC_003005-Taura_syndrome_virus 1KV_picorna_000344NC_018570-Macrobrachium_rosenbergii_Taihu_virus (strippe... 1KV_picorna_000343 NC_003629-Pea1KV_picorna_000288 enation mosaic virus 1 1KV_picorna_000765 1KV_picorna_000986 1KV_picorna_000027 1KV_picorna_000055 1KV_picorna_000692 1KV_picorna_0004641KV_picorna_000595 1KV_picorna_000716 JF423196-Big Sioux River virus 1KV_picorna_000096NC_004365-Aphid_lethal_paralysis_virus 1KV_picorna_000862 1KV_picorna_000699 1KV_picorna_000715 1KV_picorna_000554

NC_001874-Phopalosiphum_padi_virus 1KV_picorna_000067 1KV_picorna_000844 1KV_picorna_000735 1KV_picorna_000387

1KV_picorna_000954 1KV_picorna_000475

Luteoviridae 1KV_flavi_000171 JQ898334-Picalivirus A

1KV_picorna_000684 JQ898336-Picalivirus C 1KV_picorna_000795 KF478837-Picalivirus D 1KV_picorna_000776

1KV_picorna_000289 JQ898335-Picalivirus B 1KV_picorna_000293 1KV_nido_000069 1KV_nido_000070 XVI 1KV_mono_000303 1KV_picorna_000531

1KV_picorna_000366 1KV_picorna_000987 GU017972-Solenopsis invicta virus 3 KJ420969-Arivirus 1 1KV_picorna_000521 Dicistro- 1KV_picorna_000522 1KV_mono_000301 1KV_picorna_000545 1KV_picorna_000946 1KV_picorna_000511 1KV_picorna_000106

1KV_picorna_001055 NC_003787-Apple_latent_spherical_virus KM017736-Fesavirus 1 1KV_picorna_001060 VIII NC_023162-Carp picornavirus NC_001632-Rice_tungro_spherical_virus-TS 1KV_picorna_000369 1KV_picorna_000905 NC_023437-FatheadNC_018506-Bluegill minnow picornavirus picornavirus NC_025479-Carrot_torradovirus_1 NC_003785-Satsuma_dwarf_virus-TS 1KV_picorna_000207 NC_003976-Ljungan virus 1KV_picorna_000935 viridae NC_022332-Eel picornavirus 1 1KV_picorna_000769 NC_003799-Squash_mosaic_virus NC_006271-cherry_rasp_leaf_virus-TS NC_021482-Sebokele virus 1 NC_016443-Chocolate_lily_virus_A 1KV_nido_000007 1KV_picorna_000322 NC_003549-Cowpea_mosaic_virus-TS NC_003003-Broad_bean_wilt_virus_2 1KV_picorna_000541 NC_025474-Crohivirus NC_003628-Parsnip_yellow_fleck_virus-TS NC_009013-Tomato_torrado_virus-TS 1KV_mono_000278 NC_003626-Maize_chlorotic_dwarf_virus NC_003496-Bean_mottle_virus

DQ675190-Lettuce mottle virus 1KV_picorna_000163 NC_015492-Grapevine_Bulgarian_latent_virus NC_003445-Strawberry_mottle_virus NC_010709-Radish_mosaic_virus

NC_023016-Lamium_mild_mosaic_virusEF063641-Tomato apex necrosis virus 1KV_nido_000020

NC_008250-Duck_hepatitis_A_virus-TS NC_020898-Arracacha_virus_B NC_006964-Strawberry_latent_ringspot_virus

NC_015414-Cherry_leaf_roll_virus

NC_003509-Blackcurrant_reversion_virus

AB518485-Melon_mild_mottle_virus

NC_003840-Tomato_ringspot_virus

1KV_picorna_000405 NC_001489-Hepatitis_A_virus-TS 1KV_picorna_000802 NC_001897-Human_parechovirus-TS 1KV_picorna_000041 1KV_nido_000021 1KV_picorna_000845 NC_005289-Broad_bean_wilt_virus-TS NC_003615-Grapevine_fanleaf_virus NC_024766-Chicken picornavirus 2 1KV_mono_000248 (str i pped) 1KV_picorna_000754 AB649296-Blueberry_latent_spherical_virus KF006989-Ferret parechovirus 1KV_picorna_000236 1KV_picorna_000446 1KV_picorna_000764

1KV_picorna_000952 NC_024489-Asterionellopsis_glacialis_RNA_virus (stripped... 1KV_picorna_000284 1KV_picorna_000494 NC_025432-Chicken orivirus 1 NC_024767-Chicken picornavirus 3 KC904083-Mulberry_mosaic_roll_leaf-associated_virus (str... 1KV_picorna_000394 1KV_picorna_000229 1KV_picorna_000074 NC_005266-Rasberry_ringspot_virus KC614703-Turkey 1KV_nido_000005 1KV_picorna_000854 NC_023985-Duck picornavirusKC935379-Kunsagivirus GL/12 1 1KV_picorna_000199 1KV_picorna_000359 NC_025836-Fisavirus 1 1KV_picorna_000719 1KV_picorna_000771 NC_003791-Cycas_necrotic_stunt_virus

NC_003990-Avian_encephalomyelitis_virus-TS NC_003622-Grapevine_chrome_mosaic_virus

1KV_picorna_000559 NC_018613-Rhizosolenia_setigera_RNA_virus 1KV_picorna_000246 NC_009891-Seal_picornavirus-TS NC_012957-Salivirus NG-J1

1KV_picorna_000993 1KV_picorna_000235 1KV_picorna_000730

KM259923-Pasivirus_A-TS AB375474-Choetoceros_tenuissimus_RNA_virus

1KV_picorna_000891 (str i pped) 1KV_nido_000010

1KV_picorna_000069 1KV_nido_000023 1KV_picorna_000940 NC_018400-Turkey_gallivirus-TS

NC_023638-Posavirus 2 NC_007522-Schizochytrium_ss_RNA_virus

XV C rhinovirus -Human NC_009996 NC_023637-Posavirus 1 NC_014411-Turdivirus_1-TS NC_012212-Chaetoceros_socialis_f._radians_RNA_virus (str... NC_023861-Sicinivirus_1-TS

1KV_picorna_000406 C3 rhinovirus EF186077-Human NC_023987- A2

NC_001617-Human rhinovirus 89

NC_003988-Simian enterovirus A enterovirus NC_003988-Simian NC_014412-Turdivirus_2-TS B picornavirus NC_015626-Pigeon 1KV_picorna_000703 NC_001918-Aichi_virus-TS 1KV_picorna_000146

NC_002058-Poliovirus-TS 1KV_picorna_000421

FJ445113-Human rhinovirus 8 strain 8 FJ445113-Humanrhinovirus 1KV_picorna_000547 NC_016403-Quail picornavirus NC_016403-Quail

KC811837-Pigeon mesivirus 1KV_picorna_000409 NC_022802-Feline_Sakobuvirus_A-TS 1KV_picorna_0001451KV_picorna_000149 NC_005097-Tobacco_ringspot_virus-TS

1KV_picorna_000164 NC_024070-Rosavirus_2-TS NC_003982-Equine rhinitis A virus NC_015934-Bat picornavirus 3 NC_001490-Human rhinovirus 14 rhinovirus NC_001490-Human NC_025890-Tortoise picornavirus 1KV_picorna_0001661KV_picorna_000408 NC_024120-Duck megrivirus

1KV_picorna_000165

NC_001859-Bovine enterovirus NC_001859-Bovine NC_001612-Human enterovirus A enterovirus NC_001612-Human NC_023988-Tortoise Rafivirus A 1KV_picorna_000178 NC_026314-RabovirusA 1KV_picorna_000753 NC_010354-Bovine rhinitis B virus

NC_005281-Heterosigma akashiwo RNA virus JN936206-Bovine rhinitis A NC_012800-Cosavirus_A-TS

NC_006553-Avian sapelovirus NC_001366-Theilovirus

NC_021201-Turkey_hepatitis_virus-TS HM480374-Calhevirus 1 HM480374-Calhevirus

NC_026315-Lesavirus 1

NC_004451-Simian sapelovirus 1 NC_015940-Bat picornavirus 1 NC_021178-Canine_picodicistrovirus-TS NC_004004-Foot-and-mouth_disease_virus-TS Labyrnavirus IX

NC_026470-African bat icavirusJQ864242-Boone A cardiovirus NC_025961- JMY-2014 NC_016156-Feline picornavirus JX291115-Human rhinovirus C isolate JAL-1 isolate C rhinovirus JX291115-Human

NC_001479-Encephalomyocarditis_virus-TS 1KV_picorna_000450 XIII NC_003987-Porcine_sapelovirus-TS 1KV_picorna_000928 XIV NC_016964-Canine picornavirus 1KV_picorna_000643

JQ814851-Miniopterus_schreibersii_picornavirus_1-TS (str... 1KV_picorna_000311

NC_003985-Porcine_teschovirus-TS NC_026249-Bovine picornavirus

Marnaviridae NC_018668-Bovine-hungarovirus_1-TS

NC_011349-Seneca_valley_virus-TS

1KV_picorna_000315 1KV_picorna_000956 KM017738-Fesavirus 3 KM017738-Fesavirus

PicornaviridaeNC_003983-Equine_rhinitis_B_virus-TS Bacillarnavirus

X 1KV_picorna_000957

XII XI

Figure 4: Picornavirus-like Viruses 1. PRELIMINARY WORK - PHYLOGENETIC TREE RECONSTRUCTION 7

1.5 Togavirus-like Superfamily

Togavirus-like superfamily (+)

1KV_picorna_000515

1KV_picorna_000831

1KV_nege_000161 1KV_nege_000166

1KV_nege_000279

1KV_nege_000165

1KV_nege_000160

1KV_flavi_000130

1KV_nege_000144 1KV_flavi_000109

1KV_nege_000288

1KV_nege_000205 III 0.4 1KV_picorna_000433

1KV_picorna_000747

1KV_nege_000145

1KV_picorna_001028

1KV_nege_000077 1KV_flavi_000184

1KV_picorna_000432

1KV_nege_000221

1KV_flavi_000032

1KV_picorna_000586

1KV_picorna_000587

1KV_mono_000272

1KV_nege_000140 1KV_picorna_000585 1KV_nege_000317

1KV_nege_000114 1KV_nege_000159

II 1KV_picorna_000100

1KV_nege_000089

1KV_nege_000261

1KV_nege_000289

1KV_toga_000091 1KV_nege_000242

1KV_nege_000355

1KV_picorna_000878

Cile- & Higrevirus 1KV_nege_000311 1KV_nege_000091

1KV_nege_000216

1KV_nege_000185 Nelorpivirus 1KV_nege_000322 1KV_toga_000025

1KV_nege_000246

1KV_picorna_000763

1KV_nege_000233 Sandewavirus 1KV_nege_000194

1KV_nege_000127

1KV_toga_000055 1KV_mono_000269 1KV_nege_000013 1KV_nege_000232 1KV_toga_000161 1KV_nege_000336 1KV_nege_000225 1KV_nege_000225-2nd 1KV_toga_000040- part 2

1KV_nege_000180

1KV_toga_000185

1KV_nege_000295

1KV_nege_000254

1KV_nege_000235 1KV_nege_000227 1KV_nege_000230 1KV_nege_000168 1KV_nege_000029

1KV_toga_000166 1KV_nege_000018 1KV_nege_000079

1KV_nege_000067

1KV_toga_000061

1KV_nege_000277 1KV_toga_000080 1KV_nege_0000851KV_nege_000064 1KV_nege_000353 1KV_toga_000107

1KV_nege_000146 1KV_nege_000196 1KV_nege_000031

1KV_nege_000271

1KV_nege_000045 JQ675606-Santana_virus 1KV_nege_000193 NC_023440-Wallerfield_virusJQ675604-Dezidougou_virus 1KV_nege_000099 1KV_nege_000226 1KV_nege_000136 1KV_nege_000290 1KV_nege_000048 1KV_nege_000118 NC_016141-Hibiscus_green_spot_virus1KV_toga_000034 KM350508-Brejeira_virus NC_008169-Citrus_leprosis_virus JQ675605-Negev_virus_EO-329 NC_025357-Goutanap_virus_F33/CI JQ675610-Loreto_virus_3940-83 1KV_nege_000071 JQ686833-Ngewotan_virus 1KV_toga_000033 1KV_nege_000119 1KV_toga_000035 1KV_toga_000155 JQ675607-Piura_virus_P60 1KV_nege_000063 1KV_nege_000349

1KV_toga_000101 1KV_nege_000283

1KV_toga_000052 1KV_nege_000190 1KV_toga_000013 1KV_toga_000194 1KV_nege_000325 1KV_nege_000202 1KV_nege_0000811KV_nege_000286

1KV_nege_000148 1KV_toga_000083 1KV_nege_000285

1KV_nege_000208 1KV_nege_000035 1KV_nege_000258 1KV_nege_000062 1KV_nege_000054 I 1KV_nege_000304

1KV_toga_000008 1KV_nege_000305 1KV_nege_000351 1KV_nege_000197 1KV_nege_000080 1KV_toga_000074 1KV_nege_000043 1KV_nege_000333 1KV_nege_000306 1KV_toga_000090 1KV_nege_000084 1KV_nege_000217

1KV_nege_000010 1KV_nege_000125 1KV_nege_000109 1KV_nege_000214 IV 1KV_toga_000066 1KV_nege_000060 1KV_nege_000147 1KV_toga_000079 1KV_nege_000050

1KV_toga_000114 1KV_nege_000213 1KV_nege_000075 1KV_toga_000044 1KV_nege_000327 1KV_toga_000187 1KV_nege_000218 1KV_nege_000008 1KV_nege_000044 1KV_nege_000302 1KV_toga_000180 1KV_toga_000094 1KV_nege_000005 1KV_nege_000157 1KV_toga_000179

1KV_nege_000248 1KV_toga_000130 1KV_nege_000153 1KV_nege_000312 1KV_toga_000150 1KV_nege_000126 1KV_nege_000352 1KV_toga_000127 1KV_nege_000211 1KV_nege_000072 1KV_toga_000176 1KV_nege_000309 1KV_toga_000054 1KV_nege_000189 1KV_nege_000151 1KV_nege_000152

1KV_nege_000270 1KV_toga_000085 1KV_toga_000058 1KV_toga_000065 1KV_nege_000331 1KV_nege_000030 1KV_toga_000068 1KV_nege_000231 1KV_nege_000001 1KV_toga_000192 1KV_toga_000087 1KV_toga_000191 1KV_nege_000249 1KV_nege_000123 1KV_nege_000175 1KV_nege_000256 1KV_toga_000002 1KV_nege_000017 1KV_nege_000074 1KV_nege_000173 1KV_picorna_000520 1KV_nege_000172 1KV_nege_000280 1KV_nege_000358 1KV_nege_000170 1KV_picorna_000930 XXI 1KV_nege_0003541KV_nege_000171 1KV_nege_000169 1KV_picorna_000818

1KV_toga_000032 1KV_toga_000129 1KV_picorna_000131 1KV_toga_000170 NC_005094-Macrobrachium rosenbergii nodavirus 1KV_nege_000015 1KV_nege_000011 1KV_nege_000041 NC_014978-Penaeus vannamei nodavirus 1KV_picorna_0008691KV_nege_000039 1KV_nege_000038 1KV_nege_000040 NC_002690-Nodamura virus AY962576-Wuhan nodavirus 1KV_toga_000059 1KV_toga_000124 NC_003448-Striped Jack nervous necrosis virus 1KV_toga_000140 1KV_picorna_000705 1KV_toga_000165

1KV_nege_000115 XX 1KV_picorna_000757 1KV_nege_000321 1KV_toga_000063 1KV_nege_000105 1KV_picorna_000758 1KV_toga_000163 1KV_nege_000292 1KV_nege_000106 1KV_nege_000198 1KV_flavi_000011 1KV_toga_000177 1KV_nege_000022 1KV_toga_000135 1KV_nege_000020 1KV_nege_0000191KV_nege_000023 NC_015069-Santeuil nodavirus 1KV_toga_000006 1KV_toga_000019 1KV_toga_000020 1KV_toga_000193 1KV_toga_000023 1KV_picorna_0009441KV_nege_000338

1KV_toga_000145 1KV_toga_000181 1KV_nege_000137 1KV_nege_000204 1KV_toga_000093 1KV_nege_000012

NC_002786-Maize_rayado_fino_virus 1KV_toga_000028 1KV_toga_000027 NC_004063-Turnip_yellow_mosaic_virus 1KV_toga_000132

NC_003347-Grapevine_fleck_virus 1KV_nege_000026 1KV_nege_000130 1KV_toga_000144

NC_007415-Sclerotinia_sclerotiorum_debilitation-associat... 1KV_toga_000038 NC_010434-Lolium_latent_virus

NC_011620-Potato_virus_X

1KV_nege_000344 V NC_003093-Indian_citrus_ringspot_virus 1KV_nege_0003431KV_nege_0003421KV_toga_000188 1KV_nege_000129 1KV_toga_000024 NC_003795-Shallot_virus_X 1KV_nege_000212

1KV_toga_000018 NC_005132-Botrytis_virus 1KV_nege_000128

1KV_nege_000070 NC_002795-Aconitum_latent_virus 1KV_toga_000004 NC_003462-Apple_stem_pitting_virus 1KV_toga_000149 1KV_nege_000090 1KV_nege_000117 XIX 1KV_nege_000203 NC_001749-Apple_stem_grooving_virus

1KV_toga_000162 1KV_nege_000220 1KV_nege_000269

NC_003877-Citrus_leaf_blotch_virus

NC_001409-Apple_chlorotic_leaf_spot_virus 1KV_toga_000128 NC_003604-Grapevine_virus_A VI NC_002604-Botrytis_virus_F 1KV_nege_000240

1KV_nege_000332 1KV_nege_000268 1KV_nege_000076 1KV_nege_000307

1KV_nege_000253 1KV_toga_000051 1KV_toga_000169 1KV_nege_000267 1KV_toga_000057 1KV_nege_000314

1KV_nege_000200 NC_001367-Tobacco_mosaic_virus

NC_003805-Tobacco_rattle_virus 1KV_nege_000278

1KV_nege_000155 1KV_nege_000069 NC_003723-Potato_mop-top_virus 1KV_nege_000274

NC_003478-Barley_stripe_mosaic_virus 1KV_toga_000142

NC_003672-Peanut_clump_virus 1KV_nege_000215

1KV_nege_000287 NC_001434-Hepatitis_E_virus XVIII 1KV_toga_000108 NC_002035-Cucumber_mosaic_virus VII

NC_003674-Olive_latent_virus

1VK_nege_000056

1KV_toga_000164

XVII NC_002027-Brome_mosaic_virus NC_001545-Rubella_virus 1KV_nege_000340 1KV_nege_000073

NC_003650-Pelargonium_zonate_spot_virus

1KV_toga_000003 1KV_nege_000086 NC_001598-Beet_yellows_virus

1KV_nege_000132 NC_002024-Alfalfa_mosaic_virus

1KV_nege_000252 1KV_toga_000136 1KV_nege_000303

NC_003842-Tobacco_streak_virus 1KV_nege_000350 NC_004667-Grapevine_leafroll-associated_virus_3 1KV_nege_000262 VIII

1KV_nege_000357 NC_003617-Lettuce_infectious_yellows_virus

1KV_toga_000081 1KV_nege_000263 IX NC_001512-O_nyong-nyong_virus NC_004162-Chikungunya_virus NC_003215-Semliki_forest_virus

1KV_nege_000113 NC_003417-Mayaro_virus

NC_006558-Getah_virus

1KV_toga_000112 1KV_nege_000055

1KV_toga_000084

NC_003514-Beet_necrotic_vein_virusNC_021735-Burdock_mottle_virus1KV_toga_000122 NC_016962-Bebaru_virus NC_003506-Beet_soil-borne_mosaic_virus 1KV_toga_000121 1KV_nege_000224 1KV_nege_000182 XV NC_016959-Ndumu_virus NC_024887-Middelburg_virus

1KV_picorna_000553 NC_012561-Highlands_J_virus

NC_023812-Madariaga_virus

NC_001547-Sindbis_virus NC_016961-Whataroa_virus X

NC_018615-Eilat_virus 1KV_toga_000071 Rubivirus 1KV_picorna_000852 NC_003899-Eastern_equine_encephalitis_virus NC_003900-Aura_virus

XVI NC_013528-Fort_Morgan_virus

1KV_picorna_000038 1KV_picorna_000038 XI

AY604237-Norwegian_salmonid_alphavirus

NC_003930-Salmon_pancreas_disease_virus

NC_003908-Western_equine_encephalomyelitis_virus

NC_001449-Venezuelan_equine_encephalitis_virus

XIV 1KV_picorna_001024

NC_016960-Southern_elephant_seal_virus

1KV_picorna_000217

XIII Benyviridae

1KV_picorna_000040

XII

Closteroviridae

Figure 5: Togavirus-like Superfamily 8 1. PRELIMINARY WORK - PHYLOGENETIC TREE RECONSTRUCTION

1.6 Nidovirales-like Viruses

Nidovirales-like virus 0.3 pos R NA ) (ss S LE Coronavirinae A I R Torovirinae VMesoniviridae

NC_022643-BetaCoV_Erinaceus

NC_009020-Pipistrellus_batCoV

NC_009019-Tylonycteris_batCoV NC_025217-bat_Hp-BetaCoV

NC_005147-HCoV_OC43

NC_019843-MERS-CoV NC_006577-HCoV_HKU1 NC_009021-Rousettus_batCoV

O NC_010646-Beluga_whale_CoV NC_001846-MHV-TS

NC_004718-SARS-CoV NC_016994-Night-heron_CoV HQ728482-Eidolon_batCoV

NC_001451-IBV-TS GU002364-FHMNV NC_024709-Ball_python_nidovirus NC_007447-Breda_virus NC_008516-WBV

NC_011549-Trush_CoV-TSNC_011550-Munia_CoV NC_022787-PoToV

X52374-Berne_virus D NC_016996-Common-moorhen_CoVNC_016992-Sparrow_CoV NC_016995-Wigeon_CoV

1KV_nido_000018

1KV_nido_000019 HQ728481-Chaerephon_batCoV

NC_002645-HCoV_229E I NC_009988-batCoV_HKU2 I NC_005831-HCoV_NL63 NC_018871-Rousettus_batCoV Roniviridae KC807172-Kamphang_Phet_virus NC_010437-batCoV_1a KC807166-Bontang_virusKC768950-MoumoV NC_020901-NseV NC_010438-batCoV_HKU8 NC_022103-batCoV_CDPHE15

NC_009657-Scotophilus_batCoV NC_023986-Casuarina_virus NC_015668-CavV NC_003436-PEDV NC_023760-Mink_CoV

NC_002306-FIPV-TS N NC_020899-HanaV NC_020900-MenoV

EU487200-YHV

NC_010306-GAV

1KV_picorna_000523

1KV_picorna_000673

NC_001639-LDV

1KV_picorna_000706_rdrp_orf NC_001961-PRRSV 1KV_picorna_000253 NC_025112-Mikumi_yellow_baboon_virus

1KV_picorna_000228 NC_003092-SHFV NC_025113-Southwest_baboon_virus

1KV_picorna_000402

NC_002532-EAV

1KV_picorna_000484 1KV_picorna_000659

1KV_picorna_000133_rdrp_orf

1KV_picorna_000271

1KV_flavi_000104 1KV_nido_000017

1KV_nido_000029

1KV_picorna_000841

1KV_picorna_001049_rdrp_orf

1KV_picorna_000839

1KV_flavi_000049_rdrp_orf 1KV_picorna_000240

1KV_picorna_000258

1KV_picorna_000572 1KV_nido_000024 1KV_picorna_000526 1KV_picorna_000461

1KV_flavi_000167_rdrp_orf 1KV_picorna_001035

1KV_picorna_000465 1KV_picorna_000667 1KV_nido_000028 1KV_flavi_000047_rdrp_orf

1KV_picorna_000070

1KV_nido_000027 1KV_picorna_001027

1KV_picorna_000881

1KV_picorna_000914_rdrp_orf

KF298269-Uncultured virus isolate acc_7.4-unclass

1KV_nido_000037

1KV_picorna_000503 1KV_picorna_000252_rdrp_orf

1KV_nido_000052 1KV_flavi_000014_rdrp_orf 1KV_nido_000030 1KV_picorna_000005 1KV_flavi_000050_rdrp_orf 1KV_picorna_000226

NC_003555-Giardia lamblia virus-Toti-TS

1KV_picorna_000422_rdrp_orf NC_015639-Piscine myocarditis virus AL V-708-Toti

1KV_picorna_000327

NC_025218-Leptopilina boulardi Toti-like virus strain NSref-Toti 1KV_picorna_000868 1KV_picorna_000810

1KV_picorna_000210

1KV_picorna_000796 1KV_picorna_000299 1KV_picorna_000156_rdrp_orf 1KV_picorna_000259 1KV_picorna_000514_rdrp_orf

1KV_picorna_000479_rdrp_orf 1KV_picorna_000850 1KV_nido_000040

1KV_picorna_000500_rdrp_orf 1KV_nido_000043_rdrp_orf NC_028469-Penicillium aurantiogriseum partiti-like virus-unclass II

1KV_picorna_000444_rdrp_orf 1KV_picorna_000918 1KV_picorna_000445_rdrp_orf 1KV_picorna_000639 1KV_picorna_000814_rdrp_orf NC_027212-Camponotus yamaokai virus genomic RNA-Toti 1KV_picorna_000686 1KV_picorna_000309_rdrp_orf

1KV_picorna_000546_rdrp_orf

1KV_picorna_000483_rdrp_orf

1KV_picorna_000882 1KV_picorna_000306_rdrp_orf

1KV_picorna_000880 1KV_picorna_000969_rdrp_orf 1KV_picorna_000879 1KV_picorna_000970_rdrp_orf

1KV_picorna_000900_rdrp_orf 1KV_picorna_000883

1KV_picorna_000190

NC_002063-Leishmania RNA virus 1-Toti-TS 1KV_picorna_000022 1KV_flavi_000148_rdrp_orf

1KV_picorna_000654 NC_003607-Helminthosporium victoriae virus 190S-Toti-TS

NC_028948-Penicillium aurantiogriseum totivirus 1-Toti

NC_002701-Eimeria brunetti RNA virus 1-Toti

Totiviridae NC_021873-Ustilaginoidea virens partitivirus 2 isolate Uv0901 segment 1-Partiti (stripp...

NC_003824-Trichomonas vaginalis virus-Toti-TS NC_010349-Botryotinia fuckeliana partitivirus 1 segment 1-Partiti NC_005976-Penicillium stoloniferum virus S segment 1-Partiti-TS

KP128044-Pseudogymnoascus destructans virus isolate PdV80251-Partiti

NC_003745-Saccharomyces cerevisiae virus L-A (L1)-Toti-TS

NC_028480-Red clover powdery mildew-associated totivirus 1-Toti VI NC_029096-Panax notoginseng virus A-Toti NC_028483-Red clover powdery mildew-associated totivirus 3-Toti

NC_028485-Red clover powdery mildew-associated totivirus 5-Toti 1KV_picorna_000561

1KV_picorna_000416 NC_028486-Red clover powdery mildew-associated totivirus 6-Toti

CPU95995-Cryptosporidium parvum virus 1-Partiti-TS

NC_028481-Red clover powdery mildew-associated totivirus 2-Toti 1KV_picorna_000811 1KV_picorna_000095_rdrp_orf

1KV_picorna_000573 1KV_picorna_000640_rdrp_orf

1KV_picorna_000792 1KV_picorna_000685_rdrp_orf 1KV_picorna_000694

1KV_picorna_000679_rdrp_orf

NC_024014-Arhar cryptic virus-I segment RNA-1, strain Hyderabad-unclass NC_017989-Persimmon cryptic virus segment 1, isolate SSPI-Partiti NC_023983-Persimmon latent virus

NC_014360-Circulifer tenellus virus 1 NC_010343-Raphanus sativus cryptic virus 2 segment 1-Partiti

NC_014359-Spissistilus festinus virus 1 1KV_picorna_000485

1KV_picorna_000680_rdrp_orf 1KV_picorna_000233 1KV_picorna_000683_rdrp_orf NC_015494-Fig cryptic virus segment 1-Partiti-TS

1KV_picorna_000843_rdrp_orf

1KV_picorna_000980

NC_028490-Red clover powdery mildew-associated totivirus 9-Toti

NC_013999-Phlebiopsis gigantea mycovirus dsRNA 1-Toti

1KV_picorna_000232

NC_007915-Penaeid shrimp infectious myonecrosis virus-Toti 1KV_picorna_000669

1KV_picorna_000884

NC_013499-Drosophila melanogaster totivirus SW-2009a-Toti 1KV_picorna_000886 NC_030295-Golden shiner totivirus-Toti NC_014609-Armigeres subalbatus virus SaX06-AK20-Toti

1KV_picorna_000668

1KV_picorna_000885

1KV_picorna_000179_rdrp_orf 1KV_picorna_001025_rdrp_orf NC_021094-White clover cryptic virus 2 1KV_picorna_000717_rdrp_orf NC_010705-Ceratocystis polonica partitivirus segment 1-Partiti NC_003470-Atkinsonella hypoxylon partitivirus RNA 1-Partiti-TS 1KV_picorna_001041_rdrp_orf

1KV_picorna_001006_rdrp_orf NC_008191-Raphanus sativus cryptic virus 1 dsRNA 1-Partiti 1KV_picorna_000784_rdrp_orf

1KV_picorna_000023_rdrp_orf

1KV_nido_000056 NC_013014-Sclerotinia sclerotiorum partitivirus S segment 1-Partiti 1KV_picorna_000731_rdrp_orf 1KV_picorna_000967_rdrp_orf NC_006441-Amasya cherry disease-associated mycovirus RNA1-Partiti 1KV_picorna_0000261KV_picorna_000025

NC_006275-White clover cryptic virus 1 RNA1-Partiti-TS

V AB428575-Flammulina velutipes isometric virus-Partiti IV S

NC_027428-Beauveria bassiana RNA virus 1-unclass

NC_003874-Zygosaccharomyces bailii virus Z

NC_027427-Ustilaginoidea virens unassigned RNA virus HNND-1-unclass

NC_014593-Blueberry latent virus-Amalga NC_011591-Southern tomato virus-Amalga-TS tomato NC_011591-Southern

U786Vcacryptic virusEU371896-Vicia M NC_014481-Rhododendron virus A-Amalga virus NC_014481-Rhododendron NC_024703-Alternaria longipes dsRNA virus 1 strain HN28-unclass 1KV_picorna_000044_rdrp_orf NC_028242-Colletotrichum higginsianum non-segmented dsRNA virus 1-unclass U

1KV_mono_000237_rdrp_orf

1KV_picorna_000746_rdrp_orf

1KV_picorna_001040_rdrp_orf

1KV_picorna_000188_rdrp_orf III 1KV_picorna_000187_rdrp_orf I R V A RN d s

Figure 6: Nidovirales-like Viruses 2. TRAVIS - TRUE POSITIVE DETAILS 9

2 TRAVIS - True Positive Details

2.1 INSnfrTABRAAPEI-14

This transcriptome contained a starting region of the RdRp segment and the end region of a capsid protein. It also contained a sequence similar to a virus cell attachment protein. However this is questionable because it belongs to a group of sequences that caused a lot of the false positives. chapter 3.2.2.

Table 1: Sample Information of INSnfrTABRAAPEI-14.

Filename 120107_I247_FCD0KMHACXX_L8_INSnfrTABRAAPEI-14.free.fas Assembly ID INSnfrTABRAAPEI-14 Order Hemiptera Order details Heteroptera Family Pleidae Family details NA Species Plea minutissima Number of specimen 60 Stage adult Sample location Germany, Lower Saxony, Lüchow-Dannenberg, Höhbeck, Pevestorf Sample date Aug-2011 Blood-feeding no Suspicous sequences 18

Table 2: Suspicious Sequences in INSnfrTABRAAPEI-14. 2 of 18 sequences were true positives, 2 are questionable and 14 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C129227_a_11_0_l_843 ORF_005 1 .sigma 1, Mammalian Orthoreovirus (JQ412761) 18% partial (start) 2. segment 1, Nelson bay reovirus (AF218360) 26% partial (start)

C133938_a_8_0_l_1034 ORF_003 1. RdRp, Rice gall dwarf virus (DQ494209) 36% partial (start) 2. RdRp, Homalodisca vitripennis reovirus (GU362064) 34% partial (start)

C138664_a_11_0_l_1299 ORF_006 1. minor outer capsid, Rice gall dwarf virus (AY556484) 27% partial (end) 2. segment 2, Homalodisca vitripennis reovirus (GU369683, GU384984) 27% partial (end)

(?) C143329_a_10_0_l_1751 ORF_005 1. segment 10, Dendrolimus punctatus cypovirus (YP_009111319) 30% partial 2. poly ADP-ribose glycohydrolases (GBGD01000886, XM_012431649) 40-50% partial

Figure 7: Sequence Organization of INSnfrTABRAAPEI-14. 10 2. TRAVIS - TRUE POSITIVE DETAILS

2.2 INSnfrTADRAAPEI-16

This transcriptome did not contain a valid true positive RdRp. However, a sequence similar to Hubei Chuvirus-lke virus and Lishi Spider Virus has been found with a similar sequence organization.

Table 3: Sample Information of INSnfrTADRAAPEI-16.

Filename 120107_I247_FCD0KMHACXX_L8_INSnfrTADRAAPEI-16.free.fas Assembly ID INSnfrTADRAAPEI-16 Order Coleoptera Order details NA Family Dytiscidae Family details NA Species Cybister lateralimarginalis Number of specimen 1 Stage adult Sample location Germany, Lower Saxony, Lüchow-Dannenberg, Höhbeck, Pevestorf Sample date 27-Aug-2011 Blood-feeding no Suspicous sequences 16

Table 4: Suspicious Sequences in INSnfrTADRAAPEI-16. 1 of 16 sequences was true positive and 15 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C127169_a_60_0_l_4158 ORF_001 1. glycoprotein, Hubei chuvirus-like virus 3 (NC_033015) 47% full 2. glycoprotein, Lishi Spider Virus 1 (KM817596) 36% full

C127169_a_60_0_l_4158 ORF_002 1. hypothetical protein, Hubei chuvirus-like virus 3 (NC_033015) 41% full 2. nucleoprotein, Lishi Spider Virus 1 (KM817596) 39% full

Figure 8: Sequence Organization of INSnfrTADRAAPEI-16. 2. TRAVIS - TRUE POSITIVE DETAILS 11

2.3 INSnfrTAIRAAPEI-21

In this transcriptome, a full RdRp has been deteced as well as seven other fragmentary segments. One of the other segments (VP2) might also be nearly complete because it consisted of two contigs where a central connecting part is missing.

Table 5: Sample Information of INSnfrTAIRAAPEI-21.

Filename 120107_I247_FCD0KMHACXX_L8_INSnfrTAIRAAPEI-21.free.fas Assembly ID INSnfrTAIRAAPEI-21 Order Collembola Order details NA Family Neanuridae Family details NA Species Anurida maritima Number of specimen ca 100 Stage adult Sample location Netherlands, North Holland, Texel, Ferry Bay Sample date 01-Sep 2011 Blood-feeding no Suspicous sequences 20

Table 6: Suspicious Sequences in INSnfrTAIRAAPEI-21. 8 of 20 sequences were true positives, one is questionable and 11 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

C87378_a_10_0_l_1411 ORF_013 VP3, Eyach virus (KU163323, NC_003698) 26% partial (start-mid)

(?) C88412_a_5_0_l_1508 ORF_003 1. Dendrolimus punctatus cypovirus (NC_025838) 36% partial (end) 2. hypothetical poly ADP-ribose glycohydrolases 33% partial (end) (XM_012814485, XM_014007017, XM_018466349)

C91152_a_9_0_l_1820 ORF_003 segment 10, Eyach virus (NC_003705, KU163330) 25% full

C91298_a_4_0_l_1840 ORF_005 VP2, Eyach virus (EU789380) 22% partial (start)

C91402_a_6_0_l_1853 ORF_003 VP2, Eyach virus (EU789378, EU789378, KU163322, NC_003697) 30% partial (end)

C92366_a_11_0_l_1992 ORF_004 segment 3, Eyach virus (EU789381, KU163323, NC_003698) 25% partial (start-mid)

C94214_a_4_0_l_2312 ORF_005 segment 4, Eyach virus (KU163324, NC_003699) 30% full

C98610_a_10_0_l_4248 ORF_016 RdRp, Eyach virus (NC_003696) 34% full

s368_L_360_0_a_7_6_l_809 ORF_003 sigma-c like virus cell attachment protein 26% partial (start) Cangyuan orthoreovirus (NC_025806)

s6362_L_12907_0_a_8_4_l_1869 ORF_004 segment 6, Dendrolimus punctatus cypovirus (NC_025850) 21% full 12 2. TRAVIS - TRUE POSITIVE DETAILS

Figure 9: Sequence Organization of INSnfrTAIRAAPEI-21. 2. TRAVIS - TRUE POSITIVE DETAILS 13

2.4 INSnfrTAORAAPEI-35

In this transcriptome four small fragments of different segments have been detected. Three of those are potentially related to Orthoptera Brumata Reovirus.

Table 7: Sample Information of INSnfrTAORAAPEI-35.

Filename 120126_I283_FCD0L80ACXX_L1_INSnfrTAORAAPEI-35.free.fas Assembly ID INSnfrTAORAAPEI-35 Order Hemiptera Order details Heteroptera Family Veliidae Family details NA Species Velia caprai Number of specimen 20 Stage adult Sample location Germany, Lower Saxony, H/"ohbeck, Pevestorf Sample date 12-Aug-2011 Blood-feeding no Suspicous sequences 13

Table 8: Suspicious Sequences in INSnfrTAORAAPEI-35. 4 of 13 sequences were true positives and 9 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C46324_a_5_0_l_227 ORF_001 segment 3, Operophtera brumata reovirus (NC_007561) 33% partial

C50775_a_3_0_l_247 ORF_001 1. segment 6, Dendrolimus punctatus cypovirus (NC_025850) 33% partial 2. glycoprotein, Changping Tick Virus 2 (NC_028260) 38% partial

C67213_a_3_0_l_384 ORF_002 RdRp, Operophtera brumata reovirus (NC_007559) 42% partial

C94877_a_63_0_l_1501 ORF_005 1. Hubei diptera virus 21 (KX884697) 29% partial 2. segment 2, Operophtera brumata reovirus (NC_007560) 28% partial

Figure 10: Sequence Organization of INSnfrTAORAAPEI-35. 14 2. TRAVIS - TRUE POSITIVE DETAILS

2.5 INSnfrTATRAAPEI-43

In this transcriptome only a small fragmentary RdRp-matching sequence has been indentified. It is matching several other RdRps at a similar location with about 50-55% identity on amino acid level and thus is considered a true positive.

Table 9: Sample Information of INSnfrTATRAAPEI-43.

Filename 120126_I283_FCD0L80ACXX_L1_INSnfrTATRAAPEI-43.free.fas Assembly ID INSnfrTATRAAPEI-43 Order Neuroptera Order details NA Family Hemerobiidae Family details NA Species Micromus variegatus Number of specimen 12 Stage adult Sample location Germany, North Rhine-Westphalia, Bonn, Zoological Research Museum A Koenig ZFMK Sample date Jun-2011 Blood-feeding no Suspicous sequences 19

Table 10: Suspicious Sequences in INSnfrTATRAAPEI-43. 1 of 19 sequences was true positive and 18 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C94914_a_4_0_l_237 ORF_001 1. RdRp, Morris orbivirus (KX907618) 55% partial (mid)

Figure 11: Sequence Organization of INSnfrTATRAAPEI-43. 2. TRAVIS - TRUE POSITIVE DETAILS 15

2.6 INSnfrTBERAAPEI-19

In this transcriptome, two small fragments probably related to Rice dwarf virus have been detected.

Table 11: Sample Information of INSnfrTBERAAPEI-19.

Filename 120126_I283_FCD0L80ACXX_L2_INSnfrTBERAAPEI-19.free.fas Assembly ID INSnfrTBERAAPEI-19 Order Coleoptera Order details NA Family Gyrinidae Family details NA Species Gyrinus marinus Number of specimen 12 Stage adult Sample location Germany, Lower Saxony, Höhbeck, Pevestorf Sample date 11-Aug 2011 Blood-feeding no Suspicous sequences 12

Table 12: Suspicious Sequences in INSnfrTBERAAPEI-19. 2 of 12 sequences were true positives and 10 sequences were false positives similar to the false positives listed in chapter 3.2.2. Both true positive sequences were matching two distinc areas of reference RdRps suggesting they belong together.

Sequence ID ORF Match Identity Completeness

C92930_a_4_0_l_564 ORF_002 RdRp, Rice dwarf virus (NC_009248) 39% partial (start)

C98326_a_8_0_l_737 ORF_001 RdRp, Rice dwarf virus (NC_009248) 33% partial (mid)

Figure 12: Sequence Organization of INSnfrTBERAAPEI-19. 16 2. TRAVIS - TRUE POSITIVE DETAILS

2.7 INSytvTABRAAPEI-11

Table 13: Sample Information of INSytvTABRAAPEI-11.

Filename 120429_I266_FCC0HG0ACXX_L7_INSytvTABRAAPEI-11.free.fas Assembly ID INSytvTABRAAPEI-11 Order Hymenoptera Order details NA Family Mutillidae Family details NA Species Smicromyrme rufipes Number of specimen 21 Stage adult Sample location Germany, Rhineland-Palatinate, Birkenheide Sample date 22-May-2011 Blood-feeding no Suspicous sequences 23

Table 14: Suspicious Sequences in INSytvTABRAAPEI-11. 2 of 23 sequences were questionable and 21 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C64493_a_4_0_l_495 ORF_001 RdRp, Hubei reo-like virus 14 (KX884702) 49% partial (mid)

(?) C84625_a_5_0_l_1012 ORF_005 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 30% partial (start)

Figure 13: Sequence Organization of INSytvTABRAAPEI-11. 2. TRAVIS - TRUE POSITIVE DETAILS 17

2.8 INSytvTALRAAPEI-35

In this transcriptome four full segments similar to Hubei reo-like virus 6 have been identified.

Table 15: Sample Information of INSytvTALRAAPEI-35.

Filename 120429_I266_FCC0HG0ACXX_L8_INSytvTALRAAPEI-35.free.fas Assembly ID INSytvTALRAAPEI-35 Order Hemiptera Order details Sternorrhyncha Family Triozidae Family details NA Species Acanthocasuarina muellerianae Number of specimen few Stage not determined Sample location Australia, South Australia, Kangaroo Island, Sedden Conservation Park Sample date 09-Feb-2012 Blood-feeding no Suspicous sequences 10

Table 16: Suspicious Sequences in INSytvTALRAAPEI-35. 4 of 10 sequences were true positives, 1 questionable and 5 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

C203002_a_61_0_l_2753 ORF_002 hypothetical protein, Hubei reo-like virus 6 (KX884719) 26% full

C203574_a_61_0_l_3410 ORF_003 hypothetical protein, Hubei reo-like virus 6 (KX884718) 26% full

C203812_a_56_0_l_4203 ORF_001 RdRp, Hubei reo-like virus 6 (KX884716) 30% full

C203826_a_57_0_l_4320 ORF_003 hypothetical protein, Hubei reo-like virus 6 (KX884717) 25% full

(?) s10555_L_43618_0_a_47_0_l_7342 ORF_003 hypothetical protein, Dendrolimus punctatus cypovirus 22 (NC_025850) 25% full

Figure 14: Sequence Organization of INSytvTALRAAPEI-35. 18 2. TRAVIS - TRUE POSITIVE DETAILS

2.9 INSytvTBORAAPEI-47

In this transcriptome two fragments similar to Bloomfield virus have been identified. Also two small fragments similar to segment 6 of Dendrolimus punctatus cypovirus 22 gave two questionable hits.

Table 17: Sample Information of INSytvTBORAAPEI-47.

Filename 120521_I249_FCC0U4RACXX_L7_INSytvTBORAAPEI-47.free.fas Assembly ID INSytvTBORAAPEI-47 Order Hymenoptera Order details NA Family Melittidae Family details NA Species Macropis fulvipes Number of specimen 8 Stage adult Sample location Germany, Rhineland-Palatinate, Albersweiler Sample date 2011 Blood-feeding no Suspicous sequences 15

Table 18: Suspicious Sequences in INSytvTBORAAPEI-47. 2 of 15 sequences were true positives, 2 were questionable and 11 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C31726_a_6_0_l_364 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 36% partial (end)

(?) C39651_a_7_0_l_539 ORF_005 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 35% partial (mid)

C46583_a_3_0_l_801 ORF_002 RdRp, Bloomfield virus (KP714090) 31% partial (end)

C62529_a_11_0_l_2839 ORF_007 RdRp, Bloomfield virus (KP714090) 35% partial (start-mid)

Figure 15: Sequence Organization of INSytvTBORAAPEI-47. 2. TRAVIS - TRUE POSITIVE DETAILS 19

2.10 INSswpTBBRAAPEI-21

This transcriptome contained a large part of the RdRp segment similar to Hubei reo-like virus 14.

Table 19: Sample Information of INSswpTBBRAAPEI-21.

Filename 120707_I249_FCD111GACXX_L3_INSswpTBBRAAPEI-21.free.fas Assembly ID INSswpTBBRAAPEI-21 Order Hymenoptera Order details NA Family Apidae Family details NA Species Epeolus variegatus Number of specimen 3 Stage adult Sample location Italy, Sardinia, SW Santa Teresa Gallura Sample date 06-Sep-2011 Blood-feeding no Suspicous sequences 14

Table 20: Suspicious Sequences in INSswpTBBRAAPEI-21. 1 of 14 sequences was true positive and 13 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C95104_a_11_0_l_3069 ORF_012 RdRp, Hubei reo-like virus 14 (KX884607) 37% partial (start-mid)

Figure 16: Sequence Organization of INSswpTBBRAAPEI-21. 20 2. TRAVIS - TRUE POSITIVE DETAILS

2.11 INSeqtTAHRAAPEI-88

This transcriptome contained many small ORFs similar to segment 6 of Dendrolimus punctatus cypovirus 22 but no true positive segment similar to an RdRp.

Table 21: Sample Information of INSeqtTAHRAAPEI-88.

Filename 121010_I249_FCD1C4BACXX_L6_INSeqtTAHRAAPEI-88.free.fas Assembly ID INSeqtTAHRAAPEI-88 Order Hymenoptera Order details NA Family Chalcididae Family details NA Species Brachymeria minuta Number of specimen 4 Stage adult Sample location Germany, Hessen, Osthofen Sample date 28-Jun-2012 Blood-feeding no Suspicous sequences 42

Table 22: Suspicious Sequences in INSeqtTAHRAAPEI-88. 16 of 42 sequences were questionable and 26 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C106753_a_3_0_l_390 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 30% partial (start)

(?) C113935_a_11_0_l_454 ORF_002 segment 8, Kadipiro virus chromosome (NC_004208) 22% partial (start)

(?) C117709_a_23_0_l_494 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (mid)

(?) C127403_a_4_0_l_631 ORF_002 hypothetical protein, Hubei diptera virus 21 (KX884697) 38% partial (mid)

(?) C127695_a_3_0_l_636 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 29% partial (end)

(?) C133384_a_8_0_l_761 ORF_005 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 26% partial (start)

(?) C136012_a_8_0_l_834 ORF_001 hypothetical protein, Hubei diptera virus 21 (KX884697) 34% partial (mid)

(?) C142742_a_10_0_l_1115 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 26% partial (end)

(?) C144414_a_9_0_l_1213 ORF_006 hypothetical protein, Hubei diptera virus 21 (KX884697) 33% partial (mid)

(?) C68927_a_4_0_l_205 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 27% partial (mid)

(?) C76527_a_6_0_l_233 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 27% partial (start)

(?) C77495_a_5_0_l_235 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 38% partial (start)

(?) C82049_a_3_0_l_246 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 40% partial (mid)

(?) C90211_a_17_0_l_280 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 43% partial (mid)

(?) C91643_a_21_0_l_288 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 34% partial (end)

(?) C95097_a_3_0_l_310 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 37% partial (start) 2. TRAVIS - TRUE POSITIVE DETAILS 21

Figure 17: Sequence Organization of INSeqtTAHRAAPEI-88. 22 2. TRAVIS - TRUE POSITIVE DETAILS

2.12 INShkeTCLRAAPEI-44

Table 23: Sample Information of INShkeTCLRAAPEI-44.

Filename 121014_I189_FCC173YACXX_L1_INShkeTCLRAAPEI-44.free.fas Assembly ID INShkeTCLRAAPEI-44 Order Blattodea Order details NA Family Blattidae Family details NA Species Deropeltis erythrocephala Number of specimen 1 Stage adult Sample location Germany, Lab culture with Samples originating from Germany, private breeder Sample date 13-Mar-2011 Blood-feeding no Suspicous sequences 6

this sequence is questionable because it’s not an ongoing ORF

Table 24: Suspicious Sequences in INShkeTCLRAAPEI-44. 1 of 6 sequences was questionable and 5 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) s2137_L_2837_0_a_4_2_l_945 ORF_003 RdRp, Yunnan orbivirus (NC_007656) 31% partial (mid)

Figure 18: Sequence Organization of INShkeTCLRAAPEI-44. 2. TRAVIS - TRUE POSITIVE DETAILS 23

2.13 INSeqtTBNRAAPEI-11

This transcriptome contained several near full segments similar to Rice dwarf virus.

Table 25: Sample Information of INSeqtTBNRAAPEI-11.

Filename 121030_I251_FCC19KWACXX_L1_INSeqtTBNRAAPEI-11.free.fas Assembly ID INSeqtTBNRAAPEI-11 Order Odonata Order details Zygoptera Family Lestidae Family details NA Species Indolestes peregrinus Number of specimen 2 Stage adult Sample location Japan, Nagano, Ueda Sample date 27-May-2012 Blood-feeding no Suspicous sequences 13

Table 26: Suspicious Sequences in INSeqtTBNRAAPEI-11. 6 of 13 sequences were true positives and 7 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C106661_a_58_0_l_2619 ORF_001 minor core structural protein, Rice dwarf virus (NC_009247) 30% full

C107435_a_53_0_l_3182 ORF_015 major core structural protein, Rice dwarf virus (NC_009243) 21% full

C107553_a_48_0_l_3308 ORF_007 RdRp, Rice dwarf virus (NC_003773) 31% partial (mid-end)

C107715_a_60_0_l_3560 ORF_001 minor core structural protein, Rice dwarf virus (NC_003774) 25% full

C91752_a_32_0_l_905 ORF_002 RdRp, Rice dwarf virus (NC_003773) 29% partial (start)

C96118_a_13_0_l_1114 ORF_001 nonstructural protein, Rice dwarf virus (NC_003766) 24% full

Figure 19: Sequence Organization of INSeqtTBNRAAPEI-11. 24 2. TRAVIS - TRUE POSITIVE DETAILS

2.14 INSeqtTCJRAAPEI-20

This transcriptome contained several partial fragments similar to Bloomfield virus.

Table 27: Sample Information of INSeqtTCJRAAPEI-20.

Filename 121030_I251_FCC19KWACXX_L1_INSeqtTCJRAAPEI-20.free.fas Assembly ID INSeqtTCJRAAPEI-20 Order Raphidioptera Order details NA Family Raphidiidae Family details NA Species Ornatoraphidia flavilabris Number of specimen 3 Stage larva Sample location Austria, Lab culture with Samples originating from Greece, Arcadia Mainalon, Mountains Levidi, Kardaras Sample date 20-May-2012 Blood-feeding no Suspicous sequences 33

Table 28: Suspicious Sequences in INSeqtTCJRAAPEI-20. 5 of 33 sequences were true positives, 1 questionable and 27 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

C234593_a_7_0_l_724 ORF_003 RdRp, Bloomfield virus (KP714090) 34% partial (end)

C251492_a_47_0_l_2048 ORF_005 minor core structural protein, Bloomfield virus (KP714094) 24% partial (mid-end)

(?) C251660_a_4_0_l_2101 ORF_003 1. glycoprotein, Lishi Spider Virus 1 (KM817596) 34% partial (start) 2. hypothetical protein, Dendrolimus punctatus cypovirus 22 (NC_025850) 28% partial (start)

C252938_a_26_0_l_2905 ORF_007 RdRp, Bloomfield virus (KP714090) 36% partial (start-mid)

C253318_a_53_0_l_3951 ORF_009 major core protein, Bloomfield virus (KP714091) 26% full

s15123_L_40031_0_a_13_9_l_1642 ORF_001 glycoprotein, Wuchang Cockraoch Virus 3 (KM817605) 34% partial (mid-end)

Figure 20: Sequence Organization of INSeqtTCJRAAPEI-20. 2. TRAVIS - TRUE POSITIVE DETAILS 25

2.15 INSeqtTCZRAAPEI-47

This transcriptome contained svereral small fragments similar to Liao ning virus. However, a near full RdRp-segment could be assembled by three fragments.

Table 29: Sample Information of INSeqtTCZRAAPEI-47.

Filename 121030_I251_FCC19KWACXX_L1_INSeqtTCZRAAPEI-47.free.fas Assembly ID INSeqtTCZRAAPEI-47 Order Plecoptera Order details NA Family Nemouridae Family details Nemourinae Species Protonemura ausonia Number of specimen ca 7 Stage nymph Sample location Italy, Viterbo, Arlena River Sample date 01-Mar-2007 Blood-feeding no Suspicous sequences 26

Table 30: Suspicious Sequences in INSeqtTCZRAAPEI-47. 12 of 26 sequences were true positives, 3 were questionable and 11 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C145258_a_4_0_l_204 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 51% partial (mid)

C148370_a_3_0_l_209 ORF_001 segment 2, Liao ning virus (AY317100) 43% partial (mid)

C194142_a_3_0_l_243 ORF_001 segment 2, Liao ning virus (NC_007737) 29% partial (mid)

C266489_a_15_0_l_406 ORF_003 segment 5, Liao ning virus (NC_007740) 33% partial (end)

C267864_a_3_0_l_412 ORF_001 segment 2, Liao ning virus (NC_007737) 33% partial (mid)

C269215_a_3_0_l_418 ORF_001 segment 4, Liao ning virus (NC_007739) 35% partial (start)

C269977_a_6_0_l_422 ORF_002 RdRp, Liao ning virus (NC_007736) 40% partial (end)

(?) C280721_a_11_0_l_481 ORF_002 segment 8, Kadipiro virus(NC_004208) 28% partial (start)

(?) C305375_a_8_0_l_784 ORF_001 1. segment 11, Liao ning virus (NC_007746) 33% partial (start) 2. R2D2, Nilaparvata lugens (KC316044) 44% partial(start)

C307017_a_6_0_l_823 ORF_001 segment 3, Liao ning virus (NC_007738) 37% partial (end)

C311087_a_10_0_l_946 ORF_003 segment 5, Liao ning virus (NC_007740) 27% partial (start)

C315239_a_6_0_l_1127 ORF_002 segment 3, Liao ning virus (NC_007738) 46% partial (start)

C315997_a_5_0_l_1176 ORF_001 RdRp, Liao ning virus (NC_007736) 45% partial (mid)

C319123_a_7_0_l_1437 ORF_004 segment 6, Liao ning virus (NC_007741) 25% partial (start-mid)

C321585_a_7_0_l_1930 ORF_010 RdRp, Liao ning virus (NC_007736) 34% partial (start-mid) 26 2. TRAVIS - TRUE POSITIVE DETAILS

Figure 21: Sequence Organization of INSeqtTCZRAAPEI-47. 2. TRAVIS - TRUE POSITIVE DETAILS 27

2.16 INSeqtTDXRAAPEI-19

This transcriptome contained a partial segment similar to Hubei diptera virus 21 and a very small fragment of an RdRp similar to the same virus.

Table 31: Sample Information of INSeqtTDXRAAPEI-19.

Filename 121030_I251_FCC19KWACXX_L3_INSeqtTDXRAAPEI-19.free.fas Assembly ID INSeqtTDXRAAPEI-19 Order Neuroptera Order details NA Family Hemerobiidae Family details NA Species Hemerobius marginatus Number of specimen 3 Stage adult Sample location Austria, Lower Austria, Vienna-Surroudings, Klosterneuburg Sample date May-2012 Blood-feeding no Suspicous sequences 14

Table 32: Suspicious Sequences in INSeqtTDXRAAPEI-19. 3 of 14 sequences were true positives, 1 questionable and 10 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

C102680_a_3_0_l_570 ORF_001 hypothetical protein 1, Hubei diptera virus 21 (KX884697) 46% partial (mid)

C109650_a_7_0_l_734 ORF_001 hypothetical protein 1, Hubei diptera virus 21 (KX884697) 45% partial (mid)

(?) C83898_a_5_0_l_334 ORF_001 RdRp, Hubei diptera virus 21 (KX884696) 35% partial (end)

C89115_a_3_0_l_381 ORF_002 hypothetical protein 1, Hubei diptera virus 21 (KX884697) 43% partial (start)

Figure 22: Sequence Organization of INSeqtTDXRAAPEI-19. 28 2. TRAVIS - TRUE POSITIVE DETAILS

2.17 INSlupTBDRAAPEI-17

This transcriptome contained a near full RdRp-segment similar to Hubei reo-like virus 14.

Table 33: Sample Information of INSlupTBDRAAPEI-17.

Filename 121221_I260_FCC1GFFACXX_L3_INSlupTBDRAAPEI-17.free.fas Assembly ID INSlupTBDRAAPEI-17 Order Hymenoptera Order details NA Family Crabronidae Family details NA Species Mellinus arvensis Number of specimen 2 Stage adult Sample location Germany, Rhineland-Palatinate, Osthofen Sample date 21-Jun-2012 Blood-feeding no Suspicous sequences 19

Table 34: Suspicious Sequences in INSlupTBDRAAPEI-17. 1 of 19 sequences was true positive and 18 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

s5948_L_19400_0_a_33_1_l_3436 ORF_022 RdRp, Hubei reo-like virus 14 (KX884702) 39% full

Figure 23: Sequence Organization of INSlupTBDRAAPEI-17. 2. TRAVIS - TRUE POSITIVE DETAILS 29

2.18 INSlupTBKRAAPEI-31

This transcriptome contained several near full segments similar to Rice black streaked dwarf virus.

Table 35: Sample Information of INSlupTBKRAAPEI-31.

Filename 121221_I260_FCC1GFFACXX_L3_INSlupTBKRAAPEI-31.free.fas Assembly ID INSlupTBKRAAPEI-31 Order Hemiptera Order details Auchenorrhyncha Fulgoromorpha Family Cixiidae Family details NA Species Tachycixius pilosus Number of specimen 6 Stage adult Sample location Germany, Thuringia, Jena Sample date May-2012 Blood-feeding no Suspicous sequences 14

Table 36: Suspicious Sequences in INSlupTBKRAAPEI-31. 6 of 14 sequences were true positives and 8 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C136255_a_5_0_l_1616 ORF_001 segment 10, Rice black streaked dwarf virus (NC_003733) 23% full

C136503_a_9_0_l_1696 ORF_008 segment 7, Rice black streaked dwarf virus (NC_003730) 25% partial (start-mid)

C137679_a_54_0_l_3342 ORF_005 core capsid protein, Rice black streaked dwarf virus (NC_003728) 23% full

C137715_a_25_0_l_3809 ORF_003 major core protein, Rice black streaked dwarf virus (NC_003734) 21% full

s6698_L_26721_0_a_19_3_l_3499 ORF_001 segment 4, Rice black streaked dwarf virus (NC_003735) 18% full

s7010_L_30471_0_a_63_6_l_4223 ORF_011 RdRp, Rice black streaked dwarf virus (NC_003729) 34% full

Figure 24: Sequence Organization of INSlupTBKRAAPEI-31. 30 2. TRAVIS - TRUE POSITIVE DETAILS

2.19 INSlupTBMRAAPEI-34

This transcriptome contained three near full segments similar to Equine encephalitis virus.

Table 37: Sample Information of INSlupTBMRAAPEI-34.

Filename 121221_I260_FCC1GFFACXX_L3_INSlupTBMRAAPEI-34.free.fas Assembly ID INSlupTBMRAAPEI-34 Order Hemiptera Order details Heteroptera Family Aphelocheiridae Family details NA Species Aphelocheirus aestivalis Number of specimen 3 Stage adult Sample location Germany, Thuringia, Maua Sample date 01-Aug-2012 Blood-feeding no Suspicous sequences 14

Table 38: Suspicious Sequences in INSlupTBMRAAPEI-34. 3 of 14 sequences were true positives and 11 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C148506_a_38_0_l_2007 ORF_003 segment 4, Equine encephalosis virus (AB811638) 29% full

C149424_a_61_0_l_3917 ORF_003 RdRp, Equine encephalosis virus (AB811635) 26% full

s7155_L_42109_0_a_61_0_l_3092 ORF_006 segment 3, Equine encephalosis virus (AB811637) 21% full

Figure 25: Sequence Organization of INSlupTBMRAAPEI-34. 2. TRAVIS - TRUE POSITIVE DETAILS 31

2.20 INSlupTBURAAPEI-45

This transcriptome contained a small fragment of an RdRp similar to Reptilian orthoreovirus.

Table 39: Sample Information of INSlupTBURAAPEI-45.

Filename 121221_I260_FCC1GFFACXX_L8_INSlupTBURAAPEI-45.free.fas Assembly ID INSlupTBURAAPEI-45 Order Mantodea Order details NA Family Hymenopodidae Family details NA Species Acromantis sp Number of specimen 1 Stage adult Sample location Germany, Lab culture with Samples originating from Hong Kong Sample date Aug-2012 Blood-feeding no Suspicous sequences 8

Table 40: Suspicious Sequences in INSlupTBURAAPEI-45. 1 of 8 sequences was questionable and 7 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C142720_a_3_0_l_484 ORF_001 RdRp, Reptilian orthoreovirus (KT696549) 31% partial (mid)

Figure 26: Sequence Organization of INSlupTBURAAPEI-45. 32 2. TRAVIS - TRUE POSITIVE DETAILS

2.21 INSlupTAFRAAPEI-44

This transcriptome contained a small fragment that might be similar to an RdRp of Rice black streaked dwarf virus and two other segments that are in relation to Liao ning virus and Umatilla virus.

Table 41: Sample Information of INSlupTAFRAAPEI-44.

Filename 130112_I269_FCC1M19ACXX_L8_INSlupTAFRAAPEI-44.free.fas Assembly ID INSlupTAFRAAPEI-44 Order Hemiptera Order details Auchenorrhyncha Cicadomorpha Family Aphrophoridae Family details NA Species Aphrophora alni Number of specimen ca 7 Stage adult Sample location Germany, Thuringia, Jena Sample date 30-Jul-2012 Blood-feeding no Suspicous sequences 12

Table 42: Suspicious Sequences in INSlupTAFRAAPEI-44. 1 of 12 sequences was true positive, 3 questionable and 8 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C116451_a_8_0_l_385 ORF_001 RdRp, Rice black streaked dwarf virus (NC_003729) 42% partial (mid)

(?) C127494_a_7_0_l_494 ORF_003 1. segment 11, Liao ning virus (NC_007746 29% partial (start-mid) 2. transcribed RNA sequence, Homalodisca liturata (GECU01029485) 54% partial (start-mid)

C137751_a_3_0_l_666 ORF_002 segment 5, Umatilla virus (NC_024507) 28% partial (end)

(?) C139507_a_7_0_l_712 ORF_004 1. segment 11, Liao ning virus (NC_007746 34% partial (start) 2. transcribed RNA sequence, Homalodisca liturata (GECU01023362) 85% partial (start-mid)

Figure 27: Sequence Organization of INSlupTAFRAAPEI-44. 2. TRAVIS - TRUE POSITIVE DETAILS 33

2.22 INSntgTABRAAPEI-216

This transcriptome contained a fragment similar to the RdRp of Hubei reo-like virus 6.

Table 43: Sample Information of INSntgTABRAAPEI-216.

Filename 130125_I266_FCC1MY6ACXX_L3_INSntgTABRAAPEI-216.free.fas Assembly ID INSntgTABRAAPEI-216 Order Neuroptera Order details NA Family Coniopterygidae Family details NA Species Coniopteryx sp Number of specimen 10 Stage adult Sample location Austria, Lower Austria, Krems-Land, Duernstein Sample date 27-May-2012 Blood-feeding no Suspicous sequences 12

Table 44: Suspicious Sequences in INSntgTABRAAPEI-216. 1 of 12 sequences was questionable and 1 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C119964_a_3_0_l_381 ORF_002 RdRp, Hubei reo-like virus 6 (KX884716) 47% partial (mid)

Figure 28: Sequence Organization of INSntgTABRAAPEI-216. 34 2. TRAVIS - TRUE POSITIVE DETAILS

2.23 INSlupTASRAAPEI-89

This transcriptome contained several near full segments of a virus similar to Fji disease virus.

Table 45: Sample Information of INSlupTASRAAPEI-89.

Filename 130206_I238_FCC1LVUACXX_L1_INSlupTASRAAPEI-89.free.fas Assembly ID INSlupTASRAAPEI-89 Order Hemiptera Order details Auchenorrhyncha Fulgoromorpha Family Dictyopharidae Family details NA Species Dictyophara europaea Number of specimen ca 10 Stage adult Sample location Germany, Thuringia, Jena Sample date 31-Jul-2012 Blood-feeding no Suspicous sequences 12

Table 46: Suspicious Sequences in INSlupTASRAAPEI-89. 8 of 12 sequences were true positives and 4 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C233648_a_60_0_l_1298 ORF_002 segment 7, Fiji disease virus (NC_007163) 25% full

C234606_a_26_0_l_1429 ORF_006 segment 9, Fiji disease virus (NC_007156) 27% full

C236042_a_45_0_l_1734 ORF_004 major outer capsid protein, Fiji disease virus (NC_007162) 22% full

C236682_a_38_0_l_1998 ORF_004 segment 6, Fiji disease virus (NC_007161) 24% full

C237738_a_43_0_l_4257 ORF_001 RdRp, Fiji disease virus (NC_007159) 33% full

s11280_L_56553_0_a_56_1_l_3431 ORF_005 segment 4, Fiji disease virus (NC_007155) 23% full

s11335_L_57448_0_a_64_5_l_3564 ORF_005 segment 3, Fiji disease virus (NC_007158) 25% full

s11389_L_58703_0_a_46_0_l_3754 ORF_001 segment 2, Fiji disease virus (NC_007154) 20% full 2. TRAVIS - TRUE POSITIVE DETAILS 35

Figure 29: Sequence Organization of INSlupTASRAAPEI-89. 36 2. TRAVIS - TRUE POSITIVE DETAILS

2.24 INSqiqTBFRAAPEI-61

This transcriptome contained a partial RdRp similar to the one of Hubei tetragnatha maxllosa virus.

Table 47: Sample Information of INSqiqTBFRAAPEI-61.

Filename 130206_I238_FCC1LVUACXX_L1_INSqiqTBFRAAPEI-61.free.fas Assembly ID INSqiqTBFRAAPEI-61 Order Hymenoptera Order details NA Family Chrysididae Family details Chrysidinae Species Praestochrysis megerlei Number of specimen 2 Stage adult Sample location Italy, Emilia-Romagna, Parma Oriano Sample date 29-Aug-2012 Blood-feeding no Suspicous sequences 11

Table 48: Suspicious Sequences in INSqiqTBFRAAPEI-61. 4 of 13 sequences were true positives and 9 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

s7034_L_44580_0_a_6_7_l_2092 ORF_005 RdRp, Hubei tetragnatha maxillosa virus 9 (KX884675) 30% partial (mid-end)

Figure 30: Sequence Organization of INSqiqTBFRAAPEI-61. 2. TRAVIS - TRUE POSITIVE DETAILS 37

2.25 INSqiqTBLRAAPEI-83

This transcriptome contained a near full RdRp similar to the one of Hubei tetragnatha maxillosa virus and two other segments similar to Dendrolimus punctatus cypovirus 22 and Wuchang Cockroach virus.

Table 49: Sample Information of INSqiqTBLRAAPEI-83.

Filename 130206_I238_FCC1LVUACXX_L2_INSqiqTBLRAAPEI-83.free.fas Assembly ID INSqiqTBLRAAPEI-83 Order Neuroptera Order details NA Family Myrmeleontidae Family details NA Species Myrmeleon formicarius Number of specimen 1 Stage adult Sample location Japan, Ibaraki, Tsukuba Sample date 25-Jun-2012 Blood-feeding no Suspicous sequences 20

Table 50: Suspicious Sequences in INSqiqTBLRAAPEI-83. 3 of 20 sequences were true positives and 17 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C101877_a_3_0_l_1449 ORF_008 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 32% full

C105769_a_8_0_l_2999 ORF_010 glycoprotein, Wuchang Cockraoch Virus 3 (KM817605) 31% full

C106019_a_29_0_l_3543 ORF_011 RdRp, Hubei tetragnatha maxillosa virus 9 (KX884675) 35% full

Figure 31: Sequence Organization of INSqiqTBLRAAPEI-83. 38 2. TRAVIS - TRUE POSITIVE DETAILS

2.26 INSqiqTBNRABPEI-90

This transcriptome contained three fragments similar to the RdRp of Hubei diptera virus 21 aswell as several fragments similar to other segments of the same virus. Additionally, some fragments similar to different segments of Dendrolimus punctatus cypovirus could have been identified.

Table 51: Sample Information of INSqiqTBNRABPEI-90.

Filename 130206_I238_FCC1LVUACXX_L3_INSqiqTBNRABPEI-90.free.fas Assembly ID INSqiqTBNRABPEI-90 Order Hymenoptera Order details NA Family Torymidae Family details NA Species Bootanomyia dorsalis Number of specimen ca 20 Stage adult Sample location Germany, Baden-Wuertemberg, Stuttgart, Rosensteinpark Sample date 19-Jul-2012 Blood-feeding no Suspicous sequences 42

Figure 32: Sequence Organization of INSqiqTBNRABPEI-90. 2. TRAVIS - TRUE POSITIVE DETAILS 39

Table 52: Suspicious Sequences in INSqiqTBNRABPEI-90. 6 of 42 sequences were true positives, 21 questionable and 15 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C32191_a_3_0_l_222 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 28% partial (mid)

(?) C32667_a_10_0_l_226 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 25% partial (mid)

(?) C35078_a_3_0_l_243 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 36% partial (mid)

(?) C38974_a_5_0_l_2633 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 34% partial (start)

(?) C39945_a_4_0_l_267 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 34% partial (end)

(?) C40125_a_3_0_l_269 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (start)

(?) C43181_a_6_0_l_289 ORF_001 hypothetical protein 1, Hubei diptera virus 21 (KX884697) 42% partial (mid)

C46563_a_38_0_l_319 ORF_001 RdRp, Hubei diptera virus 21 (KX884696) 35% partial (start)

(?) C46813_a_3_0_l_321 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 29% partial (end)

(?) C46971_a_6_0_l_323 ORF_002 hypothetical protein 2, Hubei diptera virus 21 (KX884698) 51% partial (mid)

(?) C48774_a_7_0_l_339 ORF_001 hypothetical protein 1, Hubei diptera virus 21 (KX884697) 49% partial (mid)

C51752_a_4_0_l_370 ORF_003 RdRp, Hubei diptera virus 21 (KX884696) 53% partial (mid)

(?) C54619_a_3_0_l_401 ORF_003 segment 10, Dendrolimus punctatus cypovirus 22 (NC_025838) 35% partial (mid-end)

(?) C55252_a_9_0_l_410 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 23% partial (end)

(?) C57419_a_8_0_l_440 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 21% partial (start)

C59361_a_3_0_l_469 ORF_001 RdRp, Hubei diptera virus 21 (KX884696) 53% partial (mid)

(?) C60811_a_3_0_l_493 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 44% partial (mid)

(?) C61091_a_5_0_l_496 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 26% partial (start)

(?) C61837_a_4_0_l_509 ORF_004 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 26% partial (mid)

(?) C63597_a_6_0_l_539 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 34% partial (end)

(?) C69161_a_3_0_l_666 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 28% partial (mid)

(?) C69279_a_4_0_l_670 ORF_005 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 34% partial (mid)

(?) C70559_a_5_0_l_703 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 33% partial (mid)

C74861_a_6_0_l_852 ORF_001 hypothetical protein 1, Hubei diptera virus 21 (KX884697) 55% partial (mid)

C84762_a_5_0_l_1517 ORF_006 hypothetical protein 5, Hubei diptera virus 21 (KX884701) 47% full

s3488_L_10393_0_a_6_4_l_545 ORF_001 hypothetical protein 2, Hubei diptera virus 21 (KX884698) 66% partial (end)

(?) s4274_L_16530_0_a_3_6_l_882 ORF_001 segment 10, Dendrolimus punctatus cypovirus 22 (NC_025838) 32% partial (mid-end) 40 2. TRAVIS - TRUE POSITIVE DETAILS

2.27 INSqiqTCTRAAPEI-75

Table 53: Sample Information of INSqiqTCTRAAPEI-75.

Filename 130206_I238_FCC1LVUACXX_L3_INSqiqTCTRAAPEI-75.free.fas Assembly ID INSqiqTCTRAAPEI-75 Order Mantodea Order details NA Family Mantidae Family details NA Species Omomantis zebrata Number of specimen 1 Stage juvenile Sample location Germany, Lab culture with Samples originating from South Africa Sample date Dec-2012 Blood-feeding no Suspicous sequences 6

Table 54: Suspicious Sequences in INSqiqTCTRAAPEI-75. 1 of 6 sequences was true positive and 5 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C137185_a_7_0_l_1730 ORF_004 RdRp, Fiji disease virus (NC_007159) 25% partial (start)

Figure 33: Sequence Organization of INSqiqTCTRAAPEI-75. 2. TRAVIS - TRUE POSITIVE DETAILS 41

2.28 INSlupTATRAAPEI-90

This transcriptome containeda near full RdRp similar to the one of Hubei diptera virus 21 and three other segments similar to the same virus.

Table 55: Sample Information of INSlupTATRAAPEI-90.

Filename 130206_I238_FCC1LVUACXX_L1_INSlupTATRAAPEI-90.free.fas Assembly ID INSlupTATRAAPEI-90 Order Hymenoptera Order details NA Family Torymidae Family details NA Species Podagrion pachymerum Number of specimen 52 Stage adult Sample location Slovakia, Rudno nad Hronom Sample date Jan-2012 Blood-feeding no Suspicous sequences 128

Table 56: Suspicious Sequences in INSlupTATRAAPEI-90. 6 of 128 sequences were true positives and 122 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C92943_a_62_0_l_3472 ORF_003 minor capsid protein, Hubei diptera virus 21 (KX884698) 41% full

C93031_a_37_0_l_3587 ORF_001 structural protein, Hubei diptera virus 21 (KX884699) 26% full

C93187_a_43_0_l_3841 ORF_016 hypothetical protein, Hubei diptera virus 21 (KX884697) 31% full

C93315_a_47_0_l_4221 ORF_007 RdRp, Hubei diptera virus 21 (KX884693) 39% full

s6000_L_27906_0_a_30_4_l_3801 ORF_006 RdRp, Hubei rhabdo-like virus 6 (KX884421) 53% partial (start)

s6000_L_27906_0_a_30_4_l_3801 ORF_005 hypothetical protein, Hubei rhabdo-like virus 6 (KX884421) 42% partial(start)

Figure 34: Sequence Organization of INSlupTATRAAPEI-90. 42 2. TRAVIS - TRUE POSITIVE DETAILS

2.29 INSqiqTCXRAAPEI-90

This transcriptome contained few questionable fragments but noch true positive RdRp-like sequence.

Table 57: Sample Information of INSqiqTCXRAAPEI-90.

Filename 130206_I238_FCC1LVUACXX_L4_INSqiqTCXRAAPEI-90.free.fas Assembly ID INSqiqTCXRAAPEI-90 Order Neuroptera Order details NA Family Hemerobiidae Family details NA Species Hemerobius humulinus Number of specimen 10 Stage NA Sample location Italy, Pordenone, Barcis Prescudin Sample date 02-Aug-2012 Blood-feeding no Suspicous sequences 17

Table 58: Suspicious Sequences in INSqiqTCXRAAPEI-90. 4 of 17 sequences were questionable and 13 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C103846_a_3_0_l_322 ORF_002 hypothetical protein1, Hubei diptera virus 21 (KX884697) 44% partial (mid)

(?) C130385_a_6_0_l_631 ORF_001 hypothetical protein1, Hubei diptera virus 21 (KX884697) 26% partial (end)

(?) C136315_a_3_0_l_794 ORF_002 hypothetical protein1, Hubei diptera virus 21 (KX884697) 50% partial (mid)

(?) C145349_a_13_0_l_1352 ORF_006 glycoprotein, Dendrolimus punctatus cypovirus 22 (NC_025838) 34% full

(?) C74377_a_3_0_l_201 ORF_001 hypothetical protein1, Hubei diptera virus 21 (KX884697) 50% partial (mid)

Figure 35: Sequence Organization of INSqiqTCXRAAPEI-90. 2. TRAVIS - TRUE POSITIVE DETAILS 43

2.30 INSqiqTDLRAAPEI-72

This transcriptome contained a potential glzcoprotein but no true positive RdRp-like sequence. howeeeever....i do not trust those viruses

Table 59: Sample Information of INSqiqTDLRAAPEI-72.

Filename 130206_I238_FCC1LVUACXX_L4_INSqiqTDLRAAPEI-72.free.fas Assembly ID INSqiqTDLRAAPEI-72 Order Phasmatodea Order details NA Family Pseudophasmatidae Family details Xerosomatinae Hesperophasmatini Species Creoxylus spinosus Number of specimen 1 Stage adult Sample location Germany, lab culture Sample date 16-Oct-2012 Blood-feeding no Suspicous sequences 8

Table 60: Suspicious Sequences in INSqiqTDLRAAPEI-72. 1 of 8 sequences was true positive and 7 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C176836_a_14_0_l_2093 ORF_017 glycoprotein,Wuchang Cockraoch Virus 3 (KM817605) 32% full

Figure 36: Sequence Organization of INSqiqTDLRAAPEI-72. 44 2. TRAVIS - TRUE POSITIVE DETAILS

2.31 INSobdTDTRAAPEI-18

This transcriptome contained five near full segments similar do different other viruses.

Table 61: Sample Information of INSobdTDTRAAPEI-18.

Filename 130314_I269_FCC1KFEACXX_L7_INSobdTDTRAAPEI-18.free.fas Assembly ID INSobdTDTRAAPEI-18 Order Phasmatodea Order details NA Family Phylliidae Family details Phylliini tribe Species Phyllium philippinicum Number of specimen 1 Stage adult Sample location Germany, lab culture Sample date 13-Nov-2012 Blood-feeding no Suspicous sequences 11

Table 62: Suspicious Sequences in INSobdTDTRAAPEI-18. 5 of 10 sequences were true positives and 5 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C143807_a_61_0_l_1898 ORF_002 minor structural protein, Fiji disease virus (NC_007161) 21% full

C147136_a_60_0_l_3324 ORF_006 segment 4, Mal de Rio Cuarto virus (NC_008729) 21% full

C147356_a_62_0_l_3744 ORF_006 hypothetical protein, Hubei diptera virus 20 (KX884685) 28% full

C147412_a_62_0_l_3919 ORF_006 major core protein, Wuhan heteroptera virus 3 (NC_032510) 20% full

C147496_a_62_0_l_4272 ORF_021 RdRp, Mal de Rio Cuarto virus (NC_008733) 26% full

Figure 37: Sequence Organization of INSobdTDTRAAPEI-18. 2. TRAVIS - TRUE POSITIVE DETAILS 45

2.32 INSobdTDYRAAPEI-30

This transcriptome contained a near full RdRp similar to Hubei reo-like virus 12. There are several questionable fragments similar to segment 6 of Dendrolimus punctatus cypovirus 22 and a near full segment similar to segment 10 of Dendrolimus punctatus cypovirus 22.

Table 63: Sample Information of INSobdTDYRAAPEI-30.

Filename 130501_I249_FCC1UW6ACXX_L1_INSobdTDYRAAPEI-30.free.fas Assembly ID INSobdTDYRAAPEI-30 Order Diptera Order details Brachycera Family Rhagionidae Family details NA Species Chrysopilus thoracicus Number of specimen 1 Stage adult Sample location USA, North Carolina, Wake County, Raleigh Sample date 28-May-2012 Blood-feeding no Suspicous sequences 45

Figure 38: Sequence Organization of INSobdTDYRAAPEI-30. 46 2. TRAVIS - TRUE POSITIVE DETAILS

Table 64: Suspicious Sequences in INSobdTDYRAAPEI-30. 1 of 45 sequences was true positive, 16 were questionable and 28 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C101978_a_3_0_l_243 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 39% partial (mid)

(?) C111852_a_8_0_l_277 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (mid)

(?) C112940_a_6_0_l_282 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 38% partial (mid)

(?) C126394_a_3_0_l_364 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 25% partial (end)

(?) C129516_a_28_0_l_389 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 37% partial (start)

(?) C132062_a_5_0_l_413 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 30% partial (end)

(?) C134838_a_7_0_l_442 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 36% partial (mid)

(?) C137952_a_20_0_l_479 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 30% partial (end)

(?) C148614_a_6_0_l_673 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (start)

(?) C152290_a_5_0_l_780 ORF_004 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 35% partial (mid)

(?) C156110_a_5_0_l_944 ORF_004 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 32% partial (mid-end)

(?) C159321_a_4_0_l_1131 ORF_006 segment 10, Dendrolimus punctatus cypovirus 22 (NC_025838) 24% full

C169853_a_60_0_l_4548 ORF_010 RdRp, Hubei reo-like virus 12 (KX884634) 40% full

(?) s10802_L_38510_0_a_38_6_l_3377 ORF_004 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 24% full

(?) s383_L_329_0_a_3_0_l_375 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 34% partial (start)

(?) s3885_L_4850_0_a_23_1_l_1267 ORF_006 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 36% partial (end)

(?) s4657_L_6430_2_a_8_3_l_598 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 30% partial (end) 2. TRAVIS - TRUE POSITIVE DETAILS 47

2.33 INSerlTCGRAAPEI-32

This transcriptome contained two near full sequences similar to the glycoprotein of Dendrolimus punctatus cypovirus 22 but no RdRp-like true positive sequence.

Table 65: Sample Information of INSerlTCGRAAPEI-32.

Filename 130608_I189_FCD20KDACXX_L3_INSerlTCGRAAPEI-32.free.fas Assembly ID INSerlTCGRAAPEI-32 Order Raphidioptera Order details NA Family Inocelliidae Family details NA Species Fibla maclachlani Number of specimen 1 Stage larva Sample location Austria, Labstock culture originating from Italy, Sardinia, Sassari Sample date 22-Feb-2013 Blood-feeding no Suspicous sequences 27

Table 66: Suspicious Sequences in INSerlTCGRAAPEI-32. 2 of 27 sequences were questionable and 9 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) s5144_L_8051_0_a_48_6_l_2295 ORF_003 glycoprotein, Dendrolimus punctatus cypovirus 22 (NC_025838) 30% full

(?) s5883_L_10241_0_a_25_9_l_3513_a ORF_011 glycoprotein, Dendrolimus punctatus cypovirus 22 (NC_025838) 32% full

Figure 39: Sequence Organization of INSerlTCGRAAPEI-32. 48 2. TRAVIS - TRUE POSITIVE DETAILS

2.34 INSkzdTABRAAPEI-136

This transcriptome contained a fragment of an RdRp-like sequence similar to Dendrolimus punctatus cypovirus.

Table 67: Sample Information of INSkzdTABRAAPEI-136.

Filename 130720_I246_FCD23MRACXX_L4_INSkzdTABRAAPEI-136.free.fas Assembly ID INSkzdTABRAAPEI-136 Order Psocodea Order details Psocomorpha Family Lachesillidae Family details NA Species Lachesilla abiesicola Number of specimen NA Stage adult Sample location Mexico, Cumbres del Ajusco National Park Sample date 27-Feb-2013 Blood-feeding no Suspicous sequences 8

Table 68: Suspicious Sequences in INSkzdTABRAAPEI-136. 1 of 8 sequences was questionable and 7 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C60185_a_5_0_l_422 ORF_001 RdRp, Dendrolimus punctatus cypovirus 22 (NC_025847) 24% partial (mid)

Figure 40: Sequence Organization of INSkzdTABRAAPEI-136. 2. TRAVIS - TRUE POSITIVE DETAILS 49

2.35 INSkzdTACRAAPEI-171

This transcriptome contained two small fragments similar to the RdRp of Mal de Rio Cuarto virus.

Table 69: Sample Information of INSkzdTACRAAPEI-171.

Filename 130720_I246_FCD23MRACXX_L4_INSkzdTACRAAPEI-171.free.fas Assembly ID INSkzdTACRAAPEI-171 Order Odonata Order details Zygoptera Family Pseudostigmatidae Family details NA Species Megaloprepus caerulatus Number of specimen 1 Stage adult Sample location Panama, Barro Colorado Island Sample date 2011 Blood-feeding no Suspicous sequences 4

Table 70: Suspicious Sequences in INSkzdTACRAAPEI-171. 3 of 4 sequences were true positives, 1 questionable and 9 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

C143832_a_26_0_l_483 ORF_002 RdRp, Mal de Rio Cuarto virus (NC_008733) 29% partial (mid)

C157093_a_4_0_l_696 ORF_001 RdRp, Mal de Rio Cuarto virus (NC_008733) 36% partial (mid)

(?) C168499_a_19_0_l_1118 ORF_005 segment 10, Dendrolimus punctatus cypovirus 22 (NC_025838) 32% full

Figure 41: Sequence Organization of INSkzdTACRAAPEI-171. 50 2. TRAVIS - TRUE POSITIVE DETAILS

2.36 INSofmTBLRAAPEI-71

This transcriptome contained five near full segments similar to Colorado tick fever virus, including an RdRp segment.

Table 71: Sample Information of INSofmTBLRAAPEI-71.

Filename 130728_I263_FCD23HKACXX_L3_INSofmTBLRAAPEI-71.free.fas Assembly ID INSofmTBLRAAPEI-71 Order Odonata Order details Anisoptera Family Aeshnidae Family details NA Species Anax imperator Number of specimen 1 Stage adult Sample location Germany, Rhineland-Palatinate, Steinfeld Sample date 22-Jun-2012 Blood-feeding no Suspicous sequences 15

Table 72: Suspicious Sequences in INSofmTBLRAAPEI-71. 5 of 15 sequences were true positives and 10 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C83693_a_58_0_l_1857 ORF_003 segment 10, Colorado tick fever virus (NC_004189) 27% full

C86843_a_61_0_l_2465 ORF_006 segment 4, Colorado tick fever virus (NC_004184) 26% full

C89155_a_61_0_l_3382 ORF_010 segment 3, Colorado tick fever virus (NC_004183) 32% full

C89824_a_61_0_l_3972 ORF_006 segment 2, Colorado tick fever virus (NC_004182) 26% full

C89996_a_62_0_l_4241 ORF_012 RdRp, Colorado tick fever virus (NC_004181) 32% full

Figure 42: Sequence Organization of INSofmTBLRAAPEI-71. 2. TRAVIS - TRUE POSITIVE DETAILS 51

2.37 INSofmTCYRAAPEI-79

This transcriptome contained a near full RdRp segment similar to the one of Rice gall dwarf virus and a partial capsid protein similar to the same virus.

Table 73: Sample Information of INSofmTCYRAAPEI-79.

Filename 130728_I263_FCD23HKACXX_L4_INSofmTCYRAAPEI-79.free.fas Assembly ID INSofmTCYRAAPEI-79 Order Neuroptera Order details NA Family Coniopterygidae Family details NA Species Coniopteryx pygmaea Number of specimen 18 Stage not determined Sample location Austria, Lower Austria, Wien-Surroundings, Klosterneuburg Sample date 28-Apr-2013 Blood-feeding no Suspicous sequences 17

Table 74: Suspicious Sequences in INSofmTCYRAAPEI-79. 4 of 17 sequences were true positives and 13 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C107296_a_6_0_l_729 ORF_001 minor outer capsid protein, Rice gall dwarf virus (NC_009244) 27% partial end

C123576_a_32_0_l_3800 ORF_007 RdRp, Rice gall dwarf virus (NC_003773) 32% full

C80679_a_4_0_l_263 ORF_001 RdRp, Rice gall dwarf virus (NC_009244) 32% partial (end)

Figure 43: Sequence Organization of INSofmTCYRAAPEI-79. 52 2. TRAVIS - TRUE POSITIVE DETAILS

2.38 INSqiqTDDRABPEI-136

This transcriptome contained two smalll fragments of an RdRp similar to the one of Colorado tick fever virus.

Table 75: Sample Information of INSqiqTDDRABPEI-136.

Filename 130728_I263_FCD23HKACXX_L4_INSqiqTDDRABPEI-136.free.fas Assembly ID INSqiqTDDRABPEI-136 Order Archaeognatha Order details NA Family Machilidae Family details NA Species Petridiobius arcticus Number of specimen 8 Stage NA Sample location USA, labstock Alaska, US Fish and Wildlife service, Kenai National Wildlife Refuge, Soldotna Sample date Mar-2012 Blood-feeding no Suspicous sequences 15

Table 76: Suspicious Sequences in INSqiqTDDRABPEI-136. 2 of 15 sequences were true positives, 1 questionable and 12 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

C188009_a_5_0_l_439 ORF_002 RdRp, Colorado tick fever virus (NC_004181) 42% partial (mid)

(?) C225936_a_6_0_l_1366 ORF_003 segment 10, Dendrolimus punctatus cypovirus 22 (NC_025838) 30% full

s11273_L_59364_0_a_3_0_l_1125 ORF_001 RdRp, Colorado tick fever virus (NC_004181) 37% partial (mid)

Figure 44: Sequence Organization of INSqiqTDDRABPEI-136. 2. TRAVIS - TRUE POSITIVE DETAILS 53

2.39 INSfkjTBIRAAPEI-202

This transcriptome contained a fragment of an RdRp-like sequence similar to Rice dwarf virus.

Table 77: Sample Information of INSfkjTBIRAAPEI-202.

Filename 130815_I162_FCD2DLYACXX_L2_INSfkjTBIRAAPEI-202.free.fas Assembly ID INSfkjTBIRAAPEI-202 Order Hymenoptera Order details NA Family Platygastridae Family details NA Species Inostemma sp Number of specimen 51 Stage adult Sample location Germany, North Rhine-Westphalia, Bonn, garden of the ZFMK Sample date Jun-2013 Blood-feeding no Suspicous sequences 21

Table 78: Suspicious Sequences in INSfkjTBIRAAPEI-202. 2 of 21 sequences were questionable and 19 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C171416_a_6_0_l_469 ORF_001 glycoprotein, Lishi Spider Virus 1 (KM817596) 33% partial (start)

(?) C191441_a_36_0_l_928 ORF_003 RdRp, Rice dwarf virus (NC_003773) 32% partial (start)

Figure 45: Sequence Organization of INSfkjTBIRAAPEI-202. 54 2. TRAVIS - TRUE POSITIVE DETAILS

2.40 INSerlTAKRAAPEI-83

This transcriptome contained a glycoprotein similar to the one of Wuchang Cockraoch Virus 3 but no RdRp-like sequence.

Table 79: Sample Information of INSerlTAKRAAPEI-83.

Filename 130608_I189_FCD20KDACXX_L1_INSerlTAKRAAPEI-83.free.fas Assembly ID INSerlTAKRAAPEI-83 Order Zygentoma Order details NA Family Lepismatidae Family details NA Species Ctenolepisma lineata Number of specimen 4 Stage not determined Sample location Portuga,l Faro, near Salir Sample date 04-Jan-2013 Blood-feeding no Suspicous sequences 27

Table 80: Suspicious Sequences in INSerlTAKRAAPEI-83. 1 of 27 sequences was true positive and 26 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C184308_a_31_0_l_2127 ORF_003 glycoprotein, Wuchang Cockraoch Virus 3 (KM817605) 28% full

Figure 46: Sequence Organization of INSerlTAKRAAPEI-83. 2. TRAVIS - TRUE POSITIVE DETAILS 55

2.41 INSfkjTBMRAAPEI-206

This transcriptome contained several near full segments similar to Hubei diptera virus 21 and one segment similar to segment 5 of Operophtera brumata reovirus.

Table 81: Sample Information of INSfkjTBMRAAPEI-206.

Filename 130831_I260_FCC2BWAACXX_L1_INSfkjTBMRAAPEI-206.free.fas Assembly ID INSfkjTBMRAAPEI-206 Order Hymenoptera Order details NA Family Encyrtidae Family details NA Species Metaphycus flavus Number of specimen ca 40 Stage adult Sample location Lab culture of unknown geographical origin Sample date 03-May-2013 Blood-feeding no Suspicous sequences 19

Table 82: Suspicious Sequences in INSfkjTBMRAAPEI-206. 8 of 19 sequences were true positives and 11 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C63072_a_16_0_l_1537 ORF_006 hypothetical protein 5, Hubei diptera virus 21 (KX884701) 32% full

C63130_a_26_0_l_1543 ORF_002 RdRp, Hubei diptera virus 21 (KX884696) 37% partial (start)

C65354_a_18_0_l_1884 ORF_005 segment 5, Operophtera brumata reovirus (NC_007563) 19% full

C65544_a_25_0_l_1922 ORF_006 hypothetical protein 4, Hubei diptera virus 21 (KX884700) 24% full

C67533_a_11_0_l_2759 ORF_012 RdRp, Hubei diptera virus 21 (KX884696) 38% partial (mid-end)

C67888_a_13_0_l_3447 ORF_001 hypothetical protein 2, Hubei diptera virus 21 (KX884698) 41% full

C67916_a_19_0_l_3593 ORF_004 hypothetical protein 3, Hubei diptera virus 21 (KX884699) 26% full

C67944_a_11_0_l_3775 ORF_003 major core capsid protein, Hubei diptera virus 21 (KX884697) 30% full 56 2. TRAVIS - TRUE POSITIVE DETAILS

Figure 47: Sequence Organization of INSfkjTBMRAAPEI-206. 2. TRAVIS - TRUE POSITIVE DETAILS 57

2.42 INSofmTCERAAPEI-22

This transcriptome contained a questionably fragment of an RdRp similar to the one of Rice ragged stund virus and two other questionable sequences form viruses that are related to many false positives.

Table 83: Sample Information of INSofmTCERAAPEI-22.

Filename 130919_I247_FCC2V7VACXX_L1_INSofmTCERAAPEI-22.free.fas Assembly ID INSofmTCERAAPEI-22 Order Mantodea Order details NA Family Hymenopodidae Family details NA Species Harpagomantis tricolor Number of specimen 1 Stage NA Sample location Germany, Lab culture with Samples originating from South Africa, Johannesburg Sample date 2012 Blood-feeding no Suspicous sequences 10

Table 84: Suspicious Sequences in INSofmTCERAAPEI-22. 4 of 10 sequences were questionable and 6 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C129377_a_3_0_l_312 ORF_001 RdRp, Rice ragged stunt virus (NC_003771) 45% partial (start)

(?) C142385_a_6_0_l_382 ORF_003 segment 11, Liao ning virus (NC_007746) 34% partial (start)

(?) C186124_a_8_0_l_1235 ORF_002 sigma 1, Mammalian Orthoreovirus (JQ412761) 20% full

Figure 48: Sequence Organization of INSofmTCERAAPEI-22. 58 2. TRAVIS - TRUE POSITIVE DETAILS

2.43 INSofmTCFRAAPEI-26

This transcriptome contained three questionable fragments similar to the RdRp of Equine encephalosis virus.

Table 85: Sample Information of INSofmTCFRAAPEI-26.

Filename 130919_I247_FCC2V7VACXX_L1_INSofmTCFRAAPEI-26.free.fas Assembly ID INSofmTCFRAAPEI-26 Order Blattodea Order details NA Family Corydiidae Family details Tiviinae Species Tivia sp Number of specimen 1 Stage adult Sample location Namibia, Otjozondjupa, Waterberg Sample date 05-Apr-2013 Blood-feeding no Suspicous sequences 3

Table 86: Suspicious Sequences in INSofmTCFRAAPEI-26. 2 of 3 sequences were questionable and 1 sequence was false positive similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C429509_a_3_0_l_558 ORF_002 RdRp, Equine encephalosis virus (AB811635) 32% partial (mid)

(?) s11468_L_28221_0_a_5_6_l_982 ORF_001 RdRp, Equine encephalosis virus (AB811635) 39% partial (mid)

(?) s11468_L_28221_0_a_5_6_l_982 ORF_005 RdRp, Equine encephalosis virus (AB811635) 39% partial (mid)

Figure 49: Sequence Organization of INSofmTCFRAAPEI-26. 2. TRAVIS - TRUE POSITIVE DETAILS 59

2.44 INSinlTAARABPEI-43

This transcriptome contained a small fragment of an RdRp similar to the one of Hubei diptera virus 10.

Table 87: Sample Information of INSinlTAARABPEI-43.

Filename 130919_I247_FCC2V7VACXX_L3_INSinlTAARABPEI-43.free.fas Assembly ID INSinlTAARABPEI-43 Order Orthoptera Order details Caelifera Family Pyrgomorphidae Family details NA Species Psedna nana Number of specimen 1 Stage adult Sample location Australia, Western Australia, Toodyay Sample date Dec-2011 Blood-feeding no Suspicous sequences 1

Table 88: Suspicious Sequences in INSinlTAARABPEI-43. 1 of 1 sequences was questionable.

Sequence ID ORF Match Identity Completeness

(?) s4225_L_7567_0_a_3_0_l_271 ORF_001 RdRp, Hubei diptera virus 20 (KX884693) 56% partial (mid)

Figure 50: Sequence Organization of INSinlTAARABPEI-43. 60 2. TRAVIS - TRUE POSITIVE DETAILS

2.45 INSinlTAPRAAPEI-33

This transcriptome contained a fragmentary RdRp similar to the one of Hubei rhabdo-like virus. Additionally, another hypothetical ORF showed similarity to the same virus.

Table 89: Sample Information of INSinlTAPRAAPEI-33.

Filename 130919_I247_FCC2V7VACXX_L6_INSinlTAPRAAPEI-33.free.fas Assembly ID INSinlTAPRAAPEI-33 Order Hymenoptera Order details NA Family Agaonidae Family details NA Species Courtella sp Number of specimen 50 Stage adult Sample location South Africa, Western Cape, Kirstenhof, Waterford Circle Sample date 21-Apr-2013 Blood-feeding no Suspicous sequences 8

Table 90: Suspicious Sequences in INSinlTAPRAAPEI-33. 1 of 8 sequences was true positive and 7 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C111078_a_9_0_l_661 ORF_005 RdRp, Hubei rhabdo-like virus 6 (KX884421) 43% partial (mid)

Figure 51: Sequence Organization of INSinlTAPRAAPEI-33. 2. TRAVIS - TRUE POSITIVE DETAILS 61

2.46 INSinlTAWRAAPEI-44

This transcriptome contained a fragmentary RdRp similar to the one of Diaphorina citri associated C virus.

Table 91: Sample Information of INSinlTAWRAAPEI-44.

Filename 130919_I247_FCC2V7VACXX_L6_INSinlTAWRAAPEI-44.free.fas Assembly ID INSinlTAWRAAPEI-44 Order Diptera Order details Brachycera Family Lonchopteridae Family details NA Species Lonchoptera bifurcata Number of specimen 10 Stage adult Sample location USA, North Carolina, Wake County, Raleigh, Schenck forest Sample date 25-May-2013 Blood-feeding no Suspicous sequences 7

Table 92: Suspicious Sequences in INSinlTAWRAAPEI-44. 2 of 7 sequences were true positives and 5 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

s2259_L_4066_0_a_5_0_l_449 ORF_001 RdRp, Diaphorina citri associated C virus (KX235518) 46% partial (mid)

s2260_L_4066_1_a_3_3_l_878 ORF_001 RdRp, Diaphorina citri associated C virus (KX235518) 45% partial (mid)

Figure 52: Sequence Organization of INSinlTAWRAAPEI-44. 62 2. TRAVIS - TRUE POSITIVE DETAILS

2.47 RINSinlTCARAAPEI-55

This transcriptome contained two fragments pf an RdRp similar to the one of Hubei odonate virus 15.

Table 93: Sample Information of RINSinlTCARAAPEI-55.

Filename 130919_I247_FCC2V7VACXX_L7_RINSinlTCARAAPEI-55.free.fas Assembly ID RINSinlTCARAAPEI-55 Order Diptera Order details Brachycera Family Tachinidae Family details NA Species Euthera bicolor Number of specimen 1 Stage adult Sample location USA, Mississippi, Noxubee County, Sam D Hamilton, Noxubee National Wildlife Refuge Sample date 19-May-2013 Blood-feeding no Suspicous sequences 4

Table 94: Suspicious Sequences in RINSinlTCARAAPEI-55. 2 of 4 sequences were true positives and 2 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C61403_a_15_0_l_1491 ORF_004 RdRp, Hubei odonate virus 15 (KX884664) 46% partial (mid)

s3320_L_16467_0_a_24_6_l_1796 ORF_008 RdRp, Hubei odonate virus 15 (KX884664) 49% partial (start)

Figure 53: Sequence Organization of RINSinlTCARAAPEI-55. 2. TRAVIS - TRUE POSITIVE DETAILS 63

2.48 RINSinlTCNRAAPEI-33

This transcriptome contained several full segments similar to Cimodo virus. The contig s10827_L_38271_0_a_47_1_l_7816 seemed to be a missassembly and has been cut down to the matching region.

Table 95: Sample Information of RINSinlTCNRAAPEI-33.

Filename 130928_I232_FCC2UV4ACXX_L1_RINSinlTCNRAAPEI-33.free.fas Assembly ID RINSinlTCNRAAPEI-33 Order Hymenoptera Order details NA Family Eulophidae Family details NA Species Tamarixia radiata Number of specimen ca 131 Stage adult Sample location Pakistan, Lab culture with Samples originating from Pakistan, Punjab Sample date May-2013 Blood-feeding no Suspicous sequences 44

Table 96: Suspicious Sequences inRINSinlTCNRAAPEI-33 4 of 44 sequences were true positives and 40 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C135862_a_52_0_l_1820 ORF_008 segment 6, Cimodo virus (NC_024920) 27% full

C143103_a_33_0_l_3239 ORF_003 segment 3, Cimodo virus (NC_024917) 32% full

C144637_a_50_0_l_4075 ORF_004 RdRp, Cimodo virus (NC_023420) 41% full

s10827_L_38271_0_a_47_1_l_7816 ORF_013 segment 2, Cimodo virus (NC_024916) 32% full

Figure 54: Sequence Organization of RINSinlTCNRAAPEI-33. 64 2. TRAVIS - TRUE POSITIVE DETAILS

2.49 RINSymlTABRAAPEI-202

This transcriptome contained a full RdRp similar to the one of Hubei odonate virus 15.

Table 97: Sample Information of RINSymlTABRAAPEI-202.

Filename 131212_I249_FCC39K4ACXX_L6_RINSymlTABRAAPEI-202.free.fas Assembly ID RINSymlTABRAAPEI-202 Order Diptera Order details Brachycera Family Oestridae Family details NA Species Cuterebra austeni Number of specimen 1 Stage adult Sample location USA, New Mexico, Grant County, Silver City, Gomez Park Sample date 27-May-2013 Blood-feeding yes-larvae Suspicous sequences 3

Table 98: Suspicious Sequences in RINSymlTABRAAPEI-202. 1 of 3 sequences were true positives and 2 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C40647_a_62_0_l_4009 ORF_011 RdRp, Hubei odonate virus 15 (KX884664) 45% full

Figure 55: Sequence Organization of RINSymlTABRAAPEI-202. 2. TRAVIS - TRUE POSITIVE DETAILS 65

2.50 RINSwvkTAURAAPEI-56

This transcriptome contained a glycoprotein similar to the one of Wuchang Cockraoch Virus 3 but no RdRp-like true positive.

Table 99: Sample Information of RINSwvkTAURAAPEI-56.

Filename 140430_I162_FCC4EFCACXX_L1_RINSwvkTAURAAPEI-56.free.fas Assembly ID RINSwvkTAURAAPEI-56 Order Neuroptera Order details NA Family Chrysopidae Family details NA Species Chrysopa formosa Number of specimen 3 Stage adult Sample location Austria, Lower Austria, Krems-Land, Duernstein Sample date 27-Jul-2013 Blood-feeding no Suspicous sequences 13

Table 100: Suspicious Sequences in RINSwvkTAURAAPEI-56. 1 of 13 sequences was true positive and 12 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

s3755_L_6696_1_a_79_3_l_2180 ORF_001 glycoprotein, Wuchang Cockraoch Virus 3 (KM817605) 39% partial

Figure 56: Sequence Organization of RINSwvkTAURAAPEI-56. 66 2. TRAVIS - TRUE POSITIVE DETAILS

2.51 ANIsrmTAAWRAAPEI-225

This transcriptome contained a fragmentary RdRp similar to the one of Morris orbivirus.

Table 101: Sample Information of ANIsrmTAAWRAAPEI-225.

Filename 140710_I812_FCH8K85ADXX_L2_ANIsrmTAAWRAAPEI-225.free.fas Assembly ID ANIsrmTAAWRAAPEI-225 Order Lepidoptera Order details NA Family Nepticulidae Family details NA Species Stigmella atricapitella Number of specimen 1 Stage adult Sample location Germany, Rhineland-Palatinate, Oberhausen an der Nahe, Rabenfelsen Sample date 28-Jun-2011 Blood-feeding no Suspicous sequences 14

Table 102: Suspicious Sequences in ANIsrmTAAWRAAPEI-225. 1 of 14 sequences was true positive and 13 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C128088_a_7_0_l_1169 ORF_004 RdRp, Morris orbivirus (KX907618) 42% partial (mid)

Figure 57: Sequence Organization of ANIsrmTAAWRAAPEI-225. 2. TRAVIS - TRUE POSITIVE DETAILS 67

2.52 WHANIsrmTMAFRAAPEI-14

This transcriptome contained a small fragment of an RdRp similar to the one of Equine encephalosis virus.

Table 103: Sample Information of WHANIsrmTMAFRAAPEI-14.

Filename 140813_I652_FCC4L86ACXX_L1_WHANIsrmTMAFRAAPEI-14.free.fas Assembly ID WHANIsrmTMAFRAAPEI-14 Order Coleoptera Order details NA Family Coccinellidae Family details Scymninae Species Cryptolaemus montrouzieri Number of specimen 1 Stage adult Sample location Australia, Australian Capital Territory, Australian National Insect Collection, Acton 2601, Black Mountain Sample date 13-Oct-2013 Blood-feeding no Suspicous sequences 18

Table 104: Suspicious Sequences in WHANIsrmTMAFRAAPEI-14. 1 of 18 sequences was questionable and 17 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C100756_a_7_0_l_408 ORF_004 RdRp, Equine encephalosis virus (AB811635) 37% partial (mid)

Figure 58: Sequence Organization of WHANIsrmTMAFRAAPEI-14. 68 2. TRAVIS - TRUE POSITIVE DETAILS

2.53 WHANIsrmTMCHRAAPEI-56

This transcriptome contained a very small fragment of an RdRp similar to the one of Hubei insect virus 2 and another segment with a putative major core capsid protein similar to the one of the same virus.

Table 105: Sample Information of WHANIsrmTMCHRAAPEI-56.

Filename 140813_I652_FCC4L86ACXX_L4_WHANIsrmTMCHRAAPEI-56.free.fas Assembly ID WHANIsrmTMCHRAAPEI-56 Order Lepidoptera Order details NA Family Epipyropidae Family details NA Species Epipomponia nawai Number of specimen 1 Stage NA Sample location South Korea, Ulsan City, Ulju, Mount Ganweolsan Sample date 28-Aug-2012 Blood-feeding no Suspicous sequences 18

Table 106: Suspicious Sequences in WHANIsrmTMCHRAAPEI-56. 2 of 18 sequences were true positives and 16 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C149031_a_5_0_l_342 ORF_001 RdRp, Hubei insect virus 2 (NC_032167) 45% partial (mid)

C200126_a_3_0_l_1375 ORF_001 major core capsid protein, Hubei insect virus 2 (NC_032161) 25% partial (mid)

Figure 59: Sequence Organization of WHANIsrmTMCHRAAPEI-56. 2. TRAVIS - TRUE POSITIVE DETAILS 69

2.54 INSeqtTBBRAAPEI-75

This transcriptome contained several near full segments of a virus that has simlarities with Operophtera brumata reovirus. Additionally, there were aswell many small ORFs yielding hits for segment 6, Dendrolimus punctatus cypovirus 22.

Table 107: Sample Information of INSeqtTBBRAAPEI-75.

Filename 121010_I249_FCD1C4BACXX_L7_INSeqtTBBRAAPEI-75.free.fas Assembly ID INSeqtTBBRAAPEI-75 Order Hymenoptera Order details NA Family Diapriidae Family details NA Species Trichopria drosophilae Number of specimen ca 80 Stage adult Sample location Lab culture with Samples originating from France, 60 km south of Lyon Sablons, Is re district Sample date 2012 Blood-feeding no Suspicous sequences 48

Figure 60: Sequence Organization of INSeqtTBBRAAPEI-75. 70 2. TRAVIS - TRUE POSITIVE DETAILS

Table 108: Suspicious Sequences in INSeqtTBBRAAPEI-75. 11 of 48 sequences were true positives, 21 questionable and 16 sequences were false positives similar to the false positives listed in chapter 3.2.2. Questionable sequences are marked with (?).

Sequence ID ORF Match Identity Completeness

(?) C17507_a_3_0_l_204 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 32% partial (start)

(?) C18879_a_3_0_l_223 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 34% partial (start)

(?) C21267_a_3_0_l_244 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 38% partial (start)

(?) C23307_a_3_0_l_257 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 58% partial (mid)

(?) C24999_a_4_0_l_273 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 35% partial (end)

(?) C27963_a_3_0_l_324 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 37% partial (mid)

C36977_a_47_0_l_528 ORF_002 RdRp, Operophtera brumata reovirus (NC_007559) 38% partial (end)

(?) C37285_a_4_0_l_537 ORF_004 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 43% partial (end)

(?)C37453_a_6_0_l_542 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 38% partial (mid)

(?) C40235_a_5_0_l_643 ORF_005 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 37% partial (end)

C41097_a_12_0_l_677 ORF_002 segment 6, Operophtera brumata reovirus (NC_007564) 37% partial (end)

(?) C42503_a_6_0_l_744 ORF_003 segment 11, Liao ning virus (NC_007746) 21% full

(?) C43717_a_4_0_l_802 ORF_005 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 40% partial (end)

(?) C44833_a_3_0_l_866 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (mid)

C47520_a_10_0_l_1047 ORF_001 segment 3, Operophtera brumata reovirus (NC_007561) 40% partial (mid)

C49320_a_9_0_l_1189 ORF_001 segment 6, Operophtera brumata reovirus (NC_007564) 24% partial (mid-end)

C49430_a_19_0_l_1203 ORF_005 segment 3, Operophtera brumata reovirus (NC_007561) 29% partial (end)

C49840_a_20_0_l_1238 ORF_003 segment 8, Operophtera brumata reovirus (NC_007566) 27% partial (mid-end)

C50472_a_8_0_l_1305 ORF_002 segment 3, Operophtera brumata reovirus (NC_007561) 25% partial (start)

(?) C52230_a_7_0_l_1503 ORF_002 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 32% partial (mid-end)

(?) C52824_a_3_0_l_1571 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (start-mid)

C56798_a_53_0_l_2229 ORF_001 segment 5, Operophtera brumata reovirus (NC_007563) 19% full

C59458_a_42_0_l_3125 ORF_006 segment 4, Operophtera brumata reovirus (NC_007562) 32% full

C60500_a_30_0_l_3935 ORF_012 segment 2, Operophtera brumata reovirus (NC_007560) 27% full

C60856_a_44_0_l_4409 ORF_001 RdRp, Operophtera brumata reovirus (NC_007559) 35% full

(?) s1322_L_1611_0_a_3_0_l_526 ORF_004 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (mid)

(?) s1726_L_2289_0_a_15_5_l_3299 ORF_001 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 30% full

(?) s1831_L_2457_0_a_5_7_l_646 ORF_004 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (mid)

(?) s2132_L_3130_2_a_31_5_l_832 ORF_005 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (start)

(?) s406_L_427_0_a_6_1_l_767 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 45% partial (end)

(?) s407_L_427_1_a_5_9_l_767 ORF_003 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 34% partial (end)

(?) s924_L_1022_0_a_3_8_l_1718 ORF_010 segment 6, Dendrolimus punctatus cypovirus 22 (NC_025850) 31% partial (mid-end) 2. TRAVIS - TRUE POSITIVE DETAILS 71

2.55 INSobdTDIRAAPEI-84

This transcriptome contained several full fragments similar to Operopthera brumata reovirus.

Table 109: Sample Information of INSobdTDIRAAPEI-84.

Filename 130314_I269_FCC1KFEACXX_L8_INSobdTDIRAAPEI-84.free.fas Assembly ID INSobdTDIRAAPEI-84 Order Hymenoptera Order details NA Family Pteromalidae Family details NA Species Lariophagus distinguendus Number of specimen 70 Stage adult Sample location Germany, Lab culture with Samples originating from Germany, Berlin Sample date Nov-2012 Blood-feeding no Suspicous sequences 29

Table 110: Suspicious Sequences in INSobdTDIRAAPEI-84. 6 of 29 sequences were true positives and 23 sequences were false positives similar to the false positives listed in chapter 3.2.2.

Sequence ID ORF Match Identity Completeness

C74954_a_59_0_l_1852 ORF_001 segment 6, Operophtera brumata reovirus (NC_007564) 25% full

C75590_a_61_0_l_1947 ORF_003 segment 5, Operophtera brumata reovirus (NC_007563) 20% full

C79547_a_47_0_l_3588 ORF_003 segment 3, Operophtera brumata reovirus (NC_007561) 25% full

C79689_a_62_0_l_3813 ORF_001 segment 2, Operophtera brumata reovirus (NC_007560) 30% full

C79803_a_56_0_l_4127 ORF_001 RdRp, Operophtera brumata reovirus (NC_007559) 34% full

s4658_L_21876_0_a_45_2_l_3350 ORF_007 segment 4, Operophtera brumata reovirus (NC_007562) 36% full

Figure 61: Sequence Organization of INSobdTDIRAAPEI-84.