<<

bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license.

1 Evolution of and Reproductive :

2 Insight from Two Novel Strains that Co-infect Cat Fleas

3

4 Timothy P. Driscoll a, Victoria I. Verhoeve b, Cassia Brockway a, Darin L. Shrewsberry a,

5 Mariah L. Plumer b, Spiridon E. Sevdalis b, John F. Beckmann c, Laura M. Krueger

6 Prelesnik d, Kevin R. Macaluso e, Abdu F. Azad b, Joseph J. Gillespie b,#

7

8 a Department of Biology, West Virginia University, Morgantown, West Virginia.

9 b Department of Microbiology and Immunology, University of Maryland School of

10 Medicine, Baltimore, Maryland.

11 c Department of Entomology and Plant Pathology, Auburn University, Auburn, Alabama.

12 d Orange County Mosquito and District, Garden Grove, .

13 e Department of Microbiology and Immunology, University of South Alabama, Mobile,

14 Alabama.

15

16 Running Head: Mechanisms of Evolution in Wolbachia-host associations

17

18 # Address correspondence to: Joe Gillespie, [email protected]

19 Timothy P. Driscoll and Victoria I. Verhoeve contributed equally to this work. Author

20 order is alphabetical. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 2

21 Abstract

22 Wolbachiae are obligate intracellular that infect and certain

23 . Usually maternally inherited, they may provision nutrients to (mutualism) or

24 alter sexual biology of (reproductive parasitism) their invertebrate hosts. We report the

25 assembly of closed genomes for two novel wolbachiae, wCfeT and wCfeJ, found co-

26 infecting cat fleas (Ctenocephalides felis) of the Elward Laboratory colony (Soquel, CA).

27 wCfeT is basal to nearly all described Wolbachia supergroups, while wCfeJ is related to

28 supergroups C, D and F. Both genomes contain laterally transferred genes that inform

29 on the evolution of Wolbachia host associations. wCfeT carries the Biotin synthesis

30 Operon of Obligate intracellular Microbes (BOOM); our analyses reveal five

31 independent acquisitions of BOOM across the Wolbachia tree, indicating parallel

32 evolution towards mutualism. Alternately, wCfeJ harbors a toxin-antidote operon

33 analogous to the wPip cinAB operon recently characterized as an inducer of

34 cytoplasmic incompatibility (CI) in flies. wCfeJ cinB and immediate-5’ end genes are

35 syntenic to large modular toxins encoded in CI-like operons of certain Wolbachia strains

36 and , signifying that CI toxins streamline by fission of larger toxins.

37 Remarkably, the C. felis genome itself contains two CI-like antidote genes, divergent

38 from wCfeJ cinA, revealing episodic reproductive parasitism in cat fleas and evidencing

39 mobility of CI loci independent of WO-phage. Additional screening revealed

40 predominant co-infection (wCfeT/wCfeJ) amongst C. felis colonies, though occasionally

41 wCfeJ singly infects fleas in wild populations. Collectively, genomes of wCfeT, wCfeJ,

42 and their cat flea host supply instances of lateral gene transfers that could drive

43 transitions between parasitism and mutualism. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 3

44 Importance

45 Many and certain species are infected with wolbachiae which are

46 intracellular bacteria well known for reproductive parasitism (RP). Like other RP

47 strategies, Wolbachia-induced cytoplasmic incompatibility, CI, increases prevalence and

48 frequency in host populations. Mutualism is another strategy employed by wolbachiae

49 to maintain host infection, with some strains synthesizing and supplementing certain B

50 vitamins (particularly biotin) to invertebrate hosts. Curiously, we discovered two novel

51 Wolbachia strains that co-infect cat fleas (Ctenocephalides felis): wCfeT carries biotin

52 synthesis genes, while wCfeJ carries a CI-inducing toxin-antidote operon. Our analyses

53 of these genes highlight their mobility across the Wolbachia phylogeny and source to

54 other intracellular bacteria. Remarkably, the C. felis genome also carries two CI-like

55 antidote genes divergent from the wCfeJ antidote gene, indicating episodic RP in cat

56 fleas. Collectively, wCfeT and wCfeJ inform on the rampant dissemination of diverse

57 factors that mediate Wolbachia strategies for persisting in invertebrate host populations.

58

59 Key words

60 Wolbachia, Ctenocephalides felis, cat flea, reproductive parasitism, mutualism, lateral

61 gene transfer, cytoplasmic incompatibility, biotin operon

62

63 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 4

64 Introduction

65 Wolbachiae (: : ) comprise Gram-

66 negative, obligate intracellular bacteria that infect over half the world’s described

67 species as well as certain parasitic nematodes (1). Unlike other notable rickettsial

68 genera that contain human pathogens (e.g., Rickettsia, Orientia, Neorickettsia,

69 Anaplasma, and Ehrlichia), wolbachiae do not infect vertebrates (2). A single species,

70 Wolbachia pipientis, is formally recognized with numerous members designated as

71 strains within 15 reported supergroups (3–9). Genomic divergence indicates further

72 species names are warranted (10), though increasing diversity and community

73 consensus suggest caution regarding further Wolbachia classification at the species

74 level (11, 12).

75 Like other obligate intracellular microbes, wolbachiae are metabolic parasites that

76 complement a generally reduced metabolism with pilfering of host metabolites (13, 14)).

77 Their ability to survive and flourish is also heavily influenced by the acquisition of key

78 functions through lateral gene transfer (LGT). Several described Wolbachia strains

79 demonstrate characteristics of limited mutualism with their invertebrate partner (15, 16),

80 through the synthesis and provisioning of riboflavin (17) and biotin (18, 19). While

81 riboflavin biosynthesis genes are highly conserved in wolbachiae (20), biotin

82 biosynthesis genes are rare and likely originated via LGT with taxonomically divergent

83 intracellular bacteria (21). Still other strains of Wolbachia exert varying degrees of

84 reproductive parasitism (RP) on their insect host (22), influencing host sexual

85 via processes such as male-killing, feminization, and

86 cytoplasmic incompatibility (CI) (23). Wolbachia genes underpinning CI and male bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 5

87 killing have been characterized (24–29) and occur predominantly in the eukaryotic

88 association module (EAM) of Wolbachia prophage genomes (30). These genes

89 highlight the role of LGT in providing wolbachiae with factors facilitating mutualism or

90 RP, both of which are highly successful strategies for increasing infection frequency in

91 invertebrate host populations.

92 Compared to reproductive parasites, Wolbachia mutualists appear to form more

93 stable, long-term relationships with their hosts, as supported by Wolbachia-host

94 codivergence in certain filarial nematodes (31) and Nomada bees (32). In contrast to

95 the stability of mutualists, relationships of reproductive parasites appear more

96 ephemeral. RP can be a strong mechanism to increase infection frequency, and CI

97 inducing strains can replace populations without infections (33, 34). Despite this, CI is

98 prone to neutralization through the evolution of host suppression (23, 35, 36) and

99 purifying selection on the host doesn’t preserve CI (36). A weakened CI background

100 might be a ripe setting for invasions to begin. Whether invasion occurs at the level of

101 alternate wolbachiae, WO-phages, or CI operons themselves is an area of active

102 evolutionary research, but a clear result of this evolutionary complexity is that RP

103 inducing Wolbachia phylogenies are discordant with those of their hosts (37–39).

104 Horizontal transmissions, which can occur through direct host interactions,

105 environmental contacts (shared habitat, food sources, etc.) or predator/parasitoid

106 delivery, contribute to the discordance (40) and possibly drive episodic invasions and

107 replacements.

108 Horizontal transmission is also evident from hosts infected with multiple Wolbachia

109 strains (41–43), though little is known about the dynamics and stability of Wolbachia co- bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 6

110 infections (44). For instance, in host populations showing variability in single- versus

111 double-infections, does co-infection reflect a transitory phase (i.e. one replacing

112 another) or does it reflect stability of both strains in a population? A single infection may

113 be a consequence of closely related strains carrying different arsenals of RP-inducing

114 genes, and hence battling for control of host reproduction (38). A stable co-infection

115 may reflect more divergent strains that utilize different strategies (i.e. nutrient

116 provisioning versus CI) to inhabit different micro-niches in the same host. Such a case

117 is in part witnessed in doubly-infected cochineals (Dactylopius coccus), wherein a

118 Supergroup B strain (wDacB) carries an arsenal of RP-inducing genes but a

119 Supergroup A strain (wDacA) carries none (41). Unfortunately, few systems with

120 divergent co-infecting Wolbachia have been characterized sufficiently to dissect

121 contrasting mechanisms that enable long-term host co-infections.

122 We recently sequenced the genome of the cat flea (Ctenocephalides felis) and

123 concurrently closed the genomes of two novel wolbachiae, wCfeT and wCfeJ, that co-

124 infect fleas of the Elward Laboratory (Soquel, California; hereafter EL fleas) (45).

125 Genome-based phylogeny estimation placed wCfeT on an ancestral divergent branch,

126 while wCfeJ was found to subtend Wolbachia Supergroup C. Both genomes show

127 evidence of past WO phage infection, with LGT hotspots indicating recent acquisition of

128 factors predicted to either underpin mutualism (wCfeT) or mediate CI (wCfeJ).

129 Additionally, we describe two loci within the C. felis genome itself that support repeated

130 acquisitions of CI anti-toxin genes by the host. Infection surveys of cat flea populations

131 and colonies from across the US indicate extensive spread of wCfeT among wild

132 populations, and a strong bias for wCfeT/wCfeJ co-infection in colonies, highlighting the bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 7

133 complex ecological relationship that drives this system. Finally, we apply an informatics

134 approach including phylogenomics, synteny analysis, and functional alignment to

135 construct a comprehensive picture revealing the recurrent evolution of nutritional

136 symbiosis and RP across the Rickettsiales.

137

138 Results and Discussion

139 Identification of novel Wolbachia parasites.

140 Recently, we assembled and annotated closed genomes from two divergent

141 Wolbachia strains, named wCfeT and wCfeJ, that were captured during genome

142 sequencing of the cat flea (EL Colony) (45). The 16S rDNA sequences of these two

143 strains are identical to those previously identified in another cat flea colony (Louisiana

144 State University) (46–48), which historically has been replenished with EL colony fleas.

145 wCfeT and wCfeJ are broadly similar to other closed Wolbachia genomes in terms of

146 length, GC content, number of protein coding genes, and coding density (Table 1),

147 although it is interesting to note that wCfeT exhibits one of the highest pseudogene

148 counts (second only to wCle) while wCfeJ boasts one of the lowest (second only to

149 wOv). Coupled with other recently sequenced genomes for non-Supergroup A and B

150 strains (e.g., wFol and wVulC), these cat flea-associated wolbachiae blur the lines

151 demarcating genomic traits previously used to distinguish ancestral Wolbachia lineages

152 from those of Supergroups A and B strains (44).

153 Prior reports have indicated the presence of multiple Wolbachia “types” infecting C.

154 felis, with phylogenies inferred from partial gene sequences placing these unnamed

155 Wolbachia in supergroups B or F, or in ancestral lineages (49–51). Our robust genome- bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 8

156 based phylogeny estimation reveals that both wCfeT and wCfeJ are early branching

157 wolbachiae (Fig. 1; Table S1). wCfeT is similar to undescribed C. felis-associated

158 strains that branch ancestrally to most other Wolbachia lineages (51–53), while wCfeJ is

159 similar to undescribed C. felis-associated strains closely related to Wolbachia

160 supergroups C, D and F (49, 51). The substantial divergence of wCfeT and wCfeJ from

161 a Wolbachia supergroup B strain associated with C. felis from Germany (wCte)

162 indicates a diversity of wolbachiae that are capable of infecting cat fleas. Importantly,

163 the of host-microbe relationships for any of these C. felis-associated wolbachiae

164 is unknown.

165

166 wCfeT is equipped with biotin synthesis genes.

167 The enzymatic steps for bacterial biosynthesis of biotin from malonyl-CoA are largely

168 conserved (54) and usually involve six bio enzymes and the fatty acid biosynthesis

169 machinery (55) (Fig. 2A). We previously identified the first plasmid-borne bio gene

170 operon, which occurs on plasmid pREIS2 of Rickettsia buchneri, an of

171 the black-legged tick (21). We observed a rare gene order for the pREIS2 bio operon

172 shared only with bio operons encoded on the chromosomes of obligate intracellular

173 pathogens Neorickettsia spp. and Lawsonia intracellularis (Deltaproteobacteria). These

174 unique bio operons formed a clade in estimated phylogenies, indicating LGT of the

175 operon across divergent intracellular species. Subsequent studies, using our same

176 dataset for phylogeny estimation, discovered this bio operon in certain Wolbachia (19,

177 32, 56), (57, 58) and Legionella (59) species. We refer hereafter to this bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 9

178 intriguing LGT of bio genes as the Biotin synthesis Operon of Obligate intracellular

179 Microbes (BOOM).

180 wCfeT carries a complete BOOM with greater similarity to Wolbachia BOOM than

181 those of other bacteria (Fig. 2B). Phylogeny estimation of BOOM and other diverse bio

182 gene sets indicates wCfeT BOOM is ancestral to other Wolbachia BOOM (Fig. 2C, Fig.

183 S1). However, as the divergence pattern for other Wolbachia BOOM is discordant with

184 genome-based phylogeny (Fig. 1), it is clear that multiple independent BOOM

185 acquisitions have occurred in wolbachiae irrespective of lineage divergences. This is

186 supported by a previous study hypothesizing recent BOOM acquisition in wNfla and

187 wNleu (32), as well as the nature of BOOM flanking regions in the genomes of wCfeT,

188 wVulC, wCle, wStr, wLug, which collectively lack synteny and are riddled with

189 transposases, recombinases and phage-related elements (Table S3). Furthermore,

190 analysis of wCfeT BOOM flanking genes reveals hotspots for LGT, exemplified by an S-

191 adenosylmethionine importer (EamA) gene (Fig. S1B) that is a signature among many

192 diverse obligate intracellular bacteria (21, 60–62).

193 As some BOOM-containing Wolbachia strains have been shown to provide biotin to

194 their insect hosts (18, 19), we posit that wCfeT has established an obligate mutualism

195 with C. felis mediated by biotin-provisioning. This is consistent with a WO prophage in

196 the wCfeT genome that is highly colinear to the WOVitA1 prophage (30), except for the

197 absence of a EAM and hence no RP-inducing loci (Fig. S2). Wolbachia strains

198 associated with bedbug (wCle), planthopper (wStr, wLug), and termites (wCtub) with

199 evidence for BOOM (63) reside in bacteriocytes of their insect hosts, structures

200 associated with nutritional symbiosis in many arthropods (64). Interestingly, C. felis is bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 10

201 not known to contain bacteriocytes, and the ability of wCfeT to synthesize and

202 supplement biotin to cat fleas awaits experimentation.

203

204 wCfeJ lends insight on the evolution of reproductive parasitism.

205 wCfeJ carries a toxin-antidote (TA) operon that is architecturally similar to the

206 wPip_Pel CinA/B TA operon (Fig. 3A), which was previously characterized as a CI

207 inducer in flies (24). Grouped into three types (26, 65), CinB toxins harbor dual

208 nuclease (NUC) domains in place of the deubiquitinase (DUB) domain of CidB,

209 another CI inducer in flies (26–28). Chimeric NUC-DUB toxins, termed CndB toxins

210 (66), also occur as operons (cndAB) in certain Wolbachia and Rickettsia genomes (27,

211 67). Although their functions remain unknown, these larger CndB toxins belong to an

212 extraordinarily diverse array of toxins with highly modular architectures that occur in

213 divergent intracellular bacteria (67). Curiously, despite a size similar to the CinB toxins,

214 wCfeJ’s CinB toxin has much higher sequence identity to larger CndB and NUC-

215 containing toxins found mostly in Wolbachia Supergroups A and B (Fig. 3B). A similar

216 pattern was found for wCfeJ’s CinA antidote, indicating the genes were likely acquired

217 as an operon.

218 In contrast to wCfeT, wCfeJ harbors a highly degraded WO prophage genome with

219 only a partial EAM, the CinA/B operon, and several other genes encoding hypothetical

220 proteins and transposases (Fig. 3C). This is reminiscent of the wRec parasite, which

221 lacks a functional WO phage yet possesses TA modules within remnant EAMs (68).

222 Synteny analysis with other Wolbachia phage revealed wCfeJ cinB and three

223 downstream genes are colinear with a large (3707 aa) wDacB cndB toxin (Fig. 3D). bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 11

224 This provides strong evidence for the fission of a large modular CndB toxin into a

225 smaller CinB toxin. This interpretation is also consistent with our prior hypothesis

226 stating that more streamlined RP-inducing toxins (i.e., cinB and cidB) originate from

227 larger modular toxins (i.e., cndB and others) that are widespread in the intracellular

228 mobilome, particularly Wolbachia phage and Rickettsia plasmids and integrative and

229 conjugative elements (ICEs) (48, 67).

230 wCfeJ is unique among described early-branching wolbachiae by containing a CI-

231 like TA operon. In fact, the Wolbachia strains in closely related Supergroups C, F and D

232 are considered mutualists with their nematode and arthropod hosts. wFol, another

233 ancestral Wolbachia lineage (Fig. 1), does carry a myriad of large modular candidate

234 RP-inducing toxins but does not have cin, cid or cnd TA operons (69, 70). Collectively

235 this indicates acquisition of a WO prophage by wCfeJ, possibly from another

236 Supergroup A or B Wolbachia strain that also infects cat fleas (e.g., wCte), with viral or

237 Wolbachia level selection streamlining a cndAB locus into a cinAB operon. Given that

238 the genomes of many Wolbachia reproductive parasites harbor diverse arrays of

239 CinA/B-and CidA/B-like operons (66, 67), we posit that wCfeJ’s CinA/B TA operon might

240 function in CI or some other form of RP; however, further study is required to evaluate

241 wCfeJ as a cat flea reproductive parasite. Furthermore, an intriguing speculation is that

242 functions of the larger modular toxins, exemplified by wDacB CndB, whatever they be,

243 are likely the primal functions that gave rise to CI.

244

245 Lateral transfer of Wolbachia CI-like antidotes to the cat flea genome. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 12

246 Our previous work identified CI-like toxin and antidote genes on scaffolds in several

247 arthropod genome assemblies, although antidote genes were found more frequently

248 possibly due to utility in suppressing microbial invaders implementing RP (67). Many of

249 these genomes contain multiple divergent antidote genes, though none were found to

250 harbor large-scale Wolbachia gene transfer that is known to occur in certain

251 invertebrates (71–77). In light of this observation, we originally speculated that CI-like

252 genes are mobilized from wolbachiae to host genomes possibly through WO phage

253 introgression; however, the absence of other WO phage genes in these genomes made

254 this assertion tenuous. Instead, more recent work describing the first WO prophage

255 carrying two sets of CI loci provides compelling evidence for transposon-mediated LGT

256 of CI loci between divergent wolbachiae (38). This phage-independent mechanism for

257 RP gene purging and replenishment in wolbachiae may result in the inadvertent capture

258 of these genes by host genomes.

259 When we analyzed the cat flea genome for CI-like genes, we discovered that the

260 largest C. felis scaffold (185.5 Mb) was found to harbor two CI-like antidote genes (Fig.

261 4A). Remarkably, neither antidote has any significant similarity to CinA of wCfeJ (Fig.

262 4B), indicating that these CI antidotes originated from other, as yet undescribed

263 wolbachiae. The C. felis CinA-like CDS (XP_026474240.1) has greater similarity to

264 CinA proteins than CidA proteins, while the CidA-like CDS (not annotated in the current

265 C. felis assembly) has much greater similarity to CidA antidotes (27, 28, 78–80). The C.

266 felis cinA-like gene is found as an exon within a larger flea gene model

267 (LOC113377978), and is well supported (18X coverage, 7.2 TPM) by transcripts from

268 the 1KITE project (81). Since these transcripts were generated in fleas from a different bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 13

269 colony (Dr. Michael Dryden, Kansas State University), we conclude that this gene

270 transfer occurred during an historical infection predating the establishment of these two

271 colonies. Finally, the C. felis CinA- and CidA-like antidotes each possess the conserved

272 motifs we previously defined for all CI antidotes (67), with the exception of a truncation

273 of motifs 5 and 6 in the CinA-like protein (Fig. 4D). As other arthropod-encoded

274 antidotes share this C-terminal truncation, it may signify an expendable domain in the

275 host (i.e., a bacterial secretion signal, a motif involved in cognate toxin activation or

276 delivery, etc.).

277 The incorporation of a Wolbachia antidote into an exon of a host gene provides one

278 explanation for how such a laterally transferred gene would remain functional in the

279 absence of the native regulatory elements encoded on WO prophage genomes. We

280 previously hypothesized that CI antidote genes would be under strong selection in

281 arthropod genomes if their expression could result in rescuing CI induction by

282 reproductive parasites (67). In support of this supposition, it has been shown that: host

283 genotypes modulate CI (68, 82–84); some hosts show weak (Dmel and D. suzukii (35,

284 85, 86)) or strong ( pipiens and D. simulans (87, 88)) CI; and a Wolbachia strain’s

285 CI can be suppressed when transfected across different hosts (84). While none of the

286 arthropod-encoded CI antidotes have been characterized, we propose they impart

287 immunity to toxins secreted by intracellular parasites, curtailing chronic parasite drive

288 into populations. At least four independent evolutionary mechanisms might lead to

289 suppression of CI. Gene family expansion of, and/or overexpression of key factors

290 capable of counteracting CI mechanisms is possible; similarly, it was postulated that

291 evolution in Cid interacting proteins like karyopherin-α or P32 could lead to target site bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 14

292 resistance (25). A third possibility contributing to weakening CI, though not suppression

293 per se, is simply the mutational collapse of the operons themselves due to lack of

294 purifying selection (89). We supply evidence of a fourth possibility whereby CI might be

295 counteracted by germline expression of acquired CI antidotes. Significantly, our finding

296 raises awareness of a potential barrier to Wolbachia-mediated pathogen biocontrol

297 measures.

298

299 Characterization of wCfeT and wCfeJ infection of cat fleas.

300 Coupled with earlier reports indicating multiple early-branching wolbachiae lineages

301 infecting C. felis (49, 51), our characterization of two divergent Wolbachia strains

302 infecting cat fleas in the EL colony (45) as well as fleas maintained for a decade at LSU

303 (46–48) prompted us to test individual fleas for double infection. In addition to EL fleas,

304 three other colony strains and three geographically diverse wild cat flea populations

305 were also surveyed (Fig. 5A). Aside from the Modesto Wild strain, which is historically

306 replenished with wild-caught cat fleas (Dr. Bill Donahue, personal communication),

307 strong wCfeJ-wCfeT co-infection was found in colony fleas. Conversely, single wCfeT

308 infection dominated wild cat fleas, with only a few fleas from a large sampling

309 throughout Orange Co. CA harboring wCfeJ. While far greater sampling of both colony

310 and wild populations is necessary, our results hint at a strong selection for wCfeT over

311 wCfeJ in nature. The ability of wCfeJ to highly infect colony fleas may arise as a

312 consequence of artificial bottlenecks within colonies, which are known to strengthen CI

313 and maternal transmission (36). Nonetheless, the ability of wCfeT and wCfeJ to co-

314 infect individual cat fleas provides a system to study contrasting forces (nutritional bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 15

315 symbiosis and RP) that potentially drive multiple Wolbachia infections in a single

316 invertebrate host.

317 To gain more insight into the nature of the wCfeT-wCfeJ co-infection, we analyzed

318 the distribution of wCfeT and wCfeJ in EL flea tissues (Fig. 5B). Surmising that both

319 Wolbachia strains are maternally transmitted, we dissected out the of unfed

320 virgin female fleas and compared Wolbachia density in these tissues versus others

321 (primarily midgut). Counter to what we expected, wCfeT predominantly localizes in flea

322 ovaries, while the CI harbinger, wCfeJ, was predominantly in somatic tissues. Although

323 the two are not mutually exclusive and they do co-localize to some extent. The

324 significance of this distribution is unclear. wCfeJ may relocate to ovaries upon a mating

325 cue, or perhaps exert CI via some undefined somatic cell mechanism. Transmission of

326 wCfeJ may also be predominantly horizontal via environmental or direct flea contact

327 (per (40)). Sampling co-infected fleas at specific timepoints during development, as well

328 as determining wCfeJ tissue localization in the absence of wCfeT infection, will be

329 required to unravel the intertwined transmission dynamics of these Wolbachia strains in

330 cat fleas.

331

332 Evolution of Wolbachia Mutualism and Reproductive Parasitism

333 Order Rickettsiales is a highly diverse group of obligate intracellular bacteria (2, 62,

334 90). A rapid pace of newly identified lineages (60, 91–104) provides a rich source for

335 evaluating mechanisms of evolution within constraints of the eukaryotic cell. Phage,

336 plasmids, transposons, ICEs, and other insertion sequences are variably present across

337 rickettsial genomes and mobilize numerous genes to offset reductive genome evolution bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 16

338 and provide lifestyle altering traits (21). Our analyses of Wolbachia BOOM and RP

339 genes prompted us to evaluate the recurrent evolution of nutritional-mediated mutualism

340 and reproductive parasitism across diverse rickettsial lineages (Fig. 6).

341 Nutritional-mediated mutualism. Phylogenetic estimation supports three

342 independent acquisitions of biotin biosynthesis genes by rickettsial species (Fig. 6A,

343 Fig. S1A). The largest clade (group 1) contains all BOOM-containing species, though

344 only Neorickettsia spp., certain wolbachiae, R. buchneri, and the Peranema

345 trichophorum endosymbiont carry the conserved BOOM gene arrangement (highlighted

346 yellow in Fig. 6A). Other species in this group, as well as Groups 2 and 3, show

347 different gene arrangements indicating recombination and/or possible replacement of

348 specific bio genes through additional LGT (Fig. S3). BOOM and other bio gene clusters

349 have repeatedly lost bioH, which encodes the methyl esterase generating pimeloyl-CoA,

350 the last step in the synthesis of the pimelate moiety of biotin. Bacteria have evolved

351 numerous strategies for catalyzing this reaction (54, 105–112). Interestingly, a novel

352 methyl esterase of Ehrlichia chaffeensis (BioU) was recently shown to complement an

353 E. coli DbioH mutant (113); however, we determined that BioU orthologs are present in

354 many rickettsial species, including some that harbor a BioH already and others with no

355 bio synthesis genes at all (Fig. S4A-C). This indicates that while BioU may generate

356 pimeloyl-CoA in certain rickettsial species – namely, those with complete bio synthesis

357 modules except for BioH – it is likely to have a broad range of functions in bacteria (Fig.

358 S4D). Furthermore, some species lack the genes for both BioH and BioU, indicating

359 that alternative pathways for generating pimeloyl-CoA await discovery. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 17

360 Analysis of the genomic distribution of biotin synthesis genes, the biotin transporter

361 BioY, the biotin ligase BirA, and the enzymes involved in malonyl-CoA synthesis (biotin-

362 dependent propionyl-CoA carboxylase complex and FabD) revealed several strategies

363 for biotin utilization in Rickettsiales (Fig. 6A). Species that harbor a complete set of bio

364 genes (as discussed above) also contain ligase and utilization enzymes, although (not

365 surprisingly) most do not carry the BioY importer. Genera Wolbachia and Rickettsia

366 largely rely on parasitic scavenging of host biotin to fuel fatty acid biosynthesis: they are

367 dominated by species that lack bio genes but carry the BioY importer, BirA ligase, and

368 utilization enzymes. Two species of Transitional Group Rickettsiae (R. felis and R.

369 hoogstraalii) may be able to utilize imported dethiobiotin as well, as they retain the biotin

370 synthase BioB; alternatively, this may represent the frayed end of a decaying metabolic

371 pathway similar to the glycerol phospholipid and folate biosynthesis pathways in

372 Rickettsiae (13, 114). While characterized as two-component transporters (with

373 BioM/N), BioY subunits alone can efficiently transport biotin into cells (115) and also

374 participate with host transporters to relocate cytosolic biotin to vacuoles occupied by

375 pathogens (116). However, certain rickettsial species lacking synthesis genes also lack

376 BioY importers yet retain the ligase and utilization enzymes, indicating the presence of

377 alternative transport mechanisms. Remarkably, Orientia spp. lack all genes involved in

378 biotin synthesis, import, ligation and utilization, exhibiting the ultimate form of parasitism

379 for host-dependent fatty acid synthesis: these Rickettsiae must pilfer malonyl-CoA from

380 the host (117), using a transport system yet to be described.

381 The acquisition of BOOM by certain wolbachiae and R. buchneri provides a

382 theoretical framework for how LGT facilitates a transition from metabolite thievery to bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 18

383 nutritional-mediated symbiosis. However, it is important to consider that these

384 endoparasites may be utilizing the BOOM primarily for selfish purposes, as almost all

385 bacteria require biotin. For example, Neorickettsia spp. and the deltaproteobacterium

386 Lawsonia intracellularis are pathogens that also harbor the BOOM, suggesting a more

387 complex ecological role surrounding biotin acquisition. If Neorickettsia spp. do indeed

388 actively provision biotin to their trematode and insect vectors, as has been shown for

389 wCle (19), such host-specific supplementation would necessitate as-yet unidentified

390 genes encoding export factors as well as stringent regulation of the BOOM.

391 Canonical regulation of biotin synthesis is accomplished via a repressor, fused to the

392 N-terminal domain of the biotin ligase gene, which down-regulates transcription of the

393 bio gene cluster in response to high cellular biotin (118, 119). Interestingly, all rickettsial

394 BirA biotin ligases lack this repressor domain, indicating that alternative mechanisms for

395 regulating biotin synthesis and host supplementation are utilized by these bacteria.

396 “Candidatus Aquarickettsia rohweri” and the Trichoplax sp. H2 endosymbiont

397 (Midichloriaceae) lack BirA ligases altogether (Fig. 6A), and no other biotin ligases were

398 found in these genomes (data not shown). Regarding the Trichoplax sp. H2

399 endosymbiont, our prior analysis of the closely related Rickettsiales endosymbiont of

400 Trichoplax adhaerens (RETA) discovered two biotin-protein ligase N-terminal domain-

401 containing proteins (BPL_N) encoded in the T. adhaerens genome (62). These host-

402 encoded genes contain introns and eukaryotic regulatory elements yet are strongly

403 supported as rickettsial in origin. Homologs of these T. adhaerens proteins are also

404 present in the Trichoplax sp. H2 genome (120) (NCBI acc. nos. RDD42697 and

405 RDD42436), supporting their cross-domain LGT. This raises the possibility for a bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 19

406 metabolic interdependence involving biotin synthesis and utilization in placozoans and

407 their rickettsial symbionts, similar to the mosaic metabolic networks underpinning

408 nutritional symbioses in other systems (121–123) . For example, the nested tripartite

409 symbiosis between Planococcus citri mealybugs, Tremblaya princeps, and Moranella

410 endobia shows evidence for LGT of several bio genes to the mealybug genome (124);

411 these genes share highest similarity to Wolbachia BOOM counterparts (data not

412 shown). This is reminiscent of the pea aphid-Buchnera symbiosis, wherein several

413 rickettsial LGTs have been implicated in facilitating metabolic interdependence between

414 host and symbiont (125). For rickettsial of placozoans, host acquisition

415 of BPL_N genes (among others (62)) may have established an inextricable obligatory

416 relationship (126, 127) and may hint at the evolutionary fate of other BOOM-containing

417 rickettsiae. However, while our analyses of the BOOM and other bio genes in rickettsial

418 genomes are suggestive, in the absence of experimental validation we caution against

419 implicating these genes as factors underpinning nutritional-mediated mutualism.

420 Reproductive parasitism. Among other bacterial reproductive parasites of

421 arthropods (e.g., species of Cardinium, Arsenophonus and ) wolbachiae

422 are perhaps the best-studied for their roles in RP induction (23). Yet within the

423 Rickettsiales, wolbachiae do not walk alone in inducing RP in their hosts. Based on a

424 survey of the literature and analysis of genomic data for the ever-growing pool of RP-

425 inducing genes, we identify at least five strategies for RP induction that have evolved

426 throughout rickettsial evolution, with four independent gains in the

427 coupled to the widespread RP seen in wolbachiae (Fig. 6B). We posit that this model is

428 more parsimonious than RP as an ancestral rickettsial trait, given that RP-inducing bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 20

429 genes are mostly found embedded within mobile genetic elements (e.g., WO prophage

430 in Wolbachia, RAGE ICEs and plasmids in Rickettsia spp.).

431 RP-inducing genes are found in greater numbers and diversity in wolbachiae,

432 undoubtedly due to their strong association with EAMs of WO prophage genomes and

433 mobile elements (30, 65). In our previous work, we uncovered a link between the

434 characterized CI-inducing CinA/B and CidA/B operons of wolbachiae and a putative TA

435 operon encoded on the pLbAR plasmid of Rickettsia felis str. LSU-Lb, an obligate

436 endosymbiont of the booklouse Liposcelis bostrychophila (48). We posited that this TA

437 operon induces parthenogenesis in the booklouse: pLbAR is unknown in other strains of

438 R. felis which do not induce parthenogenesis, in booklice or other arthropods (48), and

439 booklice lacking R. felis str. LSU-Lb reproduce sexually (128, 129). Aside from being a

440 CndB toxin (harboring both the NUC and DUB domains of wolbachiae CinB and CidB

441 toxins, respectively) the large pLbAR toxin (3,241 aa) was found to carry several other

442 domains present in other Wolbachia proteins, most of which are also EAM-associated.

443 These domains, together with the large central domain (LCD) of the pLbAR toxin,

444 were utilized to identify an extraordinarily diverse array of candidate RP-inducing toxins

445 in a wide-range of intracellular bacteria, some of which are known reproductive

446 parasites (67). Importantly, because this large assemblage of proteins was identified

447 using sequence characteristics of the pLbAR toxin, we consider the collective family

448 linked within the portion of the intracellular mobilome driving the recurrent evolution of

449 RP in invertebrate-borne bacteria.

450 Aside from CI induction by CinA/B and CidA/B operons (24–27), virtually nothing is

451 known about the functions of related candidate RP-inducing genes. The function of the bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 21

452 CinA/B-like operon of O. tsutsugamushi is unclear. O. tsutsugamushi is the pathogen

453 causing scrub typhus and is capable of inducing parthenogenesis in Leptotrombidium

454 (130, 131). Given this fact, exploration of its corresponding CinA/B module is

455 warranted. Though the connection to RP in mites is unclear given that CinA/B only

456 occurs in the Boryong strain; a myriad of other proteins encoding the pLbAR LCD are

457 amplified in all Orientia genomes. Genome sequences are needed (but not yet

458 available) to determine toxin profiles for Rickettsia species either in ancestral lineages

459 (132–134) or Transitional Group rickettsiae (132, 135, 136) that induce parthenogenesis

460 in eulophid or trogiid booklice. Genome sequences are also lacking for

461 Rickettsia species from ancestral lineages known to induce male-killing in buprestid and

462 ladybird beetles (137–140). We previously mined an unpublished genome of the male

463 killer Rickettsia parasite of the ladybird beetle (141) and found a toxin

464 highly similar in architecture to the Spiroplasma male killer toxin (142). While both

465 toxins carry an ovarian tumor (OTU) cysteine protease domain (143), we found a

466 hodgepodge of toxin architectures carrying OTU domains from bacteria known to induce

467 other RP phenotypes, such as parthenogenesis (wFol (69)) and feminization (wVulC

468 (144)). Thus, it is clear that additional high-quality (closed) genomes are sorely needed

469 to correlate RP-inducing protein domains and RP phenotypes. New correlations will

470 lead to downstream functional analyses of target’s abilities to induce RP.

471 A large number of candidate proteins from the pLbAR-derived pool occur in species

472 that are not known as reproductive parasites. These include CndB toxins, NUC- and

473 DUB-domain containing proteins that are much larger than characterized CinB and CidB

474 toxins, diverse OTU-domain containing toxins, and especially proteins harboring the bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 22

475 LCD that may or may not exhibit toxin-like profiles (67). Some of these candidate toxins

476 appear to have cognate antidotes, and some genomes carry antidotes only, highlighting

477 the complex diversity of this biological function. Recently, an EAM-encoded

478 transcriptional factor in wMel termed WO-mediated killing (wmk) was shown to induce

479 male-specific lethality during early embryogenesis when expressed transgenically in

480 fruit flies (29). This male-killer gene was not detected as part of our pLbAR-derived

481 pool, indicating that RP-inducing gene arsenals are probably much larger than have

482 been described thus far, particularly considering that cin, cid and cnd loci are not

483 present in the genomes of Cardinium and Arsenophonus reproductive parasites (66, 67,

484 145) for which the mechanisms for RP induction remain unknown (146, 147).

485 Candidate RP-inducing toxins with similar features to the pLbAR toxin (large,

486 modular, and associated with mobile genetic elements) are found in several Wolbachia

487 genomes (e.g., wFol, wStr, wDacB, wVulC, wCon, wCed, wCauA). They are often in

488 conjunction with other cin and/or cid loci, yet not always associated with WO prophage.

489 Consistent with prior reports identifying LGT between Rickettsia and Wolbachia

490 genomes (21, 48, 148, 149), similar toxins in Rickettsia genomes (R. felis str. LSU-Lb,

491 R. gravesii, male killer of Adalia bipunctata) indicate a dynamic platform for

492 disseminating RP-inducing genes across diverse rickettsial species that includes

493 Rickettsia plasmids and WO prophage. Growing numbers of identified Rickettsia

494 plasmids (48), proliferative RAGE ICEs in Rickettsia and Orientia genomes (21, 117,

495 150–153), and the recent discovery of the first Wolbachia plasmid (154), attest to a

496 highly dynamic mobilome not previously realized for obligate intracellular bacteria. The

497 wCfeJ fission of a CndB-like toxin into a CinB toxin described in the current study (Fig. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 23

498 3D) supports the premise that larger toxins recombine into WO prophage and undergo

499 streamlining into discrete inducers of CI or other RP phenotypes. As with the evolution

500 of biotin utilization, Rickettsiales provides a framework to study the nature of RP in a

501 diverse array of bacteria that enjoy different relationships with their invertebrate and

502 vertebrate hosts.

503

504 Conclusion

505 Fleas are understudied vectors of several human and diseases (155). While

506 wolbachiae (mostly unnamed strains) have been detected in many flea species (50, 51,

507 156), it is pressing from a biocontrol perspective to know if these bacteria affect the

508 ability of fleas to vector pathogens. The cat flea transmits pathogenic bacteria such as

509 Rickettsia typhi (murine typhus), R. felis (murine typhus-like illness), spp.

510 (e.g., cat-scratch disease and various other bartonelloses), and to a lesser extent the

511 plague agent Yersinia pestis (157–160). Cat fleas are also intermediate hosts of the

512 tapeworm Dipylidium caninum and the filarial nematode Acanthocheilonema

513 reconditum, both of which can infect humans (161–163). Other organisms observed in

514 C. felis include undescribed Baculoviridae, amoebae, trypanosomatids, cephaline

515 gregarines (Apicomplexa), and microsporidia (164). However, nearly all epidemiological

516 studies have focused on determining the frequency and distribution of only certain

517 pathogens (e.g., Bartonella spp., R. typhi, R. felis), with few documenting wolbachiae

518 co-occurrence with these pathogens (165, 166).

519 The remarkable opposing features of wCfeT and wCfeJ offer testable hypotheses for

520 contrasting host relationships (nutritional symbiosis versus RP, respectively). They also bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 24

521 provide a framework to study the dynamics of Wolbachia co-infections, both in colonies

522 and wild populations. The wCfeT and wCfeJ genomes yield lessons on the role of LGT

523 in shaping host associations, highlighting the ephemeral nature of nutritional-based

524 mutualism and RP over a general rickettsial platform of obligate intracellular parasitism.

525 Host genome sequences provide clues to on-going mutualism and historical RP, and

526 our hypothesis for host CI-gene capture leading to suppression of RP illuminates

527 possible limitations for deployment of Wolbachia reproductive parasites to combat

528 arthropod transmission of human pathogens. In summary, our robust characterization

529 of two divergent strains of C. felis-associated wolbachiae, as well as the C. felis genome

530 itself (45), provides essential information for determining the impact of these bacteria on

531 cat flea biology and pathogen vectoral dynamics.

532

533 Materials and Methods

534 Assembly and annotation of wCfeT and wCfeJ. Our recent report detailed 1) the

535 assembly of Wolbachia reads generated from C. felis genome sequencing, 2) genome

536 circularization and naming of two novel strains (wCfeT and wCfeJ), and 3) complete

537 genome annotation (45). All relevant information pertaining to wCfeT and wCfeJ is

538 available at NCBI under Bioproject PRJNA622233.

539

540 Wolbachiae Phylogenomics. Protein sequences (n=84,836) for 66 sequenced

541 Wolbachia genomes plus 5 additional Anaplasmataceae (Neorickettsia helminthoeca

542 str. Oregon, Anaplasma centrale Israel, A. marginale Florida, Ehrlichia chaffeensis

543 Arkansas, and E. ruminantium Gardel) were either downloaded directly from NCBI bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 25

544 (n=48), retrieved as genome sequences from the NCBI Assembly database (n=13),

545 contributed via personal communication (n=8; Michael Gerth, Oxford Brookes

546 University), or sequenced here (n=2) (Table S1). For genomes lacking functional

547 annotations (n=15), gene models were predicted using the RAST v2.0 server (n=12) or

548 GeneMarkS-2 v1.10_1.07 (n=3; (167)). Ortholog groups (n=3,038) were subsequently

549 constructed using FastOrtho, an in-house version of OrthoMCL (168), using an expect

550 threshold of 0.01, percent identity threshold of 30%, and percent match length threshold

551 of 50% for ortholog inclusion. A subset of single-copy families (n=12) conserved across

552 at least 60 of the 66 genomes were independently aligned with MUSCLE v3.8.31 (169)

553 using default parameters, and regions of poor alignment were masked with trimal

554 v1.4.rev15 (170) using the "automated1" option. All modified alignments were

555 concatenated into a single data set (2,803 positions) for phylogeny estimation using

556 RAxML v8.2.4 (171), under the gamma model of rate heterogeneity and estimation of

557 the proportion of invariant sites. Branch support was assessed with 1,000 pseudo-

558 replications. Final ML optimization likelihood was -52350.085098.

559

560 BOOM characterization. Phylogeny estimation. Genomes (n=93) containing at

561 least 4 of the 6 biotin synthesis genes were identified using Blastp searches against the

562 NCBI nr database (e-value < 0.01; accessed January 9, 2020). Protein sequences for

563 these loci were downloaded from NCBI, aligned with MUSCLE v3.8.31 (169) using

564 default parameters, and regions of poor alignment masked with trimal v1.4.rev15 (170)

565 using the "automated1" option. All modified alignments were concatenated into a single

566 data set (1,457 positions) for phylogeny estimation using RAxML v8.2.4 (171), under the bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 26

567 gamma model of rate heterogeneity and estimation of the proportion of invariant sites.

568 Branch support was assessed with 1,000 pseudo-replications. Final ML optimization

569 likelihood was -127140.955066. See Table S2 for NCBI protein accession IDs for all

570 sequences included in this analysis.

571 Gene neighborhood analysis of wCfeT BOOM. Genes in a 25-gene window around

572 the wCfeT BOOM cluster were used to query (Blastp) the NCBI nr database (accessed

573 January 9, 2020). The top 50 matches to each sequence (e-value < 1, BLOSUM45

574 matrix) were binned by taxonomic assignment as "Wolbachia", "other Rickettsiales", or

575 "other (non-Rickettsiales) taxa." Other (non-Rickettsiales) taxa were further binned by

576 major taxonomic group to assess the boundaries and diversity of the wCfeT BOOM

577 MGE. Synteny analysis of biotin genes in wolbachiae. Rickettsiales genomes with at

578 least 5 of the 6 biotin synthesis genes (n=28) were downloaded from NCBI and ortholog

579 groups (n=2,527) constructed with FastOrtho, an in-house version of OrthoMCL (168),

580 using an expect threshold of 0.01, percent identity threshold of 30%, and percent match

581 length threshold of 50% for ortholog inclusion. OGs containing biotin synthesis genes

582 were identified using Rickettsia buchneri str. Wikel sequences as seeds. Genome

583 locations for all genes were retrieved from NCBI in gene file format (gff) and used to

584 order the orthologs in each genome.

585 Phylogenetic estimation of the EamA protein of wCfeT. Taxa were selected from

586 Blastp searches against the NCBI Rickettsiales and nr (excluding Rickettsiales)

587 databases using the wCfeT EamA protein (WP_168464633.1) as a query. Proteins

588 were aligned with MUSCLE v3.8.31 (169) using default parameters, with regions of poor

589 alignment masked using Gblocks (172). Phylogeny was estimated with the WAG bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 27

590 substitution model using RAxML v8.2.4 (171) under the gamma model of rate

591 heterogeneity and estimation of the proportion of invariant sites.

592

593 CI gene characterization. Wolbachiae. wCfeJ CinA and CinB were compared to

594 other RP-inducing TA operons following our previous approach (67). Another novel

595 CndAB operon was identified on a small scaffold (NCBI Nucleotide ID

596 CABPRJ010001055.1) in the aphid cedri, which has an associated Wolbachia

597 symbiont, wCed (173), and was also characterized in a similar manner. Briefly, EMBL’s

598 Simple Modular Architecture Research Tool (SMART) (174) and /or the Protein

599 Homology/analogY Recognition Engine V 2.0 (Phyre2) (175) were used to predict the

600 following domains: OTU cysteine protease (Pfam OTU, PF02338) (143), endoMS-like

601 (176) and NucS-like (177) PD-(D/E)XK nuclease (Pfam PDDEXK_1, PF12705) (178),

602 CE clan protease (Pfam Peptidase_C1, PF00112) (179), Latrotoxin-CTD (180), and

603 ankyrin repeats. Individual protein schemas were generated using Illustrator of

604 Biological Sequences (181) with manual adjustment.

605 C. felis. A cidA-like gene was identified in the Ctenocephalides felis genome during

606 initial genome decontamination (45). Briefly, a contig composed of a mosaic of flea-

607 and Wolbachia-like sequence was identified by our comparative BLAST-based pipeline

608 and flagged for manual inspection. Gene models on this contig were predicted using

609 the RAST v2.0 server. The resulting proteins were used to create a custom Blast

610 database that was queried using Blastp with CinAB sequences from Wolbachia

611 endosymbiont of Culex quinquefasciatus Pel (CAQ54390, CAQ54391), and CidAB

612 sequences Wolbachia pipientis wAlbB (CCE77512, CCE77513). The contig was bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 28

613 subsequently confirmed as belonging to the largest scaffold (NW_020538040) in the C.

614 felis assembly. A second, cinA-like gene was identified by querying all CDS in the

615 published C. felis assembly (NCBI accession ASM342690v1) with the same set of

616 Wolbachia CinAB and CidAB sequences.

617

618 Screening colonies and wild populations of cat fleas for wCfeT and wCfeJ.

619 Upon receipt from collaborators, cat fleas were stored at -20°C in EtOH until processing

620 for DNA extraction and qPCR. Fleas were sexed under a stereomicroscope and

621 surfaced sterilized for 5 min with 1% bleach solution, followed by 5 min with 70% EtOH,

622 and finally with three washes using molecular grade water. Individual fleas were flash-

623 frozen in liquid N2 and ground to powder with sterile pestle in a 1.5ml microcentrifuge

624 tube. DNA was extracted from dissected tissues using the Zymo Quick-DNA Miniprep

625 plus kit following the solid tissue protocol (Zymo Research; Irvine, CA). The presence

626 of wCfeT, wCfeJ, and C. felis DNA was determined via qPCR using the following primer

627 sets: Cfe#18S|179, 5’-TGCTCACCGTTTGACTTGG-3’ and 5’-

628 GTTTCTCAGGCTCCCTCTCC-3’ (182); wCfeT#ApaG|75: 5’-

629 GCCGTCACTGGCAGGTAATA-3’ and 5’-GCTGTTCTCCAATAACGCCA-3’,

630 wCfeJ#CinA|76, 5’-AGCAACACCAACATGCGATT-3’ and 5’-

631 GAACCCCAGAGTTGGAAGGG-3’. Each primer set amplicon was sequenced before

632 use to ensure specificity. qPCR was performed using a QuantStudio 3 with the

633 PowerUp SYBR green mastermix in a 20µl reaction volume containing 2µl of sample

634 DNA with 400nM of each primer. Cycling parameters were consistent with the “fast

635 cycling mode” including a 2 min cycle at 50°C, a 2 min cycle at 95°C, 40 cycles with 3 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 29

636 sec at 95°C and 30 seconds at 60°C followed by a melt curve analysis. Standard

637 curves were generated to quantify the number of DNA copies present in each reaction.

638 Results were expressed as the ratio of Wolbachia amplicons to C. felis 18S rDNA

639 copies to normalize for varying efficiency of DNA extractions. Fleas were considered

640 positive for wolbachiae if the flea 18S rDNA Ct value is 30 or lower and the Wolbachia

641 Ct value is 37 or lower.

642

643 Determining wCfeT and wCfeJ localization in C. felis tissues. Elward

644 Laboratories (EL; Soquel, CA) cat fleas stored at -80°C were used to assay tissue

645 localization of wCfeT and wCfeJ. Ovaries and midguts of individual female fleas were

646 dissected under a stereomicroscope. DNA extraction and qPCR were performed as

647 described above.

648

649 Data availability. All relevant information pertaining to wCfeT and wCfeJ is

650 available at NCBI under Bioproject PRJNA622233. Complete genome sequences are

651 available in the NCBI RefSeq database at accession identifiers NZ_CP051156.1

652 (wCfeT) and NZ_CP051157.1 (wCfeJ).

653

654

655 SUPPLEMENTAL MATERIAL

656 Data as well as custom analysis scripts generated for this project that are not published

657 elsewhere are provided on GitHub in the "wCfe_genomes" repository available at

658 https://www.github.com/wvuvectors/wCfe_genomes. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 30

659

660 TABLE S1. Information supporting the Wolbachia genome-based phylogeny estimation.

661

662 TABLE S2. Information supporting the phylogeny estimation of biotin genes.

663

664 TABLE S3. Information supporting the comparative analysis of BOOM gene

665 neighborhoods in wolbachiae genomes.

666

667 FIG S1. Phylogeny estimations of biotin synthesis enzymes and EamA transporters.

668 (A) Complete phylogeny estimation of BOOM and other bio gene sets from diverse

669 bacteria. Tree was estimated from the concatenation of six bio enzymes (BioC, BioH,

670 BioF, BioA, BioD, and BioB) or subsets in certain cases (see Table S2 for all sequence

671 information and Materials and Methods for details on dataset processing and tree

672 estimation). Branch support was assessed with 1,000 pseudo-replications. Final ML

673 optimization likelihood was -127140.955066. In taxon color scheme, [R] denotes

674 Rickettsiales, with Holosporaceae [H] as a revised family of Rhodospirillales (101). The

675 families of Rickettsiales are similarly noted: Anaplasmataceae [A], Midichloriaceae [M],

676 Rickettsiaceae [R], and “Candidatus Deianiaeaceae” [D], with the latter considered

677 provisional (91).

678 (B) Estimated phylogeny of EamA transporters. See Materials and Methods for details

679 on dataset processing and tree estimation. Branch support was assessed with 1,000

680 pseudo-replications. Final ML optimization likelihood was -23058.797227. Taxon color

681 scheme as described in panel A. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 31

682

683 FIG S2. In silico characterization of the wCfeT prophage genome.

684 Schema shows the comparison of the wCfeT and WOVitA1 prophage genomes, with

685 gene descriptions following prior studies (26, 30, 148, 183). wCfeT genes within the

686 typical Eukaryotic Association Module (EAM) were used as queries in Blastp searches

687 against the NCBI nr protein database, with summary statistics provided for top subjects

688 (dashed box).

689

690 FIG S3. Synteny analysis of biotin synthesis genes in Rickettsiales genomes.

691 Rickettsiales taxa with at least 5 of 6 biotin synthesis genes (n=28) were analyzed for

692 co-linearity of their biotin genes. Ortholog groups (n=2,527) were constructed from

693 these genomes as described for Fig. 1 (see Materials and Methods) and OGs

694 containing biotin synthesis genes identified. Genome locations were retrieved from

695 NCBI in gene file format (gff) and used to order the orthologs in each genome. For

696 unclosed genomes, all contigs were included in OG construction; only a single genome

697 (Candidatus Aquarickettsia rohweri) contained biotin synthesis genes on multiple

698 contigs. Biotin genes are colored according to the ideal BOOM shown at the top. Black

699 boxes indicate stretches of unrelated genes. Taxa are grouped into 3 classes according

700 to the synteny of biotin genes in their genomes: Class I taxa demonstrate completely

701 conserved gene order; Class II taxa contain only a single gene out of order (Class IIA:

702 BioA; Class IIB: BioB); Class III taxa exhibit little or no conservation of gene order. All

703 blocks are anchored on BioB for ease of comparison, except Class IIB which are

704 anchored on BioF instead. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 32

705

706 FIG S4. Bioinformatics analysis of BioU and related methyl esterases.

707 (A) Alignment of Ehrlichia chaffeensis and Anaplasma centrale BioU proteins with

708 orthologs from other Rickettsiales species. BioU orthologs were retrieved from Blastp

709 searches against the NCBI Rickettsiales databse using E. chaffeensis str. Sapulpa BioU

710 (EAM85518.1) as a query. The catalytic residues (Ser–Asp–His) characteristic of

711 methyl esterases are highlighted yellow (184). Sequences were aligned with MUSCLE

712 v3.8.31 (169) using default parameters. NCBI protein accession numbers are listed in

713 panel C.

714 (B) Percent identity matrix calculated from the alignment shown in panel A.

715 (C) Estimated phylogeny of proteins shown in panel A. Two alphaproteobacterial

716 outgroups and two hymenopteran mitochondrial methyl esterases are included.

717 Phylogeny was estimated using RAxML v8.2.4 (171) under the gamma model of rate

718 heterogeneity and estimation of the proportion of invariant sites. Branch support was

719 assessed with 1,000 pseudo-replications. Final ML optimization likelihood was -

720 5448.767799.

721 (D) Analysis of eight rickettsial genomes that contain biotin synthesis genes except for

722 the BioH methyl esterase. Blastp searches against the rickettsial genomes were

723 conducted using the following queries, all of which have been shown to orthogonally

724 substitute for BioH: BioV of Helicobacter sp. L8b (TSA87024.1); BioG of Haemophilus

725 influenzae str. CGSHiCZ412602 (AIB45959.1); BtsA of Moraxella catarrhalis str. CCUG

726 353 (KZR94829.1); BioJ of marina str. E103-15 (QEO56965.1); BioZ of bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 33

727 Mesorhizobium japonicum str. R7A (AAG47795.1); EstN1 of Nitrososphaera viennensis

728 str. EN76 (AIC14436.1); BioK of Prochlorococcus marinus str. SS51 (KGG31655.1).

729

730 ACKNOWLEDGMENTS

731 We acknowledge Dr. Michael Gerth (University of Liverpool) for sharing unpublished

732 contigs from sequencing projects for Wolbachia endosymbiont of Zootermopsis

733 nevadensis (wZoo), Wolbachia endosymbiont of Mengenilla moldrzyki (wMen), and

734 Wolbachia endosymbiont of Ctenocephalides felis (wCte). We are grateful to Dr.

735 Michael Dryden (Kansas State University), Dr. Bill Donahue (Sierra Research

736 Laboratories, Modesto, CA), Dr. Lance Durden (Georgia Southern University), and Eric

737 Dotseth (Division of Infectious Disease Epidemiology, West Virginia Department of

738 Health and Human Resources) for contributing cat fleas.

739 This work was supported with funds from the National Institutes of Health National

740 Institute of Allergy and Infectious Diseases grants (R21AI26108 and R21AI146773 to

741 JJG, R01AI017828 and R01AI126853 to AFA, and R01AI122672 to KRM). TPD and

742 VIV were supported by start-up funding provided to TPD by West Virginia University.

743 JFB was supported by start-up funds from Auburn University, USDA Hatch Grant

744 (1015922) and an Alabama Agricultural Experiment Station SEED grant. The content is

745 solely the responsibility of the authors and does not necessarily represent the official

746 views of the funding agencies. The funders had no role in study design, data collection

747 and analysis, decision to publish, or preparation of the manuscript.

748

749 REFERENCES bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 34

750 1. Hilgenboecker K, Hammerstein P, Schlattmann P, Telschow A, Werren JH. 2008.

751 How many species are infected with Wolbachia? - A statistical analysis of current

752 data. FEMS Microbiol Lett 281:215–220.

753 2. Gillespie JJ, Nordberg EK, Azad AA, Sobral BW. 2012. Phylogeny And

754 Comparative Genomics: The Shifting Landscape In The Genomics Era, p. 84–

755 141. In A.F. Azad and G.H. Palmer (ed.), Intracellular Pathogens II: Rickettsiales.

756 American Society of Microbiology, Boston.

757 3. Augustinos AA, Santos-Garcia D, Dionyssopoulou E, Moreira M, Papapanagiotou

758 A, Scarvelakis M, Doudoumis V, Ramos S, Aguiar AF, Borges PA V, Khadem M,

759 Latorre A, Tsiamis G, Bourtzis K. 2011. Detection and characterization of

760 Wolbachia infections in natural populations of aphids: is the hidden diversity fully

761 unraveled? PLoS One 6:e28695.

762 4. Baldo L, Werren JH. 2007. Revisiting Wolbachia supergroup typing based on

763 WSP: spurious lineages and discordance with MLST. Curr Microbiol 55:81–7.

764 5. Bing XL, Xia WQ, Gui JD, Yan GH, Wang XW, Liu SS. 2014. Diversity and

765 evolution of the Wolbachia endosymbionts of Bemisia (Hemiptera: Aleyrodidae)

766 whiteflies. Ecol Evol 4:2714–2737.

767 6. Glowska E, Dragun-Damian A, Dabert M, Gerth M. 2015. New Wolbachia

768 supergroups detected in quill mites (Acari: Syringophilidae). Infect Genet Evol

769 30:140–146.

770 7. Haegeman A, Vanholme B, Jacob J, Vandekerckhove TTM, Claeys M, Borgonie

771 G, Gheysen G. 2009. An endosymbiotic bacterium in a plant-parasitic nematode:

772 member of a new Wolbachia supergroup. Int J Parasitol 39:1045–54. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 35

773 8. Lo N, Casiraghi M, Salati E, Bazzocchi C, Bandi C. 2002. How Many Wolbachia

774 Supergroups Exist? Mol Biol Evol 19:341–346.

775 9. Ros VID, Fleming VM, Feil EJ, Breeuwer JAJ. 2009. How diverse is the

776 Wolbachia? Multiple-gene sequencing reveals a putatively new Wolbachia

777 supergroup recovered from mites (Acari: Tetranychidae). Appl Environ

778 Microbiol 75:1036–43.

779 10. Chung M, Munro JB, Tettelin H, Dunning Hotopp JC. 2018. Using Core Genome

780 Alignments To Assign Bacterial Species. mSystems 3.

781 11. Lindsey ARI, Bordenstein SR, Newton ILG, Rasgon JL. 2016. Wolbachia pipientis

782 should not be split into multiple species: A response to Ramírez-Puebla et al. Syst

783 Appl Microbiol 39:220–222.

784 12. Ramírez-Puebla ST, Servín-Garcidueñas LE, Ormeño-Orrillo E, Vera-Ponce de

785 León A, Rosenblueth M, Delaye L, Martínez J, Martínez-Romero E. 2015.

786 Species in Wolbachia? Proposal for the designation of ‘Candidatus Wolbachia

787 bourtzisii’, ‘Candidatus Wolbachia onchocercicola’, ‘Candidatus Wolbachia

788 blaxteri’, ‘Candidatus Wolbachia brugii’, ‘Candidatus Wolbachia taylori’,

789 ‘Candidatus Wolbachia collembol. Syst Appl Microbiol 38:390–399.

790 13. Driscoll TP, Verhoeve VI, Guillotte ML, Lehman SS, Rennoll SA, Beier-Sexton M,

791 Rahman MS, Azad AF, Gillespie JJ. 2017. Wholly Rickettsia ! Reconstructed

792 Metabolic Profile of the Quintessential Bacterial Parasite of Eukaryotic Cells. MBio

793 8:e00859-17.

794 14. Jiménez NE, Gerdtzen ZP, Olivera-Nappa Á, Salgado JC, Conca C. 2019. A

795 systems biology approach for studying Wolbachia metabolism reveals points of bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 36

796 interaction with its host in the context of arboviral infection. PLoS Negl Trop Dis

797 13:e0007678.

798 15. Zug R, Hammerstein P. 2015. Bad guys turned nice? A critical assessment of

799 Wolbachia mutualisms in arthropod hosts. Biol Rev Camb Philos Soc.

800 16. Hosokawa T, Koga R, Kikuchi Y, Meng XY, Fukatsu T. 2010. Wolbachia as a

801 bacteriocyte-associated nutritional mutualist. Proc Natl Acad Sci U S A 107:769–

802 774.

803 17. Moriyama M, Nikoh N, Hosokawa T, Fukatsu T. 2015. Riboflavin Provisioning

804 Underlies Wolbachia’s Fitness Contribution to Its Insect Host. MBio 6:e01732-15.

805 18. Ju J-F, Bing X-L, Zhao D-S, Guo Y, Xi Z, Hoffmann AA, Zhang K-J, Huang H-J,

806 Gong J-T, Zhang X, Hong X-Y. 2019. Wolbachia supplement biotin and riboflavin

807 to enhance reproduction in planthoppers. ISME J 1–12.

808 19. Nikoh N, Hosokawa T, Moriyama M, Oshima K, Hattori M, Fukatsu T. 2014.

809 Evolutionary origin of insect-Wolbachia nutritional mutualism. Proc Natl Acad Sci

810 U S A 111:10257–62.

811 20. Newton ILG, Rice DW. 2020. The Jekyll and Hyde symbiont: Could Wolbachia be

812 a nutritional mutualist? J Bacteriol. American Society for Microbiology.

813 21. Gillespie JJ, Joardar V, Williams KP, Driscoll TP, Hostetler JB, Nordberg E,

814 Shukla M, Walenz B, Hill CA, Nene VM, Azad AF, Sobral BW, Caler E. 2012. A

815 Rickettsia genome overrun by mobile genetic elements provides insight into the

816 acquisition of genes characteristic of an obligate intracellular lifestyle. J Bacteriol

817 194:376–94.

818 22. Werren JH, Baldo L, Clark ME. 2008. Wolbachia: master manipulators of bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 37

819 invertebrate biology. Nat Rev Microbiol 6:741–51.

820 23. Hurst GDD, Frost CL. 2015. Reproductive parasitism: maternally inherited

821 symbionts in a biparental world. Cold Spring Harb Perspect Biol 7:a017699.

822 24. Chen H, Ronau JA, Beckmann JF, Hochstrasser M. 2019. A Wolbachia Nuclease

823 and Its Binding Partner Comprise a Novel Mechanism for Induction of

824 Cytoplasmic Incompatibility.

825 25. Beckmann JF, Sharma GD, Mendez L, Chen H, Hochstrasser M. 2019. The

826 wolbachia cytoplasmic incompatibility enzyme CIDB targets nuclear import and

827 protamine-histone exchange factors. Elife 8.

828 26. LePage DP, Metcalf JA, Bordenstein SR, On J, Perlmutter JI, Shropshire JD,

829 Layton EM, Funkhouser-Jones LJ, Beckmann JF, Bordenstein SR. 2017.

830 Prophage WO genes recapitulate and enhance Wolbachia-induced cytoplasmic

831 incompatibility. Nature 543:243–247.

832 27. Beckmann JF, Ronau JA, Hochstrasser M. 2017. A Wolbachia deubiquitylating

833 enzyme induces cytoplasmic incompatibility. Nat Microbiol 2:17007.

834 28. Beckmann JF, Fallon AM. 2013. Detection of the Wolbachia protein WPIP0282 in

835 mosquito spermathecae: Implications for cytoplasmic incompatibility. Insect

836 Biochem Mol Biol 43:867–878.

837 29. Perlmutter JI, Bordenstein SR, Unckless RL, LePage DP, Metcalf JA, Hill T,

838 Martinez J, Jiggins FM, Bordenstein SR. 2019. The phage gene wmk is a

839 candidate for male killing by a bacterial endosymbiont. PLoS Pathog 15.

840 30. Bordenstein SR, Bordenstein SR. 2016. Eukaryotic association module in phage

841 WO genomes from Wolbachia. Nat Commun 7:13155. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 38

842 31. Bandi C, Anderson TJC, Genchi C, Blaxter ML. 1998. Phylogeny of Wolbachia in

843 filarial nematodes. Proc R Soc B Biol Sci 265:2407–2413.

844 32. Gerth M, Bleidorn C. 2017. Comparative genomics provides a timeframe for

845 Wolbachia evolution and exposes a recent biotin synthesis operon transfer. Nat

846 Microbiol 2:16241.

847 33. Turelli M, Hoffmann AA. 1991. Rapid spread of an inherited incompatibility factor

848 in California . Nature 353:440–442.

849 34. Schmidt TL, Barton NH, Rašić G, Turley AP, Montgomery BL, Iturbe-Ormaetxe I,

850 Cook PE, Ryan PA, Ritchie SA, Hoffmann AA, O’Neill SL, Turelli M. 2017. Local

851 introduction and heterogeneous spatial spread of dengue-suppressing Wolbachia

852 through an urban population of . PLoS Biol 15.

853 35. Cooper BS, Ginsberg PS, Turelli M, Matute DR. 2017. Wolbachia in the

854 Drosophila yakuba Complex: Pervasive Frequency Variation and Weak

855 Cytoplasmic Incompatibility, but No Apparent Effect on .

856 Genetics 205:333–351.

857 36. Turelli M. 1994. EVOLUTION OF INCOMPATIBILITY-INDUCING MICROBES

858 AND THEIR HOSTS. Evolution 48:1500–1513.

859 37. Raychoudhury R, Baldo L, Oliveira DCSG, Werren JH. 2009. Modes of acquisition

860 of Wolbachia: Horizontal transfer, hybrid introgression, and codivergence in the

861 species complex. Evolution (N Y) 63:165–183.

862 38. Cooper BS, Vanderpool D, Conner WR, Matute DR, Turelli M. 2019. Wolbachia

863 Acquisition by Drosophila yakuba -Clade Hosts and Transfer of Incompatibility

864 Loci Between Distantly Related Wolbachia. Genetics 212:1399–1419. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 39

865 39. Turelli M, Cooper BS, Richardson KM, Ginsberg PS, Peckenpaugh B, Antelope

866 CX, Kim KJ, May MR, Abrieux A, Wilson DA, Bronski MJ, Moore BR, Gao JJ,

867 Eisen MB, Chiu JC, Conner WR, Hoffmann AA. 2018. Rapid Global Spread of

868 wRi-like Wolbachia across Multiple Drosophila. Curr Biol 28:963-971.e8.

869 40. Pietri JE, DeBruhl H, Sullivan W. 2016. The rich somatic life of Wolbachia.

870 Microbiologyopen. Blackwell Publishing Ltd.

871 41. Ramirez-Puebla ST, Ormeno-Orrillo E, Vera-Ponce de Leon A, Lozano L,

872 Sanchez A, Rosenblueth M, Martinez-Romero E. 2016. Genomes of Candidatus

873 Wolbachia Bourtzisii wDacA and Candidatus Wolbachia Pipientis wDacB from the

874 Cochineal Insect Dactylopius coccus (Hemiptera: Dactylopiidae). G3&#58;

875 Genes|Genomes|Genetics 6:3343–3349.

876 42. Gotoh T, Noda H, Ito S. 2007. Cardinium symbionts cause cytoplasmic

877 incompatibility in spider mites. Heredity (Edinb) 98:13–20.

878 43. Chu C, Hoffmann M, Braswell WE, Pelz-Stelinski KS. 2019. Genetic variation and

879 potential coinfection of Wolbachia among widespread Asian citrus psyllid (

880 Diaphorina citri Kuwayama) populations. Insect Sci 26:671–682.

881 44. Landmann F. 2019. The Wolbachia Endosymbionts. Microbiol Spectr 7.

882 45. Driscoll TP, Verhoeve VI, Gillespie J, Johnston JS, Guillotte ML, Rennoll-Bankert

883 KE, Rahman MS, Hagen D, Elsik CG, Macaluso K, Azad A. 2020. Cat fleas in

884 flux: Rampant gene duplication, genome size plasticity, and paradoxical

885 Wolbachia infection. BMC Biol.

886 46. Pornwiroon W, Kearney MT, Husseneder C, Foil LD, Macaluso KR. 2007.

887 Comparative microbiota of Rickettsia felis-uninfected and -infected colonized cat bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 40

888 fleas, Ctenocephalides felis. ISME J 1:394–402.

889 47. Sunyakumthorn P, Bourchookarn A, Pornwiroon W, David C, Barker SA,

890 Macaluso KR. 2008. Characterization and growth of polymorphic Rickettsia felis in

891 a tick cell line. Appl Environ Microbiol 74:3151–8.

892 48. Gillespie JJ, Driscoll TP, Verhoeve VI, Utsuki T, Husseneder C, Chouljenko VN,

893 Azad AF, Macaluso KR. 2015. Genomic Diversification in Strains of Rickettsia

894 felis Isolated from Different Arthropods. Genome Biol Evol 7:35–56.

895 49. Casiraghi M, Bordenstein SR, Baldo L, Lo N, Beninati T, Wernegreen JJ, Werren

896 JH, Bandi C. 2005. Phylogeny of Wolbachia pipientis based on gltA, groEL and

897 ftsZ gene sequences: clustering of arthropod and nematode symbionts in the F

898 supergroup, and evidence for further diversity in the Wolbachia tree. Microbiology

899 151:4015–4022.

900 50. Dittmar K, Whiting MF. 2004. New Wolbachia endosymbionts from Nearctic and

901 Neotropical fleas (Siphonaptera). J Parasitol 90:953–7.

902 51. Gorham CH, Fang QQ, Durden LA. 2003. Wolbachia endosymbionts in fleas

903 (Siphonaptera). J Parasitol 89:283–9.

904 52. González-Álvarez VH, de Mera IGF, Cabezas-Cruz A, de la Fuente J, Ortega-

905 Morales AI, Almazán C. 2017. Molecular survey of Rickettsial organisms in

906 ectoparasites from a dog shelter in Northern Mexico. Vet Parasitol Reg Stud

907 Reports 10:143–148.

908 53. Vasconcelos EJR, Billeter SA, Jett LA, Meinersmann RJ, Barr MC, Diniz PPVP,

909 Oakley BB. 2018. Assessing Cat Flea Microbiomes in Northern and Southern

910 California by 16S rRNA Next-Generation Sequencing. Vector-Borne Zoonotic Dis bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 41

911 18:491–499.

912 54. Lin S, Cronan JE. 2011. Closing in on complete pathways of biotin biosynthesis.

913 Mol Biosyst.

914 55. Lin S, Hanson RE, Cronan JE. 2010. Biotin synthesis begins by hijacking the fatty

915 acid synthetic pathway. Nat Chem Biol 6:682–688.

916 56. Balvín O, Roth S, Talbot B, Reinhardt K. 2018. Co- in bedbug

917 Wolbachia parallel the pattern in nematode hosts. Sci Rep 8:8797.

918 57. Penz T, Schmitz-Esser S, Kelly SE, Cass BN, Müller A, Woyke T, Malfatti SA,

919 Hunter MS, Horn M. 2012. Comparative Genomics Suggests an Independent

920 Origin of Cytoplasmic Incompatibility in Cardinium hertigii. PLoS Genet

921 8:e1003012.

922 58. Zeng Z, Fu Y, Guo D, Wu Y, Ajayi OE, Wu Q. 2018. Bacterial endosymbiont

923 Cardinium cSfur genome sequence provides insights for understanding the

924 symbiotic relationship in Sogatella furcifera host. BMC Genomics 19:688.

925 59. Ríhová J, Nováková E, Husník F, Hypša V. 2017. Legionella Becoming a

926 Mutualist: Adaptive Processes Shaping the Genome of Symbiont in the Louse

927 Polyplax serrata. Genome Biol Evol 9:2946–2957.

928 60. Klinges JG, Rosales SM, McMinds R, Shaver EC, Shantz AA, Peters EC, Eitel M,

929 Wörheide G, Sharp KH, Burkepile DE, Silliman BR, Vega Thurber RL. 2019.

930 Phylogenetic, genomic, and biogeographic characterization of a novel and

931 ubiquitous marine invertebrate-associated Rickettsiales parasite, Candidatus

932 Aquarickettsia rohweri, gen. nov., sp. nov. ISME J 13:2938–2953.

933 61. Haferkamp I, Penz T, Geier M, Ast M, Mushak T, Horn M, Schmitz-Esser S. 2013. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 42

934 The Endosymbiont Amoebophilus asiaticus Encodes an S-Adenosylmethionine

935 Carrier That Compensates for Its Missing Methylation Cycle. J Bacteriol

936 195:3183–3192.

937 62. Driscoll TP, Gillespie JJ, Nordberg EK, Azad AF, Sobral BW. 2013. Bacterial DNA

938 sifted from the Trichoplax adhaerens (Animalia: Placozoa) genome project

939 reveals a putative rickettsial endosymbiont. Genome Biol Evol 5:621–45.

940 63. Hellemans S, Kaczmarek N, Marynowska M, Calusinska M, Roisin Y, Fournier D.

941 2019. Bacteriome-associated Wolbachia of the parthenogenetic termite

942 Cavitermes tuberosus. FEMS Microbiol Ecol 95.

943 64. Douglas AE. 2015. Multiorganismal : diversity and function of resident

944 . Annu Rev Entomol 60:17–34.

945 65. Lindsey ARI, Rice DW, Bordenstein SR, Brooks AW, Bordenstein SR, Newton

946 ILG. 2018. Evolutionary genetics of cytoplasmic incompatibility genes cifA and

947 cifB in prophage WO of Wolbachia. Genome Biol Evol.

948 66. Beckmann JF, Bonneau M, Chen H, Hochstrasser M, Poinsot D, Merçot H, Weill

949 M, Sicard M, Charlat S. 2019. The Toxin–Antidote Model of Cytoplasmic

950 Incompatibility: Genetics and Evolutionary Implications. Trends Genet.

951 67. Gillespie JJ, Driscoll TP, Verhoeve VI, Rahman MS, Macaluso KR, Azad AF.

952 2018. A Tangled Web: Origins of Reproductive Parasitism. Genome Biol Evol

953 10:2292–2309.

954 68. Metcalf JA, Jo M, Bordenstein SR, Jaenike J, Bordenstein SR. 2014. Recent

955 genome reduction of Wolbachia in Drosophila recens targets phage WO and

956 narrows candidates for reproductive parasitism. PeerJ 2:e529. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 43

957 69. Faddeeva-Vakhrusheva A, Kraaijeveld K, Derks MFL, Anvar SY, Agamennone V,

958 Suring W, Kampfraath AA, Ellers J, Le Ngoc G, van Gestel CAM, Mariën J, Smit

959 S, van Straalen NM, Roelofs D. 2017. Coping with living in the soil: the genome of

960 the parthenogenetic Folsomia candida. BMC Genomics 18:493.

961 70. Kampfraath AA, Klasson L, Anvar SY, Vossen RHAM, Roelofs D, Kraaijeveld K,

962 Ellers J. 2019. Genome expansion of an obligate parthenogenesis-associated

963 Wolbachia poses an exception to the symbiont reduction model. BMC Genomics

964 20.

965 71. Leclercq S, Thézé J, Chebbi MA, Giraud I, Moumen B, Ernenwein L, Grève P,

966 Gilbert C, Cordaux R. 2016. Birth of a W sex chromosome by horizontal transfer

967 of Wolbachia bacterial symbiont genome. Proc Natl Acad Sci U S A 113:15036–

968 15041.

969 72. Hotopp JCD, Clark ME, Oliveira DCSG, Foster JM, Fischer P, Torres MCM,

970 Giebel JD, Kumar N, Ishmael N, Wang S, Ingram J, Nene R V., Shepard J,

971 Tomkins J, Richards S, Spiro DJ, Ghedin E, Slatko BE, Tettelin H, Werren JH.

972 2007. Widespread Lateral Gene Transfer from Intracellular Bacteria to

973 Multicellular Eukaryotes. Science (80- ) 317:1753–1756.

974 73. Aikawa T, Anbutsu H, Nikoh N, Kikuchi T, Shibata F, Fukatsu T. 2009. Longicorn

975 beetle that vectors pinewood nematode carries many Wolbachia genes on an

976 autosome. Proc R Soc B Biol Sci 276:3791–3798.

977 74. Kondo N, Nikoh N, Ijichi N, Shimada M, Fukatsu T. 2002. Genome fragment of

978 Wolbachia endosymbiont transferred to X chromosome of host insect. Proc Natl

979 Acad Sci U S A 99:14280–14285. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 44

980 75. Dhaygude K, Nair A, Johansson H, Wurm Y, Sundström L. 2019. The first draft

981 genomes of the ant Formica exsecta, and its Wolbachia endosymbiont reveal

982 extensive gene transfer from endosymbiont to host. BMC Genomics 20.

983 76. McNulty SN, Foster JM, Mitreva M, Hotopp JCD, Martin J, Fischer K, Wu B, Davis

984 PJ, Kumar S, Brattig NW, Slatko BE, Weil GJ, Fischer PU. 2010. Endosymbiont

985 DNA in endobacteria-free filarial nematodes indicates ancient horizontal genetic

986 transfer. PLoS One 5.

987 77. Werren JH, Richards S, Desjardins CA, Niehuis O, Gadau J, Colbourne JK,

988 Beukeboom LW, Desplan C, Elsik CG, Grimmelikhuijzen CJP, Kitts P, Lynch JA,

989 Murphy T, Oliveira DCSG, Smith CD, Van Zande L De, Worley KC, Zdobnov EM,

990 Aerts M, Albert S, Anaya VH, Anzola JM, Barchuk AR, Behura SK, Bera AN,

991 Berenbaum MR, Bertossa RC, Bitondi MMG, Bordenstein SR, Bork P, Bornberg-

992 Bauer E, Brunain M, Cazzamali G, Chaboub L, Chacko J, Chavez D, Childers CP,

993 Choi JH, Clark ME, Claudianos C, Clinton RA, Cree AG, Cristino AS, Dang PM,

994 Darby AC, De Graaf DC, Devreese B, Dinh HH, Edwards R, Elango N, Elhaik E,

995 Ermolaeva O, Evans JD, Foret S, Fowler GR, Gerlach D, Gibson JD, Gilbert DG,

996 Graur D, Gründer S, Hagen DE, Han Y, Hauser F, Hultmark D, Hunter Iv HC,

997 Hurst GDD, Jhangian SN, Jiang H, Johnson RM, Jones AK, Junier T, Kadowaki

998 T, Kamping A, Kapustin Y, Kechavarzi B, Kim J, Kim J, Kiryutin B, Koevoets T,

999 Kovar CL, Kriventseva E V., Kucharski R, Lee H, Lee SL, Lees K, Lewis LR,

1000 Loehlin DW, Logsdon JM, Lopez JA, Lozado RJ, Maglott D, Maleszka R,

1001 Mayampurath A, Mazur DJ, McClure MA, Moore AD, Morgan MB, Muller J,

1002 Munoz-Torres MC, Muzny DM, Nazareth L V., Neupert S, Nguyen NB, Nunes bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 45

1003 FMF, Oakeshott JG, Okwuonu GO, Pannebakker BA, Pejaver VR, Peng Z, Pratt

1004 SC, Predel R, Pu LL, Ranson H, Raychoudhury R, Rechtsteiner A, Reese JT,

1005 Reid JG, Riddle M, Hugh M Robertson II, Romero-Severson J, Rosenberg M,

1006 Sackton TB, Sattelle DB, Schlüns H, Schmitt T, Schneider M, Schüler A, Schurko

1007 AM, Shuker DM, Simões ZLP, Sinha S, Smith Z, Solovyev V, Souvorov A,

1008 Springauf A, Stafflinger E, Stage DE, Stanke M, Tanaka Y, Telschow A, Vattathil

1009 CTS, Eveline C Verhulst II, Viljakainen L, Wanner KW, Waterhouse RM, Whitfield

1010 JB, Wilkes TE, Williamson M, Willis JH, Wolschin F, Wyder S, Yamada T, Yi S V.,

1011 Zecher CN, Zhang L, Gibbs RA. 2010. Functional and evolutionary insights from

1012 the genomes of three parasitoid nasonia species. Science (80- ) 327:343–348.

1013 78. Bonneau M, Landmann F, Labbé P, Justy F, Weill M, Sicard M. 2018. The cellular

1014 phenotype of cytoplasmic incompatibility in in the light of cidB

1015 diversity. PLOS Pathog 14:e1007364.

1016 79. Madhav M, Parry R, Morgan JAT, James P, Asgari S. 2020. Wolbachia

1017 Endosymbiont of the Horn Fly (Haematobia irritans irritans): a Supergroup A

1018 Strain with Multiple Horizontally Acquired Cytoplasmic Incompatibility Genes. Appl

1019 Environ Microbiol 86.

1020 80. Morrow JL, Schneider DI, Klasson L, Janitz C, Miller WJ, Riegler M. 2020. Parallel

1021 sequencing of Wolbachia wCer2 from donor and novel hosts reveals multiple

1022 incompatibility factors and genome stability after host transfers. Genome Biol

1023 Evol.

1024 81. Misof B, Liu S, Meusemann K, Peters RS, Donath A, Mayer C, Frandsen PB,

1025 Ware J, Flouri T, Beutel RG, Niehuis O, Petersen M, Izquierdo-Carrasco F, bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 46

1026 Wappler T, Rust J, Aberer AJ, Aspock U, Aspock H, Bartel D, Blanke A, Berger S,

1027 Bohm A, Buckley TR, Calcott B, Chen J, Friedrich F, Fukui M, Fujita M, Greve C,

1028 Grobe P, Gu S, Huang Y, Jermiin LS, Kawahara AY, Krogmann L, Kubiak M,

1029 Lanfear R, Letsch H, Li Y, Li Z, Li J, Lu H, Machida R, Mashimo Y, Kapli P,

1030 McKenna DD, Meng G, Nakagaki Y, Navarrete-Heredia JL, Ott M, Ou Y, Pass G,

1031 Podsiadlowski L, Pohl H, von Reumont BM, Schutte K, Sekiya K, Shimizu S,

1032 Slipinski A, Stamatakis A, Song W, Su X, Szucsich NU, Tan M, Tan X, Tang M,

1033 Tang J, Timelthaler G, Tomizuka S, Trautwein M, Tong X, Uchifune T, Walzl MG,

1034 Wiegmann BM, Wilbrandt J, Wipfler B, Wong TKF, Wu Q, Wu G, Xie Y, Yang S,

1035 Yang Q, Yeates DK, Yoshizawa K, Zhang Q, Zhang R, Zhang W, Zhang Y, Zhao

1036 J, Zhou C, Zhou L, Ziesmann T, Zou S, Li Y, Xu X, Zhang Y, Yang H, Wang J,

1037 Wang J, Kjer KM, Zhou X. 2014. Phylogenomics resolves the timing and pattern

1038 of insect evolution. Science (80- ) 346:763–767.

1039 82. Hornett EA, Charlat S, Duplouy AMR, Davies N, Roderick GK, Wedell N, Hurst

1040 GDD. 2006. Evolution of male-killer suppression in a natural population. PLoS

1041 Biol 4:e283.

1042 83. Hornett EA, Moran B, Reynolds LA, Charlat S, Tazzyman S, Wedell N, Jiggins

1043 CD, Hurst GDD. 2014. The Evolution of Distorter Suppression Affects a

1044 25 cM Genomic Region in the Butterfly . PLoS Genet

1045 10:e1004822.

1046 84. Bordenstein SR, Uy JJ, Werren JH. 2003. Host genotype determines cytoplasmic

1047 incompatibility type in the haplodiploid genus Nasonia. Genetics 164:223–33.

1048 85. Conner WR, Blaxter ML, Anfora G, Ometto L, Rota-Stabelli O, Turelli M. 2017. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 47

1049 Genome comparisons indicate recent transfer of wRi-like Wolbachia between

1050 sister species Drosophila suzukii and D. subpulchrella. Ecol Evol 7:9391–9404.

1051 86. Hamm CA, Begun DJ, Vo A, Smith CCR, Saelao P, Shaver AO, Jaenike J, Turelli

1052 M. 2014. Wolbachia do not live by reproductive manipulation alone: infection

1053 polymorphism in Drosophila suzukii and D. subpulchrella. Mol Ecol 23:4871–

1054 4885.

1055 87. Poinsot D, Charlat S, Merçot H. 2003. On the mechanism of Wolbachia -induced

1056 cytoplasmic incompatibility: Confronting the models with the facts. BioEssays

1057 25:259–265.

1058 88. Merçot H, Charlat S. 2004. Wolbachia Infections in and

1059 D. Simulans: Polymorphism and Levels of Cytoplasmic Incompatibility. Genetica

1060 120:51–59.

1061 89. Meany MK, Conner WR, Richter S V., Bailey JA, Turelli M, Cooper BS. 2019.

1062 Loss of cytoplasmic incompatibility and minimal fecundity effects explain relatively

1063 low Wolbachia frequencies in Drosophila mauritiana. Evolution (N Y) 73:1278–

1064 1295.

1065 90. Darby AC, Cho N-H, Fuxelius H-H, Westberg J, Andersson SGE. 2007.

1066 Intracellular pathogens go extreme: genome evolution in the Rickettsiales. Trends

1067 Genet 23:511–520.

1068 91. Castelli M, Sabaneyeva E, Lanzoni O, Lebedeva N, Floriano AM, Gaiarsa S,

1069 Benken K, Modeo L, Bandi C, Potekhin A, Sassera D, Petroni G. 2019.

1070 Deianiraea, an extracellular bacterium associated with the ciliate Paramecium,

1071 suggests an alternative scenario for the evolution of Rickettsiales. ISME J. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 48

1072 92. Tashyreva D, Prokopchuk G, Votýpka J, Yabuki A, Horák A, Lukeš J. 2018. Life

1073 Cycle, Ultrastructure, and Phylogeny of New Diplonemids and Their

1074 Endosymbiotic Bacteria. MBio 9.

1075 93. Kamm K, Osigus HJ, Stadler PF, DeSalle R, Schierwater B. 2019. Genome

1076 analyses of a placozoan rickettsial endosymbiont show a combination of

1077 mutualistic and parasitic traits. Sci Rep 9:17561.

1078 94. Gruber-Vodicka HR, Leisch N, Kleiner M, Hinzke T, Liebeke M, McFall-Ngai M,

1079 Hadfield MG, Dubilier N. 2019. Two intracellular and cell type-specific bacterial

1080 symbionts in the placozoan Trichoplax H2. Nat Microbiol. Nature Publishing

1081 Group.

1082 95. Martijn J, Schulz F, Zaremba-Niedzwiedzka K, Viklund J, Stepanauskas R,

1083 Andersson SGE, Horn M, Guy L, Ettema TJG. 2015. Single-cell genomics of a

1084 rare environmental alphaproteobacterium provides unique insights into

1085 Rickettsiaceae evolution. ISME J 9:2373–85.

1086 96. Schulz F, Martijn J, Wascher F, Lagkouvardos I, Kostanjšek R, Ettema TJG, Horn

1087 M. 2016. A Rickettsiales symbiont of amoebae with ancient features. Environ

1088 Microbiol 18:2326–2342.

1089 97. Yurchenko T, Ševčíková T, Přibyl P, El Karkouri K, Klimeš V, Amaral R,

1090 Zbránková V, Kim E, Raoult D, Santos LMA, Eliáš M. 2018. A gene transfer event

1091 suggests a long-term partnership between eustigmatophyte algae and a novel

1092 lineage of endosymbiotic bacteria. ISME J 12:2163–2175.

1093 98. Schrallhammer M, Ferrantini F, Vannini C, Galati S, Schweikert M, Görtz H-D,

1094 Verni F, Petroni G. 2013. “Candidatus Megaira polyxenophila” gen. nov., sp. nov.: bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 49

1095 considerations on evolutionary history, host range and shift of early divergent

1096 rickettsiae. PLoS One 8:e72581.

1097 99. Montagna M, Sassera D, Epis S, Bazzocchi C, Vannini C, Lo N, Sacchi L,

1098 Fukatsu T, Petroni G, Bandi C. 2013. “Candidatus Midichloriaceae” fam. Nov.

1099 (Rickettsiales), an ecologically: Widespread clade of intracellular

1100 alphaproteobacteria. Appl Environ Microbiol 79:3241–3248.

1101 100. Boscaro V, Husnik F, Vannini C, Keeling PJ. 2019. Symbionts of the ciliate

1102 Euplotes : diversity, patterns and potential as models for bacteria–eukaryote

1103 endosymbioses . Proc R Soc B Biol Sci 286:20190693.

1104 101. Muñoz-Gómez SA, Hess S, Burger G, Franz Lang B, Susko E, Slamovits CH,

1105 Roger AJ. 2019. An updated phylogeny of the alphaproteobacteria reveals that

1106 the parasitic rickettsiales and holosporales have independent origins. Elife 8.

1107 102. Mediannikov O, Nguyen T-T, Bell-Sakyi L, Padmanabhan R, Fournier P-E, Raoult

1108 D. 2014. High quality draft genome sequence and description of Occidentia

1109 massiliensis gen. nov., sp. nov., a new member of the family Rickettsiaceae.

1110 Stand Genomic Sci 9:9.

1111 103. Szokoli F, Castelli M, Sabaneyeva E, Schrallhammer M, Krenek S, Doak TG,

1112 Berendonk TU, Petroni G. 2016. Disentangling the of Rickettsiales and

1113 Description of Two Novel Symbionts ("Candidatus Bealeia

1114 paramacronuclearis" and "Candidatus Fokinia cryptica") Sharing

1115 the Cytoplasm of the Ciliate Protist Paramecium biaurelia. Appl Environ Microbiol

1116 82:7236–7247.

1117 104. Prokopchuk G, Tashyreva D, Yabuki A, Horák A, Masařová P, Lukeš J. 2019. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 50

1118 Morphological, Ultrastructural, Motility and Evolutionary Characterization of Two

1119 New Hemistasiidae Species. Protist 170:259–282.

1120 105. Shapiro MM, Chakravartty V, Cronan JE. 2012. Remarkable Diversity in the

1121 Enzymes Catalyzing the Last Step in Synthesis of the Pimelate Moiety of Biotin.

1122 PLoS One 7.

1123 106. Bi H, Zhu L, Jia J, Cronan JE. 2016. A Biotin Biosynthesis Gene Restricted to

1124 Helicobacter. Sci Rep 6.

1125 107. Feng Y, Napier BA, Manandhar M, Henke SK, Weiss DS, Cronan JE. 2014. A

1126 Francisella virulence factor catalyses an essential reaction of biotin synthesis. Mol

1127 Microbiol 91:300–314.

1128 108. Zeng Q, Yang Q, Jia J, Bi H. 2020. A Moraxella Virulence Factor Catalyzes an

1129 Essential Esterase Reaction of Biotin Biosynthesis. Front Microbiol 11:148.

1130 109. Chow J, Danso D, Ferrer M, Streit WR. 2018. The Thaumarchaeon N. gargensis

1131 carries functional bioABD genes and has a promiscuous E. coli ΔbioH-

1132 complementing esterase EstN1. Sci Rep 8.

1133 110. Shi J, Cao X, Chen Y, Cronan JE, Guo Z. 2016. An Atypical α/β-Hydrolase Fold

1134 Revealed in the Crystal Structure of Pimeloyl-Acyl Carrier Protein Methyl Esterase

1135 BioG from Haemophilus influenzae. Biochemistry 55:6705–6717.

1136 111. Estrada P, Manandhar M, Dong SH, Deveryshetty J, Agarwal V, Cronan JE, Nair

1137 SK. 2017. The pimeloyl-CoA synthetase BioW defines a new fold for adenylate-

1138 forming enzymes. Nat Chem Biol 13:668–674.

1139 112. Sullivan JT, Brown SD, Yocum RR, Ronson CW. 2001. The bio operon on the

1140 acquired symbiosis island of Mesorhizobium sp. strain R7A includes a novel gene bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 51

1141 involved in pimeloyl-CoA synthesis. Microbiology 147:1315–1322.

1142 113. Hang X, Zeng Q, Zeng L, Jia J, Bi H. 2019. Functional Replacement of the BioC

1143 and BioH Proteins of Escherichia coli Biotin Precursor Biosynthesis by Ehrlichia

1144 chaffeensis Novel Proteins. Curr Microbiol 76:626–636.

1145 114. Audia JP. 2012. Rickettsial Physiology And Metabolism In The Face Of Reductive

1146 Evolution, p. 221–242. In Palmer, GH, Azad, AF (eds.), Intracellular Pathogens II:

1147 Rickettsiales. American Society for Microbiology (ASM).

1148 115. Hebbeln P, Rodionov DA, Alfandega A, Eitinger T. 2007. Biotin uptake in

1149 prokaryotes by solute transporters with an optional ATP-binding cassette-

1150 containing module. Proc Natl Acad Sci 104:2909–2914.

1151 116. Fisher DJ, Fernández RE, Adams NE, Maurelli AT. 2012. Uptake of Biotin by

1152 Chlamydia Spp. through the Use of a Bacterial Transporter (BioY) and a Host-Cell

1153 Transporter (SMVT). PLoS One 7:e46052.

1154 117. Nakayama K, Yamashita A, Kurokawa K, Morimoto T, Ogawa M, Fukuhara M,

1155 Urakami H, Ohnishi M, Uchiyama I, Ogura Y, Ooka T, Oshima K, Tamura A,

1156 Hattori M, Hayashi T. 2008. The Whole-genome sequencing of the obligate

1157 intracellular bacterium Orientia tsutsugamushi revealed massive gene

1158 amplification during reductive genome evolution. DNA Res 15:185–99.

1159 118. Barker DF, Campbell AM. 1981. Genetic and biochemical characterization of the

1160 birA gene and its product: evidence for a direct role of biotin holoenzyme

1161 synthetase in repression of the biotin operon in Escherichia coli. J Mol Biol

1162 146:469–92.

1163 119. Satiaputra J, Shearwin KE, Booker GW, Polyak SW. 2016. Mechanisms of biotin- bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 52

1164 regulated gene expression in microbes. Synth Syst Biotechnol. KeAi

1165 Communications Co.

1166 120. Kamm K, Osigus HJ, Stadler PF, DeSalle R, Schierwater B. 2018. Trichoplax

1167 genomes reveal profound admixture and suggest stable wild populations without

1168 bisexual reproduction. Sci Rep 8:11168.

1169 121. Bublitz DAC, Chadwick GL, Magyar JS, Sandoz KM, Brooks DM, Mesnage S,

1170 Ladinsky MS, Garber AI, Bjorkman PJ, Orphan VJ, McCutcheon JP. 2019.

1171 Peptidoglycan Production by an Insect-Bacterial Mosaic. Cell 179:703-712.e7.

1172 122. McCutcheon JP. 2010. The bacterial essence of tiny symbiont genomes. Curr

1173 Opin Microbiol.

1174 123. Wilson ACC, Duncan RP. 2015. Signatures of host/symbiont genome coevolution

1175 in insect nutritional endosymbioses. Proc Natl Acad Sci U S A 112:10255–10261.

1176 124. Husnik F, Nikoh N, Koga R, Ross L, Duncan RP, Fujie M, Tanaka M, Satoh N,

1177 Bachtrog D, Wilson ACC, von Dohlen CD, Fukatsu T, McCutcheon JP. 2013.

1178 from diverse bacteria to an insect genome enables a

1179 tripartite nested mealybug symbiosis. Cell 153:1567–78.

1180 125. Nikoh N, McCutcheon JP, Kudo T, Miyagishima SY, Moran NA, Nakabachi A.

1181 2010. Bacterial genes in the aphid genome: Absence of functional gene transfer

1182 from Buchnera to its host. PLoS Genet 6:e1000827.

1183 126. McCutcheon JP. 2016. From microbiology to cell biology: When an intracellular

1184 bacterium becomes part of its host cell. Curr Opin Cell Biol. Elsevier Ltd.

1185 127. McCutcheon JP, Boyd BM, Dale C. 2019. The Life of an Insect Endosymbiont

1186 from the Cradle to the Grave. Curr Biol. Cell Press. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 53

1187 128. Yang Q, Kučerová Z, Perlman SJ, Opit GP, Mockford EL, Behar A, Robinson WE,

1188 Stejskal V, Li Z, Shao R. 2015. Morphological and molecular characterization of a

1189 sexually reproducing colony of the booklouse Liposcelis bostrychophila

1190 (Psocodea: Liposcelididae) found in Arizona. Sci Rep 5:10429.

1191 129. Mockford LE, Krushelnycky DP. 2008. New species and records of Liposcelis

1192 Motschulsky (Psocoptera: Liposcelididae) from Hawaii with first description of the

1193 male of Liposcelis bostrychophila Badonnel. Zootaxa 1766:53–68.

1194 130. Roberts LW, Rapmund G, Cadigan FC. 1977. Sex ratios in rickettsia

1195 tsutsugamushi-infected and noninfected colonies of Leptotrombidium (Acari:

1196 trombiculidae). J Med Entomol 14:89–92.

1197 131. Takahashi M, Urakami H, Yoshida Y, Furuya Y, Misumi H, Hori E, Kawamura A,

1198 Tanaka H. 1997. Occurrence of high ratio of males after introduction of

1199 minocycline in a colony of Leptotrombidium fletcheri infected with Orientia

1200 tsutsugamushi. Eur J Epidemiol 13:79–86.

1201 132. Perotti MA, Clarke HK, Turner BD, Braig HR. 2006. Rickettsia as obligate and

1202 mycetomic bacteria. FASEB J 20:2372–4.

1203 133. Gualtieri L, Nugnes F, Nappo AG, Gebiola M, Bernardo U. 2017. Life inside a gall:

1204 closeness does not favour horizontal transmission of Rickettsia between a gall

1205 and its parasitoid. FEMS Microbiol Ecol 93.

1206 134. Giorgini M, Bernardo U, Monti MM, Nappo AG, Gebiola M. 2010. Rickettsia

1207 Symbionts Cause Parthenogenetic Reproduction in the Pnigalio

1208 soemius (Hymenoptera: Eulophidae). Appl Environ Microbiol 76:2589–2599.

1209 135. Hagimori T, Abe Y, Date S, Miura K. 2006. The first finding of a Rickettsia bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 54

1210 bacterium associated with parthenogenesis induction among insects. Curr

1211 Microbiol 52:97–101.

1212 136. Nugnes F, Gebiola M, Monti MM, Gualtieri L, Giorgini M, Wang J, Bernardo U.

1213 2015. Genetic Diversity of the Invasive Gall Wasp Leptocybe invasa

1214 (Hymenoptera: Eulophidae) and of its Rickettsia Endosymbiont, and Associated

1215 Sex-Ratio Differences. PLoS One 10:e0124660.

1216 137. Werren JH, Hurst GD, Zhang W, Breeuwer JA, Stouthamer R, Majerus ME. 1994.

1217 Rickettsial relative associated with male killing in the ladybird beetle (Adalia

1218 bipunctata). J Bacteriol 176:388–94.

1219 138. von der Schulenburg JH, Habig M, Sloggett JJ, Webberley KM, Bertrand D, Hurst

1220 GD, Majerus ME. 2001. Incidence of male-killing Rickettsia spp. (alpha-

1221 ) in the ten-spot ladybird beetle Adalia decempunctata L.

1222 (Coleoptera: Coccinellidae). Appl Environ Microbiol 67:270–7.

1223 139. Lawson ET, Mousseau TA, Klaper R, Hunter MD, Werren JH. 2001. Rickettsia

1224 associated with male-killing in a buprestid beetle. Heredity (Edinb) 86:497–505.

1225 140. Majerus TM, Majerus ME. 2010. Discovery and identification of a male-killing

1226 agent in the Japanese ladybird Propylea japonica (Coleoptera: Coccinellidae).

1227 BMC Evol Biol 10:37.

1228 141. Murray GGR, Weinert LA, Rhule EL, Welch JJ. 2016. The Phylogeny of Rickettsia

1229 Using Different Evolutionary Signatures: How Tree-Like is Bacterial Evolution?

1230 Syst Biol 65:265–79.

1231 142. Harumoto T, Lemaitre B. 2018. Male-killing toxin in a bacterial symbiont of

1232 Drosophila. Nature 557:252–255. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 55

1233 143. Makarova KS, Aravind L, Koonin E V. 2000. A novel superfamily of predicted

1234 cysteine proteases from eukaryotes, viruses and Chlamydia pneumoniae. Trends

1235 Biochem Sci 25:50–2.

1236 144. Badawi M, Moumen B, Giraud I, Grève P, Cordaux R. 2018. Investigating the

1237 Molecular Genetic Basis of Cytoplasmic Sex Determination Caused by Wolbachia

1238 Endosymbionts in Terrestrial Isopods. Genes (Basel) 9:290.

1239 145. Gebiola M, Giorgini M, Kelly SE, Doremus MR, Ferree PM, Hunter MS. 2017.

1240 Cytological analysis of cytoplasmic incompatibility induced byCardiniumsuggests

1241 convergent evolution with its distant cousinWolbachia. Proceedings Biol Sci

1242 284:20171433.

1243 146. Siozios S, Pilgrim J, Darby AC, Baylis M, Hurst GDD. 2019. The draft genome of

1244 strain cCpun from biting midges confirms insect Cardinium are not a monophyletic

1245 group and reveals a novel gene family expansion in a symbiont. PeerJ 2019.

1246 147. Duron O, Bouchon D, Boutin S, Bellamy L, Zhou L, Engelstädter J, Hurst GD.

1247 2008. The diversity of reproductive parasites among arthropods: Wolbachia do

1248 not walk alone. BMC Biol 6.

1249 148. Ishmael N, Hotopp JCD, Loanidis P, Biber S, Sakamoto J, Siozios S, Nene V,

1250 Werren J, Boutriz K, Bordenstein SR, Tettelin H. 2009. Extensive genomic

1251 diversity of closely related wolbachia strains. Microbiology 155:2211–2222.

1252 149. Baldridge GD, Markowski TW, Witthuhn BA, Higgins L, Baldridge AS, Fallon AM.

1253 2016. The Wolbachia WO proteome in the

1254 C/wStr1 cell line: evidence for lytic activity? Vitr Cell Dev Biol - Anim 52:77–88.

1255 150. Batty EM, Chaemchuen S, Blacksell S, Richards AL, Paris D, Bowden R, Chan C, bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 56

1256 Lachumanan R, Day N, Donnelly P, Chen S, Salje J. 2018. Long-read whole

1257 genome sequencing and comparative analysis of six strains of the human

1258 pathogen Orientia tsutsugamushi. PLoS Negl Trop Dis 12:e0006566.

1259 151. Cho N-H, Kim H-R, Lee J-H, Kim S-Y, Kim J, Cha S, Kim S-Y, Darby AC, Fuxelius

1260 H-H, Yin J, Kim JH, Kim J, Lee SJ, Koh Y-S, Jang W-J, Park K-H, Andersson

1261 SGE, Choi M-S, Kim I-S. 2007. The Orientia tsutsugamushi genome reveals

1262 massive proliferation of conjugative type IV secretion system and host cell

1263 interaction genes. Proc Natl Acad Sci 104:7981–7986.

1264 152. Nakayama K, Kurokawa K, Fukuhara M, Urakami H, Yamamoto S, Yamazaki K,

1265 Ogura Y, Ooka T, Hayashi T. 2010. Genome comparison and phylogenetic

1266 analysis of Orientia tsutsugamushi strains. DNA Res 17:281–91.

1267 153. Hagen R, Verhoeve VI, Gillespie JJ, Driscoll TP. 2018. Conjugative transposons

1268 and their cargo genes vary across natural populations of Rickettsia buchneri

1269 infecting the tick Ixodes scapularis. Genome Biol Evol 10:3218–3229.

1270 154. Reveillaud J, Bordenstein SR, Cruaud C, Shaiber A, Esen ÖC, Weill M,

1271 Makoundou P, Lolans K, Watson AR, Rakotoarivony I, Bordenstein SR, Eren AM.

1272 2019. The Wolbachia mobilome in Culex pipiens includes a putative plasmid. Nat

1273 Commun 10.

1274 155. Rust M. 2017. The Biology and Ecology of Cat Fleas and Advancements in Their

1275 Pest Management: A Review. Insects 8:118.

1276 156. Lawrence AL, Hii S-F, Chong R, Webb CE, Traub R, Brown G, Šlapeta J. 2015.

1277 Evaluation of the bacterial microbiome of two flea species using different DNA-

1278 isolation techniques provides insights into flea host ecology. FEMS Microbiol Ecol bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 57

1279 91:fiv134.

1280 157. Glickman LT, Moore GE, Glickman NW, Caldanaro RJ, Aucoin D, Lewis HB.

1281 2006. Purdue University-Banfield National Companion Animal Surveillance

1282 Program for emerging and zoonotic diseases. Vector Borne Zoonotic Dis 6:14–23.

1283 158. Nogueras MM, Pons I, Ortuño A, Miret J, Pla J, Castellà J, Segura F. 2013.

1284 Molecular detection of Rickettsia typhi in cats and fleas. PLoS One 8:e71386.

1285 159. Angelakis E, Mediannikov O, Parola P, Raoult D. 2016. Rickettsia felis: The

1286 Complex Journey of an Emergent Human Pathogen. Trends Parasitol 32:554–

1287 564.

1288 160. Leulmi H, Socolovschi C, Laudisoit A, Houemenou G, Davoust B, Bitam I, Raoult

1289 D, Parola P. 2014. Detection of Rickettsia felis, Rickettsia typhi, Bartonella

1290 Species and Yersinia pestis in Fleas (Siphonaptera) from Africa. PLoS Negl Trop

1291 Dis 8.

1292 161. Fourie JJ, Crafford D, Horak IG, Stanneck D. 2012. Prophylactic treatment of flea-

1293 infested cats with an imidacloprid/flumethrin collar to forestall infection with

1294 Dipylidium caninum. Parasit Vectors 5:151.

1295 162. Brianti E, Gaglio G, Napoli E, Giannetto S, Dantas-Torres F, Bain O, Otranto D.

1296 2012. New insights into the ecology and biology of Acanthocheilonema

1297 reconditum (Grassi, 1889) causing canine subcutaneous filariosis. Parasitology

1298 139:530–6.

1299 163. Traversa D. 2013. Fleas infesting pets in the era of emerging extra-intestinal

1300 nematodes. Parasit Vectors 6:59.

1301 164. Beard CB, Butler JF, Hall DW. 1990. Prevalence and biology of endosymbionts of bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 58

1302 fleas (Siphonaptera: Pulicidae) from dogs and cats in Alachua County, Florida. J

1303 Med Entomol 27:1050–61.

1304 165. Tay ST. 2013. Wolbachia endosymbionts, Rickettsia felis and Bartonella species,

1305 in Ctenocephalides felis fleas in a tropical region. J Vector Ecol 38:200–202.

1306 166. Oteo JA, Portillo A, Portero F, Zavala-Castro J, Venzal JM, Labruna MB. 2014.

1307 `Candidatus Rickettsia asemboensis’ and Wolbachia spp. in Ctenocephalides felis

1308 and Pulex irritans fleas removed from dogs in Ecuador. Parasit Vectors 7:455.

1309 167. Lomsadze A, Gemayel K, Tang S, Borodovsky M. 2018. Modeling leaderless

1310 transcription and atypical genes results in more accurate gene prediction in

1311 prokaryotes. Genome Res 28:1079–1089.

1312 168. Li L, Stoeckert CJ, Roos DS. 2003. OrthoMCL: identification of ortholog groups for

1313 eukaryotic genomes. Genome Res 13:2178–89.

1314 169. Edgar RC. 2004. MUSCLE: Multiple sequence alignment with high accuracy and

1315 high throughput. Nucleic Acids Res 32:1792–1797.

1316 170. Capella-Gutiérrez S, Silla-Martínez JM, Gabaldón T. 2009. trimAl: a tool for

1317 automated alignment trimming in large-scale phylogenetic analyses.

1318 Bioinformatics 25:1972–3.

1319 171. Stamatakis A. 2014. RAxML version 8: A tool for phylogenetic analysis and post-

1320 analysis of large phylogenies. Bioinformatics 30:1312–1313.

1321 172. Talavera G, Castresana J. 2007. Improvement of phylogenies after removing

1322 divergent and ambiguously aligned blocks from protein sequence alignments.

1323 Syst Biol 56:564–77.

1324 173. Gómez-Valero L, Soriano-Navarro M, Pérez-Brocal V, Heddi A, Moya A, García- bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 59

1325 Verdugo JM, Latorre A. 2004. Coexistence of Wolbachia with Buchnera

1326 aphidicola and a secondary symbiont in the aphid Cinara cedri. J Bacteriol

1327 186:6626–33.

1328 174. Letunic I, Bork P. 2017. 20 years of the SMART protein domain annotation

1329 resource. Nucleic Acids Res.

1330 175. Kelley LA, Sternberg MJE. 2009. Protein structure prediction on the Web: a case

1331 study using the Phyre server. Nat Protoc 4:363–371.

1332 176. Nakae S, Hijikata A, Tsuji T, Yonezawa K, Kouyama K-I, Mayanagi K, Ishino S,

1333 Ishino Y, Shirai T. 2016. Structure of the EndoMS-DNA Complex as Mismatch

1334 Restriction Endonuclease. Structure 24:1960–1971.

1335 177. Ren B, Kühn J, Meslet-Cladiere L, Briffotaux J, Norais C, Lavigne R, Flament D,

1336 Ladenstein R, Myllykallio H. 2009. Structure and function of a novel endonuclease

1337 acting on branched DNA substrates. EMBO J 28:2479–89.

1338 178. Kinch LN, Ginalski K, Rychlewski L, Grishin N V. 2005. Identification of novel

1339 restriction endonuclease-like fold families among hypothetical proteins. Nucleic

1340 Acids Res 33:3598–3605.

1341 179. Pruneda JN, Durkin CH, Geurink PP, Ovaa H, Santhanam B, Holden DW,

1342 Komander D. 2016. The Molecular Basis for Ubiquitin and Ubiquitin-like

1343 Specificities in Bacterial Effector Proteases. Mol Cell 63:261–76.

1344 180. Zhang D, de Souza RF, Anantharaman V, Iyer LM, Aravind L. 2012. Polymorphic

1345 toxin systems: Comprehensive characterization of trafficking modes, processing,

1346 mechanisms of action, immunity and ecology using comparative genomics. Biol

1347 Direct 7:18. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 60

1348 181. Liu W, Xie Y, Ma J, Luo X, Nie P, Zuo Z, Lahrmann U, Zhao Q, Zheng Y, Zhao Y,

1349 Xue Y, Ren J. 2015. IBS: an illustrator for the presentation and visualization of

1350 biological sequences. Bioinformatics 31:3359–61.

1351 182. Reif KE, Stout RW, Henry GC, Foil LD, Macaluso KR. 2008. Prevalence and

1352 infection load dynamics of Rickettsia felis in actively feeding cat fleas. PLoS One

1353 3:e2805.

1354 183. Kent BN, Funkhouser LJ, Setia S, Bordenstein SR. 2011. Evolutionary genomics

1355 of a temperate bacteriophage in an obligate intracellular bacteria (Wolbachia).

1356 PLoS One 6.

1357 184. Polgár L. 2005. The catalytic triad of serine peptidases. Cell Mol Life Sci.

1358 185. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K,

1359 Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL,

1360 Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich

1361 C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST

1362 Server: Rapid Annotations using Subsystems Technology. BMC Genomics 9:75.

1363 186. Perlman SJ, Hunter MS, Zchori-Fein E. 2006. The emerging diversity of

1364 Rickettsia. Proc Biol Sci 273:2097–106.

1365

1366

1367 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 61

1368 Table 1. Summary statistics of selected Wolbachia genomes.

Protein Length GC Pseudo- Mean gene Coding density Taxon RefSeq ID SG coding (nt) (%) genes length (nt) (coding/total nt) genes

wFol NZ_CP015510.2 E 1801626 34.4 1537 85 976 0.880

wCfeT NZ_CP051156.1 ? 1495538 35.2 1424 201 890 0.727 wOv NZ_HG810405.1 C 960618 32.1 650 45 967 0.700

wOo NC_018267.1 C 957990 32.1 639 73 968 0.719 wCfeJ NZ_CP051157.1 ? 1201647 35.6 1116 51 917 0.813

wCle NZ_AP013028.1 F 1250060 36.3 1012 210 818 0.829 wBm NZ_CP034333.1 D 1080064 34.2 803 178 868 0.789

wBm NC_006833.1 D 1080084 34.2 839 167 854 0.795 wMau NZ_CP034334.1 B 1273527 34.0 1045 136 944 0.876

wMau NZ_CP034335.1 B 1273530 34.0 1047 131 946 0.876 wNo NC_021084.1 B 1301823 34.0 1066 133 952 0.878 wAlbB NZ_CP041923.1 B 1482279 34.5 1190 195 917 0.858 FL2016 wAlbB NZ_CP041924.1 B 1483853 34.4 1200 187 917 0.858 HN2016 wAlbB NZ_CP031221.1 B 1484007 34.4 1197 191 916 0.858 wMeg NZ_CP021120.1 B 1376868 34.0 1136 125 948 0.869

wHa NC_021089.1 A 1295804 35.1 1123 104 899 0.852 wCauA NZ_CP041215.1 A 1449344 35.0 1260 134 861 0.860

wAna NZ_CP042904.1 A 1401460 35.2 1218 107 893 0.857 wRi NC_012416.1 A 1445873 35.2 1245 122 892 0.859

wAu NZ_LK055284.1 A 1268461 35.2 1099 125 878 0.855 wMel NC_002978.6 A 1267782 35.2 1116 129 863 0.857

wMel I23 NZ_CP042444.1 A 1269137 35.2 1119 128 862 0.857 wMel N25 NZ_CP042446.1 A 1267781 35.2 1120 125 863 0.857

wMel ZH26 NZ_CP042445.1 A 1267436 35.2 1117 125 865 0.857 wIrr NZ_CP037426.1 A 1352354 35.3 1232 115 827 0.855 1369

1370

1371 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 62

1372 Figure Legends

1373

1374 Figure 1. Wolbachia genome-based phylogeny estimation.

1375 Wolbachia supergroups are within colored ellipses. Ctenocephalides felis-associated

1376 Wolbachiae are within black boxes, with red (wCfeT) and blue (wCfeJ) stars depicting

1377 the two novel Wolbachia parasites infecting C. felis. Gray inset: wCfeT and wCfeJ

1378 genome sequences and assembly statistics. White inset: color scheme for nematode

1379 and arthropod hosts. Phylogeny was estimated for 66 Wolbachia and five outgroup

1380 genomes (see Materials and Methods for more details). Branch support was assessed

1381 with 1,000 pseudoreplications. Final ML optimization likelihood was -52350.085098.

1382 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Neorickettsia helminthoeca str. Oregon Anaplasma centrale str. Israel branch lengths Anaplasma marginale str. Florida other Anaplasmataceae reduced by 200% Ehrlichia chaffeensis str. Arkansas Ehrlichia ruminantium str. Gardel wPpe (Pratylenchus penetrans) wPni (Pentalonia nigronervosa) wFol (Folsomia candida) wCfeT (Ctenocephalidis felis) B wOv () wOo (Onchocerca ochengi) C wDim () wCfeJ (Ctenocephalidis felis) bootstrap wCle () B .91-1 wMen (Mengenilla moldrzyki) F .81-.90 wOc (Osmia caerulescens) .71-.80 wLs (Litomosoides sigmodontis) Fig. 1 .61-.70 wBm () .51-.60 D < .50 wWb (Wucheria bancrofti) wVulC-f (Armadillidium vulgare) B wCon (Cylisticus convexus) wStr (Laodelphax striatella) B wLug (Nilaparvata lugens) B wDi (Diaphorina citri) 0.1 sub./site wBta (Bemisia tabaci) (Wolbachiae only) wMau (Drosophila mauritiana) wNo () wDacB (Dactylopius coccus) wCoc1 (Dactylopius coccus) wVitB () wLcl (Leptopilina clavipes) wAlbB (Aedes albopictus) Biotin synthesis Operon wAlbB-2 (Aedes albopictus) B of Obligate intracellular wCte (Ctenocephalidis felis) wTpre ( pretiosum) Microbes (BOOM) wPip_Pel (Culex quinquefasciatus) wPip_Mol (Culex molestus) wPip_JHB (Culex quinquefasciatus) lineages outside of wOb_Wba (Operophtera brumata) Supergroups A and B wAus (Plutella australiana) wBol1-b (Hypolimnas bolina) lacking complete wMeg (Chrysomya megacephala) B Cid or Cin operons wZoo (Zootermopsis nevadensis) wDacA (Dactylopius coccus) wFex (Formica exsecta) wNfe (Nomada ferruginata) wNfla (Nomada flava) B wNleu (Nomada leucophthalma) B Arthropoda wNpa (Nomada panzeri) wVitA (Nasonia vitripennis) Crustacea wUni_MUJL () wUni_ACFP (Muscidifurax uniraptor) Collembola wGmm ( morsitans) Hemiptera wNon (Nasonia oneida) wHa (Drosophila simulans) Blattodea wCauA ( sasakii) Hymenoptera wSuzi (Drosophila suzukii) Hexapoda wSpc (Drosophila subpulchrella) Strepsiptera wAna () wSim (Drosophila simulans) wRi (Drosophila simulans) Siphonaptera wAu (Drosophila simulans) wWil (Drosophila willistoni) Diptera wRec (Drosophila recens) wInc_Cu (Drosophila incompta) Nematoda wInc_SM (Drosophila incompta) wMel (Drosophila melanogaster) Tylenchoidea (plant pathogen) wMelPop (Drosophila melanogaster) Filaroidea (Spirurida) wDya (Drosophila yakuba) wDsa (Drosophila santomea) wDte (Drosophila teissieri) A bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 64

1384 FIG 2. wCfeT carries the Biotin synthesis Operon of Obligate intracellular

1385 Microbes (BOOM).

1386 (A) Metabolism of biotin from malonyl-CoA (55).

1387 (B) Comparisons of wCfeT BOOM to equivalent loci in Cardinium endosymbiont of

1388 Encarsia pergandiella (cEper1, CCM10336-CCM10341) and Wolbachia endosymbiont

1389 of Cimex lectularius (wCle, BAP00143-BAP00148). Red shading and numbers indicate

1390 % identity across pairwise protein alignments (Blastp). Gene colors correspond to

1391 enzymatic steps depicted in panel A, with all proteins drawn to scale (as a reference,

1392 wCfeT BioB is 316 aa).

1393 (C) Phylogeny of BOOM and other bio gene sets from diverse bacteria estimated from

1394 the concatenation of six bio enzymes (BioC, BioH, BioF, BioA, BioD, and BioB); see

1395 Table S2 for all sequence information, and Figure S1A for complete tree estimated

1396 under Maximum Likelihood. In taxon color scheme, [R] denotes Rickettsiales, with

1397 Holosporaceae as a revised family of Rhodospirillales (101).

1398 (D) Analysis of genes flanking wCfeT BOOM. See Table S3 for comparison with other

1399 BOOM-containing wolbachiae. Graph depicts top 50 Blastp subjects (e-value < 1,

1400 BLOSUM45 matrix) using the wCfeT proteins as queries against the NCBI nr database

1401 (January 9, 2020). Colors distinguish three taxonomic groups (see inset). Black

1402 columns: no significant similarity (nss) in the NCBI nr database. Tnp, predicted

1403 transposase. Brown, pseudogenization. See Figure S1B for EamA phylogeny

1404 estimation. Pie charts across the bottom delineate taxonomic diversity of hits to “Other

1405 taxa,” for wCfeT proteins with significant Blastp matches outside of the Rickettsiales.

1406 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NCBioB 4.0 InternationalBioF licenseBioH. BioC BioD BioA A Malonyl-CoA B cEper1 .83 .74 .60 .60 .67 .71 BioC SAM SAH wCfeT .86 .85 .69 .71 .80 .83 Malonyl-CoA methyl ester wCle

fatty acid Fig. 2 synthesis (2 cycles) C “Candidatus Paracaedibacter_symbiosus” Pimeloyl-CoA methyl ester “Candidatus Odyssella thessalonicensis” “Candidatus Deianiraea vastatrix” “Candidatus mitochondrii” str. IricVA BioH H 2 O CH3OH Occidentia massiliensis Neorickettsia spp. (5) Pimeloyl-CoA Rickettsia buchneri (2) Lawsonia intracellularis (2) “Candidatus Fokinia solitaria” BioF L-Ala ACP + CO2 Rickettsiales endosymbiont (Peranema trichophorum) Holospora spp. (4) KAPA Legionella endosymbiont (Polyplax serrata) Cadinium endosymbionts (3) AMTOB SAH wCfeT BioA bootstrap wLug .91-1 wStr DAPA .81-.90 wNfla .71-.80 wNleu .61-.70 wVulC CO2 + ATP AMP + PPi BioD .51-.60 sp. KTCN < .50 sp. SYDL Cimex- Dethiobiotin sp. TIH associated 0.1 sub./site wCle BioB SAM 5’-DOA sp. TUA “Cand. Deianiaeaceae” [R] Midichloriaceae [R] Holosporaceae Legionellales Biotin Anaplasmataceae [R] Rickettsiaceae [R] Deltaproteobacteria Bacteroidetes

D EamA BOOM nsstnp tnp nsstnp tnp tnp nss bf h c d a nss 1.0

0.8

0.6

Wolbachia Other Rickettsiales Normalized bitscore 0.4 Other taxa

0.2 -25 -20 -15 -10 -5 5 10 15 20

Alphaproteobacteria Gammaproteobacteria PVC Group Other Bacteria non-Bacteria

Holosporaceae Chromotiales Chlamydiae Actinobacteria Arthropoda Rhizobiales Legionellales Others Cyanobacteria Rhodospirillales Oceanospirillales Others Others Thiotrichales Other taxa Others (non-Rickettsiales) bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 66

1408 FIG 3. wCfeJ lends insight on the evolution of reproductive parasitism.

1409 (A) Comparison of known and predicted RP-inducing toxin-antidote operons of diverse

1410 rickettsial species. Light green: CidA/B of wPip_Pel (CAQ54390, CAQ54391); grey:

1411 CndA/B of pLbAR_36/38 (KHO02168, KHO02170), Rickettsia gravesii

1412 (WP_017442886, WP_024547315), wDacB (OAM06111, OAM06112), wStr

1413 (OAB79364, OAB79365), wVulC (OJH31528, OJH31527), wCon (RDD34163,

1414 RDD34164); light brown, CinA/B of wCfeJ (blue star, WP_168456002 &

1415 WP_168456001), Orientia tsutsugamushi (CAM80637, CAM80639), wRi (ACN95432,

1416 ACN95431), wVita (ONI57866, ONI57867), and wPip_Pel (CAQ54402, CAQ54403).

1417 Type I-IV CI operons of Wolbachia parasites (26) are distinguished. Protein domains as

1418 follows: green, CE clan protease; brown, PD-(D/E)XK nuclease; orange, OTU cysteine

1419 protease. Red shading and numbers indicate % identity across pairwise alignments.

1420 Proteins are drawn to scale, with additional C-terminal sequence for some CndB toxins

1421 depicted in dark gray boxes.

1422 (B) Sequence similarity profiles for wCfeJ’s CinA antidote (top) and CinA toxin

1423 (bottom). Subjects are from a Blastp search against the NCBI ‘Wolbachia’ database.

1424 Size, amino acids. Antidotes with cognate toxins (operon partners) also recovered in

1425 searches are noted with asterisks. Top seven hits are shown, with matches (where

1426 significant) to proteins of types II-IV TA operons shown below dashed lines. Toxin

1427 domains are from our prior report or predicted newly here using the EMBL’s Simple

1428 Modular Architecture Research Tool (SMART) (174). Only NUC and DUB domains are

1429 listed, though larger toxins contain many other predicted domains. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 67

1430 (C) Comparison of wCfeJ and wDacB degraded WO prophage genomes captures

1431 streamlining of a CI-like TA operon from a Cnd operon. WO prophage coordinates:

1432 wCfeJ, WP_168455994-WP_168456003; wDacB, OAM06111-OAM06118

1433 (LSYY01000098). Schema follows the description of the WOVitA prophage (30).

1434 (D) wCfeJ’s CinB-like toxin evolved from gene fission of a larger CndB-like toxin. NCBI

1435 protein accession numbers: wCfeJ (WP_168455999, WP_168456000, and

1436 WP_168456001); wDacB (OAM06111, OAM06112). Red shading and numbers

1437 indicate % identity across pairwise alignments. Proteins are drawn to scale; as a

1438 reference, wDacB CndB is 3707 aa.

1439 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Cognate A wPip_Pel CidA CidB type I B Subject Acc. no. Size %ID toxin 22 22 36 Cid R. felis 1387 wCed VVC35096 434 77.0 VVC35097 * wDacB OAM06474 437 70.0 None 19 23 26 wStr OAB79341 430 63.1 OAB79342 * R. gravesii 900 wCauA QDH18678 451 53.0 QDH18679 * 24 26 wVulC-f OJH31528 415 49.3 OJH31527 * wDacB 1784 wCon RDD34511 424 48.0 None

82 73 CinA antitode wCon RDD34438 421 48.0 None Cnd wStr wRi ACN95432 456 NS ACN95431

57 45 38 64 CfeJ wVita ONI57866 490 NS ONI57867 wVulC 939 w wPip_Pel CAQ54402 445 34.0 CAQ54403 74 77 wCon Subject Acc. no. Size %ID Domains 54 48 wCfeJ wCed VVC35097 2500 65.0 NUC (2) 27 25 wCauA QDH18679 3644 49.0 NUC (2) O. tsutsugamushi wStr OAB79342 3941 50.0 NUC (2) wCon RDD34164 1351 48.0 DUB-NUC (2) 21 30 35 wCed VVC35051 1964 46.0 NUC (2) wRi type II

Cin wVulC-f OJH31527 2770 47.8 OTU-NUC (2) CinB toxin OTU 33 36 wCon RDD33817 875 45.0 NUC (2) DUB wVita type III

CfeJ wRi ACN95431 456 26.0 NUC (2) 29 31 NUC w wVita ONI57867 490 27.0 NUC (2) wPip_Pel CinA CinB type IV wPip_Pel CAQ54403 445 34.0 NUC (2)

C toxin gene fission Syntenic with WOVitA1 Hypothetical Protein (HP) wCfeJ Transposase CndA/B and fission products Eukaryotic Association Module wDacB Recombinase MutL HTH RadC HP HTH HTH Pat D CinA CinB HP HP HP (Latrotoxin-CTD) wCfeJ

32 55 37 65 35 38 34

wDacB CndA CndB Ankyrin-repeat region Fig. 3 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 69

1441 FIG 4. Identification of two CI-like antidote genes in the cat flea genome.

1442 (A) The largest C. felis scaffold (NW_020538040) contains a CidA-like gene, predicted

1443 using RAST (185) (left), and a CinA-like exon encoded within a larger gene model with

1444 transcriptional support (right). Red, CI-like antidote genes. Blue, C. felis genes:

1445 113377302 (RING finger and SPRY domain-containing protein 1-like); 113377491

1446 (negative elongation factor B); 113377978 (uncharacterized). The CinA-like exon

1447 (XP_026474240) contains a corresponding transcript (dark blue, XM_026618455)

1448 generated from a study (81) independent of the cat flea genome sequencing (45).

1449 (B) Sequence similarity profiles for the C. felis CidA-like (left) and CinA-like (right)

1450 antidotes. Subjects above the black dashed line are the top hits from a Blastp search

1451 against the NCBI nr protein database; subjects below are weaker matches to CinA (left)

1452 and CidA (right) proteins; matches to wCfeJ CinA are below the orange line. Size,

1453 amino acids.

1454 (C) Alignment of C. felis CidA-like and CinA-like antidotes to CidA, CndA and CinA

1455 proteins from selected Rickettsiaceae species and Wolbachia strains. Schema at top

1456 depicts the wCfeJ CinA antidote and location of six conserved regions shown in the

1457 alignment below. Amino acid coloring as follows: black, hydrophobic; red, negatively

1458 charged; green, hydrophilic; purple, aromatic; blue, positively charged. Alignment

1459 generated using MUSCLE, default parameters (169).

1460

1461 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. A XM_026618455 Predicted with RAST XP_026474240

73 Mb 113377302 113377491 113377978

B C. felis CidA-like C. felis CinA-like

Subject Acc. no. Size %ID Supergroup Subject Acc. no. Size %ID Supergroup

wCer2 NGZ19963 480 85.1 A wNleu, wNfla WP_065095361 463 76.3 B wIrr QGT16094 483 76.8 A wCon RDD34742 467 75.3 ? wPip (Buckeye) AGR50404 491 76.3 B wAna EAL58408 443 67.9 A Cid Cin wPip_Pel (type I) CAQ54390 451 75.6 B wRi (type II) ACN95432 456 67.9 B wPipI_Utique QED12394 415 75.3 B wCon RDD34842 464 62.9 ?

wRi (type II) ACN95432 456 45.6 A wRi ACN95317 474 56.4 A wVita (type III) ONI57866 490 35.9 A wMel (WD0631) AAS14330 490 55.9 A Cid

Cin wPip_Pel (type IV) CAQ54402 445 29.0 B wPip_Pel (type I) CAQ54390 451 52.0 B

wCfeJ no acc. ID 431 24.1 - Cin wCfeJ XP_026474239 431 32.1 -

1 2 3 4 5 6 C CinA 431 aa (wCfeJ)

1 2 3 4 5 6 C. felis CidA-like [ 24] W [27] KYEMACSYCVVNKI [45] VFDFW [28] AEYFYNHL [27] ILDFCVRN [63] Y [176] 427

Cid wPip_Pel [ 69] W [27] KYEMACSYCVVDKI [45] VFDFW [28] VEYFYNHL [27] ILDFCVNK [61] Y [197] 491 R. felis [132] W [29] LYHIACQFCLEDEI [26] LKIFW [49] VEFFWNKL [26] IAAFCVEQ [82] Y [255] 614 R. gravesii [104] W [25] LYKFACNYCLEEYT [43] LIKYF [33] VKYFWNRL [33] IFAFLIEE [46] Y [454] 775 wDacB [ 60] W [37] RYKIACWCCFEDDI [27] LMIYW [34] LEFFWDKL [31] MVDFCLRY [51] Y [134] 411

Cnd wStr [ 60] W [37] RYRIACWCCFEDEI [27] LMIYW [34] LEFFWNKL [31] MVDFCLRH [51] Y [135] 412 wVulC [ 59] W [30] SYDIACRYCLEEDI [33] LIVFW [34] VEFFWNKI [34] IFEFCFSQ [51] Y [141] 419 wCon [ 14] - [ 6] -YDHLVRCCGFDLL [ 4] LAQFW [34] VEFFWNKI [34] IFEFCFGQ [51] Y [146] 320 wCfeJ [ 56] W [37] RYQIACWCCLEEDI [36] MMQFW [37] VEFFWDKV [32] VFEFCLSQ [51] Y [145] 431 O. tsutsugamushi [185] W [27] KYRLACNYCLEDSI [22] MVYFW [57] IEYIWNRL [20] FRSLLLSQ [48] F [466] 863 wRi [ 65] W [27] IYDMACRYCVIDKI [47] VFSFW [29] VEYFYSLL [29] TLDFCLSK [59] Y [163] 456

Cin wVita [ 87] W [24] IYKIACIECVESEI [47] VFNFW [29] VEHFYNRL [33] TLSSLLGK [63] Y [170] 490 wPip_Pel [ 66] W [31] RYYLACKYCLEDKI [38] IEAFW [35] IEFFYNKL [30] TIEFCLSK [49] Y [159] 445 C. felis CinA-like [105] W [27] RYNMACRYCVTDKI [47] VFTFW [29] VEYFYRRM [ 2] ------[ 0] - [ 0] 239

Fig. 4 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 71

1463 FIG 5. Characterization of wCfeJ and wCfeJ in cat fleas.

1464 (A) Screening for wolbachiae in geographically diverse cat flea colonies and wild

1465 populations. Cat fleas from six locations (including two strains from Sierra Research

1466 Laboratories) were assessed for the presence of wCfeT and wCfeJ. Wolbachia

1467 infection of individual fleas was assessed by qPCR (see Materials and Methods for

1468 details).

1469 (B) Localization of wCfeT and wCfeJ in cat flea tissues. Dissected ovaries and other

1470 tissues (mostly midguts) from individual cat fleas (EL Colony) were assayed by qPCR

1471 see Materials and Methods for details). Five fleas were assayed for each tissue type.

1472

1473 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. A Colonies B 125 Elward Kansas St. Sierra Research Labs Two-way ANOVA Laboratory University (Riner) (Modesto Wild) Factor Pr (n = 20) (n = 20) (n = 20) (n = 20) 100 Target (apaG, cinA) 0.0789 Tissue (, non-ovary) 0.0389 Target:Tissue 0.0001 75 flea 18S rDNA 5 50 Wild Populations

Statesboro Monroe Orange Co. wCfeT Georgia West Virginia California 25

wCfeJ Copies/10 (n = 38) (n = 13) (n = 216) Both None 0 Ovarian Other Ovarian Other

wCfeT wCfeJ

Fig. 5 bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 73

1475 FIG 6. Recurrent evolution of nutritional-mediated mutualism and reproductive

1476 parasitism across diverse rickettsial lineages.

1477 (A) Diverse strategies for biotin biosynthesis, acquisition and utilization in Rickettsiales.

1478 Top, biotin synthesis genes (BOOM arrangement), the BioY biotin transporter, the biotin

1479 ligase BirA, and biotin utilization enzymes (biotin-dependent propionyl-CoA carboxylase

1480 complex and FabD) involved in malonyl-CoA synthesis. The distribution of these genes

1481 across major rickettsial lineages is summarized, with phylogeny of Rickettsiales drawn

1482 as a consensus from recent studies (13, 91, 101, 103). The acquisition of biotin

1483 synthesis operons is depicted by yellow, blue and orange highlighting over branches

1484 (see estimated phylogeny in Figure 1A). Taxa highlighted yellow indicate the

1485 conservation of BOOM gene arrangement (see Figure S3). Question marks indicate

1486 missing BioH enzymes in otherwise complete biotin synthesis pathways. These

1487 genomes were scanned for other functionally equivalent methyl esterases and found to

1488 contain none (Figure S4D). Inset highlights the gatekeeper role of BioH in shunting

1489 Pimeloyl-CoA methyl ester (metabolite 2) away from fatty acid biosynthesis for the

1490 production of biotin. All other metabolites in the biotin synthesis pathway are described

1491 in Figure 2A. X, complete loss of biotin utilization in O. tsutsugamushi.

1492 (B) Phylogenomic analysis of RP in Rickettsiales. Phylogeny estimation is further

1493 simplified relative to the tree in panel A. RP is considered an acquired trait, with origins

1494 shown as yellow circles. Spotted Fever Group rickettsiae are shown in gray since RP is

1495 unknown, though CI-like genes are present in some species. Information on rickettsial

1496 hosts/vectors (2) and known RP (67, 186) are compiled from numerous reports. Check

1497 marks reflect the compilations of groups of RP-inducing genes from our prior report, or bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Mechanisms of Evolution in Wolbachia-host associations 74

1498 the recent identification of the WO-mediated killing (wmk) gene implicated in Wolbachia

1499 male-killing (29). Helix-turn-helix domains of certain XRE-family like proteins across

1500 Rickettsiales were not considered related to wmk-encoded proteins. ‘Other’ refers to

1501 additional proteins presented in our prior report (67). bioRxiv preprint doi: https://doi.org/10.1101/2020.06.01.128066; this version posted June 2, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. A SYNTHESIS (BOOM) IMPORT LIGATION UTILIZATION BioB BioF BioH BioC BioD BioA BioY BirA2 PccA PccB FabD

BioB 6 Biotin BioD “Cand. Deianiraea vastatrix” ? PccA BirA A “Cand. Xenolissoclinum pacificiensis” BCCP Neorickettsia spp. 5 ATP “Cand. Neoehrlichia lotoris” Anaplasma spp. U BioA Biotin- Ehrlichia spp. U PccABCCP other Wolbachia 4 ATP Wolbachia (w/ BOOM; see Fig. 2B) - PccA HCO “Cand. Fokinia solitaria” BioF 3 Trichoplax sp. H2 endosymbiont ? 3 “Cand. Aquarickettsia rohweri” ? Carboxy-biotin- M “Cand. Midichloria mitochondrii” PccA BCCP Peranema trichophorum endosymbiont BioH “Cand. Jidaibacter acanthamoeba” Acetyl- PccB Stachyamoeba lipophora endosymbiont 2 CoA Rickettsiales bacterium str. Ac37b ? BioC R “Cand. Arcanobacter lacustris” Malonyl-CoA “Cand. Phycorickettsia trachydisci” 1 “Cand. Occidentia massiliensis” ? Holo- FabD X Orientia spp. ACP “Cand. Megaira polyxenophila” Group 1 “Cand. Sneabacter namystus” Malonyl-ACP Group 2 R. felis and R. hoogstraalii Group 3 R. buchneri (two BOOM copies) fatty acid all other Rickettsia spp. synthesis FabH

BioCBioHBioFBioABioDBioBBioY BPLXccAXccBFabD

B Hosts/vectors RP RP-inducing genes phenotypes cinAB cidAB cndAB other wmk “Cand. Deianiraeaceae” A Xenolyssoclinum ~66% of insect species; male killing; Neorickettsia many other arthropods; CI; PI; Wolbachia some filarial worms feminization M other Anaplasmataceae Midichloriaceae ancient Rickettsiaceae Mites (chiggers) PI; CI? Occidentia Orientia Beetles; wasps; male killing; R Megaira / Sneabacter booklice PI; CI? ancestral lineages Transitional Group Wasps; booklice PI; CI? Typhus Group Rickettsia Spotted Fever Group Ticks unknown Fig. 6