bioRxiv preprint doi: https://doi.org/10.1101/2020.12.29.424491; this version posted December 30, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.
1 Ticks convert pathogenic Coxiella into endosymbionts
2
3 Amanda E. Brenner1, Sebastián Muñoz-Leal2, Madhur Sachan1, Marcelo B. Labruna3, Rahul
4 Raghavan1*
5
6 1Department of Biology and Center for Life in Extreme Environments, Portland State
7 University, Portland, Oregon, 97201, USA.
8 2Departamento de Patología y Medicina Preventiva, Facultad de Ciencias Veterinarias,
9 Universidad de Concepción, Chillán, Ñuble, Chile.
10 3Departamento de Medicina Veterinária Preventiva e Saúde Animal, Faculdade de Medicina
11 Veterinária e Zootecnia, Universidade de São Paulo, São Paulo, Brazil.
12
13
14 *Author for correspondence:
15 Rahul Raghavan | Phone: 503-725-3872 | Email: [email protected]
16
1 bioRxiv preprint doi: https://doi.org/10.1101/2020.12.29.424491; this version posted December 30, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.
17 ABSTRACT
18 Both symbiotic and pathogenic bacteria in the family Coxiellaceae cause morbidity and mortality
19 in humans and animals. For instance, Coxiella-like endosymbionts (CLEs) improve the
20 reproductive success of ticks — a major disease vector, while Coxiella burnetii is the etiological
21 agent of human Q fever and uncharacterized coxiellae cause infections in both animals and
22 humans. To better understand the evolution of pathogenesis and symbiosis in this group of
23 intracellular bacteria, we sequenced the genome of a CLE present in the soft tick Ornithodoros
24 amblus (CLEOA) and compared it to the genomes of other bacteria in the order Legionellales.
25 Our analyses confirmed that CLEOA is more closely related to C. burnetii, the human pathogen,
26 than to CLEs in hard ticks, and showed that most clades of CLEs contain both endosymbionts
27 and pathogens, indicating that several CLE lineages have evolved independently from
28 pathogenic Coxiella. We also determined that the last common ancestor of CLEOA and C.
29 burnetii was equipped to infect macrophages, and that even though HGT contributed
30 significantly to the evolution of C. burnetii, most acquisition events occurred primarily in
31 ancestors predating the CLEOA-C. burnetii divergence. These discoveries clarify the evolution
32 of C. burnetii, which previously was assumed to have emerged when an avirulent tick
33 endosymbiont recently gained virulence factors via HGT. Finally, we identified several metabolic
34 pathways, including heme biosynthesis, that are likely critical to the intracellular growth of the
35 human pathogen but not the tick symbiont, and show that the use of heme analogs is a
36 promising approach to controlling C. burnetii infections.
37
2 bioRxiv preprint doi: https://doi.org/10.1101/2020.12.29.424491; this version posted December 30, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.
38 INTRODUCTION
39 A bacterium’s genome size and gene content signal both the degree of its dependence on the
40 host and the length of the bacterium-host relationship. For example, a bacterium that has
41 established a long-term, obligate symbiosis would have a tiny genome filled with protein-coding
42 genes (Wernegreen et al. 2000). Conversely, the genome of a bacterium that is in early stages of
43 symbiosis is usually large and contains numerous pseudogenized genes, which, as the relationship
44 progresses, would eventually be lost, resulting in a tiny genome (Moran 2002; McCutcheon and
45 Moran 2011). The genomes of Coxiella-like endosymbionts (CLEs) found in ticks (Acari:
46 Ixodida) fall into both categories: Some ticks, e.g., Amblyomma americanum and A. sculptum,
47 contain small-genomed CLEs (~0.6 Mbp) that have few pseudogenes, indicating that they
48 represent an ancient lineage of tick endosymbionts (Smith et al. 2015). In contrast, CLEs in
49 Rhipicephalus turanicus (CRt), and R. sanguineus have large genomes (~1.7 Mbp) filled with
50 pseudogenes, denoting that the bacteria are in early stages of symbioses (Gottlieb et al. 2015;
51 Tsementzi et al. 2018). While most ticks contain CLEs, a few have Francisella-like
52 endosymbionts (FLEs) instead (Gerhart et al. 2016, 2018; Duron et al. 2017). All FLEs studied
53 to date have large genomes (~1.5 Mbp) with hundreds of pseudogenes, including inactivated
54 virulence genes, indicating that FLEs evolved recently from pathogenic ancestors (Gerhart et al.
55 2016, 2018; Duron et al. 2018b). Irrespective of their age, CLEs and FLEs improve the
56 reproductive fitness of their hosts by likely providing metabolites missing in vertebrate blood,
57 ticks’ sole nutritional source (Gottlieb et al. 2015; Smith et al. 2015; Gerhart et al. 2016, 2018;
58 Duron et al. 2017, 2018b; Tsementzi et al. 2018).
59 C. burnetii, the causative agent of human Q fever, has also been detected in ticks; in fact,
60 the intracellular pathogen was first isolated from hard ticks Dermacentor andersoni and
3 bioRxiv preprint doi: https://doi.org/10.1101/2020.12.29.424491; this version posted December 30, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.
61 Haemaphysalis humerosa (Cox 1938; Smith and Derrick 1940). In addition, transstadial
62 transmission and fecal excretion of C. burnetii occur in laboratory-raised ticks (Eldin et al. 2017;
63 Körner et al. 2020). However, it is not clear whether ticks play any meaningful role in the natural
64 spread of C. burnetii (Duron et al. 2015b); instead, Q fever generally occurs following inhalation
65 of C. burnetii-contaminated aerosols originating from infected farm animals (Maurin and Raoult
66 1999; Eldin et al. 2017). Within the human lungs, C. burnetii infects alveolar macrophages and
67 generates a large replicative vacuole, termed the Coxiella-containing vacuole (CCV), by
68 subverting host responses through a Dot/Icm Type IVB secretion system (T4BSS). This
69 secretion system is essential to the pathogenicity of both C. burnetii and Legionella pneumophila,
70 the two established pathogens in the order Legionellales (Segal et al. 1999; Chen et al. 2010;
71 Beare et al. 2011; Newton et al. 2014; Burstein et al. 2016). Genes for T4BSS, which evolved
72 from conjugation machinery, have spread across the bacterial kingdom via horizontal gene
73 transfer (HGT), a process through which organisms gain foreign genes, allowing them to quickly
74 adapt to a new environment (Ochman et al. 2000; Lerat et al. 2005).
75 The closest relatives of C. burnetii are CLEs present in ticks (Almeida et al. 2012; Duron
76 et al. 2015a; Smith et al. 2015), leading to the notion that the human pathogen emerged when
77 an avirulent tick endosymbiont gained pathogenicity genes, probably via HGT (Duron et al.
78 2015a; Gerhart et al. 2016). Contrary to this hypothesis, by sequencing the genome of a CLE in
79 Ornithodoros amblus (henceforth referred to as CLEOA), we show that a common virulent
80 ancestor gave rise to both C. burnetii and CLEOA. The potentially-pathogenic ancestor
81 contained genes for most virulence factors, including T4BSS, indicating that the erstwhile
82 bacterium was likely capable of infecting mammalian macrophages. In CLEOA, homologs of
83 most virulence-associated genes have been rendered non-functional, but its genome is enriched
4 bioRxiv preprint doi: https://doi.org/10.1101/2020.12.29.424491; this version posted December 30, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.
84 for B vitamin and cofactor biosynthesis pathways, suggesting that a virulent bacterium has
85 morphed into a nutrient-provisioning tick endosymbiont. Similar to this pathogen-to-
86 endosymbiont transformation, we found that several other tick endosymbionts likely evolved
87 from pathogenic ancestors, indicating that the blood-sucking arthropod is proficient at
88 “domesticating” virulent bacteria into endosymbionts. Finally, by inhibiting C. burnetii growth
89 using a synthetic analog of heme, a metabolite produced by C. burnetii but not by CLEOA, we
90 demonstrate how knowledge gained through comparative genomics could be applied to
91 developing novel strategies to control Q fever, which is difficult to treat with currently available
92 antibiotics.
93
94 RESULTS
95 CLEOA arose from a pathogenic ancestor
96 Phylogenetic trees based mainly on 16S rDNA have previously indicated that the closest relatives
97 of C. burnetii are CLEs present in Orniothodoros and Argas soft ticks (family Argasidae) (Almeida
98 et al. 2012; Duron et al. 2015a; Smith et al. 2015); however, all CLE genomes available to date
99 are from CLEs in hard ticks (family Ixodidae) (Gottlieb et al. 2015; Smith et al. 2015;
100 Tsementzi et al. 2018; Guizzo et al. 2017), stymieing earlier efforts to understand C. burnetii
101 evolution. Here, by sequencing the first soft-tick CLE genome, we were able to build a more
102 definitive phylogenomic tree, which confirmed that CLEOA is a sister taxon of C. burnetii (Fig.
103 1; Supplemental Table S1). In addition, the presence of pseudogenized T4BSS genes in
104 CLEOA indicates that the tick-symbiont evolved from a pathogenic ancestor with a functional
105 T4BSS (Fig. 2; Supplemental Table S2).
5 106 To resolve the ancestry of pathogenicity in Legionellales we determined the prevalence of
107 T4BSS, which is an essential virulence factors in this group of bacteria that includes human
108 pathogens (C. burnetii and L. pneumophila), opportunistic pathogens (Rickettsiella), and
109 symbionts (CLEs). Our analyses revealed that the secretion system is intact in all members of
110 this order with the exception of CLEs, which only contained remnants of the T4BSS (Figs. 1,
111 2). The most parsimonious explanation for this phyletic pattern is that T4BSS was present in the
112 common ancestor of all Legionellales and was later lost in lineages that gave rise to CLEs,
113 including CLEOA.
T4BSS Francisella tularensis FrancisellaFrancisellaFrancisella tularensis tularensistularensis FrancisellaFrancisellaFrancisellaFrancisella tularensis tularensistularensis tularensis ThiomicrospiraThiomicrospiraThiomicrospira aerophila aerophilaaerophila ThiomicrospiraThiomicrospiraThiomicrospiraThiomicrospira aerophila aerophilaaerophila aerophila BerkiellaBerkiellaBerkiella aquae aquaeaquae BerkiellaBerkiellaBerkiellaBerkiella aquae aquaeaquae aquae BerkiellaBerkiellaBerkiella cookevillensis cookevillensiscookevillensis BerkiellaBerkiellaBerkiellaBerkiella cookevillensis cookevillensiscookevillensis cookevillensis TatlockiaTatlockiaTatlockia micdadei micdadeimicdadei TatlockiaTatlockiaTatlockiaTatlockia micdadei micdadeimicdadei micdadei LegionellaLegionellaLegionella israelensis israelensisisraelensis LegionellaLegionellaLegionellaLegionella israelensis israelensisisraelensis israelensis LegionellaLegionellaLegionella pneumophila pneumophilapneumophila subsp. subsp.subsp. pascullei pasculleipascullei LegionellaLegionellaLegionellaLegionella pneumophila pneumophilapneumophila pneumophila subsp. subsp.subsp. subsp. pascullei pasculleipascullei pascullei LegionellaLegionellaLegionella pneumophila pneumophilapneumophila subsp. subsp.subsp. pneumophila pneumophilapneumophila str. str.str.Legionella ATCC ATCCATCCLegionellaLegionellaLegionella 43290 4329043290 pneumophila pneumophilapneumophila pneumophila subsp. subsp.subsp. subsp. pneumophila pneumophilapneumophila pneumophila str. str. str.ATCCstr. ATCCATCC ATCC 43290 4329043290 43290 LegionellaLegionellaLegionella pneumophila pneumophilapneumophila subsp. subsp.subsp. pneumophila pneumophilapneumophila str. str.str.Legionella Philadelphia1 Philadelphia1Philadelphia1LegionellaLegionellaLegionella pneumophila pneumophilapneumophila pneumophila subsp. subsp.subsp. subsp. pneumophila pneumophilapneumophila pneumophila str. str. str.Philadelphia1str. Philadelphia1Philadelphia1 Philadelphia1 FluoribacterFluoribacterFluoribacter bozemanae bozemanaebozemanae FluoribacterFluoribacterFluoribacterFluoribacter bozemanae bozemanaebozemanae bozemanae FluoribacterFluoribacterFluoribacter gormanii gormaniigormanii FluoribacterFluoribacterFluoribacterFluoribacter gormanii gormaniigormanii gormanii FluoribacterFluoribacterFluoribacter dumoffii dumoffiidumoffii str. str.str. NY23 NY23NY23 FluoribacterFluoribacterFluoribacterFluoribacter dumoffii dumoffiidumoffii dumoffii str. str. str.NY23str. NY23NY23 NY23 N1 FluoribacterFluoribacterFluoribacter dumoffii dumoffiidumoffii str. str.str. TexKL TexKLTexKL FluoribacterFluoribacterFluoribacterFluoribacter dumoffii dumoffiidumoffii dumoffii str. str. str.TexKLstr. TexKLTexKL TexKL AquicellaAquicellaAquicella lusitana lusitanalusitana AquicellaAquicellaAquicellaAquicella lusitana lusitanalusitana lusitana AquicellaAquicellaAquicella siphonis siphonissiphonis AquicellaAquicellaAquicellaAquicella siphonis siphonissiphonis siphonis DiplorickettsiaDiplorickettsiaDiplorickettsia massiliensis massiliensismassiliensis DiplorickettsiaDiplorickettsiaDiplorickettsiaDiplorickettsia massiliensis massiliensismassiliensis massiliensis RickettsiellaRickettsiellaRickettsiella viridis viridisviridis RickettsiellaRickettsiellaRickettsiellaRickettsiella viridis viridisviridis viridis RickettsiellaRickettsiellaRickettsiella grylli grylligrylli RickettsiellaRickettsiellaRickettsiellaRickettsiella grylli grylligrylli grylli RickettsiellaRickettsiella isopodorumisopodorum Rickettsiella isopodorum N2 RickettsiellaRickettsiella isopodorumisopodorum RickettsiellaRickettsiellaRickettsiella isopodorum isopodorumisopodorum CoxiellaCoxiella(CLEAA)Coxiella endosymbiont endosymbiontendosymbiont of ofof Amblyomma AmblyommaAmblyomma americanum americanumamericanum(CLEAA)(CLEAA)(CLEAA)(CLEAA) CoxiellaCoxiella(CRm)Coxiella endosymbiont endosymbiontendosymbiont of ofof Rhipicephalus RhipicephalusRhipicephalus microplus microplusmicroplus(CRm)(CRm)(CRm)(CRm) CoxiellaCoxiella(CRt)Coxiella endosymbiont endosymbiontendosymbiont of ofof Rhipicephalus RhipicephalusRhipicephalus turanicus turanicusturanicus(CRt)(CRt)(CRt)(CRt) Confidence N3 CoxiellaCoxiella(CLEOA)Coxiella endosymbiont endosymbiontendosymbiont of ofof Ornithodoros OrnithodorosOrnithodoros amblus amblusamblus(CLEOA)(CLEOA)(CLEOA)(CLEOA) 90-100 CoxiellaCoxiellaCoxiella burnetii burnetiiburnetii str. str.str. Dugway DugwayDugway CoxiellaCoxiellaCoxiellaCoxiella burnetii burnetiiburnetii burnetii str. str. str.Dugwaystr. DugwayDugway Dugway 80-89 Coxiella burnetii str. CbuK Q154 70-79 N4 CoxiellaCoxiellaCoxiella burnetii burnetiiburnetii str. str.str. CbuK CbuKCbuK Q154 Q154Q154 CoxiellaCoxiellaCoxiella burnetii burnetiiburnetii str. str. str.CbuK CbuKCbuK Q154 Q154Q154 CoxiellaCoxiellaCoxiella burnetii burnetiiburnetii str. str.str. MSU MSUMSU Goat GoatGoat CoxiellaCoxiellaCoxiellaCoxiella burnetii burnetiiburnetii burnetii str. str. str.MSUstr. MSUMSU MSU Goat GoatGoat Goat N5 T4BSS Status CoxiellaCoxiellaCoxiella burnetii burnetiiburnetii str. str.str. PE15890 PE15890PE15890 CoxiellaCoxiellaCoxiellaCoxiella burnetii burnetiiburnetii burnetii str. str. str.PE15890str. PE15890PE15890 PE15890 Coxiella burnetii str. ATCC 3262 Functional CoxiellaCoxiellaCoxiella burnetii burnetiiburnetii str. str.str. ATCC ATCCATCC 3262 32623262 CoxiellaCoxiellaCoxiella burnetii burnetiiburnetii str. str. str.ATCC ATCCATCC 3262 32623262 Pseudogenized CoxiellaCoxiellaCoxiella burnetii burnetiiburnetii str. str.str. NMII NMIINMII CoxiellaCoxiellaCoxiellaCoxiella burnetii burnetiiburnetii burnetii str. str. str.NMIIstr. NMIINMII NMII Absent CoxiellaCoxiellaCoxiella burnetii burnetiiburnetii str. str.str. RSA RSARSA493RSA 493 493493 CoxiellaCoxiellaCoxiellaCoxiella burnetii burnetiiburnetii burnetii str. str. str.RSA493str. RSA493RSA493 RSA493 114 115 Figure 1. CLEOA is the closet relative of C. burnetii. Maximum likelihood and Bayesian trees 116 built using 117 single-copy protein-coding genes were combined to generate the shown 117 phylogenomic tree. Bootstrap support and posterior probabilities agreed at all branchpoints and
6 118 are depicted as a single confidence value. The Dot/Icm Type IVB secretion system (T4BSS), 119 which is critical to pathogenesis, is found in all members of the order Legionellales, but has been 120 pseudogenized in CLEs. Nodes N1-N5 mark major branching points in the evolution of C. 121 burnetii. 122
CLEOA C. burnetti RSA493 CRt
IcmX IcmX IcmX Intact T4BSS Gene IcmW IcmW IcmW IcmV IcmV Pseudogenized DotA DotA T4BSS Gene
Intact Non-T4BSS Gene
DotB DotB Pseudogenized DotC DotC Non-T4BSS Gene DotD DotD IcmS IcmS IcmT Intact IcmQ Transposase Gene IcmP IcmQ IcmO Scale: 1.0 Kb IcmP IcmN IcmO IcmL2 IcmL1 IcmN IcmK IcmL2 IcmL IcmL1 IcmE IcmK IcmK IcmG IcmE IcmC
IcmG IcmC IcmD IcmD IcmJ IcmD IcmJ IcmB IcmB IcmB
IcmH IcmH 123
124 Figure 2. CLEs contain nonfunctional T4BSS. Comparison of T4BSS genes in CLEOA, C. 125 burnetii RSA493 and CRt indicate that the secretion system has been rendered nonfunctional in 126 tick endosymbionts. Filled blocks represent intact genes. Outlined blocks represent 127 pseudogenized genes. 128
129
7 130 Multiple CLEs have evolved from pathogens
131 Coxiella detected in ticks are classified into four clades, three of which contain intermingled
132 pathogens and endosymbionts (Fig. 3; Supplemental Table S3; Duron et al. 2015a). Clade A
133 includes C. burnetii — the human pathogen, and CLEOA, which arose from a pathogenic
134 ancestor, as discussed above. In Clade B, CLEs of Haemaphysalis ticks are present along with a
135 presumably pathogenic Coxiella that caused horse infection (Seo et al. 2016). Clade C has CRt,
136 a pathogen-derived endosymbiont, along with strains that caused opportunistic human skin
137 infections (Gottlieb et al. 2015; Angelakis et al. 2016; Guimard et al. 2017; Tsementzi et al.
138 2018; Ben-Yosef et al. 2020). Only Clade D, which contains small-genomed CLEs (e.g.,
139 CLEAA), has no known pathogenic representatives. This phylogenetic pattern of
140 endosymbionts clustering with pathogens indicate that, similar to the pathogenic ancestry of
141 CLEOA and CRt, CLEs of several other ticks have also evolved from pathogenic coxiellae.
142 Thus, based on phylogenetic and T4BSS distribution patterns, we surmise that Coxiella strains
143 that infect vertebrates (e.g., humans, horses, birds) and invertebrates (e.g., crayfish) are
144 widespread across the globe (Fig. 3), and ticks have converted several of them into beneficial
145 endosymbionts.
8 Legionella pneumophila subspecies pneumophila strain Philadelphia1 Coxiella endosymbiont of Ornithodoros marocanus isolate Omaro1 Coxiella endosymbiont of Amblyomma geayi isolate CoxAgeay1 Coxiella endosymbiont of Amblyomma naponense isolate CoxAnapo1 Coxiella endosymbiont of Haemaphysalis punctata isolate Haepun3 Coxiella species H-GG-175 Uncultured Coxiella species clone S40 Coxiella cheraxi strain TO-98 Coxiella endosymbiont of Ornithodoros erraticus isolate Oer1 CoxiellaLegionella endosymbiont pneumophila of subspecies Amblyomma pneumophila coelebs isolate strain CoxAcoel1 Philadelphia1 CoxiellaLegionella endosymbiont pneumophila of subspecies Amblyomma pneumophila calcaratum strain isolate Philadelphia1 CoxAcalca Coxiella endosymbiont of DermacentorOrnithodoros silvarummarocanus isolate isolate Dsilv1 Omaro1 Coxiella endosymbiont of AmblyommaDermacentorOrnithodoros geayi marginatusmarocanus isolate isolate CoxAgeay1 Dmar1Omaro1 Coxiella endosymbiont of Amblyomma naponensegeayiscalpturatum isolate isolate CoxAgeay1isolate CoxAnapo1 CoxAscalpt1 CoxiellaLegionellaLegionella endosymbiont pneumophilapneumophila of subspeciessubspecies AmblyommaHaemaphysalis pneumophilapneumophila naponenselatepunctatum punctata strainstrainisolate isolate isolate Philadelphia1Philadelphia1 CoxAnapo1 Haepun3 CoxAlate1* Human Pathogen LegionellaRickettsiellaCoxiella speciesendosymbiont pneumophila Clade H-GG-175 of subspecies OrnithodorosHaemaphysalis pneumophila sonrai punctata isolate strain isolate Oson1 Philadelphia1 Haepun3 LegionellaCoxiellaUnculturedCoxiella endosymbiontspeciesendosymbiont pneumophila Coxiella H-GG-175 species of ofsubspecies OrnithodorosOrnithodoros clone S40 pneumophila marocanusoccidentalismarocanus strain isolate isolateisolate Philadelphia1 Omaro1 Omaro1Oocci1 CoxiellaUnculturedCoxiella endosymbiontcheraxiendosymbiont Coxiella strain species ofTO-98of AmblyommaOrnithodorosAmblyomma clone* 1358S40 geayi geayimarocanus isolateisolate isolate CoxAgeay1CoxAgeay1 Omaro1 Crayfish Infection CoxiellaUnculturedCoxiella endosymbiont cheraxiendosymbiont Coxiella strain species ofTO-98of AmblyommaOrnithodorosAmblyomma clone 1357 geayinaponense naponensemarocanuserraticus isolate isolate isolate isolate isolateCoxAgeay1 Oer1 CoxAnapo1 CoxAnapo1Omaro1 CoxiellaCoxiella endosymbiont endosymbiont of of Amblyomma HaemaphysalisOrnithodorosHaemaphysalis naponensecajennensegeayicoelebs erraticus punctatapunctata isolate isolate isolate isolate isolate isolateisolateCoxAgeay1 CoxAcoel1 Oer1 CoxAnapo1 CoxAcaj1 Haepun3Haepun3 CoxiellaCoxiella endosymbiontspeciesspecies H-GG-175H-GG-175 of AmblyommaHaemaphysalis naponensecoelebssculptumcalcaratum punctata isolate isolate isolate isolateisolate CoxAcoel1 CeAS-UFV CoxAnapo1 CoxAcalcaHaepun3 CoxiellaUnculturedUncultured speciesendosymbiont CoxiellaCoxiella H-GG-175 speciesspecies of AmblyommaDermacentorHaemaphysalis* cloneclone S40S40 cajennense calcaratumsilvarum punctata isolate isolateisolate Dsilv1 CoxAcalcaAcajen1Haepun3 Equine Infection UnculturedCoxiellaCoxiella speciesendosymbiont cheraxicheraxi Coxiella strainH-GG-175strain species TO-98ofTO-98 DermacentorAmblyomma clone S40 americanummarginatussilvarum isolate isolateisolate Dsilv1 Dmar1 C904 CoxiellaUnculturedCoxiella endosymbiont cheraxiendosymbiont Coxiella strain species TO-98of of Rhipicephalus DermacentorOrnithodorosAmblyommaOrnithodoros clone S40 scalpturatummarginatus erraticus erraticusmicroplus isolateisolate isolateisolate isolate Oer1Oer1 Rhmicro1Dmar1 CoxAscalpt1 CoxiellaCoxiella endosymbiontcheraxiendosymbiont strain of TO-98of Rhipicephalus AmblyommaOrnithodorosAmblyomma coelebslatepunctatumscalpturatum coelebserraticus microplus isolateisolate isolate isolate isolate isolate CoxAcoel1CoxAcoel1 Oer1 CLERM CoxAscalpt1 CoxAlate1 CoxiellaCoxiella endosymbiont endosymbiont of ofof RhipicephalusAmblyommaOrnithodoros Amblyomma coelebs calcaratumlatepunctatumcalcaratumsonraierraticus australis isolateisolate isolateisolate isolateisolate isolateOson1CoxAcoel1 Oer1Rhaus1 CoxAcalcaCoxAcalca CoxAlate1 CoxiellaCoxiella endosymbiont endosymbiont of ofof Rhipicephalus AmblyommaDermacentorOrnithodorosDermacentor coelebs calcaratum silvarumsonrai occidentalissilvarumannulatus isolateisolate isolateisolate isolate isolate Oson1CoxAcoel1 Dsilv1Dsilv1 CoxAcalcaRhannu1 Oocci1 CoxiellaUnculturedCoxiella endosymbiont endosymbiont Coxiella species of of DermacentorRhipicephalus AmblyommaOrnithodorosDermacentor clone 1358 calcaratum silvarummarginatusoccidentalis marginatusdecoloratus isolate isolate isolate isolate isolateisolate Dsilv1 CoxAcalca Dmar1 Dmar1 Oocci1Rhdeco1 CoxiellaUnculturedCoxiella endosymbiont endosymbiont Coxiella species of of RhipicephalusDermacentor AmblyommaAmblyomma clone 13571358 scalpturatummarginatussilvarumscalpturatum evertsi isolateisolate isolate isolateisolate Rhever1Dsilv1 Dmar1 CoxAscalpt1CoxAscalpt1 * Avian Pathogen CoxiellaUnculturedCoxiella endosymbiont endosymbiont Coxiella species of of RhipicephalusAmblyommaDermacentor Amblyomma clone 1357 cajennense latepunctatumscalpturatummarginatuslatepunctatum *bursa isolate isolateisolate isolate isolateRhbursa1isolate CoxAcaj1Dmar1 CoxAscalpt1 CoxAlate1CoxAlate1 CoxiellaCoxiella endosymbiont endosymbiont of ofof RhipicephalusAmblyomma OrnithodorosOrnithodoros cajennenselatepunctatumsculptumscalpturatum sonraisonraisanguineus isolateisolate isolate isolate isolate isolate Oson1Oson1 CeAS-UFV CoxAcaj1 Rhsa1CoxAscalpt1 CoxAlate1 CoxiellaLegionellaCoxiellaceaeCoxiella endosymbiont endosymbiont pneumophila bacterium of ofAL07subspecies AmblyommaOrnithodoros Ornithodoros pneumophila cajennenselatepunctatum sculptumsonrai occidentalisoccidentalis isolate isolate strain isolate isolate isolateisolateOson1 Philadelphia1CeAS-UFV Acajen1 Oocci1 Oocci1CoxAlate1 CoxiellaUnculturedUncultured endosymbiont CoxiellaCoxiella speciesspecies ofof RhipicephalusAmblyommaOrnithodoros cloneclone 13581358 cajennense americanumsonraioccidentalis sanguineus isolate isolate isolateisolate Oson1 Acajen1 C904Oocci1CRS-CAT UnculturedCoxiellaCoxiellaceaeUncultured endosymbiont CoxiellaCoxiella bacterium speciesspecies of AL08 RhipicephalusOrnithodorosAmblyomma cloneclone 135713581357 americanummarocanusoccidentalis microplus isolate isolateisolate Rhmicro1Omaro1 C904Oocci1 UnculturedCoxiellaCoxiella endosymbiontendosymbiont Coxiella species ofof AmblyommaRhipicephalusAmblyomma clone 13571358 cajennensegeayicajennense microplusturanicus isolate isolate isolate isolateCoxAgeay1 Rhtura1 CoxAcaj1CLERMRhmicro1CoxAcaj1 CLADE D CoxiellaUnculturedCoxiella endosymbiont endosymbiont Coxiella species of of RhipicephalusAmblyomma Amblyomma clone 1357 cajennensenaponense sculptumsculptum australismicroplusturanicus isolate isolate isolate isolateisolate CeAS-UFV CeAS-UFV Rhaus1 CoxAnapo1CoxAcaj1CRtCLERM • CoxiellaCoxiella endosymbiontendosymbiont of of Rhipicephalus AmblyommaHaemaphysalisAmblyomma cajennensesculptumcajennense annulatusaustralispusillus punctata isolate isolate isolateisolate CeAS-UFVRhpusi1 Rhaus1 CoxAcaj1Acajen1Rhannu1Haepun3Acajen1 CLEs of Amblyomma CoxiellaCoxiella endosymbiontspecies endosymbiont H-GG-175 of of RhipicephalusAmblyommaOrnithodoros AmblyommaAmblyomma cajennensesculptum americanumamericanum peruvianusannulatusdecoloratus isolate isolate isolate isolate isolateisolate CeAS-UFV Acajen1Rhannu1 OpC2 C904 C904Rhdeco1 • No Known Pathogens CoxiellaUnculturedCoxiella endosymbiontendosymbiont Coxiella species of of AmblyommaRhipicephalus OrnithodorosRhipicephalus clone S40 cajennenseamericanum spheniscus evertsi microplusdecoloratusmicroplus isolate isolate isolateisolate isolate Rhever1 Acajen1 Rhmicro1 Rhmicro1Osphe1C904 Rhdeco1 CoxiellaCoxiella endosymbiontcheraxiendosymbiont strain ofTO-98ofof RhipicephalusAmblyommaOrnithodorosRhipicephalus americanum amblus bursamicroplusevertsimicroplus isolate isolate isolate isolateisolate Rhbursa1 AB428Rhever1 CLERMRhmicro1 CLERMC904 CoxiellaCoxiella endosymbiontendosymbiont ofof RhipicephalusOrnithodorosRhipicephalus capensiserraticus sanguineusaustralismicroplusbursaaustralis isolate isolate isolate isolateisolate isolate Rhbursa1 Oca1Oer1 Rhaus1 Rhaus1CLERMRhmicro1 Rhsa1 CoxiellaCoxiellaceaeCoxiella endosymbiont endosymbiont bacterium of ofAL07 Rhipicephalus AmblyommaOrnithodorosRhipicephalus coelebs maritimus australis annulatussanguineusmicroplusannulatus isolate isolate isolate isolateisolate isolate CoxAcoel1 Rhaus1 Omari1 Rhannu1CLERMRhannu1 Rhsa1 CoxiellaCoxiellaceaeCoxiella endosymbiont endosymbiont bacterium of ofAL07 Rhipicephalus ArgasAmblyommaRhipicephalus monachus calcaratum annulatusaustralissanguineus decoloratusdecoloratus isolate isolate isolateAmo01 isolate isolateisolate Rhaus1 CoxAcalcaRhannu1 CRS-CAT Rhdeco1Rhdeco1 CoxiellaCoxiellaceaeCoxiella endosymbiontburnetiiendosymbiont bacterium strain of Ammassalik ofAL08 Rhipicephalus DermacentorRhipicephalus silvarum annulatus evertsisanguineusdecoloratusevertsi isolateisolate isolate isolateisolate Rhever1Dsilv1Rhever1 Rhannu1 CRS-CATRhdeco1 Coxiella endosymbiontburnetii strain ofATCC RhipicephalusDermacentor 3262 marginatus evertsituranicus isolate isolate isolate Rhever1 Rhtura1 Dmar1 CLADE C CoxiellaCoxiellaceaeCoxiella endosymbiontendosymbiont bacterium of ofAL08 RhipicephalusRhipicephalus bursaevertsidecoloratusbursa isolate isolateisolate isolate Rhbursa1 Rhbursa1Rhever1 Rhdeco1 • CoxiellaCoxiella endosymbiontburnetiiendosymbiont strain ofNMIIof RhipicephalusAmblyommaRhipicephalus scalpturatum sanguineusbursaevertsituranicussanguineus isolate isolate isolate isolateisolate Rhbursa1 Rhever1 Rhtura1CRt Rhsa1CoxAscalpt1Rhsa1 CLEs of Rhipicephalus CoxiellaCoxiellaceaeCoxiellaceae endosymbiont bacteriumbacterium ofof AL07AL07 RhipicephalusAmblyommaRhipicephalusOrnithodoros* latepunctatum brasiliensis sanguineusbursapusillusturanicus isolate isolate isolate isolateisolate isolateRhbursa1 Rhpusi1 CRt Obr01Rhsa1 CoxAlate1 • Human Skin Infection Strains CoxiellaceaeCoxiellaCoxiella endosymbiont burnetiiendosymbiont bacterium strain of Dugwayof ofAL07 Rhipicephalus OrnithodorosRhipicephalus sonrai peruvianussanguineus pusillussanguineus isolate isolate isolate isolate isolateOson1 Rhpusi1 OpC2 Rhsa1 CRS-CATCRS-CAT CoxiellaCoxiellaceaeCoxiellaceae endosymbiontburnetii bacterium bacterium strain ofPE-15890 ofAL07 AL08AL08 RhipicephalusOrnithodoros* spheniscusoccidentalis peruvianussanguineus isolate isolate OpC2Osphe1 CRS-CATOocci1 CoxiellaceaeUnculturedCoxiellaCoxiella endosymbiontburnetiiendosymbiont Coxiella bacterium strain species of ofZ3055ofAL08 Rhipicephalus OrnithodorosRhipicephalus clone 1358 spheniscus amblussanguineus turanicusturanicus isolate isolate isolateisolate isolate AB428 Rhtura1 Rhtura1Osphe1 CRS-CAT CoxiellaUnculturedCoxiellaceaeCoxiella endosymbiontburnetii endosymbiont Coxiella bacterium strain species CbuKof of ofAL08 Rhipicephalus OrnithodorosRhipicephalus clone Q154 1357 capensis amblusturanicus turanicus isolate isolate isolate isolate AB428 Oca1 Rhtura1 CRtCRt CoxiellaCoxiella endosymbiontendosymbiont ofof RhipicephalusAmblyommaOrnithodorosRhipicephalus cajennense capensismaritimus rostratus pusillusturanicuspusillus isolateisolate isolateisolate Rhpusi1 Oca1Rhpusi1 Orost1 Omari1Rhtura1CRtCoxAcaj1 CoxiellaBacteriumCoxiella endosymbiont endosymbiont symbiont of Haemaphysalisof of RhipicephalusArgasAmblyomma OrnithodorosOrnithodoros monachus sculptumlagrangei maritimus peruvianusperuvianuspusillusturanicus isolate isolateclone isolateAmo01isolate isolateisolate HLSD3 CeAS-UFVRhpusi1 Omari1CRt OpC2OpC2 CoxiellaCoxiella endosymbiont burnetiispeciesendosymbiont strainH-JJ-10 ofAmmassalik of *RhipicephalusAmblyomma ArgasOrnithodorosOrnithodoros monachus cajennense spheniscusperuvianusspheniscuspusillus isolate isolate Amo01isolate isolate isolate Rhpusi1 Acajen1 OpC2 Osphe1Osphe1 CoxiellaUnculturedCoxiella endosymbiont burnetiiendosymbiont Coxiella strain species AmmassalikofATCC of OrnithodorosAmblyomma Ornithodoros clone 3262 CYP-1 americanum spheniscusperuvianus amblusamblus isolateisolate isolateisolate AB428AB428 OpC2Osphe1C904 CoxiellaBacteriumCoxiella endosymbiont burnetiiendosymbiont symbiont strain of HaemaphysalisofNMII ATCCof Rhipicephalus OrnithodorosOrnithodoros 3262 longicornis capensisspheniscusamblus capensismicroplus isolate isolateisolate isolateisolate AB428 Oca1Oca1 Rhmicro166Osphe1 CoxiellaUnculturedCoxiella endosymbiont burnetiiendosymbiont bacterium strain clone ofNMII ofof RhipicephalusOrnithodoros Ornithodoros CH1 capensis maritimusbrasiliensisamblusmaritimusmicroplus isolate isolate isolateisolate isolate AB428 Oca1 Omari1CLERMOmari1 Obr01 CoxiellaBacteriumCoxiella endosymbiont burnetiiendosymbiont symbiont strain of Haemaphysalisof Dugwayofof Rhipicephalus ArgasOrnithodorosArgas monachusmonachus shimogacapensismaritimus brasiliensisaustralis isolateisolate clone isolate isolate isolate Amo01 Amo01isolate HSKY-3 Oca1 Rhaus1Omari1 Obr01 CoxiellaCoxiella endosymbiont burnetiiburnetii strainstrain of AmmassalikPE-15890DugwayAmmassalik RhipicephalusHaemaphysalisArgasOrnithodoros monachus maritimus annulatus obesa isolate isolate isolateAmo01isolate TPSD8 Omari1Rhannu1 CoxiellaCoxiella endosymbiontburnetii burnetii strain strainstrain ofAmmassalik ATCCPE-15890Z3055ATCC IxodesRhipicephalusArgas 32623262 monachushexagonus decoloratus isolate isolate Amo01 Ihexa1 isolate Rhdeco1 CLADE B CoxiellaCoxiella endosymbiont burnetiiburnetii strainstrain ofCbuKNMIIAmmassalikATCCZ3055NMII RhipicephalusIxodes Q1543262 ricinus isolateevertsi Iric1isolate Rhever1 • CLEs of Haemaphysalis CoxiellaCoxiella endosymbiontburnetii endosymbiont strain ofNMIICbuK ATCCofof RhipicephalusIxodes OrnithodorosOrnithodoros Q1543262 uriae isolate brasiliensisrostratusbrasiliensisbursa Iuriae1 isolate isolate isolateisolate Rhbursa1 Orost1 Obr01Obr01 • Equine Infection Strain CoxiellaBacteriumCoxiella endosymbiontburnetii burnetii symbiont strain strain of HaemaphysalisofNMII DugwayofDugway RhipicephalusAmblyommaOrnithodoros splendidumlagrangei brasiliensisrostratussanguineus cloneisolate isolate isolateisolate HLSD3 Orost1 CoxAsplendi1 Obr01Rhsa1 CoxiellaCoxiellaceaeBacteriumCoxiella burnetiiendosymbiontspeciesburnetii symbiont bacterium strain H-JJ-10strain of HaemaphysalisofDugway PE-15890PE-15890AL07 AmblyommaOrnithodoros variegatumlagrangei brasiliensis clone isolate isolate HLSD3 Avar1 Obr01 CoxiellaUnculturedCoxiella endosymbiontburnetii speciesburnetii Coxiella strain H-JJ-10strain species ofPE-15890Dugway Z3055Z3055 IxodesRhipicephalus clone species CYP-1 sanguineus isolate Isp1iso1 isolate CRS-CAT CoxiellaCoxiellaceaeBacteriumUnculturedCoxiella burnetiiendosymbiontburnetii symbiont Coxiella bacterium strainstrainstrain speciesof CbuKHaemaphysalisofPE-15890 Z3055CbuKAL08 Amblyomma clone Q154Q154 CYP-1 thollonilongicornis isolate isolate CoxAthol1 66 Geographic Origin CoxiellaUnculturedBacteriumCoxiella burnetiiendosymbiont endosymbiont symbiont bacterium strainstrain of clone CbuKofHaemaphysalis Z3055of RhipicephalusAmblyomma OrnithodorosOrnithodoros CH1 Q154 loculosum longicornis rostratusrostratusturanicus isolate isolate isolateisolate Orost1Aloc1Orost1 Rhtura166 North America CoxiellaBacteriumUnculturedBacterium burnetiiendosymbiont symbiontsymbiontbacterium strain ofof clone HaemaphysalisCbuKofHaemaphysalis OrnithodorosRhipicephalus CH1 Q154 lagrangeishimoga kairouanensislagrangeirostratus turanicus clone cloneisolateclone isolate isolateHSKY-3 HLSD3HLSD3 Orost1 CRt Oka1 BacteriumCoxiellaCoxiella endosymbiont speciesspecies symbiont H-JJ-10H-JJ-10 of Haemaphysalisofof RhipicephalusHaemaphysalisOrnithodoros lagrangeishimoga rupestris rostratuspusillus obesa cloneisolate isolatecloneisolate HSKY-3 HLSD3 TPSD8 Rhpusi1Orupes1Orost1 South America CoxiellaBacteriumUnculturedUncultured endosymbiontspecies symbiont CoxiellaCoxiella H-JJ-10 speciesofspecies Haemaphysalisof IxodesHaemaphysalisOrnithodoros cloneclone hexagonus CYP-1CYP-1 lagrangei peruvianus obesa isolate isolateclone isolateIhexa1 HLSD3 TPSD8 OpC2 Europe UnculturedCoxiellaBacteriumBacterium endosymbiontspecies symbiontCoxiellasymbiont H-JJ-10 species ofof HaemaphysalisofHaemaphysalis IxodesOrnithodoros clone hexagonusricinus CYP-1 longicornisspheniscus longicornisisolate isolate Iric1 isolateisolateIhexa1 66Osphe166 Asia BacteriumCoxiellaUnculturedUncultured endosymbiont symbiont Coxiella bacteriumbacterium speciesof clone cloneHaemaphysalisof IxodesOrnithodoros CH1cloneCH1 uriaericinus CYP-1 isolatelongicornis amblusisolate Iuriae1 isolateIric1 isolate AB428 66 UnculturedCoxiellaBacteriumBacterium endosymbiont symbiontsymbiontbacterium ofof clone HaemaphysalisofHaemaphysalis IxodesOrnithodorosAmblyomma CH1 uriae isolatesplendidum shimogalongicorniscapensisshimoga Iuriae1 clone cloneisolate isolate HSKY-3HSKY-3 Oca1 CoxAsplendi166 Africa BacteriumCoxiellaUnculturedCoxiella endosymbiontendosymbiont symbiont bacterium of clone Haemaphysalis ofof HaemaphysalisAmblyommaOrnithodorosHaemaphysalis CH1 variegatumsplendidum shimogamaritimus obesaobesa clone isolateisolate isolate isolateisolate HSKY-3 TPSD8TPSD8 Omari1 Avar1CoxAsplendi1 Australia CoxiellaBacteriumCoxiella endosymbiontendosymbiont symbiont of Haemaphysalis ofof IxodesHaemaphysalisArgasAmblyommaIxodes monachus hexagonusspecieshexagonus variegatumshimoga isolateobesa isolate isolateisolate cloneisolateIsp1iso1 Amo01isolate Ihexa1Ihexa1 HSKY-3 TPSD8 Avar1 CoxiellaCoxiella endosymbiont burnetiiendosymbiont strain ofAmmassalik of IxodesHaemaphysalis AmblyommaIxodes hexagonusspecies ricinusricinus* tholloni isolate isolateobesa isolate isolate Iric1isolateIric1Isp1iso1 Ihexa1 CoxAthol1 TPSD8 CLADE A CoxiellaCoxiella endosymbiontburnetiiendosymbiont strain ofATCCof IxodesAmblyommaIxodes 3262 uriaehexagonusricinusuriae* isolateloculosumtholloniisolate isolate isolate Iuriae1 Iuriae1isolate Iric1 isolate Ihexa1 CoxAthol1 Aloc1 • CLEs of Ornithodoros Confidence CoxiellaCoxiella endosymbiontburnetii endosymbiont strain ofNMII of Ixodes OrnithodorosAmblyommaAmblyomma uriae isolate splendidumloculosumsplendidumkairouanensis Iuriae1 isolate isolateisolate isolate Aloc1 CoxAsplendi1CoxAsplendi1 Oka1 Coxiella endosymbiont of Ixodes* ricinus isolate Iric1 • Human Pathogen C. burnetii 90-100 CoxiellaCoxiella endosymbiont endosymbiont of of Ixodes AmblyommaOrnithodorosAmblyomma uriae isolate variegatumsplendidum variegatumkairouanensisbrasiliensisrupestris Iuriae1 isolate isolateisolate isolateisolate isolate Orupes1 Avar1CoxAsplendi1 Avar1Obr01 Oka1 CoxiellaCoxiella endosymbiontburnetiiendosymbiont strain ofDugwayofof IxodesAmblyommaOrnithodorosIxodes speciesspecies variegatumsplendidum rupestris isolateisolate Isp1iso1Isp1iso1isolate isolateisolate Orupes1 Avar1CoxAsplendi1 80-89 * CoxiellaCoxiella endosymbiont burnetiiendosymbiont strain of PE-15890of IxodesAmblyomma Amblyomma species* variegatum thollonitholloni isolate isolateisolate Isp1iso1 isolate CoxAthol1CoxAthol1 Avar1 70-79 CoxiellaCoxiella endosymbiont burnetiiendosymbiont strain of Z3055of Ixodes AmblyommaAmblyomma* species tholloniloculosumloculosum isolate isolate Isp1iso1 isolateisolate CoxAthol1 Aloc1Aloc1 CoxiellaCoxiella burnetii endosymbiontendosymbiont strain CbuK ofof OrnithodorosAmblyommaOrnithodoros Q154* loculosumtholloni kairouanensiskairouanensis isolate isolate isolateCoxAthol1isolate Aloc1 Oka1Oka1 Pathogen* CoxiellaCoxiella endosymbiont endosymbiont of of OrnithodorosAmblyomma Ornithodoros loculosum kairouanensis rupestrisrostratusrupestris isolateisolateisolate isolate Orupes1Orost1Aloc1Orupes1 Oka1 CoxiellaBacterium endosymbiont symbiont of Haemaphysalisof Ornithodoros lagrangeikairouanensisrupestris cloneisolate isolate HLSD3 Orupes1 Oka1 146 Coxiella endosymbiontspecies H-JJ-10 of Ornithodoros rupestris isolate Orupes1 Uncultured Coxiella species clone CYP-1 Bacterium symbiont of Haemaphysalis longicornis isolate 66 147 Figure 3. CLE clades contain both tickUncultured endosymbionts bacterium clone CH1 and pathogens. A 16S rDNA-based Bacterium symbiont of Haemaphysalis shimoga clone HSKY-3 Coxiella endosymbiont of Haemaphysalis obesa isolate TPSD8 148 phylogenetic tree is shown. Bootstrap Coxiellasupport endosymbiont and posterior of Ixodes hexagonus probabilities isolate Ihexa1 are labeled above and Coxiella endosymbiont of Ixodes ricinus isolate Iric1 Coxiella endosymbiont of Ixodes uriae isolate Iuriae1 149 below branchpoints, respectively. NodesCoxiella with endosymbiont ≤70% ofbootstrap Amblyomma splendidum support isolate were CoxAsplendi1 collapsed to Coxiella endosymbiont of Amblyomma variegatum isolate Avar1 Coxiella endosymbiont of Ixodes species isolate Isp1iso1 150 polytomies. Taxa colors represent the continentCoxiella endosymbiont from of Amblyommawhich the tholloni host isolate was CoxAthol1 derived. Established Coxiella endosymbiont of Amblyomma loculosum isolate Aloc1 Coxiella endosymbiont of Ornithodoros kairouanensis isolate Oka1 151 pathogens are marked with asterisks. CladesCoxiella endosymbiont A-D were of Ornithodoros originally rupestris defined isolate Orupes1 by Duron et al. 2015a. 152
153 HGT was a major contributor to gene accumulation in C. burnetii’s ancestors
154 In order to better understand the evolution of C. burnetii, we traced the ancestry of its protein-
155 coding genes by determining whether their orthologs — either functional or pseudogenized —
156 were present in other Legionellales members. Out of 1,530 protein-coding genes whose
157 ancestries we could trace, 790 were deemed to be ancestral, meaning it was present in the
9 158 ancestor that diverged from Legionella (Node 1), and an additional 585 genes originated in
159 Nodes 2-4 (Fig. 4; Supplemental Table S4). This data demonstrates that the common ancestor
160 of C. burnetii and CLEOA contained most of the genes, including virulence factors, present in
161 C. burnetii, and was hence well equipped to infect mammals.
162 A major impediment to unspooling the evolutionary history of C. burnetii is the sparse
163 availability of Coxiellaceae genomes, which makes it difficult to ascertain whether genes were
164 gained by C. burnetii’s ancestors at a Nodes 2-5 or were instead lost in other bacteria represented
165 at each node. To overcome this difficulty, we calculated each C. burnetii gene’s nucleotide
166 composition (%GC) and Codon Adaptation Index (CAI), two measures known to distinguish
167 foreign-origin genes from ancestral ones (Fig. 4; Supplemental Table S5; Sharp and Li 1987;
168 Lawrence and Ochman 1997; Jansen et al. 2003; Raghavan et al. 2012). Both %GC and CAI
169 values for genes that originated in Nodes 3-5 were significantly different from those of ancestral
170 (Node 1) genes, indicating that a considerable portion of these genes were acquired horizontally.
171 [Node 2 genes were excluded from this analysis due to small sample size (n=13).] Interestingly,
172 %GC and CAI values for Node 5 genes were not significantly different from those gained at
173 Node 4, suggesting that many of the genes currently found only in the human pathogen were
174 present in the common ancestor of C. burnetii and CLEOA, and were later lost in the tick
175 endosymbiont. However, it is clear that HGT has contributed to the accumulation of genes at
176 Node 5 as well because 88 out of 155 genes in this category showed phylogenetic patterns
177 consistent with HGT (Supplemental Fig. S1, Table S6). Cumulatively, our results validate the
178 important role HGT has played in assembling C. burnetii’s protein repertoire (Moses et al.
179 2017), and show that this process occurred principally in ancestors that preceded the C. burnetii-
180 CLEOA split.
10 Codon Adaptation Index %GC Codon Adaptation Index %GC A B C
0.8 0.8 N3 N2 0.5 N5 0.5
0.7 0.7 N1 CAI 0.4 CAI 0.4 %GC %GC N4 0.6 0.6 0.3 0.3 0.5 0.5 Model N1 N3 N4 N5 (n) N1 N3 N4 N5 (n) Model N1 N3 N4 N5 N1 N3 N4 N5 (790) N1 (22) Model (148) N3 **** (768) N1 ns P-Value Key **** <0.0001 **** * (148) N3 ns ** (424) N4 *** <0.001 (155) N5 **** ** ns (424) N4 **** **** **** ** <0.01 (155) N5 **** **** **** ns * <0.05 181
182 Figure 4. HGT was a major contributor to gene accumulation in C. burnetii’s ancestors. Nodes 183 N1-N5 (as labeled in Fig. 1) represent major branchpoints in the evolution of C. burnetii. (A) 184 and (B) depict %GC and Codon Adaptation Index (CAI) distributions, respectively, for genes 185 originating at each node. Boxes illustrate each distribution’s interquartile range while the black 186 line dividing the box represents the median. Whiskers represent minimum and maximum values, 187 excluding outliers (black diamonds) which were determined using the Tukey method. P-values 188 shown in tables are for pairwise T-tests (pooled SD, BKY adjusted). Genes gained at N2 were 189 excluded due to small sample size (n=13). (C) C. burnetii genome composition based on nodes of 190 gene origin: N1: 43.9%, N2: 0.7%, N3: 8.2%, N4: 22.6%, N5: 8.6%. The unlabeled portion 191 represents potentially spurious genes (n=224) as well as genes with undefined nodes of origin 192 (n=44). 193
194 CLEOA potentially provides O. amblus with nutrients missing in vertebrate blood
195 Similar to other hematophagic organisms (Duron and Gottlieb 2020), ticks likely obtain
196 nutrients missing in blood from endosymbiotic bacteria such as CLEs and FLEs (Gottlieb et al.
197 2015; Smith et al. 2015; Gerhart et al. 2016, 2018; Duron et al. 2018b; Tsementzi et al. 2018).
198 An analysis of the CLEOA genome revealed that a large number of genes have been inactivated
199 in the endosymbiont as part of its adaptation to an obligately host-dependent lifestyle (Table 1).
200 Nevertheless, the bacterium has retained complete pathways for the synthesis of several vitamins
201 and cofactors, including those for the biosynthesis of biotin (vitamin B7), folic acid (B9),
202 riboflavin (B2), FAD, CoA, and lipoic acid. These pathways are also conserved in all CLEs and
11 203 FLEs (Fig. 5), suggesting that CLEOA and other tick endosymbionts provide one or more of
204 these metabolites to their respective hosts.
205 In addition to the vitamin and cofactor biosynthesis genes, CLEOA contains 91 genes
206 that have been deactivated or lost in C. burnetii (Supplemental Table S7). It is likely that many
207 of these CLEOA-specific genes have functions in the tick ecosystem but are not useful during
208 mammalian infections. Collectively, based on its genomic features (Table 1), recent loss of
209 T4BSS (Fig. 2), and presence of intact pathways for synthesizing B vitamins and cofactors (Fig.
210 5), we conclude that a pathogenic Coxiella was recruited to function as a nutrient-provisioning
211 endosymbiont in O. amblus.
212
213
214 Figure 5. CLEs and FLEs encode B vitamin and cofactor biosynthetic pathways. CLEOA, 215 CRt, CLEAA, FLE-Om (Francisella endosymbiont of Ornithodoros moubata; LVCE00000000), 216 FLE-Am (Francisella endosymbiont of Amblyomma maculatum; LNCT00000000), and F. persica 217 (FLE in Argas arboreus; NZ_CP013022) contain intact pathways for the synthesis of B vitamins 218 and cofactors. Enzymes catalyzing each step are labeled with gene names and EC numbers. 219 Blocks with no color indicate that a functional copy of a gene was not detected in that genome. 220 Depictions of metabolic pathways modified from Gerhart et al. 2018.
12 221 Table 1. Genome characteristics of CLEOA and C. burnetii. 222 CLEOA C. burnetii RSA493 NCBI Accession VFIV00000000 AE016828.3 Length (bp) 1,561,173 1,995,488 %GC 40.6 42.7 rRNA (5S, 16S, 23S) 1,1,1 1,1,1 tRNA 42 42 Functional genes 889 1,798 Pseudogenes 660 197 Single copy genesa 106/111 106/111 223 a Albertson et al. 2013 224
225 CLEOA has not retained genes that allow C. burnetii to withstand the harsh environs of CCV
226 Loss of T4BSS and associated genes. Because C. burnetii diverged from CLEOA recently, it
227 provided us an opportunity to perform a thorough comparison of the human pathogen’s genome
228 to that of a closely-related non-pathogen. Our analysis identified 980 functional genes in C.
229 burnetii whose orthologs have been either pseudogenized or deleted in CLEOA (Supplemental
230 Table S8). Among them are genes for T4BSS and for effectors secreted through T4BSS (Fig. 2;
231 Supplemental Table S2) that enable C. burnetii to generate its intracellular replicative niche,
232 termed Coxiella-containing vacuole (CCV) (Chen et al. 2010; Beare et al. 2011; Martinez et al.
233 2014, 2020; Newton et al. 2014). In addition, genes for PmrAB and EirA that control T4BSS
234 activity, and for eight T4BSS effectors present on C. burnetii’s QpH1 plasmid have been
235 inactivated in CLEOA (Maturana et al. 2013; Beare et al. 2014; Kuba et al. 2020). Therefore,
236 the secretion system, which is critical to the intra-macrophage growth of C. burnetii, is clearly
237 not required for CLEOA to grow within tick cells.
238
13 239 Loss of transporters of antibacterial molecules. To protect itself from noxious molecules
240 produced by the host, C. burnetii likely depends on transport proteins that efflux harmful
241 substances out of its cytoplasm. For instance, macrophages increase Cu2+ concentration within
242 phagosomes to kill intracellular bacteria (Neyrolles et al. 2015), and C. burnetii probably utilizes a
243 P-1B type ATPase to export copper from its cytoplasm to sustain intracellular growth (Rowland
244 and Niederweis 2012). This ATPase, along with 18 others of unknown function, have been
245 pseudogenized or deleted in CLEOA (Supplemental Table S9). In addition, C. burnetii encodes
246 25 putative transporters that could facilitate the pathogen’s growth within CCV, out of which,
247 19 have become non-functional in CLEOA (Supplemental Table S9), suggestive of the mild
248 nature of the tick endosymbiont’s intracellular compartment where antibacterial molecules are
249 probably not a major threat.
250
251 Diminished pH regulation. A defining feature of CCV is its acidity (pH ~4.75) (Vallejo
252 Esquerra et al. 2017; Samanta et al. 2019). To compensate, C. burnetii utilizes several
253 mechanisms to maintain its cytoplasmic pH close to neutral (Hackstadt 1983), many of which
254 have been rendered non-functional in CLEOA. For example, the enzyme carbonic anhydrase
3- 255 that catalyzes the production of bicarbonate (HCO ) buffer from CO2 (Vallejo Esquerra et al.
256 2017; Bury-Moné et al. 2008) has been pseudogenized in CLEOA (Supplemental Table S10).
257 Another pH-regulating strategy used by C. burnetii is to remove excess protons in its cytoplasm
258 via proton-antiporters such as the multi-protein Mrp antiporter, a pair of Na+/H+ antiporters,
259 and a K+/H+ antiporter. C. burnetii also encodes a pair of glutamate/gamma-aminobutyrate
260 (GABA) antiporters that export GABA in exchange for glutamate, thereby reducing the
261 cytoplasmic proton content. The Mrp antiporter, one of the two Na+/H+ antiporters, and both
14 262 glutamate/GABA antiporters have been pseudogenized in CLEOA (Supplemental Table S10).
263 A hallmark of C. burnetii is its unusually high number of basic proteins (~46% of proteins have
264 pI values ³9; average pI 8.22) that could function as a “proton sink,” which allows the pathogen
265 to maintain its cytoplasmic pH close to neutral (Seshadri et al. 2003). In contrast, only ~39% of
266 proteins in CLEOA have pI values ³9 (average pI 8.0), again illustrating a lack of acidic stress
267 within CLEOA’s intracellular vacuole. Collectively, our data suggest that the endosymbiont does
268 not face the constant threat of excess protons entering its cytoplasm, probably because its
269 intracellular niche, unlike C. burnetii’s, has a pH closer to neutral.
270
271 Loss of cell membrane and cell wall genes. In gram-negative bacteria, inner and outer
272 membranes along with peptidoglycan play important roles in stress response (Rowlett et al.
273 2017). In CLEOA, the gene that encodes PlsC, which converts lysophosphatidic acid into
274 phosphatidic acid (PA), a universal intermediate in the biosynthesis of membrane phospholipids,
275 has been pseudogenized. The plsC gene is essential in Escherichia coli, and a transposon insertion
276 in this gene in C. burnetii caused severe intracellular growth defect (Coleman 1990; Martinez et
277 al. 2014); hence, it is not clear how CLEOA is able to build its membranes without a functional
278 plsC, but one possibility is that the endosymbiont utilizes PA obtained from its host. Another
279 membrane-associated loss of function in CLEOA is the pseudogenization of the pldA gene that
280 encodes phospholipase A (PldA), which is critical to C. burnetii’s outer membrane function and
281 for optimal growth within macrophages (Stead et al. 2018). As for its peptidoglycan, CLEOA
282 contains intact genes for D, D-transpeptidases (also known as penicillin-binding proteins) that
283 catalyze 4-3 peptide cross-links between D-alanine and diaminopimelate; however, all L, D-
15 284 transpeptidase genes (annotated as “enhanced entry proteins”) have been pseudogenized,
285 indicating that the tick symbiont does not have the ability to generate 3–3 cross-links between
286 diaminopimelate molecules in its peptidoglycan. These nonclassical cross-links contribute to C.
287 burnetii’s environmental stability (Sandoz et al. 2016), and are probably not critical to CLEOA
288 because the endosymbiont is passed vertically from one generation to next. Collectively, as
289 observed in other endosymbionts (Nakabachi et al. 2006; McCutcheon and Moran 2010; Chong
290 and Moran 2018), CLEOA lacks numerous proteins that are typically considered integral to the
291 optimal functioning of bacterial cell membrane and cell wall.
292
293 Loss of antioxidant genes. An intricate network of antioxidants allows C. burnetii to thrives in a
294 phagolysosome-derived intracellular vacuole (Mertens and Samuel 2012). In contrast to CCV,
295 oxidative stress appears to be minimal in CLEOA’s intracellular vacuole because the
296 endosymbiont contains only a streamlined version of C. burnetii’s antioxidant defense system. For
297 instance, C. burnetii contains two superoxide dismutases (SODs), but CLEOA has retained the
298 cytoplasmic Fe-containing SodB, but not SodC, the periplasmic Cu/Zn-SOD. OxyR, the
299 master regulator of peroxide stress, along with a catalase, a peroxidase (AhpC2), a methionine
300 sulfoxide reductase, a hemerythrin-like protein, and a glutathione transferase that together help
301 mitigate oxidative stress have also been deactivated in CLEOA (Supplemental Table S11). In
302 addition, C. burnetii, but not CLEOA, has the ability to synthesize queuine, a guanine analog,
303 found in the first anticodon position of several post-transcriptionally modified tRNAs (Iwata-
304 Reuyl 2003). The precise functions of queuine is not understood, but it is thought to promote
305 the activity of antioxidant enzymes, including catalase, superoxide dismutase, and glutathione
16 306 transferase, most of which, as mentioned above, have lost their functionality in CLEOA (Koh
307 and Sarin 2018).
308 C. burnetii utilizes both cytochrome bd (encoded by genes cydABX) and cytochrome o
309 (encoded by genes cyoABCD) as terminal oxidases, but CLEOA has only retained cytochrome o
310 genes. Cytochrome bd, which also functions as a quinol peroxidase that prevents the buildup of
311 oxidative free radicals (Endley et al. 2001; Omsland and Heinzen 2011), has become
312 nonfunctional in the tick endosymbiont. In addition, CLEOA does not encode genes for an acid
313 phosphatase and two sterol reductases that likely modify host proteins and cholesterol,
314 respectively, to protect C. burnetii from host-induced oxidative stress (Seshadri et al. 2003; Gilk
315 et al. 2010; Hill and Samuel 2011; Gilk 2012). Finally, C. burnetii is thought to compensate for
316 the lack of the oxidative branch of Pentose Phosphate Pathway (PPP)— a major source of
317 NADPH, by utilizing alternative NADPH-regenerating enzymes such as short chain
318 dehydrogenases and sterol reductases, and by salvaging NAD+ from the host (Bitew et al. 2018,
319 2020). In CLEOA, all four short chain dehydrogenases, the two eukaryote-like sterol reductases,
320 and the nicotinate-salvaging protein have become nonfunctional. In total, while the human
321 pathogen contains several mechanisms to defend against oxidative stress, most of these
322 antioxidant systems have been lost in CLEOA, most likely due to minimal oxidative stress
323 experienced by the bacterium within tick cells. Collectively, the loss of T4BSS, transporters, pH
324 regulation, cell wall modification, and antioxidant defense in CLEOA show that its intracellular
325 vacuole is a less stressful place to live than the phagolysosome-derived CCV occupied by C.
326 burnetii.
327
328 Heme analog inhibits C. burnetii growth
17 329 Cytochromes require heme as a cofactor, but CLEOA does not contain a functional heme
330 biosynthesis pathway, which is present in C. burnetii (Supplemental Table S12). The only intact
331 heme biosynthesis gene in CLEOA is ctaB, which encodes an enzyme that converts heme b to
332 heme o, a component of cytochrome o — the sole terminal cytochrome oxidase present in
333 CLEOA (Saiki et al. 1992). Based on this evidence, the endosymbiont appears to import heme b
334 from the tick hemocoel (vertebrate hemoglobin contains heme b) and converts it to heme o using
335 the ctaB-encoded protoheme IX farnesyltransferase. Additionally, while C. burnetii can import
336 ferrous iron released from iron-containing host molecules such as ferritin and transferrin
337 (Sanchez and Omsland 2020), free Fe2+ does not seem to be important for CLEOA’s
338 intracellular growth because the iron transporter FeoB has been pseudogenized, suggesting that
339 host-derived heme b serves as the tick endosymbiont’s heme and iron source.
340 The heme biosynthesis pathway, while absent in CLEOA, is conserved in all strains of C.
341 burnetii, probably because the iron-protoporphyrin molecule is critical to the pathogen’s ability to
342 grow within human macrophages (Moses et al. 2017). We tested C. burnetii’s dependence on
343 heme by treating both axenically grown and intracellular C. burnetii with gallium protoporphyrin
344 IX (GaPPIX), which can replace heme in cytochromes and other heme-containing enzymes
345 (Hijazi et al. 2017, 2018). As shown in Figure 6, ³250 nM of GaPPIX caused significant
346 inhibition of C. burnetii growth in ACCM-2, and treatment with ³2µM of GaPPIX resulted in
347 significant growth impairment of C. burnetii within THP-1 cells. Reassuringly, only GaPPIX
348 concentrations of ³512µM caused cytotoxicity in THP-1 cells (Supplemental Fig. S2),
349 indicating that gallium compounds could potentially be used to treat C. burnetii infections.
350
18 A 1.0 1.0 * ***
0.8 *** ** **
1.0 *** *** *** ***
0.8 *** ***
0.6 *** 0.8 *** *** 0.6 *** *** ***
0.4 *** 0.4 0.6 0.2 Relative Flourescence Intensity 0.2 0.4 Relative Flourescence Intensity 0.0 0.0 0 125 250 500 1000 2000 4000 8000
Relative Flourescence Intensity 0.2 0 125 250 500GaPPIX (nM)1000 2000 4000 8000 GaPPIX (nM) 0.0 0 125 250 500 1000 2000 4000 8000 GaPPIXB (nM) 08h post-treatment 1.0
08h24h post-treatment post-treatment 1.0 ** P-Values24h48h post-treatment post-treatment08h post-treatment 1.0 ***
*** <0.0001 *** 48h72h post-treatment post-treatment24h post-treatment 0.5 ** <0.001 72h post-treatment48h post-treatment * <0.01 0.5
72h post-treatment Difference Fold 0.5
Fold Difference Fold 0.0 0 2 8 32 Fold Difference Fold 0.0 GaPPIX (uM) 0 2 8 32 0.0 351 GaPPIX (uM)0 2 8 32 GaPPIX (uM) 352 Figure 6. A heme analog reduces C. burnetii growth. (A) Bacteria growing in ACCM-2 were 353 exposed to concentrations of GaPPIX shown in x-axis and were quantified using PicoGreen at 354 8h, 24h, 48h, and 72h post-treatment. Data shown are mean fluorescence intensity (± SE) 355 compared to the vehicle control (0nM). Statistical significance was analyzed using two-way 356 repeated measures ANOVA followed by Dunnett's test (n=5). (B) At 72h post-treatment, 357 bacterial growth within THP-1 cells was quantified using qPCR. Data shown as mean fold 358 difference (± SE) compared to control. Statistical significance was analyzed using one-way 359 ANOVA followed by Dunnett's test (n=3). 360
361 DISCUSSION
362 Although symbiotic and pathogenic coxiellae associated with ticks are found across the globe, it
363 is not clear how pathogenesis and symbiosis evolved in this group of bacteria. Here we show that
364 CLEOA, a soft-tick symbiont, and C. burnetii, a human pathogen, evolved recently from a
365 common ancestor that contained genes necessary to infect macrophages. Additionally, while
366 HGT contributed significantly to the evolution of C. burnetii, it occurred in ancestors prior to
19 367 the divergence of CLEOA and C. burnetii lineages. These discoveries clarify the evolution of C.
368 burnetii, which previously was thought to have evolved from an avirulent tick endosymbiont by
369 gaining virulence factors via HGT. We further show that the pathogenic ancestry of C. burnetii
370 and CLEOA fits into a general pattern of tick endosymbionts originating from erstwhile
371 pathogens, thereby revealing the ability of ticks to convert pathogenic bacteria into beneficial
372 endosymbionts. Lastly, by comparing the genomes of C. burnetii and CLEOA, we were able to
373 gain new insights into the intracellular biology of both bacteria, and show that metabolic
374 pathways retained only in the human pathogen are promising targets for the development of new
375 treatments against Q fever.
376 Emergence of tick-symbionts from virulent ancestors. Coxiella species related to CLEs infect a
377 wide range of animals (Shivaprasad et al. 2008; Woc-Colburn et al. 2008; Angelakis et al. 2016;
378 Seo et al. 2016; Guimard et al. 2017; Elliman and Owens 2020; Needle et al. 2020), but these
379 infectious strains are not the closest relatives of C. burnetii; instead, the human pathogen’s closest
380 relative is the soft-tick symbiont CLEOA. Akin to the CLEOA-C. burnetii relationship, CRt,
381 the endosymbiont in R. turanicus, is closely related to pathogenic Coxiella (termed “Candidatus
382 Coxiella massiliensis”) isolated from human skin infections, and a strain of Coxiella isolated from
383 horse blood is closely related to CLEs present in Haemaphysalis ticks (Angelakis et al. 2016; Seo
384 et al. 2016; Guimard et al. 2017). In addition to these pathogens, bacteria related to CLEs have
385 repeatedly caused fatal bird and crayfish infections (Fig. 3; Shivaprasad et al. 2008; Woc-
386 Colburn et al. 2008; Elliman and Owens 2020; Needle et al. 2020). Microscopic and histological
387 data from avian infections demonstrated that the bacteria have the ability to generate CCV-like
388 compartments within macrophages, and both avian and human skin infection strains have
20 389 “small-cell” and “large-cell” morphologies — two distinct characteristics of C. burnetii —
390 suggesting that the bacteria are genuine vertebrate pathogens (Shivaprasad et al. 2008; Woc-
391 Colburn et al. 2008; Angelakis et al. 2016; Guimard et al. 2017; Needle et al. 2020). Further
392 research, including sequencing their genomes, is required to elucidate the biology and
393 pathogenicity of these infective strains and to understand why only one, i.e., C. burnetii, among
394 several virulent lineages have evolved into a bona fide human pathogen.
395 Tick endosymbionts are ephemeral. Phylogenies of only a few CLEs are congruent with those of
396 their hosts (Duron et al. 2017; Binetruy et al. 2020), probably because older CLEs get replaced
397 by newer CLEs derived from distantly-related coxiellae. In a similar fashion, FLEs seem to have
398 replaced older CLEs in several tick lineages (Gerhart et al. 2016, 2018; Duron et al. 2017,
399 2018b). This ephemeral nature of CLEs is surprising because hematophagic arthropods typically
400 need a reliable partner to gain nutrients that are scarce in vertebrate blood. Insects such as
401 bedbugs and body lice that face similar nutrient scarcity have evolved stable long-term
402 relationships with endosymbionts (Perotti et al. 2007; Hosokawa et al. 2010). It is not clear why
403 that is not the case in ticks, but one possibility is that ticks do not need to establish long-term
404 relationships because they frequently encounter bacteria that are predisposed to becoming
405 nutrient-provisioning endosymbionts. For instance, both Coxiella and Francisella have the
406 metabolic wherewithal to provide nutrients missing in vertebrate blood, thereby improving tick
407 fitness and promoting the establishment of symbiotic relationships. Additionally, both bacteria
408 are capable of infecting tick ovaries, which ensures their vertical transfer, and thus the
409 maintenance of symbiotic relationships. Furthermore, tick behavior such as hyperparasitism,
410 wherein an unfed tick steals blood by piercing the integument of a newly-fed tick, could promote
21 411 the horizontal spread and eventual establishment of a new endosymbiont in a population (Helmy
412 et al. 1983; Labruna et al. 2007; Llanos-Soto et al. 2019). The plentiful supply of potential
413 endosymbionts in its environment could also obviate the need for ticks to evolve specialized
414 symbiont-carrying organs (Perotti et al. 2007; Hosokawa et al. 2010, 2012; Rio et al. 2012;
415 Simonet et al. 2018; Husnik et al. 2020), thereby explaining the absence bacteriomes in ticks.
416 Alternatively, malpighian tubules, which extends into most parts of a tick’s body cavity, and to
417 which CLEs and FLEs show strong tissue tropism, could be functioning as a stand-in for
418 bacteriome (Klyachko et al. 2007; Machado-Ferreira et al. 2011; Lalzar et al. 2014; Duron et al.
419 2018b).
420 Another reason for the unstable nature of CLE-tick relationships could be that the
421 constant turnover of endosymbionts protects ticks from being dependent on an endosymbiont
422 with reduced nutrient-provisioning capability. For instance, when a free-living bacterium evolves
423 into an obligate endosymbiont, its genome undergoes massive gene loss due to a combination of
424 factors, including reduced selection to maintain superfluous genes, genetic drift and possibly
425 adaptive evolution (Moran 1996; Clayton et al. 2016). This degenerative evolution, as observed
426 in CLEOA, is a convergent feature of heritable symbionts and could result in the irreversible loss
427 of fitness-improving genes, including genes that form the primary basis for symbiosis (Lambert
428 and Moran 1998; Andersson and Andersson 1999; McCutcheon and Moran 2010). If no
429 compensatory mechanisms are undertaken, this progressive gene loss could result in the
430 extinction of both the tick and the endosymbiont. One strategy to avoid this fate is to
431 periodically replace resident endosymbionts with new bacteria that have the needed metabolic
432 capabilities (Russell et al. 2017; Bennett and Moran 2020).
433 Gaining new symbionts via horizontal transmission could also protect ticks from
22 434 becoming dependent on a degraded endosymbiont. While the mechanistic details of this process
435 are not understood, phylogenetic patterns of CLE and FLE distribution strongly indicate the
436 occurrence of horizontal transmission of bacteria between ticks (Gerhart et al. 2016, 2018;
437 Duron et al. 2017; Binetruy et al. 2020). It is not clear whether there is direct tick-to-tick
438 transfer or whether unrelated ticks acquire similar bacteria from common environmental sources
439 (Duron et al. 2018a). In either case, tick-endosymbiont relationships appear to be much more
440 short-lived and dynamic than most insect-endosymbiont relationships. It should be noted
441 however that not all tick-endosymbionts are short-lived. Ticks that carry CLEs belonging to
442 Clade D (Fig. 3) appear to have established long-term relationships with their endosymbionts.
443 For instance, CLEs in A. americanum and A. sculptum have highly reduced (~0.60 Mbp) genomes
444 that are similar in size to Buchnera, which established its symbiosis with aphids more than 200
445 million years ago (Moran et al. 1993). Putting all this information together, it appears that a
446 combination of vertical inheritance, horizontal transmission, and periodic replacement of old
447 symbionts with new pathogen-derived symbionts underlies the complex distribution pattern of
448 endosymbionts observed in ticks (Gottlieb et al. 2015; Smith et al. 2015; Gerhart et al. 2016,
449 2018; Duron et al. 2017; Tsementzi et al. 2018; Binetruy et al. 2020).
450
451 Functions of CLEs and FLEs. While the exact functions of FLEs and CLEs have not been fully
452 characterized, previous studies have shown that they are often the sole bacterium present in long-
453 term laboratory tick colonies, an indication that the bacteria are essential to ticks’ wellbeing. In
454 addition, removal of the resident endosymbiont via antibiotic treatment reduced tick fitness,
455 which was reversed when ticks were provided with B vitamins (Zhong et al. 2007; Smith et al.
456 2015; Gerhart et al. 2016; Guizzo et al. 2017; Zhang et al. 2017; Duron et al. 2018b; Li et al.
23 457 2018; Ben-Yosef et al. 2020). Our genome analyses support a B vitamin-provisioning role for
458 tick endosymbionts because the genes required to synthesize B7 (biotin), B9 (folic acid), and B2
459 (riboflavin), and three coenzymes (FAD, CoA, and lipoic acid) are conserved in all CLEs and
460 FLEs. Future experiments should clarify whether any or all of these nutrients form the basis for
461 the CLE-tick and FLE-tick symbioses.
462
463 Pathogen-specific metabolic processes are potential targets to control Q fever. Genetic and
464 physiological capabilities accumulated by a bacterium are critical to its ability to adapt to new
465 environments, especially ones such as intra-macrophage vacuoles that do not facilitate the gain of
466 new genes via HGT. In accordance with this idea, our analyses showed that virulence factors and
467 metabolic genes utilized by C. burnetii to grow within CCV were present in the common
468 ancestor of C. burnetii and CLEOA. Befitting its obligate endosymbiotic lifestyle, many of these
469 genes have become non-functional in CLEOA, allowing us to identify metabolic processes that
470 are likely critical to C. burnetii’s intracellular growth. One metabolite that is exclusively produced
471 by the pathogen is heme, the iron-protoporphyrin required for oxidative phosphorylation, among
472 other functions. To test the importance of heme to C. burnetii, we exposed the bacterium to
473 GaPPIX, a Ga(III) complex of protoporphyrin IX. Ga(III) inhibits bacterial growth because it
474 binds to biological complexes that normally binds to Fe(III), but under physiological conditions
475 Ga(III) is not reduced to Ga(II), thereby disrupting essential redox-driven biological processes
476 (Bernstein 1998). We chose GaPPIX over other gallium-based formulations because it could
477 replace heme in cytochromes, is known to be bactericidal, and is not toxic to primary human
478 fibroblasts and established cell lines (Stojiljkovic et al. 1999; Arivett et al. 2015; Hijazi et al.
479 2018). C. burnetii lacks homologs of known heme transporters (Moses et al. 2017), hence, it is
24 480 not clear how GaPPIX enters into the pathogen, but our growth assays clearly demonstrated that
481 the heme analog is very effective at inhibiting both axenic and intracellular growth of C. burnetii
482 (Fig. 6). Encouragingly, a recent human trial showed that Ga could improve lung function in
483 people with cystic fibrosis and chronic Pseudomonas aeruginosa lung infections, and that the
484 molecule worked synergistically with other antibiotics to inhibit bacterial growth (Goss et al.
485 2018). Thus, Ga, which has been approved by FDA for intravenous administration (Bonchi et al.
486 2014), and its derivatives such as GaPPIX, hold great promise as new therapeutic tools. In
487 conclusion, this study provide novel insights into the evolution and biology of pathogenesis and
488 symbiosis in Coxiella, and demonstrate that the new information could be applied to developing
489 innovative therapeutic strategies.
490
491 METHODS
492 Genome sequencing and assembly. An O. amblus female, collected from soil underneath rocks
493 near a Spheniscus humbolti (Humboldt penguin) nesting area in Isla Grande de Atacama, Chile,
494 was identified as described in Clifford et al. 1980. DNA was extracted from the tick using
495 DNeasy Blood & Tissue kit (Qiagen) and was submitted to Yale Center for Genome Analysis
496 for Illumina (NovaSeq) sequencing. The resulting 150 bp paired-end reads were trimmed using
497 Trimmomatic resulting in approximately 220 million read pairs of suitable quality (Bolger et al.
498 2014). The reads were assembled into contigs using metaSPAdes (Nurk et al. 2017), and open
499 reading frames (ORFs) were identified using Prodigal (Hyatt et al. 2010). Contigs containing
500 Coxiella genes were tentatively identified using BLASTn and BLASTp by comparing to a
501 database of all publicly available sequences from Coxiellacea members. CONCOCT (Alneberg
25 502 et al. 2014) was used for binning contigs based on coverage and k-mer composition, and these
503 finding were merged with BLAST-based binning results. Approximately 20 million paired reads
504 that mapped to contigs identified as containing Coxiella genes were used for a final metaSPAdes
505 assembly resulting in a total of 101 contigs. The final collection of contigs was verified using
506 hmmsearch (Potter et al. 2018) to identify essential single-copy genes (Albertsen et al. 2013), as
507 well as RNammer (Lagesen et al. 2007) and tRNAscan-SE (Chan and Lowe 2019) to identify
508 ribosomal and transfer RNAs, respectively. We were unable to stitch the contigs together into a
509 closed chromosome because the contigs were filled with repeat sequences; however, the presence
510 of 106 out of 111 highly conserved single-copy genes in both CLEOA and C. burnetii indicate
511 that most of the CLEOA genome is represented in the assembled contigs. The final set of 101
512 contigs were submitted to NCBI (accession VFIV00000000) and annotated using the
513 Prokaryotic Genome Pipeline.
514
515 Phylogenetic analysis. Orthofinder (Emms and Kelly 2015) was utilized to identify 205 single-
516 copy genes present in 52 representative species from the order Legionellales (Supplemental
517 Tables S1, S13) in order to build the comprehensive phylogenomic tree Supplemental Figure S3.
518 A subset of 117 genes conserved in 30 species (Supplemental Tables S1, S13) were used to
519 generate Figure 1. For both trees, nucleotide sequences were aligned individually using global
520 MAFFT (Katoh and Standley 2013) and were then concatenated. GBlocks (Talavera and
521 Castresana 2007) was used to cull ambiguously aligned regions and jModelTest2 (Darriba et al.
522 2012) was used to select the appropriate model (GTR+I+G). Maximum Likelihood trees were
523 generated using RaxML and Bayesian trees were produced using MrBayes (Ronquist et al. 2012;
524 Stamatakis 2014). The 16s rDNA trees were built using the same process as above, with the final
26 525 tree based on 1203 nucleotide positions, and nodes with less than 70% support collapsed. To
526 confirm the HGT origin of Node 5 genes, homologs were identified via BLASTp (NCBI nr
527 database, e-value ≤10e-5, identity ≥30% identity, coverage ≥70%). The nucleotide sequences of
528 the homologs were collected into a database, and the Phylomizer pipeline
529 (https://github.com/Gabaldonlab/phylomizer) was used to generate individual Maximum
530 Likelihood trees using the 75 most closely related homolog sequences. Each tree was then
531 compared to an NCBI Taxonomy-based tree to validate HGT (Supplemental Fig. S1).
532
533 Determination of nodes of gene origin. The presence of functional homologs of C. burnetii
534 RSA493 (AE016828.3) genes in other members of the order Legionellales was determined using
535 BLASTp (identity ≥30%, coverage ≥70%, e-value ≤10e-5), and pseudogenized homologs were
536 detected using tBLASTn (identity ≥30%, coverage ≥50%, e-value ≤10e-5). The presence/absence
537 profile was utilized in Gain and Loss Mapping Engine (GLOOME) (Cohen et al. 2010) to
538 determine the posterior probability of each gene’s presence at nodes N1-N5. For each C. burnetii
539 gene, the node of origin was marked as the oldest node at which posterior probability was ≥0.7,
540 with all subsequent nodes also having posterior probability of ≥0.7, as described previously (Peer
541 and Margalit 2014).
542
543 Calculation of CAI and pI. We identified 22 highly conserved single-copy protein-coding genes
544 in C. burnetii that were highly expressed in both ACCM-2 and within human macrophages
545 based on previous RNA-seq data (Supplemental Table S5; Warrier et al. 2014; Wachter et al.
546 2019). CodonW (http://codonw.sourceforge.net) was used to generate CAI values for the 22
547 genes in order to generate a model for optimal codon usage in C. burnetii, which was then
27 548 compared to CAI values of sets of genes acquired at each node. All 22 genes used to build the
549 model belonged to Node 1, and were not included in this analysis. Potentially spurious genes
550 (n=224) that did not have any detectable homologs outside of C. burnetii, as well as genes with
551 undetermined nodes of origin (n=44) were excluded from this analysis (Supplemental Table
552 S14). Isoelectric points (pI) for all proteins in CLEOA and C. burnetii RSA493 (AE016828.3)
553 were calculated using IPC (Kozlowski 2016).
554
555 GaPPIX susceptibility assay
556 A 10mM GaPPIX (Frontier Scientific) solution was prepared in dimethyl sulfoxide (DMSO)
557 and was stored at 4°C under dark conditions until further use. C. burnetii was cultured in
7 558 ACCM-2 for 2 days at 37°C, 5% CO2 and 2.5% O2, and ~2x10 genome equivalents were
559 resuspended in fresh ACCM-2 containing 125nM, 250nM, 500nM, 1mM, 2mM, 4mM, or
560 8mM GaPPIX in 96-well black-bottom microplates (Greiner Bio-One). Bacterial growth was
561 measured using PicoGreen (Invitrogen) as described previously (Moses et al. 2017).
562 THP-1 human monocytes (ATCC, TIB-202) were cultured in sterile RPMI-1640
563 medium (Gibco) supplemented with 1mM sodium pyruvate, 0.05 mM beta-mercaptoethanol,
564 1% Pen-Strep, and 4500 mg/L glucose with 10% heat-inactivated fetal bovine serum (FBS) at
565 37°C, 5% CO2 in 6-well tissue culture plates. Prior to infection, cells were differentiated into
566 macrophages by treating with 30 nM phorbol 12-myristate 13-acetate (PMA) for 24h, followed
567 by resting in PMA-free RPMI for 24h. Infection of THP-1 cells with C. burnetii was carried out
568 using a 7d bacterial culture at a multiplicity of infection of 25. After briefly washing the cells
569 with PBS, bacteria-containing medium was added to each well and gently centrifuged for 10
570 minutes followed by incubation at 37°C, 5% CO2 for two hours. To remove extracellular
28 571 bacteria, cells were washed three times with PBS, and replaced with antibiotic-free RPMI and
572 were incubated for 48h before treating with GaPPIX- (2uM, 8uM, and 32uM) or DMSO- (as
573 control) containing media. After 72h, cells were washed three times with PBS and intracellular
574 bacterial load was measure using qPCR, as we described previously (Moses et al. 2017). Potential
575 cytotoxicity of GaPPIX was determined by measuring the levels of released lactate
576 dehydrogenase (LDH) in cell supernatants using an LDH Cytotoxicity Assay Kit (Invitrogen).
577
578 DATA ACCESS
579 The CLEOA genome generated in this study has been submitted to the NCBI Assembly
580 database (https://www. ncbi.nlm.nih.gov/assembly/) under Whole Genome Shotgun (WGS)
581 accession prefix VFIV, BioSample SAMN12040594, and BioProject PRJNA548565.
582
583 ACKNOWLEDGEMENTS
584 We thank Samantha Fancher, Andrew Ashford and Jim Archuleta for technical help, and Daniel
585 González-Acuña for assisting in tick collection. This project was supported in part by NIH
586 grants AI123464, AI126385, and AI133023 to R.R.
587
29 588 REFERENCES
589 Albertsen M, Hugenholtz P, Skarshewski A, Nielsen KL, Tyson GW, Nielsen PH. 2013. 590 Genome sequences of rare, uncultured bacteria obtained by differential coverage binning 591 of multiple metagenomes. Nat Biotechnol 31: 533–538.
592 Almeida AP, Marcili A, Leite RC, Nieri-Bastos FA, Domingues LN, Martins JR, Labruna 593 MB. 2012. Coxiella symbiont in the tick Ornithodoros rostratus (Acari: Argasidae). Ticks 594 Tick-Borne Dis 3: 203–206.
595 Alneberg J, Bjarnason BS, de Bruijn I, Schirmer M, Quick J, Ijaz UZ, Lahti L, Loman NJ, 596 Andersson AF, Quince C. 2014. Binning metagenomic contigs by coverage and 597 composition. Nat Methods 11: 1144–1146.
598 Andersson JO, Andersson SG. 1999. Insights into the evolutionary process of genome 599 degradation. Curr Opin Genet Dev 9: 664–671.
600 Angelakis E, Mediannikov O, Jos S-L, Berenger J-M, Parola P, Raoult D. 2016. Candidatus 601 Coxiella massiliensis infection. Emerg Infect Dis 22: 285–288.
602 Arivett BA, Fiester SE, Ohneck EJ, Penwell WF, Kaufman CM, Relich RF, Actis LA. 2015. 603 Antimicrobial activity of gallium protoporphyrin IX against Acinetobacter baumannii 604 strains displaying different antibiotic resistance phenotypes. Antimicrob Agents Chemother 605 59: 7657–7665.
606 Beare PA, Gilk SD, Larson CL, Hill J, Stead CM, Omsland A, Cockrell DC, Howe D, Voth 607 DE, Heinzen RA. 2011. Dot/Icm type IVB secretion system requirements for Coxiella 608 burnetii growth in human macrophages. mBio 2: e00175-00111.
609 Beare PA, Sandoz KM, Larson CL, Howe D, Kronmiller B, Heinzen RA. 2014. Essential role 610 for the response regulator PmrA in Coxiella burnetii type 4B secretion and colonization of 611 mammalian host cells. J Bacteriol 196: 1925–1940.
612 Ben-Yosef M, Rot A, Mahagna M, Kapri E, Behar A, Gottlieb Y. 2020. Coxiella-like 613 Endosymbiont of Rhipicephalus sanguineus is required for physiological processes during 614 ontogeny. Front Microbiol 11: 493.
615 Bennett GM, Moran NA. 2015. Heritable symbiosis: The advantages and perils of an 616 evolutionary rabbit hole. Proc Natl Acad Sci U S A 112: 10169–10176.
617 Bernstein LR. 1998. Mechanisms of therapeutic activity for gallium. Pharmacol Rev 50: 665– 618 682.
30 619 Binetruy F, Buysse M, Lejarre Q, Barosi R, Villa M, Rahola N, Paupy C, Ayala D, Duron O. 620 2020. Microbial community structure reveals instability of nutritional symbiosis during 621 the evolutionary radiation of Amblyomma ticks. Mol Ecol 29: 1016–1029.
622 Bitew MA, Hofmann J, De Souza DP, Wawegama NK, Newton HJ, Sansom FM. 2020. SdrA, 623 an NADP(H)-regenerating enzyme, is crucial for Coxiella burnetii to resist oxidative 624 stress and replicate intracellularly. Cell Microbiol 22: e13154.
625 Bitew MA, Khoo CA, Neha N, De Souza DP, Tull D, Wawegama NK, Newton HJ, Sansom 626 FM. 2018. De novo NAD synthesis is required for intracellular replication of Coxiella 627 burnetii, the causative agent of the neglected zoonotic disease Q fever. J Biol Chem 293: 628 18636–18645.
629 Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence 630 data. Bioinformatics 30: 2114–2120.
631 Bonchi C, Imperi F, Minandri F, Visca P, Frangipani E. 2014. Repurposing of gallium-based 632 drugs for antibacterial therapy: Gallium-based antibacterials. BioFactors 40: 303–312.
633 Burstein D, Amaro F, Zusman T, Lifshitz Z, Cohen O, Gilbert JA, Pupko T, Shuman HA, 634 Segal G. 2016. Genomic analysis of 38 Legionella species identifies large and diverse 635 effector repertoires. Nat Genet 48: 167–175.
636 Bury-Moné S, Mendz GL, Ball GE, Thibonnier M, Stingl K, Ecobichon C, Avé P, Huerre M, 637 Labigne A, Thiberge J-M, et al. 2008. Roles of alpha and beta carbonic anhydrases of 638 Helicobacter pylori in the urease-dependent response to acidity and in colonization of the 639 murine gastric mucosa. Infect Immun 76: 497–509.
640 Chan PP, Lowe TM. 2019. TRNAscan-SE: Searching for tRNA genes in genomic sequences. 641 Methods Mol Biol 1962: 1–14.
642 Chen C, Banga S, Mertens K, Weber MM, Gorbaslieva I, Tan Y, Luo Z-Q, Samuel JE. 2010. 643 Large-scale identification and translocation of type IV secretion substrates by Coxiella 644 burnetii. Proc Natl Acad Sci U S A 107: 21755–21760.
645 Chong RA, Moran NA. 2018. Evolutionary loss and replacement of Buchnera, the obligate 646 endosymbiont of aphids. ISME J 12: 898–908.
647 Clayton AL, Jackson DG, Weiss RB, Dale C. 2016. Adaptation by deletogenic replication 648 slippage in a nascent symbiont. Mol Biol Evol 33: 1957–1966.
649 Clifford CM, Hoogstraal H, Radovsky FJ, Stiller D, Keirans JE. 1980. Ornithodoros 650 (Alectorobius) amblus (Acarina: Ixodoidea: Argasidae): identity, marine bird and human 651 hosts, virus infections, and distribution in Peru. J Parasitol 66: 312-323.
31 652 Cohen O, Ashkenazy H, Belinky F, Huchon D, Pupko T. 2010. GLOOME: gain loss mapping 653 engine. Bioinformatics 26: 2914–2915.
654 Coleman J. 1990. Characterization of Escherichia coli cells deficient in 1-acyl-sn-glycerol-3- 655 phosphate acyltransferase activity. J Biol Chem 265: 17215–17221.
656 Cox HR. 1938. A filter-passing infectious agent isolated from ticks: III. Description of organism 657 and cultivation experiments. Public Health Rep 53: 2270–2276.
658 Darriba D, Taboada GL, Doallo R, Posada D. 2012. jModelTest 2: more models, new heuristics 659 and parallel computing. Nat Methods 9: 772–772.
660 Duron O, Binetruy F, Noël V, Cremaschi J, McCoy KD, Arnathau C, Plantard O, Goolsby J, 661 Pérez de León AA, Heylen DJA, et al. 2017. Evolutionary changes in symbiont 662 community structure in ticks. Mol Ecol 26: 2905–2921.
663 Duron O, Doublet P, Vavre F, Bouchon D. 2018a. The importance of revisiting Legionellales 664 diversity. Trends Parasitol 34: 1027–1037.
665 Duron O, Gottlieb Y. 2020. Convergence of nutritional symbioses in obligate blood feeders. 666 Trends Parasitol 36: 816–825.
667 Duron O, Morel O, Noël V, Buysse M, Binetruy F, Lancelot R, Loire E, Ménard C, Bouchez 668 O, Vavre F, et al. 2018b. Tick-bacteria mutualism depends on B vitamin synthesis 669 pathway. Curr Biol 28: 1896-1902.e5.
670 Duron O, Noël V, McCoy KD, Bonazzi M, Sidi-Boumedine K, Morel O, Vavre F, Zenner L, 671 Jourdain E, Durand P, et al. 2015a. The recent evolution of a maternally-inherited 672 endosymbiont of ticks led to the emergence of the Q fever pathogen, Coxiella burnetii. 673 PLoS Pathog 11: e1004892.
674 Duron O, Sidi-Boumedine K, Rousset E, Moutailler S, Jourdain E. 2015b. The importance of 675 ticks in Q fever transmission: What has (and has not) been demonstrated? Trends 676 Parasitol 31: 536–552.
677 Eldin C, Mélenotte C, Mediannikov O, Ghigo E, Million M, Edouard S, Mege J-L, Maurin 678 M, Raoult D. 2017. From Q fever to Coxiella burnetii infection: A paradigm change. Clin 679 Microbiol Rev 30: 115–190.
680 Elliman JR, Owens L. 2020. Confirmation that candidatus Coxiella cheraxi from redclaw crayfish 681 (Cherax quadricarinatus) is a close relative of Coxiella burnetii, the agent of Q-fever. Lett 682 Appl Microbiol 71: 320–326.
683 Emms DM, Kelly S. 2015. OrthoFinder: solving fundamental biases in whole genome 684 comparisons dramatically improves orthogroup inference accuracy. Genome Biol 16: 157.
32 685 Endley S, McMurray D, Ficht TA. 2001. Interruption of the cydB locus in Brucella abortus 686 attenuates intracellular survival and virulence in the mouse model of infection. J Bacteriol 687 183: 2454–2462.
688 Gerhart JG, Dutcher HA, Brenner AE, Moses AS, Grubhoffer L, Raghavan R. 2018. Multiple 689 acquisitions of pathogen-derived Francisella endosymbionts in soft ticks. Genome Biol 690 Evol 10: 607–615.
691 Gerhart JG, Moses AS, Raghavan R. 2016. A Francisella-like endosymbiont in the Gulf Coast 692 tick evolved from a mammalian pathogen. Sci Rep 6: 33670.
693 Gilk SD. 2012. Role of lipids in Coxiella burnetii infection. Adv Exp Med Biol 984: 199–213.
694 Gilk SD, Beare PA, Heinzen RA. 2010. Coxiella burnetii expresses a functional Δ24 sterol 695 reductase. J Bacteriol 192: 6154–6159.
696 Goss CH, Kaneko Y, Khuu L, Anderson GD, Ravishankar S, Aitken ML, Lechtzin N, Zhou 697 G, Czyz DM, McLean K, et al. 2018. Gallium disrupts bacterial iron metabolism and 698 has therapeutic effects in mice and humans with lung infections. Sci Transl Med 10: 699 eaat7520.
700 Gottlieb Y, Lalzar I, Klasson L. 2015. Distinctive genome reduction rates revealed by genomic 701 analyses of two Coxiella-like endosymbionts in ticks. Genome Biol Evol 7: 1779–1796.
702 Guimard T, Amrane S, Prudent E, El Karkouri K, Raoult D, Angelakis E. 2017. Case report: 703 Scalp eschar and neck lymphadenopathy associated with bacteremia due to Coxiella-like 704 bacteria. Am J Trop Med Hyg 97: 1319–1322.
705 Guizzo MG, Parizi LF, Nunes RD, Schama R, Albano RM, Tirloni L, Oldiges DP, Vieira RP, 706 Oliveira WHC, Leite M de S, et al. 2017. A Coxiella mutualist symbiont is essential to 707 the development of Rhipicephalus microplus. Sci Rep 7: 17554.
708 Hackstadt T. 1983. Estimation of the cytoplasmic pH of Coxiella burnetii and effect of substrate 709 oxidation on proton motive force. J Bacteriol 154: 591–597.
710 Helmy N, Khalil GM, Hoogstraal H. 1983. Hyperparasitism in Ornithodoros erraticus. J Parasitol 711 69: 229–233.
712 Hijazi S, Visaggio D, Pirolo M, Frangipani E, Bernstein L, Visca P. 2018. Antimicrobial 713 activity of gallium compounds on ESKAPE pathogens. Front Cell Infect Microbiol 8: 316.
714 Hijazi S, Visca P, Frangipani E. 2017. Gallium-protoporphyrin IX inhibits Pseudomonas 715 aeruginosa growth by targeting cytochromes. Front Cell Infect Microbiol 7: 12.
33 716 Hill J, Samuel JE. 2011. Coxiella burnetii acid phosphatase inhibits the release of reactive oxygen 717 intermediates in polymorphonuclear leukocytes. Infect Immun 79: 414–420.
718 Hosokawa T, Koga R, Kikuchi Y, Meng X-Y, Fukatsu T. 2010. Wolbachia as a bacteriocyte- 719 associated nutritional mutualist. Proc Natl Acad Sci 107: 769–774.
720 Hosokawa T, Nikoh N, Koga R, Satô M, Tanahashi M, Meng X-Y, Fukatsu T. 2012. 721 Reductive genome evolution, host–symbiont co-speciation and uterine transmission of 722 endosymbiotic bacteria in bat flies. ISME J 6: 577–587.
723 Husnik F, Hypsa V, Darby A. 2020. Insect-symbiont gene expression in the midgut 724 bacteriocytes of a blood-sucking parasite. Genome Biol Evol 12: 429–442.
725 Hyatt D, Chen G-L, LoCascio PF, Land ML, Larimer FW, Hauser LJ. 2010. Prodigal: 726 prokaryotic gene recognition and translation initiation site identification. BMC 727 Bioinformatics 11: 119.
728 Iwata-Reuyl D. 2003. Biosynthesis of the 7-deazaguanosine hypermodified nucleosides of 729 transfer RNA. Bioorganic Chem 31: 24–43.
730 Jansen R, Bussemaker HJ, Gerstein M. 2003. Revisiting the codon adaptation index from a 731 whole-genome perspective: analyzing the relationship between gene expression and 732 codon occurrence in yeast using a variety of models. Nucleic Acids Res 31: 2242–2251.
733 Katoh K, Standley DM. 2013. MAFFT multiple sequence alignment software version 7: 734 improvements in performance and usability. Mol Biol Evol 30: 772–780.
735 Klyachko O, Stein BD, Grindle N, Clay K, Fuqua C. 2007. Localization and visualization of a 736 Coxiella-type symbiont within the lone star tick, Amblyomma americanum. Appl Environ 737 Microbiol 73: 6584–6594.
738 Koh CS, Sarin LP. 2018. Transfer RNA modification and infection – Implications for 739 pathogenicity and host responses. Biochim Biophys Acta Gene Regul Mech 1861: 419–432.
740 Körner S, Makert GR, Mertens-Scholz K, Henning K, Pfeffer M, Starke A, Nijhof AM, Ulbert 741 S. 2020. Uptake and fecal excretion of Coxiella burnetii by Ixodes ricinus and Dermacentor 742 marginatus ticks. Parasit Vectors 13: 75.
743 Kozlowski LP. 2016. IPC – Isoelectric Point Calculator. Biol Direct 11: 55.
744 Kuba M, Neha N, Newton P, Lee YW, Bennett-Wood V, Hachani A, De Souza DP, Nijagal 745 B, Dayalan S, Tull D, et al. 2020. EirA is a novel protein essential for intracellular 746 replication of Coxiella burnetii. Infect Immun 88.
34 747 Labruna MB, Ahid SMM, Soares HS, Suassuna ACD. 2007. Hyperparasitism in Amblyomma 748 rotundatum (Acari: Ixodidae). J Parasitol 93: 1531–1532.
749 Lagesen K, Hallin P, Rødland EA, Stærfeldt H-H, Rognes T, Ussery DW. 2007. RNAmmer: 750 consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35: 3100– 751 3108.
752 Lalzar I, Friedmann Y, Gottlieb Y. 2014. Tissue tropism and vertical transmission of Coxiella in 753 Rhipicephalus sanguineus and Rhipicephalus turanicus ticks: Tissue tropism of tick 754 symbionts. Environ Microbiol 16: 3657–3668.
755 Lambert JD, Moran NA. 1998. Deleterious mutations destabilize ribosomal RNA in 756 endosymbiotic bacteria. Proc Natl Acad Sci U S A 95: 4458–4462.
757 Lawrence JG, Ochman H. 1997. Amelioration of bacterial genomes: rates of change and 758 exchange. J Mol Evol 44: 383–397.
759 Lerat E, Daubin V, Ochman H, Moran NA. 2005. Evolutionary origins of genomic repertoires 760 in bacteria. PLoS Biol 3: e130.
761 Li L-H, Zhang Y, Zhu D. 2018. Effects of antibiotic treatment on the fecundity of 762 Rhipicephalus haemaphysaloides ticks. Parasit Vectors 11.
763 Llanos-Soto S, Muñoz-Leal S, González-Acuña D. 2019. Hyperparasitism in the seabird tick 764 Ornithodoros amblus (Acari: Argasidae). Syst Appl Acarol 24: 525.
765 Machado-Ferreira E, Dietrich G, Hojgaard A, Levin M, Piesman J, Zeidner NS, Soares CAG. 766 2011. Coxiella symbionts in the Cayenne tick Amblyomma cajennense. Microb Ecol 62: 767 134–142.
768 Martinez E, Cantet F, Fava L, Norville I, Bonazzi M. 2014. Identification of OmpA, a Coxiella 769 burnetii protein involved in host cell invasion, by multi-phenotypic high-content 770 screening. PLoS Pathog 10: e1004013.
771 Martinez E, Huc-Brandt S, Brelle S, Allombert J, Cantet F, Gannoun-Zaki L, Burette M, 772 Martin M, Letourneur F, Bonazzi M, et al. 2020. The secreted protein kinase CstK from 773 Coxiella burnetii influences vacuole development and interacts with the GTPase- 774 activating host protein TBC1D5. J Biol Chem 295: 7391–7403.
775 Maturana P, Graham JG, Sharma UM, Voth DE. 2013. Refining the plasmid-encoded type IV 776 secretion system substrate repertoire of Coxiella burnetii. J Bacteriol 195: 3269–3276.
777 Maurin M, Raoult D. 1999. Q Fever. Clin Microbiol Rev 12: 518–553.
35 778 McCutcheon JP, Moran NA. 2011. Extreme genome reduction in symbiotic bacteria. Nat Rev 779 Microbiol 10: 13–26.
780 McCutcheon JP, Moran NA. 2010. Functional convergence in reduced genomes of bacterial 781 symbionts spanning 200 My of evolution. Genome Biol Evol 2: 708–718.
782 Mertens K, Samuel JE. 2012. Defense mechanisms against oxidative stress in Coxiella burnetii: 783 adaptation to a unique intracellular niche. Adv Exp Med Biol 984: 39–63.
784 Moran NA. 1996. Accelerated evolution and Muller’s rachet in endosymbiotic bacteria. Proc Natl 785 Acad Sci U S A 93: 2873–2878.
786 Moran NA. 2002. Microbial minimalism: genome reduction in bacterial pathogens. Cell 108: 787 583–586.
788 Moran NA, Munson MA, Baumann P, Ishikawa H. 1993. A molecular clock in endosymbiotic 789 bacteria is calibrated using the insect hosts. Proc Biol Sci 253: 167–171.
790 Moses AS, Millar JA, Bonazzi M, Beare PA, Raghavan R. 2017. Horizontally acquired 791 biosynthesis genes boost Coxiella burnetii’s physiology. Front Cell Infect Microbiol 7: 174.
792 Nakabachi A, Yamashita A, Toh H, Ishikawa H, Dunbar HE, Moran NA, Hattori M. 2006. 793 The 160-kilobase genome of the bacterial endosymbiont Carsonella. Science 314: 267.
794 Needle DB, Agnew DW, Bradway DS, Nordhausen RW, Garner MM. 2020. Avian coxiellosis 795 in nine psittacine birds, one black-browed barbet, and one paradise tanager. Avian Pathol 796 J WVPA 49: 268–274.
797 Newton HJ, Kohler LJ, McDonough JA, Temoche-Diaz M, Crabill E, Hartland EL, Roy CR. 798 2014. A screen of Coxiella burnetii mutants reveals important roles for Dot/Icm effectors 799 and host autophagy in vacuole biogenesis. PLOS Pathog 10: e1004286.
800 Neyrolles O, Wolschendorf F, Mitra A, Niederweis M. 2015. Mycobacteria, metals, and the 801 macrophage. Immunol Rev 264: 249–263.
802 Nurk S, Meleshko D, Korobeynikov A, Pevzner PA. 2017. metaSPAdes: a new versatile 803 metagenomic assembler. Genome Res 27: 824–834.
804 Ochman H, Lawrence JG, Groisman EA. 2000. Lateral gene transfer and the nature of bacterial 805 innovation. Nature 405: 299–304.
806 Omsland A, Heinzen RA. 2011. Life on the outside: the rescue of Coxiella burnetii from its host 807 cell. Annu Rev Microbiol 65: 111–128.
36 808 Peer A, Margalit H. 2014. Evolutionary patterns of Escherichia coli small RNAs and their 809 regulatory interactions. RNA 20: 994–1003.
810 Potter SC, Luciani A, Eddy SR, Park Y, Lopez R, Finn RD. 2018. HMMER web server: 2018 811 update. Nucleic Acids Res 46: W200–W204.
812 Perotti MA, Allen JM, Reed DL, Braig HR. 2007. Host-symbiont interactions of the primary 813 endosymbiont of human head and body lice. FASEB J 21: 1058–1066.
814 Raghavan R, Kelkar YD, Ochman H. 2012. A selective force favoring increased G+C content in 815 bacterial genes. Proc Natl Acad Sci U S A 109: 14504–14507.
816 Rio RVM, Symula RE, Wang J, Lohs C, Wu Y-N, Snyder AK, Bjornson RD, Oshima K, Biehl 817 BS, Perna NT, et al. 2012. Insight into the transmission biology and species-specific 818 functional capabilities of tsetse (Diptera: glossinidae) obligate symbiont Wigglesworthia. 819 MBio 3: 00240-11.
820 Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Höhna S, Larget B, Liu L, 821 Suchard MA, Huelsenbeck JP. 2012. MrBayes 3.2: efficient Bayesian phylogenetic 822 inference and model choice across a large model space. Syst Biol 61: 539–542.
823 Rowland JL, Niederweis M. 2012. Resistance mechanisms of Mycobacterium tuberculosis against 824 phagosomal copper overload. Tuberc Edinb Scotl 92: 202–210.
825 Rowlett VW, Mallampalli VKPS, Karlstaedt A, Dowhan W, Taegtmeyer H, Margolin W, 826 Vitrac H. 2017. Impact of membrane phospholipid alterations in Escherichia coli on 827 cellular function and bacterial stress adaptation. J Bacteriol 199: e00849-16.
828 Russell JA, Oliver KM, Hansen AK. 2017. Band-aids for Buchnera and B vitamins for all. Mol 829 Ecol 26: 2199–2203.
830 Saiki K, Mogi T, Anraku Y. 1992. Heme O biosynthesis in Escherichia coli: the cyoE gene in the 831 cytochrome bo operon encodes a protoheme IX farnesyltransferase. Biochem Biophys Res 832 Commun 189: 1491–1497.
833 Samanta D, Clemente TM, Schuler BE, Gilk SD. 2019. Coxiella burnetii type 4B secretion 834 system-dependent manipulation of endolysosomal maturation is required for bacterial 835 growth. PLoS Pathog 15: e1007855.
836 Sanchez SE, Omsland A. 2020. Critical role for molecular iron in Coxiella burnetii replication 837 and viability. mSphere 5: e00458-20.
838 Sandoz KM, Popham DL, Beare PA, Sturdevant DE, Hansen B, Nair V, Heinzen RA. 2016. 839 Transcriptional profiling of Coxiella burnetii reveals extensive cell wall remodeling in the 840 small cell variant developmental form. PloS One 11: e0149957.
37 841 Segal G, Russo JJ, Shuman HA. 1999. Relationships between a new type IV secretion system 842 and the icm/dot virulence system of Legionella pneumophila. Mol Microbiol 34: 799–809.
843 Seo M-G, Lee S-H, VanBik D, Ouh I-O, Yun S-H, Choi E, Park Y-S, Lee S-E, Kim JW, 844 Cho G-J, et al. 2016. Detection and genotyping of Coxiella burnetii and Coxiella-like 845 bacteria in horses in South Korea. PloS One 11: e0156710.
846 Seshadri R, Paulsen IT, Eisen JA, Read TD, Nelson KE, Nelson WC, Ward NL, Tettelin H, 847 Davidsen TM, Beanan MJ, et al. 2003. Complete genome sequence of the Q-fever 848 pathogen Coxiella burnetii. Proc Natl Acad Sci U S A 100: 5455–5460.
849 Sharp PM, Li WH. 1987. The codon adaptation index--a measure of directional synonymous 850 codon usage bias, and its potential applications. Nucleic Acids Res 15: 1281–1295.
851 Shivaprasad HL, Cadenas MB, Diab SS, Nordhausen R, Bradway D, Crespo R, Breitschwerdt 852 EB. 2008. Coxiella-like infection in psittacines and a toucan. Avian Dis 52: 426–432.
853 Simonet P, Gaget K, Balmand S, Ribeiro Lopes M, Parisot N, Buhler K, Duport G, Vulsteke V, 854 Febvay G, Heddi A, et al. 2018. Bacteriocyte cell death in the pea aphid/Buchnera 855 symbiotic system. Proc Natl Acad Sci 115: E1819–E1828.
856 Smith DJW, Derrick EH. 1940. Studies in the epidemiology of Q fever: 1. The isolation of six 857 strains of Rickettsia burneti from the tick Haemaphysalis humerosa. Aust J Exp Biol Med Sci 858 18: 1–8.
859 Smith TA, Driscoll T, Gillespie JJ, Raghavan R. 2015. A Coxiella-like endosymbiont is a 860 potential vitamin source for the Lone Star tick. Genome Biol Evol 7: 831–838.
861 Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of 862 large phylogenies. Bioinformatics 30: 1312–1313.
863 Stead CM, Cockrell DC, Beare PA, Miller HE, Heinzen RA. 2018. A Coxiella burnetii 864 phospholipase A homolog pldA is required for optimal growth in macrophages and 865 developmental form lipid remodeling. BMC Microbiol 18: 33.
866 Stojiljkovic I, Kumar V, Srinivasan N. 1999. Non-iron metalloporphyrins: potent antibacterial 867 compounds that exploit haem/Hb uptake systems of pathogenic bacteria. Mol Microbiol 868 31: 429–442.
869 Talavera G, Castresana J. 2007. Improvement of phylogenies after removing divergent and 870 ambiguously aligned blocks from protein sequence alignments. Syst Biol 56: 564–577.
871 Tsementzi D, Castro Gordillo J, Mahagna M, Gottlieb Y, Konstantinidis KT. 2018. 872 Comparison of closely related, uncultivated Coxiella tick endosymbiont population
38 873 genomes reveals clues about the mechanisms of symbiosis. Environ Microbiol 20: 1751– 874 1764.
875 Vallejo Esquerra E, Yang H, Sanchez SE, Omsland A. 2017. Physicochemical and nutritional 876 requirements for axenic replication suggest physiological basis for Coxiella burnetii niche 877 restriction. Front Cell Infect Microbiol 7: 190.
878 Wachter S, Bonazzi M, Shifflett K, Moses AS, Raghavan R, Minnick MF. 2019. A CsrA- 879 binding, trans-acting sRNA of Coxiella burnetii is necessary for optimal intracellular 880 growth and vacuole formation during early infection of host cells. J Bacteriol 201: e00524- 881 19.
882 Warrier I, Hicks LD, Battisti JM, Raghavan R, Minnick MF. 2014. Identification of novel small 883 RNAs and characterization of the 6S RNA of Coxiella burnetii. PLoS One 9: e100147.
884 Wernegreen JJ, Ochman H, Jones IB, Moran NA. 2000. Decoupling of genome size and 885 sequence divergence in a symbiotic bacterium. J Bacteriol 182: 3867–3869.
886 Woc-Colburn AM, Garner MM, Bradway D, West G, D’Agostino J, Trupkiewicz J, Barr B, 887 Anderson SE, Rurangirwa FR, Nordhausen RW. 2008. Fatal coxiellosis in Swainson’s 888 Blue Mountain Rainbow Lorikeets (Trichoglossus haematodus moluccanus). Vet Pathol 45: 889 247–254.
890 Zhang C-M, Li N-X, Zhang T-T, Qiu Z-X, Li Y, Li L-W, Liu J-Z. 2017. Endosymbiont 891 CLS-HI plays a role in reproduction and development of Haemaphysalis longicornis. Exp 892 Appl Acarol 73: 429–438.
893 Zhong J, Jasinskas A, Barbour AG. 2007. Antibiotic treatment of the tick vector Amblyomma 894 americanum reduced reproductive fitness. PLoS One 2: e405.
895
39 896 SUPPLEMENTAL FIGURE LEGENDS
897 Supplemental Figure S1. HGT contributed to C. burnetii evolution. 88 Node 5 genes have
898 phylogenies (left) inconsistent with established taxonomic relationships (right), indicating that
899 they were acquired via HGT. Branches and species titles are colored by taxonomy as specified on
900 Page 1. (A) Maximum likelihood gene trees with bootstrap values labeled at branchpoints. C.
901 burnetii RSA493 genes are indicated by their locus tags and are highlighted in blue. Duplicate
902 genes are shown together in single trees. (B) Taxonomy-based trees depict the established
903 relationships among the taxa featured in A. Taxa sharing the same NCBI taxonomic identifier
904 number are displayed only once even if multiple representatives are present in A.
905
906 Supplemental Figure S2. GaPPIX is not toxic to macrophages at levels that significantly inhibit
907 C. burnetii growth. At 24h post-GaPPIX treatment of THP-1 derived macrophages, lactate
908 dehydrogenase (LDH) activity was determined by measuring the level of resorufin formation
909 using an LDH cytotoxicity assay. The cytotoxicity was reported as the percentage LDH released
910 compared to the maximum LDH activity. Data are shown as mean ± SEM percentage LDH
911 released, and statistical significance was analyzed using one-way ANOVA followed by Dunnett's
912 test (n=3).
913
914 Supplemental Figure S3. Legionellales phylogenomic tree. Maximum likelihood and Bayesian
915 trees were generated using 205 single-copy protein-coding genes conserved in all represented
916 taxa (Supplemental Tables S1, S13). Bootstrap support and posterior probabilities are labeled
917 above and below each node respectively. This tree was used in GLOOME (Cohen et al. 2010) to
918 determine the nodes of gene origin.
40