bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

1 Evolution of the cytochrome-bd type oxygen reductase superfamily and the

2 function of cydAA’ in

3 Ranjani Murali1*, Robert B. Gennis2, James Hemp3* 4 1Division of Biology and Biological Engineering, Institute of Technology 5 2 Department of Biochemistry, University of Illinois, Urbana-Champaign 6 3School of Medicine, University of Utah 7 *Correspondence can be addressed to: [email protected], [email protected] 8 9 Abstract

10 Cytochrome bd-type oxygen reductases (cytbd) belong to one of three enzyme superfamilies that catalyze

11 oxygen reduction to water. They are widely distributed in Bacteria and Archaea, but the full extent of

12 their biochemical diversity is unknown. Here we used phylogenomics to identify 3 families and several

13 subfamilies within the cytbd superfamily. The core architecture shared by all members of the superfamily

14 consists of four transmembrane helices that bind two active site hemes, which are responsible for oxygen

15 reduction. While previously characterized cytochrome bd-type oxygen reductases use quinol as an

16 electron donor to reduce oxygen, sequence analysis shows that only one of the identified families has a

17 conserved quinol binding site. The other families are missing this feature, suggesting that they use an

18 alternative electron donor. Multiple gene duplication events were identified within the superfamily,

19 resulting in significant evolutionary and structural diversity. The CydAA’ cytbd, found exclusively in

20 Archaea, is formed by the co-association of two superfamily paralogs. We heterologously expressed

21 CydAA’ from maquilingensis and demonstrated that it performs oxygen reduction with quinol

22 as an electron donor. Strikingly, CydAA’ is the first isoform of cytbd containing only b-type hemes

23 shown to be active when isolated, demonstrating that oxygen reductase activity in this superfamily is not

24 dependent on heme d.

25 Introduction

Murali et al. ISMEJ-21-00087A 1

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

26 The predominance of oxygen in our atmosphere determines the bioenergetic importance

27 of oxygen as an electron acceptor and the prevalence of aerobic respiratory chains. There are

28 only three enzyme superfamilies capable of acting as terminal respiratory oxygen reductases -

29 heme-copper oxygen reductases, alternative oxidases and cytochrome bd-type oxygen reductases

30 (cytbd)1. While enzymes from this superfamily have been characterized from a number of

31 Bacteria, their role in archaeal respiration has not yet been determined. Archaeal aerobic

32 respiratory chains share some similarities with bacterial respiratory chains, however they often

33 differ in their composition of respiratory enzymes and are adapted to use different cofactors such

34 as methanophenazine and F420. Complexes that are typically involved in bacterial respiration

2 3 35 such as succinate-quinone oxidoreductases , cytochrome bc1 complexes and heme-copper

36 oxygen reductases from archaea have been previously characterized4,5, while NADH:quinone

37 oxidoreductases and alternative complex III are absent from this domain6. The presence of

38 cytochrome bd-type oxygen reductases has been noted in archaeal genomes7, metagenomes and

39 metaproteomes8–11 but, no functional member of the cytbd superfamily in archaea has ever been

40 demonstrated.

41 Cytochrome bd-type oxygen reductase is a respiratory enzyme that converts oxygen to

42 water using three hemes, unlike the heme-copper oxygen reductases which have two hemes and

43 a copper in the active sites1. Purified cytbd accepts electrons from quinols using a low-spin heme

44 b558 and transfers these electrons to a di-heme active site containing two high-spin hemes. In

45 some of the characterized cytochrome bd enzymes, these active site hemes were shown to be

46 heme b595 and heme d, but some other isoforms were shown to contain only hemes b. Those

47 cytochrome bd family members that contain only hemes b are usually referred to as cyanide

48 insensitive oxidases (CIO) or cytochrome bb’–type oxygen reductase, and have been identified

Murali et al. ISMEJ-21-00087A 2

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

49 in Pseudomonas aeruginosa, Bacillus subtilis and others12–15. It is unclear whether the presence

50 of only hemes b has a physiological implication but it has been suggested that these enzymes are

51 less sensitive to inhibition by cyanide13. There is no sequence signature that distinguishes those

52 enzymes in the superfamily that only contain heme b. No CIO has ever been isolated and

53 characterized.

54 The canonical cytochrome bd oxygen reductases contain a minimum of two subunits,

55 cydA and cydB, but often contain additional “auxiliary” subunits 16–18 such as CydX, a single-

56 transmembrane subunit that is associated with cytochrome bd-I from E. coli that has been

57 implicated in the stability of the enzyme19. Cytochrome bd-type oxygen reductases have a high

58 affinity for oxygen20 and the previously characterized cytbds have been associated with roles in

59 oxygen detoxification, respiratory protection of nitrogenases and as part of sulfide oxidizing

60 respiratory chains21–25. Cytochrome bd catalytic turnover generates a proton motive force by

61 translocation of protons using a conserved proton channel from the cytoplasm to the site of

62 oxygen reduction located near the periplasmic side (electrically positive) of the membrane26. Yet,

63 cytochrome bd is not as energetically efficient as the heme-copper oxygen reductases which

64 pump protons in addition to translocating “chemical” protons from the cytoplasm to the

65 periplasmic active site27,28. Expression of cytochrome bd has often been associated with

66 microoxic conditions where a high-affinity oxygen reductase would be required29.

67 In this work, we used phylogenomics to determine the diversity and distribution of this

68 high affinity oxygen reductase in Archaea and Bacteria. We determined that there are three

69 distinct families of cytbd – one of which contained the quinol binding characteristics present in

70 the structures of cytbd from Escherichia coli and Geobacillus thermodenitrificans30–32 and two

71 which do not – and discussed their evolutionary relationships. The distribution of these families

Murali et al. ISMEJ-21-00087A 3

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

72 even within Archaea involve significant variation and include the two distinct isoforms CydAB

73 and CydAA’, the latter of which appears to have been created by gene duplication. We evaluate

74 the relative distribution of the CydAA’ and CydAB within the archaea and consider the

75 likely role of CydAA’ variants within their ecological context. In addition, we show that the

76 CydAA’ from Caldivirga maquilingensis is a highly active oxygen reductase with unique

77 biochemical and structural characteristics. This combined phylogenomic and experimental

78 approach has significantly expanded our knowledge of the evolutionary and biochemical

79 diversity within the superfamily, which has important implications for the role of the cytbd

80 superfamily in novel respiratory pathways.

81 Results

82 Diversity of cytochrome bd-type oxygen reductases

83 The molecular structures of cytochrome bd-type oxygen reductases from Escherichia coli

84 and Geobacillus thermodenitrificans have been determined, and showed that cytbd typically has

85 two conserved subunits cydA and cydB, along with a third subunit, cydX or cydS which is a

86 single transmembrane subunit that is not well conserved or found along with cydA and cydB in

87 the genome30–33. Of the two main subunits, cydA is better conserved in all known cytochrome

88 bd-type oxygen reductases while cydB is very divergent and is hypothesized to have evolved at

89 faster rates than cydA34. The cydA subunit is made of nine transmembrane helices and contains

90 almost all the conserved amino acids known to be important for catalyzing oxygen reduction and

91 proton translocation, including the ligands for three hemes and the residues forming a proton

92 channel35,36. The first four helices of cydA contain all of the amino acids that form the active site.

93 These include the proton channel and ligands to bind the active site heme b595 and heme d (these

Murali et al. ISMEJ-21-00087A 4

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

94 ligands have only been verified in the isoforms containing hemes b and d, and not the ones

95 containing only hemes b). The other five helices (V- IX) form the quinol binding site in the

96 biochemically characterized bd-type oxygen reductases and include the ligands to heme b558, the

97 point of entry for electrons from quinols30–32. With this structural framework in mind, we

98 performed a sequence analysis of cytochrome bd-type oxygen.

99 An analysis of 24706 genomes available in the Genome Database

100 (release89)37,38, revealed the presence of 17852 cydA homologs. Of these, 13007 genomes

101 contained at least one cydA homolog, suggesting that this enzyme family is widely distributed

102 and important (Supplementary Table1). Phylogenomic analysis of cydA homologs revealed 15

103 clades of cydA that could be distinguished on the basis of unique sequence characteristics.1

104 (Supplementary Figure S1, Supplementary Tables1,2). Four of these clades contain the

105 features that are considered part of the quinol binding site – for e.g., conserved residues Lys252,

106 Glu257 (E.coli cydA numbering) while the remaining do not. We inferred that the former four

107 cydA clades were quinol:O2 oxidoreductases and we named them qOR1, qOR2, qOR3 and

108 qOR4a. While cydA of the families qOR1, qOR2 and qOR3 associate with cydB to form cydAB,

109 cydAA’ is formed by the co-association of two distinct cydA clades, qOR4a and qOR4b. qOR4b

110 does not possess quinol binding site characteristics and is likely the result of a gene duplication

111 event. Phylogenetic clustering of cydA sequences from all 15 cydA clades demonstrated that 2 of

112 the remaining clades are missing quinol binding sites and instead contain a number of heme c

113 binding motifs (CxxCH) (Supplementary figure S1, Supplementary multiple sequence

114 alignments MSA1, MSA3). We named these enzymes OR-C1a and OR-C1b because of the

115 presence of heme c binding motifs. Similar to qOR4a/4b, the ‘a’ and ‘b’ attachment to the names

116 signifies that their genomic context suggests that they co-associate to form one enzyme OR-C1.

Murali et al. ISMEJ-21-00087A 5

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

117 Eight of the remaining cydA clades were related and named OR-N1, OR-N2, OR-N3a, OR-N3b,

118 OR-N4a, OR-N4b, OR-N5a and OR-N5b. OR-N is named for Nitrospirota because of

119 predominance of these enzymes in that phylum (Supplementary Figure S1, Supplementary

120 multiple sequence alignments MSA1, MSA2, MSA3). Their close relationship is also

121 supported by the likely structure of the of which they are a part and their genomic

122 context (Figure 1). We have attempted to develop a nomenclature for the cytochrome bd-type

123 oxygen reductase family that can be easily expanded upon. We designate 3 large families of

124 cytochrome bd-type oxygen reductases – qOR, OR-C and OR-N - based on their phylogenetic

125 placement, presence/absence of biochemical signatures such as quinol and heme c binding site

126 features, genomic operon context and taxonomic origin. We have designated subfamilies

127 numerically starting from 1 and attached an ‘a’ or ‘b’ subscript if it is likely that two cydA

128 subfamilies co-associate to form one enzyme. Most of the ‘a’-type subfamilies include the proton

129 channel residues E99 and E107 (E. coli numbering) while ‘b’-type subfamilies do not. It appears

130 that ‘a’ and ‘b’-type subfamilies are also the result of multiple independent gene duplication

131 events within this superfamily. We will discuss the unusual number of gene duplication events

132 within the cytbd superfamily and the OR-C and OR-N families later in the text and in

133 Supplementary Material but begin with the quinol oxidizing qOR family. Sequences from this

134 family contain all the amino acids that were previously identified as forming the heme ligands,

135 proton channel, oxygen reduction site and quinol binding site30–32,36.

136 Evolution of the quinol-oxidizing cytochrome bd-type oxygen reductases (qOR)

137 To explore the evolutionary relationship between the families qOR1, qOR2, qOR3 and

138 qOR4a, which are true orthologs, we generated a maximum likelihood phylogenetic tree using

139 RAxML with the OR-C and OR-N family sequences as outgroup (Figure 2). Sequence features

Murali et al. ISMEJ-21-00087A 6

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

140 can be identified to distinguish these families and to validate the above identified monophyletic

141 clades as meaningfully distinct; some of which are outlined below while the remaining features

142 are mentioned in Supplementary Table 6. All cydA sequences from qOR1 subfamily have 7

143 amino acids between the two conserved glutamates in the proton channel Glu99 and Glu107 such

144 as in Escherichia coli cydA 26,39 while cydA sequences from qOR2, qOR3 and qOR4a typically

145 have 6 amino acids between the two conserved glutamates (ex. as between Glu101 and Glu108

146 in qOR3-subfamily cytbd from Geobacillus thermodenitrificans30). This insertion/deletion has

147 been hypothesized to lead to a reversal in the position of hemes from the qOR1-bd in

148 Escherichia coli to the qOR3-bd in Geobacillus thermodenitrificans although further research is

149 required to establish whether the reversal of heme positions is universal (further discussion on

150 the insertion/deletion in the proton channel and the Q-loop is included in Supplementary

151 Material). Sequence features which distinguish the qOR4a-subfamily cytbd are insertions

152 between helices V and VI, as well as insertions in helix VIII (Supplementary alignment MSA1,

153 Supplementary Table 6) Conserved tyrosines (Tyr115 and Tyr117 Geobacillus

154 thermodenitrificans cydA numbering) are present in qOR2 and qOR3 families but not in the

155 qOR4a-subfamily, which is consistent with the close evolutionary relationship between the

156 qOR2 and qOR3-subfamilies observed in the tree topology. Other conserved sequence features,

157 unique to each family are listed in Supplementary alignment MSA2 and Supplementary

158 Table 6.

159 Comparing the cydA phylogenetic tree and the distribution of cytochrome bd-type

160 oxygen reductases across Archaea and Bacteria provides some insight into the relative age of

161 these families. The qOR1 subfamily, which includes the Escherichia coli enzyme, at present

162 count seems to be the most widely distributed with enzymes in over 60 bacterial phyla,

Murali et al. ISMEJ-21-00087A 7

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

163 (Supplementary Table 2, 4) but it is only sparsely distributed in Archaea. In fact, there are only

164 a very few representatives in and Asgardarchaeota (Figure 2). It is only widely

165 distributed in Halobacterota, whose oxidative metabolism is expected to have evolved relatively

166 late40(Figure 3). This strongly suggests that the qOR1 subfamily is the oldest of the extant

167 families and that it is likely that cytochrome bd-type oxygen reductases originated in Bacteria.

168 While cydA from the qOR2 subfamily is also fairly well-distributed and found in over 20

169 bacterial phyla, the qOR3-subfamily enzymes are almost exclusive to the Firmicutes and

170 Firmicutes_I phyla with a few enzymes in Archaea. The qOR4a-subfamily enzymes appear to be

171 specific to the Archaea (Supplementary Tables 2, 4). A close evolutionary relationship between

172 the qOR2 and qOR3-subfamilies is suggested by cydA tree topology and identifiable sequence

173 characteristics but other trees we inferred have modelled a closer relationship between the qOR2

174 and qOR1 subfamilies (data not shown). Furthermore, the qOR1 subfamily has 7 amino acids

175 between the conserved glutamates in the proton channel, while the qOR4a, qOR2 and qOR3

176 subfamilies consistently have 6 amino acids. Lastly, enzymes from the qOR4a, qOR2 and qOR3

177 subfamilies are almost completely absent from Proteobacteria. This suggests that the qOR2,

178 qOR3 and qOR4a subfamilies diverged from the qOR1 family, Before proteobacteria diverged

179 from other bacterial phyla. While our dataset and phylogenetic analysis is consistent with the

180 above discussion, it must be noted that many lateral gene transfers have been observed within the

181 cytochrome bd-type oxygen reductase1 which complicate evolutionary analysis.

182 As mentioned above, most cydA subfamilies are widely distributed within Bacteria and

183 Archaea, but the qOR4a-subfamily is unique in having sequences that belong only to Archaea. In

184 addition, the qOR4a-subfamily is unique in having a completely different subunit II (cydA’),

185 while the qOR1-, qOR2- and qOR3- subfamily members appear to have cydB homologs as their

Murali et al. ISMEJ-21-00087A 8

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

186 subunit II. cydB is either not homologous to cydA’ or is evolutionarily distant. The unique

187 ancestry of the qOR4a-subfamily enzymes which is specific to archaea raises a question about its

188 distribution within that domain.

189 Distribution of cytochrome-bd type oxygen reductases in archaea

190 To investigate the distribution of cytochrome bd-type oxygen reductases within Archaea

191 and to contextualize the evolution of cydA within archaeal evolution, we mapped the presence of

192 qOR4a, qOR1, qOR2 and qOR3 subfamilies of cydA onto a phylogenetic tree of all archaea,

193 using a concatenated gene alignment made from the archaeal genomes in GTDB37 using

194 Anvi’o41, (Figure 2). It is clear from this representation that most of the qOR4a-subfamily or

195 cydAA’ belong to the class within the phylum with a few cydAA’

196 in Nitrosphaeria, Thermoplasmatota and Archaeoglobi. Within the Thermoprotei, almost all

197 members of the order contain cydAA’ and family Acidilobaceae contain

198 cydAA’ (Supplementary Table 3).

199 To place cydAA’ into an ecological context we looked at their environmental distribution

200 (Supplementary Table 5). Microbes containing cydAA’ are largely found in solfataric fields,

201 hot springs and deep-sea vents, suggesting that cydAA’ might only be utilized by thermophiles,

202 such as from the , Caldivirga, and Thermocladium42.

203 Within Yellowstone National Park (YNP), a number of these genera are found in hypoxic,

204 sulfur/iron-rich ecosystems, although and Thermofilum have also been found in

205 more oxygenated environments. It has been suggested that members of the Thermoproteales

206 which are found in aerobic environments have a heme-copper oxygen reductase and are more

207 likely to be using aerobic respiration as their primary energetic pathway42. This is consistent with

208 what we observe in Thermoproteales – organisms which do not have cydAA’ have heme-copper

Murali et al. ISMEJ-21-00087A 9

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

209 oxygen reductases instead (Supplementary Table 3). However, of the 8 Pyrobaculum genomes

210 in the GTDB database, the three genomes that have cydAA’, but are missing a heme-copper

211 oxygen reductase are capable of aerobic respiration43,44. This is suggestive of an based

212 on oxygen availability in the environment resulting in a trade-off between the greater energetic

213 efficiency and higher oxygen affinity of HCOs and bd respectively20,28.

214 Expression of the cydAA’ genes have been demonstrated in the hot springs and sulfur-

215 rich/iron-rich environments within Yellowstone National Park, using RT-PCR10 and

216 metatranscriptomics (Table 1). While the hot springs were typically hypoxic and sulfur-rich, the

217 iron oxide mats had higher oxygen concentration at the surface and had <0.3 μM concentrations

218 of O2 within 1 mm. Nitrosphaeria, Acidilobaceae, Thermoproteales and Thermoplasmatota

219 expressed cydAA’ in these environments, however it is not clear whether these microorganisms

220 were exposed to high O2 concentrations. In fact, the Acidolobaceae are expected to be found in

10,45 221 the middle and bottom layers of this mat where O2 concentrations are lower . All of the above

222 observations are consistent with the presence of cydAA’ in microaerobic and hypoxic

223 environments. The obvious question that needed to be addressed is whether cydAA’ actually

224 functions as an oxygen reductase. This was accomplished by biochemically characterizing the

225 CydAA’ from C. maquilingensis.

226 Partial purification and spectroscopic characterization of the cydAA’ from C.

227 maquilingensis

228 The cydAA’ operon from Caldivirga maquilingensis consists of two genes – cydA and

229 cydA’. There are no additional subunits encoded within the operon corresponding to cydX/cydY

230 or cydS, which are associated, respectively, with E.coli cydAB and G.thermodenitrificans

231 cydAB. Homologues of these subunits are not apparent in the C. maquilingensis genome. We

Murali et al. ISMEJ-21-00087A 10

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

232 cloned the operon into the pET22b vector and expressed it in an Escherichia coli strain in which

233 both bd-I and bd-II were deleted (CBO - C43, ΔcydA ΔappB)17. The enzyme, cytochrome bb’

234 oxygen reductase from Caldivirga was engineered to have numerous different tags –

235 6xHistidine, FLAG, GST and GFP. None of these tags were successful, either because of a poor

236 yield of or because of the inability of the affinity-tagged proteins to bind to columns with

237 their corresponding epitopes. A GFP-tagged protein was used to verify the expression in E.coli

238 of CydAA’ from Caldivirga maquilingensis. The presence of the protein could be observed by

239 following the fluorescence of the protein under UV light. Since subunit II was tagged with GFP,

240 it confirms the presence of subunit II in the preparation (Supplementary Figure 3). In addition,

241 the purified protein was verified by mass spectrometry with many peptides recovered from

242 subunit I. (Supplementary Figure 4). Gel electrophoresis of a partially purified CydAA’ shows

243 two bands of the sizes expected for CydA and CydA’ (Supplementary Figure 5).

244 A UV-visible spectrum of CydAA’ in the reduced-minus-oxidized state reveals the

245 absence of the heme d absorbance peak. The presence of heme b595 is also not apparent in the

246 spectrum since the maxima at 595 nm and the Soret peak at 440 nm are also missing. This could

247 indicate that heme b595 is low spin in this preparation. Hemes were extracted from CydAA’ of

248 Caldivirga maquilingensis as described previously46. Only b-type hemes are present in the

249 enzyme (Figure 4). This was verified by analyzing the hemes in the protein by LC-MS (data not

250 shown).

251 CydAA’ from Caldivirga maquilingensis has oxygen reduction activity

252 The oxygen reduction activity of CydAA’ was tested using a Clark electrode, with

253 reduced coenzyme Q1 (reduced using DTT) as the electron donor. (Table 2, Figure 4) At 37 ŀ

254 the specific activity is ~330 e-/s (/heme b). While this is not as high as the activity of E. coli bd at

Murali et al. ISMEJ-21-00087A 11

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

255 the same temperature (over 1000 e-/s), the enzymatic activity is substantial, particularly

256 considering the fact that the source of the enzyme is a thermophilic whose growth is

257 optimum at 65 ŀ. The oxygen reductase activity of CydAA’ is insensitive to the presence of 250

258 µM KCN, a concentration of cyanide that would completely inhibit the activity of heme-copper

259 oxygen reductases1. Since CydAA’ was expressed in the bd-deletion mutant, E.coli strain CBO,

47 260 the only other potential oxygen reductase in this preparation is bo3 ubiquinol oxygen reductase

261 so the lack of cyanide sensitivity confirms that our purification protocol has separated the two

262 enzymes. The enzyme is also susceptible to Aurachin AC1-10, a known inhibitor of cytochrome

263 bd at concentrations as low as 250 nM48. We did not test for other possible functions for cydAA’

264 such as catalase activity49 or peroxidase activity50.

265 We previously noted that cydAA’ is typically found in organisms that perform sulfur-

266 based chemistry such as sulfur reduction and sulfate reduction (Supplementary Table 5) and

267 use DMSO-reductase like enzymes which use molybdopterin as a cofactor10. Combining the

268 above observation with the demonstrated oxygen reductase activity of CydAA’, it is likely that

269 the role of CydAA’ is to detoxify oxygen to protect oxygen-sensitive enzymes involved in sulfur

270 metabolism. This is similar to its expected role in Desulfovibrio24 and in the protection of

271 nitrogenases during the process of nitrogen fixation22.

272 It is striking that oxygen reduction is conserved in the qOR4a-subfamily despite the

273 replacement of CydB with CydA’ and therefore, it is worth considering the similarities and

274 differences between E.coli CydAB and C.maquilingensis CydAA’.

275 Structural differences between CydAB and CydAA’ inferred from homology models

Murali et al. ISMEJ-21-00087A 12

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

276 To aid in the understanding of differences between CydAA’ and other CydAB, we used

277 multiple sequence alignments (Supplementary Figure 2) and structural models of cydA and

278 cydA’ from Caldivirga maquilingensis (Supplementary Figure 7, pdb files are available in

279 supplementary material). The most drastic difference between the E.coli and C.maquilingensis

280 enzymes is the absence of cydB. cydB in E.coli was shown to contain the oxygen diffusion

281 channel31,32 and an additional proton channel leading to heme d, bound to subunit I30–32. In C.

282 maquilingensis the second subunit is cydA’ which is 26 % similar to cydA. Only the first two

283 helices are well conserved between these subunits in C. maquilingensis whereas other qOR4a-

284 type cydA’ and qOR4b-type cydA, such as in A. fulgidus are similar in the first 4 helices. To

285 substitute for the proton channel that exists in cydB, conserved residues in cydA’ such as Thr71,

286 Thr74 and His126 might form a different proton channel. cydA’ probably hosts an oxygen

287 diffusion channel to substitute for the loss of the one in cydB but it is not possible to tell from the

288 sequence alignment or structural model where in the subunit this might be. Interestingly, cydA’

289 retains His19 which has been implicated as a ligand to heme d and heme b595 in E. coli and G.

290 thermodenitrificans cytbd respectively, which might suggest that an additional heme might bind

291 to the cydA’ subunit but we cannot verify or refute this from our protein preparation. A number

292 of mutations are observed around the binuclear-active site in subunit I, which might affect the

293 midpoint potential of the heme or the proton-coupled electron transfer mechanisms.

294 Evolution of cydA’ and other cydA homologs missing the quinol binding site

295 As mentioned earlier, a phylogenomic analysis of cydA homologs revealed two new

296 families, OR-C and OR-N that share the first four helices containing the oxygen reduction site.

297 The cydA subunit of OR-C bd-type oxygen reductases typically has eight transmembrane helices

298 and an extended C-terminal periplasmic portion that binds hemes c, strongly suggesting that a

Murali et al. ISMEJ-21-00087A 13

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

299 cytochrome c could be an electron donor to this family. Adjacent to the OR-C1a-type cydA is

300 OR-C1b which also has 8 transmembrane helices. The OR-N3a/b, -N4a/b and -N5a/b family

301 cydA typically have 10 helices while the OR-N2 and -N1 have 14 transmembrane helices. OR-N

302 enzymes have been previously noted in Nitrospira51 and Chloroflexi (N5a/b)52. They were

303 recently shown to be expressed in manganese oxidizing autotrophic microorganism, Candidatus

304 manganitrophus noduliformans (N2/N1)from the phylum Nitrospira53 and is implicated in

305 oxygen reduction. Greater details on the OR-C and OR-N families, including distribution,

306 alignments and conserved amino acids are found in Supplementary Material. The OR-C and

307 OR-N families are widely distributed in Bacteria. OR-C is present only in Bacteria, while OR-N

308 has very few representatives in Archaea (Supplementary Table 2, Supplementary Table 4).

309 A phylogenetic tree of all cydA clades suggest that the OR-C and OR-N families are

310 more closely related to qOR4b than the other qOR subfamilies (Figure 1). There are also

311 conserved sequence features that suggest that the OR-C and OR-N families are more closely

312 related to the qOR3 and qOR4a families than the qOR1 family including the deletion in the

313 proton channel between the conserved glutamates E101 and E108 like in G.thermodenitrificans

314 cytbd, and the presence of nearby conserved tyrosines (Y123 and Y125 in the

315 G.thermodenitrificans cytbd numbering). (Supplementary table 6). This suggests that the OR-C

316 and OR-N families diverged from either of these two families and evolved after the qOR

317 reductases. The evolutionary analysis within this family is complicated by the high number of

318 independent gene duplication events. It appears that qOR4b, OR-N5b, OR-N3b, OR-N4b and

319 OR-C1b subfamilies were the result of gene duplication events (Figure 1). In fact, OR-3a and

320 OR-3b cydA share 50 % sequence similarity. Additionally, the presence of OR-N1-type and OR-

321 N2-type cydA in the same operon in some Bacteria and the extent of similarity between them (up

Murali et al. ISMEJ-21-00087A 14

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

322 to 40%) suggest that they were part of yet another gene duplication. The importance of gene

323 duplication in protein evolution and functional diversification is well-known54. The nature this

324 process has taken in the cytochrome bd-type oxygen reductases is interesting - a majority of the

325 cydA paralogs have maintained the architecture associated with oxygen reduction and all of them

326 have maintained the His19 ligand to the active site heme d (as per the E. coli structure).

327 Additionally, all the above-mentioned duplication events appear to have resulted in a complex of

328 multiple cydA-like proteins with the possible exception of OR-N1 and OR-N2. OR-N2 is often

329 found in operons without another cydA-like protein (Figure 1). His19 and heme d is found near

330 the interface of subunit I and subunit II in the cydAB structures and the complete conservation of

331 these features with a change in their interacting partner, is suggestive of the process of

332 duplication and interface evolution recently investigated in hemoglobin55. Future work in the

333 biochemical and structural characterization of the various cytbd families will help us develop

334 insight into the driving forces behind the evolution of this superfamily. Presently, it is clear that

335 the defining characteristic of the cytbd superfamily is the di-heme oxygen reduction site found in

336 the first four helices of cydA homologs. Our analysis suggests that the bd protein scaffold was

337 diversified multiple times to perform O2 chemistry in unique environments, possibly to function

338 with different electron donors.

339 Conclusions

340 The superfamily of cytochrome bd-type oxygen reductases is one of only two oxygen reductase

341 superfamilies that are widely distributed in Bacteria and Archaea. In the current work we have

342 demonstrated the large diversity of this superfamily using phylogenomics. In addition, we

343 biochemically characterized the CydAA’ from C. maquilingensis showing that cydAA’ is a

344 robust oxygen reductase. The isolated CydAA’ contained only b hemes and no heme d. Hence,

Murali et al. ISMEJ-21-00087A 15

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

345 C. maquilingensis CydAA’ is a bb’-type oxygen reductase and is the first such enzyme to be

346 purified and demonstrated to be a functional oxygen reductase. Finally, we demonstrate that

347 significant diversification of the cydA has occurred with the conserved oxygen reduction site

348 being adapted to multiple functions within various ecological niches.

349 Materials and Methods

350 Phylogenomic analysis of cytochrome bd sequences in the GTDB database

351 In order to reconcile the protein phylogeny of cytochrome bd oxygen reductases with

352 taxonomy, we identified and mapped all cytbd to their respective species within the GTDB

353 database release8937. All cydA sequences were extracted from GTDB genomes using BLAST56 with an

354 e-value of 1e-1. The sequences were then aligned using muscle57 using the optional maxiters cut-off of 2.

355 The alignment was visualized on Jalview58 and sequences were filtered to remove cydA sequences

356 without characteristics of the quinol binding site or the proton channel. This filtration step was used to

357 remove subunits II but also resulted in the loss of a few subunits I within qOR1 that appear to have lost

358 the proton channel. The filtered set of cydA sequences were then classified using a Hidden Markov

359 Model (HMM)-based classifier trained to identify the families – qOR1, qOR2, qOR3, qOR4a, OR-C and

360 OR-N. The HMMs for those subfamilies and families are available in the supplementary material. The

361 presence or absence of cytbd in each species was tabulated and is available as Supplementary Table 2.

362 The all archaea species tree used to analyze the distribution of cytochrome bd oxygen reductases in

363 archaea was generated using Anvi’o41. A multiple sequence alignment was created by extracting all

364 ribosomal proteins from archaeal genomes using the HMM source Archaea_76. This alignment was then

365 used to generate a phylogenetic tree using FastTree as per Anvi’o’s default settings. This tree was

366 annotated using the data available in Supplementary Table 4 on the iTOL server59.

Murali et al. ISMEJ-21-00087A 16

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

367 The protein phylogeny of cytbd sequences was inferred using sequences of cytbd subunit I, cydA. These

368 were extracted from a taxonomically diverse set of genomes and metagenomes from IMG60, filtered with

369 UCLUST61 using a percentage identity cut-off of 0.6 and aligned using MUSCLE. The multiple sequence

370 alignment was used to infer a phylogenetic tree using RAxML62 on the CIPRES Science Gateway63 with

371 the PROTGAMMA substitution model, DAYHOFF matrix specification and a bootstrap analysis with

372 100 iterations.

373 Preparation of construct for of cytochrome bd oxidase from Escherichia coli

374 The genes encoding the bb’ oxygen reductase (Gene Object ID: 641276193-4) from C.

375 maquilengensis were PCR amplified using primers purchased from Integrated DNA Technology.

376 The genes were cloned into pET22b (Invitrogen) using 5’ NdeI and 3’ XhoI cut sites. The

377 inherent 6-Histidine tag in the vector was used to purify the protein. The vector was engineered

378 to use EGFP, GST or FLAG tags alternatively. The tag was added to subunit II in case of EGFP

379 and FLAG; a tag on both subunit I and II was attempted for the His-tag and GST tag. The

380 expression vector, along with pRARE (Novagen) was then transformed into (CBO

381 ΔcydBΔappC::kan) for protein expression.

382 Cell Growth and Protein Purification

383 A single colony was inoculated into 5 ml of LB (yeast extract and tryptone were

384 purchased from Acumedia and NaCl from Sigma-Aldrich) with 100 µg/ml Ampicillin and

385 incubated with shaking at 37 ŀ. The following day, the 5 ml culture was inoculated in 300 ml

386 LB with 100 µg/ml Ampicillin and grown overnight at 37 ŀ. On the third day, 10 ml of the

387 secondary culture was inoculated into twenty four of 2L flasks containing 1 L LB with 100

388 µg/ml Ampicillin, each. The flasks were incubated at 37 ŀ while shaking at 200 rpm, until the

Murali et al. ISMEJ-21-00087A 17

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

389 OD600 of the culture reached 0.6. The temperature was then lowered to 30 ŀ, and the culture

390 was incubated for 8 hrs or overnight.

391 The fully-grown cultures were then pelleted by spinning down at 8000 rpm for 8 minutes,

392 in 500 ml centrifuge bottles. The harvested cells were then resuspended in 100 mM Tris-HCl, 10

393 mM MgS04, pH 8 with DNaseI and a protease inhibitor cocktail from Sigma. The cells were

394 then homogenized using a Bamix Homogenizer, and passed through a Microfluidizer cell at 100

395 psi, three times, to lyse the cells. The soluble fraction of the lysate was then separated from the

396 insoluble by spinning down the lysate at 8000 rpm. Membranes were extracted from the soluble

397 fraction by centrifuging the soluble fraction at 42000 rpm for 4 hours.

398 Membranes were resuspended in 20 mM Tris, 300 mM NaCl, pH 8 and then solubilized

399 with 1% DDM or 1% SML. The solubilized membranes were spun down at 42000 rpm for 45

400 minutes to remove unsolubilized membranes. The supernatant was stirred with Ni-NTA resin for

401 1 hr and then loaded onto a column. The flow through was shown to contain the cydAA’ because

402 of its poor affinity for the nickel column. The flow through was then diluted in buffer to contain

403 50 mM salt and then loaded onto a DEAE column equilibriated with 20 mM Tris, pH 8, 0.05%

404 DDM. An elution gradient was run between 0-500 mM NaCl and cydAA’ was partially purified

405 from the fraction with higher absorbance at A412nm, corresponding to the soret peak for heme b

406 and used for assays and spectroscopy. This is similar to the first step for purification of

407 Escherichia coli.

408 UV-visible spectroscopy

409 Spectra of the protein were obtained using an Agilent DW-2000 Spectrophotometer in the

410 UV-visible region. The cuvette used has a pathlength of 1cm. The oxidized spectrum was taken

Murali et al. ISMEJ-21-00087A 18

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

411 of the air-oxidized protein. The enzyme was reduced with dithionite to obtain a reduced

412 spectrum.

413 Collection of Pyridine Hemochrome spectra and Heme Analysis

414 For the wildtype or mutants enzymes, 35 μl of the enzyme solution was mixed with an

415 equal volume of 40% pyridine with 200 mM NaOH. The oxidized spectra was measured in the

416 presence of ferricyanide and the reduced in the presence of dithionite. The values of heme b were

417 calculated according to the matrix suggested in46. The concentration of heme d was estimated

-1 -1 418 using the coefficient ε(629-670nm) = 25 mM cm .

419 Measurement of oxygen reductase activity

420 Oxygen reductase activity was measured using the Mitocell Miniature Respirometer

421 MT200A (Harvard Apparatus). 5 mM DTT and 350 µM Q1 were used as electron donors to

422 measure oxygen reduction by C.maquilingensis cydAA’ and E.coli cytochrome bd. 150-250 µM

423 KCN was used to test the cyanide sensitivity of the enzymes.

424 Structural modelling of cydAA’ from Caldivirga maquilingensis

425 Sequences of subunit I from Geobacillus thermodenitrificans and Caldivirga

426 maquilingensis were aligned using a larger alignment comprising many hundreds of bb’

427 sequences made with the software MUSCLE. This alignment was used as to create a model of

428 subunit I from Caldivirga using the Geobacillus subunit I as a template on the Swiss Model

429 server. A model of subunit II was also created using subunit I as a template. (The alignments are

430 provided as Supplementary Figures S4 and S5) The model was then visualized using VMD

431 1.9.2beta1.

Murali et al. ISMEJ-21-00087A 19

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

432

433 References

434 1. Borisov, V. B., Gennis, R. B., Hemp, J. & Verkhovsky, M. I. The cytochrome bd respiratory oxygen

435 reductases. Biochim. Biophys. Acta BBA - Bioenerg. 1807, 1398–1413 (2011).

436 2. Moll, R. & Schäfer, G. Purification and characterisation of an archaebacterial succinate

437 dehydrogenase complex from the plasma membrane of the thermoacidophile

438 acidocaldarius. Eur. J. Biochem. 201, 593–600 (1991).

439 3. Kabashima, Y. & Sakamoto, J. Purification and biochemical properties of a cytochrome bc complex

440 from the aerobic hyperthermophilic archaeon pernix. BMC Microbiol. 11, 52 (2011).

441 4. S, A. & G, S. Cytochrome aa3 from Sulfolobus acidocaldarius. A single-subunit, quinol-oxidizing

442 archaebacterial terminal oxidase. Eur. J. Biochem. 191, 297–305 (1990).

443 5. Bandeiras, T. M., Pereira, M. M., Teixeira, M., Moenne-Loccoz, P. & Blackburn, N. J. Structure and

444 coordination of CuB in the ambivalens aa3 quinol oxidase heme–copper center. JBIC J.

445 Biol. Inorg. Chem. 10, 625–635 (2005).

446 6. Marreiros, B. C. et al. Exploring membrane respiratory chains. Biochim. Biophys. Acta BBA -

447 Bioenerg. 1857, 1039–1067 (2016).

448 7. Blank, C. E. Low Rates of Lateral Gene Transfer among Metabolic Genes Define the Evolving

449 Biogeochemical Niches of Archaea through Deep Time. Archaea 2012, 1–23 (2012).

450 8. Inskeep, W. P. et al. Phylogenetic and Functional Analysis of Metagenome Sequence from High-

451 Temperature Archaeal Habitats Demonstrate Linkages between Metabolic Potential and

452 Geochemistry. Front. Microbiol. 4, (2013).

453 9. Beam, J. P., Jay, Z. J., Kozubal, M. A. & Inskeep, W. P. Niche specialization of novel

454 to oxic and hypoxic acidic geothermal springs of Yellowstone National Park. ISME

455 J. 8, 938–951 (2014).

Murali et al. ISMEJ-21-00087A 20

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

456 10. Jay, Z. J. et al. Predominant -Like Populations from Geothermal Environments in

457 Yellowstone National Park Exhibit Similar Metabolic Potential in Different Hypoxic Microbial

458 Communities. Appl. Environ. Microbiol. 80, 294–305 (2014).

459 11. Jay, Z. J. et al. Pyrobaculum yellowstonensis Strain WP30 Respires on Elemental Sulfur and/or

460 Arsenate in Circumneutral Sulfidic Geothermal Sediments of Yellowstone National Park. Appl.

461 Environ. Microbiol. 81, 5907–5916 (2015).

462 12. Azarkina, N. et al. A Cytochrome bb′-type Quinol Oxidase inBacillus subtilis Strain 168. J. Biol.

463 Chem. 274, 32810–32817 (1999).

464 13. Cunningham, L., Pitt, M. & Williams, H. D. The cioAB genes from Pseudomonas aeruginosa code

465 for a novel cyanide-insensitive terminal oxidase related to the cytochrome bd quinol oxidases. Mol.

466 Microbiol. 24, 579–591 (1997).

467 14. Sootsuwan, K., Lertwattanasakul, N., Thanonkeo, P., Matsushita, K. & Yamada, M. Analysis of the

468 respiratory chain in Ethanologenic Zymomonas mobilis with a cyanide-resistant bd-type ubiquinol

469 oxidase as the only terminal oxidase and its possible physiological roles. J. Mol. Microbiol.

470 Biotechnol. 14, 163–175 (2008).

471 15. Essential role of cytochrome bd-related oxidase in cyanide resistance of Pseudomonas

472 pseudoalcaligenes CECT5344. - Abstract - Europe PMC.

473 https://europepmc.org/article/med/17574992.

474 16. Sun, Y.-H. et al. The small protein CydX is required for function of cytochrome bd oxidase in

475 Brucella abortus. Front. Cell. Infect. Microbiol. 2, (2012).

476 17. Hoeser, J., Hong, S., Gehmann, G., Gennis, R. B. & Friedrich, T. Subunit CydX of Escherichia coli

477 cytochrome bd ubiquinol oxidase is essential for assembly and stability of the di-heme active site.

478 FEBS Lett. 588, 1537–1541 (2014).

479 18. Chen, H., Luo, Q., Yin, J., Gao, T. & Gao, H. Evidence for the requirement of CydX in function but

480 not assembly of the cytochrome bd oxidase in Shewanella oneidensis. Biochim. Biophys. Acta BBA -

481 Gen. Subj. 1850, 318–328 (2015).

Murali et al. ISMEJ-21-00087A 21

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

482 19. The Escherichia coli CydX Protein Is a Member of the CydAB Cytochrome bd Oxidase Complex and

483 Is Required for Cytochrome bd Oxidase Activity | Journal of Bacteriology. https://jb-asm-

484 org.clsproxy.library.caltech.edu/content/195/16/3640.

485 20. D’mello, R., Hill, S. & Poole, R. K. The cytochrome bd quinol oxidase in Escherichia coli has an

486 extremely high oxygen affinity and two oxygen-binding haems: implications for regulation of activity

487 in vivo by oxygen inhibition. Microbiology, 142, 755–763 (1996).

488 21. Kaminski, P. A., Kitts, C. L., Zimmerman, Z. & Ludwig, R. A. Azorhizobium caulinodans uses both

489 cytochrome bd (quinol) and cytochrome cbb3 (cytochrome c) terminal oxidases for symbiotic N2

490 fixation. J. Bacteriol. 178, 5989–5994 (1996).

491 22. Juty, N. S. & Hill, S. The KIebsieIIa pneumoniae cytochrome bd‘ terminal oxidase complex and its

492 role in microaerobic nitrogen fixation. 11 (1997).

493 23. Wakai, S., Kikumoto, M., Kanao, T. & Kamimura, K. Involvement of sulfide:quinone oxidoreductase

494 in sulfur oxidation of an acidophilic iron-oxidizing bacterium, Acidithiobacillus ferrooxidans NASF-

495 1. Biosci. Biotechnol. Biochem. 68, 2519–2528 (2004).

496 24. Lamrabet, O. et al. Oxygen reduction in the strict anaerobe Desulfovibrio vulgaris Hildenborough:

497 characterization of two membrane-bound oxygen reductases. Microbiology 157, 2720–2732 (2011).

498 25. Lemos, R. S. et al. The ‘strict’ anaerobe Desulfovibrio gigas contains a membrane-bound oxygen-

499 reducing respiratory chain. FEBS Lett. 496, 40–43 (2001).

500 26. Borisov, V. B., Belevich, I., Bloch, D. A., Mogi, T. & Verkhovsky, M. I. Glutamate 107 in Subunit I

501 of Cytochrome bd from Escherichia coli Is Part of a Transmembrane Intraprotein Pathway

502 Conducting Protons from the Cytoplasm to the Heme b595/Heme d Active Site. Biochemistry 47,

503 7907–7914 (2008).

504 27. Bertsova, Y. V., Bogachev, A. V. & Skulachev, V. P. Generation of protonic potential by the bd -type

505 quinol oxidase of Azotobacter vinelandii. FEBS Lett. 414, 369–372 (1997).

506 28. Borisov, V. B. et al. Aerobic respiratory chain of Escherichia coli is not allowed to work in fully

507 uncoupled mode. Proc. Natl. Acad. Sci. 108, 17320–17324 (2011).

Murali et al. ISMEJ-21-00087A 22

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

508 29. Stolper, D. A., Revsbech, N. P. & Canfield, D. E. Aerobic growth at nanomolar oxygen

509 concentrations. Proc. Natl. Acad. Sci. 107, 18755–18760 (2010).

510 30. Safarian, S. et al. Structure of a bd oxidase indicates similar mechanisms for membraneintegrated

511 oxygen reductases. Science 352, 583–586 (2016).

512 31. Safarian, S. et al. Active site rearrangement and structural divergence in prokaryotic respiratory

513 oxidases. Science 366, 100–104 (2019).

514 32. Theßeling, A. et al. Homologous bd oxidases share the same architecture but differ in mechanism.

515 Nat. Commun. 10, 5138 (2019).

516 33. Miller, M. J. & Gennis, R. B. The purification and characterization of the cytochrome d terminal

517 oxidase complex of the Escherichia coli aerobic respiratory chain. J. Biol. Chem. 258, 9159–9165

518 (1983).

519 34. Voggu, L. et al. Microevolution of Cytochrome bd Oxidase in Staphylococci and Its Implication in

520 Resistance to Respiratory Toxins Released by Pseudomonas. J. Bacteriol. 188, 8079–8086 (2006).

521 35. Osborne, J. & Gennis, R. Sequence analysis of cytochrome bd oxidase suggests a revised topology

522 for subunit I. Biochim. Biophys. Acta BBA - Bioenerg. 1410, 32–50 (1999).

523 36. Mogi, T. et al. Probing the Ubiquinol-Binding Site in Cytochrome bd by Site-Directed Mutagenesis.

524 Biochemistry 45, 7924–7930 (2006).

525 37. Parks, D. H. et al. A standardized bacterial taxonomy based on genome phylogeny substantially

526 revises the tree of life. Nat. Biotechnol. 36, 996–1004 (2018).

527 38. Rinke, C. et al. A rank-normalized archaeal taxonomy based on genome phylogeny resolves

528 widespread incomplete and uneven classifications. bioRxiv 2020.03.01.972265 (2020)

529 doi:10.1101/2020.03.01.972265.

530 39. Mogi, T., Endou, S., Akimoto, S., Morimoto-Tadokoro, M. & Miyoshi, H. Glutamates 99 and 107 in

531 transmembrane helix III of subunit I of cytochrome bd are critical for binding of the heme b595-d

532 binuclear center and enzyme activity. Biochemistry 45, 15785–15792 (2006).

533 40. Davín, A. A. et al. Gene transfers can date the tree of life. Nat. Ecol. Evol. 2, 904–909 (2018).

Murali et al. ISMEJ-21-00087A 23

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

534 41. Eren, A. M. et al. Anvi’o: an advanced analysis and visualization platform for ‘omics data. PeerJ 3,

535 e1319 (2015).

536 42. Jay, Z. J. et al. The distribution, diversity and function of predominant Thermoproteales in high-

537 temperature environments of Yellowstone National Park. Environ. Microbiol. 18, 4755–4769 (2016).

538 43. Cozen, A. E. et al. Transcriptional Map of Respiratory Versatility in the Hyperthermophilic

539 Crenarchaeon . J. Bacteriol. 191, 782–794 (2009).

540 44. Nunoura, T., Sako, Y., Wakagi, T. & Uchida, A. Regulation of the aerobic respiratory chain in the

541 facultatively aerobic and hyperthermophilic archaeon Pyrobaculum oguniense. Microbiology, 149,

542 673–688 (2003).

543 45. Bernstein, H. C., Beam, J. P., Kozubal, M. A., Carlson, R. P. & Inskeep, W. P. In situ analysis of

544 oxygen consumption and diffusive transport in high-temperature acidic iron-oxide microbial mats.

545 Environ. Microbiol. 15, 2360–2370 (2013).

546 46. Berry, E. A. & Trumpower, B. L. Simultaneous determination of hemes a, b, and c from pyridine

547 hemochrome spectra. Anal. Biochem. 161, 1–15 (1987).

548 47. Gennis, R. B. Some recent advances relating to prokaryotic cytochrome c reductases and cytochrome

549 c oxidases. Biochim. Biophys. Acta BBA - Bioenerg. 1058, 21–24 (1991).

550 48. Meunier, B., Madgwick, S. A., Reil, E., Oettmeier, W. & Rich, P. R. New inhibitors of the quinol

551 oxidation sites of bacterial cytochromes bo and bd. Biochemistry 34, 1076–1083 (1995).

552 49. Borisov, V. B. et al. Cytochrome bd oxidase from Escherichia coli displays high catalase activity: an

553 additional defense against oxidative stress. FEBS Lett. 587, 2214–2218 (2013).

554 50. Al-Attar, S. et al. Cytochrome bd Displays Significant Quinol Peroxidase Activity. Sci. Rep. 6,

555 (2016).

556 51. Lücker, S. et al. A Nitrospira metagenome illuminates the physiology and evolution of globally

557 important nitrite-oxidizing bacteria. Proc. Natl. Acad. Sci. U. S. A. 107, 13479–13484 (2010).

558 52. Soo, R. M., Hemp, J., Parks, D. H., Fischer, W. W. & Hugenholtz, P. On the origins of oxygenic

559 photosynthesis and aerobic respiration in Cyanobacteria. Science 355, 1436–1440 (2017).

Murali et al. ISMEJ-21-00087A 24

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

560 53. Yu, H. & Leadbetter, J. R. Bacterial chemolithoautotrophy via manganese oxidation. Nature 583,

561 453–458 (2020).

562 54. Ohno, S. Evolution by gene duplication. (1970).

563 55. Pillai, A. S. et al. Origin of complexity in haemoglobin evolution. Nature 581, 480–485 (2020).

564 56. Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search

565 tool. J. Mol. Biol. 215, 403–410 (1990).

566 57. Edgar, R. C. MUSCLE: a multiple sequence alignment method with reduced time and space

567 complexity. BMC Bioinformatics 5, 113 (2004).

568 58. Waterhouse, A. M., Procter, J. B., Martin, D. M. A., Clamp, M. & Barton, G. J. Jalview Version 2—a

569 multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191 (2009).

570 59. Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display

571 and annotation. Bioinforma. Oxf. Engl. 23, 127–128 (2007).

572 60. Chen, I.-M. A. et al. IMG/M v.5.0: an integrated data management and comparative analysis system

573 for microbial genomes and microbiomes. Nucleic Acids Res. 47, D666–D677 (2019).

574 61. Edgar, R. C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460–

575 2461 (2010).

576 62. Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large

577 phylogenies. Bioinformatics 30, 1312–1313 (2014).

578 63. M. A. Miller, W. Pfeiffer, & T. Schwartz. Creating the CIPRES Science Gateway for inference of

579 large phylogenetic trees. in 2010 Gateway Computing Environments Workshop (GCE) 1–8 (2010).

580 doi:10.1109/GCE.2010.5676129.

581

582

583

584

Murali et al. ISMEJ-21-00087A 25

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

585 List of Tables

586 Table 1. Expression of cydAA’ in the environment estimated using publicly available

587 metatranscriptomes. cydA homologs from the metatranscriptomic data available on the IMG website

588 were extracted using a BLASTP cutoff of 1e-5. The short fragments found in metatranscriptomic data

589 were matched with the full corresponding protein sequence based on the best hit in the NCBI database.

590 Table 2. Oxygen reduction activity of E. coli cydAB and C. maquilingensis cydAA’ in the presence

591 of 350 µM coenzyme Q1 and 5 mM DTT.

592 Supplementary Table 1. Total number of various cydA families and subfamilies in the GTDB. All

593 cydA sequences were extracted from GTDB genomes using BLAST with an e-value of 1e-1. The

594 sequences were then filtered to remove cydA sequences without characteristics of the quinol binding site

595 and then classified using a Hidden Markov Model (HMM)-based classifier trained to identify the families

596 – qOR1, qOR2, qOR3, qOR4a, OR-C, OR-N. The total number of cydA sequences in each of these

597 families and subfamilies were summed to generate this table.

598 Supplementary Table 2. Distribution of cydA subfamilies by genome in GTDB. All cydA sequences

599 were extracted from GTDB genomes using BLAST with an e-value of 1e-1. The sequences were then

600 filtered to remove cydA sequences without characteristics of the quinol binding site and then classified

601 using a Hidden Markov Model (HMM)-based classifier trained to identify the families – qOR1, qOR2,

602 qOR3, qOR4a, OR-C, OR-N. Labelled cydA sequences were then mapped back to each species to

603 generate this table.

604 Supplementary Table 3. Distribution of cydA subfamilies by phyla in GTDB. All cydA sequences

605 were extracted from GTDB genomes using BLAST with an e-value of 1e-1. The sequences were then

606 filtered to remove cydA sequences without characteristics of the quinol binding site and then classified

607 using a Hidden Markov Model (HMM)-based classifier trained to identify the families – qOR1, qOR2,

Murali et al. ISMEJ-21-00087A 26

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

608 qOR3, qOR4a, OR-C, OR-N. Labelled cydA sequences were then mapped back to each species and the

609 cydA sequences from each family/subfamily were summed across each phyla.

610 Supplementary Table 4. Distribution of quinol oxidizing cytbd in archaeal genomes found in the

611 GTDB. All cydA sequences were extracted from GTDB genomes using BLAST with an e-value of 1e-1.

612 The sequences were then filtered to remove cydA sequences without characteristics of the quinol binding

613 site and then classified using a Hidden Markov Model (HMM)-based classifier trained to identify the

614 families – qOR1, qOR2, qOR3, qOR4a. Labelled cydA sequences were then mapped back to each species

615 within the domain archaea to generate this table.

616 Supplementary Table 5. Growth conditions and temperature for organisms containing cydAA’.

617 Growth conditions, sensitivity to oxygen and temperature are detailed with references in this table to see

618 if there is a pattern to the conditions under which cydAA’ is typically found.

619 Supplementary Table 6. Conserved features residues identified in cytbd families without quinol

620 oxidizing features - qOR4b and subfamilies OR-C1a, OR-N1, OR-N2, OR-N3a, OR-N3b, OR-N4a,

621 OR-N4b, OR-N5a and OR-N5b. Conserved residues were identified using multiple sequence alignments

622 of cydA sequences from the above families. The presence or absence of the conserved residues for the

623 three hemes, proton channel, quinol binding site are marked.

624 Supplementary Table 7. Number of heme c binding sites in cytbd sequences from OR-C1a/OR-C1b

625 subfamilies. Number of heme c binding sites were counted using a python script that identifies the

626 number of CXXCH motifs in each protein sequence. The OR-C1a sequences from Desulfovibrionia

627 appear to have the greatest number of heme c binding sites – up to 8.

628

629

630

Murali et al. ISMEJ-21-00087A 27

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

631 List of multiple sequence alignments

632 Supplementary multiple sequences alignment MSA1. Multiple sequence alignment of sequences from

633 cydA subfamilies qOR1, qOR2, qOR3, qOR4a, qOR4b and families OR-C1a and OR-N. OR-N is not

634 split into subfamilies in this alignment. Various families and subfamilies are grouped when visualized in

635 Jalview and amino acids are colored using a ClustalX algorithim with a greater than 90 % identity. This

636 alignment was manually curated to improve the alignment and reduce the number of gaps.

637 Supplementary multiple sequences alignment MSA2. Multiple sequence alignment of sequences from

638 cydA subfamilies qOR1, qOR2, qOR3, qOR4a, qOR4b. OR-C1a, OR-N1, OR-N2. OR-N3a/3b, OR-

639 N4a/4b, OR-N5a and OR-N5b. Various families and subfamilies are grouped when visualized in Jalview

640 and amino acids are colored using a ClustalX algorithim with a greater than 90 % identity.

641 Supplementary multiple sequences alignment MSA3. Multiple sequence alignment of sequences from

642 all 15 cydA subfamilies qOR1, qOR2, qOR3, qOR4a, qOR4b. OR-C1a, OR-C1b OR-N1, OR-N2. OR-

643 N3a/3b, OR-N4a/4b, OR-N5a and OR-N5b.

644

645 List of Figures

646 Figure 1. Families within the cytochrome bd-type oxygen reductase superfamily. The cytochrome-bd

647 type oxygen reductase superfamily is divided into 3 families – qOR, OR-C and OR-N, primarily defined

648 by the presence of the quinol binding site in the first, the presence of heme c binding site in the second

649 and the abundance of OR-N enzymes in nitrospirota. The above schematic represents the various

650 subfamilies within each family, which are defined by the phylogenetic clustering shown in Supplementary

651 Figure 1). The operon context and putative complex arrangement of each cydA-containing enzyme is also

652 shown with a reference protein accession number and source microorganism. The potential gene

Murali et al. ISMEJ-21-00087A 28

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

653 duplication events are highlighted in yellow. A legend is also provided to mark the related conserved

654 domains in the same colors and redox co-factors such hemes and iron-sulfur clusters.

655 Figure 2. Phylogeny of quinol-oxidizing cytochrome bd-type oxygen reductases. Sequences of cytbd

656 subunit I, cydA were extracted from a taxonomically diverse set of genomes and metagenomes from

657 IMG, filtered with UCLUST using a percentage identity cut-off of 0.6 and aligned using MUSCLE. The

658 multiple sequence alignment was used to infer a phylogenetic tree using RAxML. The RAxML tree

659 topology was similar to that inferred by PhyML and Mr. Bayes. The cydA sequences which do not

660 contain the quinol binding site, from the OR-C and OR-N families as well as qOR4b, were used as the

661 outgroup. At least four monophyletic clades of typical cydA sequences that contain the O2- and quinol

662 binding site could be defined – qOR1, qOR2, qOR3 and qOR4a. The long branch within the qOR1 clade

663 comprises a number of cytbd which are highly similar to enzymes from this clade but are missing the

664 proton channel. Subunit I of cydAA’ is from the qOR4a-family.

665 Figure 3. Distribution of cytochrome bd-type oxygen reductases in Archaea. Concatenated gene

666 alignments were made from the archaea genomes in GTDB using Anvi’o. A phylogenetic tree was made

667 from the concatenated gene alignments using FastTree. All cydA sequences were extracted from GTDB

668 genomes using BLAST with an e-value of 1e-1. The sequences were then filtered to remove cydA

669 sequences without characteristics of the quinol binding site and then classified using a Hidden Markov

670 Model (HMM)-based classifier trained to identify the families – qOR1, qOR2, qOR3 and qOR4a. cydA

671 sequences from each family were then mapped back to each species, and visualized along with the species

672 tree on the iTOL server. Most phyla of the domain archaea were distinguished by color and a few classes

673 of the phylum Crenarchaeota were labelled to emphasize the presence of cydAA’. It is clear that cydAA’

674 is almost exclusive to the order Thermoproteales and .

675 Figure 4. Biochemical characteristics of cydAA’ from Caldivirga maquilingensis. (a and b.) UV-

676 visible spectra of cytochrome bd-type oxygen reductase purified from Escherichia coli and Caldivirga

677 maquilingensis, respectively. C. Pyridine hemochrome spectra of cydAA’ from Caldivirga

Murali et al. ISMEJ-21-00087A 29

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

678 maquilingensis revealing the absence of heme d in the partially purified enzyme. D. Oxygen reductase

679 activity of cydAA’ from C. maquilingensis shows that it is highly active and cyanide insensitive. It is

680 sensitive to Aurachin C1-10, a quinol binding site inhibitor which also inhibits E. coli cytochrome bd.

681 List of supplementary figures

682 Supplementary Figure 1. Phylogenetic clustering of all cydA-like sequences. cydA sequences were

683 extracted from a taxonomically diverse set of genomes and metagenomes from IMG and aligned using

684 MUSCLE. The multiple sequence alignment was used to infer a phylogenetic tree using RAxML. The

685 RAxML tree topology was similar to that inferred by PhyML. The three families, qOR, OR-C and OR-N

686 are clearly separated, and 15 subfamilies were designated based on the clustering observed and

687 identifiable sequence characteristics.

688 Supplementary Figure 2. Sequence characteristics of qOR4a-cydA. a. An unrooted phylogenetic tree

689 of cydA sequences from archaea including both qOR4a-type cydA and cydA of the qOR1, qOR2 and

690 qOR3 (in red) types was generated using RaxML. qOR4a type cydA have some internal clusters,

691 identified with a shaded box in green, blue and purple. Characteristics unique to the cluster, when

692 identifiable were indicated. For e.g., the presence of the insertion in the proton channel. b. A multiple

693 sequence alignment of the sequences present in the above clusters, the background of each sequence

694 cluster shaded in red, green, blue and purple according to the colors in the tree in a. Conserved residues

695 corresponding to the ligands, proton channel and a few residues expected to participate in proton-coupled

696 electron transfer are marked. Significantly, several qOR4a-type cydA have a lysine substituted for M393

697 (E. coli numbering) in the active site suggesting an alteration of the midpoint potential of heme b558 in

698 those enzymes.

699 Supplementary Figure 3. GFP-tagged cydAA’ from Caldivirga maquilingensis. The presence of

700 cydAA’ during protein purification protocol was verified by looking at elution fractions under UV-light.

701 Three glass vials containing (from leftmost) elution buffer, an elution fraction containing GFP-tagged

Murali et al. ISMEJ-21-00087A 30

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

702 cydAA’ and a fraction without cydAA’ are compared. The green fluorescence in the cydAA’ containing

703 fraction is easily distinguishable.

704 Supplementary Figure 4. Mass spectrometric identification of subunit I of cydAA’ from Caldivirga

705 maquilingensis. Partially purified cydAA’ was digested with Chymotrypsin and the digested peptides

706 were separated by HPLC and infused into a Thermo LTQ Velos ETD Pro Mass Spectrometer. The mass

707 fragments recovered after MS/MS fragmentation were subject to analysis by Mascot Distiller and Mascot

708 version 2.4. The analysis revealed peptides from subunit I of cydAA’ in the protein preparation with a

709 MASCOT score of 85.

710 Supplementary Figure 5. SDS-PAGE gel electrophoresis of partially purified cydAA’. A. Cell lysate

711 was loaded onto a Ni-NTA column (Lane 1). The flow-through was loaded onto a Q-sepharose column

712 and subject to elution by changing the salt concentration from 0-500 mM NaCl. Three elution peaks

713 (Lanes 2,3,4) which absorbed at 412 nm were pooled, concentrated and diluted to 50 mM NaCl and then

714 loaded onto a DEAE-Sepharose column and subject to elution under a salt gradient from 0-500 mM NaCl.

715 The elution fraction (Lane 6) which absorbed at 412 nm was pooled and concentrated and used to identify

716 electrophoresis patterns. Assays and spectra were obtained with a sample that was subject to a simpler

717 purification protocol – Ni-NTA followed by DEAE-sepharose because the yield was poor from the 3-step

718 purification protocol. Lane 5 was the Precision Plus Dual Color Standard (Bio-Rad). Subunits I and II are

719 similar in size to the subunits from E.coli cytbd (B). B. Electrophoresis patterns of purified E.coli cytbo3

720 and cytbd.

721 Supplementary Figure 6. Sequence characteristics of qOR4b-cydA from Caldivirga maquilingensis

722 (also referred to as cydA’). a. A topological representation of cydA’ using HMMTOP. The amino acids

723 conserved above 90% identity are shaded in black. b. A multiple sequence alignment of sequences from

724 qOR4a-cydA and qOR4b-cydA family. The former sequences are highlighted with the purple background

725 while the latter are highlighted with a gray background. The absence of the proton channel residues E99

726 and E109 is apparent. The ligand to heme d, His19 and the proton channel residue H126 are completely

Murali et al. ISMEJ-21-00087A 31

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

727 conserved. The ligands to heme b558 (H186 and M393) and other amino acids typically associated with the

728 quinol binding site in helices V-VIII are not well conserved. Two threonines Thr71 and Thr74 in

729 C.maquilingensis which take the place of Leu71 and Glu74 are completely conserved.

730 Supplementary Figure 7. Structural model of subunits I from Geobacillus thermodenitrificans

731 and Caldivirga maquilingensis respectively. The homology model of cydA from Caldivirga

732 maquilingensis was generated using the Swiss PDB viewer and visualized using VMD. The boxed

733 regions reveal more polar residues in Geobacillus, represented by red and blue, while aromatic

734 residues are colored in green.

Murali et al. ISMEJ-21-00087A 32 bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

Figure 1. Diversity of the cytochrome bd oxygen reductase superfamily. The cytochrome bd oxygen reductase superfamily is divided into 3 families based on phylogenetics and structure – qOR, OR-C and OR-N. qOR is defined by the presence of the quinol binding site in subunit I (cydA). OR-C is missing the quinol binding site but has a heme c binding site in subunit I. OR-N is also missing the quinol binding site and is commonly found in operons containing alternative electron donors. Various subfamilies within each family are also shown (Supplementary Figure 1). The operon context and putative protein complex arrangement of each cydA-containing enzyme is also shown with a reference protein accession number and source microorganism. The potential gene duplication events are highlighted in yellow. A legend is also provided to mark the related conserved domains in the same colors and redox co-factors such hemes and iron-sulfur clusters. A more detailed explanation of the figure including a description of the various subunits and characteristics of the families and subfamilies is provided in Supplementary Material.

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

Figure 2. Phylogeny of quinol-oxidizing cytochrome bd-type oxygen reductases. At least four clades of quinol-oxidizing cytochrome bd-type oxygen reductases could be identified – qOR1, qOR2, qOR-3 and qOR-4a. The long branch within the qOR1 clade (red star) is comprised of sequences missing the proton channel that is conserved in all other quinol-oxidizing cytochrome bd-type oxygen reductases. Subunit I of cydAA’ is from the qOR4a family. The cytochrome bd- type oxygen reductases that do not contain the quinol binding site (OR-C, OR-N, and qOR-4b families) were used as the outgroup.

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

Distribution of cytbd anoarch in Archaea N aeota a ot e c_Asgardarchaeota 5 5 2 7 a 8 6 2 0 0 p s

0 5 S R A

5 7 8 8 5 8 1 6 0 2 8 0 8 0 6 p 2 s 0 3 h 0 0 p 1 s S 9 R 0 A 3 1 5 S 1 R 5 3 A 2 3 5 7 83 3 8 7 1 5 8 2 0 8 6 6 0 2 2 7 p 7 00 2 s 9 0 4 sp 0 4 2 p -1 0 0 s 9 0 8 1

5 p 1 R

3 -3 s 1 A

6 2 S -

c 2 0 2 R 1

9 - 9 A 1

7 1 A 0

2 G B 2

0

5 0 C U W

7 p 5 G

0 s 1 5

3 0 9 5

6 2 6 1 9

7 -3 9 6 5 3 4 8 6 4

02 5 2 6 1 5

0 -3 0 2 8 2

2 0 0 8 5 2

sp 0 p 0 6 8 0

- s p 2 0 0 r 6 1 s 0 3 p 7 G 3 3 0 6 s

- 7 2 p 7 1

4 C A S s 2 7

-3 B 5 0 -

0 U R 8 0 4

1 A 2 p 8

- 1 s 5 4

0 S 1 5 4

- 0 3 X

4 R Y 6 E

-1 A 8

5 1 C 6 4 F 2 4 5 P 0 0 8 0 9 1 p 6 5 2 s 2 1 7 1 0 0 0 8 4 8 3 p 8 0 1 1 5 s 0 6 0 S 8 0 2 p R 4 1 0 s A 5 S 0 1 a 2 R p 2 u 2 s - 6

A 0 2 6

00 6 - 3 9

p 1 2 9

1 0 5 p

s S - 1 s 5

1 1 0 3 1

3 R G 6 1 1 1 - 5 A 0

4 8 C 8 R 7 3

8 1 0 -A 5 2

4 9 0 1 1 0

4 9 0 1 4 0

X 5 p 0 6 p

E 4 5 s 2 8 s 02 1 1 7

3 0 W 1 6

0 6 G 0 6 p 7 Y 0 4 5 s 2 K p 1 8

H 4 0 W s 5 P 9

8 0 J 5 N 9

u M 2 p 26 6 8

b A s - 7 6

e ic 5 1 3 9 2

ra ra 2 B 0 3 8 0

r 1 U - 2 0

y c rc 8 U 3 0 p

h h 9 C -1 0 s

a a 4 F p 6

e e 2 5 P G s 6

u u 0 1 B 7 4

m m 0 1 R 1 1

p 3 5 N P

c s 6 2 I

r a 5 7 2 B N

y c 2 5 A

s id 2 0 0 3

t i 5 0 4 S

a M p A p 0

le h B s 0 M

n ia il 7 0 B

s 1 u U 2 p

e m - s

4 1

s 3 1

p - 0

r 0 2 5 P

0 -0 M

2 1 9 S

2 3

1 G 6 A

4 C 8 7 C 1 1

6 0

5 5 5 0

0 8 p 7 u 2 s c_Bathyarchaeia 3 7 0 1

0 0 8 0

5 2 7

2 0 2 N

0 W

0 1 0 F

U 0 p p

B p s s 5 M

s 5 1 03

A 3 2 0

5 5 5 3

8 1 X 6

1 A V 7

A B E 2

s B U P 0 p U 0 %

0 P p

0 a s 7

I 2 r 1

a 5 v 0 u in 0 a u u 2 a 1 r 4 6 C

r 7 c 0 0 D 3

c 7 h 2 1 F . %

h 5 a 0 P

a e 0 1

e u 1 0 2

u m p s p

G m s

C a 4 0 6

a c 0 5 3

A n i 4 5 0 7

- d d 2 9 9 2 r .

2 o 0 5 2 1

6 e p 1 0 6 A %

8 rs h A 5 8 0

8 U o il B 2 6 B 9

0 B n u 0 2 U

3 A i U u 0 0

5 i 4 p 0 1 %

4 P 9 s p %

s 9 a 1 s 9 9

p 3 r 0 9 % 1

0 s v 1 5 .

0 p U a 1 1 9 5 5

2 0 B r sp A u 2 1 7 0

6 0 c 6 9 4

8 2 A U h 9 B u 6 8 2 9 .

8 4 a U 1 6 6

0 7 9 B e R 8 2 2 7

3 6 3 A u -A 3 0 - 2 9 0

5 2 4 m 6 1 A 0 .

9 sp 9 11 9 p C 0 7 3 a 5 p s p e U 5 0 c N N 5 0 7 s 6 G s 0

B 0 s i a a 6 2 2 6 1

2 p d 7 u 5 -9

E A 4 0 ip n n 2 W 8 1 2 2 8 2

1 9 0 h o o 8 G 6 7 9 0 7

0 9 2 p p N 8 2 1 1 1 5 -3 % %

2 5 6 ilu u u a 6 0 1 0 A 9 0

1 0 8 m s s n 2 5 A 5 B u 5 1 4

9 5 7 il il o 0 4 0 B 1 9 9 1 - 6

7 lu lu p 0 8 p p 2 U 0 . 6

s 3 s s u p 2 s U s 0 5 0 -

p 5 s s 6 5 1 3 4 0 4 . %

1 a s i 7 7 0 2 p -1 0

0 c te l l 5 2 2 5 8 1 s 1 0 6

2 id t u 6 7 0 p 8

1 U 1 i te s 2 0 8 5 2 0 s 7 5 2

9 B - lo s 8 0 1 0 - 8

u A 1 b ri t 8 p 6 1 A p 5 2 3 5 % .

4 i _ e 6 s -2 4 B s 1 4 2 8

1 -0 A tt 2 1 1 5 3 6 5 1

0 - E e - 0 A 7 U 1 8 9 M 8 0

2 1 X r A C 8 5 8 A S 6

0 0 4 i B 1 R 3 6 B 2 7

5 - 4 C Y 5 G 0 A 3 2 0

4 8 5 G C 1 0 - 3 0 U 0 6

sp 5 4 9 1 p 1 6 0 p % . %

- 4 P 2 s 1 7 p s

1 2 5 5 9 0 2 s 7

0 9 2 2 7 4 5 u 2 0 5 6 0

2 s U 7 2 -4 u 0 0 2 5

0 p G sp00 2686215 2 0 7 3 7 u 0 W p 9 8 9 % n 2

G 5 0 C B 0 2 0 7 2 G s 4 5

W u A 0 0 -5 9 1 2 0 A 8 4 3 0 A 4 p p 2 7 0 0 1 8 6 9 .

2 U 2 0 3 s s u 0 7 1 2 1 B 2 .

0 7 0 1 U 5 5 9 - 2 p 1 sp - U - 9 %

1 U B 7 B ARS 21 6 1 1 8 1 0 s 7 A 0

0 1 B A 8 2 s 6 3 1 9 0 p 0 -4 5 C 0 5

- - A 1 4 2 p A 3 6 2 9 G p 7 s 0 2 0 9

1 A 2 5 5 0 5 u p C s 0 1 2 0 7 G .

4 R 9 2 5 4 0 8 S 8 8 9 s 1 1 0 - 2 5 85 1

- 1 5 4 2 3 R 5 6 9 7 3 0 1 1 6 5 0

0 0 s 7 1 5 2 5 2 9 2 9 -2 1 R 7 1 1 8

- p 6 5 0 s 0 8 0 1 0 8 4 A -A A G 2 6 7

2 s 0 s . 0 5 p 3 7 0 1 -0 9 B C 0 8 .

0 p 0 p 1 5 0 6 2 p p 9 -4 B 11 U 5 0 8 6

-3 0 8 0 7 s s A 0 U 7 0 0 2 0

0 0 2 2 . 2 2 2 6 C 5 A 8 0 4 p 0 0 %

- 0 4 6 3 5 5 0 7 8 8 9 B - 2 7 s 0 0

1 8 9 0 0 2 9 9 G 7 U 0 W 6 4 p p 2

6 3 9 1 6 p 0 9 9 2 - 8 1 s s

0 4 u 6 3 s 0 1 1 6 4 G - 5

s 2 0 6 p 1 1 7 1 2 0 8 8 3

p 0- 7 5 1 5 4 s A A 2 - 0 2 1 1

0 1 2 B B 0 0 - R 8

0 4 5 - 8 0 u p 8 A 8 7 .

2 - U % 2 9 U U 0 6 s - 0 -

G 7 0 B U 3 9 p 7 1 8 1 6

C 7 - - 1 s 5 1 1 -2 % 0

9 2 A B 0 1 2 1 5 0 0

A 0 0 9 U A -1 A 1 1 G 2 A %

- 7 -3 3 B P 1 B 5 - p M C 1

0 5 4 E 0 -0 2 7 s B C W G 9

0 s A C 1 4 U 9 3

2 -1 p 8 Z G 1 1 8 - 6 G 5

8 2 0 4 Q 7 - 8 0 7 9 9

0 7 C 1 6 1 a 4 0 5 0

1 s 2 1 0 7 s 2 - 1 8 5

1 p 7 G 1 - p 0 0 1 8

0 0 9 s 1 s la 1 0 - A 6 .

5 0 2 p - p n 0 p 4 2

s 2 9 0 0 0 d 1 s -1 B 0 0

p 7 1 0 2 0 1 1 U 0

0 7 5 1 - 2 s 7 5 p

0 9 8 4 p u 2 s

2 0 1 7 7 7 0 9

8 6 1 -4 G 6 0 8 5

4 5 5 0 3 2 5 8 9

% 3 C 2 7 5 6 0

11 5 s A 6 7 8 2 u 8

0 p - 5 9 6 - 6 8

0 2 8 A 3 6

5 0 6 5 6 1 -2

1 8 9 C

8 8 5 2 G 5 0 A %

7 9 0 8 1 C %

U U 0 p 1

1 B 6 p 9 s G %

B U 4 A 5 s 2 7 1

A B 7 s 5 6 1 1

A 1 5 1 p 5 7 R

0 A 0 0 2

l t 1 1 1 0 8 0 % -A 5

ia 0 6 6 0

r 9 2 1 2 5 8 5 p 1 11 5

c 1 1 6 7 5 6 7 s u 3

h s 0 s 8 7 9 2 7 0 0 8

a p p 8 7 7 - 8 3 2 4 8 e 1 s 1 9 7 A -1 W 6 6 r

u 0 p 0 6 8 8 C 8 4 9 2

m 0 U 1 1 5 6 6 G p 0

3 B 0 6 2 6 G 2 -4 s 0 %

s 6 2 1 0 2 0 0 p

p u A 1 u 0 0 0 1 0 s 1

0 9 0 p 0 p - 4 5

0 7 u U s p s 0 6

2 s 5 - 9 5 5 5

0 s p B 1 5 7 4 A 1 83 1

8 A 0 1 B 3 8

3 0 8 F 9 7 - 0 8 7

9 0 4 B 7 8 1 U 5 3 6 8

8 1 8 U Z 5 7 8 5 1 8 -2 6

5 8 0 0 B 6 8 6 2 5 0 2 5

E 7 - N % 8 6 2 2 0 A 0 8

X 1 s 1 A 2 2 - 8 0 0 C 0 5

A 4 5 p 4 1 6 - A 7 4 p G p 5

4 9 0 -0 0 1 7 A C 8 0 s s 0

l t 8 5 0 - 2 2 C 6 0 5

ia 1 2 1 0 G 2 0 0 5 2

r 4 8 0 4 0 G 0 p 2 1 0

c - 9 7 - p 0 s R 8 0

h 6 1 5 s s p A 7 p c a 4 9 p s 7 - 8 s %

e s - 1 2 3 1 1 6

u p 9 1 0 -1 0 G 1 2 9 6

m 0 5 1 2 - 0 - 8

0 s 1 3 4 1 2 A 4

s 2 p 4 -3 1 1 C A 0

p 2 0 u 2 S 0 W B

0 5 1 0 N 0 R A G G 8

0 4 - 2 a - A U

1 4 1 7 n 1 A .

8 0 4 6 o -A

7 5 - a G 0 %

3 0 3 R r C C

8 - 3 B c 5 G 2

4 1 4 G h 9 C

5 0 5 G a 3

-3 W - e 8 S 3

1 0 1 u 8

- 2 6 m 6 8

3 0 -4 2 %

4 . 1 0 .

1 9 e 0

s 5 - -1 q p 2 0

p 0 G A 0 u s %

0 9 W R it 9

0 s a % 3

2 . 2 5 p n 0 2 7 2 7 0 s 0 3 4 7 c_Nitrososphaeria 6 1 p 0 1 . 2

S 3 1 1 1 S

C 2 4 % - 0 7 1

2 A 1 8 R 0 4 G 5 2 R 5 4 A . h C 6 2

- 5 4 3 .

A % s u 5 0

0 A p 5 2

A 0 0 3

0 8

. 2 0 % 8 3 5 6

2 8 2

-I 0 4 0

7 1 6 0

5 1 p 1

5 1 3 s .

s 5 25

p 0 5 8 7 % 0 7 0 0 . . 1 % 4

0 % 1 9 4 4 S 1

0 0 % 5 % 0

2 1 C 7 R 0

7 G 5 A 0

7 % 3 A

5 6 1 8 1 C 0 8 8 2 . G

2 9 6 . 5

% 3 7 rd

8 . 9 5 2 a

- 7 . 0 . 8 g

1 2 s

4 0 9 7 A

. 0 0 2 9 5 0 s 0 3 p 0 u 8

0 0 A 1 9 . 0 C 4 2 8 9 2 9 0 7 G 8 0 a 8 d p s p 6 9 r s

6 1 % % a 1 0 2 5 3 g 5 4 N

5 % 5 % s 1 4 9 I

U U 1 % A . 2 8 B 8 5 % B 5 4 A R 1

B 3 2 .1 5 7 B H 4

A A 4 5 6 1 7 U 4 9

5 5 4 0 0 0

4 5 3 0 9 6 4 0 0 . 0 9 p 0 5

3 s 7 4 01 s s p 6

s p . 9 9 0 1 1 5 4 5

p 0 0 . 1 4 0 0 9

0 0 0 A 9 C 9 4

0 2 0 0 C 8 O 9 5 2

2 6 % 0 A 4 0 0

5 8 A G B AW 2 s p 7 C d U 00

0 9 7 G r 5 p 5 7

E 2 3 a 5 8 s A

X 1 5 rd s g 6 4 5 7 B

3 3 7 5 4 5 U

4 5 U g a A 8 9 4 1 5 BA

4 1 B 8 s 1 4 9 6 U

8 5 9 2 4 3 1

4 % A . A 74 1 0 0 9

- 5 % 8 0 0 5 0 0 6 5

2 5 0 1 0 p 8 3 5 p 0 2 9 5

s % 9 s p s 5 3 7 s 0 6 7

p s 9 1 6 6 5 3 1 1 5 p 1

0 p 5 5 0 -6 3- 0 6 7 5 U- 2 0 s 0 0

0 0 6 9 9 0 3 5 5 8 2 31 T 5 3 57 9 2 0 2 p 5 - 1 2 6 0 1 0 O 0 0 1 A s p e 2 2 % . . 1 5 s - -2 9 7 0 0 m I- 9 2 7 B 55 5 a

5 4 9 5 5 1 6 -2 1 2 p u p u G 9 0 U lu 0 2 5 r a le

4 9 0 4 4 5 3 - M M 0 0 s 5 2 s e J 4 0 hi 5 8 7 9 5 9 e r ta

5 W 5 1 6 0 5 0 3 C 0 0 5 1 8 9 n 02 s p p 8 7 9 7 9 a t s o

6 O 4 9 5 5 8 0 -5 C 0 sp p 6 8 - 1 8 8 5 ra 0 7 e o 3 1 0 2 4 1 n ro 85 5 6 . 0 5 3 p -2 4 0 4 s 5 9 D 1 -9 6 er sp 5 5 o l 7 2 0 0 2 5 1 9 5 4 5 v a it 8 R 5 2 9 s -6 5 5 7 A sp A 5 t 5 5 7 A s 0 s p 0 3 9 0 7 0 e N 98 - 5 1 2 2 5 7 M 3 6 1 7 S 7 u b 5 64 5 B u s i 00 a e i s s p 0 p 0 3 6 7 7 a d 26 S 0 . 0 0 1 8 C 5 8 0 1 55 L 3 s 3 1 3 U r e ic n p c u is 92 s 4 1 le 0 M % 75 0 0 0 -9 0 2- 6 7 6 0 8 5 A 0 0 9 8 A 3 1 a m k e s o a e n u 1 e a 0 1 0 0 ta p0 E 2 A s p 0 4 - 2 7 0 1 5 9 P 5 0 u m 8 B 0 q u o s u is c l o t ten 0 0 t al 9 0 9 s o s 1 W 4 p N M 7 1 p 1 9 1 5 p e 2 U 0 a c g b n u is s o p o p s p o us -S X 7 4 C 5 s E 6 5 C -2 0 s A 4 2 65 8 s a 0 s p x s o 5 5 n t e u tr o o s a s os s a it r i c O 4 1 G .1 0 5 5 F 9 A 0 9 B 6 1 1 3 h 0 e o 9 9 5 h u o n i t r le i tr e a l e N ag C R 4 3 5 -2 2 5 8 4 0 C p 7 - U 1 3 1 4 5 2 rc s p 6 8 us it r 6 1 8 c o s o te N Ni t a N a l t a el G 8 7 0 rd 3 4 B 9 6 G s 5 3 0 3 0 8 -N ia 1 3 i c N 1 7 0 um is itr s s o o t s o op - -S 4 0 a 5 8 5 7 5 9 - 0 0 0 1 1 ld 0 - 1 m 3 5 s u N ro o o s ro os A - % 8 g 1 5 4 7 7 1 8 7- 6 p 0 s p 0 1 6 a N A s 0 02 io e n it it r it r i t i tr W M 2 A s . 2 4 0 2 5 - 1 s p 0 2 C I S c o p 0 0 b o t N N N N N s 1 2 . % C A 5 8 X 0 1 0 9 - 0 91 s 0 2 A 5 B L o s s p y m s

IM p O - 4 5 G 5 2 . 1 E 0 p 91 6 G 9 9 9 2 0 5 A A 70 HR PA os 2 2 s it ro 0 S 1 7 0 5 p s 1 -1 B 9 - 9 p 0 7 0 tr 4 5 5 m N

C 0 R C s 0 d 0 0 6 s 0 0 G R - G - s 1 8 Ni 0 0 4 u

4 1 - p 8 r 4 A 4 2 0 G O N 4 1 3 1 1 a e

7 S G 0 % ga 1 9 5 -9 0- p B O B E -1 0 0 0 A A h

s 2 M s 0 6 s 0 C 2 8 5 s R B F R 02 0 5 U B B rc

p 3 1 p 2 6 0 G 8 -3 - 6 F 0 p 1 U a

0 8 - 0 2 . A d 2 1 1 3 6 - D p s 4 8 i e n

0 4 S 5 A r 0 . . 1 - 1 3 J s 5 6 s C

1 5 C 0 4 0 6 C a 0 1 5 5 - - 5 3 0 5 1 e n 7 1 0 5 g A 3 G G 2 - 1 -B 8 0 3 n

4 G 7 4 % 1 G s C 5 5 3 B B - R 1 5 6 0 e n i 2 2 . 5 d A .1 2 3 R R M 25 F 7 0 p % vi s 7 s 3 8 . r G 5 8 6 C 6 d 4 4 s a e n u % 8 p 8 0 % a d 0 2 5 5 0 3 J A 0 69 1 e r d % 6 5 0 5 g r 7 0 1 3 4 3 A 0 0 3 a la 5 1 o 1 0 5 0 2 0 s a 0 0 0 0 0 1 - A p 5 1 5 ph rg 4 8 5 1 8 A sg 4 11 03 C s 2 0 G- 9 s v e s p % 8 9 7 9 A A 0 0 G 5 5 4 8 s o e 4 6 9 02 2 4 0 0 4 % 9 1 A 1 C C 2 5 p C 1 9 O 5 9 ro a 8 1 5 . 51 3 . . 0 G 0 6 s S 5 - J 4 B 1 4 i t er 5 . 8 5 8 1 8 2 % 5 0 G 0 8 6 0 6 0 2 3 02 N a 36 A 0 00 3 % . . 9 % 2 A rd rd 5 p 3 8 5 1 1 0 0 95 0 p h 3 0 B 7 s p 5 1 9 . 0 3 a a 9 s 9 5 -9 5 5 0 p 4 s p s 4 4 U % . 5 5 s 5 1 3 C g g 9 1 15 5 5 3 A 2 8 2 0 s 2 4 5 a s o 1 0 0 23 ic u 8 6 G s s 5 1 0 2 3 S 6 2 0 3 0 s i 8 er r o 0 9 6 7 27 i g 2 6 0 % % 1 5 A A 7 - 0 6 3 L 5 0 0 I -0 1 8 p 0 n 0 1 a it p 0 % 7 4 6 ev l a 8 6 0 rd 1 7 R p 9 2 7 A 3 0 p G A s g e 5 p h N s 8 0 01 br pe 9 . 4 0 a F s 0 1 P 0 s J B 1 ar 2 s - 1 1 9 0 0 p0 s s o 7 4 g 00 d 4 3 5 0 % 0 1 U 4 g 0 0 s o 1 . % s p 5 s ic u tr o 5 A s p J 2 0 3 0 p 2 1 a p o P s 28 us g i % % 0 % % 9 . C A s B p 0 2 p 1 s -1 BA 5 e r s it r CS 0 1 u 9 i c ela N % . 8 G s 6 55 s 1 4 U 6 5 a ra N g ic 70 ag p 6 . 3 % 9 5 -3 G 8 0 5 8 p h a e l a 02 el s o 8 8 0 rd -1 - 2 3 0 9 2 M 44 6 9 s h % e 0 op t ro 0 8 6 -L 0 7 3 - X 6 5 2 8 s o p o p s p s Ni % 9 0 g a 4 % 0 7 8 - % 1 3 E 5 0 2 0 i o o s 1 o s s 5 i tr o . 9 3 s 6 - IN p 2 Z 2 3 3 0 n s i tr o s 8 it r ic u 0 7 N 0 . A 1 9 B s 0 G 3 N 0 1 sp r e N t r N g 2 9 . - 2 - 1 p 0 S 0 7 4 u Ni 1 la 4 0 0 G 9 L s 3 s p 02 0 c . pe 0 5 . 2 0 % 0 B . 0 - 9 0 0 I N a s s o 0 0 3 45 6 IN - 4 6 s p B v 0 ro s p 49 74 R 0 9 7 . 1 R c a i t s 48 3 2 1 . B - 4 A % 1 3 H s N c u 0 27 6 % % 0 % B 2 u % g i 00 0 a % . 1 % % -8 0 U 8 A ld 1 la s p p0 ni 5 % 0 1 0 6 B ca p e s 5 s li m 66 0 9 5 0 - 9 U o o c u 80 um 6 3 1 6 3 8 1 4 9 9 o s r o s gi 81 ae um 50 . % - t r i t l a 7 h ae si 02 B . % 4 6 .1 0 9 . Ni N pe 02 r c ch en p0 M 1 8 4 9 5 1 2 . 0 o p0 oa ar e s S 0 9 5 % 0 6 os s os so or lu s 9 6 9 6 % 0 itr _A i tr ro k m i 1 1 % . 0 t 3 . 0 . 7 % % N % m N it um i u M A . 3 % . . 4 6 % eu N ae in o p B S 9 0 9 1 5 1 5 a h i m os M e t I Z 8 . % 6 9 0 0 1 5 7 % % 5 c h arc 5 ed i tr e h M D N U 4 6 0 0 0 9 1 9 ar s o 51 s N t h a e G 1 1 0 0 1 1 9 s o ro 8 us a n t 6 A % 8 1 A . 5 8 0 1 . tr o it 78 il n M o h -3 - 3 3 . 7 . 2 0 i N 02 um o e c o a 3 s p 2 % % C 0 9 0 N ia p 0 op f a t c n 0 3 5 9 G 4 0 0 % l ar s os s h c o s 6 % 0 . 1 % d % 5 p a us i tr tid an u c o p 0 s % r 2 2 5 s 5 s s il N 5 i o s c 0 2 p % % a 9 2 0 7 lu um 21 o c M m c 0 9 0 0 8 7 1 g 0 5 5 -0 % % 9 m i op 75 su o e u 1 2 0 3 . s 0 p 0 7 5 25 R 9 % pu os 31 5 5 m c t a s 5 3 3 0 A s 8 2 9 F 1 1 . 1 o i tr 00 2 45 c h ri m 1 2 2 % % 9 9 3 2 0 d r os N s p 76 37 s u a p 5 H 1 3 . 8 % 0 9 8 5 0 1 J 0 i t s 5 43 27 p s n a l a 1 a 5 0 B 1 . 1 1 p0 0 N ilu 75 27 01 00 00 v oc u rip 8 d 3 M 8 8 9 - 2 0 s 02 % m ic u 49 10 p0 sp 1 o M o d a 5 5 8 4 . 4 0 5 0 pu at 48 15 s s s 35 5 lt a e c is l T ar 5 B S 7 . 2 % 0 8 p 35 4 p 8 s o dri 00 00 i lu ilu 25 05 a % % 8 e t c _ u h c M 3 4 s 1 2 s 3 % tr o a 0 s p m m 19 69 7 ha u A d e h S B % 5 0 4 3 - 7 3 % 7 1 9 Ni l us sp s pu pu 54 02 6 n s is r a 1 1 0 9 4 X 2 4 0 3 1 0 3 2 mi us il u so oso 01 p0 A 7 o m _ m e 3 s 0 9 6 . E B 8 5 2 % . 9 u i l um t ro i tr p0 s e 5 c C o u B p% . 0 4 4 0 2 A . 9 op um op Ni N s il us n M o a c m 0 . % 0 B 3 0 . os op s lu s m ig e c r ip o s p 0 7 0 EX 45 p U % 0 0 i tr os it ro mi pu i na m t c a c y 0 3 9 % 0 4 8 s 1 % N it r N pu so tal h u l c e 2 . % 3 3 4 N s o tr o c a ar a s u u l 0 2 2 8 8 3 9 3 9 7 % t ro Ni s c n m d s lo 9 . 8 5 2 7 % 1 Ni il u h o a i s s w 8 9 0 1 A . .9 9 % um a e co r _ P p s 9 3 1 7 6 0 B 8 % 8 op u c ip D 0 t 7 5 0 0 U 0 . % 0 9 os m c a yr 0 o 9 0 s p 9 3 9 % 8 i tr u l u o 2 n 5 % 7 9 5 0 9 . % 9 N 5 s s d P c 2 e 5 . % . 6 3 0 4 9 . 01 u v i y o 1 n . 8 9 0 7 % 8 0 43 b a r o c 4 s 2 0 B 5 9 9 5 . 5 8 3 7 25 15 t e M n c c 5 e % 0 1 2 5 3 79 . 9 . 0 3 00 r r n 1 o u 2 T 9 3 1 9 0 0 . 7 0 73 sp a e ie c s T 5 h 1 7 1 45 3 39 0 9 9 % % 02 us n th l % c h h e 0 9 2 9 9 0 3 0 . 8 4 p0 il e u a ii P u o e rm % 7 % % 0 5 5 p 1 0 9 9 s s um 5 m n s r T r 7 9 01 1 s 0 . il u op 79 o y a ik T T h m o . 1 4 0 0 1 0 9 0 m os 17 t h ro b o h h e o c % 7 2 p 0 A s p . % pu it r 23 e 0 P P c y s e T c o 6 0 s p B 0 s o N 00 M r y P o s h T e r h rm o c T % 0 1 s 2 % % % % 1 i tr o sp e m . ro yr y c s ii h rm m e c c h 9 . %s p - -1 A % 1 3 N lu s t h o 6 c o ro c i e o r o c T u e % 9 6 6 B 1 5 8 2 i a c o c c u r m o c m c u h s rm H 0 6 -2 2 2 3 9 5 pum n o c o o s T c o o s e T r 6 . 4 B B % % . 6 s o o cc 1 P c c c s h T o o c o T cc c r h a o y 7 1 2 . 5 . tr o s i f a u y u c u c p e c cc c co h u e m e d c d % 0 B % 7 8 0 0 N i en s s 5 r s u 0 r m h o u u e s l o r io o r o . % 0 r e ti _ o s s s 0 e c s s cc rm e c m t c t 8 1 9 % 6 % % ko imu d A % c p c k 0 o rm c u p r o o o c h 0 1 5 1 lu s r it i 65 io a o 0 h u 2 c T u s c e s o ro c c le u e 1 9 % 9 1 .9 7 mi ma ns 67 su e 1 c 0 ito k 1 1 o T h o s p l c f c o r s rm % . 3 pu s ne 74 5 m o c 0 u T c h e c g 0 e p o u u c a c % . 1 0 . s o ilu i ra 00 8 M li % u s 2 no lk 4 7 h cu e rmo a 0 ri a c n s c n le a 0 0 % it ro um s p p0 467 e M M s c 6 a e s r c m T T 0 c c c d u r 0 1 0 N op i lu s s 07 t h e e p u fu 3 ph n 5 T rm T T m o c u m h h 2 rei f u u th s fte c % 7 ros um il u 00 a n M t t 0 r 7 a ii h h z h o c s e e 2 s ic s io b n h . % % 9 Nit op u m sp 5 o h h 0 P io 3 g e o e il e c o a r r 1 c u s r a 7 1 9 os op us 68 s e t ana n 15 y s 5 u r c rm li g T r m o c s to m m 1 e ic e a s i e 1 . % it r r os mi l 46 p h h o o 8 r u m o i h c c p l o 8 n u d ro u 9 0 N i t u 07 a a f f 7 o o c o i e o c u 0 e c o 5 l u s m 0 0 % 1 N sop 00 e r n o a s as5 M M c o c cu c rm c u s 0 ra o co i c s 9 6 % % 5 i tr o sp a f t t 9 c 0 o s o o s s 0 n c e ii p 3 % % % N lu s s a id id 5 e e t T c c c o cc n p 1 c cc n r . 8 1 8 % 1 % m i p s i o io t h h h u c k c c u a 9 5 u u o 0.982% 9 7 % 1 7 pu 0 0 tid s s a a e s . u o u s o s u 0 1 s s fu % 9 . . .9 2 3 3 oso 3 i u u n n rm y 9 s d c t 0 2 p n % 9 6 0 % 7 9 it r 2 os m m o o a p a g cue il 1 0 i o d 7 0 9 . 5 N 6 6 u th th o y 9 e ka o s u i 9 5 e n i 0 4 0 7 % .7 0 . 0 m sp s p e e c a p r r y 8 z n 9 6 0 % 7 5 0 0 r rm oc n t re go g t 8 o u 0 4 . % 4 4 0 1 2 % % s 0 0 m c o 6 o n u h 3 p ri 6 . . 7 3 3 % p 9 1 1 o o 0 u s 0 0 n s na a e 5 h n 0 9 0 3 1 5 1 9 5 5 c co T s ii % o i r y rm ilu e . % 0 1 % 0 % % . 9 9 5 8 8 o c . h _ . . p iu m u 9 4 9 3 0 . .8 . 0 7 7 c c c 9 e A 9 h a a 0 0 0 . . . 6 1 0 0 0 u 7 5 u u rm 9 il s l i 5 9 157 5 s s 9 s i 0 u e 1 . 0 . 1 4 0 % .9 _ t o b 8 8 n . 9 0 9 0 A h c ir 0 s 0 0 1 . 7 9 % % 1 o e 9 o c ic . 8 6 i % 0 % 9 k rm c u 56 . 1 9 87 5 1 % . 1 i o % u %. 0 0 . . n a l s 6 % 9 0 9 3 % 2 M w it _ . 9 8 8 1 % 0 e h o A 3 5. 7 . 3 0 th e 5% 6 % % 54 % 7 % a n s tro sp 0 9 . . n i p 0 3 4 9 8 % 0 7 M M o h 0 0 . 8 0 1 9 2 % % e e c i 1 T % 6 % % 1 0 0 t t a cu 3 h . 0 0 7 1 0 % . 5 4 1 % 0 h h a M ld 1 e 9 . 09 9 % . a n e o P 7 r . . 0%% 7 8 5 % 1 . 1 n o t c a 3 T m 9 % .% 2 % % . 9 o c c h a o l a 4 h o 9 9 3 9 8 4 a a n c c e T 5 e c . % 9 1 0 8 ld ld o u o h r o 9 9 05 9 1 6 . 7 % 5 o o c s c e r m c 4 5 0 3 c c a b P o m o c 8 0 6 0 4 1 5 o o ld a 1 a c T o c u % . 4 % 6 % . 9 . % % c c c o t l c h o s 6 7 % %0 c u u c h % a u e r c o c _ % %. 1 0 5 9 0 5 s s M o oa e s m c c B 7 5 % 9 1 6 M _ _ e c r T o fe o c u b % . . . 85 . A A t cu d h co r c u s a 0 1 % 2 70 e 9 v M h s e e c r o s _ r 7 8 3 0 0 0 . 50 M t h i n il e a n s s c r m c o p c _ B o % % 1 02 e a 6 fe lo M t o p e u h c A T p p % % 1 6 . 0 p0 t n r s e h a c 0 n o s i u s h a h . 0 8 s h a ob 6 nuu t h n a 0 c _ lu s p e r r ilu % 7 8 16 n r a o ld 0 o A _ 0 m a 4 1 BA M o e % n c o 0 c c p A 0 o l v 1 6 6 % U b v o a c 2 u a s 0 in 7 % % % M e re ib c ld o 5 s c p 4 co e % % 5 0 1 e th v ac a o c 52 _ if 0 3 c l 7 . 1 % th a ib t ld c c u 5 C i c 0 0 c l a 0 1 % a M n o a e o o s u 1 4 u e % n e b c r_ c c c 1 s 4 8 s % 3 0 9 5 o b t h r t e A o u j a p 0 8 5 _ . % r M a e M r s c c s n % 0 4 A 0 7 9 6 c_Thermoprotei M e n v i _ p u f n 2 6 l 2 1 e vi e t o b e A 9 s e a s M M 0 0 8 5 it o . . 9 .9 t b h M br a c th s 0 v rv c 0 1r 6 . % h a c a e t a p 0 u e h e e 9 a 1 0 9 0 a n ten o et v i e r n 903 2 lc n ii t h t h . 9 li 3 0 0 o r b h b _ o 0 0 a a a 6 7 % 1 7 8 M b M _ r a n a A b r 3 5 n n n 5 % . um e r e e A e o c t g e 1 1 iu o o 5 0 % % fi l t v t h s vi b e o v 6 5 1 to to % 4 0 pt o ha ib a p bare r_ tti b 89 rr r 8 . 1 % cr y n a n 9 0 c v A s a 5 i s ri m M o c o 0 te ib s c h ct % s 2 % 1 eu e b te b 3 r a p a e fo ig % 0 0 % ha th r e r _ r e 2 _ c 0 lk r_ r n 1 arc a M v A v i 0 A te 0 ii A 0 m e 0. 7 Kor n o M ib M b 9 5 s r 1 t ic u . 1 s e t e M a e s p a 5 p _ 5 4 h . i c 2 9 % 5 p h th e ct th M 9 ct 9 0 A 8 a u . 5 53 45 h a a n a t h e a e 0 e 0 s 6 u e 7 62 9 % 4 % 19 ax 973 e o M n M a r _ n th 0 r _ 3 p97 r 0 30 t en 024 r a b e obe n A o a 3 A 1 0 5 i 7 1 0 9 7 00 s p0 s re th r th ob s b r n 1 9 s 8 0 1 9 0 . 1 % sp ot eu s s M p M v a e a r p e o b 9 p 0 3 . 5 8 - 1 pr t eu e 0 e ib n v i n o e 9 v r 8 9 351 9 %% % . . 7 1 H- B rm o pro t h 0 3 t h a o b b a b v i 00ib e v 5 0 0 4 % 0 0 . 6 E C he r m o a 2 a c t r c r e M b a 3 a i 3 61 5 1 9 % % . 5 T he 75 M n o 6 n er M e t e v e c 1 c t ba 1 5 0 1 0 088 T 770 e t s 6 0 M o _ e v i r ib th M t e 7 e c 4 % % 8 0 1 0 96 20 h a p 6 e b r B th b a ru a a e r 8 r_ te 6 . % 0 1 000 00 n h a 5 th e s a c m c n t h _ 6 5 A r 9 8 % 95 sp sp o e a v i p n te i t e ob a A m _ 5 0 3 0 4 195 us teus M sp ra n b a 9 0 o r n a r M re M n o s i A 7 . 0 1 % 30 r ote si pr o M e h s obc 0 b o n s p v e b p 9 lle o . 0 % 9 % 00 m op ie n mo e t h a p r te 3 re lle ti 9 e t ib t r 0 r ra 0 9% 1 . . 1 sp st u r m zon her th a e r 0 evr 1 U v y u 0 h a hae 0 a li 0 0 . -D ig e he s u T a n o a 0 3 M ib _B4 6 B ib a m 0 a n c n v i 3 e . % 9 % 9 9 9 9 % BE od lu m T t eu m n o s s p 2 e a 0 A a c e 3 2 o te o b a 1 9 9 . . 0 es abu p ro dic u m s p 0 6 t h c b 5 4 t e 1 b r _ b c 5 0 0 9 3 9 1 s m us m o la n i lu p h h a 0 8 0 a teo v 12r 9 re A r e t e 3 085 9 9 0 5 . . % lic u ol e her i s roph M a e 3 0 n r_ is s 9 v v r 5 . hy us T lu m ut e t e ra 2 6 5 o b B k s p p 5 ib spib _ A . 5 .51 9 8 . % 9 1 6 0 2 et l ic acu ne 5 h a r a s 6 0 r s or 0 90 a 0 a 08 5 9 0 i nom t hy hu r ob ulu m 27 n M M s p 1 4 . e v p e 0 0 c 0 ct sm 3 %6 5 % 9 9 on ha me r op Py ac 189 nti o e e t p 1 1 5 8 i b 0 an 2 4 3 t e 2 4 e .9 1 % % . ar b et n o di t r ob 001 dif o spt h h 1 0 9 a 0 2 i 9 1 r _ 9 r _ i th 5 04 % % 9 2 9 rac M t ha oxy Py sp al i se M h a a n 2 2 1 8 c 2 5 4 6 A 6 A i 4 9 9 . et Me ar b lu m m c nie n e t a e n o o 4 2 u te 0 6 3 w0 s i_ 0 6 .% 3 s p c cu c ulu gu h a r s s p u % r _ 8 8 5 5 o 6 m A % % 0 % 7 0 % at u lu m oba ba m o um n a p h 0 B 6 2 e 5 it . 9 % sur en ofi Pyr y r o ul u ati c o M s p haa w 5 s h 4 . % 3 no nd rm P bac en b a e 0 e e r . o ei ii 8 9 9 9 t ha pe The e yro ar s c t M M t h 03ra a 8 l i 8 % % 1 9 Me um 45 ens P lu m e r e et a 2 s s t n 0 0 6 7 % ofi l 57 on % a cu i u t h han 6 p a 9 ii 1 % 9 rm 285 uz 2 r ob n m anM n o s 6 1 0 0 d t . 03 . % . h e 00 _A 9 Py ce _ o e t o p 0 1 m % 9 9 1 1 9 0 T sp lu m 8 r edu B b a h a M sph a 5 72a n .% _A of i . er r i s p c n e h e 9 a 7 9 % 3 % lu m er m % 0 m f 0 te o b th ae ra 9 e 1 4 0 % ofi Th 2 cul u 03 ri a a n M r s 6 5 % % 9 1 8 r m 8 ba il um 1 um ct o e t a p 1 6 3 The .9 yro oph 6 2 _ e r b a h a s p 0 0 9 0 % . . 6 0 % P aer M 6 B i u c n 9 2 % % 9 % 6 l um i u M M e 55 s m te o 00 50 0 %. 0 9 7 9 acu 5 est e e t h M p 0 _ ri s p 3 9 0 9 8 % . 7 9 ro b 658 m od tha th ane t 0 B um h 22 0 M . % . 9 0. Py 151 um n a n o h a 3 1 s p _ a e 1 9 5 e t 9 . 9 3 1 5 0 0 % p00 l adi o o b a n 5 0 B r 2 h 9 9 % 1 m s oc ba b a c o b 1 0 s a c 5 0 a n 9 % % . 3 2 d iu her m 15 c c t e a 533 1 p 0 u . o 1Me t 9 0 % 0 0 5 cla T 083 t e r t e riu ct 5 5 0 n 9 b 0 9 % 0 9 .8 rmo 0 25 iu ri m e r 813 ic M Mre % 9 . 0 The sp0 M m u m _ i u 1 1 6 ul 9 e v i h anob. % .7 . . 3 % um U e t h _ C _ B m 5 4 i th e t b a M 8 0 % 1 7 9 1 la di B a C la _ B 41 0 2 a h a c e 5 0 % oc B A4 n s p p c 5 n o n te th rev7 . 8 % 3 her m o 1 U o b 0 a l u s p . % b o r a 7 % 1 5 7 1 % % T g - 9 B a 0 2 u d 0 9 r e b r _ C n i 5 8 9 3 % .5 9 8 7 5 2 2 s p A4 cte 4 i 0 0 1 v e obb act 3 % % . 1 0 97 7 633 n si s B 0 U 1 r 9 8 7 0 ib v i c u r % 1 % 8 % 9 . .4 016 g e N p o 0 2 B 9 i um3 4 4 0 4 a c b a ti ever 6 4 0 0 0 sp0 ui lin 15 at Na 00g - 4 A s p 8 5 4 . t c c ib % 1 % 4 r ga maq 65 rine t rin 3 229 4 41 0 _C 5 5 .9 09 % e r teu l a _ C % . ld i vi r ga 0250 ma em M M 1 6 s 4 9 9 02 c 0 5 _ D r_a r c 8 % C a l div i sp0 sac a m e e 1 p 0 5 s 4 o . 9 . D i tea rb % . 1 Ca r ga cha ahi t h M t h 8 1 0 p 0 9 5 n g 8 78 cu f r_ 0 % l div i N rev i a M e t a 5 3 1 0 6 o 0 9 r il Co riph 0 % 8 Ca at rin i tan n o et h a no 6 2 5 25l e 4 0 2% v a i fo a 1 % % 9 em b a h a n o b a 63 0 n s .9 % . tu rm r 1 1 .9 75 a p M c n b c 3 28 e 7 96 boi _ s lu 8 5 0 964 N ell ir et te ob acte 5 5 5% % i r % % 024 na at r ubr har iu a t e r iu 5 0 2 i p 0 6 5 sp0 nia i nem um n m c te ri m 7 hiA aeta a sou aje 0 ob fo r i u m s .9 3 lu . 9 5 ani s saet N a o tg .5 Me t a r u m p 0 % 0 0 . % ulc ani trin al i 8 c tem i f or0 5 % 9 . 1 895 V Vulc N em 0 4 hanori c i s u m 0 3 . 0 254 at r a pa .8 % umc u b t i c 0 6 9 2 . 0 p002 a i nem l lid u 4 b s m e r i c u 98 0 8 0 % 64 s phi l a a m 9 ac p r a m 6 5 .% 8 % 9 72 - r mo 2 85 lt un 1 % ter 00 n e _ 9 3 . 9 %X45 the ki a 1316 N a ens % i 0 u A 6 % 9 1 E t a_B ovs p00 D trin e u m 49 m 9 % 9 3 is ae outn et a s ut a_ Nat r em a s 97 0 1 % % . l can t a m nis a i strib 5 i nem ga p 6 0 9 0 Vu is ae ul ca t a d 624 a v ri 0 0245 .9 % 1 4 2 l can V is ae 0131 ers 0 0 1 . Vu l can sp 0 t a_C N a i for m M 968 7 9 % 3 9 % 2 Vu a eta trib u trin e .92 .77 e % . % anis a di s em a 0 2 5 t ha M 05 8 1 4 3 1 % V ulc is aet Na Na sal a no e % 0 5 l can B t rin t rine cia .9 % % b th M M % 1 % 3 8 . 1 Vu ut a_ e ma ma e 05 ac an eth e % dis t r ib 16265 sp eji n .78 t e ob a t h % % 0 % eta 013 0025 or en %7 M riu a no an 0 1 9 4 9 ani sa a sp0 725 se 4 et hm c te ba ob . 1 1 . .9 Vulc i sae t Nat 25 % a _ ri c a 8 % % 5 0 lc an rinem 0 no D um te c t % 0 0 uta V u t a_E N a a l .87 b s p _ riuer 0 9 % is t rib t r ib u trin e imi c ac 00 D m iu m 0 .9 0 . e ta d ta di s m a h ola 0. 8% ter 25 sp_D _1 .1 0 1% 1% .949 nis a ni sae i sp iu 0 0 D 9% 9 1 0 . u lc a ulc a anic a 072.88 m 5702 bry %v 0 V V H 4% _ D6 50 a et 8 1 % 8 % alo t e 7 s5 9 nt er u 9 % 0 . % 41 % Hal o r rig p0 66ii m 0 1 9 0 te rri ena % 0 5 % 0 . 8 0. 9 % N a gena sal in 0 24 . . 9 0.99 .97 %. 971 tron tur a U 94 9 9 0 % 0 2 % Na or ub kme 0. 9 .99 B 8 4 1 9 % 0 7330 89735 trono rum nic a 9 9 A 185 0 9 . 1 6 1 0. % 0003 N r ubr tib e t 1 9 % 8 0 % 1 sp at ro n um t ense 9% % 7 s % . 8 .067 % 7%Acd N oru exco p % 3 at ro n br um con U 00 9 % % 0 % 0 8 99 orubr sedi ense 0 BA 24 9 0 2 1 .11 0. um m i n .9 U3 9 7 . 0 u daqi n i 1 3 B49 47 8 . % ald ar i N gen % 1% A s 85 0 0 0 % 47 % i doc atron si 34p0 % 0 8 1 555 bus ac _A or ub 9 02 . 1 . 6 0. 03086 ul fol o l dari us N a rum s p50 8 8 sp0 S doca trono b ang 1 UB 009 9 % 2 % odai i F N01 us aci 5 r ubr ens % A 2 74 5 1 tok QE fo lo b 1912 Nat ro um th e 0. U B5883 5 0% haer a S ul p0017 at ar ic u nor u i ooxi A 7 97 . % 0 l fur is p s_A s A sol f 16085 br um dan 1% 0. 447 58 sp 05 9 Su ol obu bu s_ alli cu 0013 Natr sulf 8 0 0 . Sul f Sulf olo B met s_B sp i alb a id i fac 778 % sp02 6 0 % bus_ fol obu chah i en 0 50 .9 3 9 di cu S ulf olo Sul N atri ann % 02 4 . 819 isla n al ba s aoens 5072 1 9 9 bus_A 8 45 p001 i 0 0 905 76 % % % % . ulf ol o 02495 8 613 . 65 . 9% 8 9 792 0 S B sp0 Nat r 55 5 4 6 lo bus_ i alb a 9% % 9 0 0. S ulfo N mag a 0.99 M e 9 4% atria l dii th Me % si ba h u 6% an t h % . 4 % % ahuen Nat lu nbe 0 o Mea 1 7 9 1 1 us cop rialb a i rens . 90 the t hno % . .9 ci dia n N asi a i 0 9 r m ant h 0 A si at r ia l t i ca . 2 % 0. o ot erm 2 0 anzaen ba t a 62 99 ba he1o nus m iwan e % 1 c t rmba 8 Acid i a Nat r nsi 0. % er o%c 2% Nat r ia lb a a 99 th bat er 0 0 % 99 onoli m egypt 8 Mer m ct sp 0 % 8% 7% 1 0. it ali Hal ob nobi u ia % et a er 0 . . % 0. 96 hosp or an if or ma s ba e haut m00 9 2 . 6 1 i dia nus sul fidiv halo rhuen 0.8 noot ar 82 5 . 8% Ac i anus H alo t er rest s i 95% 0.19 t hr o bu85 8 8 0 . 99 % 0. 99 Acid bi for m ri 1% 8% er phr g 7 3 2 0 0 4 % % 0 . 28 % a laci m oi c uen5 % sals i Mb s i % % . 1 2 0 1 55 Halo b 0 Uet ac 8 . 4 .8 % 0056 52 ifo rma Bhat er 0 3 9 us sp0 r ie r le yi ni trat ir .846 A1no w . 9 0 1 A cidia n i anus b n ensi Nat ro educe 2 th oel 9 % 4 A cid llo wsto nensi N a no bact n % 51 r fmei 9 0 3 haer a ye era hako tronob er iu m 1% 5 s oi 6 . t all osp l lo sp ha act er iu g regor p0ba % 9 % 5 9 % Me M eta a cupr in a m _A te yi 01c t . lo spha er xcoco 0 50er 0 % 37 M etal nense .961 07_.A 0 5 0. 4 1% edula 95 te 0 haer a s H % 59ne % etal lo sp Hal o alo pi ge 0.8 0 7b.r 9 M Nat pig er _ r xana 24 . 85 M 03ar % ro noli mn A gole i duen si % 9 eth um8 93 obi us massi l 0 % a.n9%5 0 1 % 0. 9 _B in ne i ensi .988 ot 8 % .9 % .984 Na r m ong he 0 t ron ococ oli cu % r m2 0 0 cus jeo 0.21 us% 66 % N tgal i 7% f . . % 09 atronoc erv 9 3 occus i d % 1 0. 9 % Nat r occult u 0.63 u 5 94 % 1 1% 0.93% onoc occ 9 us am y 8% . 8 lo l yticu 0.014 4 0 0. 9 1% . 936% us font i 4 vid i cocc Nat r 0. 98 % 0. 1% % 0 1% Fer 55065 i nem a sa % 4 sp00 22 l if odin a % 0. 047 83% 4484- 20 Ha e 317 % 0.7 EX l opig er _ 1% % 5 B 1-Br 10- B dje lfi m 00289980 E2g2 sp0 a ssili ens RK- 14 sp 01563 8 i A 5 85 0.90.4 p00311685 B 6803 % 03056265 A G1 s 1- Br 10- E2 % % 0 7 V N01 sp0 an B 1 g2 sp001 1 40 NB aer a aggr eg -Br10- E2 563805 0. .9 0. I gnis ph g 2 sp0021 993 % 56 705 % 9 H a H alo s lo st agni col t agnic ola k 6 _A hospit ali la nd ic u a sp0006 a mekur a % gni coccus n ic occus is 91505 e I Ig Halo 0.9 0 s tagni col 36% .112 a la r senii % 877855 Tc 0 ni AV -19 sp002 - Br 11 sp001 5 .97 r opyrum cami Z 64285 3% Ae i x o_Sulfolobales Tc- Br 11 sp opy rum per n 001 564275 1% A er la gunen si 0. 34 % 1% Cald is phaer a Tc- Br 11-E 5% 1 1 2g1_A sp0 % 01564115 era sp002506 595 Tc- Br 11- Cal dis pha E 2g1 sp001 0 564255 1% .987 % % 35 H alo vi vax % 1 0.609 bu s sp00049 57 asi aticu 1% 8% Acid i lo 1% % 0.72 us sacch arovor an H alo vi vax ru 0. 1 Aci dil ob ber 926% 1% 6 425 H alo ar cul a ja po Hal oar chae U BA16 2 sp00249 nic a obiu s ir anensi H alo 1 ar c ula s p00148 557 1% % 5 % Halo a . 888 rcul a mar is m or tui 0.90 0 0 0.179% 9% H alo a 1% . 827% 0.99 1% 1% 0.085% r cula ar gent in ensi 0. 791 1 46% % 0.9 P yrol obus fum ari i H alo ar cul a hi spanic a % e rmus butyl ic u 0.764% Hyper th Ha 0.92% ic tiu m dela ne yi lo arcul a val li smor t i Pyr od Hal o 0. 75% rodi ctium occul tum ar cula ru bripro mont or ii .85% P y 0. 884% 0 4484- 217- 1 sp0 02254745 Halo arcul a taiw anensi E X 1% 85% EX4 484-5 8 sp002 254665 H alo 0.138% 1% 0.9 ar cul a am ylo l ytica St aphyl other m us mar in u QH - 8-67- 36 sp00 1% 3021755 69% hel le ni cu Q S- 6-64- 34 sp00302092 5 1% 0.9 1% St aphyl other m us H alo m ic r o Thermogl adiu s cald erae bi um muk ohataei 0.999% 0.646% 1% Hal omi crob iu m kat esii 0. 92% 1% U BA28 5 sp0024 95025 Ha lo m ic r obu i m zhouii 1% T herm osphaer a aggre gan Ha l oi r enta lis p e r s ic u 0. 995% 0.841% 0. 63% 0.965% D esulf ur ococcus am ylo ly ticu H alo r ie ntal is regu la r i M et hanopyr us kand le r i 0.969% 0.886% D esulf ur ococcus muc Hal ori enta lis sp00198 9615 1% 0. 675% 1% M et hanopyr us sp00 2201895 1% osu H alo r habdu s tiam atea 0. 961 G CA- 0028 SW -10- 69- 26 s p003009755 Hal orh abdus ut ahensi 1% 0.997% % 38935 sp 002838935 Hal apr ic um sal in um . 991% 0 .952% 1% 0. 982% 1% U BA921 2 sp9212u 0 0.678 UB A9212 sp9293u Ha lo venus sp0030210 15 % JdFR- 43 sp002011 UB A2 355 Hal ovenus sp003022845 0.976% 1% 0. 775% 02 sp002502 685 venus sp003022725 0. 971% 0. 0 S G8- E X45 Hal o 1% 0. 994% 0.55 97% .915% 52- 3 sp00 1595915 72- 165 sp002254885 nus s p003022085 78% 2% M Hal ove 0030 21285 0.4 SM 1- 50 sp 9B 1D sp002900 H alo venu s sp % 00 2506745 535 0.959 1% SM 1- 50 lo ven us ar anensi 1% sp002 496355 H a .995% 0 S 0 .781 M 1- 50 sp0 01 % 0 S M 595945 .991% 1- 50 s p0029 enus sp 000416085 71% % 0 00555 Hal ov sp001564135 0.7 0.995 .735 E X4 Hal ovenus % 484- 36 sp 002254765 % 0.8 UBA 1 0.53 68% 92% 1% 86 sp0 02494 0.485% 0.9 0 765 UB A sp003 021335 8% .21 0 287 sp002495 Halo v enus 3485 0. 83 9% .991 235 1 sp00 3058365 -68- 16 sp00302 % HAR CE L SW -1 0 1% 0. 03022745 2 - 9-67- 25 sp0 785 5 RB G- 16 SW -33 sp003023 36% 9 0 - 68- 12 sp86 Q S- 5- 68 0.7 0.977% 59% % 0.9 1% . 982% 95u 3021305 999% 0.1 76 0.9 7- 22 sp0 0 0. % 82% RBG -1 QH- 10- 6 p003021655 6 -68- 12 sp S-4 -69- 31 s 001800825 Q R BG- 16 3023165 % - 68-1 2 sp0 pl ex sp0 0 1 UBA 01 80074 H alo si m e 68% 147 sp0 02 5 ar ls badens 0. 4% 0.2 U B 499085 S G8- 5 sim ple x c 0.6 88% A1 47 sp sp00159 Hal o 5 UB 00249 638 5885 0040364 1685 A 9653 s 5 haeum sp 0 m sp00 302 37% 0.7 0. p9653u Sal in ar c mi cr obiu 0.9 22 906 U Hal o % % 0 BA108 3 % 0 .92 4 sp1083 1 . 1% 6% CO 4 u 565 99 MB O- 56 0030 23 1 C O - 21 sp - 68- 17 sp % M BO -5 00180 06 QS- 9 1% 6- 21 sp0 75 1 018008 23195 % 15 sp0030 254 45 5 - 7-71 -33 s sp0016 % 1% UB A472 0030228 6 SW adapt atu ilu 815 % .45 1% % 1% U sp8940 71-33 sp H al i halo ph 0. 1 0 .99 17 % B A472 u SW - 7- t us pauc 0 0. 6 75 s p0024 adap ta 0. 7 97075 H al s ci bariu ad aptat u 465 0 M e FE N- 6 Hal i tor eu 03%298 5 . 81 th ano 5 sp00 pt atus l 22 sp10 3023 46 8% 0. mas s i lii FEN- 3 3 15389 Hal ada H sp00 94 c oc c u 3 sp00 5 6- 65-1 5 1% s _A in t e 313593 SW - 1% s tinali 5 0. Me 21085 2 t hanom p0030 in ense 6 assil ii 8- 20 s phi lip p 1 coccus Q S-1- 6 haeum % M eth an sp002 noar c 1% omas 5085 4 Nat r o 0 si li ic oc 5 .9 cus lu 2 DT m in yen 9 U0 08 s si % p002 m 0 50 4525 phil u .62 um hal o % 0 3 D l orubr um 95 .9 0 % T U0 Ha li poly t ic 0. 5 % 9 . 9 08 sp0 D br um 47 9 1 13 01421 TU 008 alo r u ocurii 0.7 % 0 % % % 185 sp0 0 H br um k 81 15129 alo r u .6 . UB A 65 H rsic um 0 3% 0 9915 r um pe 95 9 .4 sp991 D TU l orub di 0. % 5 1% 5u 008 s Ha pr ofun 56 8 3 p0024 la cus .7 % % 9828 ubru m 6985 0 9 8 R um 5 H alo r 00228 . 98 0 En- M m sp m 0 % . 2 sp0 lo r ubr u r ovoru % 5 0 Rum 01421 H a ccha 4 5 . 0. En- M 175 um sa . 93 4 7 8 0 2 sp0 lo r ubr 5 0 % % 7 3 .3 R um 02506 H a 6191 6 6 3 87 E n- 255 p00 12 97 % % 0 1 % M 2 sp um s 5 0. %9% .9 % U 0 02 l or ubr 5275 63.9 9 BA71 50296 Ha p0022 se 0. 0 % 9% sp00 5 um s ng en 7% 2504 lo r ubr aid i 7 7 405 Ha brum .9 al or u 05 0 8 0 0. H 5642 6 205 .9 98 p00 1 0074 .9 7% 6 rum s sp0 % UB or ub br um 0 A 71 Hal al oru m 0. 0. UB sp00 H ib ut u % % % EX 0 98 87 A 71 s 25 04 di str 52 % 2 87 % 44 .8 7 5 p00 495 br um se . 9 2 75 3 84- 6 4 % % 25064 lo ru nie n 0 84 %0. 0. 1 sp 2 25 Ha alif or 615 0. 4 9 % 0 02 % I r um c 029 6 .97 % 2 543 SO 4- ISO 4 r ub sp00 0 63 85 G 1 - G1 H alo rum m 7 3 sp002 sp00 l oru b nic u 0. % 1 50 6 156 Ha trapa cale 5 5 1 % 0. 175 3305 r um r opi 74 . % 16 1% lo r ub r um t 0. 3 H a lo r ub e % % UBA3 Ha le ns 72 % % 0 2 8 m ou 9 77 8 sp00 Met zze 0. 7 .9 2506 han um e se 0. 0 90 5 opla s orubr r ie n 7% m a te Hal m co e 9 % r mi ubr u hens % .%9 8 tum alo r ui c 96 02 9 H benq iu m . 97 .9 1 m te st en 0 0. 0 1 1 % V ubru och % Ac J % adin alo r um h % % A id u dFR A C A1 H rub r 05 e 1 0 cid l ip r -45 RK 1 sp Vad Hal o 234 ens 3 ul ip of un sp - 15 00 24 in C A 0030 dom % . r ofu dum 0020 sp0 0 9836 11 s sp so 34 1% 8 9 ndu s 11 028 . 1 Va 5 p00 2 bru m brum ur an 9 3 % % 0 8 m b p00 395 781 4 Va din C 498 lo r u l oru lo d m 0.0.4 .96 .21 . 8 0 oo 032 35 4% di nC A 11 605 H a Ha m ha pi cu 0 n ei 750 A11 sp00 ubr u t hio 0 0 0 % .6 5 0 M et sp0 250 lo r ae 7% 28 .98 h an 025 940 Ha br um 5 5 % . % 3% ome 0534 5 lo r u 735 9 t hyl 5 Ha 328 9 3 ophi l p00 8%% 0. .55 % 0 us s m s 5 8 p00 ubru 9 617 t um 9 1 0 8 0 . 1560 lo r 04 ola i cu 0.9 92 0. 3 . 87 98% Me 915 H a sp00 acu pers 3 than um um v it us 0. 24 % Me om e ubr ubr en A % % Met t ha thy a lo r al or alo p cus_ hi i % 0 han nom l ophi H H H rsi ade nt i .79 ome eth l us s pe ekz di me 2 0 t hyl yl oph sp00 nit u m al se 0. % ophi ilu s 035 ope t us um . 9 lu s sp0 030 Ha l eni parv % 9 8 0 alv u 025 5 l op al o 1 3 % .7 035 Ha H % 8 3 M e 45 7 % % 3 than % . om e _B . 8954146%98 1% 1% 1% th y ic a 0 3 UB l oph l ag 0 . 3 .9 1 A18 il us pe ic a 0 0. 0 0. 4 s UB sp0 i na el ag % p00 A18 014 lo l am a p 9 1 249 4 sp M 812 9 Ha mi n 8 % 518 00 et 5 lo l a i 8% 0 5 25 03 h ano Ha i n % % 9 8 me 55 di m 1 0 .9 5 t hyl ubra 252 a se 0 6 ophi a r 020 i n 75 5 . 0 4 GC lu s mi n sp0 ol am 44 . % 0 A -0 sp00 lo l a na al 022 1 .9 01 24 H a m i H p00 9 4 . 5 856 9532 al ola a s nse 7 8 4 825 5 H in r re 9 3 0 % sp ola m mo . 8 . 001 al go % 6 4 85 H lu m nsi 4 1 3 682 acu 5 ell e 5 0 5 3 5 lo b 94 t yrr 8 % % H a 020 m . % 03 ulu % % sp0 bac 0 1 % % % 0.9 um a lo 235 6 ul H 96 % 1 8 9 1 bac 004 76 % 8% % 0.9 a lo p0 75 8 4 1 1 9 H 9 s 698 . 4 3 0 % 8 U R5 014 0 7 % 4 % BA 7H p0 285 . 4 % 9 . 582 J0 x s 82 0 9 3 . . sp er a 14 ii 7 0 4 00 l of p00 an . 8 % 0 Th 250 Ha x s vo lc 0 8 erm 270 era x si 00. 29%4 % 1 % 1 og 5 lo f f era l ie n 09 7 0 % Th ymn H a lo ssi n 4 U B erm o m H a ma f ic a .90. .99 4% % % % A1 op ona x i tri 0 4 % . 02 T la s s a f era en nt i 6 4 % sp her ma cid alo ax d rifo %0 . 1 1 8 % 2 00 T mo vol ic ol H f er lf u 58 0% .9 1 9 18 he pla can a a lo su si i 9 9 75 rmo sma i um H r ax on .9 1 0 9 42 pla ac ofe ib b 0 78 . 9 0 5 P i sm id o al x g e 0 . 3 cr op a s phi H er a ns ei 0. % . 9 1% A hi p00 lu m lo f ove an % ci di lu s 32 Ha ah err 0 0 9 pla osh 052 pr dit 5 6 % 4 sm im a 35 r ax me um 86 6 5 .9 % a ae e fe ax os 69 9 8 5 ol al o fer uc 14 % . . 8 ic um H a lo x m 00 1 % 0 % % % 0 0 . % H r a sp % 1 U B U lo f e r ax n 9 2 5 3 A8 BA H a fe ga % 61 .7 94 08 % % 8 0 7 4 88 U B 88 al o l on nii % 6 A8 86 H x e se . 0 . 1 .9 % sp 8 sp ra la r % 1 0 0 .4 1 7 002 86 00 lo fe ax 9 4 2 UB 50 sp0 26 B-D F e a f er 9 0 2 7 4 0 6 A8 U 91 02 87 KE rro H al o .9 % 0 . 0 . % .9 % 88 B A 65 73 355 U GC sp pla s H 1 6 88 19 BA A 00 Fe m 0 % % . CG sp0 86 05 50 -96 22 rro a sp 31 76 % % 4 0 0 7 7 6 0 .4 - E 0 3 sp 9 s 574 047 pla 00 % pi1 1 9 00 p0 5 s 05 F sm 25 8 7 1 s 38 27 024 p0 er r o a a 051 . 9 . 1 7 9 % 5 p0 15 31 98 00 pl a cid 85 0 8 3 %. 1 CG 02 C 65 84 965 sm ar m 0 % . 9 2 0 9 0 % -E 724 G- 5 5 U 74 a a an 5 % 0 . 9 . . p i1 7 Ep B A 5 cid u % i 8 0 8 s 75 i1 447 ip h 1 i m 9 1 9 % 0 p0 sp U B s ilu % s l . % % . 3 0 5 02 00 A1 p00 m 8 ll u fu 0 5 0 71 25 02 25 9 be ru % 1 9 % 89 06 sp 03 7 alo lu s atu 7 9 % CG 15 485 00 205 . H el v 9 3 % . - 25 0 l ob cla . 9 1 % 4 Ep 09 a us 0 9 0 i 1 22 H ell . 7 6 s p0 5 lo b % 0 % 7 % 1 01 i _A a m i 1 6 1 % 8 by H li fum u 9 5 753 als i um ru sc 7 . . 1 % 0 4 w s by um ri c m nse v e % 0 5 t um al i _B ll id et i c u ue um us 7 .5 0 .9 ra w by pa om t r q ic um n 3 % % G ad t um al s m ge me r in l yt um um br pl a n . 0 1 83 C CA qu r a w c u lo eo bo lo br t ic r u l o ata 1 1 G - 0 alo a d m tr i Ha og m u my r u ily m Ha n 0 % % % - Ep 00 H qu atu me al i cu ng a m ti n 95 ri nu us 5 % i1 496 al o dr eo H t r lo um nu la 41 b r a n 1 7 0 sp 13 H qua og me iu s an ra ge 84 sru nig pla 1 1 00 5 s lo al eo ag gr og m 02 ri l i l o % 92 24 p0 Ha H og el lo al nu p0 m a Sa H a 9 30 97 00 al l op u Ha H r a s s 5 . 0 0 % 68 496 H a at og m du 95 6 0 p % 5 13 H in al nu un 37 7 s 9 Cu 5 r d H r a of 6 9 % 1 9 % ni in o lo g pr 15 . 8- 6 1 0 cul 0 5 us a alo 00 0 8 - 6 % 5 5 . ip la 60 gi H H p 9 -1 9 % 1 sm 41 ela 5 a s S 3 . 4 9 7 a d 00 p 6 s t e 5 9 Q % 9 0 9 . 6 i v p0 alo 39 ha i a 21 . 7 . 0 ulg s H 56 lo el d 6 % 0 76 . 0 % atu t um 01 a f i 49 1 0 .3 0 . m r a p0 H t ch 00 0 % 1% 1 ad s li 0 % .7 5 87 qu 29 sta sp 6 % 7 alo g a us 2 0 5 % % H E2 lo h ti e 8 1% 0- a no ns % 5 % 1 r1 H lo ie % 9 . 0 M - B Ha be 5 . 1 . G L u 5 4 0 72 I Ia M P h 53 4 0 1 - I s G 95 um 85 9 7 % % 2 p0 I Ia 1 ri 14 s e 9 . 0 % 02 - I 96 c t e 0 en . 7 2 % % U 49 s p 04 a p0 ai 0 % 9 0 . U .B9 8 00 0 l ob s 55 nt 0 % 4 B A28 20 2 7 p0 a m 9 ji la m 4 5 9 3 1 1 8 A2 26 M 5 M 20 s H ri u 30 r u 6 % . % 26 s5 G G 09 70 t e 02 iu m n a 8 . 0 9 8 s p0% M IIa I Ia 5 B ac 0 r a li m 4 % 1 . % % p0 02 G -I - I s 7H ob p0 c te s n i lu %% 9 0 1 02 71 IIa sp p0 A0 al s ba m c e i p h . 6 % 50 94 MG -I s 00 02 H m l o r iu u id 8 1 0 9 0 % % 59 75 I p0 26 69 ri u a te ed c . 8 35 Ia- 02 96 95 te H c ri r a 1 % 1 6 I s 6 3 1 ac b a lf u m 8 0 5 p0 92 15 5 ob l o u e u . % 9 % 02 46 al Ha s h a 9 9 72 5 H um rc 0 1 3 9 02 e la 2 5 5 9 . 9 75 ha a 3 81 % . rc H 5 21 2 2 0 5 0 9 . 1 oa 4 0 0 9 1 0 % % er 2 9 5 0 3 3 2 2 9 0 1 a 2 9 6 0 0 0 0 . 1 an 3 0 0 sp p 0 3 % u li 1 % % UB al m 0 0 0 2 0 0 s 0 % il g a A H ic u p 3 -7 7 s p 4 h t % 0 60 i c s 00 9 9 - 0 1 o p e o 5 s m 7 0 s p 5 -6 -6 7 9 al j 5 % p0 f or 9 - 0 0 0 - 1 -1 9 - ih s 8 4 02 -6 - 7 21 n i S S - 6 .9 c c u 34 % 50 um - 1 9 0 o Q Q - 1 % u c 0 1 7 33 ae S -6 0 3 r a S 0 p a c o 3 0 95 h Q - 1 0 h a Q 1 s li 0 9 2 rc 5 S sp p u k a p 0 . .5 r a 5 3 Q 0 a s si 0 c c a l s 9 M lfu 3 -7 n n 9 o l m 1 % % 2 G s u 02 9 o p e 5 . i c a i lu 0 9 0 I I e 3 -6 m la 7 a l H h D . 3 7 5 a- od 0 0 - 1 no o o 5 1 9 0 %lk p K % % K1 al p S r o m 9 5 0 2 l a o Z 0 8 % s H 0 s Q a t s 5 5 2 0 3 0 1a rm P 9 4 2 p0 -7 N a 7 8 5 0 2 0 H e % 9 02 9 45 o n 1 9 5 3 0 0 th % 6 8 68 - 6 7 m 2 0 1 7 0 s p 7 5 . . 93 M -1 2 1 o 3 0 0 1 1 p 0 7 % m % 4 1 4 G I S 0 o n 0 s p 2 s - 1 u % % 9 9 1 0 0 % 5 I a- MG Q 0 3 t r p0 4 3 0 5 0 1 a e 4 4 . %. K1 MG I I 0 a s 3 0 -1 5 - 7 % h 5 9 % 0 0 s II a- s p N 4 11 p 0 0 9 8 -5 r c 26 7 . 0 8 0 % . p0 a- K2 0 1 3 A s - 7 0 S 9 % a 8. % % . 7 0 K2 s - 7 1 B 4 - 5 0 2 Q al 89 . 0 3 %6 % 0 27 s p0 6 9 BA C 1 3 S 3 9 1 h . 0 1 3 % % 7 1 06 p0 02 1 - C 1 Q 0 9 o 8 0 6 9 9 % 1 1 4 % 61 02 69 - A p 0 . % n .0 . 7 8 9 0% 5 71 94 S B s 7 a 99 9 % 1 0 . M 9 2 Q 5 C 9 0 % th 0 . 90 9 9 . 9 G 39 5 1 8 - 1 7 e 9 . % 9 9 % 5 3 I I 5 3 7 0 % 8 M . 0 9 8 a- 0 2 - 8 9 . 0 % 4 9 9 9 0 7 K1 3 -4 . 7 5 0 . 9 % % 3 % s 0 S 4 u m 6 0 %5 6 . 0 M p0 p 0 Q 0 9 i l B 5 % 9 0 0 . % G 02 s 9 . h i_ 96 u 0 . % 1 9 9 I Ia 72 3 4 a . o p e r 4 0 % 1 6 8 6 % % % - K M 7 1 s i o l 0 k r i 5 2 i 2 . % 0 1 9 . 5 1 G 51 1 n ic 0 rm r e ta 8 0 ae 0 1 3 7 7 9 2 4 1 sp I Ia 5 BA d e in e b a rk l a 5 4 0 el A 11 . 0 5 1 1 . 0 M 00 - K C e n m 5 th a o 0 sp p i_ p 5 9 . % G 2 1 a il a di 4 na b c u 5 s r s . 0 9 % % 0 8 . 9 3 I Ia 71 s p h u a e u 5 38 m i a a 2 n a a k e a 7 . % 0 0 - M 00 00 rr th s tic 4 0 e u r c i n v 0 i n r n 0 % % 4 0 . 5 M M K1 G 1 27 o s u s ly 10 5 a s a c a p 0 r c ci ba c i e n 9 0 1 4 7 0 8 0 9 .79 G G s I Ia 5 07 m u c r o 2 02 h o ar in s a r r c u . % . . 0 M I Ia I I p0 - K 3 s cc o c a 3 0 0 rc n s c a o s s a a s a s % 8 8 2 8 1 % 9 . %8 GI - K a- 02 1 35 c u o c i u a e c h 0 p a a o r n n o in o e 9 0 5 % 9 . 8 4 M Ia 1 K1 6 s c o c lo n s t ic in c 0 s n th n sa ci a n rc n a v 1 0 4 5 % 4 5 7 0 5 7 6 GI - K s s 94 p0 co a l a e ly d s a s p % 8 2 ro e h a o r th a a a f l 5 8 00 % 9 9 0 % % Ia 1 p0 p0 24 02 lo H H lin r i ifo 1 3 a t M t n s a e th s t h a 4 p 7 % 9 5 4 . . 6 M - K s p 02 02 5 50 a e a a l u s 3 4 2 % n e a o M e n o e n 4 5 s 9 . 9 % 4 % . 4 J 0 0 % G 1 00 72 7 7 H m a g s c 9 - 1 o M th n M a M ci 9 5 a 01 u 9 7 . 9 0 5 d I I sp 2 0 01 42

H a s s oc 6 A n e a h r la 9 5 4 in 0 1 0 8 0 6 9 8 F a - 12 68 11 14 5 3 % . .9 h u u c 4 - B a M th t a i 4 8 9 c 0 3 0 9 0 6 R 7 K1 2 95 5 5 u s cc c c lo - U 6 th e e os p h 02 5 4 6 7 a r i p 9 . %9 % 3 . % 2 9 s 07 6 c o o a S 25 e M M n o 0 2 1 0 9 s n s s p . 1 5 % 4 0 8 % -2 0 9 p 0 u 5 o c c o c H Q 9 9 M a p 3 7 0 o e 4 s 4 % J 1 1 M 0 c a lo 85 a l 2 . th r m s 9 1 0 ri n b P 4 0 50 3. 7 % 0 7 d 1 0 . % % G 2 7 lo H 6 H 0 2 e e a 0 0 p s t a o T P 2 4 5 . s . 5 IIa 1 a 2 3 0 M th in 5 0 s u th n M T % 9 6 . 3 . U 3 F p 0 % 7 0 M -K 5 7 H 0 2 0 a c 0 2 sp a c e ro 0 B % R 0 2 GI 1 2 5 3 p 0 n r 0 in la M o i n M % 9 0 9 3 - 0 . 0 6 Ia s p 0 s c i sa p a c h e a 1 . 9 8 % 0 A 7 4 2 6 . 5 % M - K 0 p 0 5 2 r o s in r a z r % % . 1 2 0 7 0 2 G 1 0 2 s 2 3 - 3 sa n a rc s a in n a a v o 1 % 3 9 7 4 s 1 % M I Ia s p 7 2 s 5 1 5 o a in a o 5 r c c i m ti 1 0 1 0 9 . . 8 % 0 p 1 8 . 0 G - K 00 3 c u 70 2 - 6 n th c s n 6 a r a e e 1 . 0 1 9 4 I I 1 2 11 c 1 3 0 8 a e r o a 9 s a in c ia 9 s . 0 0 6 % 9 a - s 50 5 c o 2 0 - t h M sa n th 9 o s c a i l 5 % 0 0 . p 5 % K p 0 6 l o 3 0 0 QS e o h a e 9 6 a n o r a ic 0 0 4 . 2 9 M M 1s 0 2 70 a 0 s p M n t M 0 h an sa in s 1 0 0 G G p 7 5 H 0 5 0 a e 0 t h o c a 9 0 2 9 1 J % IIa II 00 1 0 s p 7 2 - 2 th 5 M 0 e t n a r i n . .4 2 0 d -K a - 2 7 2 6 1 4 % e 5 p M e a s c 9 9 3 F 0 1 K 0 0 5 1 2 - 6 M 47 s M th o r 0 % 5 6 0 s 1 s 9 7 4 - 3 0 6 6 6 a e n s a n u % 8 5 R .9 p 0 p 0 6 0 - 1 i n a o e 9 1 9 1 1 - 0 0 0 5 3 - 0 H 5 3 c M th n i t 3 _A 8 % 1 4 0 2 4 2 7 - sp Q 0 5 ar e a n l u 9 7 5 9 9 8 2 S 9 0 3 s h a y 7 a % 8 % % s 4 .9 1 7 6 Q 26 . s p 2 o M e t i 5 lc th A p ce 8 p 8 5 9 4- 7 n d 0 u e _ s u a 1 % 0 % 8 0 5 6 0 n a 1 5 a M n 2 v s 9 7 n S 0 . 8- i 3 t h f u 7 s m n 3 9 i 9 % y 2 9 - r c 0 e o 0 5 e s te 9 D 2 d . n 2 % M H a 0 M r 2 5 3 i d e u 7 _ 9 n 2 tr M 0 0 9 G Q s p p 0 5 6 o id l A a p ru o 1 7 M I no s i u s 0 6 8 c o h y ii 9 5 B e s a 0 p e 0 1 . G M Ia- % a a r u p n 5 0 c c t n c 4 h t 0 9 % I Ia G L h in a i b s a 7 5 o c e o 3 5 U a _ A h o h 7 % 0 5 -L I I 4 s 4 t c d n lo s r 7 2 c o m rt 0 in x A e 5 2 a - p e a r in c a o u le 5 0 5 o c u 5 45 d r i _ 4 a 0 rm . 1 0 9 sp L 4 0 0 4 M s t u l n b to 1 p0 2 a n o e s b 2 4 n h ix . % B rc 1 0 s 2 o s v a o o 0 s a 7 h n d s % 0 4 u t r % o h . i % . % 0 2 p 4 9 .6 n u t h o l r 0 i c 9 6 t h a i e 0 9 r o th g S a c o 1 7 M M 4 0 0 5 a b s e n h lu p s d 5 e t o d % p 1 a n 0 1 - e 9 A 0 6 G G 9 6 2 4 0 0 % th lo bu M a y c h i s b u n 2 1 8 M e cc i 1 s 3 A n h a no a 1 3 y r cc 5 II I 6 7 1 5 e o o h s p s o l a 4 0 M o co 5 3 0 _ ce A th a il % B 8 n u 9 c u . 2 a - Ia- 3 5 81 3 M n l t p o u l l 8 0 c c 0 0 s 5 _ e h o tr m h % 9 L 2 L 9 . h a no e s r 5 b o o 0 p o o 20 p n du 9 a x th % p 9 ii g sp o _ 1 a s 5 % M s 2 5 t a M u h 9 lo n h 5 s n c 0 s e 5 1 e ri M e o d B - p A 9 e s G p s p a 0 e h b yc 6 o h a s 0 2 s a o 9 A 3 c re 4 6 c h M 3 t % a M o 3 0 h o h 4 IIa 0 0 85 M t l o s 1 n t n 0 n t h n . 1 B 0 u o 5 2 n a t 5 ce 8 r g 8 0 B o b % g e -L 2 M 7 e o p 0 a e ra p a e a . U 2 5 d itr 9 9 i 5 o 6 n e - s 3 o a u % l n % M 2 4 9 G 4 u % M n s 5 t h M o s r th 0 1 re n 3 2 d 4 B n 7 2 oa 8 1 o t 3 p 1 g r t o g G s 9 I Ia a u 2 e v o M e A 0 o 8 5 0 n 6 _ a 1 c a h 8 0 6 - c a b l 0 M IIa p 1 9 - L 4 h b 00 M lo n s o v 0 B 6 tr s 5 2 2 5 0 ru 2 a th 0 9 m . l a 2 3 h n 0 u ie 1 G -L 0 0 5 2 e t lo p y a l M U 0 i n 5 5 0 8 3 p a 0 e e 5 r % lla o n s 0 1 8 a iv s n . IIa 2 27 s 1 o s h r y 5 n e 3 0 0 4 1 s h 6 c 2 . e ic o p 3 7 e o _ s 9 % - s 0 p 0 M n s e t o th 2 s d 7 9 p 5 5 1 a M 0 th % 0 e d c 0 1 5 sp u r . C i 0 L 2 p 0 6 0 0 9 a u v e 0 n re 8 4 s 2 5 1 A 0 in 0 0 1 c e 0 5 m a 9 9 s 0 6 2 7 . h b m lo m 0 e e 4 0 A 2 2 R _ 0 d p B 5 o lu ll 3 8 0 n v . p 0 25 5 2 % et lo o y o i i lu m p d p 2 1 _ 0 2 H ix p n s _ n a a 1 2 0 c e J 7 9 0 0 6 8 0 n t h n s h ii iu e u s e o 0 3 s 0 0 r s u 4 i x % a p _ 3 7 3 a 9 n d 2 2 2 4 8 M no a e a n p h b n a a t 3 r n 0 0 n p 0 th r 0 r 5 % % th A 9 5 1 l e F 8 6 9 1 5 a th h le o a lo li 0 e a p 0 e s p o A a th 1 la 8 7 d f R % 0 M 8 5 9 h e o m e t a a l m a i ti g 2 p h s p d 4 s n _ h 2 o e l a 5 0 a 9 ic 1 G 1 4 t M n c h h zh s A n o et s s r e u 4 8 a ix A A n % % 7 1 6 M e rv 5 9 r u -2 % . IIa 5 9 e a M t u s u e B n s 6 - 3 h r _ B a . c 3 iu J 9 - . M h r s ilu e m v h a M e n pe 3 2 1 t th ix U h 1 o o 5 % d 2 9 L 3 t o ilu h s u e U t d e 5 7 - e o r 5 t 7 9 n ry 0 s s 0 % e p h p u s e e d no 0 5 4 M n h 1 5 e 0 a z F p 0 8 p 1 M s p lo il al m M r re a 1 4 8 a o t 6 3 M 9 th a 0 1 . R 0 1 1 l u o h s iu e h p X 4 h 2 3 5 e - 0 0 . % 8 i l ha p o b p p e t s E 4 t a n 6 6 % . e . % 3 3 2 9 9 2 h a o l o n o o o e 6 X e 1 0 0 M 7 0 1 u p oh n a a al n n M 3 E M th 3 5 9 s 1 . 8 l o n a h h h a a 5 e 0 2 . 1 0 0 p 0 7 0 1 % a a h o t o t h h 0 0 0 6 0 h h t n e n e t 1 M p 0 0 4 0 0 2 % o t e a M a e ii s p 0 4 . n e M h h M M A n 7 s % 5 2 5 9 4 a M t t B - % 0 1 % th e e U ge n 4 % . 1 1 4 e M M n e 11 4 % % 9 0 % % 0 P F 2 U 3 o 9 M e h A 1 Q 1 B . s B 8 7 8 5 A % 0 9 e P 9 so U % 8 E 0 id o o s l % % X 1 . % 9 9 . 9 n e ix Z F 3 0 9 . ia i d 0 r 6 2 . 9 0 e 5 9 70 s on 5 t h 9 1 . 7 1 r 2 5 p i 0 o 0 . r % 1 0 0 %. 0 0 a 0 8 5 n 9 8 s o Po 2 a lp a 3 9 0 5 9 a % 0 2 p g s . 8 0 s 7 2 h . 0 5 h 9 0 . 0 lo p % 3 0 8 e 6 a 8 .3 6 t 6 0 b 8 % . i d 4 e 4 5 2 5 5 e . . % 8 3 u 9 2 8 .8 4 o n 9 5 2 M 8 1 0 2 M % 0 1 s 1 % P ia 6 G 9 2 0 6 5 8 1 0 8 o s 6 I 0 p0 0 8 0 % 7 8 0 p u P s e p % % Ia 3 0 s 0 . l A % 4 . 6 o i 0 - L 0 p 4 % 7 a r 0 0 % 9 s e d o 0 2 M 3 0 5 ix s % 4 0 % 0 5 c c 0 % id P n 4 G s p p 2 r % 5 id h . . 0 o o ia 9 4 I 1 s 4 th ix 5 8 9 9 7 u A a 9 0 9 .1 n s s 6 1 Ia 2 2 -9 1 5 o r 1 % 4 r e 5 ia e i p 0 4 -L 0 o 3 7 6 3 5 n th . . c o 9 .9 0 1 3 s d o 0 5 % 3 6 -4 1 0 a o 7 9 6 5 h g 0 7 . 1 % Po p 0 n 2 7 M sp u - 4 0 2 h n 0 . % 6 4 5 a lo 0 0 5 8 8 s 0 ia 2 G 0 p 0 2 t a 7 % 8 0 3 A e b . % 0 % e i 2 s 2 0 IIa 0 W s 0 e h 5 N o . 7 . 2 3 % P d 7 1 p 69 - 2 7 S 9 00 M t . 2 0 1 % . 6 2 M G g u 2 1 . os on 4 0 0 5 . L 3 2 G p e % % . % 1 l s 2 0 % 2 5 e i a 30 2 5 s 3 6 s M 0 8 0 4 E e o _ 3 M P id s 5 5 0 9 M p 3 % S m 8 7 1 % % 4 %. 0 1 1 o b A 1 . 9 % o s o p 2 G 00 5 t u 1 0 a g G u s 0 9 G 0 9 e n i 0 2 5 I I 2 4 a r 9 9 2 0 5 lo J s p 2 2 II 0 1 id a 0 2 95 M G a - 4 9 5 5 n 9 . 8 6 . 0 p % 9 s b e d _ 0 0 . % b - . 3 P o n P s p 7 % I L 3 6 9 9 7 li % % s 3 p o A F B 9 % Q 9 . 0 7 % o i o 8 04 I a 84 9 7 38 s a 5 i 9 5 5 6 0 u g 0 % 1 0 9 s a s e 3 5 M -L sp 5 9 6 5 5 o % 9 0 1 9 r 0 0 1 9 9 0 s l rc R p 2 . 5 . % 1 e sp id 3 9 1 G 3 0 . 0 0 5 0 9 9 n . 1 82 t 1 0 u 4 3 a o A h - r 0 9 s p 3 . 4 5 i d 0 o u 5 IIa s 0 2 0 3 1 3 9 3 a 8 s 9 7 3 D 2 2 1 G h b 4 o 1 0 9 % on 0 n - p 0 7 0 0 0 6 6 N 1 0 9 3 lu % . 0 9 1 0 5 9 6 a u rc a 1 fu 1 8 7 0 % 3 7 ia 2 4 ia L 3 0 3 1 p 0 p 5 5 % . 0 a n % . 5 U 9 0 6 4 0 n s h e s n 2 2 9 s 9 s s 2 6 3 s s 1 1 5 p a 3 2 5 K 9 9 9 4 A s a o p d 1 5 0 % 6 U % 8 p 5 p 0 p 9 9 5 1 9 0 0 00 9 0 2 % t 0 0 0 3 L p p 1 2 N g p e g 0 u 5 8 5 B 5 0 M 0 6 4 0 0 0 4 - 1 p p 4 . 0 la a 5 0 1 . s s 0 a 0 o lo 8 A 0 % G 0 5 2 2 5 8 4 3 2 g s s 0 u in % 6 p 3 s 5 5 M ri 0 g b 0 % . 9 5 % . I I 24 6 9 7 2 5 - U 9 6 9 p r 1 % 9 .% s 0 0 s u 2 E 2 l u 2 A A 8 5 9 . 9 a 9 4 2 -7 - 1 1 % % 0 s e im 8 lu l G o s 0 r r s 5 0 -L 7 7 5 1 W 1 0 2 g 2 g % 7 a 1 l u 6 5 u u 0 1 e 4 b 1 c ch 0 5 p 1 7 1 6 2 5 35 S r 9 1 h s 9 % 09 il 0 2 c lc 0 o 9 u f 0 h M 0 3 . M M s 5 -B -U -U . 9 7 6 p lli 4 h 2 0 l a p sp g 4 s u 2 a a 2 0 1 % 2 G G p 0 M 1 10 1 0 4 s o . 5 % p A 0 a c s 0 l 7 lg 8 e e . G 25 % 0 II I 0 G B r r 3 A o f % 5 o p % c o a o 2 s i 5 o o 9 % M I I 0 2 % a Ia 2 I -B - B 0 6 B n u 5 o 7 0 1 B s o n e 0 b 5 p d g g G b 5 . - L -L 4 9 Ia P o 1 L . a c 1 n 2 . 7 7 m 5 U n a 3 u 0 u lo 1 5 I -P 4 0 7 8 M 1 M 1 9 - L P s B P 7 U h li 3 a 5 r 6 7 1 % a lin 1 s 0 A lo b Ib 9 G s G s 5 1 o e . t o 8 h . 8 e 9 0 h th o 9 2 r b u % 5 -P M s p 5 . % 7 I I p 0 I p 4 5 s s i d 0 e n 9 t 9 0 9 5 th 7 2 1 t e 4 a 5 c u G 0 8 0 a M 0 I a 0 p 0 e i o n 0 5 M a 4 e . . 0 6 9 A e n 4 c 0 h s s_ % s 0 M 8 -L G 2 -L 02 0 d o ia 5 2 M 0 s 4 B % M M a 3 e 7 a A _ p I I 2 9 . 1 I I 5 1 1 2 n 4 2 th 0 5 5 i u 2 6 th 5 ti 5 e r A A M 00 b - 7 G 9 % s a 0 6 s 7 4 9 i a s p 2 5 8 a 5 e 0 0 0 7 2 s le 0 U e v 4 o c s 0 G 2 P 2 2 6 IIa M 5 M p0 -L 2 p 03 9 s 0 2 ic 7 5 p 0 5 l 0 4 o 5 g h s p 7 s 5 0 - G G 0 1 7 0 1 0 p 0 0 3 c 0 0 l is s 5 0 5 n u p 9 M ra lo a u 0 . IIb 0 p 9 M % L1 II 2 I 2 s 5 0 2 5 1 5 0 0 2 7

b 8 6 9 l 9 5 4 e c s . 0 5 i e lf - M 1 0 5 G a M Ia 5 p 4 2 2

2 2 m 3 9 fo is 3 5 p 1 9 rg o 5 n b o a 2 9 P 9 0 I 0 .4 s p - L -L 0 2 0 0 9 7 5 2 0 0 r 5 1 9 o ll 8 5 s 5 7 9 u n s u g ti 0 M s G 6 5 24 Ib M 0 1 % G 1 6 2 9 7 2 8 7 0 0 f o 7 3 4 o 2 6 s 0 6 4 a u 2 0 s l c 9 G p II 5 - . G M 0 s I I s 0 6 0 0 5 5 p p 0 0 2 5 n f 5 0 1 u 7 9 2 o h le _ o a 1 I 0 b 7 Q 6 7 I G 2 p 0 a - p 5 87 5 3

4 5 0 s s la 0 0 0 2 a o 6 0 e 8 8 b t l . 5 A b ll 1 Ib 0 - P 1 1 Ia M I I 6 0 L 9 0 5

1 6 a a u 7 p 0 2 th n 5 0 p 0 ll 0 1 0 s e u 0 s u id 2 1 -P 2 9 s 3 4 -L a 97 2 1 5 1 7 4 9 l 5 l g 1 s 5 p 5 e a 6 8 s B 5 u 5 0 u M c 0 8 p s u 3 7 2 sp 5 M p M 1 G -L 7 1 s p 3 5

6 3 u 3 u e 3 9 s 5 9 th 8 2 c 2 0 p e o 0 _ 5 s 4 0 G 0 G II 1 0 7 M 0 u o % 8 % 1 5 8 g 3 g r 0 l a 4 3 4 M e 1 i 9 s i _ 0 o 0 0 s ll n 4 0 A p 8 0 0 2 0 M sp a- s 5 23 G 0 0 2 5 re 1 r e o 0 u 9 3 2 i in 4 u r 0 n p s u m a 9 0 0 1 2 II 7 % G IIa 0 L p 5 2 0 0 u 9 o 4 o n p g 9 4 2 0 M i 9 r 2 le ig 5 p a 0 s u c u h 1 2 s 0 2 5 5 b- 2 . I - 0 1 00 5 I Ia 49 s p 0 3 4 n 5 1 n a i s e u 50 9 1 0 s 6 ig i m 0 l n 4 s h sp s e o ic t 0 4 p 1 4 0 Q 5 6 Ia L 1 27 s p 2 -L 5 p 0 8 a 9 3 5 h e r 3 9 4 u A n 0 n d 0 u s 6 s t u ll n h e 0 9 0 9 5 1 4 0 - 2 0 7 1 5 2 1 s 8 0 h 8 0 7 ha t n la o 2 2 A 1 4 sp _ e 0 s e p c ri 9 u e s e u a p M p 4 0 % 0 8 6 8 s 7 5 0 L 1 sp 7 0 2 s 35 5 g 1 5 t 7 0 6 t e o u n 0 0 is b 0 ri s s o a 3 u ll c o 5 6 2 4 5 p . 0 4 2 9 0 p 1 2 l a 1 2 e 5 p 5 9 e M o 5 g a 4 0 B 0 38 8 s o p a s n 8 lle M e u o th tr 7 s 2 0 . 5 0 7 1 1 s p 0 8 7 9 0 9 - U u p 0 1 s 6 3 b 3 e h 1 p U 5 8 e n s s u a m 2 ll c n e 3 5 1 9 1 5 0 0 0 2 5 0 2 5 M 0 6 3 0 g s 0 M 3 0 8 M 7 r t 1 s s p 8 1 2 a n o s m u e th s 0 cu u o a o 8 m 0 2 1 % % 0 4 9 G 0 2 5 r 1 e a p 0 la 5 2 la 2 o e s p a 4 p1 A n e r u l e ll e u 0 o c n h M n ci 9 lu 1 9 7 2 . 2 96 4 I 4 1 B r l s 0 5 u 9 0 u 0 n M e 8 9 i u o o s l u p o t e u 9 0 % 7 7 M 8 50 9 5 Ia 6

0 0 1 - 7 5 o u p 5 g 4 0 g 5 a a n D 0 s B te c g h lle u u c M lle s n n a e g i a 4 c 5 6 G 6 5 - L 7

p 3 n g la s 5 0 e 2 p e 2 th e li 4 U a u le c a a th o a r m 2 s . 7 7 0 % 4 0 5 1 35 s B 0 a re u 3 5 r 0 s r 0 e 5 n o S 5 8 a l A ik s cu l o o c u s th h e M r l a u 0 u 8 5 IIa 2 s 1 2 2 th o g la 3 9 o 0 o 0 0 li n u 2 3 g i _ h u o u n n o u e t d o c v 0 p % . - 9 5 p

g 2 0 e e u 4 4 n p la n p M 1 o a 0 1 n m e c le c a a le e M y ic m r p r L 8

2 5 0 n r g 5 2 a s u a s 6 n a h 1 6 0 1 u lu t l n o th th n l M h m u a s o 0 0 5 9 1 % 1

U 3 0 M a o e 0 t h a g h 0 a d t p ii h il a s u a n e a u M o im iu p c s 6 0

0 - 0 s p t h n r 1 0 e l e t la 5 h r e 5 s A s ir g u c th a e th c m l n m o . 3 8 p 0 u 1 4 a e a o 03 p u r e u 2 t ta M 0 8 B m p n le o e th M M e o r s n a m l u n . 2 0 r 5 6 n h n s M g o M g 0 e 1 U m lu s u l n n e u e re a 0 9 % 6 2 t

-B 1 li M t a 0 a re n e 0 M a p D ta il o h u a M e M a h a ri n g b lu u h 4 M

1 0 s a e h p l a r p e s S ir n c th M h t ri e a o u c t . 1 0 % 9 G B p 0 o M t s u o th o s in 9 s p a m o e t s a t l n la c s 5 e 7 0 0 5 9 M I I s n e la g n e n a l 4 s h u n e u e n p a s u 8 9 3 . 4 G M a - 1 a M e a a o 9 m o t ll a M M e l y o h m u rp 0 M 7 . . 5 M G L 2 N u o r th M t h e n 9 llu n e ri th ll o a n t u p o 6 0 2 % 7 9 II G I 1 g g e in a i i u tr p a e l r 0 1 a- I Ia s 2 re n e ol th A ir a M p e c e h M u o c 5 % 0 7 L I a - p a -U o a M M n e B p t h s M o p ia t le c c o 2 1 % % 0 1 -L L 1 9 3 0 6 0 n th U s e o n n e i s o n 0 6 s 1 5 r1 a e a M o n a ia i M b u n a 0 % 0 . . 4 p s s p 4 B h th n M a c o p a th 9 . % 9 9 5 p 8 u 1- t M e a h th in l a r h e p 9 % 9 0 7 B e h t e c o m co t s . 9 1 7 6 02 2 M M t e la n e M 9 4 u 7 5 e M M o a m o M m 9 % M 2 u M iu n lu 0 8 8 9 M G M 1

n th b a u G I 0

a e o th c % % Ib G 85 th r s 0 . % % I I - M I I e M ic e u 9 0 b- N G a M p 1 0 M N 2 -L M m r . . G 2 s I Ia 1 o o 4 9 0 1 9 % . M p - s n c 0 0 9 I I G sp 0 L p a o 9 . % 8 b - 0 0 S 1 0 n 9 9 0 . N II 0 2 P s 0 th a 1 . 1 5 2 b- 2 4 9 7 p 2 6 e % . 4 9 N M 5 7 9 8 7 8 th 3 % 2 1 3 5 M sp 2 0 9 s 3 8 M e 0 % 1 G 0 s G M 6 0 p 4 8 % % M M % I 0 p II 8 5 0 u 2 M 0 G . 7 3 8 Ib 2 0 b G 7 5 0 5 M 9 G - 5 0 -N I 2 G II 4 I % M N 0 2 2 Ib 7 1 0 1 . I b 8 Ib % G 2 3 7 -N 5

s a 8 I -O % - M M I 0 1 s 5 % 0 b- M N G I b p 4 5 8 2 p0 2 4 . 0. 9 O 1 3 M 1 M G - N 0 1 0 s 5 4 . M 1 s G G s G M II I 0 5 2 p 9 p % I I p I G b - Ib 2 27 7 0 0 2 2 G s 1 M Ib Ib 0 I b I N -N s 0 1 2 3 0 I M p 2 G - - 0 M - I b M M 2 p 8 3 7 c % 8 I b G 0 5 N N 2 G N - G G 2 0 6 5 1 % . 01 - I 02 7 II 1 1 5 I 1 N I sp s 0 2 9 8 2 i 1 8 O Ib 4 0 u b - s s 05 Ib s 1 I b IIb 0 p 7 5 5 2 s 9 1 - 5 N p p 4 - p s - 0 0 0 8 n 1 0 s M O 7 M 1 0 0 5 N 0 p N -N 2 0 8 5 e % 7 % p G 1 5 0 0 5 1 0 0 1 2 5 2 0 o % 0 I 9 G s 2 2 s 2 0 s 0 70 1 g % . 3 0 Ib sp M 5 II p 0 7 4 p 5 2 p sp 2 2 5 u 9 2 - 0 G b 0 1 9 0 0 4 0 0 6 9

k 5 O 0 - O 2 2 7 0 5 9 0 0 2 8

i 0 4 % 0 1 2 I I 5 6 2 2 6 5 2 2 5 5

h M 2 4 b 1 0 4 95 5 9 5 67 4 6

c 3 sp 9 - O s M 4 5 0 5 99 9 s . 1 G 6 8 7 p G 8 7 5 7 u 9 I 5 6 8 1 0 I 4 1 8 1 e % I b 8 9 s 0 Ib 5 7 5 0 ll 0 - 4 5 p 2 - 5 5 u 9 M O M u 0 4 N

c 0 M M G 1 G 0 9 1

o 1 9 . G G I I 2 87 s n 9 Ib sp I 4 2 p

a . II II - 0 b - 9 5 0

% 9 % b b O 0 O 8 0 th 1 6 1 -O -O 1 2 1 5 2 2 e 4 5 7

9 0 0 07 1 1 s 9 s 1 M 1 % % s M s p0 7 p 7 3 . p p 0 0 8 0 . 1 .% 0 G 0 02 2 0 3 % 9 0 4 0 II 0 7 5 2 4 5

% 2 b 2 0

. % 0 9 9 5 - O 4 8 5 7 9 8 M 0 9 3 5 0 0 . 0 G 2 1 6 8 5 5 9 2 3 1 s 9 5 5 t 1 M II 7 p 0 0 . . 0 . 0 3 % % G b 5 0 5 . 9 % 6 8 -O 0 0 8 0 2 II 0 2 . 8 2 . 4 1 b 2 6 5 1 % . 9 - O s 8 . 7 3 9 M M p . 5 9 0 9 % 8 % G 2 0 6 4 4 % 0 G s 0 1 1 . 5 3 % % M M I I II p 2 5 5 3 9 0 . 1 b b 0 4 % 9 M G G - -O 0 9

7 6 . M M % M I I O 2 9 9 0 % 0 0 7 G M Ib Ib 2 2 4 7 0 M % 9 9 I G G G - - s s 9 8 % m % . Ib I I G M M I O O p p 4 5 G

. 9 . 8 8 M - Ib Ib I G G Ib 2 2 0 0 9 . I I

8 9 8 G O - - Ib - 0 0 7 9 b

% 5 O O - II II O s s 2 2 5 -

1 0 5 0 % II 5 5 O b b 2 p0 p 4 6 8 O

7 7 b s 5 -O - O 0 9 8 3 e % -O p s s p s 0 2 0 7 6 0 s 8 6 T . . 0 p s 2 2 p 2 2 5 7 p

9 h 8 9 0 5 0 0 00 p s s 0 6 5 4 2 M 0

% a 0 s 2 0 2 0 p p 0 85 0 5 5 . % 0

0 % % l . p 5 2 0 0 0 2 4 9 G 2

a 0 . 4 9 9 0 0 7 71 2 0 0 4 3 1 9 M I 5

T s T 0 4 2 4 2 2 9 0 G I b 0

. h sa h 3 8 9 2 4 6 9 9 7 4 5 5 5 8 - 3

9 a a . 9 M 7 3 2 6 7 3 9 5 II O 2

la r c la 8 9 1 5 7 3 9 0 8 2 8 b 3 8

s T T h s % % M G 8 5 5 8 7 9 5 - O s 5 3 s T h h a s 8 I 9 5 7 8 M p a a a e a 6 3 G Ib 9 5 5 % 3 0 T r h l u r % I - 5 G s 0

6 h c a a T T l a m c 9 % I b O I p 2

0 a h la s s h h s h M - 5 I b 0 4

% l a s a a a s s a G O - 0 5

1 a e s r la l a p e % M 5 s M O 2 7

. s u a c s a r 0 u II p G 3 7 1

7 s m r h s s ch 0 m G b s 0 M I 3 4

% a c a a s a 2 II M -O p 0 G I b s 1 5

rc s h e r a 7 s M b G M 0 2 I - p 1

0 1 h p a u c rc e 1 p M - M 5 0 4 I b O 0 9

a 0 e m h h u 4 0 G O II M G s 2 9 - 3 0 5

4 e 0 u a a m 7 0 I G 5 b I G p 4 8 O 2

. u 2 m b e e 4 2 Ib I -O G Ib I 0 9 9 3 s 17

0 0 4 e u u s 6 - Ib s I - Ib 0 6 2 p

% M m 5 s t m m p 5 9 O - p 5 Ib O - 2 7 5 s 0 23

7 p a 0 8 5 O 0 s - O 7 2 p 0 s

3 . G s 6 0 e s s 0 2 5 0 p O 5 5 3 5 0 2 75

0 9 I M p0 0 0 p p 2 2 s 2 0 5 s 0 0 7

Ib 2 0 0 7 5 p s 5 0 p s 0 2 1

0 9 - G 0 5 4 0 0 2 0 p 0 2 s 0 M p 9 7 2 r . 5 O I 2 9 2 7 0 0 2 p 0 G 0 1 5 % 9 3 I b 5 5 5 2 2 2 0 0 49 0 2 0 5 9 7

. - 0 7 0 50 7 4 2 9 9 0 5 IIb 2 6 5

4 s O 5 3 1 5 9 5 5 2 0 5 1

9 9 M p 3 3 5 8 3 9 0 3 4 6 -O 0 5

% 0 2 0 8 1 2 9 8 1

9 G 0 s 5 85 5 6 8 5 9 2 5 6 M

5 I 2 p 5 5 0 3 5 s 0 G

I 7 0 5 4 p 5 II 7 % b 2 0 5 0 a 0 - O 8 2 0 -L 0 5 7 2 2 % 3 6 2 5 s . s 5 2 0 p . 9 p 7 5 0 9 0 3 7 0 0 0 5 7 27 5 2 3 2 2 1 7 6 0 2 1

1 1 M 5 5 % 3 o . % G 1

9 % I 5 Ib 0 - 8 O M 3 M a . 1 G M

8 s G G

II p I

% 0 M b 0 I I

5 -O 0 Ib b

. G 2 - -O

I 3 7 O T

5 6 Ib 1 3 3 h

- s 3 s a

% % O p 1 s p l

3 0 8 p 0 a

M 0 5 0 0 M s

G s 2 0 1 sa

p 7 2 6 G

M II 0 2 5 2 I r c M M b 0 0 0 9 Ib h l G - M 2 0 6 - a

G G O 1 5 7 2 O e

I I II 3 G 7 5 5 0 3 u

Ib Ib b I M 2 5 5 m

- M - -O s Ib G 1 s

O G O p - 8 p s

M M 3 3 0 O II 5 0 p

3 I s 0 3 b 0 0

G G s Ib s p 1 - 2 0

I p - p 6 s O 4

II Ib 0 O 0 0 2 p 3 9 2

b - 0 3 0 0 9 0 6 5

-O O 2 1 1 2 0 s 4 0 t 3 7 s 6 6 3 2 p 8 7 3 p 2 2 5 7 0 5 1

s s 3 0 9 8 1 0 2

p p 0 0 2 4 8 2 5

0 4 2 8 6 7

0 0 4 6 5 5 9 1

0 2 5 8 5 6

2 7 3 5 0

4 2 1 8

5 3 5 5

7 0 5 1

5 4 a 5 5 p o Cyt bd rm e families Th qOR4a qOR3 qOR1 qOR2 Figure 3. Distribution of cytochrome bd-type oxygen reductases in Archaea. Cytochrome bd-type oxygen reductases are sporatically distributed throughout the Archaea. The qOR4a family (cydAA’) is predominantly found within the Thermoproteales and Desulfurococcales orders of Crenarchaeota. bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

Figure 4. Biochemical characteristics of cytochrome bb’ (cytbb’) from Caldivirga maquilingensis. (A and B.) UV-visible spectra of cytochrome bd-type oxygen reductases purified from Escherichia coli and Caldivirga maquilingensis, respectively. C. Pyridine hemochrome spectra of Caldivirga maquilingensis cytbb’ reveals the absence of heme d in the partially purified enzyme. D. Oxygen reductase activity of cytbb’ from C. maquilingensis shows that it is highly active and cyanide insensitive. It is sensitive to Aurachin C1-10, a quinol binding site inhibitor which also inhibits E. coli cytochrome bd.

bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

Table 1. cydAA’ is expressed in many environments. Protein expression is estimated based on read counts in metatranscriptomes. Genome_ID Genome_Name Locus_Tag Reads Best BLAST Hit NCBI Organism Ga0040881_1063211 246 ESQ23817.1 Acidilobus sp. OSP8 Ga0040881_1110491, 106 AMD31390.1 Acidilobus sp. 7A Ga0099831_11874701 42 Ferric microbial mat communities from Yellowstone Ga0040881_1111231 27 WP_081246098.1 sp. CP80 National Park, Wyoming, USA - One Hundred 3300003719 Ga0040881_1134581 25 WP_066793619.1 Caldivirga sp. MU80 Spring Plain (OSP_B) (Metagenome Metatranscriptome) (*) (MER-FS) (assembled) Ga0040881_1142191, 2174 NAY81522.1 Thaumarchaeota archaeon Ga0040881_1205091 12 Ga0040881_1202971 5 KUO89797.1 Caldivirga sp. CIS_19 Ga0040881_1228021, 8 KUO80886.1 Vulcanisaeta sp. JCHS 4 Ga0040873_1002501 4784 AMD31390.1 Acidilobus sp. 7A Hypersaline microbial mat communities from Ga0040873_1003761 1921 EQB65596.1 Yellowstone National Park, Wyoming, USA - 3300003723 Ga0040873_1011521 2174 KUO93148.1 sp. ECH_B Beowulf (BE_B) (Metagenome Metatranscriptome) Ga0040873_1013971 707 PMP91269.1 sp. (*) (MER-FS) (assembled) Ga0040873_1028371 244 NAY81522.1 Thaumarchaeota archaeon Hypersaline microbial mat communities from Ga0040875_1014081 163 AMD31390.1 Acidilobus sp. 7A Yellowstone National Park, Wyoming, USA - 3300003709 Grendel (GRN_C) (Metagenome Ga0040875_1034841 11 KUO93148.1 Thermocladium sp. ECH_B Metatranscriptome) (*) (MER-FS) (assembled)

Ga0099835_1035811 192 NAY81522.1 Thaumarchaeota archaeon Iron oxide microbial mat communities from Ga0099835_1427011, 4 KUO93148.1 Thermocladium sp. ECH_B Yellowstone National Park, Wyoming, USA - 3300007166 Ga0099835_1431661 3 BED_top_diel_T=1 metaT (Metagenome Ga0099835_1456681 2 PMP93815.1 Nitrosphaera sp. Metatranscriptome) (*) (MER-FS) (assembled) Ga0099835_1510461 17 KUO89797.1 Caldivirga sp. CIS_19

Iron oxide microbial mat communities from Ga0099836_1305913 60 NAY81522.1 Thaumarchaeota archaeon Yellowstone National Park, Wyoming, USA - Ga0099836_1428691 2 WP_066793619.1 Caldivirga sp. MU80 3300007164 BED_top_diel_T=3 metaT (Metagenome Ga0099836_1428931, 3 KUO93148.1 Thermocladium sp. ECH_B Metatranscriptome) (*) (MER-FS) (assembled) Ga0099836_1472421 3 Iron oxide microbial mat communities from Ga0099838_1606901 3 KUO93148.1 Thermocladium sp. ECH_B Yellowstone National Park, Wyoming, USA - 3300007168 BED_top_diel_T=7 metaT (Metagenome Ga0099838_1754862 40 NAY81522.1 Thaumarchaeota archaeon Metatranscriptome) (*) (MER-FS) (assembled)

Iron oxide microbial mat communities from Ga0099839_1320821 2 NAZ28310.1 Caldivirga sp. Yellowstone National Park, Wyoming, USA - 3300007161 Ga0099839_1444841, 19 NAY81522.1 Thaumarchaeota archaeon BED_top_diel_T=8 metaT (Metagenome Ga0099839_1444842, 4 Metatranscriptome) (*) (MER-FS) (assembled) Ga0099839_1513781 Ga0099840_1194611, 6 KUO93148.1 Thermocladium sp. ECH_B Iron oxide microbial mat communities from Ga0099840_1228901 Yellowstone National Park, Wyoming, USA - Ga0099840_1208591, 29 NAY81522.1 Thaumarchaeota archaeon 3300007574 ECH_B_top_diel_T=1 metaT (Metagenome Ga0099840_1293341 27 Metatranscriptome) (*) (MER-FS) (assembled) Ga0099840_1307751, 7 Ga0099840_1212061 16 AMD31390.1 Acidilobus sp. 7A Ga0099841_1138611 5 WP_117355195.1 Acidilobus sp. 7A Iron oxide microbial mat communities from Ga0099841_1140711, 23 NAY81522.1 Thaumarchaeota archaeon Yellowstone National Park, Wyoming, USA - 3300007486 Ga0099841_1141101 12 ECH_B_top_diel_T=5 metaT (Metagenome Ga0099841_1256151 7 NAZ28310.1 Caldivirga sp. Metatranscriptome) (*) (MER-FS) (assembled) Ga0099841_1293981 15 KUO93148.1 Thermocladium sp. ECH_B Iron oxide microbial mat communities from Ga0099844_1215141, 10 NAY81522.1 Thaumarchaeota archaeon Yellowstone National Park, Wyoming, USA - Ga0099844_1308701 10 3300007575 ECH_C_top_diel_T=7 metaT (Metagenome Metatranscriptome) (*) (MER-FS) (assembled) Metatranscriptome of extremophilic microbial mat Ga0187838_10229991 154 WP_014289867.1 Pyrobaculum ferrireducens communities from Yellowstone National Park, 3300021851 Wyoming, USA - CONBC_RNA (Metagenome Ga0187838_11515921 155 KUK06933.1 Archaeoglobus fulgidus Metatranscriptome) (*) (MER-FS) (assembled) Ga0401895_061319_2_36 HGD34321.1 Candidatus Korarchaetoa Metatranscriptome of soil microbial communities 1 archaeon from Old Woman Creek estuary, Ohio, United 3300037625 - States - Aug_M1_C1_D5_B (Metagenome Metatranscriptome) (*) (MER-FS) (assembled)

Metatranscriptome of tropical peat soil microbial Ga0187794_17837561 5 PMP74969.1 Aciduliprofundum sp. communities from peatlands in Department of 3300019273 Meta, Colombia - 0116_SJ02_MP02_20_MT (Metagenome Metatranscriptome) (*) (MER-FS) (assembled) Metatranscriptome of tropical peat soil microbial Ga0187796_12966161 31 HHN53578.1 Nitrosphaera archaeon communities from peatlands in Department of 3300019264 Meta, Colombia - 0116_SJ02_MP15_20_MT (Metagenome Metatranscriptome) (*) (MER-FS) Ga0187796_15754521 8 PMP74969.1 Aciduliprofundum sp. (assembled) Metatranscriptome of tropical peat soil microbial Ga0187799_12787231 6 HHN53578.1 Nitrosphaera archaeon communities from peatlands in Department of 3300019211 Meta, Colombia - 0216_BV02_MP12_10_MT (Metagenome Metatranscriptome) (*) (MER-FS) (assembled) Metatranscriptome of tropical peat soil microbial Ga0187800_11075051 3 PMP74969.1 Aciduliprofundum sp. communities from peatlands in Department of 3300019278 Meta, Colombia - 0216_BV02_MP12_20_MT (Metagenome Metatranscriptome) (*) (MER-FS) (assembled) Thermal spring microbial communities from Beowulf Ga0040879_1097101 63 NAY81522.1 Thaumarchaeota archaeon Spring, Yellowstone National Park, Wyoming, USA - 3300003730 Beowulf (BE_D) (Metagenome Metatranscriptome) Ga0040879_1273451 7 NAZ28310.1 Caldivirga sp. (*) (MER-FS) (assembled) bioRxiv preprint doi: https://doi.org/10.1101/2021.01.16.426971; this version posted January 17, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

Table 2. Oxygen reduction activity of E. coli cytbd and C. maquilingensis cytbb’ in the presence of 350 µM coenzyme Q1 and 5 mM DTT.

Oxygen reduction activity Protein (e-/s)

Caldivirga maquilingensis cytbb’ 333 Ə 20 (cydAA’)

Escherichia coli cytbd (cydAB) 1065 Ə 73