bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

1 Conserved bacterial genomes from two geographically distinct peritidal

2 stromatolite formations shed light on potential functional guilds

3

4 Running Title: Functional bacterial guilds in stromatolites

5

6 Samantha C. Waterwortha, Eric W. Isemongerb, Evan R. Reesa, Rosemary A.

7 Dorringtonb and Jason C. Kwana#

8 a Division of Pharmaceutical Sciences, University of Wisconsin, 777 Highland Ave.,

9 Madison, Wisconsin 53705, USA

10 b Department of Biochemistry and Microbiology, Rhodes University, Grahamstown,

11 South Africa

12 #Corresponding author. Email: [email protected]; Phone: 608-262-3829

13

14 ORIGINALITY-SIGNIFICANCE

15 Peritidal stromatolites are unique among stromatolite formations as they grow at the

16 dynamic interface of calcium carbonate-rich groundwater and coastal marine waters.

17 The peritidal space forms a relatively unstable environment and the factors that

18 influence the growth of these peritidal structures is not well understood. To our

19 knowledge, this is the first comparative study that assesses conservation within

20 the microbial communities of two geographically distinct peritidal stromatolite

21 formations. We assessed the potential functional roles of these communities using

22 genomic bins clustered from metagenomic sequencing data. We identified several

23 conserved bacterial species across the two sites and hypothesize that their genetic

1

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

24 functional potential may be important in the formation of pertidal stromatolites. We

25 contrasted these findings against a well-studied site in Shark Bay, Australia and show

26 that, unlike these hypersaline formations, archaea do not play a major role in peritidal

27 stromatolite formation. Furthermore, bacterial nitrogen and phosphate metabolisms of

28 conserved species may be driving factors behind lithification in peritidal stromatolites.

29

30 SUMMARY

31 Stromatolites are complex microbial mats that form lithified layers and ancient forms are

32 the oldest evidence of life on earth, dating back over 3.4 billion years. Modern

33 stromatolites are relatively rare but may provide clues about the function and evolution

34 of their ancient counterparts. In this study, we focus on peritidal stromatolites occurring

35 at Cape Recife and Schoenmakerskop on the southeastern South African coastline.

36 Using assembled shotgun metagenomic data we obtained 183 genomic bins, of which

37 the most dominant taxa were from the Cyanobacteriia class (Cyanobacteria phylum),

38 with lower but notable abundances of classified as ,

39 Gammaproteobacteria and Bacteroidia. We identified functional gene sets in bacterial

40 species conserved across two geographically distinct stromatolite formations, which

41 may promote carbonate precipitation through the reduction of nitrogenous compounds

42 and possible production of calcium ions. We propose that an abundance of extracellular

43 alkaline phosphatases may lead to the formation of phosphatic deposits within these

44 stromatolites. We conclude that the cumulative effect of several conserved bacterial

45 species drives accretion in these two stromatolite formations.

46

2

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

47 INTRODUCTION

48 Stromatolites are organo-sedimentary structures that date back more than 3.4 billion

49 years, forming the oldest fossils of living organisms on Earth (Dupraz et al., 2009). A

50 more recent discovery of stromatolite-like structures in Greenland suggests that the

51 structures may date as far back as 3.7 - 3.8 billion years (Nutman et al., 2016),

52 however, their biogenic origin remains under debate (Witze, 2016). The emergence of

53 Cyanobacteria in stromatolites approximately 2.3 billion years ago initiated the Great

54 Oxygenation Event that fundamentally altered the Earth’s redox potential and resulted in

55 an explosion of oxygen-based and multicellular biological diversity (Soo et al., 2017).

56 Ancient stromatolites could provide insight into how microorganisms shaped early

57 eukaryotic evolution. Unfortunately ancient microbial mats are not sufficiently preserved

58 for identification of these microbes and individual bacteria cannot be classified more

59 specifically than phylum Cyanobacteria due to morphological conservatism (Awramik,

60 1992; Dupraz et al., 2009). The study of extant stromatolite analogs may therefore help

61 to elucidate the biological mechanisms that led to the formation and evolution of their

62 ancient ancestors. Modern stromatolites are formed through a complex combination of

63 both biotic and abiotic processes. The core process revolves around the carbon cycle

64 where bacteria transform inorganic carbon into bioavailable organic carbon for

65 respiration. Bacterial respiration in turn results in the release of inorganic carbon, which,

66 under alkaline conditions, will bind cations and precipitate primarily as calcium

67 carbonate (Dupraz et al., 2009). This carbonate precipitate, along with sediment grains,

68 can then become trapped within bacterial biofilms forming the characteristic lithified

69 layers.

3

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

70

71 Alteration of the pH and subsequently, the solubility index (SI), may promote

72 mineralization or dissolution of carbonate minerals through microbial cycling of redox

73 sensitive compounds such as phosphate, nitrogen, sulfur and other nutrients within the

74 biofilm. This in turn regulates the rate of carbonate accretion and stromatolite growth.

75 Particularly, photosynthesis and sulfate reduction have been demonstrated to increase

76 alkalinity thereby promoting carbonate accretion, resulting in the gradual formation of

77 lithified mineral layers (Dupraz et al., 2009). In some stromatolite formations such as

78 those of Shark Bay, Australia, there is abundant genetic potential for both dissimilatory

79 oxidation of sulfur (which may promote dissolution under oxic conditions and

80 precipitation under anoxic conditions) and dissimilatory reduction of sulfate (which

81 promotes precipitation) (Gallagher et al., 2012; Casaburi et al., 2016; Wong et al.,

82 2018).

83

84 The biogenicity of stromatolites has been studied extensively in the hypersaline and

85 marine formations of Shark Bay, Australia and Exuma Cay, Bahamas, respectively

86 (Khodadad and Foster, 2012; Babilonia et al., 2018). The presence of Archaea has

87 been noted in several microbial mat and stromatolite systems (Casaburi et al., 2016;

88 Balci et al., 2018; Medina-Chávez et al., 2019), particularly in the stromatolites of Shark

89 Bay, where they are hypothesized to potentially fulfill the role of nitrifiers and

90 hydrogenotrophic methanogens (Wong et al., 2017). Although Cyanobacteria,

91 and Bacteroidetes appear to be abundant in both marine and

92 hypersaline systems, Cyanobacteria are proposed to be particularly vital to these

4

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

93 formations through the combined effect of biofilm formation, carbon fixation, nitrogen

94 fixation and endolithic (boring) activity (Macintyre et al., 2000; Khodadad and Foster,

95 2012; Casaburi et al., 2016; Babilonia et al., 2018).

96

97 Peritidal tufa stromatolite systems are found along the southeastern coastline of South

98 Africa (SA). They are geographically isolated, occurring at coastal dune seeps

99 separated by stretches of coastline (Smith et al., 2018). In these systems, stromatolite

100 formations extend from freshwater to intertidal zones and are dominated by

101 Cyanobacteria, Bacteroidetes and Proteobacteria (Perissinotto et al., 2014). The

102 stromatolites are impacted by fluctuating environmental pressures caused by periodic

103 inundation by seawater, which affects the nutrient concentrations, temperature and

104 chemistry of the system (Rishworth et al., 2016). These formations are characterized by

105 their proximity to the ocean, where stromatolites in the upper formations receive

106 freshwater from the inflow seeps, middle formation stromatolites withstand a mix of

107 freshwater seepage and marine over-topping, and lower formations are in closest

108 contact with the ocean (Perissinotto et al., 2014). The stromatolite formations at Cape

109 Recife and Schoenmakerskop are exposed to both fresh and marine water that has little

110 dissolved inorganic phosphate and decreasing levels of dissolved inorganic nitrogen

111 (Cape Recife: 82 - 9 µM, Schoenmakerskop: 424 - 14 µM) moving from freshwater to

112 marine influenced formations (Rishworth, Perissinotto, Bird, et al., 2017). Microbial

113 communities within these levels therefore likely experience distinct environmental

114 pressures, including fluctuations in salinity and dissolved oxygen (Rishworth et al.,

115 2016). While carbon predominantly enters these systems through cyanobacterial carbon

5

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

116 fixation, it is unclear how other members of the stromatolite-associated bacterial

117 consortia influence mineral stratification resulting from the cycling of essential nutrients

118 such as nitrogen, phosphorus and sulfur. Since peritidal stromatolites exist in constant

119 nutritional and chemical flux with varying influence from the fresh and marine water

120 sources, they present an almost ideal in situ testing ground for investigating which

121 microbes are consistently present despite fluctuations in their environment. Identification

122 of conserved bacterial species across both time and space and across varied

123 environmental pressures would suggest that these bacteria are not only robust but likely

124 play important roles within the peritidal stromatolite consortia.

125

126 Previous studies of stromatolites collected from Lake Clifton, Pavilion Lake, Clinton

127 Creek, Highborne Cay and Pozas Azules have investigated the overall bacterial

128 composition using 16S rRNA gene analysis and/or overall potential gene function of

129 unbinned metagenomic datasets per bacterial taxa using tools such as MG-RAST

130 (Keegan et al., 2016) to gain insight into the potential roles of the different bacterial

131 groups (Mobberley et al., 2015; Centeno et al., 2016; Gleeson et al., 2016; Ruvindy et

132 al., 2016; Warden et al., 2016; White et al., 2016). However, the presence of all genes

133 required for a complete functional pathway within a collection of bacteria does not

134 necessarily mean that the cycle can take place since they may not be present within a

135 single organism. Therefore, in order to assess the potential for functional roles,

136 individual bacterial genomes must be investigated. To date, only two studies have

137 successfully binned individual bacterial genomes from culture-independent,

138 metagenomic data originating from stromatolites from Shark Bay (Australia) (Wong et

6

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

139 al., 2018) and Socompa Lake (Argentina) (Kurth et al., 2017). The latter study obtained

140 four high-quality bins (in accordance with MIMAG standards defined by (Bowers et al.,

141 2017)) and analyzed three which were believed to have carried genes from several

142 closely-related genomes. Extracted 16S rRNA sequences were in conflict with whole-

143 genome taxonomic classifications (Kurth et al., 2017). The Shark Bay study obtained a

144 total of 550 binned genomes, of which 87 (15.8%) were of medium to high quality

145 (Bowers et al., 2017; Wong et al., 2018) and the data provided information on a number

146 of potentially important processes that may contribute to the formation and maintenance

147 of the hypersaline stromatolites. However, the study did not identify key microbial

148 architects within these biogenic structures.

149

150 Using a metagenomic approach, we have sought to gain insight into the foundational

151 bacteria responsible for metabolic processes that potentially result in formation of

152 pertidal South African stromatolites. We obtained and annotated 183 putative bacterial

153 metagenome-assembled genomes (MAGs), (of which 112 (61%) were of medium to

154 high quality) from samples of two geographically isolated sites near Port Elizabeth,

155 South Africa. We identified several temporally and spatially conserved bacterial species

156 and functional gene sets, that are likely central in establishing and maintaining peritidal

157 stromatolite microbial communities.

158

159 RESULTS AND DISCUSSION

160 Two geographically isolated peritidal stromatolite sites, Cape Recife and

161 Schoenmakerskop, which are 2.82 km apart, were chosen for this study. These two

7

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

162 sites have been extensively characterized with respect to their physical structure,

163 nutrient and chemical environment (Perissinotto et al., 2014; Rishworth et al., 2016,

164 2019; Rishworth, Perissinotto, Bornman, et al., 2017; Dodd et al., 2018). The sites

165 experience regular shifts in salinity due to tidal overtopping and groundwater seepage

166 (Rishworth et al., 2019). Comparison to a site with groundwater seepage but no

167 stromatolite growth, has shown that the growth of peritidal stromatolites is promoted

168 within this region by decreased levels of wave action, higher water alkalinity and

169 decreased calcite and aragonite saturation (Dodd et al., 2018). Furthermore,

170 stromatolite growth is inhibited by increased levels of salinity, as lithified structures are

171 not observed in both the marine waters and in deeper portions of formation pools, with

172 higher salinity levels (Dodd et al., 2018).

173

174 Stromatolite formations at both sites begin at a freshwater inflow and end before the

175 subtidal zone (Fig. 1) and are exposed to different levels of tidal disturbance. Samples

176 for this study were collected from the upper stromatolites at Cape Recife and

177 Schoenmakerskop in January and April 2018 for comparisons over time and geographic

178 space. Additional samples were collected in April 2018 from middle and lower

179 formations for extended comparison across the two sites (Fig. 1). For a detailed

180 perspective of depth and the differentiation of the sampled zones a 3-dimensional

181 rendering of the sample sites was constructed by a 3rd party (Caelum Technologies)

182 using a combination of drone-based photogrammetry and geographic mapping using

183 differential GPS (Fig. 1). Stromatolite formations were classified according to their tidal

184 proximity as defined in previous studies (Perissinotto et al., 2014; Rishworth et al.,

8

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

185 2016; Rishworth, Perissinotto, Bornman, et al., 2017; Dodd et al., 2018). Upper

186 formations occur in the supratidal zone where they are constantly exposed to fresh

187 water flowing from dune seeps. These formations will only be exposed to seawater

188 during spring tides or extreme storm surges. The water from upper formations feeds into

189 large pools, which in turn feed into the lower portion of the system. In the upper-middle

190 intertidal zone, the formations form a slope into large pools where they will only receive

191 seawater during peak high tide. Lower formations occur in areas where saline or

192 brackish conditions are predominant. In Schoenmakerskop the lower zone is located in

193 a semi stagnant pool which is frequently overtopped, whilst in the lower zone of Cape

194 Recife, formations create a low flowing slope which ends at the subtidal zone. All

195 sampling was conducted at low tide. Sample abbreviations and MAG prefixes used

196 throughout this study correspond to the site and region from which they were sampled

197 e.g. “SU” identifies the sample as originating from Schoenmakerskop, Upper formation

198 in April (Table S1). Samples prefixed with a “C” identify samples collected in January

199 e.g. “CSU” identifies the sample as originating from Schoenmakerskop, Upper formation

200 in January (Table S1).

201

202 Phylogenetic distribution of microbial communities

203 We assessed the diversity and structure of the bacterial communities in triplicate

204 samples taken from upper, middle and lower formations at Cape Recife and

205 Schoenmakerskop using 16S rRNA gene amplicon sequence analysis. All communities

206 were dominated by Cyanobacteria, Bacteroidetes, Alphaproteobacteria,

207 Gammaproteobacteria and other unclassified bacteria (Fig. 2A), in agreement with a

9

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

208 previous study at Schoenmakerskop (Perissinotto et al., 2014). Bacterial communities at

209 each sampling site were statistically different from one another (Fig. 2B; ANOSIM: R =

210 0.976, p = 0.01 permutations = 999), whilst pairwise Kendall rank correlation tests

211 showed that replicates were not statistically different from one another (pairwise p-

212 values < 2.2 × 10-16).

213

214 Trends observed in metagenomes of stromatolite samples from Cape Recife and

215 Schoenmakerskop

216 To characterize the metabolic potential within stromatolites, we generated shotgun

217 metagenomic libraries from 8 samples representative of the upper, middle and lower

218 formations at both sites in April 2018, as well as an additional two samples from the

219 upper formations of each site, collected 4 months previously in January 2018 (Table

220 S1). Following assembly of raw sequence reads, gene coverages were normalized to

221 the length-weighted average coverage per sample, revealing a high abundance of

222 genes encoding phosphate transport (pstSCAB), phosphate uptake regulation

223 (phoURBP) and alkaline phosphatases (phoADX) were observed across the board (Fig.

224 3). Additionally, genes involved in phosphonate metabolism (phnCDEFGHIJKLM) were

225 abundant in three of the four middle formations but were absent in upper formations.

226 The concentration of soluble phosphorus, although relatively low, is highest in water

227 surrounding the middle formations, as both fresh seep water and ocean overtopping

228 contribute to the total phosphate (Dodd et al., 2018). It is thus unsurprising then, that the

229 greatest abundance of phosphate-metabolism genes are found in the middle

230 formations. The high abundance of genes encoding phosphate-metabolizing enzymes,

10

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

231 may also be indicative of how stromatolite communities cope with low dissolved

232 inorganic phosphorus in their environment. Genes encoding alkaline phosphatase

233 phoX, were the most abundant among the phosphatases observed in these stromatolite

234 samples, and represent a calcium-dependent enzyme that can function at low substrate

235 concentrations on a broad range of C-O-P substrates (Zaheer et al., 2009). The

236 phosphonate transporter genes (phnCDE) are more prevalent than the genes encoding

237 the C-P lyase (phnGHIJLM) required for phosphonate degradations (Fig. 3). These

238 transporters have also been implicated in the transport of inorganic phosphate, in

239 addition to phosphonates (Stasi et al., 2019), and the discrepancy between transporters

240 and metabolic genes may suggest an additional role of phosphate uptake in these

241 systems. Overall gene abundances indicated negligible presence of canonical

242 dissimilatory sulfate reduction/oxidation via aprAB and dsrAB encoded enzymes.

243 Similarly, there were low abundances of genes associated with sulfonate metabolism.

244 There was an abundance of genes associated with assimilatory sulfate reduction, but

245 the low abundance or absence of cysC, a gene encoding a key enzyme in this pathway,

246 indicated that this pathway may not be complete. Genes associated with assimilatory

247 nitrate reduction (narB, nirA) were the most prevalent markers of nitrogen metabolism.

248 All sites also contained a number of genes associated with dissimilatory nitrate

249 reduction where cytoplasmic NADH-dependent nitrate reductase nirBD appeared to be

250 favored over periplasmic cytochrome c nitrate reductase nrfAH. Interestingly nitrogen

251 fixation genes (nifDHK) appear to be more abundant in the middle stromatolite

252 formations of Schoenmakerskop (Fig. 3).

253

11

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

254 Binning and phylogenetic classification of putative genomes

255 The 10 assembled metagenomes were binned using Autometa (Miller et al., 2019),

256 resulting in a total of 183 bacterial genome bins (Table S1). Using relative coverage per

257 sample as a proxy for abundance, we found that genomes classified within the

258 Cyanobacteriia class were consistently dominant in all collection points, while

259 Alphaproteobacteria, Gammaproteobacteria and Bacteroidia were less abundant but

260 notable bacterial classes (Fig. 4A). This distribution appears to be approximately

261 congruent with abundances observed in the 16S rRNA gene amplicon analyses (Fig. 2).

262

263 Temporal and spatial conservation of bacterial species

264 We calculated pairwise average nucleotide identity (ANI) between all binned genomes

265 and defined conserved species as genomes sharing more than 97% ANI in two or more

266 of the sampled regions. We identified 16 conserved taxa across the 10 sampled

267 regions, with several species identified in 3 - 5 samples, across Schoenmakerskop and

268 Cape Recife, however no single species was common to all sampled sites (Table S2,

269 Fig. 4A). Conserved species were commonly the most abundant taxa present in each of

270 the samples, accounting for approximately 30 - 80% of the species abundance (Table

271 S2, Fig. 4A). Cyanobacterial species within the Acaryochloris and Hydrococcus genera

272 and Absconditabacterales family were conserved across upper formations (Table 1, Fig.

273 4B, Table S3). Seven other species were conserved across the middle formations,

274 including species classified within the Rivularia genus and Phormidesmiaceae and

275 Spirulinaceae families (Table 1 and S3, Fig. 4B). Two distinct species within family

276 Elainellaceaee were also conserved: species A was detected only in the upper pools of

12

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

277 Cape Recife, whilst species B was conserved across the middle formations of both sites

278 (Table 1 and S3, Fig. 4B).

279

280 Also, of interest was the presence of conserved bacterial species (order

281 Absconditabacterales), which are classified under the Patescibacteria phylum.

282 Patescibacteria are unusually small bacteria found in groundwater that produce large

283 surface proteins hypothesized to help them attach to, and exploit the ability of other

284 microorganisms performing nitrogen, sulfur and iron cycling (Herrmann et al., 2019).

285 The presence of these conserved bacteria suggests that the inflow water seeps may

286 originate from groundwater.

287

288 Conserved Rivularia, Elainellaceae, Phormidesmiaceae and Chloroflexaceae species

289 were identified in the lower formation of Schoenmakerskop, but none of the genomes

290 identified in the lower formation of Cape Recife had greater than 97% shared ANI with

291 any other genome bin. The distribution of conserved taxa, wherein the upper and middle

292 formations appear to harbor distinct conserved taxa suggests that the differing nutrient

293 and physical characteristics between upper, middle and lower regions of the

294 stromatolite formation elicit specialization of the conserved bacterial community. The

295 lack of conserved bacterial species across the lower formation of Cape Recife may be

296 due to the choice of sampling site: The lower formation sample taken from Cape Recife

297 is in closer contact with the ocean than the lower formation sample taken from

298 Schoenmakerskop (Fig. 1 C - D). The proximity of the Schoenmakerskop lower

299 formation sample to the ocean was initially thought to be sufficient to delimit it as a

13

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

300 lower formation, but following this study it may be reclassified as a middle formation, as

301 the bacterial community composition corresponds more closely with the other four

302 middle formation samples (Fig. 4B).

303

304 Metabolic potential of binned genomes

305 Oxygenic photosynthesis by cyanobacteria results in rapid fixation of carbon dioxide

306 and an increase in alkalinity (Pace et al., 2018). Carbonate ions bind cations such as

307 calcium and are precipitated under alkaline conditions, promoting the growth of

308 stromatolite structures (Dupraz et al., 2009). Given their numerical dominance in Cape

309 Recife and Schoenmakerskop stromatolites, and their predicted role in other

310 stromatolites (Dupraz et al., 2009), cyanobacteria likely perform carbon sequestration.

311 However, the identity of the bacteria that cycle redox-sensitive sulfur, phosphate,

312 nitrogen and calcium, and subsequently affect the alkalinity and solubility index enabling

313 carbonate precipitation in these stromatolites remains unknown. We inspected

314 PROKKA and KEGG annotations within all stromatolite-associated bacterial genome

315 bins to identify potential metabolic pathways that may promote mineral deposition and

316 accretion (Dupraz et al., 2009). The results presented here are summarized in Table 1.

317

318 Sulfur metabolism. Reduction of sulfate has previously been shown to promote the

319 precipitation of carbonates in the form of micritic crusts in Bahamian and Australian

320 stromatolites (Reid et al., 2000; Wong et al., 2018) and it has been suggested that

321 microbial cycling of sulfur played an important role in ancient Australian stromatolites,

322 even prior to the emergence of Cyanobacteria (Bontognali et al., 2012; Allen, 2016).

14

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

323 Amongst the Cape Recife and Schoenmakerskop stromatolite-associated bacteria, the

324 potential capacity for sulfate reduction was confined to only a few genomes (Fig. 5 and

325 Fig. S1). The complete set of genes required for assimilatory sulfate reduction

326 (sat/met3, cysC, cysH and sir genes) (Santos et al., 2015) were recovered in four

327 genomes (Fig. S1), three of which were conserved Acaryochloris species (Fig. 5). The

328 abundance of genes associated with assimilatory sulfate reduction in the conserved

329 Acaryochloris genome bins accounted for 22 - 100% of gene counts observed in their

330 respective metagenomes (Fig.5). This was calculated as abundance of geneX in binX,

331 as a percentage of geneX abundance in the sample from which the bin was derived

332 (Fig. 5). The greatest abundance genes for uptake and desulfonation of

333 alkanesulfonates (ssuABCDE) (Aguilar-Barajas et al., 2011; Ellis, 2011) were detected

334 exclusively in conserved Hydrococcus species (Fig. 5), which account for 23 - 100% of

335 gene abundance in the respective metagenomes. All genes required for a complete

336 pathway found exclusively in bin RU2_2, but both SU_1_0 and CRU1_1 appear to be

337 missing the ssuE gene. The ssuE gene is not required for growth using aliphatic

338 sulfonate or methionine substrates, but is required for arylsulfonate metabolism

339 (Kahnert et al., 2000). This suggests that these Hydrococcus strains are all capable of

340 some form of sulfonate metabolism, but not all can metabolize arylsulfonates (Bin

341 CSU_1_8 is only 37% complete, and therefore of low quality and may be missing gene

342 due to incompleteness). In both cases, these trends are in agreement with the patterns

343 observed in the metabolic potential of the overall metagenome (Fig.3). Hydrococcus

344 and Acaryochloris species are dominant in upper formations (Fig. 4 and Table S3) and

345 the potential for cumulative removal of hydrogen by sulfate reduction by these species

15

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

346 may aid in the creation of an alkaline environment within the system. Seep waters

347 feeding both Cape Recife and Schoenmakerskop have relatively high levels of sulfate

348 (Dodd et al., 2018), and it is possible that the Hydrococcus and Acaryochloris species

349 utilize this nutrient in the upper formations (closest to the seeps) as a selective

350 advantage, simultaneously increasing the alkalinity of the surrounding environment

351 through hydrogen assimilation. There was no evidence for the potential for dissimilatory

352 sulfate reduction in conserved genome bins (Fig. 5). Similarly, amongst all 183 genome

353 bins, only genomes representative of a Thioiploca sp. from the Schoenmakerskop lower

354 formation and a Desulfobacula sp. from the Schoenmakerskop middle formation

355 included all genes required for dissimilatory sulfate reduction (Fig. S1). The lack of

356 genes associated with conserved or dominant bacterial taxa in the middle and lower

357 pools suggest that either sulfate is not required by the consortia that inhabit the

358 middle/lower formations and is required only by bacteria exposed to ground water, or

359 that insufficient amounts of the sulfate continues into the lower pools where the middle

360 and lower formations are found. This suggests that calcite formation in these peritidal

361 stromatolites may be influenced by processes other than dissimilatory sulfate reduction,

362 in contrast to stromatolite formations in the Cayo Coco lagoonal network, Highborne

363 Cay and Eleuthera Island in the Bahamas (Visscher et al., 2000; Dupraz et al., 2004;

364 Pace et al., 2018).

365

366 Nitrogen metabolism. The reduction of nitrogen, nitrates and nitrites can lead to calcite

367 precipitation (Rodriguez-Navarro et al., 2003; Wei et al., 2015; Konopacka-Łyskawa et

368 al., 2017; Wong et al., 2018; Lee and Park, 2019), and the released NH3 can react with

16

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

4+ 2- 369 CO2 and H2O, to form 2NH + CO3 (Konopacka-Łyskawa et al., 2017). Nitrogen

370 metabolism is proposed to have emerged at approximately the same time as sulfur

371 metabolism, dated to 3.4 billion years ago (Stüeken, 2016), as both processes share

372 similar redox states (Thomazo et al., 2011), and ammonium availability during this time

373 may have sustained developing microbial life (Yang et al., 2019). Therefore, bacteria

374 that can fix nitrogen or produce ammonia from nitrates/nitrites could potentially promote

375 the growth of stromatolites (Visscher and Stolz, 2005) and may have added to the

376 formation of ancient analogs. Similarly, denitrification will likely lead to mineral

377 dissolution and retardation of stromatolite growth (Visscher and Stolz, 2005). We found

378 that several conserved bacteria carried the genes associated with ferredoxin-dependent

379 assimilatory nitrate reduction (nirA-narB genes) (Moreno-Vivián et al., 1999), nitrogen

380 fixation (nifDHK genes) and dissimilatory nitrite reduction (nirBD genes) (Griffith, 2016),

381 all of which result in the production of ammonia (Fig. 5).

382

383 The potential for assimilatory nitrate reduction was detected in several genomes,

384 primarily from the middle formations, with particularly high gene abundances of nirA-

385 narB genes conserved Rivularia sp. (Fig. 5), which are the dominant species in the

386 middle formations (Fig. 4). The potential for dissimilatory reduction of nitrites, via either

387 cytoplasmic nirBD or membrane-bound nrfAH nitrite reductases, was detected in

388 several genomes across both the upper and middle formations (Fig. S1). It appears that

389 the gene abundances associated with assimilatory nitrate reduction in Rivularia sp.

390 account for the majority (29 - 75%, Fig. 5) of nirA-narB genes observed in their

391 respective metagenomes (Fig. 3). Among conserved bacteria, the potential for

17

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

392 dissimilatory nitrate reduction is relatively low, with Hydrococcus sp. carrying the

393 greatest abundance of nirBD genes (Fig. 5) which encode cytoplasmic nitrite

394 reductases. Non-conserved Thioploca, Cyclobacteriaceae and Limnothrix species

395 appear to account for the remainder of the nirBD genes observed in their respective

396 metagenomes (Fig. S1). Very few genomes included the nrfAH genes which encode

397 membrane-bound nitrite reductases.

398

399 Nighttime nitrogen fixation has previously been shown to be a driver of carbonate

400 precipitation in stromatolites (Dupraz et al., 2009) and nif genes were identified in

401 several conserved bacteria associated with both upper and middle formations:

402 Hydrococcus species across both upper formations, Microcoleus in the

403 Schoenmakerskop upper formations and Phormidiaceae, Spirulinaceae and

404 Chloroflexaceae species the middle formations (Fig. 5). In addition to the conserved

405 bacteria carrying nif genes, non-conserved Blastochloris species appeared to account

406 for the majority (10% - 75%) of identified nifDHK genes (Fig. S1). The genetic capacity

407 for nitrogen fixation being retained in Schoenmakerskop and Cape Recife was

408 unexpected, given the high concentrations of dissolved inorganic nitrogen (DIN) ranging

409 from 95 - 450 mM at the two sites (Rishworth et al., 2016).

410

411 Phosphate and phosphonate metabolism

412 Phosphatic structures that closely resemble fossilized phosphatic stromatolites have

413 recently been observed within Cape Recife stromatolites (Buttner et al., 2019,

414 unpublished). Hydroxyapatite (Ca5(PO4)3(OH)) is more easily precipitated than calcite

18

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

3- 415 (CaCO3) and the release of PO4 into the biofilm by alkaline phosphatase activity could

416 increase the potential for apatite mineralization, the rate of which may be increased in

417 the presence of an alkaline environment (Gallagher et al., 2013). Copies of alkaline

418 phosphatase genes phoD and phoX are variable within bacterial genomes but the

419 majority of bacteria carry only 1 copy of phoD and 1 copy of phoX (Ragot et al., 2015,

420 2017). Conserved Acaryochloris and Rivularia species were particularly notable as they

421 encoded between 2–4 copies of phoX (Fig. 5), and carried all genes required for

422 phosphate transport (pstSCAB) (Fig. 5). Since the Cape Recife and Schoenmakerskop

423 stromatolites experience limited inorganic phosphate availability (Rishworth et al., 2016,

424 2018; Rishworth, Perissinotto, Bird, et al., 2017; Dodd et al., 2018), associated bacteria

425 may generate bioavailable phosphate from trapped sediments in biofilms resulting in

426 increased concentrations, similar to cyanobacteria living in phosphate-poor rivers

3- 2+ 427 (Wood et al., 2015). An increase in both PO4 and Ca can result in rapid precipitation

428 of apatite in marine phosphorites, freshwater lakes and in some soil bacteria (Danen-

429 Louwerse et al., 1995; Guang-Can et al., 2008; Cosmidis et al., 2015) and this could

430 account for the phosphatic deposits observed within the SA stromatolites (Buttner et al.,

431 2019, unpublished).

432

433 Finally, conserved Rivularia species in the middle and lower formations and other non-

434 conserved bacterial species harbored the 11 genes required for the transport (phnCDE)

435 and lysis (phnFGHIJKLM) of phosphonate compounds (Fig. 5) (Metcalf and Wanner,

436 1993). Phosphonates are characterized by the presence of a carbon-phosphorus bond

437 and are biosynthesized by many organisms (White and Metcalf, 2007). C-P lyase

19

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

438 (phnGHIJLKM) breaks the C-P bond within phosphonate substrates resulting in a

439 hydrocarbon and inorganic phosphate (White and Metcalf, 2007). The inorganic

440 phosphate may then be used within the bacterial cell or released aiding in accretion

441 through increased ion concentration (Rott et al., 2018). Conversely, phosphonates can

442 prevent precipitation of calcite by binding to crystal growth sites (Kan et al., 2005) and

443 the degradation of these compounds within stromatolite formations may prevent

444 chemical inhibition of stromatolite growth. Given the low availability of phosphate in

445 these systems, phosphonates may also provide an auxiliary source of bioavailable

446 phosphate. The potential for phosphonate degradation was also observed in 8 genomes

447 from the Shark Bay stromatolites (Wong et al., 2018). The conservation of phosphonate

448 metabolism across stromatolites would indicate that these bacterial processes may be

449 important within the generalized stromatolite system.

450

451 Archaeal genomes in Cape Recife and Schoenmakerskop stromatolites

452 Genome-resolved studies of hypersaline stromatolites in Shark Bay revealed that a

453 large proportion of the microbial community consisted of archaeal species (Wong et al.,

454 2017, 2018). Scrutiny of the contigs from Cape Recife and Schoenmakerskop showed

455 that none of the datasets comprised more than 1.5% archaeal genes. Assessment of

456 contigs clustered into the Archaeal kingdom bins from the upper and middle formations

457 showed little evidence for the presence of Archaea. However, two low-quality genomes

458 were obtained from the two lower formations SL_Arch_1 and RL_Arch_2, which were

459 classified as Woesearchaeales (order) and Nitrosoarchaeum (genus) respectively

460 (Table S3). Alignment of the 16S rRNA gene obtained from SL_Arch_1 (Table S3)

20

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

461 against the nr database showed that it shared the greatest sequence homology with an

462 uncultured archaeon clone GWA2 (87% identity). Similarly, analysis of the 16S rRNA

463 gene sequence from RL_Arch_2 (Table S3) indicated that it shared greatest sequence

464 homology with an uncultured archaeon clone 027 (99.79% identity). The coverage of

465 these genomes is among the lowest in each of the lower formation samples (Table S3)

466 and would suggest a low numerical abundance of these archaea. It is compelling that

467 the only Archaea detected in this study are derived from the formations most influenced

468 by saline waters, considering that the hypersaline stromatolites of Shark Bay comprise

469 several archaeal species (Wong et al., 2017, 2018), and it would be intriguing to identify

470 whether salinity has an effect on the Archaeal composition of stromatolite-associated

471 microbiota. However, as there are only two, low quality, non-conserved, relatively low-

472 abundance genomes derived from Archaea, it is likely that this kingdom is less

473 important in the stromatolites found at Cape Recife and Schoenmakerskop.

474

475 Key bacteria for stromatolite formation at Cape Recife and Schoenmakerskop

476 Several bacterial groups may be key players in the formation of stromatolites in Cape

477 Recife and Schoenmakerskop, with conserved bacterial species potentially acting as

478 major contributors (Table 1). There were distinct functional abilities between the

479 bacterial communities associated between the upper and middle formations. The

480 consortia in the upper formations were dominated by conserved Hydrococcus and

481 Acaryochloris species, which appeared to be capable of sulfate and sulfonate

482 metabolism, whilst the middle formations were dominated primarily by conserved

483 Rivularia species, which carry the genetic capacity to metabolize various forms of

21

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

484 phosphate substrates and potentially reduce nitrate. The lack of all genes required for

485 phosphonate metabolism in Rivularia species in Schoenmakerskop Middle 2 (SM2) was

486 unexpected but may be explained by the lower abundance of Rivularia in this sample,

487 resulting in lower quality of genome (Table S3). Similarly, Hydrococcus bin CSU_1_8

488 was of low quality (Table S3) and as a result appeared to be missing 3 of the genes

489 required.

490

491 We propose the redox potential and solubility index in these stromatolites is potentially

492 influenced by conserved bacteria generating sulfide (Acaryochloris and Hydrococcus

493 species) and ammonia ions (Rivularia species) for the rapid precipitation of carbonate

494 compounds and subsequent growth of stromatolite structures. The presence of

495 Microcoleus species (Phormidiaceae family) and an unidentified genus of

496 Phormidiaceae in both upper and middle formations is notable, as previous studies

497 have found that lamellar precipitation of carbonate occurs on the filaments of cultured

498 Phormidiaceae bacteria isolated from freshwater tufa structures (Payandi-Rolland et al.,

499 2019). We further propose an abundance of alkaline phosphatases in several

500 conserved species, as well as the abundance of genes associated with phosphonate

501 degradation in Rivularia species, may result in increased local inorganic phosphate

502 concentrations. This free inorganic phosphate may be incorporated into the phosphatic

503 crusts observed in these stromatolites (Buttner et al., 2019, unpublished).

504

505 The identification of conserved bacteria performing potentially important roles within

506 stromatolites of both Cape Recife and Schoenmakerskop may provide insight into what

22

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

507 cyanobacterial species may have played a key role in the formation of ancient

508 phosphatic stromatolites that formed in shallow marine and peritidal environments (Misi

509 and Kyle, 1994; Drummond et al., 2015; Buttner et al., 2019, unpublished; Shiraishi et

510 al., 2019). However, transcriptomics, nutrient uptake and additional hydrochemistry

511 would be required to determine if the proposed role of the conserved bacteria is valid.

512 Similarly, future studies will be needed to determine whether physical (e.g. flow rate) or

513 chemical factors (e.g. nutrient availability) result in our observed difference in functional

514 potential between the different formations.

515

516 EXPERIMENTAL PROCEDURES

517 Sample collection and DNA isolation. Samples were collected from

518 Schoenmakerskop (34°02'28.2"S 25°32'18.6"E) and Cape Recife (34°02'42.1"S

519 25°34'07.5"E) at low tide, from the water surface level. Samples approximately 1cm

520 deep were collected for 16S rRNA analysis in July 2019, from upper, middle and lower

521 stromatolite formations (Fig. 1) in triplicate, approximately 1 cm apart. Sample cores for

522 metagenomic shotgun sequencing were collected in April 2018, approximately 1cm

523 deep, from upper, middle and lower stromatolite formations. Additional samples were

524 collected in January 2018 from only the upper formations for metagenomic sequencing.

525 All samples were stored in RNAlater and flash-frozen until delivery to the lab at which

526 point, they were stored at -20 °C. DNA was extracted from ~1g of sample using Zymo

527 quick DNA Fecal/Soil Microbe Miniprep Kit (Zymo Research, Cat No. D6010) according

528 to the manufacturer's instructions.

23

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

529 Amplicon sequence analysis. Kapa HiFi Hotstart DNA polymerase (Roche, Cat No.

530 KK2500) was used to generate amplicon libraries of the V4-V5 region of the 16s rRNA

531 ribosomal subunit gene with the primer pair E517F (5’-GTAAGGTTCYTCGCGT-3’) and

532 E969-984 (5’-CAGCAGCCGCGGTAA-3’) (Matcher et al., 2011) from triplicate samples

533 using the following cycling parameters: Initial denaturation at 98 °C; 5 cycles (98 °C for

534 45 seconds, 45 °C for 45 seconds and 72 °C for 1 minute); 18 cycles (98 °C for 45

535 seconds, 50 °C for 30 seconds and 72 °C for 1 minute); final elongation step at 72 °C

536 for 5 minutes. PCR products were purified using the Bioline Isolate II PCR and Gel kit

537 (Bioline, Cat. No. BIO-52060). Samples were sequenced using the Illumina Miseq

538 platform. Amplicon library datasets were processed and curated using the Mothur

539 software platform (Schloss et al., 2009). Sequences shorter than 200 nucleotides or

540 containing ambiguous bases or homopolymeric runs greater than 7 were discarded.

541 Sequences were classified using Naive-Bayesian classifier against the Silva database

542 (v132) and VSEARCH software (Rognes et al., 2016) was used to remove chimeras.

543 Reads that were 97% similar were combined into operational taxonomic units (OTUs)

544 using the Opticlust method (Westcott and Schloss, 2017). OTU abundance values were

545 converted to relative values and the dataset was then transformed by square root and

546 statistically analyzed using the Primer-e (V7) software package (Gorley and Clarke,

547 2015). Reads were not classified against the Greengenes database, as the creators of

548 Mothur warn against this practice, citing poor alignment quality in the variable regions.

549 Metagenomic binning. As 16S rRNA sequence analysis indicated that there was no

550 statistical difference between triplicate samples, and that sample regions are therefore

551 homogenous, a single sample from sites CRU, RU, RM1, RM2, RL, CSU, SU, SM1,

24

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

552 SM2 and SL (Fig.1) collected in January and April 2018 was used to prepare shotgun

553 DNA libraries that were sequenced using IonTorrent Ion P1.1.17 Chip technology

554 (Central Analytical Facilities, Stellenbosch University, South Africa). Adapters were

555 trimmed and trailing bases (15 nts) were iteratively removed if the average quality score

556 for the last 30 nts was lower than 16, which resulted in approximately 30–45 million

557 reads per sample. Resultant metagenomic datasets were assembled into contiguous

558 sequences (contigs) with SPAdes version 3.12.0 (Bankevich et al., 2012) using the --

559 iontorrent and --only-assembler options with kmer values of 21,33,55,77,99,127. The --

560 iontorrent option enables a read error correction step performed by IonHammer,

561 correcting homopolymeric runs inherent to IonTorrent sequencing chemistry (Ershov et

562 al., 2019).

563 Quantification of metabolism-associated genes. All raw data, count tables and

564 scripts used to process the data can be accessed here:

565 https://github.com/samche42/Conserved_Stromatolite_bacteria_manuscript.git

566 Contigs in each of the 10 samples were clustered into kingdom bins using Autometa

567 (Miller et al., 2019). Genes on all contigs within these bins were identified using Prodigal

568 v.2.6.3 (Hyatt et al., 2010). Genes were then annotated against the KEGG database

569 using kofamscan (Aramaki et al., 2019) with output in mapper format. Coverage

570 corrected KO annotation counts were collected using kegg_parser.py found in the

571 GitHub repo listed above. Briefly, a dictionary of collection sites was made with all KO

572 numbers supplied in a KEGG annotation query list (i.e. functional group). Kofamscan

573 output per collection site was parsed and each time an entry matching that within the

574 query list was found, a count for that annotation was increased by the coverage of the

25

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

575 contig on which it was located, resulting in a coverage-corrected count table of

576 functional group KO annotations per collection site. In order to assess differences

577 between collection sites, these values were transformed relative to average coverage

578 per sample (weighted by contig length) using weighted_contig_coverage_calculator.py

579 (see GitHub repo) according to Eq.1, where i is contig length and j is contig coverage.

580 The abundances were then transformed using log2 (Fig. 3).

581 Equation 1. Contig length weighted coverage = ∑( (i / ∑(i)) x j )

582 Heatmaps for visualization of the data were created in R using the tidyr, ggplot2 and

583 viridis packages. Scripts can be found in the GitHub repo.

584 Clustering of metagenomic data in genome bins. Contigs were then clustered in

585 putative genomic bins using Autometa

586 (https://bitbucket.org/jason_c_kwan/autometa/src/master/, Master branch, commit:

587 a344c28) (Miller et al., 2019). Bins were manually curated and resulted in 183 genomic

588 bins. CheckM (Parks et al., 2015) was used to assess bin purity and completion using

589 default settings (Table S3). Approximately 37 genomes were of high quality, 75 were of

590 medium quality and 71 were of low quality, in accordance with MIMAG standards

591 defined by (Bowers et al., 2017) (Table S3).

592 Identification of conserved taxa. Conservation of bacterial taxa was calculated using

593 average nucleotide identities (ANIs) of all genomic bins, which were calculated in a

594 pairwise manner using FastANI (Jain et al., 2018). All genomic pairs sharing > 97% ANI

595 were subset and considered conserved taxa. Percentage of mapped regions used to

596 calculate ANI is reported in Table S2. Percentage of mapped regions is calculated as

26

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

597 the relative number of bidirectional fragments mapping to the total number of query

598 fragments.

599 Genome taxonomic classification. Genomes were classified using the standalone

600 GTDB-Tk tool (version 0.3.2) using the classify workflow and Genome

601 Database version 89 (Parks et al., 2019). The tool is unable to classify genomes less

602 than 10% complete, as indicated in Table S3. The GTDB-Tk tool uses the Genome

603 Taxonomy Database as a reference for classification, which is based on phylogeny

604 inferred from concatenated protein alignments. This approach enabled the removal of

605 polyphyletic groups and assignment of taxonomy from evolutionary divergence. The

606 resulting taxonomy incorporates substantial changes in comparison to the NCBI

607 taxonomy (Parks et al., 2018). Equivalent NCBI taxonomic classifications have been

608 provided for clarity in Table S3.

609 Genome taxonomic clustering. Phylogeny of bacterial genomes was inferred using

610 JolyTree (Criscuolo, 2019) with a sketch size of 10 000 (Fig. 5 and Fig. S1). JolyTree

611 infers phylogeny through computation of dissimilarity of kmer sketches, which is then

612 transformed for the estimation of substitution events of the genomes’ evolution

613 (Criscuolo, 2019).

614 Genome annotation. Manually curated putative genomic bins were annotated using

615 Prokka version 1.13 (Seemann, 2014), with GenBank compliance enabled. Protein-

616 coding amino-acid sequences from genomic bins were annotated against the KEGG

617 database using kofamscan (Aramaki et al., 2019) with output in mapper format. KEGG

618 orthologs were counted and processed as performed quantification of metabolism-

27

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

619 associated genes using kegg_parser.py. Relative percentage of genes in individual

620 genomes was calculated by dividing the contig-coverage corrected gene abundance per

621 gene by the total contig-coverage corrected gene abundance for the sample from which

622 it was binned (Fig. 5 and Fig. S1). E.g. The contig-coverage corrected gene abundance

623 of phoU in genome bin CRU1_1 was 3.52, and the total contig-coverage corrected gene

624 abundance of phoU in the CRU metagenome sample was 29.90253 (See dataset

625 Calculating_rel_perc.xlsx in GitHub repo). Therefore, the relative abundance of phoU in

626 genome bin CRU1_1 was: 3.52/29.90 X 100 = 11.79% of total sample gene abundance.

627 Archaeal genome binning and identification. Contigs are classified within kingdom

628 bins during Autometa binning. All contigs classified within the Archaea kingdom were

629 given putative taxonomic assignments based on single-copy markers and clustered into

630 bins. No bins could be acquired from the upper and middle formations from either

631 collection site due to few or no contigs being of archaeal origin. Two genomes were

632 obtained from the lower formation of each site respectively (Table S3). Genome quality

633 was assessed using CheckM as described for bacterial bins. 16S rRNA and 23S rRNA

634 sequences were extracted from each genome using barrnap 0.9

635 (https://github.com/tseemann/barrnap) using the archaeal databases (23S: SILVA-LSU-

636 Arc, 16S: RF01959) as reference. These sequences were then aligned against the nr

637 database using BLASTn (Johnson et al., 2008) for putative identification.

638 Data availability. Raw 16S rRNA gene amplicon sequence files were uploaded to the

639 NCBI sequence read archive (SRA) database in BioProject PRJNA574289. Raw reads

640 and binned genomes will be deposited in GenBank and respective accession numbers

641 will be included in the accepted version of this manuscript.

28

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

642 ACKNOWLEDGEMENTS

643 The authors acknowledge Caro Damarjanan who conducted a pilot study that informed

644 this work. We also wish to thank Karthik Anantharaman (University of Wisconsin-

645 Madison) for his helpful critique.

646

647 The authors acknowledge funding from the Gordon and Betty Moore Foundation (Grant

648 number 6920) (awarded to R.A.D and J.C.K.), and grants awarded to R.A.D by the

649 South African National Research Foundation (UID: 87583 and 109680). Development of

650 Autometa in J.C.K.’s laboratory and contributions by E.R.R. were supported by the U.S.

651 National Science Foundation (DBI-1845890). This research was performed in part using

652 the computer resources and assistance of the UW-Madison Center for High Throughput

653 Computing (CHTC) in the Department of Computer Sciences. The CHTC is supported

654 by UW-Madison, the Advanced Computing Initiative, the Wisconsin Alumni Research

655 Foundation, Wisconsin Institutes for Discovery, and the National Science Foundation

656 and is an active member of the Open Science Grid, which is supported by the National

657 Science Foundation and the U.S. Department of Energy’s Office of Science. The

658 authors also acknowledge the Center for High Performance Computing (CHPC, South

659 Africa) for providing computing facilities for bioinformatics data analysis.

660

661 The authors declare no conflict of interests.

662

663

664 REFERENCES

29

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

665 Aguilar-Barajas, E., Díaz-Pérez, C., Ramírez-Díaz, M.I., Riveros-Rosas, H., and

666 Cervantes, C. (2011) Bacterial transport of sulfate, molybdate, and related oxyanions.

667 Biometals 24: 687–707.

668 Allen, J.F. (2016) A Proposal for Formation of Archaean stromatolites before the advent

669 of oxygenic photosynthesis. Front Microbiol 7: 1784.

670 Aramaki, T., Blanc-Mathieu, R., Endo, H., Ohkubo, K., Kanehisa, M., Goto, S., and

671 Ogata, H. (2019) KofamKOALA: KEGG ortholog assignment based on profile HMM and

672 adaptive score threshold. bioRxiv. DOI: https://doi.org/10.1101/602110

673 Awramik, S.M. (1992) The History and Significance of Stromatolites. In, Schidlowski,M.,

674 Golubic,S., Kimberley,M.M., McKirdy,D.M., and Trudinger,P.A. (eds), Early Organic

675 Evolution: Implications for Mineral and Energy Resources. Berlin, Heidelberg: Springer

676 Berlin Heidelberg, pp. 435–449.

677 Babilonia, J., Conesa, A., Casaburi, G., Pereira, C., Louyakis, A.S., Reid, R.P., and

678 Foster, J.S. (2018) Comparative metagenomics provides insight into the ecosystem

679 functioning of the Shark Bay Stromatolites, Western Australia. Front Microbiol 9: 1359.

680 Balci, N., Gunes, Y., Sertcetin, E., and Kurt, M.A. (2018) Non-cyanobacterial community

681 shape the stromatolites of Lake Salda, SW, Turkey. American Geophysical Union, Fall

682 Meeting 2018, abstract #B43J-2972.

683 Bankevich, A., Nurk, S., Antipov, D., Gurevich, A.A., Dvorkin, M., Kulikov, A.S., et al.

684 (2012) SPAdes: A new genome assembly algorithm and its applications to single-cell

685 sequencing. J Comput Biol 19: 455–477.

686 Bontognali, T.R.R., Sessions, A.L., Allwood, A.C., Fischer, W.W., Grotzinger, J.P.,

687 Summons, R.E., and Eiler, J.M. (2012) Sulfur isotopes of organic matter preserved in

30

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

688 3.45-billion-year-old stromatolites reveal microbial metabolism. Proc Natl Acad Sci U S

689 A 109: 15146–15151.

690 Bowers, R.M., Kyrpides, N.C., Stepanauskas, R., Harmon-Smith, M., Doud, D., Reddy,

691 T.B.K., et al. (2017) Minimum information about a single amplified genome (MISAG)

692 and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat

693 Biotechnol 35: 725–731.

694 Casaburi, G., Duscher, A.A., Pamela Reid, R., and Foster, J.S. (2016) Characterization

695 of the stromatolite microbiome from Little Darby Island, The Bahamas using predictive

696 and whole shotgun metagenomic analysis. Environ Microbiol 18: 1452–1469.

697 Centeno, C.M., Mejía, O., and Falcón, L.I. (2016) Habitat conditions drive phylogenetic

698 structure of dominant bacterial phyla of microbialite communities from different locations

699 in Mexico. Rev Biol Trop 64: 1057–1065.

700 Chaumeil, P.-A., Mussig, A.J., Hugenholtz, P., and Parks, D.H. (2019) GTDB-Tk: A

701 toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics.

702 btz848.

703 Cosmidis, J., Benzerara, K., Guyot, F., Skouri-Panet, F., Duprat, E., Férard, C., et al.

704 (2015) Calcium-phosphate biomineralization induced by alkaline phosphatase activity in

705 Escherichia coli: Localization, kinetics, and potential signatures in the fossil record.

706 Front Earth Sci 3: 84.

707 Criscuolo, A. (2019) A fast alignment-free bioinformatics procedure to infer accurate

708 distance-based phylogenetic trees from genome assemblies. Riogrande Odontol 5:

709 e36178.

710 Danen-Louwerse, H.J., Lijklema, L., and Coenraats, M. (1995) Coprecipitation of

31

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

711 phosphate with calcium carbonate in Lake Veluwe. Water Res 29: 1781–1785.

712 Dodd, C., Anderson, C.R., Perissinotto, R., du Plooy, S.J., and Rishworth, G.M. (2018)

713 Hydrochemistry of peritidal stromatolite pools and associated freshwater inlets along the

714 Eastern Cape coast, South Africa. Sediment Geol 373: 163–179.

715 Drummond, J.B.R., Pufahl, P.K., Porto, C.G., and Carvalho, M. (2015) Neoproterozoic

716 peritidal phosphorite from the Sete Lagoas Formation (Brazil) and the Precambrian

717 phosphorus cycle. Sedimentology 62: 1978–2008.

718 Dupraz, C., Reid, R.P., Braissant, O., Decho, A.W., Norman, R.S., and Visscher, P.T.

719 (2009) Processes of carbonate precipitation in modern microbial mats. Earth-Sci Rev

720 96: 141–162.

721 Dupraz, C., Visscher, P.T., Baumgartner, L.K., and Reid, R.P. (2004) Microbe-mineral

722 interactions: Early carbonate precipitation in a hypersaline lake (Eleuthera Island,

723 Bahamas). Sedimentology 51: 745–765.

724 Ellis, H.R. (2011) Mechanism for sulfur acquisition by the alkanesulfonate

725 monooxygenase system. Bioorg Chem 39: 178–184.

726 Ershov, V., Tarasov, A., Lapidus, A., and Korobeynikov, A. (2019) IonHammer:

727 Homopolymer-space hamming clustering for IonTorrent read error correction. J Comput

728 Biol 26: 124–127.

729 Gallagher, K.L., Braissant, O., Kading, T.J., Dupraz, C., and Visscher, P.T. (2013)

730 Phosphate-related artifacts in carbonate mineralization experiments. J Sediment Res

731 83: 37–49.

732 Gallagher, K.L., Kading, T.J., Braissant, O., Dupraz, C., and Visscher, P.T. (2012)

733 Inside the alkalinity engine: The role of electron donors in the organomineralization

32

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

734 potential of sulfate-reducing bacteria. Geobiology 10: 518–530.

735 Gleeson, D.B., Wacey, D., Waite, I., O’Donnell, A.G., and Kilburn, M.R. (2016)

736 Biodiversity of living, non-marine, thrombolites of Lake Clifton, Western Australia.

737 Geomicrobiol J 33: 850–859.

738 Gorley, R.N. and Clarke, K.R. (2015) PRIMER v7: User Manual/Tutorial, Massey

739 University, Albany Campus.

740 Griffith, J.C. (2016) Insights into the soil microbial communities in New Zealand

741 indigenous tussock grasslands. (Thesis, Doctor of Philosophy). University of Otago.

742 Retrieved from http://hdl.handle.net/10523/6906

743 Guang-Can, T.A.O., Shu-Jun, T., Miao-Ying, C.A.I., and Guang-Hui, X.I.E. (2008)

744 Phosphate-solubilizing and-mineralizing abilities of bacteria isolated from soils.

745 Pedosphere 18: 515–523.

746 Herrmann, M., Wegner, C.-E., Taubert, M., Geesink, P., Lehmann, K., Yan, L., et al.

747 (2019) Predominance of Cand. Patescibacteria in groundwater is caused by their

748 preferential mobilization from soils and flourishing under oligotrophic conditions. Front

749 Microbiol 10: 1407.

750 Hyatt, D., Chen, G.-L., Locascio, P.F., Land, M.L., Larimer, F.W., and Hauser, L.J.

751 (2010) Prodigal: Prokaryotic gene recognition and translation initiation site identification.

752 BMC Bioinformatics 11: 119.

753 Jain, C., Rodriguez-R, L.M., Phillippy, A.M., Konstantinidis, K.T., and Aluru, S. (2018)

754 High throughput ANI analysis of 90K prokaryotic genomes reveals clear species

755 boundaries. Nat Commun 9: 5114.

756 Johnson, M., Zaretskaya, I., Raytselis, Y., Merezhuk, Y., McGinnis, S., and Madden,

33

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

757 T.L. (2008) NCBI BLAST: A better web interface. Nucleic Acids Res 36: W5–9.

758 Kahnert, A., Vermeij, P., Wietek, C., James, P., Leisinger, T., and Kertesz, M.A. (2000)

759 The ssu locus plays a key role in organosulfur metabolism in Pseudomonas putida S-

760 313. J Bacteriol 182: 2869–2878.

761 Kan, A.T., Fu, G., and Tomson, M.B. (2005) Adsorption and precipitation of an

762 aminoalkylphosphonate onto calcite. J Colloid Interface Sci 281: 275–284.

763 Keegan, K.P., Glass, E.M., and Meyer, F. (2016) MG-RAST, a metagenomics service

764 for analysis of microbial community structure and function. Microbial Environmental

765 Genomics (MEG) 1399: 207–233.

766 Khodadad, C.L.M. and Foster, J.S. (2012) Metagenomic and metabolic profiling of

767 nonlithifying and lithifying stromatolitic mats of Highborne Cay, the Bahamas. PLoS One

768 7: e38229.

769 Konopacka-Łyskawa, D., Kościelska, B., Karczewski, J., and Gołąbiewska, A. (2017)

770 The influence of ammonia and selected amines on the characteristics of calcium

771 carbonate precipitated from calcium chloride solutions via carbonation. Mater Chem

772 Phys 193: 13–18.

773 Kurth, D., Amadio, A., Ordoñez, O.F., Albarracín, V.H., Gärtner, W., and Farías, M.E.

774 (2017) Arsenic metabolism in high altitude modern stromatolites revealed by

775 metagenomic analysis. Sci Rep 7: 1024.

776 Lee, Y.S. and Park, W. (2019) Enhanced calcium carbonate-biofilm complex formation

777 by alkali-generating Lysinibacillus boronitolerans YS11 and alkaliphilic Bacillus sp.

778 AK13. AMB Express 9: 49.

779 Macintyre, I.G., Prufert-Bebout, L., and Reid, R.P. (2000) The role of endolithic

34

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

780 cyanobacteria in the formation of lithified laminae in Bahamian stromatolites.

781 Sedimentology 47: 915–921.

782 Matcher, G.F., Dorrington, R.A., Henninger, T.O., and Froneman, P.W. (2011) Insights

783 into the bacterial diversity in a freshwater-deprived permanently open Eastern Cape

784 estuary, using 16S rRNA pyrosequencing analysis. Water SA 37.3: 381 - 390.

785 Medina-Chávez, N.O., Viladomat-Jasso, M., Olmedo-Álvarez, G., Eguiarte, L.E., Souza,

786 V., and De la Torre-Zavala, S. (2019) Diversity of Archaea domain in Cuatro Cienegas

787 Basin: Archaean Domes. bioRxiv 766709. DOI: https://doi.org/10.1101/766709

788 Metcalf, W.W. and Wanner, B.L. (1993) Evidence for a fourteen-gene, phnC to phnP

789 locus for phosphonate metabolism in Escherichia coli. Gene 129: 27–32.

790 Miller, I.J., Rees, E.R., Ross, J., Miller, I., Baxa, J., Lopera, J., et al. (2019) Autometa:

791 Automated extraction of microbial genomes from individual shotgun metagenomes.

792 Nucleic Acids Res 47: e57.

793 Misi, A. and Richard Kyle, J. (1994) Upper Proterozoic carbonate stratigraphy,

794 diagenesis, and stromatolitic phosphorite formation, Irece Basin, Bahia, Brazil. J

795 Sediment Res 64: 299–310.

796 Mobberley, J.M., Khodadad, C.L.M., Visscher, P.T., Reid, R.P., Hagan, P., and Foster,

797 J.S. (2015) Inner workings of thrombolites: Spatial gradients of metabolic activity as

798 revealed by metatranscriptome profiling. Sci Rep 5: 12601.

799 Moreno-Vivián, C., Cabello, P., Martínez-Luque, M., Blasco, R., and Castillo, F. (1999)

800 Prokaryotic nitrate reduction: Molecular properties and functional distinction among

801 bacterial nitrate reductases. J Bacteriol 181: 6573–6584.

802 Nutman, A.P., Bennett, V.C., Friend, C.R.L., Van Kranendonk, M.J., and Chivas, A.R.

35

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

803 (2016) Rapid emergence of life shown by discovery of 3,700-million-year-old microbial

804 structures. Nature 537: 535–538.

805 Pace, A., Bourillot, R., Bouton, A., Vennin, E., Braissant, O., Dupraz, C., et al. (2018)

806 Formation of stromatolite lamina at the interface of oxygenic-anoxygenic

807 photosynthesis. Geobiology 16: 378–398.

808 Parks, D.H., Chuvochina, M., Chaumeil, P.-A., Rinke, C., Mussig, A.J., and Hugenholtz,

809 P. (2019) Selection of representative genomes for 24,706 bacterial and archaeal

810 species clusters provide a complete genome-based taxonomy. bioRxiv 771964. DOI:

811 https://doi.org/10.1101/771964

812 Parks, D.H., Chuvochina, M., Waite, D.W., Rinke, C., Skarshewski, A., Chaumeil, P.-A.,

813 and Hugenholtz, P. (2018) A standardized bacterial taxonomy based on genome

814 phylogeny substantially revises the tree of life. Nat Biotechnol 36: 996–1004.

815 Parks, D.H., Imelfort, M., Skennerton, C.T., Hugenholtz, P., and Tyson, G.W. (2015)

816 CheckM: Assessing the quality of microbial genomes recovered from isolates, single

817 cells, and metagenomes. Genome Res 25: 1043–1055.

818 Payandi-Rolland, D., Roche, A., Vennin, E., Visscher, P.T., Amiotte-Suchet, P.,

819 Thomas, C., and Bundeleva, I.A. (2019) Carbonate precipitation in mixed cyanobacterial

820 biofilms forming freshwater microbial tufa. Minerals 9: 409.

821 Perissinotto, R., Bornman, T.G., Steyn, P.-P., Miranda, N.A.F., Dorrington, R.A.,

822 Matcher, G.F., et al. (2014) Tufa stromatolite ecosystems on the South African south

823 coast. S Afr J Sci 110: 01–08.

824 Ragot, S.A., Kertesz, M.A., and Bünemann, E.K. (2015) phoD Alkaline phosphatase

825 gene diversity in soil. Appl Environ Microbiol 81: 7281–7289.

36

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

826 Ragot, S.A., Kertesz, M.A., Mészáros, É., Frossard, E., and Bünemann, E.K. (2017) Soil

827 phoD and phoX alkaline phosphatase gene diversity responds to multiple environmental

828 factors. FEMS Microbiol Ecol 93: fiw212

829 Reid, R.P., Visscher, P.T., Decho, A.W., Stolz, J.F., Bebout, B.M., Dupraz, C., et al.

830 (2000) The role of microbes in accretion, lamination and early lithification of modern

831 marine stromatolites. Nature 406: 989–992.

832 Rishworth, G.M., Cawthra, H.C., Dodd, C., and Perissinotto, R. (2019) Peritidal

833 stromatolites as indicators of stepping-stone freshwater resources on the Palaeo-

834 Agulhas Plain landscape. Quat Sci Rev 105704.

835 Rishworth, G.M., van Elden, S., Perissinotto, R., Miranda, N.A.F., Steyn, P.-P., and

836 Bornman, T.G. (2016) Environmental influences on living marine stromatolites: Insights

837 from benthic microalgal communities. Environ Microbiol 18: 503–513.

838 Rishworth, G.M., Perissinotto, R., Bird, M.S., and Pelletier, N. (2018) Grazer responses

839 to variable macroalgal resource conditions facilitate habitat structuring. R Soc Open Sci

840 5: 171428.

841 Rishworth, G.M., Perissinotto, R., Bird, M.S., Strydom, N.A., Peer, N., Miranda, N.A.F.,

842 and Raw, J.L. (2017) Non-reliance of metazoans on stromatolite-forming microbial mats

843 as a food resource. Sci Rep 7: 42614.

844 Rishworth, G.M., Perissinotto, R., Bornman, T.G., and Lemley, D.A. (2017) Peritidal

845 stromatolites at the convergence of groundwater seepage and marine incursion:

846 Patterns of salinity, temperature and nutrient variability. J Mar Syst 167: 68–77.

847 Rodriguez-Navarro, C., Rodriguez-Gallego, M., Ben Chekroun, K., and Gonzalez-

848 Muñoz, M.T. (2003) Conservation of ornamental stone by Myxococcus xanthus-induced

37

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

849 carbonate biomineralization. Appl Environ Microbiol 69: 2182–2193.

850 Rognes, T., Flouri, T., Nichols, B., Quince, C., and Mahé, F. (2016) VSEARCH: A

851 versatile open source tool for metagenomics. PeerJ 4: e2584.

852 Rott, E., Steinmetz, H., and Metzger, J.W. (2018) Organophosphonates: A review on

853 environmental relevance, biodegradability and removal in wastewater treatment plants.

854 Sci Total Environ 615: 1176–1191.

855 Ruvindy, R., White, R.A., 3rd, Neilan, B.A., and Burns, B.P. (2016) Unravelling core

856 microbial metabolisms in the hypersaline microbial mats of Shark Bay using high-

857 throughput metagenomics. ISME J 10: 183–196.

858 Santos, A.A., Venceslau, S.S., Grein, F., Leavitt, W.D., Dahl, C., Johnston, D.T., and

859 Pereira, I.A.C. (2015) A protein trisulfide couples dissimilatory sulfate reduction to

860 energy conservation. Science 350: 1541–1545.

861 Schloss, P.D., Westcott, S.L., Ryabin, T., Hall, J.R., Hartmann, M., Hollister, E.B., et al.

862 (2009) Introducing mothur: Open-source, platform-independent, community-supported

863 software for describing and comparing microbial communities. Appl Environ Microbiol

864 75: 7537–7541.

865 Seemann, T. (2014) Prokka: Rapid prokaryotic genome annotation. Bioinformatics 30:

866 2068–2069.

867 Shiraishi, F., Ohnishi, S., Hayasaka, Y., Hanzawa, Y., Takashima, C., Okumura, T., and

868 Kano, A. (2019) Potential photosynthetic impact on phosphate stromatolite formation

869 after the Marinoan glaciation: Paleoceanographic implications. Sediment Geol 380: 65–

870 82.

871 Smith, A., Cooper, A., Misra, S., Bharuth, V., Guastella, L., and Botes, R. (2018) The

38

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

872 extant shore platform stromatolite (SPS) facies association: A glimpse into the

873 Archean? Biogeosciences 15: 2189–2203.

874 Soo, R.M., Hemp, J., Parks, D.H., Fischer, W.W., and Hugenholtz, P. (2017) On the

875 origins of oxygenic photosynthesis and aerobic respiration in Cyanobacteria. Science

876 355: 1436–1440.

877 Stasi, R., Neves, H.I., and Spira, B. (2019) Phosphate uptake by the phosphonate

878 transport system PhnCDE. BMC Microbiol 19: 79.

879 Stüeken, E.E. (2016) Nitrogen in ancient mud: A biosignature? Astrobiology 16: 730–

880 735.

881 Thomazo, C., Ader, M., and Philippot, P. (2011) Extreme 15N-enrichments in 2.72-Gyr-

882 old sediments: Evidence for a turning point in the nitrogen cycle. Geobiology 9: 107–

883 120.

884 Visscher, P.T., Pamela Reid, R., and Bebout, B.M. (2000) Microscale observations of

885 sulfate reduction: Correlation of microbial activity with lithified micritic laminae in modern

886 marine stromatolites. Geology 28: 919–922.

887 Visscher, P.T. and Stolz, J.F. (2005) Microbial mats as bioreactors: Populations,

888 processes, and products. In, Noffke,N. (ed), Geobiology: Objectives, Concepts,

889 Perspectives. Amsterdam: Elsevier, pp. 87–100.

890 Warden, J.G., Casaburi, G., Omelon, C.R., Bennett, P.C., Breecker, D.O., and Foster,

891 J.S. (2016) Characterization of microbial mat microbiomes in the modern thrombolite

892 ecosystem of Lake Clifton, Western Australia using shotgun metagenomics. Front

893 Microbiol 7: 1064.

894 Wei, S., Cui, H., Jiang, Z., Liu, H., He, H., and Fang, N. (2015) Biomineralization

39

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

895 processes of calcite induced by bacteria isolated from marine sediments. Braz J

896 Microbiol 46: 455–464.

897 Westcott, S.L. and Schloss, P.D. (2017) OptiClust, an improved method for assigning

898 amplicon-based sequence data to operational taxonomic units. mSphere 2: e00073-17.

899 White, A.K. and Metcalf, W.W. (2007) Microbial metabolism of reduced phosphorus

900 compounds. Annu Rev Microbiol 61: 379–400.

901 White, R.A., Chan, A.M., Gavelis, G.S., Leander, B.S., Brady, A.L., Slater, G.F., et al.

902 (2016) Metagenomic analysis suggests modern freshwater microbialites harbor a

903 distinct core microbial community. Front Microbiol 6: 1531.

904 Witze, A. (2016) Claims of Earth’s oldest fossils tantalize researchers. Nature News.

905 DOI:10.1038/nature.2016.20506

906 Wong, H.L., Visscher, P.T., White, R.A., III, Smith, D.-L., Patterson, M.M., and Burns,

907 B.P. (2017) Dynamics of archaea at fine spatial scales in Shark Bay mat microbiomes.

908 Sci Rep 7: 46160.

909 Wong, H.L., White, R.A., Visscher, P.T., Charlesworth, J.C., Vázquez-Campos, X., and

910 Burns, B.P. (2018) Disentangling the drivers of functional complexity at the

911 metagenomic level in Shark Bay microbial mat microbiomes. ISME J 12: 2619–2639.

912 Wood, S.A., Depree, C., Brown, L., McAllister, T., and Hawes, I. (2015) Entrapped

913 sediments as a source of phosphorus in epilithic cyanobacterial proliferations in low

914 nutrient rivers. PLoS One 10: e0141063.

915 Yang, J., Junium, C.K., Grassineau, N.V., Nisbet, E.G., Izon, G., Mettam, C., et al.

916 (2019) Ammonium availability in the Late Archaean nitrogen cycle. Nature Geoscience

917 12: 553–557.

40

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

918 Zaheer, R., Morton, R., Proudfoot, M., Yakunin, A., and Finan, T.M. (2009) Genetic and

919 biochemical properties of alkaline phosphatases phoX family protein found in many

920 bacteria. Environ Microbiol 11: 1572–1587.

921

922 TABLE AND FIGURE LEGENDS

923 Figure 1. Stromatolites were collected from four different points at two different sites

924 along the SA coastline. (A) Aerial view of sampling locations within the

925 Schoenmakerskop site and (B) the Cape Recife site. Contour lines represent elevation

926 at 5cm increments. Samples were collected from upper, middle and lower formations.

927 The boundaries of which are indicated with a dotted blue line. (C - D) A 3-dimensional

928 rendering of the sample sites at Schoenmakerskop and Cape Recife respectively

929 constructed by a 3rd party (Caelum Technologies) using a combination of drone-based

930 photogrammetry and geographic mapping using differential GPS. Samples were

931 collected from upper, middle and lower formations; the boundaries of which are

932 indicated with a dotted blue line. Abbreviations are as follows: CSU: Schoenmakerskop

933 Upper Jan SU: Schoenmakerskop Upper April, CRU: Cape Recife Upper (Jan), RU:

934 Cape Recife Upper (April), SM1: Schoenmakerskop Middle 1 (April), SM2:

935 Schoenmakerskop Middle 2 (April), RM1: Cape Recife Middle 1 (April), RM2: Cape

936 Recife Middle 1 (April), SL: Schoenmakerskop Lower (April), RL: Cape Recife Lower

937 (April).

938

939 Figure 2. Distribution and abundance of bacterial taxa in stromatolite formations based

940 on 16S rRNA gene fragment amplicon libraries. (A) Phylogenetic classification and

41

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

941 average relative abundance (n=3) of dominant phyla in different sample sites indicated

942 that all stromatolite samples are dominated by Cyanobacteria, Bacteroidetes, Alpha-

943 and Gammaproteobacteria. (B) OTU abundance was used to cluster stromatolite

944 biological replicates using Bray-Curtis non-dimensional scaling and showed statistically

945 significant clustering of replicate samples from each of the sampled regions. Samples

946 were isolated from upper (green), middle (red) and lower (blue) stromatolites formations

947 in Cape Recife (triangles) and Schoenmakerskop (circles).

948

949 Figure 3. Summary of phosphate, nitrogen and sulfate transport and metabolism genes

950 in overall metagenomic data from stromatolites in Schoenmakerskop and Cape Recife

951 upper, middle and lower formations respectively. Gene abundance is expressed relative

952 to the length-weighted average contig coverage per sample and transformed using log2

953 scaling. Note: On the color scale, grey indicates genes which were not detected in the

954 respective sample. Abbreviations are as follows: CSU: Schoenmakerskop Upper Jan

955 SU: Schoenmakerskop Upper April, CRU: Cape Recife Upper (Jan), RU: Cape Recife

956 Upper (April), SM1: Schoenmakerskop Middle 1 (April), SM2: Schoenmakerskop Middle

957 2 (April), RM1: Cape Recife Middle 1 (April), RM2: Cape Recife Middle 1 (April), SL:

958 Schoenmakerskop Lower (April), RL: Cape Recife Lower (April).

959

960 Figure 4. Taxonomic classification of putative genome bins in stromatolites collected

961 from upper/inflow, middle and lower/marine formations of Schoenmakerskop and Cape

962 Recife. Coverage per genome has been used as a proxy for abundance and used to

963 scale the size of individual genome bars, expressed as a percentage of the sum of all

42

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

964 genome bin coverages. (A) The coverage of conserved (diagonal lines) and non-

965 conserved (solid colors) bacterial genomes. Taxonomic classification of each genome is

966 indicated by color. (B). Taxonomic classifications of conserved bacterial species in each

967 of the samples, generated with GTDB-Tk (Chaumeil et al., 2019).

968

969 Figure 5. Summary of phosphate, nitrogen and sulfate transport and metabolism genes

970 in conserved genomes from bacteria associated with stromatolites in Schoenmakerskop

971 and Cape Recife upper, middle and lower formations respectively. Relative gene

972 abundance per sample is indicated using a gradient color scale. Relative gene

973 abundance was calculated by dividing coverage-corrected gene abundance in each

974 genome by the coverage-corrected gene abundance from the metagenomic sample

975 from which it came. CSU: Schoenmakerskop Upper Jan SU: Schoenmakerskop Upper

976 April, CRU: Cape Recife Upper (Jan), RU: Cape Recife Upper (April), SM1:

977 Schoenmakerskop Middle 1 (April), SM2: Schoenmakerskop Middle 2 (April), RM1:

978 Cape Recife Middle 1 (April), RM2: Cape Recife Middle 1 (April), SL: Schoenmakerskop

979 Lower (April), RL: Cape Recife Lower (April).

980

981 Figure S1. Relative abundance of phosphate, nitrogen and sulfate transport and

982 metabolism genes per genome relative to total gene abundance per sample from

983 Schoenmakerskop and Cape Recife upper, middle and lower formations. Relative gene

984 abundance per sample is indicated using a gradient color scale. Relative gene

985 abundance was calculated by dividing coverage-corrected gene abundance in each

986 genome by the coverage-corrected gene abundance from the metagenomic sample

43

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

987 from which it came. CSU: Schoenmakerskop Upper Jan SU: Schoenmakerskop Upper

988 April, CRU: Cape Recife Upper (Jan), RU: Cape Recife Upper (April), SM1:

989 Schoenmakerskop Middle 1 (April), SM2: Schoenmakerskop Middle 2 (April), RM1:

990 Cape Recife Middle 1 (April), RM2: Cape Recife Middle 1 (April), SL: Schoenmakerskop

991 Lower (April), RL: Cape Recife Lower (April).

992

993 Table 1. Conserved bacterial species (ANI > 97%) and their functional potential across

994 upper, middle and lower stromatolite formations at Cape Recife and Schoenmakerskop.

995

996 Table S1. Summary of genomes binned per collected stromatolite sample

997

998 Table S2. Conserved bacterial species defined by shared ANI greater than 97% in

999 stromatolite formations from Cape Recife and Schoenmakerskop.

1000

1001 Table S3. Summary of characteristics and taxonomic classifications of genomes binned

1002 from shotgun metagenomic data from sampled upper, middle and lower stromatolite

1003 formations from Cape Recife and Schoenmakerskop. Bacterial bins are listed first, with

1004 the two archaeal bins listed at the bottom.

44

bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. bioRxiv preprint doi: https://doi.org/10.1101/818625; this version posted March 14, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.