bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

1 In situ relationships between microbiota and potential pathobiota in Arabidopsis thaliana

2 Claudia Bartoli1¶, Léa Frachon1¶, Matthieu Barret2, Mylène Rigal1, Carine Huard-Chauveau1,

3 Baptiste Mayjonade1, Catherine Zanchetta3, Olivier Bouchez3, Dominique Roby1, Sébastien

4 Carrère1, Fabrice Roux1*

5

6 Affiliations:

7 1 LIPM, Université de Toulouse, INRA, CNRS, Castanet-Tolosan, France

8 2 IRHS, INRA, AGROCAMPUS-Ouest, Université d’Angers, SFR4207 QUASAV, 42, rue

9 Georges Morel, 49071 Beaucouzé, France

10 3 INRA, GeT-PlaGe, Genotoul, Castanet-Tolosan, France

11 ¶These authors contributed equally to this work.

12

13 Corresponding author: [email protected]

1

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

14 ABSTRACT

15 A current challenge in microbial pathogenesis is to identify biological control agents that may

16 prevent and/or limit host invasion by microbial pathogens. In natura, hosts are often infected

17 by multiple pathogens. However, most of the current studies have been performed under

18 laboratory controlled conditions and by taking into account the interaction between a single

19 commensal species and a single pathogenic species. The next step is therefore to explore the

20 relationships between host-microbial communities (microbiota) and microbial members with

21 potential pathogenic behavior (pathobiota) in a realistic ecological context. In the present

22 study, we investigated such relationships within root and leaf associated bacterial

23 communities of 163 ecologically contrasted Arabidopsis thaliana populations sampled across

24 two seasons in South-West of France. In agreement with the theory of the invasion paradox,

25 we observed a significant humped-back relationship between microbiota and pathobiota α-

26 diversity that was robust between both seasons and plant organs. In most populations, we also

27 observed a strong dynamics of microbiota composition between seasons. Accordingly, the

28 potential pathobiota composition was explained by combinations of season-specific

29 microbiota OTUs. This result suggests that the potential biomarkers controlling pathogen's

30 invasion are highly dynamic.

2

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

31 INTRODUCTION

32 In the last decade, a conspicuous effort has been made to characterize and understand

33 the role of host-associated microorganisms. Animals and plants are currently considered as

34 holobionts associated with the commensals that inhabit and evolve with them (Youle et al.,

35 2013; Bordenstein and Theis, 2015; Vandenkoornhuyse et al., 2015). However, how

36 commensal microbes interact with pathogens remains largely unknown. It has been recently

37 reported that the human gut microbiota can protect against the potential overgrowth of

38 indigenous opportunistic pathobionts (i.e. pathogenic species that inhabit the host at a low

39 bacterial population size) (Vayssier-Taussat et al., 2014) or pathogenic invaders (Kamada et

40 al., 2013) by niche competition and/or induction of the host immune system (Belkaid and

41 Hand, 2014). In plants, both rhizosphere- and phyllosphere-associated can directly or

42 indirectly increase host resistance to phytopathogenic microorganisms via production of

43 antimicrobial compounds or elicitation of plant defences (Mendes et al., 2013; Berg et al.,

44 2014; Bodenhausen et al., 2014; Haney et al., 2015; Santhanam et al., 2015; Ritpitakphong et

45 al., 2016). For example, plant-associated strains belonging to the Sphingomonas genus can

46 protect the model annual plant Arabidopsis thaliana from infections caused by the causal

47 agent of bacterial spot Pseudomonas syringae (Innerebner et al., 2011). Although these

48 studies support the importance of certain microbes in protecting plants against infections, they

49 mainly focused on the interaction between a single commensal species and a single

50 pathogenic strain. Nevertheless, as previously observed in animals (Singer, 2010; Short et al.,

51 2014), plants are often infected in natura by multiple pathogens (Bartoli et al., 2016;

52 Lamichhane and Venturi, 2015; Roux and Bergelson, 2016). Therefore, the current challenge

53 in plant pathology is to investigate the relationships between resident microbial communities

54 and pathogenic microbes at the community level, i.e. between microbiota (defined here as the

55 microbial communities inhabiting plants) and pathobiota (defined here as the complex of

3

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

56 microorganisms with the potential to cause disease on a given plant host; Bartoli and Roux,

57 2017).

58 Understanding how the native cortege of beneficial/commensal microbes interacts to

59 protect plants against the invasion of a cortege of pathogenic species requires testing for

60 community ecological theories (Mallon, et al., 2018). In plant community ecology, substantial

61 efforts have been made in the last decades to understand the relationships between α-diversity

62 of resident species and α-diversity of invasive species (Fridley et al. 2007) by both applying

63 spatial pattern studies and constructed community studies (Naeem et al. 2000; Tomasetto et

64 al. 2013). In a large number of studies conducted at large spatial scale, species richness of

65 both resident and invasive species were found to be positively correlated with the quality of

66 the abiotic environment, thereby leading to a positive relationship between resident species

67 diversity and invader success (Eisenhauer et al., 2013a) (Supplementary Figure S1). On the

68 other hand, as predicted by Elton's theory (1958), constructed community studies performed

69 in homogenous abiotic environments revealed a negative relationship between resident

70 species diversity and invader success (Naeem et al., 2000). This negative relationship is

71 explained by the negative impact of the increased resident species diversity on resource

72 availability, which is in turn detrimental for the establishment of the invasive species

73 (Supplementary Figure S1). These two contrasted phenomena lead to the invasion paradox

74 (humped-back relationship between the diversity of invaders and the diversity of resident

75 species, Supplementary Figure S1), which is therefore predicted to mainly result from the

76 interplay between species diversity, resource availability and niche dimensionality in

77 ecologically relevant conditions (Mallon et al., 2015a).

78 Testing for the invasion paradox between microbiota and pathobiota therefore requires

79 a characterization of the microbial communities across the range of native habitats

80 encountered by a given plant species (Roux & Bergelson 2016). Accordingly, recent studies

4

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

81 highlighted that the ecology of the habitats where both plants and microorganisms co-exist

82 and evolve is a crucial variable to take into account when investigating the relationships

83 between a host and its microbes participating in the holobiont system (Agler et al. 2016,

84 Wagner et al. 2016). In addition, in native habitats, plants are naturally exposed to pathogens

85 whose attacks are influenced by local abiotic/biotic conditions (Bartoli et al., 2016).

86 In this study, we aimed to investigate the in situ relationships between bacterial

87 communities and the potential bacterial pathobiota in 163 natural A. thaliana populations

88 collected in southwest of France and inhabiting ecologically contrasted habitats. In particular,

89 we aimed (i) to test for the invasion paradox and (ii) to identify the combinations of microbial

90 species that can prevent and/or limit pathogen invasion. Because bacterial communities

91 associated with plants can rapidly change within the host life cycle (Chaparro et al., 2014;

92 Schlaeppi et al., 2014; Copeland et al., 2015; Agler et al., 2016) and can largely differ

93 between plant compartments (Bodenhausen et al., 2013; Coleman-Derr et al., 2016; Wagner

94 et al., 2016), we described both microbiota and potential pathobiota in the leaf and root

95 compartments across two seasons within a single life cycle of A. thaliana.

5

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

96 MATERIAL AND METHODS

97 Identification of A. thaliana populations

98 In this study, we focused on 163 natural A. thaliana populations identified in May

99 2014 in the Midi-Pyrénées region (Figure 1a, Supplementary Table 1). These populations

100 were chosen to maximize the diversity of habitats (such as climate, soil type, vegetation type

101 and degree of anthropogenic perturbation) encountered by A. thaliana (Figure 1b). Because

102 the 163 populations strongly differed in their main germination cohort in autumn 2014 (early

103 November vs early December; Supplementary Text), we defined three seasonal groups

104 hereafter named (i) ‘autumn’ corresponding to 84 populations collected in

105 November/December 2014, (ii) ‘spring with autumn’ corresponding to 80 populations already

106 sampled in autumn and additionally sampled in early-spring (February/March 2015), and (iii)

107 ‘spring without autumn’, corresponding to 79 populations only sampled in early-spring

108 (February/March 2015) (Supplementary Text).

109 To avoid a confounding effect between the sampling date and geographical origin,

110 populations were randomly collected during the sampling periods in autumn 2014 and early-

111 spring 2015.

112

113 Sampling, generation of the gyrB amplicons and sequences

114 The bacterial communities were characterized through amplification of a fraction of

115 gyrB gene encoding for the bacterial gyrase β subunit . This molecular marker has a deeper

116 taxonomic resolution than other molecular markers designed on the hypervariable regions of

117 the 16S rRNA gene (Barret et al., 2015), thereby allowing to distinguish bacterial OTUs at the

118 species level. Furthermore, this single-copy gene limits the overestimation of taxa carrying

6

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

119 multiple copies of rrn operons. In this study, based on synthetic communities, we confirmed a

120 better taxonomic resolution of the gyrB gene compared with the 16S rRNA gene

121 (Supplementary Text, Supplementary Data 1).

122 To characterize the bacterial communities, ~4 individuals per population and per

123 season were sampled in situ at the rosette stage resulting in a total number of 1,912 leaf and

124 root samples (Supplementary Text). The epiphytic and endophytic bacterial components of

125 either leaf or root samples were not separated. For each plant compartment, we therefore

126 extracted the total DNA of both epiphytic and endophytic microbes (Supplementary Text).

127 The amplicon gyrB was amplified with some modification of the protocol described in Barret

128 et al. (2015). In particular, three internal tags were added at each 5' and 3' of the original

129 primers to allow the multiplexing of three 96-well plates (Supplementary Text).

130 For each sample, PCR amplifications were repeated three times and technical

131 replicates were pooled in a unique PCR plate. PCR products were purified by using

132 Agencourt®AMPure® magnetic beads following manufacturer's instructions and purified

133 amplicons were quantified with Nanodrop and appropriately diluted to obtain an equimolar

134 concentration. Two µl of equimolar PCR purified products were used for a second PCR with

135 the Illumina adaptors. The second PCR amplicons were then purified and quantified as

136 described above to obtain a unique equimolar pool. The latter was quantified by RQ-PCR and

137 then sequenced with Illumina MiSeq 2X250 v3 (Illumina Inc., San Diego, CA, USA) in the

138 GeT-PlaGe Platform (Toulouse, France). MS-102-3003 MiSeq Reagent Kit v3 600 cycle was

139 used for this purpose.

140

141

142

143

7

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

144 Bioinformatics analysis and data curation

145 Reads were demultiplexed by considering the three internal tags. After demultiplexing,

146 the average number of sequences per sample was of 46,135 ± 19,820. Prior further analysis,

147 the negative controls – consisted by the sterilized water used to clean up the samples during

148 field sampling, the sterilized DNA free water used to elute the DNA after phenol/chloroform

149 extraction and the DNA free water used for PCR amplifications – were checked for

150 presence/absence of amplicons by blasting them against the gyrB database composed by

151 30,627 sequences (Barret et al. 2015; Supplementary Text). Because negative controls

152 showed no trace of entire gyrB sequences, they were removed before clustering. Samples

153 showing low sequence quality were also removed before clustering, resulting in a total of

154 1,903 samples. Taxonomic affiliation of gyrB sequences was performed by using a Bayesian

155 classifier (Wang et al., 2007) implemented in the classify.seqs command of mothur (Schloss

156 et al., 2009) against an in-house gyrB database containing 30,627 representative sequences

157 with an 80% bootstrap confidence score. Clustering of sequences into Operational Taxonomic

158 Units (OTUs) was performed with Swarm (Mahé et al., 2015) by using a clustering threshold

159 (d) = 1. Only the OTUs that were composed by a minimum of 5 sequences across all samples

160 were kept, resulting in a total of 278,336 OTUs. Then, we applied two steps of filtering. First,

161 to control for sampling limitation within each sample, we estimated a Good’s coverage score

162 for each sample (Schloss et al., 2009). Based on the distribution of the Good’s coverage score,

163 only samples with a score more than 0.5 were considered. Second, only OTUs showing a

164 minimum relative abundance of 1% in at least one sample were selected by using a home-

165 made perl program. The final data set corresponded to a matrix of 1,655 samples by 6,627

166 OTUs. The 6,627 OTUs obtained after the filtering of the data represented 2.4 % of the

167 totality of the OTUs but 55.6 % of the totality of the reads (Supplementary Figure S2). In

168 addition, the mean number of reads per OTU maintained after filtering was 51.3 fold higher

8

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

169 than the one of the discarded OTUs (Supplementary Figure S2). This final data set was used

170 to determine the matrix composed by the potential pathogenic species. More precisely, the

171 potential pathobiota in both leaves and roots of the 163 A. thaliana populations was

172 determined by using a list of the phytopathogenic bacteria established by the International

173 Society of Plant Pathology Committee on the of Plant Pathogenic Bacteria (ISPP-

174 CTPPB; Supplementary Data 2 ) (Bull et al., 2010, 2014). This potential pathobiota list

175 (Supplementary Data Set 2) was composed by 199 bacterial species that were filtered on the

176 microbiota matrix. The potential pathobiota matrix resulted in 11 bacterial species that were

177 further curated. More precisely,we only considered: i) the bacterial species already reported to

178 colonize A. thaliana (Pseudomonas syringae, Pseudomonas viridiflava, Pantoea

179 agglomerans, Sphingomonas melonis and Xanthomonas campestris) (Jakob et al., 2002,

180 Kniskern et al., 2007), ii) the bacterial species pathogenic on Brassicaceae species

181 (Pseudomonas marginalis, Streptomyces scabei and Xanthomonas perforans) (Charron &

182 Sams, 2002, Lerat et al., 2009) , iii) the causal agent of lamb’s lettuce spot

183 valerianellae reported to be widely distributed in France (Gardan et al., 2003). The bacterial

184 species Diaphorobacter oryzae and Janthinobacterium agaricidamnosum were removed from

185 the potential pathobiota matrix because they have been reported previously only in non-plant

186 habitats (Pham et al., 2009) and mushrooms (Lincoln et al., 1999), respectively. The resulted

187 potential pathobiota matrix was composed by 1203 samples and 29 OTUs characterizing the

188 nine bacterial species listed above (i.e. a given bacterial species can be included in more than

189 one OTU). The 29 potential pathogenic OTUs were removed from the microbiota matrix.

190 Therefore, the final microbiota matrix used for further analysis was composed by 6,598

191 OTUs.

192

193

9

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

194 Analysis of the α and β-diversity and characterization of the potential pathobiota

195 Shannon diversity and observed species richness were estimated on the final OTU

196 matrix by using the summary.single function of mothur (Schloss et al., 2009). Indexes of α-

197 diversity were also calculated by sample rarefaction of 300/600/900 iters. Results between

198 non-rarefied and rarefied samples were similar and the complete non-rarefied data set was

199 used for the calculation of microbiota diversity.

200 Due to sparsity of the OTU matrix, a Hellinger transformation (Legendre and

201 Gallagher, 2001) was performed by using the vegan R package (Jari et al., 2009) and the

202 relative Hellinger distance was inferred with the decostand command in the vegan R package

203 prior β-diversity analyses. Following Ramette (2007), the resulting Hellinger distance matrix

204 was reduced by running a Principal Coordinates Analysis (PCoA) with the ape R package

205 (Paradis et al., 2004). Because PCoA performed on Hellinger distance matrix based on

206 rarefied data and on Jaccard similarity coefficient matrix led to similar patterns of ordination

207 (Supplementary Figure S3), the PCoA coordinates from the non-rarefied Hellinger distance

208 matrix were retrieved and used for statistical analysis described below.

209 Non-metric multidimensional scaling (NMDS) was also run on the Hellinger distance

210 matrix. However, the values of stress were 0.431 and 0.293 for 2D and 3D NMDS ordination

211 space, respectively. These stress values suggest a lack of fit between the ranks on the NMDS

212 ordination configuration and the ranks in the original distance matrix (Ramette, 2007).

213

214

215

216

10

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

217 Statistical analysis

218 Natural variation for the eight descriptors of microbiota (i.e. species richness, α-diversity

219 Shannon index, first and second PCoA axes) and potential pathobiota microbiota (i.e. species

220 richness, α-diversity Shannon index, first and second PCoA axes) was explored using

221 different mixed models (Supplementary Text). A correction for the number of tests was

222 performed to control the FDR at a nominal level of 5%.

223 In order to study the relationship between microbiota α-diversity and potential

224 pathobiota α-diversity, linear and non-linear regressions were fitted using the ‘lm’ and ‘nls’

225 functions implemented in the R environment, respectively (Supplementary Text). Using a

226 paired sample t-test, model selection was performed by comparing the goodness of fit

227 between linear and non-linear models across the ‘diversity estimate x plant compartment x

228 seasonal group’ combinations. In order to confirm the significance of the humped-back curve

229 observed between diversity estimates (species richness and Shannon α-diversity) of the

230 microbiota and the potential pathobiota, the parameters of the non-linear model were

231 compared to a null distribution of those parameters obtained by creating 100 random

232 microbiota OTU matrices paired with 100 random pathobiota OTU matrices (Supplementary

233 Text).

234 Although pathogen-focused network analysis has been already used for investigating

235 the relationships between whole microbiota and microbial pathogens, this method appears

236 only suitable for studying monospecific interactions between a single pathogenic species and

237 the rest of the microbial community members (Poudel et al., 2016). In order to include higher-

238 order interactions in the study of the relationship between microbiota and potential pathobiota

239 composition, a sparse Partial Least Square Regression (sPLSR) (Lê Cao et al., 2008;

240 Carrascal et al., 2009) was therefore adopted to maximize the covariance between linear

11

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

241 combinations of relative abundances of OTUs from the microbiota (matrix X) and linear

242 combinations of relative abundances of species from the potential pathobiota (matrix Y)

243 (Supplementary Text). Significance of the OTUs included in the linear combinations was

244 estimated by a Jackknife resampling approach by leaving out 10% of the samples 1,000 times

245 (Supplementary Text).

246

247 Data availability

248 The raw FastQ reads were deposited in the Sequence Read Archive (SRA) of NCBI

249 https://trace.ncbi.nlm.nih.gov/Traces/sra/?study=SRP096011 under the study number

250 SRP096011. The filtered matrix for both microbiota and potential pathobiota are available in

251 Supplementary Data 4 and Supplementary Data 5 respectively. Raw values for both α and β-

252 diversity used for statistical analysis are available in Supplementary Data 6.

12

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

253 RESULTS

254 Characterization of the A. thaliana microbiota and potential pathobiota

255 We sampled 163 natural A. thaliana populations chosen to maximize the diversity of

256 habitats encountered by A. thaliana in the Midi-Pyrénées region (Figure 1). Due to differences

257 in germination timing in autumn, about half of the populations were sampled both in autumn

258 and in spring, whereas the other half of populations were sampled only in spring, thereby

259 leading to three seasonal groups of populations, that is ‘autumn’, ‘spring with autumn’ and

260 ‘spring without autumn’.

261 We obtained 18,610,383 high-quality reads across 1,655 samples, with on average

262 ~10,136 reads per sample. After data filtering, we identified 6,627 non-singleton bacterial

263 OTUs. A large amount of these OTUs were specific to roots or leaves, as only ~8.1% OTUs

264 (n = 540 OTUs) were shared between both plant compartments. However, the relative

265 abundance of OTUs shared between leaf and root samples was 20.2 and 16.0 higher than the

266 relative abundance of leaf and root specific OTUs. This suggests that generalists OTUs are

267 dominant members of the Arabidopsis thaliana microbiota.

268 As commonly observed in A. thaliana and other plant species (Lundberg et al., 2012;

269 Bulgarelli et al., 2013; Horton et al., 2014; Coleman-Derr et al., 2016; Wagner et al., 2016),

270 bacterial communities were largely dominated by (> 80%). At the order level,

271 (29.3%) and Sphingomonodales (27.9%) were dominant (Figure 2a). In

272 comparison with autumn, samples collected in spring were enriched for Burkholderiales and

273 depleted for Sphingomonodales in the root compartment (Figure 2a). Germination timing in

274 autumn did not impact the relative abundance of bacterial order in plants collected in spring

275 (Figure 2a).

276 Across all samples, we identified 9 bacterial species described as phytopathogenic by

277 the International Society of Plant Pathology Committee on the Taxonomy of Plant Pathogenic

13

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

278 Bacteria (Bull et al., 2010, 2014) (Figure 2b). Among them, the three most abundant species

279 representing more than 72% of the whole potential pathobiota were X. campestris (31.9%), P.

280 viridiflava (29.2%) and P. agglomerans (11.7%) (Figure 2b). For the seasonal groups

281 ‘autumn’ and ‘spring w/autumn’, we observed a depletion of X. campestris and an enrichment

282 of S. scabiei in the root compartment in comparison with the leaf compartment (Figure 2b). In

283 the leaf compartment, an enrichment of P. viridiflava between autumn and spring was

284 associated with a depletion of X. campestris (Figure 2b). Similarly to the microbiota, no effect

285 of germination timing in autumn was observed on the relative abundance of the potential

286 pathogenic species for plants collected in spring (Figure 2b).

287 It is noteworthy that the relationship between the total relative abundance among

288 samples and the prevalence among populations was weak for some potential pathogenic

289 bacterial species (Figure 2c). For example, although the total relative abundance of P.

290 syringae and S. scabiei among all samples ranged from 3.4% to 5.5% respectively (Figure

291 2b), these two species were present on average in more than 31% of the populations (Figure

292 2c). Visual inspection of original data suggests that this pattern is mainly explained by the

293 presence of few highly infected plants in many populations.

294 To confirm the pathogenic behaviour of the potential pathobiota, representative

295 bacterial strains of the three most abundant species were tested for their pathogenicity on their

296 natural host A. thaliana and on a non-host plant (tobacco). For the P. syringae complex

297 (including both P. syringae sensu stricto and P. viridiflava; Supplementary Text), we isolated

298 97 strains (74 P. viridiflava strains and 23 P. syringae sensu stricto strains; Supplementary

299 Text, Supplementary Data 3, Supplementary Figure S4). Their pathogenicity was assessed

300 based on in planta bacterial growth and disease symptoms in A. thaliana and a Hypersensitive

301 Response (HR) test on tobacco (Supplementary Text). We found that: i) 84 strains of the P.

302 syringae complex induced a HR on tobacco (Supplementary Text, Supplementary Data 3), ii)

14

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

303 all the four strains tested for in planta growth were able to reach a population size of 106

304 CFU.cm-2 7 days post inoculation on A. thaliana (Supplementary Figure S5a, Supplementary

305 Table S2), and iii) seven out of eight strains tested were able to induce disease on at least one

306 of the eight A. thaliana local accessions tested (Supplementary Text, Supplementary Figure

307 S5b, S6, S7, Supplementary Table S3). For X. campestris, 62 strains were isolated

308 (Supplementary Text, Supplementary Figure S8) and all of them induced disease symptoms

309 on the A. thaliana Kas-1 accession (Supplementary Figure S9), which is susceptible to most

310 X. campestris strains isolated from crops (Huard-Chauveau et al., 2013). In addition, 58 of the

311 62 X. campestris induced a HR on tobacco (Supplementary Text, Supplementary Data 3). For

312 P. agglomerans, we isolated a single strain (Supplementary Text, Supplementary Figure S10)

313 that was able to induce disease symptoms on all the 23 A. thaliana local accessions tested

314 (Supplementary Text, Supplementary Figure S11, S12, Supplementary Table S4). Taken

315 together, these results support that most of the strains identified here as part of the A. thaliana

316 potential pathobiota have a pathogenic behavior on A. thaliana. In addition, the relative

317 abundance of the potential pathobiota was significantly higher in A. thaliana individuals with

318 visible disease symptoms (4.5%) than in asymptomatic A. thaliana individuals (1.6%) when

319 sampled in situ (general linear model, F = 26.05, P < 0.001) (Supplementary Figure S13),

320 strengthening the potential pathogenic behavior of the pathobiota identified in this study.

321

322 Alpha-diversity of the A. thaliana microbiota and potential pathobiota

323 We investigated whether the level of α-diversity (species richness and Shannon index)

324 estimated for each sample was driven by the effects of season, plant compartment and

325 population. Statistical results globally led to similar biological conclusions between species

326 richness and Shannon index (Supplementary Tables S5 - S12). Bacterial communities of the

327 microbiota were on average less diverse in roots than in leaves, in particular in autumn

15

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

328 (Figure 3a, Supplementary Tables S5, S6, S7). For the potential pathobiota, a similar pattern

329 was observed in autumn but not in spring where the potential pathogenic communities were as

330 diverse in roots as in leaves (Figure 3b, Supplementary Tables S5, S6, S7). More importantly,

331 strong differences among populations in the dynamics of α-diversity between autumn and

332 spring were observed for the microbiota and to a lesser extent for the potential pathobiota

333 (Figure 4, Supplementary Tables S5, S6).

334 Whatever the seasonal group considered, variation in α-diversity of the microbiota was

335 first explained by differences among populations (~16.8% for species richness and ~26.2%

336 for Shannon index) (Figure 4, Supplementary Tables S8 -S11). In contrast, the main source of

337 variation in the α-diversity of the potential pathobiota largely differs between the two seasons

338 (Supplementary Tables S8 -S11). Variation in α-diversity of the pathobiota in autumn and

339 spring was first explained by the factor ‘plant compartment’ (~12.5%) and the factor

340 ‘population’ (~13.8%), respectively (Supplementary Table S11). At the ‘plant compartment ×

341 seasonal group’ level, the level of differentiation among populations for Shannon index was

342 on average almost twice higher for the microbiota than for the potential pathobiota (~38.3%

343 vs ~14.5% of variance explained by the factor ‘population’) (Supplementary Table S12) .

344 No effect of germination timing in autumn was observed on the microbiota and

345 pathobiota α-diversity of plants collected in spring (Supplementary Table S7, S11).

346

347 Relationships between microbiota and potential pathobiota α-diversity: testing for the

348 invasion paradox

349 By considering all samples, we observed a highly significant humped-back

350 relationship between the species richness of the potential pathobiota and the species richness

351 of the microbiota (Figure 5a, Supplementary Table S13, Supplementary Figure S14). This

352 humped-back relationship was robust whatever the ‘seasonal group × plant compartment’

16

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

353 considered (Figure 5b, 5c, Supplementary Table S13). A similar humped-back relationship

354 was observed when considering Shannon index instead of species richness, with the exception

355 of the root compartment in the seasonal group ‘spring w/ autumn’ (Figure 5d, 5e, 5f,

356 Supplementary Table S13). This humped-back relationship indicates that a poorly diversified

357 potential pathobiota was associated with either a highly or a poorly diversified microbiota,

358 whereas a highly diversified potential pathobiota was found in presence of microbiota with an

359 intermediate level of diversity. Altogether, these results are in line with theoretical

360 expectations of the invasion paradox (Supplementary Figure S1).

361

362 Composition and structure of A. thaliana microbiota and potential pathobiota

363 For the microbiota, the first two PCoA axes explained 20% of the β-diversity (Figure

364 6). The microbiota was structured according to a pattern of two perpendicular branches

365 (Figure 6a). The abundance variation of the most abundant OTU, i.e. an OTU belonging to the

366 genus Sphingomonas, explained up to 20.7% of the variation along the first branch (Figure

367 6b).The abundance variation of the second most abundant OTU (an OTU belonging to the

368 family Oxalobacteraceae), was related to the variation along the second branch (Figures 6c).

369 For the potential pathobiota structured according to a pattern of three branches (Figure 6d),

370 the first two PCoA axes explained 37% of the β-diversity (Figure 6d). The variation along two

371 of these branches was related strongly but independently to the abundance variations of P.

372 viridiflava and X. campestris (Figures 6e, 6f).

373 We observed strong differences among populations for the dynamics of β-diversity of

374 the microbiota between autumn and spring, with the factor ‘season × population’ explaining

375 53.8% and 43.2% of the variation along the first and second PCoA axes, respectively

376 (Supplementary Figure S15, Supplementary Tables S5,S6). The dynamics of β-diversity of

377 the potential pathobiota between autumn and spring was also dependent on the considered

17

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

378 population but to a much lesser extent than what was observed for the microbiota β-diversity

379 (i.e. 9.8% and 7.6% of the variation along the first and second PCoA axes, respectively)

380 (Supplementary Tables S5, S6).

381 Within each seasonal group, the variation of microbiota along the PCoA axes was

382 largely explained by differences among populations (up to ~81%) (Supplementary Figure

383 S16, Supplementary Tables S8 – S11), while the variation of the potential pathobiota β-

384 diversity was first explained by the factors ‘population’ (~16.2%) and ‘plant compartment’

385 (~12.6%) when considering the first and second PCoA axis, respectively (Supplementary

386 Table S11). At the ‘plant compartment × seasonal group’ level, the level of differentiation

387 among populations for β-diversity was on average higher for the microbiota than for the

388 potential pathobiota (~76.5% vs ~10.8% of the first PCoA axis variance explained by the

389 factor ‘population’) (Supplementary Table S12, Supplementary Figure S16).

390 An effect of germination timing in autumn on the β-diversity of plants collected in

391 spring was not observed on microbiota but on potential pathobiota. In the leaf compartment,

392 variation along the first PCoA axis was driven by differences among populations with early

393 autumn germinants (i.e. ‘spring with autumn’ season group), while variation along the second

394 PCoA axis was driven by differences among populations with late autumn germinants (i.e.

395 ‘spring without autumn’ season group) (Supplementary Table S12, Supplementary Figure

396 S16).

397

398 Relationship between microbiota and potential pathobiota β-diversity

399 For each ‘season group × plant compartment’ combination, a non-negligible

400 percentage of variation of the potential pathobiota composition (16.9% on average) was

401 explained by a minor fraction of the variation of the microbiota composition (3.3% on

402 average) (Figure 7). The identity of the microbiota OTUs associated with variation of the

18

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

403 potential pathobiota composition largely differs between seasons, plant compartment (in

404 particular in autumn) and germination cohorts in autumn (Figures 7 and 8). Among the 17

405 candidate OTUs of the microbiota, the fifth most abundant microbiota OTU (Pseudomonas

406 moraviensis) was the most prevalent microbiota OTU across the six ‘season group × plant

407 compartment’ combinations. However, a large fraction of the candidate microbiota OTUs

408 (58.8%) corresponds to unclassified OTUs at the order level.

409

410 DISCUSSION

411 The characterization of the A. thaliana potential pathobiota in an ecological context

412 Several evidences suggest that the potential pathobiota was well characterized in our

413 set of 163 natural populations of A. thaliana, thereby allowing investigating the bacterial

414 microbiota-potential pathobiota relationships. First, according to the gyrB community

415 profiling, X. campestris and P. viridiflava were the two most abundant species of the A.

416 thaliana potential pathobiota. These observations are congruent with a previous bacterial

417 community profiling obtained with a 16S rRNA region of 1.4kb from a field experiment on

418 wild-type and mutant lines of A. thaliana (Kniskern et al., 2007). Second, we detected in situ

419 a strong relationship between the presence of disease symptoms and the relative abundance of

420 the potential pathobiota. Third, pathogenicity tests on both host and non-host plants confirmed

421 that most of the strains belonging to the three most abundant species - composing almost three

422 quarters of the whole potential pathobiota - have a pathogenic behavior. Finally, we found a

423 significant and strong positive relationship between relative abundance and α-diversity of the

424 pathobiota (Supplementary Table S14). The latter result suggests that plants are more often

425 infected by consortia of pathogenic species than by a single pathogenic lineage, which is in

426 line with similar observations obtained in humans and animals (Faust et al., 2012). For

19

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

427 example, multiple closely related Borrelia genospecies have been found to display a positive

428 co-occurrence in ticks (Herrmann et al., 2013). Studies on plant pathogens also suggest that

429 co-infection is a frequent process mediated by niche functionality of the colonizing species

430 (Lamichhane and Ventury, 2015). More broadly, primary infecting pathogenic lineages can

431 open the route for opportunistic lineages, as previously demonstrated for brassica black rot

432 disease in which leaf infection by Xanthomonas is followed by the infection of a soft-rotting

433 bacterium responsible for the rotting disease (Williams, 1980).

434

435 Putting the characterization of bacterial communities in an ecological genomics

436 framework

437 In this study, we found that the in situ microbiota and potential pathobiota of A.

438 thaliana were affected by the combined effects of season, plant compartment and population.

439 In particular, we observed a strong dynamics in community succession of the microbiota

440 between seasons, reinforcing the need to study the dynamics of bacterial communities over

441 the entire plant life cycle (from seed to seed) across a large range of native habitats (Roux and

442 Bergelson, 2016). In addition, our study revealed that seasonal community succession of the

443 microbiota largely differed among the 163 populations. Because α- and β-diversity varied at a

444 very small geographic scale (Supplementary Figure S17), understanding among-population

445 variation of seasonal community succession will require a thorough characterization of abiotic

446 and biotic factors known to influence microbiota variation in plant species, such as soil

447 conditions (Lundberg et al., 2012; Bulgarelli et al., 2013), micro-local climatic conditions

448 (Classen et al., 2015) and plant community composition (Aleklett et al., 2015; Geremia et al.,

449 2016).

20

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

450 In contrast to the microbiota, we found that plant compartment mainly influenced the

451 composition of the potential pathobiota as well as the relative abundance of the most abundant

452 pathogenic species. Our results are in accordance with previous studies reporting that

453 pathogenic species such as P. syringae sensu lato and X. campestris evolved specific

454 strategies (e.g. entry by stomata, hydatodes and wounds) to infect the leaf compartment in a

455 wide range of crops (Mansfield et al., 2012). On the other hand, the most abundant OTUs of

456 the microbiota were shared between leaves and roots. In agreement, the use of a gnotobiotic

457 A. thaliana plant system allows to demonstrate potential reciprocal relocation between root

458 and leaf microbiota members (Bai et al., 2015). In the same study, whole-genome sequencing

459 and functional analysis of bacteria associated with both leaves and roots of A. thaliana

460 highlighted a clear taxonomy and functional overlap of the bacterial populations inhabiting

461 this plant species (Bai et al., 2015). Altogether, these results are in contrast with previous

462 studies on human microbiota demonstrating a remarkable organization of microbes into body

463 site niches (Faust et al., 2012; Belkaid and Hand, 2014). This discrepancy might originate

464 from a lack of specialized plant tissues constituting strictly specialized niches for commensal

465 bacteria as observed in human organs.

466

467 The invasion paradox is mediated by distinct microbiota composition between seasons

468 and plant organs

469 In agreement with theoretical expectations on the invasion paradox (Levine and

470 D’Antonio, 1999; Fridley et al., 2007; Tomasetto et al., 2013), we observed a humped-back

471 relationship between potential pathobiota α-diversity and microbiota α-diversity. Such a

472 pattern may have been observed due to the large range of habitats where A. thaliana plants

473 have been collected, thereby increasing the range of dimensionality of in planta ecological

21

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

474 niches available for microbes. Besides the interplay between diversity and niche

475 dimensionality, other hypotheses can contribute to the explanation of the humped-back

476 relationship. An increase in microbiota diversity can be associated with an increase of

477 antagonistic (e.g. production of anti-microbial compounds) and predator (e.g. grazing) species

478 (Mallon et al., 2015b), leading to a negative relationship between potential pathobiota α-

479 diversity and microbiota α-diversity. Concerning the positive relationship between poor

480 diversified potential pathobiota and microbiota, several non-exclusive hypotheses can be

481 advanced. Firstly, ecological disturbance can increase the random establishment of a single

482 pathogenic species with negative consequences on native species diversity (Kinnunen et al.,

483 2016). Secondly, by the producing virulence proteins, a pathogen can exclusively invade a

484 given plant compartment (Hacquard et al., 2017). Thirdly, both microbiota and potential

485 pathobiota can be poorly diversified because other microbial communities (such as fungal and

486 oomycete communities) exploit most of the resources available in the plant, leading to niche

487 inter-kingdom competition (Agler et al., 2016).

488 Interestingly, the pattern of invasion paradox was robust between compartments but

489 also between seasons, despite a strong seasonal community succession of the microbiota in

490 most populations. While this observation reinforces the importance of α-diversity in A.

491 thaliana populations to maintain a microbiota balance to prevent pathogen occurrence, it also

492 suggests that the potential pathobiota composition was associated with season-specific

493 combinations of microbiota OTUs. Accordingly, the identity of the microbiota OTUs

494 associated with variation of the potential pathobiota composition largely differs between

495 seasons. This dynamics in the potential biomarkers (i.e. specific bacteria taxa preventing

496 pathogen spread) controlling pathogen’s invasion may result from niche overlap and other

497 mechanisms regulating microbe-microbe interactions that drastically influence the bacterial

498 composition. A better understanding of the processes underlying the observed invasion

22

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

499 paradox will require the isolation of a large number of strains representative of the microbiota

500 found in the 163 A. thaliana populations. This step will especially be relevant for the

501 numerous microbiota OTUs that are unclassified at the order level but associated with

502 variation of the potential pathobiota composition.

503 However, we should acknowledge that a substantial fraction of the variation of the A.

504 thaliana pathobiota was not explained by the microbiota variation. This fraction of

505 unexplained pathobiota variation may result from (i) habitat-specific combinations of

506 bacterial species controlling the potential pathobiota, (ii) the control of bacterial pathogen

507 species by fungal or oomycete microbes (Agler et al., 2016), and (iii) the genetics of A.

508 thaliana shaping natural variation in both pathogen’s abundance and prevalence (Roux &

509 Bergelson, 2016). Teasing apart the relative roles of these putative factors will require a

510 thorough complementary ecological and genomic characterization of our natural populations

511 of A. thaliana.

512

513 ACKNOWLEDGEMENTS

514 We are also grateful to the staff of the LIPM greenhouse for their assistance during the growth

515 chamber experiments. This work was funded by the Région Midi-Pyrénées (CLIMARES

516 project), the LABEX TULIP (ANR-10-LABX-41, ANR-11-IDEX-0002-02) and the

517 Métaprogramme MEM (INRA, Metabar programme).

518

519 CONFLICT OF INTEREST

520 The authors declare no conflict of interest.

23

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

521 REFERENCES

522 Agler MT, Ruhe J, Kroll S, Morhenn C, Kim S-T, Weigel D et al. (2016). Microbial hub taxa 523 link host and abiotic factors to plant microbiome wariation. PLoS Biol 14: e1002352.

524 Aleklett K, Leff JW, Fierer N, Hart M (2015). Wild plant species growing closely connected 525 in a subalpine meadow host distinct root-associated bacterial communities. Peer J 3: e804.

526 Bai Y, Müller DB, Srinivas G, Garrido-Oter R, Potthoff E, Rott M et al. (2015). Functional 527 overlap of the Arabidopsis leaf and root microbiota. Nature 528: 364–369.

528 Barret M, Briand M, Bonneau S, Préveaux A, Valière S, Bouchez O et al. (2015). Emergence 529 shapes the structure of the seed microbiota. Appl Environmen Microbiol 81: 1257–1266.

530 Bartoli C, Roux F (2017). Genome-wide association studies in plant pathosystems: toward an 531 ecological genomics approach. Front Plant Sci 8: 763.

532 Bartoli C, Roux F, Lamichhane JR (2016). Molecular mechanisms underlying the emergence 533 of bacterial pathogens: an ecological perspective. Mol Plant Pathol 17: 303-310.

534 Belkaid Y, Hand TW (2014). Role of the microbiota in immunity and inflammation. Cell 157: 535 121–141.

536 Berg G, Grube M, Schloter M, Smalla K (2014). Unraveling the plant microbiome: looking 537 back and future perspectives. Front Microbiol 5: 148.

538 Bodenhausen N, Bortfeld-Miller M, Ackermann M, Vorholt JA (2014). A synthetic 539 community approach reveals plant genotypes affecting the phyllosphere microbiota. PLoS 540 Genet 10: e1004283.

541 Bodenhausen N, Horton MW, Bergelson, J (2013). Bacterial communities associated with the 542 leaves and the roots of Arabidopsis thaliana. PloS One. 8: e56329.

543 Bordenstein SR, Theis KR (2015). Host biology in light of the microbiome: ten principles of 544 holobionts and hologenomes. PLoS Biol 13: 1–23.

545 Bulgarelli D, Schlaeppi K, Spaepen S, Ver Loren van Themaat E, Schulze-Lefert P (2013). 546 Structure and functions of the bacterial microbiota of plants. Annu Rev Plant Biol 64: 807– 547 838.

548 Bull CT, De Boer SH, Denny TP, Firrao G, Fischer-Le Saux et al. (2010). Comprehensive list 549 of names of plant pathogenic bacteria, 1980–2007. J Plant Pathol 92: 551–592.

550 Bull CT, Coutinho TA, Denny TP, Firrao G, Fischer-Le Sau M, Li X et al. (2014). List of 551 new names of plant pathogenic bacteria (2011-2012). J Plant Pathol 96: 223–226.

552 Chaparro JM, Badri DV, Vivanco JM (2014). Rhizosphere microbiome assemblage is 553 affected by development. ISME J 8: 790–803.

554 Classen AT, Sundqvist MK, Henning JA, Newman GS, Moore, JAM, Cregger MA, et al. 555 (2015). Direct and indirect effects of climate change on soil microbial and soil microbial-plant 556 interactions: what lies ahead? Ecosphere 6: 130.

557 Carrascal LM, Galván I, Gordo O (2009). Partial least squares regression as an alternative to

24

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

558 current regression methods used in ecology. Oikos 118: 681–690.

559 Charron CS, Sams CE (2002). Impact of glucosinolate content in broccoli (Brassica oleracea 560 (italic group)) on growth of Pseudomonas marginalis, a causal agent of bacterial soft rot. 561 Plant Dis 86: 629-632.

562 Coleman-Derr D, Desgarennes D, Fonseca-Garcia C, Gross S, Clingenpeel S, Woyke T et al. 563 (2016). Plant compartment and biogeography affect microbiome composition in cultivated 564 and native Agave species. New Phytol 209: 798–811.

565 Copeland JK, Yuan L, Layeghifard M, Wang PW, Guttman DS (2015). Seasonal community 566 succession of the phyllosphere microbiome. Mol Plant Microbe Interact 28: 274–285.

567 Eisenhauer N, Dobies T, Cesarz S, Hobbie SE, Meyer RJ, Worm K et al. (2013b). Plant 568 diversity effects on soil food webs are stronger than those of elevated CO2 and N deposition 569 in a long-term grassland experiment. Proc Natl Acad Sci USA 110: 6889–6894.

570 Eisenhauer N, Schulz W, Scheu S, Jousset A (2013a). Niche dimensionality links biodiversity 571 and invisibility of microbial communities. Funct Ecol 27: 282–288.

572 Faust K, Sathirapongsasuti JF, Izard J, Segata N, Gevers D, Raes J et al. (2012). Microbial co- 573 occurrence relationships in the human microbiome. Plos Computat Biol 8: e1002606.

574 Fridley JD, Stachowicz JJ, Naeem S, Sax DF, Seabloom EW, Smith MD et al. (2007). The 575 invasion paradox: reconciling pattern and process. Ecology 88: 3–17.

576 Gardan L, Stead DE, Dauga C, Gillis M (2003). Acidovorax valerianellae sp. nov., a novel 577 pathogen of lamb’s lettuce [Valerianella locusta (L.) Latter]. Int J Syst Evol Microbiol 53: 578 795-800.

579 Geremia RA, Pușcaș M, Zinger L, Bonneville J-M, Choler P (2016). Contrasting microbial 580 biogeographical patterns between anthropogenic subalpine grasslands and natural alpine 581 grasslands. New Phytol 209: 1196–1207.

582 Hacquard S, Spaepen S, Garrido-Oter R, Schulze-Lefert P (2017). Interplay between innate 583 immunity and the plant microbiota. Annu Rev Phytopathol 55: 565-589.

584 Haney CH, Samuel BS, Bush J, Ausubel FM (2015). Associations with rhizosphere bacteria 585 can confer an adaptive advantage to plants. Nature Plants 1: 15051.

586 Herrman C, Gern L, Voordouw MJ (2013). Species co-occurrence patterns among lyme 587 borreliosis pathogens in the tick vector Ixodes ricinus. Appl Environm Microbiol 79: 7273- 588 7280.

589 Horton MW, Bodenhausen N, Beilsmith K, Meng D, Muegge BD, Subramanian S et al. 590 (2014). Genome-wide association study of Arabidopsis thaliana leaf microbial community. 591 Nature Comm 5: 5320.

592 Huard-Chauveau C, Perchepied L, Debieu M, Rivas S, Kroj T, Kars I, Bergelson J, Roux F, 593 Roby D (2013). An atypical kinase under balancing selection confers broad-spectrum disease 594 resistance in Arabidopsis. Plos Genet 9: e1003766.

595 Innerebner G, Knief C, Vorholt, JA (2011). Protection of Arabidopsis thaliana against leaf- 596 pathogenic Pseudomonas syringae by Sphingomonas strains in a controlled model system.

25

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

597 Appl Environmen Microbiol 77: 3202–3210.

598 Jakob K, Goss EM, Araki H, Van T, Kreitman M, Bergelson J (2002). Pseudomonas 599 viridiflava and P. syringae – natural pathogens of Arabidopsis thaliana. Mol Plant Microb 600 Interact 15: 1195-1203.

601 Jari Oksanen F, Blanchet G, Kindt R, Legendre P, O’Hara RG, Simpson G, et al. (2009). 602 Vegan: community ecology package. R package version 1.16-0. Available at: citeulike- 603 article-id:6657659.

604 Kamada N, Chen GY, Inohara N, Nunez G (2013). Control of pathogens and pathobionts by 605 the gut microbiota. Nature Immunol 14: 685–690.

606 Kniskern JM, Traw MB, Bergelson J (2007). Salicylic acid and jasmonic acid signalling 607 defense pathways reduce natural bacterial diversity on Arabidopsis thaliana. Mol Plant 608 Microb Inter 12: 1512-1522.

609 Kinnunen M, Dechesne A, Proctor C, Hammes F, Johnson D, Quintela-Baluja M, et al. 610 (2016). A conceptual framework for invasion in microbial communities. ISME J 10: 2773– 611 2775.

612 Lamichhane JR, Venturi V (2015). Synergisms between microbial pathogens in plant disease 613 complexes: a growing trend. Front Plant Sci 6: 385.

614 Lê Cao KA, Rossouw D, Robert-Granié C, Besse P (2008). Sparse PLS: variable selection 615 when integrating omics data. Stat Appl Gen Mol Biol 7: 35.

616 Legendre P, Gallagher ED (2001). Ecologically meaningful transformations for ordination of 617 species data. Oecologia 129: 271–280.

618 Lerat S, Simao-Beaunoir AM, Beaulieu C (2009). Genetic and physiological determinants of 619 Stremptomyces scabei pathogenicity. Mol Plant Pathol 10: 579-55.

620 Levine JM, D’Antonio CM (1999). Elton revisited: a review of evidence linking diversity and 621 invasibility. Oikos 87: 15–26.

622 Lincoln SP, Fermor TR, Tindall BJ (1999). Janthinobacterium agaracidamnosum sp. nov., a 623 soft rot pathogen of Agaricus bisporus. Int J Syst Bacteriol 4: 1577-1589.

624 Lundberg DS, Lebeis SL, Paredes SH, Yourstone S, Gehring J, Malfatti S, et al. (2012). 625 Defining the core Arabidopsis thaliana root microbiome. Nature 488: 86–90.

626 Mahé F, Rognes T, Quince C, De Vargas C, Dunthorn M (2015). Swarm v2: highly-scalable 627 and high-resolution amplicon clustering. Peer J 3: e1420.

628 Mallon CA, van Elsas JD, Salles JF (2015a). Microbial invasions: the process, patterns, and 629 mechanisms. Trends Microbiol. 23: 719–729.

630 Mallon CA, Poly F, Le Roux X, Marring I, van Elsas JD, Salles JF (2015b). Resource pulses 631 can alleviate the biodiversity–invasion relationship in soil microbial communities. Ecology, 632 96: 915–926.

633 Mallon CA, Le Roux X, van Doorn GS, Dini-Andreote F, Poly F, Salles JF (2018). The 634 impact of failure: unsuccessful bacterial invasions steer the soil microbial community away 635 from the invader’s niche. ISME J (doi.org/10.1038/s41396-017-0003-y).

26

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

636 Mansfield J, Genin S, Magori S, Citovsky V, Sriariyanum M, Ronald et al. (2012) Top 10 637 plant pathogenic bacteria in molecular plant pathology. Mol Plant Pathol 13: 614–629.

638 Mendes R, Garbeva P, Raaijmakers JM (2013). The rhizosphere microbiome: significance of 639 plant beneficial, plant pathogenic, and human pathogenic microorganisms. FEMS Microbiol 640 37: 634–663.

641 Naeem S, Knops JMH, Tilman D, Howe KM, Kennedy T, Gale S (2000). Plant diversity 642 increases resistance to invasion in the absence of covarying extrinsic factors. Oikos 91: 97– 643 108.

644 Paradis E, Claude J, Strimmer K (2004). APE: Analyses of phylogenetics and evolution in R 645 language. Bioinformatics 20: 289–290.

646 Pham VH, Park SJ, Roh Y, Roh DH, Rhee SK (2009). Diaphorobacter oryzae sp. nov., 647 isolated from a thiosulfate-oxidizing enrichment culture. Int J Syst Evol Microbiol 59: 218- 648 221.

649 Poudel R, Jumpponen A, Schlatter DC, Paulitz TC, Gardener BB, Kinkel LL et al. (2016). 650 Microbiome networks: a systems framework for identifying candidate microbial assemblages 651 for disease management. Phytopathology 106: 1083–1096.

652 Ramette A (2007). Multivariate analyses in microbial ecology. FEMS Microbiol Ecol 62: 653 142–160.

654 Ritpitakphong U, Falquet L, Vimoltust A, Berger A, Métraux J-P, L’Haridon F (2016). The 655 microbiome of the leaf surface of Arabidopsis protects against a fungal pathogen. New Phytol 656 210: 1033–1043.

657 Roux F, Bergelson J (2016). The Genetics underlying natural variation in the biotic 658 interactions of Arabidopsis thaliana: the challenges of linking evolutionary genetics and 659 community ecology. Curr Top Dev Biol 119: 111–156.

660 Santhanam R, Luu VT, Weinhold A, Goldberg J, Oh Y, Baldwin IT (2015). Native root- 661 associated bacteria rescue a plant from a sudden-wilt disease that emerged during continuous 662 cropping. Proc Natl Acad Sci USA 112: 5013-5020.

663 Schlaeppi K, Dombrowski N, Oter RG, Ver Loren van Themaat E, Schulze-Lefert P (2014). 664 Quantitative divergence of the bacterial root microbiota in Arabidopsis thaliana relatives. 665 Proc Natl Acad Sci USA 111: 585–592.

666 Schloss, P. D., Westcott, S. L., Ryabin, T., Hall, J. R., Hartmann, M., Hollister, E. B., et al. 667 (2009). Introducing mothur: open-source, platform-independent, community-supported 668 software for describing and comparing microbial communities. Appl Environ Microbiol 75: 669 7537–7541.

670 Short FL, Murdoch SL, Ryan RP (2014). Polybacterial human disease: the ills of social 671 networking. Trends Microbiol 22: 508–516.

672 Singer M (2010). Pathogen-pathogen interaction. Virulence. 1: 10–18.

673 Tomasetto F, Duncan RP, Hulme PE (2013). Environmental gradients shift the direction of 674 the relationship between native and alien plant species richness. Divers Distrib 19: 49–59.

27

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

675 Vandenkoornhuyse P, Quaiser A, Duhamel M, Le Van A, Dufresne A (2015). The importance 676 of the microbiome of the plant holobiont. New Phytol 206: 1196–1206. doi: 677 10.1111/nph.13312.

678 Vayssier-Taussat M, Albina E, Citti C, Cosson JF, Jacques MA, Lebrun MH et al. (2014). 679 Shifting the paradigm from pathogens to pathobiome: new concepts in the light of meta- 680 omics. Front Cell Infect Microbiol 4: 29.

681 Wagner MR, Lundberg DS, del Rio TG, Tringe SG, Dangl JL, Mitchell-Olds T (2016). Host 682 genotype and age shape the leaf and root microbiomes of a wild perennial plant. Nature 683 Comm 7: 12151.

684 Wang Q, Garrity GM, Tiedje JM, Cole JR (2007). Naïve Bayesian classifier for rapid 685 assignment of rRNA sequences into the new bacterial taxonomy. Appl Environ Microbiol 73: 686 5261–5267.

687 Williams PH (1980). Black rot: a continuing. Plant Disease 64: 736-742.

688 Youle M, Knowlton N, Rohwer F, Gordon J, Relman DA (2013). Superorganisms and 689 holobionts. Microbe Mag 8: 152–153.

690

28

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

691 FIGURE LEGENDS

692

693 Figure 1. Plant material. (a) Location of the 163 A. thaliana populations across the Midi-

694 Pyrénées region (southwest of France). The average distance among populations was 99.9 km

695 (median = 92.6 km, SD = 55.4 km). Blue dots represent the 80 populations collected in both

696 autumn and spring, red dots represent the 79 populations collected in spring only and orange

697 dots represent the 4 populations collected in autumn only. The map clearly shows that the

698 range of sampling across the Midi-Pyrénées region was homogeneous during the two

699 sampling seasons. (b) Diversity of habitats encountered by A. thaliana in the Midi-Pyrénées

700 region.

701 Figure 2. Stacked bar plots of the relative abundances of the major bacterial phyla and

702 bacterial species for both microbiota and potential pathobiota of the A. thaliana populations

703 collected in the Midi-Pyrénées region. (a) Stacked barplots representing the relative

704 abundance for the ten most abundant bacterial orders of the microbiota. ‘All samples’ n

705 =1655; Leaf: ‘autumn’ n= 314, ‘spring w/ autumn’ n = 245, ‘spring wo/ autumn’ n = 262;

706 Root: ‘autumn’ n= 309, ‘spring w/ autumn’ n = 267, ‘spring wo/ autumn’ n = 258. In

707 comparison with autumn, samples in spring were enriched for Burkholderiales (χ² = 7.93, P <

708 0.01) and depleted for Sphingomonodales (χ² = 18.18, P < 0.001) but only in the root

709 compartment. (b) Stacked barplots representing the relative abundance of the nine bacterial

710 species representing the potential pathobiota. ‘All samples’ n = 1203; Leaf: ‘autumn’ n = 273,

711 ‘spring w/ autumn ‘ n = 209, ‘spring wo/ autumn’ n = 213; Root: ‘autumn’ n= 180, spring w/

712 autumn’ n = 171, ‘spring wo/autumn’ n = 157. For the seasonal groups ‘autumn’ and ‘spring

713 w/autumn’, we observed a depletion of X. campestris (‘autumn’, χ² = 15.67, P < 0.01; ‘spring

714 with autumn’, χ² = 8.26, P < 0.05) and an enrichment of S. scabiei (‘autumn’, χ² = 18.61 P <

715 0.001; ‘spring with autumn’, χ² = 9.31, P < 0.05) in the root compartment in comparison with

29

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

716 the leaf compartment. In the leaf compartment, an enrichment of P. viridiflava between

717 autumn and spring (χ² =15.20, P < 0.01) was associated with a depletion of X. campestris (χ²

718 = 11.50, P < 0.01). A correction for the number of tests was performed to control the False

719 Discovery Rate at a nominal level of 5%. c) Prevalence (number of populations in which the

720 potential pathogen species was detected) of the nine bacterial species characterizing the

721 potential pathobiota.

722 Figure 3. Violin plots (i.e. box-and-whisker plot overlaid with a kernel density plot)

723 representing the seasonal variation of the α-diversity inferred as Shannon's index for both (a)

724 microbiota and (b) potential pathobiota. Leaf and root samples are represented with a red and

725 blue color scale, respectively. Microbiota: ‘Autumn – Leaf’ n= 314 , ‘Autumn – Root’ n =

726 309, ‘Spring w/ Autumn - Leaf’ n= 245, ‘Spring w/ Autumn - Root’ n= 267,‘Spring w/o

727 Autumn - Leaf’ n= 262, ‘Spring w/o Autumn - Root’ n= 258. Pathobiota: ‘Autumn – Leaf’ n=

728 273 , ‘Autumn – Root’ n = 180, ‘Spring w/ Autumn - Leaf’ n= 209, ‘Spring w/ Autumn -

729 Root’ n= 171,‘Spring w/o Autumn - Leaf’ n= 213, ‘Spring w/o Autumn - Root’ n= 157.

730 Figure 4. Variation among populations in the dynamics of α-diversity between autumn and

731 spring. Each dot corresponds to the mean α-diversity (estimated as BLUPs) of a population.

732 ‘leaf’ n = 74 populations, ‘root’ n = 62 populations.

733 Figure 5. Humped-back relationships between potential pathobiota α-diversity and microbiota

734 α-diversity. (a) Species richness when considering all samples. (b) Species richness when

735 considering samples from the leaf compartment. (c) Species richness when considering

736 samples from the root compartment. (d) Shannon’s index when considering all samples. (e)

737 Shannon’s index when considering samples from the leaf compartment. (f) Shannon’s index

738 when considering samples from the root compartment. The red lines indicate a significant

30

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

739 quadratic relationship, according to the following non-linear model: pathobiota’s diversity ~

740 k*microbiota’s diversity – q*microbiota’s diversity*microbiota’s diversity.

741 Figure 6. Bacterial composition and structure of A. thaliana illustrated by Principal

742 coordinates (PCoA) plots of Hellinger dissimilarity matrices for microbiota (left panels) and

743 potential pathobiota (right panels). n = 1655 samples for the microbiota matrix and n = 1203

744 for the potential pathobiota matrix. (a) PCoA plot of microbiota for all samples indicates that

745 bacterial composition is mainly structured along two axes. The (b) and (c) plots indicate the

746 relationships between the microbiota composition and the abundance of the two most

747 abundant OTUs, i.e. Sphingomona ssp., and Oxalobacteraceae unclassified sp., respectively.

748 (d) PCoA plot of potential pathobiota for all samples indicates that bacterial composition is

749 mainly structured along three axes. The (e) and (f) plots indicate the relationships between the

750 potential pathobiota composition and the abundance of the two most abundant potential

751 pathogenic OTUs, i.e. Pseudomonas viridiflava and Xanthomonas campestris, respectively.

752 Red-to-blue color gradient indicates high-to-low OTU abundance.

753 Figure 7. Relationships between microbiota β-diversity and potential pathobiota β-diversity

754 based on a Sparse Partial Least Square Regression (sPLSR). Only OTUs with a loading value

755 above 0.2 in more than 75% of the 1,000 Jackknife resampled matrices were considered as

756 significant. The color gradient indicates the strength of the loading values for both microbiota

757 OTUs and potential pathobiota OTUs (yellow: 0.2 < loadings < 0.35, orange: 0.35 < loadings

758 < 0.5, red: loadings > 0.5). ‘All’, ‘Aut’, ‘Sw/ A’ and ‘Sw/o A’ stand for all samples and

759 samples from the three seasonal groups (i.e. ‘autumn’ populations, ‘spring w/ autumn’

760 populations and ‘spring w/o autumn’ populations), ‘L’ and ‘R’ stand for leaf and root,

761 respectively. ‘All’ n = 1655; ‘Aut - L’ n= 314, ‘Sw/ A - L’ n = 245, ‘Sw/o A - L’ n = 262;

762 ‘Aut – R’ n= 309, ‘Sw/ A - R’ n = 267, ‘Sw/o A – R’ n = 258.

31

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

763 Figure 8. 2D density plots illustrating the relationships between the relative abundance of the

764 two most abundant bacterial species of the potential pathobiota and the total relative

765 abundance of the microbiota OTUs identified by sPLSR. (a) Example with X. campestris in

766 the seasonal group ‘spring with autumn’ in the leaf compartment (Pearson’s r = -0.15, P =

767 0.0187). (b) Example with P. viridiflava in the seasonal group ‘spring with autumn’ in the

768 root compartment (Pearson’s r = -0.17, P = 0.0006). Red-to-blue color gradient represents

769 high-to-low density gradient.

770

32

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

771 SUPPLEMENTARY FILES 772 773 Supplementary Text: supplementary material and methods and supplementary results. 774 Supplementary Figure S1. The invasion paradox. (A) Species richness of both native and 775 alien species is positive correlated with the quality of the environment. Abiotic factors often 776 have similar effects on biodiversity and invader success. (B) The increasing quality of the 777 environment leads to a positive relationship between species richness of alien species and 778 species richness of native species. (C) Elton’s hypothesis describing a negative association 779 between the increasing of the biodiversity and the success of the invaders. The interplay 780 between resource availability and diversity in determining invasion resistance leads to the 781 well-known phenomenon called ‘invasion paradox’. (D), (E) and (F) indicate application of 782 the theories to microbial communities.

783 Supplementary Figure S2. Distribution of the number of reads (expressed in log10) for the 784 6,627 OTUs obtained after data filtering (top panel) and for the remaining 271,706 OTUs 785 discarded with the filtering (bottom panel). The mean number of reads per OTU maintained 786 after filtering was 51.3 fold higher than the one of the discarded OTUs. 787 Supplementary Figure S3. PCoA perfomed on a Hellinger distance matrix based on rarefied 788 data (top panel) and on a Jaccard similarity coefficient matrix distance (bottom panel). 789 Supplementary Figure S4. Phylogenetic tree inferred with a Neighbor Joining (NJ) model 790 (3000 bootstrap repetitions) and based on the cts sequences (350bp) of the 97 P. syringae 791 strains isolated from the 163 A. thaliana populations. Bootstrap values are shown at each node 792 and names of the strain at each branch. All the strains belong to the P. syringae complex and 793 are mainly distributed in the phylogroups 7, 9, 1, 2, 11 and 13. Reference P. syringae strains 794 representative of the 13 phylogroups are labeled in green. Phylogroup affiliation was based on 795 the previous work from Berge et al. (2014). 796 Supplementary Figure S5. Pathogenicity of Pseudomonas sp. strains isolated from the 163 797 A. thaliana populations of the region Midi-Pyrénées. (A) Mean bacterial growth across eight 798 accessions collected in the region Midi-Pyrénées for two strains of P. viridiflava (0114-Psy- 799 NAUV-BL and 0124-Psy-SAUB-AL) and two strains of P. syringae sensu stricto (0132-Psy- 800 BAZI-AL and 0143-Psy-THOM-AL). (B) Illustration of symptoms observed three days after 801 inoculation for two strains of P. syringae. Presence and absence of symptoms was observed 802 on each of the eight accessions collected in the region Midi-Pyrénées for the strains 0111-Psy- 803 RAYR-BL and 0117-Psy-NAZA-AL, respectively. D0, D3, D5 and D7 indicate the number 804 of days after inoculation.

33

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

805 Supplementary Figure S6. Genetic variation among five accessions from the Midi-Pyrénées 806 region for the response to the P. viridiflava strain 0124-Psy-SAUB-AL, three days after 807 infiltration. Mock: infiltration with water. 808 Supplementary Figure S7. Heatmap for the symptoms observed three days after inoculation 809 with P. syringae strains. The heatmap shows the interactions between eight natural accessions 810 of A. thaliana from the region Midi-Pyrénées and eight natural strains belonging to the P. 811 syringae complex collected in the same populations than the eight natural accessions. 812 Supplementary Figure S8. Phylogenetic tree inferred with a Neighbor Joining (NJ) model 813 (3000 bootstrap repetitions) and based on the gyrB sequences (280 bp) of the 62 Xanthomonas 814 campestris strains isolated from the 163 A. thaliana populations. Bootstrap values are shown 815 at each node and names of the strain at each branch. All the strains belong to the X. 816 campestris group. Reference Xanthomonas strains are labeled in blue. Sequences for the 817 reference strains were downloaded on GenBank.

818 Supplementary Figure S9. Distribution of disease symptoms observed on the accession Kas- 819 1 among the 62 Xanthomonas campestris strains isolated from the 163 A. thaliana 820 populations. The arrows indicate disease symptoms for the accessions Col-0 and Kas-1 and 821 the mutant rks1-1 with the X. campestris control strain Xcc568 (Huard-Chauveau et al., 2013).

822 Supplementary Figure S10. Phylogenetic tree inferred with a Neighbor Joining (NJ) model 823 (3000 bootstrap repetitions) and based on the gyrB sequences (290 bp) of the Pantoea 824 agglomerans strain (labeled in green) isolated from the MONTI-DL population. Bootstrap 825 values are shown at each node and names of the strain at each branch. All the strains belong to 826 the X. campestris group. Reference Pantoea sequences were obtained from JGI.

827 Supplementary Figure S11. Pathogenicity of a strain of P. agglomerans (0001-Pag-MONTI- 828 DL) isolated from the MONTI-D population of A. thaliana population located in the Midi- 829 Pyrénées region. Left panel: Illustration of symptoms observed five days after inoculation on 830 the reference accession Col-0. The two leaves on the left were inoculated on both sides of the 831 midrib while only one side of the midrib has been inoculated for the three leaves on the right. 832 Right panel: genetic variation observed among natural accessions collected in the Midi- 833 Pyrénées region. Symptoms showed in the picture were obtained five days after inoculation of 834 the P. agglomerans strain on three A. thaliana accessions of the Midi-Pyrénées region 835 (MONTI-A-17, NAUV-B-14 and VILLEM-A-12). As illustrated, the 0001-Pag-MONTI-DL 836 was able to induce disease symptoms but with different severity among the three accessions.

34

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

837 Supplementary Figure S12. Dynamics of disease symptoms across 23 accessions of the 838 region Midi-Pyrénées inoculated with the strain 0001-Pag-MONTI-DL of P. agglomerans. 839 dpi: days post-inoculation.

840 Supplementary Figure S13. Box-and-whisker plot illustrating the significant difference of 841 the relative abundance of the pathobiota in the leaf compartment between plants with visible 842 disease symptoms - represented by the blue box (n = 424 samples) - and asymptomatic plants 843 - represented by the red box (n = 396 samples) - when sampled in situ.

844 Supplementary Figure S14. Comparison of the parameters ‘k’ and ‘q’ of the non-linear model 845 (pathobiota’s diversity ~ k*microbiota’s diversity – q*microbiota’s diversity*microbiota’s 846 diversity) run on the raw data (red arrows) to a null distribution of those parameters obtained by 847 creating 100 random microbiota OTU matrices paired with 100 random pathobiota OTU 848 matrices.

849 Supplementary Figure S15. Variation among populations in the dynamics of β-diversity 850 (PCoA, first axis) between autumn and spring. Each dot corresponds to the mean β-diversity 851 (estimated as BLUPs) of a population. ‘leaf’ n = 74 populations, ‘root’ n = 78 populations..

852 Supplementary Figure S16. Percentage of variance of the β-diversity among populations. 853 Red and blue bars correspond to microbiota and potential pathobiota, respectively. A 854 correction for the number of tests was performed to control the FDR at a nominal level of 5%: 855 ns non-significant, ** 0.01>P> 0.001, *** P < 0.001. 856 Supplementary Figure S17. Geographic variation of α-diversity (inferred as Shannon's 857 index) and β -diversity (when considering the first axis of the PCoA) for the microbiota. For 858 each ‘seasonal group x plant compartment’ combination, blue, green, yellow and red dots 859 correspond to populations from the 1st, 2nd, 3rd and 4th quartiles of either the α-diversity of the 860 β -diversity distribution (based on population BLUPs). Values indicate the percentage of α- 861 diversity and β -diversity variation among populations. Significance after a FDR correction at 862 a nominal level of 5%: ** 0.01 > P > 0.001, *** P < 0.001. Number of populations: ‘Autumn 863 – Leaf’ n= 82, ‘Autumn – Root’ n = 82, ‘Spring w/ Autumn - Leaf’ n= 76, ‘Spring w/ Autumn 864 - Root’ n= 79,‘Spring w/o Autumn - Leaf’ n= 80, ‘Spring w/o Autumn - Root’ n= 80.

865 Supplementary Table S1. Names and GPS coordinates (expressed in degrees) of the 163 866 populations.

35

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

867 Supplementary Table S2. In planta bacterial growth of four natural Pseudomonas strains in 868 four corresponding local natural populations of A. thaliana, each represented by two 869 accessions. 870 Supplementary Table S3. Natural interactions between eight local natural A. thaliana 871 accessions and eight corresponding local natural Pseudomonas syringae strains for disease 872 symptom evolution. 873 Supplementary Table S4. Genetic variation among 23 accessions of the region Midi- 874 Pyrénées for the response to the strain 0001-Pag-MONTI-DL of P. agglomerans at four time 875 points after inoculation (dpi, days post-inoculation). H²: broad-sense heritability. 876 Supplementary Table S5. Natural variation of α-diversity and β-diversity estimators of 877 microbiota and potential pathobiota in natural populations of A. thaliana collected in both 878 autumn and spring. 879 Supplementary Table S6. Percentage of variance of α-diversity and β-diversity estimators of 880 microbiota and potential pathobiota explained by the factors ‘seasons’, ‘plant compartment’, 881 ‘population’ and their interactions in natural populations of A. thaliana collected both in 882 autumn and spring. 883 Supplementary Table S7. Natural variation of α-diversity and β-diversity estimators of 884 microbiota and potential pathobiota in all the natural populations of A. thaliana collected in 885 spring. 886 Supplementary Table S8. Natural variation of α-diversity and β-diversity estimators of 887 microbiota and potential pathobiota in the natural populations of A. thaliana collected in the 888 seasonal group ‘autumn’. 889 Supplementary Table S9. Natural variation of α-diversity and β-diversity estimators of 890 microbiota and potential pathobiota in the natural populations of A. thaliana collected in the 891 seasonal group ‘spring with autumn’. 892 Supplementary Table S10. Natural variation of α-diversity and β-diversity estimators of 893 microbiota and potential pathobiota in the natural populations of A. thaliana collected in the 894 seasonal group ‘spring without autumn’ 895 Supplementary Table S11. Percentage of variance of α-diversity and β-diversity estimators 896 of microbiota and pathobiota explained by the factors ‘population’, ‘plant compartment’ and 897 their interactions in natural populations of A. thaliana collected either in autumn or ins spring. 898 Supplementary Table S12. Natural variation of α-diversity and β-diversity estimators of 899 microbiota and potential pathobiota for each ‘seasonal group x plant compartment’ 900 combination.

36

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

901 Supplementary Table S13. Comparison of the fit between a linear model and a non-linear 902 model on the relationship between the α-diversity estimators (species richness and Shannon 903 index) of the microbiota and the α-diversity estimators (species richness and Shannon index) 904 of the potential pathobiota. 905 Supplementary Table S14. Relationships between the relative abundance of pathobiota and 906 α-diversity estimators (species richness and Shannon index) of the microbiota and potential 907 pathobiota. 908 909 Supplementary Data 1. Validation of the gyrB gene marker. 910 Supplementary Data 2. List of pathogenic bacteria used to determine the potential 911 pathobiota of the 163 A. thaliana populations. 912 Supplementary Data 3. Strains belonging to the P. syringae complex, X. campestris and P. 913 agglomerans isolated from the 163 A. thaliana populations for the validation of the potential 914 pathobiota. Housekeeping gene sequences used for phylogeny are available in the data set. 915 Supplementary Data 4. Abundance matrix obtained after data filtering for the whole 916 microbiota. 917 Supplementary Data 5. Abundance matrix for the potential pathobiota. 918 Supplementary Data 6. Raw values for both α and β diversity used in the statistical analysis 919 of natural variation of microbiota and potential pathobiota.

37

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

920 921

922 923 924 925 926 Figure 1 927 928 929 930 931 932

38

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

933

934 935 936 937 938 939 Figure 2

39

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

940 941

942 943 944 945 946 Figure 3 947 948 949 950 951 952 953 954 955 956 957 958 959 960 961

40

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

962

963 964 965 Figure 4 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983

41

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

984 985 986 Figure 5 987 988 989 990 991 992

42

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

993 994 995 Figure 6

43

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

996

a. microbiota Aut S w/ A S w/o A All Order Family OTU L R L R L R Pseudomonadales Pseudomonadaceae Otu0000005 unclassified unclassified Otu0000006 unclassified unclassified Otu0000009 unclassified unclassified Otu0000011 unclassified unclassified Otu0000016 unclassified unclassified Otu0000017 unclassified unclassified Otu0000027 Pseudomonadales Pseudomonadaceae Otu0000028 unclassified unclassified Otu0000051 unclassified unclassified Otu0000063 Actinomycetales Geodermatophilaceae Otu0000083 Burkholderiales Otu0000093 unclassified unclassified Otu0000100 Nitrospirales Nitrospiraceae Otu0000225 Burkholderiales unclassified Otu0000346 unclassified unclassified Otu0000399 Burkholderiales Oxalobacteraceae Otu0000757 Portion of variance explained (%) 5.88 1.64 1.44 3.97 4.44 3.49 4.95

b. potential pathobiota Aut S w/ A S w/o A All Species name L R L R L R Acidovorax valerianellae Pantoea agglomerans Pseudomonas syringae Pseudomonas viridiflava Sphingomonas melonis Streptomyces scabiei Xanthomonas campestris 997 Portion of variance explained (%) 17.11 14.90 15.40 17.60 19.34 17.31 16.93 998 999 1000 Figure 7 1001 1002 1003 1004

44

bioRxiv preprint doi: https://doi.org/10.1101/261602; this version posted February 7, 2018. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.

1005

1006 1007 1008 Figure 8

45