Population Genomics of Sinorhizobium Medicae Based On
Total Page:16
File Type:pdf, Size:1020Kb
Population genomics of Sinorhizobium medicae based on low-coverage sequencing of sympatric isolates Xavier Bailly, Elisa Giuntini, Connor M Sexton, Ryan Pj Lower, Peter W Harrison, Nitin Kumar, J Peter W Young To cite this version: Xavier Bailly, Elisa Giuntini, Connor M Sexton, Ryan Pj Lower, Peter W Harrison, et al.. Popu- lation genomics of Sinorhizobium medicae based on low-coverage sequencing of sympatric isolates. ISME Journal, Nature Publishing Group, 2011, 5 (11), pp.1722-1734. 10.1038/ismej.2011.55. hal- 02652397 HAL Id: hal-02652397 https://hal.inrae.fr/hal-02652397 Submitted on 29 May 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. The ISME Journal (2011) 5, 1722–1734 & 2011 International Society for Microbial Ecology All rights reserved 1751-7362/11 www.nature.com/ismej ORIGINAL ARTICLE Population genomics of Sinorhizobium medicae based on low-coverage sequencing of sympatric isolates Xavier Bailly1, Elisa Giuntini, M Connor Sexton, Ryan PJ Lower, Peter W Harrison, Nitin Kumar and J Peter W Young Department of Biology, University of York, York, UK We investigated the genomic diversity of a local population of the symbiotic bacterium Sinorhizobium medicae, isolated from the roots of wild Medicago lupulina plants, in order to assess genomic diversity, to identify genomic regions influenced by duplication, deletion or strong selection, and to explore the composition of the pan-genome. Partial genome sequences of 12 isolates were obtained by Roche 454 shotgun sequencing (average 5.3 Mb per isolate) and compared with the published sequence of S. medicae WSM 419. Homologous recombination appears to have less impact on the polymorphism patterns of the chromosome than on the chromid pSMED01 and megaplasmid pSMED02. Moreover, pSMED02 is a hot spot of insertions and deletions. The whole chromosome is characterized by low sequence polymorphism, consistent with the high density of housekeeping genes. Similarly, the level of polymorphism of symbiosis genes (low) and of genes involved in polysaccharide synthesis (high) may reflect different selection. Finally, some isolates carry genes that may confer adaptations that S. medicae WSM 419 lacks, including homologues of genes encoding rhizobitoxine synthesis, iron uptake, response to autoinducer-2, and synthesis of distinct polysaccharides. The presence or absence of these genes was confirmed by PCR in each of these 12 isolates and a further 27 isolates from the same population. All isolates had rhizobitoxine genes, while the other genes were co-distributed, suggesting that they may be on the same mobile element. These results are discussed in relation to the ecology of Medicago symbionts and in the perspective of population genomics studies. The ISME Journal (2011) 5, 1722–1734; doi:10.1038/ismej.2011.55; published online 12 May 2011 Subject Category: evolutionary genetics Keywords: population genomics; genome evolution; horizontal gene pool; population structure; Sinorhizobium medicae Introduction sample of a defined local population. Comparison of such genomes has revealed extensive polymorphism There have been many studies of genetic diversity affecting both genome content and gene sequences within a bacterial species, and the majority have (Tettelin et al., 2005; Lefebure and Stanhope, 2007). adopted one of three approaches. Innumerable ‘clas- Based on such observations, the pan-genome of a sical’ studies have characterized isolates by their species has been defined as the total gene repertoire genotypes at a small number of marker loci. Recent of the species, including both core genes, which are advances in sequencing technology have facilitated present in the genome of (almost) all individuals two additional approaches: sequencing the entire within the species, and accessory genes, which have genomes of multiple isolates or sequencing ran- a sparser distribution. Theoretical studies suggest domly from an entire bacterial community (meta- that the frequency of a gene in a bacterial population genomics). While there are now many species for would depend on parameters related to its inheri- which several isolates have been sequenced, these tance mechanism and its fitness effect on the reci- isolates have generally been chosen from the global pient genome (Berg and Kurland, 2002; Novozhilov diversity of the species, and do not represent a et al., 2005), as would allele frequency at a given locus (Mes, 2008). Population genomics approaches Correspondence: JPW Young, Department of Biology, University using full genomes are giving insights into the para- of York, YO10 5DD, York, UK. meters that shape the diversity of bacterial species E-mail: [email protected] (Didelot et al., 2007), but estimates of diversity and 1Current address: INRA (Institut National de la Recherche recombination are not easily interpreted when the Agronomique), UR346 Epide´miologie Animale, 63122 Saint Gene`s Champanelle, France isolates are sparsely sampled from a global distribu- Received 25 January 2011; revised 28 March 2011; accepted 28 tion. By contrast, metagenomic studies do provide March 2011; published online 12 May 2011 samples of a local population, and gene-based Population genomics of Sinorhizobium medicae X Bailly et al 1723 estimates of diversity, but the partitioning of these plasmid called pSMED03. Although S. meliloti sequences into discrete genomes is not known. This 1021 does not have any comparable plasmid, hampers studies on recombination and selection, additional plasmids have also been observed in although some interesting analysis is still possible some other strains of this species (Mercado-Blanco (Allen et al., 2007; Eppley et al., 2007). These consi- and Olivares, 1993, 1994; Stiens et al., 2006, 2007; derations raise questions about the sampling strate- Kuhn et al., 2008). Based on sequencing of several loci, gies employed to study the genomic diversity of S. medicae is less diverse than S. meliloti, especially at bacteria (Rocha, 2008). Furthermore, it has been chromosomal loci (Bailly et al., 2006; van Berkum proposed that gene and allele frequencies obtained et al., 2006). Indications of directional and balancing from genomic data might be used in top-down selection have been identified for the nod gene region approaches (similar to association mapping) to on pSMED02/pSymA and the region involved identify, without a priori knowledge, genes that in polysaccharide synthesis on pSMED01/pSymB, could be involved in bacterial adaptations (Falush respectively, in sympatric populations of S. meliloti and Bowden, 2006). and S. medicae (Bailly et al., 2006). Such diversity In this context, we have acquired genomic data for patterns suggest that recombination has an important rhizobia, which are ecologically important bacteria role in shaping the diversity of both species. Within that have been extensively studied by both func- each species, linkage disequilibrium analyses suggest tional and evolutionary approaches, to assess the that homologous recombination occurs preferentially results that can be obtained from a population geno- at loci located on the chromid and megaplasmid, mic survey. Rhizobia constitute a functional group rather than the chromosome (Bailly et al., 2006). that includes both Alphaproteobacteria and Beta- The diversity of the accessory genome of S. meliloti proteobacteria able to form a nitrogen-fixing endo- has been studied using various approaches. Novel symbiosis with a legume host plant. To establish genomic sequences have been described both from this interaction, bacteria trigger the organogenesis of the cryptic plasmids of the strain SM11 (Stiens nodules, which are usually located on the roots of et al., 2006, 2007) and also from a representational the host. The association between plants belonging difference analysis based on the strain ATCC 9930 to the genus Medicago, such as alfalfa (Medicago (Guo et al., 2005). These new sequences occur with a sativa) or barrel medic (Medicago truncatula), and wide range of frequencies in natural populations of their specific symbionts, Sinorhizobium meliloti S. meliloti (Guo et al., 2005; Kuhn et al., 2008). Some and Sinorhizobium medicae, is one of the best- of this genetic material might have an important role studied systems. Bacterial genetic and post-genomic in the adaptation of Medicago symbionts. Finally, approaches have demonstrated the critical role the analysis of the genome content of four natural played by genes involved in the synthesis of isolates of S. meliloti using comparative genomic nodulation factors (that is, nod genes) and surface hybridization revealed that at least 12% of the polysaccharides (for example, exo or rkp genes) in genes of the model strain S. meliloti 1021 have been providing signals for the recognition of S. meliloti recently duplicated, gained or lost during the and S. medicae by their host (Jones et al., 2007). diversification of the species (Giuntini et al., While most functional studies have used S. meliloti 2005). Altogether, these data suggest that high rates as a model