REVIEWS Nature Reviews Genetics | AOP, published online 12 February 2008; doi:10.1038/nrg2319

Learning how to live together: genomic insights into symbioses

Andrés Moya*‡, Juli Peretó*§, Rosario Gil*‡ and Amparo Latorre*‡ Abstract | Our understanding of prokaryote–eukaryote symbioses as a source of evolutionary innovation has been rapidly increased by the advent of , which has made possible the biological study of uncultivable endosymbionts. Genomics is allowing the dissection of the evolutionary process that starts with host invasion then progresses from facultative to obligate and ends with replacement by, or coexistence with, new symbionts. Moreover, genomics has provided important clues on the mechanisms driving the -reduction process, the functions that are retained by the endosymbionts, the role of the host, and the factors that might determine whether the association will become parasitic or mutualistic.

Symbiosis Prokaryotic microorganisms are widespread in all the symbioses during the origin and evolution of eukaryotic 7 From the Greek, sym ‘with’ and environments on Earth. Given their ecological ubiquity, cells, although controversies about the details persist . biosis ‘living’. A long-term it is not surprising to find many prokaryotic species in In addition, on the basis of its wide distribution across association between two or close relationships with members of many eukaryotic major phylogenetic taxa (BOX 1), symbiosis could have more organisms of different species that is integrated at the taxa, often establishing a persistent association, which an important role in the evolution of species that are behavioural, metabolic or is known as symbiosis (BOX 1). According to the fitness involved in such partnerships. In all the major biological genetic level. According to effects on the members of the symbiotic relationship, phenomena that are classified as symbiotic, new cellular the level of dependence on the associations can be referred to as parasitism, mutualism structures and/or metabolic capabilities emerge as a host, symbiosis can be obligate or commensalism and, depending on the location of the result of evolutionary forces, favouring the maintenance or facultative. The term was 8 introduced by Anton de Bary symbiont with respect to host cells, as ectosymbiosis or of the association . The interplay of both partners or, in and Albert Bernard Frank when endosymbiosis. Among intracellular symbioses, there are some cases, between a single host and more than one discussing lichens and differences regarding the extent of dependence between symbiont, forms an evolving biological community that mycorrhizae, respectively, at the animal host and the symbiont and the age of the asso- changes throughout time. the end of the 1870s. ciation, leading to the dichotomic classification between In most cases of symbioses between and obligate primary endosymbionts (P-endosymbionts) eukaryotes, the relationship between host and symbiont *Institut Cavanilles de and facultative secondary symbionts (S-symbionts). is so close that the microorganisms cannot be cultured, Biodiversitat i Biologia P‑endosymbionts generally have long evolutionary histo- making them difficult to study. However, in the past Evolutiva, Universitat de ries with their hosts, whereas S‑symbionts seem to have few years, genome sequencing, transcriptomics and València, Apartado de correos metagenomics 22085. 46071 València and established more recent associations, retaining the ability the recently emerged field of — which 1–4 CIBER de Epidemiología y to return to a free-living condition . In some cases, an avoid having to culture microbial cells — have offered Salud Pública, Spain. S‑symbiont can evolve to become an obligate partner new opportunities for symbiosis research. Whole ‡Departament de Genètica and and, if a P‑endosymbiont is already present, microbial of several intracellular bacterial symbionts §Departament de Bioquímica i Biologia Molecular, consortia can be established. have been sequenced, allowing comparisons among Universitat de València, For several decades, the idea that microbial associa- the evolutionary innovations of these , from Dr. Moliner, 50. 46100 tions might be central to eukaryote evolution remained being free living to various stages of integration with Burjassot (València), Spain. controversial. However, the work of Lynn Margulis since their respective hosts9–23 (TABLE 1). Such genomics Correspondence to A.M. the late 1960s not only contributed to the establishment approaches to the study of symbiosis will allow several e-mail: [email protected] 5 doi:10.1038/nrg2319 of a symbiotic theory for cell evolution , but also revived questions to be tackled, contributing to an understand- 6 Published online earlier proposals that were forgotten by biologists . ing of the evolutionary relevance of this phenomenon. 12 February 2008 Today, there is a wide consensus on the essential role of In particular, these questions relate to the nature of the

218 | advance online publication www.nature.com/reviews/genetics © 2008 Nature Publishing Group

REVIEWS

Box 1 | Spread of symbiosis in the biosphere Stable symbioses have independently evolved many times in diverse recycling. In the best-studied cases — those of symbioses involving groups of eukaryotes. Although the scope of this review is restricted to insects97 — the presence of such associations throughout most of prokaryotic partners that are associated with animal hosts, there are many evolutionary history suggests that symbiosis has been a driving force in other remarkable examples of symbiosis between eukaryotic organisms, the diversification of this group. as exemplified by fungal associations with other fungi, protists, plants The figure shows the phylogenetic distribution of symbioses, indicating (that is, mycorrhizae), algae (that is, lichens) and (for an overview, the bacterial and archaeal classes within which there are associations see ref. 90). Most symbioses have a proven biochemical foundation. In with eukaryotic hosts. Data were collected from the literature and are some cases, one of the partners benefits from compounds that are the result of a long tradition of studies that have used ecological, produced by the other; in other cases, the waste products of one partner developmental, morphological, biochemical and genomic approaches to (especially nitrogen compounds) are recycled by the other. investigate symbiosis (a list of examples is given in the Supplementary The eukaryotic nucleocytoplasm has limited metabolic capabilities. The information S2 (table)). The uneven distribution of symbiosis highlights mitochondrial respiratory chain and oxygenic photosynthesis in the diverse interests and motivations of the scientific community. the chloroplast are two examples of metabolic functions that have been Advances in genomics have had a large impact on the research into acquired through symbiotic associations with prokaryotic partners microbial symbioses, which has implications for biotechnology (for during early eukaryotic evolution8. Countless symbiotic relationships example, -associated prokaryotes98), agriculture (for example, have emerged more recently, or are in the process of developing, and nitrogen fixation in plant-associated bacteria91) and biomedicine (for they complement the limited metabolic networks of most eukaryotes example, the human intestine microbiome99). Metagenomic methods with several prokaryotic metabolic capabilities, such as dinitrogen have notably increased our knowledge of the biodiversity of non- fixation91, methanogenesis92, chemolithoautotrophy93, nitrogen cultivable bacteria in symbiotic consortia, including the description of assimilation94 and essential-nutrient anabolism95,96. In particular, animal completely new candidate phyla — Endomicrobia100 and Poribacteria101. heterotrophic metabolism is relatively narrow and some amino acids, Postulated symbiotic events leading to the evolutionary origin of vitamins and fatty acids must be obtained from an external source. Thus, organelles (mitochondria and chloroplasts) are indicated. The branching symbiotic associations with microorganisms have allowed animals to order in the eukaryotic phylogenetic tree has been adapted from refs adapt to specialized feeding behaviours by providing the nutrients that 102,103. The phylogenetic tree of animals is an adaptation from ref. 104, are deficient in their restricted diets. In fact, all intracellular mutualistic modified by recent molecular phylogenies105,106. The lengths of branches symbioses between bacteria and animals that have been analysed at the are not to scale. An asterisk indicates that complete genomes are genomic level9–23 (TABLE 1) are related to nutrient provision and waste available (see TABLE 1).

Cy Cy Rhizaria Cy, α, β, γ, δ, M a nt ozoa ra a rc fe at ko Ce Excavata Fi, S, M ol ro ramini a ve Fo Metamonada Al Hete Cy, , M β Euglenozoa, Radiolari A, Cy, α, β, γ Heterolobosea Viridiplantae Malawimonas Amoebozoa Rhodophyta Plantae E Oxymonada Discosea Glaucophyta A, B, Ch, Fi, α, β, γ, ε Variosea Jakobidae Ch, γ Lobosea Chloroplast Mycetozoa Cryptophyta M Archamoebae Haptophyta Opisthokonta Cristidiscoidea B, F, Fi, γ, M Centrohelid heliozoa Vertebrata Cy Fungi Cy, γ Urochordata Choanozoa Apusozoa Cephalochordata Animalia Mitochondria Echinodermata Eukaryotes Hemichordata Xenoturbellida Prokaryotes Priapulida α*, γ Nematoda Archaea Arthropoda Methanobacteria M A, B*, Fu, S, α, β, γ*, δ, ε, M Thermoprotei T B, α, γ, δ, ε, S Annelida Bacteria γ Ectoprocta Sipunculida Ac N γ*, ε Mollusca A α- α Rotifera B β-Proteobacteria β Platyhelminthes Ch γ-Proteobacteria γ Ctenophora Cl δ-Proteobacteria δ Cy, α, β Cnidaria Cy ε-Proteobacteria ε F S Porifera Fi V Ac, A, Cl, Cy, Fi, G, N, α, β, γ, δ, S, V, P, T Fu Ca. Endomicrobia E G Ca. Poribacteria P Nature Reviews | Genetics nature reviews | genetics advance online publication | 219 © 2008 Nature Publishing Group

REVIEWS

Table 1 | Genomic data for mutualistic symbionts of animals Organism Host Metabolic Genome GC content CDS rRNAs tRNAs Pseudogenes Accession mode size (kb) (%) number Buchnera Acyrthosiphon pisum Heterotroph 652 26.24 574 3 32 12 BA000003, aphidicola BAp* (aphid)‡ AP001070, AP001070 Buchnera Schizaphis graminum Heterotroph 653 26.3 556 3 32 33 AE013218, aphidicola BSg* (aphid) AF041836, Z21938 Buchnera Baizongia pistaciae (aphid) Heterotroph 618 25.3 507 3 32 9 AE016826, aphidicola BBp* AF492591 Buchnera Cinara cedri (aphid) Heterotroph 422 20.2 362 3 31 3 CP000263, aphidicola BCc* AY438025 Blochmannia Camponotus floridanus Heterotroph 706 27.4 583 3 37 4 BX248583 floridanus*§ (carpenter ant) Blochmannia Camponotus Heterotroph 792 29.6 610 3 39 4 CP000016 pennsylvanicus*§ pennsylvanicus (carpenter ant) Wigglesworthia Glossina brevipalpis Heterotroph 698 22.5 617 6 34 14 BA000021, glossinidia* (tsetse ) AB063523 Sodalis Glossina morsitans Heterotroph 4,171 54.7 2,516 7 69 972 AP008232, glossinidius* (tsetse fly)‡ (Secondary) AP008233, AP008234, AP008235 Baumannia Homalodisca coagulata Heterotroph 686 33.2 595 6 39 9 CP000238 cicadellinicola*§ (sharpshooter) Sulcia muelleri§|| Homalodisca coagulata Heterotroph 245 22.4 227 3 31 – CP000770 (sharpshooter) Carsonella ruddii*§ Pachypsylla venusta (psyllid) Heterotroph 160 16.6 182 3 28 – AP009180 Wolbachia wBm¶ Brugia malayi (nematode)# Heterotroph 1,080 34 805 3 34 98 AE017321 Ruthia magnifica*§ Calyptogena magnifica Autotroph 1,200 34.0 976 3 36 – CP000488 (Deep-sea clam) Vesicomyosocius Calyptogena okutanii Autotroph 1,000 31.6 937 3 35 – AP009247 okutanii*§ (Deep-sea clam) Nitratiruptor sp** Deep-sea-vent animals Autotroph 1,878 39.7 ~1,118 9 45 ~739‡‡ AP009178 Sulfurovum sp** Deep-sea-vent animals Autotroph 2,563 43.8 ~1,218 9 44 ~1,248‡‡ AP009179 The data shown are for those symbionts that have been sequenced as of January 2008. Data were retrieved from the National Center for Biotechnology Information (NCBI). *γ-Proteobacteria. ‡Genome sequence in progress. §These bacteria are called Candidatus. ||Bacteroidetes. ¶α-Proteobacteria. #Complete genome available, see ref. 78. **ε-Proteobacteria; although it is unclear whether these two isolates are epibiotic symbionts or another variation of symbionts, many genome features strongly support their symbiotic lifestyle. ‡‡Estimated from ref. 20. CDS, coding sequences; rRNA, ribosomal RNA.

genome-reduction process, the type of that are aspects such as metabolic interdependencies between retained by the endosymbiont, the molecular pathways partners and the survival of the prokaryote in the face that are used by the host to control the endosymbiont of the host immune response. population and the complex set of factors that deter- mine whether the final outcome of the association is Genomics coverage of prokaryotic endosymbionts parasitic or mutualistic. Obligate endosymbionts of animals. In recent years, Here we focus on the study of mutualistic endosym- the genomes of several bacterial P‑endosymbionts of Parasitism (TABLE 1) Symbioses in which one bioses of animals for which genomic studies have been animals have been completely sequenced . species is increasing its fitness carried out, providing new biological and evolutionary P‑endosymbionts are inherited by strict vertical trans- while the fitness of the other insights. Although only one host genome has been mission and, in most cases, reside in specialized host species is adversely affected. sequenced, complete genomic sequences for several cells called bacteriocytes. Functional analyses have cor-

Mutualism bacterial symbionts are available. Comparisons with roborated the nutritional role of the associations; each Symbioses in which both free-living relatives have revealed dramatic changes in bacterium provides the nutrients that are deficient in species increase their fitness. genome size and content, and have begun to reveal the the diet of its animal-host, whereas the bacterium gains mechanisms by which these changes take place and their a permanent supply of a wide range of metabolites that Commensalism functional consequences. Furthermore, the mechanisms are provided by the host. Symbioses in which one partner is increasing its fitness that are involved in the establishment, maintenance and Most of the endosymbionts that have been studied without affecting the other evolution of mutualistic associations are being scruti- are γ‑proteobacteria that live in obligate association species. nized; this has led to the emergence of new insights into with , and they provide various functions to

220 | advance online publication www.nature.com/reviews/genetics © 2008 Nature Publishing Group

REVIEWS

their host, including: the provision of essential amino cedar-aphid endosymbionts (FIG. 2), led Pérez-Brocal acids9,10,12–14,16,21,23, vitamins and cofactors9–12,15 that are and co-workers21 to conclude that S. symbiotica SCc absent in the host diet; nitrogen recycling and storage13,14; might have the potential to replace B. aphidicola BCc. and the provision of metabolic factors that are required In a second example, genomic sequencing has been for survival and fertility19. Whereas most of these species carried out for Baumannia cicadellinicola and the are heterotrophic endosymbionts, two recently sequenced Bacteroidetes species Sulcia muelleri, the co-resident Ectosymbiosis endosymbionts that are harboured by deep-sea clams of P‑endosymbionts of the xylem-feeding sharpshooter A symbiosis in which the the genus Calyptogena17,18 are chemolithoautotrophic, and Homalodisca coagulata15,23. Genomic analysis has symbiont lives on the body surface of the host, including provide almost all nutrients to the host by fixing CO2 revealed that the two symbionts have complementary internal surfaces such as the and using H2S as a source of energy and reducing power biosynthetic capabilities, which are needed to provide lining of the digestive tube and (FIG. 1 and Supplementary information S1 (figure)). their host with nutrients that are lacking in xylem sap. the ducts of glands. Phylogenetic analysis revealed that Sulcia was ancestrally Facultative symbionts. Occasionally, hosts tolerate present in a host lineage that acquired Baumannia at the Endosymbiosis Symbioses in which a S‑symbionts that coexist with a P‑endosymbiont and same approximate time as the switch to xylem feeding, 24 prokaryote symbiont lives that can be either deleterious or beneficial to the host . consistent with the view that its nutrient-provisioning inside a eukaryotic cell. In aphids, a number of facultative symbionts that reside capabilities were a requirement for the host to evolve in multiple host tissues, cells surrounding primary towards this lifestyle41. As in the case of Buchnera– Primary endosymbiont (P-endosymbiont). Obligate bacteriocytes or in their own bacteriocytes have been Serratia, the two symbionts live in close proximity within 25 15 bacterial endosymbionts that described . Although they are normally vertically trans- the host bacteriome . live inside specialized animal mitted, their distribution patterns suggest that sporadic Metagenomic approaches have also been used to host cells called bacteriocytes. horizontal transmission among host individuals and host analyse more complex consortia, including the four co- The association is obligate for species must have occurred26. Among the possible occurring symbionts from the segmented worm Olavius both partners. beneficial roles that have been identified, some of these algarvensis42. In this instance, the host belongs to a group 27–29 Secondary symbiont S‑symbionts can rescue the host from heat damage , of oligochaetes that lack a mouth, gut and anus, and (S-symbiont). Facultative provide defence against natural enemies (parasitoids and that are unique among annelids in having reduced their bacterial endosymbiont pathogens)30–32 and participate in host specialization33–36. nephridial excretory system. O. algarvensis harbours that coexists with a γ ( P‑endosymbiont. Often In other cases, facultative symbionts can spread among at least four symbionts, two -proteobacteria sulphur located in syncitial cells near lineages without conferring a benefit, by manipulating oxidizing) and two δ-proteobacteria (sulphate reducing), ref. 37 the bacteriocyte and in various host reproduction (reviewed in ). For example, which fix CO2 and provide the host with nutrients. Almost other insect tissue types. in Wolbachia infections of , the symbiont all amino acids and several vitamins can be synthesized by Secondary symbionts are not undergoes transfer among host lineages. Remarkably, the symbionts and then provided to their host, probably essential for host survival and are transferred horizontally Wolbachia can also be a P‑endosymbiont in filarial through controlled digestion of the bacteria as suggested 19 43 among individuals of both the nematodes , indicating that the same prokaryotic line- by the vacuole localization of the host . The symbionts host species and other species. age can take part in more than one type of symbiotic are also likely to be involved in host-waste recycling. association. Metagenomics Genomic changes during endosymbiont evolution The application of genomic At present, the only completely sequenced genome analyses to uncultured of an S‑symbiont is that of Sodalis glossinidius (TABLE 1). Genomic changes experienced by endosymbionts. A gen- microorganisms. Also referred This bacterium coexists in the gut lumen of tsetse eral feature of intracellular pathogenic and mutualistic to as environmental genomics. with the P‑endosymbiont Wigglesworthia glossinidia, bacteria is that they have smaller genomes with a higher although they occupy different areas of the gut and can AT content than their free-living relatives44. The causes Vertical transmission 22 The endosymbionts are be found both intra- and extracellularly . It has been of reductive-genome evolution are related to both the maternally transferred, that suggested that S. glossinidius has an important role in genetic information that is needed in the new environ- is, directly from a host to the acquisition of trypanosome infections38. ment and to population dynamics, although the relative its offspring. importance of these factors has not been determined45,46.

Bacteriocytes Endosymbiotic consortia. Although the importance of The change from a free-living environment to one that is Specialized cells of the host syntrophy between unrelated organisms has been recog- intracellular and protected implies that many genes are species in which symbiotic nized in several cases39, the new field of metagenomics rendered unnecessary, whereas others become redun- bacteria live. provides the ability to study bacterial endosymbionts dant because their functions can be supplied by the host. that coexist in the same host, enabling the discovery Therefore, some genetic material can be lost without a Heterotrophy The metabolic mode in which of complex and stable associations, and analysis of the detrimental effect, and can accumulate mutations owing the carbon source is organic contribution of each partner to the relationship. to the lack of effective natural selection to purge them. In matter. By extension, this is a One endosymbiotic consortium that has recently addition, owing to the strict vertical transmission of obli- metabolic mode in which been reported involves Buchnera aphidicola BCc gate endosymbionts, only a few bacteria are transmitted, organic matter is the source of carbon, electrons and energy and Serratia symbiotica SCc, which coexist in the which generates continuous bottlenecks that favour the 40 (chemoorganoheterotrophy). bacteriome of the aphid Cinara cedri . S. symbiotica is action of random genetic drift. The combination of these a facultative symbiont in other aphid species, but has factors allows the accumulation of mutations in genes Chemolithoautotrophy been found in all the cedar-aphid populations that that can be beneficial but that are not essential — such The metabolic mode in have been analysed, casting doubts over its faculta- as genes involved in DNA repair and recombination which CO2 is the carbon source and an inorganic tive status in this species. Functional and evolutionary (see below) — further increasing the repertoire of genes chemical reaction is both the comparative analyses of the sequenced B. aphidicola that can be lost, and reducing the possibility of genetic electron and energy source. genomes, as well as microscopic analysis of the two exchange by homologous recombination. Furthermore,

nature reviews | genetics advance online publication | 221 © 2008 Nature Publishing Group

REVIEWS

a CO2

+ + + + Nucleus + Ion transport PTS Nutrient of host transport cell to the host Mitochondria Candidatus Nitrogen Ruthia magnifica metabolism Reductive Cell-wall pentose pathway biosynthesis Membrane- phospholipid Inner and outer biosynthesis Amino-acid biosynthesis export ATP (except Thr and Ile) stem bacterial membranes sy

Sulphur Sec metabolism Fatty-acid De novo biosynthesis nucleotide biosynthesis H2S and salvage Cofactor biosynthesis pathways Electron (except pantothenate and UQ) transport + H ATP H+ Membrane- O 2 associated Cytoplasm of Vacuole of the ATP synthesis the host cell host cell

b Reduced carbon, non-essential amino acids

+ + + + PTS Nucleus + Ion transport of host ? Nutrient cell transport Glycolysis to the host Mitochondria Buchnera Nitrogen aphidicola BCc metabolism Non-oxidative pentose pathway Protein Cell-wall export

biosynthesis Membrane- stem phospholipid Amino-acid biosynthesis sy biosynthesis Inner and outer (except Trp) Sec bacterial membranes

Sulphur De novo metabolism Fatty-acid nucleotide biosynthesis? biosynthesis Flagella and salvage export Cofactor biosynthesis pathways apparatus Electron (except riboflavin) transport H+ Membrane- O2 Horizontal transmission Cytoplasm of associated ATP synthesis Some endosymbionts retain a the host cell Serratia symbiotica generalized ability to colonize Vacuole of the host cell and persist in multiple hosts, that is, their transmission is Figure 1 | Comparative biochemistry of symbiont–host interdependence. As a study case, we use information from between individuals of the Nature Reviews | Genetics same or different host species, complete genome sequences to infer and compare the metabolisms that are encoded by the reduced genomes of an rather than from parent to autotrophic and a heterotrophic endosymbiont. Blue indicates the presence of a metabolic process, orange indicates its 17 offspring. absence. a | Candidatus Ruthia magnifica uses electrons from H2S to drive an autotrophic metabolism based on the reductive pentose phosphate pathway (Calvin–Benson cycle), supplies essential nutrients (including amino acids and Syntrophy cofactors) to its host, and recycles nitrogen. As is the case in the Calyptogena okutanii endosymbiont18, it has the potential Emergence of new metabolic for anabolism of all essential biomolecules except threonine (Thr), isoleucine (Ile) and ubiquinone (UQ). The missing steps capabilities as a result of in those pathways could be explained by the function of an unidentified enzyme, or by complementation with the host symbiosis, it is often metabolism. b | Buchnera aphidicola BCc uses reduced-carbon molecules and non-essential amino acids to fuel its essential for the survival heterotrophic metabolism and synthesize essential nutrients, with the exception of tryptophan (Trp) and riboflavin21. of the consortium. These molecules could be provided by a second co-existing endosymbiont, Serratia symbiotica. The self-sufficiency of Bacteriome R. magnifica sharply contrasts with the incomplete metabolic abilities of B. aphidicola BCc. Among the few similarities An organ-like structure formed between both metabolisms, note the absence of nutrient-transport systems to the host. A more detailed version can be by bacteriocytes. found in the Supplementary information S1 (figure). PTS, phosphotransferase system; Sec system, secretory system.

222 | advance online publication www.nature.com/reviews/genetics © 2008 Nature Publishing Group

REVIEWS

the intracellular environment isolates the symbiont from elements. The increase in frequency of these elements in other bacterial species and thus reduces, and can even the newly established P‑endosymbiont must be due to eliminate, the possibility of gaining new genetic mate- an increase in the replicative transposition of elements rial by horizontal transfer. Thus, gene losses can that were resident at the onset of symbiosis24. All analysed become irreversible. endosymbiotic bacteria that are involved in older associa- In addition to having reduced genomes, an increased tions possess genomes that are free of mobile elements. AT content is another general feature of the endosymbi- Presumably, therefore, the expansion of mobile ont genomes that have been sequenced so far. The bias elements must at some point have become deleterious towards AT is probably due to the loss of genes for DNA and they must have been removed as part of the process of repair, leading to a mutational GC to AT pressure47 (for genome degradation. The difference in the abundance a metabolic hypothesis see ref. 48). Such a bias has a of transposable elements in two strains of Wolbachia notable effect on many genes by altering the structure with different lifestyles and genome sizes supports this and function of the corresponding . In fact, it statement. In Wolbachia pipientis wMel57, a reproductive has been proposed that GroEL, a chaperonin that is parasite of Drosophila melanogaster, 14% of the 1.27-Mb constitutively overexpressed in B. aphidicola, helps to symbiont genome is occupied by repetitive DNA and post-translationally correct the altered structure of many mobile elements, whereas in W. pipientis wBm19, the obli- proteins that is caused by this bias49. A by-product of the gate endosymbiont of the nematode Brugia malayi, these base-composition bias of these genomes is the loss of elements account for just 5.4% of the 1.08-Mb genome. the codon-usage bias that is typical of free-living bacte- During the second stage of the genome-reduction ria: it is almost absent from B. aphidicola, and is highly process, genome shrinkage seems to have mostly reduced in P‑endosymbionts with larger genomes and occurred through a process of gradual gene loss, scat- in S‑symbionts50,51. tered along the genome. Such loss seems to follow The degree of both genome-size reduction and a pattern that starts with the inactivation of a gene increase in AT content vary among endosymbionts, (pseudogenization) by single-nucleotide mutations, and correlate with the age of the association. P‑endosymbionts that are partners in old associations a have generally smaller genomes and an AT content higher than 70%, whereas S‑symbionts and P‑endosymbionts 10 µm S that are part of younger associations have genome sizes and AT contents intermediate with respect to older n 2,25,50 P‑endosymbionts and free-living relatives . P P n The genome-reduction process examined. To understand the different stages of the genome-reduction process it is necessary to analyse a range of genome sequences, from S endosymbionts that still have genomes similar in size to those of their free-living relatives, to those that have undergone the most dramatic reductions. Such compara- tive analyses suggest that the process of genome shrink- P age might have taken place in two separate stages. S A massive gene loss must have occurred soon after the establishment of the obligate symbiosis, probably by b means of large deletions that would cause the elimination of a series of contiguous genes52. The accumulation of mobile elements, representing a source for chromosomal rearrangements and gene inactivation, seems to have an m important role at this first stage. This is suggested by m the fact that mobile elements are relatively abundant in bacteria that have recently acquired an obligate, intra- cellular way of life53,54. Mobile elements, such as bacte- rial insertion sequences (IS), are fairly abundant in the S‑symbiont S. glossinidius22, but even more so (estimated Figure 2 | Bacteriocytes of the aphidNatur Cinarae Reviews cedri | Genetics. at more than 25% of genome content) in its close rela- a | A 1.5 mm semi-thin section that shows both primary tives SOPE (Sitophilus oryzae primary endosymbiont) and secondary bacteriocytes, easily identifiable by their different tones under toluidine blue staining. and SZPE (Sitophilus zeamais primary endosymbiont). b | Electron micrographs of the same individual that These species are P‑endosymbionts of the rice and maize show the round shape of Buchnera aphidicola and weevils, respectively, and both have recently established Serratia symbiotica, respectively, in their bacteriocytes. obligate associations with their hosts55,56 (sequencing of P, primary symbiont (B. aphidicola); S, secondary symbiont the SOPE genome is in progress). (S. symbiotica); n, nuclei of both bacteriocytes; These data indicate that a common symbiotic ancestor m, mitochondria. Electron micrographs courtesy of of these two lineages must already have possessed these A. Lamelas, Universitat de València, Spain.

nature reviews | genetics advance online publication | 223 © 2008 Nature Publishing Group

REVIEWS

and continues with a rapid reduction in length until the required for host survival, as we discuss later. Another original gene is completely eroded58,59. Even in advanced general feature of all sequenced endosymbiont genomes stages of the reductive process, genome-length reduc- is the loss of most DNA repair and transcriptional regula- tion is mainly due to the loss of protein-coding genes, tory mechanisms. Furthermore, losses affect most genes not to their shortening21. Regarding intergenic regions, of the latter category, indicating that there is no need there is a slight size reduction in the four sequenced for transcriptional regulation in a stable environment. B. aphidicola strains when compared with their close This hypothesis is supported by the results of microarray relative Escherichia coli60. The shortening of intergenic experiments in B. aphidicola62,63, which show constitu- regions is more evident in extremely reduced genomes, tive gene expression of metabolic genes, regardless of as observed in Carsonella ruddii16, in which this has changes in environmental conditions. been so extreme as to have led to overlapping genes. Other features of the functional content of endo- In C. ruddii, the annotated ORFs are also consider- symbiont genomes are more dependent on their spe- ably shorter than their orthologues in other bacteria, cific lifestyles. For example, the bacterial cell envelope although the functionality of these shortened ORFs is seems to be highly simplified, being less structured in questionable61. bacteria living inside host-derived vesicles (for example, B. aphidicola) than in endosymbionts that live free in Functional changes in hosts and symbionts the cytosol of bacteriocytes (for example, Blochmannia The first step towards the establishment of an obligate floridanus and W. glossinidia)13. The simplification can endosymbiosis takes place when a free-living bacterium be extreme; for example, B. aphidicola BCc has lost all infects the host. Then, both organisms co-evolve to adapt the genes that are involved in the biosynthesis of the to the new situation. On the bacterial side, genomics bacterial cell wall21. studies have revealed that the endosymbiont genome gets smaller during this adaptive process, owing to the Metabolic changes and nutrient transport. The bigger loss of genes that are rendered unnecessary in the new endosymbiont genomes, belonging to more recently environment; however, a symbiotic bacterium must still established partnerships, have retained many genes that retain a number of functions to allow its continued sur- are involved in several intermediary metabolic path- vival in the host. By contrast, our understanding of how ways, but most of these have been lost in the smaller the animal host is affected at the genetic level, and how it genomes of well-established endosymbionts (FIG. 1; has solved the problem of having a bacterium (or more TABLE 1). The loss of metabolic genes is also affected to a than one) inside its cells, is poorly understood. However, certain extent by the presence of other bacterial species host specialization implies that it is now dependent on in the intracellular environment of the symbiont, in that the bacteria to survive and reproduce. Adaptive changes the simultaneous presence of S‑symbionts can compen- in the host include the development of specialized cells sate for the metabolic deficiencies of the P‑endosymbiont, where bacteria reside, adequate systems to control bacte- as well as those of the host64. rial populations, plus the modification of its immune Even though each endosymbiont (or the combina- response against the intruder cells. tion of symbionts within a consortium) has specifically retained the pathways that are necessary to synthesize General changes in endosymbiont gene content. The the nutrients that respective hosts are unable to provide catalogue of individual genes that are shared by all for themselves, endosymbionts exhibit limited transport analysed endosymbionts suggests that the molecular capabilities65 (FIG. 1). This situation has been observed in mechanisms that are necessary for survival in an intra- B. aphidicola, the two deep-sea-clam endosymbionts cellular environment share broad commonalities across that were recently analysed17,18 and the endosymbionts of all endosymbiotic associations13. Only about one-third O. algarvensis 42. The subcellular localization of endo- of the coding capacity of each endosymbiont genome symbionts inside vacuole-like compartments66,67, and that has been sequenced so far is devoted to the require- the presence of lysozyme in the host cells68,69, strongly ments of the specific symbiosis, with these genes mainly suggest that, in these systems, the host nutritional needs reflecting differences in host lifestyle, nutritional needs could be satisfied by controlled weakening or killing of and location within host cells. symbiont cells — a case of a necrotrophic host–symbiont Across the sequenced endosymbiont genomes, genes relationship. that are involved in essential functions — such as those involved in DNA replication, transcription and transla- Invasion strategies: have pathogens been domesticated? tion — are more likely to be retained than are genes Recent studies of the mechanisms that are used by endo- of other functional classes, and sometimes account for symbionts to establish and maintain their infection of more than one-third of the genome21. Chaperone sys- host tissues indicate that invasion strategies are based on tems and all essential components of the protein trans- the same molecular tools that are used by well-studied location machinery are also retained in these genomes, pathogenic bacteria (reviewed in refs 24,47). These ensuring the correct folding and localization of protein mechanisms include the use of various secretion systems components9,49. for the attachment and invasion of host cells64 and the Although gene losses affect all functional categories, utilization of the same quorum-sensing mechanisms they are nonrandom. The most dramatic losses affect for the regulation of virulence or mutualistic traits, genes that are involved in metabolism but are not depending on the type of association17,18.

224 | advance online publication www.nature.com/reviews/genetics © 2008 Nature Publishing Group

REVIEWS

Cap violaceum, respectively. Interestingly, both regions have a FliD complete set of genes encoding a functional T3SS export apparatus, but they lack many genes encoding effector proteins in the orthologous pathogenic islands. Moreover, Hook–filament junction Filament the only putative effector proteins that are preserved are a FlgK FlgL FliC specifically involved in the cytoskeletal rearrangements Hook necessary for bacterial cell invasion24. FliK FlgD FlgE Both parasitic and mutualistic Wolbachia lack a T3SS, but they do possess a complete set of genes for another bac- FlgH ring L Rod FliE FlgB FlgC FlgF FlgG terial secretion system, T4SS, which is used by many path- Outer membrane ogens to secrete proteins in order to invade host cells19,57. FlgI ring P B. aphidicola and W. glossinidia — well-established Cell wall Basal P‑endosymbionts in an advanced stage of genome reduc- body Stator MotA MotB Inner membrane tion and with highly simplified cell envelopes — also lack Export apparatus a canonical T3SS, but still retain surface mechanisms for FliF MS ring FlhA FlhB FliOP FliQ FliR exporting proteins, such as a simplified flagellar appara- tus71 (FIG. 3). Because these structures are quite abundant ATPase Rotor (ring C) Flil FliH FliM FliN FliG at the B. aphidicola surface — even though these are non- FliJ motile bacteria72 — and only those elements homologous FlgN to the T3SS have been retained in all of these reduced FlgA genomes21 it has been proposed that these structures b FlgH FlgF must be involved in invading new bacteriocytes, ovaries and embryos to ensure transmission to host offspring9. FlgI The immune response and control of bacterial inva- sion. The molecular mechanisms that are involved in the host immune response to bacterial invasion remain FlhA FlhB FliOP FliQ FliR FliF poorly understood73 but, clearly, hosts have adapted to maintain rather than eliminate endosymbionts. But how FliI FliH FliN FliG does the host immune system perceive these bacteria and control their growth and invasion without complete Figure 3 | The flagellum, an example of genome reduction at theNatur structurale Reviews level. | Genetics a | The Escherichia coli flagellum. Protein and structural components are indicated: cap, bacterial clearance? filament (formed by ~20,000 copies of flagellin), hook–filament junction, hook and basal Hosts have developed molecular systems to recognize body. The basal body is formed by a set of rings for anchorage to the inner and outer conserved microbial cell-envelope motifs (for example, membranes and to the cell wall, the rod, the stator, the rotor and the export apparatus peptidoglycan) through receptors such as those that were (including one ATPase). b | Vestiges of the flagellar apparatus in Buchnera aphidicola identified following the whole-genome sequencing of BCc. Elements that are involved in the formation of the flagellar filament and hook, the D. melanogaster74. Endosymbionts that have an ancient stator/motor and the transcriptional regulators are lost. None of them is necessary for a relationship with their hosts have highly simplified cell non-motile bacterium. Many elements that are involved in the anchorage of the export envelopes and, therefore, these old associations are not apparatus to the cell envelope have also been lost, consistent with the absence of a well- good models to study this type of immune control. structured outer membrane and cell wall. However, the MS‑ring protein (FliF), two out of However, analyses of younger endosymbiotic associa- three components of the C‑ring (FliN and FliG), the P‑ring protein (FlgI) and L‑ring protein (FlgH) are present. Only one protein for the rod formation (FlgF) is preserved. tions provide insights into the early stages of the interplay All the retained proteins are homologous to the components of the type III secretion between host immunity and bacterial virulence, for exam- system, supporting the hypothesis that this structure is used by the bacterium as a ple, endosymbionts of weevils from the genus Sitophilus. protein export system to help in the invasion of new host cells71. Heddi et al. carried out an in vivo broad characterization of the transcriptional response of the bacteriocyte to intracellular bacteria, using SZPE as a model organism75. Sitophilus spp. P‑endosymbionts, and their close They found that the bacteriome expresses a peptidog- relative the S‑symbiont S. glossindius, encode presum- lycan recognition protein (PGRP) that is homologous ably functional genes of the type III secretion system to the PGRP-LB protein of D. melanogaster, a catalytic (T3SS)70, an indication that the presence of the secretion member of the PGRP family. The amidase activity of system pre-dates the origin of the symbiotic association. PGRP-LB reduces the biological activity of peptidoglycan, The T3SS is a crucial virulence factor of many gram- and therefore downregulates the host immune response negative pathogens of plants and animals, including against gram-negative bacteria76. The specific role of the humans. However, it also seems to be a beneficial factor, weevil PRPG is still under investigation, but the fact that Pathogenic island enabling the establishment of a mutualistic, intracel- it contains the amino-acid residues that are responsible A part of a genome, for lular insect–bacteria association. In the S. glossinidius for amidase activity, and that PRPG transcripts only which there is evidence of genome there are three regions that contain distinct accumulate when endosymbiotic bacteria are present its acquisition by horizontal 77 transfer, that encodes genes T3SS systems. The best characterized are SSR‑1 and in the bacteriome , indicate that it might work in the that contribute to the virulence SSR‑2, which contain genes that are related to genes same way, thus — reducing the host’s defence against of a pathogen. found in enterica and Chromobacterium the endosymbiont to allow the long-term interaction.

nature reviews | genetics advance online publication | 225 © 2008 Nature Publishing Group

REVIEWS

Only one sequenced genome is available for a eukary- the intracellular bacteria to their host. This scenario otic host of an endosymbiont that also has a sequenced was initially proposed in the 1930s by Lwoff 81 and has genome — the genome of B. malayi 78, the nematode host been confirmed by genomic studies (see previous sec- of Wolbachia wBm. Consistent with a mitigated immune tion). However, the bacterial genome, no matter how response to control rather than eliminate the endosym- small, must retain the genes that are essential for the biont, the immune system of B. malayi has lost the ability maintenance of the symbiotic relationship (for example, to encode the small antibacterial peptides described in essential nutrient provision), and a reduced repertoire of other sequenced nematodes79. The genome sequence genes for self-maintenance, self-reproduction and evo- of the aphid Acyrthosiphon pisum will soon be avail- lution. Otherwise, genome reduction might eventually able and will provide us with additional clues into how lead these bacteria towards extinction. hosts control bacteria in the later stages of the symbiotic In time, a second symbiont can join the consortium. association. Although initially this new association is likely to be One additional insight into the control of endosymbi- facultative, a new stable relationship can be established, onts by their hosts comes from the finding that endosym- in which case the two bacterial species will co-evolve. bionts that do not reside in host-derived vesicles, but that As the genome-reductive process continues, and genes live free inside the cytosol, have lost the gene dnaA11,13. that are shared with the second endosymbiont become This gene encodes a protein that is involved in the ini- redundant, two possible outcomes can occur. The tiation of DNA replication, which might indicate that actual outcome is a matter of chance, depending on there is a direct control of bacterial growth by the host. which genome is affected by the loss of genes that are In the case of B. aphidicola, which does reside in host- needed for the synthesis of molecules that are essential derived vesicles, the high level of lysozyme expression for fitness. detected in bacteriocytes68,69 might be an important fac- One possibility is that the presence of a second symbi- tor involved in the host control of bacterial abundance, ont might accelerate the degenerative process of the first, as this enzyme catalyses the hydrolysis of peptidoglycan leading to its extinction and replacement by the formerly — a system normally used by animals as a constitutive facultative symbiont, which will then become obligatory. defence against bacterial pathogens. This replacement has been reported in symbioses of some weevils56 and might also be taking place in the case of The evolution of specialized host cells. Using transcrip- the Buchnera–Serratia consortium that has been estab- tomic analysis, a pioneering study was carried out into lished in the aphid C. cedri 21. Unlike other sequenced the development and evolution of aphid cells that are B. aphidicola strains, BCc has partially lost its symbiotic destined to become bacteriocytes. Several transcription role, as it cannot synthesize tryptophan and riboflavin factors were found to be expressed in these cells early (see Supplementary information S1(figure)), which need on in development — before colonization by the symbi- to come from another source, not only for the survival ont population acquired from the mother — following of the host, but also for that of B. aphidicola. Genes from a pattern that has not been described in any other insect S. symbiotica SCc that are involved in the biosynthesis cells80. Even when bacteria are experimentally removed of tryptophan have been identified, revealing that this by antibiotic treatment, bacteriocytes are still specified species might be the source of this essential amino acid. and maintained. The main conclusion of this work is The complete sequence of S. symbiotica SCc (currently that bacteriocytes represent an evolutionarily novel cell in progress) will tell us whether this bacterium is able to fate, which is developmentally determined independ- perform all the metabolic functions needed to maintain ently of the presence of bacteria. Regarding the role its host’s fitness, or whether it has also lost some path- of the bacteriocyte itself in the symbiotic relationship, ways. In the first case, extinction of the P‑endosymbiont the identification of a number of bacteriocyte-specific, (Buchnera) and replacement by Serratia would be the highly expressed genes in the aphid A. pisum by tran- most plausible hypothesis. scriptome analysis69 revealed that most of them were Alternatively — highlighting the second possibil- involved in several functions: non-essential amino-acid ity for the evolutionary outcome of a second symbiont metabolism, consistent with this process being among establishing itself in a host — the obligate biochemical the most important in symbiotic systems; transport, interdependence between two endosymbionts can evo- indicating that the bacteriocyte mediates exchange lutionarily seal the bacterial metabolic complementa- of metabolites and substrates; and production of lys- tion, and the establishment of a stable consortium would ozyme, possibly involved in nutrient uptake by the be the expected evolutionary outcome. This appears host, as already mentioned. A similar study that was to be the case in the sharpshooter H. coagulata, with carried out in S. zeamais revealed that the bacteriocyte two complementary endosymbionts, B. cicadellinicola also allows increased sugar uptake and metabolism, and and S. muelleri15,23. Whereas B. cicadellinicola is mainly various anti-stress systems that control the resultant devoted to the biosynthesis of vitamins, S. muelleri oxidative stress75. encodes the enzymes that are involved in the biosynthe- sis of most essential amino acids. The complementarity Evolutionary outcomes of symbioses is striking. For example, S. muelleri has lost the pathway Once an association has been established and the for the biosynthesis of histidine, which is the only bio- genome-reduction process has started, the loss of some synthetic pathway for essential amino acids retained in metabolic abilities, and other functions, irreversibly ties B. cicadellinicola.

226 | advance online publication www.nature.com/reviews/genetics © 2008 Nature Publishing Group

REVIEWS

Box 2 | Minimal cells and synthetic biology elucidated whether C. ruddii is on its way to becoming an organelle, or if it is nearing extinction and replace- The phenomenon of genome downsizing that has been observed in endosymbionts, ment by an unidentified symbiont61. The transfer of intracellular parasites and organelles has inspired a research programme on minimal genes from an organelle to the nucleus has been demon- life, based on the hypothesis that genomes must retain essential genes that are ref. 83 involved in housekeeping functions, and a minimum number of metabolic strated under laboratory conditions (see and ref- transactions for cellular survival and replication. Apart from shedding light on this erences therein). There is also evidence of gene transfer fundamental topic, the search for minimal genomes is of much value in the context from Wolbachia to the nuclear genome of several insect of synthetic biology. In fact, one of the aims of this emerging field is the definition and nematode hosts84. These remarkable cases of lateral and chemical synthesis of a minimal genome, and its incorporation and expression gene transfer should be a caution for those carrying out in a suitable chassis, either derived from a cell (top-down strategy) or starting from a eukaryotic genome sequencing, and should add new simple chemical system, such as a liposome (bottom-up approach). The top-down insights of relevance to the debate about the differences 107 approach, also called genome-driven cell engineering or the minimal cell between cell organelles and endosymbionts85. project108, is expected to provide methods for fabricating engineered cells, which will have a big impact both on biotechnology and on our basic understanding of Conclusions and future directions living systems109. The first attempt to define a minimal genome based on comparative genomics used The advent of genomics has significantly contributed the genomes of two human parasitic bacteria: Haemophilus influenzae and to reinforcing the view of symbiosis as a widespread genitalium110. Owing to their parasitic lifestyles, these two bacteria have biological phenomenon, especially through the genome reduced genomes when compared with their closest free-living relatives. The sequencing of uncultivable organisms. Symbiosis is a analysis led to the proposal of a minimal gene set composed of just 256 genes, most dynamic process in which the prokaryotic symbiont, of them involved in genetic-information storage and processing, protein while providing the host with new metabolic capa- chaperoning and a limited metabolic capability. Later, a combined study of all bilities, experiences many genotypic and phenotypic published research using computational or experimental methods, including the changes compared with its free-living relatives in order comparison of reduced genomes from insect endosymbionts, was used to define to adapt to its new lifestyle, as revealed by the specific the minimal core of essential genes for a free-living bacterium thriving in a differences observed in the completely sequenced chemically rich environment111. The main difference between this study and previous efforts (see ref. 112 and references therein) was the emphasis on the functional genomes of several endosymbiotic bacteria. completeness of the minimal metabolism encoded by the proposed gene repertoire Many questions remain open for future research. On (involving 62 protein-coding genes out of a minimal set of 208 genes). This aspect has the host side, a better understanding of how these organ- been explored further, demonstrating the stoichiometric consistency of the minimal isms control and/or domesticate prokaryotic symbionts metabolic network, as well as providing insights into some of its architectural is needed. Genomic and transcriptomic studies of model properties, such as its size, clustering and robustness113. eukaryotic hosts that have well known and stable sym- It must be stressed that these latter studies present just one possible form of a biotic associations will be valuable. Until now, only the minimal metabolism. Metabolic complexity is ecologically dependent; it is a genome of B. malayi, the nematode host of W. pipiensis function of the chemical richness of the environment and the primary energy wBm, has been fully sequenced78, but the sequence of the source(s) available to the living system (FIG. 1). Different versions of minimal gene aphid A. pisum is on its way. Comparative analysis with sets exist, that is, different combinations of metabolic pathways that can perform the essential biological functions of self-maintenance, self-reproduction and model organisms that have already been sequenced, 86 evolution114 under the same external conditions. However, it is possible to define such as the nematode Caenorhabditis elegans and the 87 which functions should be performed in any living cell in a specified niche, and list D. melanogaster , will give new insights the genes that would be necessary to maintain such functions. In this context, into the changes that are experienced by the host after comparative and evolutionary genomics of endosymbionts provide insights into the the establishment of the symbiosis. On the bacterial different routes that endosymbionts have taken to solve the challenges of their own side, comparative studies of the molecular processes maintenance under the diverse selective pressures and environmental constraints governing the nature of symbiotic associations, either that are the result of different host lifestyles. mutualistic or parasitic, will provide important clues to understand why microorganisms evolve toward one lifestyle or the other. The genome degeneration process seen in endosymbi- Symbiosis is not always a matter of two organisms: onts can also be accompanied by massive gene transfer to we are gaining increasing evidence that well-established the host cell nucleus82. In order for a true cell organelle prokaryotic endosymbionts also maintain some meta- to be established, several processes must be acquired: new bolic interplay with new facultative symbionts. Such gene-regulatory mechanisms; specific target sequences relationships could finish with the extinction of the for proteins that are encoded by the host nucleus but P‑endosymbiont (as described in insects from the fam- function in the symbiont; and a protein import appa- ily Dryopthoridae, to which Sitophilus spp. belongs56, or ratus that functions at the symbiont surface. These the Buchnera–Serratia consortium in cedar aphids21), processes occurred in the α‑proteobacterial ancestor or the symbiotic consortia could continue over time of mitochondria and in the cyanobacterial ances- (as described in the Baumannia–Sulcia consortium in 15 Minimal genome tor that gave rise to chloroplast lineages. Is there any sharpshooters ). Metagenomics and systems biology The smallest set of genes that system that is currently en route from endosymbiont are new approaches that can be used in the analysis is necessary and sufficient to to organelle? At present, the smallest sequenced endo- of complex symbiotic consortia. For example, the sustain a living cell in the most symbiont genome belongs to C. ruddii, which only metagenomes of sponge microbial communities have favourable conditions; that is, in the presence of adequate codes for approximately 180 proteins. However, a care- been shown to contain genes and gene clusters for the nutrients and in the absence ful analysis revealed that many genes that are involved biosynthesis of biologically active natural products. of stress factors. in essential functions are absent. It remains to be There is evidence to suggest that a mutually beneficial

nature reviews | genetics advance online publication | 227 © 2008 Nature Publishing Group

REVIEWS

Synthetic biology relationship exists, at least between some of the bacte- new insights into symbiotic associations, including 88 The design and fabrication of ria and the themselves . Another interesting those that are highly relevant to biotechnology and artificial biological systems, example is the symbiotic gut microbiome of mammals, biomedicine. with the aim of either in which we are starting to have a clearer picture of the Finally, synthetic biology will benefit from the new optimizing their performance in the context of their role of the symbiotic microbial consortia in the meta- insights gained by comparative genomic analyses 89 technological utility or of bolic phenotype of the mammalian host . In this case, of extremely reduced genomes (BOX 2). This type of deepening our understanding an extensive microbial–mammalian co-metabolism research makes important contributions to the concep- of the naturally occurring exists. As new complete genomes of eukaryote hosts tually intriguing and biologically fundamental question organisms. become available, we will be able to make new meta- of the minimal genomic basis for sustaining cellular life. bolic inferences to understand the interplay between Many efforts in this area have attracted a great deal of the host and its associated . Genomics and public attention regarding both biosecurity risks and metagenomics are becoming essential tools to provide potential commercialization.

1. Hypsa, V. & Dale, C. In vitro culture and phylogenetic 20. Nakagawa, S. et al. Deep-sea vent ε-proteobacterial 37. McGraw, E. A. & O’Neill, S. L. Wolbachia pipientis: analysis of ‘Candidatus Arsenophonus triatominarum’, genomes provide insights into emergence of intracellular infection and pathogenesis in Drosophila. an intracellular bacterium from the triatomine bug, pathogens. Proc. Natl Acad. Sci. USA 104, Curr. Opin. Microbiol. 7, 67–70 (2004). Triatoma infestans. Int. J. Syst. Bacteriol. 47, 12146–12150 (2007). 38. Welburn, S. C. & Maudlin, I. Tsetse–trypanosome 1140–1144 (1997). 21. Pérez-Brocal, V. et al. A small microbial genome: interactions: rites of passage. Parasitol. Today 15, 2. Dale, C. & Maudlin, I. Sodalis gen. nov. and the end of a long symbiotic relationship? Science 314, 399–403 (1999). Sodalis glossinidius sp. nov., a microaerophilic 312–313 (2006). 39. Moran, N. A. Symbiosis as an adaptive process and secondary endosymbiont of the tsetse fly Glossina This paper, together with reference 16, shows that source of phenotypic complexity. Proc. Natl Acad. Sci. morsitans morsitans. Int. J. Syst. Bacteriol. 49, the genomes of B. aphidicola BCc and C. ruddii USA 104 (Suppl 1), 8627–8633 (2007). 267–275 (1999). represent the lower limits of genome size in insect 40. Gomez-Valero, L. et al. Coexistence of Wolbachia with 3. Dale, C., Beeton, M., Harbison, C., Jones, T. & endosymbionts. Further research in these systems Buchnera aphidicola and a secondary symbiont in the Pontes, M. Isolation, pure culture, and might shed light on the evolutionary fate of aphid Cinara cedri. J. Bacteriol. 186, 6626–6633 characterization of ‘Candidatus Arsenophonus endosymbionts. (2004). arthropodicus’, an intracellular secondary 22. Toh, H. et al. Massive genome erosion and functional 41. Takiya, D. M., Tran, P. L., Dietrich, C. H. & Moran, N. A. endosymbiont from the hippoboscid louse fly adaptations provide insights into the symbiotic Co-cladogenesis spanning three phyla: leafhoppers Pseudolynchia canariensis. Appl. Environ. lifestyle of Sodalis glossinidius in the tsetse host. (Insecta: : Cicadellidae) and their dual Microbiol. 72, 2997–3004 (2006). Genome Res. 16, 149–156 (2006). bacterial symbionts. Mol. Ecol. 15, 4175–4191 (2006). 4. Darby, A. C., Chandler, S. M., Welburn, S. C. & 23. McCutcheon, J. P. & Moran, N. A. Parallel genomic 42. Woyke, T. et al. Symbiosis insights through Douglas, A. E. Aphid-symbiotic bacteria cultured evolution and metabolic interdependence in an metagenomic analysis of a microbial consortium. in insect cell lines. Appl. Environ. Microbiol. 71, ancient symbiosis. Proc. Natl Acad. Sci. USA 104, Nature 443, 950–955 (2006). 4833–4839 (2005). 19392–19397 (2007). This paper and reference 15 are two pioneering 5. Sagan, L. On the origin of mitosing cells. J. Theor. Biol. 24. Dale, C. & Moran, N. A. Molecular interactions works on the application of genomic analysis to 14, 255–274 (1967). between bacterial symbionts and their hosts. uncultivable microbial consortia, which opened up 6. Sapp, J. Evolution by Association. A History of Cell 126, 453–465 (2006). a new field of research on symbiosis. Symbiosis (Oxford University Press, New York, 1994). 25. Baumann, P. Biology of bacteriocyte-associated 43. Giere, O. & Erséus, C. and new bacterial This book is the first historical report on the endosymbionts of plant sap-sucking insects. symbioses of gutless marine Tubificidae (Annelida, symbiosis concept and its central role in shaping Annu. Rev. Microbiol. 59, 155–189 (2005). Oligochaeta) from the island of Elba (Italy). contemporary evolutionary ideas about the origin 26. Russell, J. A., Latorre, A., Sabater-Munoz, B., Org. Divers. Evol. 2, 289–297 (2002). and diversification of life. Moya, A. & Moran, N. A. Side-stepping secondary 44. Wernegreen, J. J. Genome evolution in bacterial 7. de Duve, C. The origin of eukaryotes: a reappraisal. symbionts: widespread horizontal transfer across endosymbionts of insects. Nature Rev. Genet. 3, Nature Rev. Genet. 8, 395–403 (2007). and beyond the Aphidoidea. Mol. Ecol. 12, 850–861 (2002). 8. Margulis, L. Symbiosis in Cell Evolution. Microbial 1061–1075 (2003). 45. Moran, N. A. Accelerated evolution and Muller’s Communities in the Archaean and Proterozoic Eons 27. Chen, D.‑Q., Montllor, C. B. & Purcell, A. H. Fitness rachet in endosymbiotic bacteria. Proc. Natl Acad. Sci. (W. H. Freeman and Co., New York, 1993). effects of two facultative endosymbiotic bacteria on USA 93, 2873–2878 (1996). 9. Shigenobu, S., Watanabe, H., Hattori, M., Sakaki, Y. & the pea aphid, Acyrthosiphon pisum, and the blue 46. Itoh, T., Martin, W. & Nei, M. Acceleration of genomic Ishikawa, H. Genome sequence of the endocellular alfalfa aphid, A. kondoi. Entomol. Exp. Appl. 95, evolution caused by enhanced mutation rate in bacterial symbiont of aphids Buchnera sp. APS. 315–323 (2000). endocellular symbionts. Proc. Natl Acad. Sci. USA 99, Nature 407, 81–86 (2000). 28. Montllor, C. B., Maxmen, A. & Purcell, A. H. 12944–12948 (2002). 10. Tamas, I. et al. 50 million years of genomic stasis in Facultative bacterial endosymbionts benefit pea 47. Wernegreen, J. J. For better or worse: genomic endosymbiotic bacteria. Science 296, 2376–2379 aphids Acyrthosiphon pisum under heat stress. consequences of intracellular mutualism and parasitism. (2002). Ecol. Entomol. 27, 189–195 (2002). Curr. Opin. Genet. Dev. 15, 572–583 (2005). 11. Akman, L. et al. Genome sequence of the endocellular 29. Russell, J. A. & Moran, N. A. Costs and benefits 48. Rocha, E. P. & Danchin, A. Base composition bias obligate symbiont of tsetse flies, Wigglesworthia of symbiont infection in aphids: variation among might result from competition for metabolic resources. glossinidia. Nature Genet. 32, 402–407 (2002). symbionts and across temperatures. Proc. Biol. Sci. Trends Genet. 18, 291–294 (2002). 12. van Ham, R. C. et al. Reductive genome evolution in 273, 603–610 (2006). 49. Fares, M. A., Ruiz-Gonzalez, M. X., Moya, A., Elena, S. F. Buchnera aphidicola. Proc. Natl Acad. Sci. USA 100, 30. Oliver, K. M., Moran, N. A. & Hunter, M. S. Variation & Barrio, E. Endosymbiotic bacteria: GroEL buffers 581–586 (2003). in resistance to parasitism in aphids is due to against deleterious mutations. Nature 417, 398 (2002). 13. Gil, R. et al. The genome sequence of Blochmannia symbionts not host genotype. Proc. Natl Acad. Sci. 50. Moya, A., Latorre, A., Sabater-Munoz, B. & Silva, F. J. floridanus: Comparative analysis of reduced genomes. USA 102, 12795–12800 (2005). Comparative molecular evolution of primary Proc. Natl Acad. Sci. USA 100, 9388–9393 (2003). 31. Oliver, K. M., Russell, J. A., Moran, N. A. & (Buchnera) and secondary symbionts of aphids 14. Degnan, P. H., Lazarus, A. B. & Wernegreen, J. J. Hunter, M. S. Facultative bacterial symbionts in aphids based on two protein-coding genes. J. Mol. Evol. 55, Genome sequence of Blochmannia pennsylvanicus confer resistance to parasitic wasps. Proc. Natl Acad. 127–137 (2002). indicates parallel evolutionary trends among bacterial Sci. USA 100, 1803–1807 (2003). 51. Rispe, C., Delmotte, F., van Ham, R. C. & Moya, A. mutualists of insects. Genome Res. 15, 1023–1033 32. Scarborough, C. L., Ferrari, J. & Godfray, H. C. Mutational and selective pressures on codon and (2005). Aphid protected from pathogen by endosymbiont. amino acid usage in Buchnera, endosymbiotic bacteria 15. Wu, D. et al. Metabolic complementarity and Science 310, 1781 (2005). of aphids. Genome Res. 14, 44–53 (2004). genomics of the dual bacterial symbiosis of 33. Tsuchida, T., Koga, R. & Fukatsu, T. Host plant 52. Moran, N. A. & Mira, A. The process of genome sharpshooters. PLoS Biol. 4, e188 (2006). specialization governed by facultative symbiont. shrinkage in the obligate symbiont Buchnera 16. Nakabachi, A. et al. The 160-kilobase genome of the Science 303, 1989 (2004). aphidicola. Genome Biol. 2, RESEARCH0054 (2001). bacterial endosymbiont Carsonella. Science 314, 267 34. Simon, J. C. et al. Host-based divergence in 53. Moran, N. A. & Plague, G. R. Genomic changes (2006). populations of the pea aphid: insights from following host restriction in bacteria. Curr. Opin. 17. Newton, I. L. et al. The Calyptogena magnifica nuclear markers and the prevalence of facultative Genet. Dev. 14, 627–633 (2004). chemoautotrophic symbiont genome. Science 315, symbionts. Proc. Biol. Sci. 270, 1703–1712 54. Plague, G. R., Dunbar, H. E., Tran, P. L. & Moran, N. A. 998–1000 (2007). (2003). Extensive proliferation of transposable elements in 18. Kuwahara, H. et al. Reduced genome of the 35. Leonardo, T. E. & Muiru, G. T. Facultative symbionts heritable bacterial symbionts. J. Bacteriol. 190, thioautotrophic intracellular symbiont in a deep-sea are associated with host plant specialization in pea 777–779 (2007). clam, Calyptogena okutanii. Curr. Biol. 17, 881–886 aphid populations. Proc. Biol. Sci. 270 (Suppl 2), 55. Heddi, A., Charles, H., Khatchadourian, C., Bonnot, G. (2007). S209–S212 (2003). & Nardon, P. Molecular characterization of the 19. Foster, J. et al. The Wolbachia genome of Brugia 36. Leonardo, T. E. & Mondor, E. B. Symbiont modifies principal symbiotic bacteria of the weevil Sitophilus malayi: endosymbiont evolution within a human host life-history traits that affect gene flow. Proc. Biol. oryzae: a peculiar G+C content of an endocytobiotic pathogenic nematode. PLoS Biol. 3, e121 (2005). Sci. 273, 1079–1084 (2006). DNA. J. Mol. Evol. 47, 52–61 (1998).

228 | advance online publication www.nature.com/reviews/genetics © 2008 Nature Publishing Group

REVIEWS

56. Lefevre, C. et al. Endosymbiont phylogenesis in the 79. Kato, Y. & Komatsu, S. ASABF, a novel cysteine-rich 103. Rodriguez-Ezpeleta, N. et al. Toward resolving Dryophthoridae weevils: evidence for bacterial antibacterial peptide isolated from the nematode the eukaryotic tree: the phylogenetic positions replacement. Mol. Biol. Evol. 21, 965–973 (2004). Ascaris suum. Purification, primary structure, and of jakobids and cercozoans. Curr. Biol. 17, 57. Wu, M. et al. Phylogenomics of the reproductive molecular cloning of cDNA. J. Biol. Chem. 271, 1420–1425 (2007). parasite Wolbachia pipientis wMel: a streamlined 30493–30498 (1996). 104. Lecointre, G. & Le Guyader, H. The Tree of Life. genome overrun by mobile genetic elements. PLoS 80. Braendle, C. et al. Developmental origin and evolution A Phylogenetic Classification (Harvard University Biol. 2, e69 (2004). of bacteriocytes in the aphid–Buchnera symbiosis. Press Reference Library) (Belknap Press, 58. Silva, F. J., Latorre, A. & Moya, A. Genome size PLoS Biol. 1, e21 (2003). Cambridge, 2006). reduction through multiple events of gene disintegration This paper was the first study of the development 105. Delsuc, F., Brinkmann, H., Chourrout, D. & Philippe, H. in Buchnera APS. Trends Genet. 17, 615–618 (2001). and evolution of a bacteriocyte containing Tunicates and not cephalochordates are the closest 59. Gomez-Valero, L., Silva, F. J., Simon, J. C. & Latorre, A. endosymbiotic bacteria. living relatives of vertebrates. Nature 439, 965–968 Genome reduction of the aphid endosymbiont 81. Lwoff, A. L’évolution physiologique. Étude des pertes (2006). Buchnera aphidicola in a recent evolutionary time des fonctions chez les microorganismes (Hermann, 106. Bourlat, S. J. et al. Deuterostome phylogeny scale. Gene 389, 87–95 (2007). Paris, 1944). reveals monophyletic chordates and the new 60. Gomez-Valero, L., Latorre, A. & Silva, F. J. The 82. Timmis, J. N., Ayliffe, M. A., Huang, C. Y. & Martin, W. phylum Xenoturbellida. Nature 444, 85–88 evolutionary fate of nonfunctional DNA in the bacterial Endosymbiotic gene transfer: organelle genomes (2006). endosymbiont Buchnera aphidicola. Mol. Biol. Evol. forge eukaryotic chromosomes. Nature Rev. Genet. 5, 107. O’Malley, M., Powell, A., Davies, J. & Calvert, J. 21, 2172–2181 (2004). 123–135 (2004). Knowledge-making distinctions in synthetic biology. 61. Tamames, J. et al. The frontier between cell and 83. Martin, W. Gene transfer from organelles to the Bioessays 30, 57–65 (2007). organelle: genome analysis of Candidatus Carsonella nucleus: frequent and in big chunks. Proc. Natl Acad. 108. Luisi, P. L. Chemical aspects of synthetic biology. ruddii. BMC Evol. Biol. 7, 181 (2007). Sci. USA 100, 8612–8614 (2003). Chem. Biodivers. 4, 603–621 (2007). 62. Wilson, A. C. et al. A dual-genome microarray for the 84. Hotopp, J. C. et al. Widespread lateral gene transfer 109. Peretó, J. & Català, J. The renaissance of synthetic pea aphid, Acyrthosiphon pisum, and its obligate from intracellular bacteria to multicellular eukaryotes. biology. Biol. Theor. 2, 128–130 (2007). bacterial symbiont, Buchnera aphidicola. BMC Science 317, 1753–1756 (2007). 110. Mushegian, A. R. & Koonin, E. V. A minimal gene set Genomics 7, 50 (2006). 85. Bhattacharya, D., Archibald, J. M., Weber, A. P. M. & for cellular life derived by comparison of complete 63. Wilcox, J. L., Dunbar, H. E., Wolfinger, R. D. & Reyes-Prieto, A. How do endosymbionts become bacterial genomes. Proc. Natl Acad. Sci. USA 93, Moran, N. A. Consequences of reductive evolution organelles? Understanding early events in plastid 10268–10273 (1996). for gene expression in an obligate endosymbiont. evolution. Bioessays 29, 1239–1246 (2007). 111. Gil, R., Silva, F. J., Pereto, J. & Moya, A. Mol. Microbiol. 48, 1491–1500 (2003). 86. C. elegans Sequencing Consortium. Genome sequence Determination of the core of a minimal 64. Koga, R., Tsuchida, T. & Fukatsu, T. Changing of the nematode Caenorhabditis elegans: a platform bacterial gene set. Microbiol. Mol. Biol. Rev. 68, partners in an obligate symbiosis: a facultative for investigating biology. Science 282, 2012–2018 518–537 (2004). endosymbiont can compensate for loss of the essential (1998). This Review proposes a minimal genetic endosymbiont Buchnera in an aphid. Proc. Biol. Sci. 87. Myers, E. W. et al. A whole-genome assembly of repertoire for a hypothetical heterotrophic 270, 2543–2550 (2003). Drosophila. Science 287, 2196–2204 (2000). bacterium, based on comparative genomics 65. Ren, Q. & Paulsen, I. T. Comparative analyses of 88. Kennedy, J., Marchesi, J. R. & Dobson, A. D. of endosymbionts, experimentally determined fundamental differences in membrane transport Metagenomic approaches to exploit the gene essentiality and the functions of a coherent capabilities in prokaryotes and eukaryotes. biotechnological potential of the microbial consortia metabolism. PLoS Comput. Biol. 1, e27 (2005). of marine sponges. Appl. Microbiol. Biotechnol. 75, 112. Klasson, L. & Andersson, S. G. Evolution of 66. Fiala-Médioni, A. & Métivier, C. Ultrastructure of the 11–20 (2007). minimal‑gene‑sets in host-dependent bacteria. gill of the hydrothermal vent bivalve Calyptogena 89. Martin, F. P. et al. A top-down systems biology view Trends Microbiol. 12, 37–43 (2004). magnifica, with a discussion of its nutrition. Mar. Biol. of microbiome–mammalian metabolic interactions 113. Gabaldon, T. et al. Structural analyses of 90, 215–222 (1986). in a mouse model. Mol. Syst. Biol. 3, 112 (2007). a hypothetical minimal metabolism. Philos. Trans. 67. Baumann, P., Moran, N. A. & Baumann, L. in The 90. Paracer, S. & Ahmadjian, V. Symbiosis. R. Soc. Lond., B, Biol. Sci. 362, 1751–1762 Prokaryotes (ed. Dworkin, M.) 1–67 (Springer, New An Introduction to Biological Associations (2007). York, 2000). (Oxford University Press, New York, 2000). 114. Ruiz-Mirazo, K., Pereto, J. & Moreno, A. 68. Fiala-Médioni, A., Michalski, J. C., Jollès, J., Alonso, C. 91. Kneip, C., Lockhart, P., Voss, C. & Maier, U. G. A universal definition of life: autonomy and & Montreuil, J. Lysosomic and lysozyme activities in Nitrogen fixation in eukaryotes — new models for open-ended evolution. Orig. Life Evol. Biosph. 34, gills of bivalves from deep hydrothermal vents. symbiosis. BMC Evol. Biol. 7, 55 (2007). 323–346 (2004). C. R. Acad. Sci. Paris 317, 239–244 (1994). 92. Schink, B. Energetics of syntrophic cooperation in 69. Nakabachi, A. et al. Transcriptome analysis of the methanogenic degradation. Microbiol. Mol. Biol. Rev. Acknowledgements aphid bacteriocyte, the symbiotic host cell that harbors 61, 262–280 (1997). Financial support was provided by grants GV/2007/050 from an endocellular mutualistic bacterium, Buchnera. Proc. 93. Stewart, F. J., Newton, I. L. & Cavanaugh, C. M. Generalitat Valenciana, Spain, to R.G and BFU2006‑06003 Natl Acad. Sci. USA 102, 5477–5482 (2005). Chemosynthetic endosymbioses: adaptations to from Ministerio de Educación y Ciencia (MEC), Spain, to A.L. 70. Dale, C., Young, S. A., Haydon, D. T. & Welburn, S. C. oxic–anoxic interfaces. Trends Microbiol. 13, R.G is a recipient of a contract in the ‘Ramón y Cajal’ pro- The insect endosymbiont Sodalis glossinidius utilizes a 439–448 (2005). gramme from the MEC, Spain. Our thanks to H. Escrivá type III secretion system for cell invasion. Proc. Natl 94. Minic, Z. & Herve, G. Biochemical and enzymological (CNRS, Banyuls sur Mer), P. López (CNRS, Orsay) and D. Acad. Sci. USA 98, 1883–1888 (2001). aspects of the symbiosis between the deep-sea Moreira (CNRS, Orsay) for their advice in the design of the 71. Young, G. M., Schmiel, D. H. & Miller, V. L. A new tubeworm Riftia pachyptila and its bacterial eukaryotic phylogenetic tree in BOX 1. pathway for the secretion of virulence factors by endosymbiont. Eur. J. Biochem. 271, 3093–3102 bacteria: The flagellar export apparatus functions as a (2004). protein-secretion system. Proc. Natl Acad. Sci. USA 95. Zientz, E., Dandekar, T. & Gross, R. Metabolic DATABASES 96, 6456–6461 (1999). interdependence of obligate intracellular bacteria and Entrez Genome Project: 72. Maezawa, K. et al. Hundreds of flagellar basal bodies their insect hosts. Microbiol. Mol. Biol. Rev. 68, http://www.ncbi.nlm.nih.gov/sites/entrez?db=genomeprj cover the cell surface of the endosymbiotic bacterium 745–770 (2004). Acyrthosiphon pisum | Baumannia cicadellinicola | Buchnera aphidicola sp. strain APS. J. Bacteriol. 188, 96. Croft, M. T., Lawrence, A. D., Raux-Deery, E., Blochmannia floridanus | Brugia malayi | Buchnera aphidicola | 6539–6543 (2006). Warren, M. J. & Smith, A. G. Algae acquire vitamin Caenorhabditis elegans | Carsonella ruddii | Chromobacterium 73. McFall-Ngai, M. Adaptive immunity: care for the B12 through a symbiotic relationship with bacteria. violaceum | Drosophila melanogaster | Sodalis glossinidius | community. Nature 445, 153 (2007). Nature 438, 90–93 (2005). Sulcia muelleri | Wigglesworthia glossinidia | 74. Hoffmann, J. A. The immune response of Drosophila. 97. Buchner, P. Endosymbiosis of Animals with Plant Wolbachia pipientis Nature 426, 33–38 (2003). Microorganisms (Interscience, New York, 1965). 75. Heddi, A. et al. Molecular and cellular profiles of 98. Taylor, M. W., Radax, R., Steger, D. & Wagner, M. FURTHER INFORMATION insect bacteriocytes: mutualism and harm at the initial Sponge-associated microorganisms: evolution, Andrés Moya’s homepage: evolutionary step of symbiogenesis. Cell. Microbiol. 7, ecology, and biotechnological potential. Microbiol. www.uv.es/cavanilles/personal/Andres_Moya 293–305 (2005). Mol. Biol. Rev. 71, 295–347 (2007). Juli Peretó’s homepage: 76. Zaidman-Remy, A. et al. The Drosophila amidase 99. Backhed, F., Ley, R. E., Sonnenburg, J. L., Peterson, D. A. http://www.uv.es/cavanilles/personal/Juli_Pereto PGRP-LB modulates the immune response to bacterial & Gordon, J. I. Host–bacterial mutualism in the Rosario Gil’s homepage: infection. Immunity 24, 463–473 (2006). human intestine. Science 307, 1915–1920 (2005). www.uv.es/cavanilles/personal/Rosario_Gil 77. Anselme, C., Vallier, A., Balmand, S., Fauvarque, M.‑O. 100. Stingl, U., Radek, R., Yang, H. & Brune, A. Amparo Latorre’s homepage: http://www.uv.es/cavanilles/ & Heddi, A. Host PGRP gene expression and bacterial ‘Endomicrobia’: cytoplasmic symbionts of termite gut personal/Amparo_Latorre/amparo_latorre.htm release in endosymbiosis of the weevil Sitophilus protozoa form a separate phylum of prokaryotes. Cavanilles Institute web page: www.uv.es/cavanilles zeamais. Appl. Environ. Microbiol. 72, 6766–6772 Appl. Environ. Microbiol. 71, 1473–1479 (2005). Genomes OnLine Database: http://www.genomesonline.org (2006). 101. Fieseler, L., Quaiser, A., Schleper, C. & Hentschel, U. Microbial Genome Database for Comparative Analysis: 78. Ghedin, E. et al. Draft genome of the filarial nematode Analysis of the first genome fragment from the marine http://mbgd.genome.ad.jp parasite Brugia malayi. Science 317, 1756–1760 sponge-associated, novel candidate phylum The Sanger Institute Glossina Genome Project: (2007). Poribacteria by environmental genomics. http://www.sanger.ac.uk/Projects/G_morsitans The complete genome sequence of the nematode Environ. Microbiol. 8, 612–624 (2006). B. malayi, together with the previously reported 102. Moreira, D. et al. Global eukaryote phylogeny: SUPPLEMENTARY INFORMATION genome sequence of its Wolbachia endosymbiont, Combined small- and large-subunit ribosomal DNA See online article: S1 (figure) | S2 (table) will enable a systems approach for the study of trees support monophyly of Rhizaria, Retaria and All links are active in the online pdf host–symbiont interactions. Excavata. Mol. Phylogenet. Evol. 44, 255–266 (2007).

nature reviews | genetics advance online publication | 229 © 2008 Nature Publishing Group