<<

Critical Reviews in Microbiology, 31:101–135, 2005 Copyright c Taylor & Francis Inc. ISSN: 1040-841X print / 1549-7828 online DOI: 10.1080/10408410590922393

Protein Signatures Distinctive of Alpha and Its Subgroups and a Model for α-Proteobacterial Evolution

Radhey S. Gupta Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada

on the distribution patterns of these signatures, it is now possi- Alpha (α)proteobacteria comprise a large and metabolically ble to logically deduce a model for the branching order among diverse group. No biochemical or molecular feature is presently α-proteobacteria, which is as follows: Rickettsiales → Rhodo- known that can distinguish these from other groups. The spirillales-Sphingomonadales → Rhodobacterales-Caulobacterales evolutionary relationships among this group, which includes nu- → Rhizobiales (Rhizobiaceaea-Brucellaceae-Phyllobacteriaceae, merous pathogens and agriculturally important microbes, are also and Bradyrhizobiaceae). The deduced branching order is also con- not understood. Shared conserved inserts and deletions (i.e., indels sistent with the topologies in the 16 rRNA and other phylogenetic or signatures) in molecular sequences provide a powerful means trees. Signature sequences in a number of other proteins provide ev- for identification of different groups in clear terms, and for evo- idence that α-proteobacteria is a late branching taxa within Bacte- lutionary studies (see www.bacterialphylogeny.com). This review ria, which branched after the δ,-subdivisions but prior to the β,γ- describes, for the first time, a large number of conserved indels in proteobacteria. The shared presence of many of these signatures in broadly distributed proteins that are distinctive and unifying char- the mitochondrial (eukaryotic) homologs also provides evidence of acteristics of either all α-proteobacteria, or many of its constituent the α-proteobacterial ancestry of mitochondria. subgroups (i.e., orders, families, etc.). These signatures were iden- tified by systematic analyses of proteins found in the Rickettsia Keywords Bacterial Phylogeny; Alpha Proteobacteria Trees; Pro- prowazekii (RP) genome. Conserved indels that are unique to α- tein Signatures; Rickettsiales; Rhodobacterales; Branch- proteobacteria are present in the following proteins: Cytochrome ing Order; Mitochondrial Origin; Rickettsia prowazekii; c oxidase assembly protein Ctag, PurC, DnaB, ATP synthase α- Rhizobiales subunit, exonuclease VII, prolipoprotein phosphatidylglycerol transferase, RP-400, FtsK, puruvate phosphate dikinase, cyto- For personal use only. chrome b, MutY,and homoserine dehydrogenase. The signatures in succinyl-CoA synthetase, cytochrome oxidase I, alanyl-tRNA syn- INTRODUCTION thetase, and MutS proteins are found in all α-proteobacteria, ex- The alpha (α) proteobacteria comprise an important group cept the Rickettsiales, indicating that this group has diverged prior to the introduction of these signatures. A number of proteins con- within Bacteria, which has contributed seminally to many as- tain conserved indels that are specific for Rickettsiales (XerD inte- pects of the history of life (Margulis 1970; Kersters et al. 2003). grase and leucine aminopeptidase), Rickettsiaceae (Mfd, ribosomal It is now established that mitochondria, which enable eukary- protein L19, FtsZ, Sigma 70 and exonuclease VII), or Anaplasmat- otic cells to produce energy via oxidative phosphorylation, are aceae (Tgt and RP-314), and they distinguish these groups from the result of endosymbitotic capture of an α-proteobacteria by all others. Signatures in DnaA, RP-057, and DNA ligase A are commonly shared by various Rhizobiales, Rhodobacterales, and the primitive eukaryotic cell (Margulis 1970; Falah & Gupta Caulobacter, suggesting that these groups shared a common an- 1994; Viale & Arakaki 1994; Andersson et al. 1998; Gray et al. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University cestor exclusive of other α-proteobacteria. A specific relationship 1999; Karlin & Brocchieri 2000; Emelyanov 2001a; Esser et al. between Rhodobacterales and Caulobacter is indicated by a large 2004). There is also strong evidence indicating that the ances- insert in the Asn-Gln amidotransferase. The Rhizobiales group tral eukaryotic cell itself may have originated via a fusion, or of species are distinguished from others by a large insert in the α Trp-tRNA synthetase. Signature sequences in a number of other long-term symbiotic association, event between one or more - proteins (viz. oxoglutarate dehydogenase, succinyl-CoA synthase, proteobacteria and an archaebacteria (or Archaea) (Gupta et al. LytB, DNA gyrase A, LepA, and Ser-tRNA synthetase) serve to 1994; Lake & Rivera 1994; Gupta & Golding 1996; Margulis distinguish the Rhizobiaceae, Brucellaceae, and Phyllobacteriaceae 1996; Gupta 1998; Martin & Muller 1998; Ribeiro & Golding families from Bradyrhizobiaceae and Methylobacteriaceae. Based 1998; Andersson et al. 1998; Karlin et al. 1999; Lang et al. 1999; Kurland & Andersson 2000; Emelyanov 2001a, 2003b). The symbiosis between α-proteobacteria (viz. Rhizobiaceae species) and plant root nodules plays a central role in the fixation of at- Received 20 December 2004; accepted 8 December 2005. Address correspondence to Radhey S. Gupta, Department of Bio- mospheric nitrogen by plants (Sadowsky & Graham 2000; Van chemistry and Biomedical Sciences, McMaster University, Hamilton, Sluys et al. 2002; Kersters et al. 2003; Sawada et al. 2003). Ad- Ontario, Canada L8N 3Z5. E-mail: [email protected] ditionally, many α-proteobacterial species (viz. Rickettsiales, 101 102 R. S. GUPTA

Brucella, Bartonella) are adapted to intracellular life style and ettsia conorii, Ri. prowazekii, Ri. typhi, and Wolbachia sp. are major human and animal pathogens (Moreno & Moriyon (Drosophila endosymbiont)) have become available (Anders- 2001; Kersters et al. 2003; Yu & Walker 2003). son et al. 1998; Kaneko et al. 2000, 2002; Nierman et al. 2001; The α-proteobacteria exhibit enormous diversity in terms of Wood et al. 2001; Ogata et al. 2001; Galibert et al. 2001; their morphological and metabolic characteristics and they in- DelVecchio et al. 2002; Paulsen et al. 2002; Larimer et al. 2004; clude numerous phototrophs, chemolithotrophs and chemoorgan- McLeod et al. 2004). These provide valuable resources for iden- otrophs (Stackebrandt et al. 1988; De Ley 1992; Kersters et al. tifying novel molecular features that are likely distinctive char- 2003). This group also harbors all known aerobic photoheterotro- acteristics of α-proteobacteria and its various subgroups, and hic bacteria, which contain bacteriochlorophyll a,but are unable which may prove helpful in clarifying the evolutionary rela- to grow photosynthetically under anaerobic conditions (Yurkov tionships among them. This article, describes for the first time, & Beatty 1998). These bacteria are abundant in the upper layers a large number of conserved indels in widely distributed pro- of oceans (Kolber et al. 2001). The α-proteobacterial species are teins that are either uniquely shared by all α-proteobacteria, or presently recognized on the basis of their branching pattern in which are shared by only particular subgroups (i.e., families or the 16S rRNA trees, where they form a distinct clade within the orders) of this Class. These signatures provide novel and defini- proteobacterial phylum (Woese et al. 1984; Stackebrandt et al. tive molecular means for distinguishing α-proteobacteria and 1988; Olsen et al. 1994; Gupta 2000; Kersters et al. 2003). This many of its subgroups from all other bacteria. The distribution group has been given the rank of a Class or subdivision within of these signatures in different α-proteobacteria also enables one the Proteobacteria phylum (Stackebrandt et al. 1988; Murray to logically deduce the relative branching orders and interrela- et al. 1990; De Ley 1992; Stackebrandt 2000; Ludwig & Klenk tionships among different α-proteobacteria subgroups. Phylo- 2001; Garrity & Holt 2001; Kersters et al. 2003). Other than genetic studies have also been carried out based on 16S rRNA their distinct branching in the 16S rRNA or other phylogenetic and a number of proteins sequences. Based on this informa- trees (De Ley 1992; Viale et al. 1994; Eisen 1995; Gupta et al. tion, a detailed model for the evolutionary relationships among 1997; Gupta 2000; Stepkowski et al. 2003; Emelyanov 2003a; α-proteobacteria has been developed. Battistuzzi et al. 2004), there is no reliable phenotypic or molec- ular characteristic known at present that is uniquely shared by PHYLOGENETIC TREE FOR ALPHA PROTEOBACTERIA different α-proteobacteria which distinguish them from all other BASED ON 16S rRNA SEQUENCES bacteria (Kersters et al. 2003). On the basis of 16S rRNA trees the Although α-Proteobacteria comprise a major group within α-proteobacteria have been divided into seven main subgroups Bacteria (Garrity & Holt 2001) with >5200 sequences in the or orders (viz. Caulobacterales, Rhizobiales, Rhodobacterales, Ribosomal Database Project II (Maidak et al. 2001), there is no Rhodospirillales, Rickettsiales, Sphingomondales, and Parvu- detailed review or article that discusses the evolutionary relation-

For personal use only. larucales) (Maidak et al. 2001; Garrity & Holt 2001; Kersters ships among this group (i.e. indicating the relationships among et al. 2003). However, the branching order and interrelation- different subgroups and orders within this Class) (Kersters et al. ships among these subgroups are presently not resolved and no 2003). Most of the articles on α-Proteobacteria are aimed at clar- distinctive features that can distinguish these groups from each ifying the phylogenetic placement of particular species at either other are known (Kersters et al. 2003). genus or family levels (Dumler et al. 2001; Gaunt et al. 2001; In our recent work, we have been utilizing a new approach Young et al. 2001; Taillardat-Bisch et al. 2003; van Berkum based on identification of conserved indels (also referred to as et al. 2003; Broughton 2003; Stepkowski et al. 2003; Sawada signatures) in proteins sequences that is proving very useful in et al. 2003). The second edition of Bergey’s Manual (Ludwig & identifying different groups within Bacteria in clear molecu- Klenk 2001) and the third edition of Prokaryotes (Kersters et al. lar terms and clarifying evolutionary relationships among them 2003) present condensed phylogenetic trees for the α-Proteo-

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University (see www.bacterialphylogeny.com) (Gupta 1998, 2003, 2004; bacteria (or Proteobacteria) as a whole to indicate presumed Griffiths & Gupta 2002, 2004a; Gupta & Griffiths 2002; Gupta relationships among different subgroups comprising this sub- et al. 2003). We have previously described many protein sig- division. However, most of these trees do not show any boot- natures that are distinctive characteristics of the proteobacte- strap scores or even individual species (Ludwig & Klenk 2001; rial phylum and which also provided information regarding its Kersters et al. 2003), making it difficult to get a clear sense of the branching position relative to other bacterial groups (Gupta 1998, reliability of the observed (or indicated) relationships. Hence, 2000; Griffiths & Gupta 2004b). This review focuses on ex- as an initial step toward understanding the evolutionary rela- amining the evolutionary relationships among α-proteobacteria tionships among α-Proteobacteria, a phylogenetic tree based on using the signature sequence as well as traditional phyloge- 16S rRNA sequences was constructed from 65 α-proteobacterial netic approaches. In recent years, complete genomes of sev- species, covering its major subgroups. The resulting neighbor- eral α-proteobacteria (viz. Bartonella henselae, Bart. quintana, joining bootstrapped consensus tree is presented in Figure 1. Bradyrhizobium japonicum, Brucella melitensis, Bru. suis, The tree shown was rooted using the 16S rRNA sequences from Caulobacter crescentus, Mesorhizobium loti, Sinorhizobium loti, epsilon proteobacteria, which show deeper branching than the Rhodopseudomonas palustris, Agrobacterium tumefaciens, Rick- α-subdivision in the rRNA as well as various other trees (Olsen PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 103 For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 1. A neighbor-joining bootstrap consensus tree for α-proteobacteria based on 16S rRNA sequences. The tree was bootstrapped 100 times and bootstrap scores which were >60 are indicated on the nodes. The tree was rooted using H. pylori.However, the tree topologies was not altered on rooting with other deep branching bacteria (e.g., Aq. aeolicus). The groups of species corresponding to some of the main subgroups within α-proteobacteria are marked. ∗indicates anomalous branching in the tree. 104 R. S. GUPTA

et al. 1994; Viale et al. 1994; Eisen 1995; Gupta 1998). The A. Signature Sequences That are Common to All bootstrap scores for all nodes, which were >60 (out of 100) are α-Proteobacteria indicated on the tree. Signature sequences in the following proteins are uniquely In the resulting tree a number of different clades are either shared by different α-proteobacteria. Cytochrome c oxidase clearly (>90% bootstrap score) or reasonably well resolved. (CoxI) is an integral component of the respiratory chain in mi- These included the clades corresponding to group of species tochondria and various aerobic bacteria, and it serves as the which are recognized as major orders within the α-Proteobacteria terminal electron acceptor (Stryer 1995; Andersson et al. 1998; (Rhizobiales, Rhodospirillales, Caulobacterales, Sphingomon- Emelyanov 2003a). This membrane-associated complex requires adales, Rhodobacterales, and Rickettsiales) (Ludwig & Klenk the association of several protein subunits and the formation of 2001; Garrity & Holt 2001; Kersters et al. 2003). Within Rhi- many different metal centers. One of the proteins involved in zobiales, the Bradyrhizobiaceae family of species was clearly its assembly is Ctag (Cox11), which is required for the for- separated from some of the other families within this order mation of CuB and magnesium centers of Cox I (Hiser et al. (viz. Rhizobiaceae, Brucellaceae, and Phyllobacteriaceae) 2000). In the Ctag protein,a5aainsert in a conserved region is (Wang et al. 1998; Sadowsky & Graham 2000; Dumler et al. present in all α-proteobacteria, but not found in any other bacte- 2001; van Berkum et al. 2003; Stepkowski et al. 2003). Within ria (Figure 2). Within bacteria the homologs of this protein are α-Proteobacteria, the deepest branching was observed for the mainly restricted to α, β, and γ -subdivisions of proteobacteria. Rickettsiales group of species. Within the Rickettsiales, the Rick- Although a protein which carries out a similar function (also ettsia, and Orientia genera, which form part of the Rickttsi- known as Ctag) is present in gram-positive bacteria, it does not aceae family, were clearly resolved from the Anaplasmataceae show any sequence similarity to the proteobacterial homologs family comprised of Ehrlichia, Wolbachia, Anaplasma, and Ne- (Bengtsson et al. 2004). The observed insert in the Ctag protein orickettsia species (Dumler et al. 2001; Yu & Walker 2003). In is also present in various eukaryotic homologs, supporting their contrast to these well-resolved clades or relationships, various derivation from α-proteobacteria. nodes indicating the interrelationships among different orders Another conserved insert that is specific for α-proteobacteria had lower bootstrap scores (<60%), indicating that interrela- is present in the enzyme 5-phosphoribosyl-5-aminoimidazole- tionships among them were not resolved. The relationships ob- 4-N-succinocarboxamide (SAICAR or PurC) synthetase, which served here are very similar to those reported in earlier studies carries out the seventh step in the de novo purine biosynthetic (Olsen et al. 1994; Sadowsky & Graham 2000; Dumler et al. pathway (Hui & Morrison 1993; Stryer 1995). The enzyme is en- 2001; Yu et al. 2001; Kersters et al. 2003; Yu & Walker 2003; coded by the purC gene and it is broadly distributed in bacteria. Stepkowski et al. 2003). The tree shown here will serve as a A3aa insert in this protein is present in various α-proteobacteria useful reference for determining the evolutionary significance (Figure 3), but not in any other bacteria, indicating that it is a For personal use only. of various signature sequences. distinctive characteristic of the group. The eukaryotic homologs of the SAICAR synthetase do not contain this insert, but their SIGNATURE SEQUENCES DISTINCTIVE OF overall similarity in this region is limited (not shown). In ad- α-PROTEOBACTERIA dition to α-proteobacteria, a 2 aa insert is also present in this To identify conserved indels that might be distinctive of α- position in Magnetococcus sp. MC-1, suggesting that it may be Proteobacteria, or a particular groups of species within this distantly related to this group. The phylogenetic assignment of Class, multiple sequence alignments of all proteins that are found Magnetococcus sp. MC-1 is presently uncertain (Garrity & Holt in the genome of Ri. prowazekii (Andersson et al. 1998) were cre- 2001). ated using the CLUSTAL X program (Jeanmougin et al. 1998). Two other signatures that are specific for α-proteobacteria are These alignments were visually inspected for any conserved in- present in the replicative DNA helicase (DnaB) and the α subunit

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University dels that were mainly restricted to the α-proteobacterial species. of ATP synthase complex. DnaB helicase is a multifunctional The indels that we focussed on were generally of defined size enzyme involved in the DNA replication process (Soni et al. and they were present in the same position in a given protein. 2003). It interacts with a number of proteins involved in DNA The indels of interest were also required to be flanked on both replication and exhibits multiple enzymatic activities including sides by conserved regions to ensure that the sequence align- helicase, ATP hydrolysis and DNA binding. An insert of between ment in the region was reliable and that the indel under con- 8 and 14 aa is present in a conserved region of DnaB, which is sideration was not resulting from any alignment artefact (Gupta unique to various α-proteobacteria (Figure 4). Most of the Rhizo- 1998, 2000; Rokas & Holland 2000; Gupta & Griffiths 2002). biales as well as Rhodobacter and Caulobacter species are found The indels that appeared unique to other groups of bacteria, or to contain the 14 aa insert, whereas a smaller insert is present in which were present in only a single α-proteobacterial species, various Rickettsiales and certain other α-proteobacteria. In Mag. were not further investigated. This has led to identification of magnetotacticum, three different homologs of DnaB are found many conserved indels that are specific for α-proteobacteria or and they all contained the 14 aa insert. Based upon the fact that its subgroups. A brief description of these signatures as well as this insert is present in the same position in all α-proteobacteria, of the proteins in which they are found is given below. it is likely that it was introduced only once in a common ancestor PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 105 For personal use only.

FIG. 2. Partial sequence alignment for Cytochrome c oxidase assembly protein (Ctag) showinga5aainsert (boxed), which is specific for α-proteobacteria. Dashes in this sequence alignment as well as all others indicate identity with the amino acid on the top line. The position of this sequence in the proteinismarked

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University on the top. The accession numbers of various sequences are shown in the second column. Only representative sequences from different bacteria are shown. The identified insert is also present in various eukaryotic (mitochondrial) homologs indicating their derivation from an α-proteobacterial ancestor. The abbreviations used in the species names are listed at the end of this review.

of the group and that subsequent genetic changes led to the ob- is commonly present in all α-proteobacteria, but which is not served variation in its length. The DnaB homologs are not found found in any other proteobacterial species (Figure 5). Besides, in eukaryotic species. α-proteobacteria, inserts of variable lengths are also present in The synthesis of ATP in different organisms is carried out this position in various Actinobacteria and Bacteriodetes (not by F1F0ATP synthase, a multisubunit complex located in the shown). In phylogenetic tree based on ATP synthase (α), these cytoplasmic membrane of bacteria or inner membrane of mito- latter groups do not show any affinity for each other or to the α- chondria (Stryer 1995; Leyva et al. 2003). The F1 portion of this proteobacteria (Gupta 2004), indicating that these inserts have complex is a heteromer made up of five subunits, α, β, γ , δ and likely been introduced independently. The observed insert in  with the stoichiometry α3β3γδ. The α subunit of ATP syn- ATP synthase α is also present in various eukaryotic homologs thase contains an 8 aa insert in a highly conserved region that providing evidence of their α-proteobacterial ancestry. 106 R. S. GUPTA For personal use only.

FIG. 3. Partial sequence alignment of PurC (SAICAR synthetase) showinga3aainsert that is specific for various α-proteobacteria.

The enzyme exonuclease VII degrades single-stranded DNA supported by the phylogenetic analysis. The evolutionary stages bidirectionally and processively (Chase et al. 1986). In the large where different identified signatures have likely been introduced subunit of exonuclease VII, encoded by the xseA gene, a number in this gene/protein are marked on the tree (Figure 7). of useful signatures for α-proteobacteria are present. These sig- Another insert that is specific for α-proteobacteria is present natures includea3aainsert anda1aadeletion that are present in the enzyme prolipoprotein-phosphatidylglycerol (PLPG) trans- in all known α-proteobacterial homologs (Figure 6A), but not ferase, which carries out the first committed step in the pathway Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University found in any other groups of bacteria. In the same position where leading to synthesis of lipid modified proteins (Figure 6B) (Qi the 3 aa insert is found, an additional 3 aa insert is present in et al. 1995). The indicated 3 aa insert in PLPG- transferase is all α-proteobacteria except the Rickettsiales. Elsewhere in this unique to α-proteobacteria and not found in other bacteria. The protein, a 1 aa deletion (at position 141 in the Ri. prowazekii se- homologs of exonuclease VII and PLPG-transferase were not quence) that is unique to various Rickettsia species is also found detected in eukaryotes. (not shown). In a phylogenetic tree based on exonuclease VII se- Two other proteins where α-proteobacteria-specific inserts quences (Figure 7), all of the α-proteobacterial homologs formed are found are, RP-400 and puryvuate phosphate dikinase (PPDK). a well-defined clade, which was strongly supported by bootstrap The first of these is a protein of unknown function present in the scores. Similar to the rRNA tree, the Rickettsiales species formed Ri. prowazekii genome (RP-400), which is distantly related to the earliest branching group within the α-proteobacteria, and murein transglycosylases. This protein contains a 4–6 aa insert both the Rickettsiales clade, as well as a clade comprising of the in a conserved region that is a distinctive characteristic of var- remainder of the α-proteobacteria were clearly resolved in this ious α-proteobacteria, except Zymomonas mobilis (Figure 8). tree. The inferences from signature sequences are thus strongly The absence of this insert in the latter species could result from PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 107 For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 4. Partial sequence alignment of replicative DNA helicase, DnaB, showing an 8–14 aa insert that is specific for various α-proteobacteria.

either selective loss, or exchange of this gene from some other found in various other α-proteobacteria (Figure 9). Interestingly, species lacking the insert. PPDK is a key enzyme in photosyn- an insert of 10 aa is also present in the same position in various thesis, which catalyzes the reversible conversion of phospho- δ-proteobacteria, suggesting a distant relationship of this group enolpyruvate to pyruvate (Ku et al. 1996). This enzyme is not to the α-proteobacteria. Because the insert sequence in vari- found in mammalian cells but it is broadly distributed in bacteria ous species appears to be related, it is possible that this insert and plants. A conserved insert in PPDK provides an informative was originally introduced in a common ancestor of the α- and δ- signature for α-proteobacteria. Rickettsiales species contain a proteobacteria. The varying lengths of the inserts in Rickettsiales 5aainsert in this position, whereas a larger insert of 12 aa is and other α-proteobacteria could then result from subsequent 108 R. S. GUPTA For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 5. Sequence alignment of ATP synthase α subunit showing an 8 aa insert in a highly conserved region that is present in various α-proteobacterial homologs, but not found in any other proteobacteria. The shared presence of this insert in various eukaryotic homologs provides evidence of their α-proteobacterial ancestry.

genetic changes in the branches leading to these groups. Alter- divergent and it is possible that this may have been acquired by natively, the inserts of different lengths could have been inde- means of lateral gene transfer (LGT) from other bacteria. pendently introduced in these groups and the observed sequence Two different proteins, FtsK and Cytochrome b (PetB) con- similarity may be a consequence of their related function. It is tain deletions which are mainly limited to the α-proteobacterial of interest that in contrast to other α-proteobacteria, which con- species. In the FtsK protein, which plays a central role in cell tain only a single PPDK homolog, two homologs of this protein division and chromosome segregation in bacteria (Capiaux et al. are found in Bradyrhizobium (Brad.) japonicum. Of these, only 2002; Espeli et al. 2003), two 1 aa deletions are present in con- one contained the insert. The homolog lacking the insert is quite served regions that are largely distinctive of α-proteobacteria PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 109 For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 6. Signature sequence in exonucleaseVII (A) and prolipoprotein-phosphatidylglycerol (PLPG) transferase (B) proteins that are distinctive of α- proteobacterial species. Exonuclease VII contains a 3 aa insert and a 1 aa deletion that is unique to all α-proteobacteria. The presence of an additional 3 aa insert distinguishes Rickettsiales species from other α-proteobacteria. The enzyme PLPG-transferase also contains a 3 aa insert that is specific for α-proteobacteria. 110 R. S. GUPTA For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University FIG. 7. A neighbor-joining bootstrap consensus tree based on exonucleaseVII sequences. Bootstrap scores >50% are indicated on various nodes. All inserts and deletions were excluded from the sequence alignment used for phylogenetic analysis. The α-proteobacteria formed a well-defined clade in this tree, however, their branching position relative to other groups was not resolved. The Rickettsiales order formed the deepest branch within α-proteobacteria and they were also clearly resolved from other α-proteobacteria. The arrows mark the suggested positions where the identified signatures were introduced in this protein.

(Figure 10). One of these deletions is a distinctive characteris- position corresponding to aa 513–520 in Ri. prowazekii protein). tic of all α-proteobacteria and not found in any other bacteria. Since the region where this insert is found exhibits variability The other deletion, in addition to the α-proteobacteria, is in other bacteria, this signature is not shown. The FtsK protein also commonly present in the two Desulfovibrio species has also been previously shown to contain an 8–9 aa insert in a (δ-proteobacteria), suggesting a distant relationship of this group different region of the protein that is a distinctive characteristic to α-proteobacteria, as also seen with the PPDK protein (Figure of various Bacteriodetes and Chlorobium species (Gupta 2004). 9). In addition to these deletions, the FtsK protein also con- The FtsK homologs are not found in most eukaryotic organisms. tains a 5–6 aa insert that is unique to various α-proteobacteria However, a homolog of this protein is present in Plasmodium in comparison to the other groups of proteobacteria (present in yoelii (Genebank accession number 23485217). The origin and PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 111

FIG. 8. Partial sequence alignment of RP-400 protein showing a 4–6 aa insert that is specific for various α-proteobacteria, except Z. mobilis. For personal use only. possible significance of this gene/protein is presently unclear. branching clade within α-proteobacteria (see Figures 1 and 7) A1aa deletion that is specific for various α-proteobacteria is (Dumler et al. 2001; Gaunt et al. 2001; Yu et al. 2001; Kersters also present in the Cytochrome b (Cyt b; PetB) protein (Fig- et al. 2003; Yu & Walker 2003; Stepkowski et al. 2003). We ure 11), which is a subunit of the cytochrome reductase, which have identified several signatures that are present in various α- is an integral part of the electron transport chain (Daldal et al. proteobacteria, except the Rickettsiales. These signatures are 1987; Stryer 1995; Emelyanov 2003a). This indel is not present described below. in other bacteria including that from Aquifex aeolicus, indicat- The enzyme succinyl CoA-synthetase, which is part of the ing that it is a deletion in α-proteobacteria, rather than an insert citric acid cycle, carries out cleavage of the thioester bond in in other bacteria. Cyt b is one of the 13 proteins that is still succinyl-CoA in a coupled reaction to generate succinate and encoded by mitochondrial DNA (Lang et al. 1999). Sequence producing GTP (Bridger et al. 1987; Stryer 1995). It is the Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University information for Cyt b is available from a large number (>500) only step in the citric acid cycle that directly leads to the for- of mitochondrial genomes and phylogenetic studies based on mation of a high-energy phosphate bond. The beta subunit of this protein provides evidence for the origin of mitochondria this protein contains a conserved insert of 10 aa, that is com- from within the Rickettsiaceae (Sicheritz-Ponten et al. 1998; monly present in all other α-proteobacteria, except the Rick- Emelyanov 2003a). Similar to the α-proteobacteria, Cyt b from ettsiales (Figure 12). Surprisingly, this insert is also present all eukaryotic mitochondrial homologs was found to lack this 1 in Ral. metallidurans (a β-proteobacterium), but not in any aa indel, providing evidence of their specific relationship to the other β-proteobacteria, including the closely related species Ral. α-proteobacteria. solanacearum. This suggests that the Succ-CoA synthetase gene in Ral. metallidurans has likely originated by non-specific means such as LGT. A smaller unrelated insert in this region, which is B. Signature Sequences Distinguishing Rickettsiales from presumably of independent origin, is also present in Cytophaga Other α-Proteobacteria and Rhodopirellula species (not shown). It is of interest that a In phylogenetic trees based on 16S rRNA, as well as many 7–8 aa insert is also present in this position in various eukary- protein sequences, the Rickettsiales are found to form the deepest otic homologs. It is unclear at present, whether this latter insert 112 R. S. GUPTA For personal use only.

FIG. 9. Excerpt from sequence alignment for pyruvate phosphate dikinase (PPDK) protein showing a signature for α-proteobacteria. The Rickettsiales species contain a 5 aa long insert, where all other α-proteobacteria have a 12 aa insert in the same position. Two different homologs of PPDK are found in Brad. japonicum, only one of which is found to contain the insert. A smaller conserved insert of 10 aa is also present in this position in various δ-proteobacteria suggesting that they may be specifically, but distantly, related to the α-proteobacteria.

has originated from an α-proteobacterial ancestor or it is of in- Another signature showing a similar distribution pattern has dependent origin. If these inserts are of common origin, then been identified in cytochrome oxidase polypeptide I (Cox I). this would suggest that the eukaryotic homologs of Succ-CoA- In this case,a5aainsert in a conserved region is commonly

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University synthetase have originated from an α-proteobacterial ancestor present in various α-proteobacterial species except the Rick- other than the Rickettsiales. This observation will be at variance ettsiales (Figure 13). It should be noted that α-proteobacteria with other evidence pointing to a closer relationship of mito- contain two different related proteins. One of these, which har- chondria to the Rickettsiales species (Viale & Arakaki 1994; bors this insert seems to correspond to Cox I, whereas the other Gupta 1995; Andersson et al. 1998; Sicheritz-Ponten et al. 1998; homologs lacking the insert are mainly those from Cytochrome Gray et al. 1999; Lang et al. 1999; Emelyanov 2001a, 2001b, o ubiquinol oxidase (Davidson & Daldal 1987). However, all 2003a). Emelyanov (2001a, 2001b) has observed a closer rela- Rickettsiales species contain only a single homolog of this pro- tionship of mitochondrial homologs to certain rickettsial species tein, corresponding to Cox I. The observed insert in both Succ- (e.g. Holospora obtusa, Caedibactera caryophila), for which CoA-synthetase and Cox I were thus likely introduced in a com- sequence information for this protein is lacking at present. It mon ancestor of the remainder of the α-proteobacteria after the is possible that Succ-CoA synthetase from these species may branching of Rickettsiales. Similar to the Cyt b, the Cox I in eu- contain this insert. Presently, the possibility that the insert in karyotic cells is also encoded by mitochondrial DNA (Andersson eukaryotic homologs was independently introduced also cannot et al. 1998; Gray et al. 1999) and sequence information for be excluded. this protein is available from a large number of mitochondrial PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 113 For personal use only.

FIG. 10. Partial sequence alignments of FtsK protein showing two different signatures (1 aa deletions) that are informative characteristics of α-proteobacteria. The deletion on the left is unique to various α-proteobacteria, whereas the one on the right is also commonly shared by two Desulfovibrio species (δ-proteobacteria) suggesting their relatedness to the α-proteobacteria.

genomes. The eukaryotic homologs of Cox I do not contain for these signatures is that they were introduced in an ancestral

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University the identified insert (results not shown) indicating their possible α-proteobacterial lineage, after the branching of Rickettsiales derivation from Rickettsiales (Emelyanov 2003a). (and also possibly Mag. magnetotacticum). The observed vari- Two other proteins were found to contain inserts of variable ations in the lengths of these inserts have presumably resulted lengths in highly conserved regions in various α-proteobacterial from subsequent genetic changes. species, with the exception of Rickettsiales (Figure 14). In We have also identified a number of α-proteobacteria-specific alanyl-tRNA synthetase (AlaRS), which is ubiquitously found signatures in proteins for which no homologs are found in all organisms, an insert of between 5–11 aa is present in in the Rickettsiales.Inthe MutY protein, which is an A-G a highly conserved region in various α-proteobacteria, except specific DNA glycosylase involved in DNA repair (Parker & the Rickettsiales (and also Mag. magnetotacticum) (Figure 14A). Eshleman 2003; Martins-Pinheiro et al. 2004), a 4–9 aa in- Another signature showing a similar distribution pattern is found sert in a conserved region is present in various α-proteobacteria in the MutS protein, which is involved in the DNA mismatch re- (Figure 15A). An insert of similar length is also present in most pair (Sixma 2001; Martins-Pinheiro et al. 2004). In this case, a eukaryotic homologs (with the exception of Anopheles gambiae) conserved insert of 2–5 aa is present in various α-proteobacteria indicating their possible derivation from α-proteobacteria. An- (Figure 14B), but not in Rickettsiales. The simplest explanation other signature showing similar species distribution is present in 114 R. S. GUPTA For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 11. Partial sequence alignment for Cyt b protein showing a 1 aa deletion that is specific for various α-proteobacteria. This deletion is also present in all mitochondrial homologs (Cyt b is encoded by mitochondrial DNA) providing strong evidence of their α-proteobacterial ancestry.

the protein homoserine dehydrogenase (Figure 15B). This indel observed inserts in these genes could have been introduced in a consists ofa1aainsert in a conserved region that is present common ancestor of the α-proteobacteria, either before or after in various α-proteobacteria, but not any other proteobacteria. the loss of these genes in Rickettsiales. The homologs of both these proteins were not detected in the Several proteins contain conserved inserts that are either unique Rickettsiales species and their absence is very likely due to selec- for the Rickettsiales or for the two main families, Rickettsi- tive loss of these genes in a common ancestor of the Rickettsiales aceae and Anaplasmataceae, comprising this order (Dumler (Martins-Pinheiro et al. 2004), presumably due to the intracel- et al. 2001; Yu& Walker 2003). The Rickettsiales-specific signa- lular life-style of these organisms (Boussau et al. 2004). The tures are present in the proteins XerD and leucine aminopeptidase PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 115 For personal use only.

FIG. 12. Partial sequence alignment of Succ-CoA synthase showing a 10 aa insert that is present in various α-proteobacteria, except the Rickettsiales. This insert is not found in other bacteria, except Ral. metallidurans, which has likely acquired it by non-specific means. A smaller insert is also present in this position in various eukaryotic homologs. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University (Figure 16). XerD protein (Figure 16B) is a part of the XerCD (Figure 17B) anda1aainsert in the FtsZ protein (Figure 17C). integrase/recombinase that is involved in the cell division pro- Two additional Rickettsia-specific signatures consisting of a 1 cess and decatenation of DNA duplexes (Ip et al. 2003).A7aa aa insert in the major sigma factor-70 (at position 141 in the R. insert is present in a conserved region of this protein which is prowazekii sequence) anda1aadeletion in exouclease VII (at uniquely shared by all Rickettsiales and not found in any other position 137 in the Ri. prowazekii homolog) were also identi- bacteria (Figure 16A). Another 2 aa insert that is specific for fied, but they are not shown here. The identified signatures in Rickettsiales is present in leucine aminopeptidase (Figure 16A), these proteins are present only in various Rickettsiaceae species which is an exopeptidase that selectively releases N-terminal and not found in other Rickettsiales (viz. Ehrlichia, Wolbachia, amino acids from peptides and proteins (Gonzales & Robert- Anaplasma)orother groups of bacteria. Within eukaryotes, a Baudouy 1996). The signatures that are specific for Rickettsia homolog of the transcription repair-coupling factor is only de- include a 4 aa insert in a highly conserved region of the tran- tected in Arabidoposis thaliana and it lacks the identified insert scription repair coupling factor (Mfd) (Martins-Pinheiro et al. (results not shown). The homologs of ribosomal protein L19 are 2004) (Figure 17A), a 10 aa insert in ribosomal protein L19 found in various plants and algae but not in any of the animal 116 R. S. GUPTA

FIG. 13. Partial sequence alignment of Cox I showing a 5 aa insert that is present in various α-proteobacteria, except Rickettsiales. The other α-proteobacteria also contains a second more distantly related homolog that lacks this insert.

species. Of these, an 8 aa insert in the same position is present et al. 2003; Emelyanov 2003a). These signatures were likely in- For personal use only. only in the homolog from Cyanophora paradox (not shown). The troduced in a common ancestor of the Anaplasmataceae family, significance and possible origin of this insert is not clear. Simi- which now includes all Ehrlichia, Anaplasma, Cowdria, Wol- lar to the ribosomal protein L19, FtsZ homologs are also found bachia, and Neorickettsia species (Dumler et al. 2001; Yu & only in plants but not in animals. These homologs also lacked the Walker 2003). insert that is present in Rickettsiaceae. The plant homologs of these proteins likely correspond to those of the plastids, which because of their cyanobacterial ancestry (Gray 1989; Morden C. Signature Sequences for Other Subgroups of et al. 1992; Margulis 1993; Gupta et al. 2003) are expected to α-Proteobacteria and Providing Information Regarding be lacking Rickettsia-specific signatures. Their Interrelationships We have also identified two large inserts that are commonly Signature sequences in a number of other proteins are use- Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University shared by the Ehrlichia, Wolbachia, and Anaplasma species but ful in distinguishing other subgroups of α-proteobacteria and not found in any of the Rickettsia species or other bacteria. These they also provide information clarifying the interrelationships signatures include a 15 aa insert in the HlyD family of secretory among them. In the DnaA protein involved in chromosomal protein (Figure 18A) and a 10–11 aa insert in the tRNA guanine replication (Messer 2002),a5aainsert is present in various Rhi- transglycosylase (Tgt) protein (Figure 18B), involved in the syn- zobiales and Caulobacter/Rhodobacter species (Figure 19A). thesis of hypermodified nucleoside queousine (Reuter & Ficner However, this insert is not found in any of the Rickettsiales,as 1995). The eukaryotic homologs of Tgt do not contain this insert well most α-proteobacterial species belonging to the orders Sph- providing evidence against their origin from Anaplasmatacaea ingomonadales and Rhodospirillales. The species Mag. mag- family of species (results not shown). The homologs of HlyD are netotacticum contains two different homologs of this protein, not found in eukaryotes. These signatures point to a close rela- only one of which is found to contain the insert. Another insert tionship between Ehrlichia, Wolbachia, and Anaplasma species, showing a similar distribution pattern is present in the protein which is also seen in phylogenetic trees based on many other se- RP057, which is a homolog of the glucose-inhibited division quences (Dumler et al. 2001; Gaunt et al. 2001; Yu et al. 2001; protein B (Romanowski et al. 2002). This protein contains a 3 aa Taillardat-Bisch et al. 2003; Yu & Walker 2003; Stepkowski insert that is common to the same subgroups of α-proteobacteria PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 117 For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 14. Signature sequences in alanyl-tRNA synthetase (AlaRS) and MutS proteins that are informative for the α-proteobacteria. In AlaRS (upper panel) an insert of variable length in a highly conserved region is present in various α-proteobacteria, except the Rickettsiales and Mag. magentotacticum. The DNA mismatch repair protein MutS (lower panel) also contains a 3–5 aa insert in various α-proteobacteria, except Rickettsiales. The inserts lengths in this case also serve to differentiate Rhodospirillales and Sphingomonadales species from the Rickettsiales, Rhodobacterales, and Caulobacterales. 118 R. S. GUPTA For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 15. Partial sequence alignments of MutY (upper panel) and homoserine dehydrogenase (lower panel) proteins showing inserts (boxed) in conserved regions that are specific for α-proteobacteria. The homologs of both these proteins are not found in the Rickettsiales. For MutY, an insert of approximately similar length is also present in various eukaryotic homologs, with the exception of Anopheles gambiae.

as the insert in the DnaA protein, but which is not found in α-proteobacteria. Based on the distribution patterns of these the Rickettsiales or Rhodospirillales/Sphingomonadales species signatures, these inserts were likely introduced in a common (Figure 19B). The variable length inserts are also present in ancestor of the Rhizobiales and Caulobacter/Rhodobacter this position in other bacteria (not shown). However, within after the branching of Rickettsiales and Rhodospirillales/ proteobacteria this insert is limited to the above subgroups of Sphingomonadales orders (Figure 19C). PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 119 For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 16. Signature sequences in XerD integrase (upper panel) and leucine aminopeptidase (lower panel) that are distinctive of the Rickettsiales order and not found in other α-proteobacteria or other bacteria. 120 R. S. GUPTA For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 17. Signature sequences in transcription repair coupling factor Mfd (A), Ribosomal protein L19 (B), and FtsZ (C) proteins that are distinctive of Rickettsia species and not found in other α-proteobacteria including Anaplasmataceae family (e.g., Wolbachia, Ehrlichia, Anaplasma)ofspecies. Two additional signatures showing similar distribution are found in the sigma factor-70 and exonuclease VII proteins. PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 121 For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 18. Signature sequences in RP-314 (A) and tRNA guanine transglycosylase (Tgt) (B) proteins that are distinctive of the Anaplasmataceae family of species and not found in Rickettsia or various other bacteria.

The protein DNA ligase (NAD dependent; Lig A) contains a ingomonadales (Z. mobilis, Novo. armoaticivorans). The ab- 12 aa insert in a highly conserved region that is commonly shared sence of this insert in the Mesorhizobium sp. BNC1,issome- by various Rhizobiales as well as Rhodobacterales species what surprising, but it could result from non-specific mecha- (Figure 20A), but which is not found in C. crescentus, Rho- nisms. This signature suggests that Rhizobiales species may dospirillales (Rhodo. rubrum, Mag. magnetotacticum), and Sph- be more closely related to Rhodobacterales in comparison to 122 R. S. GUPTA For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 19. Partial sequence alignments of DnaA (panel A) and RP-057 (panel B) proteins showing inserts in conserved regions (boxed) that are only present in various Rhizobiales, Rhodobacterales, and Caulobacter,but not found in other α-proteobacteria or bacteria. These inserts were likely introduced in a common ancestor of the above groups after the branching of Rickettsiales, Rhodospirillales, and Sphingomonadales as indicated in panel C.

Caulobacter and other α-proteobacteria. However, another promi- a distinct clade in the tree, but they consisted of only certain nent insert (11 aa) in a highly conserved region of the protein Rhizobiaceace and Caulobacter species (Martins-Pinheiro et al. aspargine-glutamine amidotransferase points to a specific re- 2004). To fully understand the evolutionary significance of these lationship between Rhodobacterales and Caulobacter species signatures, it would be necessary to obtain sequence information (Figure 20B), to the exclusion of all other α-proteobacteria. for these proteins from additional Caulobacterales. Martins-Pinheiro et al. (2004) have reported phylogenetic anal- We have also identified many conserved inserts that are ysis based on LigA sequences. The α-proteobacteria formed specific for species belonging to the Rhiziobiales order. The PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 123 For personal use only.

FIG. 20. Signature sequences in DNA ligase A and (upper panel) and Asn-Gln amidotransferase (lower panel) that are informative for α-proteobacteria. The signature in DNA ligase is commonly shared by various Rhizobiales as well as C. crescentus species, while that in the Asn-Gln amidotransferase is uniquely shared by Rhodobacterales and Caulobacter, indicating a specific relationship between these subgroups.

Trp-tRNA synthetase (TrpRS) contains a large insert in a in a common ancestor of the Rhizobiales and subsequent modifi-

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University highly conserved region which is uniquely shared by various cation has led to the observed length variation. The distinctness Rhizobiales species (Figure 21A), but not found in any of the of Bradyrhizobium and Rhodopseudomonas from other Rhizo- other α-proteobacteria or other groups of bacteria (results for biales is also supported by a signature (3 aa insert) in Seryl-tRNA other groups of bacteria not shown). The absence of this insert in synthetase (SerRS), which is uniquely present in these species various Rickettsiales, Rhodospirillales, Sphingomonadales, and (Figure 21B) and it serves to distinguish them from other Rhizo- Rhodobacterales as well as Caulobacter provides evidence that biales as well as other α-proteobacteria. A schematic diagram these groups have branched off prior to the introduction of this indicating the suggested positions where signatures described in insert (Figure 21A). The length of the insert in TrpRS also serves Figures 20 to 23 have been introduced is presented in Figure 21C. to distinguish the Rhizobiaceae, Brucellaceae, and Phyllobacte- We have also identified several signatures that are uniquely riaceae family of species from those belonging to Bradyrhizobi- present in the Rhizobiaceae, Brucellaceae, and Phyllobacteri- aceae and Methylobacteriaceae. The insert in the former group aceae families of species, but not found in other α-proteobacteria of species is 19 aa long, whereas the latter species contain only a including Bradyrhizobium and Rhodopseudomonas. These sig- 9–10 aa insert. Because the insert sequence in all of these species natures include a 7 aa insert in Oxoglutarate dehydrogenease is conserved, it is likely that the insert was introduced only once (Figure 22A),a5aainsert in Succ-CoA synthase (Figure 22B), 124 R. S. GUPTA For personal use only.

FIG. 21. Signature sequences in Trp-tRNA synthetase (upper panel) and Ser-tRNA synthetase (lower panel) that are informative for α-proteobacteria. The first of these signatures is specific for Rhizobiales. The insert length in this signature also distinguishes Bradyrhizobiaceae and Methylobacteriaceae species from other Rhizobiales. The insert in the Ser-tRNA synthetase is specific for the Bradyrhizobiaceae species and distinguishes this family from other Rhizobiales.

a3aainsert in LytB metalloproteinase (Figure 23A) anda2aa well as Ehrlichia and a few other proteobacteria, has also been insert in DNA gyrase A subunit (Figure 23B). A smaller insert described by Stepkowski et al. (2003). in oxoglutarate dehydrogenase is also present in Novosphingob- acteria,but since its sequence is unrelated, it is either of in- D. Signature Sequences Indicating the Phylogenetic Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University dependent origin or could have resulted from LGT. In addition Placement of α-Proteobacteria to these proteins, a 1–2 aa insert that is specific for Rhizobi- A number of signatures described in earlier work have indi- aceae is also found in a conserved region of the LepA protein cated that proteobacteria is a late branching phylum in compar- (Figure 23C). The evolutionary positions where these signa- ison to other main groups within Bacteria (Gupta 1998, 2000, tures have been introduced are indicated in Figure 21C. It is 2003; Gupta & Griffiths 2002; Griffiths & Gupta 2004b). These of interest that in contrast to other Rhizobiaceae species, which signatures includeda4aainsert in alanyl-tRNA synthetase, an contain only 1 aa inserts, Sinorhizobium meliloti and Agrobac- insert of >100 aa in RNA polymerase β (RpoB) subunit, a 10 terium tumefacienes are found to contain 2 aa inserts in the LepA aa insert in CTP synthase, a 2 aa insert in inorganic pyrophos- protein (Figure 23C). This observation points to a specific rela- phatase, anda2aainsert in Hsp70 protein. The identified sig- tionship between these two Rhizobiaceae species, as has been natures in these proteins were present in all proteobacterial ho- suggested based on other lines of evidences (Young et al. 2001). mologs, but they were absent from most other bacterial phyla A2aainsert in the DnaK protein, which is commonly shared (viz. Firmicutes, Actinobacteria, Thermotogae, Deinococcus- by species belonging to Rhizobium and Sinrhizobium genera, as Thermus, Cyanobacteria, Spirochetes). In a number of cases, PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 125 For personal use only.

FIG. 22. Signature sequences in Oxoglutarate dehydrogenase (upper panel) and Succ-CoA synthase (lower panel) proteins that are commonly shared by only certain Rhizobiales families (e.g., Rhizobiaceae, Brucellaceae, and Phyllobacteriaceae), and not found in Bradyrhizobiaceae or other α-proteobacteria. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University where the corresponding proteins were present in Archaea (viz. Figure 24 shows the excerpt from a sequence alignment for RpoB, Hsp70, AlaRS), the archael homologs also lacked the in- the transcription termination factor Rho, which is an RNA- dicated inserts, indicating that the absence of these indels con- binding protein that plays a central role in the RNA chain stitute the ancestral states and that these signatures were intro- termination (Opperman & Richardson 1994). This protein is duced after branching of the groups lacking these indels (Gupta present in all main groups of bacteria, except cyanobacteria & Griffiths 2002; Gupta 2003; Griffiths & Gupta 2004b). A num- (Gupta & Griffiths 2002; Gupta 2003), where RNA chain termi- ber of identified signatures (7 aa insert in SecA, 1 aa deletion in nation presumably occurs via a Rho-independent mechanism. the Lon protease) were uniquely shared by only the α, β, and A3aa insert is present in a highly conserved region of Rho, γ -proteobacteria, providing evidence of the later branching of which is a distinctive characteristic of all α, β, and γ -proteo- these subdivisions (Gupta 2000, 2001, 2003). Two additional bacteria. The length of this insert is 2–3 aa longer in various signatures that are helpful in understanding the phylogenetic Rickettsiales species, which suggests an additional insert in this placement of α-proteobacteria are described in the following group of bacteria. In contrast to the α, β, and γ -proteobacteria, section. this insert is not present in δ, ε-proteobacteria or any other 126 R. S. GUPTA For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 23. Signature sequences in LytB (A), DNA gyrase A (B) and LepA proteins that are distinctive characteristics of only certain Rhizobiales families (e.g., Rhizobiaceae, Brucellaceae, and Phyllobacteriaceae), but not found in Bradyrhizobiaceae or other α-proteobacteria. PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 127 For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 24. Partial sequence alignment of Rho protein showing a conserved insert that is commonly shared by various α, β, and γ -proteobacteria, but not found in any other groups of bacteria including the δ,-proteobacteria and all other phyla of gram-positive and gram-negative bacteria. This insert was likely introduced in a common ancestor of the α, β, and γ -proteobacteria after the branching of other bacterial phyla (see Figure 26). Many other signatures showing similar distribution pattern and supporting the indicated branching position of α, β, and γ -proteobacteria have been described in earlier work.

groups of Gram-negative and Gram-positive bacteria. This is present in various β and γ -proteobacteria, but it is not signature provides evidence that the groups consisting of α, found in any α-proteobacteria or other groups of bacteria (Fig- β, and γ -proteobacteria have branched off late in compari- ure 25). The absence of this insert in various other bacteria as son to the other groups of bacteria. Another novel signature well as archael homologs provides evidence that it was intro- that is useful in understanding the branching position of α- duced in a common ancestor of the β and γ -proteobacteria af- proteobacteria is present in the ATP synthase alpha subunit. ter the divergence of other bacteria, including α-proteobacteria In this case, an 11 aa insert in a highly conserved region (Figure 26). 128 R. S. GUPTA For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 25. Partial sequence alignment of ATP synthase α-subunit showing a highly conserved insert that is commonly shared by various β and γ -proteobacteria, but not found in any other groups of bacteria including the α- and δ,-proteobacteria and all other phyla of Gram-positive and Gram-negative bacteria. This insert is also not present in archael or eukaryotic homologs indicating that it was introduced in a common ancestor of the β and γ -proteobacteria after the branching of all other groups including α-proteobacteria. Other signatures showing similar relationships have been described in earlier work (Gupta 1998, 2000, 2001, 2003). PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 129 For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University

FIG. 26. Evolutionary relationships among α-proteobacteria based on signature sequences in different proteins. The branching position of α-proteobacteria relative to other groups of bacteria is based on signature sequences such as those shown in Figures 24 and 25. The evolutionary stages where these signatures have been introduced are indicated by thick arrows. Many other signatures that are helpful in resolving the branching order of other groups have been described in our earlier work (Gupta 1998, 2000, 2001, 2003, 2004; Gupta & Griffiths 2002; Griffiths & Gupta 2004b (see also www.bacterialphylogeny.com)). The evolutionary relationship among α-proteobacteria shown here was deduced based on the distribution patterns of different signatures described in this review. The long thin arrows mark the positions where the signature sequences in various proteins have likely been introduced. 130 R. S. GUPTA

CONCLUSIONS guish them from all other bacteria. The unique presence of these The α-proteobacteria are a morphologically and metaboli- signatures in various α-proteobacteria, which is a very diverse cally very diverse group of organisms, which are presently rec- group (Kersters et al. 2003), strongly suggests that these indels ognized as a distinct group solely on the basis of their branching should be functionally important for this group of organisms. pattern in the 16S rRNA tree (Woese et al. 1984; Stackebrandt Hence, studies examining their functional effects should be of et al. 1988; Murray et al. 1990; De Ley 1992; Ludwig & Klenk much interest. 2001; Kersters et al. 2003). No biochemical, molecular or other Signature sequences in other proteins are helpful in defining features are presently known, which are uniquely shared by many of the α-proteobacteria subgroups and in clarifying evolu- various α-proteobacteria and that can clearly distinguish this tionary relationships among them. A number of proteins, which group from all others. The evolutionary relationships within include, Succ-CoA synthetase, Cox I, AlaRS, and MutS, contain this group of bacteria are also presently not understood. This conserved inserts that are shared by all other α-proteobacteria, review describes many novel signatures consisting of conserved except the Rickettsiales. The homologs of these proteins from inserts and deletions in widely distributed proteins that provide other bacteria also lack these indels providing evidence that these definitive means for defining the α-proteobacteria and many of signatures were introduced in a common ancestor of other α- its subgroups, and for understanding evolutionary relationships proteobacteria after the divergence of Rickettsiales. The Rick- among them. Because of the rarity and highly specific nature of ettsiales order also consistently forms the deepest branching these genetic changes, the possibility of their arising indepen- lineage in 16S rRNA and various protein trees (Dumler et al. dently by either convergent or parallel evolution is low (Gupta 2001; Gaunt et al. 2001; Kersters et al. 2003; Yu & Walker 1998; Rokas & Holland 2000). The simplest and most parsi- 2003; Stepkowski et al. 2003). Signature sequences in a num- monious explanation for such rare genetic changes, when re- ber of proteins were found to be specific for either the Rick- stricted to a particular clade(s), is that they were introduced ettsiales order (viz. XerD integrase and leucine aminopeptidase) only once in common ancestors of the particular group(s) and or the two main families, Rickettsiaceae (viz. transcription repair then passed on to various descendants. The signature approach coupling factor, ribosomal protein L19, and FtsZ proteins) and has proven very useful in the past in clarifying a number of Anaplasmataceae (RP-314 and Tgt proteins). These signatures important evolutionary relationships, which could not be reli- were likely introduced in the common ancestors of these groups. ably resolved based on phylogenetic trees (Rivera & Lake 1992; These groups are also clearly distinguished in the phylogenetic Baldauf & Palmer 1993). Our earlier work has identified many trees based on 16S rRNA (Figure 1) (Dumler et al. 2001; Yu & signatures that are either specific for particular groups of bac- Walker 2003) and various proteins (Figure 7) (Stepkowski et al. teria (viz. chlamydiae, cyanobacteria, Bacteroidetes-Chlorobi- 2003; Emelyanov 2003a). Fibrobacter, Deinococcus-Thermus, Proteobacteria) (Gupta Signature sequences in a number of proteins (viz. chromoso- For personal use only. 2000, 2004; Griffiths & Gupta 2002, 2004a; Gupta et al. 2003), mal replication factor, RP-057 and DNA ligase) were commonly or which are commonly shared by certain bacterial phyla provid- shared by various Rhizobiales, Rhodobacterales, and in most ing information regarding their interrelationships (Gupta 1998, cases Caulobacterales (currently represented by only C. cres- 2003; Gupta & Griffiths 2002; Griffiths & Gupta 2004b). centus),but they were not present in Rickettsiales, Rhodospir- A summary of the different signatures that were described illales as well as Sphingomonadales species. These results pro- in this review and the overall picture of α-proteobacterial evo- vide evidence that the groups lacking these signatures diverged lution that emerges based upon them is presented in Figure 26. prior to the introduction of these signatures. A unique signature Most of the signatures described here were unique for either all has also been identified for the Rhizobiales order (viz. TrpRS), α-proteobacteria or certain of its subgroups, and except for a few and one which is commonly shared by Rhodobacterales and C. isolated instances, they were not found in other bacteria. These crescentus. The latter signature suggests a specific relationship

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University finding provides evidence that the genes containing these signa- between the Rhodobacterales and Caulobacter groups. The re- tures have not been laterally transferred from α-proteobacteria lationships indicated by these signatures are also generally sup- to other bacteria, although LGT for certain other genes have ported by the phylogenetic trees based on 16S rRNA and various been previously reported (Wolf et al. 1999). A large number proteins (Gaunt et al. 2001; Kersters et al. 2003; Stepkowski et al. of these signatures, present in broadly distributed proteins (cy- 2003; Emelyanov 2003a). Signatures sequences in a number tochrome assembly protein Ctag, SAICAR synthetase, DnaB, of other proteins (viz. oxoglutarate dehydrogenase, Succ-CoA ATP synthase α,exonuclease VII, PLPG transferase, RP-400, synthase, DNA gyrase A, LepA, and LytB), are able to distin- puruvate phosphate dikinase, FtsK, and Cyt b) were distinctive guish the Rhizobiaceae, Brucellaceae, and Phyllobacteriaceae characteristics of all α-proteobacteria. Two additional proteins, families from the Bradyrhizobiaceae species. The distinctness MutY and homoserine dehydrogenase, also contain signatures of Bradyrhizobiaceae from other Rhizobiales is also clearly that were specific for α-proteobacteria. However, the homologs indicated by a signature sequence in seryl-tRNA synthetase of these proteins were not found in Rickettsiales. These signa- that is specific for this group. These signatures are also con- tures, for the first time, describe molecular characteristics that sistent with the observation that Bradyrhizobiaceae species are unify all α-proteobacteria, and provide means to clearly distin- only distantly related to other Rhizobiales (viz. Rhizobeaceae, PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 131

Brucellaceae, and Phyllobacteriaceae) (Figure 1) (Sadowsky within Bacteria (Woese et al. 1985; Stackebrandt 2000; Ludwig & Graham 2000; Gaunt et al. 2001; van Berkum et al. 2003; & Klenk 2001; Gupta & Griffiths 2002; Gupta 2002). Based on Kersters et al. 2003; Stepkowski et al. 2003; Moulin et al. 2004). the observations that α-proteobacteria can now be clearly dis- A specific relationship between Sinorhizobium and Agrobac- tinguished from all other bacteria based upon a large number of terium species was also indicated by the signature sequence in molecular characteristics, and that this group also branches dis- the LepA protein. tinctly from all other groups of bacteria including the β, γ - and On the basis of 16S rRNA or various genes/proteins trees, it δ,-proteobacteria, it is suggested that α-proteobacteri should be has proven difficult to reliably determine the interrelationships recognized as a main group or phylum within Bacteria, rather among different α-proteobacterial subgroups (Ludwig & Klenk than as a subdivision or class of the Proteobacteria (Gupta 2000, 2001; Kersters et al. 2003). However, based upon the distribu- 2004; Gupta & Griffiths 2002). Signature sequences in a few tion patterns of various signatures, it is now possible to logically proteins (viz. PPDK and FtsK) indicate that α-proteobacteria deduce the branching order of the main α-proteobacterial sub- might have shared a distant ancestry with the δ-proteobacteria groups (Figure 26). The model for α-proteobacterial evolution, exclusive of other bacteria, but this relationship needs to be fur- which has been developed here is based upon a large number of ther investigated and confirmed. proteins, which are involved in different functions. This model The α-proteobacteria have also given rise to mitochondria is internally highly consistent and it is difficult to logically ex- (Margulis 1970; Gray & Doolittle 1982; Andersson et al. 1998; plain the observed distributions of these signatures by alternate Sicheritz-Ponten et al. 1998; Gray et al. 1999; Gupta 2000; means. The model developed here is also consistent with the rela- Emelyanov 2001a, 2003a, 2003b) and very likely played a cen- tionships, which are resolved in the 16S rRNA or other phyloge- tral role in the origin of the ancestral eukaryotic cell (Gupta netic trees (viz. deep branching and distinctness of Rickettsiales, & Singh 1994; Gupta & Golding 1996; Margulis 1996; Gupta a closer relationship between Rhizobiaceae, Brucellaceae, and 1998; Martin & Muller 1998; Lopez-Garcia & Moreira 1999; Phyllobacteriaceae as compared to Bradyrhizobiaceae;acloser Karlin et al. 1999; Lang et al. 1999; Emelyanov 2003b; Rivera relationship between Rhodobacterales and Caulobacterales; dis- & Lake 2004). Many of the α-proteobacteria specific signa- tinctness of Rickettsiaceae from Anaplasmataceae species; dis- tures identified in the present work are also present in the mito- tinctness of Rhizobiales order containing various root nodule chondrial/eukaryotic homologs, providing additional evidence bacteria, etc.) (Sadowsky & Graham 2000; Dumler et al. 2001; of their derivation from an α-proteobacterial ancestor. In a few Kersters et al. 2003; Yu & Walker 2003; Moulin et al. 2004). cases, the α-proteobacterial signatures are present in genes which Afew minor inconsistencies seen at present (e.g., phyloge- are encoded by the mitochondrial DNA (viz. Cox I and Cyt b). netic placement of Ca. crescentus) should be clarified when The shared presence of these signatures in the mitochondrial ho- sequence information from additional species becomes avail- mologs provides further strong evidence for the α-proteobacterial

For personal use only. able. In this context, it is important to acknowledge that se- ancestry of mitochondria, as previously shown by phylogenetic quence information is available at present from only a limited analysis (Andersson et al. 1998; Sicheritz-Ponten et al. 1998; number of α-proteobacterial species. Although, these species Emelyanov 2003a). The current evidence suggests that within include representatives from different α-proteobacterial orders, α-proteobacteria, the Rickettsiales group of species are the clos- it is necessary to obtain sequence information for many other est relatives of mitochondria (Gupta 1995; Andersson et al. 1998; species from different genera and families to test and validate this Sicheritz-Ponten et al. 1998; Gray et al. 1999; Lang et al. 1999; model. Emelyanov 2001a, 2001b). However, this view is supported by Signature sequences in a number of proteins, a few of which only some of the identified signatures and further work is needed are described here, also provide evidence that α-proteobacteria to clarify this aspect. is a late diverging group within Bacteria (Gupta 1998, 2000, δ Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University 2003; Gupta & Griffiths 2002). Within proteobacteria, and -subdivisions are indicated to have branched prior to α-proteo- LIST OF ABBREVIATIONS bacteria, whereas β and γ -subdivisions are indicated as later AlaRS, alanyl-tRNA synthetase; CFBG, Chlamydia- branching groups (see also www.bacterialphylogeny.com). The Fibrobacter-Bacteroidetes-Green sulfur bacteria; Cyt., Cyto- branching of α-proteobacteria in this position is also supported chrome; Cox I, Cytochrome oxidase polypeptide I; LGT, lat- by the16S rRNA and various protein trees (Olsen et al. 1994; eral gene transfer; PLPG, Prolipoprotein-phosphatidylgycerol; Viale et al. 1994; Eisen 1995; Kersters et al. 2003). The α- PPDK, pyruvate phosphate dikinase; RP, Rickettsia prowazekii; proteobacteria, which is a very large group within Bacteria SerRS, serine-tRNA synthetase; Succ-CoA, Succinyl-CoA; Tgt, (>5000 entries in the RDP-II database) (Maidak et al. 2001), tRNA-guanine transglycosylase; TrpRS, tryptophanyl-tRNA are presently recognized as a Class within the Proteobacteria synthetase; Abbreviations in the species names are: A., Agrobac- phylum (Woese et al. 1984; Stackebrandt et al. 1988; Murray terium; Ana., Anaplasma; Aqu., Aquifex; Azo., Azotobacter; et al. 1990; Ludwig & Schleifer 1999; Boone et al. 2001; Azospir.,Azospirillum; Bac., Bacillus; Bact., Bacteroides; Bart., Kersters et al. 2003). However, presently there are no clearly de- Bartonella; Bdello., Bdeollovibrio; Bif., Bifidobacterium; Bor., fined criteria for the higher taxa (viz. Phylum, Class, Order, etc.) Borrelia; Bord., Bordetella; Brad. Bradyrhizobium; Bru., 132 R. S. GUPTA

Brucella; Buch., Buchnera; Burk., Burkholderia; Ca., Caulobac- Boone, D.R., Castenholz, R.W., and Garrity, G.M. 2001. Bergey’s Manual of ter; Camp., Campylobacter; Cb., Chlorobium; Cfx., Chlorof- Systematic Bacteriology. Springer, New York. lexus; Chl., Chlamydia; Chlam, Chlamydophila; Chromo., Boussau, B., Karlberg, E.O., Frank, A.C., Legault, B.A., and Andersson, S.G. 2004. Computational inference of scenarios for alpha-proteobacterial genome Chromo-bacterium; Clo., Clostridium; Cor., Cornyebacterium; evolution. Proc. Natl. Acad. Sci. USA 101, 9722–9727. Cox., Coxiella; Cyt., Cytophaga; Dei., Deinococcus; Dechloro., Bridger, W.A., Wolodko, W.T., Henning, W., Upton, C., Majumdar, R., and Dechloromonas; Des., Desulfovibrio; Desulf., Desulfito- Williams, S.P. 1987. The subunits of succinyl-coenzyme A synthetase— bacterium; Dros. endo., Drosophila endosymbiont; E., Esche- function and assembly. Biochem. Soc. Symp. 103–111. richia; Ent., Enterococcus; Fuso., Fusobacterium; Geo., Geo- Broughton, W. J. 2003. Roses by other names: of the Rhizobiaceae. J. Bacteriol. 185, 2975–2979. bacter; H., Haemophilus; Hel., Helicobacter; Lac., Lactococ- Capiaux, H., Lesterlin, C., Perals, K., Louarn, J.M., and Cornet, F. 2002. A dual cus; Lactobac., Lactobacillus; Lep., Leptospira; Lis., Listeria; role for the FtsK protein in Escherichia coli chromosome segregation. EMBO Leg., Legionella; Mag., Magnetococcus; Meso., Mesorhizobium; Rep. 3, 532–536. Methano., Methanobacterium; Methyl., Methylobacillus; Mi- Chase, J.W., Rabin, B.A., Murphy, J.B., Stone, K.L., and Williams, K.R. 1986. crobul., Microbulbifer; Myc., Mycobacterium; Myx., Myxococ- Escherichia coli exonuclease VII. Cloning and sequencing of the gene encod- ing the large subunit (xseA). J. Biol. Chem. 261, 14929–14935. cus; Nei., Neisseria; Nit., Nitrosomonas; Nitro., Nitrosospira; Daldal, F., Davidson, E., and Cheng, S. 1987. Isolation of the structural genes Novo., Novosphingobacterium; Olig., Oligotropha; Para., Para- for the Rieske Fe-S protein, cytochrome b and cytochrome c1 all components coccus; Pas., Pasteurella; Photobac., Photobacterium; Por., of the ubiquinol: Cytochrome c2 oxidoreductase complex of Rhodopseu- Porphyromonas; Pse., Pseudomonas; Ral., Ralstonia; Rhi., Rhi- domonas capsulata. J. Mol. Biol. 195, 1–12. zobium; Rho., Rhodobacter; Rhodo., Rhodospirillum; Rhodo- Davidson, E., and Daldal, F. 1987. Primary structure of the bc1 complex of Rhodopseudomonas capsulata. Nucleotide sequence of the pet operon encod- pseud., Rhodopseudomonas; Ri., Rickettsia; Shew., Shewanella; ing the Rieske cytochrome b, and cytochrome c1 apoproteins. J. Mol. Biol. Sino., Sinorhizobium; Sta., Staphylococcus; Str., Streptomyces; 195, 13–24. Strep., Streptococcus; Syn., Synechococcus; Sulfo., Sulfolobus; De Ley, J. 1992. The Proteobacteria: Ribosomal RNA cistron similarities T., Thermotoga; Thermoan., Thermoanaerobacter; Thermosyn., and bacterial taxonomy. In The Prokaryotes, eds. A. Balows, H.G. Tr¨uper, Thermosynechococcus; Tre., Treponema; Vib., Vibrio; Xan., M. Dworkin, W. Harder, and K.H. Schleifer, 2111–2140. Springer-Verlag, New York. Xanthomonas; Thiobac., Thiobacillus; Wol., Wolinella; Xyl., DelVecchio, V.G., Kapatral, V., Redkar, R.J., Patra, G., Mujer, C., Los, T., Xylella; Yer., ; Z., Zymomonas. Ivanova, N., Anderson, I., Bhattacharyya, A., Lykidis, A., Reznik, G., Jablonski, L., Larsen, N., D’Souza, M., Bernal, A., Mazur, M., Goltsman, E., Selkov, E., Elzer, P.H., Hagius, S., O’Callaghan, D., Letesson, J. J., Haselkorn, ACKNOWLEDGMENTS R., and Kyrpides, N. 2002. The genome sequence of the facultative intracel- The competent technical assistance of Pinay Kanth, Jeveon lular pathogen Brucella melitensis. Proc. Natl. Acad. Sci. USA 99, 443–448. Clements, Larissa Shamseer, and Adeel Mahmood in creating Dumler, J.S., Barbet, A.F., Bekker, C.P., Dasch, G.A., Palmer, G.H., Ray, S.C., Rikihisa, Y., and Rurangirwa, F.R. 2001. Reorganization of genera in the

For personal use only. sequence alignments of proteins from Rickettsia prowazekii and families Rickettsiaceae and Anaplasmataceae in the order Rickettsiales: Uni- other genomes is thankfully acknowledged. I am also thankful to fication of some species of Ehrlichia with Anaplasma, Cowdria with Ehrlichia YanLifor developing certain computer programs that facilitated and Ehrlichia with Neorickettsia, descriptions of six new species combinations the creation of signature sequence files and for help in setting up and designation of Ehrlichia equi and ‘HGE agent’ as subjective synonyms Int. J. Syst. Evol. Microbiol. the bacterial signatures website (www.bacterialphylogeny.com). of Ehrlichia phagocytophila. 51, 2145–2165. Eisen, J.A. 1995. The RecA protein as a model molecule for molecular systematic Thanks are also due to Emma Griffiths and Pinay Kanth for studies of bacteria: Comparison of trees of RecAs and 16S rRNAs from the helpful comments on the manuscript. The work on signature same species. J. Mol. Evol. 41, 1105–1123. sequences described here was mostly completed by August Emelyanov, V.V. 2001a. Evolutionary relationship of Rickettsiae and mitochon- 2004. This work was supported by a research grant from the dria. FEBS Letters 501, 11–18. rickettsia National Science and Engineering Research Council of Canada Emelyanov, V.V. 2001b. Rickettsiaceae, -like endosymbionts, and the origin of mitochondria. Biosci. Rep. 21, 1–17. and the Canadian Institute of Health Research.

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University Emelyanov, V.V.2003a. Common evolutionary origin of mitochondrial and rick- ettsial respiratory chains. Arch. Biochem. Biophys. 420, 130–141. Emelyanov, V.V. 2003b. Mitochondrial connection to the origin of eukaryotic REFERENCES cell. Eur. J. Biochem. 270, 1599–1618. Andersson, S.G., Zomorodipour, A., Andersson, J.O., Sicheritz-Ponten, T., Espeli, O., Lee, C., and Marians, K.J. 2003. A physical and functional interaction Alsmark, U.C., Podowski, R.M., Naslund, A.K., Eriksson, A.S., Winkler, between Escherichia coli FtsK and topoisomerase IV. J. Biol. Chem. 278, H.H., and Kurland, C.G. 1998. The genome sequence of Rickettsia prowazekii 44639–44644. and the origin of mitochondria. Nature 396, 133–140. Esser, C., Ahmadinejad, N., Wiegand, C., Rotte, C., Sebastiani, F., Gelius- Baldauf, S.L., and Palmer, J.D. 1993. Animals and fungi are each other’s closest Dietrich, G., Henze, K., Kretschmann, E., Richly, E., Leister, D., Bryant, relatives: Congruent evidence from multiple proteins. Proc. Natl. Acad. Sci. D., Steel, M.A., Lockhart, P.J., Penny, D., and Martin, W. 2004. A Genome USA 90, 11558–11562. Phylogeny for Mitochondria Among {alpha}-Proteobacteria and a Predom- Battistuzzi, F.U., Feijao, A., and Hedges, S.B. 2004. A genomic timescale of inantly Eubacterial Ancestry of Yeast Nuclear Genes. Mol. Biol. Evol. 21, prokaryote evolution: Insights into the origin of methanogenesis, phototrophy, 1643–1660. and the colonization of land. BMC. Evol. Biol. 4, 44. Falah, M., and Gupta, R.S. 1994. Cloning of the hsp70 (dnaK) genes from Bengtsson, J., von Wachenfeldt, C., Winstedt, L., Nygaard, P., and Hederstedt, Rhizobium meliloti and Pseudomonas cepacia:Phylogenetic analyses of mi- L. 2004. CtaG is required for formation of active cytochrome c oxidase in tochondrial origin based on a highly conserved protein sequence. J. Bacteriol. Bacillus subtilis. Microbiology 150, 415–425. 176, 7748–7753. PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 133

Galibert, F., Finan, T.M., Long, S.R., Puhler, A., Abola, P., Ampe, F., Barloy- Gupta, R.S., Pereira, M., Chandrasekera, C., and Johari, V. 2003. Molecular Hubler, F., Barnett, M. J., Becker, a., Boistard, P., Bothe, G., Boutry, M., signatures in protein sequences that are characteristic of Cyanobacteria and Bowser, L., Buhrmester, J., Cadieu, E., Capela, D., Chain, P., Cowie, A., plastid homologues. Int. J. Syst. Evol. Microbiol. 53, 1833–1842. Davis, R. W., Dreano, s., Federspiel, N. A., Fisher, R. F., Gloux, S., godrie, T., Gupta, R.S., and Singh, B. 1994. Phylogenetic analysis of 70 kD heat shock Goffeau, A., Golding, B., Gouzy, J., Gurjal, M., Hernandez-Lucas, I., Hong, protein sequences suggests a chimeric origin for the eukaryotic cell nucleus. A., Huizar, L., Hyman, R. W., Jons, T., Kahn, D., Kahn, M. L., Kalman, S., Curr. Biol. 4, 1104–1114. Keating, D. H., Kiss, E., Komp, c., Lelaure, v., Masuy, d., Palm, C., Peck, Hiser, L., Di Valentin,M., Hamer, A.G., and Hosler, J.P.2000. Cox11p is required M. C., Pohl, T. M., Portetelle, d., Purnelle, B., Ramsperger, U., Surzycki, for stable formation of the Cu(B) and magnesium centers of cytochrome c r., Thebault, P., Vandenbol, M., Vorholter, F. J., Weidner, S., Wells, D. H., oxidase. J. Biol. Chem. 275, 619–623. Wong, K., Yeh, K. C., and Batut, J. 2001. The composite genome of the Hui, F.M., and Morrison, D.A. 1993. Identification of a purC gene from Strep- legume symbiont Sinorhizobium meliloti. Science 293, 668–672. tococcus pneumoniae. J. Bacteriol. 175, 6364–6367. Garrity, G.M., and Holt, J.G. 2001. The road map to the manual. In Bergey’s Ip, S.C., Bregu, M., Barre, F.X., and Sherratt, D.J. 2003. Decatenation of DNA Manual of Systematic Bacteriology, eds. D. R. Boone and R. W. Castenholz, circles by FtsK-dependent Xer site-specific recombination. EMBO J. 22, 119–166. Springer-Verlag, Berlin. 6399–6407. Gaunt, M.W., Turner, S.L., Rigottier-Gois, L., Lloyd-Macgilp, S.A., and Young, Jeanmougin, F., Thompson, J.D., Gouy, M., Higgins, D.G., and Gibson, T.J. J.P. 2001. Phylogenies of atpD and recA support the small subunit rRNA- 1998. Multiple sequence alignment with Clustal x. Trends Biochem. Sci. 23, based classification of rhizobia. Int. J. Syst. Evol. Microbiol. 51, 2037– 403–405. 2048. Kaneko, T., Nakamura, Y., Sato, S., Asamizu, E., Kato, T., Sasamoto, S., Gonzales, T., and Robert-Baudouy, J. 1996. Bacterial aminopeptidases: Proper- Watanabe, a., Idesawa, K., Ishikawa, a., Kawashima, K., Kimura, t., Kimura, ties and functions. FEMS Microbiol. Rev. 18, 319–344. T., Kishida, Y., Kiyokawa, c., Kohara, M., Matsumoto, M., Matsuno, a., Gray, M.W. 1989. The evolutionary origins of organelles. Trends in Genet. 5, Mochizuki, Y., Nakayama, S., Nakazaki, N., Shimpo, S., Sugimoto, M., 294–299. Takeuchi, C., Yamada, M., and tabata, S., Complete genome structure of Gray, M.W., Burger, G., and Lang, B.F. 1999. Mitochondrial evolution. Science the nitrogen-fixing symbiotic bacterium Mesorhizobium loti. DNA Res. 7, 283, 1476–1481. 331–338. Gray, M.W., and Doolittle, W.F. 1982. Has the endosymbiont hypothesis been Kaneko, T., Nakamura, Y., Sato, S., Minamisawa, K., UCHIUMI, T., Sasamoto, proven?. Microbiol. Rev. 46, 1–42. s., Watanabe, A., Idesawa, K., Iriguchi, M., Kawashima, K., Kohara, M., Griffiths, E., and Gupta, R.S. 2002. Protein signatures distinctive of chlamy- Matsumoto, M., Shimpo, S., Tsuruoka, H., Wada, T., Yamada, M., and Tabata, dial species: Horizontal transfer of cell wall biosynthesis genes glmU from S., 2002. Complete genomic sequence of nitrogen-fixing symbiotic bacterium Archaebacteria to Chlamydiae, and murA between Chlamydiae and Strepto- Bradyrhizobium japonicum USDA110. DNA Res. 9, 189–197. myces. Microbiology 148, 2541–2549. Karlin, S., and Brocchieri, L. 2000. Heat shock protein 60 sequence comparisons: Griffiths, E., and Gupta, R.S. 2004a. Distinctive protein signatures pro- Duplications, lateral transfer, and mitochondrial evolution. Proc. Natl. Acad. vide molecular markers and evidence for the monophyletic nature of the Sci. USA 97, 11348–11353. Deinococcus-Thermus phylum. J. Bacteriol. 186, 3097–3107. Karlin, S., Brocchieri, L., Mrazek, J., Campbell, A.M., and Spormann, A.M. Griffiths, E., and Gupta, R.S. 2004b. Signature sequences in diverse proteins 1999. A chimeric prokaryotic ancestory of mitochondria and primitive eu- provide evidence for the late divergence of the order Aquificales. International karyotes. Proc. Natl. Acad. Sci. USA 96, 9190–9195. Microbiol. 7, 41–52. Kersters, K., Devos, P., Gillis, M., Vandamme, P., and Stackebrandt, E. 2003.

For personal use only. Gupta, R.S. 1995. Evolution of the chaperonin families (Hsp60, Hsp10 and Tcp- Introduction to the Proteobacteria. In The Prokaryotes: An Evolving Elec- 1) of proteins and the origin of eukaryotic cells. Mol. Microbiol. 15, 1–11. tronic Resource for the Microbiological Community, ed. M. e. al. Dworkin, Gupta, R.S. 1998. Protein phylogenies and signature sequences: a reappraisal of Springer-Verlag, New York. evolutionary relationships among archaebacteria, eubacteria, and eukaryotes. Kolber, Z.S., Plumley, F.G., Lang, A.S., Beatty, J.T., Blankenship, R.E., Microbiol. Mol. Biol. Rev. 62, 1435–1491. VanDover, C.L., Vetriani, C., Koblizek, M., Rathgeber, C., and Falkowski, Gupta, R.S. 2000. The phylogeny of Proteobacteria: Relationships to other eu- P.G. 2001. Contribution of aerobic photoheterotrophic bacteria to the carbon bacterial phyla and eukaryotes. FEMS Microbiol. Rev. 24, 367–402. cycle in the ocean. Science 292, 2492–2495. Gupta, R.S. 2001. The branching order and phylogenetic placement of species Ku, M.S., Kano-Murakami, Y., and Matsuoka, M. 1996. Evolution and expres- from completed bacterial genomes, based on conserved indels found in various sion of C4 photosynthesis genes. Plant Physiol. 111, 949–957. proteins. Inter. Microbiol. 4, 187–202. Kurland, C.G., and Andersson, S.G. 2000. Origin and evolution of the mito- Gupta, R.S. 2002. Phylogeny of Bacteria: Are we now close to understanding chondrial proteome. Microbiol. Mol. Biol. Rev. 64, 786–820. it?. ASM News. 68, 284–291. Lake, J.A., and Rivera, M.C. 1994. Was the nucleus the first endosymbiont?

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University Gupta, R.S. 2003. Evolutionary relationships among photosynthetic bacteria. Proc. Natl. Acad. Sci. USA 91, 2880–2881. Photosynth. Res. 76, 173–183. Lang, B.F., Gray, M.W., and Burger, G. 1999. Mitochondrial genome evo- Gupta, R.S. 2004. The phylogeny and signature sequences characteristics of lution and the origin of eukaryotes. Annual Review of Genetics 33, 351– Fibrobacters, Chlorobi and Bacteroidetes. Crit. Rev. Microbiol. 30, 123– 397. 143. Larimer, F.W., Chain, P., Hauser, L., Lamerdin, J. Malfatti, S., Do, L., Land, Gupta, R.S., Aitken, K., Falah, M., and Singh, B. 1994. Cloning of Giardia M. L., Pelletier, D. A., Beatty, J. t., Lang, A. S., Tabita, F. R., Gibson, J. L., lamblia heat shock protein HSP70 homologs: Implications regarding origin Hanson, T. E., Bobst, C., Torres, J. L., Peres, C., Harrison, F. H., Gibson, of eukaryotic cells and of endoplasmic reticulum. Proc. Natl. Acad. Sci. USA J., and Harwood, C. S., 2004. Complete genome sequence of the metaboli- 91, 2895–2899. cally versatile photosynthetic bacterium Rhodopseudomonas palustris. Nat. Gupta, R.S., Bustard, K., Falah, M., and Singh, D. 1997. Sequencing of heat Biotechnol. 22, 55–56. shock protein 70 (DnaK) homologs from Deinococcus proteolyticus and Ther- Leyva, J.A., Bianchet, M.A., and Amzel, L.M. 2003. Understanding ATP syn- momicrobium roseum and their integration in a protein-based phylogeny of thesis: Structure and mechanism of the F1-ATPase (Review). Mol. Membr. prokaryotes. J. Bacteriol. 179, 345–357. Biol. 20, 27–33. Gupta, R.S., and Golding, G.B. 1996. The origin of the eukaryotic cell. Trends Lopez-Garcia, P., and Moreira, D. 1999. Metabolic symbiosis at the origin of Biochem. Sci. 21, 166–171. eukaryotes. Trends Biochem. Sci. 24, 88–93. Gupta, R.S., and Griffiths, E. 2002. Critical issues in bacterial phylogenies. Ludwig, W., and Klenk, H.-P. 2001. Overview: A phylogenetic backbone and Theor. Popul. Biol. 61, 423–434. taxonomic framework for prokaryotic systamatics. In Bergey’s Manual of 134 R. S. GUPTA

Systematic Bacteriology, eds. D. R. Boone and R. W. Castenholz, 49–65. Ayodeji, B., Kraul, M., Shetty, J., Malek, J., Van Aken, S. E., Reidmuller, S., Springer-Verlag, Berlin. Tettelin, H., Gill, S. R., White, O., Salzberg, S. L., Hoover, D. L., Lindler, L. Ludwig, W., and Schleifer, K.H. 1999. Phylogeny of Bacteria beyond the 16S E., Halling, s. M., Boyle, S. M., and Fraser, C. M., 2002. The Brucella suis rRNA Standard. ASM News 65, 752–757. genome reveals fundamental similarities between animal and plant pathogens Maidak, B.L., Cole, J.R., Lilburn, T.G., Parker, C.T., Jr., Saxman, P.R., Far- and symbionts. Proc. Natl. Acad. Sci. USA 99, 13148–13153. ris, R.J., Garrity, G.M., Olsen, G.J., Schmidt, T.M., and Tiedje, J.M. 2001. Qi, H.Y., Sankaran, K., Gan, K., and Wu, H.C. 1995. Structure-function rela- The RDP-II (Ribosomal Database Project). Nucleic Acids Res. 29, 173– tionship of bacterial prolipoprotein diacylglyceryl transferase: Functionally 174. significant conserved regions. J. Bacteriol. 177, 6820–6824. Margulis, L. 1970. Origin of Eukaryotic cells. YaleUniversity Press, New Haven, Reuter, K., and Ficner, R. 1995. Sequence analysis and overexpression of the CT. Zymomonas mobilis tgt gene encoding tRNA-guanine transglycosylase: Pu- Margulis, L. 1993. Symbiosis in Cell Evolution. W.H. Freeman and Company, rification and biochemical characterization of the enzyme. J. Bacteriol. 177, New York. 5284–5288. Margulis, L. 1996. Archaeal-eubacterial mergers in the origin of Eukarya: Phy- Ribeiro, S., and Golding, G.B. 1998. The mosaic nature of the eukaryotic nu- logenetic classification of life. Proc. Natl. Acad. Sci. USA 93, 1071–1076. cleus. Mol. Biol. Evol. 15, 779–788. Martin, W., and Muller, M. 1998. The hydrogenosome hypothesis for the first Rivera, M.C., and Lake, J.A. 1992. Evidence that eukaryotes and eocyte prokary- eukaryote. Nature 392, 37–41. otes are immediate relatives. Science 257, 74–76. Martins-Pinheiro, M., Galhardo, R.S., Lage, C., Lima-Bessa, K.M., Aires, K.A., Rivera, M.C., and Lake, J.A. 2004. The ring of life provides evidence for a and Menck, C.F. 2004. Different patterns of evolution for duplicated DNA genome fusion origin of eukaryotes. Nature 431, 152–155. repair genes in bacteria of the Xanthomonadales group. BMC. Evol. Biol. 4, Rokas, A., and Holland, P.W. 2000. Rare genomic changes as a tool for phylo- 29. genetics. Trends Ecol. Evol. 15, 454–459. McLeod, M.P., Qin, X., Karpathy, S.E., Gioia, J. Highlander, S. K., Fox, G. E., Romanowski, M.J., Bonanno, J.B., and Burley, S.K. 2002. Crystal structure of McNeill, T. Z., Jiang, H., Muzny, d., Jacob, L. S., Hawes, A. C., Sodergren, the Escherichia coli glucose-inhibited division protein B (GidB) reveals a E., Gill, R., Hume, J., Morgan, M., Fan, G., Amin, A. G., Gibbs, R. A., methyltransferase fold. Proteins 47, 563–567. Hong, C., Yu, X. J., Walker, D. H., and Weinstock, G. M., 2004. Complete Sadowsky, M.J. and P.H. Graham. 2000. Root and Stem Nodule Bacteria of genome sequence of Rickettsia typhi and comparison with sequences of other Legumes. In The Prokaryotes: An Evolving Electronic Resource for the Mi- rickettsiae. J. Bacteriol. 186, 5842–5855. crobiological Community, ed. M. e. al. Dworkin. Springer-Verlag, New York. Messer, W. 2002. The bacterial replication initiator DnaA. DnaA and oriC, the Sawada, H., Kuykendall, L.D., and Young, J.M. 2003. Changing concepts in bacterial mode to initiate DNA replication. FEMS Microbiol. Rev. 26, 355– the systematics of bacterial nitrogen-fixing legume symbionts. J. Gen. Appl. 374. Microbiol. 49, 155–179. Morden, C.W., Delwiche, C.F., Kuhsel, M., and Palmer, J.D. 1992. Gene phy- Sicheritz-Ponten, T., Kurland, C.G., and Andersson, S.G. 1998. A phylogenetic logenies and the endosymbiotic origin of plastids. Biosystems 28, 75–90. analysis of the cytochrome b and cytochrome c oxidase I genes supports an Moreno, E., and Moriyon, I. 2001. The Genus Brucella. The Prokaryotes: An origin of mitochondria from within the Rickettsiaceae. Biochim. Biophys. Evolving Electronic Resource for the Microbiological Community.Ined. Acta. 1365, 545–551. M. e. al. Dworkin. Springer-Verlag, New York. Sixma, T.K. 2001. DNA mismatch repair: MutS structures bound to mismatches. Moulin, L., Bena, G., Boivin-Masson, C., and Stepkowski, T. 2004. Phylogenetic Curr. Opin. Struct. Biol. 11, 47–52. analyses of symbiotic nodulation genes support vertical and lateral gene co- Soni, R.K., Mehra, P., Choudhury, N.R., Mukhopadhyay, G., and Dhar, S.K.

For personal use only. transfer within the Bradyrhizobium genus. Mol. Phylogenet. Evol. 30, 720– 2003. Functional characterization of Helicobacter pylori DnaB helicase. 732. Nucleic Acids Res. 31, 6828–6840. Murray, R.G.E., Brenner, D.J., Colwell, R.R., De Vos, P., Goodfellow, M., Stackebrandt, E. 2000. Defining Taxonomic Ranks. In The Prokaryotes: An Grimont, P.A.D., Pfennig, N., Stackebrandt, E., and Zavarzin, G.A. 1990. Evolving Electronic Resource for the Microbiological Community, ed. M. e. Report of the Ad Hoc Committee on approaches to taxonomy within the al. Dworkin. Springer-Verlag, New York. Proteobacteria. Int. J. Syst. Bacteriol. 40, 213–215. Stackebrandt, E., Murray, R.G.E., and Tr¨uper, H.G. 1988. Proteobacteria classis Nierman, W.C., Feldblyum, T.V., Laub, M.T., Paulsen, I.T., Nelson, K.E., Eisen, nov., a name for the phylogenetic taxon that includes the “Purple bacteria and J., Heidelberg, J.F., Alley, M.R., Ohta, N., Maddock, J.R., Potocka, I., Nelson, their Relatives.” Int. J. Syst. Bacteriol. 38, 321–325. W.C., Newton, A., Stephens, C., Phadke, N.D., Ely, B., DeBoy, R.T., Dodson, Stepkowski, T., Czaplinska, M., Miedzinska, K., and Moulin, L. 2003. The R.J., Durkin, A.S., Gwinn, M.L., Haft, D.H., Kolonay, J.F., Smit, J., Craven, variable part of the dnaK gene as an alternative marker for phylogenetic M.B., Khouri, H., Shetty, J., Berry, K., Utterback, T., Tran, K., Wolf, A., Va- studies of rhizobia and related alpha Proteobacteria. Syst. Appl. Microbiol. mathevan, J., Ermolaeva, M., White, O., Salzberg, S.L., Venter, J.C., Shapiro, 26, 483–494.

Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University L., and Fraser, C.M. 2001. Complete genome sequence of Caulobacter cres- Stryer, L. 1995. Biochemistry. W.H. Freeman and Co., New York. centus. Proc. Natl. Acad. Sci. USA 98, 4136–4141. Taillardat-Bisch, A.V., Raoult, D., and Drancourt, M. 2003. RNA polymerase Ogata, H., Audic, S., Renesto-Audiffren, P., Fournier, P.E., Barbe, V., Samson, beta-subunit-based phylogeny of Ehrlichia spp., Anaplasma spp., Neorick- D., Roux, V., Cossart, P., Weissenbach, J., Claverie, J.M., and Raoult, D. 2001. ettsia spp. and Wolbachia pipientis. Int. J. Syst. Evol. Microbiol. 53, 455– Mechanisms of evolution in Rickettsia conorii and R. prowazekii. Science 293, 458. 2093–2098. van Berkum, P., Terefework, Z., Paulin, L., Suomalainen, S., Lindstrom, K., and Olsen, G. J., Woese, C. R., and Overbeek, R. 1991. The winds of (evolutionary) Eardly, B.D. 2003. Discordant phylogenies within the rrn loci of Rhizobia. change: Breathing new life into microbiology. J. Bacteriol. 176, 1–6. J. Bacteriol. 185, 2988–2998. Opperman, T., and Richardson, J.P. 1994. Phylogenetic analysis of sequences Van Sluys, M.A., Monteiro-Vitorello, C.B., Camargo, L.E., Menck, C.F., da from diverse bacteria with homology to the Escherichia coli rho gene. J. Bac- Silva, A.C., Ferro, J.A., Oliveira, M.C., Setubal, J.C., Kitajima, J.P., and teriol. 176, 5033–5043. Simpson, A.J. 2002. Comparative genomic analysis of plant-associated bac- Parker, A.R., and Eshleman, J.R. 2003. Human MutY: Gene structure, protein teria. Annu. Rev. Phytopathol. 40, 169–189. functions and interactions, and role in carcinogenesis. Cell Mol. Life Sci. 60, Viale, A.M., and Arakaki, A.K. 1994. The chaperone connection to the origins 2064–2083. of the eukaryotic organelles. FEBS Letters 341, 146–151. Paulsen, I.T., Seshadri, R., Nelson, K.E., Eisen, J.A. Heidelberg, J. F., Read, T. Viale, A.M., Arakaki, A.K., Soncini, F.C., and Ferreyra, R.G. 1994. Evolutionary D., Dodson, R. J., Umayam, L., Brinkac, L. M., Beanan, M. J., Daugherty, s. relationships among eubacterial groups as inferred from GroEL (chaperonin) C., DeBoy, R. T., Durkin, A. S., Kolonay, J. F., Madupu, r., Nelson, W. C., sequence comparisons. Int. J. Syst. Bacteriol. 44, 527–533. PHYLOGENY AND SIGNATURES DISTINCTIVE OF α-PROTEOBACTERIA 135

Wang, E.T., van Berkum, P., Beyene, D., Sui, X.H., Dorado, O., Chen, W.X., P., Jung, M., Krespan, W., Perry, M., Gordon-Kamm, B., Lioa, L., Kim, S., and Martinez-Romero, E. 1998. Rhizobium huautlense sp. nov., a symbiont of Hendrick, C., Zhao, Z. Y., Dolan, M., Chumley, F., Tingey, S. V., Tomb, Sesbania herbacea that has a close phylogenetic relationship with Rhizobium J. F., Godon, M. P., Olson, M. V., and Nester, E. W., 2001. The genome of the galegae. Int. J. Syst. Bacteriol. 48 Pt. 3, 687–699. natural genetic engineer Agrobacterium tumefaciens C58. Science 294, 2317– Woese, C.R., Stackebrandt, E., Macke, R.J., and Fox, G.E. 1985. A phylogenetic 2323. definition of the major eubacterial taxa. System. Appl. Microbiol. 6, 143– Young, J.M., Kuykendall, L.D., Martinez-Romero, E., Kerr, A., and Sawada, 151. H. 2001. A revision of Rhizobium Frank 1889, with an emended description Woese, C.R., Stackebrandt, E., Weisburg, W.G., Paster, B.J., Madigan, M.T., of the genus, and the inclusion of all species of Agrobacterium Conn 1942 Fowler, C.M.R., Hahn, C.M., Blanz, P., Gupta, R., Nealson, K.H., and Fox, and Allorhizobium undicola de Lajudie et al. 1998 as new combinations: G.E. 1984The phylogeny of purple bacteria: The alpha subdivision. System. Rhizobium radiobacter, R. rhizogenes, R. rubi, R. undicola and R. vitis. Int. Appl. Microbiol. 5, 315–326. J. Syst. Evol. Microbiol. 51, 89–103. Wolf, Y.I., Aravind, L., and Koonin, E.V. 1999. Rickettsiae and Chlamydiae— Yu, X.J. and D. H. Walker. 2003. The Order Rickettsiales. In The Prokaryotes: evidence of horizontal gene transfer and gene exchange. Trends Genet 15, An Evolving Electronic Resource for the Microbiological Community, ed. 173–175. M. e. al. Dworkin. Springer-Verlag, New York. Wood, D.W., Setubal, J.C., Kaul, R., Monks, D.E. Kitajima, J. P., Okura, V. K., Yu, X.J., Zhang, X.F., McBride, J.W., Zhang, Y., and Walker, D.H. 2001. Phylo- Zhou, Y., Chen, L., Wood, G. E., Almeida, N. F., Jr., Woo, L., Chen, Y.,Paulsen, genetic relationships of Anaplasma marginale and ‘Ehrlichia platys’ to other I. T., Eisen, J. A., Karp, P. D., Bovee, D., Sr., Chapman, P., Clendenning, Ehrlichia species determined by GroEL amino acid sequences. Int. J. Syst. J., Deatherage, G., Gillet, W., Grant, c., Kutyavin, T., Levy, R., Li, M. J., Evol. Microbiol. 51, 1143–1146. McClelland, E., Palmieri, A., Raymond, C., Rouse, G., Saenphimmachak, Yurkov, V.V., and Beatty, J.T. 1998. Aerobic anoxygenic phototrophic bacteria. C., Wu, Z., Romero, P., Gordon, D., Zhnag, S., Yoo, H., Tao, Y., Biddle, Microbiol. Mol. Biol. Rev. 62, 695–724. For personal use only. Critical Reviews in Microbiology Downloaded from informahealthcare.com by Cornell University