Experimental Parasitology 123 (2009) 236–243

Contents lists available at ScienceDirect

Experimental Parasitology

journal homepage: www.elsevier.com/locate/yexpr

Babesia bovis: A comprehensive phylogenetic analysis of plastid-encoded genes supports green algal origin of apicoplasts

Audrey O.T. Lau a,*, Terry F. McElwain a, Kelly A. Brayton a, Donald P. Knowles a,b, Eric H. Roalson c a Program in Genomics, Department of Veterinary Microbiology & Pathology, School for Global Animal Health, College of Veterinary Medicine, Washington State University, Pullman, WA 99164-7040, USA b Animal Diseases Research Unit, United States Department of Agriculture–Agricultural Research Service, Washington State University, Pullman, WA 99164-7030, USA c School of Biological Sciences and Center for Integrated Biotechnology, Washington State University, Pullman, WA 99164-4236, USA article info abstract

Article history: Apicomplexan parasites commonly contain a unique, non-photosynthetic plastid-like organelle termed Received 30 April 2009 the apicoplast. Previous analyses of other plastid-containing organisms suggest that apicoplasts were Received in revised form 23 July 2009 derived from a red algal ancestor. In this report, we present an extensive phylogenetic study of apicoplast Accepted 24 July 2009 origins using multiple previously reported apicoplast sequences as well as several sequences recently Available online 29 July 2009 reported. Phylogenetic analysis of amino acid sequences was used to determine the evolutionary origin of the organelle. A total of nine plastid genes from 37 species were incorporated in our study. The data Keywords: strongly support a green algal origin for apicoplasts and Euglenozoan plastids. Further, the nearest green Apicoplasts algae lineage to the Apicomplexans is the parasite Helicosporidium, suggesting that apicoplasts may have Apicomplexans Bayesian inference originated by lateral transfer from green algal parasite lineages. The results also substantiate earlier find- Euglenozoa ings that plastids found in Heterokonts such as Odontella and Thalassiosira were derived from a separate Red and green algae secondary endosymbiotic event likely originating from a red algal lineage. Ó 2009 Elsevier Inc. All rights reserved.

1. Introduction 1992; Delwiche et al., 1995; Yoon et al., 2002). However, the iden- tity of the secondary endosymbiont remains controversial and the Since the first genome report in Plasmodium (Gardner et al., phylogenetic origin of the apicoplast has been contested (Blanchard 2002), apicoplast genomes have been detected in Theileria, Eimeria, and Hicks, 1999; Cai et al., 2003; Fast et al., 2001; Funes et al., 2002; Toxoplasma, and through complete genome sequencing ef- Kohler et al., 1997; Waller et al., 2003; Waller and McFadden, 2005; forts (Abrahamsen et al., 2004; Brayton et al., 2007; Dunn et al., Williamson et al., 1994; Zhang et al., 1999). 1998; Gardner et al., 2002, 2005; Toso and Omoto, 2007a,b; Xu Most of the genes encoded by the ancestral photosynthetic et al., 2004). Notably, genome sequencing has failed to detect the plastid genome have been lost or have migrated to the nucleus, presence of an apicoplast genome for Cryptosporidium spp. (Abra- resulting in much reduced genome sizes (Gray, 1992, 1993; Medlin hamsen et al., 2004; Xu et al., 2004), and ultrastructural studies et al., 1995). Therefore, the apicoplast genome typically encodes indicate that the more distantly related Gregarines(Toso and Omot- less than 1% of the total number of chromosomal genes. Plastid o, 2007a,b) and Archigregarines (Simdyanov and Kuvardina, 2007) and nuclear-encoded genes such as tufA, rpo (B and C), cox2a and do not appear to contain an apicoplast. While the specific role of cox2b suggest that apicoplasts originated from green algae (Cai the apicoplast in the Apicomplexan life cycle is for the most part un- et al., 2003; Funes et al., 2002; Kohler et al., 1997; Williamson clear, in Plasmodium falciparum, the causative agent of , the et al., 1994) while studies using gene order, other apicoplast- apicoplast has been demonstrated to be involved in de novo fatty (Blanchard and Hicks, 1999) or nuclear-encoded genes whose acid synthesis (Waller et al., 2003). This biosynthetic pathway, protein products are translocated to the plastid (Fast et al., 2001; which is considered a novel chemotherapeutic target (Gornicki, Waller and McFadden, 2005) suggest that this organelle originates 2003), is identical to those utilized in plant chloroplasts and bacte- from red algae. Many of these studies were limited in scope due to ria. Acquisition of the multi-walled apicoplast must have involved a paucity of genetic information from a diverse selection of Api- at least two endosymbiotic events (Keeling, 2004), and complexan organisms. Additional genome sequences that have phylogenetic evidence indicates that a bacterium, probably a recently become available include (Gardner et al., cyanobacterium, was the primary endosymbiont (Cavalier-Smith, 2005), T. annulata (Pain et al., 2005), Babesia bovis (Brayton et al., 2007), and Thalassiosira pseudonana (Armbrust et al., 2004) and have considerably increased the sampling density of plastid- * Corresponding author. Fax: +1 509 335 8529. E-mail address: [email protected] (A.O.T. Lau). encoded genes. We have utilized comprehensive, multi-gene

0014-4894/$ - see front matter Ó 2009 Elsevier Inc. All rights reserved. doi:10.1016/j.exppara.2009.07.007 A.O.T. Lau et al. / Experimental Parasitology 123 (2009) 236–243 237

Bayesian inference analyses to determine that the apicoplast gen- rpl14, rpl16, rpoB, rpoC1, rpoC2, rps3, rps11, rps12, and tufA. The omes originated from a green algal lineage. In addition, our results large and small subunit rRNA were excluded for several reasons. substantiate earlier findings that plastids found in Heterokonts First, the widely discussed issues of alignment of rRNA genes at such as Odontella and Thalassiosira were derived from a separate this phylogenetic depth (Dacks et al., 2002; Gribaldo and Philippe, secondary endosymbiotic event likely originating from a red algal 2002) make these genes poor candidates for the current phyloge- lineage. Odontella and Thalassiosira are types of marine diatoms netic analysis. Second, many plastid genomes have more than with worldwide distribution. Our findings differ from the current one copy of the LSU and SSU while some, including Babesia and chromalveolate hypothesis, which states that chromists and alveo- Theileria, have only one of each, complicating assessment of their lates retain their plastids from a red algal ancestry. homology. After initial alignments of the clpC genes, the data indi- cated that there were issues of orthology of this gene complicated by several apparent transfers of the orthologs to the nuclear com- 2. Materials and methods partment. Therefore, the clpC gene was also excluded from the analyses. Finally, some genes are missing for a few taxa, including Complete plastid sequences from a diversity of Cyanobacteria Anthoceros: tufA; Chlamydomonas: rpoC1; Oryza: tufA; and Thalass- and were collected from GenBank with their individual iosira: rpl14, rpl16, rps3, rps11, and rps12. In all, this represents accession numbers provided in Table 1. All completely sequenced eight missing gene copies from 252 total possible copies for this plastids from Eukaryotes were included with the exception of the matrix (3%). land plants, where a subset of complete plastids were included. Amino acid sequence alignments were performed using a two- Gene overlap between these samples and Apicomplexan apicop- step process. First, amino acid sequences were compiled and lasts were found to include the genes clpC, LSU rRNA, SSU rRNA, aligned using Clustal X 1.83 (Huelsenbeck and Bollback, 2001; Thompson et al., 1997) with the Gonnet 250 cost matrix applied to pairwise alignments and the Gonnet series applied to the multi- Table 1 Taxa sampled and their corresponding plastid GenBank accession number. ple alignments. Multiple amino acid alignment models were com- pared and these different alignment options had little effect on Cyanobacteria preliminary analyses (data not shown). Alignment results sug- Gloeobacter violaceus BA000045 Nostoc anabaena PCC7120 BA000019 gested that some regions of the genes were much less conserved Prochlorococcus marinus MIT9313 BX548175 than others, with significantly greater amounts of inferred indel Synechococcus sp. WH8102 BX548020 events in these regions. Due to the uncertainty of the alignments Thermosynechococcus elongatus BP-1 BA000039 in these gene regions, Gblocks (Castresana, 2000; Talavera and Cas- Apicomplexans tresana, 2007) was used to select those regions of the aligned se- Babesia bovis T2BO AAXT00000000 quences that are confidently aligned for analysis. Gblocks Eimeria tenella AY217738 Plasmodium falciparum 3D7 X95275/X95276 eliminates poorly aligned positions and divergent regions of an Theileria parva AAGK01000009 alignment of DNA or protein sequences and selects sequence seg- Toxoplasma gondii U87145 ments that lack large segments of contiguous non-conserved posi- Euglenozoans tions, lack of gap positions and high conservation of flanking Astasia longa AJ294725 positions. Euglena gracilis Z Z11874 Maximum likelihood (ML) analyses of the complete and trun- Haptophyceae cated nucleotide matrices were performed using PAUP* 4.0b10 Emiliana huxleyi AY741371 and heuristic searches were employed with the starting tree ob- Cercozoa tained via neighbor-joining (NJ) and using the tree-bisection- Bigelowiella natans DQ851108 reconnection (TBR) branch swapping algorithm (Swofford et al., Viridiplantae 2001). Clade support was estimated using 100 heuristic bootstrap Anthoceros formosae AB086179 replicates using a reduced data set (four Viridiplantae, one Hapto- Arabidopsis thaliana AP00423 phyceae and one Cercozoa were omitted as compared to the final Chaetosphaeridium globosum AF494278 Chlamydomonas reinhardtii BK000554 set of taxa included in the Bayesian analysis). Results from the Chlorella vulgaris C-27 AB001684 ML were congruent with the final Bayesian results, thus none of Helicosporidium sp. ex Simulium jonesii DQ398104 the ML data were shown to avoid redundancy in the report. Bayes- Leptosira terrestris EF506945 ian inference analysis was performed on the Gblocks individual Mesostigma viride AF166114 Nephroselmis olivacea AF137379 and combined-gene matrices using MrBayes v.3.0 (Huelsenbeck Oltmannsiellopsis viridis DQ291132 and Bollback, 2001). Seven and a half million generations were Oryza nivara AP006728 run with four chains (Markov Chain Monte Carlo), the heating Pseudenoclonium akinetum AY835431 parameter set at 0.05, and a tree was saved every 1000 generations. Scenedesmus obliqus DQ396875 Priors for all analyses included the mixed amino acid model imple- Stigeoclonium helvetiucum DQ630521 menting a covarion model, as applied in MrBayes. The covarion Heterokonts model allows for rates to change across the topology (Galtier, Odontella sinensis Z67753 Thalassiosira pseudonana EF067921 2001; Huelsenbeck et al., 2002; Tuffley and Steel, 1998). In order to test for the occurrence of stationarity, convergence, and mixing Rhodophytes Cyanidioschyzon merolae AB002583 within 7.5 million generations, multiple analyses were started Cyanidium caldarium RK1 AF022186 from different random locations in tree space. The posterior prob- Gracilaria tenuistipitata var. liui AY673996 ability distributions from these separate replicates were compared Porphyra purpurea U38804 for convergence to the same posterior probabilities across Cryptophytes branches. Majority rule consensus trees of those sampled in Bayes- Guillardia theta AF041468 ian inference analyses yielded probabilities that the clades are Rhodomonas salina EF508371 monophyletic (Lewis, 2001). The trees from the MrBayes analysis Glaucocystophytes were loaded into PAUP* 4.0b10 (Swofford et al., 2001), discarding Cyanophora paradoxa cyanelle U30821 the trees generated within the first 2,000,000 generations (those 238 A.O.T. Lau et al. / Experimental Parasitology 123 (2009) 236–243 sampled during the ‘‘burnin” of the chain (Huelsenbeck and Ron- gies, although often with less resolution and lower posterior quist, 2001), to only include trees after stationarity was estab- probability support for branches. The statistical support of these lished. Posterior probability values (pp) are presented on a relationships is very high as evidenced by the number of branches sample tree from the post-stationarity distribution of Bayesian with posterior probability values greater than 95%. Independent trees in order to demonstrate branch lengths. MrBayes analyses that were performed converged on the same posterior probability distribution of trees, suggesting that conver- gence and mixing were occurring in these analyses. Shorter analy- 3. Results and discussion ses (5 million generations) with 8 chains also converged similarly and fully resolved the consensus topology. Therefore, our results Babesia bovis, an Apicomplexan hemoparasite, is one of the most suggest that (i) the plastids in the Heterokonts Odontella and Tha- prevalent tick-borne pathogens of worldwide. Results from lassiosira originated separately from the Apicomplexans and are the B. bovis genome sequencing project revealed the presence of a likely derived from a red algal lineage (Fig. 1) and (ii) that the api- circular 33 kbp plastid-like genome (Brayton et al., 2007). Although coplast and the Euglenozoan plastids were similarly derived from the function of apicoplasts is not well established, an investigation green plant lineages. into its origin will no doubt provide insight into Apicomplexans’ evo- Overall patterns found here propose that the roots of the Eukary- lution and gene loss in parasites (Keeling, 2004). B. bovis apicoplast otic plastids are in the vicinity of the Glaucocystophyceae and red al- genome together with the recent availability of Theileria apicoplast gae. Strong branch support is found for the placement of the and Helicosporidium plastid sequences allowed us to conduct a com- Apicomplexans with the Euglenozoa and Viridiplantae (Chlorophyta prehensive phylogenetic analysis of apicoplast/plastid genomes and Streptophyta). Further, the Guillardia plastid is found to origi- using nine genes common to all apicoplast/plastid genomes, includ- nate from the red algae, as previously suggested (Hagopian et al., ing rpl14, rpl16, rpoB, rpoC1, rpoC2, rps3, rps11, rps12, and tufA. Anal- 2004) and finally, the Chlorophytes and Streptophytes, as tradition- yses were conducted on deduced amino acid sequence alignments, ally delimited, do not resolve into monophyletic groups (Fig. 1). and these were analyzed individually as well as together in a com- Previous analysis of the phylogenetic position of the Plasmo- bined analysis (Figs. 1 and 2). All of the genes are located on non- dium apicoplast suggested that some data supported a red algal recombining plastids, and therefore, share the same history. These origin, but that a combined analysis of all genes supported a green genes were analyzed for five Cyanobacteria, one Glaucocystophyte, algal origin (Blanchard and Hicks, 1999). The phylogenetic hypoth- two Cryptophyte, four red algae, two Heterokonts, 14 green plants eses presented in that study did not include estimates of branch (five Streptophytes and nine Chlorophytes), two Euglenozoans, one support and, therefore, how strongly one of these topologies was Haptophyceae, one Cercozoa, and five Apicomplexans (Table 1). supported over the other was unclear. Analysis of the individual Although the Apicomplexan, Sarcocystis muris, has been reported genes in our study generally did not provide strong support for to contain an apicoplast, sequences for this organelle are not cur- many of the internal branches of the trees, regardless of the rently available. Cryptosporidium hominis and C. parvum appear to topology found. These results differ from a recent review of plas- have lost their apicoplast and the associated genome, as genes of tid origins (Keeling, 2004) in which it is suggested that both the plastid origin were detected in the nuclear genome but no contigu- Heterokonts and Apicomplexans are derived through secondary ous apicoplast sequence was found in the complete genome (Abra- endosymbiosis from red algae (the chromalveolate hypothesis), hamsen et al., 2004; Wilson et al., 1996; Xu et al., 2004; Zhu et al., whereas the Euglenozoan plastid is derived from a green algal 2000). Gregarina niphandrodes and Selenidium orientale also appear source. Within the Apicomplexans, Babesia and Theileria are sis- to have lost their apicoplast genomes (Toso and Omoto, 2007a,b). ter lineages, as would be expected given their similar plastid It has been suggested that sequences from the apicoplast gen- genome organization, and these taxa together are sister to Plas- ome itself should not be used to determine its phylogenetic posi- modium (Fig. 1). Eimeria and Toxoplasma form a sister clade to tion due to a high AT content that drives long branch attraction this group of three. Furthermore, the parasitic green algae Heli- towards otherwise distantly related lineages (Keeling, 2004). How- cosporidium is strongly placed as sister to the Apicomplexans ever, where amino acid (or DNA) changes have been properly mod- in the combined analysis (Fig. 1), and this relationship shows eled and analyzed under a likelihood framework, these influences support in the rps3, rps11, rps12, rpoB, and rpoC1 individual-gene should be minimized. In addition, we used a covarion model that analyses (Fig. 2). specifically corrects for rate variation across the tree, further min- Results from several studies have been used as a basis for the imizing any potential for a long branch attraction effect from AT chromalveolate hypothesis. For example, red algal origins of the content variation. The suggestion that high AT content in apicop- apicoplast have been suggested using phylogenetic analysis of nu- last genomes improperly forces this lineage towards the green al- clear-encoded, plastid-targeted GAPDH genes (Fast et al., 2001). gae (Morton, 1999) is not supported by the similar AT content in Three aspects of that study should be noted. First, the authors both red and green algae. Last but not least, additional analyses use a complex model of derivation of these nuclear-encoded genes excluding the Euglenozoa plastid genes, which have a highly to explain why the plastid-targeted copies are more closely related biased AT content, were also conducted in our study (data not to eukaryotic cytosolic copies than to plastid copies from plants, shown) and resulted in the same topologies as in Fig. 1. Conse- red or green algae. It is not clear why these results favor red algal quently, we consider the combined data set results in these analy- origins, and while it appears that both the Apicomplexans and ses as the best estimate of relationships of eukaryotic plastids Dinoflagellates have undergone gene replacement, this does not di- without high AT content skewing the overall outcome. rectly address the origins of the plastids to which their nuclear-en- In this study, the analysis of 2826 amino acid (AA) combined- coded gene products are targeted. Second, the branching structure gene with Gblocks matrix resulted in a robust phylogenetic of the phylogeny presented had very low statistical support for hypothesis for apicoplast origins and so this combined data set is most branches, limiting confidence that the presented tree reflects used to represent our consensus hypothesis (Fig. 1). Individual- true branching relationships. Third, since nuclear genomes evolve gene analysis with Gblocks matrices varied in length from 115 to at different rates than those of the plastid (Lynch, 1997; Martin, 854 amino acids (rpl14 – 115 AA; rpl16 – 132 AA; rpoB – 854 AA; 1999), the accuracy of nuclear copies of formerly plastid-encoded rpoC1 – 433 AA; rpoC2 – 509 AA; rps3 – 147 AA; rps11 – 116 AA; genes to represent plastid origins is questionable. rps12 – 121 AA; and tufA – 399 AA). Bayesian analysis of these indi- Gene order and plastid structure have also been used to ad- vidual matrices (Fig. 2A–I) resulted in generally congruent topolo- dress the phylogenetic position of the apicoplast (Blanchard A.O.T. Lau et al. / Experimental Parasitology 123 (2009) 236–243 239

Fig. 1. Bayesian inference analysis of the combined-gene Gblocks amino acid alignment. Relationships represented by one of the post-burnin topologies in order to represent branch lengths. Posterior probabilities are denoted at each node when 50% or greater. Branches marked by ‘‘//” have been reduced in scale by 50% in order to fit the page.

and Hicks, 1999). These comparisons are difficult to interpret, Theileria is further complicated by the change from a double- particularly given the extreme levels of gene loss in most of stranded coding structure seen in most plastids to a single- the secondary endosymbionts. Using gene order as a tool for stranded coding arrangement for all genes in the apicoplast gen- comparison of the origin of the apicoplasts from Babesia and omes of these two taxa. 240 A.O.T. Lau et al. / Experimental Parasitology 123 (2009) 236–243

Fig. 2. Bayesian inference analyses of individual-gene Gblocks amino acid alignments. Posterior probabilities are denoted at each node when 50% or greater. (A) rpl14 consensus tree. (B) rpl16 consensus tree. (C) rps3 consensus tree. (D) rps11 consensus tree. (E) rps12 consensus tree. (F) rpoB consensus tree. (G) rpoC1 consensus tree. (H) rpoC2 consensus tree. (I) tufA consensus tree.

Other studies have supported a green algal origin for apicop- Apicomplexan plastids were not included in this analysis, lasts. Cai et al.’s analysis of rpoB, rpoC1, and rpoC2 genes using branching structure in the Gracilaria study was strongly sup- maximum likelihood and Bayesian inference methods similarly ported and the inferred relationships of photosynthetic taxa provide strong statistical support for this using a smaller sam- were very similar to our results (Hagopian et al., 2004) ple of Apicomplexans and Heterokonts (Cai et al., 2003). A re- (Fig. 1). If Apicomplexan sequences were improperly placed in cent paper on the phylogenetic position of the red algae the current study, we would expect larger perturbations of Gracilaria tenuistipitata var. liui analyzed the relationships the overall tree structure. Several lines of evidence also showed among photosynthetic organisms using 41 plastid protein-cod- the inconsistency of a single origin of the plastid in all Chromi- ing genes (Hagopian et al., 2004). While the Euglenozoa and sta and Alveolata (Bodyl, 2004). A.O.T. Lau et al. / Experimental Parasitology 123 (2009) 236–243 241

Fig. 2 (continued)

The invertebrate pathogen, Helicosporidium, is a green alga of this green alga to the Apicomplexans further strengthens the which has retained a non-photosynthetic plastid (Tartar and Bou- argument that the apicoplast is of green algal origin. Interestingly, cias, 2004; Tartar et al., 2002). This taxon groups very strongly with the association of the apicoplast with a non-photosynthetic green Euglenozoans and Apicomplexans (Fig. 1), and resolves as the sis- alga raises the question of when the plastid became non- ter lineage of the Apicomplexans. The close phylogenetic position photosynthetic. 242 A.O.T. Lau et al. / Experimental Parasitology 123 (2009) 236–243

pling within the Trebouxiophyceae green algae may lead to a better understanding of which green algal lineage contributed the plastid to Apicomplexans, and will help to better define when the apicoplast became non-photosynthetic. Nonetheless, this anal- ysis is the most comprehensive to date, including the most taxa and plastid genes, and uses rigorous Bayesian inference analyses of all data sets. These analyses provide strong multi-gene statistical support for the green algal hypothesis. Understanding the origins of plastids across eukaryotic lineages is critical to understanding overall patterns of diversification, mechanisms of innovation in these lineages, and may play an important role in understanding ecological roles of (and possibly biological control of) these cryptic ‘‘protists”.

Acknowledgments

This work was supported by USDA-ARS SCA58-5348-2-683, SCA5348-32000-020-01S and CRIS project 5348-32000-020-00D, and the Animal Health Research Center, College of Veterinary Med- icine, Washington State University.

References

Abrahamsen, M.S., Templeton, T.J., Enomoto, S., Abrahante, J.E., Zhu, G., Lancto, C.A., Deng, M., Liu, C., Widmer, G., Tzipori, S., Buck, G.A., Xu, P., Bankier, A.T., Dear, P.H., Konfortov, B.A., Spriggs, H.F., Iyer, L., Anantharaman, V., Aravind, L., Kapur, V., 2004. Complete genome sequence of the apicomplexan, Cryptosporidium parvum. Science 304, 441–445. Fig. 2 (continued) Armbrust, E.V., Berges, J.A., Bowler, C., Green, B.R., Martinez, D., Putnam, N.H., Zhou, S., Allen, A.E., Apt, K.E., Bechner, M., Brzezinski, M.A., Chaal, B.K., Chiovitti, A., Davis, A.K., Demarest, M.S., Detter, J.C., Glavina, T., Goodstein, D., Hadi, M.Z., Hellsten, U., Hildebrand, M., Jenkins, B.D., Jurka, J., Kapitonov, V.V., Kroger, N., Chromera velia, was recently reported to be closely related to Lau, W.W., Lane, T.W., Larimer, F.W., Lippmeier, J.C., Lucas, S., Medina, M., the Apicomplexans although it retains a photosynthetic plastid Montsant, A., Obornik, M., Parker, M.S., Palenik, B., Pazour, G.J., Richardson, P.M., Rynearson, T.A., Saito, M.A., Schwartz, D.C., Thamatrakoln, K., Valentin, K., Vardi, (Moore et al., 2008). This conclusion was based on analyzing the A., Wilkerson, F.P., Rokhsar, D.S., 2004. The genome of the diatom Thalassiosira nuclear large subunit rDNA sequences and the psbA gene. Since this pseudonana: ecology, evolution, and metabolism. Science 306, 79–86. study utilized nuclear-encoded genes to infer relationship, we cau- Blanchard, J.L., Hicks, J.S., 1999. The non-photosynthetic plastid in malarial parasites and other apicomplexans is derived from outside the green plastid lineage. J. tion that nuclear genes cannot be routinely used to predict the rate Eukaryot. Microbiol. 46, 367–375. of evolution of an organelle, as evolution rates of nuclear and api- Bodyl, A., 2004. Evolutionary origin of a preprotein translocase in the periplastid coplast genes can be very different as their genome evolution are membrane of complex plastids: a hypothesis. Plant Biol. (Stuttg.) 6, 513–518. Brayton, K.A., Lau, A.O., Herndon, D.R., Hannick, L., Kappmeyer, L.S., Berens, S.J., governed by different events (Lynch, 1997; Martin, 1999). Thus, Bidwell, S.L., Brown, W.C., Crabtree, J., Fadrosh, D., Feldblum, T., Forberger, H.A., nuclear genes that were once encoded by the apicoplast could Haas, B.J., Howell, J.M., Khouri, H., Koo, H., Mann, D.J., Norimine, J., Paulsen, I.T., evolve at different rates than those that remained in the apicoplast. Radune, D., Ren, Q., Smith Jr., R.K., Suarez, C.E., White, O., Wortman, J.R., The C. velia study also reported the UGA-Trp usage in psbA and Knowles Jr., D.P., McElwain, T.F., Nene, V.M., 2007. Genome sequence of Babesia bovis and comparative analysis of apicomplexan hemoprotozoa. PLoS Pathog. 3, claimed to be a feature also found in apicoplasts of coccidians 1401–1413. (Lang-Unnasch and Aiello, 1999) and mitochondria (Ralph et al., Cai, X., Fuller, A.L., McDougald, L.R., Zhu, G., 2003. Apicoplast genome of the 2004). This observation of the unusual UGA-Trp codon usage only coccidian Eimeria tenella. Gene 321, 39–46. Castresana, J., 2000. Selection of conserved blocks from multiple alignments for holds true for Neospora caninum and T. gondii. UGG-Trp is still pref- their use in phylogenetic analysis. Mol. Biol. Evol. 17, 540–552. erentially used in E. tenella (a coccidian), and other Apicomplexans Cavalier-Smith, T., 1992. The number of symbiotic origins of organelles. Biosystems such as, T. parva and B. bovis (data not shown). Last but not least, 28, 91–106. discussion 107–108. Dacks, J.B., Marinets, A., Ford Doolittle, W., Cavalier-Smith, T., Logsdon Jr., J.M., 2002. the study reported the detection of an isofucoxanthin isomer in Analyses of RNA Polymerase II genes from free-living protists: phylogeny, long C. velia and this implies that its plastid is of red algal origin. This branch attraction, and the eukaryotic big bang. Mol. Biol. Evol. 19, 830–840. last finding is intriguing and adds to the already contested debate Delwiche, C.F., Kuhsel, M., Palmer, J.D., 1995. Phylogenetic analysis of tufA sequences indicates a cyanobacterial origin of all plastids. Mol. Phylogenet. of the apicoplast origin. Evol. 4, 110–128. Based on our analysis, we conclude that there is strong support Dunn, P.P., Stephens, P.J., Shirley, M.W., 1998. Eimeria tenella: two species of for a green algal origin of Apicomplexan and Euglenozoan plastids, extrachromosomal DNA revealed by pulsed-field gel electrophoresis. Parasitol. Res. 84, 272–275. in contrast to the plastid origins of Heterokonts, which were likely Fast, N.M., Kissinger, J.C., Roos, D.S., Keeling, P.J., 2001. Nuclear-encoded, plastid- derived from secondary endosymbiosis of red algae (Fig. 1). It is, targeted genes suggest a single common origin for apicomplexan and however, plausible that our conclusion of green algal origin of api- dinoflagellate plastids. Mol. Biol. Evol. 18, 418–426. coplast could be due to the possibility that the ancestral host Funes, S., Davidson, E., Reyes-Prieto, A., Magallon, S., Herion, P., King, M.P., Gonzalez-Halphen, D., 2002. A green algal apicoplast ancestor. Science 298, which gave rise to the (red) apicoplast contained some green plas- 2155. tid genes (Cai et al., 2003). It should also be noted that sampling Galtier, N., 2001. Maximum-likelihood phylogenetic analysis under a covarion-like density of taxa can have a large influence on phylogenetic infer- model. Mol. Biol. Evol. 18, 866–873. Gardner, M.J., Hall, N., Fung, E., White, O., Berriman, M., Hyman, R.W., Carlton, J.M., ences, and while we have substantially increased the sampling Pain, A., Nelson, K.E., Bowman, S., Paulsen, I.T., James, K., Eisen, J.A., Rutherford, density of Apicomplexans and Heterokonts in our study, under- K., Salzberg, S.L., Craig, A., Kyes, S., Chan, M.S., Nene, V., Shallom, S.J., Suh, B., standing the precise patterns of secondary endosymbiotic events Peterson, J., Angiuoli, S., Pertea, M., Allen, J., Selengut, J., Haft, D., Mather, M.W., Vaidya, A.B., Martin, D.M., Fairlamb, A.H., Fraunholz, M.J., Roos, D.S., Ralph, S.A., in all plastid-containing organisms will require much more de- McFadden, G.I., Cummings, L.M., Subramanian, G.M., Mungall, C., Venter, J.C., tailed sampling of the green and red algal lineages. Further sam- Carucci, D.J., Hoffman, S.L., Newbold, C., Davis, R.W., Fraser, C.M., Barrell, B., A.O.T. Lau et al. / Experimental Parasitology 123 (2009) 236–243 243

2002. Genome sequence of the human malaria parasite Plasmodium falciparum. N.D., Rutter, S., Saunders, D., Seeger, K., Shah, T., Squares, R., Squares, S., Tivey, Nature 419, 498–511. A., Walker, A.R., Woodward, J., Dobbelaere, D.A., Langsley, G., Rajandream, M.A., Gardner, M.J., Bishop, R., Shah, T., de Villiers, E.P., Carlton, J.M., Hall, N., Ren, Q., McKeever, D., Shiels, B., Tait, A., Barrell, B., Hall, N., 2005. Genome of the host- Paulsen, I.T., Pain, A., Berriman, M., Wilson, R.J., Sato, S., Ralph, S.A., Mann, D.J., cell transforming parasite Theileria annulata compared with T. parva. Science Xiong, Z., Shallom, S.J., Weidman, J., Jiang, L., Lynn, J., Weaver, B., Shoaibi, A., 309, 131–133. Domingo, A.R., Wasawo, D., Crabtree, J., Wortman, J.R., Haas, B., Angiuoli, S.V., Ralph, S.A., Van Dooren, G.G., Waller, R.F., Crawford, M.J., Fraunholz, M.J., Foth, B.J., Creasy, T.H., Lu, C., Suh, B., Silva, J.C., Utterback, T.R., Feldblyum, T.V., Pertea, M., Tonkin, C.J., Roos, D.S., McFadden, G.I., 2004. Tropical infectious diseases: Allen, J., Nierman, W.C., Taracha, E.L., Salzberg, S.L., White, O.R., Fitzhugh, H.A., metabolic maps and functions of the Plasmodium falciparum apicoplast. Nat. Morzaria, S., Venter, J.C., Fraser, C.M., Nene, V., 2005. Genome sequence of Rev. Microbiol. 2, 203–216. Theileria parva, a bovine pathogen that transforms lymphocytes. Science 309, Simdyanov, T.G., Kuvardina, O.N., 2007. Fine structure and putative feeding 134–137. mechanism of the archigregarine Selenidium orientale (: Gornicki, P., 2003. Apicoplast fatty acid biosynthesis as a target for medical Gregarinomorpha). Eur. J. Protistol. 43, 17–25. intervention in apicomplexan parasites. Int. J. Parasitol. 33, 885–896. Swofford, D.L., Waddell, P.J., Huelsenbeck, J.P., Foster, P.G., Lewis, P.O., Rogers, J.S., Gray, M.W., 1992. The endosymbiont hypothesis revisited. Int. Rev. Cytol. 141, 233– 2001. Bias in phylogenetic estimation and its relevance to the choice between 357. parsimony and likelihood methods. Syst. Biol. 50, 525–539. Gray, M.W., 1993. Origin and evolution of organelle genomes. Curr. Opin. Genet. Talavera, G., Castresana, J., 2007. Improvement of phylogenies after removing Dev. 3, 884–890. divergent and ambiguously aligned blocks from protein sequence alignments. Gribaldo, S., Philippe, H., 2002. Ancient phylogenetic relationships. Theor. Popul. Syst. Biol. 56, 564–577. Biol. 61, 391–408. Tartar, A., Boucias, D.G., 2004. The non-photosynthetic, pathogenic green alga Hagopian, J.C., Reis, M., Kitajima, J.P., Bhattacharya, D., de Oliveira, M.C., 2004. Helicosporidium sp. has retained a modified, functional plastid genome. FEMS Comparative analysis of the complete plastid genome sequence of the red alga Microbiol. Lett. 233, 153–157. Gracilaria tenuistipitata var. liui provides insights into the evolution of Tartar, A., Boucias, D.G., Adams, B.J., Becnel, J.J., 2002. Phylogenetic analysis rhodoplasts and their relationship to other plastids. J. Mol. Evol. 59, 464–477. identifies the invertebrate pathogen Helicosporidium sp. as a green alga Huelsenbeck, J.P., Bollback, J.P., 2001. Empirical and hierarchical Bayesian (Chlorophyta). Int. J. Syst. Evol. Microbiol. 52, 273–279. estimation of ancestral states. Syst. Biol. 50, 351–366. Thompson, J.K., Rubio, J.P., Caruana, S., Brockman, A., Wickham, M.E., Cowman, A.F., Huelsenbeck, J.P., Ronquist, F., 2001. MRBAYES: Bayesian inference of phylogenetic 1997. The chromosomal organization of the Plasmodium falciparum var. gene trees. Bioinformatics 17, 754–755. family is conserved. Mol. Biochem. Parasitol. 87, 49–60. Huelsenbeck, J.P., Larget, B., Miller, R.E., Ronquist, F., 2002. Potential applications Toso, M.A., Omoto, C.K., 2007a. Gregarina niphandrodes may lack both a plastid and pitfalls of Bayesian inference of phylogeny. Syst. Biol. 51, 673–688. genome and organelle. J. Eukaryot. Microbiol. 54, 66–72. Keeling, P.J., 2004. Reduction and compaction in the genome of the apicomplexan Toso, M.A., Omoto, C.K., 2007b. Ultrastructure of the Gregarina niphandrodes nucleus parasite Cryptosporidium parvum. Dev. Cell 6, 614–616. through stages from unassociated trophozoites to gamonts in syzygy and the Kohler, S., Delwiche, C.F., Denny, P.W., Tilney, L.G., Webster, P., Wilson, R.J., Palmer, syzygy junction. J. Parasitol. 93, 479–484. J.D., Roos, D.S., 1997. A plastid of probable green algal origin in apicomplexan Tuffley, C., Steel, M., 1998. Modeling the covarion hypothesis of nucleotide parasites. Science 275, 1485–1489. substitution. Math. Biosci. 147, 63–91. Lang-Unnasch, N., Aiello, D.P., 1999. Sequence evidence for an altered genetic code Waller, R.F., McFadden, G.I., 2005. The apicoplast: a review of the derived plastid of in the Neospora caninum plastid. Int. J. Parasitol. 29, 1557–1562. apicomplexan parasites. Curr. Issues Mol. Biol. 7, 57–79. Lewis, P.O., 2001. Phylogenetic systematics turns over a new leaf. Trends Ecol. Evol. Waller, R.F., Keeling, P.J., van Dooren, G.G., McFadden, G.I., 2003. Comment on ‘‘A 16, 30–37. green algal apicoplast ancestor”. Science 301, 49. author reply 49. Lynch, M., 1997. Mutation accumulation in nuclear, organelle, and prokaryotic Williamson, D.H., Gardner, M.J., Preiser, P., Moore, D.J., Rangachari, K., Wilson, R.J., transfer RNA genes. Mol. Biol. Evol. 14, 914–925. 1994. The evolutionary origin of the 35 kb circular DNA of Plasmodium Martin, A.P., 1999. Substitution rates of organelle and nuclear genes in sharks: falciparum: new evidence supports a possible rhodophyte ancestry. Mol. Gen. implicating metabolic rate (again). Mol. Biol. Evol. 16, 996–1002. Genet. 243, 249–252. Medlin, L.K., Cooper, A., Hill, C., Wrieden, S., Wellbrock, U., 1995. Phylogenetic Wilson, R.J., Denny, P.W., Preiser, P.R., Rangachari, K., Roberts, K., Roy, A., Whyte, A., position of the Chromista plastids based on small subunit rRNA coding regions. Strath, M., Moore, D.J., Moore, P.W., Williamson, D.H., 1996. Complete gene map Curr. Genet. 28, 560–565. of the plastid-like DNA of the malaria parasite Plasmodium falciparum. J. Mol. Moore, R.B., Obornik, M., Janouskovec, J., Chrudimsky, T., Vancova, M., Green, D.H., Biol. 261, 155–172. Wright, S.W., Davies, N.W., Bolch, C.J., Heimann, K., Slapeta, J., Hoegh-Guldberg, Xu, P., Widmer, G., Wang, Y., Ozaki, L.S., Alves, J.M., Serrano, M.G., Puiu, D., Manque, O., Logsdon, J.M., Carter, D.A., 2008. A photosynthetic closely related to P., Akiyoshi, D., Mackey, A.J., Pearson, W.R., Dear, P.H., Bankier, A.T., Peterson, apicomplexan parasites. Nature 451, 959–963. D.L., Abrahamsen, M.S., Kapur, V., Tzipori, S., Buck, G.A., 2004. The genome of Morton, B.R., 1999. Strand asymmetry and codon usage bias in the chloroplast Cryptosporidium hominis. Nature 431, 1107–1112. genome of Euglena gracilis. Proc. Natl. Acad. Sci. USA 96, 5123–5128. Yoon, H.S., Hackett, J.D., Pinto, G., Bhattacharya, D., 2002. The single, ancient origin Pain, A., Renauld, H., Berriman, M., Murphy, L., Yeats, C.A., Weir, W., Kerhornou, A., of chromist plastids. Proc. Natl. Acad. Sci. USA 99, 15507–15512. Aslett, M., Bishop, R., Bouchier, C., Cochet, M., Coulson, R.M., Cronin, A., de Zhang, Z., Green, B.R., Cavalier-Smith, T., 1999. Single gene circles in dinoflagellate Villiers, E.P., Fraser, A., Fosker, N., Gardner, M., Goble, A., Griffiths-Jones, S., chloroplast genomes. Nature 400, 155–159. Harris, D.E., Katzer, F., Larke, N., Lord, A., Maser, P., McKellar, S., Mooney, P., Zhu, G., Marchewka, M.J., Keithly, J.S., 2000. Cryptosporidium parvum appears to lack Morton, F., Nene, V., O’Neil, S., Price, C., Quail, M.A., Rabbinowitsch, E., Rawlings, a plastid genome. Microbiology 146 (Pt 2), 315–321.