Comparative Analyses of Chloroplast Genome Data Representing Nine Green Algae in Sphaeropleales (Chlorophyceae, Chlorophyta)

Comparative Analyses of Chloroplast Genome Data Representing Nine Green Algae in Sphaeropleales (Chlorophyceae, Chlorophyta)

Data in Brief 7 (2016) 558–570 Contents lists available at ScienceDirect Data in Brief journal homepage: www.elsevier.com/locate/dib Data article Comparative analyses of chloroplast genome data representing nine green algae in Sphaeropleales (Chlorophyceae, Chlorophyta) Karolina Fučíková n, Louise A. Lewis, Paul O. Lewis Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA article info abstract Article history: The chloroplast genomes of green algae are highly variable in their Received 16 February 2016 architecture. In this article we summarize gene content across Received in revised form newly obtained and published chloroplast genomes in Chlor- 25 February 2016 ophyceae, including new data from nine of species in Sphaer- Accepted 1 March 2016 opleales (Chlorophyceae, Chlorophyta). We present genome Available online 9 March 2016 architecture information, including genome synteny analysis across two groups of species. Also, we provide a phylogenetic tree obtained from analysis of gene order data for species in Chlor- ophyceae with fully sequenced chloroplast genomes. Further analyses and interpretation of the data can be found in “Chlor- oplast phylogenomic data from the green algal order Sphaer- opleales (Chlorophyceae, Chlorophyta) reveal complex patterns of sequence evolution” (Fučíková et al., In review) [1]. & 2016 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). Specifications Table Subject area Biology More specific sub- Phylogenomics ject area Type of data Table, Figures, text file, tree file DOI of original article: http://dx.doi.org/10.1016/j.ympev.2016.01.022 n Corresponding author. E-mail address: [email protected] (K. Fučíková). http://dx.doi.org/10.1016/j.dib.2016.03.014 2352-3409/& 2016 The Authors. Published by Elsevier Inc. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). K. Fučíková et al. / Data in Brief 7 (2016) 558–570 559 How data were Phylogenetic analysis of gene order in Badger, Mauve analysis, Geneious genome acquired map Data format Analyzed Experimental Genomic data were collected using Illumina HiSeq and annotated in Geneious factors Experimental Gene order data analyzed in Badger, Synteny maps in Mauve plugin in Geneious features Data source Storrs, CT, U.S.A. location Data accessibility Data is within this article and at NCBI: http://www.ncbi.nlm.nih.gov accessions GenBank: KT199248, GenBank: KT199249, GenBank: KT199250, GenBank: KT199251, GenBank: KT199252, GenBank: KT199253, GenBank: KT199254, GenBank: KT199255, GenBank: KT199256 Value of the data Chloroplast genomes of green algae in the order Sphaeropleales are currently sparsely studied, thus new data from nine additional species expands knowledge of the structural variation within this order of algae. Table summarizes the features present in chloroplast genomes of green algae in Sphaeropleales, useful for comparison to other species in future analyses. Figures of nine assembled chloroplast genomes of green algae illustrate the features and their arrangements in these species. New gene order data can be used in future phylogenetic analyses that include information from additional species. 1. Data In this article we present chloroplast genome structural data for nine species of green algae (GenBank: KT199248, GenBank: KT199249, GenBank: KT199250, GenBank: KT199251, GenBank: KT199252, GenBank: KT199253, GenBank: KT199254, GenBank: KT199255, GenBank: KT199256; NCBI: http://www.ncbi.nlm.nih.gov), including comparison of gene and intron content (Fig. 1). Chloroplast genome maps with all annotated features are presented in Figs. 2–10. Synteny comparisons were performed for two sets of species (Figs. 11 and 12). Gene order data were scored for 15 taxa in the class Chlorophyceae (GeneOrder.txt). A phylogenetic analysis of gene order using Badger is presented in (Fig. 13, GeneOrder.tre). 2. Experimental design, materials and methods Full chloroplast genome sequences were obtained for 9 species of green algae in Sphaeropleales [1]. From these, genome maps were prepared in Geneious (version 6) (Figs. 2–10). Synteny maps were produced using the Mauve plugin in Geneious, which was also used to estimate the numbers of genomic rearrangements among taxa [2] (Figs. 11–12). Gene order data were analyzed phylogeneti- cally using Badger [3] for a total of 15 species in Chlorophyceae with fully sequenced cp genomes (Fig. 13). Only single-copy genes that were present in all included genomes (83 genes) were used for the analysis. Genes present in inverted repeats were counted once and inverted repeats were oriented to have the rRNA genes on the positive strand. For the trans-spliced psaA gene, only the first exon was considered. Badger was run for 10,000,000 generations, sampling every 100, with other settings set to default. The first 1000 samples were discarded as burnin. 560 K. Fučíková et al. / Data in Brief 7 (2016) 558–570 Fig. 1. Comparison of gene presence and intron content in chloroplast genomes of algae in Chlorophyceae. K. Fučíková et al. / Data in Brief 7 (2016) 558–570 561 Fig. 2. Schematic map of the chloroplast genome of Ankyra judayi (KT199255, SAG 17.84). 562 K. Fučíková et al. / Data in Brief 7 (2016) 558–570 Fig. 3. Schematic map of the chloroplast genome of Bracteacoccus aerius (KT199254, UTEX 1250). K. Fučíková et al. / Data in Brief 7 (2016) 558–570 563 Fig. 4. Schematic map of the chloroplast genome of Bracteacoccus minor (KT199253, UTEX B 66). 564 K. Fučíková et al. / Data in Brief 7 (2016) 558–570 Fig. 5. Schematic map of the chloroplast genome of Chlorotetraedron incus (KT199252, SAG 43.81). K. Fučíková et al. / Data in Brief 7 (2016) 558–570 565 Fig. 6. Schematic map of the chloroplast genome of Chromochloris zofingiensis (KT199251, UTEX 56). 566 K. Fučíková et al. / Data in Brief 7 (2016) 558–570 Fig. 7. Schematic map of the chloroplast genome of Kirchneriella aperta (KT199250, SAG 2004). Fig. 8. Schematic map of the chloroplast genome of Mychonastes homosphaera (KT199249, CAUP H 6502). K. Fučíková et al. / Data in Brief 7 (2016) 558–570 567 Fig. 9. Schematic map of the chloroplast genome of Neochloris aquatica (KT199248, UTEX 138). 568 K. Fučíková et al. / Data in Brief 7 (2016) 558–570 Fig. 10. Schematic map of the chloroplast genome of Pseudomuriella schumacherensis (KT199256, SAG 2137). Bracteacoccus aerius Bracteacoccus minor Fig. 11. Analysis of synteny between the chloroplast genomes of two species of Bracteacoccus, B. aerius and B. minor, using MAUVE alignments. Colored and outlined blocks surround regions of the genome sequence of one genome that aligned to a corresponding part of the second genome, and lines connect blocks of putative homology. Within the blocks the colored bars indicate the level of sequence similarities. K. Fučíková et al. / Data in Brief 7 (2016) 558–570 569 Chlorotetraedron incus Neochloris aquatica Acutodesmus obliquus Fig. 12. Synteny analysis among the chloroplast genomes of three species, Acutodesmus obliquus, Neochloris aquatica, and Chlorotetraedron incus. Colored and outlined blocks surround regions of the genome sequence that aligned to a corresponding part of the second genome, and lines connect blocks of putative homology. Within the blocks the colored bars indicate the level of sequence similarities. 0.9986 Chlorotetraedron incus 0.9548 Neochloris aquatica 0.6258 Acutodesmus obliquus 0.5726 Mychonastes homosphaera Chromochloris zofingiensis 1 0.8677 Pseudomuriella schumacherensis 0.5 Kirchneriella aperta 0.9813 Bracteacoccus aerius 0.6642 Bracteacoccus minor Ankyra judai 0.8437 Chlamydomonas reinhardtii 0.9998 Gonium pectorale 0.9979 Pleodorina starrii 0.5 Volvox carteri 0.7 inversions Dunaliella salina Fig. 13. Badger phylogenetic tree inferred from gene order data present in completely sequenced chloroplast genomes of Sphaeropleales. Scale bar represents the number of inversions inferred to have occurred along a particular branch. 570 K. Fučíková et al. / Data in Brief 7 (2016) 558–570 Acknowledgments Analyses were supported with funding from NSF grants DEB-1036448 and DEB-1354146 awarded to LAL and POL at University of Connecticut. Analyses were carried out at the Bioinformatics Facility of the University of Connecticut Biotechnology and Bioservices Center (BBC). Appendix A. Supplementary material Supplementary data associated with this article can be found in the online version at http://dx.doi. org/10.1016/j.dib.2016.03.014. References [1] K. Fučíková, P.O. Lewis, L.A. Lewis, Chloroplast phylogenomic data from the green algal order Sphaeropleales (Chlor- ophyceae, Chlorophyta) reveal complex patterns of sequence evolution, Mol. Phylogenet. Evol. (2016) (In press). [2] A.E. Darling, B. Mau, N.T. Perna, progressiveMauve: Multiple genome alignment with gene gain, loss and rearrangement, PloS One 5 (6) (2010) e11147. [3] D. Simon, B. Larget, Bayesian analysis to describe genomic evolution by rearrangement (BADGER), version 1.02 beta, Department of Mathematics and Computer Science , Duquesne University, 2004..

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    13 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us