bioRxiv preprint doi: https://doi.org/10.1101/2021.07.25.453713; this version posted July 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Eukaryotic genomes from a global metagenomic dataset illuminate trophic modes and biogeography of ocean plankton Harriet Alexander1,*, Sarah K. Hu2, Arianna I. Krinos1,3, Maria Pachiadaki1, Benjamin J. Tully4, Christopher J. Neely5, and Taylor Reiter6 1Biology Department, Woods Hole Oceanographic Institution, Woods Hole, MA, USA, 02543 2Marine Chemistry and Geochemistry, Woods Hole Oceanographic Institution, Woods Hole, MA, USA, 02543 3MIT-WHOI Joint Program in Oceanography, Cambridge and Woods Hole, MA, 02540 4Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089 5Department of Computational and Quantitative Biology, University of Southern California, Los Angeles, CA 90089 6Population Health and Reproduction, University of California, Davis, Davis, CA, 95616 *Correspondence;
[email protected] Abstract Molecular and genomic approaches that target mixed microbial communities (e.g., metagenomics or metatranscriptomics) provide insight into the ecological roles, evolutionary histories, and phys- iological capabilities of the microorganisms and the processes in the environment. Computational tools that harness large-scale sequence surveys have become a valuable resource for characterizing 5 the genetic make-up of the bacterial and archaeal component of the marine microbiome. Yet, fewer studies have focused on the unicellular eukaryotic fraction of the community. Here, we developed the EukHeist automated computational pipeline, to retrieve eukaryotic and prokaryotic metagenome assembled genomes (MAGs).