INSIGHT

GENOME We are not so special New sequence data from improves our understanding of the genetic changes that occurred along the branch of the evolutionary tree that gave rise to .

ZACHARY R LEWIS AND CASEY W DUNN

then it will appear that the gene is specific to Related research article Richter DJ, group M and younger than it actually is. Fozouni P, Eisen M, King N. 2018. Gene Now, in eLife, Daniel Richter, Parinaz Fozouni, family innovation, conservation and loss on Michael Eisen and Nicole King report their work the stem lineage. eLife 7:e34226. to reduce sequencing bias by sampling many DOI: 10.7554/eLife.34226 more genes in the sister group to animals, the choanoflagellates (Richter et al., 2018). They generated transcriptomic data for 19 species of choanoflagellates and analyzed them in combi- nation with previously published metazoan (ani- he most recent common ancestor of ani- mal), and other eukaryote mals lived more than 600 million years genomes. In addition to presenting new data, T ago, so we cannot sequence its genome. Richter et al. – who are based at UC Berkeley, Nevertheless, we can identify a minimal set of UCSF, the Gladstone Institutes and Station gene families that were present in this long- Biologique de Roscoff – applied new probabilis- dead ancestor by comparing genomic data tic methods to minimize the chance that a gene across animals and their closest relatives. In family would be predicted to be present in a tax- addition to being interesting in its own right, onomic group based on the spurious assignment this helps us identify which genes were gained of unrelated genes to the same family. and lost before the origin of animals and, like- In related work at the universities of Essex wise, which genes were gained and lost as ani- and Oxford, Jordi Paps and Peter Holland have mals diversified. reported an interesting analysis of gene gain The challenge, though, is that there are and loss in early animal evolution (Paps and Hol- strong sampling biases that can compromise land, 2018). The studies agree on some key these analyses. Genome sequencing has focused points. Both recovered a relatively large number on species that are medically relevant, experi- of gene family gains along the ‘animal stem’ (the mentally tractable, and easy to sequence branch of the evolutionary tree that uniquely (del Campo et al., 2014). Left unaddressed, gives rise to animals; shown in blue in these biases can frustrate efforts to reconstruct Figure 1B). However, while Paps and Holland the genomes of our ancient ancestors. Take, for estimate that the number of gains was much example, the simple case of three groups of higher than the number of losses, which they organisms called O, C and M, and a gene that interpreted as evidence for an accelerated Copyright Lewis and Dunn. This originated along the branch that gave rise to C expansion of gene families along the Metazoa article is distributed under the terms and M (Figure 1A). If more sequencing effort stem, Richter et al. estimate approximately of the Creative Commons Attribution has been invested in group M than in group C, equal numbers of gains and losses (Figure 1C). License, which permits unrestricted the gene is more likely to be found in group M This means that Richter et al. find evidence for use and redistribution provided that the original author and source are than in group C. And if the gene is found in M accelerated churn of gene families along the credited. but not in C, even though it is present in both, Metazoa stem, not a burst of expansion. This

Lewis and Dunn. eLife 2018;7:e38726. DOI: https://doi.org/10.7554/eLife.38726 1 of 3 Insight Genome Evolution We are not so special

Figure 1. Genes lost and gained. (A) Example of biased sampling (left): although a gene was gained (first green line) before group C and group M diverged, biased sampling means that it is only detected in group M, which leads to the incorrect inference (second green line) that the gene arose after the groups diverged. With uniform sampling (right), the gene gain is correctly inferred (third green line). Groups C, M and O could be Choanoflagellata, Metazoa and Outgroups. (B) Cladogram showing the evolutionary relationships of the clades in question, with the Choanoflagellata stem shown in red and the Metazoa stem shown in blue. refers to the clade Choanoflagellata + Metazoa (Brunet and King, 2017). (C) The number of gene groups gained (y-axis) plotted against the number of gene groups lost (x-axis) along various branches leading to the nodes shown in panel B, based on the data in four studies (Fairclough et al., 2013; Paps and Holland, 2018; Richter et al., 2018; Suga et al., 2013). The gray dashed line indicates equal gene group gain and loss. Note that the four studies use different methodologies to define groupings of genes. Data and analyses are available at https://github.com/dunnlab/gene_inventory_2018 (Lewis and Dunn, 2018; copy archived at https://github.com/elifesciences- publications/gene_inventory_2018).

incongruence is likely related to Paps and Hol- as the Hippo pathway; Sebe´-Pedro´s et al., 2012). land analyzing two choanoflagellate species, Other changes included the expansion of a core compared to the 21 analyzed by Richter et al. set of transcription factors (de Mendoza et al., Another difference is that Paps and Holland 2013), and increased cis-regulatory complexity did not estimate gene gain and loss along the (Sebe´-Pedro´s et al., 2016). Choanoflagellata stem, whereas Richter et al. Comparative gene content analyses refine did. This revealed more gene family gain and our understanding of what makes metazoans less gene family loss along the Choanoflagellata unique, and in the process we are learning about stem than along the Metazoa stem (Figure 1C). the underappreciated biology of our close non- So, Richter et al. do find a burst of gene family metazoan relatives (Sebe´-Pedro´s et al., 2017). expansion, but in Choanoflagellata rather than For instance, Richter et al. identified homologs Metazoa. It will be critical to further test the of Toll-like receptors in most choanoflagellates. findings of both studies with improved sampling These genes were thought to be an animal-spe- of other closely related groups, which could cific innovation for innate immunity. Future change how the gains and losses are appor- research could investigate if these genes have tioned to these two stems. The results presented by Richter et al. agree in immune-like roles in non-animals. important ways with other recent work (King et al., It is impossible to know how special animals 2008; Suga et al., 2013). These analyses reveal really are without also knowing something about that the genetic changes on the Metazoa stem our closest relatives. The more we learn about included the evolution of new intercellular signal- these relatives, the less special we seem to be. ing pathways (Fairclough et al., 2013) and the integration of new ligands and receptors into intra- cellular pathways that were already present (such

Lewis and Dunn. eLife 2018;7:e38726. DOI: https://doi.org/10.7554/eLife.38726 2 of 3 Insight Genome Evolution We are not so special

Zachary R Lewis is in the Department of Ecology and I, Marr M, Pincus D, Putnam N, Rokas A, Wright KJ, Evolutionary Biology, Yale University, New Haven, Zuzow R, Dirks W, Good M, Goodstein D, Lemons D, United States et al. 2008. The genome of the choanoflagellate https://orcid.org/0000-0003-0160-4722 Monosiga brevicollis and the origin of metazoans. Nature 451:783–788. DOI: https://doi.org/10.1038/ Casey W Dunn is in the Department of Ecology and nature06617, PMID: 18273011 Evolutionary Biology, Yale University, New Haven, Lewis ZR, Dunn CW. 2018. Gene Inventory 2018. United States Github. 18b945b. https://github.com/dunnlab/gene_ [email protected] inventory_2018 http://orcid.org/0000-0003-0628-5150 Paps J, Holland PWH. 2018. Reconstruction of the ancestral metazoan genome reveals an increase in Competing interests: The authors declare that no genomic novelty. Nature Communications 9:1730. competing interests exist. DOI: https://doi.org/10.1038/s41467-018-04136-5, Published 03 July 2018 PMID: 29712911 Richter DJ, Fozouni P, Eisen M, King N. 2018. Gene References family innovation, conservation and loss on the animal stem lineage. eLife 7:e34226. DOI: https://doi.org/10. Brunet T, King N. 2017. The origin of animal 7554/eLife.34226, PMID: 29848444 multicellularity and cell differentiation. Developmental Sebe´ -Pedro´ s A, Ballare´C, Parra-Acero H, Chiva C, Cell 43:124–140. DOI: https://doi.org/10.1016/j. Tena JJ, Sabido´E, Go´mez-Skarmeta JL, Di Croce L, devcel.2017.09.016, PMID: 29065305 Ruiz-Trillo I. 2016. The dynamic regulatory genome of de Mendoza A, Sebe´-Pedro´s A, Sˇ estak MS, Matejcic Capsaspora and the origin of animal multicellularity. M, Torruella G, Domazet-Loso T, Ruiz-Trillo I. 2013. Cell 165:1224–1237. DOI: https://doi.org/10.1016/j. Transcription factor evolution in eukaryotes and the cell.2016.03.034, PMID: 27114036 assembly of the regulatory toolkit in multicellular Sebe´ -Pedro´ s A, Degnan BM, Ruiz-Trillo I. 2017. The lineages. PNAS 110:E4858–E4866. DOI: https://doi. origin of Metazoa: a unicellular perspective. Nature org/10.1073/pnas.1311818110, PMID: 24277850 Reviews Genetics 18:498–512. DOI: https://doi.org/10. del Campo J, Sieracki ME, Molestina R, Keeling P, 1038/nrg.2017.21, PMID: 28479598 Massana R, Ruiz-Trillo I. 2014. The others: our biased Sebe´ -Pedro´ s A, Zheng Y, Ruiz-Trillo I, Pan D. 2012. perspective of eukaryotic genomes. Trends in Ecology Premetazoan origin of the Hippo signaling pathway. & Evolution 29:252–259. DOI: https://doi.org/10.1016/ Cell Reports 1:13–20. DOI: https://doi.org/10.1016/j. j.tree.2014.03.006, PMID: 24726347 celrep.2011.11.004, PMID: 22832104 Fairclough SR, Chen Z, Kramer E, Zeng Q, Young S, Suga H, Chen Z, de Mendoza A, Sebe´-Pedro´s A, Robertson HM, Begovic E, Richter DJ, Russ C, Brown MW, Kramer E, Carr M, Kerner P, Vervoort M, Westbrook MJ, Manning G, Lang BF, Haas B, Sa´nchez-Pons N, Torruella G, Derelle R, Manning G, Nusbaum C, King N. 2013. Premetazoan genome Lang BF, Russ C, Haas BJ, Roger AJ, Nusbaum C, Ruiz- evolution and the regulation of cell differentiation in Trillo I. 2013. The Capsaspora genome reveals a the choanoflagellate Salpingoeca rosetta. Genome complex unicellular prehistory of animals. Nature Biology 14:R15. DOI: https://doi.org/10.1186/gb- Communications 4:2325. DOI: https://doi.org/10. 2013-14-2-r15, PMID: 23419129 1038/ncomms3325, PMID: 23942320 King N, Westbrook MJ, Young SL, Kuo A, Abedin M, Chapman J, Fairclough S, Hellsten U, Isogai Y, Letunic

Lewis and Dunn. eLife 2018;7:e38726. DOI: https://doi.org/10.7554/eLife.38726 3 of 3