A Phylogenomic Assessment of Ancient Polyploidy and Genome Evolution Across the Poales
Total Page:16
File Type:pdf, Size:1020Kb
GBE A Phylogenomic Assessment of Ancient Polyploidy and Genome Evolution across the Poales Michael R. McKain1,2,*, Haibao Tang3,4,JoelR.McNeal5,2, Saravanaraj Ayyampalayam2, Jerrold I. Davis6, Claude W. dePamphilis7, Thomas J. Givnish8,J.ChrisPires9, Dennis Wm. Stevenson10,and James H. Leebens-Mack2 1Donald Danforth Plant Science Center, St. Louis, Missouri Downloaded from https://academic.oup.com/gbe/article-abstract/8/4/1150/2574085 by guest on 17 January 2019 2Department of Plant Biology, University of Georgia 3Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, Fujian Province, China 4School of Plant Sciences, iPlant Collaborative, University of Arizona 5Department of Ecology, Evolution, and Organismal Biology, Kennesaw State University 6L. H. Bailey Hortorium and Department of Plant Biology, Cornell University 7Department of Biology and Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, Pennsylvania 8Department of Botany, University of Wisconsin-Madison 9Division of Biological Sciences, University of Missouri, Columbia 10New York Botanical Garden, Bronx, New York *Corresponding author: E-mail: [email protected]. Accepted: March 7, 2016 Data deposition: Alignments, trees, and analyses for this project have been deposited at Dryad under the accession doi:10.5061/dryad.305s0. DNA sequences have been deposited at GenBank under the accession PRJNA313089. Abstract Comparisons of flowering plant genomes reveal multiple rounds of ancient polyploidy characterized by large intragenomic syntenic blocks. Three such whole-genome duplication (WGD) events, designated as rho (r), sigma (s), and tau (t), have been identified in the genomes of cereal grasses. Precise dating of these WGD events is necessary to investigate how they have influenced diversification rates, evolutionary innovations, and genomic characteristics such as the GC profile of protein-coding sequences. The timing of these events has remained uncertain due to the paucity of monocot genome sequence data outside the grass family (Poaceae). Phylogenomic analysis of protein-coding genes from sequenced genomes and transcriptome assemblies from 35 species, including representatives of all families within the Poales, has resolved the timing of rho and sigma relative to speciation events and placed tau prior to divergence of Asparagales and the commelinids but after divergence with eudicots. Examination of gene family phylogenies indicates that rho occurred just prior to the diversification of Poaceae and sigma occurred before early diversification of Poales lineages but after the Poales-commelinid split. Additional lineage-specific WGD events were identified on the basis of the transcriptome data. Gene families exhibiting high GC content are underrepresented among those with duplicate genes that persisted following these genome duplications. However, genome du- plications had little overall influence on lineage-specific changes in the GC content of coding genes. Improved resolution of the timing of WGD events in monocot history provides evidence for the influence of polyploidization on functional evolution and species diversification. Key words: whole-genome duplication, grasses, monocots, GC content. Introduction Taylor and Raes 2004; Soltis et al. 2009; De Smet and Van de Paleopolyploidy or ancient whole-genome duplication (WGD) Peer 2012). WGDs have been especially widespread through- events have occurred across the eukaryotic tree of life and out flowering plant history (Cui et al. 2006; Vanneste et al. have been hypothesized to have had major impacts on life 2014), and ancient polyploidy has been associated with the history innovations and organismal diversification (Levin 1983; origin of seed plants (spermatophyta) and angiosperms (Jiao ß The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected] 1150 Genome Biol. Evol. 8(4):1150–1164. doi:10.1093/gbe/evw060 Advance Access publication March 17, 2016 Polyploidy and Genome Evolution Across Poales GBE et al. 2011). In fact, multiple rounds of WGD have been in- polyploid events occurring over a relatively short period of ferred for many plant lineages (Tang et al. 2010; Jiao et al. time (Estep et al. 2014). 2011, 2012; McKain et al. 2012; Yang et al. 2015) including Early analyses of the rice genome revealed a WGD event those leading to the model plant species Arabidopsis thaliana designated as rho (Paterson et al. 2004; Yu et al. 2005)in- (Blanc and Wolfe 2004), the tree species Populus trichocarpa ferred to have predated the divergence of the BOP and (Tuskan et al. 2006), legumes (Fabaceae; [Cannon et al. PACMAD clades within the grass phylogeny (Grass 2015]), and grasses (Poaceae; [Paterson et al. 2004; Tang Phylogeny Working Group 2001; Blanc and Wolfe 2004; et al. 2010]). Though many recent polyploid lineages have Paterson et al. 2004; Schlueter et al. 2004; Yu et al. 2005; relatively lower net diversification rates than diploid lineages, Wu et al. 2008; Grass Phylogeny Working Group II 2012; Downloaded from https://academic.oup.com/gbe/article-abstract/8/4/1150/2574085 by guest on 17 January 2019 genome duplications have undoubtedly contributed to molec- Soreng et al. 2015). The rho event has been implicated ular, ecological, and phylogenetic diversification during angio- as contributing to the success of Poaceae through the sperm evolution (Mayrose et al. 2011; Soltis et al. 2014). role of duplicated MADS-box genes in the development However, a mechanistic understanding of how polyploidy of the spikelet (Preston and Kellogg 2006; Preston et al. has contributed to evolutionary innovations and diversification 2009) and the role of duplicated genes involved in the devel- in angiosperm history requires precise phylogenetic placement opment of starch-rich seeds (Wu et al. 2008; Comparot-Moss of the nature and timing of these ancient WGDs (Soltis et al. and Denyer 2009). Despite the rho event’s impact on 2009). functional and phylogenetic diversification of the Polyploid can occur through either somatic genome dou- grasses, the absence of genomic-scale analyses for early di- bling within meristematic tissue, zygotes, or young embryos, verging grass lineages (pre-divergence of BOP and or with the fusion of unreduced gametes (Ramsey and PACMAD clades) and nongrass graminids has left the precise Schemske 1998). Both autopolyploidy (multiple copies of the placement of rho uncertain (Paterson et al. 2004; Soltis et al. 2009). same genome) and allopolyploidy (multiple copies of different Later comparative genomic analyses performed with re- genomes, usually of different species) have prevalence in an- fined computational tools elucidated an even older genome giosperm history (Barker et al. 2016). The formation of stable, duplication in grass genomes designated as sigma that was sexually reproducing polyploid populations is thought to in- estimated to have occurred sometime prior to the diversifica- volve two steps: 1) mating between individuals with reduced tion of Poaceae (Paterson et al. 2009; Tang et al. 2010). and unreduced gametes, sometimes producing individuals Analyses of the Musa genome identified three lineage-specific with odd polyploidy levels, the so-called triploid bridge WGDs within Zingiberales and determined that the rho and (Mallet 2007), and 2) subsequent fusion of unreduced gam- sigma events occurred within the Poales lineage but prior to etes producing a stable tetraploid, or higher even-level poly- the divergence of the PACMAD and BOP clades (D’Hont et al. ploid, such as hexaploid wheat (Levin 2002). Morphological, 2012). physiological, and ecological diversification among polyploid Based on synteny analysis of the rice and sorghum ge- populations can follow as a consequence of differential gene nomes, Tang et al. (2010) hypothesized a third paleopolyploid loss or subfunctionalization and neofunctionalization of re- event prior to sigma but such an event was not inferred in the tained duplicates (i.e., homeologs). Musa genome analyses (D’Hont et al. 2012). Comparative Polyploidy is hypothesized to have played a major role in synteny analyses of oil palm (Elaeis guineensis) and eudicot the evolution of key innovations through the proliferation of genomes have recently placed this event, designated tau,as novel genes and gene interactions (Ohno 1970; Levin 1983; early in the evolutionary history of monocots (Jiao et al. 2014), VandePeeretal.2009). Polyploidy was recently shown to be but after the divergence of the Alismatales (Ming et al. 2015). a driver of innovation and novelty in secondary metabolites in Clearly, polyploidy has contributed to molecular variation and the mustard family (Edger et al. 2015), demonstrating a con- evolution within grass and other monocot lineages, but a full nection between genome duplication, adaptation, and diver- understanding of the impact of the rho, sigma,andtau events sification. Additionally, polyploidy has been implicated as requires more precise estimation of the timing of these spurring epigenetic changes (Yoo et al. 2014), immediate genome duplications relative to branching points in the