Using Mustard Genomes to Explore the Genetic Basis of Evolutionary
Total Page:16
File Type:pdf, Size:1020Kb
Available online at www.sciencedirect.com ScienceDirect Using mustard genomes to explore the genetic basis of evolutionary change Lachezar A Nikolov and Miltos Tsiantis Recent advances in sequencing technologies and gene phylogenetic framework, which is a prerequisite for the manipulation tools have driven mustard species into the comparative approach, is coming into focus and currently spotlight of comparative research and have offered powerful comprises 49 tribes [3] and several monophyletic sub- insight how phenotypic space is explored during evolution. familial groupings [4]. Brassicaceae is the plant family Evidence emerged for genome-wide signal of transcription with the highest number of species with sequenced factors and gene duplication contributing to trait divergence, genomes. Many Brassicaceae species are genetically e.g., PLETHORA5/7 in leaf complexity. Trait divergence is often and biochemically tractable. Collectively, these qualities manifested in differential expression due to cis-regulatory create a powerful platform to link genotype to phenotype divergence, as in KNOX genes and REDUCED COMPLEXITY, and address how diverse traits evolve. Here we focus on and can be coupled with protein divergence. Fruit shape in recent studies employing comparative genomics to sum- Capsella rubella results from anisotropic growth during three marize patterns of genomic variation and to understand distinct phases. Brassicaceae exhibit novel fruit dispersal the genetic basis of phenotypic change in mustards. strategy, explosive pod shatter, where the rapid movement depends on slow build-up of tension and its rapid release Genomic advances at different evolutionary facilitated by asymmetric cell wall thickenings. scales Mustards on the map: intra-specific and inter-specific Address variation Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Cologne 50829, Germany After the completion of the reference sequence of A. thaliana (Col-0) [5], much effort has been invested Corresponding author: Tsiantis, Miltos ([email protected]) to re-sequence accessions throughout its geographic range to gain insight into the genomic variation and Current Opinion in Plant Biology 2017, 36:119–128 evolutionary history of the species. The most recent report included 1135 naturally inbred lines and featured This review comes from a themed issue on Genome studies and the densest variant map available for any complex eukary- molecular genetics ote [6 ]. It provided unprecedented resolution of the Edited by Ian Henderson and Korbinian Schneeberger pattern of polymorphisms, insight into the demographic history of the species, and highlighted the formative role of climate (in particular, the last glacial period) and http://dx.doi.org/10.1016/j.pbi.2017.02.005 geography (Iberian Peninsula as a hotspot of relict popu- 1369-5266/ã 2017 Published by Elsevier Ltd. lations surviving since the last ice age) in shaping the current genetic variation of A. thaliana. The study revealed that although the relict populations continue to inhabit ancestral habitats, the descendants of one as- yet unidentified ancestral population have expanded broadly and this may have resulted in rapid fixation Introduction of genetic variants due to founder effects. Subsequent The mustard family (Brassicaceae) holds a special place in divergence was reinforced by species-wide genetic the plant kingdom as it contains Arabidopsis thaliana, a incompatibilities. NLR immune receptor genes were key reference species for the green evolutionary lineage among the most influential players in such incompatibili- [1]. Reference species provide powerful functional ties, and may have caused gene flow barriers through insight that, when put in a comparative perspective, autoimmunity-driven hybrid necrosis in intraspecific can also illuminate novel aspects of biology not observed hybrids [7]. Other immunity-related genes, the homologs in the model alone. The mustards offer excellent oppor- of NON-EXPRESSOR OF PR-GENES1 and RECOGNI- tunities to address the genetic basis of phenotypic change TION OF PERENOSPORA PARASITICA 5, interacted at a phylogenetic scale where unique biological questions epistatically to cause an autoimmune response and gene not accessible in A. thaliana can be studied with mecha- flow barriers when combined in hybrids of different nistic rigor using resources pioneered for this important Capsella species [8 ]. model over the last four decades. The Brassicaceae exhibit considerable diversity at the morphological, phys- At a higher taxonomic level, coding sequence variation in iological, and biochemical level [2] (Figure 1). A robust the genus Arabidopsis was harnessed to reveal support for www.sciencedirect.com Current Opinion in Plant Biology 2017, 36:119–128 120 Genome studies and molecular genetics Figure 1 Capsell a rubella self-compa tibility, speciation Capsell a grandiflora demographic history Camelina sativa oilseed crop Arabidopsis halleri metal hyperaccumulator Arabidopsis lyrata self-compa tibility, local adaptation Arabidopsis thali ana wide range of biological questions Cardamin e hirsuta leaf and fruit morphology Roripp a aquatica hygrophyte, heterophylly Leavenworthi a alabamica mating systems transitions Boec hera stricta biotic interactions, phytochemistry Boec hera retrofracta apomixis, chromosome evolution Lepidiu m meyenii high altitude adaptations Brassica rapa major crops, diverse morphotypes Brassica oleracea major crops, diverse morphotypes Sisymbriu m irio invasive weed Schre nkiella parvula extremophil e Thlaspi arvense biofuel crop Eutrema salsugineum halophyte Noccaea caerulescens phytoremedia tor Arabis alpin a perenniality, biotic interactions Aethionema arabicum fruit and seed dimorphism Current Opinion in Plant Biology Phylogenetic relationships of Brassicaceae species with abundant sequencing resources. Specific questions addressed using these models are listed in blue. Panels depict diversity in leaf and fruit morphology of selected species. seven clusters corresponding to the currently recognized more recently than previously thought and its placement seven Arabidopsis species [9 ]. The relationship among may reflect fixation of genetic variation derived from species based on individual gene trees varied significantly ancient admixture. In this case, the unique genomic likely due to extensive hybridization, and only A. thaliana features of A. thaliana, like the basic chromosome number could be reliably placed as a sister to the rest of the genus of 5, and its derived traits, such as self-compatibility and [10]. Interestingly, allele distribution test (ABBA-BABA) annual habit, must have arisen more rapidly than previ- taking into account population structure showed that A. ously thought. This study also identified a set of 129 genes thaliana was more similar to Arabidopsis lyrata than to containing shared ancestral polymorphisms in at least two Arabidopsis arenosa, which is unexpected for its sister of the three most common Arabidopsis species, which position. This suggests that A. thaliana became reproduc- were enriched for viral-related functions and could indi- tively isolated from the outcrossing Arabidopsis species cate balancing selection. Current Opinion in Plant Biology 2017, 36:119–128 www.sciencedirect.com Genetic basis of evolutionary change Nikolov and Tsiantis 121 Gene discovery by de novo assembly overrepresentation of transcription factors and tandemly Although single nucleotide polymorphism data provide a duplicated genes among the differentially expressed powerful window into the distribution of small-scale genes between these two species [19 ]. A. thaliana and sequence variation, some genomic features, like collinear C. hirsuta exhibit markedly different leaf shapes and such and rearranged variation, require high-quality genomes changes may have contributed to trait divergence assembled without reference guidance. The genome of between these two species. This comparative transcrip- another commonly used ecotype of A. thaliana, Landsberg tomic approach led to the discovery of novel leaf shape erecta, was assembled from short-reads and PacBio regulators, the stem cell fate transcription factors PLETH- sequencing data using a genetic map scaffold [11]. The ORA5 (PLT5) and PLT7, which have not been previously complete assembly revealed large-scale rearrangements implicated in interspecific divergence. PLT5/7 were including 1.2-Mb inversion on chromosome 4 and several shown to be necessary (and PLT7—sufficient in A. thali- hundred unique genes, along with genes exhibiting acces- ana) to increase leaf complexity in the complex leaved C. sion-specific duplication or an asymmetric loss of a para- hirsuta [19 ]. Clade-specific expansions with potentially log. Such dynamic gene gain and loss processes have been adaptive roles were detected in the metal hyperaccumu- documented broadly throughout Brassicaceae and under- lation pathways [20], stress-tolerance genes [21], the score the importance of ‘total evidence’ in genomics: the photoperiod regulator CONSTANS [22], and some gluco- pangenome as the sum of core genes shared by all sinolate synthesis genes [23]. In the heavy metal hyper- accessions and accession-specific genes. In A. thaliana accumular Noccaea caerulescens, there is also evidence that in particular, lineage-specific genes arose after duplica- structural rearrangements may have played a role to group tion and subsequent divergence, retrotransposition, and together functionally similar