Using Mustard Genomes to Explore the Genetic Basis Of
Total Page:16
File Type:pdf, Size:1020Kb
Available online at www.sciencedirect.com ScienceDirect Using mustard genomes to explore the genetic basis of evolutionary change Lachezar A Nikolov and Miltos Tsiantis Recent advances in sequencing technologies and gene physiological, and biochemical level [2] (Figure 1). A manipulation tools have driven mustard species into the robust phylogenetic framework, which is a prerequisite spotlight of comparative research and have offered powerful for the comparative approach, is coming into focus and insight how phenotypic space is explored during evolution. currently comprises 49 tribes [3] and several monophy- Evidence emerged for genome-wide signal of transcription letic subfamilial groupings [4]. Brassicaceae is the plant factors and gene duplication contributing to trait divergence, family with the highest number of species with e.g., PLETHORA5/7 in leaf complexity. Trait divergence is often sequenced genomes. Many Brassicaceae species are manifested in differential expression due to cis-regulatory genetically and biochemically tractable. Collectively, divergence, as in KNOX genes and REDUCED COMPLEXITY, these qualities create a powerful platform to link geno- and can be coupled with protein divergence. Fruit shape in type to phenotype and address how diverse traits evolve. Capsella rubella results from anisotropic growth during three Here we focus on recent studies employing comparative distinct phases. Brassicaceae exhibit novel fruit dispersal genomics to summarize patterns of genomic variation and strategy, explosive pod shatter, where the rapid movement to understand the genetic basis of phenotypic change in depends on slow build-up of tension and its rapid release mustards. facilitated by asymmetric cell wall thickenings. Genomic advances at different evolutionary Address scales Department of Comparative Development and Genetics, Max Planck Mustards on the map: intra-specific and inter-specific Institute for Plant Breeding Research, Cologne 50829, Germany variation Corresponding author: Tsiantis, Miltos ([email protected]) After the completion of the reference sequence of A. thaliana (Col-0) [5], much effort has been invested to re-sequence accessions throughout its geographic Current Opinion in Plant Biology 2017, 36:119–128 range to gain insight into the genomic variation and This review comes from a themed issue on Genome studies and evolutionary history of the species. The most recent molecular genetics report included 1135 naturally inbred lines and featured Edited by Ian Henderson and Korbinian Schneeberger the densest variant map available for any complex eukary- For a complete overview see the Issue and the Editorial ote [6 ]. It provided unprecedented resolution of the pattern of polymorphisms, insight into the demographic Available online 8th March 2017 history of the species, and highlighted the formative role http://dx.doi.org/10.1016/j.pbi.2017.02.005 of climate (in particular, the last glacial period) and 1369-5266/ã 2017 Published by Elsevier Ltd. geography (Iberian Peninsula as a hotspot of relict popu- lations surviving since the last ice age) in shaping the current genetic variation of A. thaliana. The study revealed that although the relict populations continue to inhabit ancestral habitats, the descendants of one as- Introduction yet unidentified ancestral population have expanded broadly and this may have resulted in rapid fixation The mustard family (Brassicaceae) holds a special place in of genetic variants due to founder effects. Subsequent the plant kingdom as it contains Arabidopsis thaliana, a divergence was reinforced by species-wide genetic key reference species for the green evolutionary lineage incompatibilities. NLR immune receptor genes were [1]. Reference species provide powerful functional among the most influential players in such insight that, when put in a comparative perspective, incompatibilities, and may have caused gene flow barriers can also illuminate novel aspects of biology not observed through autoimmunity-driven hybrid necrosis in intraspe- in the model alone. The mustards offer excellent oppor- cific hybrids [7]. Other immunity-related genes, the tunities to address the genetic basis of phenotypic change homologs of NON-EXPRESSOR OF PR-GENES1 and at a phylogenetic scale where unique biological questions RECOGNITION OF PERENOSPORA PARASITICA 5, not accessible in A. thaliana can be studied with mecha- interacted epistatically to cause an autoimmune response nistic rigor using resources pioneered for this important and gene flow barriers when combined in hybrids of model over the last four decades. The Brassicaceae different Capsella species [8 ]. exhibit considerable diversity at the morphological, www.sciencedirect.com Current Opinion in Plant Biology 2017, 36:119–128 120 Genome studies and molecular genetics Figure 1 Capsell a rubella self-compa tibility, speciation Capsell a grandiflora demographic history Camelina sativa oilseed crop Arabidopsis halleri metal hyperaccumulator Arabidopsis lyrata self-compa tibility, local adaptation Arabidopsis thali ana wide range of biological questions Cardamin e hirsuta leaf and fruit morphology Roripp a aquatica hygrophyte, heterophylly Leavenworthi a alabamica mating systems transitions Boec hera stricta biotic interactions, phytochemistry Boec hera retrofracta apomixis, chromosome evolution Lepidiu m meyenii high altitude adaptations Brassica rapa major crops, diverse morphotypes Brassica oleracea major crops, diverse morphotypes Sisymbriu m irio invasive weed Schre nkiella parvula extremophil e Thlaspi arvense biofuel crop Eutrema salsugineum halophyte Noccaea caerulescens phytoremedia tor Arabis alpin a perenniality, biotic interactions Aethionema arabicum fruit and seed dimorphism Current Opinion in Plant Biology Phylogenetic relationships of Brassicaceae species with abundant sequencing resources. Specific questions addressed using these models are listed in blue. Panels depict diversity in leaf and fruit morphology of selected species. At a higher taxonomic level, coding sequence variation in position. This suggests that A. thaliana became reproduc- the genus Arabidopsis was harnessed to reveal support for tively isolated from the outcrossing Arabidopsis species seven clusters corresponding to the currently recognized more recently than previously thought and its placement seven Arabidopsis species [9 ]. The relationship among may reflect fixation of genetic variation derived from species based on individual gene trees varied significantly ancient admixture. In this case, the unique genomic likely due to extensive hybridization, and only A. thaliana features of A. thaliana, like the basic chromosome number could be reliably placed as a sister to the rest of the genus of 5, and its derived traits, such as self-compatibility and [10]. Interestingly, allele distribution test (ABBA-BABA) annual habit, must have arisen more rapidly than previ- taking into account population structure showed that A. ously thought. This study also identified a set of thaliana was more similar to Arabidopsis lyrata than to 129 genes containing shared ancestral polymorphisms Arabidopsis arenosa, which is unexpected for its sister in at least two of the three most common Arabidopsis Current Opinion in Plant Biology 2017, 36:119–128 www.sciencedirect.com Genetic basis of evolutionary change Nikolov and Tsiantis 121 species, which were enriched for viral-related functions expanded or unique gene families compared to seven and could indicate balancing selection. other mustard species. Comparative leaf transcriptomes of A. thaliana and C. hirsuta revealed similar overrepre- Gene discovery by de novo assembly sentation of transcription factors and tandemly duplicated Although single nucleotide polymorphism data provide a genes among the differentially expressed genes between powerful window into the distribution of small-scale these two species [19 ]. A. thaliana and C. hirsuta exhibit sequence variation, some genomic features, like collinear markedly different leaf shapes and such changes may and rearranged variation, require high-quality genomes have contributed to trait divergence between these two assembled without reference guidance. The genome of species. This comparative transcriptomic approach led to another commonly used ecotype of A. thaliana, Landsberg the discovery of novel leaf shape regulators, the stem cell erecta, was assembled from short-reads and PacBio fate transcription factors PLETHORA5 (PLT5) and PLT7, sequencing data using a genetic map scaffold [11]. The which have not been previously implicated in interspe- complete assembly revealed large-scale rearrangements cific divergence. PLT5/7 were shown to be necessary (and including 1.2-Mb inversion on chromosome 4 and several PLT7—sufficient in A. thaliana) to increase leaf complex- hundred unique genes, along with genes exhibiting acces- ity in the complex leaved C. hirsuta [19 ]. Clade-specific sion-specific duplication or an asymmetric loss of a para- expansions with potentially adaptive roles were detected log. Such dynamic gene gain and loss processes have been in the metal hyperaccumulation pathways [20], stress- documented broadly throughout Brassicaceae and under- tolerance genes [21], the photoperiod regulator score the importance of ‘total evidence’ in genomics: the CONSTANS [22], and some glucosinolate synthesis genes pangenome as the sum of core genes shared by all [23]. In the heavy metal hyperaccumular Noccaea accessions and accession-specific genes. In A. thaliana caerulescens,