Transcriptomic Analysis of Eruca Vesicaria Subs
Total Page:16
File Type:pdf, Size:1020Kb
Huang et al. BMC Plant Biology (2019) 19:419 https://doi.org/10.1186/s12870-019-1997-2 RESEARCH ARTICLE Open Access Transcriptomic analysis of Eruca vesicaria subs. sativa lines with contrasting tolerance to polyethylene glycol-simulated drought stress Bang-Lian Huang1,2, Xuan Li1, Pei Liu1, Lan Ma1, Wenhua Wu1, Xuekun Zhang3, Zaiyun Li4 and Bangquan Huang1* Abstract Background: Eruca vesicaria subsp. sativa is one of the Cruciferae species most tolerant to drought stress. In our previous study some extremely drought-tolerant/sensitive Eruca lines were obtained. However little is known about the mechanism for drought tolerance in Eruca. Methods: In this study two E. vesicaria subs. sativa lines with contrasting drought tolerance were treated with liquid MS/PEG solution. Total RNA was isolated from 7-day old whole seedlings and then applied to Illumina sequencing platform for high-throughput transcriptional sequencing. Results: KEGG pathway analysis indicated that differentially expressed genes (DEGs) involved in alpha-Linolenic acid metabolism, Tyrosine metabolism, Phenylalanine, Tyrosine and tryptophan biosynthesis, Galactose metabolism, Isoquinoline alkaloid biosynthesis, Tropane, Piperidine and pyridine alkaloid biosynthesis, Mineral absorption, were all up-regulated specifically in drought-tolerant (DT) Eruca line under drought stress, while DEGs involved in ribosome, ribosome biogenesis, Pyrimidine metabolism, RNA degradation, Glyoxylate and dicarboxylate metabolism, Aminoacyl-tRNA biosynthesis, Citrate cycle, Methane metabolism, Carbon fixation in photosynthetic organisms, were all down-regulated. 51 DEGs were found to be most significantly up-regulated (log2 ratio ≥ 8) specifically in the DT line under PEG treatment, including those for ethylene-responsive transcription factors, WRKY and bHLH transcription factors, calmodulin-binding transcription activator, cysteine-rich receptor-like protein kinase, mitogen- activated protein kinase kinase, WD repeat-containing protein, OPDA reductase, allene oxide cyclase, aquaporin, O- acyltransferase WSD1, C-5 sterol desaturase, sugar transporter ERD6-like 12, trehalose-phosphate phosphatase and galactinol synthase 4. Eight of these 51 DEGs wre enriched in 8 COG and 17 KEGG pathways. Conclusions: DEGs that were found to be most significantly up-regulated specifically in the DT line under PEG treatment, up-regulation of DEGs involved in Arginine and proline metabolism, alpha-linolenic acid metabolism and down-regulation of carbon fixation and protein synthesis might be critical for the drought tolerance in Eruca. These results will be valuable for revealing mechanism of drought tolerance in Eruca and also for genetic engineering to improve drought tolerance in crops. Keywords: Eruca vesicaria subs. Sativa, Genotypes, Drought tolerance, RNA-seq, Gene expression * Correspondence: [email protected] 1Hubei Collaborative Innovation Center for Green Transformation of Bio-Resources, College of Life Science, Hubei University, Wuhan 430062, China Full list of author information is available at the end of the article © The Author(s). 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Huang et al. BMC Plant Biology (2019) 19:419 Page 2 of 11 Background Table 1 Illumina RNA-Seq reads and de novo assembly statistics Drought is one of the most important environmental of Eruca transcriptome stresses in the world [1–3]. Previous studies indicated that Total number of raw reads 314,322,210 plant responses to drought stress is quite complicated and Total number of clean reads 254,660,002 exploring drought tolerance mechanism in plants is still a Mean length of reads (bp) 90 challenge [4–7]. Next generation sequencing (NGS) using Number of total unigenes 430,166 Illumina HiSeq2000 covers the transcriptome many folds and allows quantification of the detected transcripts [8, 9]. Mean length of unigenes (bp) 511.56 By using next generation sequencing the gene expression is Minimum unigene length (bp) 201 quantified, making it possible for quantitative comparisons Maximum unigene length (bp) 16,517 [10]. Many researchers have used transcriptome sequencing N50 (bp) 638 – to study the responses to drought stress in crops [10 16]. Unigenes annotated to Nr 139,090 Biological and genetic diversity exists among and within Unigenes annotated to KEGG 40,463 plant species regarding drought tolerance, however many of these adaptive mechanisms are not completely under- Unigenes annotated to COG 105,052 stood. Genome-wide identification of drought-responsive − transcriptions using genotypes with contrasting drought- value < 10 5)(Fig.1). Among the unigenes 83,392 had tolerance will help revealing interactions among metabolic identity between 22.86–80% and 55,698 higher than 80%. pathways in response to drought stress [12]. Eruca vesi- Regarding the species distribution, 83,034 unigene caria subsp. sativa is one of the Cruciferae species most (59.70%) had similarity between 19.32–80%, 56, 056 uni- tolerant to drought stress [17, 18]. In our previous study gene (40.30%) had similarity higher than 80%. Of the 139, some extremely drought-tolerant/sensitive Eruca lines were 090 unigenes annotated to Nr, 36,254 (26.06%) had obtained [19]. In this study, Illumina sequencing technology matches to sequences from Brassica napus, followed by was used to identify the differentially expressed genes that from Brassica rapa (27,894, 20.05%), Eutrema salsugi- (DEGs) and their biochemical pathways related to drought neum (5366, 3.86%), Hordeum vulgare (3578, 2.57%), Ara- tolerance in drought tolerant (DT) versus drought- bidopsis thaliana (3391, 2.44%), Physcomitrella patens susceptible (DS) Eruca genotypes. Expression level of some (3289, 2.36%), Camelina sativa (3249, 2.34%) (Fig. 2). selected drought-responsive genes was validated by using quantitative RT-PCR. To our knowledge,thisisthefirstre- COG annotation and enrichment port about the whole transcriptome analysis in E. vesicaria In total 105,052 unigenes were anotated to COG (Clusters subsp. sativa. The results provide transcriptome data valu- of Orthologous Groups of proteins) database and assigned able for understanding drought tolerance mechanism in to 25 COG functional clusters (Fig. 3), among which the Eruca and the differentially expressed genes could be valu- “general function prediction only” cluster comprised the able for improving drought tolerance in crops. highest number of unigenes (15,472, 14.73%), followed by the cluster “translation, ribosomal structure and biogen- Results esis” (14,062, 13.39%) and “Posttranslational modification, Overview of Illumina RNA sequencing and de novo protein turnover, chaperones” (10,244, 9.75%). By contrast, assembly of E. vesicaria subs. sativa transcriptome only 110 unigenes were classified into “Cell motility”,98 In total 314,322,210 raw reads after sequencing produced unigenes classified into “Nuclear structure” and 3 uni- 254,660,002 clean reads including 430,166 unique genes with genes classified into “Extracellular structures” (Fig. 3). N50 value of 638 bp and average length of 511.56 bp. The unigene length ranged from 201 to 16, 517 bp (Table 1). The unigenes included 94,463 clusters and 335,703 sin- GO anotation and enrichment gletons. Among the unigenes there were 308,310 (71.67%) GO (Gene ontology) with Nr annotation indicated that with length from 200 to 500 bp, 76,255 (17.73%) with 4953 unigenes were classified into 69 groups that could length from 500 bp to 1000 bp, 24,662 (5.73%) with length be categorized into three main classifications: “biological from 1000 bp to 1500 bp, 11,229 (2.61%) with length from process” (2859), “cellular component” (643) and “mo- 1500 bp to 2000 bp, 9709 (2.26%) with length longer than lecular function” (1451). In the “molecular function” 2000 bp, and no unigenes shorter than 200 bp (Table 2). class the largest number of unigenes (984) were involved Based on Nr annotation and the E-value distribution, it in “catalytic activity”; in the “cellular component” class was found that among the annotated unigenes 51,486 the largest number of unigenes (168) were involved in − (37.02%) shared very strong homology (E-value < 10 60), “cell part”; in the “biological process” class the largest − 31,700 (22.80%) had strong homology (10 60 <Evalue < number of unigenes (534) were involved in “metabolic − − 10 30) and 55, 904 (40.19%) showed homology (10 30 <E- process” (Additional file 1). Huang et al. BMC Plant Biology (2019) 19:419 Page 3 of 11 Table 2 Length distribution of the Unigenes Length (nt) Total < 200 200–500 500–1000 1000–1500 1500–2000 ≥2000 Number of Unigenes 430,166 0 308,310 76,255 24,662 11,229 9709 % 100 0.00 71.67 17.73 5.73 2.61 2.26 KEGG pathway annotation and unigene enrichment metabolism, 39 in Plant hormone signal transduction, 21 In total 28,295 unigenes were annotated to 344 KEGG path- in Phenylpropanoid biosynthesis, 13 in alpha-Linolenic ways (Additional file 2). The number of unigenes in different acid metabolism and 41 in D-Glutaminehe