<<

Botany

Transcriptomic analysis of the heat-stress response in Boechera depauperata and reveals a distinct and unusual heat-stress response in Boechera.

Journal: Botany

Manuscript ID cjb-2020-0014.R2

Manuscript Type: Article

Date Submitted by the 21-May-2020 Author:

Complete List of Authors: Pierce, Ian; San Diego State University, Biology Halter, Gillian; San Diego State University, Biology Waters, ElizabethDraft R. ; San Diego State University, Biology Keyword: Arabidopsis, heat-stress, transcriptomics, RNA-seq

Is the invited manuscript for consideration in a Special Not applicable (regular submission) Issue? :

https://mc06.manuscriptcentral.com/botany-pubs Page 1 of 37 Botany

Title: Transcriptomic analysis of the heat stress response in Boechera depauperata and

Arabidopsis reveals a distinct and unusual heat stress response in Boechera.

Authors: Ian Piercea, Gillian Haltera, and Elizabeth R. Watersa, *

Affiliations:

a Department of Biology, San Diego State University, San Diego, CA. 92182

* Corresponding author: [email protected] Draft

1 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 2 of 37

Abstract: Global surface temperatures are expected to rise throughout the 21st century

and will negatively impact growth and reproduction. Thus, it is imperative that we

deepen our understanding of plant thermotolerance. The examination of native plant

species that have evolved tolerance to high temperatures can provide crucial

information on how can adapt to climate change. Boechera (), a

large that is native to North America, is highly thermotolerant and can maintain

photosynthetic activity at high temperatures. Here we report results of transcriptomic

studies that seek to reveal possible thermotolerance mechanisms in B. depauperata.

Analysis of RNA-seq datasets from heat stressed B. depauperata and Arabidopsis

thaliana plants identified significant differences in how each of these species responds

to identical heat stress conditions. TheDraft most highly upregulated heat-stress in A. thaliana includes the well-characterized heat-shock genes. In contrast, the Boechera heat stress response is composed of: novel genes that lack orthologs in other genomes, genes coding for of uncharacterized function, and genes coding for proteins associated with the unfolded and ER stress responses. In addition, genes that are protective of photosynthetic capacity are also differentially upregulated in B. depauperata.

Key words: Arabidopsis, Boechera, heat stress, transcriptomics, RNA-seq, orphan genes.

2 https://mc06.manuscriptcentral.com/botany-pubs Page 3 of 37 Botany

Introduction

Plants are sessile organisms, and this means that wherever a seed germinates

the growing plant must survive to produce the next generation or that lineage becomes

a dead-end. Thus, tolerance to stress is a crucial aspect of plant success. As our

climate changes the severity and duration of heatwaves are expected to increase

(Stocker et al. 2013). Soon, intensifying heat stress will have a greater impact on plant

growth; and this stress will challenge the extent of plant adaptation and tolerance to

heat-stress. This, in turn, puts both crop productivity and the survival of native plant

species at risk. Thermotolerance is a complex and crucial trait for plant growth and

reproduction and its importance in plants has motivated a large number of studies in

crop and model plant species. The best-studiedDraft model species is .

Much of what we know about thermotolerance, the heat shock response, and the heat

shock proteins in plants is based on studies in Arabidopsis. The enormous body of

literature on Arabidopsis and heat stress is impossible to cite here. However, there have

been a few seminal papers that provide a mechanistic understanding of

thermotolerance in Arabidopsis, in particular, and in all plants, in general (Larkindale et

al. 2005; Kotak et al. 2007; Larkindale and Vierling 2008; Saidi et al. 2011; Mittler et al.

2012). What we know about heat stress in Arabidopsis (and for most plants as well) is

that exposure to a short-term (few hours) of moderate heat stress at 38°C is not lethal

and that this stress induces the expression of the heat shock proteins (HSPs) (Vierling

1991; Waters et al. 1996; Queitsch et al. 2000; Kotak et al. 2007). The HSPs are

molecular chaperones and as such, they protect cells from heat-induced damage (

Hendrick and Hartl 1993; Haslbeck and Buchner 2002; Haslbeck et al. 2005; Hilton et

3 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 4 of 37

al. 2013; Kim et al. 2013). However, there is an upper limit of heat stress above which

(for most plants including Arabidopsis this is above 44° or 45°C) even the HSPs cannot provide protection and cell death occurs (Larkindale et al. 2005; Kotak et al. 2007).

While it is recognized that the heat shock proteins are a crucial component of thermotolerance and are one the best studied responses to stress, it is now widely understood that the heat shock proteins (HSPs) comprise only one component of thermotolerance (Hanson et al. 1999; Sharkey 2000; Larkindale et al. 2005; Kotak et al.

2007; Larkindale and Vierling 2008; Sharkey and Zhang 2010) see also (Mittler et al.

2012; Ohama et al. 2017; Scheepens et al. 2018). Most of the thermotolerance studies are conducted using standard thermotolerance methodologies as described in (Yeh et al. 2012) and use Arabidopsis as theirDraft study species.

Recently, the importance of understanding thermotolerance for plant conservation has been acknowledged and this has stimulated thermotolerance studies in a wider range of plants (French et al. 2017). Further, it is clear that detailed knowledge of thermotolerance in naturally thermotolerant species will greatly increase our understanding of the physiological and biochemical mechanisms that underlie this important trait. The value of studies in Arabidopsis is that they benefit from a well- curated genome and decades of past studies that provide an important foundation for molecular and physiological studies. However, by only focusing on just one species we risk considerable bias in our understanding of the possible range of thermotolerance levels and in the stress response itself. In our previous studies (Gallas and Waters

2015; Halter et al. 2017; Kannan et al. 2018) we have examined the thermotolerance and stress responses of a group of native California plants in the Boechera genus

4 https://mc06.manuscriptcentral.com/botany-pubs Page 5 of 37 Botany

(Brassicaceae). Boechera is a large genus over 100 species that are native to North

America (Al-Shehbaz 2003; Alexander et al. 2013). The Boechera are found from cool

moist coastal environments to high elevation mountains, to hot and dry deserts (Gallas

and Waters 2015; Halter et al 2017). A phylogenetic analysis of the Brassicaceae

family identified three major lineages (I, II, III) (Franzke et al. 2011). Both the

Arabidopsis and Boechera genera are members of the well-supported Lineage I within

the Brassicaceae. Thus, studies of the Boechera provide both the value of being related

to the best-studied model plant and with it the massive body of knowledge of the

Arabidopsis stress responses, as well as the value of examining plants that have

adapted to varied and extreme natural environments. In our studies, we have found

considerable diversity in levels of abioticDraft stress tolerance among the Boechera species

(Gallas and Waters 2015; Halter et al. 2017). Notably, we found that all the Boechera

species studied were more thermotolerant than Arabidopsis and that among the

Boechera species examined, Boechera depauperata is the most thermotolerant.

Further, our studies utilizing qPCR of the well-studied heat shock genes have shown

that the high levels of thermotolerance in the Boechera are not correlated with high

levels of HSP expression (Gallas and Waters 2015; Halter et al 2017). In addition,

we found that when compared to A. thaliana, B. depauperata has a much higher ability

to protect photosynthetic capacity during heat stress (Halter et al 2017).

In order to fully understand B. depauperata’ high level of thermotolerance, we

have analyzed the heat stress transcriptomes of Boechera depauperata and the related

model plant species Arabidopsis thaliana. Here we report significant differences in the

heat stress response between these two related species. Most of the most highly

5 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 6 of 37

expressed genes in A. thaliana are the well-known heat shock genes. However, the most highly expressed genes in B. depauperata included novel genes not found in A. thaliana or in other genomes. In addition, B. depauperata differentially upregulates genes that are involved in the ER stress response. Finally, there are also significant differences in the patterns of differential expression of the photosynthetic genes between the highly thermotolerant B. depauperata and the less tolerant, but better studied A. thaliana.

Materials and Methods

Plant growth and heat stress experiments:

Boechera depauperata and Arabidopsis thaliana (Columbia ecotype) were grown from seed at control conditions and Draftthen subjected to heat shock. Plant growth, heat stress, and RNA isolation methods were as previously described (Gallas and Waters

2015; Halter et al 2017). Over the last 20-30 years, numerous thermotolerance experiments with Arabidopsis have been conducted. Almost all of these studies have examined either Basal Heat stress, Acquired Heat stress, or both (Larkindale et al.

2005; Larkindale and Vierling 2008; Yeh et al 2012). Basal Heat stress (BHS) is defined as a sudden stress from control (approx. 22°C) to heat stress conditions (38°C or above) (Yeh et al. 2012). This type of stress measures the inherent tolerance to stress, i.e., stress tolerance that is not induced but always present. The second major type of stress tolerance is Acquired Heat stress or (AHS). This is examined by providing a moderate heat stress, usually, 38°C for one to two hours, followed by an hour or two of recovery, followed by more extreme stress (above 42°C) for two to four hours (Yeh et al.

2012). AHS experiments examine the impact of the induction of the heat stress

6 https://mc06.manuscriptcentral.com/botany-pubs Page 7 of 37 Botany

response genes on the ability of plants to tolerate higher temperatures. In all plants

studied to date, there is a higher tolerance to AHS than to a comparable BHS stress.

That is, a BHS of 43°C might be lethal, but an AHS treatment of 43°C that follows a pre-

treatment of 38°C is not lethal. The conditions chosen for this study were based on

previous studies of Boechera and Arabidopsis basal and acquired thermotolerance

(Gallas and Waters 2015; Halter et al. 2017). The BHS temperature (38°C) was chosen

because this is the most common BHS used and it is known to induce the protective

HSPs in Arabidopsis. The AHS temperature was chosen because it is known to induce

stress in Arabidopsis and in some species of Boechera (Gallas and Waters 2015; Halter

et al. 2017). These temperatures are often used in Arabidopsis studies. One of our

goals is to compare the HS responseDraft of Boechera to Arabidopsis and using

temperatures that are standard for this type of study allows this study to be more easily

compared to those in the thermotolerance literature.

Briefly, two heat shock protocols were applied. 1) basal heat stress (BHS) in

which plants were moved from control temperatures (22°C) and then subjected to 38°C

for two hours. 2) Acquired heat stress (AHS) in which plants were subjected to 38°C for

one hour and then returned to control conditions (22°C) for a two-hour recovery period,

and then finally subjected to heat stress at 43°C for two hours (Halter et al. 2017). Leaf

tissue was harvested and flash-frozen in liquid nitrogen RNA immediately following the

heat stress treatments. Experiments were conducted in triplicate. RNA was isolated as

previously described (Halter et al. 2017). Purified mRNA was then used to construct

RNAseq libraries. These libraries were then sequenced and analyzed as described

below.

7 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 8 of 37

Transcriptomic analysis of Arabidopsis heat stress samples was compared to the known Arabidopsis genome (TAIR 10). All gene and protein names are based on the

Arabidopsis thaliana naming conventions found at The Arabidopsis Information

Resource site (https://www.arabidopsis.org/). However, in order to analyze the RNAseq datasets from Boechera depauperata, a reference transcriptome was generated. The transcriptome assembly RNAseq libraries were prepared from a range of life stages at control conditions, and from both control and heat stressed (basal and acquired) tissue with the Illumina TruSeq protocol and normalized with Illumina Duplex-Specific

Thermostable Nuclease (P/N 15014673 Rev. C). The library was sequenced on an

Illumina Hi-Seq 2000 producing 2 x 94 million100bp paired-end reads. RNA-seq libraries were constructed with IlluminaDraft TruSeq (Illumina, San Diego, CA, USA) libraries.

Barcoded libraries from Arabidopsis thaliana and Boechera depauperata were sequenced in a single lane on separate runs of an Illumina Hi-Seq 2000. Sequencing was performed at The Scripps Research Institute (La Jolla, California, USA). All sequences generated and analyzed here are deposited at the NCBI GEO database under accession number (GSE107820).

In summary, there were two sets of RNAseq sequences analyzed: A) RNAseq libraries from B. depauperata tissue (control, heat stress from both seedling and mature plants) that were used to build a de novo B. depauperata transcriptome, and B) RNAseq libraries from B. depauperata and A. thaliana seedlings for control, after BHS, and after

AHS that were used to examine gene expression induced by heat stress.

Statistical and sequence analysis

8 https://mc06.manuscriptcentral.com/botany-pubs Page 9 of 37 Botany

A) Boechera depauperata transcriptome assembly: Raw reads were evaluated with

FastQC (v0.11) trimmed with Skewer (v0.1.120) (Jiang et al. 2014) until a minimum

Phred Q score > 5 was reached. Reads longer than 25bp and with average quality

score Q > 15 were retained. The trimmed paired reads were then assembled de novo

using Trinity (release 2014_07_17) to produce the transcriptome FASTA file. The

completeness of the assembled transcriptome was evaluated with the Core Eukaryotic

Genes Mapping Approach (CEGMA) (Parra et al. 2007). CEGMA identified highly

conserved eukaryotic genes and identified a set of 248 core eukaryotic genes expected

to be expressed in any eukaryotic genome. The CEGMA database was developed from

six model organisms (Homo sapiens, Drosophila melanogaster, Arabidopsis

thaliana, Caenorhabditis elegans, SaccharomycesDraft cerevisiae and Saccharomyces

pombe). The core genes or CEGs were classed into four groups based on the degree of

conservation expected across diverse taxa. The CEGs from group one are expected to

have the lowest level of conservation across taxa and those from group four are

expected to have the highest. A high percentage of coverage of core genes in a de

novo assembly indicates the assembly has complete or near-complete coverage of the

organism’s known biological transcriptional requirements.

Functional annotation of the de novo transcriptome assembly was done using

Blast2GO (version 3.0) (Conesa and Gotz 2008). Blast2GO annotates novel sequences

with terms derived from BLAST hits to annotated non-redundant protein

sequences at NCBI. Top 20 hits (by lowest E-value) with a minimum E-Value < 1E-3

were retained. The transcriptome FASTA file and the BLAST query results were

imported to Blast2GO and the transcriptome was annotated with the following

9 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 10 of 37

parameters: E-value hit filter < 1E-3, annotation cut-off (default =55), HSP-hit coverage cutoff (default = 0), and GO weight (default =5). In addition, InterProScan (Jones et al.

2014) was run from within Blast2GO to identify conserved domains in sequences and the retrieved GO terms were merged. ANNEX (Myhre et al. 2006) was run from within

Blast2GO to apply additional GO terms based on known relationships between

Molecular Function annotations and corresponding Biological Process and Cellular

Component annotations. Annotations were validated using the Validate Annotations tool and GO terms were filtered to remove sequences incompatible with land plants (taxa:

3193, Embryophyta). The gene names for Boechera depauperata are based on the de novo transcriptome and have the format “cXXXX-gXX”. Whenever possible we have annotated the Boechera depauperataDraft sequences with the A. thaliana gene number or that of other known genes in NCBI. Genes that have the annotation, “no known homolog” did not have a homolog in the NCBI database. Genes that are

“uncharacterized” have a homolog, but no function can be assigned at this time.

B) Transcriptomics of heat stressed tissue: In order to evaluate differential heat stress gene expression in Boechera and Arabidopsis RNA-Seq reads were mapped using CLC Genomics Workbench. The RNA-seq reads from plants at control, basal heat shock, and acquired heat shock were imported and mapped to the appropriate reference. For Boechera depauperata the Blast2GO annotated FASTA file was imported and reads were mapped to the de novo transcriptome. Arabidopsis thaliana reads were mapped to the TAIR10 reference genome (https://www.arabidopsis.org). For both species, the default read map settings were used: mismatch cost = 2, insertion

10 https://mc06.manuscriptcentral.com/botany-pubs Page 11 of 37 Botany

cost = 3, deletion cost = 3, length fraction = 0.8, similarity fraction = 0.8, global

alignment = no, strand-specific = both, maximum number of hits for a read = 10,

expression value = total counts and calculate RPKM for genes without exons. More than

98% of Arabidopsis reads that survived quality control mapped to the reference

Arabidopsis genome, including approximately 15% that mapped to intergenic regions.

More than 96% of Boechera reads that survived quality control mapped to the de novo

transcriptome.

Mapped reads from control, basal HS, and acquired HS were tested for

differential expression using the “Empirical Analysis of DGE” tool with parameters: total

count filter cutoff = 5.0, estimate tagwise dispersions, exact test comparisons against all

pairs, and include FDR corrected P-values.Draft Each test was also performed using edgeR

(v3.16.5) (Nikolayeva and Robinson 2014; Robinson et al. 2010) which uses the same

algorithm as the CLC Workbench and the results were identical. A Benjamini-Hochberg

false discovery rate (Reiner et al. 2003) (FDR) P-value < 0.05 was chosen as the

threshold to identify differential expression.

The software GoSeq (version 1.22.0) (Oshlack et al. 2010) was used to test for

GO term enrichment in differentially expressed genes in each experiment as previously

identified with the EDGE test. For the probability weighting function the length of the

transcript (B. depauperata), or the gene model coding sequence length (A. thaliana)

were used for the probability weight. The Sampling method was used with 1000

iterations. Gene ontology terms with an FDR p-value < 0.05 were accepted as over- or

under-enriched. Because of the extremely sensitive nature of transcriptomics

experiments, a minimum threshold number of reads was imposed. Genes that were

11 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 12 of 37

highly upregulated in terms of fold change but not expressed with the minimum read count were not considered.

Previous studies have shown that Boechera is better able to protect photosynthesis during heat stress than is Arabidopsis. In order to examine differential expression of the photosynthetic pathways during heat stress in these two species, we used the programs Mapman (V. 3.51) and Mercator (Lohse et al. 2014; Bolger et al.

2014). To ensure that we could compare the results across these species we used

Mercator to annotate each Boechera gene by its Arabidopsis homolog. The Boechera transcriptome with Arabidopsis gene ids and the Arabidopsis genome were used as the reference genome data for the pathway analysis within Mapman. Draft Results

Boechera transcriptome assembly:

The Boechera transcriptome was assembled into 82,733 contigs containing

80,628,428 bases. The average contig length was 974 base pairs (bp), the assembly

N50 length was 1,548 bp, and GC content was 43.6% (Table 1). The recovery of core genes in the Trinity assembly was high with 245 of 248 genes at least partially matched

(Table 2). The high rate of recovery of the core genes in the transcriptome indicates that it is a good representation of the genes expressed in Boechera under both control and heat stress conditions. The genes in group one are expected to have the lowest level of conservation, and group four is expected to have the highest. Here (Table 2) we see that all groups have high conservation (above 90%). The two missing groups include inorganic Ion transport and GTPases XPA which interacts with DNA repair protein XPA.

These genes are not known to play a significant role in plant heat stress.

12 https://mc06.manuscriptcentral.com/botany-pubs Page 13 of 37 Botany

Differential gene expression during heat stress:

Analysis of RNAseq data indicates similar patterns in the number of genes that

are up and down-regulated during both basal and acquired heat stress in A. thaliana

and B. depauperata (Figure 1). After BHS, A. thaliana upregulated 3,926 genes and

2,012 of these genes were also upregulated after AHS. In A. thaliana 1,906 genes were

only upregulated during BHS. After AHS 2,793 genes were upregulated.

This analysis also revealed that compared to control gene expression patterns 3,832 A.

thaliana genes were down-regulated after BHS and that 2,909 genes were down-

regulated after AHS. Draft

A similar but not identical pattern of gene expression after both heat stress

treatments was seen in B. depauperata. After BHS 2,488 genes were significantly

upregulated in B. depauperata and 3,620 genes were significantly upregulated after

AHS (Figure 1). There were fewer genes significantly down-regulated after BHS

compared to A. thaliana (1,477 genes). However, more genes were significantly down-

regulated in B. depauperata after AHS compared to A. thaliana (2,455 genes). In B.

depauperata only 718 genes were significantly down-regulated after both basal and

acquired HS. This is less than one half the number of genes significantly down-

regulated in A. thaliana after both stresses.

Important differences in the HS responses of A. thaliana and B. depauperata

become apparent when the specific genes that are up and down-regulated are

examined. The analysis of the genes upregulated in response to BHS (Table 3)

13 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 14 of 37

indicates that, as expected, many of the genes induced by BHS in the model plant A. thaliana are the well-studied heat shock genes. For example, of the 50 most upregulated BHS genes (Table 3) 26 are heat shock genes. A distinct and species- specific pattern of heat stress gene expression is found in the highly thermotolerant B. depauperata. Analysis reveals far fewer of the most highly differentially expressed genes induced by BHS in B. depauperata are the heat shock genes (Table 3). Only 11 of the B. depauperata genes are classified as HS genes and most of those are not shared with A. thaliana. A similar pattern can be seen when the genes induced after

AHS are examined (Table 4). From this analysis, it is clear that in A. thaliana19 of the

50 most upregulated AHS genes are heat shock genes (Table 4). Only (6/50) of B. depauperata genes induced after AHSDraft are HSPs (Table 4).

Our data reveals that the vast majority of the genes induced by both basal and acquired heat stress in B. depauperata are not members of the well-studied and characterized families (Tables 3, 4). Most importantly, many of the genes induced by both BHS and AHS in B. depauperata lack homologs in Arabidopsis or other plant species and those that do have homologs have no known function (Table

3 and 4). It is also interesting that one gene that is shared across both HS treatments in both species is galactinol synthase. Galactinol is an osmoprotectant and the gene for galactinol synthase is known to be induced by a variety of stresses including heat

(Sengupta et al. 2015; Jing et al. 2018). Notably, many of the genes that are induced by heat stress in Boechera are involved in the ER-stress response (i.e., e3 ufm1-protein ligase 1; tunicamycin induced protein 1).

14 https://mc06.manuscriptcentral.com/botany-pubs Page 15 of 37 Botany

The response of the photosynthetic pathway genes in B. depauperata to both

basal (Figure 2) and acquired HS (Figure 3) is quite distinct from that of A. thaliana

(Figure 2). In Figure 2 it is clear that while BHS results in the downregulation of many A.

thaliana photosynthetic genes. In B. depauperata after BHS most genes either have no

change in expression or exhibit increases in expression (Fig. 2). For example, two

photosystem II genes (homologs of At1g05385 and At1g5140) have higher gene

expression after BHS in B. depauperata. Further, the B. depauperata genes for CPN60

(homologs of At2g28000, At3g13470) also have much higher gene expression after

BHS. A. thaliana does not share this pattern of photosynthetic gene expression after

BHS (Fig. 2). The upregulation of these genes could play a pivotal role in the ability of

B. depauperata to protect photosynthesisDraft during heat stress.

Examination of the changes in gene expression of photosynthetic genes after

AHS shows similar species-specific patterns (Figure 3). AHS results in the

downregulation of a wide array of A. thaliana photosynthetic genes. It is notable that

after AHS in A. thaliana a large number of genes involved in the light reactions exhibit

significantly decreased gene expression (Fig. 3A). However, after AHS, many B.

depauperata genes involved in both the light and dark reactions are significantly

upregulated after AHS (Fig. 3B). The genes for Photosystem BP-2 (homolog of

At2g30790) have increased expression after AHS. Perhaps most notably, the genes

coding for the CPN60Beta proteins (homologs of At3g13470, At1g55490, At3g13470)

exhibit significantly much higher gene expression compared to control (Figure 3B).

Discussion

15 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 16 of 37

This work was undertaken to better understand previous findings that B. depauperata possesses high organismal thermotolerance and can protect photosynthesis at high temperatures (Gallas and Waters 2015; Halter et al. 2017).

Experiments with A. thaliana and B. depauperata were conducted at the same time in the same chambers with identical methods. This was done to allow direct comparisons of B. depauperata with the well-studied A. thaliana. We have found that under these identical heat stress conditions Arabidopsis thaliana and Boechera depauperata, have distinct gene expression patterns. Not unexpectedly, in A. thaliana a wide array of the heat shock genes are expressed under heat stress (Tables 3-4). In particular, a number of the well-studied small heat shock protein genes (Waters 2013) are upregulated during both basal and acquired heatDraft stress in A. thaliana. Notably, the patterns of gene expression in B. depauperata are largely different from that of A. thaliana (Tables 3-4).

Our findings here, that the heat shock genes including the well-characterized small heat shock genes do not play an important role in the heat shock response of B. depauperata, while surprising considering the large body of research conducted in A. thaliana, is consistent with our previous work in Boechera (Gallas and Waters 2015;

Halter et al. 2017). From this, we can conclude that when exposed to identical conditions B. depauperata is not activating the heat shock transcription factors to the same extent as is A. thaliana. This raises the question of how Boechera “feels the heat”.

The cellular signals that transmit the heat stress signal and alter gene expression have been highly studied in Arabidopsis (Mittler et al. 2012) and our findings indicate

Boechera has a distinct and different genomic response to heat stress. If B. depauperata were activating the HSFs we would expect to see a large number of heat

16 https://mc06.manuscriptcentral.com/botany-pubs Page 17 of 37 Botany

shock protein genes expressed, i.e. the pattern of gene expression that is demonstrated

here and elsewhere for A. thaliana (Scharf et al. 2012). That the HSPs are not strongly

upregulated suggests that in response to these temperatures B. depauperata might be

activating a distinct set of transcription factors. Future detailed studies of transcriptional

networks during heat stress will be necessary to fully understand the stress-induced

transcriptional networks active in B. depauperata.

Novel genes and genes coding for uncharacterized proteins dominate the heat

shock response of B. depauperata. Many of the genes upregulated in response to both

basal and acquired HS in B. depauperata are either found in other plant genomes, but

are of unknown function; or are novel or orphan genes with no known homologs in other

plant genomes (Tables 3, 4). It is wellDraft established that novel genes can evolve from

non-coding regions, and that these novel or orphan genes play important roles in

lineage-specific adaptations to novel environments (Tautz and Domazet-Loso 2011;

Arendsee et al 2014; Schlötterer 2015). The finding here that the heat shock response

of the highly thermotolerant Boechera depauperata includes numerous novel genes

suggests that these genes are crucial for B. depauperata’s thermotolerance. When the

complete genome of B. depauperata and it’s close relatives are available a more

complete analysis of the origin of these novel genes can take place. Functional analysis

will be needed to determine if the novel genes are responsible for the high

thermotolerance found in B. depauperata (Halter et al. 2017).

Not all of the heat-induced B. depauperata genes are novel or uncharacterized.

Some of the B. depauperata heat-induced genes that have a known function are part of

the unfolded protein and ER- stress response pathways. One of the most highly

17 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 18 of 37

upregulated genes in B. depauperata is a gene for UFM1 ligase (an ortholog of

At3g46220). This is an intriguing finding. UFM1 is a protein with a -like fold.

However, very little is known about the targets or role of this protein in plants (Vierstra

2012). Work done in human cells suggests that UFM1 has a role in apoptosis and in ER stress (Tatsumi et al. 2010; Lemaire et al. 2011). Another gene that is highly heat- induced in B. depauperata codes for Tunicamycin-induced protein (TIN1, an ortholog of

AT5G64510). This gene is known to be induced by ER stress and other abiotic stresses

such as salt (Crosti et al. 2001; Iwata et al. 2010; Iwata and Koizumi 2012; Ozgur et al.

2014). The induction of both UFM1-ligase and TIN1 suggests that in B. depauperata

parts of the ER-stress and Unfolded protein response (UFR) pathways may be

important for the acquisition of thermotoleranceDraft in B. depauperata.

ER stress and the UFR pathways can be induced by a buildup of unfolded

proteins and this can be caused by a variety of stresses including abiotic stresses such

as heat stress (Che et al. 2010; Deng et al. 2013). The UFR is a complex response that

increases both protein degradation and folding capacity in the ER, thus reducing the

number of unfolded proteins in the cell (Vitale and Boston 2008; Howell 2013; Bao and

Howell 2017). ER stress can result in autophagy. During autophagy cell components

including organelles are sent to the lysosome for degradation (Liu and Howell 2016;

Yang et al. 2016). This process recycles damaged cell components and has been

described as a “cell-sparing process” (Howell 2013). Recently a number of publications

have focused on the roles of diverse E3 ligases on the control of chloroplast quality and

autophagy (Woodson et al. 2015; Woodson 2016, 2019; Otegui 2018; Zhang et al.

2018; Peng et al. 2019). Here we have demonstrated that components of both the ER

18 https://mc06.manuscriptcentral.com/botany-pubs Page 19 of 37 Botany

stress and UFR pathway are upregulated in B. depauperata during heat stress. While

only parts of these pathways are upregulated, our findings suggest that it is possible

that chloroplast autophagy may play a role in the HS response of B. depauperata. At

this time our data is only suggestive, but it does point a direction towards future studies.

Further experimental studies will need to be conducted to determine the physiological

importance of the transcriptomic data presented here.

The photosynthesis pathways displayed in (Figure 2,3) reveal major differences

between the heat stress response in Arabidopsis and B. depauperata. It has been

previously reported that B. depauperata can maintain high levels of chlorophyll fluorescence during heat stress (GallasDraft and Waters 2015; Halter et al. 2017). It is well established that photosynthesis is highly sensitive to heat stress and that both

Photosystem II and Rubisco Activase are two of the most heat-sensitive components of

the photosynthetic machinery (Salvucci et al. 2001; Sharkey 2005; Zhang and Sharkey

2009; Sharkey and Zhang 2010). When genes associated with the carbon-fixation

reactions were examined it is clear that CPN60BETA2 ( 60) is highly

upregulated in Boechera under both 38°C and 43C stress conditions (Fig. 2B, 3B). We

know that Rubisco Activase (RCA) is highly sensitive to heat stress and may be one of

the limiting factors in photosynthetic activity during heat stress (Salvucci et al. 2001).

Further, it is known that CPN60 has been shown to protect RCA during heat stress

(Ciarbelli et al. 2008). The genes for CPN60 were not differentially regulated in

Arabidopsis under either basal or acquired heat stress (Fig. 2A, 3A). This suggests that

Boechera is using CPN60 to differentially protect RCA during heat stress which in turn

may allow photosynthesis to continue during heat stress conditions.

19 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 20 of 37

Boechera depauperata is also differentially regulating genes associated with the

light reactions of photosynthesis during heat stress. Specifically, we have shown that

during both basal and acquired HS B. depauperata upregulates the gene for

Photosystem II subunit P-2 (PSBP-2: At2g30790) (Figure 3). As with CPN60, the genes

for PSBP-2 are not differentially expressed during heat stress in A. thaliana (Fig. 2A,

Fig. 3A). It is well known that PsbP2 is a crucial component of Photosystem II and that it

is required for the proper formation of thylakoid membranes and for photoautotrophy (Yi

et al. 2007; Ido et al. 2009; Roose et al. 2016). PSII is a multi-subunit complex that

initiates electron transport in the chloroplasts and is highly sensitive to damage due to

light (photoinhibition). When this occurs PSII repair requires new components and thus

increased gene expression (NickelsenDraft and Rengstl 2013; Xu et al. 2020). The results

presented here strongly suggest that further detailed studies of the photosynthetic

machinery in Boechera during high-temperature stress should be focused on the process of PSII repair during heat stress.

In conclusion, analysis of RNAseq libraries generated from basal and acquired heat stressed tissue from Arabidopsis thaliana and Boechera depauperata revealed that these two species (both members of the Brassicaceae) have highly distinct and largely non-overlapping heat stress responses. As expected the heat stress response of A. thaliana is largely composed of the heat shock proteins, especially the small heat shock proteins. The heat stress response of B. depauperata is dominated by novel genes,

genes of unknown function, PSBP-2 genes, CPN-60 genes, as well as genes that are

part of the ER and/or unfolded protein responses. Together these findings suggest that

B. depauperata has evolved a unique response to heat stress that combines novel

20 https://mc06.manuscriptcentral.com/botany-pubs Page 21 of 37 Botany

genes and parts of existing pathways. This novel combination may be responsible for B.

depauperata’s ability to withstand high-temperatures and protect photosynthesis during

heat stress. Further study of the heat stress response in Boechera depauperata could

open new pathways towards our understanding of how plants can adapt to high

temperature stress and may assist in developing heat-resistant crops.

Acknowledgements

We would like to thank the editors of Botany and two anonymous reviewers who

provided extremely helpful comments and suggestions on previous versions of this

manuscript. This work was supported by a grant from the National Science Foundation

(IOS 0920611) to E.R. Waters. Ian DraftPierce was partially supported by a National

Science Foundation (DUE 0966391) Bioinformatics Statistical Informatics Scholarship.

21 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 22 of 37

Figure 1. Overlap genes that are differentially expressed under basal and acquired heat-stress. Values are shown for A. Arabidopsis thaliana (A.t) and B. Boechera depauperata (B.d). Basal Heat-stress Up-regulated Genes (BHS-U). Basal Heat-stress Down-regulated Genes (BHS-D). Acquired Heat-stress Up-regulated Genes (AHS-U). Acquired Heat-stress Down-regulated Genes (AHS-D).

Figure 2. Impact of basal heat-stress on the expression of genes in photosynthetic Pathways. Each box represents a differentially expressed genes. Boxes shaded blue are upregulated genes, red are downregulated. The relative intensity of the colors indicates the intensity of differential expression compared to control. Plotted values are EDGE test adjusted fold changes, the scale is in the top left corner. A. Arabidopsis thaliana. B. Boechera depauperata.

Figure 3. Impact of acquired heat-stress on the expression of genes in photosynthetic pathways. Each box represents a differentially expressed genes. Boxes shaded blue are upregulated genes, red are downregulated. The relative intensity of the colors indicates the intensity of differential expression compared to control. Plotted values are EDGE test adjusted fold changes, the scale is in the top left corner. A. Arabidopsis thaliana. B. Boechera depauperata. Draft

22 https://mc06.manuscriptcentral.com/botany-pubs Page 23 of 37 Botany

References

Al-Shehbaz, I.A. 2003. Transfer of most North American species of to Boechera (Brassicaceae). Novon, 13: 381-391.

Alexander, P.J., Windham, M.D., Beck, J.B., Al-Shehbaz, I.A., Allphin, L., and Bailey, C.D. 2013. Molecular Phylogenetics and of the Genus Boechera and Related Genera (Brassicaceae: Boechereae). Syst. Bot. 38(1): 192-209. doi: 10.1600/036364413x661917.

Bao, Y., and Howell, S.H. 2017. The Unfolded Protein Response Supports Plant Development and Defense as well as Responses to Abiotic Stress. Front. Plant Sci. 8: 344. doi: 10.3389/fpls.2017.00344.

Bolger, A.M., Lohse, M., and Usadel, B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics, 30(15): 2114-2120. doi: 10.1093/bioinformatics/btu170.

Che, P., Bussell, J.D., Zhou, W., Estavillo, G.M., Pogson, B.J., and Smith, S.M. 2010. Signaling from the endoplasmic reticulum activates brassinosteroid signaling and promotes acclimation to stress in Arabidopsis. Sci. Signal. 3(141): ra69. doi: 10.1126/scisignal.2001140. Draft Ciarbelli, A.R., Ciolfi, A., Salvucci, S., Ruzza, V., Possenti, M., Carabelli, M., et al. 2008. The Arabidopsis homeodomain-leucine zipper II gene family: diversity and redundancy. Plant Mol. Biol. 68(4-5): 465-478. doi: 10.1007/s11103-008-9383-8.

Conesa, A., and Gotz, S. 2008. Blast2GO: A comprehensive suite for functional analysis in plant genomics. Int. J. Plant Genomics, 2008: 619832. doi: 10.1155/2008/619832.

Crosti, P., Malerba, M., and Bianchetti, R. 2001. Tunicamycin and Brefeldin A induce in plant cells a programmed cell death showing apoptotic features. Protoplasma, 216(1-2): 31-38.

Deng, Y., Srivastava, R., and Howell, S.H. 2013. Endoplasmic reticulum (ER) stress response and its physiological roles in plants. Int. J. Mol. Sci. 14(4): 8188-8212. doi: 10.3390/ijms14048188.

Franzke, A., Lysak, M.A., Al-Shehbaz, I.A., Koch, M.A., and Mummenhoff, K. 2011. Cabbage family affairs: the evolutionary history of Brassicaceae. Trends Plant Sci. 16(2): 108-116. doi: 10.1016/j.tplants.2010.11.005.

French, K., Robinson, S.A., and Lia, J. 2017. Thermotolerance capacities of native and exotic coastal plants will lead to changes in species composition under increased heat waves. Conserv. Physiol. 5(1): cox029. doi: 10.1093/conphys/cox029.

Gallas, G., and Waters, E.R. 2015. Boechera species exhibit species-specific responses to combined heat and high light stress. Plos One, 10(6): e0129041. doi: 10.1371/journal.pone.0129041.

23 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 24 of 37

Halter, G., Simonetti, N., Suguitan, C., Helm, K., Soroksky, J., and Waters, E.R. 2017. Patterns of thermotolerance, chlorophyll fluorescence, and heat shock gene expression vary among four Boechera species and Arabidopsis thaliana. Botany, 95(1): 9-27. doi: 10.1139/cjb- 2016-0158.

Hanson, D.T., Swanson, S., Graham, L.E., and Sharkey, T.D. 1999. Evolutionary significance of isoprene emission from mosses. Am. J. Bot. 86(5): 634-639.

Haslbeck, M., and Buchner, J. 2002. function of sHsps. In Small Stress Proteins. Progress in Molecular and Subcellular Biology, vol 28. Edited by A.P. Arrigo and W.E.G. Müller. Springer, Berlin, Heidelberg. pp. 37-59.

Haslbeck, M., Franzmann, T., Weinfurtner, D., and Buchner, J. 2005. Some like it hot: the structure and function of small heat-shock proteins. Nat. Struct. Mol. Biol. 12(10): 842-846.

Hendrick, J.P., and Hartl, F.-U. 1993. Molecular Chaperone Functions of Heat-Shock Proteins. Annu. Rev. Biochem. 62: 349-384.

Hilton, G., Lioe, H., Stengel, F., Baldwin, A., and JL, B. 2013. Small heat-shock proteins: paramedics of the cell. In Molecular Chaperones.Draft Topics in Current Chemistry, vol. 328. Edited by S. Jackson. Springer, Berlin, Heidelberg. pp. 69-98. doi: 10.1007/128_2012_324.

Howell, S.H. 2013. Endoplasmic reticulum stress responses in plants. Annu. Rev. Plant Biol. 64: 477-499. doi: 10.1146/annurev-arplant-050312-120053.

Ido, K., Ifuku, K., Yamamoto, Y., Ishihara, S., Murakami, A., Takabe, K., Miyake, C., and Sato, F. 2009. Knockdown of the PsbP protein does not prevent assembly of the dimeric PSII core complex but impairs accumulation of photosystem II supercomplexes in tobacco. Bba- Bioenergetics, 1787(7): 873-881. doi: 10.1016/j.bbabio.2009.03.004.

Iwata, Y., and Koizumi, N. 2012. Plant transducers of the endoplasmic reticulum unfolded protein response. Trends Plant Sci. 17(12): 720-727. doi: DOI 10.1016/j.tplants.2012.06.014.

Iwata, Y., Nishino, T., Takayama, S., and Koizumi, N. 2010. Characterization of a plant- specific gene induced by endoplasmic reticulum stress in Arabidopsis thaliana. Biosci. Biotechnol. Biochem. 74(10): 2087-2091. doi: 10.1271/bbb.100487.

Jiang, H., Lei, R., Ding, S.W., and Zhu, S. 2014. Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads. BMC Bioinformatics, 15: 182. doi: 10.1186/1471-2105-15-182.

Jing, Y., Lang, S., Wang, D., Xue, H., and Wang, X.F. 2018. Functional characterization of galactinol synthase and raffinose synthase in desiccation tolerance acquisition in

24 https://mc06.manuscriptcentral.com/botany-pubs Page 25 of 37 Botany

developing Arabidopsis seeds. J. Plant Physiol. 230: 109-121. doi: 10.1016/j.jplph.2018.10.011.

Jones, P., Binns, D., Chang, H.Y., Fraser, M., Li, W., McAnulla, C., et al. 2014. InterProScan 5: genome-scale protein function classification. Bioinformatics, 30(9): 1236-1240. doi: 10.1093/bioinformatics/btu031.

Kannan, S., Halter, G., Renner, T., and Waters, E.R. 2018. Patterns of alternative splicing vary between species during heat stress. Aob Plants, 10(2). doi: ARTN ply013 10.1093/aobpla/ply013.

Kim, Y.E., Hipp, M.S., Bracher, A., Hayer-Hartl, M., and Hartl, F.U. 2013. Molecular chaperone functions in and proteostasis. Annu. Rev. Biochem. 82: 323-355. doi: 10.1146/annurev-biochem-060208-092442.

Kotak, S., Larkindale, J., Lee, U., von Koskull-Doring, P., Vierling, E., and Scharf, K.D. 2007. Complexity of the heat stress response in plants. Curr. Opin. Plant Biol. 10(3): 310-316. doi: S1369-5266(07)00046-5 [pii]10.1016/j.pbi.2007.04.011.

Larkindale, J., Hall, J.D., Knight, M.R., and Vierling, E. 2005. Heat stress phenotypes of Arabidopsis mutants implicate multipleDraft signaling pathways in the acquisition of thermotolerance. Plant Physiol. 138(2): 882-897.

Larkindale, J., and Vierling, E. 2008. Core genome responses involved in acclimation to high temperature. Plant Physiol. 146(2): 748-761. doi: 10.1104/pp.107.112060.

Lemaire, K., Moura, R.F., Granvik, M., Igoillo-Esteve, M., Hohmeier, H.E., Hendrickx, N., et al. 2011. Ubiquitin fold modifier 1 (UFM1) and its target UFBP1 protect pancreatic beta cells from ER stress-induced apoptosis. Plos One, 6(4): e18517. doi: 10.1371/journal.pone.0018517.

Liu, J.X., and Howell, S.H. 2016. Managing the protein folding demands in the endoplasmic reticulum of plants. New Phytol. 211(2): 418-428. doi: 10.1111/nph.13915.

Lohse, M., Nagel, A., Herter, T., May, P., Schroda, M., Zrenner, R., et al. 2014. Mercator: a fast and simple web server for genome scale functional annotation of plant sequence data. Plant Cell Environ. 37(5): 1250-1258. doi: 10.1111/pce.12231.

Mittler, R., Finka, A., and Goloubinoff, P. 2012. How do plants feel the heat? Trends Biochem. Sci. 37(3): 118-125. doi: 10.1016/J.Tibs.2011.11.007.

Myhre, S., Tveit, H., Mollestad, T., and Laegreid, A. 2006. Additional gene ontology structure for improved biological reasoning. Bioinformatics, 22(16): 2020-2027. doi: 10.1093/bioinformatics/btl334.

25 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 26 of 37

Nickelsen, J., and Rengstl, B. 2013. Photosystem II assembly: from cyanobacteria to plants. Annu. Rev. Plant Biol. 64: 609-635. doi: 10.1146/annurev-arplant-050312-120124.

Nikolayeva, O., and Robinson, M.D. 2014. edgeR for differential RNA-seq and ChIP-seq analysis: an application to stem cell biology. Methods Mol. Biol. 1150: 45-79. doi: 10.1007/978-1-4939-0512-6_3.

Ohama, N., Sato, H., Shinozaki, K., and Yamaguchi-Shinozaki, K. 2017. Transcriptional Regulatory Network of Plant Heat Stress Response. Trends Plant Sci. 22(1): 53-65. doi: 10.1016/j.tplants.2016.08.015.

Otegui, M.S. 2018. Vacuolar degradation of chloroplast components: autophagy and beyond. J. Exp. Bot. 69(4): 741-750. doi: 10.1093/jxb/erx234.

Oshlack, A., Robinson, M.D., and Young, M.D. 2010. From RNA-seq reads to differential expression results. Genome Biol. 11(12): 220. doi: 10.1186/gb-2010-11-12-220.

Ozgur, R., Turkan, I., Uzilday, B., and Sekmen, A.H. 2014. Endoplasmic reticulum stress triggers ROS signalling, changes the redox state, and regulates the antioxidant defence of Arabidopsis thaliana. J. Exp. Bot. 65(5):Draft 1377-1390. doi: 10.1093/jxb/eru034. Parra, G., Bradnam, K., and Korf, I. 2007. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics, 23(9): 1061-1067. doi: 10.1093/bioinformatics/btm071.

Peng, L., Wan, X., Huang, K., Pei, L., Xiong, J., Li, X., and Wang, J. 2019. AtPUB48 E3 ligase plays a crucial role in the thermotolerance of Arabidopsis. Biochem. Biophys. Res. Commun. 509(1): 281-286. doi: 10.1016/j.bbrc.2018.12.123.

Queitsch, C., Hong, S.W., Vierling, E., and Lindquist, S. 2000. Heat shock protein 101 plays a crucial role in thermotolernace in Arabidopsis. The Plant Cell, 12: 479-492.

Reiner, A., Yekutieli, D., and Benjamini, Y. 2003. Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics, 19(3): 368-375.

Robinson, M.D., McCarthy, D.J., and Smyth, G.K. 2010. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics, 26(1): 139- 140. doi: 10.1093/bioinformatics/btp616.

Roose, J.L., Frankel, L.K., Mummadisetti, M.P., and Bricker, T.M. 2016. The extrinsic proteins of photosystem II: update. Planta, 243(4): 889-908. doi: 10.1007/s00425-015-2462-6.

Saidi, Y., Finka, A., and Goloubinoff, P. 2011. Heat perception and signalling in plants: a tortuous path to thermotolerance. New Phytol. 190(3): 556-565. doi: Doi 10.1111/J.1469- 8137.2010.03571.X.

26 https://mc06.manuscriptcentral.com/botany-pubs Page 27 of 37 Botany

Salvucci, M.E., Osteryoung, K.W., Crafts-Brandner, S.J., and Vierling, E. 2001. Exceptional sensitivity of Rubisco activase to thermal denaturation in vitro and in vivo. Plant Physiol. 127(3): 1053-1064.

Scharf, K.D., Berberich, T., Ebersberger, I., and Nover, L. 2012. The plant heat stress transcription factor (Hsf) family: Structure, function and evolution. Biochim. Biophys. Acta, Gene Regul. Mech. 1819(2): 104-119. doi: Doi 10.1016/J.Bbagrm.2011.10.002.

Scheepens, J.F., Deng, Y., and Bossdorf, O. 2018. Phenotypic plasticity in response to temperature fluctuations is genetically variable, and relates to climatic variability of origin, in Arabidopsis thaliana. Aob Plants, 10(4). doi: ARTN ply04310.1093/aobpla/ply043.

Sengupta, S., Mukherjee, S., Basak, P., and Majumder, A.L. 2015. Significance of galactinol and raffinose family oligosaccharide synthesis in plants. Front. Plant Sci. 6: 656. doi: 10.3389/fpls.2015.00656.

Sharkey, T.D. 2000. Plant biology - Some like it hot. Science, 287(5452): 435-+. doi: Doi 10.1126/Science.287.5452.435.

Sharkey, T.D. 2005. Effects of moderate heat stress on photosynthesis: Importance of thylakoid reactions, rubisco deactivation,Draft reactive oxygen species, and thermotolerance provided by isoprene. Plant Cell Environ. 28: 269-277.

Sharkey, T.D., and Zhang, R. 2010. High Temperature Effects on Electron and Proton Circuits of Photosynthesis. J. Integr. Plant Biol. 52(8): 712-722.

Stocker, T.F., Qin, D., Plattner, G.-K., Tignor, M.M.B., Allen, S.K., Boschung, J., Nauels, A., Xia, Y., Bex, V., and Midgley, P.M. (Editors) 2013. Climate Change 2013: The Physical Science Basis. Cambrige University Press, Cambridge, UK.

Tatsumi, K., Sou, Y.S., Tada, N., Nakamura, E., Iemura, S., Natsume, T., et al. 2010. A novel type of E3 ligase for the Ufm1 conjugation system. J. Biol. Chem. 285(8): 5417-5427. doi: 10.1074/jbc.M109.036814.

Vierling, E. 1991. The heat shock response in plants. Annu. Rev. Plant Physiol. Plant Mol. Biol. 42: 579-620.

Vierstra, R.D. 2012. The expanding universe of ubiquitin and ubiquitin-like modifiers. Plant Physiol. 160(1): 2-14. doi: 10.1104/pp.112.200667.

Vitale, A., and Boston, R.S. 2008. Endoplasmic reticulum quality control and the unfolded protein response: insights from plants. Traffic, 9(10): 1581-1588. doi: 10.1111/j.1600- 0854.2008.00780.x.

Waters, E.R. 2013. The evolution, function, structure, and expression of the plant sHSPs. J. Exp. Bot. 64(2): 391-403. doi: 10.1093/jxb/ers355.

27 https://mc06.manuscriptcentral.com/botany-pubs Botany Page 28 of 37

Waters, E.R., Lee, G., and Vierling, E. 1996. Evolution, structure and function of the small heat shock proteins in plants. J. Exp. Bot. 47: 325-338.

Woodson, J.D. 2016. Chloroplast quality control - balancing energy production and stress. New Phytol. 212(1): 36-41. doi: 10.1111/nph.14134.

Woodson, J.D. 2019. Chloroplast stress signals: regulation of cellular degradation and chloroplast turnover. Curr. Opin. Plant Biol. 52: 30-37. doi: 10.1016/j.pbi.2019.06.005.

Woodson, J.D., Joens, M.S., Sinson, A.B., Gilkerson, J., Salome, P.A., Weigel, D., Fitzpatrick, J.A., and Chory, J. 2015. Ubiquitin facilitates a quality-control pathway that removes damaged chloroplasts. Science, 350(6259): 450-454. doi: 10.1126/science.aac7444.

Xu, H.-F., Dai, G.-Z., Ye, D.-M., Shang, J.-L., Song, W.-Y., Shi, H., and Qiu, B.-S. 2020. Dehydration-Induced DNAK2 Chaperone is Involved in PSII Repair of a Dessication- Tolerant Cyanobacterium. Plant Physiol. 182: 1991-2005.

Yang, X., Srivastava, R., Howell, S.H., and Bassham, D.C. 2016. Activation of autophagy by unfolded proteins during endoplasmic reticulum stress. Plant J. 85(1): 83-95. doi: 10.1111/tpj.13091. Draft

Yeh, C.H., Kaplinsky, N.J., Hu, C., and Charng, Y.Y. 2012. Some like it hot, some like it warm: Phenotyping to explore thermotolerance diversity. Plant Sci. 195: 10-23. doi: Doi 10.1016/J.Plantsci.2012.06.004.

Yi, X.P., Hargett, S.R., Liu, H.J., Frankel, L.K., and Bricker, T.M. 2007. The PsbP protein is required for photosystem II complex assembly/stability and photoautotrophy in Arabidopsis thaliana. J. Biol. Chem. 282(34): 24833-24841. doi: DOI 10.1074/jbc.M705011200.

Zhang, R., and Sharkey, T.D. 2009. Photosynthetic electron transport and proton flux under moderate heat stress. Photosynth. Res. 100(1): 29-43.

Zhang, S., Wang, S., Lv, J., Liu, Z., Wang, Y., Ma, N., and Meng, Q. 2018. SUMO E3 Ligase SlSIZ1 Facilitates Heat Tolerance in Tomato. Plant Cell Physiol. 59(1): 58-71. doi: 10.1093/pcp/pcx160.

28 https://mc06.manuscriptcentral.com/botany-pubs Page 29 of 37 Botany

Table 1. Summary statistics of B. depauperata transcriptome de novo assembly

Parameter Trinity assembly

Input reads 94,662,505

Mapped reads 73,734,639 Reads not mapped 20,929,437Draft Number of contigs 82,733

Contig N50 (base pairs) 1,548

Contig average length 974 (base pairs)

Total assembled bases 80,628,428

Percent GC 43.6

https://mc06.manuscriptcentral.com/botany-pubs Botany Page 30 of 37

Table 2. Core Eukaryotic Genes in B. depauperata transcriptome. CEG Groups Genes recovered Percent recovered

Group 1 (n=661) 65 97%

Group 2 (n=56*) 52 98%

Group 3 (n=61*) 60 93%

Group 4 (n=65*) 64 98%

Total (n=2482) 241 97%

1The total number of Core Eukaryotic Genes (CEGs)Draft in each group. 2All 248 CEGs are characterized as house-keeping gene families and are expected to be present in all eukaryotic genomes.

_

https://mc06.manuscriptcentral.com/botany-pubs Page 31 of 37 Botany

Table 3. Genes with highest change in expression after a Basal Heat Stress (BHS).

Arabidopsis thaliana Boechera depauperata Gene Annotation TAIR ID Fold Change Gene ID Fold Change Gene Annotation AT3G46230 16.23 heat shock protein 17.4 c33426_g1_i5 53.11 galactinol synthase 1 AT4G10250 16.05 HSP20-like chaperones superfamily protein c35686_g1_i8 47.24 e3 ufm1-protein ligase 1 homolog AT1G52560 15.62 HSP20-like chaperones superfamily protein c30323_g1_i3 35.82 no known homolog AT4G27670 15.59 heat shock protein 21 c26432_g1_i8 34.95 no known homolog AT5G12030 15.38 heat shock protein 17.6A c35686_g1_i7 34.25 e3 ufm1-protein ligase 1 homolog AT1G59860 15.38 HSP20-like chaperones superfamily protein c34550_g3_i1 31.91 developmentally-regulated g-protein 2 AT5G59720 15.32 heat shock protein 18.2 c32523_g1_i1 29.13 -binding protein 1-like AT2G29500 15.12 HSP20-like chaperones superfamily protein c26432_g1_i9 27.56 no known homolog AT5G48570 14.82 FKBP-type peptidyl-prolyl cis-trans isomerase family protein c33010_g1_i1 27.42 no known homolog AT5G25450 14.78 Cytochrome ubiquinol oxidase c29978_g1_i2 27.00 l-ascorbate peroxidase cytosolic-like AT1G07400 14.78 HSP20-like chaperones superfamily proteinDraftc29978_g1_i11 25.44 l-ascorbate peroxidase cytosolic-like AT1G71000 14.76 Chaperone DnaJ-domain superfamily protein c31317_g1_i7 24.88 hsp70- organizing protein 3-like AT1G53540 14.75 HSP20-like chaperones superfamily protein c22536_g1_i2 24.49 uncharacterized protein LOC104722666 AT1G17870 14.69 ethylene-dependent gravitropism-deficient yellow-green-like 3 c34173_g4_i2 24.49 heat shock protein 83-like AT1G16030 14.65 heat shock protein 70B c32384_g1_i13 24.20 pentatricopeptide repeat-containing protein AT5G54165 14.60 no known homolog c26432_g1_i6 22.90 no known homolog AT3G24500 14.56 multiprotein bridging factor 1C c34777_g1_i4 22.38 zinc finger protein zpr1-like AT1G72660 14.52 P-loop containing nucleoside triphosphate hydrolases c30928_g1_i4 22.33 no known homolog AT3G12580 14.52 heat shock protein 70 c32515_g1_i10 22.30 ultraviolet-b receptor uvr8-like AT2G20560 14.52 DNAJ heat shock family protein c32962_g1_i5 21.95 atp-dependent zinc metalloprotease ftsh chloroplastic AT5G37670 14.52 HSP20-like chaperones superfamily protein c31317_g1_i1 20.59 hsp70-hsp90 organizing protein 3 AT1G54050 14.49 HSP20-like chaperones superfamily protein c26737_g1_i7 20.25 synaptotagmin-4-like isoform x1 AT5G51440 14.47 HSP20-like chaperones superfamily protein c26050_g1_i2 20.07 inactive rhomboid protein 1-like AT1G74310 14.46 heat shock protein 101 c30491_g1_i2 19.81 no known homolog AT4G25200 14.46 mitochondrion-localized small heat shock protein 23.6 c29245_g1_i2 19.59 protein tunicamycin induced 1 AT2G32120 14.45 heat-shock protein 70T-2 c30904_g5_i3 19.14 sHSP class I heat shock AT5G12020 14.37 17.6 kDa class II heat shock protein c24839_g1_i1 19.12 sHSP heat shock protein AT2G47180 13.92 galactinol synthase 1 c34777_g1_i3 18.99 zinc finger protein zpr1-like

https://mc06.manuscriptcentral.com/botany-pubs Botany Page 32 of 37

AT4G12400 13.89 stress-inducible protein c34655_g1_i1 18.83 peptidyl-prolyl cis-trans isomerase fkbp65 AT2G46240 13.87 BCL-2-associated-6 c34843_g1_i3 18.75 chaperone protein 1 AT5G07330 13.82 no known homolog c32962_g1_i8 18.48 atp-dependent zinc metalloprotease ftsh chloroplastic-like AT2G26150 13.75 heat shock transcription factor A2 c26050_g1_i3 18.45 inactive rhomboid protein 1-like AT5G52640 13.50 heat shock protein 90.1 c26572_g1_i2 18.18 cytochrome b-c1 complex subunit 7-2 AT5G12110 13.48 Glutathione S-transferase c32505_g1_i9 18.17 calcyclin-binding AT1G21550 13.40 Calcium-binding EF-hand family protein c34173_g4_i1 18.12 heat shock protein 83-like AT1G03070 13.01 Bax inhibitor-1 family protein c32505_g1_i2 17.86 calcyclin-binding AT3G10020 12.58 no known homolog c28228_g1_i1 17.80 small heat shock chloroplastic-like AT3G09350 12.56 Fes1A c22147_g1_i1 17.42 sHSP kda heat shock peroxisomal AT1G30070 12.50 SGS domain-containing protein c27663_g1_i1 17.41 heat stress transcription factor a-7a-like AT5G01180 12.38 peptide transporter -5 c35686_g1_i4 17.26 e3 ufm1-protein ligase 1 homolog AT5G47830 12.34 no known homolog c22392_g1_i2 17.10 no known homolog AT5G09590 12.26 mitochondrial HSP70-2 c30904_g5_i2 17.09 sHSP class I heat shock AT4G23493 12.21 no known homolog Draftc9792_g1_i1 17.08 sHSP class I heat shock protein AT5G10695 12.15 no known homolog c35686_g1_i3 17.03 e3 ufm1-protein ligase 1 homolog AT3G08970 11.96 DNAJ heat shock N-terminal domain-containing protein c33819_g1_i3 16.99 ubiquitin-associated translation elongation factor ef1b protein AT2G19310 11.93 HSP20-like chaperones superfamily protein c29978_g1_i6 16.89 l-ascorbate peroxidase cytosolic AT2G37180 11.78 Aquaporin-like superfamily protein c30904_g5_i1 16.87 class I heat shock AT3G53230 11.67 ATPase c21290_g1_i1 16.77 pentatricopeptide repeat-containing protein AT1G67360 11.60 Rubber elongation factor protein (REF) c30904_g5_i4 16.65 sHSP class I heat shock AT3G28210 11.43 zinc finger (AN1-like) family protein c25637_g1_i1 16.63 sHSP heat shock mitochondrial

A Benjamini-Hochberg false discovery rate (Reiner, Yekutieli et al. 2003) (FDR) p-value < 0.05 was chosen as the threshold to identify differential expression. Genes in bold are members of the well-studied heat shock protein families.

https://mc06.manuscriptcentral.com/botany-pubs Page 33 of 37 Botany

Table 4. Genes with highest change in expression after an Acquired Heat Stress (AHS).

Arabidopsis thaliana Boechera depauperata Fold Gene Annotation Fold TAIR ID Gene ID Gene Annotation Change Change AT1G56600 23.00 galactinol synthase 2 c35942_g1_i4 200.42 retrovirus-related pol polyprotein AT5G07330 22.22 no known homolog c19645_g1_i2 164.85 cation h(+) antiporter 18-like AT5G52630 18.56 mitochondrial RNAediting factor 1 c33426_g1_i7 146.75 galactinol synthase 1 AT1G71000 18.04 Chaperone DnaJ-domain superfamily protein c35942_g1_i2 130.87 no known homolog AT5G05410 17.66 DRE-binding protein 2A c29988_g4_i1 119.85 uncharacterized protein AT4G37990 17.42 elicitor-activated gene 3-2 c31317_g1_i7 119.64 hsp70-hsp90 organizing protein 3-like AT5G37670 17.02 HSP20-like chaperones superfamily protein c30471_g1_i4 119.14 probable glucose-1-phosphate AT1G52560 16.22 HSP20-like chaperones superfamily protein c29455_g1_i1 111.98 heat shock 70 kda protein 8 AT4G27657 16.02 no known homolog c34655_g1_i4 99.92 peptidyl-prolyl cis-trans isomerase fkbp62-like AT5G15960 15.32 stress-responsive protein (KIN1) / stress-induced protein (KIN1) c19645_g1_i1 94.80 cation h(+) antiporter 18-like AT5G59720 14.05 heat shock protein 18.2 Draftc33410_g1_i2 94.37 uncharacterized vacuolar membrane protein yml018c AT5G12030 13.87 heat shock protein 17.6A c33426_g1_i1 94.32 galactinol synthase 1-like AT2G29500 13.75 HSP20-like chaperones superfamily protein c31166_g1_i4 89.05 no known homolog AT2G22240 13.46 myo-inositol-1-phosphate synthase 2 c27782_g2_i1 86.42 gdsl esterase lipase 3-like isoform x2 AT5G47610 13.03 RING/U-box superfamily protein c34655_g1_i9 85.45 peptidyl-prolyl cis-trans isomerase fkbp62-like AT3G28550 12.63 Proline-rich extensin-like family protein c34173_g4_i2 79.98 heat shock protein 83-like AT1G33350 12.59 Pentatricopeptide repeat (PPR) superfamily protein c24401_g1_i1 77.69 no known homolog AT2G20560 12.38 DNAJ heat shock family protein c33496_g1_i6 74.28 probable zinc metallopeptidase chloroplastic AT5G03210 12.06 no known homolog c28380_g1_i1 71.35 no known homolog AT5G62020 11.86 heat shock transcription factor B2A c32523_g1_i1 70.90 hsp70-binding protein 1-like AT2G46240 11.54 BCL-2-associated-6 c34655_g1_i1 66.95 peptidyl-prolyl cis-trans isomerase fkbp65 AT5G43170 11.41 zinc-finger protein 3 c26211_g1_i1 66.79 PREDICTED: uncharacterized protein LOC104768939 AT3G54580 11.29 Proline-rich extensin-like family protein c28278_g1_i1 62.99 PREDICTED: uncharacterized protein LOC104728675 AT5G59820 11.24 C2H2-type zinc finger family protein c33426_g1_i5 62.87 galactinol synthase 1 AT5G15250 11.12 FTSH protease 6 c34274_g2_i5 62.01 atp-dependent 6-phosphofructokinase 7-like AT3G03270 11.11 Adenine nucleotide alpha hydrolases-like superfamily protein c40391_g1_i1 61.76 no known homolog AT4G36990 10.97 heat shock factor 4 c35686_g1_i7 58.95 e3 ufm1-protein ligase 1 homolog AT1G53540 10.94 HSP20-like chaperones superfamily protein c34655_g1_i11 58.76 peptidyl-prolyl cis-trans isomerase fkbp65

https://mc06.manuscriptcentral.com/botany-pubs Botany Page 34 of 37

AT5G35320 10.82 no known homolog c22000_g1_i1 56.55 no known homolog AT5G54165 10.78 no known homolog c22682_g2_i2 55.38 no known homolog AT3G50970 10.66 dehydrin family protein c13684_g1_i1 55.12 transposon tf2-1 polyprotein isoform x1 AT4G27670 10.64 heat shock protein 21 c34334_g1_i5 53.40 heat shock cognate protein 80-like AT4G36988 10.26 conserved peptide upstream open reading frame 49 c29978_g1_i2 53.33 l-ascorbate peroxidase cytosolic-like AT4G10250 10.17 HSP20-like chaperones superfamily protein c31317_g1_i1 50.88 hsp70-hsp90 organizing protein 3 AT3G12580 9.93 heat shock protein 70 c32384_g1_i13 50.77 pentatricopeptide repeat-containing protein mitochondrial AT3G46230 9.92 heat shock protein 17.4 c35340_g2_i1 50.10 protein sgt1 homolog a-like AT3G15340 9.76 proton pump interactor 2 c34655_g1_i7 48.14 70 kda peptidyl-prolyl isomerase-like AT3G24500 9.62 multiprotein bridging factor 1C c34655_g1_i2 47.51 peptidyl-prolyl cis-trans isomerase fkbp65 AT1G54050 9.56 HSP20-like chaperones superfamily protein c19310_g1_i1 47.04 no known homolog AT1G59860 9.54 HSP20-like chaperones superfamily protein c32962_g1_i6 46.16 vacuolar amino acid transporter 1-like AT1G16030 9.53 heat shock protein 70B c32505_g1_i2 45.69 calcyclin-binding AT5G10695 9.45 no known homolog c32670_g1_i1 45.31 dehydration-responsive element-binding protein 2a-like AT1G07400 9.43 HSP20-like chaperones superfamily proteinDraftc26327_g1_i2 45.27 f1f0-atpase inhibitor protein AT4G25200 9.38 mitochondrion-localized small heat shock protein 23.6 c31166_g1_i3 45.17 no known homolog AT1G74310 9.28 heat shock protein 101 c18072_g1_i1 44.82 mediator of rna polymerase ii transcription subunit 37c AT3G41762 9.26 no known homolog c30973_g2_i1 44.79 protein polar localization during asymmetric division AT4G12400 9.13 stress-inducible protein c35956_g1_i3 44.64 no known homolog AT1G61340 9.13 F-box family protein c23392_g1_i2 44.63 phd finger-like domain-containing protein 5b AT3G10020 9.11 no known homolog c30578_g1_i1 44.11 no known homolog AT1G20440 9.08 cold-regulated 47 c24026_g1_i2 44.10 no known homolog

A Benjamini-Hochberg false discovery rate (Reiner, Yekutieli et al. 2003) (FDR) p-value < 0.05 was chosen as the threshold to identify differential expression. Genes in bold are members of the well-studied heat shock protein families.

https://mc06.manuscriptcentral.com/botany-pubs Page 35 of 37 Botany

Draft

Figure 1. Overlap genes that are differentially expressed under basal and acquired heat-stress. Values are shown for A. Arabidopsis thaliana (A.t) and B. Boechera depauperata (B.d). Basal Heat-stress Up-regulated Genes (BHS-U). Basal Heat-stress Down-regulated Genes (BHS-D). Acquired Heat-stress Up-regulated Genes (AHS-U). Acquired Heat-stress Down-regulated Genes (AHS-D).

215x279mm (220 x 220 DPI)

https://mc06.manuscriptcentral.com/botany-pubs Botany Page 36 of 37

Figure 2. Impact of basal heat-stress on the expression of genes in photosynthetic Pathways. Each box represents a differentially expressed genes. Boxes shaded blue are upregulated genes, red are downregulated. The relative intensity of the colors indicates the intensity of differential expression compared to control. Plotted values are EDGE test adjusted fold changes, the scale is in the top left corner. A. Arabidopsis thaliana. B. Boechera depauperata.

272x99mm (600 x 600 DPI) Draft

https://mc06.manuscriptcentral.com/botany-pubs Page 37 of 37 Botany

Figure 3. Impact of acquired heat-stress on the expression of genes in photosynthetic pathways. Each box represents a differentially expressed genes. Boxes shaded blue are upregulated genes, red are downregulated. The relative intensity of the colors indicates the intensity of differential expression compared to control. Plotted values are EDGE test adjusted fold changes, the scale is in the top left corner. A. Arabidopsis thaliana. B. Boechera depauperata.

246x88mm (600 x 600 DPI) Draft

https://mc06.manuscriptcentral.com/botany-pubs