Identification and Characterization of and

Involved in Arabinogalactan-Protein Glycosylation

A dissertation presented to

the faculty of

the College of Arts and Sciences of Ohio University

In partial fulfillment

of the requirements for the degree

Doctor of Philosophy

Yan Liang

August 2012

© 2012 Yan Liang. All Rights Reserved.

2

This dissertation titled

Identification and Characterization of Galactosyltransferases and Fucosyltransferases

Involved in Arabinogalactan-Protein Glycosylation

by

YAN LIANG

has been approved for

the Department of Environmental and Plant Biology

and the College of Arts and Sciences by

Allan M. Showalter

Professor of Environmental and Plant Biology

Howard Dewald

Interim Dean, College of Arts and Sciences

3

ABSTRACT

LIANG, YAN, Ph.D., August 2012, Molecular and Cellular Biology

Identification and Characterization of Galactosyltransferases and Fucosyltransferases

Involved in Arabinogalactan-Protein Glycosylation

Director of Dissertation: (Allan M. Showalter)

Arabinogalactan-proteins (AGPs) are highly glycosylated hydroxyproline-rich (HRGPs) that are frequently characterized by the presence of the repetitive dipeptide motifs [Ala-Hyp] or [Hyp-Ala]. Hydroxyproline (Hyp) residues in the dipeptide motifs are sites for the attachment of arabinogalactan (AG) side chains. Fucose residues are found in some dicot AGPs, and AGP fucosylation is developmentally regulated. AGP (GalT) and (FUT) activities were investigated in three ways in this study. First, an in vitro AGP GalT assay was developed, which used permeabilized microsomal membranes from tobacco (Nicotiana tobacum) or

Arabidopsis (Arabidopsis thaliana) suspension-cultured cells as the source and

14 UDP-[ C] Gal as the sugar donor. Two model AGP peptides, [Ala-Hyp]7 or [AO]7 and

deglycosylated [Ala-Hyp]51 or d[AO]51, were used as substrate acceptors. Product

analysis indicated that the [AO]7:GalT assay and the d[AO]51:GalT assay mainly detected

GalT activities that added the first and the second Gal residues in the Hyp-AG side chain.

Examination of the Hyp:GalT activity using various acceptor substrates, including two extensin sequences and a [AP]7 peptide, indicated this activity was specific for Hyp

dipeptide motifs in AGPs. The [AO]7:GalT and d[AO]51:GalT activities were localized to

the endomembrane system of Arabidopsis suspension cultured cells following sucrose

4

density gradient centrifugation. Second, five candidate AGP GALT genes (GALT1:

At1g26810; GALT3: At3g06440; GALT4: At1g27120; GALT5: At1g74800; GALT6:

At5g62620) were expressed in Pichia pastoris and tobacco suspension-cultured cells and tested for AGP GalT activity using the [AO]7:GalT and d[AO]51:GalT assay systems.

Preliminary results indicate that heterologously expressed GalT3, GalT5 and GalT6 have

AGP GalT activities. Furthermore, GalT3 was shown to be Golgi localized in a tobacco

leaf expression system. Third, functions of the FUT4 and FUT6 genes were investigated

using Arabidopsis , and fut4/fut6 mutant plants. Biochemical analysis indicated

that FUT4 was required for fucosylation of leaf AGPs, while both FUT4 and FUT6

contributed to fucosylation of root AGPs. In addition, glycome profiling indicated that

fucosylated AGPs may regulate intermolecular interactions between AGPs and other wall

components. Finally, a model of HRGP biosynthesis is proposed which highlights the

involved in this process.

Approved: ______

Allan M. Showalter

Professor of Environmental and Plant Biology

5

ACKNOWLEDGMENTS

First and foremost, I am thankful to my advisor, Dr. Allan Showalter, the one who provided guidance throughout my PhD study; the one who offered the research assistant fellowship to allow me to concentrate on my research; the one who encouraged and trusted me in trying out my own ideas in experiments. I especially thank Dr. Ahmed Faik, who shared his expertise in enzyme biochemistry, his ideas of experimental design and his lab equipment, which were indispensable for the progress of my project. I also thank

Dr. Marcia Kieliszewski, who is a critical thinker in science and at the same time impressed me with the feeling that doing science is an enjoyable process. I am grateful to

Dr. Sarah Wyatt, not only for the information she taught me, but also for her guidance and help in becoming socialized in the academic society.

I thank Dr. Wen-liang Xu for his contribution to the RNA analysis and HPAEC analysis in the FUT project and Ms. Alexandra Venetos and Ms. Rebecca Vondrell for their help in phenotypic analysis of the fut mutants. Mr. Xiao Liu and Mr. Wuda Wang helped with the bacterium colony screening in the expression construct cloning of GALT genes. Dr. Wei Zeng tutored me in the techniques of microsomal protein extraction, making sucrose gradient density, HPAEC and MS analysis and Pichia protein expression.

Dr. Li Tan provided the d[AO]51 peptide substrate and helped to solve the technical problems with the RP-HPLC system. Ms. Debarati Basu helped with the GC/MS analysis and shared her experience in Pichia protein expression and activity tests. Dr. Michael

Held helped to set up the tobacco leaf expression system and advised on the use of the confocal microscope. Dr. Betsy Briju, Ms. Laura Cristea and Mr. Vijayanand Nadella

6

provided suggestions in tobacco BY2 transformation. Mr. Yuning Chen and Mr. Dening

Ye helped me with the liquid chromatography techniques. Dr. Glen Jackson and Dr. Hao

Chen assisted with the ESI-MS/MS analysis. Dr. Jie Yang and Dr. Yizhu Zhang provided many helpful suggestions for both my research and my PhD study. The administrative and coordination work done by Ms. Connie Pollard, Ms. Martha Bishop and Dr. Melanie

Schori was fantastic. Also, the previous and current members of the Showalter lab and

Faik lab, Dr. Harjinder Sardar, Ms. Mohor Chatterjee, Mr. Nan Jiang, Mr. Brian Keppler,

Mr. Richard Wiemel and Ms. Rebekah Whitley were such supportive and nice people to be around. Without the help and kindness from all the above people, my research would not have gone as smoothly and my study at OU would not have been so enjoyable.

I thank the Department of Environmental and Plant Biology and the Molecular and

Cellular Program for providing teaching assistant scholarships and funding for me to attend national meetings.

Last but not least, I am indebted to my family and friends, who walked me through difficult times and provide me with endless love and support.

7

TABLE OF CONTENTS

Page

Abstract ………………………………………………………………………………….3

Acknowledgments...... 5

List of Tables ……………………………………………………………………………14

List of Figures ……………………………………………………………………………16

List of Abbreviations ...... 20

Chapter 1: Introduction ...... 23

1.1. Plant Cell Walls ...... 23

1.1.1. Biological and economic importance ...... 23

1.1.2. Structural model and compositions ...... 24

1.1.3. Biosynthesis of cell wall polysaccharides ...... 31

1.2. Hydroxyproline Rich-Glycoproteins (HRGPs) ...... 42

1.2.1. Classification ...... 42

1.2.2. Three major HRGP subfamilies: PRPs, EXTs and AGPs ...... 44

1.2.3. HRGP Glycosylation ...... 47

1.3. Arabinogalactan-Proteins (AGPs) ...... 52

1.3.1. Classification ...... 52

8

1.3.2. Post-translational modifications ...... 53

1.3.3. Biological functions ...... 60

1.4. Specific Aims of the Research ...... 62

Chapter 2: Identification and Characterization of in vitro Galactosyltransferase Activities

Involved in Arabinogalactan-Protein Glycosylation in Tobacco and

Arabidopsis ...... 64

2.1. Introduction ...... 64

2.2. Material and Methods ...... 68

2.2.1. Suspension culture of Arabidopsis cells ...... 68

2.2.2. Preparation of microsomal membranes from tobacco and Arabiodopsis

suspension cultured cells...... 68

2.2.3. Standard assay for galactosyltransferase (GalT) activities ...... 69

2.2.4. Analysis of the GalT assay products by reverse phase-high performance

liquid chromatography (RP-HPLC) ...... 70

2.2.5. Monosaccharide composition analysis of the GalT assay product ...... 70

2.2.6. Hyp [14C]Galactoside profile analysis ...... 71

2.2.7. Characterization of the Arabidopsis [AO]7:GalT activity ...... 72

2.2.8. Electrospray Ionization-Tandem Mass Spectrometry (ESI-MS/MS) of

the [AO]7:GalT assay product ...... 73

9

2.2.9. Continuous sucrose gradient centrifugation ...... 74

2.3. Results ...... 75

2.3.1 Development of a GalT assay system with AGP-like peptides as acceptor

substrates ...... 75

2.3.2. Characterization of the GalT assay products by Reverse-Phase High

Performance Liquid Chromatography (RP-HPLC) analysis ...... 76

2.3.3. Extent of AGP peptide galactosylation ...... 79

2.3.4. Enzymatic characteristics of the [AO]7:GalT activity in Arabidopsis

suspension-cultured cells ...... 83

2.3.5. Identification of the Arabidopsis [AO]7:GalT reaction product by

Electrospray Ionization-Tandem Mass Spectrometry (ESI-MS/MS)

analysis ...... 87

2.3.6. Subcellular localization of [AO]7:GalT and d[AO]51:GalT activities in

Arabidopsis suspension cultured cells ...... 89

2.4. Discussion ...... 91

Chapter 3: Heterologous Expression of Putative AGP Galactosyltransferases in Pichia

Pastoris and Tobacco BY2 Cells and Their Functional Characterization ...... 96

3.1. Introduction ...... 96

3.2. Material and Methods ...... 102

3.2.1. Obtaining full length cDNA clones ...... 102

10

3.2.2. Sequence analysis of the putative GalTs ...... 103

3.2.3. Expression construct cloning of GALT1, GALT3, GALT4, GALT5 and

GALT6 for expression in Pichia ...... 103

3.2.4. Pichia transformation ...... 104

3.2.5. Expression construct cloning of GALT1, GALT3, GALT4, GALT5 and

GALT6 for expression in tobacco BY2 suspension-cultured cells...... 105

3.2.6. Agrobacterium transformation ...... 106

3.2.7. Transformation of tobacco BY2 suspension-cultured cells ...... 106

3.2.8. Isolation of microsomal membranes from Pichia ...... 107

3.2.9. Preparation of microsomal membranes from tobacco BY2 cell lines .. 108

3.2.10. Western blotting analysis ...... 109

3.2.11. AGP GalT activity test ...... 110

3.2.12. Transient expression of GALT3-YELLOW FLUORESCENT PROTEIN

(YFP) in tobacco leaves ...... 110

3.3. Results ...... 112

3.3.1. Comparison of the GALT sequences of the corresponding cDNA clones

and the GALT sequences from the Arabidopsis Information Resource

(TAIR) database ...... 112

3.3.2. Expression construct cloning and yeast/tobacco BY2 cell transformation

...... 113

11

3.3.3. Protein expression analyses and AGP GalT activity tests of the

microsomal membranes isolated from Pichia strains transformed with

the PAOX:6xHis-GALT genes ...... 120

3.3.4. Protein expression analyses and AGP GalT activity tests of the

microsomal membranes isolated from tobacco BY2 cell lines

transformed with the Phsp:6xHis-GALT genes ...... 129

3.3.5. Protein expression analyses and AGP GalT activity tests of the

microsomal membranes isolated from tobacco BY2 cell lines

transformed with the P2x35S:6xHis-GALTs genes ...... 132

3.3.6. Subcellular localization of GalT3 in tobacco leaf epidermal cells ...... 135

3.4. Discussion ...... 137

Chapter 4: Biochemical and Physiological Characterization of fut4 and fut6 Mutants

Defective in Arabinogalactan-Protein Fucosylation in Arabidopsis ...... 142

4.1. Introduction ...... 142

4.2. Materials and Methods ...... 145

4.2.1. Plant materials and growth conditions ...... 145

4.2.2. Mutant confirmation by PCR and RT-PCR ...... 145

4.2.3. Phenotypic analysis ...... 146

4.2.4. Eel lectin staining ...... 146

4.2.5. Monosaccharide composition analysis by GC-MS ...... 147

12

4.2.6. Monosaccharide composition analysis by HPAEC ...... 148

4.2.7. Monoclonal antibodies ...... 148

4.2.8. Preparation of AIR (Alcohol Insoluble Residue) extracts and

fractionation ...... 148

4.2.9. Total sugar estimation and ELISA ...... 149

4.2.10. Expression analysis of FUT4 and FUT6 genes based on microarray data

from Genevestigator...... 150

4.3. Results ...... 150

4.3.1. Isolation of T-DNA insertion lines of fut4 and fut6 and generation of the

fut4/fut6 double mutant ...... 150

4.3.2. Phenotypic analysis of fut4, fut6 and fut4/fut6 mutants under

physiological growth conditions ...... 153

4.3.3. Eel lectin shows a different staining pattern in roots of fut6 and fut4/fut6

mutant plants compared to fut4 and wild type plants ...... 157

4.3.4. fut4/fut6 mutants are defective in AGP fucosylation in roots and leaves

...... 158

4.3.5. Glycome profiling of root and leaf cell walls of wild type, fut4, fut6 and

fut4/fut6 mutant plants ...... 161

4.4. Discussion ...... 165

Chapter 5: Conclusions ...... 176

13

References………………………………………………………………………………189

Appendix A: Sequences and Locations of Primers Used in This Work ...... 210

Appendix B: Partial Sequence of the Final Expression Constructs ...... 215

Appendix C: Expanded List of All Plant Cell Wall Glycan-Directed Monoclonal

Antibodies (MAbs) Used in This Study for Glycome Profiling ...... 235

14

LIST OF TABLES

Page

Table 1.1 Polysaccharide synthases and GTs involved in plant cell wall biosynthesis from

Arabidopsis thaliana and other plants.a ...... 33

14 Table 2.1 Incorporation of [ C] radiolabel into the [AO]7 acceptor substrate when

various nucleotide were used as the sugar donor...... 87

Table 3.1 Structural features of the 20 putative Arabidopsis β-(13)-GalTsa ...... 98

Table 3.2 Expression constructs and transgenic Pichia or tobacco BY2 cell lines obtained

in this research...... 114

Table 3.3 Summary of the signal bands observed in microsomal membrane proteins from

transgenic and control Pichia and tobacco BY2 cells in western blotting

analysis ...... 123

Table 3.4 Results of the tests for AGP GalT activities in microsomal membranes from

transgenic strains using the in vitro [AO]7:GalT and d[AO]51:GalT assay

systems...... 128

Table 3.5 AGP GalT activities of the upper membrane layer from wild type BY2 cells

and BY2 lines transformed with the Phsp:6xHis-GALT or the P2x35S:6xHis-

GALT genes...... 132

Table 4.1 Flowering time and silique numbers of wild type, fut4, fut6 and fut4/fut6

mutant plants grown in soil under physiological growth conditions ...... 155

Table 4.2 Neutral monosaccharide content of purified AGPs from roots and leaves of

wild type, fut4, fut6 and fut4/fut6 mutant plants...... 160

15

Table 4.3 Expression analysis of the FUT4 and FUT6 genes in response to biotic and

abiotic stimuli based on microarray data from Genevestigator

(https://www.genevestigator.com/gv/)...... 172

Table 5.1 GT activities involved in AGP glycosylation...... 185

16

LIST OF FIGURES

Page

Figure 1.1 Protein sequences of a classical AGP molecule, an EXT molecule and a hybrid

PRP molecule identified in Arabidopsis...... 43

Figure 1.2 A simplified HRGP model illustrating the Hyp contiguity hypothesis for a

model AGP protein sequence and a model EXT protein sequence...... 49

Figure 2.1 AG structure of an AGP molecule...... 66

Figure 2.2 Total [14C] radiolabel incorporation into the GalT reaction product in the

absence or presence of [AO]7 or d[AO]51 acceptor substrate...... 76

Figure 2.3 RP-HPLC fractionation of the [AO]7:GalT and d[AO]51:GalT reaction

products on a PRP-1 reverse-phase column...... 77

Figure 2.4 Monosaccharide analysis of the RP-HPLC purified tobacco d[AO]51:GalT

reaction product (Panel A) and the Arabidopsis [AO]7:GalT reaction product

(Panel B)...... 79

Figure 2.5 Bio-gel P2 fractionation of the RP-HPLC purified GalT reaction product. .... 81

Figure 2.6 Bio-gel P2 fractionation of free Hyp amino acid (Panel A) or Hyp amino acid

produced following base hydrolysis of the [AO]7 acceptor substrate (Panel B)

...... 82

Figure 2.7 Biochemical characteristics of the Arabidopsis [AO]7:GalT activity...... 85

Figure 2.8 Incorporation of [14C] radiolabel into acceptor substrates containing various

Hyp motifs...... 86

Figure 2.9 ESI-MS/MS analysis of the Arabidopsis [AO]7:GalT reaction product...... 88

17

Figure 2.10 Fractionation of mixed-membranes of Arabidopsis on a sucrose gradient in

the presence of 1 mM EDTA...... 90

Figure 3.1 Schematic domain structure of the two classes of putative Arabidopsis β-

(13)-GalTs...... 102

Figure 3.2 PCR analysis of Pichia transformed with pPICZA-6xHis-GALT1, pPICZA-

6xHis-GALT3, pPICZA-6xHis-GALT4, pPICZB-6xHis-GALT5, pPICZB-

6xHis-GALT6...... 116

Figure 3.3 Verification of the constructs of pMDC30-6xHis-GALT by diagnostic

digestion with the AscI and SacI restriction ...... 117

Figure 3.4 PCR analysis of the Agrobacterium strains transformed with the pMDC30-

6xHis-GALT constructs...... 117

Figure 3.5 PCR analysis of tobacco BY2 cell lines transformed with the hsp:6xHis-GALT

genes...... 118

Figure 3.6 Verification of the constructs of pMDC32-6xHis-GALTs by diagnostic

digestion and PCR...... 119

Figure 3.7 PCR analysis of the Agrobacterium strains transformed with the pMDC32-

6xHis-GALT constructs...... 120

Figure 3.8 PCR analysis of the tobacco BY2 cell lines transformed with the P2x35S:6xHis-

GALT genes...... 120

Figure 3.9 Western blotting analysis of microsomal proteins isolated from Pichia

transformants...... 122

18

Figure 3.10 RP-HPLC fractionation of the [AO]7 substrate acceptor and the [AO]7:GalT

reaction products on a PRP-1 reverse-phase column...... 127

Figure 3.11 Western blotting analysis of microsomal proteins isolated from the tobacco

BY2 lines transformed with the Phsp:6xHis-GALT genes and the wild type

BY2 lines...... 130

Figure 3.12 Western blotting analysis of proteins from microsomal membranes isolated

from tobacco BY2 lines transformed with the P2x35S:6xHis-GALT genes and

wild type BY2 lines...... 134

Figure 3.13 Subcellular localization of GalT3 in tobacco leaf epidermal cells...... 136

Figure 4.1 Identification of T-DNA insertion lines of Arabidopsis fut4, fut6, and fut4/fut6

mutants by PCR...... 152

Figure 4.2 RNA transcript levels of FUT4 and FUT6 genes in homozygous fut4, fut6,

fut4/fut6 double mutants and wild type seedlings...... 153

Figure 4.3 Phenotypic analysis of Arabidopsis fut4, fut6 and fut4/fut6 mutant plants

compared to wild type plants under physiological growth conditions...... 154

Figure 4.4 Germination rate of Arabidopsis fut4, fut6 and fut4/fut6 double mutant and

wild type plants...... 155

Figure 4.5 Root growth of Arabidopsis wild type, fut4, fut6 and fut4/fut6 mutant plants.

...... 156

Figure 4.6 Eel lectin staining of roots from WT, fut4, fut6 and fut4/fut6 mutant plants. 158

Figure 4.7 Monosaccharide composition analysis of root AGPs from WT, fut4, fut6 and

fut4/fut6 mutant plants by HPAEC...... 159

19

Figure 4.8 Glycome profiling of sequential cell wall extracts from roots of WT, fut4, fut6

and fut4/fut6 double mutant plants...... 163

Figure 4.9 Glycome profiling of sequential cell wall extracts from leaves of WT, fut4,

fut6 and fut4/fut6 double mutant plants...... 164

Figure 5.1 A model for HRGP hydroxylation and Hyp glycosylation through the

secretary pathway...... 183

Figure 5.2 Predicted GT activities involved in the formation of a 15-sugar unit for AGP

glycosylation...... 186

20

LIST OF ABBREVIATIONS

[AO]7: [alanine-hydroxyproline]7 AG: arabinogalactan AGP: arabinogalactan-protein AIR: alcohol insoluble residue Ala: alanine AOX: alcohol oxidase Ara: arabinose Arabidopsis: Arabidopsis thaliana AraT: Arg: arginine Asn: asparagine Asp: aspartic acid CAZy: the Carbohydrate Active enZymes (CAZy) database CCRC: the Complex Carbohydrate Research Center CID: collision-induced dissociation Cys: cystine d[AO]51: deglycosylated [alanine-hydroxyproline]51 dd[AO]51: double deglycosylated d[AO]51 peptide DP: degree of polymerization EDTA: ethylenediaminetetraacetic acid ElISA: enzyme-linked immunosorbent assay ER: endoplasmic reticulum ESI-MS: Electrospray Ionization-Tandem Mass Spectrometry EST: expressed sequence tag EXT: extensin ExtP: extensin peptide FITC: fluorescein isothiocyanate FLA: fasciclin-like AGP

21

FUC XG: fucosylated xyloglucan Fuc: fucose FUT: fucosyltransferase G unit: guaiacyl unit of lignin monomer GABA: γ-aminobutyric acid GAGP: gum arabic Gal: Gal: galactosyltransferase GalA: galacturonic acid Galectin domain: galactose-binding lectin domain GalNAcT: acetylgalactosaminyltransferases GAUT: galacturonosyltransferase GAX: glucuronoarabinoxylan GC-MS: gas chromatography-mass spectrometry GFP: green fluorescent protein Glc: glucose GlcA: glucuronic acid GlcA: glucuronyltransferase GlcT: Gln: glutamine Glu: glutamic acid Gly: glycine GPI: glycosylphosphatidylinositol GT: H unit: hydroxyphenyl unit of lignin monomer HF: hydrogen fluoride HG: homogalacturonan His: histidine HPAEC: High pH Anion Exchange Chromatography

22

HRGP: Hydroxyproline rich-glycoproteins Hyp: hydroxyproline Idt: isodityrosine Lys: lysine mAb: monoclonal antibody Man: mannose MeClcA: O-methyl-GlcA N-glycan: N-linked NMR: magnetic resonance NON-FUC XG: nonfucosylated xyloglucan P4H: proline 4-hydroxylase Pfam: database PMSF: phenylmethylsulphonyl fluoride Pro: proline PRP: proline rich protein RG-I: rhamnogalacturonan I RG-II: rhamnogalacturonan I Rha: rhamnose RP-HPLC: reverse phase-high performance liquid chromatography S unit: syringyl unit of lignin monomer Ser: serine ST: TFA: trifluoroacetic acid Thr: threonine Tyr: tyrosine Val: valine Xyl: xylose XylT:

β-Yariv reagent: (β-D-galactosyl)3 or (β-D-glucosyl)3 Yariv phenylglycosides

23

CHAPTER 1: INTRODUCTION

1.1. Plant Cell Walls

1.1.1. Biological and economic importance

Plant cell walls are a major feature of plant cells. Plant cell walls, with their rigid

structure, define plant cell shapes and provide mechanical support for the plant body.

Born at the cell plates during cell division, plant cell walls are involved in plant

embryogenesis and organ formation. In differentiated cells, cell wall structures acquire

heterogeneity at the tissue and individual cell levels, or even across a single cell wall to

accommodate the diverse functions of plant cells (Knox, 2008; Burton et al., 2010).

Lying as the outermost layer of plant cells, the plant cell wall is also an active player in

plant response to biotic and aboitic factors in the environment (Vorwerk et al., 2004).

Plant cell walls are not only of biological importance for plants; they are the major

components of feedstock and used for the production of textile, constructional materials

and paper in industry (Carpita and McCann, 2000). In agriculture, cell wall compositions

directly affect food and forage properties such as digestibility and shelf life. Some

essential food content, like dietary fiber, comes from the plant cell wall. Specific cell wall

components, such as gum arabic and pectins, are widely-used food ingredients and serve

as emulsifiers, stabilizers or carriers for flavoring. Furthermore, cell wall biomass is one

of the potential sources for biofuel production to meet the urgent need of the world for sustainable energy.

24

Given the biological and economic importance of plant cell walls, cell wall biology is of particular interest in plant biology. Acquiring knowledge about cell wall structure, regulatory mechanisms, biosynthesis and degradation is essential in the attempt to manipulate plants for better agricultural and industrial uses.

1.1.2. Structural model and compositions

The architecture of plant cell walls differ depending on plant species, developmental stages of a plant or even differentiated types of cells within a plant (Carpita and Gibeaut,

1993; Knox, 2008). All plant cells have primary cell walls, which are deposited at the cell plate and grow accordingly during cell expansion. Some specialized cell types (e.g. xylem cells) also build a secondary cell wall after cessation of cell growth, between the primary cell walls and the plasma membrane.

The primary cell walls of flowering plants are classified into two types (Carpita and

Gibeaut, 1993; Carpita and McCann, 2000). The type I cell wall is present in dicot plants and most of the monocot plants whereas the type II cell wall is found in commelinoid monocots. The type I cell walls are composed of cellulose microfibrils interconnected by the xyloglucan. This load-bearing framework is embedded in a network of pectin polymers, which constitute the wall matrix, controlling wall porosity and many other physiological properties. The type I cell walls have substantial amount of protein components playing structural roles in muro. In type II cell walls, glucuronoarabinoxylan

(GAX) replaces xyloglucan as an intermolecular linker for cellulose microfibrils, with the exception of plants of the order Poales, for which mixed-linkage glucans serve as the major cross-linking hemicellulose. The type II cell wall is low in pectin polymers as well

25

as structural protein components but rich in phenolic substances. Compared to primary

cell walls, secondary cell walls have higher content in cellulose and lower content in pectin and protein components (Albersheim et al., 2011). Secondary cell walls often feature the presence of lignin polymers, which enhance the mechanical strength of cell walls and serve as barriers against water diffusion and pathogen invasion.

The cell wall model is a developing model with the incorporation of new concepts from the advances of researchers. For example, one of the major cell wall glycoproteins,

extensins (EXTs), was originally considered to form an independent wall domain, functioning to lock microfibrils in position when cells cease in growth (Lamport, 1986;

Carpita and Gibeaut, 1993). Recent findings showed that EXT molecules are capable of self-assembly and are likely to self-assemble into a scaffold which directs the assembly of newly synthesized wall materials at the cell plate during cytokinesis (Cannon et al., 2008;

Lamport et al., 2011)

1.1.2.1. Celluloses

Cellulose is the principle structural component of plant cell walls. It composes 20-30%

(w/w) of both the type I and type II cell walls and accounts for up to 50% (w/w) of secondary cell walls (Albersheim et al., 2011). Unlike most of the other cell wall polysaccharides, which have sugar backbones frequently substituted by monosaccharides or larger sugar side chains, cellulose molecules consist of unsubstituted β-D-(14)- glucan chains. Two to three dozen of the glucan chains align in parallel and assemble into cellulose microfibrils (Carpita, 2011). With the glucan chains bound tightly through hydrogen bonds (Nishiyama et al., 2002; Nishiyama et al., 2003), cellulose microfibrils

26 are present as a crystalline structure, endowing them with strength and stiffness. The length of glucan chains varies from several hundred to several thousand glucose (Glc) units (measuring up to a few micrometers in length) depending on wall type (Brett, 2000;

Brown et al., 2005). Individual glucan chains are thought to begin and end at different places within the microfibril to constitute longer cellulose fibrils (Carpita and McCann,

2000).

1.1.2.2. Xyloglucans

Xyloglucans constitute 10-20% (w/w) of Type I cell walls (Fry, 1989) as the principle cross-linking molecules. Xyloglucan molecules have a β-D-(14)-glucan backbone, with sugar substitutions occurring at regular positions. Substitutions on Glc residues in xyloglucan are limited to a few types, each of which is given a one-letter code as an abbreviation. Whereas “G” denotes unsubstituted Glc residues; “X” denotes Glc residues substituted by a single α-D-xylose (Xyl) at the O-6 position; “L” and “A” denote

O-6 of the Glc residues substituted by β-D-galactose (Gal) -(12)-α-D-Xyl and α-L- arabinose (Ara) -(12)-α-D-Xyl, respectively. If the Gal on the “L” chain is further substituted by an α-L-fucose (Fuc) residue, the chain is denoted as “F” (Fry et al., 1993).

With these regular sugar substitutions, xyloglucan can be viewed as a polymer constructed of repeating oligosaccharide blocks. The predominant xyloglucan, the type I cell wall xyloglucan, is built up mainly of XXXG and XXFG blocks, although α-L-Ara is added at some places along the glucan chain (Carpita and McCann, 2000). Other forms of xyloglucan exist. For solanaceous species and peppermint, xyloglucan is a mixture of

AXGG, XAGG and AAGG subunits (York et al., 1996). In commelinoid monocots, the

27

small amount of xyloglucan present is modified by xylose residues at random places on

the backbone (Kato et al., 1982).

1.1.2.3. Xylans

Xylans refer to a family of polysaccharides having a β-D-(14)-xylan backbone in common (Scheller and Ulvskov, 2011). Modifications of the Xyl residues mainly occur at

C-2 and C-3 positions with the attachment of α-Ara residues, α-glucuronic acid (GlcA) or

O-methyl-α-GlcA (MeGlcA) residues. Different species of xylan molecules vary in the types of substitutions and are present in different plant species and tissues. Secondary cell walls of dicot plants are composed of up to 35% (w/w) glucuronoxylans, which have little or no Ara but have approximately 10% to 20% (w/w) of the Xyl residues substituted with MeGlcA residues (Zeng et al., 2008). In commelinoid monocots, GAX serves as the principle hemicellulose, making up to 20% (w/w) of the cell wall. GAX is substituted with α-Ara at the C-3 and, less frequently, single GlcA at the C-2 of the xylosyl units

(Carpita and Gibeaut, 1993). The percentage of substituted Xyl residues ranges from 10%

to 90% in GAX molecules. The higher degree of GAX substitution prevents bonding between GAX to itself and to cellulose, and so favors the need of dividing and elongating cells (Carpita and Gibeaut, 1993). GAX of commelinoid plants have ferulic acid esters formed at O-5 of some of the Ara residues. Feruloylated GAX may form intra- and inter- molecular linkages within the GAX group or to lignin molecules, which is predicted to contribute to the recalcitrant nature of cell walls (Scheller and Ulvskov, 2011).

28

1.1.2.4. Mannans and glucomannans

Mannans and glucomannans are β-(14)-linked polysaccharides containing mannose (Man) in their backbones (Scheller and Ulvskov, 2011). Mannans and galactomannans have backbones composed entirely of Man residues. Mannans have little if any substitutions on the backbone while galactomannans are substituted by α-1,6- linked Gal to various extents (Reid et al., 1995). The backbones of glucomannans and galactoglucomannans contain Glc and Man residues connected in a nonrepeating pattern

(Scheller and Ulvskov, 2011). D-Gal is bonded as single-unit side chains to the glucomannan chain through α-(16)-linkages (Willfor et al., 2003). Mannans and galactomannans are well known to be prevalent in the endosperm of legume seeds.

Galactoglucomannans are major components of the secondary cell walls in gymnosperms

(Ebringerová et al., 2005). In other cell types of higher plants, this group of molecules is present in low abundance but may play essential roles (Goubet et al., 2003).

1.1.2.5. Mixed-linkage glucans

Mixed-linkage glucans mainly consist of β-(14)-linked glucotriose and glucotetraose units interspersed by single β-(13)-linkages, although oligomers of five or more contiguous β-(14)-linked Glc units also occur (Carpita and McCann, 2000).

Mixed-linkage glucans are found throughout the order of Poales but not in dicot plants. In

Poales, mixed-linkage glucan content is developmentally regulated and plays roles in cell expansion and cellulose cross-linking (Scheller and Ulvskov, 2011).

29

1.1.2.6. Pectins

Pectins are a family of cell wall polysaccharides rich in galacturonic acid (GalA).

GalA residues are linked at O-1 and O-4 positions and comprise approximately 70% of

the pectin composition (Mohnen, 2008). The three major components, homogalacturonan

(HG), rhamnogalacturonan I (RG-I) and rhamnogalacturonan II (RG-II), compose around

65%, 20%, and 10% of the pectin polymers, respectively. HG is an unbranched α-(14)-

linked GalA chain. HG is partially methylesterified at the C-6 position and may be

acetylated at O-2 or O-3. RG-II has a HG backbone substituted by four types of

oligosaccharide and polysaccharide side chains. The four types of RG-II side chains in

total are comprised of 12 types of sugars in over 20 different linkages. The highly

complex structure of RG-II side chains is conserved across species, implying functional

importance of these structures (Carpita and McCann, 2000). RG-II is found to form

dimers via apiosyl residues in the side chains (O'Neill et al., 2004). RG-I contains a

backbone of a repeating disaccharide unit [-α-D-GalA-(12)-α-L-Rha-(14)-]. In contrast to RG-II, the side chains of RG-I exhibit a high degree of structural variation depending on cell types and developmental stages. Studies suggest that HG, RG-I and

RG-II are linked via their backbones to form pectic polymers (Ishii and Matsunaga, 2001;

Nakamura et al., 2002). Pectic polymers are cross-linked by RG-II dimerization into a macromolecular pectin network (Fleischer et al., 1999). The negatively charged HG

domains interact with each other in the presence of Ca2+ ions, forming the gel structure of

pectin. Bridging between HGs through Ca2+ is blocked by HG methylesterification,

which can be reversed by pectin methylesterases in the cell wall.

30

1.1.2.7. Lignin

Lignin is a complex aromatic biopolymer based on 4-hydroxyphenylpropanoids

(Simmons et al., 2010). Three major lignin monomers are hydroxyphenyl unit (H units),

guaiacyl unit (G units), and syringyl unit (S units), which are predominantly

interconnected through ether linkages and carbon-carbon linkages (Albersheim et al.,

2010). Lignins are deposited predominantly in secondary cell walls, reinforcing the

mechanical properties of the wall and serving as a barrier to prevent water diffusion in vascular tissues (Vanholme et al., 2010). Biosynthesis of lignin is affected by biotic and abiotic stresses, indicating a role of lignin in plant defense and resistance to unfavorable

conditions (Moura et al., 2010).

1.1.2.8. Plant cell wall proteins

Plant cell wall proteins refer to proteins located outside of the plasma membrane or

in the wall as either bound components or free molecules. Based on genomic and proteomic data, it has been estimated that hundreds of cell wall proteins are present in

organ and tissue specific patterns (Robertson et al., 1997). According to a recent survey,

281 plant cell wall proteins (including predicted ones) are grouped into polysaccharide

modifying proteins, oxido-reductases, structural proteins, signaling proteins, proteases,

proteins with interacting domains, miscellaneous proteins, and proteins with unknown

function (Jamet et al., 2006). Plant cell wall proteins play essential roles in the wall,

including modifications of cell wall components, constitution of wall structure, signaling,

interactions between the wall and plasma membrane, and responses to environmental

31 stimuli (Carpita and McCann, 2000). Although a few cell wall proteins have been studied extensively, e.g. expansins (Cosgrove, 2000) and EXTs (Lamport et al., 2011), the exact functions and regulatory mechanisms of most cell wall proteins remain to be confirmed and further explored.

1.1.3. Biosynthesis of cell wall polysaccharides

The biosynthesis of plant cell walls involves enzymes to synthesize nucleotide sugars as building blocks, the polysaccharide synthases and glycosyltransferases (GTs) to assemble the polysaccharides as well as multiple modifying enzymes, such as glycohydrolases, to complete the products. Polysaccharide synthases catalyze the synthesis of the backbones of wall polysaccharides (Perrin et al., 2001). Backbones of polysaccharides usually consist of repeating units of a single type of monosaccharide.

Accordingly, polysaccharide synthases are processive, i.e. the sugar-transfer product could be used continuously as the substrate for the same enzyme (delCardayre and Raines,

1994). GTs catalyze the formation of sugar side chains which are diverse in their composition and architecture (Perrin et al., 2001). GTs are non-processive. A basic difference between sugar synthesis and nucleic acid or protein synthesis is that sugar synthesis does not involve templates. The accurate assembly of the complex structures of polysaccharides relies on the specificity of GTs that recognize specific substrate structures and generate unique glycosyl linkages in products. Given that plant cell wall polymers consist of at least 14 different types of monosaccharides connected by over 100 distinct glycosyl linkages (Keegstra and Raikhel, 2001), there is no surprise that over 400 genes have been predicted as putative GTs in the Carbohydrate Active enZymes (CAZy)

32 database (http://www.cazy.org/GlycosylTransferases.html) for the Arabidopsis

(Arabidopsis thaliana) genome alone. The CAZy database classifies GTs into families on the basis of amino acid sequence similarities, which reflects the intrinsic structural and mechanistic features of the enzymes (Campbell et al., 1998; Coutinho et al., 2003).

Polysaccharide synthases and GTs are usually membrane proteins and are present in low amounts in plant cells, making their purification notoriously difficult. However, substantial progress has been made in recent years in the identification of genes for the biosynthesis of cell wall polymers. A variety of strategies have been developed and are usually applied in combination, including homolog identification based on the polysaccharide synthases or the GTs known in other organisms; conventional biochemical purification methods coupled with proteomic methods; forward genetic and reverse genetic methods; and gene mining based on expression data of expressed sequence tags (ESTs) or microarray data. Table 1 summarizes the polysaccharide synthases and GTs identified unambiguously for the synthesis of specific wall polymers to date, with the activity tests and mutant phenotypes included if such information is available.

Table 1.1 Polysaccharide synthases and GTs involved in plant cell wall biosynthesis from Arabidopsis thaliana and other plants.a

Species and CAZy Activity Mutant Mutant Wall Major Phenotype (s) Reference(s) Gene GT (Mutagen) Family AtCESA1 2 n.d. rsw1(EMS) Reduced cellulose Swollen root, abnormal dark (Beeckman et al., morphology, reduced growth of 2002; Reiter, 2002) hypocotyls and roots (rsw1-10) Antisense Slightly reduced cellulose Stunted growth (Burn et al., 2002) in leaves

AtCESA3 2 n.d. ixr1 Unaltered cellulose Semidominant resistance to (Scheible et al., (sd, EMS) isoxaben 2001; Cano- Delgado et al., 2003) cev1 (EMS) Reduced cellulose Stunted root growth, induction (Ellis et al., 2002) of defense responses eli1 (EMS) Reduced cellulose, ectopic Stunted root growth, induction (Cano-Delgado et lignin of defense responses al., 2003) Antisense Slightly reduced cellulose Stunted growth (Burn et al., 2002) in leaves

AtCESA4 2 n.d. irx5 Reduction of secondary Irregular xylem vessels (Taylor et al., 2003) (Ds, EMS) cellulose 33

34

Table 1.1 (continued) AtCES6 2 n.d prc1 Reduced cellulose, Reduced growth of dark-grown (Reiter, 2002; (T-DNA) incomplete cell walls hypocotyls and roots Doblin et al., 2003) irx2 Unaltered cellulose Semidominant resistance to (Desprez et al., (sd, EMS) isoxaben 2002)

AtCESA7 2 n.d. irx3 (EMS) Reduction of secondary Collapsed xylem, weak stem (Reiter, 2002; cellulose Doblin et al., 2003) fra5 Reduction of secondary Fragile fibers, weak stem, (Zhong et al., 2003) (sd, EMS), fra5 cellulose extremely thin fiber walls, (OX) collapsed xylem vessels

AtCESA8 2 n.d. irx1 (EMS) Reduction of secondary Weak stem, collapsed xylem (Reiter, 2002; cellulose vessels Doblin et al., 2003) fra6 (EMS) Reduction of secondary Reduced fiber wall thickness (Zhong et al., 2003) cellulose

AtGSL5 2 n.d. pmr4 (EMS, Lack of wound-induced Powdery mildew resistance, (Jacobs et al., 2003; /CalS12 RNAi) callose induction of defense response, Nishimura et al., (At4g03550) callose synthesis 2003)

34

35

Table 1.1 (continued) AtGSL6 2 n.d. Heterologous More callose accumulated n.d. (Hong et al., 2001) /CalS1 expression in at the cell plate (At1g05570) tobacco

AtCSLC4 2 β-(14)-glucan Heterologous Production of soluble β– n.d. (Cocuron et al., (At3g28180) synthase expression in glucan; long β-glucan 2007) Pichia produced in lines coexpressing AtCSLC4 and AtXT1

AtXT1 34 Xyloglucan Heterologous changed distribution of No morphological change, (Faik et al., 2002; (At3g62720) α-(16)-XylT expression in XyG and slight decrease of slightly changed distribution Cavalier and Pichia and XyG amount patterns of xyloglucan epitopes Keegstra, 2006; Drosophila Cavalier et al., xxt1 2008) (T-DNA)

AtXT2 34 Xyloglucan Heterologous Absence of XyG in Aberrant root hairs and (Cavalier and (At4g02500) α-(16)-XylT. expression in xxt1/xxt2 changed mechanical properties Keegstra, 2006; Drosophila of cell wall in xxt1/xxt2 Cavalier et al., xxt2 2008) (T-DNA) 35

36

Table 1.1 (continued) AtMUR3 47 Xyloglucan mur3 (EMS) Absence of fucogalactosyl- Altered trichome papillae, (Madson et al., (At2g20370) β-(12)- GalT for the side-cain in xyloglucan reduced wall strength in 2003; Ryden et al., 3rd Xyl in XXXG hypocotyls 2003)

AtFUT1 37 Xyloglucan mur2 98% reduction of Altered trichome papillae (Perrin et al., 1999; (At2g03220) α-(12)- FUT (EMS) xyloglucan fucosylation, Vanzin et al., 2002; lower acetylation level on Perrin et al., 2003) (T-DNA) Gal over-expression increase in xyloglucan O- No visible phenotype (Perrin et al., 2003) acetylation

PsFUT1 37 Xyloglucan antisense No change Wrinkled, collapsed root cell (Faik et al., 2000; (Q9M5Q1) α-(12)-FUT surface under electron Wen et al., 2008) microscope

AtIRX9 43 Xylan backbone irx9 55% less xylose in stem; Irregular xylem; develop (Bauer et al., 2006; (At2g37090) synthase; microsome (T-DNA) decrease of xylan; decrease slower; dark green; narrower Brown et al., 2007; from irx9 is defect in in cellulose increase in the leaves; almost sterile Lee et al., 2007; Lee incorporating 14C-Xyl proportion of xylan et al., 2012)

to β-(14)-Xyl6 branching by Me-GlcUA relative to GlcUA

36

37

Table 1.1 (continued) AtIRX14 43 Xylan backbone irx14 Decrease of Xyl in stem; Irregular xylem; stem thinner (Brown et al., 2007; (At4g36890) synthase; microsome (T-DNA) decrease of xylan; decrease and shorter; has less well- Keppler and from irx14 is defect in in cellulose; increase in the developed silique and fewer Showalter, 2010; incorporating 14C-Xyl proportion of xylan seeds; drought-tolerant Lee et al., 2012)

to β-(14)-Xyl6 branching by Me-GlcUA phenotype relative to GlcUA

AtIRX14L 43 n.d. irx14l No Xyl deduction in stem No visible phenotype for irx14l. (Keppler and (At5g67230) (T-DNA) irx14/irx14L double mutant has Showalter, 2010) severe growth defect, is dwarf; fail to produce inflorescence stem

AtIRX10 47 Xylan backbone irx10 Decrease of Xyl in stem Minor irregular xylem (Brown et al., 2005; /GUT2 synthase (T-DNA) phenotype Brown et al., 2009; (At1g27440) Wu et al., 2009)

37

38

Table 1.1 (continued) AtIRX10L 47 Xylan backbone irx10l No visible phenotype for No visible phenotype for irx10l. (Brown et al., 2009; (At5g61840) synthase; microsome (T-DNA) irx14l. irx14/irx14L double irx10/irx10L double mutant is Wu et al., 2009) from irx10/irx10l is mutant showed large dwarf; develop slowly; dark defect in incorporating reduction of xylan in leaves; sterile; has decreased 14C-Xyl to β-(14)- secondary cell wall and secondary cell wall formation

Xyl6 decrease in cellulose

AtGUXT1 8 xylan MeGlcAT and guxt1 Reduced [Me]GlcA Stem strength is reduced and (Mortimer et al., (At3g18660) GlcAT (T-DNA) substitution of improved extractability of 2010) glucuronoxylan guxt1 and xylan from cell wall of

AtGUXT2 8 xylan MeGlcAT and guxt2 guxt2 in single mutant; guxt1/guxt2 double mutant; (Mortimer et al., (At4g33330) GlcAT (T-DNA) almost absence of though no growth defects found 2010) substitution of xylan backbone in guxt1/guxt2 double mutant

TaXAT1 61 xylan RNAi RNAi line decreases α- No visible phenotype (Anders et al., 2012) α-(13)-AraT Heterologous (13)–linked expression in arabinosyl substitution of Arabidopsis xylan; Arabidopsis transgenic line obtain arabinosylated xylan 38

39

Table 1.1 (continued) TfManS 2 Mannan mannosylT over-expression Two-fold increase in No visible phenotype (Dhugga et al., (AAR23313) (putative mannan mannan 2004) synthase) EST identified

TfGMGT 34 Galactomannan α- heterologous Higher substitution level of n.d. (Edwards et al., (CAB52246) (16)-GalT for the 3rd expression in Gal for mannan 1999; Edwards et Man from nonreducing tobacco al., 2002; Reid et end al., 2003)

AtFUT4 37 AGP α-(12)- FUT over-expression 33% increase in fucose, No visible phenotype (Wu et al., 2010) (At2g15390) change in other MS in AIR extract from leaves

AtFUT6 37 AGP α-(12)- FUT n.d. n.d. n.d. (Wu et al., 2010) (At1g14080)

XEG113 77 n.d. xeg113 Underarabinosylation of Elongated hypocotyls and (Gille et al., 2009) (At2g35610) (T-DNA) extensin longer petioles of the rosette leaves in the presence of xyloglucanase

39

40

Table 1.1 (continued) AtGAUT1 8 HG:α-(14)-GalAT heterologous n.d. n.d. (Sterling et al., (At3g61130) expression in 2006; Caffall et al., human cells 2009)

AtGAUT7 8 Form an active gaut7 No significant change No visible phenotype (Sterling et al., (At2g38650) complex with (T-DNA) 2006; Atmodjo et AtGAUT1, but no al., 2011) activity itself expressed in human cells

RGXT1 77 RG-II: α-(13)-XylT rgx1 No significant difference No visible phenotype (Egelund et al., (At4g01770) (T-DNA) compare to WT 2006)

RGXT2 77 RG-II: α-(13)-XylT rgx2 No significant difference No visible phenotype (Egelund et al., (At4g01750) (T-DNA) compare to WT 2006) aData listed were modified from Scheible and Pauly (2004) and updated with recent findings. n.d., not determined, indicating the enzyme activities have not been proved by available in vitro GT assays. 40

41

Most cell wall polysaccharides are synthesized in the secretory pathway except for cellulose and callose which are assembled at the outer layer of the plasma membrane

(Scheible and Pauly, 2004). Emerging evidence supports the idea that individual biosynthetic enzymes may organize into functional complexes for the biosynthesis of some cell wall polysaccharides. The cellulose synthase complex has been proven to be composed of three or more different subunits (Gardiner et al., 2003; Taylor et al., 2003;

Wang et al., 2008). The formation of the xyloglucan backbone requires coordinated actions of β-(14)- (GlcTs) and α-(12)-xylosyltransferases

(XylTs), very likely in a complex (Gordon and Maclachlan, 1989; Cocuron et al., 2007).

Very recently, a protein complex with GAX synthesizing activity has been shown to contain three putative GTs in wheat (Zeng et al., 2010). In addition, the Arabidopsis galacturonosyltransferase 1 (GAUT1) functions in a protein complex with the homologous GAUT7 for pectin synthesis (Atmodjo et al., 2011). In contrast, backbone formation and substitutions with GlcA or MeGlcA residues for glucuronoxylan synthesis are apparently two independent processes (Mortimer et al., 2010).

The study of cell wall biosynthesis is still at an early stage. The actual functions of the majority of putative GTs remain to be identified. Many other fundamental questions also remain open. How are the polysaccharides synthesized in the secretory pathway integrated into the cell wall? Do the modifications of wall polymers occur in the secretory pathway or at the cell surface? How are the cell wall biosynthetic enzymes regulated to construct heterologous wall structures in differentiated plant cells? What signals and how are wall modifications achieved in response to environmental stimuli? Answers to these

42 questions will not only unravel the mystery associated with the cell wall biosynthetic machinery, but will also provide a basis for attempts to manipulate cell wall structure for agricultural and industrial applications.

1.2. Hydroxyproline Rich-Glycoproteins (HRGPs)

HRGPs are a family of plant cell wall proteins that are rich in hydroxyproline (Hyp) residues in the protein backbone and often undergo glycosylation modifications. In addition, the HRGP family is characterized by the presence of repetitive motifs in the protein backbone (Kieliszewski and Lamport, 1994).

1.2.1. Classification

Plant HRGPs can be subdivided into proline (Pro) rich proteins (PRPs), extensins

(EXTs), arabinogalactan-proteins (AGPs), gum arabic glycoprotein (GAGP), solanaceous lectins and the hybrid and chimeric HRGPs subfamilies (Showalter, 1993; Kieliszewski and Lamport, 1994; Seifert and Roberts, 2007). HRGP subfamilies vary in the characteristic motifs of their protein backbones, glycosylation patterns and cross-linking properties. Hybrid HRGPs are composed of modules of different HRGP subfamilies, and chimeric HRGPs are defined as HRGPs containing one or more HRGP modules as well as non-HRGP protein sequences (Showalter et al., 2010). While GAGP and solanaceous lectins are restricted to certain plant species, EXTs, PRPs and AGPs are widely present in plants (Showalter, 1993). Protein sequences of a chimeric PRP molecule, a classical EXT molecule and a classical AGP molecule are shown in Fig. 1.1. Although HRGP subfamilies have different characteristic motifs, common rules for Pro hydroxylation and

43

Hyp glycosylation apply to the whole HRGP superfamily. Thus, HRGP subfamilies are

better viewed as structurally and evolutionarily related glycoprotein groups, rather than

discrete classes (Kieliszewski and Lamport, 1994).

A classical AGP

>At1g35230 (AGP5) MASKSVVVFL FLALVASSVV AQAPGPAPTI SPLPATPTPSQSPRATAPAPSPSANPPPSAPTTAPPVS QPPTESPPAPPTSTSPSGAPGTNVPSGEAGPAQSPLSGSPNAAAVSRVSLVGTFAGVAVIAA LLL

An EXT

>At1g21310 (EXT3) MASLVATLLVLTISLTFVSQSTANYFYSSPPPPVKHYTPP VKHYSPPPVYHSPPPPKKHYEYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKKHYVYKSPPPP VKHYSPPPVYHSPPPPKEKYVYKSPPPP PVHHYSPPHHPYLYKSPPPPYHY

A chimeric PRP

>At3g62680 (PRP3) MAITRSSLAICLILSLVTITTADYYSPSSPPVYKSPEHKPTLPSPVYTPPVYKPTLSPPVYTKPTIPPPVYTP PVYKHTPSPPVYTKPTIPPPVYTPPVYKPTLSPPVYTKPTIPPPVYTPPVYKPTPVYT KPTIPPPVYTPPVYKPTPSPPVYKKSPSYSSPPPPYVPKPTYTPTTKPYVPEILKAVDGIILCKNGYETYPIL GAKIQIVCSDPASYGKSNTEVVIYSNPTDSKGYFHLSLTSIKDLAYCRVKLYLSPVETCKNPTNVNKGLTG VPLALYGYRFYPDKNLELFSVGPFYYTGPKAAPATPKY

Figure 1.1 Protein sequences of a classical AGP molecule, an EXT molecule and a hybrid PRP molecule identified in Arabidopsis. Colored sequences at the N and C termini indicate predicted signal peptide (green) and GPI anchor (light blue) addition sequences if present. The sequence of AGP5 (Schultz et al., 2000; Showalter et al., 2010), a classical AGP, is mainly composed of Pro-dipeptide sequences (highlighted in yellow) but also contains a few Pron motifs (highlighted in pink). The sequence of EXT3 (Hall and Cannon, 2002), representing a typical EXT sequence, contains the YVYK motif (highlighted in dark purple) and the Pron motif

44

located in longer repetitive motifs (underlined). The PRP3 (Fowler et al., 1999) is a chimeric PRP with an N terminal PRP domain composed of PRP motifs (highlighted in red) located in longer repetitive motifs (underlined) and a C terminal region rich in Pro, Val, Lys and cysteine (Cys) residues.

1.2.2. Three major HRGP subfamilies: PRPs, EXTs and AGPs

PRPs have repetitive structures with a high content of Pro and Hyp residues and are suggested to be involved in normal plant development as well as defense (Liu and Mehdy,

2007). A subclass of PRPs may function in plant nodule formation (Showalter, 1993).

One of the common motifs of PRPs is Pro-Pro-Xaa-Yaa-lysine (Lys), where Xaa is valine

(Val), histidine (His), threonine (Thr) or alanine (Ala) and Yaa is tyrosine (Tyr), Thr,

glutamic acid (Glu) or Pro (Sommer-Knudsen et al., 1998). One or both of the two Pro

residues may be modified into Hyp residues (Francisco and Tierney, 1990; Kieliszewski

et al., 1992). Expanded versions or variations of the pentapeptide are also found, such as

(Lys)-Pro-Pro-Xaa-Yaa-Lys-(Pro-Pro), Lys-Pro-Pro-Xaa-Lys-(Pro-Pro), where Xaa is

normally glutamine (Gln), Val, Thr or Ala. Individual PRPs may have one or two such

motifs or consist almost entirely of the repetitive motifs (Sommer-Knudsen et al., 1998).

PRPs are normally lightly glycosylated and contain 2-27% (w/w) sugar (Shpak et al.,

1999). Glycosylation of PRPs occurs mainly as arabinosylation of Hyp residues. Gal

attachment or N-glycosylation are found in some PRPs (Sommer-Knudsen et al., 1998).

However, two recent studies characterized chimeric PRPs from tobacco (Nicotiana alata) and Arabidopsis with a carbohydrate content of up to 75%, highlighting the structural diversity and complexity of HRGPs (Sommer-Knudsen et al., 1996; Hijazi et al., 2012).

45

EXTs are periodic amphiphiles composed of hydrophilic motifs of serine (Ser) -

Hyp4 and hydrophobic motifs of Tyr-Xaa-Tyr-Lys (where Xaa is Lys or Tyr) and/or Val-

Tyr-Lys (Kieliszewski and Lamport, 1994; Cannon et al., 2008). Among HRGPs, EXTs

are moderately glycosylated and have sugar components counting for ~50% of the

molecular weight. Glycosylation of EXTs occurs on Ser-Hyp4 motifs, with Ara

(commonly DP1 to DP4) attached to the Hyp residues and frequently,

single Gal residues attached to the Ser residues (Kieliszewski et al., 1992; Kieliszewski

and Lamport, 1994). Contiguous Hyp residues in Ser-Hyp4 form a rigid poly-Pro II

structure, which is stabilized by the attached sugar side chains as well as by isodityrosine

(Idt) formation between the two Tyr residues in the Tyr-Xaa-Tyr-Lys motif (Kieliszewski and Lamport, 1994; Velasquez et al., 2011). While Idt functions in intramolecular cross- linking (Epstein and Lamport, 1984), two EXT monomers are cross-linked by forming di-

Idt (Held et al., 2004) or pulcherosine, i.e. cross-linking between two Idt residues or between one Idt and one Tyr residue from the Val-Tyr-Lys motif (Schnabelrauch et al.,

1996). The discovery of EXT peroxidase (Everdeen et al., 1988), together with the visualization of the in vitro self-assembly of EXT monomers into dendritic structures

(Cannon et al., 2008) provides evidence that soluble EXT monomers are likely to self- assemble and cross-link into an insoluble EXT network in vivo. Formation of the EXT network likely contributes to wall architecture and defense, consistent with previous studies which showed induced expression of EXTs by wounding and pathogen attack

(Esquerre-Tugaye and Mazau, 1974; Esquerre-Tugaye and Lamport, 1979; Mouly et al.,

1992; Merkouropoulos et al., 1999). An essential role of EXT in cell plate formation was

46

illustrated by an aberrant cross wall assembly in the Arabidopsis ext3 mutant and

suggested that the EXT network may serve as a scaffold to template cell wall assembly

through interactions with pectin polymers (Hall and Cannon, 2002; Cannon et al., 2008).

Among HRGPs, AGPs are the least periodic but undergo glycosylation to the

highest extent (Tan et al., 2003). In Arabidopsis, a recent bioinformatic study identified

85 putative AGPs, which are highly variable in sequence (Showalter et al., 2010). A

common feature of AGP protein backbones is the biased amino acid compositions, which

are rich in Pro, Ala, Ser, Thr and glycine (Gly) residues (Showalter, 1993; Nothnagel,

1997). For example, classical AGPs are considered to have Pro, Ala, Ser and Thr residues composing greater than 50% of their protein backbones (Showalter et al., 2010). Rather

than the longer protein motifs present in PRPs or EXTs, dipeptide sequences, such as

Ala-Hyp, Hyp-Ala, Ser-Hyp, Thr-Hyp, are frequently found in AGPs in clusters or

sometimes in a more dispersed pattern (Gao et al., 1999; Showalter, 2001). The protein

backbone typically counts for less than 10% of the molecular mass, whereas the majority of the AGP molecule is composed of carbohydrates (Nothnagel, 1997). O-glycosylation of Hyp residues with arabinogalactan (AG) polysaccharides is the dominant form of AGP glycosylation, though there may be arabino-oligosaccharides attached to Hyp residues,

O-glycosylation of Ser or Thr residues, and N-glycosylation in some AGPs (Gao et al.,

1999; Qin et al., 2001). Specific binding of AGPs to (β-D-galactosyl)3 and (β-D- glucosyl)3 Yariv phenylglycosides (β-Yariv reagent) is considered a characteristic feature

of AGPs. This binding is likely to involve both the protein backbone and polysaccharide

components (Nothnagel, 1997). AGPs are proposed to play a variety of roles in plant

47

growth and development (Seifert and Roberts, 2007; Ellis et al., 2010), which will be

further discussed in Chapter 1.3.3.

1.2.3. HRGP Glycosylation

Glycosylation is an important feature of HRGPs. Hyp arabinosylation has been suggested to stabilize the secondary structures of EXTs (Velasquez et al., 2011). For

HRGPs with extensive glycosylation, like EXTs and AGPs, it is conceivable that

glycosylation plays a crucial role in the molecular function by determining the interactive

surface of the molecule (Kieliszewski, 2001). In addition, glycosylation has been

suggested to facilitate the secretion of HRGPs into the plant cell wall (Xu et al., 2008).

HRGP glycosylation mainly occurs through O-glycosylation of Hyp residues, which involves two basic steps: Pro hydroxylation by Pro-4-hydroxylases and subsequent Hyp glycosylation by GTs. In the protein backbones of HRGPs, some but not all Pro residues are converted to Hyp residues; the resulting Hyp residues may remain non-glycosylated, be glycosylated by arabino-oligosaccharides or be glycosylated by AG-polysaccharides

(Kieliszewski, 2001). These selective processes of Pro hydroxylation and Hyp glycosylation have inspired the query for potential rules involved.

Kieliszewski and colleagues first proposed rules for Pro hydroxylation and Hyp glycosylation based on protein sequence data and glycosylation patterns of isolated EXT and PRP molecules (Kieliszewski and Lamport, 1994). Hydroxylation of Pro residues is likely to be specified by amino acid sequences involving the Pro residue and its neighboring amino acids. While dipeptide Pro-Val is always hydroxylated, Lys-Pro, Tyr-

Pro, and Phe-Pro are never hydroxylated. Gly-Pro may be or may not be hydroxylated.

48

Furthermore, the extent of Hyp glycosylation was predicted to be determined by the

contiguity of Hyp residues, as summarized by the following Hyp contiguity hypothesis.

While contiguous Hyp residues are the sites for glycosylation by arabino- oligosaccharides, clustered, noncontiguous Hyp residues are the sites for glycosylation by

AG polysaccharides (Kieliszewski and Lamport, 1994; Kieliszewski and Shpak, 2001).

Contiguous Hyp residues can be symbolized as Hypn, where n is between 2 and 5 in

general. For example, the SerHyp4 motif in EXTs represents a typical contiguous Hyp

motif. The motif containing clustered, noncontiguous Hyp residues may be symbolized as

[Hyp-Xaa-Hyp-Xaa]n, where Xaa represents one to a few amino acids and is most

frequently Ser, Thr or Ala as in AGPs. A simplified model illustrating the Hyp contiguity

hypothesis is shown in Fig. 1.2.

49

Figure 1.2 A simplified HRGP model illustrating the Hyp contiguity hypothesis for a model AGP protein sequence and a model EXT protein sequence. The Hyp contiguity hypothesis was proposed by Kieliszewski and colleagues (Kieliszewski and Lamport, 1994; Kieliszewski and Shpak, 2001). The AG structure is modified from Tan et al. (2004; 2010). The arabino-oligosaccharide structure is modified from Velasquez et al. (2011). Monosaccharide symbols used here are based on the Symbol and Text Nomenclature for Representation of Glycan Structure as proposed by the Consortium for Functional Glycomics (http://glycomics.scripps.edu/CFGnomenclature.pdf).

50

While structures of isolated EXTs and PRPs support the Hyp contiguity hypothesis,

the link between AGP backbone sequences and the glycosylation pattern was obscure due

to difficulties in purification of a single AGP molecule to homogeneity as well as in

separation of the arabinosylated Hyp motifs and the AG-glycosylated Hyp motifs present

in the same AGP molecule. To overcome these difficulties, a novel strategy was applied

that involved the expression of synthetic genes encoding Hyp motifs in fusion with a

green fluorescent protein (GFP) tag in tobacco (Nicotiana tabacum) BY2 cells and

subsequent purification and analysis of the expressed glycoproteins. Information obtained

from such a synthetic gene expression strategy indeed supports the Hyp contiguity

hypothesis. For example, the expressed [Ser-Pro]32 motif is post-translationally modified

into [Ser-Hyp]32 (representing a clustered, non-contiguous Hyp motif) and subsequently

glycosylated by AG side chains (Shpak et al., 1999); while an expressed [Ser-Pro4]25 motif is modified into [Ser-Hyp4]25 (representing a contiguous Hyp motif), and

glycosylated exclusively by arabino-oligosaccharide side chains (Shpak et al., 2001). The

Hyp contiguity hypothesis also receives supports from the glycosylation pattern of a plant

AGP, tomato LeAGP1, when expressed in BY2 cells (Zhao et al., 2002). Further

investigation with the synthetic gene expression strategy indicates that the identities of

“Xaa” residues in the [Hyp-Xaa-Hyp-Xaa]n motif influence the extent of Pro

hydroxylation and Hyp glycosylation (Tan et al., 2003).

The Hyp contiguity hypothesis is well supported; however, there are exceptional

cases. For example, the gymnosperm PRP motif: Lys-Pro-Hyp-Val-Hyp-Val-Ile-Pro-Pro-

Hyp-Val-Val-Lys-Pro-Hyp-Hyp-Val-Tyr contains clustered noncontiguous Hyp motifs

51

(underlined), that are non-glycosylated or arabinosylated instead of arabinogalactosylated

(Kieliszewski et al., 1992; Kieliszewski et al., 1995; Tan et al., 2003). Refinement of the

Hyp contiguity hypothesis, perhaps taking into account what is the minimum number of noncontiguous Hyp residues for arabinogalactosylation, may help to explain these exceptional cases. Interestingly, the same synthetic motif, [Ser-Hyp4]25 is similarly hydroxylated and glycosylated when expressed in tobacco and Arabidopsis suspension cultured cells, but obtains a different glycosylation pattern when expressed in

Arabidopsis plants (Shpak et al., 2001; Estevez et al., 2006; Xu et al., 2008). This indicates that even though the same Hyp glycosylation code is shared between tobacco and Arabidopsis, the tissue or organ specific activities of P4H and GTs need to be taken into account when using the Hyp glycosylation code to predict the glycosylation pattern of HRGPs.

The hydroxylation of Pro to 4-Hyp is catalyzed by the Pro-4-hydroxylase (P4H) activities. P4Hs identified in Chlamydomonas reinhardtii and Arabidopsis compose a multi-gene family (Yuasa et al., 2005; Keskiaho et al., 2007), in support of the previous speculation that sequence-specific P4Hs are involved in the hydroxylation of HRGPs

(Kieliszewski and Lamport, 1994). AtP4H1 showed broader substrate specificity compared to AtP4H2 when expressed in insect cells. Although both AtP4H1 and AtP4H2 use EXT-like peptides as substrates, only AtP4H1 hydroxylates collagen-like peptides and hypoxia-inducible transcription factors from vertebrates (Hieta and Myllyharju, 2002;

Tiainen et al., 2005). It would be interesting to compare the efficiencies of P4Hs in using

EXT-like peptides versus AGP-like peptides as a substrate. In addition, the 13 P4Hs in

52

Arabidopsis showed distinct expression profiles based on in silica analysis (Velasquez et al., 2011). Much remains to be explored with respect to how the P4H family (i.e. 13 predicted members in Arabidopsis) coordinates hydroxylation within the much larger

HPRG family (i.e. 166 predicted members in Arabidopsis, (Showalter et al., 2010)) in planta. Even less is known about the GTs involved in HRGP glycosylation. To date, one

EXT ARABINOSYLTRANSFERASE (AraT) (Gille et al., 2009) and two AGP

FUCOSYLTRANSFERASES 4 and 6 (FUT4 and FUT6) (Wu et al., 2010) are the only genes identified for glycosylation of HRGPs.

1.3. Arabinogalactan-Proteins (AGPs)

1.3.1. Classification

Based on their protein backbone structures, AGPs are classified into classical AGPs,

AG peptides, hybrid AGPs and chimeric AGPs (Schultz et al., 2002). As discussed before

(Chapter 1.2.2), the characteristic sequence of an AGP protein backbone is rich in Hyp,

Ser, Thr, Ala and Gly amino acids, which are often present in repetitive SO, TO and AO motifs (Showalter, 1993). The backbone of classical AGPs has a domain mainly composed of AGP motifs, sandwiched by a signal peptide and often a site for GPI-anchor attachment located at the amino and carboxyl termini, respectively. The lengths of protein backbones of classical AGPs vary from 80 to hundreds of amino acids (Schultz et al.,

2000). AG peptides are similar to classical AGPs but have an AGP domain shorter in length. The definitive length of the AGP domain for an AGP peptide is considered

53

differently by different researchers, ranging from below 20 amino acids to up to 90 amino

acids (Schultz et al., 2004; Showalter et al., 2010).

In contrast to classical AGPs and AG peptides, hybrid AGPs and chimeric AGPs

contain non-AGP domains in the core protein sequence. Hybrid AGPs have domains

composed of motifs commonly found in other HRGPs, i.e. PRP motifs, EXT motifs, in

addition to AGP motifs. If the additional structural domains are irrelevant to HRGPs, the

AGPs are referred to as chimeric AGPs (Schultz et al., 2002). Structural domains found

in chimeric AGPs include the fasciclin-like domain, Cys-rich domain, plastocyanin

domain, etc. (Showalter, 2001; Showalter et al., 2010).

1.3.2. Post-translational modifications

AGPs undergo extensive post-translational modifications, including the removal of the N-terminal signal peptide, C-terminal processing involving the addition of the glycosylphosphatidylinositol (GPI) anchor, Pro hydroxylation to Hyp and glycosylation of the protein backbone. In addition, some AGPs may be subjected to degradation by proteases and glycosidases.

1.3.2.1. Removal of N-terminal signal peptides

All of the cDNAs isolated from confirmed AGPs encode N-terminal signal peptides

(Nothnagel, 1997). The presence of an N-terminal signal peptide has been used as a prerequisite factor for the identification of putative AGPs by bioinformatic methods

(Showalter et al., 2010). The N-terminal signal peptides direct the co-translational translocation of AGP core backbones into the endoplasmic reticulum (ER) lumen, an

54

entry point into the secretory pathway from ER to the Golgi apparatus and then to the cell

surface. The signal peptides are cleaved in the ER lumen as evidenced by the absence of

the signal sequences in mature AGPs or AG peptides (Mau et al., 1995; Schultz et al.,

2004).

1.3.2.2. Addition of C-terminal GPI anchors

In eukaryotes, the GPI anchor provides an alternative way for the attachment of

proteins to the plasma membrane other than via transmembrane domains. The GPI anchor

has a core oligosaccharide structure, i.e., D-Man-α-(12)-D-Man-α-(16)-D-Man-α-

(14)-D-GlcN-inositol. The C-terminus of a GPI-anchored protein is linked to the nonreducing end of the core oligosaccharide via ethanolamine phosphate. The inositol phospholipid moiety on the reducing end of the core oligosaccharide is embedded in the outer leaflet of the plasma membrane so as to “anchor” the attached protein to the cell surface (Oxley and Bacic, 1999).

GPI-anchored proteins in plants present similar characteristics with respect to the sites for processing and addition of GPI anchors as in other eukaryotes (Takos et al.,

2000). The amino acid residue at the site of GPI addition is designated ω, which is Ser, asparagine (Asn), Ala, Gly, aspartic acid (Asp), or Cys, whereas the ω+ 2 residue is

generally Ala, Gly, Thr, or Ser. The ω+1 residue is less critical. A terminal 14 to 18 amino acid hydrophobic tail is often found following a four to eight amino acid spacer region, which often contains a basic residue [e. g. arginine (Arg) or Lys]. The amino acid sequence after the ω residue is cleaved before the preformed GPI anchor is transferred en

55 bloc to the ω residue on the backbone of a GPI-anchored protein by transamidase activity

(Takos et al., 2000; Showalter, 2001).

To date, AGPs with known GPI anchors include PcAGP1 from Pyrus communis

(Youl et al., 1998), NaAGP1 from Nicotiana alata (Gilson et al., 2001), AtAGP10 from

Arabidopsis (Schultz et al., 2000), LeAGP1 from Lycopersicon esculentum (Sun et al.,

2004) and 12 AG-peptides from Arabidopsis (Schultz et al., 2004). Fifty six out of 85

AGPs, including those experimentally confirmed and those newly identified by a bioinformatic strategy, are predicted to be GPI-anchored in Arabidopsis (Showalter et al.,

2010). Many potential functions of AGPs related to their GPI-anchored characteristics have been proposed. For example, the attachment of AGPs to the surface of the plasma membrane through GPI anchors may facilitate the interaction between AGPs and other membrane proteins (Kohorn, 2000). Another intriguing hypothesis is that phospholipases, which specifically cleave the linkage between AGPs and their GPI anchors, might control the release of AGPs to the cell wall where AGPs may function as structural components or as signal molecules (Borner et al., 2002). However, these functional speculations require experimental proof.

1.3.2.3. Glycosylation

Classical AGPs may contain both contiguous Hyp motifs and clustered, non- contiguous Hyp motifs for glycosylation by arabino-oligosaccharides and AG polysaccharides, respectively (Gao et al., 1999; Seifert and Roberts, 2007). However,

Hyp O-glycosylation with AG polysaccharide is the major defining structure for AGPs and will be the focus of discussion here. Besides Hyp O-glycosylation, single Gal

56

attached to Ser (Tsumuraya et al., 1984), unidentified O-glycosylation through Thr

residues (Tsumuraya et al., 1987) and N-glycosylation on Asn-X-Ser/Thr motifs (Wang

et al., 1993) are also reported for some AGPs.

AG polysaccharides found in AGPs are type II AGs, which are characterized by a

β-(3)-galactan backbone branched with β-(16)-galactan side chains where

oligosaccharides containing arabinosyl residues are attached. Except for the above key

features, type II AGs were suggested to be highly heterogeneous. The polysaccharide size

ranges from 30-150 sugar residues (Gaspar et al., 2001; Showalter, 2001); the ratio of Gal:

Ara varies among different AGP preparations; besides the Gal and Ara residues, AGPs

from different species may contain different sugars, including rhamnose (Rha), GlcA,

GalA, Fuc, Glc or monosaccharides modified by acetylation or O-methylation (Fincher et al., 1983; Serpe and Nothnagel, 1995; Goodrum et al., 2000; Hashimoto, 2000). The strategy of expressing synthetic genes encoding Hyp motifs fused to a GFP tag not only helped with deciphering the Hyp glycosylation code but also helped with the elucidation of the AG glycan structure. The AG polysaccharide attached to the [Ala-Pro]51-EGFP fusion protein expressed in tobacco BY2 cells was isolated and analyzed by nuclear magnetic resonance (NMR), which presented the first complete primary structure of a type II AG glycan (Tan et al., 2004) (Fig. 1.2). In a subsequent study, an interferon α2b-

[Ser-Hyp]20 fusion protein was expressed in tobacco BY2 cells. The AG polysaccharides isolated from the transgenically expressed peptides with sizes ranging from 14 to 22

residues were much smaller in size compared to naturally occurring AG polysaccharides

(Tan et al., 2004; Tan et al., 2010). It is possible that the expression of the polypeptides

57

[Ala-Pro]51-EGFP and α2b-[Ser-Hyp]20 driven by the 35S promoter may produce a large

pool of substrates which exceeded the capacity of the endogenous GTs to perform

complete glycosylation. Although varying in sizes, the AG polysaccharides from the

[Ala-Pro]51-EGFP and α2b-[Ser-Hyp]20 peptides are similar in structure to each other and

to naturally occurring AG polysaccharides. Structural similarities in the AG

polysaccharides from the [Ala-Pro]51-GFP and the interferon α2b-[Ser-Hyp]20 fusion proteins and naturally occurring AG polysaccharides led to the suggestion that a ~15 sugar unit may constitute the fundamental building blocks for the assembly of type II AG

(Tan et al., 2010). The 15 sugar unit has a β-(13) linked trigalactosyl main chain and bifurcated sidechains linked through Gal residues 6-linked to Gal-1 and Gal-2 of the main chain. The trigalactosyl main chains are repeatedly connected through β-(16) linkages whereas Gal side chains may extend with Ara oligosaccharides, Rha, GlcUA or other sugar residues depending on the plant species. The 15 sugar unit hypothesis is consistent with previous studies showing that AGPs are composed of repeated sugar blocks separated by periodate-sensitive residues (Churms et al., 1981; Fincher et al., 1983).

AGPs in plants with differentiated tissues and organs tend to be more heterogeneous in nature. However, studies using the tobacco expression system may provide a simple solution to dissect the seemingly complicated AG structure.

To date, little is known about the enzymatic machinery involved in AGP glycosylation, or more broadly, in HRGP glycosylation. In protein N-glycosylation, an oligosaccharide core structure is formed on a lipid intermediate before being transferred en bloc onto the protein backbone (Varki et al., 1999). If complex AG polysaccharides

58

are composed of units of simpler sugar building blocks (Churms et al., 1981; Fincher et

al., 1983; Tan et al., 2010), it is reasonable to predict that the sugar building blocks are

preformed before assembly into AG polysaccharides. Potential lipid intermediate

formation in AG glycosylation similar to N-glycosylation is supported by the

identification of a containing [14C]Gal in a pea membrane preparation

incubated with UDP- [14C]Gal (Hayashi and Maclachlan, 1984). However, the lack of

detailed information of the glycolipid structure or any intermediate glycosylation products makes this idea speculative.

Given the specificity of GTs (Scheible and Pauly, 2004), different GTs are likely to be involved in putting on specific sugars at specific positions of the AG side chain.

Membrane preparation from rygrass (Lolium multiflorum) cell cultures showed galactosyltransferase (GalT) activity that transferred radiolabeled Gal to endogenous acceptors with type II AG structures (Mascara and Fincher, 1982; Schibeci et al., 1984).

GalT activities involved in the formation of the β-(16)-galactosyl linkage preferably on

β-(13)-galacto-oligomers were identified and characterized in a membrane preparation from radish roots (Raphanus sativus L.) (Kato et al., 2003). Suspension-cultured tobacco cells pulse-labeled with [14C]Ara were shown to accumulate AGPs with [14C]Ara in

Golgi membranes (Kawasaki, 1987). Misawa et al. (1996) identified FUT activity that added Fuc to the L-Ara-α-(1,3)-D-Gal‐β-(16)-D-Gal-pyridylamino acceptor through an

α-(12)-linkage and was distinct from FUT activity for xyloglucan fucosylation in the

primary roots of radish. In addition, glucuronyltransferase (GlcAT) activities were also

detected in radish roots using linear β-(13)-galactans or branched β-(16)-galactans as

59

the acceptors (Endo et al., 2004). These studies used endogenous acceptors or

oligosaccharides mimicking partial sequences of the AG side chains as acceptors,

preventing a precise determination of the positions of sugar addition by the AGP GT

activities. However, valuable information on the characteristics of AGP GTs was

ascertained, for example, the β-D-GalT activities from both rygrass and radish roots and

the α-L-FUT activity from radish root all require Mn2+ divalent ions but have slightly

different pH optima, ranging from 6 to 7.5 (Mascara and Fincher, 1982; Misawa et al.,

1996; Kato et al., 2003). Furthermore, subcellular localization data indicated that β-D-

GalT, α-L-FUT and GlcAT for AGP glycosylation all reside in the endomembrane system (Mascara and Fincher, 1982; Schibeci et al., 1984; Misawa et al., 1996; Kato et al.,

2003; Endo et al., 2004). Recently, progress has been made in the identification of AGP

GT genes with the help of bioinformatics strategies. Qu et al. (2008) identified 20

Arabidopsis homologues to the mammalian -(13)-GalT genes as candidate AGP

GalTs in the CAZy GT 31 family. Although one of the twenty genes was proven to be a

-(13)-GalT involved in protein N-glycosylation (Strasser et al., 2007), another candidate gene demonstrated activity to transfer Gal to an O-methylated Gal--(13)-

Gal disaccharide, an analogue to a partial structure of AGP side chains (Qu et al., 2008).

Two Arabidopsis genes, AtFUT4 and AtFUT6, were identified from the CAZy GT37 and proven biochemically to encode AGP specific α-(12)-FUTs (Wu et al., 2010). In addition, Oka et al. (2010) utilized chemically synthesized AGP peptides conjugated to fluorescein isothiocyanate (FITC) tags as acceptors to study the GalT activity for adding

Gal residues to the AGP protein backbones.

60

1.3.3. Biological functions

Because AGPs are usually present in mixtures and are heterogeneous in nature, it is

often not possible to isolate pure, individual molecules for functional elucidation.

Exceptions include the purification of the tobacco transmitting tissue-specific protein

(TTS) (Cheung et al., 1995) and the Zinnia elegans xylogen (Motose et al., 2004), which

are highly enriched in specific plant organs or cell lines. The use of the β-Yariv reagent

and monoclonal antibodies (mAb) against AGPs have greatly facilitated the

investigations of AGPs (Nothnagel, 1997; Showalter, 2001). Besides being utilized as

molecular probes for the identification and localization of AGPs, β-Yariv reagent and

mAbs against AGP epitopes were applied to bind to and interfere with AGPs in living

plants or cells, thus providing cues to deduce AGP function. However, both β-Yariv and

most AGP mAbs are estimated to bind to 50-100 different AGPs, and thus provide

information on the distribution, expression and function of an AGP population rather than

individual molecules (Ellis et al., 2010). In recent years, advances in molecular genetic

techniques and the completion of genome sequencing of model plants, has allowed for

the study of individual AGP molecules by generating and characterizing their genetic

mutants. In addition, advances in bioinformatics (Schultz et al., 2002; Ma and Zhao, 2010;

Showalter et al., 2010) and the accumulation of microarray data (Du et al., 2009; Bosch

et al., 2011; Lin et al., 2011) provide new tools to guide the identification and functional

studies of AGPs.

With the above strategies, AGPs were demonstrated to be involved in various

aspects of plant growth and development (Showalter, 2001; Seifert and Roberts, 2007;

61

Ellis et al., 2010). Treatment of the suspension-cultured cells with β-Yariv reagent indicated that AGPs are involved in cell devision (Serpe and Nothnagel, 1994) and programmed cell death (Gao and Showalter, 1999). In embryogenesis, AGP epitopes serve as markers for differentiating cells (Pennell et al., 1991). The isolation of xylogen, a chimeric AGP with a nonspecific lipid-transfer domain, provided direct evidence that

AGPs function to induce tracheary element differentiation (Motose et al., 2001, 2004).

AGPs are involved in shoot and root growth, as indicated by shorter shoots and multifaceted defects in root growth when plant seedlings are grown in the presence of β-

Yariv reagent (Willats and Knox, 1996; Ding and Zhu, 1997). Consistently, AGPs were shown to be over-represented in the root hair morphogenesis transcriptome (Jones et al.,

2006). A null mutant of the Arabidopsis FASCICLIN-LIKE AGP (FLA) gene, fla1, was defective in shoot regeneration in tissue cultures (Johnson et al., 2011). In secondary cell wall biosynthesis, two Arabidopsis genes, FLA11 and FLA12, were suggested to modulate cellulose deposition and thus affect the mechanical properties of stems

(MacMillan et al., 2010). AGPs also have established roles in plant reproduction.

Different AGP molecules were suggested to function in female gametogenesis (Acosta-

Garcia and Vielle-Calzada, 2004), pollen tube guidance (Clarke et al., 1979; Cheung et al.,

1995), pollen incompatibility (Lind et al., 1994; Cheung et al., 1995; Wu et al., 2000; Lee et al., 2008) and pollen grain development (Levitin et al., 2008; Coimbra et al., 2009). In addition to the multifaceted roles of AGPs under physiological conditions, AGPs are implicated in plant resistance to stress, like salt stress (Shi et al., 2003; Lamport et al.,

62

2006) and possibly play a role in plant-microbe interactions (Nam et al., 1999; Gaspar et al., 2004).

Given the large number of AGP family members predicted by bioinformatics (e.g.

85 AGPs in Arabidopsis (Showalter et al., 2010) and 69 AGPs in rice (Ma and Zhao,

2010)), the precise function of the majority of AGP molecules remains unknown. The

AGP family is likely to divide into subgroups for functional specialization. Classification of AGPs based on the protein backbone structure is not necessarily correlated to the functional specialization. For example, the relatively well-characterized Lys-rich AGP and the fasciclin-like AGP families are both composed of members with diverse expression patterns and functions (Shi et al., 2003; Acosta-Garcia and Vielle-Calzada,

2004; Gaspar et al., 2004; MacMillan et al., 2010). The molecular mechanism of AGP function remains elusive dispite knowledge of the protein backbone sequence, the addition of the GPI-anchor and the glycosylation patterns that have been suggested to play a role in the functionalities of AGPs (Ellis et al., 2010).

1.4. Specific Aims of the Research

The goal of this research is to extend our understanding of the AGP glycosylation machinery. The research focuses on the identification and characterization of the GalTs and the FUTs for AGP glycosylation with the following specific aims.

 Development of an AGP GalT assay system and characterization of AGP GalT

activities in tobacco and Arabidopsis suspension-cultured cells

To date, none of the GalT genes involved in AGP glycosylation have been conclusively identified. The first specific aim of this research is to develop an in vitro

63

assay to facilitate the specific detection of AGP GalT activity. The AGP GalT assay, once developed, will be applied to characterize AGP GalT activity in Arabidopsis suspension-

cultured cells to provide insight to the AGP biosynthetic machinery.

 Heterologous expression and functional characterization of putative GalT genes

for AGP galactosylation in Pichia pastoris and tobacco suspension cultured

cells

Using a bioinformatics method, six genes in CAZy GT31 were predicted to encode

putative GalTs for AGP galactosylation (Qu et al., 2008; Egelund et al., 2011). Five of the

six genes with cDNA clones commercially available will be expressed heterologously in

yeast and tobacco suspension-cultured cell lines with an expression tag for functional

characterization. Subcellular localization of the GalT genes will be also studied using a

transient expression system in tobacco leaves to support the functional characterization

work.

 Characterization of the fut4 and fut6 mutant plants in Arabidopsis

Arabidopsis FUT4 and FUT6 have been identified as AGP-specific FUTs using a

biochemical approach (Wu et al., 2010). The functions of the FUT4 and FUT6 genes as

well as the physiological roles of AGP fucosylation will be studied by characterizing the

fut4, fut6 and the fut4/ fut6 mutant plants in Arabidopsis.

64

CHAPTER 2: IDENTIFICATION AND CHARACTERIZATION OF IN VITRO

GALACTOSYLTRANSFERASE ACTIVITIES INVOLVED IN

ARABINOGALACTAN-PROTEIN GLYCOSYLATION IN TOBACCO AND

ARABIDOPSIS

This work has been published in the following manuscript.

Liang Y, Faik A, Kieliszewski M, Tan L, Xu WL, Showalter AM (2010)

Identification and characterization of in vitro galactosyltransferase activities involved in

arabinogalactan-protein glycosylation in tobacco and Arabidopsis. Plant Physiol 154:

632-642

2.1. Introduction

Arabinogalactan-proteins (AGPs) are highly glycosylated hydroxyproline-rich glycoproteins (HRGPs) implicated in many physiological processes including plant somatic embryogenesis, programmed cell death, wound responses, and hormone signaling pathways (Seifert and Roberts, 2007; Ellis et al., 2010). AGPs are defined by the presence of arabinogalactan (AG) polysaccharides and reside mainly at the plasma membrane-cell wall interface and in plant exudates (Langan and Nothnagel, 1997; Oxley and Bacic, 1999; Sherrier et al., 1999; Svetek et al., 1999; Lamport et al., 2006; Seifert

65

and Roberts, 2007). These AG polysaccharides are added via O-glycosylation, which is

widespread in plants and includes monogalactosylation of Ser residues and extensive

modification of hydroxyproline (Hyp), which ranges from addition of oligoarabinosides

to AG polysaccharide addition. The Hyp contiguity hypothesis predicts contiguous Hyp

residues as sites of HRGP arabinosylation while clustered non-contiguous Hyp residues

are sites of galactosylation that give rise to AG polysaccharides on AGPs (Kieliszewski

and Lamport, 1994; Kieliszewski et al., 1995; Tan et al., 2003). Tests of the hypothesis in

gum arabic glycoprotein (Goodrum et al., 2000) and tobacco (Nicotiana tobacum) (Shpak et al., 1999; Shpak et al., 2001; Zhao et al., 2002; Tan et al., 2003; Held et al., 2004) using naturally occurring AGPs and synthetic genes encoding only clustered non- contiguous Hyp or contiguous Hyp confirmed that AG polysaccharide was added only to clustered, non-contiguous Hyp, while arabinosylation (oligosaccharides composed of ~4

Ara residues) occurred on contiguous Hyp blocks. The structure of a well-characterized

Hyp-AG isolated from tobacco was recently elucidated (Tan et al., 2004) (Fig. 2.1).

Based on the structure of this AG polysaccharide and given the specificity of

glycosyltransferases (GTs), there may be as many as fifteen activities

involved in the synthesis of this AG polysaccharide, namely one peptidyl Hyp--

galactosyltransferase, one -(1,5)arabinosyltransferase, possibly four -

(1,3)arabinosyltransferases, three -(1,3)galactosyltransferases, three -

(1,6)galactosyltransferases that add the three branch sites on the AG core, two -

(1,6)glucuronyltransferases, and one -(1,4)rhamnosyltransferase. In other species,

66

additional are possible, as in Arabidopsis (Arabidopsis thaliana) where -

(1,2)fucosyltransferase is involved in AGP glycosylation (van Hengel and Roberts, 2002).

Figure 2.1 AG structure of an AGP molecule. The protein backbone containing a clustered non-contiguous Hyp motif is shown. Although the two Hyp residues in the protein backbone are both glycosylated with AG when expressed in tobacco cells, only one AG side chain is shown here for simplicity. The focus of the present study is the GalT enzyme (shown in red) that adds the first Gal residue onto the AGP peptide backbone. Monosaccharide symbols used here are based on the Symbol and Text Nomenclature for Representation of Glycan Structure as proposed by the Consortium for Functional Glycomics (http://www.functionalglycomics.org/glycomics/molecule/jsp/carbohydrate/carbMolecule Home.jsp). Modified from Tan et al.2004.

Despite the fact that plant cell walls contain substantial amounts of AGPs, there is an embarrassing lack of knowledge on the enzymology of AGP (and HRGP) biosynthesis

(Ellis et al., 2010). (Karr, 1972) partially characterized a microsomal fraction that arabinosylated extensin peptides and Bolwell (Bolwell, 1986) suggested a lipid-linked intermediate might be involved in HRGP glycosylation. In addition, galactosyltransferase

67

(GalT) activity associated with AGPs was identified in ryegrass (Mascara and Fincher,

1982) and pea (Hayashi and Maclachlan, 1984) membrane preparations; moreover, this

activity in ryegrass was restricted to Golgi-derived membranes (Schibeci et al., 1984).

Others identified an -fucosyltransferase activity (Misawa et al., 1996) and a -

glucuronyltransferase activity (Endo et al., 2003) that might be involved in biosynthesis

of radish root AGPs. The major reason for this lack of research progress is the difficulty

in making appropriate acceptor substrates for these enzymes.

Very recently, however, some progress on the enzymes responsible of AGP

glycosylation was made. A bioinformatics study looking for Arabidopsis homologs to

mammalian -(1,3)GalTs identified 20 putative Arabidopsis -(1,3)GalTs (Qu et al.,

2008), one of which was previously identified as a -(1,3)GalT involved in the

biosynthesis of protein-bound N-linked oligosaccharide (N-glycan) (Strasser et al., 2007).

Additionally, Oka et al. (2010) utilized an in vitro assay system to detect and localize

Hyp:GalT activity in the endoplasmic reticulum (ER) of Arabidopsis using a chemically

synthesized AGP peptide and variants thereof conjugated to fluorescein isothiocyanate

(FITC) via a γ-aminobutyric acid (GABA) as acceptor substrates. Finally, Wu et al. (2010)

identified and characterized two α-(1,2)fucosyltransferases encoded by AtFUT4 and

AtFUT6 in Arabidopsis that are specific to AGPs.

With this as a background, we report here on the identification and characterization

of in vitro GalT activities that are likely involved in the initial steps of AGP glycosylation in both tobacco and Arabidopsis. This work was achieved by developing an in vitro AGP

GalT assay using synthetic and transgenically produced AGP peptides as acceptor

68

substrates, UDP-[14C]Gal as the sugar donor, and permeablized microsomal membranes

from tobacco BY2 and Arabidopsis suspension-cultured cells.

2.2. Material and Methods

2.2.1. Suspension culture of Arabidopsis cells

The tobacco (Nicotiana tabacum, BY2) and Arabidopsis suspension cultured cells

(Arabidopsis thaliana, ecotype Columbia) were maintained in liquid NT-1 media

(Arabidopsis Biological Resource Center, http://abrc.osu.edu/cell_culture_handling.html)

on a rotary shaker (120 rpm) at 24˚C. NT-1 media (pH 5.8) contains 4.3 g/L Murashige

and Skoog salts (Caisson), 30 g/L Suc, 180 mg/L KH2PO4, 100 mg/L myo-inositol,

1mg/L thiamine HCl and 0.44 mg/L 2,4-dichlorophenoxacetic acid. Cells were subcultured weekly (1:10 v/v) into fresh culture media. The Arabidopsis cells were

obtained from and established by Axelos et al. (1992).

2.2.2. Preparation of microsomal membranes from tobacco and Arabiodopsis

suspension cultured cells

Microsomal membranes were prepared as described earlier (Zeng et al., 2008). The

method was modified for the use of suspension cultured cells as the starting material.

Briefly, 140 g of mid-log phase cells (7 day cultures) were harvested and thoroughly

washed with double distilled water (ddH2O) at 4˚C and resuspended in 70 mL extraction

buffer (0.1 M Hepes–KOH pH 7, 0.4 M sucrose, 0.1% BSA, 1 mM DTT, 5 mM MgCl2, 5 mM MnCl2, 1 mM phenylmethylsulphonyl fluoride (PMSF), 1 tablet of Roche complete

69

protease inhibitor cocktail). Cells were disrupted in a Brinkmann Homogenizer (Model:

CPU11, Brinkmann Instruments Corp.) at speed 9 for 3 min (stopping for 30 s every 30 s

during disruption). The homogenate was filtered through two layers of miracloth, and the

filtrate was centrifuged at 3,000 x g for 20 min. The resulting supernatant was layered

over a 1.8 M sucrose cushion buffer and centrifuged at 100,000 x g for 60 min. The

uppermost layer was discarded without disturbing the membrane containing interphase

layer. A discontinuous sucrose gradient was implemented by sequentially layering 1.1 M

and 0.25 M sucrose solutions onto the interphase layer and centrifuging at 100,000 x g

for 60 min. The microsomal membranes enriched at the 0.25/1.1 M sucrose interphase

were collected and pelleted by another centrifugation at 100,000 x g for 30 min. The

pellet was resuspended in 700 μL extraction buffer and stored at -80ºC until use.

2.2.3. Standard assay for galactosyltransferase (GalT) activities

The standard GalT reaction mixture (75 μL) consisted of permeabilized microsomal

membranes (~150 μg total protein), acceptor substrate peptide (50 μg) and UDP-[14C]Gal

(~3 μM, 90,000 cpm, 465 cpm/pmol, PerkinElmer Life Science Inc.). To achieve permeabilization, 50 μL of the microsome preparations were treated with 1% Triton X-

100 (15 min, 4ºC), followed by ultracentrifugation at 100,000 x g for 45 min. The pellet was resuspended in 50 μL extraction buffer (BSA eliminated). The [AO]7 acceptor

substrate was a chemically synthesized peptide (GenScript Corp.). The sequence of

[AO]7 was [Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp]. The

d[AO]51 peptide was obtained from the [AO]51-EGFP fusion protein expressed in tobacco

BY-2 cells as described previously (Tan et al., 2003). In the control reaction, no acceptor

70 substrate was included. Reactions were incubated at room temperature for 2 h and terminated by mixing with 400 μL anion-exchange resin (DOWEX 1X8-100 resin,

Sigma-Aldrich) (1:1 v/v in ddH2O). The resin mixture was loaded on a Zeba spin column

(Pierce) and centrifuged at 15,000 x g for 1 min. While the unreacted UDP-[14C]Gal was retained by the ion-exchange resin beads, the flowthrough contained the incorporated

[14C] radiolabeled product and was analyzed with an LS6500 Multi-purpose Scintillation

Counter (Beckman).

2.2.4. Analysis of the GalT assay products by reverse phase-high performance liquid chromatography (RP-HPLC)

The GalT assay product purified by the ion-exchange resin was loaded onto a polymeric reverse-phase column (PRP-1, 5 µm, 4.1 × 150 mm, Hamilton) equilibrated with buffer A (2% acetonitrile 0.1% trifluoroacetic acid). Reaction products were eluted with isotonic buffer A for 10 min followed by a linear gradient of buffer B (100% acetonitrile with 0.1% trifluoroacetic acid) from 2 to 100% in 90 min at a flow rate of 0.5 mL/min. Chromatography was monitored by absorption at 220 nm. The eluate was collected in 0.5 mL fractions and counted for radioactivity in cpm.

2.2.5. Monosaccharide composition analysis of the GalT assay product

The RP-HPLC fractions containing [14C] radiolabeled peptides were collected and freeze dried. The dried powder was dissolved in ddH2O and subjected to total acid hydrolysis [2 M trifluoroacetic acid (TFA), 121°C, 60 min]. Excess TFA was removed by a freeze drying-dissolving (with ddH2O) cycle repeated three times. The product from

71

total acid hydrolysis was dissolved in deionized water and analyzed on a CarboPac PA10

column (4 x 250mm, Dionex) in a BioLC system using pulsed amperometric detection

(ED50 electrochemical detector, Dionex). The column was equilibrated at a flow rate of

0.5 mL/min with 200 mM NaOH for 10 min, ddH2O for 10 min and 2.5 M NaOH for 30

min. The sample was eluted with 2.5 M NaOH at a flow rate of 0.5 mL/min. Each

fraction (0.5 mL) was counted for radioactivity in cpm. Fucose, glucose, galactose,

arabinose and xylose monosaccharide (Acros Organics) were used as standards.

2.2.6. Hyp [14C]Galactoside profile analysis

The RP-HPLC fractions containing [14C] radiolabeled peptides were pooled and

freeze dried. The dried powder was dissolved in ddH2O and analyzed by size exclusion

chromatography before or after base hydrolysis (0.44 M NaOH, 18 h, 105°C). Size

exclusion chromatography was performed with a Bio-gel P2 column (90 x 1.5 cm) under gravity with degassed ddH2O. Each fraction (2.3 mL) was counted for radioactivity in

cpm. The base hydrolysate of [AO]7 as well as pure Hyp amino acid (Sigma-Aldrich) was

eluted on Bio-gel P2 under the same elution conditions as described above for

comparison. Hyp fractions were analyzed using a colorimetric method described

previously (Lamport and Miller, 1971). The column system was calibrated with high-Mr dextran (V0), galactose (Vt), xylo-oligosaccharides (DP2-6), and xyloglucan-

oligosaccharides (DP7-9) (Megazyme).

72

2.2.7. Characterization of the Arabidopsis [AO]7:GalT activity

The standard GalT assay was modified for the enzymatic characterization tests using

[AO]7 peptide as the acceptor substrate. The assay product from each reaction was

fractionated on RP-HPLC to specifically measure incorporated [14C] radiolabel into

acceptor substrates.

To test the effect of pH, permeabilized microsomal membranes were dissolved in test buffers at a final concentration of 100 mM and used for the GalT assay. Test buffers

included MES-KOH buffer at pH 4, 5, 6, and 7; HEPES-KOH buffer at pH 6, 7, and 8;

Tris-HCl buffer at pH 8 and 9; CAPS-KOH buffer at pH 10.

To test the effect of divalent ions, microsomal membranes were extracted with

divalent ions eliminated from the extraction buffer. Permeabilized microsomal

membranes were dissolved in extraction buffer with divalent ions eliminated and

distributed into aliquots. MnCl2, MgCl2, CaCl2, CuCl2, NiCl2 and ZnSO4 at a final

concentration of 5 mM was added when the GalT assays were performed. Deionized distilled H2O instead of divalent ions were added in the control reaction.

To test the enzyme specificity for different acceptor substrates, the standard GalT

assay was performed with 2 µg of various acceptor substrates. The [AO]14, [AO]7 and

ExtP acceptor substrates were chemically synthesized peptides (GenScript Corp). The sequence of [AO]14 is [Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-

Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp-Ala-Hyp]. The sequence of

[AP]7 is [Ala-Pro-Ala-Pro-Ala-Pro-Ala-Pro-Ala-Pro-Ala-Pro-Ala-Pro]. The sequence of

ExtP is [Ser-Hyp-Hyp-Hyp-Hyp-Tyr-Val-Tyr-Ser-Ser-Hyp-Hyp-Hyp-Hyp-Tyr]. The

73 dExtP2 was isolated from tomato suspension cultured cells and deglycosylated as described previously (Smith et al., 1986). dExtP2 has a highly periodic structure which is composed of di-and octapeptides, with sequences of [Tyr-Lys] and [Ser-Hyp-Hyp-Hyp-

Hyp-Val-Tyr-Lys], respectively.

To analyze the enzyme specificity for sugar donors, the standard activity assay was

14 performed with [AO]7 as the acceptor substrate and various [ C]-labeled nucleotide sugars (90,000 cpm) as the sugar donor. The nucleotide sugars tested included UDP-

[14C]Glc (MP Biomedicals), UDP-[14C]Xyl (PerkinElmer Life Sciences) and GDP-

[14C]Fuc (PerkinElmer Life Sciences).

2.2.8. Electrospray Ionization-Tandem Mass Spectrometry (ESI-MS/MS) of the

[AO]7:GalT assay product

The GalT assay was conducted under standard assay conditions except that UDP-

[14C]Gal was replaced by non-radioactive (cold) UDP-Gal (12 µM). The cold GalT assay products were fractionated by RP-HPLC, and fractions were collected according to the retention time of the glycopeptide peak as determined by a preceding run with radiolabeled products. RP-HPLC purified products were lyophilized and dissolved in 50% methanol containing 0.1% acetic acid. The sample was introduced into the ESI source at

3 µL/min using a Cole-Parmer 74900 syringe pump. ESI-MS spectra were acquired on an

Esquire 6000 Ion Trap analyzer (Bruker Daltonics) operated in positive ion mode with a capillary voltage of 4000 V, drying gas temperature of 300°C, drying gas flow rate of 5

L/min and a nebulizer pressure of 10 psi. Nitrogen was used as both the nebulizing gas and drying gas. Samples from the control or experimental assay product were subjected

74

to a full scan from m/z 600 to 3000. In MS/MS analysis, precursor ions with the target

m/z values were isolated from the full scan spectrum and fragmented by collision-induced dissociation (CID) with helium as the collision gas and a collision voltage of 1 V. Target m/z values for precursor ions were set as 1470, 1632 and 1794, corresponding to the theoretical m/z values of protonated [AO]7 with one, two and three Gal residues attached.

Instrument control and data acquisition were performed with Esquire 5.0 software.

2.2.9. Continuous sucrose gradient centrifugation

Linear 30-45% (w/v) sucrose gradients (20 mL) were poured over 4 mL cushions of

62% sucrose (w/v) in a centrifugation tube by Auto Densi-Flow (Labconco). All

solutions are made up in 0.1 M Hepes–KOH (pH 7) in the presence of 1 mM EDTA.

Microsomal membranes from Arabidopsis suspension cultured cells (1 mL) were layered

on top of the sucrose gradient. After centrifugation (100,000 x g, 16 h, 4°C, Beckman

Coulter SW32 rotor), 20 equal fractions were collected from the top of the gradient

downward with the electric gradient fractionator Auto Densi-Flow (Labconco).

The cytochrome C reductase (an ER marker), inosine diphosphatase (a Golgi marker)

and glycan synthase II (a plasma membrane marker) in each gradient fraction were

measured according to procedures described previously (Shore and Maclachlan, 1975;

Gibeaut and Carpita, 1990; Dhugga and Ray, 1994).

75

2.3. Results

2.3.1 Development of a GalT assay system with AGP-like peptides as acceptor

substrates

An in vitro GalT assay system was developed with UDP-[14C]Gal as the sugar donor and permeablized microsomal membranes from tobacco or Arabidopsis suspension- cultured cells as the enzyme resource. Two peptides with repetitive [AO] units, representing clustered noncontiguous Hyp motifs of AGPs, were used as acceptor substrates. One of the peptides, denoted as d[AO]51, contained an [AO] motif repeated 51

times. The d[AO]51 peptide was obtained from the [AO]51-EGFP fusion protein expressed

in transgenic tobacco BY-2 cells; the EGFP tag was removed by trypsin digestion and the

AG side chains were removed by deglycosylation with hydrogen fluoride (HF). The other

peptide, namely [AO]7, was chemically synthesized and contained seven [AO] repeats. A

control reaction was set up with the peptide acceptors excluded. After the reaction,

unreacted UDP-[14C]Gal was removed with an ion-exchange resin column and

radioactivity in the reaction solution was counted. As shown in Fig. 2.2, when tobacco

permeablized microsomal membranes were used as the enzyme resource, radioactivity

detected in the d[AO]51:GalT reactions was higher than the control reaction as well as the

[AO]7:GalT reactions, which displayed a similar level of radioactivity as the control reactions. Compared to the tobacco reactions, the Arabidopsis reactions showed much lower background levels and high, roughly equivalent levels of incorporation into both d[AO]51 and [AO]7.

76

Figure 2.2 Total [14C] radiolabel incorporation into the GalT reaction product in the absence or presence of [AO]7 or d[AO]51 acceptor substrate. Permeablized microsomal membranes from tobacco and Arabidopsis suspension cultured cells served as the enzyme source. Reactions were done in triplicate and mean values are presented.

2.3.2. Characterization of the GalT assay products by Reverse-Phase High

Performance Liquid Chromatography (RP-HPLC) analysis

To confirm that [14C] radiolabel was transferred to the acceptors, reaction products

were first fractionated by RP-HPLC. RP-HPLC fractionation of the tobacco and

Arabidopsis control reaction products resolved a single radioactive peak with a low retention time (Peak I in Fig. 2.3 A and D). Peak I was also detected in the experimental

reactions with [AO]7 and d[AO]51 acceptor substrates (Fig. 2.3 B, C, E and F). The

14 identity of the components in this [ C] labeled fraction is unknown. The small A220nm peaks appearing at the corresponding retention time were solvent peaks and did not represent protein components. It is possible that this fraction is composed of oligosaccharides with [14C]Gal incorporated into endogenous sugar acceptors during the

77

GalT assay. Alternatively, the fraction may represent free [14C]Gal residues released

during the assay by an endogenous galactosidase (Kato et al., 2003).

Figure 2.3 RP-HPLC fractionation of the [AO]7:GalT and d[AO]51:GalT reaction products on a PRP-1 reverse-phase column. Radioactive Peak II coeluted with the [AO]7 or d[AO]51 acceptor substrate and was used for subsequent product analysis and in monitoring enzyme activity.

Another radioactive product peak co-eluting with the [AO]7 and d[AO]51 acceptor substrates appeared only in the experimental reactions (Peak II in Fig. 2.3.B, C, E and F).

Control reactions, lacking the peptide acceptors, contained no such radioactive product peak. RP-HPLC fractionation provided evidence for incorporation of the [14C] label from

14 UDP-[ C]Gal into the [AO]7 and d[AO]51 peptide acceptors in both the tobacco and

78

Arabidopsis assay systems, including the [AO]7:GalT reaction in tobacco, which showed

no substantial increase in radioactivity in the crude product analysis (Fig. 2.2).

To confirm that the [14C] radiolabel remained associated with Gal, RP-HPLC

14 fractions containing the [ C] labeled GalT reaction product coeluting with the d[AO]51

peptide acceptor in the tobacco assay were pooled and subjected to total acid hydrolysis,

which degraded the product into free amino acids and monosaccharides. The resulting

[14C] labeled monosaccharide eluted as Gal following High pH Anion Exchange

Chromatography (HPAEC) on a CarboPac PA-10 column (Dionex) (Fig. 2.4.A).

14 Similarly, [ C]-labeled monosaccharide released from the Arabidopsis [AO]7:GalT reaction product eluted as Gal (Fig. 2.4.B). Such analysis confirmed that [14C]Gal was

incorporated into the d[AO]51 and [AO]7 acceptors without any significant conversion

into other monosaccharides.

79

Figure 2.4 Monosaccharide analysis of the RP-HPLC purified tobacco d[AO]51:GalT reaction product (Panel A) and the Arabidopsis [AO]7:GalT reaction product (Panel B). The [14C]-labeled monosaccharides were analyzed by High-Performance Anion- Exchange Chromatography (HPAEC) on a CarboPac PA-10 column. Elution times of monosaccharide standards are as indicated with arrows at the top.

2.3.3. Extent of AGP peptide galactosylation

In order to investigate how many Gal residues were incorporated into the reaction products, RP-HPLC fractions containing the [14C] labeled GalT reaction products coeluting with the peptide acceptors were pooled and subjected to base hydrolysis. Base hydrolysis degrades peptide bonds but keeps Hyp-glycosidic bonds intact (Kieliszewski and Shpak, 2001). The molecular sizes of the resulting [14C] labeled base hydrolysates were analyzed by gel filtration chromatography on a Bio-gel P2 column (Fig. 2.5). The

80 rationale here was the extent of Gal addition would be reflected in the sizes of the resulting Hyp-glycosides. As shown in Fig. 2.6.A and B, Hyp generated from base hydrolysis of the [AO]7 peptide eluted as a degree of polymerization (DP) 3 size on the

P2 column, which was identical to the elution profile of commercially available free Hyp amino acid.

14 Before base hydrolysis treatment, intact [ C] labeled d[AO]51 and [AO]7 peptides eluted at the void volume (V0) of the P2 column, as shown in the upper portion of each panel in Fig. 2.5. When tobacco permeabilized microsomal membranes were used as the enzyme source, the base hydrolysate of the d[AO]51 reaction product mainly eluted as

DP5 (Fig. 2.5.A), while the major base hydrolysate released from [AO]7 reaction product eluted as DP4 (Fig. 2.5.B). Given that Hyp eluted as DP3 (Fig. 2.6) under the same conditions, the results indicated that Hyp residues in [AO]7 were glycosylated with one

Gal residue, while Hyp residues in d[AO]51 had glycosyl side chains corresponding to two Gal residues. Note that Hyp, but not Ala residues, are known sites of glycosylation for AGPs.

81

Figure 2.5 Bio-gel P2 fractionation of the RP-HPLC purified GalT reaction product. Elution profiles of the d[AO]51:GalT reaction product before and after base hydrolysis are shown for tobacco reactions (Panel A) and Arabidopsis reactions (Panel D); elution profiles of the [AO]7:GalT reaction product before and after base hydrolysis are shown for tobacco reactions (Panel B) and Arabidopsis reactions (Panel E); elution profiles of the dd[AO]51:GalT reaction product before and after base hydrolysis are shown for tobacco reactions (Panel C) and Arabidopsis reactions (Panel F); the elution position of free Hyp amino acid (corresponding to DP3) is shown with an arrow in each panel. The column was calibrated with high-Mr dextran (V0), galactose (Vt ), xylo-oligosaccharides

82

with degree of polymerization (DP) 2 to 5 and xyloglucan-oligosaccharides (DP6-9); their elution positions are indicated with arrows at the top of the figure.

Figure 2.6 Bio-gel P2 fractionation of free Hyp amino acid (Panel A) or Hyp amino acid produced following base hydrolysis of the [AO]7 acceptor substrate (Panel B). The column was calibrated as described in Fig. 2.5.

In the case of the d[AO]51 acceptor, it was possible that two monosaccharide sugars

were added to one Hyp residue in the GalT assay. However, it could not be excluded that

one Gal residue might pre-exist on the Hyp residues of d[AO]51 because of incomplete

deglycosylation during the preparation of this acceptor substrate. To test this possibility,

double deglycosylated d[AO]51 (dd[AO]51) was produced by subjecting d[AO]51 to a

second HF deglycosylation reaction and used as an acceptor in the GalT assay. The base

hydrolysate generated from the tobacco dd[AO]51 reaction products eluted as DP4 on the

Bio-gel P2 column (Fig. 2.5.C), indicating not only that d[AO]51 was incompletely

83

deglycoslyated, but also providing evidence for the presence of two GalT activities in

tobacco microsomal membranes. One activity, a Hyp:GalT activity, transfers the initial

Gal onto Hyp residues in dd[AO]51 and [AO]7, while the second activity, a likely

Gal:GalT activity, transfers the next Gal residue onto galactosylated Hyp in the

incompletely deglycosylated d[AO]51.

When Arabidopsis permeablized microsomal membranes were used as the enzyme

source, a similar picture emerged. Here, the base hydrolysate from the [AO]7:GalT

reaction product eluted as DP4 on the Bio-gel P2 column (Fig. 2.5.E). Moreover, the base

hydrolysate of the dd[AO]51:GalT product eluted mainly as DP4 with a small amount of

DP5, whereas the base hydrolysate of the d[AO]51:GalT product eluted with considerably more DP5 relative to DP4 (Fig. 2.5.D and F). The result indicated that Arabidopsis, like tobacco, contained two GalT activities; however, the relative activity of the putative

Gal:GalT responsible for the DP5 peak was considerably greater in tobacco than in

Arabidopsis.

2.3.4. Enzymatic characteristics of the [AO]7:GalT activity in Arabidopsis suspension-cultured cells

Since Arabidopsis material will be a better source for identification of the AGP

GalT enzyme(s) using proteomic strategies given its complete genome sequence,

Arabidopsis [AO]7:GalT activity was further characterized as described below. With a

total of 150 µg of microsomal proteins in the assay system, [AO]7:GalT activity

approached saturation when 2 µg of [AO]7 was included in the reaction mix (Fig. 2.7).

14 With 2 µg [AO]7 in the GalT assay system, incorporation of [ C]Gal increased

84

proportionally with respect to the amount of microsomal protein up to 150 µg using an

incubation time of 2 h (Fig. 2.7.B). The [AO]7:GalT activity had a pH optimum of 7 (Fig.

2.7.C) and a temperature optimum of 40°C (data not shown). The [AO]7:GalT activity

increased approximately 4 fold in the presence of Mn2+ but was unchanged with Mg2+, in comparison to the control (Fig. 2.7.D). In addition, the presence of Ca2+, Cu2+, Zn2+ and

Ni2+ divalent ions inhibited activity to different extents (Fig. 2.7.D).

85

Figure 2.7 Biochemical characteristics of the Arabidopsis [AO]7:GalT activity. Data are the average of triplicate assays. A. Relationship between [AO]7 concentration 14 and incorporation of [ C]Gal into [AO]7. B. Relationship between microsomal protein 14 concentration and incorporation of [ C]Gal into [AO]7. C. Effect of pH on enzyme activity. D. Effect of different divalent ions (5 mM) on enzyme activity.

The specificity of the Arabidopsis Hyp:GalT activity was investigated using various acceptor substrates (Fig. 2.8). Namely, [AO]7, [AO]14 and dd[AO]51 were used as AGP peptides with repetitive [AO] units of various lengths. Incorporation of [14C] radiolabel decreased with increasing lengths of these [AO] acceptor substrates. In contrast, [AP]7 containing seven repetitive [Alanine-Proline] units did not serve as an acceptor substrate, indicating that peptidyl Hyp was required for galactosylation. In addition, two extensin sequences were used as potential acceptor substrates, a chemically synthesized extensin

86

peptide (ExtP) designed based on a repetitive sequence (i.e., SOOOOYVYSSOOOOY)

found in several Arabidopsis extensins (e.g., At4g08410) and a deglycosylated tomato P2 extensin (dExtP2) containing numerous SOOOOVYK and YK repeat units (Smith et al.,

1986). Both of these extensin sequences, which contained contiguous peptidyl Hyp as well as other amino acid sequences, failed to act as acceptors, indicating this Hyp:GalT activity was specific for AGP sequences containing non-contiguous peptidyl Hyp.

Figure 2.8 Incorporation of [14C] radiolabel into acceptor substrates containing various Hyp motifs. [AO]7, [AO]14 and dd[AO]51 contain seven, 14 and 51 [AO] units, respectively. A chemically synthesized extensin peptide (ExtP) and tomato deglycosylated extensin P2 (dExtP2) contain repetitive SO4 units. [AP]7 contains seven [AP] units. Enzyme reactions were done in triplicate and mean values are presented.

The requirement of a specific nucleotide sugar donor for [AO]7 glycosylation was

also tested. Arabidopsis microsomal membranes specifically incorporated [14C]Gal, but

not [14C]glucose, [14C]xylose or [14C]fucose from their corresponding nucleotide sugars,

into the [AO]7 acceptor substrate (Table 2.1).

87

14 Table 2.1 Incorporation of [ C] radiolabel into the [AO]7 acceptor substrate when various nucleotide sugars were used as the sugar donor.

14 Sugar Donor [ C] Incorporation (pmol/h/mg )

14 UDP-[ C]Gal 11.50 + 0.19

14 UDP-[ C]Glc 0.15 + 0.03

14 UDP-[ C]Xyl 0.01 + 0.02

14 GDP-[ C]Fuc 0.03 + 0.00

2.3.5. Identification of the Arabidopsis [AO]7:GalT reaction product by

Electrospray Ionization-Tandem Mass Spectrometry (ESI-MS/MS) analysis

When non-radioactive (cold) UDP-Gal was used in the GalT reactions as the sugar donor, the resulting RP-HPLC purified [AO]7:GalT reaction product was analyzed by

ESI-MS/MS analysis. No product peak was visible in either the [AO]7:GalT product or the control product by ESI-MS under full scan mode (data not shown), due to a low signal to noise ratio. In MS/MS analysis, however, a mass/charge ratio (m/z) 1470.7 ion corresponding to the putative galactosylated [AO]7 with one Gal attached was isolated in the [AO]7 assay product (Fig. 2.9.A). The identity of the 1470.7 ion was confirmed by ion fragmentation using the collision-induced dissociation (CID) technique, which generated a series of peaks with their m/z matching the theoretical fragments from [AO]7 with either a single Gal attached or not (Fig. 2.9.B). The fragmentation pattern of the m/z 1470.7 ion

88

also indicated that the attachment site of the Gal residue was at Hyp 12 or Hyp 14.

Although an ion with a m/z of 1471.0 was isolated from the control assay product (Fig.

2.9.C), the fragmentation pattern of the m/z 1471.0 ion was not related to the [AO]7

substrate or to galactosylated [AO]7 (Fig. 2.9.D). Moreover, we were unable to detect or

isolate ions corresponding to [AO]7 peptide with more than one Gal attached from either

the [AO]7 assay product or control product.

Figure 2.9 ESI-MS/MS analysis of the Arabidopsis [AO]7:GalT reaction product. The ion with m/z of 1470.7 corresponding to glycosylated [AO]7 with one Gal attached was isolated from the [AO]7:GalT reaction product (Panel A). The fragments generated from ion 1470.7 following collision-induced dissociation (Panel B) were matched with the [AO]7 fragments or galactosylated [AO]7 fragments (b and y ions with asterisks) based on their calculated m/z. In ion 1470.7, the Gal attachment site was deduced to be the Hyp residue at position 12 or position 14 (i.e., Hyp residues with asterisks in the

89

peptide). A peak with m/z of 1471.0 was isolated from the control reaction (Panel C). However, fragmentation of ion 1471.0 did not show peaks relevant to [AO]7 fragments or the galactosylated [AO]7 product (Panel D).

2.3.6. Subcellular localization of [AO]7:GalT and d[AO]51:GalT activities in

Arabidopsis suspension cultured cells

A continuous sucrose density gradient was employed to fractionate microsomal

membranes, followed by marker enzyme assays to identify subcellular membrane

systems. Although various combinations of gradient conditions were used, including

different sucrose gradients, with or without the addition of EDTA (1 mM or 5 mM) or

Mg2+ (1 mM or 5 mM), it was not possible to completely resolve ER membranes from

Golgi membranes. The best separation of ER and Golgi membranes was achieved with a

30-45% (w/v) sucrose gradient in the presence of 1 mM EDTA (Fig. 2.10.A), in which both NADH cytochrome C reductase (an ER marker) and inosine diphosphatase (a Golgi marker) showed two activity peaks (Fig. 2.10.B). However, NADH cytochrome C reductase showed higher activity in the peak of lower density (fraction 2 to 5), while inosine diphosphatase had greater activity in the higher density peak (fraction 7 to 16).

The plasma membrane exhibited a broad distribution with a single peak around fraction

13, as indicated by using glucan synthase II as a marker (Fig. 2.10.A). The gradient fractions were used for GalT assays. With both [AO]7 and d[AO]51 as acceptor substrates,

two GalT activity peaks corresponding to the ER and Golgi marker enzyme activities

were detected (Fig. 2.10.C and D).

90

Figure 2.10 Fractionation of mixed-membranes of Arabidopsis on a sucrose gradient in the presence of 1 mM EDTA. Each fraction was assayed for: A. density and glucan synthase II (a plasma membrane marker); B. activities of antimycin A-insensitive NADH cytochrome C reductase (an endoplasmic reticulum marker) and IDPase (a Golgi apparatus marker); C. [AO]7:GalT activity, and D. d[AO]51:GalT activity.

91

2.4. Discussion

Considering GTs are linkage-specific (Keegstra and Raikhel, 2001), the complex

AG structure implies the involvement of multiple GalTs in AG biosynthesis. Using

chemically synthesized and transgenically produced AGP peptides as acceptor substrates

and UDP-[14C]Gal as the sugar donor, an in vitro GalT assay was developed to detect

GalT activities involved in AGP glycosylation in tobacco and Arabidopsis. Product

analysis by RP-HPLC and monosaccharide analysis showed that [14C]Gal was

incorporated into AGP peptides in the GalT assay (Figs. 2.3 and 2.4). In examining the

extent of AGP peptide galactosylation in the [AO]7, d[AO]51 and dd[AO]51 reaction products (Fig. 2.5), evidence for two distinct GalT activities was detected. The first activity, a Hyp:GalT activity, catalyzes the addition of Gal onto peptidyl Hyp residues, as demonstrated in reactions with [AO]7 and dd[AO]51 acceptor substrates. The second

activity, a probable Gal:GalT activity, extends the sugar side chain with a second Gal as

demonstrated in reactions with the d[AO]51 acceptor substrate and was more pronounced

in tobacco compared to Arabidopsis.

The Arabidopsis Hyp:GalT activity observed here is similar to that reported by

Oka et al. (2010) with respect to its properties (pH optimum, temperature optimum,

requirement for Mn2+, specificity for UDP-Gal) (Fig. 2.7, Table 2.1) and localization to

the endomembrane system (Fig. 2.10). However, further analysis of the Hyp:GalT activity here revealed that it was specific for AGP sequences and not for other related protein sequences, including extensins with their characteristic SO4 repeat units. This

finding is consistent with the Hyp contiguity hypothesis which states that non-contiguous

92

Hyp residues are sites of arabinogalactan polysaccharide addition, whereas contiguous

Hyp residues are sites for the addition of arabinose oligosaccharides (Kieliszewski and

Lamport, 1994; Kieliszewski et al., 1995; Tan et al., 2003). In addition, ESI-MS/MS

analysis of the [AO]7 product demonstrated that only one galactose was added per peptide

molecule to the C-terminal or pentultimate Hyp residue of this peptide. Whether this

observation reflects the enzymatic mechanism associated with this Hyp:GalT activity or

the in vitro nature of this work awaits further investigation. It should also be noted that

Oka et al. used a different in vitro assay system involving a chemically synthesized

AGP14 peptide and variants thereof conjugated to fluorescein isothiocyanate (FITC) via a γ-aminobutyric acid (GABA) linker as acceptor substrates. Regardless, such chemically synthesized peptides, whether it be [AO]7 or AGP14, were only useful in demonstrating

the addition of the first Gal residue to Hyp and provide no evidence for a second

Gal:GalT activity, which could only be detected when deglycosylated AGP sequences

were used as acceptor substrates. These observations also highlight the need to develop

appropriate acceptor substrates to detect specific enzyme activities in the AG biosynthetic

pathway as the small amount of product formed in these assays is apparently insufficient

to serve as an effective substrate for the next glycosylation reaction.

In ryegrass, AGP GalT activity was reported to incorporate [14C]Gal into

endogenous acceptor substrates, forming 66% ethanol insoluble products with (1,6)-

linked galactosyl residues (Mascara and Fincher, 1982). AGP GalT activity was also

detected in radish roots and catalyzed addition of Gal residues onto exogenous β-(1,3)-

galactotriose acceptors through β-(1,6)-linkages (Kato et al., 2003). In these two studies,

93

AGP GalT activities were involved in elongation of AG side chains and so are likely to be different from the Hyp:GalT activity identified here. Indeed, the Hyp:GalT activity exhibited distinct features from the rygrass GalT and radish GalT activity. For example,

Mg2+ did not activate the Arabidopsis Hyp:GalT activity as it did for ryegrass GalT

activities. The optimal temperature of Arabidopsis Hyp:GalT (40°C) is higher than that of

radish GalT (30°C) and of ryegrass GalT (between 10 and 25°C).

N-glycosylation and O-glycosylation are the two major types of protein

glycosylation. The biosynthesis of N-glycans involves assembling an oligosaccharide

precursor structure linked to lipid carriers. The oligosaccharide precursor is then

transferred en block from lipid carriers to the protein N-glycosylation sites (Varki et al.,

1999). In contrast, O-glycosylation is often viewed as the sequential addition of single

sugar residues to the polypeptide, as exemplified by two well defined processes, O-

mannosylation (Goto, 2007) and mucin type O-glycosylation (Hanisch, 2001). The

identification of Hyp:GalT activity and likely Gal:GalT activity for the formation of a

disaccharide galactosyl chain on Hyp as well as the lack of detection of en block transfer

in this study is consistent with the notion that Gal residues are transferred sequentially to

the protein backbone in AGP glycosylation. In addition, the recent identification and

characterization of two AGP fucosyltransferases which have the ability to fucosylate

AGPs lacking terminal fucose residues is also consistent with this notion of sequential

sugar addition (Wu et al., 2010).

Protein N-glycosylation is initiated in ER but completed in Golgi apparatus. Oka et

al. (2010) proposed a hypothetical model for AG biosynthesis in which the first Gal

94 residue is added to peptidyl Hyp in an AGP backbone by GalT in the ER with further AG side chain elongation occurring in the Golgi apparatus. The subcellular localization of the

Hyp:GalT and Gal:GalT activities here is consistent with this model in that these enzyme activities were localized in the endomembrane system (Fig. 2.10). Cloning and tagging of the Hyp:GalT and Gal:GalT enzymes in the future should allow for more precise subcellular localizations. Moreover, the subcellular localization of AGP GalT activities in rygrass (Mascara and Fincher, 1982; Schibeci et al., 1984) and radish root (Kato et al.,

2003), AGP fucosyltransferase activity in radish root (Misawa et al., 1996), an AGP fucosyltransferase enzyme in tobacco leaves (Wu et al., 2010), and AGP arabinosyltransferase activity in tobacco cultured cells (Kawasaki, 1987) likewise are consistent with involvement of the endomembrane system, particularly the Golgi apparatus, in AG biosynthesis.

Although a large number of GTs are predicted to be involved with the biosynthesis of plant cell wall components, only a few GTs are unambiguously identified and functionally characterized (Keegstra and Raikhel, 2001). One of the main obstacles for

GT identification is the lack of specific enzyme activity assays. Here, an in vitro assay was developed which allows for the detection of two distinct AGP GalT activities in tobacco and Arabidopsis microsomal membranes. Clearly, this radioactive, in vitro assay to detect GalT activity using AGP peptide and glycopeptide acceptor substrates provides a valuable tool for the identification of AGP GalT proteins/genes and serves as an entry point for the elucidation of AG biosynthesis for AGPs. Finally, this assay can be readily

95 modified by using different AG acceptor substrates along with appropriate radiolabeled sugar nucleotides to detect other GT activities involved with AG biosynthesis.

96

CHAPTER 3: HETEROLOGOUS EXPRESSION OF PUTATIVE AGP

GALACTOSYLTRANSFERASES IN PICHIA PASTORIS AND TOBACCO BY2

CELLS AND THEIR FUNCTIONAL CHARACTERIZATION

3.1. Introduction

A coupling of bioinformatics prediction and experimental validation becomes a potent tool for the functional characterization of genes in the post-genomic era. The use of bioinformatics tools as a guide to identify cell wall biosynthetic enzymes has proven successful in many cases, i.e. the identification of Arabidopsis XYLOGLUCAN XylTs

(XXT1 and XXT2) (Faik et al., 2002; Cavalier and Keegstra, 2006; Cavalier et al., 2008),

AGP FUT4 and FUT6 (Sarria et al., 2001; Wu et al., 2010), XYLAN GlcATs (GUX1 and

GUX2) (Mortimer et al., 2010), and XYLAN AraTs (XAT1 and XAT2) (Anders et al.,

2012). In these studies, putative GTs were identified by searching for homologues of genes with known functions or based on the expression of genes with the event of cell

wall polymer deposition. The functional confirmation of candidate GTs usually involves

three strategies (Farrokhi et al., 2006): i) the gene is expressed heterologously, usually in

fusion with a molecular tag, and the gene product isolated and tested for the proposed GT

activity in vitro; ii) the loss-of-function mutants showing physiological or biochemical

phenotypes provide evidence of the gene function in vivo; iii) function of genes may be

demonstrated via various gain-of-function systems.

In the type II AG polysaccharides of AGPs, Gal residues are mainly connected

through β-(13)- and β-(16)-linkages. To identify candidate AGP β-(13)-GalTs, Qu

et al. (2008) conducted a BLAST search of the Arabidopsis database with three reference

97

protein sequences from Homo sapiens, GalT1, GalT2 and GalT4. Human GalT1 and

GalT2 both encode a UDP-galactose:β-N-acetylglucosamine β-(13)-GalT, while GalT4

encodes a UDP-galactose:β-N-acetyl-galactosamine β-(13)-GalT. This study identified

20 putative Arabidopsis β-(13)-GalTs, only one of which has a known function as a β-

(13)-GalT for the biosynthesis of the Lewisa epitope of N-glycans; it was named

Arabidopsis GalT1 (Strasser et al., 2007) (Table 3.1). All of the twenty genes belong to

the CAZy GT31 family. The CAZy GT31 family contains other genes with known functions mainly as β-(13)-GTs, including β-(13)-GlcTs, β-(13)-GlcNAcTs, β-

(13)-GalTs, β-(13)-GalNAcTs and one β-(14)-GalNAcT, for the protein

glycosylation in mammals and bacteria (Qu et al., 2008; Egelund et al., 2011).

98

Table 3.1 Structural features of the 20 putative Arabidopsis β-(13)-GalTsa

Name in Gene Cladeb Lengthc Position of Position of Subcellular this Accession (a.a.) the GalT the Galectin Localizatione Dissertation Domaind Domain rd GalT1 At1g26810 7 643 426-628 182-383 Golgi

GalT2 At4g21060 7 684 450-639 190-404 Golgi

GalT3 At3g06440 7 619 407-609 183-365 Golgi

GalT4 At1g27120 7 673 471-660 198-421 Golgi

GalT5 At1g74800 7 672 468-657 195-418 Golgi

GalT6 At5g62620 7 681 446-632 186-400 Golgi

- At1g11730 10 384 129-325 - Golgi

- At1g05170 10 404 149-346 - Golgi

- At2g32430 10 409 154-351 - Golgi

- At4g26940 10 407 153-350 - Golgi

- At1g22015 10 398 140-336 - Golgi

- At1g32930 10 399 144-341 - Golgi

- At1g77810 10 393 167-381 - Golgi

- At1g33430 10 395 138-334 - Golgi

- At2g25300 10 346 167-344 - Golgi

- At4g32120 10 345 133-326 - Golgi

- At5g53340 10 338 159-330 - Golgi

- At2g26100 10 331 158-355 - Post-Golgi

- At3g14960 10 343 99-292 - Post-Golgi

- At1g53290 10 345 134-331 - Post-Golgi

aThe 20 putative Arabidopsis β-(13)-GalTs were identified by Qu et al. (2008). bThe clade number is based on the phylogenetic grouping of Arabidopsis GT31 family members (Egelund et al., 2011). cProtein lengths (a.a.) are calculated based on the translated sequences of the open reading frame of the cDNA clones for GalT1 to GalT6. For the other genes, the data for protein length (a.a.) is from the TAIR database (http://www.arabidopsis.org/).

99 dThe positions of the GalT domain and the Galectin domain are predicted by the Pfam database (http://pfam.sanger.ac.uk/). eThe subcellular localization information is from the study of Qu et al. (2008) based on data from the Golgi predictor program (http://csbl1.bmb.uga.edu/GolgiP/).

On a basis of their phylogenetic relationship, genes in the CAZy GT31 family were classified into 11 clades (Egelund et al., 2011). The 20 candidate β-(13)-GalTs identified by Qu et al. (2008) are located in clade 7 and clade 10, two clades containing exclusively plant GTs. All of the twenty candidate β-(13)-GalTs contain a GalT domain (Pfam: PF01762) as identified by the Protein Family (Pfam) database

(http://pfam.sanger.ac.uk/), which is not present in the remaining 13 Arabidopsis genes in

CAZy GT31. Within the GalT domain, six conserved motifs are present, including a conserved hydrophobic region, an RxxxRxT/SW motif, an FxxG/A motif, a DXD motif, a GxxYxxS motif, an E/DDV plus GxW/C motif. The six conserved motifs are involved in sugar donor binding, sugar acceptor binding, catalysis, and may affect the specificities of the enzymes for different donor substrates (Qu et al., 2008; Egelund et al., 2011). The presence of these conserved motifs supports that the Arabidopsis accessions encode β-

(13)-GTs because none of the six conserved motifs, with the exception of the DxD motif, occur in either β-(14)-GTs or α-(13)-GTs (Qu et al., 2008).

Besides the conserved GalT domain, all of the twenty putative β-(13)-GalTs were predicted to be Gogi or post-Golgi localized, with the majority of the genes containing a transmemberane (TM) domain close to the N terminus of the protein (Qu et al., 2008).

Such predicted subcellular localization of the putative GalT genes is consistent with the

100

AGP GalT activities identified experimentally in various plant species (see Chapter

1.3.2.3).

The six putative Arabidopsis GalTs in GT31 clade 7 differ from those in clade 10 by the addition of a galectin domain (Pfam: 00337). A schematic summarizing the domain structure of the putative Arabidopsis β-(13)-GalTs in clade 7 and clade 10 is shown in Fig. 3.1. A galectin domain is defined by a conserved carbohydrate recognition domain that binds exclusively to β-galactosides (reviewed by Egelund et al. 2011). In

Homo sapiens, polypeptide N-acetylgalactosaminyltransferases (GalNAcT) containing galectin domains catalyze the addition of the first GalNAc to initiate protein O- glycosylations. It has been hypothesized that the Arabidopsis GalTs in clade 7 may transfer the first Gal onto the HRGP backbone in a similar mechanism as human

GalNAcTs (Qu et al., 2008; Egelund et al., 2011). To test this hypothesis, I plan to adopt the heterologous expression strategy to characterize the functions of five putative GalTs in the GT31 clade 7. For simplicity, the five putative GalTs are named GalT1, GalT3,

GalT4, GalT5 and GalT6 in this research (Table 3.1). Five out of the six candidate GALT genes were chosen to study at the beginning because cDNA clones of the five genes were available from public cDNA libraries. Later, the cDNA clone of GALT2 was obtained as a gift from Dr. Richard Strasser (Institute of Applied Genetics and Cell Biology, BOKU,

University of Natural Resources and Applied Life Sciences) and the function of GALT2 is being characterized by Ms. Debarati Basu in our lab using a similar heterologous expression strategy as described below. Specifically, the five putative GalTs will be expressed with an N-terminal 6xHis tag in two heterologous expression systems,

101

including a methanotrophic yeast (Pichia pastoris) system and tobacco BY2 suspension-

cultured cells. For Pichia expression, GALT genes with the N-terminal 6xHis tag are

cloned into pPICZA/B vector behind the alcohol oxidase promoter (PAOX). After

transformation, the PAOX:6xHis-GALTs are integrated into Pichia genome at the site of the genomic PAOX through homologous recombination. For tobacco expression, GALT

genes with the N-terminal 6xHis tag will be cloned into expression vectors containing an

inducible heat shock protein promoter (Phsp) or a constitutive 2x35S promoter (P2x35S).

The Phsp:6xHis-GALTs or P2x35S:6xHis-GALTs are integrated into random positions of

tobacco genome through an Agrobacterium mediated transformation. The expressed

proteins will be analyzed for AGP GalT activities using the in vitro GalT assays developed in Chapter 2. In addition, subcellular localization of the putative GalTs is planned to be studied by transiently expressing tagged versions of these genes in tobacco leaves to corroborate functional speculations of the GalTs.

102

Figure 3.1 Schematic domain structure of the two classes of putative Arabidopsis β- (13)-GalTs. A. Classical β-(13)-GalTs as presented by Arabidopsis genes in GT31 clade 10. B. Galectin-containing domain β-(13)-GalTs as presented by the six Arabidopsis genes in GT31 clade 7. The figure is drawn to scale. The red triangle indicates the position of the DXD motif in each gene. TM, transmembrane region as predicted by the TMHMM server version 2.0 (http://www.cbs.dtu.dk/services/TMHMM/). Galectin domain and GalT domain are predicted by Pfam (http://pfam.sanger.ac.uk/) database. GalT1 and GalT3 were predicted to contain no TM by TMHMM server but were predicted to be Golgi localized by Golgi predictor (http://bioinformatics.oxfordjournals.org/content/26/19/2464.full).

3.2. Material and Methods

3.2.1. Obtaining full length cDNA clones

Full length cDNA clones of RAFL09-24-L19 (At1g26810, GALT1), RAFL09-12-

K06 (At3g06440, GALT3), RAFL19-55-I21 (Ag1g27120, GALT4), and RAFL09-69-H18

(At1g74800, GALT5) were ordered from the RIKEN Bioresource Center

(http://www.brc.riken.jp/lab/epd/Eng/species/arabidopsis.shtml). The full length cDNA

103

clone of At5g62620 (GALT6, NCBI Accession BX831498) was ordered from the

Arabidopsis thaliana GSLT EST libraries (http://www.genoscope.cns.fr).

3.2.2. Sequence analysis of the putative GalTs

Each of the cDNA clone sequences was extracted from the RIKEN Bioresource

Center (http://www.brc.riken.go.jp/lab/epd/catalog/cdnaclone.html) for GALT1, GALT3,

GALT4 and GalT5 with the accession numbers of pda09678, pda01627, pda11283, pda08691, respectively, and the Arabidopsis thaliana EST libraries

(http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?CMD=search&DB=nucest) for GALT6

with the accession number of BX831498. The sequences from the cDNA libraries were

then compared to the predicted cDNA sequences from the TAIR database

(http://www.arabidopsis.org/) by the BLAST2 Sequences program

(http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE_TYPE=BlastSearch&PROG_DEF=blastn

&BLAST_PROG_DEF=megaBlast&BLAST_SPEC=blast2seq). Translations of the cDNAs were performed using the ExPASy Translate Tool

(http://web.expasy.org/translate/). ESTs for GALT4 were identified from a search of the following NCBI UniGene database

(http://www.ncbi.nlm.nih.gov/UniGene/clust.cgi?ORG=At&CID=16267).

3.2.3. Expression construct cloning of GALT1, GALT3, GALT4, GALT5 and GALT6 for expression in Pichia

The N-terminal 6xHis-tag and restriction sites were incorporated into the putative

GalT genes by PCR. The PCR product was first cloned into the pCR4-TOPO vector

104

(Invitrogen) and then subcloned into the Pichia expression vector pPICZA/B (Invitrogen).

Specifically, GALT1, GALT3 and GALT4 were subcloned from the pCR4-TOPO vector

into the pPICZA vector by a SacII and ApaI restriction digestion and ligation; GALT5 and

GALT6 were subcloned from the pCR4-TOPO vector into the pPICZB vector by a SacII

and XbaI restriction digestion and ligation. E.coli strains hosting the pCR4-GALTs were

selected by kanamycin resistance and E.coli strains hosting pPICZA/B-GALTs were

selected by Zeocin resistance. Both the pCR4-GALTs and the pPICZA/B-GALTs

constructs were first examined by diagnostic digestion or PCR and then sequenced to insure that correct in-frame sequences were obtained for the N-terminal 6xHis-tag and the

GALT coding regions. The primer sequences for GalT subcloning and sequencing are

listed in Appendix A Table 1 and Fig. 1. Sequencing results for the final expression

constructs were assembled into contigs (Appendix B Part I) using an open source

software, Cap3 (http://pbil.univ-lyon1.fr/cap3.php) as described by Huang and Madan

(1999).

3.2.4. Pichia transformation

The constructs of pPICZA-GALT1, pPICZA-GALT3, pPICZA-GALT4, pPICZB-

GALT5 and pPICZB-GALT6 were transformed individually into Pichia strain X-33 by

electroporation using the Gene Pulser II system (Bio-Rad). Preparation of competent X-

33 cells and the electroporation were performed according to the manufacturer’s

instruction for the EasySelect™Pichia Expression Kit (Invitrogen). The transformants

were selected on the yeast peptone dextrose sorbitol (YPDS) medium plates containing

100mg/L Zeocin antibiotic. Positive clones were inoculated into the liquid yeast peptone

105

dextrose (YPD) medium containing 100mg/L Zeocin. Genomic DNA (gDNA) from

transgenic Pichia was isolated and verified by PCR analysis as described in Chapter 3.3.1.

Protocols for making the YPDS and YPD medium are described in the EasySelect™

Pichia Expression Kit (Invitrogen). The protocol for gDNA isolation from Pichia was

provided by Dr. Tomohiko Sugiyama (Department of Biological Sciences, Ohio

University). Briefly, a 3 mL overnight Pichia culture was pelleted by repeating centrifugation (10,000 xg, 1 min) in 1.5 mL microcentrifuge tubes. The culture medium

was discarded and 200 µL Blue Buffer (10 mM Tris-HCl buffer pH 8.0, containing 2%

Triton X-100, 1% SDS, 100 mM NaCl, 1mM EDTA), 200 µL

phenol/chloroform/isoamyl alcohol solution (125:24:1,v/v, Fisher), and 0.3 g glass beads

( 425-600 µm, Sigma) were added to the cell pellet. The mixture was vortexed for 1 min

to break the Pichia cells and then centrifuged at 10,000 x g for 2 min. gDNA was

recovered from the supernatant by ethanol precipitation.

3.2.5. Expression construct cloning of GALT1, GALT3, GALT4, GALT5 and GALT6

for expression in tobacco BY2 suspension-cultured cells

The ‘CACC’ Kozak sequence followed by a 6xHis tag was incorporated into the N-

termini of the putative GalT genes by PCR. The PCR products were cloned into the pENTR/D-TOPO Gateway vector by Topo cloning (Invitrogen). GALTs were then transferred from the pENTR/D-TOPO vector digested with MluI to the pMDC30 or pMDC32 vectors by an LR Clonase (Invitrogen) reaction. The pMDC30 differs from the pMDC32 vector in that the former carries a heat shock promoter and the later carries a

2x35S promoter to drive the gene expression in plant cells. E.coli strains hosting the

106

pENTR/D-GALTs, pMDC30-GALTs and pMDC32-GALTs were selected by kanamycin

resistance. The pENTR/D-GALTs, pMDC30-GALTs and pMDC32-GALTs and the pPICZA/B-GALTs constructs were first examined by diagnostic digestion and PCR and then sequenced to make sure that correct in-fram sequences were obtained for the N- terminal 6xHis-tag and the GALT coding regions. The primer sequences for GalT subcloning and sequencing are listed in Appendix A Table 1 and Fig. 1. Sequencing results for the final expression constructs were assembled into contigs and shown in

Appendix B Part II and III.

3.2.6. Agrobacterium transformation

The constructs of the pMDC30-GalTs and the pMDC32-GalTs were transformed

into the Agrobacterium strain EHA105 by electroporation using the Gene Pulser II

system (Bio-Rad). The settings for electroporation with a 2 mm cuvette were: capacitance:

25 µF; voltage: 2.4 kV; resistance: 200 Ohm; pulse length: 5 msec. The transformed

Agrobacterium strains were selected using LB medium containing 50 mg/L kanamycin

and cultured at 28 °C. The transformed colonies were confirmed by direct PCR of the

Agrobacterium liquid cultures using GalT gene specific primers and the untransformed

Agrobacterium strain as the control.

3.2.7. Transformation of tobacco BY2 suspension-cultured cells

Tobacco (Nicotiana tabacum, BY2) suspension-cultured cells were maintained

under the same conditions as described in Chapter 2.2.1.

107

Overnight-grown Agrobacterium cultures were centrifuged at 3,000 xg for 5 min to

remove the LB medium and washed twice with the NT-1 medium. The Agrobacterium pellet was then resuspended in 200 µL NT-1 medium and co-cultivated with 4-days-old tobacco BY2 cells in a petri plate covered with aluminum foil at 25°C with slow agitation at 40 rpm. After 3 days, the BY2 cells were transferred to a 50 mL conical tube and the co-cultivation medium was removed with a pipette. The BY2 cells were washed by 3 repeating cycles of adding 50 mL NT-1 medium to the tube, inverting the tube to resuspend the cells, letting the cells settle and removing the medium with a pipette. The cells were washed for a fourth time with NT-1 medium containing 400 mg/L timentin before being plated onto a NT-1 plate containing 400 mg/L timentin and 20 mg/L hygromycin B to form a thin layer of cells. Calli grown after three weeks were subcultured onto new plates containing 400 mg/L timentin and 20 mg/L hygromycin B.

After three rounds of selection on the petri plates, the transgenic BY2 callus were transferred and maintained in liquid NT-1 medium containing 50 mg/L hygromicin B.

3.2.8. Isolation of microsomal membranes from Pichia

Pichia strains transformed with the PAOX:6xHis-GalT genes or the empty pPICZB

vector were induced for protein expression following the instructions in the EasySelect™

Pichia Expression Kit (Invitrogen). The Pichia culturing temperature was 28°C, and 0.5%

methanol was applied to the cell culture media for the induction of protein expression.

Pichia cells were harvested on the fifth day after induction by centrifugation at 3,000 xg

for 10 min and preserved at -80C until use. Microsomal membranes were isolated from

108

Pichia strains with a method described previously (Tan et al., 2006) with some modifications. Briefly, 2 mL Pichia cells were mixed with 2 mL glass beads (425-600 µm,

Sigma) and 4 mL extraction buffer (0.1 M HEPES-KOH, pH 7, 0.4 M Suc, 0.1% BSA, 1 mM dithiothreitol, 5 mM MgCl2, 5 mM MnCl2, 1 mM phenylmethylsulfonylfluoride, 2.4 mg Roche complete protease inhibitor and 66 µL Protease Inhibitor Cocktail IV from the

Research Product International Corp). The mixture was repeatedly vortexed for 30 s for 7 times with 30 s intervals on ice and then centrifuged at 3,000 x g for 15 min at 4°C. The supernatant was recovered and subjected to ultracentrifugation at 100,000 x g for 90 min.

The resultig pellet was resuspended in 300 µL extraction buffer and preserved at -80C until use.

3.2.9. Preparation of microsomal membranes from tobacco BY2 cell lines

The preparation of microsomal membranes from the transgenic tobacco BY2 cell lines followed the same protocol for the preparation of microsomal membranes from the wild type BY2 cell lines (Chapter 2.2.2) except for the following two modifications. First, the preparation was scaled down to allow for the processing of multiple tobacco cell lines.

Five to 12 g of tobacco cells were used in each preparation with the volume of extraction buffer reduced accordingly. Second, the microsomal membranes enriched at the 0.25/1.1

M sucrose interface and at the 1.1/1.8 M sucrose interface were both collected and subjected to the final ultracentrifugation step. The pellets obtained were resuspended separately and named as the upper membrane layer and the lower membrane layer as discussed in Chapter 3.3.4. The Phsp:6xHis-GalT transgenic BY2 lines were subjected to a

109 heat shock treatment before the isolation of microsomal membranes. For the heat shock treatment, 75 mL culture media of the transgenic BY2 line or control wild type line were transferred to a 1 L flask to form a thin layer of cells. The 1 L flask was incubated in a water bath at 40°C with agitation at 125 rpm for 2 h. These conditions were chosen based on conditions used in a previous study on heat shock induced proteins in the tobacco Wisconsin 38 cell line (Barnett et al., 1979) and a study which utilized the same heat shock promoter for the induction of protein expression in tobacco leaves (Schoffl et al., 1989).

3.2.10. Western blotting analysis

Microsomal membrane samples were quantified by the Bradford method using a

Bio-Rad Protein Assay system before being subjected to western blotting analysis. Pichia microsomal membranes containing 30 µg protein and BY2 microsomal membranes containing either 5 µg or 2 µg protein were treated at 100°C for 10 min in the Laemmli

Sample Buffer (Bio-Rad) supplemented with 5% β-mercaptoethanol and run on a 10%

SDS-PAGE gel before being electro-blotted to a PVDF membrane. Ten microliters of the ten times diluted Precision Plus Protein Kaleidoscope standard (Bio-Rad) was used.

Electrophoresis and transfer of proteins were carried out using the Mini-PROTEAN 3

Electrophoresis Cell and Mini Trans-Blot Electrophoretic Transfer Cell (Bio-Rad). For immunoblotting, the membrane was treated with the blocking buffer (5% non-fat dry milk in PBS) overnight at 4 °C and then incubated with the 6xHis Monoclonal Antibody

(Clontech) for 1 h at room temperature. Membranes were then washed in the washing

110

buffer (5% non-fat dry milk in PBS containing 0.1% Tween 20) for 45 min at room

temperature before incubation with Goat anti Mouse IgG (H&L) horseradish peroxidase-

conjugated secondary antibodies (Genscript Corp) for 1 h at room temperature. The primary and secondary antibodies were diluted 10,000 and 20,000 times respectively in the washing buffer.

3.2.11. AGP GalT activity test

The standard assay for GalT activity (Chapter 2.2.3) was used to detect AGP GalT activities in the transgenic Pichia or tobacco cell lines with the following modifications.

Microsomal membranes containing 250 µg proteins from transgenic Pichia or tobacco cell lines were permeabilized and used as the enzyme source. [AO]7 alone, d[AO]51 alone, or [AO]7 and d[AO]51 together was used as the substrate acceptor in the GalT assay.

Based on the experimental result that [AO]7:GalT activity approached saturation when 2

µg of [AO]7 was included in the reaction mix (Chapter 2.3.4), 2 µg instead of 50 µg of

each substrate acceptor was used in the AGP GalT assay.

3.2.12. Transient expression of GALT3-YELLOW FLUORESCENT PROTEIN (YFP)

in tobacco leaves

To construct the pVKH18En6-GALT3 vector, a single nucleotide mutation from T to

C at nucleotide position 1446 was introduced to the GALT3 coding region to eliminate an

XbaI recognition site without affecting the amino acid codons. The mutagenesis was

performed using the QuikChange II-E Site-Directed Mutagenesis Kit (Agilent

Technology). The introduction of an XbaI restriction site to the 5’ end, a SalI restriction

111

site to the 3’ end of GALT3 coding region, and the removal of the stop codon of GALT3

were achieved through one PCR. The resulting PCR product was gel purified and then

cloned into the pVKH18En6 vector by a cut and paste method using the XbaI and the SalI

restriction sites. The construct of pVKH18En6-GALT3 was verified by diagnostic

digestion and PCR analysis and sequenced to insure an accurate sequence for the GALT3

coding region and that the YELLOW FLUORESCENT PROTEIN (YFP) gene at the C terminus of the GALT3 gene was in frame. Sequencing results for the final expression construct were assembled one contig and shown in Appendix B Part IV. The primer sequences for GALT subcloning and sequencing are listed in Appendix A Table 1 and Fig.

1. The pVKH18En6-GALT3 construct was then transformed into Agrobacterium strain

GV3101 as described in 3.2.6. Transformed Agrobacterium was selected in LB medium

containing 50 mg/L kanamycin, 50 mg/L and 25 mg/L gentamicin and verified

by PCR analysis using the GALT3 gene specific primers. The Agrobacterium strain with the expression construct for the fusion protein of α-2,6-SIALYLTRANFERASE-GREEN

FLUORESCENT PROTEIN (ST-GFP) was provided by Dr. Michael Held (Department of

Chemistry and Biochemistry, Ohio University). The Agrobacterium strains hosting the

GALT3-YFP and ST-GFP constructs were co-infiltrated into tobacco leaves as described

previously (Sparkes et al., 2006). After two days of infiltration, the epidermal cell layers

were observed for fluorescence using a Zeiss LSM 510 laser-scanning confocal

microscope. For co-localization of GFP and YFP in epidermal cells, a 488- and 514-nm

argon ion laser was used. Fluorescence signals were separated using a dichromic beam

splitter NFT; the emission signal was 458 nm for GFP and 514 nm for YFP. ZEN 2007

112

software was used to operate the system. Images of tobacco cells expressing GALT3-YFP or ST-GFP alone were taken to serve as a control to indicate that no cross-talk occurred between the two fluorochromes before image acquisition.

3.3. Results

3.3.1. Comparison of the GALT sequences of the corresponding cDNA clones and the

GALT sequences from the Arabidopsis Information Resource (TAIR) database

cDNA sequences of the GALT1, GALT3, GALT4 and GALT5 genes were retrieved from the website of the RIKEN Bioresource Center

(http://www.brc.riken.go.jp/lab/epd/catalog/cdnaclone.html), which hosts the cDNA

library, and compared to the cDNA sequences of GALT genes predicted from the

Arabidopsis genome sequence retrieved from the TAIR database. The sequences for

GALT3 and GALT5 in the cDNA clones are identical to the corresponding cDNA sequences from TAIR. For GALT1, a single nucleotide variation at position 1206 (relative

to the first nucleotide of the start codon designated as position 1) is found between the

sequences from the cDNA library database and from the TAIR database without affecting

the amino acid sequences encoded. A single nucleotide variation at nucleotide position

1379 for the GALT4 gene in the cDNA database compared to the sequence from TAIR results in a change in the amino acid sequence at position 460 (relative to the first amino acid designated as position 1) from Val to Ala. Amino acid position 460 is located in the region between the Galectin domain and the GalT domain of the GalT4 protein (Table

3.1). The currently available overexpression sequence tags (ESTs) are not available to

113

cover nucleotide position 1379 in GALT4, making it difficult to determine the authentic

gene sequence at this specific position. The GALT6 cDNA clone was obtained from the

Arabidopsis thaliana EST libraries

(http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?CMD=search&DB=nucest); however,

the nucleotide sequence retrieved from this website with the accession number of

BX831498 is apparently wrong as it contains multiple stop codons in the translated sequence. Based on the above sequence analysis, cDNA sequences from RIKEN

Bioresource Center were utilized as the reference sequences for the GALT1, GALT3,

GALT4 and GALT5 genes, while the predicted cDNA sequence from the TAIR website

was used as the reference sequence for the GALT6 gene for sequence analysis during

expression construct cloning.

3.3.2. Expression construct cloning and yeast/tobacco BY2 cell transformation

All of the five putative GALT genes, GALT1, GALT3, GALT4, GALT5 and GALT6,

with a 5’ sequence encoding an N terminal 6xHis tag, were subcloned into the pPICZA/B

vectors for their expression in Pichia under the control of the PAOX promoter. Also, all of

the five GALT genes were subcloned into the pMDC30 and pMDC32 vectors for their

expression in tobacco BY2 cells under the control of the Phsp promoter and the P2x35S promote, respectively (Table 3.2).

114

Table 3.2 Expression constructs and transgenic Pichia or tobacco BY2 cell lines obtained in this research.

Purpose of Expression System Promoter Constructs Number of Expression Completed Independent Transformation Events

GalT activity test pPICZA/B vector Alcohol PAOX:6xHis-GALT1 Eight events for each

for Pichia oxidase PAOX:6xHis-GALT3 gene construct

transformation and promoter PAOX:6xHis-GALT4

gene expression PAOX:6xHis-GALT5

PAOX:6xHis-GALT6

GalT activity test pMDC30 vector for Heat shock Phsp:6xHis-GALT1 One event for each of

transformation and protein Phsp:6xHis-GALT3 the GALT1, GALT3

gene expression in promoter Phsp:6xHis-GALT4 and GALT6 gene

tobacco BY2 Phsp:6xHis-GALT5 constructs

suspension-cultured Phsp:6xHis-GALT6 cells

GalT activity test pMDC32 vector for 2 x 35S P2x35S:6xHis-GALT1 Five events for the

transformation and P2x35S:6xHis-GALT3 GALT5gene construct

gene expression in P2x35S:6xHis-GALT4 and two events for the

tobacco BY2 P2x35S:6xHis-GALT5 GALT6 gene construct

suspension-cultured P2x35S:6xHis-GALT6 cells

a Subcellular pVKH18En6 vector 35S P35S:GALT3-YFP N.A. localization of for transformation GalT proteins and gene expression in tobacco leaf epidermis

aYFP, yellow fluorescent protein. N.A., not applicable.

115

The expression constructs of pPICZA-6xHis-GALT1/GALT3/GALT4 and pPICZB-

6xHis-GALT5/GALT6 were sequenced for full length coding regions of the 6xHis-GALTs and partial sequences of the promoter and terminator regions (Appendix B Part I) before transformation into Pichia competent cells. After transformation, the PAOX:6xHis-GALT genes integrate into the Pichia genome through homologous recombination at the locus close to the ALCOHOL OXIDASE 1 (AOX1) gene (Tschopp et al., 1987) (Fig. 3.2). Eight

Pichia colonies representing eight independent transformation events for each GALT gene construct were selected by genotyping of the transgenic Pichia lines by PCR. The

PCR analysis of one transformation event for each PAOX:6xHis-GALT transformation is shown in Fig. 3.2.

116

Figure 3.2 PCR analysis of Pichia transformed with pPICZA-6xHis-GALT1, pPICZA-6xHis-GALT3, pPICZA-6xHis-GALT4, pPICZB-6xHis-GALT5, pPICZB- 6xHis-GALT6. A. Schemetic diagram of the genome sequence of the control Pichia strain tranformed with the empty pPICZA/B vector. The Zeosinr gene is integrated into the control Pichia genome. B. Schemetic diagram of the genome sequence of Pichia strains transformed with the pPICZA/B-6xHis-GALT or pPICZA-XT1 (XT1 stands for xyloglucan XylT1) r constructs. The Zeosin as well as the PAOX:6xHis-GALT or the PAOX:6xHis-XT1 genes are integrated into the Pichia genome for protein expression. C. Genotyping of the Pichia strains transformed with the PAOX:6xHis-GALT genes, the PAOX:6xHis-XT1 gene or the empty pPICZB vector by PCR using the AOX primer pair, the positions of which are shown in A with arrows. In addition to a common band for the AOX1 gene of approximately 2200 bp, each lane shows a gene specific band with a size of the coding region of each gene plus 325 bp from the pPICZA vector or 323 bp from the pPICZB vector, i.e. GalT1: 2257 bp; GalT4: 2347 bp; GalT5: 2342 bp; GalT6: 2369 bp; XT: 1706bp; pPICZB, no additional band amplified. The sizes of the GalT3 specific band (2185 bp) and the AOX1 gene band (2200 bp) are too close to be discriminated. M, DNA size maker.

The expression constructs of the pMDC30-6xHis-GALTs were verified by diagnostic

digestion analysis (Fig. 3.3). The verified constructs were sequenced for the full length coding regions of the 6xHis-GALTs and partial sequences of the promoter and terminator

117

regions (Appendix B Part II) before transformation into Agrobacterium. The

Agrobacterium strain carrying each of the expression constructs was selected by PCR

(Fig. 3.4) and used for the transformation of tobacco BY2 cells. One transformation event

in BY2 cells was obtained for the Phsp:6xHis-GALT1, Phsp:6xHis-GALT3 and Phsp:6xHis-

GALT6 genes, respectively. Genotyping of the transgenic BY2 lines was performed by

PCR with wild type BY2 as a control, the result of which is shown in Fig. 3.5.

Figure 3.3 Verification of the constructs of pMDC30-6xHis-GALT by diagnostic digestion with the AscI and SacI restriction enzymes. The estimated product sizes of each construct after AscI/SacI double digestion are as follows: GalT1: 9977 bp, 2020 bp; GalT3: 9985 bp, 1515 bp, 400 bp; GalT4: 9977 bp, 2112 bp; GalT5: 9977 bp, 1606 bp, 500 bp; GalT6: 9977 bp, 1019 bp, 802 bp, 300 bp. M, DNA size maker.

Figure 3.4 PCR analysis of the Agrobacterium strains transformed with the pMDC30-6xHis-GALT constructs. Gene specific primers were used to amplify the coding region of each GALT gene from the Agrobacterium transformants with the pMDC30-6xHis-GALT constructs (T) or the

118

empty pMDC30 vector (NC) by PCR. The primer sequences and locations in the cDNA coding region are shown in Appendix A Table 1 and Fig. 1. M, DNA size maker.

Figure 3.5 PCR analysis of tobacco BY2 cell lines transformed with the hsp:6xHis- GALT genes. Gene specific primers were used to amplify the coding region of each GALT gene from genomic DNA isolated from the BY2 tranformants with the Phsp:6xHis-GALT genes (T) or from a wild type BY2 cell line (NC) by PCR. The primer sequences and locations in the cDNA coding region are shown in Appendix A Table 1 and Fig. 1. M, DNA size maker.

The expression constructs of pMDC32-6xHis-GALTs were verified by diagnostic

digestion and PCR analysis (Fig. 3.6). The verified constructs were sequenced for the

full-length cDNA coding region of the 6xHis-GALTs and partial sequences of the

promoter and terminator regions (Appendix B Part III) before transformation into

Agrobacterium. The Agrobacterium strain carrying each expression construct was

selected by PCR (Fig. 3.7) and used for the transformation of tobacco BY2 cells. Five and two transformation events in BY2 cell lines were obtained for the P2x35S:6xHis-

GALT5 gene and the P2x35S:6xHis-GALT6 gene, respectively. Genotyping of the

transgenic BY2 cell lines was performed by PCR with wild type BY2 cells as a control,

the result of which is shown in Fig. 3.8.

119

Figure 3.6 Verification of the constructs of pMDC32-6xHis-GALTs by diagnostic digestion and PCR. A. Diagnostic digestion of the pMDC32-GALT constructs and the empty pMDC32 vector with the AscI and SacI restriction enzymes. The estimated product sizes of each construct after AscI/SacI double digestion are as follows: GalT1: 9985 bp, 2020 bp, 78 bp; GalT3: 9985 bp, 1507 bp, 441 bp, 78 bp; GalT4: 9977 bp, 2112 bp, 86 bp; GalT5: 9985 bp, 1598 bp, 509 bp, 78 bp; GalT6: 9985 bp, 1019 bp, 810 bp, 305 bp, 78 bp; pMDC32: 9985bp, 1767 bp. B. Gene specific primers were used to amplify the coding region of each GALT gene from the corresponding pMDC32-6xHis-GALT constructs (T) and from the empty pMDC32 vector (NC) by PCR. The primer sequences and locations in the cDNA coding region are shown in Appendix A Table 1 and Fig. 1. M, DNA size maker.

120

Figure 3.7 PCR analysis of the Agrobacterium strains transformed with the pMDC32-6xHis-GALT constructs. Gene specific primers were used to amplify the coding region of each GALT gene from the Agrobacterium strains transformed with the corresponding pMDC32-6xHis-GALT constructs (T) or with the empty pMDC32 vector (NC) by PCR. The primer sequences and locations in the cDNA coding region are shown in Appendix A Table 1 and Fig. 1. M, DNA size maker.

Figure 3.8 PCR analysis of the tobacco BY2 cell lines transformed with the P2x35S:6xHis-GALT genes. Gene specific primers were used to amplify the coding region of GALT5 or GALT6 from the P2x35S:6xHis-GALT5 transformants number 1 to number 5 (5-1 to 5-5), the P2x35S:6xHis-GALT6 transformants number 1 and number 2 (6-1 and 6-2) and the wild type BY2 cell line by PCR. The primer sequences and locations in the cDNA coding region are shown in Appendix A Table 1 and Fig. 1. M, DNA size maker.

3.3.3. Protein expression analyses and AGP GalT activity tests of the microsomal

membranes isolated from Pichia strains transformed with the PAOX:6xHis-GALT genes

Microsomal membranes were isolated from the transgenic Pichia strains carrying the PAOX:6xHis-GALT genes or the control Pichia strain transformed with the empty

pPICZB vector and analyzed by western blotting analysis using an anti-6xHis antibody

121

(Fig. 3.9; Table 3.3). Double bands at approximately 75 kD, close to the predicted sizes

of 6xHis-GalT proteins were detected in the microsomal membrane proteins of all the

PAOX:6xHis-GALT transgenic Pichia strains analyzed so far, including the PAOX:6xHis-

GALT3 transformants number 1, number 3, and number 4, the PAOX:6xHis-GALT4

transformants number 1 to number 5, the PAOX:6xHis-GALT5 transformants number 1 and

number 2 and the PAOX:6xHis-GALT6 transformants number 1 to number 5. The bands

close to 75 kD in the GALT transgenic strains likely indicated the expression of the

6xHis-GalT fusion proteins because in the control Pichia strain, bands at 75 kD, if any,

are significantly weaker compared to the GalT transgenic strains. Given that the protein

sequences of GalT3, GalT4, GalT5 and GalT6 all contain two N-glycosylation sites (data

not shown), the 6xHis-GalT fusion proteins may undergo N-glycosylation and were thus

present as double bands or smears. Double bands with the size of approximately 50 kD

were present in most of the GALT transgenic strains as well as the control strain with a

lower signal intensity. The 50 kD bands may due to partial degradation of the 6xHis-

GalT fusion proteins. Although less likely, it cannot be excluded that the 75 kD and 50

kD bands represent non-specific binding of the anti-6xHis antibody to endogenous Pichia

proteins whose expression was induced in the GalT transgenic lines. Western blotting

analysis with GalT specific antibodies will allow more accurate detection of the

expression of the GalT proteins.

122

Figure 3.9 Western blotting analysis of microsomal proteins isolated from Pichia transformants. Pichia strains analyzed include the PAOX:6xHis-GALT3 transformants number 1, 3 and 4 (3-1, 3-3, 3-4), the PAOX:6xHis-GALT4 transformants number 1 to 5 (4-1 to 4-5 ), the PAOX:6xHis-GALT5 transformants number 1 and 2 (5-1 and 5-2), the PAOX:6xHis-GALT6 transformants number 1 to 5 (6-1 to 6-5) and the control Pichia strain transformed with the empty pPICZB vector. The primary anti-6xHis antibody and the secondary goat anti mouse antibody were used at 1:10,000 and 1:20,000 dilutions, respectively.

Table 3.3 Summary of the signal bands observed in microsomal membrane proteins from transgenic and control Pichia and tobacco BY2 cells in western blotting analysis

Fusion Observed Sizes of Protein Observed Sizes of Observed Sizes of Observed Sizes of Protein Observed Sizes of Proteins Bands in Microsomal Protein Bands in Protein Bands in Bands in Upper Layer Protein Bands in

(Predicted Membranes from PAOX:6xHis- Upper Layer Lower Layer Membrane Proteins from Lower Layer

Sizes) GALT Transformed Pichia Membrane Proteins Membrane Proteins P2x35S:6xHis-GALT Membrane Proteins

from Phsp:6xHis- from Phsp:6xHis- Transformed BY2 cells from P2x35S:6xHis- GALT Transformed GALT Transformed GALT Transformed BY2 cells BY2 cells BY2 cells 6xHis- n.d. 1-1 showed bands at 1-1 showed bands at n.d. n.d. GalT1 approximately 75 kD, approximately 50 kD, (72 kD) 60 kD, 50 kD and 40 kD and below 30 kD below 30 kD

6xHis- 3-1, 3-2 and 3-3 showed two 3-1 showed signals for 3-1 showed bands at n.d. n.d. GalT3 bands at 70 to 80 kD and two almost the whole approximately 50 kD (71 kD) bands at 50 to 60 kD sample lane including and below 30 kD bands at 75 kD

123

124

Table 3.3 (continued) 6xHis- 4-1, 4-2, 4-3, 4-4 and 4-5 showed n.d. n.d. n.d. n.d. GalT4 two bands at 75 to 85 kD and (78 kD) two bands at 45 to 55 kD. The upper bands at approximately 85 kD is more intense in 4-2, 4-3, and 4-4 lines compared to 4-1. 4- 5 showed bands at approximately 125 kD.

6xHis- 5-1 and 5-2 showed two bands at n.d. n.d. 5-1, 5-2, 5-3, 5-4 showed 5-1, 5-2, 5-3, 5-4 GalT5 75 to 85 kD and one band at faint bands at approximately showed bands at 50 kD (78 kD) approximately 50 kD 50 kD and strong signals and below 30 kD below 30 kD. 5-1 and 5-2 showed a smear at 75 to 120 kD. 5-3 and 5-4 showed a weaker smear at 75 to 85 kD

124

125

Table 3.3 (continued) 6xHis- 6-1 and 6-4 showed two bands at 6-1 showed signals for 6-1 showed bands at 6-1 showed a smear of 6-1, 6-2 showed bands GalT6 70 to 80 kD, one band at almost the whole approximately 50 kD, signals from 50 kD to above at 50 kD and below 30 (79 kD) approximately 60 kD and two sample lane including 40 kD and below 30 kD 250 kD and multiple bands kD bands at approximately 45 to 55 bands at 75 kD below 50 kD. 6-2 showed a kD. 6-2, 6-3 and 6-5 showed two faint smear at 75 kD, a band bands at 75 to 85 kD and two at 50 kD and below 30 kD. bands at 45 to 55 kD

Negative Pichia transformed with empty WT showed signals for WT showed bands at WT showed a smear from 70 n.d. Controls pPICZB vector showed faint almost the whole approximately 150 kD, to 110 kD and bands below band at approximately 70 kD and sample lane including 50 kD and below 30 kD 30 kD. two faint bands at 45 to 50 kD bands at 75 kD n.d. not determined. 125

126

Microsomal membranes from transgenic Pichia strains were analyzed for AGP GalT activities with the in vitro AGP GalT assay system developed using Arabidopsis and tobacco microsomal membranes (Chapter 2.3). Based on the structural analysis of the

Arabidopsis AGP GalT products, the assay using [AO]7 as the substrate acceptor mainly

detects the GalT activity ([AO]7:GalT activity) that transfers the first Gal to Hyp residues

in the peptide backbone; while the assay using d[AO]51 as the substrate acceptor mainly

detects the GalT activity (d[AO]51:GalT activity) that adds a second Gal to the first Gal

already attached to the peptide backbone (Chapter 2.4). The incorporation of [14C]Gal from UDP-[14C]Gal into the substrate acceptors is indicated by the coelution of the [14C] radioactive peak and the elution peak of the [AO]7 or the d[AO]51 substrate acceptor upon

HPLC fractionation of the assay product. Unfortunately, none of the transgenic Pichia

strains analyzed displayed [AO]7:GalT or d[AO]51:GalT activity in their microsomal membranes given that no radioactive peak coeluting with the substrate acceptors upon

HPLC fractionation of the reaction products was observed. A representative HPLC elution profile of the Pichia [AO]7:GalT assay product is shown in Fig. 3.10.B. The

[AO]7:GalT and d[AO]51:GalT assay results for individual transgenic Pichia strains are summarized in Table 3.4.

127

Figure 3.10 RP-HPLC fractionation of the [AO]7 substrate acceptor and the [AO]7:GalT reaction products on a PRP-1 reverse-phase column. A. Elution profile of the [AO]7 substrate acceptor. B. Representative elution profile of the [AO]7:GalT reaction product using microsomal membranes from the Pichia strains tranformed with the PAOX:6xHis-GALT genes. C. Representative elution profile of the [AO]7:GalT reaction product using microsomal membranes from the tobacco BY2 lines transformed with the Phsp: 6xHis-GALT genes. The presence or absence of the Radioactive Peak II coeluting with the [AO]7 acceptor substrate indicates the presence or absence of the [AO]7:GalT activities in the microsomal membranes. The [AO]7:GalT activity is quantified by combining the radioactive counts present in Radioactive Peak II.

128

Table 3.4 Results of the tests for AGP GalT activities in microsomal membranes from transgenic strains using the in vitro [AO]7:GalT and d[AO]51:GalT assay systems.

Pichia Strain [AO]7:GalT Activity d[AO]51:GalT Activity Non GALT transformant Noneb Noneb

PAOX:6xHis-GALT1-1 n.d. n.d.

PAOX:6xHis-GALT1-2 n.d. n.d.

PAOX:6xHis-GALT1-3 n.d. n.d.

PAOX:6xHis-GALT1-4 n.d. n.d.

PAOX:6xHis-GALT1-5 n.d. n.d. a PAOX:6xHis-GALT3-1 None n.d.

PAOX:6xHis-GALT3-2 n.d. n.d. a PAOX:6xHis-GALT3-3 None n.d.

PAOX:6xHis-GALT3-4 n.d. n.d.

PAOX:6xHis-GALT3-5 n.d. n.d. a a PAOX:6xHis-GALT4-1 None None a a PAOX:6xHis-GALT4-2 None None

P :6xHis-GALT4-3 Noneb n.d. AOX b PAOX:6xHis-GALT4-4 None n.d. b PAOX:6xHis-GALT4-5 None n.d.

P :6xHis-GALT5-1 Nonea n.d. AOX

PAOX:6xHis-GALT5-2 n.d. n.d. a a PAOX:6xHis-GALT5-3 None None a a PAOX:6xHis-GALT5-4 None None

a a PAOX:6xHis-GALT5-5 None None

PAOX:6xHis-GALT6-1 n.d. n.d. a PAOX:6xHis-GALT6-2 None n.d.

P :6xHis-GALT6-3 Nonea AOX n.d.

PAOX:6xHis-GALT6-4 n.d. n.d. a PAOX:6xHis-GALT6-5 None n.d. aTests with one biological repeat. bTests with two biological repeats. None, No detected [AO]7:GalT or d[AO]51:GalT activities, as indicated by no 14 14 incorporation of the [ C] radiolabel from UDP-[ C]Gal into the [AO]7 or the d[AO]51

129

substrate acceptor on the HPLC fractionation of the GalT assay product. n.d., not determined.

3.3.4. Protein expression analyses and AGP GalT activity tests of the microsomal

membranes isolated from tobacco BY2 cell lines transformed with the Phsp:6xHis-

GALT genes

The tobacco BY2 cell lines transformed with the Phsp:6xHis-GALT genes were

subjected to a heat shock treatment before being analyzed for the 6xHis-GalT expression

and the AGP GalT activities. Wild type BY2 cells were treated in the same manner as the control. Total microsomal membranes were isolated from the treated tobacco BY2 cell

lines and fractionated on a discontinuous sucrose density gradient as described in Chapter

2.2.2. Microsomal membranes located at the interface of the 0.25M and the 1.1M sucrose

solutions were collected and named as the upper membrane layer while microsomal

membranes located at the interface of the 1.1M and the 1.8M sucrose solution were collected and named as the lower membrane layer (Fig. 3.11.A). Western blotting analysis of the proteins in the upper membrane layer and the lower membrane layer using an anti 6xHis-tag antibody showed similar signal patterns for wild type BY2 cells and the

Phsp:6xHis-GALT transformants (Fig. 3.11.B; Table 3.3). Specifically, protein bands with

sizes less than 50 kD were detected in the lower membrane layer from both the wild type

and the Phsp:6xHis-GALT transgenic BY2 lines. Multiple antibody reactive bands of

different sizes, including 75 kD bands, were observed for the upper layer membrane

proteins from both the wild type and the Phsp:6xHis-GALT transgenic BY2 lines.

130

Figure 3.11 Western blotting analysis of microsomal proteins isolated from the tobacco BY2 lines transformed with the Phsp:6xHis-GALT genes and the wild type BY2 lines. The transgenic and wild type BY2 cells were treated with a heat shock treatment at 40°C for 2 h before being used for microsome preparation. A. An image showing the separation of the upper membrane layer and the lower membrane layer of microsomal membranes by a discontinuous sucrose gradient. B. Western blotting analysis of the proteins in the two membrane layers as defined in A. U, upper membrane layer. L, lower membrane layer. The primary anti-6xHis antibody and the secondary goat anti mouse antibody were used at 1:10,000 and 1:20,000 dilutions, respectively.

131

The upper membrane layer from the Phsp:6xHis-GALT transgenic BY2 lines, were

analyzed for AGP GalT activities using [AO]7 as the substrate acceptor. The [AO]7:GalT

assay products were fractionated by HPLC. A representative HPLC elution profile of the

[AO]7:GalT assay product with upper layer membranes from transgenic BY2 cell lines is

shown in Fig. 3.10.C. The total amount of radioactivity in the radioactive peak II

coeluting with the substrate acceptor on HPLC was used as the measure for [AO]7:GalT activity. The BY2 cell lines transformed with the Phsp:6xHis-GALT3 and Phsp:6xHis-

GALT6 genes showed higher [AO]7:GalT activities compared to the wild type BY2 lines after heat shock treatment (Table 3.5). In addition, higher [AO]7:GalT activities in the

upper membrane layer was detected when the Phsp:6xHis-GALT3 transgenic cell line was

subjected to heat shock treatment when compared to the same cell line cultured in normal

conditions.

132

Table 3.5 AGP GalT activities of the upper membrane layer from wild type BY2 cells and BY2 lines transformed with the Phsp:6xHis-GALT or the P2x35S:6xHis-GALT genes.

BY2 lines Heat Substrate Incorporation of the [14C] shock acceptorsa radiolabel into substrate treatment acceptorsb (cpm)

Phsp:6xHis-GALT1-1 Yes [AO]7 503 + Phsp:6xHis-GALT3-1 Yes [AO]7 883 241 + Phsp:6xHis-GALT3-1 No [AO]7 614 42

Phsp:6xHis-GALT6-1 Yes [AO]7 830

WT BY2 Yes [AO]7 454

P2x35S:6xHis-GALT5-1 No [AO]7+d[AO]51 916

P2x35S:6xHis-GALT5-2 No [AO]7+d[AO]51 1473

P2x35S:6xHis-GALT5-3 No [AO]7+d[AO]51 n.d.

P2x35S:6xHis-GALT5-4 No [AO]7+d[AO]51 546 + P2x35S:6xHis-GALT6-1 No [AO]7+d[AO]51 1478 78

P2x35S:6xHis-GALT6-2 No [AO]7+d[AO]51 1529 + WT BY2 No [AO]7+d[AO]51 1350 272

a [AO]7 alone or [AO]7 together with d[AO]51 was used as substrate acceptors in the in vitro GalT assays. Incorporation of the [14C] radiolabel into substrate acceptors was measured by the amount of total radioactivity coeluting with the substrate acceptors on HPLC fractionation of the assay products. bMost of the test results have one biological repeat. The test results with two biological repeats were calculated for standard deviations shown as the number after the + symbol. n.d., not determined.

3.3.5. Protein expression analyses and AGP GalT activity tests of the microsomal

membranes isolated from tobacco BY2 cell lines transformed with the P2x35S:6xHis-

GALTs genes

In contrast to the Phsp:6xHis-GALTs genes whose expression requires the induction

by a heat shock treatment, the expression of the P2x35S:6xHis-GALTs genes is constitutive

133

in theory. Microsomal membranes were isolated from the BY2 cell lines transformed

with the P2x35S:6xHis-GALTs genes and fractionated into the upper and lower membrane

layers as described in Chapter 3.3.4. Western blotting analysis of the isolated proteins

with an anti-6xHis antibody was performed. Microsomal membranes containing 2 µg

proteins from the upper or lower membrane layer from the P2x35S:6xHis-GALTs transgenic

BY2 cells were loaded for each sample instead of 5 µg proteins as loaded for the analysis for the Phsp:6xHis-GALTs transgenic BY2 cells. Because of the lower sample loading,

signals for non-specific antibody bindings were reduced in the western blotting analysis

for the P2x35S:6xHis-GALTs transformants (Fig. 3.12; Table 3.3) compared to previous

analysis for the Phsp:6xHis-GALTs transformants (Fig. 3.11). For the lower layer membrane protein samples, only one band at around 50 kD was observed for all the samples. For the upper layer membrane protein samples, protein bands at the size of 75 kD or above 75 kD were present in both P2x35S:6xHis-GALTs transformants and wild type

samples. Proteins signals at around 75 kD or above 75 kD in the upper layer were stronger in the P2x35S:6xHis-GALT5-1, GALT5-2, GALT6-1 lines, but weaker in the

P2x35S:6xHis-GALT5-3, GALT5-4, GALT6-2 lines compared to wild type.

134

Figure 3.12 Western blotting analysis of proteins from microsomal membranes isolated from tobacco BY2 lines transformed with the P2x35S:6xHis-GALT genes and wild type BY2 lines. The upper layer membrane proteins (U) and the lower layer membrane proteins (L) are defined as in Fig. 3.11.A. The primary anti-6xHis antibody and the secondary goat anti- mouse antibody were used at 1:10,000 and 1:20,000 dilutions, respectively.

AGP GalT activities were measured in the upper layer membranes from the

P2x35S:6xHis-GALT5 and the P2x35S:6xHis-GALT6 transformants and the wild type BY2 cells (Table 3.5). Both the [AO]7 and the d[AO]51 substrate acceptors were included in the

135 same GalT activity assay. The assay product was fractionated by HPLC and the radioactive peak coeluting with the substrate acceptors was combined and tested for radioactivity. In the assays completed, P2x35S:6xHis-GALT5-2, GALT6-1 and GALT6-2 showed slightly higher [14C] radiolabel incorporation into the peptide substrate acceptors compared to wild type, while P2x35S:6xHis-GALT5-1, GALT5-4 had lower [AO]7

/d[AO]51:GalT activity compared to wild type (Fig. 3.10.C). Of note, the higher or lower

[AO]7 /d[AO]51:GalT activities do not correlate to the higher or lower signal intensities at around 75 kD in the western blotting analysis for upper membrane layer from the

P2x35S:6xHis-GALT transformants and the wild type line.

3.3.6. Subcellular localization of GalT3 in tobacco leaf epidermal cells

To study the subcellular localization of GalT3, GalT3-yellow fluorescent protein

(GalT3-YFP) fusion protein was constructed and expressed using a transient expression system in tobacco leaf epidermal cells (Sparkes et al., 2006). The expression of GalT3-

YFP alone showed punctate structures typical of a Golgi-localized staining pattern (Fig.

3.13). In addition, the staining pattern of GalT3-YFP was distinct from the expression of an ER marker HDEL sequence fused with green fluorescent protein (HDEL-GFP) but overlapped with the expression pattern of a Golgi marker protein, α-2,6-sialyltransferase

(Saint-Jore et al., 2002), fused with GFP (ST-GFP). Thus, the subcellular localization experiments indicate that GalT3 is Golgi-localized.

136

Figure 3.13 Subcellular localization of GalT3 in tobacco leaf epidermal cells. The expression of GalT3-YFP is shown as red dots and the expression of transferase (ST, a Golgi marker)-GFP and HDEL (a ER Marker)-GFP fusion proteins is shown as green dots. The arrows indicate artificial signals.

137

3.4. Discussion

For the purpose of protein expression for functional characterization, many

heterologous expression systems or cell free expression systems are currently available.

However, these expression systems all have advantages and disadvantages. There is no

guarantee which system will be more preferable than another for the expression of a

specific protein in its active form without experimental trials (Farrokhi et al., 2009). For

this reason, two different heterologous expression systems were tested for expression of

the five putative Arabidopsis GALTs, GALT1, GALT3, GALT4, GALT5 and GALT6.

Foreign protein expression in a methanotrophic Pichia strain is driven by an alcohol

oxidase promoter, PAOX. The commercially available Pichia strains have the advantages of high transformation efficiency and a short turnaround time for protein expression.

Notably, Pichia has the basic cellular machinery for posttranslational modifications that are usually required for activity of an expressed protein of eukaryotic origin. Furthermore,

Pichia strains are likely to have low background activity for expression of plant specific enzymes. Nonetheless, Pichia strains may lack some of the protein modification machineries or cofactors that are required for maintaining the activity of some plant proteins. The plant expression systems of tobacco BY2 suspension-cultured cells were utilized here to compensate for the deficiencies of the Pichia expression system. A major

concern for the expression and activity test of the Arabidopsis GALT genes in tobacco

cell cultures is the interference from endogenous GalT activities in tobacco, which were

detected as discussed in Chapter 2. Thus, two types of promoters were used for the

expression of the Arabidopsis GALT genes in tobacco. While the tandem 35S promoter

138

(P2x35S) is supposed to drive strong, constitutive expression of the gene downstream; the

heat shock promoter (Phsp) promotes gene expression specifically under heat shock

conditions.

Constructs for the expression of the five GALTs in both expression systems were successfully accomplished as shown by construct verification with PCR or diagnostic digestion analysis and sequence analysis. A sequence encoding a 6xHis tag was incorporated to the 5’ end of the open reading frame of the GALT genes to facilitate downstream protein detection and purification procedures. Pichia transformants with each of the five 6xHis-GALTs were obtained successfully. Due to time limitations and technical difficulties, transformants of only some of the Phsp:6xHis-GALTs and

P2x35S:6xHis-GALTs in tobacco BY2 cells were obtained. A first screening for Pichia and

BY2 transformants expressing active 6xHis-GalTs was performed by western blotting

analysis and in vitro GalT assays using [AO]7 and d[AO]51 as substrate acceptors. It

should be noted that most of the western blotting analysis and the GalT activity tests were conducted once or in duplicate, thus the data presented should be considered as preliminary and used as a guide for future investigations.

Although western blotting showed that 6xHis-GalT3, 6xHis-GalT4, 6xHis-GalT5 and 6xHis-GalT6 are being expressed in Pichia cells, none of the Pichia transformants tested so far demonstrated the [AO]7:GalT activity or the d[AO]51:GalT activity. The

protein expression analysis of the tobacco BY2 cells transformed with the Phsp:6xHis-

GALT genes and the P2x35S:6xHis-GALT genes is obscured by non-specific protein signals

present in the wild type BY2 cells upon western blotting. However, in [AO]7:GalT

139

activity assays, the upper layer membranes from the Phsp:6xHis-GALT3 and the

Phsp:6xHis-GALT6 transgenic BY2 lines showed higher GalT activity compared to the

wild type BY2 line. Intriguingly, the upper membrane layer from the same Phsp:6xHis-

GALT3 transgenic cell line showed higher [AO]7:GalT activity when the cells were

treated with the heat shock treatment compared to the untreated cells, which suggests that

the increase in the [AO]7:GalT activity is due to the expression of the 6xHis-GALT3

driven by the heat shock promoter. For the P2x35S:6xHis-GALT transgenic cell lines,

slightly higher GalT activity in assays using both [AO]7 and d[AO]51 as the substrate

acceptors was detected in P2x35S:6xHis-GALT5 transgenic line number 2, and the two

P2x35S:6xHis-GALT6 transgenic lines compared to wild type.

The discrepancy between the yeast expression result and the BY2 expression result of 6xHis-GalTs may be attributable to many reasons. Firstly, expression analyses and GalT assays need to be repeated to confirm that higher GalT activity is present in the transgenic BY2 lines compared to wild type lines. In these tests, the [AO]7:GalT assay

and the d[AO]51:GalT assay should be performed separately to discriminate the two GalT activities in the transgenic BY2 lines. Additionally, the d[AO]51:GalT assays for

microsomal membranes from most of the transgenic Pichia strains have not been

performed and need to be completed. Secondly, given that the differences in GalT

activities between the transgenic BY2 cells and the wild type BY2 cells are subtle, the

6xHis-GalTs expressed in the P2x35S:6xHis-GALT transgenic lines may need to be purified

to show that the GalT activity is due to the expressed 6xHis-GalTs rather than

endogenous GalTs in BY2 cells. The Phsp:6xHis-GALT transgenic lines have the

140 advantage that genetic variation is eliminated when GalT activity is tested for the same cell line under heat shock treatment versus no heat shock treatment. Nevertheless, the conditions for the heat shock treatment may be optimized for a more definitive result.

Thirdly, parameters for Pichia protein expression may be optimized with respect to the temperature for Pichia culturing, the methanol concentration for the induction of protein expression, and the length of time for the induction of protein expression. The same methodologies used in this dissertation have been applied to express the 6xHis-GALT2 gene in Pichia, which showed detectable [AO]7:GalT activity in the microsomal extracts

(unpublished data from Ms. Debarati Basu in the Showalter lab at Ohio University).

Although the methodologies are proven to be valid for GALT2 expression, different genes may vary in their requirements for expression conditions. Fourthly, the western blotting analysis in this study utilized an anti 6xHis-tag antibody which detects non-specific signals with similar sizes to the estimated sizes for the 6xHis-GalT fusion proteins at a low level in the control Pichia lines and at a high level in the control BY2 lines. The generation and use of GalT specific antibodies may be more efficient for the identification of Pichia strains and BY2 lines with high expression levels of the GALT genes for the activity test. Fifthly, the GalT proteins may be inactive in Pichia for various reasons, such as low protein expression, incorrect folding or improper secondary modifications (Petersen et al., 2009). An example is the Arabidopsis xyloglucan XylT 2

(AtXT2), which showed xyloglucan XylT activity when expressed in Drosophila S2

(Schneider 2) cells but not in Pichia cells (Faik et al., 2002; Cavalier and Keegstra, 2006).

Lastly, if a GT functions in an enzyme complex, it may not show activity when expressed

141 individually in heterologous systems. The last situation is exemplified by the recent discovery of the biological function of the Arabidopsis

GALACTURONOSYLTRANSFERASE 7 (GAUT7), which did not show galacturonosyltransferase activity when expressed alone however is an essential component of the biosynthetic complex of HG (Atmodjo et al., 2011).

Because microsomal proteins from the Phsp:6xHis-GALT3 transgenic line showed promising [AO]7:GalT activities, the subcellular localization of GalT3 in fusion with YFP was studied using a transient expression system in tobacco leaves. The subcellular localization result suggests that GalT3 is Golgi-localized, which is consistent with the expected localization of an AGP GalT.

142

CHAPTER 4: BIOCHEMICAL AND PHYSIOLOGICAL CHARACTERIZATION

OF FUT4 AND FUT6 MUTANTS DEFECTIVE IN ARABINOGALACTAN-

PROTEIN FUCOSYLATION IN ARABIDOPSIS

This work has been submitted to the journal of Plant Physiology for review.

Liang Y, Pattathil S, Xu W-L, Basu D, Venetos A, Faik A, Hahn MG,

Showalter AM (6-5-2012 submitted) Biochemical and physiological characterization of

fut4 and fut6 mutants defective in arabinogalactan-protein fucosylation in Arabidopsis.

4.1. Introduction

Primary plant cell walls are a composite of complex carbohydrates and protein components. In dicots, cellulose microfibers cross-linked by xyloglucan hemicellulose constitute the load-bearing framework, which is embedded in a matrix of pectic polymers made of homogalacturonan (HG), rhamnogalacturonan I (RG-I), rhamnogalacturonan II

(RG-II), and protein components (Carpita and Gibeaut, 1993). The hydroxyproline-rich glycoprotein (HRGP) family represents a major group of plant cell wall proteins. The three subfamilies of HRGPs, proline-rich proteins (PRPs), extensins (EXTs) and arabinogalactan-proteins (AGPs), share the common feature of having hydroxyproline

(Hyp) residues in their protein backbones and undergo glycosylation to various extents

143

(Showalter, 1993; Shpak et al., 2001). PRPs are the least glycosylated and suggested to

insert into and stabilize mature wall structure (Carpita and McCann, 2000). EXTs are

moderately glycosylated, characterized by motifs for intra- and inter- molecular cross-

linking, and are active players for wall self-assembly and plant defense (Memelink et al.,

1993; Wei and Shirsat, 2006; Cannon et al., 2008; Lamport et al., 2011). AGPs, having

the greatest number of family members and the highest level of glycosylation of all the

HRGPs, are implicated in various aspects of plant growth and development (Nothnagel,

1997; Showalter, 2001; Seifert and Roberts, 2007; Ellis et al., 2010). As their name

implies, AGPs are extensively glycosylated with type II arabinogalactan (AG)

polysaccharides, which are mainly composed of galactose (Gal) and arabinose (Ara)

residues, but may also contain other sugars, including rhamnose (Rha), glucuronic acid

(GlcA), galacturonic acid (GalA), and fucose (Fuc) (Nothnagel, 1997; Tan et al., 2004;

Ellis et al., 2010). Given that the sugar side chains typically account for > 90% of the molecular mass of AGPs, they are likely to define the interactive surface of the molecule and hence its function.

Recently, 85 AGP genes were identified in Arabidopsis (Arabidopsis thaliana) using bioinformatics (Showalter et al., 2010). However, the precise functions and mechanisms of action of most AGPs remain elusive. To address the function and regulation of AGPs, it is necessary to understand the mechanism underlying AGP glycosylation. Although

several glycosyltransferase activities were reported in in vitro assays for AGP

glycosylation (Mascara and Fincher, 1982; Hayashi and Maclachlan, 1984; Schibeci et al.,

1984; Misawa et al., 1996; Liang et al., 2010; Oka et al., 2010), there are to date, only

144

two Arabidopsis genes, FUCOSYLTRANSFERASE 4 and 6 (FUT4 and FUT6), identified

to encode glycosyltransferases specific for AGPs (Wu et al., 2010). FUT4 and FUT6

belong to the Carbohydrate Active enZymes glycosyltransferase (GT) family 37 (CAZy

GT37) (http://www.cazy.org/GlycosylTransferases.html), the members of which are

thought to encode α-1,2-FUTs (Sarria et al., 2001), based on the characterized function of

FUT1, which specifically fucosylates xyloglucans in pea and Arabidopsis (Perrin et al.,

1999; Faik et al., 2000). AGPs with terminal Fuc residues were mainly studied in cruciferous plants (Hashimoto, 2000). Monosaccharide composition analysis of AGPs from different organs of radish plants showed that AGP fucosylation was organ-specific and developmentally regulated (Tsumuraya et al., 1984; Tsumuraya et al., 1987;

Tsumuraya et al., 1988). In Arabidopsis, Fuc was reported to be present in root AGPs

(van Hengel and Roberts, 2002).

Previously, we demonstrated that the Arabidopsis FUT4 and FUT6 proteins fucosylate AGPs in biochemical assays in vitro and in a tobacco BY-2 expression system, but no in planta data was provided (Wu et al., 2010). Although, van Hengel and Roberts

(2002) have shown that reduction in Fuc in Arabidopsis roots affected their development,

no direct link to the genes involved in AGPs fucosylation was demonstrated. Here we

demonstrate that the FUT4 and FUT6 genes are responsible for AGP fucosylation in

Arabidopsis plants. Specifically, we focus on the biochemical and physiological

characterization of Arabidopsis fut4, fut6 and fut4/fut6 double mutants, to corroborate our

previous findings and to obtain insight to the physiological functions of AGP

fucosylation.

145

4.2. Materials and Methods

4.2.1. Plant materials and growth conditions

Arabidopsis (Arabidopsis thaliana) wild type and fut4, fut6 mutant plants were of

the Columbia-0 ecotype. Seeds of T-DNA insertion lines of fut4 and fut6 were obtained

from the Arabidopsis Biological Research Centre (http://abrc.osu.edu/) in Columbus,

Ohio. Plants were grown in soil for mutant screening, seed harvesting and growth stage

phenotypic analysis. For root harvesting, plants were grown hydroponically in water supplemented with Liquid All-Pro 7-7-7 Plant Food (Dyna-gro). The Arabidopsis hydroponic growth system was as described previously (Gibeaut et al., 1997). For the phenotypic analysis of root growth, plants were grown on MS medium (Murashige and

Skoog salts, Caisson) containing 0.5% sucrose and 1 g/L Phytogel. All plants were grown under long-day conditions (16 h of light /8 h of dark, 22°C, 55% humidity) in growth chambers or growth rooms.

4.2.2. Mutant confirmation by PCR and RT-PCR

Genomic DNA isolation from fut4, fut6 and fut4/fut6 mutant leaves and subsequent

PCR analysis was carried out using Extract-N-Amp™ Plant Kits (Sigma). The primer locations are indicated in Fig.1, and the corresponding primer sequences are listed in

Appendix A Table 1. For sequencing purposes, PCR products were purified by gel extraction with QIAquick Gel Extraction Kit (Qiagen) and sequenced by the Ohio

University Genomics Facility (http://www.dna.ohio.edu/). To analyze transcript levels of

FUT4 and FUT6, total RNA was isolated from seedlings of wild type and mutant plants

146

15 DAG using the RNeasy Plant Mini Kit (Qiagen) and the RNase-Free DNase Set

(Qiagen). Primer sequences for RT-PCR are listed in Appendix A Table 1 and Fig. 2.

4.2.3. Phenotypic analysis

A growth stage based phenotypic analysis method was adopted from Boyes et al.

(2001). Plant growth parameters including rosette perimeters (at 29 DAG), plant height

(at 43 DAG), branch numbers (at 43 DAG) and plant weight (at 49 DAG) were measured

and compared among wild type, fut4, fut6 and fut4/fut6 double mutant plants (15 plants

for each line). For the measurement of germination rate, mutant and wild type seeds (over

50 seeds for each line) were sown on the same MS plate. Germinated seeds were counted

under the light microscope every 12 h after sowing the seeds. For root morphology

analysis, mutant and wild type plants were grown on MS plates (over 24 plants for each

line). The lengths of primary roots were recorded every 24 h from 3 DAG to 10 DAG,

when the primary roots reach the edges of the petri plates. Statistical analysis was

performed using the Two-Sample Independent t Test for continuous variables obtained

from an open source software, OpenEpi

( http://www.openepi.com/OE2.3/Menu/OpenEpiMenu.htm).

4.2.4. Eel lectin staining

Roots were harvested from two-week old wild type, fut4, fut6 and fut4/fut6 double

mutant seedlings grown on MS plates. Freshly harvested roots were incubated in a

solution of eel lectin conjugated to Texas-Red (EY Laboratories) dissolved in 20 µg mL-1 in 10 mM phosphate buffered saline (pH 7.3) for 3 h in the dark at room temperature.

147

After incubation, roots were rinsed with 10 mM phosphate buffered saline (pH 7.3) and observed under a Motic BA400 EPI-Fluorescence Upright Biological Microscope (Beith

Dekel) using a Texas Red/Cy3.5 filter set. Images were captured by Motic Images Plus

2.0 software (Motic Instruments Inc.).

4.2.5. Monosaccharide composition analysis by GC-MS

AGPs were isolated from leaves and roots of wild type, fut4, fut6 and fut4/fut6 double mutant plants as reported previously (Schultz et al., 2000). AGP samples were quantified using a Yariv precipitation method (Xu et al., 2008) and gum arabic as standards. Samples were analyzed by both in-house GC-MS instruments at Ohio

University and by analytical services at Complex Carbohydrate Research Center (CCRC) at the University of Georgia (http://www.ccrc.uga.edu/). Briefly, 100 µg AGP samples were hydrolyzed in 2 M trifluoroacetic acid (2 h in sealed tube at 121°C), reduced with

NaBD4, and acetylated using acetic anhydride/trifluoroacetic acid. Inositol, 20 µg, was added as an internal standard to all samples. For the in-house GC-MS system, alditol acetates were analyzed on a Trace GC Ultra interfaced to a DSQII mass spectrometer

(Thermo Scientific). Separation was performed on a 30 m FactorFOUR VF-23 ms capillary column (Varian). At the CCRC, alditol acetates were analyzed on a 7890A GC interfaced to a 5975C mass spectrometer (Agilent). Separation was performed on a 30 m

Supelco 2330 capillary column (Sigma).

148

4.2.6. Monosaccharide composition analysis by HPAEC

Root AGP samples (1 µg each) from wild type, fut4, fut6 and fut4/fut6 double mutant plants were hydrolyzed by total acid hydrolysis and fractionated on a CarboPac

PA20 column (Dionex) as described previously (Liang et al., 2010). Fuc, Glc, Gal, Ara, and Xyl monosaccharides (Acros Organics) were used as standards and treated under the same conditions.

4.2.7. Monoclonal antibodies

Monoclonal antibodies were obtained as hybridoma cell culture supernatants from laboratory stocks at the Complex Carbohydrate Research Center [available from

CarboSource Services (http://www.carbosource.net)]. A detailed list of the antibodies used, grouped according to the polysaccharide primarily recognized by the antibodies

(Pattathil et al. 2010), is provided in Appendix C, which also includes links to a database,

WallMabDB (http://www.wallmabdb.net), containing more detailed information about each antibody.

4.2.8. Preparation of AIR (Alcohol Insoluble Residue) extracts and fractionation

Plant tissues were isolated and ground to a fine powder using liquid nitrogen and a mortar and pestle. The powder was then suspended in 80% (v/v) ethanol, vortexed and centrifuged at 3000 g for 5 min. The residue was then suspended in absolute ethanol and centrifuged as above. Subsequently, the residue was suspended in chloroform:methanol

(1:1 [v/v]) and stirred for one hour at room temperature. This suspension was centrifuged

149

again and the AIR extract was washed with acetone. All the above steps were repeated,

and the final residue was air-dried at room temperature.

AIR fractionation was done in order to obtain an overall picture of the glycan epitope composition of the walls and gain insight into how tightly these epitopes are bound into the wall matrix. For this purpose, AIR samples were sequentially extracted with 50 mM ammonium oxalate, pH 5.0, 50 mM sodium carbonate, pH 10 (containing

0.5% [w/v] sodium borohydride), 1 M KOH (containing 1% [w/v] sodium borohydride), and 4 M KOH (containing 1% [w/v] sodium borohydride). Each treatment was done for a period of 24 h, after which time the samples were centrifuged at 4000 g for 15 min and the supernatants decanted. The pellets were washed with water and centrifuged as before; the water wash was discarded and the pellet retained for the next extraction. Each wall extract was neutralized (if necessary), dialyzed extensively against water, and lyophilized for further analysis.

4.2.9. Total sugar estimation and ELISA

Cell wall extracts were dissolved in de-ionized water (0.2 mg mL−1) and total sugar

contents of cell wall extracts were estimated using the phenol-sulfuric acid method

(DuBois et al., 1956; Masuko et al., 2005). All solubilized fractions were adjusted to an

equal amount of total carbohydrate content prior ELISA assay. Cell wall extracts (60 μg

sugar mL−1) were applied to the wells of 96-well ELISA plates (Costar 3598) at 50 μL per well and allowed to evaporate to dryness overnight at 37°C. A Biotek robotic system

(Biotek) was used to perform fully automated ELISAs using a series of 152 monoclonal antibodies directed against diverse plant cell wall carbohydrate epitopes (Pattathil et al.,

150

2012). ELISA data are presented as heatmaps (glycome profiles) in which the antibody

order and groupings are based on a hierarchical clustering analysis that groups the

antibodies based on similarities in their binding patterns to a panel of diverse plant

glycans (Pattathil et al., 2010).

4.2.10. Expression analysis of FUT4 and FUT6 genes based on microarray data from

Genevestigator

Expression data for the FUT4 and FUT6 genes were acquired from the

Genevestigator web site (https://www.genevestigator.com/) using the ‘perturbations’

condition search. Filter options were used to select for fold changes in expression levels (>

2 or < -2) and for P < 0.05.

4.3. Results

4.3.1. Isolation of T-DNA insertion lines of fut4 and fut6 and generation of the

fut4/fut6 double mutant

Arabidopsis fut4 (SAIL_284_B05) and fut6 (SALK_099500) single mutants with T-

DNA insertions in the exons of FUT4 or FUT6 genes were ordered from the Arabidopsis

Biological Resource Center (ABRC, Columbus, OH) and used to obtain homozygous

mutants. Genotyping of the homozygous mutant plants was performed by PCR analysis

using wild type controls (Fig. 4.1). Flanking regions of left borders of the T-DNA insertions were amplified and sequenced with T-DNA left border (LB) primers and

FUT4- or FUT6-specific right primers (RP). DNA sequencing showed that the insertion

151

sites were located at nucleotide position 1438 (relative to the first nucleotide of the start

codon designated as position 1) for fut4 and nucleotide position 31 for fut6, which were

consistent with the records of insertion sites of the mutant lines on the Salk Institute

Genomic Analysis Laboratory web site (SIGnAL, http://signal.salk.edu/cgi-

bin/tdnaexpress). To confirm that the flanking regions of the right borders of the T-DNA

insertions were also restricted to the fut4 and fut6 genes, PCRs with T-DNA right border

(RB) primers and FUT4- or FUT6-specific left primers (LP) were performed, but no

amplifications were obtained. Instead, specific bands were amplified using primer pairs

of LB and LP (data not shown). Sequencing results of the product amplified with LB and

LP indicated that a second T-DNA insertion was located at nucleotide position 1477 in

fut4, downstream and in the opposite direction of the insertion at position 1438 (Fig. 4.1).

Similarly in fut6, sequencing showed that at least two T-DNA insertions, in opposite

directions, interrupted FUT6 between nucleotide position -4 and position 31. The

occurrence of such T-DNA insertions as tandem repeats is not infrequent (Ponce et al.,

1998). Absence of the FUT4 and FUT6 transcripts in the corresponding single mutant lines was confirmed by RT-PCR (Fig. 4.2). However, it should be noted that the possibility that truncated versions of FUT4 and FUT6 transcripts are present cannot be

excluded based on the locations of the primers used in these experiments (Appendix A

Table 1 and Fig. 2).

152

Figure 4.1 Identification of T-DNA insertion lines of Arabidopsis fut4, fut6, and fut4/fut6 mutants by PCR. A, Schematic diagrams of mutant fut4 and fut6 genes with T-DNA insertions. Black box, grey box and white box represent exons, introns and untranslated regions, respectively. Both the fut4 (SAIL_284_B05) and fut6 (SALK_099500) lines were identified to contain double T-DNA insertions (black triangles), as discussed in the text. B, Genotyping of wild type (W), fut4 (4), fut6 (6) and fut4/fut6 (4/6) double mutant plants by PCR. Positions of the PCR primers are shown in A with arrows. The expected sizes amplified with primer pairs of F4LP and F4RP (1026 bp), LB3+F4RP (450 bp), F6LP and F6RP (450 bp), LBa1 and F6RP (808 bp) are indicated with arrows in B. The 2-Log DNA ladder (New England Biolabs) was used as a molecular weight marker (M).

153

Figure 4.2 RNA transcript levels of FUT4 and FUT6 genes in homozygous fut4, fut6, fut4/fut6 double mutants and wild type seedlings. RT-PCR of total RNAs isolated from Arabidopsis seedlings 15 days after germination (DAG) were performed using gene-specific primers (See Appendix A Table 1 and Fig. 2). ACTIN was used as a control for loading.

Homozygous fut4 and fut6 single mutants were crossed, and the F1 generation was selfed before screening the F2 generation to obtain homozygous fut4/fut6 double mutants

(Fig. 4.1). As expected, no FUT4 or FUT6 transcripts were observed in the double mutant line (Fig. 4.2).

4.3.2. Phenotypic analysis of fut4, fut6 and fut4/fut6 mutants under physiological growth conditions

Growth stage-based phenotypic analysis (Boyes et al., 2001) was performed to evaluate growth and reproduction of the fut4, fut6 and fut4/fut6 mutants compared to wild type Arabidopsis plants. However, no significant differences were observed among the fut4, fut6 and fut4/fut6 mutants and wild type plants in terms of rosette size, plant height, weight and stem branching numbers (Fig. 4.3). The fut4, fut6 and fut4/fut6 mutants did not show abnormalities in reproduction as indicated by similar flowering times, seed yields, and seed germination rates compared to wild type plants under physiological growth conditions (Table. 4.1; Fig. 4.4).

154

Figure 4.3 Phenotypic analysis of Arabidopsis fut4, fut6 and fut4/fut6 mutant plants compared to wild type plants under physiological growth conditions. A, Whole plant images of wild type, fut4, fut6 and fut4/fut6 mutant plants grown in soil at 32 DAG. B, The rosette perimeters (at 29 DAG), plant height (at 43 DAG), branch numbers (at 43 DAG) and plant weight (at 49 DAG) compared among wild type, fut4, fut6 and fut4/fut6 mutant plants. Data and error bars represent mean + SD (n = 15). No significant differences were identified between wild type and any of the mutant lines for the parameters measured, as indicated by P > 0.05 in student t tests for continuous variables.

155

Table 4.1 Flowering time and silique numbers of wild type, fut4, fut6 and fut4/fut6 mutant plants grown in soil under physiological growth conditions

Wild Type fut4 fut6 fut4/fut6 a Flowering Time 24.3 + 0.7 24.0 + 0.0 24.0 + 0.0 24.2 + 0.6 (days + SD) b Number of Siliques 181 + 71 221 + 50 229 + 69 224 + 118 (number per plant + SD) a b Flowering time was recorded when the first flower buds were visible. Number of siliques was counted at 49 DAG (n = 15). No significant differences were identified between wild type and any of the mutant lines for flowering time and number of siliques, as indicated by P > 0.05 in student t tests for continuous variables.

Figure 4.4 Germination rate of Arabidopsis fut4, fut6 and fut4/fut6 double mutant and wild type plants. Plant seeds were sown on MS plates containing 0.5% sucrose. Over 50 seeds were analyzed for each line.

To examine root growth of the mutant plants, fut4, fut6 and fut4/fut6 mutant and wild type plants were grown on media plates containing Murashige and Skoog (MS) salts supplemented with 0.5% sucrose. No significant differences in root morphology or root

156

growth rates were observed among wild type, fut4, fut6 and fut4/fut6 mutant plants (Fig.

4.5). In addition, transverse sections of roots from wild type and fut4/fut6 double mutant

plants were stained with toluidine blue, observed under a light microscope, and found to

be histologically identical (data not shown).

Figure 4.5 Root growth of Arabidopsis wild type, fut4, fut6 and fut4/fut6 mutant plants. Plants were grown on MS plates containing 0.5% sucrose. A, Light microscope images of wild type, fut4, fut6 and fut4/fut6 roots at 15 DAG. Scale bars, 10 µm. B, Root length was measured daily as a function of time from 3-10 DAG. Values shown were averages of data from over 24 individual plants per line, with standard errors shown as error bars.

157

4.3.3. Eel lectin shows a different staining pattern in roots of fut6 and fut4/fut6 mutant plants compared to fut4 and wild type plants

Screening for and characterization of Fuc residues on AGPs from cruciferous plants were performed using eel lectin (Hashimoto, 2000), a reagent that binds specifically to terminal α-L-Fuc residues (Watkins and Morgan, 1952; Springer and Desai, 1971). van

Hengel and Roberts (2002) showed that eel lectin recognizes α-L-Fuc attached to α-L-

Ara residues in Arabidopsis root AGPs, but not α-L-Fuc-(12)-β-D-Gal in RG-I. The binding specificity of eel lectin is also different from the specificity of the monoclonal antibody, CCRC-M1, which recognizes the α-(12)-linked fucosyl epitope on xyloglucan (Puhlmann et al., 1994; van Hengel and Roberts, 2002). When whole roots of

14-day old Arabidopsis plants were subjected to staining with eel lectin conjugated to

Texas-Red, the wild type and fut4 mutant plants showed a bright, patchy staining pattern on the root surface, which was absent in the fut6 and fut4/fut6 mutant roots (Fig. 4.6). No significant differences were observed for the leaves from wild type, fut4, fut6 and fut4/fut6 mutants stained with eel lectin conjugated to Texas-Red (data not shown).

158

Figure 4.6 Eel lectin staining of roots from WT, fut4, fut6 and fut4/fut6 mutant plants. Eel lectin conjugated to Texas-Red was used to stain the roots of Arabidopsis plants grown on MS media containing 0.5% sucrose for 14 days. The stained roots were observed under fluorescent light (FL) and white light (WL) as indicated. Scale bars, 10 µm.

4.3.4. fut4/fut6 mutants are defective in AGP fucosylation in roots and leaves

Because FUT6 was mostly expressed in root while FUT4 showed the greatest

expression in leaf but was also expressed in root (Sarria et al., 2001), leaf and root AGPs were purified using β-Yariv reagent from wild type, fut4, fut6 and fut4/fut6 mutant plants.

The purified root AGPs were acid hydrolyzed to monosaccharides and analyzed by High pH Anion Exchange Chromatography (HPAEC). As expected, HPAEC detected Ara and

Gal as the two dominant peaks in all of the AGP samples. Glucose (Glc) residues were also detected in each root sample as a minor peak. Intriguingly, Fuc was detected in root

AGP samples from wild type plants as well as the fut4 and fut6 plants but not in fut4/fut6 root AGPs (Fig. 4.7). To confirm this HPAEC analysis result, AGP samples from both roots and leaves were subjected to monosaccharide composition analysis using gas

159

chromatography-mass spectrometry (GC-MS). Consistent with the HPAEC result, the

GC-MS data showed that AGPs from all samples were mainly composed of Gal and Ara

residues, with a ratio of approximately 3:1 for root samples and approximately 2:1 for

leaf samples in both wild type and mutant lines with subtle variations (Table 4.2.).

Compared to wild type, both fut4 and fut6 showed a decrease of Fuc content by

approximately 50% in root AGPs, while the fut4/fut6 double mutant contained no

detectable Fuc in their root AGPs. Fuc was also not detected in AGPs from fut4/fut6 and fut4 mutant leaves. In contrast, the Fuc content in leaf AGPs from fut6 was increased by approximately 1.5 fold compared to wild type.

Figure 4.7 Monosaccharide composition analysis of root AGPs from WT, fut4, fut6 and fut4/fut6 mutant plants by HPAEC. Peaks 1-3 are noise signals; peaks 4-7 represent Fuc, Ara, Gal and Glc respectively, as determined by retention times of a set of sugar standards including Fuc, Ara, Gal, Glc and xylose (Xyl). Representative profiles of two biological replicates of each sample were shown.

Table 4.2 Neutral monosaccharide content of purified AGPs from roots and leaves of wild type, fut4, fut6 and fut4/fut6 mutant plants. Root AGPsa Leaf AGPsa

Wild Type fut4 fut6 fut4/fut6 Wild Type fut4 fut6 fut4/fut6

+ + + + + + + + Ara 23.8 2.4 23.2 1.0 24.9 2.5 26.4 2.0 34.3 1.1 38.1 1.3 35.8 0.4 39.2 0.6

Gal + + + + + + + + 70.6 1.0 70.1 1.9 73.0 0.5 72.0 0.1 61.6 1.1 61.9 1.3 61.6 1.1 60.8 0.6

+ + + + + Fuc 1.8 0.2 0.8 0.1 1.0 0.1 ND 2.1 0.1 ND 3.5 0.6 ND

+ + + + Glc 2.2 0.2 5.9 0.8 1.1 1.0 2.3 1.6 ND ND ND ND

a AGP samples were derivatized by the alditol acetate method and analyzed using GC-MS. The molar percentage of each sugar is presented. The values are averages of two biological replicates. The standard deviations are indicated. ND, not detected. 160

161

4.3.5. Glycome profiling of root and leaf cell walls of wild type, fut4, fut6 and

fut4/fut6 mutant plants

To examine alterations in the compositions and interactive properties of cell wall

polysaccharides in fut4, fut6 and fut4/fut6 mutants compared to wild type plants, total cell

walls from leaf and root materials were sequentially extracted and the solubilized glycans

were subjected to glycome profiling. Glycome profiling provides both a qualitative and

quantitative measure of cell wall glycan epitopes present in the walls using a high-

throughput enzyme-linked immunosorbent assay (ELISA) method (Pattathil et al., 2012).

Approximately, one hundred fifty monoclonal antibodies, which recognize 19 groups of

glycan epitopes present in most major classes of cell wall glycans, were used in the

ELISAs. The output of these ELISAs is typically shown as a heatmap, where the

antibodies are grouped according to their reactivities with a diverse panel of plant cell

wall glycans (Pattathil et al., 2010).

Glycome profiles of four sequential extractions of cell wall materials from roots and

leaves are shown in Fig. 4.8 and Fig. 4.9, respectively. The four sequential extractions

included an oxalate extraction, followed by alkaline extractions, with progressively

increasing strengths. Oxalate binds to and depletes Ca2+ from cell walls, during and after

which, the most loosely associated pectic polysaccharides as well as AGPs are solubilized

(Gorshkova et al., 1996; Giunter et al., 2004). Alkali solutions of different strengths

extract mainly hemicellulose (Fry, 1988), but also solubilize more tightly bound pectins

and AGPs. For the wild type root sample, consistent with the extraction method, pectic and AG epitopes, including epitopes for RG-I/AG and AG-1, 2, 3, 4, showed the

162

strongest signals in the oxalate extract, while most epitopes for , including epitopes for nonfucosylated xyloglucans (NON-FUC XG), fucosylated xyloglucans (FUC

XG), and xylans (recognized by XYLAN-1, 2, 3, 4 groups of antibodies) were present in the 1 M KOH and 4 M KOH extracts (Fig. 4.8). It should be noted that fractionation by no means separates cell wall components into pure polymers (Fry, 1988) and that epitopes recognized by the same monoclonal antibodies may be present on different wall polymers (Pattathil et al., 2010). Furthermore, the different sub-fractions of wall glycans are bound into the wall matrix to different degrees. Thus, while the glycome profiles of wild type roots showed abundant pectic and AG epitopes in the oxalate extract, these

epitopes were also observed in decreasing amounts in the carbonate, 1 M KOH, and 4 M

KOH extracts (Fig. 4.8).

163

Figure 4.8 Glycome profiling of sequential cell wall extracts from roots of WT, fut4, fut6 and fut4/fut6 double mutant plants. The presence of cell wall glycan epitopes in each extract (as indicated at the bottom of each column) was determined by ELISAs using 152 glycan-directed monoclonal antibodies. The data are presented as heat maps, with bright yellow indicating strongest binding and black indicating no binding. The altered patterns of mAb binding affinity in the mutant fractions compared to wild type fractions were highlighted in green (increased binding) or red (decreased binding) rectangles.

164

Figure 4.9 Glycome profiling of sequential cell wall extracts from leaves of WT, fut4, fut6 and fut4/fut6 double mutant plants. The presence of cell wall glycan epitopes in each extract was detected as described in Fig. 4.8.

Glycome profiles of cell wall extracts from roots of fut4, fut6 and fut4/fut6 mutants

were different to varying extents from the corresponding extracts prepared from wild type

roots (Fig. 4.8). In the oxalate extract, epitopes for the HG BACKBONE and AG-3

165 groups of antibodies showed decreased abundance in the fut6 and fut4/fut6 lines, respectively. The carbonate extract of fut4 roots exhibited signals for hemicellulose and pectin backbone epitopes, i.e. NON-FUC XG, FUC XG, XYLAN-1, 3, HG BACKBONE and RG-I BACKBONE, which were largely absent in the same extract of wild type samples. The fut6 and fut4/fut6 lines showed decreased abundance for epitopes for NON-

FUC XG and FUC XG in 1 M KOH and 4 M KOH extracts compared to wild type.

Interestingly, in the fut6 and fut4/fut6 lines, epitopes for RG-I/AG and AG-4 had decreased signal intensities in the 1M KOH and 4M KOH extracts, but increased signal intensities in the carbonate extract. Similarly, in all three mutant lines, epitopes for

XYLAN-3 showed reduced signal intensity in the 1 M KOH extract, but epitope signal intensity increased in the carbonate extract.

Glycome profiles of leaves from fut4, fut6, fut4/fut6 and wild type plants showed similar patterns to one another in the carbonate and 4 M KOH extracts (Fig. 4.9). In the oxalate extract, increased signals for pectic AG epitopes (RG-I/AG) were observed in all three mutant lines and increased AG epitopes (AG-1, 2, 4) were observed in the fut4 and fut4/fut6 mutant lines compared to the wild type line. In addition, the 1 M KOH extract in the fut4/fut6 double mutant showed increased signals for multiple epitopes, including

XYLAN-3, XYLAN-4, RG-I BACKBONE, LINSEED MUCILAGE RG-I, and RG-I/AG.

4.4. Discussion

Over 1000 Arabidopsis genes have been predicted to encode genes involved in cell wall biosynthesis (Yong et al., 2005). However, to date, only about two dozen genes have been identified unambiguously as GTs or glycan synthases responsible for the

166

biosynthesis of specific cell wall molecules. Identification of cell wall biosynthetic

enzymes through a biochemical purification route is often hampered by the low

abundance of the enzymes or difficulties in maintaining enzymatic activities during the

solubilization/purification process. Genetic strategies are also potentially hampered by

functional redundancy of cell wall synthetic genes or the plasticity of cell wall structure,

which is often insensitive to the disruption of single genes (Reiter et al., 1997).

Alternatively, bioinformatics provides a means to identify candidate genes involved in the

biosynthesis of particular wall molecules, although additional research is required to

verify the function of the genes identified in this manner. Using a bioinformatic strategy,

nine Arabidopsis genes, namely FUT2-FUT9 (Sarria et al., 2001), were identified as homologues to Arabidopsis FUT1, which has a defined function as an α-1,2-FUT specific

for xyloglucan fucosylation in all organs of Arabidopsis plants (Perrin et al., 1999; Faik et

al., 2000). We previously demonstrated that the FUT4 and FUT6 genes are AGP FUTs by

introducing the FUT4 and FUT6 genes into tobacco BY2 cells, which normally do not

contain fucosylated AGPs, and showing that AGP FUT activities are acquired by the transgenic BY2 cells (Wu et al., 2010). In this report, fut4 and fut6 mutants as well as a

fut4/fut6 double mutant were utilized to corroborate our previous biochemical findings

using molecular genetics and to obtain insight to the physiological functions of FUT4 and

FUT6 in Arabidopsis plants.

T-DNA insertion sites in fut4 and fut6 mutants were identified and confirmed by

PCR and sequencing. RT-PCR using primers across the T-DNA insertion sites showed the absence of FUT4 or/and FUT6 transcripts in the corresponding fut4, fut6 and fut4/fut6

167

mutants. Further RT-PCR analysis using primers located at the other regions of the FUT4 and FUT6 genes may be performed to determine whether the fut mutant lines are truely null mutants or instead contain truncated versions of FUT4 or FUT6. Expression analysis

has shown that FUT6 is expressed nearly exclusively in the root with only trace

expression in flowers, while FUT4 is expressed more ubiquitously in root, stem, leaf and

flower and has the highest expression in leaf (Sarria et al., 2001). Monosaccharide

composition analyses of the AGPs isolated from the fut mutants examined in this study

(Table 4.2) are consistent with these previous gene expression studies. Specifically, Fuc

residues are present in wild type root and leaf AGPs, but absent in root and leaf AGPs

from the fut4/fut6 mutant. Moreover, the fut4 and fut6 single mutants had reduced Fuc

content in their root AGPs. Fuc residues were absent in fut4 leaf AGPs, but present in fut6

leaf AGPs (Table 4.2). The fucosylation patterns of AGPs in wild type and mutant lines,

together with the expression patterns of FUT4 and FUT6, clearly indicate that AGP

fucosylation is attributable to both FUT4 and FUT6 genes in roots, but only to the FUT4

gene in leaves. It would be interesting to investigate the role of FUT4 in AGP

fucosylation in stems and flowers, where the expression of FUT4 was also abundant at

least at a transcriptional level (Sarria et al., 2001). The fact that FUT3, FUT5 and FUT7

to FUT10 were all expressed in leaves at different levels (Sarria et al., 2001), but did not

compensate for the lost function of FUT4 in the fut4 mutant, indicates that these genes

might not be involved in AGP fucosylation and instead may fucosylate other non-AGP

molecules. Alternatively, these genes may fucosylate AGPs, but only under certain non-

physiological conditions, which were not examined in this study. FUT4 and FUT6 were

168

shown to differentially fucosylate AGPs, i.e. adding Fuc residues to different sites on AG

side chains (Wu et al., 2010). Given the extensive heterogeneity of polysaccharide

structures present in AGP populations, some AG polysaccharides may lack Fuc, while

others may contain Fuc added by one or both of the FUT enzymes. Moreover, FUT6 likely adds a terminal Fuc residue to AG polysaccharides that is particularly reactive with eel lectin, since the fut6 mutant roots contained Fuc in AGPs but were not stained by eel lectin (Fig. 4.6; Table 4.2). It is likely that epitopes recognized by eel lectin were synthesized by FUT6 and were abolished in fut6 roots, whereas FUT4 still functioned in

AGP fucosylation, but added the Fuc at a different site compared to FUT6.

AGPs are implicated in the regulation of root growth and root epidermal cell expansion, as treatment of Arabidopsis roots with β-Yariv reagent, which specifically aggregates some AGPs, results in a reduction of root length and epidermal cell length, as well as produces a “bulging” phenotype of epidermal cells (Willats and Knox, 1996;

Ding and Zhu, 1997). The Arabidopsis mur1 mutant exhibited root phenotypes of shortened root length, which were attributed to shortened, swollen root hair cells presented in periodic zones of cells, interspersed by zones containing cells of normal length (van Hengel and Roberts, 2002). In addition, the length of the tip of mur1 roots, measured from root cap to the first initiated root hair has a 40% reduction in length compared to wild type roots. The mur1 mutant was deficient in synthesizing GDP-L-Fuc

(Bonin et al., 1997), the sugar donor substrate for the biosynthesis of Fuc containing polymers. Considering that AGPs from mur1 roots had a reduced Fuc content as compared to wild type root and that eel lectin treatment of Arabidopsis plants

169 phenocopies the mur1 mutant in root, van Hengel and Roberts (2002) postulated that under-fucosylated AGPs were the cause of the mur1 root phenotype. Consistent with the study of van Hengel and Roberts, Fuc residues were detected in AGPs from wild type

Arabidopsis roots (Table 4.2). However, the fut4/fut6 double mutant, in which Fuc residues are completely eliminated from root AGPs, did not show changes in the root phenotype (Table 4.2; Fig. 4), indicating that the absence of Fuc residues in AGPs per se, does not result in an abnormality of root growth. Besides AGPs, common Fuc-containing cell wall polymers include xyloglucan, pectin polymers and N-glycans. Similar to the fut4/fut6 double mutant reported here, mutants deficient in the fucosylation of xyloglucan or N-glycan had normal root growth, indicating the fucosylation status of xyloglucan or

N-glycan alone did not affect root morphogenesis (van Hengel and Roberts, 2002; Vanzin et al., 2002). Nonetheless, it cannot be excluded that the additive effects of under- fucosylation of multiple cell wall polymers resulted in the defects in root growth observed in the mur1 mutant.

Compared to wild type plants, fut4, fut6 and fut4/fut6 mutants showed distinct glycome profiles for root and leaf cell wall extracts (Fig. 4.8 and Fig. 4.9). Glycome profiling provides information about both the glycan epitope composition of cell walls, as well as how tightly those epitopes are bound into the wall matrix. To date, no antibody specifically directed at a fucosylated AGP epitope has been generated or characterized.

The glycome profiles of the fut mutants examined here, not surprisingly, do not show the complete absence of a particular class of cell wall glycan epitopes. However, the glycome profiles of the fut mutants do show changes in the epitope compositions of individual

170

extracts, suggesting that the mutations have affected the extractability of the glycan

epitopes. For example in the glycome profiles of cell wall extracts from roots (Fig. 4.8),

many epitopes, including epitopes for NON-FUC XG, FUC XG, XYLAN-1, 3, HG

BACKBONE and RG-I BACKBONE, showed increased signals in the carbonate extract

from the fut4 mutant, indicating that some xyloglucan and pectin polymers become more extractable in the fut4 mutant root by carbonate. On the other hand, the coupling of a decrease of signals in the more alkaline fractions with an increase of signals in the less alkaline fractions for RG-I/AG, AG-4 and XYLAN-3 epitopes in the fut mutant root, indicates the corresponding polymers are more extractable in the mutants compared to wild type. In contrast, epitopes for NON-FUC XG and FUC XG showed decreased signals in the 1 M KOH and 4 M KOH extracts for fut6 and fut4/fut6 lines compared to wild type, indicating decreased extractability of some xyloglucan polymers in fut6 and fut4/fut6 roots. At a molecular level, AGPs have long been suggested to interact with other wall polymers or wall modifying enzymes to modulate cell wall architectures (Roy et al., 1998; Seifert and Roberts, 2007). Strong and specific binding between β-Yariv reagent and some AGPs implies that such AGPs may interact with β-glycan in vivo

(Seifert and Roberts, 2007). Interactions between AGPs and pectic polymers were often suggested based on co-purification of the two polymers (Carpita, 1989; Iraki et al., 1989;

Immerzeel et al., 2006). Binding between AGPs and pectin fractions was also shown in an in vitro binding experiment (Baldwin et al., 1993). In the current study, the changes of the extractability of wall polymers as suggested by the variation of epitope abundance in different wall extracts, in essence, reflects alterations in the interactions among cell wall

171 polymers (i.e. AGPs, pectin, hemicellulose, and cellulose) in the fut4, fut6 and fut4/fut6 mutants. Thus, Fuc residues on AGPs might play a critical role in mediating such interactions among wall polymers.

It is interesting that changes in cell wall structure in the fut mutants were not manifested as morphological phenotypes at the organismal level. Perhaps morphological phenotypes will be observed at the organismal or cell wall level with more rigorous examination e.g. structural examination by electron microscopy, or monitoring plant growth under a wide range of non-physiological conditions. As an initial attempt to look for potential roles of FUT4 and FUT6 under non-physiological conditions, expression profiles of the two genes were examined based on public microarray data, which show that the two genes are differentially affected by several treatments (Table 4.3). The microarray data, although requiring experimental verification, provide a guideline for the selection of growth conditions or treatments to further investigate the physiological roles of FUT4 and FUT6 in Arabidopsis.

Table 4.3 Expression analysis of the FUT4 and FUT6 genes in response to biotic and abiotic stimuli based on microarray data from Genevestigator (https://www.genevestigator.com/gv/).

FUT4 a FUT6 a

Categories Stimulus Fold Change P Value Fold Change P Value

Both FUT4 and FUT6 showed increase or decrease in expression for more than two folds under stimulus conditions

Biotic P.syringae pv. maculicola (Col-0)/ mock treated leaf samples (Col-0) 22.1 <0.001 6.89 0.002

Stress hypoxia study 6 (Col-0)/untreated plant samples (Col-0) 2.11 0.048 2.49 0.025

FUT4 but not FUT6 showed increase or decrease in expression for more than two folds under stimulus conditions

Biotic P.syringae pv. syringae study 2 (Col-0)/ non-infected leaf samples (Col-0) 23.65 0.01 1.45 0.387

Biotic P.parasitica (6h)/ non-infected root samples (Col-0) 4.31 0.004 -2.11 0.22

Biotic P.syringae pv. tomato study 12 (Col-0)/ untreated leaf tissue samples (Col-0) 3.94 0.023 1 0.933

Biotic B.graminis (Col-0)/non-infected rosette leaf samples 3.33 <0.001 1.2 0.016

Biotic P.syringae pv.phaseolicola (24h)/mock inoculated leaf samples (24h) 2.73 <0.001 1.37 0.04

Biotic P.syringae pv.tomato study 3 (DC3000 hrC-)/mock inoculated leaf samples (24h) 2.39 <0.001 1.16 0.19

Chemical cycloheximide/mock treated seedlings 10.37 0.016 -1.76 0.011

Chemical syringolin study 3 (late)/ solvent treated leaf samples (Col-0; late) 4.53 0.002 -1.01 0.955

Chemical Chitin/mock treated seedlings 3.98 <0.001 -1.35 0.042

Chemical ozone/air treated seedlings 2.74 0.044 1.09 0.704 172

173

Table 4.3 (continued)

Chemical benzothiadiazole study 3 (Col-0)/ untreated (Col-0) plant samples 2.36 0.013 1.02 0.898

Chemical AgNO3/mock treated seedlings 2.05 0.031 -1.05 0.73

Chemical phenathrene/ untreated Col plant samples 2.02 0.003 1.6 0.028

Elicitor HrpZ (4h)/H2O treated leaf samples (4h) 10.06 0.004 1.03 0.655

Elicitor FLG22 study 2 (1h)/ H2O treated Col-0 seedlings (1h) 5.73 <0.001 1.05 0.89

Elicitor FLG22 study 4 (Col-0)/ untreated leaf disc samples (Col-0) 5.6 0.006 1.13 0.397

Elicitor GST-PP1(4h) 4.12 <0.001 1 0.817

Elicitor OGs (1h)/ H2O treated Col-0 seedlings (1h) 4.01 0.002 1.14 0.582

Elicitor FLG22 (1h)/H2O treated leaf samples (1h) 3.98 <0.001 1.06 0.495

Elicitor FLG22 study 5 (Col-0)/ untreated leaf disc samples (Col-0) 3.62 0.057 1.2 0.308

Hormone NAA + FLG22 (1h)/ untreated leaf disc samples (Col-0) 4.93 0.008 1.05 0.687

Hormone salicylic acid study 4 (Col-0)/ mock treated seedlings 3.15 0.01 1.62 0.245

Hormone IAA study 9 (Col-0)/ untreated seedling samples (Col-0) 1.92 0.009 -1.09 0.32

Nutrient N depletion (Col-0)/ seedlings grown under N-replete condition (Col-0) 2.65 <0.001 1.8 <0.001

Stress cold study 6 (Col-0)/ 20C/18C treated rosette samples (Col-0) 2.21 0.21 -1.01 0.979

Stress Osmotic (late)/untreated green tissue samples (late) 2.08 0.009 1.26 0.056

173

174

Table 4.3 (continued)

FUT6 but not FUT4 showed increase or decrease in expression for more than two folds under stimulus conditions

Hormone ABA study 6 (Col-0)/untreated plant samples (Col-0) -1.07 0.491 -2.46 0.003

Light light study 2/dark grown Col-0 seedlings -1.03 0.719 -2.08 0.03

Light light study 6 (Col-0)/dark grown Col-0 seedlings -1.8 0.029 -3.48 0.003

Nutrient nitrate starvation/untreated seedlings 1.84 0.031 2.22 <0.001

Nutrient iron deficiency (LZ4)/mock treated root samples (LZ4) -1.28 0.64 2.1 0.01

Stress salt study 2(late)/untreated root samples (late) 1.07 0.448 -2.01 0.008

Stress anoxia study 2/dark grown Col-0 seedling samples -1.55 0.246 -2.26 0.004

Stress osmotic study 2 (late)/untreated plant samples (Col-0) -1.4 0.002 -2.71 <0.001

Stress heat study 3/dark grown Col-0 seedling samples -1.75 0.178 -2.81 0.001

Stress heat/anoxia/dark grown Col-0 seedling samples -1.39 0.612 -3.48 0.033

aExpression data of FUT4 and FUT6 genes for Arabidopsis (Col-0 ecotype ) were shown. 174

175

In conclusion, the characterization of the fut4, fut6 and fut4/fut6 mutants reported here further supports our previous biochemical study of FUT4 and FUT6 (Wu et al.,

2010). Furthermore, we show here that FUT4 fucosylates AGPs in leaves and that FUT4 and FUT6 fucosylate AGPs in roots of Arabidopsis. Non-fucosylated AGPs present in the mutants did not affect normal root growth, but nonetheless resulted in biochemical changes in overall cell wall structure in fut4, fut6 and fut4/fut6 mutants as indicated by glycome profiling. Based on this profiling work, it is likely that the Fuc residues present in AGPs function to regulate interactions between AGPs and other cell wall molecules and thus contribute to overall cell wall architecture. The current work also exemplifies the possibilities of refinement of cell wall structures by manipulation of a single or a few cell wall biosynthetic genes. Indeed, plants with such modified wall structures may display altered biomass recalcitrance and thus have the potential to enhance biofuel production.

176

CHAPTER 5: CONCLUSIONS

AGPs represent a plant-cell-surface glycoprotein family which includes a large

number of family members that play diverse and important roles in plant growth and

development. As an initial attempt to decipher the glycosylation machinery involved in

AGP biosynthesis, this dissertation research focused on the identification and

characterization of the GalTs that incorporate the first one or two Gal residues and the

FUTs that add the terminal Fuc residues in the AG polysaccharide side chains.

An in vitro AGP GalT assay was developed, which used permeabilized microsomal

membranes from either tobacco or Arabidopsis suspension-cultured cells as the enzyme

source and UDP-[14C]Gal as the sugar donor. When a chemically synthesized peptide,

[AO]7, was used as the substrate acceptor, the assay mainly detected a GalT activity that

transferred the first Gal onto Hyp residues of the protein backbone. When a

transgenically expressed and HF deglycosylated AGP polypeptide, d[AO]51, was used as

the substrate acceptor, the assay mainly detected the GalT activity that added the second

Gal to Gal residues existing on the protein backbone (Fig. 2.5). The incorporation of

[14C]Gal into the peptide substrate acceptors was detected by RP-HPLC fractionation of

the assay products after removal of unreacted UDP-[14C]Gal by an ion-exchange column

(Fig. 2.3). After RP-HPLC, the purified assay product was subjected to total acid

hydrolysis coupled with an HPAEC analysis to determine the sugar transfer specificity

(Fig. 2.4) and to base hydrolysis coupled with the Bio-gel P2 size exclusion

chromatography to determine the extent of glycosylation in the GalT assay (Fig. 2.5).

14 Moreover, when cold UDP-Gal was used in place of UDP-[ C]Gal, the [AO]7:GalT

177

assay product was precisely identified by mass spectrometry analysis (Fig. 2.9). The sequences of the peptide acceptors mimic the Ala-Hyp dipeptide motifs in naturally occurring AGP molecules; thus the assay is most likely to detect AGP specific GalT activities. Indeed, the substrate specificity test showed that the enzyme activity only used

[AO]n (where n=7, 14 or 51 as tested in the current research) as the substrate acceptors

but not peptides representing an EXT protein backbone sequence (Fig. 2.8). This in vitro

AGP GalT assay can be used to monitor AGP GalT activities during purification of the

GalT enzyme, or alternatively, the assay can be (and was) utilized for the functional

verification of candidate AGP GalT genes expressed in heterologous expression systems.

The Hyp contiguity hypothesis predicted that clustered non-contiguous Hyp residues are sites for the attachment of AG polysaccharides and contiguous Hyp residues are sites of HRGP arabinosylation (Kieliszewski and Lamport, 1994; Kieliszewski et al., 1995).

While the [AO]n peptides represented a clustered, non-contiguous Hyp motif, the EXT

peptides represented a contiguous Hyp motif. The identification of the Hyp:GalT activity

in Arabidopsis microsomal membranes specifically utilizing the [AO]n peptides but not

the EXT peptides as the substrate acceptors provided further support for the Hyp contiguity hypothesis.

The enzymatic characterization of the [AO]7:GalT and d[AO]51:GalT activities in

Arabidopsis microsomal membranes provided insights into AGP GalT activities.

Enzymatic characterization showed that the [AO]7:GalT activity significantly increased

in the presence of Mn2+ divalent ions (Fig. 2.7). The nucleotide sugar-dependent GTs

adopt either a GT-A or a GT-B structural fold (Lairson et al., 2008). For the GTs of the

178

GT-A type, an essential divalent cation, coordinated by the DXD motif, facilitates departure of the nucleoside diphosphate leaving group by electrostatically stabilizing the developing negative charge. In contrast, GTs of the GT-B type use a metal ion- independent method to facilitate leaving group departure. Eukaryotic members possessing the GT-A fold are typically type II membrane proteins, i.e. having a short N- terminal cytoplasmic domain that is followed by transmembrane and stem regions leading to the globular catalytic region (Breton and Imberty, 1999). Thus, the enzyme responsible for the [AO]7:GalT activity is likely to be a type II membrane protein processing the GT-A fold in its catalytic domain. Previous studies suggested that AGP

GalT activities for the extension of AG side chains are localized to the Golgi apparatus

(Mascara and Fincher, 1982; Schibeci et al., 1984; Kato et al., 2003). Oka et al. (2010) proposed that the first GalT activity for AGP glycosylation, namely Hyp:GalT activity

(an equivalent activity to the [AO]7:GalT activity), is located in the ER and the GalT activities after the initiation step are located in the Golgi. Using a similar sucrose density gradient strategy but different organelle markers as Oka et al. (2010), the ER marker and the Golgi marker could not be completely resolved on a sucrose density gradient in the current study (Fig. 2.10). Moreover, the [AO]7:GalT and d[AO]51:GalT activities have similar distribution profiles to each other and to both ER and Golgi markers on the sucrose density gradient. Results from sucrose density gradient experiments indicate that

AGP GalT activities are located to the endomembrane system but are not sufficient to provide precise localization information of the enzymes. First, sucrose density gradients are limited in achieving complete separation of ER and Golgi organelles in Arabidopsis

179

and tobacco. Second, marker enzymes used to identify Golgi or ER fractions may not be

representative of all the sub-compartments of ER and Golgi organelles.

Using a bioinformatics strategy, 20 Arabidopsis genes were predicted to encodeβ-

(13)-GalTs for AGP glycosylation. Six of the 20 genes were further assigned as candidate GalTs that function in the initiation of AGP glycosylation by transferring the first Gal residue to the protein backbone (Qu et al., 2008; Egelund et al., 2011). All of the six candidate GalTs were predicted to be Golgi-localized (Table 3.1) and to possess a

GT-A fold (Coutinho et al., 2003), which is consistent with the proposed localization and conformation of the enzymes corresponding to the [AO]7:GalT activity. I have referred to

the six candidate GalTs in this dissertation as GalT1 to GalT6 and initiated the functional

characterization of these GalTs by expressing five of the six genes (GalT1, GalT3,

GalT4, GalT5 and GalT6) in heterologous expression systems. Genetic constructs for the

expression of all of the five genes in two expression systems were successfully built. The

preliminary data indicated that all of the five genes were expressed in Pichia cells but

none of the Pichia expressed proteins tested so far showed [AO]7:GalT activity or

d[AO]51:GalT activity. However, GalT3 and GalT6, when expressed in tobacco BY2

suspension cultured cells under the control of a heat shock promoter, displayed higher

[AO]7:GalT activity compared to the wild type BY2 cells after a heat shock treatment

(Table 3.4). Some of the BY2 cell lines, transformed with GalT5 or GalT6 under the control of a 2x35S promoter, displayed higher GalT activity compared to wild type BY2 cells in GalT assays containing both the [AO]7 and d[AO]51 substrate acceptors.

Nonetheless, the preliminary results of activity tests of heterologously expressed GalTs

180

require confirmation by further tests with optimized expression conditions. In addition,

subcellular localization of GalT3 with a transient expression system confirmed the Golgi

localization of GalT3 as predicted by bioinformatics methods (Table 3.1; Fig. 3.13).

Future work needs to determine subcellular localization of all the candidate AGP GalTs as well as the substrate specificity of the GalTs, i.e. whether the GalTs are specific for the

[AO]7 or d[AO]51 substrate acceptor, or utilize both of the substrate acceptors. The precise subcellular localization of AGP glycosylation will be determined only when the

GalT localization data and the GalT specificity data are both available.

To study the biological functions of the FUT4 and FUT6 genes previously

characterized by a biochemical approach as AGP FUTs (Wu et al., 2010), fut4, fut6 and

fut4/fut6 genetic mutants in Arabidopsis were obtained and characterized. Sugar

composition analysis of AGPs from the fut mutant plants compared to wild type plants

indicated that FUT4 was required for fucosylation of leaf AGPs while both FUT4 and

FUT6 contributed to fucosylation of root AGPs (Table 4.2). Non-fucosylated AGPs

present in the mutants did not affect the phenotype of Arabidopsis grown under

physiological conditions, but nonetheless resulted in biochemical changes in overall cell

wall structure in the fut4, fut6 and fut4/fut6 mutants as indicated by glycome profiling

(Figs. 4.8 and 4.9). Based on this profiling work, I propose that the Fuc residues present

in AGPs function to regulate interactions between AGPs and other cell wall molecules

and thus contribute to overall cell wall architecture. Previous biochemical

characterization of the heterologously expressed FUT4 and FUT6 proteins suggested that

the two enzymes differentially fucosylate AGPs (Wu et al., 2010). In the current study,

181

fut4 mutant roots showed an eel lectin-staining pattern which was similar to wild type

roots but this staining pattern was absent in fut6 and fut4/fut6 mutant roots. The cell wall

fractions from the fut4 mutant roots also showed a distinct glycome profile compared to

the glycome profiles of cell wall fractions from the fut6 and fut4/fut6 mutant roots. In

addition, the expression of the FUT4 and FUT6 genes was differentially affected by non-

physiological treatments. Taken together, FUT4 and FUT6 are likely to catalyze the

addition of Fuc residues to different sites on the AG side chains as suggested by Wu et al.

(2010). Moreover, differentially fucosylated AGPs catalyzed by FUT4 and FUT6 may

play different physiological roles in plants.

Based on the concept of the Hyp contiguity hypothesis (Kieliszewski and Lamport,

1994; Kieliszewski and Shpak, 2001) and the GalT and FUT activities studied in this

research, a model is proposed for HRGP hydroxylation and Hyp glycosylation as depicted in Fig. 5.1. In the model, the N-terminal signal peptides direct the co-

translational translocation of HRGP core backbones into the ER lumen. In the ER,

approximately 65 % of the AGP molecules are predicted to be modified by the addition

of a GPI anchor to the C terminus of the protein backbone and become attached to the ER

membrane. The AGP molecules without a GPI anchor attachment site remain soluble in

ER lumen. Prolyl residues in the AGP backbone sequence are recognized by membrane

bound prolyl-4-hydroxylases (P4Hs) in the ER and modified into clustered, non-

contiguous Hyp motifs which are recognized by AGP Hyp:GalTs located in the ER or

Golgi to initiate AGP glycosylation. The AGP Hyp:GalTs are likely to contain a galectin

domain, which is absent in the GTs for the extension of AG polysaccharides. The

182

structural model of AG sidechains on AGPs from tobacco BY2 cells indicates that a ~15

sugar unit constitutes the building block of AG polysaccharides. AGP glycosylation in

Golgi may involve GTs for the formation of the ~15-sugar unit, which is repeatedly assembled to form AG polysaccharides. Given the specificity of GTs, at least eleven GT activities are required to synthesize the 15-sugar unit (Table 5.1 and Fig. 5.2). In species

other than tobacco, additional transferases are possible, as in Arabidopsis, FUT4 and

FUT6 add terminal Fuc residues to different positions of AG polysaccharides.

183

Figure 5.1 A model for HRGP hydroxylation and Hyp glycosylation through the secretary pathway.

184

Figure 5.1 (continued) The N-terminal signal peptides direct the co-translational translocation of HRGP core backbones into the ER lumen. In the ER, some of the AGP molecules are modified by the addition of a GPI-anchor to the C terminus and become attached to ER membrane. The other AGP molecules, EXT molecules and PRP molecules remain soluble in the ER lumen. Pro residues in the HRGP protein backbones are modified into Hyp residues by prolyl-4-hydroxylases (P4Hs) bound to the ER membrane. Contiguous Hyp motifs in AGP, EXT and PRP molecules are recognized and glycosylated by AraTs located in Golgi apparatus. Clustered, non-contiguous Hyp motifs in AGPs are recognized by β- (1,3)-GalTs with a galectin domain in addition to the GalT catalytic domain to initiate

185 glycosylation. The initiation of AG-type glycosylation may occur in the ER or Golgi while the extention of AG side chains mainly occurs in Golgi. AG polysaccharides may be considered as an assembly of ~15-sugar units. A set of eleven GTs may form an enzyme complex to construct AG polysaccharides by repeatedly forming and extending the 15-sugar units. The AG polysaccharides may be further decorated by GTs not included in the AGP GT complex, such as FUTs. Fully glycosylated AGP, EXT and PRP molecules are transported to the plant cell surface through the secretory pathway. At the plant cell surface, AGP molecules may attach to the plasma membrane through GPI anchors or be released as free molecules for AGPs without GPI anchors or after GPI anchors are processed by phospholipases. Membrane attached or free AGPs may interact with other cell wall polymers to affect the cell wall structure or perform other biological functions. The secreted EXT monomers self-assemble into insoluble networks by forming intra- and inter-molecular cross-links. The EXT network may serve as a scaffold for the deposition of newly-formed cell wall polymers. Glycosylated PRPs may form cross-links with EXT polymers to further lock the cell wall polymers in place. CW, cell wall. PM, plasma memberane. ER, endoplasmic rediculum.

Table 5.1 GT activities involved in AGP glycosylation.

GTa GT activity Functions in the Formation of the 15 AG Structural Unitb # 1 peptidyl Hyp-β-GalT Initiation of the trigalactosyl main chain # 2 β-(13)-GalT Addition of the 2nd Ga residue in the trigalactosyl main chain # 3 β-(13)-GalT Addition of the 3nd Ga residue in the trigalactosyl main chain # 4 β-(16)-GalT Formation of the linkage between the trigalactosyl main chains # 5 β-(16)-GalT Addition of Gal residues to Gal-1of the main chain # 6 β-(16)-GalT Addition of Gal residues to Gal-2 of the main chain # 7 α-(13)-AraT Substitution of the sidechain GalT residue # 8 β-(16)-GlcAT Substitution of the sidechain GalT residue # 9 α-(13)-AraT Extension of the Ara side chain # 10 α-(15)-AraT Extension of the Ara side chain # 11 α-(14)-RhaT Substitution of the terminal GlcA residue FUT4 α-(12)-FUT Addition of terminal Fuc residues FUT6 α-(12)-FUT Addition of terminal Fuc residues at the sites different from FUT4 aGT#1 to #11 are predicted GT activities involved in the formation of the core 15-sugar unit in AG polysaccharides. FUT4 and FUT6 are two FUTs identified for AGP fucosylation in Arabidopsis. bLinkages in the AG side chain formed by the predicted GT activities were shown in Fig. 5.2.

186

Figure 5.2 Predicted GT activities involved in the formation of a 15-sugar unit for AGP glycosylation. The predicted GT activities (from GT#1 to GT#11) were listed in table 5.1. The sites of the addition of terminal Fuc residues catalyzed by the Arabidopsis FUT4 and FUT6 are hypothetical.

The GTs are most likely to be type II membrane proteins, however, they may also adopt other strategies to be retained in the Golgi for their functionality. For example, GTs without a transmembrane domain may be anchored in the Golgi by associating with a GT which contains a transmembrane domain (Atmodjo et al., 2011). The GTs may form a complex for AGP glycosylation, thus individually expressed GalTs showed either no (as for GalT1, GalT3, GalT4, GalT5 and GalT6) or low GalT activity (as for GalT2) in the

187

Pichia expression system. In the tobacco BY2 expression system, heterologously expressed Arabidopsis GalTs may interact with tobacco GT partners and thus showed detectable GalT activities. However, FUT4 and FUT6 are apparently not essential

components of the AGP GT complex because AGPs from the fut4/fut6 double mutant

showed minimal sugar composition changes except for the absence of Fuc residues.

Different types of AGPs, including AGPs with or without a GPI anchor and hybrid and

chimeric AGPs with non-AGP domains, may be modified through a common

glycosylation pathway and be transported to the plant cell surface where AGPs interact

with other cell wall polymers to affect the cell wall structure or perform other biological

functions.

In contrast to AGP molecules, Pro containing motifs in EXT and PRP molecules are

modified into contiguous Hyp motifs by P4Hs in ER. In Golgi, the contiguous Hyp

motifs are glycosylated with arabino-tetrasaccharide sidechains probably by an AraT

complex, which involves one peptidyl Hyp-AraT, two β-(12)-AraTs and one α-(13)-

AraT. EXT and PRP monomers are soluble in the ER and Golgi. After being transported

to the plant cell surface, glycosylated EXT monomers self-assemble into insoluble

networks by forming intra- and inter-molecular cross-links through the Tyr-Xaa-Tyr-Lys

and Val-Tyr-Lys motifs. The EXT network may serve as a scaffold for the deposition of

newly-formed cell wall polymers. Glycosylated PRPs also form cross-links with each

other and with EXT polymers to further lock cell wall polymers in place.

The functions of HRGP subfamilies are achieved by a combination of factors,

including the various functional domains of the protein backbone, the glycosylation

188 pattern of the Hyp motifs and some potential structural modifications at the cell surface, such as partial degradation of sugar sidechains, a release of AGP molecules from the GPI anchor by phospholipases and cross-linking of EXT and PRP molecules. The expression of HRGP backbones, specific GTs and the sugar modifying enzymes may all be subjected to spacial, temporal and environmental controls. This model is proposed to summarize the current knowledge and speculations about HRGP glycosylation, which require further experiments for its validation.

189

REFERENCES

Acosta-Garcia G, Vielle-Calzada JP (2004) A classical arabinogalactan protein is essential for the initiation of female gametogenesis in Arabidopsis. Plant Cell 16: 2614-2628

Albersheim P, Darvill A, Roberts K, Sederoff R, Staehelin A (2010) Plant cell walls- from chemistry to biology. Garland Science, New York

Albersheim P, Darvill A, Roberts K, Sederoff R, Staehelin A (2011) Plant cell walls- from chemistry to biology. Garland Science, New York

Anders N, Wilkinson MD, Lovegrove A, Freeman J, Tryfona T, Pellny TK, Weimar T, Mortimer JC, Stott K, Baker JM, Defoin-Platel M, Shewry PR, Dupree P, Mitchell RAC (2012) Glycosyl transferases in family 61 mediate arabinofuranosyl transfer onto xylan in grasses. Proc Natl Acad Sci USA

Atmodjo MA, Sakuragi Y, Zhu X, Burrell AJ, Mohanty SS, Atwood JA, III, Orlando R, Scheller HV, Mohnen D (2011) Galacturonosyltransferase (GAUT)1 and GAUT7 are the core of a plant cell wall pectin biosynthetic homogalacturonan: galacturonosyltransferase complex. Proc Natl Acad Sci USA 108: 20225-20230

Baldwin TC, McCann MC, Roberts K (1993) A novel hydroxyproline-deficient arabinogalactan protein secreted by suspension-cultured cells of daucus carota (purification and partial characterization). Plant Physiol 103: 115-123

Barnett T, Altschuler M, McDaniel CN, Mascarenhas JP (1979) Heat shock induced proteins in plant cells. Dev Genet 1: 331-340

Bauer S, Vasu P, Persson S, Mort AJ, Somerville CR (2006) Development and application of a suite of polysaccharide-degrading enzymes for analyzing plant cell walls. Proc Natl Acad Sci USA 103: 11417-11422

Beeckman T, Przemeck GKH, Stamatiou G, Lau R, Terryn N, De Rycke R, Inze D, Berleth T (2002) Genetic complexity of cellulose synthase A gene function in Arabidopsis embryogenesis. Plant Physiol 130: 1883-1893

Bolwell GP (1986) Microsomal arabinosylation of polysaccharide and elicitor-induced carbohydrate-binding glycoprotien in french bean. Phytochemistry 25: 1807-1813

Bonin CP, Potter I, Vanzin GF, Reiter W-D (1997) The MUR1 gene of Arabidopsis thaliana encodes an isoform of GDP-D-mannose-4,6-dehydratase, catalyzing the first step in the de novo synthesis of GDP-L-fucose. Proc Natl Acad Sci USA 94: 2085-2090

190

Borner GHH, Sherrier DJ, Stevens TJ, Arkin IT, Dupree P (2002) Prediction of glycosylphosphatidylinositol-anchored proteins in Arabidopsis. A genomic analysis. Plant Physiol 129: 486-499

Bosch M, Mayer CD, Cookson A, Donnison IS (2011) Identification of genes involved in cell wall biogenesis in grasses by differential gene expression profiling of elongating and non-elongating maize internodes. J Exp Bot 62: 3545-3561

Boyes DC, Zayed AM, Ascenzi R, McCaskill AJ, Hoffman NE, Davis KR, Gorlach J (2001) Growth stage-based phenotypic analysis of Arabidopsis: a model for high throughput functional genomics in plants. Plant Cell 13: 1499-1510

Breton C, Imberty A (1999) Structure/function studies of glycosyltransferases. Curr Opin Struct Biol 9: 563-571.

Brown DM, Goubet F, Vicky WWA, Goodacre R, Stephens E, Dupree P, Turner SR (2007) Comparison of five xylan synthesis mutants reveals new insight into the mechanisms of xylan synthesis. Plant J 52: 1154-1168

Brown DM, Zeef LAH, Ellis J, Goodacre R, Turner SR (2005) Identification of novel genes in Arabidopsis involved in secondary cell wall formation using expression profiling and reverse genetics. Plant Cell 17: 2281-2295

Brown DM, Zhang ZN, Stephens E, Dupree P, Turner SR (2009) Characterization of IRX10 and IRX10-like reveals an essential role in glucuronoxylan biosynthesis in Arabidopsis. Plant J 57: 732-746

Burn JE, Hocart CH, Birch RJ, Cork AC, Williamson RE (2002) Functional analysis of the cellulose synthase genes CesA1, CesA2, and CesA3 in Arabidopsis. Plant Physiol 129: 797-807.

Burton RA, Gidley MJ, Fincher GB (2010) Heterogeneity in the chemistry, structure and function of plant cell walls. Nat Chem Biol 6: 724-732

Caffall KH, Pattathil S, Phillips SE, Hahn MG, Mohnen D (2009) Arabidopsis thaliana T-DNA mutants implicate GAUT genes in the biosynthesis of pectin and xylan in cell walls and seed testa. Molecular Plant 2: 1000-1014

Campbell JA, Davies GJ, Bulone V, Henrissat B (1998) A classification of nucleotide- diphospho-sugar glycosyltransferases based on amino acid sequence similarities. Biochem J 329: 719-719

191

Cannon MC, Terneus K, Hall Q, Tan L, Wang Y, Wegenhart BL, Chen L, Lamport DT, Chen Y, Kieliszewski MJ (2008) Self-assembly of the plant cell wall requires an extensin scaffold. Proc Natl Acad Sci USA 105: 2226-2231

Cano-Delgado A, Penfield S, Smith C, Catley M, Bevan M (2003) Reduced cellulose synthesis invokes lignification and defense responses in Arabidopsis thaliana. Plant J 34: 351-362

Carpita N, McCann M (2000) The cell wall. In BB Buchanan, W Gruissem, R Jones, eds, Biochemistry and Molecular Biology of Plants, Ed 1. American Society of Plant Physiologists, Rockville, MD, pp 52-109

Carpita NC (1989) Pectic polysaccharides of maize coleoptiles and proso millet cells in liquid culture. Phytochemistry 28: 121-125

Carpita NC, Gibeaut DM (1993) Structural models of primary-cell walls in flowering plants - consistency of molecular-structure with the physical-properties of the walls during growth. Plant J 3: 1-30

Cavalier DM, Keegstra K (2006) Two xyloglucan xylosyltransferases catalyze the addition of multiple xylosyl residues to cellohexaose. J Biol Chem 281: 34197- 34207

Cavalier DM, Lerouxel O, Neumetzler L, Yamauchi K, Reinecke A, Freshour G, Zabotina OA, Hahn MG, Burgert I, Pauly M, Raikhel NV, Keegstra K (2008) Disrupting two Arabidopsis thaliana xylosyltransferase genes results in plants deficient in xyloglucan, a major primary cell wall component. Plant Cell 20: 1519- 1537

Cheung AY, Wang H, Wu HM (1995) A floral transmitting tissue-specific glycoprotein attracts pollen tubes and stimulates their growth. Cell 82: 383-393

Churms SC, Stephen AM, Siddiqui IR (1981) Evidence for repeating sub-units in the molecular structure of the acidic arabinogalactan from rapeseed (Brassica compestris). Carbohydrate Research 94: 119-122

Clarke A, Gleeson P, Harrison S, Knox RB (1979) Pollen-stigma interactions - identification and characterization of surface components with recognition potential. Proc Natl Acad Sci USA 76: 3358-3362

Cocuron JC, Lerouxel O, Drakakaki G, Alonso AP, Liepman AH, Keegstra K, Raikhel N, Wilkerson CG (2007) A gene from the cellulose synthase-like C family encodes a beta-1,4 glucan synthase. Proc Natl Acad Sci USA 104: 8550-8555

192

Cosgrove DJ (2000) New genes and new biological roles for expansins. Curr Opin Plant Biol 3: 73-78.

Coutinho PM, Deleury E, Davies GJ, Henrissat B (2003) An evolving hierarchical family classification for glycosyltransferases. J Mol Biol 328: 307-317

delCardayre SB, Raines RT (1994) Structural determinants of enzymic processivity Biochemistry 33: 6031-6037

Desprez T, Vernhettes S, Fagard M, Refregier G, Desnos T, Aletti E, Py N, Pelletier S, Hofte H (2002) Resistance against herbicide isoxaben and cellulose deficiency caused by distinct mutations in same cellulose synthase isoform CESA6. Plant Physiol 128: 482-490

Dhugga KS, Barreiro R, Whitten B, Stecca K, Hazebroek J, Randhawa GS, Dolan M, Kinney AJ, Tomes D, Nichols S, Anderson P (2004) Guar seed beta-mannan synthase is a member of the cellulose synthase super gene family. Science 303: 363- 366.

Dhugga KS, Ray PM (1994) Purification of 1,3-beta-D-glucan synthase activity from pea tissue - 2 polypeptides of 55 kDa and 70 kDa copurify with enzyme-activity. Eur J Biochem 220: 943-953

Ding L, Zhu JK (1997) A role for arabinogalactan-proteins in root epidermal cell expansion. Planta 203: 289-294

Doblin M, Vergara C, Read S, Newbigin E, Bacic A (2003) Plant cell wall synthesis: making the bricks. In R JCK, ed, Annual Plant Reviews, Vol 8. Sheiffield: Blackwell Publishing – CRC Press, pp 183-222

Du J, Mansfield SD, Groover AT (2009) The Populus homeobox gene ARBORKNOX2 regulates cell differentiation during secondary growth. Plant J 60: 1000-1014

DuBois M, Gilles KA, Hamilton JK, Rebers PA, Smith F (1956) Colorimetric method for determination of sugars and related substances. Analytical Chemistry 28: 350- 356

Ebringerová A, Hromádková Z, Heinze T (2005) Hemicellulose. In T Heinze, ed, Polysaccharides I, Structure, Characterization and Use, Advances in Polymer Science, Vol 186. Springer Berlin / Heidelberg, pp 1-67

Edwards ME, Dickson CA, Chengappa S, Sidebottom C, Gidley MJ, Reid JSG (1999) Molecular characterisation of a membrane-bound galactosyltransferase of plant cell wall matrix polysaccharide biosynthesis. Plant J 19: 691-697

193

Edwards ME, Marshall E, Gidley MJ, Reid JSG (2002) Transfer specificity of detergent-solubilized fenugreek galactomannan galactosyltransferase. Plant Physiol 129: 1391-1397

Egelund J, Ellis M, Doblin M, Qu Y, Bacic A (2011) Genes and enzymes of the GT31 family: towards unravelling the function(s) of the plant glycosyltransferase family members. In P Ulvskov, ed, Annual Plant Reviews, Vol 41. Wiley-Blackwell, pp 213-234

Egelund J, Petersen BL, Motawia MS, Damager I, Faik A, Olsen CE, Ishii T, Clausen H, Ulvskov P, Geshi N (2006) Arabidopsis thaliana RGXT1 and RGXT2 encode Golgi-localized (1,3)-alpha-D-xylosyltransferases involved in the synthesis of pectic rhamnogalacturonan-II. Plant Cell 18: 2593-2607

Ellis C, Karafyllidis I, Wasternack C, Turner JG (2002) The Arabidopsis mutant cev1 links cell wall signaling to jasmonate and ethylene responses. Plant Cell 14: 1557- 1566

Ellis M, Egelund J, Schultz CJ, Bacic A (2010) Arabinogalactan-proteins: Key regulators at the cell surface? Plant Physiol 153: 403-419

Endo M, Konishi T, Kotake T, Kimura K, Tsumuraya Y (2004) Beta- from radish primary roots involved in the synthesis of arabinogalactan-proteins. Glycobiology 14: 1122-1122

Endo M, Maruyama Y, Kotake T, Tsumuraya Y (2003) Beta-glucuronosyltransferase from radish primary roots involving in the synthesis of arabinogalactan-proteins. Plant Cell Phiol 44: S49-S49

Epstein L, Lamport DTA (1984) An intramolecular linkage involving isodityrosine in extensin. Phytochemistry 23: 1241-1246

Esquerre-Tugaye MT, Lamport DTA (1979) Cell surfaces in plantmicroorganism interactions. I. A structural investigation of cell wall hydroxyproline-rich glycoproteins which accumulate in fungusinfected plants. Plant Physiol 64: 314-319 Esquerre-Tugaye MT, Mazau D (1974) Effect of a fungal disease on extensin, plant- cell wall glycoprotein. J Exp Bot 25: 509-513

Estevez JM, Kieliszewski MJ, Khitrov N, Somerville C (2006) Characterization of synthetic hydroxyproline-rich proteoglycans with arabinogalactan protein and extensin motifs in Arabidopsis. Plant Physiol 142: 458-470

194

Everdeen DS, Kiefer S, Willard JJ, Muldoon EP, Dey PM, Li XB, Lamport DT (1988) Enzymic cross-linkage of monomeric extensin precursors in vitro. Plant Physiol 87: 616-621.

Faik A, Bar-Peled M, DeRocher AE, Zeng WQ, Perrin RM, Wilkerson C, Raikhel NV, Keegstra K (2000) Biochemical characterization and molecular cloning of an alpha-1,2-fucosyltransferase that catalyzes the last step of cell wall xyloglucan biosynthesis in pea. J Biol Chem 275: 15082-15089

Faik A, Price NJ, Raikhel NV, Keegstra K (2002) An Arabidopsis gene encoding an alpha-xylosyltransferase involved in xyloglucan biosynthesis. Proc Natl Acad Sci USA 99: 7797-7802

Farrokhi N, Burton RA, Brownfield L, Hrmova M, Wilson SM, Bacic A, Fincher GB (2006) Plant cell wall biosynthesis: genetic, biochemical and functional genomics approaches to the identification of key genes. Plant Biotechnol J 4: 145- 167

Farrokhi N, Hrmova M, Burton RA, Fincher GB (2009) Heterologous and cell free protein expression systems. Methods Mol Biol 513: 175-198

Fincher GB, Stone BA, Clarke AE (1983) Arabinogalactan-proteins - structure, biosynthesis, and function. Annu Rev Plant Physiol Plant Mol Biol 34: 47-70

Fleischer A, O'Neill MA, Ehwald R (1999) The pore size of non-graminaceous plant cell walls is rapidly decreased by borate ester cross-linking of the pectic polysaccharide rhamnogalacturonan II. Plant Physiol 121: 829-838

Fowler TJ, Bernhardt C, Tierney ML (1999) Characterization and expression of four proline-rich cell wall protein genes in Arabidopsis encoding two distinct subsets of multiple domain proteins. Plant Physiol 121: 1081-1091

Francisco SM, Tierney ML (1990) Isolation and characterization of a proline-rich cell wall protein from soybean seedlings. Plant Physiol 94: 1897-1902.

Fry SC (1988) The Growing Plant Cell Wall: Chemical and Metabolic Analysis. Longman Scientific & Technical, Essex

Gao M, Kieliszewski MJ, Lamport DT, Showalter AM (1999) Isolation, characterization and immunolocalization of a novel, modular tomato arabinogalactan-protein corresponding to the LeAGP-1 gene. Plant J 18: 43-55

195

Gao M, Showalter AM (1999) Yariv reagent treatment induces programmed cell death in Arabidopsis cell cultures and implicates arabinogalactan protein involvement. Plant J 19: 321-331

Gardiner JC, Taylor NG, Turner SR (2003) Control of cellulose synthase complex localization in developing xylem. Plant Cell 15: 1740-1748

Gaspar Y, Johnson KL, McKenna JA, Bacic A, Schultz CJ (2001) The complex structures of arabinogalactan-proteins and the journey towards understanding function. Plant Mol Biol 47: 161-176

Gaspar YM, Nam J, Schultz CJ, Lee L-Y, Gilson PR, Gelvin SB, Bacic A (2004) Characterization of the Arabidopsis lysine-rich arabinogalactan-protein ATAGP17 mutant (rat1) that results in a decreased efficiency of Agrobacterium transformation. Plant Physiol 135: 2162-2171

Gibeaut DM, Carpita NC (1990) Separation of membranes by flotation centrifugation for in vitro synthesis of plant-cell wall polysaccharides. Protoplasma 156: 82-93

Gibeaut DM, Hulett J, Cramer GR, Seemann JR (1997) Maximal biomass of Arabidopsis thaliana using a simple, low-maintenance hydroponic method and favorable environmental conditions. Plant Physiol 115: 317-319

Gille SFa, Hansel UF, Ziemann MF, Pauly M (2009) Identification of plant cell wall mutants by means of a forward chemical genetic approach using . Proceedings of the National Academy of Sciences, USA 106: 14699-14704

Gilson P, Gaspar YM, Oxley D, Youl JJ, Bacic A (2001) NaAGP4 is an arabinogalactan protein whose expression is suppressed by wounding and fungal infection in Nicotiana alata. Protoplasma 215: 128-139

Giunter EA, Popeĭko OV, Ovodov Iu S (2004) Isolation of polysaccharides from callus culture of Lemna minor L. Prikl Biokhim Mikrobiol 40: 94-97

Goodrum LJ, Patel A, Leykam JF, Kieliszewski MJ (2000) Gum arabic glycoprotein contains glycomodules of both extensin and arabinogalactan-glycoproteins. Phytochemistry 54: 99-106

Gordon R, Maclachlan G (1989) Incorporation of UDP-[14C]glucose into xyloglucan by pea membranes. Plant Physiol 91: 373-378

Gorshkova TA, Wyatt SE, Salnikov VV, Gibeaut DM, Ibragimov MR, Lozovaya VV, Carpita NC (1996) Cell-wall polysaccharides of developing flax plants. Plant Physiol 110: 721-729

196

Goto M (2007) Protein O-glycosylation in fungi: diverse structures and multiple functions. Biosci, Biotechnol, Biochem 71: 1415-1427

Goubet F, Misrahi A, Park SK, Zhang ZN, Twell D, Dupree P (2003) AtCSLA7, a cellulose synthase-like putative glycosyltransferase, is important for pollen tube growth and embryogenesis in Arabidopsis. Plant Physiol 131: 547-557

Hall Q, Cannon MC (2002) The cell wall hydroxyproline-rich glycoprotein RSH is essential for normal embryo development in Arabidopsis. Plant Cell 14: 1161-1172 Hanisch FA (2001) O-glycosylation of the mucin type. Biol Chem 382: 143-149

Hashimoto Y (2000) Structure and biosynthesis of L-fucosylated arabinogalactan- proteins in cruciferous plants In EA Nothnagel, A Bacic, AE Clarke, eds, Cell and Developmental Biology of Arabinogalactan-Proteins. Kluwer Academic/Plenum Publishers, New York, pp 51-60

Hayashi T, Maclachlan G (1984) and glycoproteins formed from UDP- galactose by pea membranes. Phytochemistry 23: 487-492

Held MA, Tan L, Kamyab A, Hare M, Shpak E, Kieliszewski MJ (2004) Di- isodityrosine is the intermolecular cross-link of isodityrosine-rich extensin analogs cross-linked in vitro. J Biol Chem 279: 55474-55482

Hieta R, Myllyharju J (2002) Cloning and characterization of a low molecular weight prolyl 4-hydroxylase from Arabidopsis thaliana. J Biol Chem 277: 23965-23971

Hijazi M, Durand J, Pichereaux C, Pont F, Jamet E, Albenne C (2012) Characterization of the Arabinogalactan Protein 31 (AGP31) of Arabidopsis thaliana: new advances on the Hyp-O-glycosylation of the pro-rich domain. J Biol Chem 287: 9623-9632. Epub 2012 Jan 9623.

Hong ZL, Delauney AJ, Verma DPS (2001) A cell plate specific callose synthase and its interaction with phragmoplastin. Plant Cell 13: 755-768

Huang X, Madan A (1999) CAP3: A DNA sequence assembly program. Genome Res 9: 868-877

Immerzeel P, Eppink MM, De Vries SC, Schols HA, Voragen AGJ (2006) Carrot arabinogalactan proteins are interlinked with pectins. Physiol Plant 128: 18-28

Iraki NM, Singh N, Bressan RA, Carpita NC (1989) Cell-walls of tobacco cells and changes in composition associated with reduced growth upon adaptation to water and saline stress. Plant Physiol 91: 48-53

197

Ishii T, Matsunaga T (2001) Pectic polysaccharide rhamnogalacturonan II is covalently linked to homogalacturonan. Phytochemistry 57: 969-974

Jacobs AK, Lipka V, Burton RA, Panstruga R, Strizhov N, Schulze-Lefert P, Fincher GB (2003) An Arabidopsis callose synthase, GSL5, is required for wound and papillary callose formation. Plant Cell 15: 2503-2513

Jamet E, Canut H, Boudart G, Pont-Lezica RF (2006) Cell wall proteins: a new insight through proteomics. Trends Plant Sci 11: 33-39

Jensen JK, Sorensen SO, Harholt J, Geshi N, Sakuragi Y, Moller I, Zandleven J, Bernal AJ, Jensen NB, Sorensen C, Pauly M, Beldman G, Willats WGT, Scheller HV (2008) Identification of a xylogalacturonan xylosyltransferase involved in pectin biosynthesis in Arabidopsis. Plant Cell 20: 1289-1302

Johnson KL, Kibble NAJ, Bacic A, Schultz CJ (2011) A fasciclin-like arabinogalactan-protein (FLA) mutant of Arabidopsis thaliana, fla1, shows defects in shoot regeneration. Plos One 6

Jones MA, Raymond MJ, Smirnoff N (2006) Analysis of the root-hair morphogenesis transcriptome reveals the molecular identity of six genes with roles in root-hair development in Arabidopsis. Plant J 45: 83-100

Karr AL (1972) Isolation of an enzyme system which will catalyze the glycosylation of extensin. Plant Physiol 50: 275-282

Kato H, Takeuchi Y, Tsumuraya Y, Hashimoto Y, Nakano H, Kovac P (2003) In vitro biosynthesis of galactans by membrane-bound galactosyltransferase from radish (Raphanus sativus L.) seedlings. Planta 217: 271-282

Kato Y, Ito S, Iki K, Matsuda K (1982) Xyloglucan and rβ-D-glucan in cell walls of rice seedlings. Plant and Cell Physiology 23: 351-364

Kawasaki S (1987) Synthesis of arabinose-containing cell-wall precursors in suspension- cultured tobacco cells .4. Differential subfractionation and comparison of the acidic precursors. Plant and Cell Physiology 28: 925-935

Kawasaki S (1987) Synthesis of arabinose-containing cell-wall precursors in suspension- cultured tobacco cells. 3. Purification and some properties of the major component Plant Cell Phiol 28: 187-197

Keegstra K, Raikhel N (2001) Plant glycosyltransferases. Curr Opin Plant Biol 4: 219- 224

198

Keppler BD, Showalter AM (2010) IRX14 and IRX14-LIKE, two glycosyl transferases involved in glucuronoxylan biosynthesis and drought tolerance in Arabidopsis. Molecular Plant 3: 834-841

Keskiaho K, Hieta R, Sormunen R, Myllyharju J (2007) Chlamydomonas reinhardtii has multiple prolyl 4-hydroxylases, one of which is essential for proper cell wall assembly. Plant Cell 19: 256-269

Kieliszewski M, de Zacks R, Leykam JF, Lamport DT (1992) A repetitive proline-rich protein from the gymnosperm Douglas fir is a hydroxyproline-rich glycoprotein. Plant Physiol 98: 919-926

Kieliszewski MJ (2001) The latest hype on Hyp-O-glycosylation codes. Phytochemistry 57: 319-323

Kieliszewski MJ, Lamport DTA (1994) Extensin - repetitive motifs, functional sites, posttranslational codes, and phylogeny. Plant J 5: 157-172

Kieliszewski MJ, Oneill M, Leykam J, Orlando R (1995) Tandem mass-spectrometry and structural elucidation of glycopeptides from a hydroxyproline-rich plant-cell wall glycoprotein indicate that contiguous hydroxyproline residues are the major sites of hydroxyproline O-arabinosylation. J Biol Chem 270: 2541-2549

Kieliszewski MJ, Shpak E (2001) Synthetic genes for the elucidation of glycosylation codes for arabinogalactan-proteins and other hydroxyproline-rich glycoproteins. Cell Mol Life Sci 58: 1386-1398

Knox JP (2008) Revealing the structural and functional diversity of plant cell walls. Curr Opin Plant Biol 11: 308-313

Kohorn BD (2000) Plasma membrane-cell wall contacts. Plant Physiol 124: 31-38

Lairson LL, Henrissat BF, Davies GJ, Withers SG (2008) Glycosyltransferases: structures, functions, and mechanisms. Annu Rev Biochem 77: 521-555

Lamport DT, Kieliszewski MJ, Showalter AM (2006) Salt stress upregulates periplasmic arabinogalactan proteins: using salt stress to analyse AGP function. New Phytol 169: 479-492

Lamport DTA (1986) The primary cell wall: a new model. In RA Young, RM Rowell, eds, Cellulose: Structure, Modification and Hydrolysis. John Wiley, New York, pp 77-90

199

Lamport DTA, Kieliszewski MJ, Chen Y, Cannon MC (2011) Role of the extensin superfamily in primary cell wall architecture. Plant Physiol 156: 11-19

Lamport DTA, Miller DH (1971) Hydroxyproline arabinosides in the plant kingdom. Plant Physiol 48: 454-456

Langan KJ, Nothnagel EA (1997) Cell surface arabinogalactan-proteins and their relation to cell proliferation and viability. Protoplasma 196: 87-98

Lee C, Zhong R, Ye ZH (2012) Arabidopsis family GT43 members are xylan xylosyltransferases required for the elongation of xylan backbone. Plant Cell Physiol 53: 135-143

Lee CB, Swatek KN, McClure B (2008) Pollen proteins bind to the C-terminal domain of Nicotiana alata pistil arabinogalactan proteins. J Biol Chem 283: 26965-26973

Lee CH, O'Neill MA, Tsumuraya Y, Darvill AG, Ye ZH (2007) The irregular xylem 9 mutant is deficient in xylan xylosyltransferase activity. Plant and Cell Physiology 48: 1624-1634

Liang Y, Faik A, Kieliszewski M, Tan L, Xu WL, Showalter AM (2010) Identification and characterization of in vitro galactosyltransferase activities involved in arabinogalactan-protein glycosylation in tobacco and Arabidopsis. Plant Physiol 154: 632-642

Lin WD, Liao YY, Yang TJW, Pan CY, Buckhout TJ, Schmidt W (2011) Coexpression-based clustering of Arabidopsis root genes predicts functional modules in early phosphate deficiency signaling. Plant Physiol 155: 1383-1402

Lind JL, Bacic A, Clarke AE, Anderson MA (1994) A style-specific hydroxyproline- rich glycoprotein with properties of both extensins and arabinogalactan proteins. Plant J 6: 491-502

Liu CG, Mehdy MC (2007) A nonclassical arabinogalactan protein gene highly expressed in vascular tissues, AGP31, is transcriptionally repressed by methyl jasmonic acid in Arabidopsis. Plant Physiol 145: 863-874

Ma HL, Zhao J (2010) Genome-wide identification, classification, and expression analysis of the arabinogalactan protein gene family in rice (Oryza sativa L.). J Exp Bot 61: 2647-2668

MacMillan CP, Mansfield SD, Stachurski ZH, Evans R, Southerton SG (2010) Fasciclin-like arabinogalactan proteins: specialization for stem biomechanics and cell wall architecture in Arabidopsis and Eucalyptus. Plant J 62: 689-703

200

Madson M, Dunand C, Li XM, Verma R, Vanzin GF, Calplan J, Shoue DA, Carpita NC, Reiter WD (2003) The MUR3 gene of Arabidopsis encodes a xyloglucan galactosyltransferase that is evolutionarily related to animal exostosins. Plant Cell 15: 1662-1670

Mascara T, Fincher GB (1982) Biosynthesis of arabinogalactan-protein in Lolium multiflorum (ryegrass) endosperm cells. 2. In vitro incorporation of galactosyl residues from UDP-galactose into polymeric products. Aust J Plant Physiol 9: 31-45

Masuko T, Minami A, Iwasaki N, Majima T, Nishimura S-I, Lee YC (2005) Carbohydrate analysis by a phenol-sulfuric acid method in microplate format. Anal Biochem 339: 69-72

Mau SL, Chen CG, Pu ZY, Moritz RL, Simpson RJ, Bacic A, Clarke AE (1995) Molecular-cloning of cDNAs encoding the protein backbones of arabinogalactan- proteins from the filtrate of suspension-cultured cells of Pyrus communis and Nicotiana alata. Plant J 8: 269-281

Memelink J, Swords KMM, Dekam RJ, Schilperoort RA, Hoge JHC, Staehelin LA (1993) Structure and regulation of tobacco extensin. Plant J 4: 1011-1022

Merkouropoulos G, Barnett DC, Shirsat AH (1999) The Arabidopsis extensin gene is developmentally regulated, is induced by wounding, methyl jasmonate, abscisic and salicylic acid, and codes for a protein with unusual motifs. Planta 208: 212-219

Misawa H, Tsumuraya Y, Kaneko Y, Hashimoto Y (1996) alpha-L-fucosyltransferases from radish primary roots. Plant Physiol 110: 665-673

Mohnen D (2008) Pectin structure and biosynthesis. Curr Opin Plant Biol 11: 266-277

Mortimer JC, Miles GP, Brown DM, Zhang Z, Segura MP, Weimar T, Yu X, Seffen KA, Stephens E, Turner SR, Dupree P (2010) Absence of branches from xylan in Arabidopsis gux mutants reveals potential for simplification of lignocellulosic biomass. Proc Natl Acad Sci USA 107: 17409-17414

Motose H, Sugiyama M, Fukuda H (2001) An arabinogalactan protein(s) is a key component of a fraction that mediates local intercellular communication involved in tracheary element differentiation of zinnia mesophyll cells. Plant and Cell Physiology 42: 129-137

Motose H, Sugiyama M, Fukuda H (2004) A proteoglycan mediates inductive interaction during plant vascular development. Nature 429: 873-878

201

Mouly A, Rumeau D, Esquerretugaye MT (1992) Differential accumulation of hydroxyproline-rich glycoprotein transcripts in sunflower plants infected with sclerotinia-sclerotiorum or treated with oxalic-acid. Plant Sci 85: 51-59

Moura J, Bonine CAV, Viana JDF, Dornelas MC, Mazzafera P (2010) Abiotic and biotic stresses and changes in the lignin content and composition in plants. Journal of Integrative Plant Biology 52: 360-376

Nakamura A, Furuta H, Maeda H, Takao T, Nagamatsu Y (2002) Structural studies by stepwise enzymatic degradation of the main backbone of soybean soluble polysaccharides consisting of galacturonan and rhamnogalacturonan. Bioscience Biotechnology and Biochemistry 66: 1301-1313

Nam J, Mysore KS, Zheng C, Knue MK, Matthysse AG, Gelvin SB (1999) Identification of T-DNA tagged Arabidopsis mutants that are resistant to transformation by Agrobacterium. Mol Gen Genet 261: 429-438

Nishimura MT, Stein M, Hou BH, Vogel JP, Edwards H, Somerville SC (2003) Loss of a callose synthase results in salicylic acid-dependent disease resistance. Science 301: 969-972

Nothnagel EA (1997) Proteoglycans and related components in plant cells. In International Review of Cytology - a Survey of Cell Biology, Vol 174, pp 195-291

O'Neill MA, Ishii T, Albersheim P, Darvill AG (2004) Rhamnogalacturonan II: Structure and function of a borate cross-linked cell wall pectic polysaccharide. Annu Rev Plant Biol 55: 109-139

Oka T, Saito F, Shimma YI, Yoko-O T, Nomura Y, Matsuoka K, Jigami Y (2010) Characterization of endoplasmic reticulum-localized UDP-D-galactose: hydroxyproline O-galactosyltransferase using synthetic peptide substrates in Arabidopsis. Plant Physiol 152: 332-340

Oxley D, Bacic A (1999) Structure of the glycosylphosphatidylinositol anchor of an arabinogalactan protein from Pyrus communis suspension-cultured cells. Proc Natl Acad Sci USA 96: 14246-14251

Pattathil S, Avci U, Baldwin D, Swennes AG, McGill JA, Popper Z, Bootten T, Albert A, Davis RH, Chennareddy C, Dong R, O'Shea B, Rossi R, Leoff C, Freshour G, Narra R, O'Neil M, York WS, Hahn MG (2010) A comprehensive toolkit of plant cell wall glycan-directed monoclonal antibodies. Plant Physiol 153: 514-525

202

Pattathil S, Avci U, Miller JS, Hahn MG (2012) Immunological approaches to plant cell wall and biomass characterization: glycome profiling. In M Himmel, ed, Biomass Conversion: Methods and Protocols, Methods in Molecular Biology, Vol 908. Humana Press (In press), DOI 10.1007/978-1-61779-956-36

Pennell RI, Janniche L, Kjellbom P, Scofield GN, Peart JM, Roberts K (1991) Developmental regulation of a plasma-membrane arabinogalactan protein epitope in oilseed rape flowers. Plant Cell 3: 1317-1326

Perrin R, Wilkerson C, Keegstra K (2001) Golgi enzymes that synthesize plant cell wall polysaccharides: finding and evaluating candidates in the genomic era. Plant Mol Biol 47: 115-130

Perrin RM, DeRocher AE, Bar-Peled M, Zeng WQ, Norambuena L, Orellana A, Raikhel NV, Keegstra K (1999) Xyloglucan fucosyltransferase, an enzyme involved in plant cell wall biosynthesis. Science 284: 1976-1979

Perrin RM, Jia ZH, Wagner TA, O'Neill MA, Sarria R, York WS, Raikhel NV, Keegstra K (2003) Analysis of xyloglucan fucosylation in Arabidopsis. Plant Physiol 132: 768-778

Petersen BL, Egelund J, Damager I, Faber K, Jensen JK, Yang Z, Bennett EP, Scheller HV, Ulvskov P (2009) Assay and heterologous expression in Pichia pastoris of plant cell wall type-II membrane anchored glycosyltransferases. Glycoconjugate Journal 26: 1235-1246

Ponce MR, Quesada V, Micol JL (1998) Rapid discrimination of sequences flanking and within T-DNA insertions in the Arabidopsis genome. Plant J 14: 497-501.

Puhlmann J, Bucheli E, Swain MJ, Dunning N, Albersheim P, Darvill AG, Hahn MG (1994) Generation of monoclonal-antibodies against plant cell-wall polysaccharides. I. characterization of a monoclonal-antibody to a terminal alpha- (12)-linked fucosyl-containing epitope. Plant Physiol 104: 699-710

Qin XM, Yamauchi R, Aizawa K, Inakuma T, Kato K (2001) Structural features of arabinogalactan-proteins from the fruit of Lycium chinense Mill. Carbohydrate Research 333: 79-85

Qu YM, Egelund J, Gilson PR, Houghton F, Gleeson PA, Schultz CJ, Bacic A (2008) Identification of a novel group of putative Arabidopsis thaliana beta-(1,3)- galactosyltransferases. Plant Mol Biol 68: 43-59

Reid JSG, Edwards M, Gidley MJ, Clark AH (1995) Enzyme specificity in galactomannan biosynthesis. Planta 195: 489-495

203

Reid JSG, Edwards ME, Dickson CA, Scott C, Gidley MJ (2003) Tobacco transgenic lines that express fenugreek galactomannan galactosyltransferase constitutively have structurally altered galactomannans in their seed endosperm cell walls. Plant Physiol 131: 1487-1495

Reiter WD (2002) Biosynthesis and properties of the plant cell wall. Curr Opin Plant Biol 5: 536-542

Reiter WD, Chapple C, Somerville CR (1997) Mutants of Arabidopsis thaliana with altered cell wall polysaccharide composition. Plant J 12: 335-345

Robertson D, Mitchell GP, Gilroy JS, Gerrish C, Bolwell GP, Slabas AR (1997) Differential extraction and protein sequencing reveals major differences in patterns of primary cell wall proteins from plants. J Biol Chem 272: 15841-15848

Roy S, Jauh GY, Hepler PK, Lord EM (1998) Effects of Yariv phenylglycoside on cell wall assembly in the lily pollen tube. Planta 204: 450-458

Ryden P, Sugimoto-Shirasu K, Smith AC, Findlay K, Reiter WD, McCann MC (2003) Tensile properties of Arabidopsis cell walls depend on both a xyloglucan cross-linked microfibrillar network and rhamnogalacturonan II-borate complexes. Plant Physiol 132: 1033-1040

Saint-Jore CM, Evins J, Batoko H, Brandizzi F, Moore I, Hawes C (2002) Redistribution of membrane proteins between the Golgi apparatus and endoplasmic reticulum in plants is reversible and not dependent on cytoskeletal networks. Plant J 29: 661-678

Sarria R, Wagner TA, O'Neill MA, Faik A, Wilkerson CG, Keegstra K, Raikhel NV (2001) Characterization of a family of Arabidopsis genes related to xyloglucan fucosyltransferase1. Plant Physiol 127: 1595-1606

Scheible WR, Eshed R, Richmond T, Delmer D, Somerville C (2001) Modifications of cellulose synthase confer resistance to isoxaben and thiazolidinone herbicides in Arabidopsis Ixr1 mutants. Proc Natl Acad Sci USA 98: 10079-10084

Scheible WR, Pauly M (2004) Glycosyltransferases and cell wall biosynthesis: novel players and insights. Curr Opin Plant Biol 7: 285-295

Scheller HV, Ulvskov P (2011) Hemicelluloses. In S Merchant, WR Briggs, D Ort, eds, Annu Rev Plant Biol, Vol 61. Annual Reviews, Palo Alto, pp 263-289

204

Schibeci A, Pnjak A, Fincher GB (1984) Biosynthesis of arabinogalactan-protein in Lolium multiflorum (Italian ryegrass) endosperm cells - subcellular-distribution of galactosyltransferases. Biochem J 218: 633-636

Schnabelrauch LS, Kieliszewski M, Upham BL, Alizedeh H, Lamport DT (1996) Isolation of pl 4.6 extensin peroxidase from tomato cell suspension cultures and identification of Val-Tyr-Lys as putative intermolecular cross-link site. Plant J 9: 477-489

Schoffl F, Rieping M, Baumann G, Bevan M, Angermuller S (1989) The function of plant heat shock promoter elements in the regulated expression of chimaeric genes in transgenic tobacco. Mol Gen Genet 217: 246-253.

Schultz CJ, Ferguson KL, Lahnstein J, Bacic A (2004) Post-translational modifications of arabinogalactan-peptides of Arabidopsis thaliana - Endoplasmic reticulum and glycosylphosphatidylinositol-anchor signal cleavage sites and hydroxylation of proline. J Biol Chem 279: 45503-45511

Schultz CJ, Johnson KL, Currie G, Bacic A (2000) The classical arabinogalactan protein gene gamily of Arabidopsis. Plant Cell 12: 1751-1768

Schultz CJ, Rumsewicz MP, Johnson KL, Jones BJ, Gaspar YM, Bacic A (2002) Using genomic resources to guide research directions. The arabinogalactan protein gene family as a test case. Plant Physiol 129: 1448-1463

Seifert GJ, Roberts K (2007) The biology of arabinogalactan proteins. Annu Rev Plant Biol 58: 137-161

Serpe M, Nothnagel E (1994) Effects of Yariv phenylglycosides on rosa cell suspensions: Evidence for the involvement of arabinogalactan-proteins in cell proliferation. Planta 193: 542-550

Serpe MD, Nothnagel EA (1995) Fractionation and structural characterization of arabinogalactan-proteins from the cell wall of rose cells. Plant Physiol 109: 1007- 1016.

Sherrier DJ, Prime TA, Dupree P (1999) Glycosylphosphatidylinositol-anchored cell- surface proteins from Arabidopsis. Electrophoresis 20: 2027-2035

Shi HZ, Kim Y, Guo Y, Stevenson B, Zhu JK (2003) The Arabidopsis SOS5 locus encodes a putative cell surface adhesion protein and is required for normal cell expansion. Plant Cell 15: 19-32

205

Shore G, Maclachlan GA (1975) The site of cellulose synthesis. Hormone treatment alters the intracellular location of alkali-insoluble beta-1,4-glucan (cellulose) synthetase activities. J Cell Biol 64: 557-571

Showalter AM (1993) Structure and function of plant cell wall proteins. Plant Cell 5: 9- 23

Showalter AM (2001) Arabinogalactan-proteins: structure, expression and function. Cell Mol Life Sci 58: 1399-1417

Showalter AM, Keppler B, Lichtenberg J, Gu DZ, Welch LR (2010) A bioinformatics approach to the identification, classification, and analysis of hydroxyproline-rich glycoproteins. Plant Physiol 153: 485-513

Shpak E, Barbar E, Leykam JF, Kieliszewski MJ (2001) Contiguous hydroxyproline residues direct hydroxyproline arabinosylation in Nicotiana tabacum. J Biol Chem 276: 11272-11278

Shpak E, Leykam JF, Kieliszewski MJ (1999) Synthetic genes for glycoprotein design and the elucidation of hydroxyproline-O-glycosylation codes. Proc Natl Acad Sci USA 96: 14736-14741

Simmons BA, Logue D, Ralph J (2010) Advances in modifying lignin for enhanced biofuel production. Curr Opin Plant Biol 13: 313-320

Smith JJ, Muldoon EP, Willard JJ, Lamport DTA (1986) Tomato extensin precursors P1 and P2 are highly periodic structures. Phytochemistry 25: 1021-1030

Sommer-Knudsen J, Bacic A, Clarke AE (1998) Hydroxyproline-rich plant glycoproteins. Phytochemistry 47: 483-497

Sommer-Knudsen J, Clarke AE, Bacic A (1996) A galactose-rich, cell-wall glycoprotein from styles of Nicotiana alata. Plant J 9: 71-83

Sparkes IA, Runions J, Kearns A, Hawes C (2006) Rapid, transient expression of fluorescent fusion proteins in tobacco plants and generation of stably transformed plants. Nat Protoc 1: 2019-2025.

Springer GF, Desai PR (1971) Monosaccharides as specific precipitinogens of eel antihuman blood-group H(O) antibody. Biochemistry 10: 3749-3761

Sterling JD, Atmodjo MA, Inwood SE, Kolli VSK, Quigley HF, Hahn MG, Mohnen D (2006) Functional identification of an Arabidopsis pectin biosynthetic

206

homogalacturonan galacturonosyltransferase. Proc Natl Acad Sci USA 103: 5236- 5241

Strasser R, Bondili JS, Vavra U, Schoberer J, Svoboda B, Glossl J, Leonard R, Stadlmann J, Altmann F, Steinkellner H, Macha L (2007) A unique beta 1,3- galactosyltransferase is indispensable for the biosynthesis of N-glycans containing lewis a structures in Arabidopsis thaliana. Plant Cell 19: 2278-2292

Sun WX, Zhao ZD, Hare MC, Kieliszewski MJ, Showalter AM (2004) Tomato LeAGP-1 is a plasma membrane-bound, glycosylphosphatidylinositol-anchored arabinogalactan-protein. Physiol Plant 120: 319-327

Svetek J, Yadav MP, Nothnagel EA (1999) Presence of a glycosylphosphatidylinositol lipid anchor on rose arabinogalactan proteins. J Biol Chem 274: 14724-14733

Takos AM, Dry IB, Soole KL (2000) Glycosyl-phosphatidylinositol-anchor addition signals are processed in Nicotiana tabacum. Plant J 21: 43-52

Tan L, Leykam JF, Kieliszewski MJ (2003) Glycosylation motifs that direct arabinogalactan addition to arabinogalactan-proteins. Plant Physiol 132: 1362-1369 Tan L, Qiu F, Lamport DT, Kieliszewski MJ (2004) Structure of a hydroxyproline (Hyp)-arabinogalactan polysaccharide from repetitive Ala-Hyp expressed in transgenic Nicotiana tabacum. J Biol Chem 279: 13156-13165

Tan L, Varnai P, Lamport DT, Yuan C, Xu J, Qiu F, Kieliszewski MJ (2010) Plant O-hydroxyproline arabinogalactans are composed of repeating trigalactosyl subunits with short bifurcated side chains. J Biol Chem 285: 24575-24583

Tan W, Gou DM, Tai E, Zhao YZ, Chow LMC (2006) Functional reconstitution of purified chloroquine resistance membrane transporter expressed in yeast. Archives of Biochemistry and Biophysics 452: 119-128

Taylor NG, Howells RM, Huttly AK, Vickers K, Turner SR (2003) Interactions among three distinct CesA proteins essential for cellulose synthesis. Proc Natl Acad Sci USA 100: 1450-1455

Tiainen P, Myllyharju J, Koivunen P (2005) Characterization of a second Arabidopsis thaliana prolyl 4-hydroxylase with distinct substrate specificity. J Biol Chem 280: 1142-1148

Tschopp JF, Brust PF, Cregg JM, Stillman CA, Gingeras TR (1987) Expression of the lacZ gene from two methanol-regulated promoters in Pichia pastoris. Nucleic Acids Res 15: 3859-3876.

207

Tsumuraya Y, Hashimoto Y, Yamamoto S (1987) An L-arabino-D-galactan and an L- arabino-D-galactan-containing proteoglycan from radish (Raphanus sativus) seeds. Carbohydrate Research 161: 113-126

Tsumuraya Y, Hashimoto Y, Yamamoto S, Shibuya N (1984) Structure of L-arabino- D-galactan-containing glycoproteins from radish leaves. Carbohydrate Research 134: 215-228

Tsumuraya Y, Ogura K, Hashimoto Y, Mukoyama H, Yamamoto S (1988) Arabinogalactan-proteins from primary and mature roots of radish (Raphanus sativus L.). Plant Physiol 86: 155-160

van Hengel AJ, Roberts K (2002) Fucosylated arabinogalactan-proteins are required for full root cell elongation in Arabidopsis. Plant J 32: 105-113

Vanholme R, Demedts B, Morreel K, Ralph J, Boerjan W (2010) Lignin Biosynthesis and Structure. Plant Physiol 153: 895-905

Vanzin GF, Madson M, Carpita NC, Raikhel NV, Keegstra K, Reiter WD (2002) The mur2 mutant of Arabidopsis thaliana lacks fucosylated xyloglucan because of a lesion in fucosyltransferase AtFUT1. Proc Natl Acad Sci USA 99: 3340-3345

Varki A, Cummings R, Esko J, Freeze H, Hart G, Marth J (1999) O-glycans, Ed 1. Cold Spring Harbor, New York

Velasquez SM, Ricardi MM, Dorosz JG, Fernandez PV, Nadra AD, Pol-Fachin L, Egelund J, Gille S, Harholt J, Ciancia M, Verli H, Pauly M, Bacic A, Olsen CE, Ulvskov P, Petersen BL, Somerville C, Iusem ND, Estevez JM (2011) O- Glycosylated cell wall proteins are essential in root hair growth. Science 332: 1401- 1403

Vorwerk S, Somerville S, Somerville C (2004) The role of plant cell wall polysaccharide composition in disease resistance. Trends Plant Sci 9: 203-209

Wang H, Wu HM, Cheung AY (1993) Development and pollination regulated accumulation and glycosylation of a stylar transmitting tissue-specific proline-rich protein. Plant Cell 5: 1639-1650

Wang J, Elliott JE, Williamson RE (2008) Features of the primary wall CESA complex in wild type and cellulose-deficient mutants of Arabidopsis thaliana. J Exp Bot 59: 2627-2637

Watkins WM, Morgan WT (1952) Neutralization of the anti-H agglutinin in eel serum by simple sugars. Nature 169: 825-826.

208

Wei G, Shirsat AH (2006) Extensin over-expression in Arabidopsis limits pathogen invasiveness. Mol Plant Pathol 7: 579-592

Wen FS, Celoy RM, Nguyen T, Zeng WQ, Keegstra K, Immerzeel P, Pauly M, Hawes MC (2008) Inducible expression of Pisum sativum xyloglucan fucosyltransferase in the pea root cap meristem, and effects of antisense mRNA expression on root cap cell wall structural integrity. Plant Cell Rep 27: 1125-1135

Willats WGT, Knox JP (1996) A role for arabinogalactan-proteins in plant cell expansion: evidence from studies on the interaction of beta-glucosyl Yariv reagent with seedlings of Arabidopsis thaliana. Plant J 9: 919-925

Willfor S, Sjoholm R, Laine C, Roslund M, Hemming J, Holmbom B (2003) Characterisation of water-soluble galactoglucomannans from Norway spruce wood and thermomechanical pulp. Carbohydrate Polymers 52: 175-187

Wu AM, Rihouey C, Seveno M, Hornblad E, Singh SK, Matsunaga T, Ishii T, Lerouge P, Marchant A (2009) The Arabidopsis IRX10 and IRX10-LIKE glycosyltransferases are critical for glucuronoxylan biosynthesis during secondary cell wall formation. Plant J 57: 718-731

Wu HM, Wong E, Ogdahl J, Cheung AY (2000) A pollen tube growth-promoting arabinogalactan protein from Nicotiana alata is similar to the tobacco TTS protein. Plant J 22: 165-176

Wu YY, Williams M, Bernard S, Driouich A, Showalter AM, Faik A (2010) Functional identification of two nonredundant Arabidopsis (1,2)fucosyltransferases specific to arabinogalactan proteins. J Biol Chem 285: 13638-13645

Xu JF, Tan L, Lamport DTA, Showalter AM, Kieliszewski MJ (2008) The O-Hyp glycosylation code in tobacco and Arabidopsis and a proposed role of Hyp-glycans in secretion. Phytochemistry 69: 1631-1640

Yong WD, Link B, O'Malley R, Tewari J, Hunter CT, Lu CA, Li XM, Bleecker AB, Koch KE, McCann MC, McCarty DR, Patterson SE, Reiter WD, Staiger C, Thomas SR, Vermerris W, Carpita NC (2005) Genomics of plant cell wall biogenesis. Planta 221: 747-751

York WS, Kumar Kolli VS, Orlando R, Albersheim P, Darvill AG (1996) The structures of arabinoxyloglucans produced by solanaceous plants. Carbohydr Res 285: 99-128

209

Youl JJ, Bacic A, Oxley D (1998) Arabinogalactan-proteins from Nicotiana alata and Pyrus communis contain glycosylphosphatidylinositol membrane anchors. Proc Natl Acad Sci USA 95: 7921-7926

Yuasa K, Toyooka K, Fukuda H, Matsuoka K (2005) Membrane-anchored prolyl hydroxylase with an export signal from the endoplasmic reticulum. Plant J 41: 81-94 Zeng W, Chatterjee M, Faik A (2008) UDP-xylose-stimulated glucuronyltransferase activity in wheat microsomal membranes: characterization and role in glucurono(arabino) xylan biosynthesis. Plant Physiol 147: 78-91

Zeng W, Jiang N, Nadella R, Killen TL, Nadella V, Faik A (2010) A glucurono(arabino)xylan synthase complex from wheat contains members of the GT43, GT47, and GT75 families and functions cooperatively. Plant Physiol 154: 78- 97

Zhao ZD, Tan L, Showalter AM, Lamport DT, Kieliszewski MJ (2002) Tomato LeAGP-1 arabinogalactan-protein purified from transgenic tobacco corroborates the Hyp contiguity hypothesis. Plant J 31: 431-444

Zhong RQ, Morrison WH, Freshour GD, Hahn MG, Ye ZH (2003) Expression of a mutant form of cellulose synthase AtCesA7 causes dominant negative effect on cellulose biosynthesis. Plant Physiol 132: 786-795

210

APPENDIX A: SEQUENCES AND LOCATIONS OF PRIMERS USED IN THIS

WORK

Appendix A Table 1. Sequence and annealing temperatures of the primers used in this work

Purpose Name Primer Sequences (5’3’) Annealing

Cloning of GALT1 to pPICZA GalT1 yeast F CGCCGCGGATGCATCATCATCATCATCACA 54°C TGAAGAGATTTTATGGAGGGCTT

GalT1 yeast R GGGCCCTTACCATTCGCGGCAGCA

Cloning of GALT3 to pPICZA GalT3 yeast F CGCCGCGG 54°C ATGCATCATCATCATCATCACATGAAGCAAT TCATGTCAG TGGT

GalT3 yeast R GGGCCCTTATTCGCAGCAAATAGATTGGTTC

Cloning of GALT4 to pPICZA GalT4 yeast F CGCCGCGGATGCATCATCATCATCATCACAT 54°C GAAGAAGTCTAAACTCGATAATTC

GalT4 yeast R GGGCCCTCATCTCATGTTGCAGCATTG

Cloning of GALT5 to pPICZA GalT5 yeast F CGCCGCGGATGCATCATCATCATCATCACAT 54°C GAAAAAACCCAAATTGTCG

GalT5 yeast R TCTAGATCATCTCATGTTACAACACTC

Cloning of GALT6 to pPICZA GalT6 yeast F CGCCGCGGATGCATCATCATCATCATCACAT 54°C GAGGAAGCCCAAGTTGTCA

GalT6 yeast R TCTAGATCATCTCATGTTGCAGCACTG

Cloning of GALT1 to pMDC30 GalT1BY2F2 CACCATGCATCATCATCATCATCACAAGAGA 54°C /pMDC32; verification of TTTTATGGAGGGCTTC constructs or Agrobacterium or BY2 cells containing GALT1 galt1R by2 TTACCATTCGCGGCAGCAAA

Cloning of GALT3 to pMDC30 GalT3BY2F2 CACCATGCATCATCATCATCATCACAAGCAA 54°C /pMDC32; verification of TTCATGTCAGTGGT constructs or Agrobacterium or BY2 cells containing GALT3 galt3R by2 TTATTCGCAGCAAATAGATTGGTTC

Cloning of GALT4 to pMDC30 GalT4BY2F2 CACCATGCATCATCATCATCATCACAAGAAG 54°C /pMDC32; verification of TCTAAACTCGATAATTC constructs or Agrobacterium or BY2 cells containing GALT4 galt4R by2 TCATCTCATGTTGCAGCATTG

Cloning of GALT5 to pMDC30 GalT5BY2F2 CACCATGCATCATCATCATCATCACAAAAAA 54°C /pMDC32; verification of CCCAAATTGTCG constructs or Agrobacterium or BY2 cells containing GALT5 galt5R by2 TCATCTCATGTTACAACACTCAG

211

Appendix A Table 1 (continued)

Cloning of GALT6 to pMDC30 GalT6BY2F2 CACCATGCATCATCATCATCATCACAGGAAG 54°C /pMDC32; verification of CCCAAGTTGTCA constructs or Agrobacterium or BY2 cells containing GALT6 galt6R by2 TCATCTCATGTTGCAGCACTG

GALT1 construct sequencing GalT1site544 GTTCCTCCATCACAGTTATTGG 55°C

GalT1site651R CACAATGATGGGTGGATCAG 55°C

GalT1site220R CACCAGATATCGCTTCTTGAG 55°C

GALT3 construct sequencing GalT3 site462 GCATTTGATAAGGATTTGTCTGG 55°C

GalT3site622R GACGAGTTTCTCCTGATAATCC 55°C

GALT4 construct sequencing GalT4 site586 GATCTCACATTACCGTCGTG 55°C

GalT4site717R AGGCGGATCTTCACCATC 55°C

GalT4site222R ATTAGCTTCCCTACTACCACC 55°C

GALT5 construct sequencing GalT5 site395F GGTCAGTGGAGCTTCACAAG 55°C

GalT5 site586 CCGTGTGGTTTGACATTGG 55°C

GalT5site726R ATCCTCTCCTTCAACAGTCTTC 55°C

GalT5site222R GGGCTCTTGCTCGTTATTG 55°C

GALT6 GalT6 site592 GGTCACATATTACAGTGGTTGG 55°C construct sequencing GalT6site722R TTCAAACCCTGAAGCTCCAAC 55°C

GalT6site227R CTCTCTTGTAACTCTCTCTGAC 55°C

GalT6site855R TTCACATCTTTGTGCTGAGCC 55°C

GalT6site1939F GCATAGAGAATTACTTGACGGCG 55°C

pMDC30/pMDC32 vector pMDC45PacI CCGGCAACAGGATTCAATCTTAAG 55°C sequencing

GALT3 site 1446 T to C t1446c GATGAGCTCCTATCAAGCCTAGAAGAAAGA 55°C CCGTC

t1446c- GACGGTCTTTCTTCTAGGCTTGATAGGAGCT antisense CATC

Cloning of GALT3 to GalT3-C-F CAGGACGTCTAGATGAAGCAATTCATGTCA 62°C pVKH18En6 GTGGTGAGATTC

GalT3-C-R CATGACCGTCGACTTTTCGCAGCAAATAGAT TGGTTCTC

212

Appendix A Table 1 (Continued) pVKH18En6-GALT3 sequencing MHP-23 GAACTTCAGGGTCAGCTTGCC 58.5°C

pVKH18En6-GALT3 sequencing MHP-22 AGTCCGCCCTGAGCAAAG 57.8°C

Absence of insertion of fut4 F4SLLP CCATGTAGTTACATTCCCAACCG 54°C

F4SLRP1 CCACGTCGATGGAGCCTTGTTT

Presence of insertion 1 in fut4 TAGCATCTGAATTTCATAACCAATCTCGATA 49°C LB3 CAC

F4SLRP2 GAACATGTTTTCAGAGCGAGC

Presence of insertion 2 in fut4 LB3 TAGCATCTGAATTTCATAACCAATCTCGATA 54°C CAC

F4SLLP CCATGTAGTTACATTCCCAACCG

Absence of insertion of fut6 F6LP2 CACATCTTTCAGATCTCCAGCG 55°C

F6RP CTTTCTTGTAAGCATCCGTGC

Presence of insertion 1 in fut6 LBa1 TGGTTCACGTAGTGGGCCATCG 54°C

F6RP CTTTCTTGTAAGCATCCGTGC

Presence of insertion 2 in fut6 LBa1 TGGTTCACGTAGTGGGCCATCG 54°C

F6LP1 TGTTTTGCTAGCAACGAAAGC

RT -PCR for fut4 46RTL1 TGGAGGATTAAAGCCATGGTTAC 52°C

FUT4RP1 AATGAAAAAGGAAGATATATAACA

RT -PCR for fut6 FUT6brtlp TCCAGCGAAGTTTTCAGAGC 52°C

Fut6brtrp CGCAACCCACACAATGTATC

213

Appendix A Figure 1. Schematic diagrams showing the positions of primers used for expression cloning and verification of transgenic bacteria, Agrobacterium or tobacco BY2 cells.

214

Appendix A Figure 2 Schematic diagram showing the positions of primers used for RT-PCR analysis of fut4, fut4/fut6 and fut4/fut6 mutant plants.

215

APPENDIX B: PARTIAL SEQUENCE OF THE FINAL EXPRESSION

CONSTRUCTS

Part I. Sequencing results of the pPICZA-6xHis-GalT constructs.

The expression constructs of pPICZA-6xHis-GalT1/GalT3/GalT4 and pPICZB-6xHis - GalT5/GalT6 were sequenced for full length coding regions of the 6xHis-GalT genes and partial sequences of the promoter and terminator regions. Color codes: blue: pPICZA/B vector sequences; orange: SacII restriction site; yellow: ApaI restriction site; green: XbaI restriction site; purple: 6xHis tag; black: GalT coding region; red: nucleotide mutation without affecting the amino acid encoded.

>pPICZ-GalT1 GATCAAAAAACAACTAATTATTCGAAACGAGGAATTCACGTGGCCCAGCCGGCCGTCTCG GATCGGTACCTCGAGCCGCGGATGCATCATCATCATCATCACATGAAGAGATTTTATGGA GGGCTTCTTGTTGTATCAATGTGTATGTTCTTGACGGTGTATAGATATGTAGACTTGAAC ACTCCTGTTGAAAAGCCTTATATTACTGCTGCTGCTTCTGTTGTTGTTACTCCTAACACC ACTCTTCCTATGGAATGGCTGCGGATTACTCTCCCTGACTTTATGAAAGAAGCGAGGAAT ACTCAAGAAGCGATATCTGGTGATGATATCGCTGTTGTCTCGGGTTTGTTTGTTGAGCAG AATGTGTCTAAAGAAGAGAGAGAGCCTTTGCTTACTTGGAACCGTTTGGAAAGTCTTGTT GATAATGCACAGAGTTTAGTTAATGGAGTTGATGCTATTAAGGAAGCTGGCATTGTTTGG GAGAGTCTTGTGTCTGCTGTTGAAGCTAAGAAACTAGTTGATGTTAATGAAAATCAGACG AGGAAAGGAAAAGAGGAGCTTTGTCCTCAGTTTCTAAGCAAAATGAATGCTACTGAAGCT GATGGGAGTAGTCTGAAGTTGCAAATTCCTTGTGGTTTGACTCAGGGTTCCTCCATCACA GTTATTGGCATCCCAGATGGTCTTGTTGGTAGTTTTCGGATTGATCTAACGGGACAACCG CTTCCAGGGGAGCCTGATCCACCCATCATTGTGCATTATAATGTTAGGCTTCTTGGTGAC AAATCGACGGAAGACCCTGTGATTGTTCAAAACAGCTGGACGGCATCTCAGGACTGGGGA GCTGAGGAACGCTGTCCAAAATTTGATCCTGATATGAATAAGAAAGTGGATGACTTGGAT GAATGCAACAAGATGGTTGGTGGAGAAATTAACCGAACTTCTTCAACTAGCTTGCAGTCC AACACTTCAAGGGGAGTTCCAGTAGCCAGGGAAGCATCTAAACATGAAAAATACTTTCCT TTCAAGCAGGGTTTCTTATCGGTTGCTACACTTAGGGTGGGAACAGAGGGAATGCAGATG ACAGTTGATGGGAAACATATAACTTCATTTGCTTTCCGCGATACACTGGAACCGTGGCTT GTTAGTGAAATACGGATTACAGGTGACTTTAGGTTAATATCCATTCTTGCCAGCGGTTTG CCCACATCAGAAGAATCAGAGCACGTTGTTGATCTAGAGGCACTAAAATCACCTACCCTT TCACCATTAAGGCCACTGGATCTCGTTATTGGTGTTTTCTCCACTGCGAACAATTTTAAA AGACGGATGGCTGTGAGGAGAACATGGATGCAGTATGATGATGTAAGATCTGGAAGAGTT GCAGTACGCTTTTTTGTTGGCCTTCACAAAAGTCCTCTTGTTAACTTGGAACTCTGGAAC GAGGCTCGGACTTACGGTGATGTTCAGCTAATGCCCTTTGTTGATTATTACAGTCTCATC AGTTGGAAAACACTAGCCATCTGCATCTTCGGGACAGAGGTTGACTCAGCCAAGTTCATC

216

ATGAAAACGGATGATGACGCCTTTGTTCGTGTAGATGAAGTGTTACTTTCTTTATCAATG ACCAACAACACTCGCGGGTTAATATACGGACTGATCAATTCCGACTCTCAACCTATTCGA AACCCTGATAGCAAATGGTACATCAGTTATGAGGAATGGCCTGAAGAGAAATATCCACCA TGGGCGCATGGTCCAGGCTACATTGTATCTCGTGACATAGCAGAATCGGTTGGTAAGCTT TTCAAAGAAGGAAACCTAAAGATGTTTAAGCTAGAAGATGTGGCAATGGGGATATGGATA GCTGAGCTGACAAAACATGGACTCGAGCCTCATTACGAAAACGATGGAAGGATCATTAGT GATGGATGCAAGGATGGTTATGTGGTTGCTCATTACCAAAGCCCTGCCGAAATGACTTGC CTTTGGCGTAAATACCAAGAAACCAAACGCTCTCTTTGCTGCCGCGAATGGTAAGGGCCC GAACAAAAACTCATCTCAGAAGAGGATCTGAATAGCGCCGTCGACCATCATCATCATCAT CATTGAGTTT

>pPICZA-GalT3 AGATCAAAAAACAACTAATTATTCGAAACGAGGAATTCACGTGGCCCAGCCGGCCGTCTC GGATCGGTACCTCGAGCCGCGGATGCATCATCATCATCATCACATGAAGCAATTCATGTC AGTGGTGAGATTCAAATTTGGTTTCACTTCAGTCAGAATGAGGGATTGGTCGGTGGGAGT CTCCATTATGGTTCTTACATTGATCTTCATCATCCGTTATGAACAATCTGATCACACTCA CACTGTGGATGATTCTAGTATAGAAGGAGAGAGTGTTCATGAACCCGCAAAGAAGCCACA TTTTATGACTTTGGAAGATCTTGATTATCTATTTTCAAACAAGAGCTTTTTTGGAGAAGA AGAAGTGTCCAATGGAATGCTTGTATGGTCTCGAATGCGTCCATTTCTTGAAAGGCCAGA TGCTTTGCCAGAAACTGCTCAAGGGATAGAAGAAGCTACATTGGCAATGAAAGGTTTGGT TTTAGAAATCAATAGAGAGAAGAGAGCTTATTCTTCTGGTATGGTCTCTAAGGAAATTAG AAGAATCTGTCCGGATTTTGTCACTGCATTTGATAAGGATTTGTCTGGTTTAAGTCATGT ACTTCTTGAGCTTCCTTGTGGTTTAATTGAAGATTCTTCAATAACTTTGGTTGGTATTCC TGATGAACATTCTAGTAGCTTCCAGATTCAGCTCGTTGGCTCGGGATTATCAGGAGAAAC TCGTCGGCCAATAATCTTGCGTTACAATGTGAACTTTTCTAAACCATCGATAGTGCAAAA TACATGGACAGAGAAGCTAGGTTGGGGAAACGAAGAGCGATGCCAATATCATGGATCATT GAAAAATCATTTAGTTGATGAACTTCCTCTCTGCAACAAACAGACCGGTAGAATCATTTC GGAAAAGAGTTCCAACGATGATGCAACTATGGAACTTTCTCTTTCAAATGCTAATTTTCC ATTTCTCAAAGGGAGTCCCTTCACTGCCGCATTGTGGTTTGGCTTAGAAGGTTTTCATAT GACGATAAATGGGCGGCACGAGACTTCATTTGCTTACAGGGAGAAGCTCGAGCCATGGTT AGTCAGTGCAGTCAAAGTCTCAGGTGGTTTGAAAATTTTATCTGTCTTAGCCACAAGACT GCCCATTCCCGATGACCATGCATCTTTAATCATAGAAGAGAAACTTAAAGCTCCATCTCT TTCCGGGACAAGAATAGAACTATTGGTGGGTGTTTTCTCCACTGGAAATAATTTTAAGCG GCGTATGGCATTGAGAAGATCTTGGATGCAATACGAGGCAGTAAGATCTGGCAAAGTAGC TGTTCGATTTCTCATTGGCCTTCACACAAATGAAAAAGTCAATTTAGAGATGTGGAGAGA ATCTAAGGCATATGGAGACATTCAGTTTATGCCATTTGTTGACTACTATGGTTTACTTAG CTTGAAAACAGTTGCGCTTTGCATTCTCGGGACCAAAGTCATCCCAGCAAAATACATAAT GAAGACGGATGATGATGCGTTTGTACGGATTGATGAGCTCCTATCAAGTCTAGAAGAAAG

217

ACCGTCTAGTGCCCTTCTGTACGGTTTGATCTCATTTGATTCATCACCGGACCGTGAACA AGGCAGCAAATGGTTTATCCCTAAAGAGGAATGGCCTTTAGATTCATACCCTCCATGGGC ACATGGCCCTGGCTACATCATCTCTCATGATATAGCGAAATTTGTGGTGAAGGGTCACCG TCAAAGAGATCTTGGACTTTTCAAGCTGGAAGATGTGGCGATGGGGATATGGATTCAACA ATTCAACCAGACGATAAAAAGAGTGAAGTACATCAATGACAAAAGATTTCATAACAGTGA TTGTAAATCAAATTACATTCTTGTTCATTACCAAACTCCTAGACTAATTTTGTGTCTTTG GGAGAAGCTGCAAAAAGAGAACCAATCTATTTGCTGCGAATAAGGGCCCGAACAAAAACT CATCTCAGAAGAGGATCTGAATAGCGCCGTCGACCATCATCATCATCATCATTGAGTTT

>pPICZA-GalT4 GATCAAAAAACAACTAATTATTCGAAACGAGGAATTCACGTGGCCCAGCCGGCCGTCTCG GATCGGTACCTCGAGCCGCGGATGCATCATCATCATCATCACATGAAGAAGTCTAAACTC GATAATTCTTCTTCACAGATTCGATTCGGGCTTGTTCAGTTCTTATTAGTTGTTCTGCTT TTTTACTTCCTCTGCATGAGCTTCGAGATCCCATTCATCTTCAGAACCGGGTCTGGGTCC GGGTCTGATGATGTTTCATCTTCTTCTTTTGCTGACGCATTACCGAGACCAATGGTTGTT GGTGGTGGTAGTAGGGAAGCTAATTGGGTTGTCGGAGAAGAAGAAGAAGCAGACCCACAT CGACATTTCAAGGATCCGGGTCGGGTACAGCTTCGGTTACCGGAGCGGAAAATGAGGGAA TTTAAGTCCGTCTCTGAGATTTTCGTCAACGAGAGCTTCTTCGACAATGGCGGATTCAGC GATGAATTCTCAATCTTTCACAAAACAGCGAAGCATGCGATTTCAATGGGTCGAAAAATG TGGGACGGACTCGATTCGGGTTTAATCAAACCCGATAAAGCTCCGGTTAAGACCCGGATT GAGAAATGTCCGGATATGGTTTCGGTTTCTGAGTCGGAGTTTGTGAACCGGAGTCGGATC TTGGTTTTGCCGTGTGGGTTAACGTTAGGATCTCACATTACCGTCGTGGCTACGCCGCAT TGGGCTCACGTTGAGAAAGATGGTGATAAGACGGCGATGGTGAGTCAGTTCATGATGGAG TTACAAGGATTAAAGGCGGTGGATGGTGAAGATCCGCCTCGGATACTTCATTTTAACCCG AGGATTAAAGGTGATTGGAGTGGAAGACCAGTGATTGAGCAAAACACTTGTTATCGAATG CAATGGGGCTCAGGTTTACGTTGTGATGGTCGTGAATCTAGTGATGATGAAGAATATGTT GATGGAGAGGTGAAATGTGAGAGGTGGAAGAGAGATGATGATGATGGTGGTAATAATGGT GATGATTTTGATGAATCAAAGAAGACATGGTGGTTGAATAGGTTGATGGGTCGGAGGAAG AAGATGATAACACATGATTGGGATTATCCTTTTGCTGAAGGGAAGCTTTTTGTTCTTACA CTTCGAGCTGGGATGGAAGGTTATCATATTAGTGTGAATGGAAGACATATCACATCTTTT CCTTATAGAACGGGGTTTGTTTTGGAGGATGCCACTGGATTAGCAGTCAAAGGAAACATT GATGTGCATTCTGTATATGCTGCCTCGTTACCTTCTACAAATCCTAGTTTTGCACCGCAG AAGCATCTCGAGATGCAAAGGATATGGAAAGCTCCTTCGTTACCTCAGAAGCCTGTAGAG TTGTTCATTGGAATTCTTTCTGCTGGTAATCATTTTGCAGAGAGAATGGCAGTGAGGAAG TCATGGATGCAGCAGAAGCTGGTCAGATCATCGAAAGTTGCTGCCCGGTTCTTTGTGGCA TTGCATGCAAGAAAGGAAGTCAATGTGGATTTAAAGAAAGAAGCTGAGTACTTTGGTGAT ATTGTCATAGTACCGTACATGGATCATTATGACCTTGTTGTGCTCAAGACAGTTGCCATC TGCGAATATGGGGTGAACACAGTGGCGGCAAAGTACGTTATGAAATGTGACGATGATACA TTTGTGCGTGTGGATGCTGTGATCCAGGAAGCAGAAAAGGTTAAGGGAAGAGAGAGCCTT

218

TATATTGGAAACATTAATTTTAACCATAAGCCATTGCGTACCGGGAAATGGGCTGTGACA TTCGAGGAATGGCCAGAAGAGTATTATCCTCCATATGCAAATGGTCCGGGTTACATCTTG TCATATGATGTAGCTAAGTTCATTGTCGATGATTTTGAACAAAAGCGATTAAGATTATTC AAGATGGAAGATGTGAGCATGGGAATGTGGGTGGAGAAGTTCAACGAGACTAGACCAGTG GCAGTGGTTCACAGCCTCAAGTTCTGTCAGTTTGGTTGCATAGAAGACTACTTCACCGCT CATTATCAGTCGCCTCGCCAGATGATTTGCATGTGGGATAAGCTGCAGAGACTCGGGAAG CCCCAATGCTGCAACATGAGATGAGGGCCCGAACAAAAACTCATCTCAGAAGAGGATCTG AATAGCGCCGTCGACCATCATCATCATCATCATTGAGTTTTAGCCTTAGACATGACTGTT CCTCAGTTCAAGTTGGGCACTTACGAGAAGACC

>pPICZB-GalT5 GATCAAAAAACAACTAATTATTCGAAACGAGGAATTCACGTGGCCCAGCCGGCCGTCTCG GATCGGTACCTCGAGCCGCGGATGCATCATCATCATCATCACATGAAAAAACCCAAATTA TCGAAAGTGGAAAAAATCGACAAGATTGATCTGTTCTCTTCACTATGGAAGCAGAGATCG GTTCGTGTAATAATGGCAATAGGGTTTCTCTATCTTGTAATTGTCTCAGTAGAGATACCT CTCGTTTTCAAATCCTGGTCCAGCAGCTCCGTGCCTCTTGATGCTCTTTCTCGACTCGAG AAGCTCAATAACGAGCAAGAGCCCCAAGTCGAGATTATCCCTAATCCTCCATTGGAGCCA GTTTCGTACCCGGTTTCGAACCCGACCATTGTTACCCGGACGGACCTTGTTCAGAACAAG GTCCGTGAACATCATCGGGGTGTTCTCTCAAGTTTGAGATTTGATTCGGAAACTTTCGAC CCGAGTAGTAAAGACGGGTCAGTGGAGCTTCACAAGTCAGCTAAGGAAGCTTGGCAGCTA GGTCGCAAGCTATGGAAGGAGCTTGAATCTGGAAGGCTTGAGAAACTAGTGGAGAAGCCA GAGAAGAACAAACCGGATTCATGTCCACATTCTGTTTCGCTAACCGGGTCTGAGTTTATG AACCGGGAGAACAAATTGATGGAGCTGCCGTGTGGTTTGACATTGGGTTCACACATAACT TTGGTGGGGAGGCCGAGGAAAGCTCATCCCAAGGAAGGAGATTGGTCTAAGTTGGTGTCT CAGTTTGTGATAGAGCTTCAAGGTTTGAAGACTGTTGAAGGAGAGGATCCTCCTAGGATT CTGCATTTCAATCCGAGGCTTAAGGGAGATTGGAGCAAAAAACCGGTGATTGAGCAGAAT AGTTGCTATAGGATGCAATGGGGACCTGCACAACGTTGCGAAGGATGGAAGTCAAGAGAT GATGAAGAGACTGTTGATAGTCATGTGAAGTGTGAAAAATGGATTCGTGATGATGACAAT TACTCAGAAGGGTCGAGGGCAAGATGGTGGTTGAATAGACTTATAGGAAGGAGGAAAAGG GTCAAAGTAGAATGGCCGTTTCCTTTTGTGGAAGAGAAGCTGTTCGTTCTAACTCTTAGC GCCGGTTTAGAGGGTTACCATATCAATGTTGATGGAAAGCATGTTACTTCTTTCCCTTAT CGCACTGGTTTCACCCTTGAGGATGCAACAGGGCTAACAGTAAACGGAGACATTGATGTC CATTCTGTTTTTGTTGCCTCTCTGCCAACATCACATCCTAGTTTTGCTCCCCAAAGGCAT CTCGAATTGTCAAAGAGATGGCAGGCTCCTGTAGTTCCCGATGGGCCTGTGGAGATCTTT ATAGGCATTCTTTCCGCAGGCAATCATTTCTCTGAGCGGATGGCTGTGAGGAAATCCTGG ATGCAGCATGTTCTTATTACATCTGCAAAAGTTGTTGCTCGTTTCTTTGTGGCGCTGCAT GGGAGGAAGGAGGTGAATGTGGAATTGAAGAAAGAAGCGGAGTATTTTGGGGACATTGTA CTTGTTCCTTACATGGATAGCTATGATCTTGTCGTGCTGAAAACTGTTGCCATATGTGAA CACGGAGCTCTTGCATTCTCTGCAAAGTACATAATGAAGTGTGACGATGATACATTTGTA

219

AAACTTGGCGCGGTGATCAATGAAGTGAAAAAAGTACCCGAAGGTAGAAGCCTGTACATT GGTAACATGAATTATTACCACAAACCTCTCCGTGGGGGTAAATGGGCAGTCACATATGAG GAATGGCCAGAGGAGGACTATCCGCCCTACGCAAATGGACCCGGATATGTTCTATCTTCT GACATTGCGCGCTTCATCGTGGACAAGTTTGAGAGACATAAATTACGGCTGTTCAAGATG GAGGACGTGAGTGTGGGAATGTGGGTTGAGCATTTCAAGAACACAACAAACCCAGTGGAT TACAGACACAGTCTGAGATTCTGCCAGTTTGGTTGTGTTGAGAACTACTACACAGCTCAT TACCAGTCGCCAAGACAGATGATATGCTTATGGGATAAGCTCTTAAGACAGAACAAGCCT GAGTGTTGTAACATGAGATGATCTAGAACAAAAACTCATCTCAGAAGAGGATCTGAATAG CGCCGTCGACCATCATCATCATCATCATTGAGTTTGTAGCCTTAGACATGACTGTTCCTC AGTTCAAG

>pPICZB-GalT6 ACGACACTTGAGAGATCAAAAAACAACTAATTATTCGAAACGAGGAATTCACGTGGCCCA GCCGGCCGTCTCGGATCGGTACCTCGAGCCGCGGATGCATCATCATCATCATCACATGAG GAAGCCCAAGTTGTCAAAACTCGAGAGGTTAGAGAAATTCGACATTTTCGTTTCTCTAAG CAAGCAGAGATCGGTTCAGATACTAATGGCGGTTGGGTTACTCTACATGCTTCTTATCAC ATTCGAAATCCCTTTCGTCTTCAAAACCGGGCTTAGTTCTTTATCTCAGGATCCGTTAAC CCGACCCGAGAAGCACAATAGTCAGAGAGAGTTACAAGAGAGACGAGCTCCGACTCGACC TTTAAAGAGTCTGCTTTACCAGGAATCACAATCGGAATCACCGGCTCAGGGTTTAAGAAG AAGGACTCGGATCCTTTCTAGTTTGAGATTCGACCCGGAAACGTTTAACCCGAGTAGCAA AGATGGGTCTGTGGAGCTTCATAAATCTGCTAAGGTAGCTTGGGAAGTTGGTCGAAAGAT ATGGGAAGAGCTTGAGTCTGGGAAAACGTTGAAAGCTTTGGAGAAGGAGAAGAAGAAGAA GATTGAGGAACATGGGACAAACTCGTGTTCTCTCTCTGTTTCCTTAACCGGGTCTGATCT TTTGAAACGTGGGAATATCATGGAGCTTCCATGTGGTTTAACTCTTGGGTCACATATTAC AGTGGTTGGGAAGCCACGAGCTGCTCATTCGGAGAAGGACCCTAAGATATCGATGTTAAA GGAAGGAGATGAAGCTGTGAAGGTTTCACAGTTTAAGTTGGAGCTTCAGGGTTTGAAAGC AGTGGAAGGAGAAGAGCCACCTCGGATACTCCACTTGAATCCAAGGCTTAAGGGTGATTG GAGTGGTAAGCCTGTGATTGAGCAGAACACTTGCTATAGAATGCAATGGGGCTCAGCACA AAGATGTGAAGGATGGAGATCTAGGGATGATGAAGAGACTGTTGATGGTCAGGTTAAGTG CGAGAAATGGGCTCGGGATGATAGCATTACATCTAAAGAAGAAGAGTCTAGCAAGGCGGC TTCATGGTGGCTTAGTCGATTAATAGGTCGGAGCAAGAAAGTAACTGTTGAATGGCCATT TCCATTCACAGTTGACAAGCTTTTCGTGCTTACTCTTAGTGCTGGATTGGAAGGCTACCA TGTTAGTGTCGATGGGAAGCATGTCACTTCCTTTCCATACCGAACTGGATTTACGCTTGA GGATGCTACTGGTCTAACCATTAACGGGGACATAGATGTTCACTCTGTTTTCGCTGGCTC TCTCCCAACCTCGCATCCTAGTTTTTCTCCTCAGAGGCATCTTGAGCTCTCGAGCAATTG GCAAGCCCCATCACTTCCTGATGAGCAAGTTGATATGTTCATTGGTATCCTTTCTGCTGG TAACCATTTTGCTGAGAGGATGGCTGTGAGGAGGTCGTGGATGCAACATAAACTCGTTAA ATCTTCCAAAGTAGTGGCTCGGTTCTTTGTTGCACTGCACTCAAGGAAAGAAGTAAATGT GGAGCTAAAGAAGGAAGCTGAATTCTTTGGGGACATAGTTATAGTCCCTTACATGGACAG

220

TTATGACCTTGTCGTCCTCAAAACCGTTGCAATTTGCGAGTACGGGGCTCATCAACTTGC AGCTAAATTCATCATGAAGTGTGATGACGATACATTTGTACAAGTGGATGCGGTTCTTAG TGAAGCAAAGAAAACACCCACAGATAGAAGTCTATACATTGGCAACATCAATTATTATCA CAAACCACTTCGCCAGGGTAAATGGTCTGTTACATATGAGGAATGGCCAGAGGAAGACTA TCCACCTTATGCTAATGGCCCCGGATACATATTATCAAACGATATATCTCGCTTTATCGT GAAAGAGTTTGAGAAACACAAATTAAGGATGTTCAAAATGGAAGATGTAAGCGTGGGAAT GTGGGTAGAACAATTCAACAATGGTACAAAACCGGTGGACTACATTCACAGCCTCAGGTT TTGTCAGTTTGGTTGCATAGAGAATTACTTGACGGCGCATTATCAGTCGCCGAGACAGAT GATTTGCTTGTGGGATAAGCTGGTGTTGACAGGCAAACCTCAGTGCTGCAACATGAGATG ATCTAGAACAAAAACTCATCTCAGAAGAGGATCTGAATAGCGCCGTCGACCATCATCATC ATCATCATTGAGTTTGTAGCCTTAGACATGACTGTTCCTCAGTTCAAGTTGGGCACTTAC GAGAAGACC

221

Part II. Sequencing results of the pMDC30-6xHis-GalT constructs.

The expression constructs of pMDC30-6xHis-GalTs were sequenced for full length coding regions of the 6xHis-GalT genes and partial sequences of the promoter and terminator regions. Color codes: blue: pMDC30 vector sequences; orange: the remaining pENTR/D vector sequence after recombination; purple: 6xHis tag; black: GalT coding region without ATG.

> pMDC30-6xHis-GalT1 CAAGCTCAAGCTGCTCTAGCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATC GGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATT AAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGTGCCAA GCTTGCATGCCTGCAGGTCGACGGATCCCCCCTGGTAGACCAATCCTAACCAATGTCTGG TTAAGATGGTCCAATCCCGAAACTTCTAGTTGCGGTTCGAAGAAGTCCAGAATGTTTCTG AAAGTTTCAGAAAATTCTAGTTTTGAGATTTTCAGAAGTACGGCATGATGATGCATAACA AGGACTTTCTCGAAAGTACTATATTGCTCCTCTACATCATTTTAAATACCCCATGTGTCC TTTGAAGACACATCACAGAAAGAAGTGAAGGCATCGTTAGCAGTTTTGTAGATTCAACCT CAATTTGCAGAGTTACGTTCTAATATATTTACACAAGACTGGGGATCCTCTAGAGGATCC CCGGGTACCGGGCCCCCCCTCGAGGCGCGCCAAGCTATCAAACAAGTTTGTACAAAAAAG CAGGCTCCGCGGCCGCCCCCTTCACCATGCATCATCATCATCATCACAAGAGATTTTATG GAGGGCTTCTTGTTGTATCAATGTGTATGTTCTTGACGGTGTATAGATATGTAGACTTGA ACACTCCTGTTGAAAAGCCTTATATTACTGCTGCTGCTTCTGTTGTTGTTACTCCTAACA CCACTCTTCCTATGGAATGGCTGCGGATTACTCTCCCTGACTTTATGAAAGAAGCGAGGA ATACTCAAGAAGCGATATCTGGTGATGATATCGCTGTTGTCTCGGGTTTGTTTGTTGAGC AGAATGTGTCTAAAGAAGAGAGAGAGCCTTTGCTTACTTGGAACCGTTTGGAAAGTCTTG TTGATAATGCACAGAGTTTAGTTAATGGAGTTGATGCTATTAAGGAAGCTGGCATTGTTT GGGAGAGTCTTGTGTCTGCTGTTGAAGCTAAGAAACTAGTTGATGTTAATGAAAATCAGA CGAGGAAAGGAAAAGAGGAGCTTTGTCCTCAGTTTCTAAGCAAAATGAATGCTACTGAAG CTGATGGGAGTAGTCTGAAGTTGCAAATTCCTTGTGGTTTGACTCAGGGTTCCTCCATCA CAGTTATTGGCATCCCAGATGGTCTTGTTGGTAGTTTTCGGATTGATCTAACGGGACAAC CGCTTCCAGGGGAGCCTGATCCACCCATCATTGTGCATTATAATGTTAGGCTTCTTGGTG ACAAATCGACGGAAGACCCTGTGATTGTTCAAAACAGCTGGACGGCATCTCAGGACTGGG GAGCTGAGGAACGCTGTCCAAAATTTGATCCTGATATGAATAAGAAAGTGGATGACTTGG ATGAATGCAACAAGATGGTTGGTGGAGAAATTAACCGAACTTCTTCAACTAGCTTGCAGT CCAACACTTCAAGGGGAGTTCCAGTAGCCAGGGAAGCATCTAAACATGAAAAATACTTTC CTTTCAAGCAGGGTTTCTTATCGGTTGCTACACTTAGGGTGGGAACAGAGGGAATGCAGA TGACAGTTGATGGGAAACATATAACTTCATTTGCTTTCCGCGATACACTGGAACCGTGGC TTGTTAGTGAAATACGGATTACAGGTGACTTTAGGTTAATATCCATTCTTGCCAGCGGTT TGCCCACATCAGAAGAATCAGAGCACGTTGTTGATCTAGAGGCACTAAAATCACCTACCC TTTCACCATTAAGGCCACTGGATCTCGTTATTGGTGTTTTCTCCACTGCGAACAATTTTA

222

AAAGACGGATGGCTGTGAGGAGAACATGGATGCAGTATGATGATGTAAGATCTGGAAGAG TTGCAGTACGCTTTTTTGTTGGCCTTCACAAAAGTCCTCTTGTTAACTTGGAACTCTGGA ACGAGGCTCGGACTTACGGTGATGTTCAGCTAATGCCCTTTGTTGATTATTACAGTCTCA TCAGTTGGAAAACACTAGCCATCTGCATCTTCGGGACAGAGGTTGACTCAGCCAAGTTCA TCATGAAAACGGATGATGACGCCTTTGTTCGTGTAGATGAAGTGTTACTTTCTTTATCAA TGACCAACAACACTCGCGGGTTAATATACGGACTGATCAATTCCGACTCTCAACCTATTC GAAACCCTGATAGCAAATGGTACATCAGTTATGAGGAATGGCCTGAAGAGAAATATCCAC CATGGGCGCATGGTCCAGGCTACATTGTATCTCGTGACATAGCAGAATCGGTTGGTAAGC TTTTCAAAGAAGGAAACCTAAAGATGTTTAAGCTAGAAGATGTGGCAATGGGGATATGGA TAGCTGAGCTGACAAAACATGGACTCGAGCCTCATTACGAAAACGATGGAAGGATCATTA GTGATGGATGCAAGGATGGTTATGTGGTTGCTCATTACCAAAGCCCTGCCGAAATGACTT GCCTTTGGCGTAAATACCAAGAAACCAAACGCTCTCTTTGCTGCCGCGAATGGTAAAAGG GTGGGCGCGCCGACCCAGCTTTCTTGTACAAAGTGGTTCGATAATTCA

> pMDC30-6xHis-GalT3 TAATATATTTACACAAGACTGGGGATCCTCTAGAGGATCCCCCGGGTACCGGGCCCCCCC TCGAGGCGCGCCAAGCTATCAAACAAGTTTGTACAAAAAAGCAGGCTCCGCGGCCGCCCC CTTCACCATGCATCATCATCATCATCACAAGCAATTCATGTCAGTGGTGAGATTCAAATT TGGTTTCACTTCAGTCAGAATGAGGGATTGGTCGGTGGGAGTCTCCATTATGGTTCTTAC ATTGATCTTCATCATCCGTTATGAACAATCTGATCACACTCACACTGTGGATGATTCTAG TATAGAAGGAGAGAGTGTTCATGAACCCGCAAAGAAGCCACATTTTATGACTTTGGAAGA TCTTGATTATCTATTTTCAAACAAGAGCTTTTTTGGAGAAGAAGAAGTGTCCAATGGAAT GCTTGTATGGTCTCGAATGCGTCCATTTCTTGAAAGGCCAGATGCTTTGCCAGAAACTGC TCAAGGGATAGAAGAAGCTACATTGGCAATGAAAGGTTTGGTTTTAGAAATCAATAGAGA GAAGAGAGCTTATTCTTCTGGTATGGTCTCTAAGGAAATTAGAAGAATCTGTCCGGATTT TGTCACTGCATTTGATAAGGATTTGTCTGGTTTAAGTCATGTACTTCTTGAGCTTCCTTG TGGTTTAATTGAAGATTCTTCAATAACTTTGGTTGGTATTCCTGATGAACATTCTAGTAG CTTCCAGATTCAGCTCGTTGGCTCGGGATTATCAGGAGAAACTCGTCGGCCAATAATCTT GCGTTACAATGTGAACTTTTCTAAACCATCGATAGTGCAAAATACATGGACAGAGAAGCT AGGTTGGGGAAACGAAGAGCGATGCCAATATCATGGATCATTGAAAAATCATTTAGTTGA TGAACTTCCTCTCTGCAACAAACAGACCGGTAGAATCATTTCGGAAAAGAGTTCCAACGA TGATGCAACTATGGAACTTTCTCTTTCAAATGCTAATTTTCCATTTCTCAAAGGGAGTCC CTTCACTGCCGCATTGTGGTTTGGCTTAGAAGGTTTTCATATGACGATAAATGGGCGGCA CGAGACTTCATTTGCTTACAGGGAGAAGCTCGAGCCATGGTTAGTCAGTGCAGTCAAAGT CTCAGGTGGTTTGAAAATTTTATCTGTCTTAGCCACAAGACTGCCCATTCCCGATGACCA TGCATCTTTAATCATAGAAGAGAAACTTAAAGCTCCATCTCTTTCCGGGACAAGAATAGA ACTATTGGTGGGTGTTTTCTCCACTGGAAATAATTTTAAGCGGCGTATGGCATTGAGAAG ATCTTGGATGCAATACGAGGCAGTAAGATCTGGCAAAGTAGCTGTTCGATTTCTCATTGG CCTTCACACAAATGAAAAAGTCAATTTAGAGATGTGGAGAGAATCTAAGGCATATGGAGA

223

CATTCAGTTTATGCCATTTGTTGACTACTATGGTTTACTTAGCTTGAAAACAGTTGCGCT TTGCATTCTCGGGACCAAAGTCATCCCAGCAAAATACATAATGAAGACGGATGATGATGC GTTTGTACGGATTGATGAGCTCCTATCAAGTCTAGAAGAAAGACCGTCTAGTGCCCTTCT GTACGGTTTGATCTCATTTGATTCATCACCGGACCGTGAACAAGGCAGCAAATGGTTTAT CCCTAAAGAGGAATGGCCTTTAGATTCATACCCTCCATGGGCACATGGCCCTGGCTACAT CATCTCTCATGATATAGCGAAATTTGTGGTGAAGGGTCACCGTCAAAGAGATCTTGGACT TTTCAAGCTGGAAGATGTGGCGATGGGGATATGGATTCAACAATTCAACCAGACGATAAA AAGAGTGAAGTACATCAATGACAAAAGATTTCATAACAGTGATTGTAAATCAAATTACAT TCTTGTTCATTACCAAACTCCTAGACTAATTTTGTGTCTTTGGGAGAAGCTGCAAAAAGA GAACCAATCTATTTGCTGCGAATAAAAGGGTGGGCGCGCCGACCCAGCTTTCTTGTACAA AGTGGTTCGATAATTCTTAATTAACTAGTTCTAGAGCGGCCGCCACCGCGGTGGAGCTCG AATTCCCCGATCGTCAACA

> pMDC30-6xHis-GalT4 GAAGACACATCACAGAAAGAAGTGAAGGCATCGTTAGCAGTTTTGTAGATTCAACCTCAA TTTGCAGAGTTACGTTCTAATATATTTACACAAGACTGGGGATCCTCTAGAGGATCCCCG GGTACCGGGCCCCCCCTCGAGGCGCGCCAAGCTATCAAACAAGTTTGTACAAAAAAGCAG GCTCCGCGGCCGCCCCCTTCACCATGCATCATCATCATCATCACAAGAAGTCTAAACTCG ATAATTCTTCTTCACAGATTCGATTCGGGCTTGTTCAGTTCTTATTAGTTGTTCTGCTTT TTTACTTCCTCTGCATGAGCTTCGAGATCCCATTCATCTTCAGAACCGGGTCTGGGTCCG GGTCTGATGATGTTTCATCTTCTTCTTTTGCTGACGCATTACCGAGACCAATGGTTGTTG GTGGTGGTAGTAGGGAAGCTAATTGGGTTGTCGGAGAAGAAGAAGAAGCAGACCCACATC GACATTTCAAGGATCCGGGTCGGGTACAGCTTCGGTTACCGGAGCGGAAAATGAGGGAAT TTAAGTCCGTCTCTGAGATTTTCGTCAACGAGAGCTTCTTCGACAATGGCGGATTCAGCG ATGAATTCTCAATCTTTCACAAAACAGCGAAGCATGCGATTTCAATGGGTCGAAAAATGT GGGACGGACTCGATTCGGGTTTAATCAAACCCGATAAAGCTCCGGTTAAGACCCGGATTG AGAAATGTCCGGATATGGTTTCGGTTTCTGAGTCGGAGTTTGTGAACCGGAGTCGGATCT TGGTTTTGCCGTGTGGGTTAACGTTAGGATCTCACATTACCGTCGTGGCTACGCCGCATT GGGCTCACGTTGAGAAAGATGGTGATAAGACGGCGATGGTGAGTCAGTTCATGATGGAGT TACAAGGATTAAAGGCGGTGGATGGTGAAGATCCGCCTCGGATACTTCATTTTAACCCGA GGATTAAAGGTGATTGGAGTGGAAGACCAGTGATTGAGCAAAACACTTGTTATCGAATGC AATGGGGCTCAGGTTTACGTTGTGATGGTCGTGAATCTAGTGATGATGAAGAATATGTTG ATGGAGAGGTGAAATGTGAGAGGTGGAAGAGAGATGATGATGATGGTGGTAATAATGGTG ATGATTTTGATGAATCAAAGAAGACATGGTGGTTGAATAGGTTGATGGGTCGGAGGAAGA AGATGATAACACATGATTGGGATTATCCTTTTGCTGAAGGGAAGCTTTTTGTTCTTACAC TTCGAGCTGGGATGGAAGGTTATCATATTAGTGTGAATGGAAGACATATCACATCTTTTC CTTATAGAACGGGGTTTGTTTTGGAGGATGCCACTGGATTAGCAGTCAAAGGAAACATTG ATGTGCATTCTGTATATGCTGCCTCGTTACCTTCTACAAATCCTAGTTTTGCACCGCAGA AGCATCTCGAGATGCAAAGGATATGGAAAGCTCCTTCGTTACCTCAGAAGCCTGTAGAGT

224

TGTTCATTGGAATTCTTTCTGCTGGTAATCATTTTGCAGAGAGAATGGCAGTGAGGAAGT CATGGATGCAGCAGAAGCTGGTCAGATCATCGAAAGTTGCTGCCCGGTTCTTTGTGGCAT TGCATGCAAGAAAGGAAGTCAATGTGGATTTAAAGAAAGAAGCTGAGTACTTTGGTGATA TTGTCATAGTACCGTACATGGATCATTATGACCTTGTTGTGCTCAAGACAGTTGCCATCT GCGAATATGGGGTGAACACAGTGGCGGCAAAGTACGTTATGAAATGTGACGATGATACAT TTGTGCGTGTGGATGCTGTGATCCAGGAAGCAGAAAAGGTTAAGGGAAGAGAGAGCCTTT ATATTGGAAACATTAATTTTAACCATAAGCCATTGCGTACCGGGAAATGGGCTGTGACAT TCGAGGAATGGCCAGAAGAGTATTATCCTCCATATGCAAATGGTCCGGGTTACATCTTGT CATATGATGTAGCTAAGTTCATTGTCGATGATTTTGAACAAAAGCGATTAAGATTATTCA AGATGGAAGATGTGAGCATGGGAATGTGGGTGGAGAAGTTCAACGAGACTAGACCAGTGG CAGTGGTTCACAGCCTCAAGTTCTGTCAGTTTGGTTGCATAGAAGACTACTTCACCGCTC ATTATCAGTCGCCTCGCCAGATGATTTGCATGTGGGATAAGCTGCAGAGACTCGGGAAGC CCCAATGCTGCAACATGAGATGAAAGGGTGGGCGCGCCGACCCAGCTTTCTTGTACAAAG TGGTTCGATAATTCTTAATTAACTAGTTCTAGAGCGGCCGCCACCGCGGTGGAGCTCGAA TTCCCCGATCGTCAAACA

> pMDC30-6xHis-GalT5 TACATCATTTTAAATACCCCATGTGTCCTTNGAAGACACATCACAGAAAGAAGTGAAGGC ATCGTTAGCAGTTTTGTAGATTCAACCTCAATTTGCAGAGTTACGTTCTAATATATTTAC ACAAGACTGGGGATCCTCTAGAGGATCCCCGGGTACCGGGCCCCCCCTCGAGGCGCGCCA AGCTATCAAACAAGTTTGTACAAAAAAGCAGGCTCCGCGGCCGCCCCCTTCACCATGCAT CATCATCATCATCACAAAAAACCCAAATTGTCGAAAGTGGAAAAAATCGACAAGATTGAT CTGTTCTCTTCACTATGGAAGCAGAGATCGGTTCGTGTAATAATGGCAATAGGGTTTCTC TATCTTGTAATTGTCTCAGTAGAGATACCTCTCGTTTTCAAATCCTGGTCCAGCAGCTCC GTGCCTCTTGATGCTCTTTCTCGACTCGAGAAGCTCAATAACGAGCAAGAGCCCCAAGTC GAGATTATCCCTAATCCTCCATTGGAGCCAGTTTCGTACCCGGTTTCGAACCCGACCATT GTTACCCGGACGGACCTTGTTCAGAACAAGGTCCGTGAACATCATCGGGGTGTTCTCTCA AGTTTGAGATTTGATTCGGAAACTTTCGACCCGAGTAGTAAAGACGGGTCAGTGGAGCTT CACAAGTCAGCTAAGGAAGCTTGGCAGCTAGGTCGCAAGCTATGGAAGGAGCTTGAATCT GGAAGGCTTGAGAAACTAGTGGAGAAGCCAGAGAAGAACAAACCGGATTCATGTCCACAT TCTGTTTCGCTAACCGGGTCTGAGTTTATGAACCGGGAGAACAAATTGATGGAGCTGCCG TGTGGTTTGACATTGGGTTCACACATAACTTTGGTGGGGAGGCCGAGGAAAGCTCATCCC AAGGAAGGAGATTGGTCTAAGTTGGTGTCTCAGTTTGTGATAGAGCTTCAAGGTTTGAAG ACTGTTGAAGGAGAGGATCCTCCTAGGATTCTGCATTTCAATCCGAGGCTTAAGGGAGAT TGGAGCAAAAAACCGGTGATTGAGCAGAATAGTTGCTATAGGATGCAATGGGGACCTGCA CAACGTTGCGAAGGATGGAAGTCAAGAGATGATGAAGAGACTGTTGATAGTCATGTGAAG TGTGAAAAATGGATTCGTGATGATGACAATTACTCAGAAGGGTCGAGGGCAAGATGGTGG TTGAATAGACTTATAGGAAGGAGGAAAAGGGTCAAAGTAGAATGGCCGTTTCCTTTTGTG GAAGAGAAGCTGTTCGTTCTAACTCTTAGCGCCGGTTTAGAGGGTTACCATATCAATGTT

225

GATGGAAAGCATGTTACTTCTTTCCCTTATCGCACTGGTTTCACCCTTGAGGATGCAACA GGGCTAACAGTAAACGGAGACATTGATGTCCATTCTGTTTTTGTTGCCTCTCTGCCAACA TCACATCCTAGTTTTGCTCCCCAAAGGCATCTCGAATTGTCAAAGAGATGGCAGGCTCCT GTAGTTCCCGATGGGCCTGTGGAGATCTTTATAGGCATTCTTTCCGCAGGCAATCATTTC TCTGAGCGGATGGCTGTGAGGAAATCCTGGATGCAGCATGTTCTTATTACATCTGCAAAA GTTGTTGCTCGTTTCTTTGTGGCGCTGCATGGGAGGAAGGAGGTGAATGTGGAATTGAAG AAAGAAGCGGAGTATTTTGGGGACATTGTACTTGTTCCTTACATGGATAGCTATGATCTT GTCGTGCTGAAAACTGTTGCCATATGTGAACACGGAGCTCTTGCATTCTCTGCAAAGTAC ATAATGAAGTGTGACGATGATACATTTGTAAAACTTGGCGCGGTGATCAATGAAGTGAAA AAAGTACCCGAAGGTAGAAGCCTGTACATTGGTAACATGAATTATTACCACAAACCTCTC CGTGGGGGTAAATGGGCAGTCACATATGAGGAATGGCCAGAGGAGGACTATCCGCCCTAC GCAAATGGACCCGGATATGTTCTATCTTCTGACATTGCGCGCTTCATCGTGGACAAGTTT GAGAGACATAAATTACGGCTGTTCAAGATGGAGGACGTGAGTGTGGGAATGTGGGTTGAG CATTTCAAGAACACAACAAACCCAGTGGATTACAGACACAGTCTGAGATTCTGCCAGTTT GGTTGTGTTGAGAACTACTACACAGCTCATTACCAGTCGCCAAGACAGATGATATGCTTA TGGGATAAGCTCTTAAGACAGAACAAGCCTGAGTGTTGTAACATGAGATGAAAGGGTGGG CGCGCCGACCCAGCTTTCTTGTACAAAGTGGTTCGATAATTCTTAATTAACTAGTTCTAG AGCGGCCGCCACCGCGGTGGAGCTCGAATTCCCCGATCGTCAACA

> pMDC30-6xHis-GalT6 CCAAGCTCAAGCTGCTNTAGCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGAT CGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGAT TAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTGTAAAACGACGGCCAGTGCCA AGCTTGCATGCCTGCAGGTCGACGGATCCCCCCTGGTAGACCAATCCTAACCAATGTCTG GTTAAGATGGTCCAATCCCGAAACTTCTAGTTGCGGTTCGAAGAAGTCCAGAATGTTTCT GAAAGTTTCAGAAAATTCTAGTTTTGAGATTTTCAGAAGTACGGCATGATGATGCATAAC AAGGACTTTCTCGAAAGTACTATATTGCTCCTCTACATCATTTTAAATACCCCATGTGTC CTTTGAAGACACATCACAGAAAGAAGTGAAGGCATCGTTAGCAGTTTTGTAGATTCAACC TCAATTTGCAGAGTTACGTTCTAATATATTTACACAAGACTGGGGATCCTCTAGAGGATC CCCGGGTACCGGGCCCCCCCTCGAGGCGCGCCAAGCTATCAAACAAGTTTGTACAAAAAA GCAGGCTCCGCGGCCGCCCCCTTCACCATGCATCATCATCATCATCACAGGAAGCCCAAG TTGTCAAAACTCGAGAGGTTAGAGAAATTCGACATTTTCGTTTCTCTAAGCAAGCAGAGA TCGGTTCAGATACTAATGGCGGTTGGGTTACTCTACATGCTTCTTATCACATTCGAAATC CCTTTCGTCTTCAAAACCGGGCTTAGTTCTTTATCTCAGGATCCGTTAACCCGACCCGAG AAGCACAATAGTCAGAGAGAGTTACAAGAGAGACGAGCTCCGACTCGACCTTTAAAGAGT CTGCTTTACCAGGAATCACAATCGGAATCACCGGCTCAGGGTTTAAGAAGAAGGACTCGG ATCCTTTCTAGTTTGAGATTCGACCCGGAAACGTTTAACCCGAGTAGCAAAGATGGGTCT GTGGAGCTTCATAAATCTGCTAAGGTAGCTTGGGAAGTTGGTCGAAAGATATGGGAAGAG CTTGAGTCTGGGAAAACGTTGAAAGCTTTGGAGAAGGAGAAGAAGAAGAAGATTGAGGAA

226

CATGGGACAAACTCGTGTTCTCTCTCTGTTTCCTTAACCGGGTCTGATCTTTTGAAACGT GGGAATATCATGGAGCTTCCATGTGGTTTAACTCTTGGGTCACATATTACAGTGGTTGGG AAGCCACGAGCTGCTCATTCGGAGAAGGACCCTAAGATATCGATGTTAAAGGAAGGAGAT GAAGCTGTGAAGGTTTCACAGTTTAAGTTGGAGCTTCAGGGTTTGAAAGCAGTGGAAGGA GAAGAGCCACCTCGGATACTCCACTTGAATCCAAGGCTTAAGGGTGATTGGAGTGGTAAG CCTGTGATTGAGCAGAACACTTGCTATAGAATGCAATGGGGCTCAGCACAAAGATGTGAA GGATGGAGATCTAGGGATGATGAAGAGACTGTTGATGGTCAGGTTAAGTGCGAGAAATGG GCTCGGGATGATAGCATTACATCTAAAGAAGAAGAGTCTAGCAAGGCGGCTTCATGGTGG CTTAGTCGATTAATAGGTCGGAGCAAGAAAGTAACTGTTGAATGGCCATTTCCATTCACA GTTGACAAGCTTTTCGTGCTTACTCTTAGTGCTGGATTGGAAGGCTACCATGTTAGTGTC GATGGGAAGCATGTCACTTCCTTTCCATACCGAACTGGATTTACGCTTGAGGATGCTACT GGTCTAACCATTAACGGGGACATAGATGTTCACTCTGTTTTCGCTGGCTCTCTCCCAACC TCGCATCCTAGTTTTTCTCCTCAGAGGCATCTTGAGCTCTCGAGCAATTGGCAAGCCCCA TCACTTCCTGATGAGCAAGTTGATATGTTCATTGGTATCCTTTCTGCTGGTAACCATTTT GCTGAGAGGATGGCTGTGAGGAGGTCGTGGATGCAACATAAACTCGTTAAATCTTCCAAA GTAGTGGCTCGGTTCTTTGTTGCACTGCACTCAAGGAAAGAAGTAAATGTGGAGCTAAAG AAGGAAGCTGAATTCTTTGGGGACATAGTTATAGTCCCTTACATGGACAGTTATGACCTT GTCGTCCTCAAAACCGTTGCAATTTGCGAGTACGGGGCTCATCAACTTGCAGCTAAATTC ATCATGAAGTGTGATGACGATACATTTGTACAAGTGGATGCGGTTCTTAGTGAAGCAAAG AAAACACCCACAGATAGAAGTCTATACATTGGCAACATCAATTATTATCACAAACCACTT CGCCAGGGTAAATGGTCTGTTACATATGAGGAATGGCCAGAGGAAGACTATCCACCTTAT GCTAATGGCCCCGGATACATATTATCAAACGATATATCTCGCTTTATCGTGAAAGAGTTT GAGAAACACAAATTAAGGATGTTCAAAATGGAAGATGTAAGCGTGGGAATGTGGGTAGAA CAATTCAACAATGGTACAAAACCGGTGGACTACATTCACAGCCTCAGGTTTTGTCAGTTT GGTTGCATAGAGAATTACTTGACGGCGCATTATCAGTCGCCGAGACAGATGATTTGCTTG TGGGATAAGCTGGTGTTGACAGGCAAACCTCAGTGCTGCAACATGAGATGAAAGGGTGGG CGCGCCGACCCAGCTTTCTTGTACAAAGTGGTTCGATAATTCTTAATTAACTAGTTCTAG AGCGGCCGCCACCGCGGTGGAGCTCGAATTCCCCGATCGTCAA

227

Part III. Sequencing results of the pMDC32-6xHis-GalT constructs.

The expression constructs of pMDC32-6xHis-GalTs were sequenced for full length coding regions of the 6xHis-GalT genes and partial sequences of the promoter and terminator regions. Color codes: blue: pMDC32 vector sequences; orange: the remaining pENTR/D vector sequence after recombination; purple: 6xHis tag; black: GalT coding region without ATG.

>pMDC32-6xHis-GalT1 AAAAGGAAGGTGGCTCCTACAAATGCCATCATTGCGATAAAGGAAAGGCCATCGTTGAAG ATGCCTCTGCCGACAGTGGTCCCAAAGATGGACCCCCACCCACGAGGAGCATCGTGGAAA AAGAAGACGTTCCAACCACGTCTTCAAAGCAAGTGGATTGATGTGATAACATGGTGGAGC ACGACACACTTGTCTACTCCAAAAATATCAAAGATACAGTCTCAGAAGACCAAAGGGCAA TTGAGACTTTTCAACAAAGGGTAATATCCGGAAACCTCCTCGGATTCCATTGCCCAGCTA TCTGTCACTTTATTGTGAAGATAGTGGAAAAGGAAGGTGGCTCCTACAAATGCCATCATT GCGATAAAGGAAAGGCCATCGTTGAAGATGCCTCTGCCGACAGTGGTCCCAAAGATGGAC CCCCACCCACGAGGAGCATCGTGGAAAAAGAAGACGTTCCAACCACGTCTTCAAAGCAAG TGGATTGATGTGATATCTCCACTGACGTAAGGGATGACGCACAATCCCACTATCCTTCGC AAGACCCTTCCTCTATATAAGGAAGTTCATTTCATTTGGAGAGGACCTCGACTCTAGAGG ATCCCCGGGTACCGGGCCCCCCCTCGAGGCGCGCCAAGCTATCAAACAAGTTTGTACAAA AAAGCAGGCTCCGCGGCCGCCCCCTTCACCATGCATCATCATCATCATCACAAGAGATTT TATGGAGGGCTTCTTGTTGTATCAATGTGTATGTTCTTGACGGTGTATAGATATGTAGAC TTGAACACTCCTGTTGAAAAGCCTTATATTACTGCTGCTGCTTCTGTTGTTGTTACTCCT AACACCACTCTTCCTATGGAATGGCTGCGGATTACTCTCCCTGACTTTATGAAAGAAGCG AGGAATACTCAAGAAGCGATATCTGGTGATGATATCGCTGTTGTCTCGGGTTTGTTTGTT GAGCAGAATGTGTCTAAAGAAGAGAGAGAGCCTTTGCTTACTTGGAACCGTTTGGAAAGT CTTGTTGATAATGCACAGAGTTTAGTTAATGGAGTTGATGCTATTAAGGAAGCTGGCATT GTTTGGGAGAGTCTTGTGTCTGCTGTTGAAGCTAAGAAACTAGTTGATGTTAATGAAAAT CAGACGAGGAAAGGAAAAGAGGAGCTTTGTCCTCAGTTTCTAAGCAAAATGAATGCTACT GAAGCTGATGGGAGTAGTCTGAAGTTGCAAATTCCTTGTGGTTTGACTCAGGGTTCCTCC ATCACAGTTATTGGCATCCCAGATGGTCTTGTTGGTAGTTTTCGGATTGATCTAACGGGA CAACCGCTTCCAGGGGAGCCTGATCCACCCATCATTGTGCATTATAATGTTAGGCTTCTT GGTGACAAATCGACGGAAGACCCTGTGATTGTTCAAAACAGCTGGACGGCATCTCAGGAC TGGGGAGCTGAGGAACGCTGTCCAAAATTTGATCCTGATATGAATAAGAAAGTGGATGAC TTGGATGAATGCAACAAGATGGTTGGTGGAGAAATTAACCGAACTTCTTCAACTAGCTTG CAGTCCAACACTTCAAGGGGAGTTCCAGTAGCCAGGGAAGCATCTAAACATGAAAAATAC TTTCCTTTCAAGCAGGGTTTCTTATCGGTTGCTACACTTAGGGTGGGAACAGAGGGAATG CAGATGACAGTTGATGGGAAACATATAACTTCATTTGCTTTCCGCGATACACTGGAACCG TGGCTTGTTAGTGAAATACGGATTACAGGTGACTTTAGGTTAATATCCATTCTTGCCAGC GGTTTGCCCACATCAGAAGAATCAGAGCACGTTGTTGATCTAGAGGCACTAAAATCACCT

228

ACCCTTTCACCATTAAGGCCACTGGATCTCGTTATTGGTGTTTTCTCCACTGCGAACAAT TTTAAAAGACGGATGGCTGTGAGGAGAACATGGATGCAGTATGATGATGTAAGATCTGGA AGAGTTGCAGTACGCTTTTTTGTTGGCCTTCACAAAAGTCCTCTTGTTAACTTGGAACTC TGGAACGAGGCTCGGACTTACGGTGATGTTCAGCTAATGCCCTTTGTTGATTATTACAGT CTCATCAGTTGGAAAACACTAGCCATCTGCATCTTCGGGACAGAGGTTGACTCAGCCAAG TTCATCATGAAAACGGATGATGACGCCTTTGTTCGTGTAGATGAAGTGTTACTTTCTTTA TCAATGACCAACAACACTCGCGGGTTAATATACGGACTGATCAATTCCGACTCTCAACCT ATTCGAAACCCTGATAGCAAATGGTACATCAGTTATGAGGAATGGCCTGAAGAGAAATAT CCACCATGGGCGCATGGTCCAGGCTACATTGTATCTCGTGACATAGCAGAATCGGTTGGT AAGCTTTTCAAAGAAGGAAACCTAAAGATGTTTAAGCTAGAAGATGTGGCAATGGGGATA TGGATAGCTGAGCTGACAAAACATGGACTCGAGCCTCATTACGAAAACGATGGAAGGATC ATTAGTGATGGATGCAAGGATGGTTATGTGGTTGCTCATTACCAAAGCCCTGCCGAAATG ACTTGCCTTTGGCGTAAATACCAAGAAACCAAACGCTCTCTTTGCTGCCGCGAATGGTAA AAGGGTGGGCGCGCCGACCCAGCTTTCTTGTACAAAGTGGTTCGATAATTCTTAATTAAC TAGTTCTAGAGCGGCCGCCACCGCGGTGGAGCTCGAATTTCCCCGATCGTTCAAACATTT GGCAATAAAGTTTCTTAAGATTGAATCCTGTTGCCGGTCTTGCGATGATTATCATATAAT TTCTGTTGAATTACGTTAAGCATGTAATAATTAACATGTAATGCATGACGTTATTTATGA GATGGGTTTTTATGATTAGAGTCCCGCAATTATACNATTAATACGCGATAGAAAACAAAA TATAGCGCGCAAACTAGGATAACANG

>pMDC32-6xHis-GalT3 CTCTGCCGACAGTGGTCCCAAAGATGGACCCCCACCCACGAGGAGCATCGTGGAAAAAGA AGACGTTCCAACCACGTCTTCAAAGCAAGTGGATTGATGTGATATCTCCACTGACGTAAG GGATGACGCACAATCCCACTATCCTTCGCAAGACCCTTCCTCTATATAAGGAAGTTCATT TCATTTGGAGAGGACCTCGACTCTAGAGGATCCCCGGGTACCGGGCCCCCCCTCGAGGCG CGCCAAGCTATCAAACAAGTTTGTACAAAAAAGCAGGCTCCGCGGCCGCCCCCTTCACCA TGCATCATCATCATCATCACAAGCAATTCATGTCAGTGGTGAGATTCAAATTTGGTTTCA CTTCAGTCAGAATGAGGGATTGGTCGGTGGGAGTCTCCATTATGGTTCTTACATTGATCT TCATCATCCGTTATGAACAATCTGATCACACTCACACTGTGGATGATTCTAGTATAGAAG GAGAGAGTGTTCATGAACCCGCAAAGAAGCCACATTTTATGACTTTGGAAGATCTTGATT ATCTATTTTCAAACAAGAGCTTTTTTGGAGAAGAAGAAGTGTCCAATGGAATGCTTGTAT GGTCTCGAATGCGTCCATTTCTTGAAAGGCCAGATGCTTTGCCAGAAACTGCTCAAGGGA TAGAAGAAGCTACATTGGCAATGAAAGGTTTGGTTTTAGAAATCAATAGAGAGAAGAGAG CTTATTCTTCTGGTATGGTCTCTAAGGAAATTAGAAGAATCTGTCCGGATTTTGTCACTG CATTTGATAAGGATTTGTCTGGTTTAAGTCATGTACTTCTTGAGCTTCCTTGTGGTTTAA TTGAAGATTCTTCAATAACTTTGGTTGGTATTCCTGATGAACATTCTAGTAGCTTCCAGA TTCAGCTCGTTGGCTCGGGATTATCAGGAGAAACTCGTCGGCCAATAATCTTGCGTTACA ATGTGAACTTTTCTAAACCATCGATAGTGCAAAATACATGGACAGAGAAGCTAGGTTGGG GAAACGAAGAGCGATGCCAATATCATGGATCATTGAAAAATCATTTAGTTGATGAACTTC

229

CTCTCTGCAACAAACAGACCGGTAGAATCATTTCGGAAAAGAGTTCCAACGATGATGCAA CTATGGAACTTTCTCTTTCAAATGCTAATTTTCCATTTCTCAAAGGGAGTCCCTTCACTG CCGCATTGTGGTTTGGCTTAGAAGGTTTTCATATGACGATAAATGGGCGGCACGAGACTT CATTTGCTTACAGGGAGAAGCTCGAGCCATGGTTAGTCAGTGCAGTCAAAGTCTCAGGTG GTTTGAAAATTTTATCTGTCTTAGCCACAAGACTGCCCATTCCCGATGACCATGCATCTT TAATCATAGAAGAGAAACTTAAAGCTCCATCTCTTTCCGGGACAAGAATAGAACTATTGG TGGGTGTTTTCTCCACTGGAAATAATTTTAAGCGGCGTATGGCATTGAGAAGATCTTGGA TGCAATACGAGGCAGTAAGATCTGGCAAAGTAGCTGTTCGATTTCTCATTGGCCTTCACA CAAATGAAAAAGTCAATTTAGAGATGTGGAGAGAATCTAAGGCATATGGAGACATTCAGT TTATGCCATTTGTTGACTACTATGGTTTACTTAGCTTGAAAACAGTTGCGCTTTGCATTC TCGGGACCAAAGTCATCCCAGCAAAATACATAATGAAGACGGATGATGATGCGTTTGTAC GGATTGATGAGCTCCTATCAAGTCTAGAAGAAAGACCGTCTAGTGCCCTTCTGTACGGTT TGATCTCATTTGATTCATCACCGGACCGTGAACAAGGCAGCAAATGGTTTATCCCTAAAG AGGAATGGCCTTTAGATTCATACCCTCCATGGGCACATGGCCCTGGCTACATCATCTCTC ATGATATAGCGAAATTTGTGGTGAAGGGTCACCGTCAAAGAGATCTTGGACTTTTCAAGC TGGAAGATGTGGCGATGGGGATATGGATTCAACAATTCAACCAGACGATAAAAAGAGTGA AGTACATCAATGACAAAAGATTTCATAACAGTGATTGTAAATCAAATTACATTCTTGTTC ATTACCAAACTCCTAGACTAATTTTGTGTCTTTGGGAGAAGCTGCAAAAAGAGAACCAAT CTATTTGCTGCGAATAAAAGGGTGGGCGCGCCGACCCAGCTTTCTTGTACAAAGTGGTTC GATAATTCTTAATTAACTAGTTCTAGAGCGGCCGCCACCGCGGTGGAGCTCGAACANG

>pMDC32-6xHis-GalT4 ACGTCTTCAAAGCAAGTGGATTGATGTGATATCTCCACTGACGTAAGGGATGACGCACAA TCCCACTATCCTTCGCAAGACCCTTCCTCTATATAAGGAAGTTCATTTCATTTGGAGAGG ACCTCGACTCTAGAGGATCCCCGGGTACCGGGCCCCCCCTCGAGGCGCGCCAAGCTATCA AACAAGTTTGTACAAAAAAGCAGGCTCCGCGGCCGCCCCCTTCACCATGCATCATCATCA TCATCACAAGAAGTCTAAACTCGATAATTCTTCTTCACAGATTCGATTCGGGCTTGTTCA GTTCTTATTAGTTGTTCTGCTTTTTTACTTCCTCTGCATGAGCTTCGAGATCCCATTCAT CTTCAGAACCGGGTCTGGGTCCGGGTCTGATGATGTTTCATCTTCTTCTTTTGCTGACGC ATTACCGAGACCAATGGTTGTTGGTGGTGGTAGTAGGGAAGCTAATTGGGTTGTCGGAGA AGAAGAAGAAGCAGACCCACATCGACATTTCAAGGATCCGGGTCGGGTACAGCTTCGGTT ACCGGAGCGGAAAATGAGGGAATTTAAGTCCGTCTCTGAGATTTTCGTCAACGAGAGCTT CTTCGACAATGGCGGATTCAGCGATGAATTCTCAATCTTTCACAAAACAGCGAAGCATGC GATTTCAATGGGTCGAAAAATGTGGGACGGACTCGATTCGGGTTTAATCAAACCCGATAA AGCTCCGGTTAAGACCCGGATTGAGAAATGTCCGGATATGGTTTCGGTTTCTGAGTCGGA GTTTGTGAACCGGAGTCGGATCTTGGTTTTGCCGTGTGGGTTAACGTTAGGATCTCACAT TACCGTCGTGGCTACGCCGCATTGGGCTCACGTTGAGAAAGATGGTGATAAGACGGCGAT GGTGAGTCAGTTCATGATGGAGTTACAAGGATTAAAGGCGGTGGATGGTGAAGATCCGCC TCGGATACTTCATTTTAACCCGAGGATTAAAGGTGATTGGAGTGGAAGACCAGTGATTGA

230

GCAAAACACTTGTTATCGAATGCAATGGGGCTCAGGTTTACGTTGTGATGGTCGTGAATC TAGTGATGATGAAGAATATGTTGATGGAGAGGTGAAATGTGAGAGGTGGAAGAGAGATGA TGATGATGGTGGTAATAATGGTGATGATTTTGATGAATCAAAGAAGACATGGTGGTTGAA TAGGTTGATGGGTCGGAGGAAGAAGATGATAACACATGATTGGGATTATCCTTTTGCTGA AGGGAAGCTTTTTGTTCTTACACTTCGAGCTGGGATGGAAGGTTATCATATTAGTGTGAA TGGAAGACATATCACATCTTTTCCTTATAGAACGGGGTTTGTTTTGGAGGATGCCACTGG ATTAGCAGTCAAAGGAAACATTGATGTGCATTCTGTATATGCTGCCTCGTTACCTTCTAC AAATCCTAGTTTTGCACCGCAGAAGCATCTCGAGATGCAAAGGATATGGAAAGCTCCTTC GTTACCTCAGAAGCCTGTAGAGTTGTTCATTGGAATTCTTTCTGCTGGTAATCATTTTGC AGAGAGAATGGCAGTGAGGAAGTCATGGATGCAGCAGAAGCTGGTCAGATCATCGAAAGT TGCTGCCCGGTTCTTTGTGGCATTGCATGCAAGAAAGGAAGTCAATGTGGATTTAAAGAA AGAAGCTGAGTACTTTGGTGATATTGTCATAGTACCGTACATGGATCATTATGACCTTGT TGTGCTCAAGACAGTTGCCATCTGCGAATATGGGGTGAACACAGTGGCGGCAAAGTACGT TATGAAATGTGACGATGATACATTTGTGCGTGTGGATGCTGTGATCCAGGAAGCAGAAAA GGTTAAGGGAAGAGAGAGCCTTTATATTGGAAACATTAATTTTAACCATAAGCCATTGCG TACCGGGAAATGGGCTGTGACATTCGAGGAATGGCCAGAAGAGTATTATCCTCCATATGC AAATGGTCCGGGTTACATCTTGTCATATGATGTAGCTAAGTTCATTGTCGATGATTTTGA ACAAAAGCGATTAAGATTATTCAAGATGGAAGATGTGAGCATGGGAATGTGGGTGGAGAA GTTCAACGAGACTAGACCAGTGGCAGTGGTTCACAGCCTCAAGTTCTGTCAGTTTGGTTG CATAGAAGACTACTTCACCGCTCATTATCAGTCGCCTCGCCAGATGATTTGCATGTGGGA TAAGCTGCAGAGACTCGGGAAGCCCCAATGCTGCAACATGAGATGAAAGGGTGGGCGCGC CGACCCAGCTTTCTTGTACAAAGTGGTTCGATAATTCTTAATTAACTAGTTCTAGAGCGG CCGCCACCGCGGTGGAGCTCGAATTCCCCGATCGTCAACA

>pMDC32-6xHis-GalT5 AAGTTCATTTCATTTGGAGAGGACCTCGACTCTAGAGGATCCCCGGGTACCGGGCCCCCC CTCGAGGCGCGCCAAGCTATCAAACAAGTTTGTACAAAAAAGCAGGCTCCGCGGCCGCCC CCTTCACCATGCATCATCATCATCATCACAAAAAACCCAAATTGTCGAAAGTGGAAAAAA TCGACAAGATTGATCTGTTCTCTTCACTATGGAAGCAGAGATCGGTTCGTGTAATAATGG CAATAGGGTTTCTCTATCTTGTAATTGTCTCAGTAGAGATACCTCTCGTTTTCAAATCCT GGTCCAGCAGCTCCGTGCCTCTTGATGCTCTTTCTCGACTCGAGAAGCTCAATAACGAGC AAGAGCCCCAAGTCGAGATTATCCCTAATCCTCCATTGGAGCCAGTTTCGTACCCGGTTT CGAACCCGACCATTGTTACCCGGACGGACCTTGTTCAGAACAAGGTCCGTGAACATCATC GGGGTGTTCTCTCAAGTTTGAGATTTGATTCGGAAACTTTCGACCCGAGTAGTAAAGACG GGTCAGTGGAGCTTCACAAGTCAGCTAAGGAAGCTTGGCAGCTAGGTCGCAAGCTATGGA AGGAGCTTGAATCTGGAAGGCTTGAGAAACTAGTGGAGAAGCCAGAGAAGAACAAACCGG ATTCATGTCCACATTCTGTTTCGCTAACCGGGTCTGAGTTTATGAACCGGGAGAACAAAT TGATGGAGCTGCCGTGTGGTTTGACATTGGGTTCACACATAACTTTGGTGGGGAGGCCGA GGAAAGCTCATCCCAAGGAAGGAGATTGGTCTAAGTTGGTGTCTCAGTTTGTGATAGAGC

231

TTCAAGGTTTGAAGACTGTTGAAGGAGAGGATCCTCCTAGGATTCTGCATTTCAATCCGA GGCTTAAGGGAGATTGGAGCAAAAAACCGGTGATTGAGCAGAATAGTTGCTATAGGATGC AATGGGGACCTGCACAACGTTGCGAAGGATGGAAGTCAAGAGATGATGAAGAGACTGTTG ATAGTCATGTGAAGTGTGAAAAATGGATTCGTGATGATGACAATTACTCAGAAGGGTCGA GGGCAAGATGGTGGTTGAATAGACTTATAGGAAGGAGGAAAAGGGTCAAAGTAGAATGGC CGTTTCCTTTTGTGGAAGAGAAGCTGTTCGTTCTAACTCTTAGCGCCGGTTTAGAGGGTT ACCATATCAATGTTGATGGAAAGCATGTTACTTCTTTCCCTTATCGCACTGGTTTCACCC TTGAGGATGCAACAGGGCTAACAGTAAACGGAGACATTGATGTCCATTCTGTTTTTGTTG CCTCTCTGCCAACATCACATCCTAGTTTTGCTCCCCAAAGGCATCTCGAATTGTCAAAGA GATGGCAGGCTCCTGTAGTTCCCGATGGGCCTGTGGAGATCTTTATAGGCATTCTTTCCG CAGGCAATCATTTCTCTGAGCGGATGGCTGTGAGGAAATCCTGGATGCAGCATGTTCTTA TTACATCTGCAAAAGTTGTTGCTCGTTTCTTTGTGGCGCTGCATGGGAGGAAGGAGGTGA ATGTGGAATTGAAGAAAGAAGCGGAGTATTTTGGGGACATTGTACTTGTTCCTTACATGG ATAGCTATGATCTTGTCGTGCTGAAAACTGTTGCCATATGTGAACACGGAGCTCTTGCAT TCTCTGCAAAGTACATAATGAAGTGTGACGATGATACATTTGTAAAACTTGGCGCGGTGA TCAATGAAGTGAAAAAAGTACCCGAAGGTAGAAGCCTGTACATTGGTAACATGAATTATT ACCACAAACCTCTCCGTGGGGGTAAATGGGCAGTCACATATGAGGAATGGCCAGAGGAGG ACTATCCGCCCTACGCAAATGGACCCGGATATGTTCTATCTTCTGACATTGCGCGCTTCA TCGTGGACAAGTTTGAGAGACATAAATTACGGCTGTTCAAGATGGAGGACGTGAGTGTGG GAATGTGGGTTGAGCATTTCAAGAACACAACAAACCCAGTGGATTACAGACACAGTCTGA GATTCTGCCAGTTTGGTTGTGTTGAGAACTACTACACAGCTCATTACCAGTCGCCAAGAC AGATGATATGCTTATGGGATAAGCTCTTAAGACAGAACAAGCCTGAGTGTTGTAACATGA GATGAAAGGGTGGGCGCGCCGACCCAGCTTTCTTGTACAAAGTGGTTCGATAATTCTTAA TTAACTAGTTCTAGAGCGGCCGCCACCGCGGTGGAGCTCGAANTCCCCGATCGTCAAACA

>pMDC32-6xHis-GalT6 TGCCGACCNGTGGTCCCAAAAGATGGACCCCCCNNCCCACGAGGAGCATCGTGGAAAAAG NAAGACGTTCCAACCACGTCTTCAAAGCAAGTGGATTGATGTGATATCTCCNCTGACGTA AGGGATGACGCACAATCCCACTATCNTTCGCAAGACCCTTCCTCTATATAAGGAAGTTCA TTTCATTTGGAGAGGACCTCGACTCTAGAGGATCCCCGGGTACCGGGCCCCCCCTCGAGG CGCGCCAAGCTATCAAACAAGTTTGTACAAAAAAGCAGGCTCCGCGGCCGCCCCCTTCAC CATGCATCATCATCATCATCACAGGAAGCCCAAGTTGTCAAAACTCGAGAGGTTAGAGAA ATTCGACATTTTCGTTTCTCTAAGCAAGCAGAGATCGGTTCAGATACTAATGGCGGTTGG GTTACTCTACATGCTTCTTATCACATTCGAAATCCCTTTCGTCTTCAAAACCGGGCTTAG TTCTTTATCTCAGGATCCGTTAACCCGACCCGAGAAGCACAATAGTCAGAGAGAGTTACA AGAGAGACGAGCTCCGACTCGACCTTTAAAGAGTCTGCTTTACCAGGAATCACAATCGGA ATCACCGGCTCAGGGTTTAAGAAGAAGGACTCGGATCCTTTCTAGTTTGAGATTCGACCC GGAAACGTTTAACCCGAGTAGCAAAGATGGGTCTGTGGAGCTTCATAAATCTGCTAAGGT AGCTTGGGAAGTTGGTCGAAAGATATGGGAAGAGCTTGAGTCTGGGAAAACGTTGAAAGC

232

TTTGGAGAAGGAGAAGAAGAAGAAGATTGAGGAACATGGGACAAACTCGTGTTCTCTCTC TGTTTCCTTAACCGGGTCTGATCTTTTGAAACGTGGGAATATCATGGAGCTTCCATGTGG TTTAACTCTTGGGTCACATATTACAGTGGTTGGGAAGCCACGAGCTGCTCATTCGGAGAA GGACCCTAAGATATCGATGTTAAAGGAAGGAGATGAAGCTGTGAAGGTTTCACAGTTTAA GTTGGAGCTTCAGGGTTTGAAAGCAGTGGAAGGAGAAGAGCCACCTCGGATACTCCACTT GAATCCAAGGCTTAAGGGTGATTGGAGTGGTAAGCCTGTGATTGAGCAGAACACTTGCTA TAGAATGCAATGGGGCTCAGCACAAAGATGTGAAGGATGGAGATCTAGGGATGATGAAGA GACTGTTGATGGTCAGGTTAAGTGCGAGAAATGGGCTCGGGATGATAGCATTACATCTAA AGAAGAAGAGTCTAGCAAGGCGGCTTCATGGTGGCTTAGTCGATTAATAGGTCGGAGCAA GAAAGTAACTGTTGAATGGCCATTTCCATTCACAGTTGACAAGCTTTTCGTGCTTACTCT TAGTGCTGGATTGGAAGGCTACCATGTTAGTGTCGATGGGAAGCATGTCACTTCCTTTCC ATACCGAACTGGATTTACGCTTGAGGATGCTACTGGTCTAACCATTAACGGGGACATAGA TGTTCACTCTGTTTTCGCTGGCTCTCTCCCAACCTCGCATCCTAGTTTTTCTCCTCAGAG GCATCTTGAGCTCTCGAGCAATTGGCAAGCCCCATCACTTCCTGATGAGCAAGTTGATAT GTTCATTGGTATCCTTTCTGCTGGTAACCATTTTGCTGAGAGGATGGCTGTGAGGAGGTC GTGGATGCAACATAAACTCGTTAAATCTTCCAAAGTAGTGGCTCGGTTCTTTGTTGCACT GCACTCAAGGAAAGAAGTAAATGTGGAGCTAAAGAAGGAAGCTGAATTCTTTGGGGACAT AGTTATAGTCCCTTACATGGACAGTTATGACCTTGTCGTCCTCAAAACCGTTGCAATTTG CGAGTACGGGGCTCATCAACTTGCAGCTAAATTCATCATGAAGTGTGATGACGATACATT TGTACAAGTGGATGCGGTTCTTAGTGAAGCAAAGAAAACACCCACAGATAGAAGTCTATA CATTGGCAACATCAATTATTATCACAAACCACTTCGCCAGGGTAAATGGTCTGTTACATA TGAGGAATGGCCAGAGGAAGACTATCCACCTTATGCTAATGGCCCCGGATACATATTATC AAACGATATATCTCGCTTTATCGTGAAAGAGTTTGAGAAACACAAATTAAGGATGTTCAA AATGGAAGATGTAAGCGTGGGAATGTGGGTAGAACAATTCAACAATGGTACAAAACCGGT GGACTACATTCACAGCCTCAGGTTTTGTCAGTTTGGTTGCATAGAGAATTACTTGACGGC GCATTATCAGTCGCCGAGACAGATGATTTGCTTGTGGGATAAGCTGGTGTTGACAGGCAA ACCTCAGTGCTGCAACATGAGATGAAAGGGTGGGCGCGCCGACCCAGCTTTCTTGTACAA AGTGGTTCGATAATTCTTAATTAACTAGTTCTAGAGCGGCCGCCACCGCGGTGGAGCTCG AACANG

233

Part IV. The sequencing result of the pVKH18En6-GalT3-YFP construct

The expression construct of pVKH18En6-6xHis-GalT3-YFP were sequenced for full length coding region of the 6xHis-GalT3 gene and partial sequences of the vector sequence. Color codes: blue: pVKH18En6 vector sequences; black: GalT coding region without the stop codon TAA; red: single nucleotide mutation from T to C at nucleotide position 1446 for cloning. The amino acid encoded is not affected by the mutation. The YFP on the pVKH18En6 construct is in frame with the GalT3 gene.

CAACAACATTACATTTTACATTCTACAACTACATCTAGATGAAGCAATTCATGTCAGTGG TGAGATTCAAATTTGGTTTCACTTCAGTCAGAATGAGGGATTGGTCGGTGGGAGTCTCCA TTATGGTTCTTACATTGATCTTCATCATCCGTTATGAACAATCTGATCACACTCACACTG TGGATGATTCTAGTATAGAAGGAGAGAGTGTTCATGAACCCGCAAAGAAGCCACATTTTA TGACTTTGGAAGATCTTGATTATCTATTTTCAAACAAGAGCTTTTTTGGAGAAGAAGAAG TGTCCAATGGAATGCTTGTATGGTCTCGAATGCGTCCATTTCTTGAAAGGCCAGATGCTT TGCCAGAAACTGCTCAAGGGATAGAAGAAGCTACATTGGCAATGAAAGGTTTGGTTTTAG AAATCAATAGAGAGAAGAGAGCTTATTCTTCTGGTATGGTCTCTAAGGAAATTAGAAGAA TCTGTCCGGATTTTGTCACTGCATTTGATAAGGATTTGTCTGGTTTAAGTCATGTACTTC TTGAGCTTCCTTGTGGTTTAATTGAAGATTCTTCAATAACTTTGGTTGGTATTCCTGATG AACATTCTAGTAGCTTCCAGATTCAGCTCGTTGGCTCGGGATTATCAGGAGAAACTCGTC GGCCAATAATCTTGCGTTACAATGTGAACTTTTCTAAACCATCGATAGTGCAAAATACAT GGACAGAGAAGCTAGGTTGGGGAAACGAAGAGCGATGCCAATATCATGGATCATTGAAAA ATCATTTAGTTGATGAACTTCCTCTCTGCAACAAACAGACCGGTAGAATCATTTCGGAAA AGAGTTCCAACGATGATGCAACTATGGAACTTTCTCTTTCAAATGCTAATTTTCCATTTC TCAAAGGGAGTCCCTTCACTGCCGCATTGTGGTTTGGCTTAGAAGGTTTTCATATGACGA TAAATGGGCGGCACGAGACTTCATTTGCTTACAGGGAGAAGCTCGAGCCATGGTTAGTCA GTGCAGTCAAAGTCTCAGGTGGTTTGAAAATTTTATCTGTCTTAGCCACAAGACTGCCCA TTCCCGATGACCATGCATCTTTAATCATAGAAGAGAAACTTAAAGCTCCATCTCTTTCCG GGACAAGAATAGAACTATTGGTGGGTGTTTTCTCCACTGGAAATAATTTTAAGCGGCGTA TGGCATTGAGAAGATCTTGGATGCAATACGAGGCAGTAAGATCTGGCAAAGTAGCTGTTC GATTTCTCATTGGCCTTCACACAAATGAAAAAGTCAATTTAGAGATGTGGAGAGAATCTA AGGCATATGGAGACATTCAGTTTATGCCATTTGTTGACTACTATGGTTTACTTAGCTTGA AAACAGTTGCGCTTTGCATTCTCGGGACCAAAGTCATCCCAGCAAAATACATAATGAAGA CGGATGATGATGCGTTTGTACGGATTGATGAGCTCCTATCAAGCTAGAAGAAAGACCGT CTAGTGCCCTTCTGTACGGTTTGATCTCATTTGATTCATCACCGGACCGTGAACAAGGCA GCAAATGGTTTATCCCTAAAGAGGAATGGCCTTTAGATTCATACCCTCCATGGGCACATG GCCCTGGCTACATCATCTCTCATGATATAGCGAAATTTGTGGTGAAGGGTCACCGTCAAA GAGATCTTGGACTTTTCAAGCTGGAAGATGTGGCGATGGGGATATGGATTCAACAATTCA ACCAGACGATAAAAAGAGTGAAGTACATCAATGACAAAAGATTTCATAACAGTGATTGTA

234

AATCAAATTACATTCTTGTTCATTACCAAACTCCTAGACTAATTTTGTGTCTTTGGGAGA AGCTGCAAAAAGAGAACCAATCTATTTGCTGCGAAAAG TCG ACT GTG AGC AAG GGC GAG GAG CTG TTC ACC GGG GTG GTG CCC ATC CTG GTC GAG CTG GAC GGC GAC GTA AAC GGC CAC AAG T

235

APPENDIX C: EXPANDED LIST OF ALL PLANT CELL WALL GLYCAN-

DIRECTED MONOCLONAL ANTIBODIES (MABS) USED IN THIS STUDY

FOR GLYCOME PROFILING

The groupings of antibodies are based on a hierarchical clustering analysis of all

mAbs screened against a panel of plant polysaccharide preparations (Pattathil et al., 2010;

2012) that groups the mAbs according to the polysaccharides that they predominantly

recognize. The majority of listings link to the WallMabDB plant cell wall monoclonal antibody database (http://www.wallmabdb.net) that provides detailed descriptions of each

mAb, including immunogen, antibody isotype, epitope structure (to the extent known), supplier information, and related literature citations.

Glycan Group Recognized mAb Name

CCRC-M54

CCRC-M48

CCRC-M49

CCRC-M96

CCRC-M50

CCRC-M51

CCRC-M53 CCRC-M100 CCRC-M103 Non-Fucosylated

CCRC-M58 Xyloglucan

CCRC-M86

CCRC-M55

CCRC-M52

CCRC-M99

CCRC-M95 CCRC-M101 CCRC-M104

CCRC-M89

CCRC-M93

236

CCRC-M87

CCRC-M88

CCRC-M57

CCRC-M90 CCRC-M102

CCRC-M39 Fucosylated CCRC-M106 Xyloglucan

CCRC-M84

CCRC-M1 CCRC-M111 Xylan 1/XG CCRC-M108 CCRC-M109

CCRC-M119 CCRC-M115 CCRC-M110 CCRC-M105 CCRC-M117 CCRC-M113 Xylan 2 CCRC-M120 CCRC-M118 CCRC-M116 CCRC-M114 CCRC-M154 CCRC-M150 CCRC-M160 CCRC-M137 CCRC-M152 CCRC-M149 Xylan 3 CCRC-M144 CCRC-M146 CCRC-M145 CCRC-M155 CCRC-M153 CCRC-M151 CCRC-M148 Xylan 4 CCRC-M140 CCRC-M139 CCRC-M138

Seed CCRC-M75

Galactomannan CCRC-M70

237

CCRC-M74 CCRC-M131

CCRC-M38

Homogalacturonan JIM5

Backbone JIM136

JIM7

CCRC-M34

CCRC-M69

CCRC-M35

Rhamnogalacturonan I CCRC-M36

Backbone CCRC-M14 CCRC-M129

CCRC-M72 Linseed Mucilage

CCRC-M40 Rhamnogalacturonan I

Physcomitrella CCRC-M98

Pectin CCRC-M94

CCRC-M5 Rhamnogalacturonan IA

CCRC-M2

CCRC-M23

CCRC-M17

CCRC-M19 Rhamnogalacturonan IB

CCRC-M18

CCRC-M56

CCRC-M16

JIM137

JIM101 Rhamnogalacturonan IC

CCRC-M61

CCRC-M30

CCRC-M60

CCRC-M41

CCRC-M80

CCRC-M79

CCRC-M44

Rhamnogalacturonan I CCRC-M33

Arabinogalactan CCRC-M32

CCRC-M13

CCRC-M42

CCRC-M24

CCRC-M12

CCRC-M7

238

CCRC-M77

CCRC-M25

CCRC-M9 CCRC-M128 CCRC-M126 CCRC-M134 CCRC-M125 CCRC-M123 CCRC-M122 CCRC-M121 CCRC-M112

CCRC-M21

JIM131

CCRC-M22

JIM132

JIM1

CCRC-M15

CCRC-M8

MH4.3E5

JIM16

JIM93

JIM94

Arabinogalactan 1 JIM11

MAC204

JIM20

JIM14

MAC207

JIM19 Arabinogalactan 2

JIM12 CCRC-M133 CCRC-M107

JIM4

CCRC-M31

JIM17

CCRC-M26

JIM15 Arabinogalactan 3

JIM8

CCRC-M85

CCRC-M81

MAC266

PN 16.4B4

239

JIM133

JIM13

Arabinogalactan 4 CCRC-M92

CCRC-M91

CCRC-M78

MAC265 Unidentified

CCRC-M97

! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! !

Thesis and Dissertation Services ! !