Functional and Structural Characterization Reveals Novel
FBXW7 Biology
by
Tonny Chao Huang
A thesis submitted in conformity with the requirements for the degree of Master of Science
Department of Medical Biophysics
University of Toronto
© Copyright by Tonny Chao Huang 2018
Functional and Structural Characterization Reveals Novel FBXW7
Biology
Tonny Chao Huang Master of Science
Department of Medical Biophysics
University of Toronto
2018
Abstract
This thesis aims to examine aspects of FBXW7 biology, a protein that is frequently mutated in a variety of cancers. The first part of this thesis describes the characterization of FBXW7 isoform and mutant substrate profiles using a proximity-dependent biotinylation assay. Isoform-specific substrates were validated, revealing the involvement of FBXW7 in the regulation of several protein complexes. Characterization of FBXW7 mutants also revealed site- and residue-specific consequences on the binding of substrates and, surprisingly, possible neo-substrates.
In the second part of this thesis, we utilize high-throughput peptide binding assays and statistical modelling to discover novel features of the FBXW7-binding phosphodegron. In contrast to the canonical motif, a possible preference of FBXW7 for arginine residues at the +4 position was discovered. I then attempted to validate this feature in vivo and in vitro on a novel substrate discovered through BioID.
ii
Acknowledgments
The past three years in the Department of Medical Biophysics have defied expectations. I not only had the opportunity to conduct my own independent research, but also to work with distinguished collaborators and to explore exciting complementary fields. I experienced the freedom to guide my own academic development, as well as to pursue my extracurricular interests. Perhaps most significantly, the amount of personal development I have experienced during this journey has significantly changed my views of my myself and my place within the world.
For this, there are those who I will be forever grateful for their guidance, mentorship, and friendship. I would like to first thank my supervisor, Brian. His knowledge and expertise helped guide my research, but it was his kindness and support that allowed me the freedom to complete my graduate studies in my own way. I would also like to thank all members of the Raught Lab that I have had the privilege to cross paths with. The camaraderie shared between the graduate students in the Lab has helped me greatly in completing my degree, so for that I would like to thank Meg, Aaron, Deb, Diana, and Adam. I would also like to thank Étienne, Estelle, and Faith for their technical support and for helping me to get started working in the Lab. I have also befriended many wonderful people within the Department who have helped me along the way, including Nina, Parasvi, Justin, Stanley, Javier, and Danton, to name a few. Lastly, I would like to thank my family for their unending support, without which none of this would have been possible.
If any regrets were to be had, it would be that I was ultimately unsuccessful in what I had sought out to accomplish at the outset of my studies. While my goals and ambitions now lie elsewhere, I remain hopeful that the contents contained within this thesis may one day be shared beyond the limits of this document.
iii
Table of Contents
Acknowledgments.......................................................................................................................... iii Table of Contents........................................................................................................................... iv List of Figures............................................................................................................................... vii List of Tables ................................................................................................................................. ix Abbreviations...................................................................................................................................x List of Appendices ........................................................................................................................ xii Chapter 1 – Introduction ..................................................................................................................1
Introduction.................................................................................................................................2 1.1 The ubiquitin-proteasome system........................................................................................2
1.1.1 Ubiquitin-mediated proteolysis................................................................................3 1.1.2 The ubiquitylation cascade ......................................................................................9 1.1.3 Ubiquitin E3 ligase types.......................................................................................12
1.2 The FBXW7 protein ..........................................................................................................18
1.2.1 F-box proteins and the SCF complex.....................................................................18 1.2.2 Structure and organization of the FBXW7 protein................................................26 1.2.3 FBXW7 substrates in health and disease...............................................................30
1.3 Proteomic approaches in studying protein-protein interactions.........................................35
1.3.1 Strategies for the study of protein-protein interactions in vivo..............................35 1.3.2 Proximity-dependent biotinylation assays .............................................................40 1.3.3 Protein identification via mass spectrometry.........................................................45
1.4 Thesis motivation and outline............................................................................................48
Chapter 2 – Elucidation of substrate profiles of FBXW7 and mutants through BioID.................49
Elucidation of substrate profiles of FBXW7 isoforms and mutants through BioID.................50 2.1 Chapter overview...............................................................................................................50 2.2 Contributions......................................................................................................................50
iv
2.3 Materials and methods .......................................................................................................52
2.3.1 Plasmids.................................................................................................................52 2.3.2 Cell lines ................................................................................................................52 2.3.3 BioID and biotin-streptavidin affinity purification................................................52 2.3.4 Mass spectrometry .................................................................................................53 2.3.5 Immunoblotting......................................................................................................54 2.3.6 Substrate validation via cycloheximide chase .......................................................54 2.3.7 Immunofluorescence imaging................................................................................55 2.3.8 Data analysis and visualization..............................................................................55
2.4 Results................................................................................................................................56
2.4.1 Expression and localization of FlagBirA-FBXW7 isoforms.................................56 2.4.2 FBXW7 isoforms exhibit distinct substrate profiles..............................................59 2.4.3 CHX chase reveals novel FBXW7 interactors.......................................................64 2.4.4 Effect of hotspot mutations on substrate binding is site- and residue-specific......70
2.5 Discussion..........................................................................................................................72
Chapter 3 – Discovery of novel FBXW7 phosphodegron features ...............................................77
Discovery of novel FBXW7 phosphodegron features ..............................................................78 3.1 Chapter overview...............................................................................................................78 3.2 Contributions......................................................................................................................78 3.3 Materials and methods .......................................................................................................79
3.3.1 Generation of peptide-binding models...................................................................79 3.3.2 Peptide array synthesis and binding.......................................................................79 3.3.3 Mutant phosphodegron cloning and CHX chase ...................................................80 3.3.4 Fluorescence polarization ......................................................................................80
3.4 Results................................................................................................................................81
3.4.1 Performance of the Cdc4 model ............................................................................81
v
3.4.2 Analysis of the FBXW7 peptide-binding array .....................................................84 3.4.3 Evaluation of phosphodegron features...................................................................88
3.5 Discussion..........................................................................................................................92
References......................................................................................................................................94 Appendices...................................................................................................................................112
vi
List of Figures
Figure 1.1: Comparison of lysosomal and proteasomal protein degradation ..................................4 Figure 1.2: Structure and function of the 26S proteasome ..............................................................7 Figure 1.3: Generalized overview of the ubiquitylation cascade.....................................................9 Figure 1.4: Overview of three primary types of E3 ligases ...........................................................13 Figure 1.5: Examples of different RING type E3 ligases. .............................................................15 Figure 1.6: The SCF E3 ligase complex and types of FBPs..........................................................20 Figure 1.7: Generalized dynamics of the SCF complex in substrate ubiquitylation .....................23 Figure 1.8: Structural organization and localization of the FBXW7 isoforms..............................28 Figure 1.9: Missense mutations in FBXW7 in the MSK-IMPACT sequencing cohort ................31 Figure 1.10: Comparison of two common strategies used to study PPIs.......................................37 Figure 1.11: Overview of the BioID assay ....................................................................................41 Figure 1.12: Generalized workflow of a bottom-up mass spectrometry experiment. ...................46 Figure 2.1: Expression and knockdown of FlagBirA-tagged FBXW7 isoforms in Flp-In
TREx 293 cells..........................................................................................................................56
Figure 2.2: Epifluorescence and confocal microscopy of FlagBirA-tagged FBXW7 isoforms reveal isoform-specific localization..........................................................................................57
Figure 2.3: Bait-bait Pearson correlation coefficients reveal relationship between FBXW7
BioID datasets...........................................................................................................................61
Figure 2.4: FBXW7 isoforms exhibit distinst substrate profiles ...................................................63 Figure 2.5: Visualization of nucleoplasmic, cytoplasmic, and nucleolar FBXW7 isoform interactor profiles ......................................................................................................................65
Figure 2.6: Putative substrates identified by BioID are stabilized with FBXW7 knockdown ......67 Figure 2.7: Components of the SAGA, ATAC, ASCOM, and DREAM complexes discovered by BioID are stabilized in FBXW7 knockdown conditions......................................................68
Figure 2.8: Effect of hotspot mutation on nucleoplasmic FBXW7 substrate binding is mutantspecific ......................................................................................................................................71
vii
Figure 3.1: Visualization of Cdc4 model performance in the human proteome for the training dataset........................................................................................................................................82
Figure 3.2: Binding intensities in separate peptide-binding assays are generally reproducible ....85 Figure 3.3: Sequence logos of peptides with the top 200 highest and lowest intensities in
FBXW7 peptide binding arrays ................................................................................................87
Figure 3.5: Evaluation of TAF6L phosphodegrons .......................................................................91
viii
List of Tables
Table 2.1: BioID identifies validated substrates and interactors of FBXW7 ................................62 Table 2.2: BioID identifies components of the SAGA, ATAC, ASCOM, and DREAM complex components as putative substrates..............................................................................66
Table 3.1: Cdc4 peptide-binding model outperforms other strategies for finding FBXW7- binding peptides in the human proteome for the training dataset.............................................81
Table 3.2: Binding of select validated FBXW7 phosphodegrons in the training dataset..............83 Table 3.3: Binding of select validated FBXW7 phosphodegrons in proteome-wide screening....86 Table 3.4: Structural feature prediction of TAF6L candidate phosphodegrons ............................90
ix
Abbreviations
AP
Affinity purification
APC/C APEX ASCOM ATAC ATP
Anaphase-promoting complex/cyclosome Engineered ascorbate peroxidase Activating signal co-integrator 2 (complex) Ada Two A-containing (complex) Adenosine triphosphate
BET BiFC BSR CHX CML co-IP CPD
Bromodomain and extra-terminal motif Bimolecular fluorescence complementation Bimolecular sensor/reporter Cycloheximide Chronic myeloid leukemia Co-immunoprecipitation Cdc4 phosphodegron
CRL
Cullin-RING ligase
DD
Dimerization domain
DREAM DSB
Dimerization partner, Rb-like, E2F and multivulval class B (complex) Double-strand break
DUB
Deubiquitylating enzyme
ESI
Electrospray ionization
FBP
F-box protein
FBXL FBXO FBXW FRET HECT HERC IBR
F-box/LRR protein F-box only protein F-box/WD40 repeat-containing protein Förster resonance energy transfer Homologous to E6AP C-terminus HECT domain and RCC1-like domain-containing In between RING (domain) Interferon
IFN LC
Liquid chromatography
LIC
Leukemia-initiating cells
LRR
Leucine rich repeats
m/z
Mass to charge ratio
MALDI MSK- IMPACT NICD NLS
Matrix-assisted laser desorption/ionization Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets Notch intracellular domain Nuclear localization signal
OE-PCR PCA PCR
Overlap-extension polymerase chain reaction Protein fragment complementation assay Polymerase chain reaction
x
PLA
Proximity ligation assay
PMA POI
Phorbol 12-myristate 13-acetate Protein of interest
PPI
Protein-protein interaction
PSM PTM RBR RING SAGA SAINT Sc
Peptide-spectrum match Post-translational modification RING-between-RING Really interesting new gene Spt-Ada-Gcn5 acetyltransferase (complex) Significance analysis of interactome Saccharomyces cerevisiae gene name SKP1-CUL1-FBP complex Support vector machine T-cell acute lymphocytic leukemia Tandem affinity purification Tyrosine kinase binding (domain) Trans-Proteomic Pipeline
SCF SVM T-ALL TAP TKB TPP UBC UFD UPS v-ATPase WD40 Y2H
Ubiquitin conjugating (domain) Ubiquitin fold domain Ubiquitin-proteasome system Vacuolar-type proton-ATPase Tryptophan-aspartic acid repeats (domain) Yeast two-hybrid
xi
List of Appendices
Table S1: A curated overview of validated FBXW7 substrates and phosphodegrons ................113 Table S2: List of primers used for the generation of FlagBirA-tagged or 3xHA-tagged expression vectors...................................................................................................................117
Table S3: List of antibodies used in immunoblotting (IB) and immunofluorescence microscopy (IF).......................................................................................................................119
Figure S1: Binding intensities in separate peptide-binding assays are generally reproducible...120
xii
Chapter 1 – Introduction
1
2
Introduction
The ubiquitin-proteasome system is a major mode of selective protein degradation in eukaryotic organisms. Within this system, the F-box and WD repeat-containing protein 7 (FBXW7) has been implicated in the regulation of many important processes. This thesis focuses on the discovery of novel FBXW7 isoform-specific substrates and other aspect of FBXW7 biology to better characterize its role in health and disease.
1.1 The ubiquitin-proteasome system
From cell division to apoptosis, proteins facilitate essentially all processes in biological systems. However, prior to the mid-1900s, it was widely assumed that proteins were largely stable and rarely replaced (Ciechanover 2005). This paradigm began to shift when radioactive amino acids were found to be rapidly incorporated into the tissues of rats that consumed them, instead of being fully metabolized and excreted (Schoenheimer et al. 1939). Today, protein turnover is understood to be an important process in biological systems. With a median half-life of 46 h, proteins may often outlast their usefulness in shorter processes like cell cycling, which generally take less than half as long in humans (Schwanhäusser et al. 2011; Eden et al. 2011). Errors in transcription, translation, or physical stress like oxidation or heat can also result in misfolded proteins that not only lose their functions, but can form aggregates that are toxic to the cell (Amm et al. 2014). Protein degradation, or proteolysis, is therefore crucial not only for the regulation of biological processes, but also for protein quality control, which includes the rapid destruction of upwards of 30% of newly-translated proteins that fail to fold properly (Schubert et al. 2000). The ubiquitin-proteasome system (UPS) is one of the primary pathways of targeted proteolysis and, unsurprisingly, its dysregulation underlies many human diseases. In cancers, mutations in UPS machinery can lead to inappropriately stabilized oncoproteins, an example of which can be found within the interaction between the von Hippel-Lindau disease tumour suppressor (VHL) and the hypoxia-inducible factor 1-alpha (HIF1A) in highly vascularized tumours such as clear cell renal cell carcinoma (Baldewijns et al. 2010). Neurodegenerative
disorders such as Huntington’s disease and Alzheimer’s disease produce protein aggregates that
impair the function of the UPS, resulting in abnormal synaptic function (Davies et al. 2007; Upadhya and Hegde 2007). The UPS has also been implicated in a variety of other disorders,
including cystic fibrosis, Paget’s disease of bone, viral oncogenesis, muscle wasting disorders,
3
hereditary hypertension, and diabetes (Turnbull et al. 2007; Layfield and Shaw 2007; Shackelford and Pagano 2007; Nury et al. 2007; Rotin 2008; Wing 2008). Given the increasing awareness of its role in health and disease, pharmacological intervention in the UPS has also been a topic of interest. The following sections will provide an overview of key processes in the UPS and their involvement in health and disease.
1.1.1 Ubiquitin-mediated proteolysis
Intracellular proteolysis in mammalian cells is primarily mediated via lysosomes or proteasomes. Lysosomal proteolysis relies on proteases that are active within the acidic environment of the lysosome. In contrast, proteasomal proteolysis is facilitated by the proteasome, a 33-component multiprotein complex found throughout the nucleus and cytoplasm. Both proteolytic pathways utilize ubiquitin, a highly-conserved 76 residue protein that exists in all eukaryotes that can be covalently linked to substrate proteins as a post-translational modification (PTM) in a process referred to as ubiquitylation (as well as ubiquitination). Ubiquitin may also be linked to other ubiquitin molecules at its N-terminal methionine or one of its seven lysine residues, producing
ubiquitin oligomers (or “chains”) of various lengths and branching patterns. The mode in which
the substrate protein is ubiquitylated dictates the effect on the substrate, a review of which can be found in Yau & Rape, 2016. In general, homotypic chains linked through lysine 48 (K48) or K11 target substrates for proteasomal degradation, while other forms of mono- and polyubiquitylation are involved in non-proteasome related functions, including lysosomal degradation. The following section provides an overview of the role of ubiquitin in these proteolytic pathways.